; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G06340 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G06340
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr02:6072711..6080606
RNA-Seq ExpressionClc02G06340
SyntenyClc02G06340
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR003340 - B3 DNA binding domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR015300 - DNA-binding pseudobarrel domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571981.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0068.43Show/hide
Query:  LFIGARNSLSRAH-HWKIEGNELQFSDGILQQKMVESNLTYEECRRQRLEENKKRMEELNLNKLADALKFSSPKSSPTKQLKRPRQPLDKSSFSVRRSSR
        LF+ A   LSR     KIEGN+LQF D  LQ+KMVESNLTYEECRRQRLEENKKRMEELNLNKLADALK SSPKSSPTKQLKRPRQPLD SS SVRRSSR
Subjt:  LFIGARNSLSRAH-HWKIEGNELQFSDGILQQKMVESNLTYEECRRQRLEENKKRMEELNLNKLADALKFSSPKSSPTKQLKRPRQPLDKSSFSVRRSSR

Query:  FADKPPPNYKEVPIEPLPGIRRTYQRRDLLNRIYASQEERQYAIDRARDLQSSLESRYPSFVKPMLQSHVTGGFWLGLPVHFCKTHLPLEDEMLTLVDED
        FADKPPP+YKE PIEPL G+RRTYQRRDLLNR+YAS  ERQYAIDRARDLQSSLESRYPSFVKPMLQSHVTGGFWLGLPVHFCK HLPLEDEMLTLVDED
Subjt:  FADKPPPNYKEVPIEPLPGIRRTYQRRDLLNRIYASQEERQYAIDRARDLQSSLESRYPSFVKPMLQSHVTGGFWLGLPVHFCKTHLPLEDEMLTLVDED

Query:  DNEFQTKYLADKTGLSGGWRGFSIDHQLVDGDALVFQLTKPTEFKVYIIRAYNLEDREDTNEDSDVTQLEKSSKRNTKSSGHKSRANNSEDKGDNGEDSA
        +NEFQTKYLA+KTGLSGGWRGFSIDHQLVDGD LVFQLTKPTEFKVYIIRAYNLEDRE+T+EDSDVTQLE + KRNT S                G+ + 
Subjt:  DNEFQTKYLADKTGLSGGWRGFSIDHQLVDGDALVFQLTKPTEFKVYIIRAYNLEDREDTNEDSDVTQLEKSSKRNTKSSGHKSRANNSEDKGDNGEDSA

Query:  DVSQLEKSGKKTTKSSGRKSRANKSKDKADNGADLDVPELEKSGKRITRSSRQKNFVSLSSLWTVVIPNKKGLGIFRSEMASLMAVRRARTPILISSFFK
         + QL                               +P +  S K++ +  + +     SS               RS   S + ++     +L  S + 
Subjt:  DVSQLEKSGKKTTKSSGRKSRANKSKDKADNGADLDVPELEKSGKRITRSSRQKNFVSLSSLWTVVIPNKKGLGIFRSEMASLMAVRRARTPILISSFFK

Query:  VRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQSP
        +                                                                                                   
Subjt:  VRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQSP

Query:  QPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALV
                                   SYPQY NPSQ NPQNFNYQQQR PNQW+NQ QG PQFG+P QRN Q ENS QLNNQAGIQR  AQN A NALV
Subjt:  QPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALV

Query:  SPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWH
        SPIDELRR CGEGK+KEAVELLKEGVKADADCFH LFELCGKSKSF+NAKVVHDYFLQSTCRSDLQLNNKVLEMYG+CGSMSDA+RVFD+MPDR+I+SWH
Subjt:  SPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWH

Query:  FMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVE
         M+KGYADNGLGDEGLELFENMK+LGLQPDSQTFLFVMSACASA+AVEEGF+YFESMKNDYHI P+MDHYLGLLGILGEPGHINEAFEYV+KLPMEPTVE
Subjt:  FMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVE

Query:  VWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAK
        VWETLKNYARIHGDVDLEDYAEELIVDLDPTKA SNKI TPPPKKRSAISMLDGKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAK
Subjt:  VWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAK

Query:  EQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
        EQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Subjt:  EQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

KAG7011659.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-30489.95Show/hide
Query:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR
        MASLMAVRR RTPI ISSF KVRSPLPS FTF CGN+T+TLIKALSTSA  +DFSNFP PPQQPS SDPR+ Q QWGSPSQVH PSGNFNNQSFSEFQNR
Subjt:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR

Query:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ
        DYVQQGSPSN + YRSQNQS  PNPGF RQGQSY+Q GNPNSWNPPNQSYPQY NPSQ NPQNFNYQQQR PNQW+NQ QG PQFG+P QRN Q ENS Q
Subjt:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ

Query:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG
        LNNQAGIQR  AQN A NALVSPIDELRR CGEGK+KEAVELLKEGVKADADCFH LFELCGKSKSF+NAKVVHDYFLQSTCRSDLQLNNKVLEMYG+CG
Subjt:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG

Query:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE
        SMSDA+RVFD+MPDR+I+SWH M+KGYADNGLGDEGLELFENMK+LGLQPDSQTFLFVMSACASA+AVEEGF+YFESMKNDYHI P+MDHYLGLLGILGE
Subjt:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE

Query:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM
        PGHINEAFEYV+KLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKA SNKI TPPPKKRSAISMLDGKNRI EFRNPTLYKDDEKLKALKAM
Subjt:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM

Query:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
        KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Subjt:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

XP_022952757.1 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucurbita moschata]3.3e-30389.61Show/hide
Query:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR
        MASLMAVRR RTPI ISSF KVRSPLPS FTF+CGN+T+TLIKALSTSA  +DFSNFP PPQQPSSSDPR+ Q QWGSPSQVH PSGNFNNQSFSEFQNR
Subjt:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR

Query:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ
        DYVQQGSPSNQ+ YRSQNQS  PNPGF RQGQSY+Q GNPNSWNPPNQSYPQYQNPSQ NPQNFNYQQQR PNQW+NQ QG PQFG+P QRN Q ENS Q
Subjt:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ

Query:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG
        LNNQAGIQ  GAQN   NALVSPIDELRR CGEGK+KEAVELLKEGVKADADCFH  FELCGKSKSF+NAKVVHDYFLQSTCRSDLQLNNKVLEMYG+CG
Subjt:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG

Query:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE
        SMSDA+RVFD+M DR+I+SWH M+KGYADNGLGDEGLELFENMK+LGL P+SQTFLFVMSACASA+AVEEGF+YFESMKNDYHI PDMDHYLGLLGILGE
Subjt:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE

Query:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM
        PGHINEAFEYV+KLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKA SNKI TPPPKKR AISMLDGKNRI EFRNPTLYKDDEKLKALKAM
Subjt:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM

Query:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
        KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Subjt:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

XP_022972422.1 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucurbita maxima]2.1e-30289.78Show/hide
Query:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR
        MASLMAVRR RTPI ISSF KVRSPLPS FTF+CGNQT+TLIKALSTSA  +DFSNFP PPQQPSSS PR+ Q Q GSPSQVH PSGNFNNQSFSEFQNR
Subjt:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR

Query:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ
        DYVQ GSPSNQ+  RSQNQS  PNPGF RQGQSY+Q GNPNSWNPPNQSYPQYQNPSQ NPQNFNYQQQR PNQW+NQ QG PQFG+P QRN Q ENS Q
Subjt:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ

Query:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG
        LNNQAGIQ  GAQN A NALVSPIDELRR CGEGK+KEAVELLKEGVKADADCFH LFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYG+CG
Subjt:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG

Query:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE
        SMSDA+RVFD+MPDR+I+SWH M+KGYADNGLGDEGLELFENMK+LGLQP+SQTFLFVMSACASA+AVEEGF+YFESMKNDYHI PDMDHYLGLLGILGE
Subjt:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE

Query:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM
        PGHINEAFEYV+KLP+EPTVEVWETLKNYA+IHGDVDLEDYAEELIVDLDPTKA SNKI TPPPKKRSAISMLDGKNRI EFRNPTLYKDDEKLKALKAM
Subjt:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM

Query:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
        KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Subjt:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

XP_038887834.1 pentatricopeptide repeat-containing protein At2g15690, mitochondrial [Benincasa hispida]0.0e+0093.87Show/hide
Query:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR
        MASLMA RRARTPIL+SSFFKVRSPLPSRFTFTCG+QTDTLIKALSTSAI NDFSNFPPPPQQPSSSDPR+RQAQ G  SQVHPP+GNFNNQSFSEFQNR
Subjt:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR

Query:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ
        DYVQQGSPSNQ  YRSQNQSPQPNPGFS QGQ Y+Q GNPNSWNPPNQSYPQYQNPSQ NPQNF YQQ+RGPNQWNNQ QGYPQ GRPEQRNPQVE SNQ
Subjt:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ

Query:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG
        LNNQAGIQR GAQNQ  NA VS IDELRR+CGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG
Subjt:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG

Query:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE
        SMSDA+RVFD+MPDRNIDSWHFMMKGYADNGLGDEGLELFENMK+LGLQP+SQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMD YLGLLGILGE
Subjt:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE

Query:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM
        PGHINEAFEYV+KLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAV NKI TPPPKKRSAI+MLDGKNRISEFRNPTLYKDDEKLKALKAM
Subjt:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM

Query:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
        KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Subjt:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

TrEMBL top hitse value%identityAlignment
A0A1S3BZP9 pentatricopeptide repeat-containing protein At2g156909.9e-29884.98Show/hide
Query:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR
        MASLMAVRR RTPI +SSFFKVR PL S FTFT  NQT+TLIK LSTSAI +DFSNFP  PQQPSSS P +RQ QWGSPSQV+PPS NFN QSFSEFQN 
Subjt:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR

Query:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSW--------------------------------NPPNQSYPQYQNPSQANPQNFNYQQ
        DY QQG+PSNQL YRSQ+QSPQPNPGFSR+GQSY+Q G  NSW                                NPPNQSYPQYQNPSQ NP NFNYQQ
Subjt:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSW--------------------------------NPPNQSYPQYQNPSQANPQNFNYQQ

Query:  QRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFD
        QRGPNQWNNQ QG+PQFGR E RNPQ ENSNQLNNQA IQR G QNQA NALVSPIDELRR CGEGK+KEAVELLK+GVKAD DCFHLLFELCGKSKS D
Subjt:  QRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFD

Query:  NAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAV
        NAKVVHDYFLQSTCRSDLQLNN+VLEMYGRCGSMSDARRVFD+MPDRNIDSWH MMKGYADNGLGDEGLELFENMK LGLQP+SQTFL+VMSACASA+AV
Subjt:  NAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAV

Query:  EEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRS
        EEGFLYFESMKNDYHITPD +HYLGLLGILGEPGHI+EAFEYV+KLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKI TPPPKKRS
Subjt:  EEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRS

Query:  AISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSR
        AISML+GKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSR
Subjt:  AISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSR

Query:  IVGRELIVRDNKRFHHFKD
        IVGRELIVRDNKRFHHFKD
Subjt:  IVGRELIVRDNKRFHHFKD

A0A5D3C491 Pentatricopeptide repeat-containing protein3.6e-29278.86Show/hide
Query:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR
        MASLMAVRR RTPI +SSFFKVR PL S FTFT  NQT+TLIK LSTSAI +DFSNFP  PQQPSSS P +RQ QWGSPSQV+PPS NFN QSFSEFQN 
Subjt:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR

Query:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSW---------------------------------------------------------
        DY QQG+PSNQL YRSQ+QSPQPNPGFSR+GQSY+Q G  NSW                                                         
Subjt:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSW---------------------------------------------------------

Query:  -----------------------NPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNAL
                               NPPNQSYPQYQNPSQ NP NFNYQQQRGPNQWNNQ QG+PQFGR E RNPQ ENSNQLNNQA IQR G QNQA NAL
Subjt:  -----------------------NPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNAL

Query:  VSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSW
        VSPIDELRR CGEGK+KEAVELLK+GVKAD DCFHLLFELCGKSKS DNAKVVHDYFLQSTCRSDLQLNN+VLEMYGRCGSMSDARRVFD+MPDRNIDSW
Subjt:  VSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSW

Query:  HFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTV
        H MMKGYADNGLGDEGLELFENMK LGLQP+SQTFL+VMSACASA+AVEEGFLYFESMKNDYHITPD +HYLGLLGILGEPGHI+EAFEYV+KLPMEPTV
Subjt:  HFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTV

Query:  EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEA
        EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKI TPPPKKRSAISML+GKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEA
Subjt:  EVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEA

Query:  KEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
        KEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Subjt:  KEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

A0A6J1C4C3 pentatricopeptide repeat-containing protein At2g156905.4e-29686.55Show/hide
Query:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQ-----AQWGSPSQVHPPSGNFNNQSFS
        MASLMAVRRAR PIL SSFFKVR PLPS F+F+CGNQT+T IKALSTSAI ND+SNF P PQQ  +SDPR  Q      QWG+PSQVHPPSGNFNNQSFS
Subjt:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQ-----AQWGSPSQVHPPSGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQN---PSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRN
        EFQNRDYVQQGS  NQ+ Y+SQN+   PNPGFS+QGQ Y+Q GNPNSWNPPNQSYPQ QN   PS  NPQNFNYQQQRGPNQWNNQ QGYPQ G P QRN
Subjt:  EFQNRDYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQN---PSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRN

Query:  PQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKV
        PQVEN NQLNNQ G+Q  GAQ QA NALV PIDELRRLCG+GK+KEAVELLKEGVKADADCFH++FELCGKSKSFDNAK+VHDYFLQSTCR DLQLNNKV
Subjt:  PQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKV

Query:  LEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYL
        LEMYG+CGSMSDARRVFD+MPDRNIDSWH M+KGYADNGLGDEGLELFENMK+LGLQP+SQTFL+VMSACAS +AVEEGF+YFESMKNDYHI P+MDHYL
Subjt:  LEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYL

Query:  GLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDE
        GLLGILGEPGHINEAFEYV+KLPMEPTVEVWETLKNYARIHG+VDLEDYAEELIV LDPTKA  NKI TPPPKKRSAISMLDGKNRI EFRNPTLYKDDE
Subjt:  GLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDE

Query:  KLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
        KLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Subjt:  KLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

A0A6J1GL94 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like1.6e-30389.61Show/hide
Query:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR
        MASLMAVRR RTPI ISSF KVRSPLPS FTF+CGN+T+TLIKALSTSA  +DFSNFP PPQQPSSSDPR+ Q QWGSPSQVH PSGNFNNQSFSEFQNR
Subjt:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR

Query:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ
        DYVQQGSPSNQ+ YRSQNQS  PNPGF RQGQSY+Q GNPNSWNPPNQSYPQYQNPSQ NPQNFNYQQQR PNQW+NQ QG PQFG+P QRN Q ENS Q
Subjt:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ

Query:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG
        LNNQAGIQ  GAQN   NALVSPIDELRR CGEGK+KEAVELLKEGVKADADCFH  FELCGKSKSF+NAKVVHDYFLQSTCRSDLQLNNKVLEMYG+CG
Subjt:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG

Query:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE
        SMSDA+RVFD+M DR+I+SWH M+KGYADNGLGDEGLELFENMK+LGL P+SQTFLFVMSACASA+AVEEGF+YFESMKNDYHI PDMDHYLGLLGILGE
Subjt:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE

Query:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM
        PGHINEAFEYV+KLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKA SNKI TPPPKKR AISMLDGKNRI EFRNPTLYKDDEKLKALKAM
Subjt:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM

Query:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
        KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Subjt:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

A0A6J1I4R8 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like1.0e-30289.78Show/hide
Query:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR
        MASLMAVRR RTPI ISSF KVRSPLPS FTF+CGNQT+TLIKALSTSA  +DFSNFP PPQQPSSS PR+ Q Q GSPSQVH PSGNFNNQSFSEFQNR
Subjt:  MASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNR

Query:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ
        DYVQ GSPSNQ+  RSQNQS  PNPGF RQGQSY+Q GNPNSWNPPNQSYPQYQNPSQ NPQNFNYQQQR PNQW+NQ QG PQFG+P QRN Q ENS Q
Subjt:  DYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQ

Query:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG
        LNNQAGIQ  GAQN A NALVSPIDELRR CGEGK+KEAVELLKEGVKADADCFH LFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYG+CG
Subjt:  LNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCG

Query:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE
        SMSDA+RVFD+MPDR+I+SWH M+KGYADNGLGDEGLELFENMK+LGLQP+SQTFLFVMSACASA+AVEEGF+YFESMKNDYHI PDMDHYLGLLGILGE
Subjt:  SMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGE

Query:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM
        PGHINEAFEYV+KLP+EPTVEVWETLKNYA+IHGDVDLEDYAEELIVDLDPTKA SNKI TPPPKKRSAISMLDGKNRI EFRNPTLYKDDEKLKALKAM
Subjt:  PGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRISEFRNPTLYKDDEKLKALKAM

Query:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
        KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
Subjt:  KEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210652.9e-6836.01Show/hide
Query:  GKMKEAVELLKE----GVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYAD
        GK +EA+ L  E    G+K D      L   C K  +    K VH Y ++     +L  +N +L++Y RCG + +A+ +FD M D+N  SW  ++ G A 
Subjt:  GKMKEAVELLKE----GVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYAD

Query:  NGLGDEGLELFENMKQL-GLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKN
        NG G E +ELF+ M+   GL P   TF+ ++ AC+    V+EGF YF  M+ +Y I P ++H+  ++ +L   G + +A+EY+K +PM+P V +W TL  
Subjt:  NGLGDEGLELFENMKQL-GLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKN

Query:  YARIHGDVDLEDYAEELIVDLDP---------------------TKAVSNKISTPPPKKRSAISMLDGKNRISEF-----RNPTLYKDDEKLKALKA-MK
           +HGD DL ++A   I+ L+P                      + +  ++     KK    S+++  NR+ EF      +P       KLK +   ++
Subjt:  YARIHGDVDLEDYAEELIVDLDP---------------------TKAVSNKISTPPPKKRSAISMLDGKNRISEF-----RNPTLYKDDEKLKALKA-MK

Query:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
         +GYVP    V  D+++E KE A++YHSE++AIA+ LISTP R+P+ ++KNLR+C DCH AIK++S++  RE++VRD  RFHHFK+
Subjt:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

Q680H3 Pentatricopeptide repeat-containing protein At2g255802.6e-7740.41Show/hide
Query:  IDELRRLCGEGKMKEAVE----LLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDS
        I+E    C  GK+K+A+     L       D      L ++CG+++    AK VH     S    DL  N+ +LEMY  CG  ++A  VF+ M ++N+++
Subjt:  IDELRRLCGEGKMKEAVE----LLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDS

Query:  WHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPT
        W  +++ +A NG G++ +++F   K+ G  PD Q F  +  AC     V+EG L+FESM  DY I P ++ Y+ L+ +   PG ++EA E+V+++PMEP 
Subjt:  WHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPT

Query:  VEVWETLKNYARIHGDVDLEDYAEELIVDLDPTK-----------AVSNKISTPPPKKRSAISMLDG-KNRISEFR--NPTLYKDDEKLKALKAMK----
        V+VWETL N +R+HG+++L DY  E++  LDPT+             ++ +     KKRS I  L G K+ + EFR  +  L ++DE  + L+ +K    
Subjt:  VEVWETLKNYARIHGDVDLEDYAEELIVDLDPTK-----------AVSNKISTPPPKKRSAISMLDG-KNRISEFR--NPTLYKDDEKLKALKAMK----

Query:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
        E GYV +TR  LHDIDQE+KE  LL HSER+A A  ++++  R P  +IKNLR+C DCHNA+KIMS IVGRE+I RD KRFH  K+
Subjt:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial9.9e-6935.01Show/hide
Query:  NALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNI
        NAL++     RR   E  ++    +L++G +     +  LF  C  +   +  K VH Y ++S  +      N +L+MY + GS+ DAR++FD +  R++
Subjt:  NALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNI

Query:  DSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPME
         SW+ ++  YA +G G E +  FE M+++G++P+  +FL V++AC+ +  ++EG+ Y+E MK D  I P+  HY+ ++ +LG  G +N A  +++++P+E
Subjt:  DSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPME

Query:  PTVEVWETLKNYARIHGDVDLEDYAEELIVDLDP---------------------TKAVSNKISTPPPKKRSAISMLDGKNRISEF-RNPTLYKDDEKL-
        PT  +W+ L N  R+H + +L  YA E + +LDP                        V  K+     KK  A S ++ +N I  F  N   +   E++ 
Subjt:  PTVEVWETLKNYARIHGDVDLEDYAEELIVDLDP---------------------TKAVSNKISTPPPKKRSAISMLDGKNRISEF-RNPTLYKDDEKL-

Query:  ----KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
            + L  +KE GYVPDT +V+  +DQ+ +E  L YHSE++A+A+ L++TP  + + I KN+R+CGDCH AIK+ S++VGRE+IVRD  RFHHFKD
Subjt:  ----KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

Q9SUU7 Pentatricopeptide repeat-containing protein At4g32450, mitochondrial4.0e-7834.9Show/hide
Query:  TCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSG-NFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQSPQPNPGFSRQG
        TC  +  +L   LST+A+R  F N       P++ +P        S   +   +G N   QS   FQ   Y Q  +P +                    G
Subjt:  TCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSG-NFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQSPQPNPGFSRQG

Query:  QSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLC
        Q+ +     N +N  NQSY ++      N +N N+Q   G + +     G PQ       + Q ++S                       S +DEL  +C
Subjt:  QSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLC

Query:  GEGKMKEAVELLK----EGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGY
         EGK+K+AVE++K    EG   D      + +LCG +++   AKVVH++   S   SD+   N ++EMY  CGS+ DA  VF+ MP+RN+++W  +++ +
Subjt:  GEGKMKEAVELLK----EGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGY

Query:  ADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLK
        A NG G++ ++ F   KQ G +PD + F  +  AC     + EG L+FESM  +Y I P M+HY+ L+ +L EPG+++EA  +V+   MEP V++WETL 
Subjt:  ADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLK

Query:  NYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAI------SMLDGKN---------RISEFRNPTLYKDDEKLKALKA-MKEQGYVPDTRY
        N +R+HGD+ L D  ++++  LD ++      +   P K S +       M  G N          IS   N  LY     LK+LK  M E GYVP ++ 
Subjt:  NYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAI------SMLDGKN---------RISEFRNPTLYKDDEKLKALKA-MKEQGYVPDTRY

Query:  VLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
         LHD+DQE+K++ L  H+ER A     + TPAR+ +R++KNLR+C DCHNA+K+MS+IVGRELI RD KRFHH KD
Subjt:  VLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

Q9ZQE5 Pentatricopeptide repeat-containing protein At2g15690, mitochondrial4.5e-13849.51Show/hide
Query:  MASLMAVRRARTP--ILISSFFKVRSPLP---SRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFS
        M+SLMA+R ART   + I S  ++RS  P   S+F F+ G      IK LSTSA  ND+                H+  Q GSPSQ   P   +  QSF 
Subjt:  MASLMAVRRARTP--ILISSFFKVRSPLP---SRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLKYRSQN--QSPQ-----PNPGFSRQ---GQSYSQPGNPNSWNPPNQSY----PQY--QNPSQANPQNFNYQQQRGPNQWNNQK
        + QN+    Q  P +  ++ +Q+  Q PQ     P  G  R    GQ+  Q G  + +   N  +    PQY  Q P    P N NYQ Q    Q +NQ 
Subjt:  EFQNRDYVQQGSPSNQLKYRSQN--QSPQ-----PNPGFSRQ---GQSYSQPGNPNSWNPPNQSY----PQY--QNPSQANPQNFNYQQQRGPNQWNNQK

Query:  QGY-PQFGRPEQRNPQ-VENSNQLNNQAGIQRQGAQNQALNALVSP--IDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHD
        Q Y PQ    +Q+ PQ   +SNQ  NQ            +N +  P  ++E+ RLC     K+A+ELL +G   D +CF LLFE C   KS +++K VHD
Subjt:  QGY-PQFGRPEQRNPQ-VENSNQLNNQAGIQRQGAQNQALNALVSP--IDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHD

Query:  YFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYF
        +FLQS  R D +LNN V+ M+G C S++DA+RVFD+M D+++DSWH MM  Y+DNG+GD+ L LFE M + GL+P+ +TFL V  ACA+   +EE FL+F
Subjt:  YFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYF

Query:  ESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDG
        +SMKN++ I+P  +HYLG+LG+LG+ GH+ EA +Y++ LP EPT + WE ++NYAR+HGD+DLEDY EEL+VD+DP+KAV NKI TPPPK     +M+  
Subjt:  ESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDG

Query:  KNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELI
        K+RI EFRN T YKD+ K  A K  K   YVPDTR+VLHDIDQEAKEQALLYHSERLAIAYG+I TP R  L IIKNLR+CGDCHN IKIMS+I+GR LI
Subjt:  KNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELI

Query:  VRDNKRFHHFKD
        VRDNKRFHHFKD
Subjt:  VRDNKRFHHFKD

Arabidopsis top hitse value%identityAlignment
AT2G15690.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.2e-13949.51Show/hide
Query:  MASLMAVRRARTP--ILISSFFKVRSPLP---SRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFS
        M+SLMA+R ART   + I S  ++RS  P   S+F F+ G      IK LSTSA  ND+                H+  Q GSPSQ   P   +  QSF 
Subjt:  MASLMAVRRARTP--ILISSFFKVRSPLP---SRFTFTCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLKYRSQN--QSPQ-----PNPGFSRQ---GQSYSQPGNPNSWNPPNQSY----PQY--QNPSQANPQNFNYQQQRGPNQWNNQK
        + QN+    Q  P +  ++ +Q+  Q PQ     P  G  R    GQ+  Q G  + +   N  +    PQY  Q P    P N NYQ Q    Q +NQ 
Subjt:  EFQNRDYVQQGSPSNQLKYRSQN--QSPQ-----PNPGFSRQ---GQSYSQPGNPNSWNPPNQSY----PQY--QNPSQANPQNFNYQQQRGPNQWNNQK

Query:  QGY-PQFGRPEQRNPQ-VENSNQLNNQAGIQRQGAQNQALNALVSP--IDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHD
        Q Y PQ    +Q+ PQ   +SNQ  NQ            +N +  P  ++E+ RLC     K+A+ELL +G   D +CF LLFE C   KS +++K VHD
Subjt:  QGY-PQFGRPEQRNPQ-VENSNQLNNQAGIQRQGAQNQALNALVSP--IDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHD

Query:  YFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYF
        +FLQS  R D +LNN V+ M+G C S++DA+RVFD+M D+++DSWH MM  Y+DNG+GD+ L LFE M + GL+P+ +TFL V  ACA+   +EE FL+F
Subjt:  YFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYF

Query:  ESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDG
        +SMKN++ I+P  +HYLG+LG+LG+ GH+ EA +Y++ LP EPT + WE ++NYAR+HGD+DLEDY EEL+VD+DP+KAV NKI TPPPK     +M+  
Subjt:  ESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDG

Query:  KNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELI
        K+RI EFRN T YKD+ K  A K  K   YVPDTR+VLHDIDQEAKEQALLYHSERLAIAYG+I TP R  L IIKNLR+CGDCHN IKIMS+I+GR LI
Subjt:  KNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELI

Query:  VRDNKRFHHFKD
        VRDNKRFHHFKD
Subjt:  VRDNKRFHHFKD

AT2G25580.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-7840.41Show/hide
Query:  IDELRRLCGEGKMKEAVE----LLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDS
        I+E    C  GK+K+A+     L       D      L ++CG+++    AK VH     S    DL  N+ +LEMY  CG  ++A  VF+ M ++N+++
Subjt:  IDELRRLCGEGKMKEAVE----LLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDS

Query:  WHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPT
        W  +++ +A NG G++ +++F   K+ G  PD Q F  +  AC     V+EG L+FESM  DY I P ++ Y+ L+ +   PG ++EA E+V+++PMEP 
Subjt:  WHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPT

Query:  VEVWETLKNYARIHGDVDLEDYAEELIVDLDPTK-----------AVSNKISTPPPKKRSAISMLDG-KNRISEFR--NPTLYKDDEKLKALKAMK----
        V+VWETL N +R+HG+++L DY  E++  LDPT+             ++ +     KKRS I  L G K+ + EFR  +  L ++DE  + L+ +K    
Subjt:  VEVWETLKNYARIHGDVDLEDYAEELIVDLDPTK-----------AVSNKISTPPPKKRSAISMLDG-KNRISEFR--NPTLYKDDEKLKALKAMK----

Query:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
        E GYV +TR  LHDIDQE+KE  LL HSER+A A  ++++  R P  +IKNLR+C DCHNA+KIMS IVGRE+I RD KRFH  K+
Subjt:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-7035.18Show/hide
Query:  NALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNI
        NAL++     RR   E  ++    +L++G +     +  LF  C  +   +  K VH Y ++S  +      N +L+MY + GS+ DAR++FD +  R++
Subjt:  NALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNI

Query:  DSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPME
         SW+ ++  YA +G G E +  FE M+++G++P+  +FL V++AC+ +  ++EG+ Y+E MK D  I P+  HY+ ++ +LG  G +N A  +++++P+E
Subjt:  DSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPME

Query:  PTVEVWETLKNYARIHGDVDLEDYAEELIVDLDP---------------------TKAVSNKISTPPPKKRSAISMLDGKNRISEF-RNPTLYKDDEKL-
        PT  +W+ L N  R+H + +L  YA E + +LDP                        V  K+     KK  A S ++ +N I  F  N   +   E++ 
Subjt:  PTVEVWETLKNYARIHGDVDLEDYAEELIVDLDP---------------------TKAVSNKISTPPPKKRSAISMLDGKNRISEF-RNPTLYKDDEKL-

Query:  ----KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDA
            + L  +KE GYVPDT +V+  +DQ+ +E  L YHSE++A+A+ L++TP  + + I KN+R+CGDCH AIK+ S++VGRE+IVRD  RFHHFKDA
Subjt:  ----KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDA

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-6936.01Show/hide
Query:  GKMKEAVELLKE----GVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYAD
        GK +EA+ L  E    G+K D      L   C K  +    K VH Y ++     +L  +N +L++Y RCG + +A+ +FD M D+N  SW  ++ G A 
Subjt:  GKMKEAVELLKE----GVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYAD

Query:  NGLGDEGLELFENMKQL-GLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKN
        NG G E +ELF+ M+   GL P   TF+ ++ AC+    V+EGF YF  M+ +Y I P ++H+  ++ +L   G + +A+EY+K +PM+P V +W TL  
Subjt:  NGLGDEGLELFENMKQL-GLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKN

Query:  YARIHGDVDLEDYAEELIVDLDP---------------------TKAVSNKISTPPPKKRSAISMLDGKNRISEF-----RNPTLYKDDEKLKALKA-MK
           +HGD DL ++A   I+ L+P                      + +  ++     KK    S+++  NR+ EF      +P       KLK +   ++
Subjt:  YARIHGDVDLEDYAEELIVDLDP---------------------TKAVSNKISTPPPKKRSAISMLDGKNRISEF-----RNPTLYKDDEKLKALKA-MK

Query:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
         +GYVP    V  D+++E KE A++YHSE++AIA+ LISTP R+P+ ++KNLR+C DCH AIK++S++  RE++VRD  RFHHFK+
Subjt:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD

AT4G32450.1 Pentatricopeptide repeat (PPR) superfamily protein2.8e-7934.9Show/hide
Query:  TCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSG-NFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQSPQPNPGFSRQG
        TC  +  +L   LST+A+R  F N       P++ +P        S   +   +G N   QS   FQ   Y Q  +P +                    G
Subjt:  TCGNQTDTLIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSG-NFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQSPQPNPGFSRQG

Query:  QSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLC
        Q+ +     N +N  NQSY ++      N +N N+Q   G + +     G PQ       + Q ++S                       S +DEL  +C
Subjt:  QSYSQPGNPNSWNPPNQSYPQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLC

Query:  GEGKMKEAVELLK----EGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGY
         EGK+K+AVE++K    EG   D      + +LCG +++   AKVVH++   S   SD+   N ++EMY  CGS+ DA  VF+ MP+RN+++W  +++ +
Subjt:  GEGKMKEAVELLK----EGVKADADCFHLLFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGY

Query:  ADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLK
        A NG G++ ++ F   KQ G +PD + F  +  AC     + EG L+FESM  +Y I P M+HY+ L+ +L EPG+++EA  +V+   MEP V++WETL 
Subjt:  ADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEEGFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLK

Query:  NYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAI------SMLDGKN---------RISEFRNPTLYKDDEKLKALKA-MKEQGYVPDTRY
        N +R+HGD+ L D  ++++  LD ++      +   P K S +       M  G N          IS   N  LY     LK+LK  M E GYVP ++ 
Subjt:  NYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAI------SMLDGKN---------RISEFRNPTLYKDDEKLKALKA-MKEQGYVPDTRY

Query:  VLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD
         LHD+DQE+K++ L  H+ER A     + TPAR+ +R++KNLR+C DCHNA+K+MS+IVGRELI RD KRFHH KD
Subjt:  VLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCATGTAGCTGAGCCTCGGCCTTTACCAGCCCCGTTTGAATTCAAATCAAACTACGGAGCTCTCTTTATTGGCGCTCGAAATTCTCTCTCAAGAGCCCATCACTG
GAAAATCGAAGGGAACGAGCTTCAGTTCAGCGACGGAATCCTACAACAGAAAATGGTGGAATCGAACCTGACTTACGAGGAATGCCGACGCCAGAGGTTGGAAGAGAATA
AGAAGAGGATGGAAGAGCTTAATCTGAACAAACTTGCCGATGCTCTGAAATTTTCCAGCCCTAAATCCTCTCCGACCAAACAGCTAAAGCGTCCTCGTCAGCCACTCGAT
AAGTCGTCTTTCAGTGTGAGAAGGTCTAGCCGTTTTGCCGATAAGCCCCCTCCGAACTATAAGGAGGTGCCCATTGAACCACTTCCAGGTATAAGAAGGACTTATCAAAG
GAGAGATTTGCTGAATCGGATTTATGCTTCACAAGAAGAAAGGCAATATGCTATTGACAGAGCAAGAGACCTTCAATCTAGCCTGGAATCTAGGTACCCCAGTTTTGTGA
AGCCCATGCTTCAATCACATGTCACAGGGGGATTTTGGCTGGGTCTACCAGTTCACTTTTGCAAGACACACCTTCCCCTTGAGGATGAAATGCTAACTCTGGTTGACGAG
GATGATAATGAGTTCCAAACAAAATACCTTGCCGATAAAACAGGTCTCAGTGGTGGTTGGAGAGGGTTTTCCATTGATCATCAGTTAGTAGATGGGGATGCTTTGGTGTT
TCAGTTAACTAAGCCAACTGAATTCAAGGTATATATCATCAGAGCATACAATTTAGAAGACAGAGAAGATACCAATGAGGATTCTGATGTCACCCAATTGGAAAAAAGTA
GCAAAAGAAATACCAAATCATCAGGGCATAAATCCAGGGCAAATAATTCTGAGGATAAAGGAGATAATGGTGAGGATTCAGCAGATGTCTCTCAGTTGGAAAAAAGTGGC
AAAAAAACTACTAAATCATCAGGGCGTAAAAGCAGGGCAAATAAATCCAAAGATAAAGCAGATAACGGCGCGGATTTGGATGTCCCCGAGCTGGAAAAAAGTGGCAAAAG
AATTACTAGATCATCAAGGCAAAAGAATTTTGTCAGTCTTAGCTCTCTCTGGACTGTTGTAATTCCAAACAAAAAGGGTTTAGGCATTTTCCGATCAGAAATGGCGTCTC
TCATGGCGGTTCGGCGTGCTCGAACCCCTATTCTTATCTCCTCCTTCTTCAAGGTACGGTCTCCTCTCCCTTCTCGTTTCACCTTCACTTGTGGAAATCAGACAGATACC
CTAATCAAAGCCCTAAGCACCTCGGCAATCCGTAACGATTTCTCAAATTTTCCTCCTCCGCCGCAACAACCTTCTTCGTCTGACCCTCGACATCGTCAAGCCCAGTGGGG
CTCGCCGAGCCAGGTTCATCCTCCGAGTGGAAATTTTAATAATCAGTCGTTCTCGGAGTTTCAGAATCGCGATTATGTTCAACAGGGAAGCCCCAGTAATCAATTGAAGT
ATCGGAGTCAGAATCAGAGCCCTCAACCCAATCCTGGATTTTCCCGGCAGGGTCAGAGCTATAGTCAACCCGGTAACCCTAATTCGTGGAATCCTCCAAATCAAAGCTAC
CCGCAGTATCAAAATCCTTCGCAGGCGAACCCTCAAAATTTCAATTATCAGCAACAAAGAGGCCCTAACCAATGGAACAATCAAAAACAGGGATATCCACAATTTGGAAG
GCCTGAACAGCGTAACCCACAAGTAGAGAATTCTAATCAGTTGAATAATCAGGCTGGGATTCAAAGGCAAGGTGCTCAAAATCAAGCACTAAATGCCCTTGTATCTCCTA
TTGACGAACTGCGGCGCCTTTGTGGAGAGGGGAAGATGAAAGAAGCTGTTGAATTATTGAAAGAAGGTGTTAAAGCTGATGCTGATTGTTTCCATTTGTTGTTTGAACTA
TGTGGGAAATCCAAGTCATTTGACAATGCTAAAGTAGTTCATGATTACTTTTTACAGTCAACTTGTAGAAGTGATCTGCAATTGAATAATAAAGTGCTTGAGATGTATGG
GAGATGTGGAAGCATGAGTGATGCACGGAGAGTGTTCGACTATATGCCTGATAGAAATATTGATTCTTGGCATTTTATGATGAAAGGATATGCTGATAATGGATTGGGTG
ATGAGGGTCTGGAGTTATTTGAGAATATGAAGCAGCTAGGGTTGCAACCCGATTCACAAACTTTCCTTTTTGTTATGTCAGCTTGTGCTAGTGCGAATGCTGTGGAAGAA
GGATTTCTGTACTTTGAGTCAATGAAAAATGATTATCATATTACCCCAGACATGGATCATTATTTGGGGCTTTTAGGTATTCTTGGAGAACCAGGACACATTAATGAGGC
TTTCGAGTATGTTAAGAAACTGCCAATGGAGCCCACAGTTGAGGTATGGGAGACTTTAAAGAACTATGCTAGAATTCATGGAGATGTTGATCTTGAGGACTATGCAGAGG
AGCTAATTGTTGATCTGGACCCGACAAAAGCTGTTTCTAATAAGATATCGACACCACCTCCCAAAAAACGGTCTGCAATTAGCATGCTTGATGGGAAGAACAGGATTAGT
GAATTCAGAAATCCAACTCTCTACAAAGATGATGAAAAATTGAAGGCTTTGAAGGCAATGAAAGAACAAGGGTATGTGCCAGATACTAGATATGTTCTTCACGATATCGA
TCAAGAGGCCAAAGAGCAGGCATTGCTGTATCACAGTGAACGATTGGCAATCGCATATGGATTGATCAGTACTCCGGCACGAACGCCTCTTAGGATCATTAAGAACCTAA
GGATCTGCGGTGATTGTCATAATGCCATCAAAATCATGTCTAGAATTGTTGGGAGAGAGTTAATTGTAAGGGACAACAAACGGTTCCATCATTTTAAGGATGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTCATGTAGCTGAGCCTCGGCCTTTACCAGCCCCGTTTGAATTCAAATCAAACTACGGAGCTCTCTTTATTGGCGCTCGAAATTCTCTCTCAAGAGCCCATCACTG
GAAAATCGAAGGGAACGAGCTTCAGTTCAGCGACGGAATCCTACAACAGAAAATGGTGGAATCGAACCTGACTTACGAGGAATGCCGACGCCAGAGGTTGGAAGAGAATA
AGAAGAGGATGGAAGAGCTTAATCTGAACAAACTTGCCGATGCTCTGAAATTTTCCAGCCCTAAATCCTCTCCGACCAAACAGCTAAAGCGTCCTCGTCAGCCACTCGAT
AAGTCGTCTTTCAGTGTGAGAAGGTCTAGCCGTTTTGCCGATAAGCCCCCTCCGAACTATAAGGAGGTGCCCATTGAACCACTTCCAGGTATAAGAAGGACTTATCAAAG
GAGAGATTTGCTGAATCGGATTTATGCTTCACAAGAAGAAAGGCAATATGCTATTGACAGAGCAAGAGACCTTCAATCTAGCCTGGAATCTAGGTACCCCAGTTTTGTGA
AGCCCATGCTTCAATCACATGTCACAGGGGGATTTTGGCTGGGTCTACCAGTTCACTTTTGCAAGACACACCTTCCCCTTGAGGATGAAATGCTAACTCTGGTTGACGAG
GATGATAATGAGTTCCAAACAAAATACCTTGCCGATAAAACAGGTCTCAGTGGTGGTTGGAGAGGGTTTTCCATTGATCATCAGTTAGTAGATGGGGATGCTTTGGTGTT
TCAGTTAACTAAGCCAACTGAATTCAAGGTATATATCATCAGAGCATACAATTTAGAAGACAGAGAAGATACCAATGAGGATTCTGATGTCACCCAATTGGAAAAAAGTA
GCAAAAGAAATACCAAATCATCAGGGCATAAATCCAGGGCAAATAATTCTGAGGATAAAGGAGATAATGGTGAGGATTCAGCAGATGTCTCTCAGTTGGAAAAAAGTGGC
AAAAAAACTACTAAATCATCAGGGCGTAAAAGCAGGGCAAATAAATCCAAAGATAAAGCAGATAACGGCGCGGATTTGGATGTCCCCGAGCTGGAAAAAAGTGGCAAAAG
AATTACTAGATCATCAAGGCAAAAGAATTTTGTCAGTCTTAGCTCTCTCTGGACTGTTGTAATTCCAAACAAAAAGGGTTTAGGCATTTTCCGATCAGAAATGGCGTCTC
TCATGGCGGTTCGGCGTGCTCGAACCCCTATTCTTATCTCCTCCTTCTTCAAGGTACGGTCTCCTCTCCCTTCTCGTTTCACCTTCACTTGTGGAAATCAGACAGATACC
CTAATCAAAGCCCTAAGCACCTCGGCAATCCGTAACGATTTCTCAAATTTTCCTCCTCCGCCGCAACAACCTTCTTCGTCTGACCCTCGACATCGTCAAGCCCAGTGGGG
CTCGCCGAGCCAGGTTCATCCTCCGAGTGGAAATTTTAATAATCAGTCGTTCTCGGAGTTTCAGAATCGCGATTATGTTCAACAGGGAAGCCCCAGTAATCAATTGAAGT
ATCGGAGTCAGAATCAGAGCCCTCAACCCAATCCTGGATTTTCCCGGCAGGGTCAGAGCTATAGTCAACCCGGTAACCCTAATTCGTGGAATCCTCCAAATCAAAGCTAC
CCGCAGTATCAAAATCCTTCGCAGGCGAACCCTCAAAATTTCAATTATCAGCAACAAAGAGGCCCTAACCAATGGAACAATCAAAAACAGGGATATCCACAATTTGGAAG
GCCTGAACAGCGTAACCCACAAGTAGAGAATTCTAATCAGTTGAATAATCAGGCTGGGATTCAAAGGCAAGGTGCTCAAAATCAAGCACTAAATGCCCTTGTATCTCCTA
TTGACGAACTGCGGCGCCTTTGTGGAGAGGGGAAGATGAAAGAAGCTGTTGAATTATTGAAAGAAGGTGTTAAAGCTGATGCTGATTGTTTCCATTTGTTGTTTGAACTA
TGTGGGAAATCCAAGTCATTTGACAATGCTAAAGTAGTTCATGATTACTTTTTACAGTCAACTTGTAGAAGTGATCTGCAATTGAATAATAAAGTGCTTGAGATGTATGG
GAGATGTGGAAGCATGAGTGATGCACGGAGAGTGTTCGACTATATGCCTGATAGAAATATTGATTCTTGGCATTTTATGATGAAAGGATATGCTGATAATGGATTGGGTG
ATGAGGGTCTGGAGTTATTTGAGAATATGAAGCAGCTAGGGTTGCAACCCGATTCACAAACTTTCCTTTTTGTTATGTCAGCTTGTGCTAGTGCGAATGCTGTGGAAGAA
GGATTTCTGTACTTTGAGTCAATGAAAAATGATTATCATATTACCCCAGACATGGATCATTATTTGGGGCTTTTAGGTATTCTTGGAGAACCAGGACACATTAATGAGGC
TTTCGAGTATGTTAAGAAACTGCCAATGGAGCCCACAGTTGAGGTATGGGAGACTTTAAAGAACTATGCTAGAATTCATGGAGATGTTGATCTTGAGGACTATGCAGAGG
AGCTAATTGTTGATCTGGACCCGACAAAAGCTGTTTCTAATAAGATATCGACACCACCTCCCAAAAAACGGTCTGCAATTAGCATGCTTGATGGGAAGAACAGGATTAGT
GAATTCAGAAATCCAACTCTCTACAAAGATGATGAAAAATTGAAGGCTTTGAAGGCAATGAAAGAACAAGGGTATGTGCCAGATACTAGATATGTTCTTCACGATATCGA
TCAAGAGGCCAAAGAGCAGGCATTGCTGTATCACAGTGAACGATTGGCAATCGCATATGGATTGATCAGTACTCCGGCACGAACGCCTCTTAGGATCATTAAGAACCTAA
GGATCTGCGGTGATTGTCATAATGCCATCAAAATCATGTCTAGAATTGTTGGGAGAGAGTTAATTGTAAGGGACAACAAACGGTTCCATCATTTTAAGGATGCTTGA
Protein sequenceShow/hide protein sequence
MCHVAEPRPLPAPFEFKSNYGALFIGARNSLSRAHHWKIEGNELQFSDGILQQKMVESNLTYEECRRQRLEENKKRMEELNLNKLADALKFSSPKSSPTKQLKRPRQPLD
KSSFSVRRSSRFADKPPPNYKEVPIEPLPGIRRTYQRRDLLNRIYASQEERQYAIDRARDLQSSLESRYPSFVKPMLQSHVTGGFWLGLPVHFCKTHLPLEDEMLTLVDE
DDNEFQTKYLADKTGLSGGWRGFSIDHQLVDGDALVFQLTKPTEFKVYIIRAYNLEDREDTNEDSDVTQLEKSSKRNTKSSGHKSRANNSEDKGDNGEDSADVSQLEKSG
KKTTKSSGRKSRANKSKDKADNGADLDVPELEKSGKRITRSSRQKNFVSLSSLWTVVIPNKKGLGIFRSEMASLMAVRRARTPILISSFFKVRSPLPSRFTFTCGNQTDT
LIKALSTSAIRNDFSNFPPPPQQPSSSDPRHRQAQWGSPSQVHPPSGNFNNQSFSEFQNRDYVQQGSPSNQLKYRSQNQSPQPNPGFSRQGQSYSQPGNPNSWNPPNQSY
PQYQNPSQANPQNFNYQQQRGPNQWNNQKQGYPQFGRPEQRNPQVENSNQLNNQAGIQRQGAQNQALNALVSPIDELRRLCGEGKMKEAVELLKEGVKADADCFHLLFEL
CGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEMYGRCGSMSDARRVFDYMPDRNIDSWHFMMKGYADNGLGDEGLELFENMKQLGLQPDSQTFLFVMSACASANAVEE
GFLYFESMKNDYHITPDMDHYLGLLGILGEPGHINEAFEYVKKLPMEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAVSNKISTPPPKKRSAISMLDGKNRIS
EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDA