; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G015530 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G015530
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCG_Chr07:32043765..32052884
RNA-Seq ExpressionClCG07G015530
SyntenyClCG07G015530
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152299.1 pentatricopeptide repeat-containing protein At4g20090 isoform X1 [Cucumis sativus]4.2e-29188.19Show/hide
Query:  TIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLTSDLVL
        TIR++SALLK +  HF+G SSHFFSTS TT HIAIAPRALAR PTSRTAP PR+P+T GS+DVVNSVC+LLSNKN  T NLDLDHLLKRFK  L+SD VL
Subjt:  TIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLTSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SH GR+SCRT SICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSNVIIGLCKFGR+ TAIE F EM RAGLVPTR+A+NILIG+LCSLSAKEGAVE+V+V ST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRST

Query:  PRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAANKL+LVPS+FV VQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVM+ALCEHRHV+EAS+LFGRMLS
Subjt:  PRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLA
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMN++RCAPDHVTYSALIHAYGE R+WSAAY LLKEMLS GMSP FHVYSIVDKLMREHGQIDLCLKL 
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLA

Query:  VKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS
        +KWEAQILQKLCKQGQLEAAYEKMKSMLEKG  PPIYVRD FESAFQKKGKFKIARELLQKMD VHQHES TRNSS
Subjt:  VKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS

XP_008453994.1 PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucumis melo]3.6e-29087.5Show/hide
Query:  TIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLTSDLVL
        TIR++SALLK +  HF+GFSSHFFSTS TTKHIAIAPRAL R PTSRTAP PR+P+T GS+DVVNSVC+LLSNKN  T NLD++HLLKRFK  L+SDLVL
Subjt:  TIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLTSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SH GR+SCRT SICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSNVIIGLCKFGR+ TAIE F EM RAGLVPTRSA NILIG+LCSLSAKEGA+E+V+VRST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRST

Query:  PRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAANKL LVPS+FV VQLISELCR+GQMQEAI+VLKVVE DKLRCAEECYSVVM+ALCEHRH++EAS+LFGRMLS
Subjt:  PRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLA
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMN++RCAPDHVTYSALIHAYGE RNWSAAY LLKEMLS GMSP FHVYS+VDKLMREHGQ+DLCLKL 
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLA

Query:  VKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS
        +KWEAQILQKLCKQGQLEAAYEKMKSMLEKG  PPIYVRD FESAFQKKGKFKIARELLQKMD VHQHES TRNSS
Subjt:  VKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS

XP_022955543.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita moschata]2.0e-28587.09Show/hide
Query:  MLSRSTIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLT
        MLS STIRNASA LKFV    YGFSS+  STSST K  AIAPRALAR PTSRTAPIPRA D    TD V+SVC+LLSNKNH TTNL+LDHLLKRFK TL+
Subjt:  MLSRSTIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLT

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+ GR+SCRT SICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSN+IIGLCKFGRFGTA+EVFDEM RAGLVPTRSA+NILIGDLCSLSAKEGAVE+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERV

Query:  KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLF
        +VRST RPFTVLVPNVNPKSGAI+ AVGVFWAAN+LALVPS FVIV+LISELCRLGQMQEAIRVLKVVE +KLRC EECYS+VMQALCEHR V+EAS+LF
Subjt:  KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDL
        GRMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNR+RC PDHVTYSALIHAYGE RNWSAAYSLLKEMLS G+SP FHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDL

Query:  CLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS
        CLKL +KWE+QILQKLCKQGQL  AYEK+KSMLEKGF+PPIYVRD FESAFQKKGKFKIARELLQ MD VH+HES +R +S
Subjt:  CLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS

XP_022979738.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita maxima]3.1e-28687.44Show/hide
Query:  MLSRSTIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLT
        MLS STIRNASA LKFV    YGFSS+  STSSTTK  AIAPRALAR PTSRTA IPRA D    TD V+SVC+LLSNK+H TTNL+LDHLLKRFK TL+
Subjt:  MLSRSTIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLT

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+ GR+SCRT SICIRFLGRQGRVREALCLFEEMEP 
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSN+IIGLCKFGRFGTA+EVFDEM RA LVPTRSA+NILIGDLCSLSAKEGAVE+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERV

Query:  KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLF
        +VRST RPFTVLVPNVNPKSGAIEPAVGVFWAAN++ALVPSAFVIV+LISELCRLGQMQEAIRVLKVVE +KLRC EECYS+VMQALCEHR V+EAS+LF
Subjt:  KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDL
        GRMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNR+RC PDHVTYSALIHAYGE RNWSAAYSLLKEMLS G+SP FHVYSIVDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDL

Query:  CLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS
        CLKL +KWE+QILQKLCKQGQL AAYEK+KSMLEKGF+PPIYVRD FESAFQKKGKFKIARELLQ MD VH+HES TR +S
Subjt:  CLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS

XP_038875040.1 pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Benincasa hispida]1.3e-30891.91Show/hide
Query:  MLSRSTIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLT
        MLS++TIRNASA LKF P HFYGFSSHFFSTS+ TKHIAIAPRALAR PTSRTAPIPRA DT GS+DVVNSVC+LLSNKNH T NLDLDHLLKRFK TL+
Subjt:  MLSRSTIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLT

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPK
        SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDE+VVEYMADFLGRRKLFDDMKCLLVTVSSH GRLSCRT SICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTAL+IFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSA+NILIGDLCSLSAKEGAVE+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERV

Query:  KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLF
        +VRST RPFTVLVPNVNPKSGAIEPAVG+FWAANKLALVPSAFVIVQLISELCRLGQMQEAI+VLKVVEGDKLRCAEECYSVVM+ALCEHRHVEEAS+LF
Subjt:  KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDL
        GR+LSQGMKPKLAIYNS+ICMLCK+GNL+DAERVFKIMNR+RCAPDHVTYS+LIHAYGETRNWSAAYSLLKEMLS GMSP FH+YS+VDKLMREHGQIDL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDL

Query:  CLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS
        CLKL +KWEAQILQKLCK GQL+AAYEKMKSMLEKGF+PPIYVRD+FESAFQKKGKFKIARELLQK+D VHQHES TRNSS
Subjt:  CLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS

TrEMBL top hitse value%identityAlignment
A0A0A0KU61 Uncharacterized protein1.3e-27787.98Show/hide
Query:  TIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLTSDLVL
        TIR++SALLK +  HF+G SSHFFSTS TT HIAIAPRALAR PTSRTAP PR+P+T GS+DVVNSVC+LLSNKN  T NLDLDHLLKRFK  L+SD VL
Subjt:  TIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLTSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SH GR+SCRT SICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSNVIIGLCKFGR+ TAIE F EM RAGLVPTR+A+NILIG+LCSLSAKEGAVE+V+V ST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRST

Query:  PRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAANKL+LVPS+FV VQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVM+ALCEHRHV+EAS+LFGRMLS
Subjt:  PRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLA
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMN++RCAPDHVTYSALIHAYGE R+WSAAY LLKEMLS GMSP FHVYSIVDKLMREHGQIDLCLKL 
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLA

Query:  VKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKK
        +KWEAQILQKLCKQGQLEAAYEKMKSMLEKG  PPIYVRD FESAFQKK
Subjt:  VKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKK

A0A1S3BXL0 pentatricopeptide repeat-containing protein At5g65560-like isoform X11.7e-29087.5Show/hide
Query:  TIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLTSDLVL
        TIR++SALLK +  HF+GFSSHFFSTS TTKHIAIAPRAL R PTSRTAP PR+P+T GS+DVVNSVC+LLSNKN  T NLD++HLLKRFK  L+SDLVL
Subjt:  TIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLTSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SH GR+SCRT SICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSNVIIGLCKFGR+ TAIE F EM RAGLVPTRSA NILIG+LCSLSAKEGA+E+V+VRST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRST

Query:  PRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAANKL LVPS+FV VQLISELCR+GQMQEAI+VLKVVE DKLRCAEECYSVVM+ALCEHRH++EAS+LFGRMLS
Subjt:  PRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLA
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMN++RCAPDHVTYSALIHAYGE RNWSAAY LLKEMLS GMSP FHVYS+VDKLMREHGQ+DLCLKL 
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLA

Query:  VKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS
        +KWEAQILQKLCKQGQLEAAYEKMKSMLEKG  PPIYVRD FESAFQKKGKFKIARELLQKMD VHQHES TRNSS
Subjt:  VKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS

A0A5D3CYL1 Pentatricopeptide repeat-containing protein6.4e-27787.25Show/hide
Query:  TIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLTSDLVL
        TIR++SALLK +  HF+GFSSHFFSTS TTKHIAIAPRAL R PTSRTAP PR+P+T GS+DVVNSVC+LLSNKN  T NLD++HLLKRFK  L+SDLVL
Subjt:  TIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLTSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMG+RFD SVVEYMADFLGRRKLFDDMKCLLVTV SH GR+SCRT SICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSNVIIGLCKFGR+ TAIE F EM RAGLVPTRSA NILIG+LCSLSAKEGA+E+V+VRST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRST

Query:  PRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAANKL LVPS+FV VQLISELCR+GQMQEAI+VLKVVE DKLRCAEECYSVVM+ALCEHRH++EAS+LFGRMLS
Subjt:  PRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLA
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMN++RCAPDHVTYSALIHAYGE RNWSAAY LLKEMLS GMSP FHVYS+VDKLMREHGQ+DLCLKL 
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLA

Query:  VKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKK
        +KWEAQILQKLCKQGQLEAAYEKMKSMLEKG  PPIYVRD FESAFQKK
Subjt:  VKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKK

A0A6J1GU90 pentatricopeptide repeat-containing protein At4g20090-like9.8e-28687.09Show/hide
Query:  MLSRSTIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLT
        MLS STIRNASA LKFV    YGFSS+  STSST K  AIAPRALAR PTSRTAPIPRA D    TD V+SVC+LLSNKNH TTNL+LDHLLKRFK TL+
Subjt:  MLSRSTIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLT

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+ GR+SCRT SICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSN+IIGLCKFGRFGTA+EVFDEM RAGLVPTRSA+NILIGDLCSLSAKEGAVE+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERV

Query:  KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLF
        +VRST RPFTVLVPNVNPKSGAI+ AVGVFWAAN+LALVPS FVIV+LISELCRLGQMQEAIRVLKVVE +KLRC EECYS+VMQALCEHR V+EAS+LF
Subjt:  KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDL
        GRMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNR+RC PDHVTYSALIHAYGE RNWSAAYSLLKEMLS G+SP FHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDL

Query:  CLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS
        CLKL +KWE+QILQKLCKQGQL  AYEK+KSMLEKGF+PPIYVRD FESAFQKKGKFKIARELLQ MD VH+HES +R +S
Subjt:  CLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS

A0A6J1IX53 pentatricopeptide repeat-containing protein At4g20090-like1.5e-28687.44Show/hide
Query:  MLSRSTIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLT
        MLS STIRNASA LKFV    YGFSS+  STSSTTK  AIAPRALAR PTSRTA IPRA D    TD V+SVC+LLSNK+H TTNL+LDHLLKRFK TL+
Subjt:  MLSRSTIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPIPRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLT

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+ GR+SCRT SICIRFLGRQGRVREALCLFEEMEP 
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSN+IIGLCKFGRFGTA+EVFDEM RA LVPTRSA+NILIGDLCSLSAKEGAVE+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAVERV

Query:  KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLF
        +VRST RPFTVLVPNVNPKSGAIEPAVGVFWAAN++ALVPSAFVIV+LISELCRLGQMQEAIRVLKVVE +KLRC EECYS+VMQALCEHR V+EAS+LF
Subjt:  KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDL
        GRMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNR+RC PDHVTYSALIHAYGE RNWSAAYSLLKEMLS G+SP FHVYSIVDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDL

Query:  CLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS
        CLKL +KWE+QILQKLCKQGQL AAYEK+KSMLEKGF+PPIYVRD FESAFQKKGKFKIARELLQ MD VH+HES TR +S
Subjt:  CLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS

SwissProt top hitse value%identityAlignment
Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial1.1e-2824.74Show/hide
Query:  QILMNYRLLGRAKTLEF----FSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKF
        ++L  + +LGRA  L +     ++S L  G+  +  V E +A  L  R +    +  LVTVS+ +         +C++     GRV EAL L + M  ++
Subjt:  QILMNYRLLGRAKTLEF----FSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHMGRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKF

Query:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIE----LPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAV
        G +PD + +  +L  LCK   +      AL +FR++E          YS VI  LCK G F  A+ +F+EM   G+       + LIG LC+    +   
Subjt:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIE----LPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIGDLCSLSAKEGAV

Query:  ERVKVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEAS
        + ++         ++  N+ P        + VF    KL          +L +E+   G   + I    +++G    C E C             + EA+
Subjt:  ERVKVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEAS

Query:  NLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQ
         +F  M+S+G +P +  Y+ +I   CK   +DD  R+F+ ++ +   P+ +TY+ L+  + ++   +AA  L +EM+S G+ P    Y I+   + ++G+
Subjt:  NLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQ

Query:  IDLCLKLAVKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKM
        ++  L++  K +             I+  +C   +++ A+    S+ +KG  P +   +       KKG    A  L +KM
Subjt:  IDLCLKLAVKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKM

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397105.1e-2926.72Show/hide
Query:  TLSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNVIIGLCKFGRFGTAIEVFDEM
        T +I IR     G +  AL LF++ME K GC P+ + +N ++   CK       ID    + R + L    P+  SY+ VI GLC+ GR      V  EM
Subjt:  TLSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIEL----PDKYSYSNVIIGLCKFGRFGTAIEVFDEM

Query:  NRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRSTPRPFTVLVPNVNP---------KSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQ
        NR G        N LI   C    KEG   +  V         L P+V           K+G +  A+          L P+      L+    + G M 
Subjt:  NRAGLVPTRSALNILIGDLCSLSAKEGAVERVKVRSTPRPFTVLVPNVNP---------KSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQ

Query:  EAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGE
        EA RVL+ +  +    +   Y+ ++   C    +E+A  +   M  +G+ P +  Y++V+   C+  ++D+A RV + M  +   PD +TYS+LI  + E
Subjt:  EAIRVLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGE

Query:  TRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKI
         R    A  L +EML  G+ P    Y+                         ++   C +G LE A +    M+EKG  P +       +   K+ + + 
Subjt:  TRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKI

Query:  ARELLQKM
        A+ LL K+
Subjt:  ARELLQKM

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011103.9e-2924.81Show/hide
Query:  TLSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAG
        TL+I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  +    +  +   P  Y+Y+ VI GLCK G++  A EVF EM R+G
Subjt:  TLSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAG

Query:  LVPTRSALNILIGDLCSLSAKEGAVERVKVRSTPRPFTVL--------VPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRV
        L P  +    L+ + C    K   VE  KV S  R   V+        + ++  +SG ++ A+  F +  +  L+P   +   LI   CR G +  A+ +
Subjt:  LVPTRSALNILIGDLCSLSAKEGAVERVKVRSTPRPFTVL--------VPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRV

Query:  LKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWS
           +           Y+ ++  LC+ + + EA  LF  M  + + P       +I   CKLGNL +A  +F+ M  +R   D VTY+ L+  +G+  +  
Subjt:  LKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWS

Query:  AAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELL
         A  +  +M+S  + P    YSI                        ++  LC +G L  A+     M+ K   P + + ++    + + G        L
Subjt:  AAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELL

Query:  QKM
        +KM
Subjt:  QKM

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic1.6e-3025.26Show/hide
Query:  TLSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAG
        ++++ +    ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y++VI GLCK G    A+EV D+M    
Subjt:  TLSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAG

Query:  LVPTRSALNILIGDLCSLSAKEGAVERVKVRSTPRPFTVLVPNVNPKSGAIE---------PAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIR
          P     N LI  LC  +  E A E  +V ++      ++P+V   +  I+          A+ +F         P  F    LI  LC  G++ EA+ 
Subjt:  LVPTRSALNILIGDLCSLSAKEGAVERVKVRSTPRPFTVLVPNVNPKSGAIE---------PAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIR

Query:  VLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNW
        +LK +E      +   Y+ ++   C+     EA  +F  M   G+      YN++I  LCK   ++DA ++   M  E   PD  TY++L+  +    + 
Subjt:  VLKVVEGDKLRCAEECYSVVMQALCEHRHVEEASNLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNW

Query:  SAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLAVKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGFHPP
          A  +++ M S G  P    Y  +   + + G++++  KL    + +           ++Q L ++ +   A    + MLE+   PP
Subjt:  SAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHGQIDLCLKLAVKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGFHPP

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655601.7e-2924.48Show/hide
Query:  RVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIG
        R+ EA+ LF +M+    C P    +  ++ +LC  E   E ++    +      P+ ++Y+ +I  LC   +F  A E+  +M   GL+P     N LI 
Subjt:  RVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPTRSALNILIG

Query:  DLCSLSAKEGAVERV------KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEEC
          C     E AV+ V      K+    R +  L+      +  +  A+GV     +  ++P       LI   CR G    A R+L ++    L   +  
Subjt:  DLCSLSAKEGAVERV------KVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEEC

Query:  YSVVMQALCEHRHVEEASNLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMS
        Y+ ++ +LC+ + VEEA +LF  +  +G+ P + +Y ++I   CK G +D+A  + + M  + C P+ +T++ALIH          A  L ++M+  G+ 
Subjt:  YSVVMQALCEHRHVEEASNLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMS

Query:  PQFHVYSIVDKLMREHGQIDLCLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKM
        P                         V  +  ++ +L K G  + AY + + ML  G  P  +   TF   + ++G+   A +++ KM
Subjt:  PQFHVYSIVDKLMREHGQIDLCLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKM

Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)2.1e-4655.5Show/hide
Query:  MGSDFFSESETWVSSVNGIREDPDDETSIEEGEGIGLDSDLEGSALMGAKNKLLKKRSQVLLEGFVE---------DEDDLMRTKSLTDEDLDELKGCVD
        MG      S+ W +S N    +  DE+ I E   + + S     A  G++ KL +K+SQVLLEG+VE          +DDL R+KSLTD+DL++L+GC+D
Subjt:  MGSDFFSESETWVSSVNGIREDPDDETSIEEGEGIGLDSDLEGSALMGAKNKLLKKRSQVLLEGFVE---------DEDDLMRTKSLTDEDLDELKGCVD

Query:  LGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESSPFSAVPTDSGSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVALTFSL
        LGFGFSYDEIPELCNTLPALELCYSMSQK++DD Q KSPE+S     P+      ++PIANWKISSPGD+P+DVKARLK+WAQAVA T  L
Subjt:  LGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESSPFSAVPTDSGSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVALTFSL

AT1G05870.2 Protein of unknown function (DUF1685)2.1e-4655.5Show/hide
Query:  MGSDFFSESETWVSSVNGIREDPDDETSIEEGEGIGLDSDLEGSALMGAKNKLLKKRSQVLLEGFVE---------DEDDLMRTKSLTDEDLDELKGCVD
        MG      S+ W +S N    +  DE+ I E   + + S     A  G++ KL +K+SQVLLEG+VE          +DDL R+KSLTD+DL++L+GC+D
Subjt:  MGSDFFSESETWVSSVNGIREDPDDETSIEEGEGIGLDSDLEGSALMGAKNKLLKKRSQVLLEGFVE---------DEDDLMRTKSLTDEDLDELKGCVD

Query:  LGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESSPFSAVPTDSGSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVALTFSL
        LGFGFSYDEIPELCNTLPALELCYSMSQK++DD Q KSPE+S     P+      ++PIANWKISSPGD+P+DVKARLK+WAQAVA T  L
Subjt:  LGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESSPFSAVPTDSGSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVALTFSL

AT1G05870.3 Protein of unknown function (DUF1685)2.1e-4655.5Show/hide
Query:  MGSDFFSESETWVSSVNGIREDPDDETSIEEGEGIGLDSDLEGSALMGAKNKLLKKRSQVLLEGFVE---------DEDDLMRTKSLTDEDLDELKGCVD
        MG      S+ W +S N    +  DE+ I E   + + S     A  G++ KL +K+SQVLLEG+VE          +DDL R+KSLTD+DL++L+GC+D
Subjt:  MGSDFFSESETWVSSVNGIREDPDDETSIEEGEGIGLDSDLEGSALMGAKNKLLKKRSQVLLEGFVE---------DEDDLMRTKSLTDEDLDELKGCVD

Query:  LGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESSPFSAVPTDSGSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVALTFSL
        LGFGFSYDEIPELCNTLPALELCYSMSQK++DD Q KSPE+S     P+      ++PIANWKISSPGD+P+DVKARLK+WAQAVA T  L
Subjt:  LGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESSPFSAVPTDSGSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVALTFSL

AT1G05870.4 Protein of unknown function (DUF1685)2.1e-4655.5Show/hide
Query:  MGSDFFSESETWVSSVNGIREDPDDETSIEEGEGIGLDSDLEGSALMGAKNKLLKKRSQVLLEGFVE---------DEDDLMRTKSLTDEDLDELKGCVD
        MG      S+ W +S N    +  DE+ I E   + + S     A  G++ KL +K+SQVLLEG+VE          +DDL R+KSLTD+DL++L+GC+D
Subjt:  MGSDFFSESETWVSSVNGIREDPDDETSIEEGEGIGLDSDLEGSALMGAKNKLLKKRSQVLLEGFVE---------DEDDLMRTKSLTDEDLDELKGCVD

Query:  LGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESSPFSAVPTDSGSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVALTFSL
        LGFGFSYDEIPELCNTLPALELCYSMSQK++DD Q KSPE+S     P+      ++PIANWKISSPGD+P+DVKARLK+WAQAVA T  L
Subjt:  LGFGFSYDEIPELCNTLPALELCYSMSQKYMDDHQ-KSPESSPFSAVPTDSGSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVALTFSL

AT2G31560.1 Protein of unknown function (DUF1685)2.7e-4668.12Show/hide
Query:  AKNKLLKKRSQVLLEGF-VEDEDDLMRTKSLTDEDLDELKGCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDD----HQKSPESSPFSAVPTDSGS
        ++ KL KK+SQVLLEG+ ++D+DDL R KSLTD+DL+ELKGC+DLGFGFSYDEIPELCNTLPALELCYSMSQK++DD    H KS E    S  PT    
Subjt:  AKNKLLKKRSQVLLEGF-VEDEDDLMRTKSLTDEDLDELKGCVDLGFGFSYDEIPELCNTLPALELCYSMSQKYMDD----HQKSPESSPFSAVPTDSGS

Query:  SVSSPIANWKISSPGDHPEDVKARLKFWAQAVALTFSL
          ++PIANWKISSPGD P+DVKARLK+WAQ VA T  L
Subjt:  SVSSPIANWKISSPGDHPEDVKARLKFWAQAVALTFSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGTGATTTCTTTAGCGAATCTGAAACGTGGGTTTCTTCTGTGAACGGAATTCGAGAAGACCCAGATGATGAAACCTCCATTGAAGAAGGTGAAGGAATTGGGTT
GGATTCGGATTTGGAGGGGTCGGCTCTAATGGGGGCGAAGAACAAGTTGTTGAAGAAGAGGAGTCAGGTTTTGCTTGAAGGGTTTGTGGAGGATGAGGATGATTTGATGA
GGACGAAGAGCTTGACGGATGAGGATCTTGATGAGCTTAAAGGGTGTGTGGATCTAGGGTTTGGTTTCAGCTACGATGAGATTCCGGAGCTCTGCAACACTTTGCCGGCC
TTGGAACTCTGTTATTCCATGAGCCAAAAGTACATGGACGACCACCAGAAGTCGCCGGAGAGCTCGCCGTTTTCGGCGGTGCCGACGGACTCTGGTTCGTCCGTGTCCAG
TCCGATTGCGAATTGGAAGATCTCCAGTCCTGGTGATCATCCAGAAGATGTGAAGGCAAGGCTCAAATTTTGGGCTCAGGCAGTGGCATTAACCTTTAGCCTTATGGTTG
ATGACATTGCACGCTTCCAAGAAAAGATAGATCGAATGAATGAACAAACGGGTCATTTGATGAATCCGACTTATGATGATGAAGAGATGAAGAAGAAGAATGTGTCAGAG
GCCGGGAATGTTACAGTCTCGATGTCATCTCTACTTCTGATTTCTAGTCTTCCTGAGGGCCTTGATGCCAAAGGAAAAGAAAATGTTTGGTTGAGGTTGTATAAATTGAA
AGAGCGAGAATTTTCGAACTTATTGGTTCCGCTTCAGCTTGCGGAATCGTCGATGTTGAGCAGGAGCACGATTAGGAATGCTTCGGCGTTGCTCAAATTCGTTCCTTTTC
ATTTTTATGGCTTCTCTTCACACTTTTTCAGCACTTCATCGACCACAAAGCACATTGCCATAGCTCCAAGAGCTCTTGCAAGAACACCCACTTCACGAACTGCTCCAATC
CCTCGGGCTCCGGACACCCCCGGCTCTACCGATGTCGTAAATTCAGTATGTGCTTTACTTTCAAACAAAAATCACCTAACAACTAATCTCGATCTTGATCATTTATTGAA
AAGGTTCAAACACACCTTAACTTCGGATCTCGTTCTTCAAATTCTGATGAATTATAGGCTGTTGGGTCGGGCTAAAACGCTTGAATTCTTCTCTTGGTCTGGATTGCAAA
TGGGGTATCGGTTTGATGAGTCCGTGGTTGAGTATATGGCTGATTTCTTAGGTAGGAGGAAATTGTTTGATGATATGAAGTGTCTTTTGGTGACTGTGTCGTCTCATATG
GGTCGGCTTTCTTGTCGAACATTATCAATTTGTATCAGATTTTTGGGTAGGCAAGGGAGGGTTAGGGAAGCACTTTGCTTGTTCGAAGAAATGGAACCAAAATTTGGGTG
TAAACCTGATAATTTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAGGAACCAACTGGGGAATTGATTGATACTGCTCTAACCATTTTCAGAAGAATTGAATTGC
CTGATAAATATTCATACAGTAATGTCATTATCGGATTGTGTAAATTTGGTCGGTTTGGTACAGCTATTGAAGTGTTTGATGAAATGAATAGGGCAGGTTTGGTACCTACT
CGATCTGCTCTGAACATTCTCATTGGTGATTTGTGTTCGTTGAGTGCCAAAGAAGGGGCTGTAGAACGAGTTAAGGTCAGAAGTACACCTAGACCTTTTACCGTTCTAGT
TCCAAATGTGAATCCGAAGAGTGGAGCCATTGAACCTGCAGTTGGAGTTTTTTGGGCAGCTAATAAGCTGGCTTTAGTTCCCAGTGCTTTTGTTATAGTTCAGCTCATCT
CAGAGCTTTGTCGATTAGGTCAAATGCAAGAAGCAATTAGGGTATTGAAGGTTGTTGAGGGTGACAAGCTAAGATGTGCTGAAGAGTGTTATTCTGTTGTGATGCAAGCG
TTGTGCGAACATCGCCACGTGGAAGAAGCTAGTAATCTGTTTGGGAGGATGCTTTCTCAGGGCATGAAGCCAAAGTTGGCTATTTACAATTCTGTTATTTGCATGTTATG
CAAATTAGGAAATTTGGATGATGCTGAAAGGGTCTTCAAGATTATGAACAGGGAAAGATGCGCACCTGATCATGTTACTTATTCGGCGTTAATCCACGCCTATGGTGAAA
CTAGGAATTGGTCGGCAGCCTACAGTTTATTGAAGGAAATGCTGAGTTTTGGCATGTCTCCTCAGTTTCATGTGTATAGTATAGTGGATAAACTAATGAGGGAACATGGG
CAAATTGATCTGTGCTTGAAACTGGCAGTGAAATGGGAAGCCCAAATTTTGCAGAAGCTTTGTAAACAAGGACAACTGGAGGCCGCATATGAAAAGATGAAGTCAATGCT
TGAAAAGGGTTTTCATCCTCCTATCTATGTTAGAGATACTTTTGAGAGTGCATTTCAAAAGAAGGGTAAGTTTAAGATTGCACGGGAGTTGCTACAGAAGATGGACAGAG
TCCACCAACATGAGTCATTAACCAGAAATTCATCTTGA
mRNA sequenceShow/hide mRNA sequence
TTTTTTTTAATTTCACTCTGCTTTATAAAGCCAAGCTCCTCTCTCCGGCTCTGTTGCGGACACGTTTTAAAGAAAGAGAAGCGAATTCGAAAAAGAAAGAAAGACAACCT
AACAAAGAAAAATGGGAAGTGATTTCTTTAGCGAATCTGAAACGTGGGTTTCTTCTGTGAACGGAATTCGAGAAGACCCAGATGATGAAACCTCCATTGAAGAAGGTGAA
GGAATTGGGTTGGATTCGGATTTGGAGGGGTCGGCTCTAATGGGGGCGAAGAACAAGTTGTTGAAGAAGAGGAGTCAGGTTTTGCTTGAAGGGTTTGTGGAGGATGAGGA
TGATTTGATGAGGACGAAGAGCTTGACGGATGAGGATCTTGATGAGCTTAAAGGGTGTGTGGATCTAGGGTTTGGTTTCAGCTACGATGAGATTCCGGAGCTCTGCAACA
CTTTGCCGGCCTTGGAACTCTGTTATTCCATGAGCCAAAAGTACATGGACGACCACCAGAAGTCGCCGGAGAGCTCGCCGTTTTCGGCGGTGCCGACGGACTCTGGTTCG
TCCGTGTCCAGTCCGATTGCGAATTGGAAGATCTCCAGTCCTGGTGATCATCCAGAAGATGTGAAGGCAAGGCTCAAATTTTGGGCTCAGGCAGTGGCATTAACCTTTAG
CCTTATGGTTGATGACATTGCACGCTTCCAAGAAAAGATAGATCGAATGAATGAACAAACGGGTCATTTGATGAATCCGACTTATGATGATGAAGAGATGAAGAAGAAGA
ATGTGTCAGAGGCCGGGAATGTTACAGTCTCGATGTCATCTCTACTTCTGATTTCTAGTCTTCCTGAGGGCCTTGATGCCAAAGGAAAAGAAAATGTTTGGTTGAGGTTG
TATAAATTGAAAGAGCGAGAATTTTCGAACTTATTGGTTCCGCTTCAGCTTGCGGAATCGTCGATGTTGAGCAGGAGCACGATTAGGAATGCTTCGGCGTTGCTCAAATT
CGTTCCTTTTCATTTTTATGGCTTCTCTTCACACTTTTTCAGCACTTCATCGACCACAAAGCACATTGCCATAGCTCCAAGAGCTCTTGCAAGAACACCCACTTCACGAA
CTGCTCCAATCCCTCGGGCTCCGGACACCCCCGGCTCTACCGATGTCGTAAATTCAGTATGTGCTTTACTTTCAAACAAAAATCACCTAACAACTAATCTCGATCTTGAT
CATTTATTGAAAAGGTTCAAACACACCTTAACTTCGGATCTCGTTCTTCAAATTCTGATGAATTATAGGCTGTTGGGTCGGGCTAAAACGCTTGAATTCTTCTCTTGGTC
TGGATTGCAAATGGGGTATCGGTTTGATGAGTCCGTGGTTGAGTATATGGCTGATTTCTTAGGTAGGAGGAAATTGTTTGATGATATGAAGTGTCTTTTGGTGACTGTGT
CGTCTCATATGGGTCGGCTTTCTTGTCGAACATTATCAATTTGTATCAGATTTTTGGGTAGGCAAGGGAGGGTTAGGGAAGCACTTTGCTTGTTCGAAGAAATGGAACCA
AAATTTGGGTGTAAACCTGATAATTTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAGGAACCAACTGGGGAATTGATTGATACTGCTCTAACCATTTTCAGAAG
AATTGAATTGCCTGATAAATATTCATACAGTAATGTCATTATCGGATTGTGTAAATTTGGTCGGTTTGGTACAGCTATTGAAGTGTTTGATGAAATGAATAGGGCAGGTT
TGGTACCTACTCGATCTGCTCTGAACATTCTCATTGGTGATTTGTGTTCGTTGAGTGCCAAAGAAGGGGCTGTAGAACGAGTTAAGGTCAGAAGTACACCTAGACCTTTT
ACCGTTCTAGTTCCAAATGTGAATCCGAAGAGTGGAGCCATTGAACCTGCAGTTGGAGTTTTTTGGGCAGCTAATAAGCTGGCTTTAGTTCCCAGTGCTTTTGTTATAGT
TCAGCTCATCTCAGAGCTTTGTCGATTAGGTCAAATGCAAGAAGCAATTAGGGTATTGAAGGTTGTTGAGGGTGACAAGCTAAGATGTGCTGAAGAGTGTTATTCTGTTG
TGATGCAAGCGTTGTGCGAACATCGCCACGTGGAAGAAGCTAGTAATCTGTTTGGGAGGATGCTTTCTCAGGGCATGAAGCCAAAGTTGGCTATTTACAATTCTGTTATT
TGCATGTTATGCAAATTAGGAAATTTGGATGATGCTGAAAGGGTCTTCAAGATTATGAACAGGGAAAGATGCGCACCTGATCATGTTACTTATTCGGCGTTAATCCACGC
CTATGGTGAAACTAGGAATTGGTCGGCAGCCTACAGTTTATTGAAGGAAATGCTGAGTTTTGGCATGTCTCCTCAGTTTCATGTGTATAGTATAGTGGATAAACTAATGA
GGGAACATGGGCAAATTGATCTGTGCTTGAAACTGGCAGTGAAATGGGAAGCCCAAATTTTGCAGAAGCTTTGTAAACAAGGACAACTGGAGGCCGCATATGAAAAGATG
AAGTCAATGCTTGAAAAGGGTTTTCATCCTCCTATCTATGTTAGAGATACTTTTGAGAGTGCATTTCAAAAGAAGGGTAAGTTTAAGATTGCACGGGAGTTGCTACAGAA
GATGGACAGAGTCCACCAACATGAGTCATTAACCAGAAATTCATCTTGA
Protein sequenceShow/hide protein sequence
MGSDFFSESETWVSSVNGIREDPDDETSIEEGEGIGLDSDLEGSALMGAKNKLLKKRSQVLLEGFVEDEDDLMRTKSLTDEDLDELKGCVDLGFGFSYDEIPELCNTLPA
LELCYSMSQKYMDDHQKSPESSPFSAVPTDSGSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVALTFSLMVDDIARFQEKIDRMNEQTGHLMNPTYDDEEMKKKNVSE
AGNVTVSMSSLLLISSLPEGLDAKGKENVWLRLYKLKEREFSNLLVPLQLAESSMLSRSTIRNASALLKFVPFHFYGFSSHFFSTSSTTKHIAIAPRALARTPTSRTAPI
PRAPDTPGSTDVVNSVCALLSNKNHLTTNLDLDHLLKRFKHTLTSDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHM
GRLSCRTLSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNVIIGLCKFGRFGTAIEVFDEMNRAGLVPT
RSALNILIGDLCSLSAKEGAVERVKVRSTPRPFTVLVPNVNPKSGAIEPAVGVFWAANKLALVPSAFVIVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMQA
LCEHRHVEEASNLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRERCAPDHVTYSALIHAYGETRNWSAAYSLLKEMLSFGMSPQFHVYSIVDKLMREHG
QIDLCLKLAVKWEAQILQKLCKQGQLEAAYEKMKSMLEKGFHPPIYVRDTFESAFQKKGKFKIARELLQKMDRVHQHESLTRNSS