; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027917 (gene) of Chayote v1 genome

Gene IDSed0027917
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG02:26415080..26417362
RNA-Seq ExpressionSed0027917
SyntenySed0027917
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015666.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0081.73Show/hide
Query:  MAASLSSCVDVNPRPPPLLLLLSSSPVGR----KKARILCCS---------PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLS
        MAA LSS +D+  +  P   L  +SP+ R    K+  +LC S          + PSLSEQL+ LS STLS +    ESHL   PKS WVNPTK K SVLS
Subjt:  MAASLSSCVDVNPRPPPLLLLLSSSPVGR----KKARILCCS---------PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLS

Query:  LQRQKRSSYSYNPHMADLKAFARLLNACDSSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRI
        LQRQKRSSYSYNP M +LK FA  LNA DSSE AF A L+++PHPPTKENALL+LN+LKPW K  LFF+WIKTQNLFP++TIFYNVAMKSLRYGRQFQ I
Subjt:  LQRQKRSSYSYNPHMADLKAFARLLNACDSSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRI

Query:  EHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAG
        E LAN+MI TG+ LDNITYSTIITCA KC RFDKA+EWFERMY TGLMPDEVTYSAILDVYANLGKVEE L+LYERGRASGWKPDP TFSVLGKMFGEAG
Subjt:  EHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAG

Query:  DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNM
        DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAG+PGFARSLF+EMI+SGITPN KTLTALVKIYGKARWARDAL+LWERMRS GWPMDFILYNTLLNM
Subjt:  DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNM

Query:  CADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLC
        CADLGLEEEAEKLFEEMKKSE SRPDSWSYTAMLNIHGSGGNVK+SMELFEEM+ELGV INVM CTCLIQCLGKA ++DDLVRVF+V ++KG+KPDDRLC
Subjt:  CADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLC

Query:  GCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLH
        GCLLSVVSLCDN EDI+KV  CLQQANPKLV FVNLLQQNDITF+V+KDEFR IL +TATEARRPFCNCLIDICR QNL +RAHELL+LGSL+GLYPGLH
Subjt:  GCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLH

Query:  NKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAP
        NKTE EWCLDVRSLSVGAAQTALEEWM TLSKIV REEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPF++REDRAGWF+ATREDVVAWVHSR P
Subjt:  NKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAP

Query:  SVATTA
        SVAT A
Subjt:  SVATTA

XP_008448710.1 PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Cucumis melo]0.0e+0081.86Show/hide
Query:  MAASLSSCVDVNPRPPP---------------LLLLLSSSPVGRKKARILCCS--PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKP
        MAA LSS +D   +P P               L LL SSS   RK + I   S   + PSLS+QL+ LS +TLS+ P   E+ L  +PKS WVNPTK K 
Subjt:  MAASLSSCVDVNPRPPP---------------LLLLLSSSPVGRKKARILCCS--PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKP

Query:  SVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSS-EPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGR
        SVLSLQRQKRSSYSYNP M DLK+FA  LNACDSS E +F AAL ++PHPPTKENALL+LN+L+PW K  LFF+WIKTQNLFP++TIFYNVAMKSLRYGR
Subjt:  SVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSS-EPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGR

Query:  QFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKM
        QFQ IE LANDM+ TG+ LDNITYSTIITCA KC RFDKA+EWFERMY TGLMPDEVTYSAILDVYANLGKVEEVL+LYERGRASGWKPDP TFSVLGKM
Subjt:  QFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKM

Query:  FGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN
        FGEAGDYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAGKPGFARSLFDEM++SGITPN KTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN
Subjt:  FGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN

Query:  TLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKP
        TLLNMCADLGLEEEAEKLFEEMKKS+ SRPDSWSYTAMLNI+GSGGNVK+SMELFEEM++LGVEINVM CTCLIQCLGK+G++DDLVRVFNV VQKGIKP
Subjt:  TLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKP

Query:  DDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGL
        DDRLCGCLLSVVSLCDN+EDINKV  CLQQANPKLV FVNLLQQN ITFEV+K+EFRNILS+TA+EARRPFCNCLIDICR QNLRERAHELL+LGSL+GL
Subjt:  DDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGL

Query:  YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWV
        YPGLHNKTE EWCLDVRSLSVGAAQTALEEWM TLSKIV R+EALPELLSAQTGAGTHRFSQGLANSFASHV+KLAAPFQ+REDRAGWF+ATRED+V WV
Subjt:  YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWV

Query:  HSRAPSVATTA
        HSR PSV  TA
Subjt:  HSRAPSVATTA

XP_022145326.1 pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Momordica charantia]0.0e+0085.21Show/hide
Query:  LLLLLSSSPVGRKKARILCCS-PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACD
        L +L SSS   RK  +    S  + PSLS+QL+ LS +TLS  PK  ESHL  +PKS WVNPTK K SVLSLQRQKRSSYSYNP M +LK+FA+ LNACD
Subjt:  LLLLLSSSPVGRKKARILCCS-PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACD

Query:  SSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKC
        SSE AF AAL ++PHPPTKENALL+LN+LKPW K Q+FF+WIK+QNLFP++TIFYNVAMKSLRYGRQFQ IE LAN+MI +G+ LDNITYSTIITCA KC
Subjt:  SSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKC

Query:  GRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLD
         RFDKA+EWFERMY TGLMPDEVTYSAILDVYANLGKVEEVL+LYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLD
Subjt:  GRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLD

Query:  AMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWS
        AMGKAGKPGFARSLFDEMI+SGITPN KTLTALVKIYGKARWARDAL LWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSE SRPDSWS
Subjt:  AMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWS

Query:  YTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPK
        YTAMLNIHGSGGNVK+SMELFEEM+ELGVEINVMGCTCLIQCLGKA ++DDLVRVF+V VQKGIKPDDRLCGCLLSVVSLCDN+EDINKV  CLQQANP 
Subjt:  YTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPK

Query:  LVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMST
        LV F+NLLQQN ITFEVVK+EFR IL +TATEARRPFCNCLIDICR QNLRERAHELL+LGSL+GLYPGLHNKTE+EWCLDVRSLSVGAAQTALEEWM+T
Subjt:  LVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMST

Query:  LSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATT
        LSKIV REEALP+LLSAQTGAGTHRFSQGLANSFASHVEKLAAPF++REDRAGWF+ATRED+V+WVHSR PSVA T
Subjt:  LSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATT

XP_022923274.1 pentatricopeptide repeat-containing protein At5g46580, chloroplastic-like [Cucurbita moschata]0.0e+0081.73Show/hide
Query:  MAASLSSCVDVNPRPPPLLLLLSSSPVGR----KKARILCCS---------PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLS
        MAA LSS +D+  +  P   L  +SP+ R    K+  +LC S          + PSLSEQL+ LS STLS +    ESHL   PKS WVNPTK K SVLS
Subjt:  MAASLSSCVDVNPRPPPLLLLLSSSPVGR----KKARILCCS---------PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLS

Query:  LQRQKRSSYSYNPHMADLKAFARLLNACDSSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRI
        LQRQKRSSYSYNP M +LK FA  LNA DSSE AF A L+++PHPPTKENALL+LN+LKPW K  LFF+WIKTQNLFP++TIFYNVAMKSLRYGRQFQ I
Subjt:  LQRQKRSSYSYNPHMADLKAFARLLNACDSSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRI

Query:  EHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAG
        E LAN+MI TG+ LDNITYSTIITCA KC RFDKA+EWFERMY TGLMPDEVTYSAILDVYANLGKVEE L+LYERGRASGWKPDP TFSVLGKMFGEAG
Subjt:  EHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAG

Query:  DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNM
        DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAG+PGFARSLF+EMI+SGITPN KTLTALVKIYGKARWARDAL+LWERMRS GWPMDFILYNTLLNM
Subjt:  DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNM

Query:  CADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLC
        CADLGLEEEAEKLFEEMKKSE SRPDSWSYTAMLNIHGSGGNVK+SMELFEEM+ELGV INVM CTCLIQCLGKA ++DDLVRVF+V V+KG++PDDRLC
Subjt:  CADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLC

Query:  GCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLH
        GCLLSVVSLCDN EDI+KV  CLQQANPKLV FVNLLQQNDITF+V+KDEFR IL +TATEARRPFCNCLIDICR QNL +RAHELL+LGSL+GLYPGLH
Subjt:  GCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLH

Query:  NKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAP
        NKTE EWCLDVRSLSVGAAQTALEEWM TLSKIV REEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPF++REDRAGWF+ATREDVVAWVHSR P
Subjt:  NKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAP

Query:  SVATTA
        SVAT A
Subjt:  SVATTA

XP_038906217.1 pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Benincasa hispida]0.0e+0082.79Show/hide
Query:  MAASLSSCVDVNPRP-------------PPLLLLLSSSPVGRKKARILCCS--PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSV
        MAA LSS +D+N +P               L ++ +SS   RK + I   S   + PSLSEQL+ LS +TLS+ P   ESHL  +PKS WVNPTK K SV
Subjt:  MAASLSSCVDVNPRP-------------PPLLLLLSSSPVGRKKARILCCS--PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSV

Query:  LSLQRQKRSSYSYNPHMADLKAFARLLNACDSS-EPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQF
        LSLQRQKRSSYSYNP M DLK+FA  LNACDSS E AF AAL ++PHPPTKENALL+LN+L+PW K  LFF+WIKTQNLFP++TIFYNVAMKSLRYGRQF
Subjt:  LSLQRQKRSSYSYNPHMADLKAFARLLNACDSS-EPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQF

Query:  QRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFG
        Q +E LAN+MI TG+ LDNITYSTIITCA KC RFDKA+EWFERMY TGLMPDEVTYSAILDVYANLGKVEEVL+LYERGRASGWKPDP TFSVLGKMFG
Subjt:  QRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFG

Query:  EAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTL
        EAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEM++SGITPN KTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTL
Subjt:  EAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTL

Query:  LNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDD
        LNMCADLGLEEEAEKLFEEMKKSE SRPDSWSYTAMLNIHGSGGNVK+SMELFEEM++LGVEINVM CTCLIQCLGK+G++D+LVRVFNV VQKGIKPDD
Subjt:  LNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDD

Query:  RLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYP
        RLCGCLLSVVSLCDN+EDINKV  CLQQA+PKLV FVNLLQQNDITFEVVK+EFRNIL +TATEARRPFCNCLIDICR QNLRERAHELL+LGSL+GLYP
Subjt:  RLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYP

Query:  GLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHS
        GLHNKTEAEWCLDVRSLSVGAAQTALEEWM TLSKIV REEALPELLSAQTGAGTH+FSQGLANSFASHV+KLAAPFQ+REDRAGWF+ATRED+V WVHS
Subjt:  GLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHS

Query:  RAPSVATTA
        R PSVA TA
Subjt:  RAPSVATTA

TrEMBL top hitse value%identityAlignment
A0A0A0L6K8 Smr domain-containing protein0.0e+0081.29Show/hide
Query:  MAASLSSCVDVNPRPPP---------------LLLLLSSSPVGRKKARILCCS--PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKP
        MA  LSS +D+  +P P               L LL SSS   RK + +   S   + PSLSEQL+ LS +TLS+ P   E+ L  +PKS WVNPTK K 
Subjt:  MAASLSSCVDVNPRPPP---------------LLLLLSSSPVGRKKARILCCS--PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKP

Query:  SVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSSEPA-FAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGR
        SVLSLQRQKRSSYSYNP M DLK+FA  LNACDSS+ A F AAL ++PHPPTKENALL+LN+L+PW K  LFF+WIK+QNLFP++TIFYNVAMKSLRYGR
Subjt:  SVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSSEPA-FAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGR

Query:  QFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKM
        QFQ IE LAN+MI  G+ LDNITYSTIITCA KC RFDKA+EWFERMY TGLMPDEVTYSAILDVYANLGKVEEVL+LYERGRASGW PDP TFSVLGKM
Subjt:  QFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKM

Query:  FGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN
        FGEAGDYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAGKPGFARSLFDEM++SGITPN KTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN
Subjt:  FGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN

Query:  TLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKP
        TLLNMCADLGLEEEAE LFEEMKKS+ SRPDSWSYTAMLNI+GSGGNVK+SMELFEEM+ELGVEINVM CTCLIQCLGK+G++DDLVRVFNV VQKGIKP
Subjt:  TLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKP

Query:  DDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGL
        DDRLCGCLLSV+SLC N+EDINKV  CLQQANPKLV F+NLLQQNDITFEVVK+EFRNIL +TA EARRPFCNCLIDICR QNLRERAHELL+LGSL+GL
Subjt:  DDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGL

Query:  YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWV
        YPGLHNKTE EWCLDVRSLSVGAAQTALEEWM TLSKIV REEALPELLSAQTGAGTHRFSQGLANSFASHV+KLAAPFQ+REDRAGWF+ATRED+V WV
Subjt:  YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWV

Query:  HSRAPSVATTA
        HSR PSVA TA
Subjt:  HSRAPSVATTA

A0A1S3BKZ1 pentatricopeptide repeat-containing protein At5g46580, chloroplastic0.0e+0081.86Show/hide
Query:  MAASLSSCVDVNPRPPP---------------LLLLLSSSPVGRKKARILCCS--PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKP
        MAA LSS +D   +P P               L LL SSS   RK + I   S   + PSLS+QL+ LS +TLS+ P   E+ L  +PKS WVNPTK K 
Subjt:  MAASLSSCVDVNPRPPP---------------LLLLLSSSPVGRKKARILCCS--PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKP

Query:  SVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSS-EPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGR
        SVLSLQRQKRSSYSYNP M DLK+FA  LNACDSS E +F AAL ++PHPPTKENALL+LN+L+PW K  LFF+WIKTQNLFP++TIFYNVAMKSLRYGR
Subjt:  SVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSS-EPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGR

Query:  QFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKM
        QFQ IE LANDM+ TG+ LDNITYSTIITCA KC RFDKA+EWFERMY TGLMPDEVTYSAILDVYANLGKVEEVL+LYERGRASGWKPDP TFSVLGKM
Subjt:  QFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKM

Query:  FGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN
        FGEAGDYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAGKPGFARSLFDEM++SGITPN KTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN
Subjt:  FGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN

Query:  TLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKP
        TLLNMCADLGLEEEAEKLFEEMKKS+ SRPDSWSYTAMLNI+GSGGNVK+SMELFEEM++LGVEINVM CTCLIQCLGK+G++DDLVRVFNV VQKGIKP
Subjt:  TLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKP

Query:  DDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGL
        DDRLCGCLLSVVSLCDN+EDINKV  CLQQANPKLV FVNLLQQN ITFEV+K+EFRNILS+TA+EARRPFCNCLIDICR QNLRERAHELL+LGSL+GL
Subjt:  DDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGL

Query:  YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWV
        YPGLHNKTE EWCLDVRSLSVGAAQTALEEWM TLSKIV R+EALPELLSAQTGAGTHRFSQGLANSFASHV+KLAAPFQ+REDRAGWF+ATRED+V WV
Subjt:  YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWV

Query:  HSRAPSVATTA
        HSR PSV  TA
Subjt:  HSRAPSVATTA

A0A5A7UFR2 Pentatricopeptide repeat-containing protein0.0e+0081.86Show/hide
Query:  MAASLSSCVDVNPRPPP---------------LLLLLSSSPVGRKKARILCCS--PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKP
        MAA LSS +D   +P P               L LL SSS   RK + I   S   + PSLS+QL+ LS +TLS+ P   E+ L  +PKS WVNPTK K 
Subjt:  MAASLSSCVDVNPRPPP---------------LLLLLSSSPVGRKKARILCCS--PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKP

Query:  SVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSS-EPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGR
        SVLSLQRQKRSSYSYNP M DLK+FA  LNACDSS E +F AAL ++PHPPTKENALL+LN+L+PW K  LFF+WIKTQNLFP++TIFYNVAMKSLRYGR
Subjt:  SVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSS-EPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGR

Query:  QFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKM
        QFQ IE LANDM+ TG+ LDNITYSTIITCA KC RFDKA+EWFERMY TGLMPDEVTYSAILDVYANLGKVEEVL+LYERGRASGWKPDP TFSVLGKM
Subjt:  QFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKM

Query:  FGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN
        FGEAGDYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAGKPGFARSLFDEM++SGITPN KTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN
Subjt:  FGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYN

Query:  TLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKP
        TLLNMCADLGLEEEAEKLFEEMKKS+ SRPDSWSYTAMLNI+GSGGNVK+SMELFEEM++LGVEINVM CTCLIQCLGK+G++DDLVRVFNV VQKGIKP
Subjt:  TLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKP

Query:  DDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGL
        DDRLCGCLLSVVSLCDN+EDINKV  CLQQANPKLV FVNLLQQN ITFEV+K+EFRNILS+TA+EARRPFCNCLIDICR QNLRERAHELL+LGSL+GL
Subjt:  DDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGL

Query:  YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWV
        YPGLHNKTE EWCLDVRSLSVGAAQTALEEWM TLSKIV R+EALPELLSAQTGAGTHRFSQGLANSFASHV+KLAAPFQ+REDRAGWF+ATRED+V WV
Subjt:  YPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWV

Query:  HSRAPSVATTA
        HSR PSV  TA
Subjt:  HSRAPSVATTA

A0A6J1CW10 pentatricopeptide repeat-containing protein At5g46580, chloroplastic0.0e+0085.21Show/hide
Query:  LLLLLSSSPVGRKKARILCCS-PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACD
        L +L SSS   RK  +    S  + PSLS+QL+ LS +TLS  PK  ESHL  +PKS WVNPTK K SVLSLQRQKRSSYSYNP M +LK+FA+ LNACD
Subjt:  LLLLLSSSPVGRKKARILCCS-PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACD

Query:  SSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKC
        SSE AF AAL ++PHPPTKENALL+LN+LKPW K Q+FF+WIK+QNLFP++TIFYNVAMKSLRYGRQFQ IE LAN+MI +G+ LDNITYSTIITCA KC
Subjt:  SSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKC

Query:  GRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLD
         RFDKA+EWFERMY TGLMPDEVTYSAILDVYANLGKVEEVL+LYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLD
Subjt:  GRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLD

Query:  AMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWS
        AMGKAGKPGFARSLFDEMI+SGITPN KTLTALVKIYGKARWARDAL LWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSE SRPDSWS
Subjt:  AMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWS

Query:  YTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPK
        YTAMLNIHGSGGNVK+SMELFEEM+ELGVEINVMGCTCLIQCLGKA ++DDLVRVF+V VQKGIKPDDRLCGCLLSVVSLCDN+EDINKV  CLQQANP 
Subjt:  YTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPK

Query:  LVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMST
        LV F+NLLQQN ITFEVVK+EFR IL +TATEARRPFCNCLIDICR QNLRERAHELL+LGSL+GLYPGLHNKTE+EWCLDVRSLSVGAAQTALEEWM+T
Subjt:  LVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMST

Query:  LSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATT
        LSKIV REEALP+LLSAQTGAGTHRFSQGLANSFASHVEKLAAPF++REDRAGWF+ATRED+V+WVHSR PSVA T
Subjt:  LSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATT

A0A6J1E5X7 pentatricopeptide repeat-containing protein At5g46580, chloroplastic-like0.0e+0081.73Show/hide
Query:  MAASLSSCVDVNPRPPPLLLLLSSSPVGR----KKARILCCS---------PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLS
        MAA LSS +D+  +  P   L  +SP+ R    K+  +LC S          + PSLSEQL+ LS STLS +    ESHL   PKS WVNPTK K SVLS
Subjt:  MAASLSSCVDVNPRPPPLLLLLSSSPVGR----KKARILCCS---------PQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLS

Query:  LQRQKRSSYSYNPHMADLKAFARLLNACDSSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRI
        LQRQKRSSYSYNP M +LK FA  LNA DSSE AF A L+++PHPPTKENALL+LN+LKPW K  LFF+WIKTQNLFP++TIFYNVAMKSLRYGRQFQ I
Subjt:  LQRQKRSSYSYNPHMADLKAFARLLNACDSSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRI

Query:  EHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAG
        E LAN+MI TG+ LDNITYSTIITCA KC RFDKA+EWFERMY TGLMPDEVTYSAILDVYANLGKVEE L+LYERGRASGWKPDP TFSVLGKMFGEAG
Subjt:  EHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAG

Query:  DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNM
        DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAG+PGFARSLF+EMI+SGITPN KTLTALVKIYGKARWARDAL+LWERMRS GWPMDFILYNTLLNM
Subjt:  DYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNM

Query:  CADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLC
        CADLGLEEEAEKLFEEMKKSE SRPDSWSYTAMLNIHGSGGNVK+SMELFEEM+ELGV INVM CTCLIQCLGKA ++DDLVRVF+V V+KG++PDDRLC
Subjt:  CADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLC

Query:  GCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLH
        GCLLSVVSLCDN EDI+KV  CLQQANPKLV FVNLLQQNDITF+V+KDEFR IL +TATEARRPFCNCLIDICR QNL +RAHELL+LGSL+GLYPGLH
Subjt:  GCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLH

Query:  NKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAP
        NKTE EWCLDVRSLSVGAAQTALEEWM TLSKIV REEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPF++REDRAGWF+ATREDVVAWVHSR P
Subjt:  NKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAP

Query:  SVATTA
        SVAT A
Subjt:  SVATTA

SwissProt top hitse value%identityAlignment
B4F8Z1 Pentatricopeptide repeat-containing protein ATP4, chloroplastic1.5e-12539.02Show/hide
Query:  DPKPQESHLPR--QPKSR------WVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSSEPAFAAALR-QLPHPPTKENALLLLN--ALKP
        DP P     P   +P S       WVNP    P    + R +  S       A L + A  L AC+++E A  AAL+   P PP++++A+++LN  A   
Subjt:  DPKPQESHLPR--QPKSR------WVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSSEPAFAAALR-QLPHPPTKENALLLLN--ALKP

Query:  WPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDV
           A L   W           I YNV +K LR  R +   E L  +M+R GV  DN T+ST+I+CA  CG   KA+EWF++M   G  PD +TYSA++D 
Subjt:  WPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDV

Query:  YANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLT
        Y + G  E  L LY+R RA  W+ DPV  S + K+   +G++DG + V +EMK+I V+PNLVVYNT+LDAMG+A +P   +++  EM+   + P+  T  
Subjt:  YANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLT

Query:  ALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKS--EKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGV
         L+  Y +AR+  DA+ ++  M+     +D +LYN LL+MCAD+G  +EAE++F +MK S    S+PDSWSY++M+ ++ S  NV  +  +  EM+E G 
Subjt:  ALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKS--EKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGV

Query:  EINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNT--EDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILS
        + N+   T LI+C GK G+ DD+VR F +L   GI PDDR CGCLLSV +   NT  E++ KV++C++++N +L   V LL     + E  ++  R +L 
Subjt:  EINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNT--EDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILS

Query:  QTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTL-SKIVLREEALPELLSAQTGAGTHRF
         +    + P+CNCL+D+C   N  E+A  LL      G+Y  +  +T+ +W L +R LSVGAA T L  WM+ L + +    E LP LL   TG G + +
Subjt:  QTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTL-SKIVLREEALPELLSAQTGAGTHRF

Query:  S-QGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATT
        S +GLA  F +H+++L APF    D+AGWF+ T      W+ S+A S   T
Subjt:  S-QGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATT

Q10PZ4 Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic1.9e-12538.14Show/hide
Query:  KARILCCSPQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSR------WVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSSEPAFAA
        +A  L   P+ PS S    R+S   + D P P     P   +S       WVNP    P    L R +  S       A L A A  L AC++ E   AA
Subjt:  KARILCCSPQTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSR------WVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSSEPAFAA

Query:  ALR-QLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNL-FPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKA
        AL    P PP++++A+++LN     P A +   W   +N     + I YNVA+K+LR  R++   E L  +M+R GV  DN T+ST+I+CA  CG   KA
Subjt:  ALR-QLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNL-FPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKA

Query:  LEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAG
        +EWFE+M + G  PD +TYSA++D Y   G  E  L LY+R RA  W+ DPV  + + ++   +G++DG + V +EMK+  V+PNLVVYNT+LDAMG+A 
Subjt:  LEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAG

Query:  KPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKS--EKSRPDSWSYTAM
        +P   +++  E++     PN  T   L+  Y +AR+  DA+ ++  M+     +D +LYN LL+MCAD+G  EEAE++F +MK S   +S+PDSWSY++M
Subjt:  KPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKS--EKSRPDSWSYTAM

Query:  LNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEF
        + ++   GNV  +  +  EM+E G + N+   T LI+C GKAG+ DD+VR F +L   GI PDDR CGCLL+V +     +++ KV+ C+ +++ +L   
Subjt:  LNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEF

Query:  VNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKI
        V LL       E +++    +L       R P+CNCL+D+    +  E+A  LL +    G+Y  +  +T+ +W L +R LSVGAA T L  WMS L   
Subjt:  VNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKI

Query:  VLREEALPELLSAQTGAGTHRFS-QGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSR
        +   + LP LL   TG G + +S +GLA  F SH+++L APF    D+AGWF+ T      W+ ++
Subjt:  VLREEALPELLSAQTGAGTHRFS-QGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSR

Q8GWE0 Pentatricopeptide repeat-containing protein At4g16390, chloroplastic4.7e-12436.31Show/hide
Query:  PPPLLLLLSSSPVGRKKARILCCSPQTPSLSEQ--LERLSASTLSDDPKPQESHL---------PRQPKSR-WVNPTKRKPSVLSLQRQKRSSYSYNPHM
        P PL  LLS  P    ++ +   +P +     +  L+    S     P+ ++S L         P   KS  WVNP  + P    L+R+     SY+   
Subjt:  PPPLLLLLSSSPVGRKKARILCCSPQTPSLSEQ--LERLSASTLSDDPKPQESHL---------PRQPKSR-WVNPTKRKPSVLSLQRQKRSSYSYNPHM

Query:  ADLKAFARLLNACDSSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLD
        + L   A  L+AC  +E      +        +++A++ LN +     A L  + +        + I YNV MK  R  +  ++ E L ++M+  G+  D
Subjt:  ADLKAFARLLNACDSSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLD

Query:  NITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSI
        N T++TII+CA + G   +A+EWFE+M + G  PD VT +A++D Y   G V+  L+LY+R R   W+ D VTFS L +++G +G+YDG + + +EMK++
Subjt:  NITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSI

Query:  EVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE
         V+PNLV+YN L+D+MG+A +P  A+ ++ ++I +G TPN  T  ALV+ YG+AR+  DAL ++  M+  G  +  ILYNTLL+MCAD    +EA ++F+
Subjt:  EVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE

Query:  EMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTED
        +MK  E   PDSW++++++ ++   G V ++     +M E G E  +   T +IQC GKA +VDD+VR F+ +++ GI PDDR CGCLL+V++    +E+
Subjt:  EMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTED

Query:  INKVVACLQQANPKLVEFVNLL-QQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSL
        I K++ C+++A PKL + V +L ++ +    V K E   ++    ++ ++ + NCLID+C   N  ERA E+L LG  + +Y GL +K+  +W L ++SL
Subjt:  INKVVACLQQANPKLVEFVNLL-QQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSL

Query:  SVGAAQTALEEWMSTLSKIVLRE-EALPELLSAQTGAGTHRFS-QGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATTA
        S+GAA TAL  WM+ LS+  L   E  P LL   TG G H++S +GLA  F SH+++L APF    D+ GWF+ T     AW+ SR  +   +A
Subjt:  SVGAAQTALEEWMSTLSKIVLRE-EALPELLSAQTGAGTHRFS-QGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATTA

Q8GYP6 Pentatricopeptide repeat-containing protein At1g189003.9e-4623.82Show/hide
Query:  PAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRF
        PA   AL+ L        A  +L  +  +  A  FF+W+K Q  F  D   Y   + +L   +QF  I  L ++M+R G   + +TY+ +I    +    
Subjt:  PAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRF

Query:  DKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMG
        ++A+  F +M   G  PD VTY  ++D++A  G ++  + +Y+R +A G  PD  T+SV                                   +++ +G
Subjt:  DKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMG

Query:  KAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTA
        KAG    A  LF EM+  G TPN  T   ++ ++ KAR  ++AL L+  M++ G+  D + Y+ ++ +    G  EEAE +F EM++ +   PD   Y  
Subjt:  KAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTA

Query:  MLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVE
        ++++ G  GNV+K+ + ++ M+  G+  NV  C  L+    +  K+ +   +   ++  G++P  +    LLS  +   +  D+      +         
Subjt:  MLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVE

Query:  FVNLLQQNDITFEVVK---DEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYP-GLHNKTEAEWCLDVRSLSVGAAQTALEEWMS
        F+  +       E V+   + F +++     E++R   + ++D       +E A  +  + +   ++P  L  K+ + W +++  +S G A TAL   ++
Subjt:  FVNLLQQNDITFEVVK---DEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYP-GLHNKTEAEWCLDVRSLSVGAAQTALEEWMS

Query:  TLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKL----AAPFQMREDRAGWFMATREDVVAWV
           K +L     P  +   TG G      G  +     VE+L     +PF      +G F+ + E +  W+
Subjt:  TLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKL----AAPFQMREDRAGWFMATREDVVAWV

Q9LS25 Pentatricopeptide repeat-containing protein At5g46580, chloroplastic3.2e-26666.82Show/hide
Query:  QTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSSEPA-FAAALRQLPHPPTKENA
        +TPSLSEQL+ LSA+TL    + +++ +  +PKS WVNPT+ K SVLSLQRQKRS+YSYNP + DL+AFA  LN+   +E + F + L ++PHPP ++NA
Subjt:  QTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSSEPA-FAAALRQLPHPPTKENA

Query:  LLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDE
        LL+LN+L+ W K   FF+W+K+++LFP++TIFYNV MKSLR+GRQFQ IE +A +M++ GV LDNITYSTIITCA +C  ++KA+EWFERMY TGLMPDE
Subjt:  LLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDE

Query:  VTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSG
        VTYSAILDVY+  GKVEEVL+LYER  A+GWKPD + FSVLGKMFGEAGDYDGI YVLQEMKS++V+PN+VVYNTLL+AMG+AGKPG ARSLF+EM+++G
Subjt:  VTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSG

Query:  ITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFE
        +TPN KTLTALVKIYGKARWARDAL LWE M++  WPMDFILYNTLLNMCAD+GLEEEAE+LF +MK+S + RPD++SYTAMLNI+GSGG  +K+MELFE
Subjt:  ITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFE

Query:  EMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEF
        EM++ GV++NVMGCTCL+QCLGKA ++DD+V VF++ +++G+KPDDRLCGCLLSV++LC+++ED  KV+ACL++AN KLV FVNL+      +E VK+EF
Subjt:  EMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEF

Query:  RNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAG
        + +++ T  EARRPFCNCLIDICR  N  ERAHELL+LG+L GLYPGLHNKT  EW LDVRSLSVGAA+TALEEWM TL+ I+ R+E LPEL  AQTG G
Subjt:  RNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAG

Query:  THRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATT
        THRFSQGLANSFA H+++L+APF+ + DR G F+AT+ED+V+W+ S+ P + T+
Subjt:  THRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATT

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein2.8e-4723.82Show/hide
Query:  PAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRF
        PA   AL+ L        A  +L  +  +  A  FF+W+K Q  F  D   Y   + +L   +QF  I  L ++M+R G   + +TY+ +I    +    
Subjt:  PAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRF

Query:  DKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMG
        ++A+  F +M   G  PD VTY  ++D++A  G ++  + +Y+R +A G  PD  T+SV                                   +++ +G
Subjt:  DKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMG

Query:  KAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTA
        KAG    A  LF EM+  G TPN  T   ++ ++ KAR  ++AL L+  M++ G+  D + Y+ ++ +    G  EEAE +F EM++ +   PD   Y  
Subjt:  KAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTA

Query:  MLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVE
        ++++ G  GNV+K+ + ++ M+  G+  NV  C  L+    +  K+ +   +   ++  G++P  +    LLS  +   +  D+      +         
Subjt:  MLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVE

Query:  FVNLLQQNDITFEVVK---DEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYP-GLHNKTEAEWCLDVRSLSVGAAQTALEEWMS
        F+  +       E V+   + F +++     E++R   + ++D       +E A  +  + +   ++P  L  K+ + W +++  +S G A TAL   ++
Subjt:  FVNLLQQNDITFEVVK---DEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYP-GLHNKTEAEWCLDVRSLSVGAAQTALEEWMS

Query:  TLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKL----AAPFQMREDRAGWFMATREDVVAWV
           K +L     P  +   TG G      G  +     VE+L     +PF      +G F+ + E +  W+
Subjt:  TLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKL----AAPFQMREDRAGWFMATREDVVAWV

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein2.8e-4723.82Show/hide
Query:  PAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRF
        PA   AL+ L        A  +L  +  +  A  FF+W+K Q  F  D   Y   + +L   +QF  I  L ++M+R G   + +TY+ +I    +    
Subjt:  PAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRF

Query:  DKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMG
        ++A+  F +M   G  PD VTY  ++D++A  G ++  + +Y+R +A G  PD  T+SV                                   +++ +G
Subjt:  DKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMG

Query:  KAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTA
        KAG    A  LF EM+  G TPN  T   ++ ++ KAR  ++AL L+  M++ G+  D + Y+ ++ +    G  EEAE +F EM++ +   PD   Y  
Subjt:  KAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTA

Query:  MLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVE
        ++++ G  GNV+K+ + ++ M+  G+  NV  C  L+    +  K+ +   +   ++  G++P  +    LLS  +   +  D+      +         
Subjt:  MLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVE

Query:  FVNLLQQNDITFEVVK---DEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYP-GLHNKTEAEWCLDVRSLSVGAAQTALEEWMS
        F+  +       E V+   + F +++     E++R   + ++D       +E A  +  + +   ++P  L  K+ + W +++  +S G A TAL   ++
Subjt:  FVNLLQQNDITFEVVK---DEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYP-GLHNKTEAEWCLDVRSLSVGAAQTALEEWMS

Query:  TLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKL----AAPFQMREDRAGWFMATREDVVAWV
           K +L     P  +   TG G      G  +     VE+L     +PF      +G F+ + E +  W+
Subjt:  TLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKL----AAPFQMREDRAGWFMATREDVVAWV

AT1G18900.3 Pentatricopeptide repeat (PPR) superfamily protein2.8e-4723.82Show/hide
Query:  PAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRF
        PA   AL+ L        A  +L  +  +  A  FF+W+K Q  F  D   Y   + +L   +QF  I  L ++M+R G   + +TY+ +I    +    
Subjt:  PAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRF

Query:  DKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMG
        ++A+  F +M   G  PD VTY  ++D++A  G ++  + +Y+R +A G  PD  T+SV                                   +++ +G
Subjt:  DKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMG

Query:  KAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTA
        KAG    A  LF EM+  G TPN  T   ++ ++ KAR  ++AL L+  M++ G+  D + Y+ ++ +    G  EEAE +F EM++ +   PD   Y  
Subjt:  KAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTA

Query:  MLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVE
        ++++ G  GNV+K+ + ++ M+  G+  NV  C  L+    +  K+ +   +   ++  G++P  +    LLS  +   +  D+      +         
Subjt:  MLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVE

Query:  FVNLLQQNDITFEVVK---DEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYP-GLHNKTEAEWCLDVRSLSVGAAQTALEEWMS
        F+  +       E V+   + F +++     E++R   + ++D       +E A  +  + +   ++P  L  K+ + W +++  +S G A TAL   ++
Subjt:  FVNLLQQNDITFEVVK---DEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYP-GLHNKTEAEWCLDVRSLSVGAAQTALEEWMS

Query:  TLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKL----AAPFQMREDRAGWFMATREDVVAWV
           K +L     P  +   TG G      G  +     VE+L     +PF      +G F+ + E +  W+
Subjt:  TLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKL----AAPFQMREDRAGWFMATREDVVAWV

AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein3.3e-12536.31Show/hide
Query:  PPPLLLLLSSSPVGRKKARILCCSPQTPSLSEQ--LERLSASTLSDDPKPQESHL---------PRQPKSR-WVNPTKRKPSVLSLQRQKRSSYSYNPHM
        P PL  LLS  P    ++ +   +P +     +  L+    S     P+ ++S L         P   KS  WVNP  + P    L+R+     SY+   
Subjt:  PPPLLLLLSSSPVGRKKARILCCSPQTPSLSEQ--LERLSASTLSDDPKPQESHL---------PRQPKSR-WVNPTKRKPSVLSLQRQKRSSYSYNPHM

Query:  ADLKAFARLLNACDSSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLD
        + L   A  L+AC  +E      +        +++A++ LN +     A L  + +        + I YNV MK  R  +  ++ E L ++M+  G+  D
Subjt:  ADLKAFARLLNACDSSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLD

Query:  NITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSI
        N T++TII+CA + G   +A+EWFE+M + G  PD VT +A++D Y   G V+  L+LY+R R   W+ D VTFS L +++G +G+YDG + + +EMK++
Subjt:  NITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSI

Query:  EVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE
         V+PNLV+YN L+D+MG+A +P  A+ ++ ++I +G TPN  T  ALV+ YG+AR+  DAL ++  M+  G  +  ILYNTLL+MCAD    +EA ++F+
Subjt:  EVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFE

Query:  EMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTED
        +MK  E   PDSW++++++ ++   G V ++     +M E G E  +   T +IQC GKA +VDD+VR F+ +++ GI PDDR CGCLL+V++    +E+
Subjt:  EMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTED

Query:  INKVVACLQQANPKLVEFVNLL-QQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSL
        I K++ C+++A PKL + V +L ++ +    V K E   ++    ++ ++ + NCLID+C   N  ERA E+L LG  + +Y GL +K+  +W L ++SL
Subjt:  INKVVACLQQANPKLVEFVNLL-QQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSL

Query:  SVGAAQTALEEWMSTLSKIVLRE-EALPELLSAQTGAGTHRFS-QGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATTA
        S+GAA TAL  WM+ LS+  L   E  P LL   TG G H++S +GLA  F SH+++L APF    D+ GWF+ T     AW+ SR  +   +A
Subjt:  SVGAAQTALEEWMSTLSKIVLRE-EALPELLSAQTGAGTHRFS-QGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATTA

AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein2.3e-26766.82Show/hide
Query:  QTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSSEPA-FAAALRQLPHPPTKENA
        +TPSLSEQL+ LSA+TL    + +++ +  +PKS WVNPT+ K SVLSLQRQKRS+YSYNP + DL+AFA  LN+   +E + F + L ++PHPP ++NA
Subjt:  QTPSLSEQLERLSASTLSDDPKPQESHLPRQPKSRWVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSSEPA-FAAALRQLPHPPTKENA

Query:  LLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDE
        LL+LN+L+ W K   FF+W+K+++LFP++TIFYNV MKSLR+GRQFQ IE +A +M++ GV LDNITYSTIITCA +C  ++KA+EWFERMY TGLMPDE
Subjt:  LLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAMKSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDE

Query:  VTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSG
        VTYSAILDVY+  GKVEEVL+LYER  A+GWKPD + FSVLGKMFGEAGDYDGI YVLQEMKS++V+PN+VVYNTLL+AMG+AGKPG ARSLF+EM+++G
Subjt:  VTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSG

Query:  ITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFE
        +TPN KTLTALVKIYGKARWARDAL LWE M++  WPMDFILYNTLLNMCAD+GLEEEAE+LF +MK+S + RPD++SYTAMLNI+GSGG  +K+MELFE
Subjt:  ITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFE

Query:  EMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEF
        EM++ GV++NVMGCTCL+QCLGKA ++DD+V VF++ +++G+KPDDRLCGCLLSV++LC+++ED  KV+ACL++AN KLV FVNL+      +E VK+EF
Subjt:  EMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINKVVACLQQANPKLVEFVNLLQQNDITFEVVKDEF

Query:  RNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAG
        + +++ T  EARRPFCNCLIDICR  N  ERAHELL+LG+L GLYPGLHNKT  EW LDVRSLSVGAA+TALEEWM TL+ I+ R+E LPEL  AQTG G
Subjt:  RNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMSTLSKIVLREEALPELLSAQTGAG

Query:  THRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATT
        THRFSQGLANSFA H+++L+APF+ + DR G F+AT+ED+V+W+ S+ P + T+
Subjt:  THRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAAATGTAGTCCATTGGTTTTTCAAAACTCAATTAAAAAGAAAAAGAAAAAGAAAAAGAAAGAAAAGAAAAGAAAAAAAAGTTTGGTGAAGGTGTGGATGGAGAA
TGCGATAAGGCAGTGGTGTTCTCCCATGGCGGCTTCCCTTTCAAGCTGCGTGGATGTGAACCCAAGGCCGCCGCCGTTGTTGTTGTTGTTGTCCAGTTCCCCTGTTGGGC
GAAAGAAGGCGAGGATCCTGTGTTGTTCCCCCCAAACCCCCTCTCTATCCGAGCAGCTGGAGAGGCTGTCCGCATCCACGCTTTCCGATGATCCCAAACCCCAGGAATCC
CATCTCCCGCGGCAGCCCAAGTCGAGGTGGGTCAACCCCACGAAGCGCAAGCCCTCCGTCCTTTCCCTCCAAAGGCAGAAGCGCTCTTCCTACTCCTACAACCCCCACAT
GGCCGACCTCAAAGCCTTCGCCCGCCTCCTCAACGCCTGCGATTCCTCCGAGCCCGCCTTCGCCGCCGCCCTCCGCCAACTCCCCCATCCTCCCACCAAGGAAAACGCGC
TTCTCCTTCTCAACGCCTTGAAGCCCTGGCCCAAGGCTCAGCTCTTCTTCCATTGGATCAAGACCCAGAACCTGTTTCCTCTCGACACCATCTTCTACAACGTGGCTATG
AAGTCCTTGAGGTATGGCAGGCAGTTTCAGCGTATCGAACATCTTGCCAATGACATGATTCGCACTGGGGTTCACCTTGATAACATTACTTATTCTACTATAATCACTTG
TGCTAACAAGTGCGGTAGGTTTGATAAGGCTTTGGAGTGGTTTGAGCGCATGTATAACACTGGATTGATGCCTGATGAGGTTACTTACTCCGCTATTTTAGATGTTTATG
CTAATTTGGGGAAGGTGGAGGAGGTTCTTACTTTGTATGAAAGAGGGAGGGCGAGTGGCTGGAAGCCCGACCCTGTTACTTTCTCCGTCTTGGGGAAGATGTTTGGGGAA
GCAGGGGATTATGATGGGATTATGTATGTTTTGCAAGAAATGAAGTCTATTGAAGTGCAGCCTAATCTTGTGGTCTACAACACTTTGTTGGATGCCATGGGGAAGGCTGG
GAAGCCTGGTTTTGCGAGGAGCCTGTTCGACGAGATGATTCAGTCGGGGATAACCCCCAACGCAAAGACCTTGACTGCTTTGGTAAAGATTTATGGAAAGGCAAGGTGGG
CTCGGGATGCTTTAGACTTGTGGGAGCGGATGAGGTCGAATGGCTGGCCAATGGACTTTATTTTGTATAATACATTGTTGAATATGTGTGCTGACCTTGGCTTGGAGGAG
GAAGCTGAGAAGCTCTTTGAAGAGATGAAGAAGTCCGAGAAGTCTAGGCCGGATAGTTGGAGCTACACGGCGATGTTGAATATACATGGTAGTGGAGGTAATGTAAAAAA
ATCCATGGAGTTGTTTGAAGAAATGATCGAGTTGGGTGTTGAGATTAATGTGATGGGATGCACTTGTTTGATTCAATGCTTGGGGAAAGCTGGGAAAGTTGATGATCTAG
TCCGAGTTTTCAACGTTCTAGTACAGAAAGGAATTAAGCCAGATGACAGACTTTGTGGCTGTTTGCTGTCTGTTGTGTCCTTGTGTGACAATACTGAAGATATTAACAAG
GTAGTCGCTTGTCTGCAACAAGCTAACCCAAAGTTAGTTGAGTTTGTAAATCTTCTGCAACAAAATGACATTACCTTTGAAGTTGTCAAGGACGAATTCAGAAACATTCT
CAGCCAGACTGCCACGGAAGCCCGAAGACCCTTCTGCAATTGCCTAATTGATATATGTCGAACCCAAAATCTTCGAGAGAGAGCTCACGAACTGCTCTTCTTGGGAAGTC
TGCATGGACTGTACCCAGGCTTACACAACAAAACCGAAGCTGAATGGTGCCTCGATGTTCGATCTCTATCAGTAGGTGCCGCTCAGACTGCACTTGAAGAATGGATGTCA
ACTCTGTCGAAAATTGTACTACGAGAAGAAGCATTACCTGAATTGTTATCAGCTCAAACGGGTGCAGGAACTCATAGATTTTCTCAAGGACTAGCCAATTCATTTGCTTC
TCATGTAGAGAAACTTGCTGCTCCGTTTCAAATGCGAGAAGACCGGGCCGGTTGGTTCATGGCCACAAGGGAGGATGTAGTTGCATGGGTTCATTCAAGGGCGCCATCTG
TGGCTACCACAGCTTAA
mRNA sequenceShow/hide mRNA sequence
TTTAGTTATGCCAAAATGTAGTCCATTGGTTTTTCAAAACTCAATTAAAAAGAAAAAGAAAAAGAAAAAGAAAGAAAAGAAAAGAAAAAAAAGTTTGGTGAAGGTGTGGA
TGGAGAATGCGATAAGGCAGTGGTGTTCTCCCATGGCGGCTTCCCTTTCAAGCTGCGTGGATGTGAACCCAAGGCCGCCGCCGTTGTTGTTGTTGTTGTCCAGTTCCCCT
GTTGGGCGAAAGAAGGCGAGGATCCTGTGTTGTTCCCCCCAAACCCCCTCTCTATCCGAGCAGCTGGAGAGGCTGTCCGCATCCACGCTTTCCGATGATCCCAAACCCCA
GGAATCCCATCTCCCGCGGCAGCCCAAGTCGAGGTGGGTCAACCCCACGAAGCGCAAGCCCTCCGTCCTTTCCCTCCAAAGGCAGAAGCGCTCTTCCTACTCCTACAACC
CCCACATGGCCGACCTCAAAGCCTTCGCCCGCCTCCTCAACGCCTGCGATTCCTCCGAGCCCGCCTTCGCCGCCGCCCTCCGCCAACTCCCCCATCCTCCCACCAAGGAA
AACGCGCTTCTCCTTCTCAACGCCTTGAAGCCCTGGCCCAAGGCTCAGCTCTTCTTCCATTGGATCAAGACCCAGAACCTGTTTCCTCTCGACACCATCTTCTACAACGT
GGCTATGAAGTCCTTGAGGTATGGCAGGCAGTTTCAGCGTATCGAACATCTTGCCAATGACATGATTCGCACTGGGGTTCACCTTGATAACATTACTTATTCTACTATAA
TCACTTGTGCTAACAAGTGCGGTAGGTTTGATAAGGCTTTGGAGTGGTTTGAGCGCATGTATAACACTGGATTGATGCCTGATGAGGTTACTTACTCCGCTATTTTAGAT
GTTTATGCTAATTTGGGGAAGGTGGAGGAGGTTCTTACTTTGTATGAAAGAGGGAGGGCGAGTGGCTGGAAGCCCGACCCTGTTACTTTCTCCGTCTTGGGGAAGATGTT
TGGGGAAGCAGGGGATTATGATGGGATTATGTATGTTTTGCAAGAAATGAAGTCTATTGAAGTGCAGCCTAATCTTGTGGTCTACAACACTTTGTTGGATGCCATGGGGA
AGGCTGGGAAGCCTGGTTTTGCGAGGAGCCTGTTCGACGAGATGATTCAGTCGGGGATAACCCCCAACGCAAAGACCTTGACTGCTTTGGTAAAGATTTATGGAAAGGCA
AGGTGGGCTCGGGATGCTTTAGACTTGTGGGAGCGGATGAGGTCGAATGGCTGGCCAATGGACTTTATTTTGTATAATACATTGTTGAATATGTGTGCTGACCTTGGCTT
GGAGGAGGAAGCTGAGAAGCTCTTTGAAGAGATGAAGAAGTCCGAGAAGTCTAGGCCGGATAGTTGGAGCTACACGGCGATGTTGAATATACATGGTAGTGGAGGTAATG
TAAAAAAATCCATGGAGTTGTTTGAAGAAATGATCGAGTTGGGTGTTGAGATTAATGTGATGGGATGCACTTGTTTGATTCAATGCTTGGGGAAAGCTGGGAAAGTTGAT
GATCTAGTCCGAGTTTTCAACGTTCTAGTACAGAAAGGAATTAAGCCAGATGACAGACTTTGTGGCTGTTTGCTGTCTGTTGTGTCCTTGTGTGACAATACTGAAGATAT
TAACAAGGTAGTCGCTTGTCTGCAACAAGCTAACCCAAAGTTAGTTGAGTTTGTAAATCTTCTGCAACAAAATGACATTACCTTTGAAGTTGTCAAGGACGAATTCAGAA
ACATTCTCAGCCAGACTGCCACGGAAGCCCGAAGACCCTTCTGCAATTGCCTAATTGATATATGTCGAACCCAAAATCTTCGAGAGAGAGCTCACGAACTGCTCTTCTTG
GGAAGTCTGCATGGACTGTACCCAGGCTTACACAACAAAACCGAAGCTGAATGGTGCCTCGATGTTCGATCTCTATCAGTAGGTGCCGCTCAGACTGCACTTGAAGAATG
GATGTCAACTCTGTCGAAAATTGTACTACGAGAAGAAGCATTACCTGAATTGTTATCAGCTCAAACGGGTGCAGGAACTCATAGATTTTCTCAAGGACTAGCCAATTCAT
TTGCTTCTCATGTAGAGAAACTTGCTGCTCCGTTTCAAATGCGAGAAGACCGGGCCGGTTGGTTCATGGCCACAAGGGAGGATGTAGTTGCATGGGTTCATTCAAGGGCG
CCATCTGTGGCTACCACAGCTTAACCAGCCATGTATTTAGCTTCTGTGAAGCACCTTTCATGGCTACATTGTTTGAGTTTTTT
Protein sequenceShow/hide protein sequence
MPKCSPLVFQNSIKKKKKKKKKEKKRKKSLVKVWMENAIRQWCSPMAASLSSCVDVNPRPPPLLLLLSSSPVGRKKARILCCSPQTPSLSEQLERLSASTLSDDPKPQES
HLPRQPKSRWVNPTKRKPSVLSLQRQKRSSYSYNPHMADLKAFARLLNACDSSEPAFAAALRQLPHPPTKENALLLLNALKPWPKAQLFFHWIKTQNLFPLDTIFYNVAM
KSLRYGRQFQRIEHLANDMIRTGVHLDNITYSTIITCANKCGRFDKALEWFERMYNTGLMPDEVTYSAILDVYANLGKVEEVLTLYERGRASGWKPDPVTFSVLGKMFGE
AGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGKPGFARSLFDEMIQSGITPNAKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEE
EAEKLFEEMKKSEKSRPDSWSYTAMLNIHGSGGNVKKSMELFEEMIELGVEINVMGCTCLIQCLGKAGKVDDLVRVFNVLVQKGIKPDDRLCGCLLSVVSLCDNTEDINK
VVACLQQANPKLVEFVNLLQQNDITFEVVKDEFRNILSQTATEARRPFCNCLIDICRTQNLRERAHELLFLGSLHGLYPGLHNKTEAEWCLDVRSLSVGAAQTALEEWMS
TLSKIVLREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLAAPFQMREDRAGWFMATREDVVAWVHSRAPSVATTA