; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0016520 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0016520
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr10:1832167..1834683
RNA-Seq ExpressionPI0016520
SyntenyPI0016520
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008452406.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X2 [Cucumis melo]5.1e-24382.64Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN
        MS FGCSPDI SYNSLLDGYCSS QIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKY LPSIVTYGTFVDMFCKMGDMEM N
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN

Query:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYST---------------------------------------------
        RMFLDM+KVGIVP+L+VFSSLIDGYCKAGSLDVAFEYFERMKE SVRPNEFTYST                                             
Subjt:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYST---------------------------------------------

Query:  -------------------------LIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGF
                                 LIDGCSK GMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFD+DIKLDLTAYTVIISGF
Subjt:  -------------------------LIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGF

Query:  HRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYLVKEKANEILYTVFIDAL
        HRVGRFDKSMEAAEYVAK GLLPDRIILTAIMD HFKAGNIKEALNAYKILL KGFEADV TLSALMDGLSKHGYLQ+ARRY VKEKANEILYTVFIDAL
Subjt:  HRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYLVKEKANEILYTVFIDAL

Query:  CKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDI
        CKEGNLDEAEKMIKEMSEAGFVPDKFVYTS IAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLI GLAEKGLMIEAKQVFDDMLNKGITPDFVAYDI
Subjt:  CKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDI

Query:  LVRGYHNQGNVAAISGLYDEMRKRGITVED
        L+RGYHNQGN AAISGL+DEMRKRGITVED
Subjt:  LVRGYHNQGNVAAISGLYDEMRKRGITVED

XP_008452410.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01740 isoform X4 [Cucumis melo]2.1e-25295.22Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN
        MS FGCSPDI SYNSLLDGYCSS QIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKY LPSIVTYGTFVDMFCKMGDMEM N
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN

Query:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGN
        RMFLDM+KVGIVP+L+VFSSLIDGYCKAGSLDVAFEYFERMKE SVRPNEFTYSTLIDGCSK GMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGN
Subjt:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGN

Query:  VDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGL
        VDDAIKYINQMFD+DIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAK GLLPDRIILTAIMD HFKAGNIKEALNAYKILL KGFEADV TLSALMDGL
Subjt:  VDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGL

Query:  SKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGL
        SKHGYLQ+ARRY VKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTS IAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLI GL
Subjt:  SKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGL

Query:  AEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED
        AEKGLMIEAKQVFDDMLNKGITPDFVAYDIL+RGYHNQGN AAISGL+DEMRKRGITVED
Subjt:  AEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED

XP_011654465.1 pentatricopeptide repeat-containing protein At2g01740 isoform X1 [Cucumis sativus]5.3e-25695.65Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN
        MS FGCSPDI SYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILF GFAKVYMKNEAFMY GLMWKYCLPSIVTYGTFVDMFCKMGDM+M N
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN

Query:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGN
        RMFLDM+KVGIVP+LVVFSSLIDGYCKAGSLDVAFEYFERMKE SVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGN
Subjt:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGN

Query:  VDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGL
        VDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMD HFKAGNIKEALNAYKILL KGFEADVVTLSALMDGL
Subjt:  VDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGL

Query:  SKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGL
        SKHGYLQEARRYLVKE ANEILYTVFIDALCKEGNLD+AEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEH+EPDLLTYSSLI GL
Subjt:  SKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGL

Query:  AEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED
        AEKGLMIEAKQVFDDMLNKGITPDFV+YDIL+RGYHNQGN AAISGL+DEMRKRGI VED
Subjt:  AEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED

XP_011654466.1 pentatricopeptide repeat-containing protein At2g01740 isoform X2 [Cucumis sativus]3.4e-25595.84Show/hide
Query:  FGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMF
        FGCSPDI SYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILF GFAKVYMKNEAFMY GLMWKYCLPSIVTYGTFVDMFCKMGDM+M NRMF
Subjt:  FGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMF

Query:  LDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDD
        LDM+KVGIVP+LVVFSSLIDGYCKAGSLDVAFEYFERMKE SVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDD
Subjt:  LDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDD

Query:  AIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKH
        AIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMD HFKAGNIKEALNAYKILL KGFEADVVTLSALMDGLSKH
Subjt:  AIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKH

Query:  GYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEK
        GYLQEARRYLVKE ANEILYTVFIDALCKEGNLD+AEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEH+EPDLLTYSSLI GLAEK
Subjt:  GYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEK

Query:  GLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED
        GLMIEAKQVFDDMLNKGITPDFV+YDIL+RGYHNQGN AAISGL+DEMRKRGI VED
Subjt:  GLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED

XP_038891429.1 pentatricopeptide repeat-containing protein At2g01740 [Benincasa hispida]4.6e-24490.65Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN
        M +FGCSPDI SYNSLLDGYC+SYQIQKACFLV+RVRGCELNRPDLVMFNILF G AKVYMKNEAFMYLGLMWKYCLPS+VTYGTFVDMFCKMGDMEM N
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN

Query:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGN
        RMFLDM+KVG++P+LVVFSSLIDGYCKAGSLDVAFEYFERM+E SVRPNEFTYS LIDGC KHGML RADSLFEKMLSA ILPNCTVYTSIIDGHFKKGN
Subjt:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGN

Query:  VDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGL
        VDD IKYIN MFDR+IKLDLTAYTVIISGFHRVGR DKSMEAAEYV KNGLLPDRIILTAIMD HFKAGN+KEALNAYKILL+KGFE DVVT SAL+DGL
Subjt:  VDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGL

Query:  SKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGL
         KHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAE+ IKEMSEAGFV DK+VYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLI GL
Subjt:  SKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGL

Query:  AEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED
        AEKGLMIEAKQVFDDM+NKGITPD V+YDIL+RGYHNQGN AAISGL+DEMRKRGIT+ED
Subjt:  AEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED

TrEMBL top hitse value%identityAlignment
A0A0A0KPG7 Uncharacterized protein6.6e-22895.64Show/hide
Query:  MFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVR
        MFNILF GFAKVYMKNEAFMY GLMWKYCLPSIVTYGTFVDMFCKMGDM+M NRMFLDM+KVGIVP+LVVFSSLIDGYCKAGSLDVAFEYFERMKE SVR
Subjt:  MFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVR

Query:  PNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVA
        PNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVA
Subjt:  PNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVA

Query:  KNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMS
        KNGLLPDRIILTAIMD HFKAGNIKEALNAYKILL KGFEADVVTLSALMDGLSKHGYLQEARRYLVKE ANEILYTVFIDALCKEGNLD+AEKMIKEMS
Subjt:  KNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMS

Query:  EAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGL
        EAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEH+EPDLLTYSSLI GLAEKGLMIEAKQVFDDMLNKGITPDFV+YDIL+RGYHNQGN AAISGL
Subjt:  EAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGL

Query:  YDEMRKRGITVED
        +DEMRKRGI VED
Subjt:  YDEMRKRGITVED

A0A1S3BTQ4 pentatricopeptide repeat-containing protein At2g01740 isoform X22.5e-24382.64Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN
        MS FGCSPDI SYNSLLDGYCSS QIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKY LPSIVTYGTFVDMFCKMGDMEM N
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN

Query:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYST---------------------------------------------
        RMFLDM+KVGIVP+L+VFSSLIDGYCKAGSLDVAFEYFERMKE SVRPNEFTYST                                             
Subjt:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYST---------------------------------------------

Query:  -------------------------LIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGF
                                 LIDGCSK GMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFD+DIKLDLTAYTVIISGF
Subjt:  -------------------------LIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGF

Query:  HRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYLVKEKANEILYTVFIDAL
        HRVGRFDKSMEAAEYVAK GLLPDRIILTAIMD HFKAGNIKEALNAYKILL KGFEADV TLSALMDGLSKHGYLQ+ARRY VKEKANEILYTVFIDAL
Subjt:  HRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYLVKEKANEILYTVFIDAL

Query:  CKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDI
        CKEGNLDEAEKMIKEMSEAGFVPDKFVYTS IAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLI GLAEKGLMIEAKQVFDDMLNKGITPDFVAYDI
Subjt:  CKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDI

Query:  LVRGYHNQGNVAAISGLYDEMRKRGITVED
        L+RGYHNQGN AAISGL+DEMRKRGITVED
Subjt:  LVRGYHNQGNVAAISGLYDEMRKRGITVED

A0A1S3BUK0 pentatricopeptide repeat-containing protein At2g01740 isoform X12.5e-24382.64Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN
        MS FGCSPDI SYNSLLDGYCSS QIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKY LPSIVTYGTFVDMFCKMGDMEM N
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN

Query:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYST---------------------------------------------
        RMFLDM+KVGIVP+L+VFSSLIDGYCKAGSLDVAFEYFERMKE SVRPNEFTYST                                             
Subjt:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYST---------------------------------------------

Query:  -------------------------LIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGF
                                 LIDGCSK GMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFD+DIKLDLTAYTVIISGF
Subjt:  -------------------------LIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGF

Query:  HRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYLVKEKANEILYTVFIDAL
        HRVGRFDKSMEAAEYVAK GLLPDRIILTAIMD HFKAGNIKEALNAYKILL KGFEADV TLSALMDGLSKHGYLQ+ARRY VKEKANEILYTVFIDAL
Subjt:  HRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYLVKEKANEILYTVFIDAL

Query:  CKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDI
        CKEGNLDEAEKMIKEMSEAGFVPDKFVYTS IAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLI GLAEKGLMIEAKQVFDDMLNKGITPDFVAYDI
Subjt:  CKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDI

Query:  LVRGYHNQGNVAAISGLYDEMRKRGITVED
        L+RGYHNQGN AAISGL+DEMRKRGITVED
Subjt:  LVRGYHNQGNVAAISGLYDEMRKRGITVED

A0A1S3BUW7 pentatricopeptide repeat-containing protein At2g01740 isoform X41.0e-25295.22Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN
        MS FGCSPDI SYNSLLDGYCSS QIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKY LPSIVTYGTFVDMFCKMGDMEM N
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMAN

Query:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGN
        RMFLDM+KVGIVP+L+VFSSLIDGYCKAGSLDVAFEYFERMKE SVRPNEFTYSTLIDGCSK GMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGN
Subjt:  RMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGN

Query:  VDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGL
        VDDAIKYINQMFD+DIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAK GLLPDRIILTAIMD HFKAGNIKEALNAYKILL KGFEADV TLSALMDGL
Subjt:  VDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGL

Query:  SKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGL
        SKHGYLQ+ARRY VKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTS IAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLI GL
Subjt:  SKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGL

Query:  AEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED
        AEKGLMIEAKQVFDDMLNKGITPDFVAYDIL+RGYHNQGN AAISGL+DEMRKRGITVED
Subjt:  AEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED

A0A1S4DZF8 pentatricopeptide repeat-containing protein At2g01740 isoform X31.6e-24282.73Show/hide
Query:  FGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMF
        FGCSPDI SYNSLLDGYCSS QIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKY LPSIVTYGTFVDMFCKMGDMEM NRMF
Subjt:  FGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMF

Query:  LDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYST------------------------------------------------
        LDM+KVGIVP+L+VFSSLIDGYCKAGSLDVAFEYFERMKE SVRPNEFTYST                                                
Subjt:  LDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYST------------------------------------------------

Query:  ----------------------LIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRV
                              LIDGCSK GMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFD+DIKLDLTAYTVIISGFHRV
Subjt:  ----------------------LIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRV

Query:  GRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYLVKEKANEILYTVFIDALCKE
        GRFDKSMEAAEYVAK GLLPDRIILTAIMD HFKAGNIKEALNAYKILL KGFEADV TLSALMDGLSKHGYLQ+ARRY VKEKANEILYTVFIDALCKE
Subjt:  GRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYLVKEKANEILYTVFIDALCKE

Query:  GNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVR
        GNLDEAEKMIKEMSEAGFVPDKFVYTS IAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLI GLAEKGLMIEAKQVFDDMLNKGITPDFVAYDIL+R
Subjt:  GNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVR

Query:  GYHNQGNVAAISGLYDEMRKRGITVED
        GYHNQGN AAISGL+DEMRKRGITVED
Subjt:  GYHNQGNVAAISGLYDEMRKRGITVED

SwissProt top hitse value%identityAlignment
P0C894 Putative pentatricopeptide repeat-containing protein At2g021501.5e-6431.25Show/hide
Query:  GCSPDIASYNSLLDGYCSSYQI-QKACFLVNRVRGCELNRPDLVMFNILFKGFAKV-YMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRM
        G  PD  +YNS++DG+    ++    CF       C    PD++ +N L   F K   +      Y  +      P++V+Y T VD FCK G M+ A + 
Subjt:  GCSPDIASYNSLLDGYCSSYQI-QKACFLVNRVRGCELNRPDLVMFNILFKGFAKV-YMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRM

Query:  FLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVD
        ++DM +VG+VP+   ++SLID  CK G+L  AF     M +  V  N  TY+ LIDG      +  A+ LF KM +A ++PN   Y ++I G  K  N+D
Subjt:  FLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVD

Query:  DAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSK
         A++ +N++  R IK DL  Y   I G   + + + +      + + G+  + +I T +MDA+FK+GN  E L+    +     E  VVT   L+DGL K
Subjt:  DAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSK

Query:  HGYLQEARRYLVK------EKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSL
        +  + +A  Y  +       +AN  ++T  ID LCK+  ++ A  + ++M + G VPD+  YTS +    KQGN+L+A  ++ +M +  ++ DLL Y+SL
Subjt:  HGYLQEARRYLVK------EKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSL

Query:  ICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED
        + GL+    + +A+   ++M+ +GI PD V    +++ ++  G +     L   + K  +   D
Subjt:  ICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED

Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial1.6e-6130.81Show/hide
Query:  FMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGM
        F  +G+ W     ++ +Y   +   C++G ++ A+ + L M   G  P ++ +S++++GYC+ G LD  ++  E MK   ++PN + Y ++I    +   
Subjt:  FMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGM

Query:  LARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAH
        LA A+  F +M+   ILP+  VYT++IDG  K+G++  A K+  +M  RDI  D+  YT IISGF ++G   ++ +    +   GL PD +  T +++ +
Subjt:  LARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAH

Query:  FKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYL-----VKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTS
         KAG++K+A   +  ++  G   +VVT + L+DGL K G L  A   L     +  + N   Y   ++ LCK GN++EA K++ E   AG   D   YT+
Subjt:  FKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYL-----VKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTS

Query:  WIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGI
         +   CK G + KA  + K M+ + ++P ++T++ L+ G    G++ + +++ + ML KGI P+   ++ LV+ Y  + N+ A + +Y +M  RG+
Subjt:  WIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGI

Q9C8T7 Pentatricopeptide repeat-containing protein At1g633309.0e-5728.7Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELN-RPDLVMFNILFKGFAKVYMKNEAFMYLG-LMWKYCLPSIVTYGTFVDMFCKMGDMEM
        M   G  P I + +SLL+GYC   +I  A  LV+++   E+  RPD + F  L  G       +EA   +  ++ + C P++VTYG  V+  CK GD+++
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELN-RPDLVMFNILFKGFAKVYMKNEAFMYLG-LMWKYCLPSIVTYGTFVDMFCKMGDMEM

Query:  ANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKK
        A  +   M    I   +V+F+++ID  CK   +D A   F+ M+   +RPN  TYS+LI     +G  + A  L   M+   I PN   + ++ID   K+
Subjt:  ANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKK

Query:  GNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMD
        G   +A K  + M  R I  D+  Y  +I+GF    R DK+ +  E++      PD      ++    K+  +++    ++ +  +G   D VT + L+ 
Subjt:  GNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMD

Query:  GLSKHGYLQEARRYLVKEKANE------ILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLT
        GL   G    A++ + K+  ++      + Y++ +D LC  G L++A ++   M ++    D ++YT+ I  +CK G +   + +   +  + ++P+++T
Subjt:  GLSKHGYLQEARRYLVKEKANE------ILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLT

Query:  YSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMR
        Y+++I GL  K L+ EA  +   M   G  PD   Y+ L+R +   G+ AA + L  EMR
Subjt:  YSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMR

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397108.7e-6029.3Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKV-YMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMA
        M + GC P++ +YN+L+DGYC   +I    F + R    +   P+L+ +N++  G  +   MK  +F+   +  +      VTY T +  +CK G+   A
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKV-YMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMA

Query:  NRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKG
          M  +M++ G+ P ++ ++SLI   CKAG+++ A E+ ++M+   + PNE TY+TL+DG S+ G +  A  +  +M      P+   Y ++I+GH   G
Subjt:  NRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKG

Query:  NVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDG
         ++DAI  +  M ++ +  D+ +Y+ ++SGF R    D+++     + + G+ PD I  ++++    +    KEA + Y+ +L  G   D          
Subjt:  NVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDG

Query:  LSKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYS-----
                            E  YT  I+A C EG+L++A ++  EM E G +PD   Y+  I  L KQ    +A  +  ++  E   P  +TY      
Subjt:  LSKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYS-----

Query:  ----------SLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRG
                  SLI G   KG+M EA QVF+ ML K   PD  AY+I++ G+   G++     LY EM K G
Subjt:  ----------SLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRG

Q9ZUA2 Pentatricopeptide repeat-containing protein At2g017401.1e-13148.47Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCE--LNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEM
        M  FGC PD+ SYNSL+DG+C +  I+ A  ++  +R     + +PD+V FN LF GF+K+ M +E F+Y+G+M K C P++VTY T++D FCK G++++
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCE--LNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEM

Query:  ANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKK
        A + F  M +  + P++V F+ LIDGYCKAG L+VA   ++ M+   +  N  TY+ LIDG  K G + RA+ ++ +M+   + PN  VYT+IIDG F++
Subjt:  ANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKK

Query:  GNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMD
        G+ D+A+K++ +M ++ ++LD+TAY VIISG    G+  ++ E  E + K+ L+PD +I T +M+A+FK+G +K A+N Y  L+ +GFE DVV LS ++D
Subjt:  GNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMD

Query:  GLSKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIC
        G++K+G L EA  Y   EKAN+++YTV IDALCKEG+  E E++  ++SEAG VPDKF+YTSWIA LCKQGNL+ AF +K RMVQE +  DLL Y++LI 
Subjt:  GLSKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIC

Query:  GLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGI
        GLA KGLM+EA+QVFD+MLN GI+PD   +D+L+R Y  +GN+AA S L  +M++RG+
Subjt:  GLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGI

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.1e-6230.81Show/hide
Query:  FMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGM
        F  +G+ W     ++ +Y   +   C++G ++ A+ + L M   G  P ++ +S++++GYC+ G LD  ++  E MK   ++PN + Y ++I    +   
Subjt:  FMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGM

Query:  LARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAH
        LA A+  F +M+   ILP+  VYT++IDG  K+G++  A K+  +M  RDI  D+  YT IISGF ++G   ++ +    +   GL PD +  T +++ +
Subjt:  LARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAH

Query:  FKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYL-----VKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTS
         KAG++K+A   +  ++  G   +VVT + L+DGL K G L  A   L     +  + N   Y   ++ LCK GN++EA K++ E   AG   D   YT+
Subjt:  FKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYL-----VKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTS

Query:  WIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGI
         +   CK G + KA  + K M+ + ++P ++T++ L+ G    G++ + +++ + ML KGI P+   ++ LV+ Y  + N+ A + +Y +M  RG+
Subjt:  WIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGI

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein1.1e-6230.81Show/hide
Query:  FMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGM
        F  +G+ W     ++ +Y   +   C++G ++ A+ + L M   G  P ++ +S++++GYC+ G LD  ++  E MK   ++PN + Y ++I    +   
Subjt:  FMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGM

Query:  LARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAH
        LA A+  F +M+   ILP+  VYT++IDG  K+G++  A K+  +M  RDI  D+  YT IISGF ++G   ++ +    +   GL PD +  T +++ +
Subjt:  LARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAH

Query:  FKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYL-----VKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTS
         KAG++K+A   +  ++  G   +VVT + L+DGL K G L  A   L     +  + N   Y   ++ LCK GN++EA K++ E   AG   D   YT+
Subjt:  FKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYL-----VKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTS

Query:  WIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGI
         +   CK G + KA  + K M+ + ++P ++T++ L+ G    G++ + +++ + ML KGI P+   ++ LV+ Y  + N+ A + +Y +M  RG+
Subjt:  WIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGI

AT2G01740.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.9e-13348.47Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCE--LNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEM
        M  FGC PD+ SYNSL+DG+C +  I+ A  ++  +R     + +PD+V FN LF GF+K+ M +E F+Y+G+M K C P++VTY T++D FCK G++++
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCE--LNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEM

Query:  ANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKK
        A + F  M +  + P++V F+ LIDGYCKAG L+VA   ++ M+   +  N  TY+ LIDG  K G + RA+ ++ +M+   + PN  VYT+IIDG F++
Subjt:  ANRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKK

Query:  GNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMD
        G+ D+A+K++ +M ++ ++LD+TAY VIISG    G+  ++ E  E + K+ L+PD +I T +M+A+FK+G +K A+N Y  L+ +GFE DVV LS ++D
Subjt:  GNVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMD

Query:  GLSKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIC
        G++K+G L EA  Y   EKAN+++YTV IDALCKEG+  E E++  ++SEAG VPDKF+YTSWIA LCKQGNL+ AF +K RMVQE +  DLL Y++LI 
Subjt:  GLSKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLIC

Query:  GLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGI
        GLA KGLM+EA+QVFD+MLN GI+PD   +D+L+R Y  +GN+AA S L  +M++RG+
Subjt:  GLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGI

AT2G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-6531.25Show/hide
Query:  GCSPDIASYNSLLDGYCSSYQI-QKACFLVNRVRGCELNRPDLVMFNILFKGFAKV-YMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRM
        G  PD  +YNS++DG+    ++    CF       C    PD++ +N L   F K   +      Y  +      P++V+Y T VD FCK G M+ A + 
Subjt:  GCSPDIASYNSLLDGYCSSYQI-QKACFLVNRVRGCELNRPDLVMFNILFKGFAKV-YMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRM

Query:  FLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVD
        ++DM +VG+VP+   ++SLID  CK G+L  AF     M +  V  N  TY+ LIDG      +  A+ LF KM +A ++PN   Y ++I G  K  N+D
Subjt:  FLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVD

Query:  DAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSK
         A++ +N++  R IK DL  Y   I G   + + + +      + + G+  + +I T +MDA+FK+GN  E L+    +     E  VVT   L+DGL K
Subjt:  DAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSK

Query:  HGYLQEARRYLVK------EKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSL
        +  + +A  Y  +       +AN  ++T  ID LCK+  ++ A  + ++M + G VPD+  YTS +    KQGN+L+A  ++ +M +  ++ DLL Y+SL
Subjt:  HGYLQEARRYLVK------EKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSL

Query:  ICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED
        + GL+    + +A+   ++M+ +GI PD V    +++ ++  G +     L   + K  +   D
Subjt:  ICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRGITVED

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.2e-6129.3Show/hide
Query:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKV-YMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMA
        M + GC P++ +YN+L+DGYC   +I    F + R    +   P+L+ +N++  G  +   MK  +F+   +  +      VTY T +  +CK G+   A
Subjt:  MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKV-YMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMA

Query:  NRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKG
          M  +M++ G+ P ++ ++SLI   CKAG+++ A E+ ++M+   + PNE TY+TL+DG S+ G +  A  +  +M      P+   Y ++I+GH   G
Subjt:  NRMFLDMVKVGIVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKG

Query:  NVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDG
         ++DAI  +  M ++ +  D+ +Y+ ++SGF R    D+++     + + G+ PD I  ++++    +    KEA + Y+ +L  G   D          
Subjt:  NVDDAIKYINQMFDRDIKLDLTAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDG

Query:  LSKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYS-----
                            E  YT  I+A C EG+L++A ++  EM E G +PD   Y+  I  L KQ    +A  +  ++  E   P  +TY      
Subjt:  LSKHGYLQEARRYLVKEKANEILYTVFIDALCKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYS-----

Query:  ----------SLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRG
                  SLI G   KG+M EA QVF+ ML K   PD  AY+I++ G+   G++     LY EM K G
Subjt:  ----------SLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGNVAAISGLYDEMRKRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTTTGGGTGCTCGCCTGATATTGCGTCCTACAATTCTTTGTTAGATGGGTATTGTTCGAGTTATCAAATTCAGAAAGCTTGCTTTCTTGTGAATAGAGTTCG
TGGGTGTGAGTTAAATAGGCCTGATTTGGTTATGTTTAATATACTGTTTAAGGGGTTTGCTAAAGTTTATATGAAAAATGAGGCATTTATGTATTTGGGTTTGATGTGGA
AATACTGTTTGCCTAGTATTGTTACTTACGGTACGTTTGTTGATATGTTCTGTAAGATGGGGGATATGGAGATGGCTAATAGAATGTTTTTGGATATGGTGAAGGTTGGG
ATTGTGCCTCATTTGGTTGTTTTTAGCTCCTTGATTGATGGCTATTGCAAGGCTGGGAGTTTGGATGTCGCATTTGAATACTTTGAGAGAATGAAGGAATATTCGGTTCG
GCCGAATGAGTTTACATATTCCACGTTGATTGATGGTTGCTCCAAGCATGGGATGTTGGCAAGAGCTGACTCTTTGTTTGAAAAGATGTTGAGTGCTAGTATTCTGCCTA
ATTGTACGGTTTACACTTCAATAATAGATGGTCATTTTAAGAAGGGAAATGTAGACGATGCGATAAAGTATATTAATCAGATGTTTGATCGAGATATAAAACTAGATTTA
ACAGCATATACGGTAATTATCTCAGGCTTTCATAGAGTTGGTAGGTTTGATAAATCAATGGAAGCTGCAGAATATGTGGCGAAGAATGGATTACTTCCTGATAGGATAAT
ATTGACAGCTATTATGGATGCCCATTTCAAAGCTGGAAACATAAAAGAAGCTCTGAATGCATACAAAATATTACTCACGAAGGGTTTCGAGGCTGATGTTGTGACTCTTT
CCGCCCTAATGGATGGCCTATCCAAGCATGGGTATTTGCAGGAGGCTAGACGGTATTTGGTCAAAGAAAAGGCCAATGAAATTCTATATACAGTGTTTATAGATGCACTA
TGCAAGGAGGGTAATTTAGATGAAGCTGAGAAAATGATTAAGGAGATGTCTGAGGCAGGGTTTGTTCCAGATAAATTTGTGTACACTTCGTGGATTGCTGAACTTTGCAA
GCAAGGAAATTTGCTGAAGGCTTTCATGGTTAAGAAAAGGATGGTTCAAGAGCATATTGAACCTGATTTATTAACTTATAGCTCCCTCATTTGTGGTTTGGCTGAGAAGG
GGCTAATGATAGAAGCCAAACAGGTTTTTGATGACATGTTAAATAAAGGAATCACTCCAGATTTTGTTGCTTATGACATCCTCGTTAGAGGATATCATAATCAGGGTAAT
GTAGCTGCGATTTCAGGTCTATATGACGAAATGAGAAAGAGAGGAATCACTGTTGAAGATTAG
mRNA sequenceShow/hide mRNA sequence
CATAATTCCAAAAGGAGTATACATACGTTTTTCCCACCAAAATCCCTTCCTTGGTTCAAAAACTTGTGAAGTGCCGCTAGGGTCTAAGGCCAAAACCAAAACCCCTTCAA
CCAAACTCATCTTCTATCCTCGAGTGTCTTTCTTCAACATTCTCTCCCACTCAAACTCATCTTCGAATGGTCAAAGAAGTCCTCCAATTCTTGGCTCACCTCCGACGAAT
CTCCCGCTTTCCCACCCCTTTCATCTGCAACAAGCTTCTCCACTCTCTCATCAACTCCGGCTGCGGCGACCTCTCCGCCAAATTGCTTTTCCACCTCCTCTCCAAAGGGT
ATACTCCTCATCCATCTTCTTTCAATTCCATTATCTCCTTCTTCTGTAGATCAGGGAATGTGAAATTTTCGAACAGATTTTCATTTCAATGTCTAGCTTTGGGTGCTCGC
CTGATATTGCGTCCTACAATTCTTTGTTAGATGGGTATTGTTCGAGTTATCAAATTCAGAAAGCTTGCTTTCTTGTGAATAGAGTTCGTGGGTGTGAGTTAAATAGGCCT
GATTTGGTTATGTTTAATATACTGTTTAAGGGGTTTGCTAAAGTTTATATGAAAAATGAGGCATTTATGTATTTGGGTTTGATGTGGAAATACTGTTTGCCTAGTATTGT
TACTTACGGTACGTTTGTTGATATGTTCTGTAAGATGGGGGATATGGAGATGGCTAATAGAATGTTTTTGGATATGGTGAAGGTTGGGATTGTGCCTCATTTGGTTGTTT
TTAGCTCCTTGATTGATGGCTATTGCAAGGCTGGGAGTTTGGATGTCGCATTTGAATACTTTGAGAGAATGAAGGAATATTCGGTTCGGCCGAATGAGTTTACATATTCC
ACGTTGATTGATGGTTGCTCCAAGCATGGGATGTTGGCAAGAGCTGACTCTTTGTTTGAAAAGATGTTGAGTGCTAGTATTCTGCCTAATTGTACGGTTTACACTTCAAT
AATAGATGGTCATTTTAAGAAGGGAAATGTAGACGATGCGATAAAGTATATTAATCAGATGTTTGATCGAGATATAAAACTAGATTTAACAGCATATACGGTAATTATCT
CAGGCTTTCATAGAGTTGGTAGGTTTGATAAATCAATGGAAGCTGCAGAATATGTGGCGAAGAATGGATTACTTCCTGATAGGATAATATTGACAGCTATTATGGATGCC
CATTTCAAAGCTGGAAACATAAAAGAAGCTCTGAATGCATACAAAATATTACTCACGAAGGGTTTCGAGGCTGATGTTGTGACTCTTTCCGCCCTAATGGATGGCCTATC
CAAGCATGGGTATTTGCAGGAGGCTAGACGGTATTTGGTCAAAGAAAAGGCCAATGAAATTCTATATACAGTGTTTATAGATGCACTATGCAAGGAGGGTAATTTAGATG
AAGCTGAGAAAATGATTAAGGAGATGTCTGAGGCAGGGTTTGTTCCAGATAAATTTGTGTACACTTCGTGGATTGCTGAACTTTGCAAGCAAGGAAATTTGCTGAAGGCT
TTCATGGTTAAGAAAAGGATGGTTCAAGAGCATATTGAACCTGATTTATTAACTTATAGCTCCCTCATTTGTGGTTTGGCTGAGAAGGGGCTAATGATAGAAGCCAAACA
GGTTTTTGATGACATGTTAAATAAAGGAATCACTCCAGATTTTGTTGCTTATGACATCCTCGTTAGAGGATATCATAATCAGGGTAATGTAGCTGCGATTTCAGGTCTAT
ATGACGAAATGAGAAAGAGAGGAATCACTGTTGAAGATTAGTTCACCAAATATTGAAGCCTGGTCTTGTTCTTCCCTGGTCTTGTTCTTCAATTTATACATCAGCCTTCC
ATACGGGAAGCCGAGATGTATGGGATTGACGTCTGCTGACTGGATCGTGATTTTGTTGACTATGAAGGATATTTCAGACGAAAACAAAAATATATAGAGAGAGTGATTCA
AACTCAAAGTCATGTTCCTTCATATATTCCTTTGCTTATCACAAGTTATGATTATGAAAGCTCCCAAATGGTTAAATTTTGCCAATCACCTTTGGGTTATCGATATAAGT
CATAAGATTTTGTTCTTCTCATTCTACTGTCTAGTTGCAACATAAGATCAGCCAAGTGGATGGCATTGTTTGAGATTCCACCTGATGGATTGCTTGTAGTGTTGGGGAAT
TTCAACTCATTGGCTGAAAGGTATATATCAATTCTTGCAAGCTAGAGCTTTTATATGACCAAAATGGAAACACATTGGTATTATTGCATATGAGGAAGCTCTCATCAGAA
ACAGAAACAAGCTATTGTATAATTTGTCACTTCCTGCACTTGATGAATATGATTTACTTTGAGCACTTTGATGATGTATTGTAAAACTTTCCAATGTGAATTACGATTAT
TTGAA
Protein sequenceShow/hide protein sequence
MSSFGCSPDIASYNSLLDGYCSSYQIQKACFLVNRVRGCELNRPDLVMFNILFKGFAKVYMKNEAFMYLGLMWKYCLPSIVTYGTFVDMFCKMGDMEMANRMFLDMVKVG
IVPHLVVFSSLIDGYCKAGSLDVAFEYFERMKEYSVRPNEFTYSTLIDGCSKHGMLARADSLFEKMLSASILPNCTVYTSIIDGHFKKGNVDDAIKYINQMFDRDIKLDL
TAYTVIISGFHRVGRFDKSMEAAEYVAKNGLLPDRIILTAIMDAHFKAGNIKEALNAYKILLTKGFEADVVTLSALMDGLSKHGYLQEARRYLVKEKANEILYTVFIDAL
CKEGNLDEAEKMIKEMSEAGFVPDKFVYTSWIAELCKQGNLLKAFMVKKRMVQEHIEPDLLTYSSLICGLAEKGLMIEAKQVFDDMLNKGITPDFVAYDILVRGYHNQGN
VAAISGLYDEMRKRGITVED