; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005794 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005794
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold11:111260..114394
RNA-Seq ExpressionSpg005794
SyntenySpg005794
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006388 - tRNA splicing, via endonucleolytic cleavage and ligation (biological process)
GO:0010239 - chloroplast mRNA processing (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048564 - photosystem I assembly (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0004519 - endonuclease activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR004860 - Homing endonuclease, LAGLIDADG
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR027434 - Homing endonuclease


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607381.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.84Show/hide
Query:  MNLLHPKGPSTKVLFTPNSAFALLSNRNLASVFSMSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQL
        MNL++PK    KV    +S+  LL   N  S  SMSIRTSAFATVTLLRSLTL  SQCHHHF         L IPTYSAKGR+QLPRIPAFASSS  E L
Subjt:  MNLLHPKGPSTKVLFTPNSAFALLSNRNLASVFSMSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQL

Query:  VHDRDSPSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCL
        V+DRDSP+ESEE LCSPYS  AEG      FASADLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWM+QDDAAYV VHCL
Subjt:  VHDRDSPSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCL

Query:  RIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLS
        RIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVF DIINQGCVPSESTFHILIVAYLSAP+QGCIEEAS IYNRMIQLGGYQPRLS
Subjt:  RIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLS

Query:  LHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSW
        LHNSLF+AL+SKPGDLSKHHLKQAEFIYHN+ TTGLELHKDIYGGLIWLHSYQDT+DKERI+SLRKEMQQAGIEEEREVL+SILRASSK+GDVMEAERSW
Subjt:  LHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSW

Query:  LKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKL
        LKLK+FDGSMPSQAFVYKMEVYAKVGNPMKA E FREMEQLN  SAAAYQTIIGILCK +++ LAES+M  FIKSNLKPL PAYVDLMNMFFNLSLHDKL
Subjt:  LKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKL

Query:  ELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSR
        ELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RAEEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSR
Subjt:  ELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSR

Query:  KEVRKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWP
        KE++KPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEF + CSTHS LRRHIYEQYHEWLH ASKLSDSD DIPYKFCTVSHSYFGFYADQFWP
Subjt:  KEVRKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWP

Query:  QGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAG
        +GHPAIPNLIHRWLSPRVLAYWYMYGG RI SGD +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA 
Subjt:  QGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAG

Query:  S-DSEGALIETENINFDIQSYSDEEASN
        S + E A  ET NINFD QS SDEEAS+
Subjt:  S-DSEGALIETENINFDIQSYSDEEASN

XP_022949171.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata]0.0e+0089.04Show/hide
Query:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATVTLLRSLTL  SQCHHHF         L IPTYSAKGR+QLPRIPAFASSS  E LV+DRDSP+ESEE LCSPYS  AEG      FASA
Subjt:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWM+QDDAAY+ VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVF DIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGYQPRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL TT
Subjt:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIYGGLIWLHSYQDT+DKERI+SLRKEM QAGIEEEREVL+SILRASSK+GDVMEAERSWLKLK+FDGSMPSQAFVYKMEVYAKVGNPMKA E 
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLNS SAAAYQTIIGILCKF+++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD
        RKNHRIQFEF + CSTHS LRRHI+EQYHEWLHPASKLSDSD DIPYKFCTVSHSYFGFYADQFWP+GHP IPNLIHRWLSPRVLAYWYMYGG RI SGD
Subjt:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN
         +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA S + E A  ET NINFD QS SDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN

XP_022998786.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima]0.0e+0089.42Show/hide
Query:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATVTLLRSLTL  SQCH+HF         L IPTYSAKGR+QLPRIPAFASSS  E LV+DRDSP+ESEE LCSPYSN AE       FASA
Subjt:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWM+QDDAAY+ VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVF DIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVTT
Subjt:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIYGGLIWLHSYQDT+DKERI+SLRKEMQQAGIEEEREVL+SILRASSK+GDVMEAERSWLK+K+FDGSMPSQAFVYKMEVYAKVGNPMKALE 
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLNS S+AAYQTIIGILCKF+++ LAES+MAGFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD
        RKNHRIQFEF + CSTHS LRRH+YEQYHEWLHPASKLSDSD DIPYKFCTVSHSYFGFYADQFWP+GHPAIPNLIHRWLSPRVLAYWYMYGG RI SGD
Subjt:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN
         +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA + + E A+ ET NINFD QS SDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN

XP_023521219.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0088.79Show/hide
Query:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATVTLLRSLTLS   CHHHF         L IPTYSAKGR+QLPRIPAFASSS  E LVHDRDSP+ESEE LCSPYS  AEG      FASA
Subjt:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAH PGTLIRLLNAQRKWM+QDDAAYV VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVF DIINQGCVPSESTFHILIVAYLSAP+QGCIEEAS IYNRMIQLGGY+PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVTT
Subjt:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIY GLIWLHSYQDT+DKERI+SLRKEMQQAGIEEEREVL+SILRASSK+GDVMEAERSWLKLK+FDGSMPSQAFVYKMEVYAKVGNPMKA E 
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLNS SAAAYQTIIGILCK +++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD
        RKNHRIQFEF +  STHS LRRHIYEQYHEWLHPASK SDSD DIPYKFCTVSHSYFGFYADQFWP+GHPAIPNLIHRWLSPRVLAYWYMYGG RI SGD
Subjt:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN
         +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKD LQA S + E A+ ET NINFD QS SDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN

XP_023525582.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0088.79Show/hide
Query:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATVTLLRSLTLS   CHHHF         L IPTYSAKGR+QL RIPAFASSS  E LVHDRDSP+ESEE LCSPYS  AEG      FASA
Subjt:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWM+QDDAAYV VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVF DIINQGCVPSESTFHILIVAYLSAP+QGCIEEAS IYNRMIQLGGY+PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVTT
Subjt:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIY GLIWLHSYQDT+DKERI+SLRKEMQQAGIEEEREVL+SILRASSK+GDVMEAERSWLKLK+FDGSMPSQAFVYKMEVYAKVGNPMKA E 
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLNS SAAAYQTIIGILCK +++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD
        RKNHRIQFEF +  STHS LRRHIYEQYHEWLHPASK SDSD DIPYKFCTVSHSYFGFYADQFWP+GHPAIPNLIHRWLSPRVLAYWYMYGG RI SGD
Subjt:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN
         +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKD LQA S + E A+ ET NINFD QS SDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN

TrEMBL top hitse value%identityAlignment
A0A0A0LBL0 LAGLIDADG_2 domain-containing protein0.0e+0085.19Show/hide
Query:  VFSMSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSF
        VFSMSI TSAF+TVT LRSLTLSLS  HH+F         LF+P YS K R+QLPRI AFAS SF +QLV+D DSPSESEEHL S +SN  +GFH+EN F
Subjt:  VFSMSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSF

Query:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKL
        AS DLKHLGTP LEVKELDELPEQWRRSK+AWLCKELPA KPGT+IRLLNAQ+KWM QDDA Y+ VHCLRIRENETAFRVYKWMMQQHWYRFDYAL+TKL
Subjt:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKL

Query:  ADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL
        ADYMGKERKFSKCREVF DIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLH+SLFRAL+SKPGDLSKHHLKQAEFIYHNL
Subjt:  ADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL

Query:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKA
        VT+GLELHKD+YGGLIWLHSYQDTID+ERIVSLRKEMQQAGI+EEREVLLSILRASSKMGDVMEAE+ W +LK  DG+MPSQAFVYKMEVYAK+G PMKA
Subjt:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKA

Query:  LETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
        LE FREMEQLNST+AAAYQTIIGILCKFQ IELAESIMAGFI+SNLKPL PAYVDLMNMFFNL+L DKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
Subjt:  LETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL

Query:  NRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIES
        +RAEEIFSQM+TNGEIG+NARSCNIIL GYLL GNY+KAEKIYDLMCQK+YDIDPPLMEKL+Y+LSLSRKEV+KP+SLKLSKEQREILVGLLLGGLEIES
Subjt:  NRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIES

Query:  DEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRIL
        D+ RKNHRIQFEF + C THS+LRRHIYEQYH+WLH ASKL+D D+DIPYKFCTVSHSYFGFYADQFWP+G  AIPNLIHRWLSPRVLAYWYMYGG R  
Subjt:  DEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRIL

Query:  SGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN
        SGDILLKLKGSHEGVEKIVKSLREKS++CKVKRKG +YWIGLLGSNATWFWKLIEPFILD LK+S QA S +  G L  +ENINFD +S S EE SN
Subjt:  SGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN

A0A1S3CPK0 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0087.94Show/hide
Query:  VFSMSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSF
        VFSMSI TSAF+TVTLLRSLTLSLS  HH+F         LFI +YS K R QLPRI AFAS SF +QLV+DRDSPSESEEHL SPYSN  +GFH+EN F
Subjt:  VFSMSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSF

Query:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKL
        AS DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPA KPGT+IRLLNAQRKWM QDDA Y+TVHCLRIRENETAFRVYKWMMQQHWYRFDYAL+TKL
Subjt:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKL

Query:  ADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL
        ADYMGKERKFSKCREVF DIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLH+SLFRALMSKPGDLSKHHLKQAEFIYHNL
Subjt:  ADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL

Query:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKA
        VT+GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGI+EE+EVLLSILRASSKMGDV+EAER W KLK  DG+MP QAFVYKMEVYAK+G PMKA
Subjt:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKA

Query:  LETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
        LE FREMEQLNST+AAAYQTIIGILCKFQ+IELAESIMAGFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
Subjt:  LETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL

Query:  NRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIES
        +RAEEIFSQM+TNGEIGVNARSCN+IL GYLLFGNY+KAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEV+KP+SLKLSKEQREILVGLLLGGLEIES
Subjt:  NRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIES

Query:  DEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRIL
        DE RKNHRIQFEF K C THS+LRRHIYEQYH+WLH ASKL+D DIDIPYKFCTVSHSYFGFYADQFWP+G   IPNLIHRWLSPR LAYWYMYGG R  
Subjt:  DEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRIL

Query:  SGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGSDSEGALIETENINFDIQSYSDEEASN
        SGDILLKLKGSHEGVEKIVKSLREKSM+CKVKRKG +YWIGLLGSNATWFWKLIEPFILDDLK+S QA S + G L ETENINFD QS S EE SN
Subjt:  SGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGSDSEGALIETENINFDIQSYSDEEASN

A0A6J1GB98 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0089.04Show/hide
Query:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATVTLLRSLTL  SQCHHHF         L IPTYSAKGR+QLPRIPAFASSS  E LV+DRDSP+ESEE LCSPYS  AEG      FASA
Subjt:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWM+QDDAAY+ VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVF DIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGYQPRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL TT
Subjt:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIYGGLIWLHSYQDT+DKERI+SLRKEM QAGIEEEREVL+SILRASSK+GDVMEAERSWLKLK+FDGSMPSQAFVYKMEVYAKVGNPMKA E 
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLNS SAAAYQTIIGILCKF+++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD
        RKNHRIQFEF + CSTHS LRRHI+EQYHEWLHPASKLSDSD DIPYKFCTVSHSYFGFYADQFWP+GHP IPNLIHRWLSPRVLAYWYMYGG RI SGD
Subjt:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN
         +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA S + E A  ET NINFD QS SDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN

A0A6J1KB64 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0089.42Show/hide
Query:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATVTLLRSLTL  SQCH+HF         L IPTYSAKGR+QLPRIPAFASSS  E LV+DRDSP+ESEE LCSPYSN AE       FASA
Subjt:  MSIRTSAFATVTLLRSLTLSLSQCHHHF---------LFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWM+QDDAAY+ VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVF DIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVTT
Subjt:  MGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIYGGLIWLHSYQDT+DKERI+SLRKEMQQAGIEEEREVL+SILRASSK+GDVMEAERSWLK+K+FDGSMPSQAFVYKMEVYAKVGNPMKALE 
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLNS S+AAYQTIIGILCKF+++ LAES+MAGFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD
        RKNHRIQFEF + CSTHS LRRH+YEQYHEWLHPASKLSDSD DIPYKFCTVSHSYFGFYADQFWP+GHPAIPNLIHRWLSPRVLAYWYMYGG RI SGD
Subjt:  RKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN
         +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA + + E A+ ET NINFD QS SDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGS-DSEGALIETENINFDIQSYSDEEASN

A0A6P3ZHH7 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0068.42Show/hide
Query:  GPSTKVLFTPNSAFALLSNRNLASVFSMSIRTSAFATVTLLRSLTLSLSQC-HHHFLFIPTYS--AKGRQQLPRIPAFASS-SFAEQLVHDRDSPSESEE
        G  + +   PNS     S   LA+  S+S+R+S+F   +LLRSLTLSLS C HHH  F P ++       +  R+PA +SS +FAEQL       S +EE
Subjt:  GPSTKVLFTPNSAFALLSNRNLASVFSMSIRTSAFATVTLLRSLTLSLSQC-HHHFLFIPTYS--AKGRQQLPRIPAFASS-SFAEQLVHDRDSPSESEE

Query:  HLCSPYSNEAEGFHYENSFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVY
        +      +E E F YE SFAS DLKHL +P LEVKEL+ELPEQWRRSKLAWLCKELPAHKP TL+R+LNAQ+KW+RQ+DA YV VHC+RIRENE  FRVY
Subjt:  HLCSPYSNEAEGFHYENSFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVY

Query:  KWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSK
        KWMMQQHWYRFD+ALATKLADYMGKERKFSKCRE+F DIINQG VPSESTFHIL+VAYLS PVQGC+EEA +IYNRMIQLGGYQPRLSLHNSLFR+++ K
Subjt:  KWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSK

Query:  PGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPS
        PG  SK +LKQAEFI+HNL TTGLE+HKDIY GLIWLHS+QDT+DKER+ +LR  MQQAGIEE REVL+S+LRA SK GDV EAE++W KL   D   PS
Subjt:  PGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPS

Query:  QAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKC
        QAFVY+MEV+AK GN  K+LE FR+M++ LNSTS  AY  +I ILC+ Q++ELAES+M  F+ S LKPLMP+YVDLM+M+F+L LHDK+EL F QCL+KC
Subjt:  QAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKC

Query:  KPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKL
        +PNRTIY+IYLDSLVK  NL +AEEIF QMQ +G IGV+ARSCNIILSGYL  G+Y+KAEKIYDLMCQK+YDI+  LMEK+DYVLSLSRK V+KP+SLKL
Subjt:  KPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKL

Query:  SKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIH
        SKEQREILVGLLLGGL+IESDE RKNH ++FEF +    HS+L+RHI++QYHEWLHP+ K +D+  DIP +F T+SHSYFGFYADQFWP+G   IP LIH
Subjt:  SKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIH

Query:  RWLSPRVLAYWYMYGGYRILSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGSDSEG-ALIET
        RWLSPRVLAYWYMYGG+R  SGDILLKLKG+ E VEKIVK+L+ +S+ C+VK+KGRV+WIG LG+N+TWFWKL EP+I+DDLKDSL+ G ++ G +  ET
Subjt:  RWLSPRVLAYWYMYGGYRILSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGSDSEG-ALIET

Query:  ENINFDIQSYSDEEASN
        ENI+F+  S SDE+AS+
Subjt:  ENINFDIQSYSDEEASN

SwissProt top hitse value%identityAlignment
O04491 Putative pentatricopeptide repeat-containing protein At1g096801.0e-1024.36Show/hide
Query:  ETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSL
        +  FR+ K  M++   R D    + L + + KE K      +F ++  +G +P++  F  LI  +      G I+     Y +M+   G QP + L+N+L
Subjt:  ETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSL

Query:  FRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKA
              K GD     L  A  I   ++  GL   K  Y  LI    +    D E  + +RKEM Q GIE +R    +++    K G V++AER+  ++  
Subjt:  FRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKA

Query:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNST-SAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP
                 +   M+ + K G+     +  +EM+      S   Y  ++  LCK   ++ A+ ++   +   + P
Subjt:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNST-SAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP

O82178 Pentatricopeptide repeat-containing protein At2g351303.9e-1019.86Show/hide
Query:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   V+V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTI
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTI

Query:  DKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII
               L  EM+    +       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME Y++ G P  A E F  M+ +      A+Y  ++
Subjt:  DKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII

Query:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNAR
            +      AE++     +  + P M +++ L++ +       K E    +  E   +P+  + +  L+   ++G   + E+I ++M+ NG    +  
Subjt:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNAR

Query:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID
        + NI+++ Y   G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID

P0C8Q6 Putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial7.9e-1123.64Show/hide
Query:  AFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFR
        A+  + W  +Q  YR D      +A  + + R+ +  + +  D++N  C  S   F   I    +A   G ++EAS++++R+ ++G   P    +N L  
Subjt:  AFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFR

Query:  ALMSKPGDLSKHHLKQAEFIYHNL-VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAG-IEEEREVLLSILRASSKMGDVMEAERSWLKLKA
        A       +SK +    E +   L        H D +     L  Y +T   ER +S+  E+   G ++E    +L +  +  K G V +A      L+ 
Subjt:  ALMSKPGDLSKHHLKQAEFIYHNL-VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAG-IEEEREVLLSILRASSKMGDVMEAERSWLKLKA

Query:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP
         D  +  + +   +  + K     KA + F +M ++  +   A Y  +IG LCK +D+E+A S+     +S + P
Subjt:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP

Q6ZHJ5 Pentatricopeptide repeat-containing protein OTP51, chloroplastic5.9e-23253.75Show/hide
Query:  PRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAE-GFHYENSFASADLKH-LGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQ
        P IPA AS+   E L+ D D   E E+        E E G     ++A+AD +  + +P L V EL+ELPEQWRRS++AWLCKELPA+K  T  R+LNAQ
Subjt:  PRIPAFASSSFAEQLVHDRDSPSESEEHLCSPYSNEAE-GFHYENSFASADLKH-LGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQ

Query:  RKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEAS
        RKW+ QDDA YV VHCLRIR N+ AFRVY WM++QHW+RF++ALAT++AD +G++ K  KCREVF  ++ QG VP+ESTFHILIVAYLS P   C+EEA 
Subjt:  RKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEAS

Query:  TIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSI
        TIYN+MIQ+GGY+PRLSLHNSLFRAL+SK G  +K++LKQAEF+YHN+VTT L++HKD+Y GLIWLHSYQD ID+ERI++LRKEM+QAG +E  +VL+S+
Subjt:  TIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSI

Query:  LRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMP
        +RA SK G+V E E +W  +      +P QA+V +ME YA+ G PMK+L+ F+EM+  N   + A+Y  II I+ K  ++++ E +M  FI+S++K LMP
Subjt:  LRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMP

Query:  AYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKY
        A++DLM M+ +L +H+KLELTF +C+ +C+PNR +Y+IYL+SLVKVGN+ +AEE+F +M  NG IG N +SCNI+L GYL   +Y KAEK+YD+M +KKY
Subjt:  AYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKY

Query:  DIDPPLMEKLDYVLSLSRKEVR-KPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPY
        D+    +EKL   L L++K ++ K VS+KL +EQREIL+GLLLGG  +ES   R  H + F+FQ+  + HS+LR HI+E++ EWL  AS+  D    IPY
Subjt:  DIDPPLMEKLDYVLSLSRKEVR-KPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPY

Query:  KFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGDILLKLKGSH-EGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATW
        +F T+ H +F F+ DQF+ +G P +P LIHRWL+PRVLAYW+M+GG ++ SGDI+LKL G + EGVE+IV SL  +S+  KVKRKGR +WIG  GSNA  
Subjt:  KFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGDILLKLKGSH-EGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATW

Query:  FWKLIEPFILDDLKDSLQAGSDSEGA--LIETENINFDIQSYSDEE
        FW++IEP +L++    +     S G+    +T+  + D    SD E
Subjt:  FWKLIEPFILDDLKDSLQAGSDSEGA--LIETENINFDIQSYSDEE

Q9XIL5 Pentatricopeptide repeat-containing protein At2g15820, chloroplastic2.0e-25655.21Show/hide
Query:  STKVLFTPNSAFALLSNRNLASVFSMSIRTSAFATVTLLRSLT-LSLSQCHHHFLFIPTYSAKGRQQLPRIPAFASSSFAEQ---LVHDRDSPSESEEHL
        S+ V  T  +  +L SN N+ +  S   R+ +F+ +    S +  SL +   H +                P F ++S A++    V      +ESEE +
Subjt:  STKVLFTPNSAFALLSNRNLASVFSMSIRTSAFATVTLLRSLT-LSLSQCHHHFLFIPTYSAKGRQQLPRIPAFASSSFAEQ---LVHDRDSPSESEEHL

Query:  CSPYSNEAEGFHYENSFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFR
             +EA GF    S A  D++++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+P HK  TL+RLLNAQ+KW+RQ+DA Y++VHC+RIRENET FR
Subjt:  CSPYSNEAEGFHYENSFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFR

Query:  VYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRAL
        VY+WM QQ+WYRFD+ L TKLA+Y+GKERKF+KCREVF D++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YNRMIQLGGY+PRLSLHNSLFRAL
Subjt:  VYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRAL

Query:  MSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGS
        +SK G +    LKQAEFI+HN+VTTGLE+ KDIY GLIWLHS QD +D  RI SLR+EM++AG +E +EV++S+LRA +K G V E ER+WL+L   D  
Subjt:  MSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGS

Query:  MPSQAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCL
        +PSQAFVYK+E Y+KVG+  KA+E FREME+ +   + + Y  II +LCK Q +EL E++M  F +S  KPL+P+++++  M+F+L LH+KLE+ F QCL
Subjt:  MPSQAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCL

Query:  EKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRK-PV
        EKC+P++ IY+IYLDSL K+GNL +A ++F++M+ NG I V+ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LSL +KEV+K P 
Subjt:  EKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRK-PV

Query:  SLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIP
        S+KLSK+QRE+LVGLLLGGL+IESD+ +K+H I+FEF++    H +L+++I++Q+ EWLHP S   + DI IP++F +V HSYFGFYA+ +WP+G P IP
Subjt:  SLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIP

Query:  NLIHRWLSPRVLAYWYMYGGYRILSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGSDSEGAL
         LIHRWLSP  LAYWYMY G +  SGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+KG+V+WIGL G+N+  FWKLIEP +L++LK+ L+  S+S   +
Subjt:  NLIHRWLSPRVLAYWYMYGGYRILSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGSDSEGAL

Query:  IETENINFDIQSYSD
         E E  + + +S SD
Subjt:  IETENINFDIQSYSD

Arabidopsis top hitse value%identityAlignment
AT1G09680.1 Pentatricopeptide repeat (PPR) superfamily protein7.3e-1224.36Show/hide
Query:  ETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSL
        +  FR+ K  M++   R D    + L + + KE K      +F ++  +G +P++  F  LI  +      G I+     Y +M+   G QP + L+N+L
Subjt:  ETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSL

Query:  FRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKA
              K GD     L  A  I   ++  GL   K  Y  LI    +    D E  + +RKEM Q GIE +R    +++    K G V++AER+  ++  
Subjt:  FRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKA

Query:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNST-SAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP
                 +   M+ + K G+     +  +EM+      S   Y  ++  LCK   ++ A+ ++   +   + P
Subjt:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNST-SAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP

AT2G15820.1 endonucleases1.4e-25755.21Show/hide
Query:  STKVLFTPNSAFALLSNRNLASVFSMSIRTSAFATVTLLRSLT-LSLSQCHHHFLFIPTYSAKGRQQLPRIPAFASSSFAEQ---LVHDRDSPSESEEHL
        S+ V  T  +  +L SN N+ +  S   R+ +F+ +    S +  SL +   H +                P F ++S A++    V      +ESEE +
Subjt:  STKVLFTPNSAFALLSNRNLASVFSMSIRTSAFATVTLLRSLT-LSLSQCHHHFLFIPTYSAKGRQQLPRIPAFASSSFAEQ---LVHDRDSPSESEEHL

Query:  CSPYSNEAEGFHYENSFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFR
             +EA GF    S A  D++++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+P HK  TL+RLLNAQ+KW+RQ+DA Y++VHC+RIRENET FR
Subjt:  CSPYSNEAEGFHYENSFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFR

Query:  VYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRAL
        VY+WM QQ+WYRFD+ L TKLA+Y+GKERKF+KCREVF D++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YNRMIQLGGY+PRLSLHNSLFRAL
Subjt:  VYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRAL

Query:  MSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGS
        +SK G +    LKQAEFI+HN+VTTGLE+ KDIY GLIWLHS QD +D  RI SLR+EM++AG +E +EV++S+LRA +K G V E ER+WL+L   D  
Subjt:  MSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGS

Query:  MPSQAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCL
        +PSQAFVYK+E Y+KVG+  KA+E FREME+ +   + + Y  II +LCK Q +EL E++M  F +S  KPL+P+++++  M+F+L LH+KLE+ F QCL
Subjt:  MPSQAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCL

Query:  EKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRK-PV
        EKC+P++ IY+IYLDSL K+GNL +A ++F++M+ NG I V+ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LSL +KEV+K P 
Subjt:  EKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRK-PV

Query:  SLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIP
        S+KLSK+QRE+LVGLLLGGL+IESD+ +K+H I+FEF++    H +L+++I++Q+ EWLHP S   + DI IP++F +V HSYFGFYA+ +WP+G P IP
Subjt:  SLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKCSTHSLLRRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIP

Query:  NLIHRWLSPRVLAYWYMYGGYRILSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGSDSEGAL
         LIHRWLSP  LAYWYMY G +  SGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+KG+V+WIGL G+N+  FWKLIEP +L++LK+ L+  S+S   +
Subjt:  NLIHRWLSPRVLAYWYMYGGYRILSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGSDSEGAL

Query:  IETENINFDIQSYSD
         E E  + + +S SD
Subjt:  IETENINFDIQSYSD

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-1119.86Show/hide
Query:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   V+V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTI
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTI

Query:  DKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII
               L  EM+    +       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME Y++ G P  A E F  M+ +      A+Y  ++
Subjt:  DKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII

Query:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNAR
            +      AE++     +  + P M +++ L++ +       K E    +  E   +P+  + +  L+   ++G   + E+I ++M+ NG    +  
Subjt:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNAR

Query:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID
        + NI+++ Y   G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID

AT2G35130.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-1119.86Show/hide
Query:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   V+V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTI
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTI

Query:  DKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII
               L  EM+    +       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME Y++ G P  A E F  M+ +      A+Y  ++
Subjt:  DKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII

Query:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNAR
            +      AE++     +  + P M +++ L++ +       K E    +  E   +P+  + +  L+   ++G   + E+I ++M+ NG    +  
Subjt:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTNGEIGVNAR

Query:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID
        + NI+++ Y   G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID

AT5G08310.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.6e-1223.64Show/hide
Query:  AFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFR
        A+  + W  +Q  YR D      +A  + + R+ +  + +  D++N  C  S   F   I    +A   G ++EAS++++R+ ++G   P    +N L  
Subjt:  AFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFR

Query:  ALMSKPGDLSKHHLKQAEFIYHNL-VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAG-IEEEREVLLSILRASSKMGDVMEAERSWLKLKA
        A       +SK +    E +   L        H D +     L  Y +T   ER +S+  E+   G ++E    +L +  +  K G V +A      L+ 
Subjt:  ALMSKPGDLSKHHLKQAEFIYHNL-VTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAG-IEEEREVLLSILRASSKMGDVMEAERSWLKLKA

Query:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP
         D  +  + +   +  + K     KA + F +M ++  +   A Y  +IG LCK +D+E+A S+     +S + P
Subjt:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACTCTCGTTCCTGCTGAAACATGGTTTGCTTCTCACTAAGATGAACCTTCTACATCCTAAGGGTCCGTCCACAAAAGTTCTCTTCACTCCCAACTCTGCATTCGC
CTTGCTGTCTAACCGTAACCTTGCTTCGGTTTTCTCCATGTCCATTCGTACCTCTGCCTTTGCCACTGTCACCCTTCTGCGTTCTCTCACTCTTTCCCTCTCTCAATGCC
ATCACCACTTCCTCTTTATCCCAACATATTCTGCAAAAGGACGGCAACAACTTCCGCGAATTCCTGCCTTTGCTTCCAGTTCTTTCGCTGAACAGTTGGTACACGACCGG
GATTCCCCGTCCGAGTCTGAAGAGCACTTGTGTTCTCCATACAGTAACGAGGCCGAGGGTTTTCATTATGAAAATAGTTTTGCGTCGGCAGATTTGAAACACTTGGGAAC
GCCTGCGCTTGAAGTCAAGGAGCTAGACGAGTTGCCAGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAAGAATTGCCAGCACATAAGCCGGGAACATTGATAC
GGCTGCTTAATGCTCAGAGGAAATGGATGAGGCAGGATGATGCGGCCTATGTCACCGTGCATTGTTTGCGTATTCGCGAAAATGAGACTGCTTTTAGGGTGTACAAGTGG
ATGATGCAACAACATTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGGAAGTTCTCAAAGTGTCGGGAGGTATTTGGTGATAT
AATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATACTGATTGTTGCTTACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTACCATTTACAATC
GTATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTCATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCT
GAGTTTATATATCACAATCTGGTAACAACTGGACTCGAGTTACATAAAGATATATATGGTGGTCTAATTTGGTTACATAGTTATCAGGATACTATTGACAAAGAAAGGAT
AGTGTCACTAAGGAAAGAAATGCAACAAGCAGGAATCGAGGAGGAAAGAGAAGTCCTCTTGTCCATCTTGAGAGCGAGCTCAAAAATGGGGGATGTGATGGAAGCAGAAA
GATCGTGGCTTAAACTCAAGGCTTTTGATGGTAGCATGCCATCTCAGGCTTTTGTTTACAAAATGGAAGTCTATGCAAAGGTGGGTAACCCGATGAAAGCTTTGGAAACA
TTTAGGGAGATGGAGCAGTTGAACTCTACTAGTGCTGCAGCATATCAGACAATTATTGGGATTTTATGTAAATTTCAAGATATAGAACTTGCAGAATCCATCATGGCAGG
CTTCATAAAGAGTAATTTAAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACATGATAAATTAGAGTTAACCTTCTCCCAGTGCC
TTGAGAAGTGTAAGCCCAATCGTACGATTTACAGCATATATTTGGACTCTTTGGTAAAAGTTGGTAATCTCAACAGGGCTGAAGAAATATTTAGTCAGATGCAAACAAAT
GGAGAAATTGGTGTAAATGCTCGTTCATGCAACATCATTTTAAGTGGGTATCTGTTATTTGGGAATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAAAA
GTATGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGGTTAGGAAGCCAGTAAGCTTGAAGTTGAGTAAAGAACAAAGGGAGA
TTTTAGTAGGGTTGTTGTTAGGTGGCCTGGAGATAGAATCTGATGAAGGGAGGAAGAATCATAGAATCCAATTTGAATTCCAAAAGAAATGTAGCACCCACTCTCTTTTG
AGGAGACACATATATGAGCAATATCACGAGTGGTTACATCCTGCTTCAAAGTTGAGCGATAGTGATATAGATATACCATATAAATTCTGCACCGTTTCACATTCATATTT
TGGTTTCTATGCAGATCAGTTTTGGCCACAAGGCCATCCTGCAATCCCTAATCTAATTCATAGGTGGCTTTCTCCTCGCGTTCTTGCCTACTGGTATATGTATGGAGGCT
ACAGGATACTGTCAGGGGATATTTTACTGAAGCTAAAGGGAAGTCATGAAGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAAAAGTCCATGTATTGCAAGGTGAAAAGG
AAAGGCAGGGTGTATTGGATAGGTTTACTCGGAAGCAATGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATGACTTGAAAGATAGTCTACAGGCAGGCAG
CGATTCTGAGGGGGCTTTAATTGAAACTGAAAATATCAACTTTGATATTCAATCTTATTCTGATGAGGAGGCTTCTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACTCTCGTTCCTGCTGAAACATGGTTTGCTTCTCACTAAGATGAACCTTCTACATCCTAAGGGTCCGTCCACAAAAGTTCTCTTCACTCCCAACTCTGCATTCGC
CTTGCTGTCTAACCGTAACCTTGCTTCGGTTTTCTCCATGTCCATTCGTACCTCTGCCTTTGCCACTGTCACCCTTCTGCGTTCTCTCACTCTTTCCCTCTCTCAATGCC
ATCACCACTTCCTCTTTATCCCAACATATTCTGCAAAAGGACGGCAACAACTTCCGCGAATTCCTGCCTTTGCTTCCAGTTCTTTCGCTGAACAGTTGGTACACGACCGG
GATTCCCCGTCCGAGTCTGAAGAGCACTTGTGTTCTCCATACAGTAACGAGGCCGAGGGTTTTCATTATGAAAATAGTTTTGCGTCGGCAGATTTGAAACACTTGGGAAC
GCCTGCGCTTGAAGTCAAGGAGCTAGACGAGTTGCCAGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAAGAATTGCCAGCACATAAGCCGGGAACATTGATAC
GGCTGCTTAATGCTCAGAGGAAATGGATGAGGCAGGATGATGCGGCCTATGTCACCGTGCATTGTTTGCGTATTCGCGAAAATGAGACTGCTTTTAGGGTGTACAAGTGG
ATGATGCAACAACATTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGGAAGTTCTCAAAGTGTCGGGAGGTATTTGGTGATAT
AATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATACTGATTGTTGCTTACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTACCATTTACAATC
GTATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTCATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCT
GAGTTTATATATCACAATCTGGTAACAACTGGACTCGAGTTACATAAAGATATATATGGTGGTCTAATTTGGTTACATAGTTATCAGGATACTATTGACAAAGAAAGGAT
AGTGTCACTAAGGAAAGAAATGCAACAAGCAGGAATCGAGGAGGAAAGAGAAGTCCTCTTGTCCATCTTGAGAGCGAGCTCAAAAATGGGGGATGTGATGGAAGCAGAAA
GATCGTGGCTTAAACTCAAGGCTTTTGATGGTAGCATGCCATCTCAGGCTTTTGTTTACAAAATGGAAGTCTATGCAAAGGTGGGTAACCCGATGAAAGCTTTGGAAACA
TTTAGGGAGATGGAGCAGTTGAACTCTACTAGTGCTGCAGCATATCAGACAATTATTGGGATTTTATGTAAATTTCAAGATATAGAACTTGCAGAATCCATCATGGCAGG
CTTCATAAAGAGTAATTTAAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACATGATAAATTAGAGTTAACCTTCTCCCAGTGCC
TTGAGAAGTGTAAGCCCAATCGTACGATTTACAGCATATATTTGGACTCTTTGGTAAAAGTTGGTAATCTCAACAGGGCTGAAGAAATATTTAGTCAGATGCAAACAAAT
GGAGAAATTGGTGTAAATGCTCGTTCATGCAACATCATTTTAAGTGGGTATCTGTTATTTGGGAATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAAAA
GTATGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGGTTAGGAAGCCAGTAAGCTTGAAGTTGAGTAAAGAACAAAGGGAGA
TTTTAGTAGGGTTGTTGTTAGGTGGCCTGGAGATAGAATCTGATGAAGGGAGGAAGAATCATAGAATCCAATTTGAATTCCAAAAGAAATGTAGCACCCACTCTCTTTTG
AGGAGACACATATATGAGCAATATCACGAGTGGTTACATCCTGCTTCAAAGTTGAGCGATAGTGATATAGATATACCATATAAATTCTGCACCGTTTCACATTCATATTT
TGGTTTCTATGCAGATCAGTTTTGGCCACAAGGCCATCCTGCAATCCCTAATCTAATTCATAGGTGGCTTTCTCCTCGCGTTCTTGCCTACTGGTATATGTATGGAGGCT
ACAGGATACTGTCAGGGGATATTTTACTGAAGCTAAAGGGAAGTCATGAAGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAAAAGTCCATGTATTGCAAGGTGAAAAGG
AAAGGCAGGGTGTATTGGATAGGTTTACTCGGAAGCAATGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATGACTTGAAAGATAGTCTACAGGCAGGCAG
CGATTCTGAGGGGGCTTTAATTGAAACTGAAAATATCAACTTTGATATTCAATCTTATTCTGATGAGGAGGCTTCTAATTAA
Protein sequenceShow/hide protein sequence
MSLSFLLKHGLLLTKMNLLHPKGPSTKVLFTPNSAFALLSNRNLASVFSMSIRTSAFATVTLLRSLTLSLSQCHHHFLFIPTYSAKGRQQLPRIPAFASSSFAEQLVHDR
DSPSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKW
MMQQHWYRFDYALATKLADYMGKERKFSKCREVFGDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQA
EFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIEEEREVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMQTN
GEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKCSTHSLL
RRHIYEQYHEWLHPASKLSDSDIDIPYKFCTVSHSYFGFYADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGYRILSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKR
KGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGSDSEGALIETENINFDIQSYSDEEASN