; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023441 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023441
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr7:48243717..48246633
RNA-Seq ExpressionLag0023441
SyntenyLag0023441
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006388 - tRNA splicing, via endonucleolytic cleavage and ligation (biological process)
GO:0010239 - chloroplast mRNA processing (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048564 - photosystem I assembly (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0004519 - endonuclease activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR004860 - Homing endonuclease, LAGLIDADG
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR027434 - Homing endonuclease


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607381.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0080.92Show/hide
Query:  MNLLHPKGPSTKVLFTPNSAFTLLSNRNLASVFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQL
        MNL++PK    KV     S+ T+L N   +S  SMSIRTSAFATV LLRSLTL  SQCH HFRC N++IR+L IPTY AKGR+QLPRIPAF SSS  E L
Subjt:  MNLLHPKGPSTKVLFTPNSAFTLLSNRNLASVFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQL

Query:  VHDRDSPSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCL
        V+DRDSP+ESEE LCSPYS  AEG      FASADLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWM+QDDAAYV VHCL
Subjt:  VHDRDSPSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCL

Query:  RIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLS
        RIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEEAS IYNRMIQLGGYQPRLS
Subjt:  RIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLS

Query:  LHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSW
        LHNSLF+AL+SKPGDLSKHHLKQAEFIYHN+ TTGLELHKDIYGGLIWLHSYQ+T+DKERI+SLRKEMQQAGIEEE+EVL+SILRASSK+GDVMEAERSW
Subjt:  LHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSW

Query:  LKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKL
        LKLK+FDGSMPSQAFVYKMEVYAKVGNPMKA E FREMEQLN  SAAAYQTIIGILCK +++ LAES+M  FIKSNLKPL PAYVDLMNMFFNLSLHDKL
Subjt:  LKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKL

Query:  ELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSR
        ELTFSQCLEKCKPNRTIY+IYL+SLVKVGNL+R EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKY IDPPLMEKLDYVLSLSR
Subjt:  ELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSR

Query:  KEVRKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQK-----------------------------------KY-------------QFWP
        KE++KPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEF +                                   K+             QFWP
Subjt:  KEVRKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQK-----------------------------------KY-------------QFWP

Query:  QGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-
        +GHPAIPNLIHRWLSPRVLAYWYMYGGCRI SGD +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA 
Subjt:  QGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-

Query:  GRDSEGALIETENINFYSQSYSDEEASN
          + E A  ET NINF SQS SDEEAS+
Subjt:  GRDSEGALIETENINFYSQSYSDEEASN

XP_022949171.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata]0.0e+0082.87Show/hide
Query:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATV LLRSLTL  SQCH HFRC N++IR+L IPTY AKGR+QLPRIPAF SSS  E LV+DRDSP+ESEE LCSPYS  AEG      FASA
Subjt:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWM+QDDAAY+ VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGYQPRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL TT
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIYGGLIWLHSYQ+T+DKERI+SLRKEM QAGIEEE+EVL+SILRASSK+GDVMEAERSWLKLK+FDGSMPSQAFVYKMEVYAKVGNPMKA E 
Subjt:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT
        FREMEQLNS SAAAYQTIIGILCKF+++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY+IYL+SLVKVGNL+R 
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKY IDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD
        RKNHRIQFEF +                                   K+             QFWP+GHP IPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD

Query:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-GRDSEGALIETENINFYSQSYSDEEASN
         +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA   + E A  ET NINF SQS SDEEAS+
Subjt:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-GRDSEGALIETENINFYSQSYSDEEASN

XP_022998786.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima]0.0e+0083.5Show/hide
Query:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATV LLRSLTL  SQCH HFRC N++IR+L IPTY AKGR+QLPRIPAF SSS  E LV+DRDSP+ESEE LCSPYSN AE       FASA
Subjt:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWM+QDDAAY+ VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVTT
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIYGGLIWLHSYQ+T+DKERI+SLRKEMQQAGIEEE+EVL+SILRASSK+GDVMEAERSWLK+K+FDGSMPSQAFVYKMEVYAKVGNPMKALE 
Subjt:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT
        FREMEQLNS S+AAYQTIIGILCKF+++ LAES+MAGFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY+IYL+SLVKVGNL+R 
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKY IDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD
        RKNHRIQFEF +                                   K+             QFWP+GHPAIPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD

Query:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGR-DSEGALIETENINFYSQSYSDEEASN
         +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA   + E A+ ET NINF SQS SDEEAS+
Subjt:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGR-DSEGALIETENINFYSQSYSDEEASN

XP_023521219.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0082.87Show/hide
Query:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATV LLRSLTLS   CH HFRC N++IR+L IPTY AKGR+QLPRIPAF SSS  E LVHDRDSP+ESEE LCSPYS  AEG      FASA
Subjt:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAH PGTLIRLLNAQRKWM+QDDAAYV VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEEAS IYNRMIQLGGY+PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVTT
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIY GLIWLHSYQ+T+DKERI+SLRKEMQQAGIEEE+EVL+SILRASSK+GDVMEAERSWLKLK+FDGSMPSQAFVYKMEVYAKVGNPMKA E 
Subjt:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT
        FREMEQLNS SAAAYQTIIGILCK +++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY+IYL+SLVKVGNL+R 
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKY IDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD
        RKNHRIQFEF +                                   K+             QFWP+GHPAIPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD

Query:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-GRDSEGALIETENINFYSQSYSDEEASN
         +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKD LQA   + E A+ ET NINF SQS SDEEAS+
Subjt:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-GRDSEGALIETENINFYSQSYSDEEASN

XP_023525582.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0082.87Show/hide
Query:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATV LLRSLTLS   CH HFRC N++IR+L IPTY AKGR+QL RIPAF SSS  E LVHDRDSP+ESEE LCSPYS  AEG      FASA
Subjt:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWM+QDDAAYV VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEEAS IYNRMIQLGGY+PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVTT
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIY GLIWLHSYQ+T+DKERI+SLRKEMQQAGIEEE+EVL+SILRASSK+GDVMEAERSWLKLK+FDGSMPSQAFVYKMEVYAKVGNPMKA E 
Subjt:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT
        FREMEQLNS SAAAYQTIIGILCK +++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY+IYL+SLVKVGNL+R 
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKY IDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD
        RKNHRIQFEF +                                   K+             QFWP+GHPAIPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD

Query:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-GRDSEGALIETENINFYSQSYSDEEASN
         +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKD LQA   + E A+ ET NINF SQS SDEEAS+
Subjt:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-GRDSEGALIETENINFYSQSYSDEEASN

TrEMBL top hitse value%identityAlignment
A0A0A0LBL0 LAGLIDADG_2 domain-containing protein0.0e+0079.42Show/hide
Query:  VFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSF
        VFSMSI TSAF+TV  LRSLTLSLS  H +F C NHII TLF+P Y  K R+QLPRI AF S SF +QLV+D DSPSESEEHL S +SN  +GFH+EN F
Subjt:  VFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSF

Query:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKL
        AS DLKHLGTP LEVKELDELPEQWRRSK+AWLCKELPA KPGT+IRLLNAQ+KWM QDDA Y+ VHCLRIRENETAFRVYKWMMQQHWYRFDYAL+TKL
Subjt:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKL

Query:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL
        ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLH+SLFRAL+SKPGDLSKHHLKQAEFIYHNL
Subjt:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL

Query:  VTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKA
        VT+GLELHKD+YGGLIWLHSYQ+TID+ERIVSLRKEMQQAGI+EE+EVLLSILRASSKMGDVMEAE+ W +LK  DG+MPSQAFVYKMEVYAK+G PMKA
Subjt:  VTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKA

Query:  LETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNL
        LE FREMEQLNST+AAAYQTIIGILCKFQ IELAESIMAGFI+SNLKPL PAYVDLMNMFFNL+L DKLELTFSQCLEKCKPNRTIY+IYLDSLVKVGNL
Subjt:  LETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNL

Query:  NRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIES
        +R EEIFSQM+TNGEIG+NARSCNIIL GYLL GNY+KAEKIYDLMCQK+Y IDPPLMEKL+Y+LSLSRKEV+KP+SLKLSKEQREILVGLLLGGLEIES
Subjt:  NRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIES

Query:  DEGRKNHRIQFEFQKKY------------------------------------------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRIL
        D+ RKNHRIQFEF +                                                  QFWP+G  AIPNLIHRWLSPRVLAYWYMYGGCR  
Subjt:  DEGRKNHRIQFEFQKKY------------------------------------------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRIL

Query:  SGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-GRDSEGALIETENINFYSQSYSDEEASN
        SGDILLKLKGSHEGV+KIVKSL EKS++CKVKRKG +YWIGLLGSNATWFWKLIEPFILD LK+S QA   +  G L  +ENINF S+S S EE SN
Subjt:  SGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-GRDSEGALIETENINFYSQSYSDEEASN

A0A1S3CPK0 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0082.16Show/hide
Query:  VFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSF
        VFSMSI TSAF+TV LLRSLTLSLS  H +F   NHII TLFI +Y  K R QLPRI AF S SF +QLV+DRDSPSESEEHL SPYSN  +GFH+EN F
Subjt:  VFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSF

Query:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKL
        AS DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPA KPGT+IRLLNAQRKWM QDDA Y+TVHCLRIRENETAFRVYKWMMQQHWYRFDYAL+TKL
Subjt:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKL

Query:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL
        ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLH+SLFRALMSKPGDLSKHHLKQAEFIYHNL
Subjt:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL

Query:  VTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKA
        VT+GLELHKDIYGGLIWLHSYQ+TIDKERIVSLRKEMQQAGI+EEKEVLLSILRASSKMGDV+EAER W KLK  DG+MP QAFVYKMEVYAK+G PMKA
Subjt:  VTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKA

Query:  LETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNL
        LE FREMEQLNST+AAAYQTIIGILCKFQ+IELAESIMAGFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKCKPNRTIY+IYLDSLVKVGNL
Subjt:  LETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNL

Query:  NRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIES
        +R EEIFSQM+TNGEIGVNARSCN+IL GYLLFGNY+KAEKIYDLMCQKKY IDPPLMEKLDYVLSLSRKEV+KP+SLKLSKEQREILVGLLLGGLEIES
Subjt:  NRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIES

Query:  DEGRKNHRIQFEFQKKY------------------------------------------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRIL
        DE RKNHRIQFEF K                                                  QFWP+G   IPNLIHRWLSPR LAYWYMYGGCR  
Subjt:  DEGRKNHRIQFEFQKKY------------------------------------------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRIL

Query:  SGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGRDSEGALIETENINFYSQSYSDEEASN
        SGDILLKLKGSHEGV+KIVKSL EKSM+CKVKRKG +YWIGLLGSNATWFWKLIEPFILDDLK+S QA   + G L ETENINF SQS S EE SN
Subjt:  SGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGRDSEGALIETENINFYSQSYSDEEASN

A0A6J1GB98 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0082.87Show/hide
Query:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATV LLRSLTL  SQCH HFRC N++IR+L IPTY AKGR+QLPRIPAF SSS  E LV+DRDSP+ESEE LCSPYS  AEG      FASA
Subjt:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWM+QDDAAY+ VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGYQPRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL TT
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIYGGLIWLHSYQ+T+DKERI+SLRKEM QAGIEEE+EVL+SILRASSK+GDVMEAERSWLKLK+FDGSMPSQAFVYKMEVYAKVGNPMKA E 
Subjt:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT
        FREMEQLNS SAAAYQTIIGILCKF+++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY+IYL+SLVKVGNL+R 
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKY IDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD
        RKNHRIQFEF +                                   K+             QFWP+GHP IPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD

Query:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-GRDSEGALIETENINFYSQSYSDEEASN
         +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA   + E A  ET NINF SQS SDEEAS+
Subjt:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA-GRDSEGALIETENINFYSQSYSDEEASN

A0A6J1KB64 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0083.5Show/hide
Query:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA
        MSIRTSAFATV LLRSLTL  SQCH HFRC N++IR+L IPTY AKGR+QLPRIPAF SSS  E LV+DRDSP+ESEE LCSPYSN AE       FASA
Subjt:  MSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASA

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWM+QDDAAY+ VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT
        MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVTT
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTT

Query:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET
        GLELHKDIYGGLIWLHSYQ+T+DKERI+SLRKEMQQAGIEEE+EVL+SILRASSK+GDVMEAERSWLK+K+FDGSMPSQAFVYKMEVYAKVGNPMKALE 
Subjt:  GLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALET

Query:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT
        FREMEQLNS S+AAYQTIIGILCKF+++ LAES+MAGFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIY+IYL+SLVKVGNL+R 
Subjt:  FREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRT

Query:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQMQTNGEIGV+ARSCNIILSGYLL G+YLKAEKIYDLMCQKKY IDPPLMEKLDYVLSLSRKE++KPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD
        RKNHRIQFEF +                                   K+             QFWP+GHPAIPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFQK-----------------------------------KY-------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGD

Query:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGR-DSEGALIETENINFYSQSYSDEEASN
         +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQA   + E A+ ET NINF SQS SDEEAS+
Subjt:  ILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGR-DSEGALIETENINFYSQSYSDEEASN

A0A6P3ZHH7 pentatricopeptide repeat-containing protein At2g15820, chloroplastic4.7e-29064.03Show/hide
Query:  GPSTKVLFTPNSAFTLLSNRNLASVFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSS-SFAEQLVHDRDS
        G  + +   PNS     S   LA+  S+S+R+S+F+   LLRSLTLSLS C QH  C+    R +F P   A  +    R+PA  SS +FAEQL      
Subjt:  GPSTKVLFTPNSAFTLLSNRNLASVFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSS-SFAEQLVHDRDS

Query:  PSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENE
         S +EE+      +E E F YE SFAS DLKHL +P LEVKEL+ELPEQWRRSKLAWLCKELPAHKP TL+R+LNAQ+KW+RQ+DA YV VHC+RIRENE
Subjt:  PSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIRENE

Query:  TAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLF
          FRVYKWMMQQHWYRFD+ALATKLADYMGKERKFSKCRE+FDDIINQG VPSESTFHIL+VAYLS PVQGC+EEA +IYNRMIQLGGYQPRLSLHNSLF
Subjt:  TAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLF

Query:  RALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAF
        R+++ KPG  SK +LKQAEFI+HNL TTGLE+HKDIY GLIWLHS+Q+T+DKER+ +LR  MQQAGIEE +EVL+S+LRA SK GDV EAE++W KL   
Subjt:  RALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAF

Query:  DGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFS
        D   PSQAFVY+MEV+AK GN  K+LE FR+M++ LNSTS  AY  +I ILC+ Q++ELAES+M  F+ S LKPLMP+YVDLM+M+F+L LHDK+EL F 
Subjt:  DGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFS

Query:  QCLEKCKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRK
        QCL+KC+PNRTIY IYLDSLVK  NL + EEIF QMQ +G IGV+ARSCNIILSGYL  G+Y+KAEKIYDLMCQK+Y I+  LMEK+DYVLSLSRK V+K
Subjt:  QCLEKCKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRK

Query:  PVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKY------------------------------------------------QFWPQGHPA
        P+SLKLSKEQREILVGLLLGGL+IESDE RKNH ++FEF +                                                  QFWP+G   
Subjt:  PVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKY------------------------------------------------QFWPQGHPA

Query:  IPNLIHRWLSPRVLAYWYMYGGCRILSGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGRDSEG
        IP LIHRWLSPRVLAYWYMYGG R  SGDILLKLKG+ E V+KIVK+L  +S+ C+VK+KGRV+WIG LG+N+TWFWKL EP+I+DDLKDSL+ G ++ G
Subjt:  IPNLIHRWLSPRVLAYWYMYGGCRILSGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGRDSEG

Query:  -ALIETENINFYSQSYSDEEASN
         +  ETENI+F S S SDE+AS+
Subjt:  -ALIETENINFYSQSYSDEEASN

SwissProt top hitse value%identityAlignment
O04491 Putative pentatricopeptide repeat-containing protein At1g096803.4e-1124.36Show/hide
Query:  ETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSL
        +  FR+ K  M++   R D    + L + + KE K      +FD++  +G +P++  F  LI  +      G I+     Y +M+   G QP + L+N+L
Subjt:  ETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSL

Query:  FRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKA
              K GD     L  A  I   ++  GL   K  Y  LI    +    D E  + +RKEM Q GIE ++    +++    K G V++AER+  ++  
Subjt:  FRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKA

Query:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNST-SAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP
                 +   M+ + K G+     +  +EM+      S   Y  ++  LCK   ++ A+ ++   +   + P
Subjt:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNST-SAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP

O82178 Pentatricopeptide repeat-containing protein At2g351309.8e-1120.09Show/hide
Query:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   V+V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTI
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTI

Query:  DKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII
               L  EM+    +       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME Y++ G P  A E F  M+ +      A+Y  ++
Subjt:  DKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII

Query:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNAR
            +      AE++     +  + P M +++ L++ +       K E    +  E   +P+  + N  L+   ++G   + E+I ++M+ NG    +  
Subjt:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNAR

Query:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYAID
        + NI+++ Y   G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYAID

P0C8Q6 Putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial2.6e-1124Show/hide
Query:  AFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFR
        A+  + W  +Q  YR D      +A  + + R+ +  + +  D++N  C  S   F   I    +A   G ++EAS++++R+ ++G   P    +N L  
Subjt:  AFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFR

Query:  ALMSKPGDLSKHHLKQAEFIYHNL-VTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAG-IEEEKEVLLSILRASSKMGDVMEAERSWLKLKA
        A       +SK +    E +   L        H D +     L  Y NT   ER +S+  E+   G ++E    +L +  +  K G V +A      L+ 
Subjt:  ALMSKPGDLSKHHLKQAEFIYHNL-VTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAG-IEEEKEVLLSILRASSKMGDVMEAERSWLKLKA

Query:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP
         D  +  + +   +  + K     KA + F +M ++  +   A Y  +IG LCK +D+E+A S+     +S + P
Subjt:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP

Q6ZHJ5 Pentatricopeptide repeat-containing protein OTP51, chloroplastic2.8e-20751.54Show/hide
Query:  PRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAE-GFHYENSFASADLKH-LGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQ
        P IPA  S+   E L+ D D   E E+        E E G     ++A+AD +  + +P L V EL+ELPEQWRRS++AWLCKELPA+K  T  R+LNAQ
Subjt:  PRIPAFPSSSFAEQLVHDRDSPSESEEHLCSPYSNEAE-GFHYENSFASADLKH-LGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQ

Query:  RKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEAS
        RKW+ QDDA YV VHCLRIR N+ AFRVY WM++QHW+RF++ALAT++AD +G++ K  KCREVF+ ++ QG VP+ESTFHILIVAYLS P   C+EEA 
Subjt:  RKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEAS

Query:  TIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSI
        TIYN+MIQ+GGY+PRLSLHNSLFRAL+SK G  +K++LKQAEF+YHN+VTT L++HKD+Y GLIWLHSYQ+ ID+ERI++LRKEM+QAG +E  +VL+S+
Subjt:  TIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSI

Query:  LRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMP
        +RA SK G+V E E +W  +      +P QA+V +ME YA+ G PMK+L+ F+EM+  N   + A+Y  II I+ K  ++++ E +M  FI+S++K LMP
Subjt:  LRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMP

Query:  AYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKY
        A++DLM M+ +L +H+KLELTF +C+ +C+PNR +Y IYL+SLVKVGN+ + EE+F +M  NG IG N +SCNI+L GYL   +Y KAEK+YD+M +KKY
Subjt:  AYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKY

Query:  AIDPPLMEKLDYVLSLSRKEVR-KPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKY---------------------------------
         +    +EKL   L L++K ++ K VS+KL +EQREIL+GLLLGG  +ES   R  H + F+FQ+                                   
Subjt:  AIDPPLMEKLDYVLSLSRKEVR-KPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKY---------------------------------

Query:  ---------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGDILLKLKGSH-EGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATW
                       QF+ +G P +P LIHRWL+PRVLAYW+M+GG ++ SGDI+LKL G + EGV++IV SL  +S+  KVKRKGR +WIG  GSNA  
Subjt:  ---------------QFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGDILLKLKGSH-EGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATW

Query:  FWKLIEPFILDD
        FW++IEP +L++
Subjt:  FWKLIEPFILDD

Q9XIL5 Pentatricopeptide repeat-containing protein At2g15820, chloroplastic4.1e-23552.11Show/hide
Query:  LFTPNSAFTLLSNRNLASVFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPT-------YFAKGRQQLPRIPAFPSSSFAEQ---LVHD
        L + +S+   ++  N++S+ S     ++ +T  L RSL+ SL +    +      +R L I T       +F+    + P  P F ++S A++    V  
Subjt:  LFTPNSAFTLLSNRNLASVFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPT-------YFAKGRQQLPRIPAFPSSSFAEQ---LVHD

Query:  RDSPSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHC
            +ESEE +     +EA GF    S A  D++++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+P HK  TL+RLLNAQ+KW+RQ+DA Y++VHC
Subjt:  RDSPSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHC

Query:  LRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPR
        +RIRENET FRVY+WM QQ+WYRFD+ L TKLA+Y+GKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YNRMIQLGGY+PR
Subjt:  LRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPR

Query:  LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAER
        LSLHNSLFRAL+SK G +    LKQAEFI+HN+VTTGLE+ KDIY GLIWLHS Q+ +D  RI SLR+EM++AG +E KEV++S+LRA +K G V E ER
Subjt:  LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAER

Query:  SWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLH
        +WL+L   D  +PSQAFVYK+E Y+KVG+  KA+E FREME+ +   + + Y  II +LCK Q +EL E++M  F +S  KPL+P+++++  M+F+L LH
Subjt:  SWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLH

Query:  DKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLS
        +KLE+ F QCLEKC+P++ IYNIYLDSL K+GNL +  ++F++M+ NG I V+ARSCN +L GYL  G  ++AE+IYDLM  KKY I+PPLMEKLDY+LS
Subjt:  DKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLS

Query:  LSRKEVRK-PVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKYQ----------------------------------------------F
        L +KEV+K P S+KLSK+QRE+LVGLLLGGL+IESD+ +K+H I+FEF++  Q                                              +
Subjt:  LSRKEVRK-PVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKYQ----------------------------------------------F

Query:  WPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQ
        WP+G P IP LIHRWLSP  LAYWYMY G +  SGDI+L+LKGS EGV+K+VK+L  KSM C+VK+KG+V+WIGL G+N+  FWKLIEP +L++LK+ L+
Subjt:  WPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQ

Query:  AGRDSEGALIETE--NINFYSQSYSDEEASN
           +S   + E E  +INF S S   ++  N
Subjt:  AGRDSEGALIETE--NINFYSQSYSDEEASN

Arabidopsis top hitse value%identityAlignment
AT1G09680.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-1224.36Show/hide
Query:  ETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSL
        +  FR+ K  M++   R D    + L + + KE K      +FD++  +G +P++  F  LI  +      G I+     Y +M+   G QP + L+N+L
Subjt:  ETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSL

Query:  FRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKA
              K GD     L  A  I   ++  GL   K  Y  LI    +    D E  + +RKEM Q GIE ++    +++    K G V++AER+  ++  
Subjt:  FRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKA

Query:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNST-SAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP
                 +   M+ + K G+     +  +EM+      S   Y  ++  LCK   ++ A+ ++   +   + P
Subjt:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLNST-SAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP

AT2G15820.1 endonucleases2.9e-23652.11Show/hide
Query:  LFTPNSAFTLLSNRNLASVFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPT-------YFAKGRQQLPRIPAFPSSSFAEQ---LVHD
        L + +S+   ++  N++S+ S     ++ +T  L RSL+ SL +    +      +R L I T       +F+    + P  P F ++S A++    V  
Subjt:  LFTPNSAFTLLSNRNLASVFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPT-------YFAKGRQQLPRIPAFPSSSFAEQ---LVHD

Query:  RDSPSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHC
            +ESEE +     +EA GF    S A  D++++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+P HK  TL+RLLNAQ+KW+RQ+DA Y++VHC
Subjt:  RDSPSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHC

Query:  LRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPR
        +RIRENET FRVY+WM QQ+WYRFD+ L TKLA+Y+GKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YNRMIQLGGY+PR
Subjt:  LRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPR

Query:  LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAER
        LSLHNSLFRAL+SK G +    LKQAEFI+HN+VTTGLE+ KDIY GLIWLHS Q+ +D  RI SLR+EM++AG +E KEV++S+LRA +K G V E ER
Subjt:  LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAER

Query:  SWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLH
        +WL+L   D  +PSQAFVYK+E Y+KVG+  KA+E FREME+ +   + + Y  II +LCK Q +EL E++M  F +S  KPL+P+++++  M+F+L LH
Subjt:  SWLKLKAFDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQ-LNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLH

Query:  DKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLS
        +KLE+ F QCLEKC+P++ IYNIYLDSL K+GNL +  ++F++M+ NG I V+ARSCN +L GYL  G  ++AE+IYDLM  KKY I+PPLMEKLDY+LS
Subjt:  DKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLS

Query:  LSRKEVRK-PVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKYQ----------------------------------------------F
        L +KEV+K P S+KLSK+QRE+LVGLLLGGL+IESD+ +K+H I+FEF++  Q                                              +
Subjt:  LSRKEVRK-PVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQKKYQ----------------------------------------------F

Query:  WPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQ
        WP+G P IP LIHRWLSP  LAYWYMY G +  SGDI+L+LKGS EGV+K+VK+L  KSM C+VK+KG+V+WIGL G+N+  FWKLIEP +L++LK+ L+
Subjt:  WPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQ

Query:  AGRDSEGALIETE--NINFYSQSYSDEEASN
           +S   + E E  +INF S S   ++  N
Subjt:  AGRDSEGALIETE--NINFYSQSYSDEEASN

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.0e-1220.09Show/hide
Query:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   V+V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTI
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTI

Query:  DKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII
               L  EM+    +       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME Y++ G P  A E F  M+ +      A+Y  ++
Subjt:  DKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII

Query:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNAR
            +      AE++     +  + P M +++ L++ +       K E    +  E   +P+  + N  L+   ++G   + E+I ++M+ NG    +  
Subjt:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNAR

Query:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYAID
        + NI+++ Y   G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYAID

AT2G35130.2 Tetratricopeptide repeat (TPR)-like superfamily protein7.0e-1220.09Show/hide
Query:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   V+V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYVTVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTI
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTI

Query:  DKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII
               L  EM+    +       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME Y++ G P  A E F  M+ +      A+Y  ++
Subjt:  DKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYK--MEVYAKVGNPMKALETFREMEQLN-STSAAAYQTII

Query:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNAR
            +      AE++     +  + P M +++ L++ +       K E    +  E   +P+  + N  L+   ++G   + E+I ++M+ NG    +  
Subjt:  GILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYNIYLDSLVKVGNLNRTEEIFSQMQTNGEIGVNAR

Query:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYAID
        + NI+++ Y   G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYAID

AT5G08310.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-1224Show/hide
Query:  AFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFR
        A+  + W  +Q  YR D      +A  + + R+ +  + +  D++N  C  S   F   I    +A   G ++EAS++++R+ ++G   P    +N L  
Subjt:  AFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFR

Query:  ALMSKPGDLSKHHLKQAEFIYHNL-VTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAG-IEEEKEVLLSILRASSKMGDVMEAERSWLKLKA
        A       +SK +    E +   L        H D +     L  Y NT   ER +S+  E+   G ++E    +L +  +  K G V +A      L+ 
Subjt:  ALMSKPGDLSKHHLKQAEFIYHNL-VTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAG-IEEEKEVLLSILRASSKMGDVMEAERSWLKLKA

Query:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP
         D  +  + +   +  + K     KA + F +M ++  +   A Y  +IG LCK +D+E+A S+     +S + P
Subjt:  FDGSMPSQAFVYKMEVYAKVGNPMKALETFREMEQLN-STSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACTCTCGCTTCTGCTGAAACATGGTTTGCTTCTCACTAAGATGAACCTTCTGCATCCTAAGGGTCCGTCCACAAAAGTTCTCTTCACTCCCAACTCTGCATTCAC
CTTGCTGTCTAACCGTAACCTTGCTTCGGTTTTCTCCATGTCCATTCGTACCTCTGCCTTTGCCACTGTCCCCCTTCTGCGTTCTCTCACTCTTTCCCTCTCTCAATGCC
ATCAACACTTCCGTTGCCACAATCACATCATCCGTACTCTCTTTATCCCAACATATTTTGCAAAAGGACGGCAACAACTTCCGCGAATTCCTGCCTTTCCTTCCAGTTCT
TTCGCTGAACAGTTGGTACACGATCGGGATTCCCCGTCCGAGTCTGAAGAGCACTTGTGTTCTCCATACAGTAACGAGGCCGAGGGTTTTCATTATGAAAATAGTTTTGC
GTCGGCAGATTTGAAACACTTGGGAACGCCTGCGCTTGAAGTCAAGGAGCTAGACGAGTTGCCAGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAAGAATTGC
CCGCGCATAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAGGAAATGGATGAGGCAGGATGATGCGGCCTATGTCACCGTGCATTGTTTGCGTATTCGCGAAAAC
GAGACTGCTTTTAGGGTGTACAAGTGGATGATGCAACAACATTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGAAAGTTCTC
AAAGTGTCGGGAGGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCTTACCTTAGTGCACCTGTTCAAGGATGCA
TAGAGGAAGCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTCATGAGCAAACCAGGGGAT
TTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCACAATCTGGTAACAACTGGACTCGAGTTACATAAAGATATATATGGTGGTCTAATTTGGTTACATAGTTA
TCAGAATACTATAGACAAAGAAAGGATAGTGTCACTAAGGAAAGAAATGCAACAAGCAGGAATCGAGGAGGAAAAAGAAGTCCTTTTGTCCATCTTGAGAGCAAGCTCAA
AAATGGGGGATGTGATGGAAGCAGAAAGATCGTGGCTTAAACTCAAGGCTTTTGATGGTAGCATGCCATCTCAGGCTTTTGTTTACAAAATGGAAGTCTATGCAAAGGTG
GGTAACCCGATGAAAGCTTTGGAAACATTTAGGGAGATGGAGCAGTTGAACTCTACAAGTGCTGCAGCATATCAGACAATTATTGGGATTTTATGTAAATTTCAAGATAT
AGAACTTGCAGAATCCATCATGGCAGGCTTCATAAAGAGTAATTTAAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACATGATA
AATTAGAGTTAACCTTCTCCCAGTGCCTTGAGAAGTGTAAGCCCAATCGTACGATTTACAACATATATTTGGACTCTTTGGTAAAAGTTGGTAATCTCAACAGGACTGAA
GAAATATTTAGTCAGATGCAAACAAATGGAGAAATTGGTGTGAATGCTCGTTCATGCAACATCATTTTAAGTGGGTATCTGTTATTTGGGAATTATTTGAAGGCTGAAAA
AATATATGATTTGATGTGTCAGAAAAAGTATGCCATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGGTCAGGAAGCCAGTAAGCT
TGAAGTTGAGTAAAGAACAAAGGGAGATTTTAGTAGGGTTGTTGTTAGGTGGCCTGGAGATAGAATCTGATGAAGGGAGGAAGAATCATAGAATCCAATTTGAATTCCAA
AAGAAATATCAGTTTTGGCCACAAGGCCATCCTGCAATCCCTAATCTAATTCATAGGTGGCTTTCTCCTCGCGTTCTTGCCTACTGGTATATGTATGGAGGCTGCAGGAT
ACTGTCAGGGGATATTTTACTGAAGTTAAAGGGAAGTCATGAAGGTGTTAAGAAGATTGTTAAATCTCTGAGTGAAAAGTCCATGTATTGCAAGGTGAAAAGGAAAGGCA
GGGTGTATTGGATAGGTTTACTCGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATGACTTGAAAGATAGTCTACAGGCAGGCCGTGATTCT
GAGGGGGCTTTAATTGAAACTGAAAATATCAACTTTTATAGTCAATCTTATTCTGATGAGGAGGCTTCTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACTCTCGCTTCTGCTGAAACATGGTTTGCTTCTCACTAAGATGAACCTTCTGCATCCTAAGGGTCCGTCCACAAAAGTTCTCTTCACTCCCAACTCTGCATTCAC
CTTGCTGTCTAACCGTAACCTTGCTTCGGTTTTCTCCATGTCCATTCGTACCTCTGCCTTTGCCACTGTCCCCCTTCTGCGTTCTCTCACTCTTTCCCTCTCTCAATGCC
ATCAACACTTCCGTTGCCACAATCACATCATCCGTACTCTCTTTATCCCAACATATTTTGCAAAAGGACGGCAACAACTTCCGCGAATTCCTGCCTTTCCTTCCAGTTCT
TTCGCTGAACAGTTGGTACACGATCGGGATTCCCCGTCCGAGTCTGAAGAGCACTTGTGTTCTCCATACAGTAACGAGGCCGAGGGTTTTCATTATGAAAATAGTTTTGC
GTCGGCAGATTTGAAACACTTGGGAACGCCTGCGCTTGAAGTCAAGGAGCTAGACGAGTTGCCAGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAAGAATTGC
CCGCGCATAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAGGAAATGGATGAGGCAGGATGATGCGGCCTATGTCACCGTGCATTGTTTGCGTATTCGCGAAAAC
GAGACTGCTTTTAGGGTGTACAAGTGGATGATGCAACAACATTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGAAAGTTCTC
AAAGTGTCGGGAGGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCTTACCTTAGTGCACCTGTTCAAGGATGCA
TAGAGGAAGCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTCATGAGCAAACCAGGGGAT
TTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCACAATCTGGTAACAACTGGACTCGAGTTACATAAAGATATATATGGTGGTCTAATTTGGTTACATAGTTA
TCAGAATACTATAGACAAAGAAAGGATAGTGTCACTAAGGAAAGAAATGCAACAAGCAGGAATCGAGGAGGAAAAAGAAGTCCTTTTGTCCATCTTGAGAGCAAGCTCAA
AAATGGGGGATGTGATGGAAGCAGAAAGATCGTGGCTTAAACTCAAGGCTTTTGATGGTAGCATGCCATCTCAGGCTTTTGTTTACAAAATGGAAGTCTATGCAAAGGTG
GGTAACCCGATGAAAGCTTTGGAAACATTTAGGGAGATGGAGCAGTTGAACTCTACAAGTGCTGCAGCATATCAGACAATTATTGGGATTTTATGTAAATTTCAAGATAT
AGAACTTGCAGAATCCATCATGGCAGGCTTCATAAAGAGTAATTTAAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACATGATA
AATTAGAGTTAACCTTCTCCCAGTGCCTTGAGAAGTGTAAGCCCAATCGTACGATTTACAACATATATTTGGACTCTTTGGTAAAAGTTGGTAATCTCAACAGGACTGAA
GAAATATTTAGTCAGATGCAAACAAATGGAGAAATTGGTGTGAATGCTCGTTCATGCAACATCATTTTAAGTGGGTATCTGTTATTTGGGAATTATTTGAAGGCTGAAAA
AATATATGATTTGATGTGTCAGAAAAAGTATGCCATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGGTCAGGAAGCCAGTAAGCT
TGAAGTTGAGTAAAGAACAAAGGGAGATTTTAGTAGGGTTGTTGTTAGGTGGCCTGGAGATAGAATCTGATGAAGGGAGGAAGAATCATAGAATCCAATTTGAATTCCAA
AAGAAATATCAGTTTTGGCCACAAGGCCATCCTGCAATCCCTAATCTAATTCATAGGTGGCTTTCTCCTCGCGTTCTTGCCTACTGGTATATGTATGGAGGCTGCAGGAT
ACTGTCAGGGGATATTTTACTGAAGTTAAAGGGAAGTCATGAAGGTGTTAAGAAGATTGTTAAATCTCTGAGTGAAAAGTCCATGTATTGCAAGGTGAAAAGGAAAGGCA
GGGTGTATTGGATAGGTTTACTCGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATGACTTGAAAGATAGTCTACAGGCAGGCCGTGATTCT
GAGGGGGCTTTAATTGAAACTGAAAATATCAACTTTTATAGTCAATCTTATTCTGATGAGGAGGCTTCTAATTAA
Protein sequenceShow/hide protein sequence
MSLSLLLKHGLLLTKMNLLHPKGPSTKVLFTPNSAFTLLSNRNLASVFSMSIRTSAFATVPLLRSLTLSLSQCHQHFRCHNHIIRTLFIPTYFAKGRQQLPRIPAFPSSS
FAEQLVHDRDSPSESEEHLCSPYSNEAEGFHYENSFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYVTVHCLRIREN
ETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGD
LSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQNTIDKERIVSLRKEMQQAGIEEEKEVLLSILRASSKMGDVMEAERSWLKLKAFDGSMPSQAFVYKMEVYAKV
GNPMKALETFREMEQLNSTSAAAYQTIIGILCKFQDIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYNIYLDSLVKVGNLNRTE
EIFSQMQTNGEIGVNARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYAIDPPLMEKLDYVLSLSRKEVRKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFQ
KKYQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRILSGDILLKLKGSHEGVKKIVKSLSEKSMYCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAGRDS
EGALIETENINFYSQSYSDEEASN