; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G007720 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G007720
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr01:4004553..4008774
RNA-Seq ExpressionCmoCh01G007720
SyntenyCmoCh01G007720
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006388 - tRNA splicing, via endonucleolytic cleavage and ligation (biological process)
GO:0010239 - chloroplast mRNA processing (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048564 - photosystem I assembly (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0004519 - endonuclease activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR004860 - Homing endonuclease, LAGLIDADG
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR027434 - Homing endonuclease


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607381.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0098.52Show/hide
Query:  MNLVNPKPKVSSSTVLLNSTSSSSMSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAES
        MNLVNPKPKVSSSTVLLNSTSSSSMSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVE LVYDRDSPAES
Subjt:  MNLVNPKPKVSSSTVLLNSTSSSSMSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAES

Query:  EEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMM
        EEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAY+IVHCLRIRENETAFRVYKWMM
Subjt:  EEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMM

Query:  QQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDL
        QQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEE+S IYNRMIQLGGYQPRLSLHNSLFKALVSKPGDL
Subjt:  QQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDL

Query:  SKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFV
        SKHHLKQAEFIYHN+ATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEM QAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFV
Subjt:  SKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFV

Query:  YKMEVYAKVGNPMKAFEIFREMEQLNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRT
        YKMEVYAKVGNPMKAFEIFREMEQLN ISAAAYQTIIGILCK EEVTLAESVME FIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRT
Subjt:  YKMEVYAKVGNPMKAFEIFREMEQLNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRT

Query:  IYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQR
        IYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQR
Subjt:  IYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQR

Query:  EILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSP
        EILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHI+EQYHEWLH ASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHP IPNLIHRWLSP
Subjt:  EILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSP

Query:  RVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINF
        RVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINF
Subjt:  RVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINF

Query:  DSQSDSDEEASS
        DSQSDSDEEASS
Subjt:  DSQSDSDEEASS

XP_022949171.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata]0.0e+00100Show/hide
Query:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG
        MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG
Subjt:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG

Query:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
        APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
Subjt:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK

Query:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK
        FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK
Subjt:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK

Query:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
        DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
Subjt:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ

Query:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
        LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
Subjt:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ

Query:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
        MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Subjt:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI

Query:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
        QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
Subjt:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK

Query:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS
        GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS
Subjt:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS

XP_022998786.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima]0.0e+0097.08Show/hide
Query:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG
        MSIRTSAFATVTLLRSLTLPFSQCH+HFRC NYVIRSL IPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYS GAE FASADLKHLG
Subjt:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG

Query:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
        APALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
Subjt:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK

Query:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK
        FSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGY PRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNL TTGLELHK
Subjt:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK

Query:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
        DIYGGLIWLHSYQDTVDKERIMSLRKEM QAGIEEEREVLVSILRASSKLGDVMEAERSWLK+KSFDGSMPSQAFVYKMEVYAKVGNPMKA EIFREMEQ
Subjt:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ

Query:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
        LNSIS+AAYQTIIGILCKFEEVTLAESVM GFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
Subjt:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ

Query:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
        MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Subjt:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI

Query:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
        QFEFHEDCSTHS LRRH++EQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHP IPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
Subjt:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK

Query:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS
        GSREGV KIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAD+LN+EKA NETYNINFDSQSDSDEEASS
Subjt:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS

XP_023521219.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0096.95Show/hide
Query:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG
        MSIRTSAFATVTLLRSLTL F  CHHHFRCRNYVIRSL IPTYSAKGRRQLPRIPAFASSSSVEALV+DRDSPAESEEPLCSPYSTGAEGFASADLKHLG
Subjt:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG

Query:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
        APALEVKELDELPEQWRRSKLAWLCKELPA  PGTLIRLLNAQRKWMKQDDAAY+IVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
Subjt:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK

Query:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK
        FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEE+S IYNRMIQLGGY+PRLSLHNSLFKAL+SKPGDLSKHHLKQAEFIYHNL TTGLELHK
Subjt:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK

Query:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
        DIY GLIWLHSYQDTVDKERIMSLRKEM QAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
Subjt:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ

Query:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
        LNS+SAAAYQTIIGILCK EEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
Subjt:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ

Query:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
        MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Subjt:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI

Query:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
        QFEFHED STHSRLRRHI+EQYHEWLHPASK SDSDTDIPYKFCTVSHSYFGFYADQFWPRGHP IPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
Subjt:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK

Query:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS
        GSREGVAKIVKSL EKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKD LQADSLNMEKA NETYNINFDSQSDSDEEASS
Subjt:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS

XP_023525582.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0096.95Show/hide
Query:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG
        MSIRTSAFATVTLLRSLTL F  CHHHFRCRNYVIRSL IPTYSAKGRRQL RIPAFASSSSVEALV+DRDSPAESEEPLCSPYSTGAEGFASADLKHLG
Subjt:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG

Query:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
        APALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWMKQDDAAY+IVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
Subjt:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK

Query:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK
        FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEE+S IYNRMIQLGGY+PRLSLHNSLFKAL+SKPGDLSKHHLKQAEFIYHNL TTGLELHK
Subjt:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK

Query:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
        DIY GLIWLHSYQDTVDKERIMSLRKEM QAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
Subjt:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ

Query:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
        LNS+SAAAYQTIIGILCK EEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
Subjt:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ

Query:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
        MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Subjt:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI

Query:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
        QFEFHED STHSRLRRHI+EQYHEWLHPASK SDSDTDIPYKFCTVSHSYFGFYADQFWPRGHP IPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
Subjt:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK

Query:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS
        GSREGVAKIVKSL EKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKD LQADSLNMEKA NETYNINFDSQSDSDEEASS
Subjt:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS

TrEMBL top hitse value%identityAlignment
A0A0A0LBL0 LAGLIDADG_2 domain-containing protein0.0e+0082.89Show/hide
Query:  SMSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAE------GFAS
        SMSI TSAF+TVT LRSLTL  S  HH+F C N++I +L +P YS K RRQLPRI AFAS S V+ LVYD DSP+ESEE L S +S G +      GFAS
Subjt:  SMSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAE------GFAS

Query:  ADLKHLGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLAD
         DLKHLG P LEVKELDELPEQWRRSK+AWLCKELPAQKPGT+IRLLNAQ+KWM QDDA YLIVHCLRIRENETAFRVYKWMMQQHWYRFDYAL+TKLAD
Subjt:  ADLKHLGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLAD

Query:  YMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLAT
        YMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGYQPRLSLH+SLF+ALVSKPGDLSKHHLKQAEFIYHNL T
Subjt:  YMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLAT

Query:  TGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFE
        +GLELHKD+YGGLIWLHSYQDT+D+ERI+SLRKEM QAGI+EEREVL+SILRASSK+GDVMEAE+ W +LK  DG+MPSQAFVYKMEVYAK+G PMKA E
Subjt:  TGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFE

Query:  IFREMEQLNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDR
        IFREMEQLNS +AAAYQTIIGILCKF+ + LAES+M GFI+SNLKPL PAYVDLMNMFFNL+L DKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNLDR
Subjt:  IFREMEQLNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDR

Query:  AEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDE
        AEEIFSQM+TNGEIG++ARSCNIIL GYLL G+Y+KAEKIYDLMCQK+YDIDPPLMEKL+Y+LSLSRKE+KKP+SLKLSKEQREILVGLLLGGLEIESD+
Subjt:  AEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDE

Query:  GRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSG
         RKNHRIQFEFH +C THS LRRHI+EQYH+WLH ASKL+D D DIPYKFCTVSHSYFGFYADQFWPRG   IPNLIHRWLSPRVLAYWYMYGGCR SSG
Subjt:  GRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSG

Query:  DFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS
        D +LKLKGS EGV KIVKSLREKS+ CKVKRKG +YWIGLLGSNATWFWKLIEPFILD LK+S QADSLN+    N + NINFDS+SDS EE S+
Subjt:  DFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS

A0A1S3CPK0 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0084.65Show/hide
Query:  SMSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAE------GFAS
        SMSI TSAF+TVTLLRSLTL  S  HH+F   N++I +L I +YS K  RQLPRI AFAS S V+ LVYDRDSP+ESEE L SPYS G +      GFAS
Subjt:  SMSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAE------GFAS

Query:  ADLKHLGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLAD
         DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAQKPGT+IRLLNAQRKWM QDDA YL VHCLRIRENETAFRVYKWMMQQHWYRFDYAL+TKLAD
Subjt:  ADLKHLGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLAD

Query:  YMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLAT
        YMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGYQPRLSLH+SLF+AL+SKPGDLSKHHLKQAEFIYHNL T
Subjt:  YMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLAT

Query:  TGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFE
        +GLELHKDIYGGLIWLHSYQDT+DKERI+SLRKEM QAGI+EE+EVL+SILRASSK+GDV+EAER W KLK  DG+MP QAFVYKMEVYAK+G PMKA E
Subjt:  TGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFE

Query:  IFREMEQLNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDR
        IFREMEQLNS +AAAYQTIIGILCKF+E+ LAES+M GFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNLDR
Subjt:  IFREMEQLNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDR

Query:  AEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDE
        AEEIFSQM+TNGEIGV+ARSCN+IL GYLL G+Y+KAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREILVGLLLGGLEIESDE
Subjt:  AEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDE

Query:  GRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSG
         RKNHRIQFEFH++C THS LRRHI+EQYH+WLH ASKL+D D DIPYKFCTVSHSYFGFYADQFWPRG   IPNLIHRWLSPR LAYWYMYGGCR SSG
Subjt:  GRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSG

Query:  DFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS
        D +LKLKGS EGV KIVKSLREKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILDDLK+S QADSLN+    NET NINFDSQSDS EE S+
Subjt:  DFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS

A0A6J1GB98 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+00100Show/hide
Query:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG
        MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG
Subjt:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG

Query:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
        APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
Subjt:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK

Query:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK
        FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK
Subjt:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK

Query:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
        DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
Subjt:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ

Query:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
        LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
Subjt:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ

Query:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
        MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Subjt:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI

Query:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
        QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
Subjt:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK

Query:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS
        GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS
Subjt:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS

A0A6J1KB64 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0097.08Show/hide
Query:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG
        MSIRTSAFATVTLLRSLTLPFSQCH+HFRC NYVIRSL IPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYS GAE FASADLKHLG
Subjt:  MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLG

Query:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
        APALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK
Subjt:  APALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERK

Query:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK
        FSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGY PRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNL TTGLELHK
Subjt:  FSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK

Query:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
        DIYGGLIWLHSYQDTVDKERIMSLRKEM QAGIEEEREVLVSILRASSKLGDVMEAERSWLK+KSFDGSMPSQAFVYKMEVYAKVGNPMKA EIFREMEQ
Subjt:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ

Query:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
        LNSIS+AAYQTIIGILCKFEEVTLAESVM GFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
Subjt:  LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ

Query:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
        MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Subjt:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI

Query:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
        QFEFHEDCSTHS LRRH++EQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHP IPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK
Subjt:  QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK

Query:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS
        GSREGV KIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQAD+LN+EKA NETYNINFDSQSDSDEEASS
Subjt:  GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS

A0A7N2MZ74 LAGLIDADG_2 domain-containing protein0.0e+0066.79Show/hide
Query:  VSSSTVLLNSTSSSSMSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIP--TYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSP
        ++SS   L++  SS+  +     ++++ LRSL+L  S    H   R++  R++  P  + S K ++  P + A ++SSS    V       E+ E     
Subjt:  VSSSTVLLNSTSSSSMSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIP--TYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSP

Query:  YSTGAEGF-------ASADLKHLGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMM
         S+ +E          S DLKHL A +L+VKELDELPEQWRRSKLAWLCKELPA K GTL+R+LNAQRKWM+Q DA Y+ VHC+RIRENE  F+VYKWMM
Subjt:  YSTGAEGF-------ASADLKHLGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMM

Query:  QQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDL
        QQHWYRFD+ALATKLADYMGKERKFSKCRE+FDDIINQG VP ESTFH+LIVAYLS+ IQGC+EE+ +IYNRMIQLGGY+PRLSLHNSLF+ALVSKPG  
Subjt:  QQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDL

Query:  SKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFV
        SKH+LKQAEFI+HN+ T+GLE+HKDIYGGLIWLHSYQDT+DK+RI SLRKEM  AGIEE REVLVSILRA SK GDV + E++WLKL   +G +PSQAFV
Subjt:  SKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFV

Query:  YKMEVYAKVGNPMKAFEIFREM-EQLNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNR
        YKME Y+K+G PMK+ EIFREM EQL S + AA+  II ILCK +EV LAES+M  FI SNLKPL P+Y+D+M+M+FNLSLHDKLEL FSQCLEKC+PNR
Subjt:  YKMEVYAKVGNPMKAFEIFREM-EQLNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNR

Query:  TIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQ
        TIYSIYL+SLV VGNLD+AEEIF+QM ++G I V  RSCN IL GYL SG+Y+KAEKIYDLMCQKKYDID PLMEKLDYVLSLSRKE+KKPVSLKLSKEQ
Subjt:  TIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQ

Query:  REILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLS
        RE+LVGLLLGGL IESDE RKNH ++FE  E+ STHS L+RHIH++YHEWLHP+ + S+   DIPY+F T+SHSYFGFYADQFWP+G P+IP LIHRWLS
Subjt:  REILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLS

Query:  PRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNIN
        PR LAYWYMYGG R SSGD +LKLKG+ EG  K+VK+L+ KS+ C+VK++GRV+WIG LGSN++WFWKLIEP++LDDLKD L+A     E    ET + +
Subjt:  PRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNIN

Query:  FDSQSDSDEEASS
        +DS  +SDE  S+
Subjt:  FDSQSDSDEEASS

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351302.2e-1019.73Show/hide
Query:  LAWLCKELPAQKPGTLIRLL-NAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   + V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAQKPGTLIRLL-NAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPR---LSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTV
        E T+ +LI AY  A   G IE +  +   M Q     P+   ++++N+  + L+ + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPR---LSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTV

Query:  DKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYK--MEVYAKVGNPMKAFEIFREMEQLN-SISAAAYQTII
               L  EM     +       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME Y++ G P  A EIF  M+ +      A+Y  ++
Subjt:  DKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYK--MEVYAKVGNPMKAFEIFREMEQLN-SISAAAYQTII

Query:  GILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSAR
            +    + AE+V E   +  + P   +++ L++ +       K E    +  E   +P+  + +  LN   ++G   + E+I ++M+ NG       
Subjt:  GILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSAR

Query:  SCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEI
        + NI+++ Y  +G   + E+++  + +K +   P ++     + + SRK++
Subjt:  SCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEI

Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171401.6e-1122.22Show/hide
Query:  REVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYG
        RE+FD++  +GC P+E TF IL+  Y  A   G  ++   + N M +  G  P   ++N++  +   +  +        +E +   +   GL      + 
Subjt:  REVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYG

Query:  GLI-WLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSI-LRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLN
          I  L      +D  RI S  +     G+     +  ++ L+   K+G + +A+  +  ++  D     Q++   ++   + G  ++A  + ++M    
Subjt:  GLI-WLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSI-LRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLN

Query:  -SISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCL-EKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
           S  +Y  ++  LCK   ++ A++++    ++ + P    Y  L++ + ++   D  +    + +   C PN    +I L+SL K+G +  AEE+  +
Subjt:  -SISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCL-EKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ

Query:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKI
        M   G  G+   +CNII+ G   SG+  KA +I
Subjt:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKI

Q6ZHJ5 Pentatricopeptide repeat-containing protein OTP51, chloroplastic9.8e-23253.79Show/hide
Query:  PRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKH-LGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQD
        P IPA A  S++E+L+ D D   E E+          E +A+AD +  + +P L V EL+ELPEQWRRS++AWLCKELPA K  T  R+LNAQRKW+ QD
Subjt:  PRIPAFASSSSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKH-LGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQD

Query:  DAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMI
        DA Y+ VHCLRIR N+ AFRVY WM++QHW+RF++ALAT++AD +G++ K  KCREVF+ ++ QG VP+ESTFHILIVAYLS P   C+EE+ TIYN+MI
Subjt:  DAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMI

Query:  QLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKL
        Q+GGY+PRLSLHNSLF+ALVSK G  +K++LKQAEF+YHN+ TT L++HKD+Y GLIWLHSYQD +D+ERI++LRKEM QAG +E  +VLVS++RA SK 
Subjt:  QLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKL

Query:  GDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLN-SISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMN
        G+V E E +W  +      +P QA+V +ME YA+ G PMK+ ++F+EM+  N   + A+Y  II I+ K  EV + E +M  FI+S++K L PA++DLM 
Subjt:  GDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLN-SISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMN

Query:  MFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLM
        M+ +L +H+KLELTF +C+ +C+PNR +Y+IYL SLVKVGN+++AEE+F +M  NG IG + +SCNI+L GYL + DY KAEK+YD+M +KKYD+    +
Subjt:  MFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLM

Query:  EKLDYVLSLSRKEIK-KPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSH
        EKL   L L++K IK K VS+KL +EQREIL+GLLLGG  +ES   R  H + F+F ED + HS LR HIHE++ EWL  AS+  D  + IPY+F T+ H
Subjt:  EKLDYVLSLSRKEIK-KPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSH

Query:  SYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK-GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEP
         +F F+ DQF+ +G PV+P LIHRWL+PRVLAYW+M+GG ++ SGD VLKL  G+ EGV +IV SL  +S++ KVKRKGR +WIG  GSNA  FW++IEP
Subjt:  SYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK-GSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEP

Query:  FILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEE
         +L++    +  +  ++     +      D+ +DSD++
Subjt:  FILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEE

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic3.8e-1020.12Show/hide
Query:  GTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY---
        G++ R L+  +  +  +D A +        + + + R++K+M +Q W + +  + T +   +G+E    KC EVFD++ +QG   S  ++  LI AY   
Subjt:  GTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY---

Query:  ---------LSAPIQGCIEESSTIYNRMIQ------------LG--------GYQPRLSLHNSLFKALVSKP-GDLSKHHLKQAEFIYHNLATTGLELHK
                 L       I  S   YN +I             LG        G QP +  +N+L  A   +  GD       +AE ++  +   G+    
Subjt:  ---------LSAPIQGCIEESSTIYNRMIQ------------LG--------GYQPRLSLHNSLFKALVSKP-GDLSKHHLKQAEFIYHNLATTGLELHK

Query:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
          Y  L+   ++      E++  L  EM   G   +      +L A +K G + EA   + ++++   +  +  +   + ++ + G      ++F EM+ 
Subjt:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ

Query:  LNS-ISAAAYQTIIGILCK---FEEVTL--------------------------------AESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTF
         N+   AA Y  +I +  +   F+EV                                  A  +++    +++ P   AY  ++  F   +L+++  + F
Subjt:  LNS-ISAAAYQTIIGILCK---FEEVTL--------------------------------AESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTF

Query:  SQCLE-KCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSR
        +   E    P+   +   L S  + G +  +E I S++  +G I  +  + N  +  Y   G + +A K Y  M + + D D   +E +  V S +R
Subjt:  SQCLE-KCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSR

Q9XIL5 Pentatricopeptide repeat-containing protein At2g15820, chloroplastic1.7e-25555.32Show/hide
Query:  SSSTVLLNSTSSSSMSIRTSAF-ATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPT--------YSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESE
        SSSTV + + + SS+S   +   ++ TL RSL+  FS   H        +R L I T        +S    R  P   A +++      V       ESE
Subjt:  SSSTVLLNSTSSSSMSIRTSAF-ATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPT--------YSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESE

Query:  EPLCSPYSTGAEGFASADLKHLGAPAL----EVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYK
        E +      G    A  D++++    +    EV+EL+ELPE+WRRSKLAWLCKE+P  K  TL+RLLNAQ+KW++Q+DA Y+ VHC+RIRENET FRVY+
Subjt:  EPLCSPYSTGAEGFASADLKHLGAPAL----EVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYK

Query:  WMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSK
        WM QQ+WYRFD+ L TKLA+Y+GKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  ++GC+EE+ ++YNRMIQLGGY+PRLSLHNSLF+ALVSK
Subjt:  WMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSK

Query:  PGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPS
         G +    LKQAEFI+HN+ TTGLE+ KDIY GLIWLHS QD VD  RI SLR+EM +AG +E +EV+VS+LRA +K G V E ER+WL+L   D  +PS
Subjt:  PGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPS

Query:  QAFVYKMEVYAKVGNPMKAFEIFREMEQ-LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKC
        QAFVYK+E Y+KVG+  KA EIFREME+ +   + + Y  II +LCK ++V L E++M+ F +S  KPL P+++++  M+F+L LH+KLE+ F QCLEKC
Subjt:  QAFVYKMEVYAKVGNPMKAFEIFREMEQ-LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKC

Query:  KPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKK-PVSLK
        +P++ IY+IYL+SL K+GNL++A ++F++M+ NG I VSARSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LSL +KE+KK P S+K
Subjt:  KPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKK-PVSLK

Query:  LSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLI
        LSK+QRE+LVGLLLGGL+IESD+ +K+H I+FEF E+   H  L+++IH+Q+ EWLHP S   +    IP++F +V HSYFGFYA+ +WP+G P IP LI
Subjt:  LSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLI

Query:  HRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQ--ADSLNMEKAA
        HRWLSP  LAYWYMY G + SSGD +L+LKGS EGV K+VK+L+ KSM C+VK+KG+V+WIGL G+N+  FWKLIEP +L++LK+ L+  ++SL+  K A
Subjt:  HRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQ--ADSLNMEKAA

Query:  NETYNINFDSQSDSDEE
         E  +INF S SD  ++
Subjt:  NETYNINFDSQSDSDEE

Arabidopsis top hitse value%identityAlignment
AT1G74850.1 plastid transcriptionally active 22.7e-1120.12Show/hide
Query:  GTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY---
        G++ R L+  +  +  +D A +        + + + R++K+M +Q W + +  + T +   +G+E    KC EVFD++ +QG   S  ++  LI AY   
Subjt:  GTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY---

Query:  ---------LSAPIQGCIEESSTIYNRMIQ------------LG--------GYQPRLSLHNSLFKALVSKP-GDLSKHHLKQAEFIYHNLATTGLELHK
                 L       I  S   YN +I             LG        G QP +  +N+L  A   +  GD       +AE ++  +   G+    
Subjt:  ---------LSAPIQGCIEESSTIYNRMIQ------------LG--------GYQPRLSLHNSLFKALVSKP-GDLSKHHLKQAEFIYHNLATTGLELHK

Query:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ
          Y  L+   ++      E++  L  EM   G   +      +L A +K G + EA   + ++++   +  +  +   + ++ + G      ++F EM+ 
Subjt:  DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQ

Query:  LNS-ISAAAYQTIIGILCK---FEEVTL--------------------------------AESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTF
         N+   AA Y  +I +  +   F+EV                                  A  +++    +++ P   AY  ++  F   +L+++  + F
Subjt:  LNS-ISAAAYQTIIGILCK---FEEVTL--------------------------------AESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTF

Query:  SQCLE-KCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSR
        +   E    P+   +   L S  + G +  +E I S++  +G I  +  + N  +  Y   G + +A K Y  M + + D D   +E +  V S +R
Subjt:  SQCLE-KCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSR

AT2G15820.1 endonucleases1.2e-25655.32Show/hide
Query:  SSSTVLLNSTSSSSMSIRTSAF-ATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPT--------YSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESE
        SSSTV + + + SS+S   +   ++ TL RSL+  FS   H        +R L I T        +S    R  P   A +++      V       ESE
Subjt:  SSSTVLLNSTSSSSMSIRTSAF-ATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPT--------YSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESE

Query:  EPLCSPYSTGAEGFASADLKHLGAPAL----EVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYK
        E +      G    A  D++++    +    EV+EL+ELPE+WRRSKLAWLCKE+P  K  TL+RLLNAQ+KW++Q+DA Y+ VHC+RIRENET FRVY+
Subjt:  EPLCSPYSTGAEGFASADLKHLGAPAL----EVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYK

Query:  WMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSK
        WM QQ+WYRFD+ L TKLA+Y+GKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  ++GC+EE+ ++YNRMIQLGGY+PRLSLHNSLF+ALVSK
Subjt:  WMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSK

Query:  PGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPS
         G +    LKQAEFI+HN+ TTGLE+ KDIY GLIWLHS QD VD  RI SLR+EM +AG +E +EV+VS+LRA +K G V E ER+WL+L   D  +PS
Subjt:  PGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPS

Query:  QAFVYKMEVYAKVGNPMKAFEIFREMEQ-LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKC
        QAFVYK+E Y+KVG+  KA EIFREME+ +   + + Y  II +LCK ++V L E++M+ F +S  KPL P+++++  M+F+L LH+KLE+ F QCLEKC
Subjt:  QAFVYKMEVYAKVGNPMKAFEIFREMEQ-LNSISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKC

Query:  KPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKK-PVSLK
        +P++ IY+IYL+SL K+GNL++A ++F++M+ NG I VSARSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LSL +KE+KK P S+K
Subjt:  KPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKK-PVSLK

Query:  LSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLI
        LSK+QRE+LVGLLLGGL+IESD+ +K+H I+FEF E+   H  L+++IH+Q+ EWLHP S   +    IP++F +V HSYFGFYA+ +WP+G P IP LI
Subjt:  LSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLI

Query:  HRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQ--ADSLNMEKAA
        HRWLSP  LAYWYMY G + SSGD +L+LKGS EGV K+VK+L+ KSM C+VK+KG+V+WIGL G+N+  FWKLIEP +L++LK+ L+  ++SL+  K A
Subjt:  HRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQ--ADSLNMEKAA

Query:  NETYNINFDSQSDSDEE
         E  +INF S SD  ++
Subjt:  NETYNINFDSQSDSDEE

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-1222.22Show/hide
Query:  REVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYG
        RE+FD++  +GC P+E TF IL+  Y  A   G  ++   + N M +  G  P   ++N++  +   +  +        +E +   +   GL      + 
Subjt:  REVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYG

Query:  GLI-WLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSI-LRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLN
          I  L      +D  RI S  +     G+     +  ++ L+   K+G + +A+  +  ++  D     Q++   ++   + G  ++A  + ++M    
Subjt:  GLI-WLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSI-LRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLN

Query:  -SISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCL-EKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ
           S  +Y  ++  LCK   ++ A++++    ++ + P    Y  L++ + ++   D  +    + +   C PN    +I L+SL K+G +  AEE+  +
Subjt:  -SISAAAYQTIIGILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCL-EKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQ

Query:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKI
        M   G  G+   +CNII+ G   SG+  KA +I
Subjt:  MQTNGEIGVSARSCNIILSGYLLSGDYLKAEKI

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-1119.73Show/hide
Query:  LAWLCKELPAQKPGTLIRLL-NAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   + V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAQKPGTLIRLL-NAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPR---LSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTV
        E T+ +LI AY  A   G IE +  +   M Q     P+   ++++N+  + L+ + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPR---LSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTV

Query:  DKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYK--MEVYAKVGNPMKAFEIFREMEQLN-SISAAAYQTII
               L  EM     +       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME Y++ G P  A EIF  M+ +      A+Y  ++
Subjt:  DKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYK--MEVYAKVGNPMKAFEIFREMEQLN-SISAAAYQTII

Query:  GILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSAR
            +    + AE+V E   +  + P   +++ L++ +       K E    +  E   +P+  + +  LN   ++G   + E+I ++M+ NG       
Subjt:  GILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSAR

Query:  SCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEI
        + NI+++ Y  +G   + E+++  + +K +   P ++     + + SRK++
Subjt:  SCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEI

AT2G35130.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-1119.73Show/hide
Query:  LAWLCKELPAQKPGTLIRLL-NAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   + V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAQKPGTLIRLL-NAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPR---LSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTV
        E T+ +LI AY  A   G IE +  +   M Q     P+   ++++N+  + L+ + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPR---LSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGLIWLHSYQDTV

Query:  DKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYK--MEVYAKVGNPMKAFEIFREMEQLN-SISAAAYQTII
               L  EM     +       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME Y++ G P  A EIF  M+ +      A+Y  ++
Subjt:  DKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYK--MEVYAKVGNPMKAFEIFREMEQLN-SISAAAYQTII

Query:  GILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSAR
            +    + AE+V E   +  + P   +++ L++ +       K E    +  E   +P+  + +  LN   ++G   + E+I ++M+ NG       
Subjt:  GILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSAR

Query:  SCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEI
        + NI+++ Y  +G   + E+++  + +K +   P ++     + + SRK++
Subjt:  SCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTCGTAAACCCTAAGCCTAAGGTTTCATCATCGACAGTTCTTCTGAACTCTACTTCGAGTTCCTCCATGTCCATTCGAACCTCTGCCTTCGCCACCGTCACCCT
TCTCCGCTCTCTCACTCTTCCCTTCTCTCAATGCCACCACCACTTCCGTTGCCGGAACTACGTCATCCGTTCTCTCTGTATCCCAACATATTCAGCGAAAGGACGACGAC
AACTTCCGAGAATTCCTGCCTTTGCTTCCAGTTCTTCCGTTGAAGCGTTGGTGTATGACCGGGATTCCCCGGCCGAATCTGAAGAGCCTTTGTGTTCTCCATACAGTACT
GGCGCTGAGGGGTTTGCGTCGGCGGATTTGAAACACTTGGGAGCGCCTGCGCTTGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGTAGATCCAAATTGGCTTG
GCTTTGTAAAGAATTGCCAGCACAGAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAGGAAATGGATGAAGCAGGATGACGCGGCCTATCTCATCGTGCATTGTT
TGCGTATTCGCGAAAATGAGACTGCTTTTAGGGTGTACAAGTGGATGATGCAACAACATTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGC
AAGGAACGGAAGTTCTCGAAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCCTACCTTAGTGC
ACCTATCCAAGGATGCATAGAGGAATCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAATTCTCTCTTTAAAGCTCTGG
TGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTCATATATCACAATCTGGCAACAACTGGACTTGAGTTGCATAAGGATATATATGGTGGTCTA
ATTTGGCTACATAGTTATCAGGATACTGTAGACAAAGAAAGGATAATGTCACTAAGGAAAGAAATGCATCAAGCAGGAATTGAGGAAGAAAGAGAAGTCCTTGTATCCAT
CTTGAGAGCGAGCTCGAAATTGGGGGATGTGATGGAAGCAGAGAGATCGTGGCTTAAACTTAAGTCTTTTGATGGTAGCATGCCATCCCAGGCTTTTGTTTACAAAATGG
AAGTATATGCAAAGGTGGGTAATCCGATGAAAGCTTTCGAGATATTTAGGGAGATGGAGCAGTTGAACTCTATAAGTGCTGCAGCATATCAGACAATTATTGGGATTTTA
TGTAAATTTGAAGAGGTAACACTAGCAGAATCCGTCATGGAAGGCTTCATAAAGAGTAATTTAAAGCCCCTCAAGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAA
TTTAAGCTTACATGATAAGTTAGAGTTAACCTTCTCCCAGTGCCTTGAGAAGTGTAAACCAAATCGTACTATTTACAGCATATATTTGAACTCTTTGGTAAAAGTTGGTA
ATCTCGACAGGGCTGAAGAAATATTTAGTCAGATGCAAACAAATGGAGAAATTGGTGTAAGTGCTCGTTCATGCAACATTATTTTAAGTGGGTACCTGTTAAGTGGGGAT
TATTTGAAGGCTGAAAAAATATATGACTTGATGTGTCAGAAAAAGTACGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGAT
TAAGAAGCCAGTAAGCTTGAAGTTGAGTAAAGAACAAAGGGAGATTTTAGTAGGGTTGTTATTAGGTGGCCTGGAGATCGAATCTGATGAAGGGAGGAAGAATCATAGGA
TCCAATTTGAATTCCACGAAGATTGTAGCACCCACTCTCGTTTGAGGAGACACATACATGAGCAATATCATGAGTGGTTACATCCTGCTTCAAAGTTAAGCGATAGTGAT
ACAGATATACCATATAAATTCTGCACCGTTTCACATTCATATTTTGGTTTCTACGCCGATCAGTTTTGGCCACGAGGCCATCCTGTAATCCCTAATCTAATTCACCGGTG
GCTTTCACCTCGTGTTCTTGCATACTGGTATATGTATGGAGGCTGCAGGATATCGTCAGGAGATTTCGTACTGAAGCTAAAGGGAAGTCGTGAGGGTGTTGCGAAGATTG
TTAAATCTCTGAGAGAAAAGTCCATGTCTTGCAAGGTCAAAAGGAAGGGCAGGGTGTATTGGATAGGCTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAA
CCTTTCATTCTGGATGACTTGAAAGATAGTTTACAGGCAGACAGCCTCAACATGGAGAAGGCTGCAAATGAAACTTACAATATCAACTTTGATAGTCAATCTGATTCCGA
TGAGGAGGCGTCCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCTCGTAAACCCTAAGCCTAAGGTTTCATCATCGACAGTTCTTCTGAACTCTACTTCGAGTTCCTCCATGTCCATTCGAACCTCTGCCTTCGCCACCGTCACCCT
TCTCCGCTCTCTCACTCTTCCCTTCTCTCAATGCCACCACCACTTCCGTTGCCGGAACTACGTCATCCGTTCTCTCTGTATCCCAACATATTCAGCGAAAGGACGACGAC
AACTTCCGAGAATTCCTGCCTTTGCTTCCAGTTCTTCCGTTGAAGCGTTGGTGTATGACCGGGATTCCCCGGCCGAATCTGAAGAGCCTTTGTGTTCTCCATACAGTACT
GGCGCTGAGGGGTTTGCGTCGGCGGATTTGAAACACTTGGGAGCGCCTGCGCTTGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGTAGATCCAAATTGGCTTG
GCTTTGTAAAGAATTGCCAGCACAGAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAGGAAATGGATGAAGCAGGATGACGCGGCCTATCTCATCGTGCATTGTT
TGCGTATTCGCGAAAATGAGACTGCTTTTAGGGTGTACAAGTGGATGATGCAACAACATTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGC
AAGGAACGGAAGTTCTCGAAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCCTACCTTAGTGC
ACCTATCCAAGGATGCATAGAGGAATCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAATTCTCTCTTTAAAGCTCTGG
TGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTCATATATCACAATCTGGCAACAACTGGACTTGAGTTGCATAAGGATATATATGGTGGTCTA
ATTTGGCTACATAGTTATCAGGATACTGTAGACAAAGAAAGGATAATGTCACTAAGGAAAGAAATGCATCAAGCAGGAATTGAGGAAGAAAGAGAAGTCCTTGTATCCAT
CTTGAGAGCGAGCTCGAAATTGGGGGATGTGATGGAAGCAGAGAGATCGTGGCTTAAACTTAAGTCTTTTGATGGTAGCATGCCATCCCAGGCTTTTGTTTACAAAATGG
AAGTATATGCAAAGGTGGGTAATCCGATGAAAGCTTTCGAGATATTTAGGGAGATGGAGCAGTTGAACTCTATAAGTGCTGCAGCATATCAGACAATTATTGGGATTTTA
TGTAAATTTGAAGAGGTAACACTAGCAGAATCCGTCATGGAAGGCTTCATAAAGAGTAATTTAAAGCCCCTCAAGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAA
TTTAAGCTTACATGATAAGTTAGAGTTAACCTTCTCCCAGTGCCTTGAGAAGTGTAAACCAAATCGTACTATTTACAGCATATATTTGAACTCTTTGGTAAAAGTTGGTA
ATCTCGACAGGGCTGAAGAAATATTTAGTCAGATGCAAACAAATGGAGAAATTGGTGTAAGTGCTCGTTCATGCAACATTATTTTAAGTGGGTACCTGTTAAGTGGGGAT
TATTTGAAGGCTGAAAAAATATATGACTTGATGTGTCAGAAAAAGTACGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGAT
TAAGAAGCCAGTAAGCTTGAAGTTGAGTAAAGAACAAAGGGAGATTTTAGTAGGGTTGTTATTAGGTGGCCTGGAGATCGAATCTGATGAAGGGAGGAAGAATCATAGGA
TCCAATTTGAATTCCACGAAGATTGTAGCACCCACTCTCGTTTGAGGAGACACATACATGAGCAATATCATGAGTGGTTACATCCTGCTTCAAAGTTAAGCGATAGTGAT
ACAGATATACCATATAAATTCTGCACCGTTTCACATTCATATTTTGGTTTCTACGCCGATCAGTTTTGGCCACGAGGCCATCCTGTAATCCCTAATCTAATTCACCGGTG
GCTTTCACCTCGTGTTCTTGCATACTGGTATATGTATGGAGGCTGCAGGATATCGTCAGGAGATTTCGTACTGAAGCTAAAGGGAAGTCGTGAGGGTGTTGCGAAGATTG
TTAAATCTCTGAGAGAAAAGTCCATGTCTTGCAAGGTCAAAAGGAAGGGCAGGGTGTATTGGATAGGCTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAA
CCTTTCATTCTGGATGACTTGAAAGATAGTTTACAGGCAGACAGCCTCAACATGGAGAAGGCTGCAAATGAAACTTACAATATCAACTTTGATAGTCAATCTGATTCCGA
TGAGGAGGCGTCCAGTTAGTACAAGAATTTTGGCTCTAATTCGCCGAATGATGAATTCCTTCAACGTTGAATGGTTTTGGGAGGCTTTGTAAATCAAATAGGGTCTTCCT
TGTCCAATTTGGAAGATGTAAATACCTAAATGGAGATAAAATGAGCTTATGTTCTGATTATGTATTGTTGTAGCATTCTTTTCTTATTTTTTATTTTTAAATTTTAATAT
AGGTTATATGCTGGTTTGGAAGTCTTACAAAAATCACTTATATATTTTGTTCTTGTAAACTCTCAGAATGTTCATGTTAATTATTGTTCTTTTTTAATATCTATTTTGTA
ACCACAGCTGTTCCAGCGTCAATCCTCTCCATGTGAACCTTGTTTGGGAAAGCAAACTACCAAAGAAGGTTTAAATTTTCCTTTAGCAGGAGCTTAAATTCAGAGAGGCA
AACTTCGACCATCGCTTTGCGGCCAGAGGGTTGTTCTTCACGGACATTTTGGGATTGAAGATGAATTGTGATTGCCATTTTGATTGTTTAGAGAAGACATGTCTGGAAGG
AAAAGGTGCAACCTCTTAAATAGCAAATTGTTGCTTGCTTGATTGAGGTATATTGGAGAAGTTCACTGGACTGAATTAGACTGCATTAGCTTTTGTACTCAGAATGCAAT
ATTATAATTATTAGACTGAGAAGAACTGAATTATTTATGTAGATATTAAATATTTCACACGTTGACATAGTCTATAGAATGCGTGGGAGAAGAAGTTACAAGTAATTGAC
GGAAAAAGAAATCTGTGCTTACCAAGGATATACACTAGGACGAATAATTATTCTATTAAAAATTTGTCGATAGTACGTATTATTGGTATATATAGTAATTATCGATCA
Protein sequenceShow/hide protein sequence
MNLVNPKPKVSSSTVLLNSTSSSSMSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASSSSVEALVYDRDSPAESEEPLCSPYST
GAEGFASADLKHLGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMG
KERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHKDIYGGL
IWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSISAAAYQTIIGIL
CKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGD
YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSD
TDIPYKFCTVSHSYFGFYADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIE
PFILDDLKDSLQADSLNMEKAANETYNINFDSQSDSDEEASS