; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008764 (gene) of Snake gourd v1 genome

Gene IDTan0008764
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLAGLIDADG_2 domain-containing protein
Genome locationLG10:59418593..59424897
RNA-Seq ExpressionTan0008764
SyntenyTan0008764
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006388 - tRNA splicing, via endonucleolytic cleavage and ligation (biological process)
GO:0010239 - chloroplast mRNA processing (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048564 - photosystem I assembly (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0004519 - endonuclease activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR004860 - Homing endonuclease, LAGLIDADG
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR027434 - Homing endonuclease


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607381.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.04Show/hide
Query:  LRNPPSVFSMSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGF
        L N  S  SMSIRTSAFAT+TLL SLT   SQC+HHFRC NY+IR+L IPTYSAKGRRQLP+IPAFASSS VE LVYDRDS +E EE   SPYS GAEG 
Subjt:  LRNPPSVFSMSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGF

Query:  HFENSFASADFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDY
             FASAD KHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWM+QDDAAY+ VHCLRIRENETAFRVYKWMMQQHWY+FDY
Subjt:  HFENSFASADFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDY

Query:  ALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAE
        ALATKLADY+GKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEEAS IYNRMIQLGGYQPRLSLHNSLF+AL+SKPGDLSKHHLKQAE
Subjt:  ALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAE

Query:  FIYHNLVRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKV
        FIYHN+  TGLELHK IYGGLIWLHSYQDTVDKERI+SLRKEMQQAGI+EEREVL+SILRASSK+GDVMEAERSWLKLK FDGSMPSQAFVYKMEVY+KV
Subjt:  FIYHNLVRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKV

Query:  GNPMKALEIFREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSL
        GNPMKA EIFREMEQLN  SAA YQTIIGILCK +E+ LAES+M  FIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SL
Subjt:  GNPMKALEIFREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSL

Query:  VKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLG
        VKVGNL+RAEEIFSQM+TNGEIGV+ARSCNIILSGYLLSG+YLKAEKIYDLMCQKKYDID  LMEKLDYVLSLSRKE+KKPVSLKLSKEQREILVGLLLG
Subjt:  VKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLG

Query:  GLEIESDEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMY
        GLEIESDEGRKNHRIQFEFH+  STHSRLRRHIYEQYHEWLH ASKLSD+D DIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMY
Subjt:  GLEIESDEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMY

Query:  GGCRIWSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEE
        GGCRI SGD +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGLLGSNATWF KLIEPFILDDLKD+LQA SLN+E A  ET NI+FD+QSDSDEE
Subjt:  GGCRIWSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEE

Query:  ASN
        AS+
Subjt:  ASN

XP_022949171.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata]0.0e+0088.66Show/hide
Query:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA
        MSIRTSAFAT+TLL SLT   SQC+HHFRC NY+IR+L IPTYSAKGRRQLP+IPAFASSS VE LVYDRDS +E EE   SPYS GAEG      FASA
Subjt:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA

Query:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY
        D KHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWM+QDDAAYL VHCLRIRENETAFRVYKWMMQQHWY+FDYALATKLADY
Subjt:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY

Query:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT
        +GKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+S IYNRMIQLGGYQPRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL  T
Subjt:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT

Query:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI
        GLELHK IYGGLIWLHSYQDTVDKERI+SLRKEM QAGI+EEREVL+SILRASSK+GDVMEAERSWLKLK FDGSMPSQAFVYKMEVY+KVGNPMKA EI
Subjt:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI

Query:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLN+ SAA YQTIIGILCKF+E+ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQM+TNGEIGV+ARSCNIILSGYLLSG+YLKAEKIYDLMCQKKYDID  LMEKLDYVLSLSRKE+KKPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD
        RKNHRIQFEFH+  STHSRLRRHI+EQYHEWLHPASKLSD+D DIPYKFCTVSHSYFGFYADQFWPRGHP IPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN
         +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGLLGSNATWF KLIEPFILDDLKD+LQA SLN+E A  ET NI+FD+QSDSDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN

XP_022998786.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima]0.0e+0089.17Show/hide
Query:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA
        MSIRTSAFAT+TLL SLT   SQC++HFRC NY+IR+LSIPTYSAKGRRQLP+IPAFASSS VE LVYDRDS +E EE   SPYSNGAE       FASA
Subjt:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA

Query:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY
        D KHLG PALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWM+QDDAAYL VHCLRIRENETAFRVYKWMMQQHWY+FDYALATKLADY
Subjt:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY

Query:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT
        +GKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEAS IYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLV T
Subjt:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT

Query:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI
        GLELHK IYGGLIWLHSYQDTVDKERI+SLRKEMQQAGI+EEREVL+SILRASSK+GDVMEAERSWLK+K FDGSMPSQAFVYKMEVY+KVGNPMKALEI
Subjt:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI

Query:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLN+ S+A YQTIIGILCKF+E+ LAES+MAGFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQM+TNGEIGV+ARSCNIILSGYLLSG+YLKAEKIYDLMCQKKYDID  LMEKLDYVLSLSRKE+KKPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD
        RKNHRIQFEFH+  STHS LRRH+YEQYHEWLHPASKLSD+D DIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN
         +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGLLGSNATWF KLIEPFILDDLKD+LQA +LNLE A+ ET NI+FD+QSDSDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN

XP_023521219.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0088.41Show/hide
Query:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA
        MSIRTSAFAT+TLL SLT S   C+HHFRC NY+IR+LSIPTYSAKGRRQLP+IPAFASSS VE LV+DRDS +E EE   SPYS GAEG      FASA
Subjt:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA

Query:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY
        D KHLG PALEVKELDELPEQWRRSKLAWLCKELPAH PGTLIRLLNAQRKWM+QDDAAY+ VHCLRIRENETAFRVYKWMMQQHWY+FDYALATKLADY
Subjt:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY

Query:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT
        +GKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEEAS IYNRMIQLGGY+PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLV T
Subjt:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT

Query:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI
        GLELHK IY GLIWLHSYQDTVDKERI+SLRKEMQQAGI+EEREVL+SILRASSK+GDVMEAERSWLKLK FDGSMPSQAFVYKMEVY+KVGNPMKA EI
Subjt:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI

Query:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLN+ SAA YQTIIGILCK +E+ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQM+TNGEIGV+ARSCNIILSGYLLSG+YLKAEKIYDLMCQKKYDID  LMEKLDYVLSLSRKE+KKPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD
        RKNHRIQFEFH+  STHSRLRRHIYEQYHEWLHPASK SD+D DIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN
         +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGLLGSNATWF KLIEPFILDDLKD LQA SLN+E A+ ET NI+FD+QSDSDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN

XP_023525582.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0088.41Show/hide
Query:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA
        MSIRTSAFAT+TLL SLT S   C+HHFRC NY+IR+LSIPTYSAKGRRQL +IPAFASSS VE LV+DRDS +E EE   SPYS GAEG      FASA
Subjt:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA

Query:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY
        D KHLG PALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWM+QDDAAY+ VHCLRIRENETAFRVYKWMMQQHWY+FDYALATKLADY
Subjt:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY

Query:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT
        +GKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEEAS IYNRMIQLGGY+PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLV T
Subjt:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT

Query:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI
        GLELHK IY GLIWLHSYQDTVDKERI+SLRKEMQQAGI+EEREVL+SILRASSK+GDVMEAERSWLKLK FDGSMPSQAFVYKMEVY+KVGNPMKA EI
Subjt:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI

Query:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLN+ SAA YQTIIGILCK +E+ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQM+TNGEIGV+ARSCNIILSGYLLSG+YLKAEKIYDLMCQKKYDID  LMEKLDYVLSLSRKE+KKPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD
        RKNHRIQFEFH+  STHSRLRRHIYEQYHEWLHPASK SD+D DIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN
         +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGLLGSNATWF KLIEPFILDDLKD LQA SLN+E A+ ET NI+FD+QSDSDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN

TrEMBL top hitse value%identityAlignment
A0A0A0LBL0 LAGLIDADG_2 domain-containing protein0.0e+0084.57Show/hide
Query:  VFSMSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSF
        VFSMSI TSAF+T+T L SLT SLS  +H+F C N+II TL +P YS K RRQLP+I AFAS SFV+ LVYD DS SE EE   S +SNG +GFHFEN F
Subjt:  VFSMSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSF

Query:  ASADFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKL
        AS D KHLGTP LEVKELDELPEQWRRSK+AWLCKELPA KPGT+IRLLNAQ+KWM QDDA YL VHCLRIRENETAFRVYKWMMQQHWY+FDYAL+TKL
Subjt:  ASADFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKL

Query:  ADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL
        ADY+GKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEAS IYNRMIQLGGYQPRLSLH+SLFRAL+SKPGDLSKHHLKQAEFIYHNL
Subjt:  ADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL

Query:  VRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKA
        V +GLELHK +YGGLIWLHSYQDT+D+ERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAE+ W +LKY DG+MPSQAFVYKMEVY+K+G PMKA
Subjt:  VRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKA

Query:  LEIFREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
        LEIFREMEQLN+T+AA YQTIIGILCKFQ I+LAESIMAGFI+SNLKPL PAYVDLMNMFFNL+L DKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
Subjt:  LEIFREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL

Query:  NRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIES
        +RAEEIFSQMETNGEIG+NARSCNIIL GYLL GNY+KAEKIYDLMCQK+YDID  LMEKL+Y+LSLSRKEVKKP+SLKLSKEQREILVGLLLGGLEIES
Subjt:  NRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIES

Query:  DEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIW
        D+ RKNHRIQFEFH+   THS LRRHIYEQYH+WLH ASKL+D D+DIPYKFCTVSHSYFGFYADQFWPRG  AIPNLIHRWLSPRVLAYWYMYGGCR  
Subjt:  DEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIW

Query:  SGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN
        SGDILLKLKGSHEGVEKIVKSLREKS++CKVKRKG +YWIGLLGSNATWF KLIEPFILD LK++ QA SLNL   L  +ENI+FD++SDS EE SN
Subjt:  SGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN

A0A1S3CPK0 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0086.95Show/hide
Query:  VFSMSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSF
        VFSMSI TSAF+T+TLL SLT SLS  +H+F   N+II TL I +YS K  RQLP+I AFAS SFV+ LVYDRDS SE EE   SPYSNG +GFHFEN F
Subjt:  VFSMSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSF

Query:  ASADFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKL
        AS D KHLGTPALEVKELDELPEQWRRSKLAWLCKELPA KPGT+IRLLNAQRKWM QDDA YLTVHCLRIRENETAFRVYKWMMQQHWY+FDYAL+TKL
Subjt:  ASADFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKL

Query:  ADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL
        ADY+GKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEAS IYNRMIQLGGYQPRLSLH+SLFRALMSKPGDLSKHHLKQAEFIYHNL
Subjt:  ADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL

Query:  VRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKA
        V +GLELHK IYGGLIWLHSYQDT+DKERIVSLRKEMQQAGIKEE+EVLLSILRASSKMGDV+EAER W KLKY DG+MP QAFVYKMEVY+K+G PMKA
Subjt:  VRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKA

Query:  LEIFREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
        LEIFREMEQLN+T+AA YQTIIGILCKFQEI+LAESIMAGFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
Subjt:  LEIFREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL

Query:  NRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIES
        +RAEEIFSQMETNGEIGVNARSCN+IL GYLL GNY+KAEKIYDLMCQKKYDID  LMEKLDYVLSLSRKEVKKP+SLKLSKEQREILVGLLLGGLEIES
Subjt:  NRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIES

Query:  DEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIW
        DE RKNHRIQFEFHK   THS LRRHIYEQYH+WLH ASKL+D DIDIPYKFCTVSHSYFGFYADQFWPRG   IPNLIHRWLSPR LAYWYMYGGCR  
Subjt:  DEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIW

Query:  SGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN
        SGDILLKLKGSHEGVEKIVKSLREKSM+CKVKRKG +YWIGLLGSNATWF KLIEPFILDDLK++ QA SLNL   L ETENI+FD+QSDS EE SN
Subjt:  SGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN

A0A6J1GB98 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0088.66Show/hide
Query:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA
        MSIRTSAFAT+TLL SLT   SQC+HHFRC NY+IR+L IPTYSAKGRRQLP+IPAFASSS VE LVYDRDS +E EE   SPYS GAEG      FASA
Subjt:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA

Query:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY
        D KHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWM+QDDAAYL VHCLRIRENETAFRVYKWMMQQHWY+FDYALATKLADY
Subjt:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY

Query:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT
        +GKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+S IYNRMIQLGGYQPRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL  T
Subjt:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT

Query:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI
        GLELHK IYGGLIWLHSYQDTVDKERI+SLRKEM QAGI+EEREVL+SILRASSK+GDVMEAERSWLKLK FDGSMPSQAFVYKMEVY+KVGNPMKA EI
Subjt:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI

Query:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLN+ SAA YQTIIGILCKF+E+ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQM+TNGEIGV+ARSCNIILSGYLLSG+YLKAEKIYDLMCQKKYDID  LMEKLDYVLSLSRKE+KKPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD
        RKNHRIQFEFH+  STHSRLRRHI+EQYHEWLHPASKLSD+D DIPYKFCTVSHSYFGFYADQFWPRGHP IPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN
         +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGLLGSNATWF KLIEPFILDDLKD+LQA SLN+E A  ET NI+FD+QSDSDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN

A0A6J1KB64 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0089.17Show/hide
Query:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA
        MSIRTSAFAT+TLL SLT   SQC++HFRC NY+IR+LSIPTYSAKGRRQLP+IPAFASSS VE LVYDRDS +E EE   SPYSNGAE       FASA
Subjt:  MSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTYSAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASA

Query:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY
        D KHLG PALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWM+QDDAAYL VHCLRIRENETAFRVYKWMMQQHWY+FDYALATKLADY
Subjt:  DFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADY

Query:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT
        +GKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEAS IYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLV T
Subjt:  LGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRT

Query:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI
        GLELHK IYGGLIWLHSYQDTVDKERI+SLRKEMQQAGI+EEREVL+SILRASSK+GDVMEAERSWLK+K FDGSMPSQAFVYKMEVY+KVGNPMKALEI
Subjt:  GLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEI

Query:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA
        FREMEQLN+ S+A YQTIIGILCKF+E+ LAES+MAGFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNL+RA
Subjt:  FREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRA

Query:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG
        EEIFSQM+TNGEIGV+ARSCNIILSGYLLSG+YLKAEKIYDLMCQKKYDID  LMEKLDYVLSLSRKE+KKPVSLKLSKEQREILVGLLLGGLEIESDEG
Subjt:  EEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGLEIESDEG

Query:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD
        RKNHRIQFEFH+  STHS LRRH+YEQYHEWLHPASKLSD+D DIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRI SGD
Subjt:  RKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN
         +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGLLGSNATWF KLIEPFILDDLKD+LQA +LNLE A+ ET NI+FD+QSDSDEEAS+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN

A0A7N2MZ74 LAGLIDADG_2 domain-containing protein0.0e+0066.43Show/hide
Query:  SPTVPLTPNSAF-TVLSLRNPPSVFSMSIRTSAFATITLLPSLTFSLSQCN-HHFRCHNYIIRTLSIP--TYSAKGRRQLPQIPAFASSS----FVEPLV
        S T+ LT +  F + L   NP   F M       ++++ L SL+ SLS    HHF  H +  R +S P  + S K ++  P + A ++SS    FVE L 
Subjt:  SPTVPLTPNSAF-TVLSLRNPPSVFSMSIRTSAFATITLLPSLTFSLSQCN-HHFRCHNYIIRTLSIP--TYSAKGRRQLPQIPAFASSS----FVEPLV

Query:  YDRDSLSEYEERSYSPYSNGAEGFHFENSFASADFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLR
         + ++   ++   ++  S     F F+ +  S D KHL   +L+VKELDELPEQWRRSKLAWLCKELPAHK GTL+R+LNAQRKWMRQ DA Y+ VHC+R
Subjt:  YDRDSLSEYEERSYSPYSNGAEGFHFENSFASADFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTVHCLR

Query:  IRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSL
        IRENE  F+VYKWMMQQHWY+FD+ALATKLADY+GKERKFSKCRE+FDDIINQG VP ESTFH+LIVAYLS+ +QGC+EEA  IYNRMIQLGGY+PRLSL
Subjt:  IRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSL

Query:  HNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWL
        HNSLFRAL+SKPG  SKH+LKQAEFI+HN+V +GLE+HK IYGGLIWLHSYQDT+DK+RI SLRKEM+ AGI+E REVL+SILRA SK GDV + E++WL
Subjt:  HNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWL

Query:  KLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREM-EQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKL
        KL + +G +PSQAFVYKME YSK+G PMK+LEIFREM EQL +T+ A +  II ILCK QE++LAES+M  FI SNLKPL P+Y+D+M+M+FNLSLHDKL
Subjt:  KLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREM-EQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKL

Query:  ELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSR
        EL FSQCLEKC+PNRTIYSIYLDSLV VGNL++AEEIF+QM ++G I V+ RSCN IL GYL SG Y+KAEKIYDLMCQKKYDIDS LMEKLDYVLSLSR
Subjt:  ELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSR

Query:  KEVKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWP
        KEVKKPVSLKLSKEQRE+LVGLLLGGL IESDE RKNH ++FE  +  STHS L+RHI+++YHEWLHP+ + S++  DIPY+F T+SHSYFGFYADQFWP
Subjt:  KEVKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWP

Query:  RGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAG
        +G P IP LIHRWLSPR LAYWYMYGG R  SGDILLKLKG+ EG EK+VK+L+ KS+ C+VK++GRV+WIG LGSN++WF KLIEP++LDDLKD L+AG
Subjt:  RGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAG

Query:  SLNLESALIETENIDFDNQSDSDEEASN
            E+ L ET++  +D+  +SDE  SN
Subjt:  SLNLESALIETENIDFDNQSDSDEEASN

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351301.9e-1221.02Show/hide
Query:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   ++V     ++ ++   V +W++++  +Q D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTV
        E T+ +LI AY  A   G IE A ++   M Q     P+   ++++N+    LM + G     + ++A  ++  + R   +     Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTV

Query:  DKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYK--MEVYSKVGNPMKALEIFREMEQLN-TTSAATYQTII
               L  EM+    K       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME YS+ G P  A EIF  M+ +      A+Y  ++
Subjt:  DKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYK--MEVYSKVGNPMKALEIFREMEQLN-TTSAATYQTII

Query:  GILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNAR
            +      AE++     +  + P M +++ L++ +       K E    +  E   +P+  + +  L+   ++G   + E+I ++ME NG    +  
Subjt:  GILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNAR

Query:  SCNIILSGYLLSGNYLKAEKIYDLMCQKKYDID
        + NI+++ Y  +G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLSGNYLKAEKIYDLMCQKKYDID

Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171401.4e-1023.35Show/hide
Query:  REVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALM--SKPGDLSKHHLKQAEFIYHNLVRTGLELHKHI
        RE+FD++  +GC P+E TF IL+  Y  A   G  ++   + N M +  G  P   ++N++  +     +  D  K   K  E     LV   +  +  I
Subjt:  REVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALM--SKPGDLSKHHLKQAEFIYHNLVRTGLELHKHI

Query:  YGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSI-LRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQL
              L      +D  RI S  +  +  G+     +  ++ L+   K+G + +A+  +  ++  D     Q++   ++   + G  ++A  + ++M   
Subjt:  YGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSI-LRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQL

Query:  NT-TSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCL-EKCKPNRTIYSIYLDSLVKVGNLNRAEEIFS
            S  +Y  ++  LCK   +  A++I+    ++ + P    Y  L++ + ++   D  +    + +   C PN    +I L SL K+G ++ AEE+  
Subjt:  NT-TSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCL-EKCKPNRTIYSIYLDSLVKVGNLNRAEEIFS

Query:  QMETNGEIGVNARSCNIILSGYLLSGNYLKAEKI
        +M   G  G++  +CNII+ G   SG   KA +I
Subjt:  QMETNGEIGVNARSCNIILSGYLLSGNYLKAEKI

Q6ZHJ5 Pentatricopeptide repeat-containing protein OTP51, chloroplastic1.9e-22552.81Show/hide
Query:  PQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASADFKH-LGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQR
        P IPA AS+  +E L+ D D   E E+           G     ++A+AD +  + +P L V EL+ELPEQWRRS++AWLCKELPA+K  T  R+LNAQR
Subjt:  PQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASADFKH-LGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQR

Query:  KWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASI
        KW+ QDDA Y+ VHCLRIR N+ AFRVY WM++QHW++F++ALAT++AD LG++ K  KCREVF+ ++ QG VP+ESTFHILIVAYLS P   C+EEA  
Subjt:  KWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASI

Query:  IYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSIL
        IYN+MIQ+GGY+PRLSLHNSLFRAL+SK G  +K++LKQAEF+YHN+V T L++HK +Y GLIWLHSYQD +D+ERI++LRKEM+QAG  E  +VL+S++
Subjt:  IYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSIL

Query:  RASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQLN-TTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPA
        RA SK G+V E E +W  +      +P QA+V +ME Y++ G PMK+L++F+EM+  N   + A+Y  II I+ K  E+ + E +M  FI+S++K LMPA
Subjt:  RASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQLN-TTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPA

Query:  YVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYD
        ++DLM M+ +L +H+KLELTF +C+ +C+PNR +Y+IYL+SLVKVGN+ +AEE+F +M  NG IG N +SCNI+L GYL + +Y KAEK+YD+M +KKYD
Subjt:  YVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYD

Query:  IDSRLMEKLDYVLSLSRKEVK-KPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYK
        + +  +EKL   L L++K +K K VS+KL +EQREIL+GLLLGG  +ES   R  H + F+F +  + HS LR HI+E++ EWL  AS+  D+   IPY+
Subjt:  IDSRLMEKLDYVLSLSRKEVK-KPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYK

Query:  FCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGDILLKLKGSH-EGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWF
        F T+ H +F F+ DQF+ +G P +P LIHRWL+PRVLAYW+M+GG ++ SGDI+LKL G + EGVE+IV SL  +S+  KVKRKGR +WIG  GSNA  F
Subjt:  FCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGDILLKLKGSH-EGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWF

Query:  LKLIEPFILDDLKDNLQAGSLNLESALIETENI-DFDNQSDSDEEASN
         ++IEP +L++      A  +  E + I ++   D D  SD D + S+
Subjt:  LKLIEPFILDDLKDNLQAGSLNLESALIETENI-DFDNQSDSDEEASN

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic2.2e-1120.72Show/hide
Query:  GTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY---
        G++ R L+  +  +  +D A +        + + + R++K+M +Q W + +  + T +   LG+E    KC EVFD++ +QG   S  ++  LI AY   
Subjt:  GTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY---

Query:  ---------LSAPVQGCIEEASIIYNRMIQ------------LG--------GYQPRLSLHNSLFRALMSKP-GDLSKHHLKQAEFIYHNLVRTGLELHK
                 L       I  + + YN +I             LG        G QP +  +N+L  A   +  GD       +AE ++  +   G+    
Subjt:  ---------LSAPVQGCIEEASIIYNRMIQ------------LG--------GYQPRLSLHNSLFRALMSKP-GDLSKHHLKQAEFIYHNLVRTGLELHK

Query:  HIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQ
          Y  L+   ++      E++  L  EM   G   +      +L A +K G + EA   + +++    +  +  +   + ++ + G      ++F EM+ 
Subjt:  HIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQ

Query:  LNT-TSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLM-----------------------------------PAYVDLMNMFFNLSLHDKLELTF
         NT   AATY  +I +  +    K   ++    ++ N++P M                                    AY  ++  F   +L+++  + F
Subjt:  LNT-TSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLM-----------------------------------PAYVDLMNMFFNLSLHDKLELTF

Query:  SQCLE-KCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSR
        +   E    P+   +   L S  + G +  +E I S++  +G I  N  + N  +  Y   G + +A K Y  M + + D D R +E +  V S +R
Subjt:  SQCLE-KCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSR

Q9XIL5 Pentatricopeptide repeat-containing protein At2g15820, chloroplastic7.6e-25154.61Show/hide
Query:  SPTVPLTPNSAFTVLSLRNPPSVFSMSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPT--------YSAKGRRQLPQIPAFASSSFVEPLV
        S TV +T    F + SL + P++ + S         TL  SL+FSL    H        +R LSI T        +S    R  P   A +++      V
Subjt:  SPTVPLTPNSAFTVLSLRNPPSVFSMSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPT--------YSAKGRRQLPQIPAFASSSFVEPLV

Query:  YDRDSLSEYEERSYSPYSNGAEGFHFENSFASADFKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTV
             ++E EE       + A GF    S A  D +++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+P HK  TL+RLLNAQ+KW+RQ+DA Y++V
Subjt:  YDRDSLSEYEERSYSPYSNGAEGFHFENSFASADFKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTV

Query:  HCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASIIYNRMIQLGGYQ
        HC+RIRENET FRVY+WM QQ+WY+FD+ L TKLA+YLGKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA  +YNRMIQLGGY+
Subjt:  HCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASIIYNRMIQLGGYQ

Query:  PRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEA
        PRLSLHNSLFRAL+SK G +    LKQAEFI+HN+V TGLE+ K IY GLIWLHS QD VD  RI SLR+EM++AG +E +EV++S+LRA +K G V E 
Subjt:  PRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEA

Query:  ERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQ-LNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLS
        ER+WL+L   D  +PSQAFVYK+E YSKVG+  KA+EIFREME+ +   + + Y  II +LCK Q+++L E++M  F +S  KPL+P+++++  M+F+L 
Subjt:  ERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQ-LNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLS

Query:  LHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYV
        LH+KLE+ F QCLEKC+P++ IY+IYLDSL K+GNL +A ++F++M+ NG I V+ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+  LMEKLDY+
Subjt:  LHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYV

Query:  LSLSRKEVKK-PVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFY
        LSL +KEVKK P S+KLSK+QRE+LVGLLLGGL+IESD+ +K+H I+FEF +    H  L+++I++Q+ EWLHP S   + DI IP++F +V HSYFGFY
Subjt:  LSLSRKEVKK-PVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFY

Query:  ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLK
        A+ +WP+G P IP LIHRWLSP  LAYWYMY G +  SGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+KG+V+WIGL G+N+  F KLIEP +L++LK
Subjt:  ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLK

Query:  DNLQAGSLNLESAL-IETENIDFDNQSDSDEEASN
        ++L+  S +L++    E ++I+F + SD  ++  N
Subjt:  DNLQAGSLNLESAL-IETENIDFDNQSDSDEEASN

Arabidopsis top hitse value%identityAlignment
AT1G74850.1 plastid transcriptionally active 21.5e-1220.72Show/hide
Query:  GTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY---
        G++ R L+  +  +  +D A +        + + + R++K+M +Q W + +  + T +   LG+E    KC EVFD++ +QG   S  ++  LI AY   
Subjt:  GTLIRLLNAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY---

Query:  ---------LSAPVQGCIEEASIIYNRMIQ------------LG--------GYQPRLSLHNSLFRALMSKP-GDLSKHHLKQAEFIYHNLVRTGLELHK
                 L       I  + + YN +I             LG        G QP +  +N+L  A   +  GD       +AE ++  +   G+    
Subjt:  ---------LSAPVQGCIEEASIIYNRMIQ------------LG--------GYQPRLSLHNSLFRALMSKP-GDLSKHHLKQAEFIYHNLVRTGLELHK

Query:  HIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQ
          Y  L+   ++      E++  L  EM   G   +      +L A +K G + EA   + +++    +  +  +   + ++ + G      ++F EM+ 
Subjt:  HIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQ

Query:  LNT-TSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLM-----------------------------------PAYVDLMNMFFNLSLHDKLELTF
         NT   AATY  +I +  +    K   ++    ++ N++P M                                    AY  ++  F   +L+++  + F
Subjt:  LNT-TSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLM-----------------------------------PAYVDLMNMFFNLSLHDKLELTF

Query:  SQCLE-KCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSR
        +   E    P+   +   L S  + G +  +E I S++  +G I  N  + N  +  Y   G + +A K Y  M + + D D R +E +  V S +R
Subjt:  SQCLE-KCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSR

AT2G15820.1 endonucleases5.4e-25254.61Show/hide
Query:  SPTVPLTPNSAFTVLSLRNPPSVFSMSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPT--------YSAKGRRQLPQIPAFASSSFVEPLV
        S TV +T    F + SL + P++ + S         TL  SL+FSL    H        +R LSI T        +S    R  P   A +++      V
Subjt:  SPTVPLTPNSAFTVLSLRNPPSVFSMSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPT--------YSAKGRRQLPQIPAFASSSFVEPLV

Query:  YDRDSLSEYEERSYSPYSNGAEGFHFENSFASADFKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTV
             ++E EE       + A GF    S A  D +++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+P HK  TL+RLLNAQ+KW+RQ+DA Y++V
Subjt:  YDRDSLSEYEERSYSPYSNGAEGFHFENSFASADFKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMRQDDAAYLTV

Query:  HCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASIIYNRMIQLGGYQ
        HC+RIRENET FRVY+WM QQ+WY+FD+ L TKLA+YLGKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA  +YNRMIQLGGY+
Subjt:  HCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASIIYNRMIQLGGYQ

Query:  PRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEA
        PRLSLHNSLFRAL+SK G +    LKQAEFI+HN+V TGLE+ K IY GLIWLHS QD VD  RI SLR+EM++AG +E +EV++S+LRA +K G V E 
Subjt:  PRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEA

Query:  ERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQ-LNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLS
        ER+WL+L   D  +PSQAFVYK+E YSKVG+  KA+EIFREME+ +   + + Y  II +LCK Q+++L E++M  F +S  KPL+P+++++  M+F+L 
Subjt:  ERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQ-LNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLS

Query:  LHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYV
        LH+KLE+ F QCLEKC+P++ IY+IYLDSL K+GNL +A ++F++M+ NG I V+ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+  LMEKLDY+
Subjt:  LHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYV

Query:  LSLSRKEVKK-PVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFY
        LSL +KEVKK P S+KLSK+QRE+LVGLLLGGL+IESD+ +K+H I+FEF +    H  L+++I++Q+ EWLHP S   + DI IP++F +V HSYFGFY
Subjt:  LSLSRKEVKK-PVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFY

Query:  ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLK
        A+ +WP+G P IP LIHRWLSP  LAYWYMY G +  SGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+KG+V+WIGL G+N+  F KLIEP +L++LK
Subjt:  ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGDILLKLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLK

Query:  DNLQAGSLNLESAL-IETENIDFDNQSDSDEEASN
        ++L+  S +L++    E ++I+F + SD  ++  N
Subjt:  DNLQAGSLNLESAL-IETENIDFDNQSDSDEEASN

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein9.9e-1223.35Show/hide
Query:  REVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALM--SKPGDLSKHHLKQAEFIYHNLVRTGLELHKHI
        RE+FD++  +GC P+E TF IL+  Y  A   G  ++   + N M +  G  P   ++N++  +     +  D  K   K  E     LV   +  +  I
Subjt:  REVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPRLSLHNSLFRALM--SKPGDLSKHHLKQAEFIYHNLVRTGLELHKHI

Query:  YGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSI-LRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQL
              L      +D  RI S  +  +  G+     +  ++ L+   K+G + +A+  +  ++  D     Q++   ++   + G  ++A  + ++M   
Subjt:  YGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSI-LRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQL

Query:  NT-TSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCL-EKCKPNRTIYSIYLDSLVKVGNLNRAEEIFS
            S  +Y  ++  LCK   +  A++I+    ++ + P    Y  L++ + ++   D  +    + +   C PN    +I L SL K+G ++ AEE+  
Subjt:  NT-TSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCL-EKCKPNRTIYSIYLDSLVKVGNLNRAEEIFS

Query:  QMETNGEIGVNARSCNIILSGYLLSGNYLKAEKI
        +M   G  G++  +CNII+ G   SG   KA +I
Subjt:  QMETNGEIGVNARSCNIILSGYLLSGNYLKAEKI

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-1321.02Show/hide
Query:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   ++V     ++ ++   V +W++++  +Q D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTV
        E T+ +LI AY  A   G IE A ++   M Q     P+   ++++N+    LM + G     + ++A  ++  + R   +     Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTV

Query:  DKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYK--MEVYSKVGNPMKALEIFREMEQLN-TTSAATYQTII
               L  EM+    K       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME YS+ G P  A EIF  M+ +      A+Y  ++
Subjt:  DKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYK--MEVYSKVGNPMKALEIFREMEQLN-TTSAATYQTII

Query:  GILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNAR
            +      AE++     +  + P M +++ L++ +       K E    +  E   +P+  + +  L+   ++G   + E+I ++ME NG    +  
Subjt:  GILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNAR

Query:  SCNIILSGYLLSGNYLKAEKIYDLMCQKKYDID
        + NI+++ Y  +G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLSGNYLKAEKIYDLMCQKKYDID

AT2G35130.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-1321.02Show/hide
Query:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   DD   ++V     ++ ++   V +W++++  +Q D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAHKPGTLIRLL-NAQRKWMRQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTV
        E T+ +LI AY  A   G IE A ++   M Q     P+   ++++N+    LM + G     + ++A  ++  + R   +     Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTV

Query:  DKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYK--MEVYSKVGNPMKALEIFREMEQLN-TTSAATYQTII
               L  EM+    K       +++ A ++ G   +AE  + +L+  DG  P   +VY   ME YS+ G P  A EIF  M+ +      A+Y  ++
Subjt:  DKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFDGSMPSQAFVYK--MEVYSKVGNPMKALEIFREMEQLN-TTSAATYQTII

Query:  GILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNAR
            +      AE++     +  + P M +++ L++ +       K E    +  E   +P+  + +  L+   ++G   + E+I ++ME NG    +  
Subjt:  GILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNAR

Query:  SCNIILSGYLLSGNYLKAEKIYDLMCQKKYDID
        + NI+++ Y  +G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLSGNYLKAEKIYDLMCQKKYDID


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAACTAGCAAAGCCCAACCTTCAAAACAAACACCACCTTCCCATCGCATTCTTCCATCTCTTGCGCGACACTACGGCGAAACGGCTCGGCCACGACAAACGGCGAC
AAGCCTTTGCTTCCAGTCGCCAACAGTCCCTCTCACTCCCAACTCTGCATTCACTGTTCTGTCTCTCCGTAACCCTCCTTCGGTTTTCTCCATGTCCATTCGTACCTCTG
CCTTTGCCACTATCACCCTTCTCCCTTCTCTCACTTTTTCCCTCTCTCAATGCAATCACCACTTTCGTTGCCACAACTACATCATCCGTACTCTCTCTATCCCAACATAT
TCTGCAAAAGGACGACGACAACTTCCGCAAATTCCTGCCTTTGCTTCCAGTTCTTTCGTCGAGCCGTTGGTGTACGACCGGGATTCCCTGTCCGAGTATGAAGAGCGCTC
GTATTCCCCATACAGTAACGGGGCTGAGGGTTTTCATTTTGAAAATAGTTTTGCGTCGGCGGATTTCAAACACTTGGGAACGCCTGCACTTGAAGTGAAGGAGCTGGACG
AGTTGCCGGAGCAATGGCGTCGATCCAAATTGGCTTGGCTTTGCAAAGAATTGCCTGCGCATAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAGGAAATGGATG
AGGCAGGATGACGCGGCCTATCTCACCGTGCATTGTTTGCGTATTCGCGAAAACGAGACTGCTTTTAGGGTATACAAGTGGATGATGCAACAACATTGGTACCAATTTGA
TTATGCTTTAGCTACTAAGCTTGCTGATTACTTGGGAAAGGAACGGAAGTTCTCAAAGTGTCGGGAGGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAAT
CCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTATCATTTACAATCGTATGATTCAGTTAGGAGGTTACCAACCA
CGTCTTAGCTTGCACAATTCTCTCTTTAGGGCTCTCATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATACCACAATCTGGTAAGAAC
TGGACTCGAGTTACATAAACATATATATGGCGGTCTAATTTGGCTGCATAGTTATCAGGATACTGTAGACAAAGAAAGGATAGTGTCACTAAGAAAAGAAATGCAACAAG
CTGGAATCAAGGAGGAAAGAGAAGTCCTTTTGTCCATCTTGAGAGCGAGCTCAAAAATGGGGGATGTGATGGAAGCAGAAAGATCGTGGCTTAAACTTAAGTATTTTGAT
GGTAGCATGCCATCTCAGGCTTTTGTTTACAAAATGGAAGTCTATTCAAAGGTGGGTAACCCGATGAAAGCTTTGGAGATATTTAGGGAGATGGAGCAGCTGAACACTAC
AAGTGCTGCAACATATCAGACAATTATTGGGATTTTATGTAAATTTCAAGAGATAAAACTTGCAGAATCCATCATGGCAGGCTTCATAAAGAGTAATTTAAAACCCCTCA
TGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACACGATAAGTTAGAGTTAACCTTCTCCCAGTGCCTTGAGAAGTGTAAACCCAATCGTACTATT
TACAGCATATATTTGGACTCTTTGGTAAAAGTTGGTAATCTCAACAGGGCTGAAGAAATATTTAGTCAGATGGAAACAAATGGAGAAATTGGTGTAAATGCTCGTTCATG
CAACATCATTTTAAGTGGGTATCTGTTAAGTGGGAATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAAAAGTATGACATTGATTCTCGATTAATGGAGA
AACTTGATTATGTCCTCAGCTTGAGTAGGAAGGAGGTTAAGAAGCCAGTAAGCTTAAAGTTGAGTAAAGAACAAAGGGAGATTTTAGTAGGGTTGTTGTTAGGTGGCCTG
GAGATAGAATCTGATGAAGGGAGGAAGAATCATAGAATCCAATTTGAATTCCACAAAAAATGGAGCACCCACTCTCGTTTGAGGAGACACATATATGAGCAATATCACGA
GTGGTTACATCCTGCTTCAAAGTTGAGCGATAATGATATAGATATACCATATAAATTCTGCACTGTTTCACATTCATATTTTGGTTTCTATGCAGATCAGTTTTGGCCAC
GAGGCCATCCTGCAATCCCTAATCTAATTCACAGGTGGCTTTCACCTCGTGTTCTTGCATACTGGTACATGTATGGAGGCTGCAGGATATGGTCAGGGGATATTTTACTG
AAGCTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAAAAGTCCATGTATTGCAAGGTGAAACGGAAGGGCAGGGTGTATTGGATAGGTTTACT
TGGAAGCAACGCCACGTGGTTCTTGAAACTCATTGAACCTTTCATTCTGGATGACTTAAAAGATAATCTACAGGCAGGCAGCCTTAACTTGGAGAGTGCGTTAATTGAAA
CTGAAAATATCGATTTTGATAATCAATCTGATTCTGATGAGGAGGCTTCTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTAACTAGCAAAGCCCAACCTTCAAAACAAACACCACCTTCCCATCGCATTCTTCCATCTCTTGCGCGACACTACGGCGAAACGGCTCGGCCACGACAAACGGCGAC
AAGCCTTTGCTTCCAGTCGCCAACAGTCCCTCTCACTCCCAACTCTGCATTCACTGTTCTGTCTCTCCGTAACCCTCCTTCGGTTTTCTCCATGTCCATTCGTACCTCTG
CCTTTGCCACTATCACCCTTCTCCCTTCTCTCACTTTTTCCCTCTCTCAATGCAATCACCACTTTCGTTGCCACAACTACATCATCCGTACTCTCTCTATCCCAACATAT
TCTGCAAAAGGACGACGACAACTTCCGCAAATTCCTGCCTTTGCTTCCAGTTCTTTCGTCGAGCCGTTGGTGTACGACCGGGATTCCCTGTCCGAGTATGAAGAGCGCTC
GTATTCCCCATACAGTAACGGGGCTGAGGGTTTTCATTTTGAAAATAGTTTTGCGTCGGCGGATTTCAAACACTTGGGAACGCCTGCACTTGAAGTGAAGGAGCTGGACG
AGTTGCCGGAGCAATGGCGTCGATCCAAATTGGCTTGGCTTTGCAAAGAATTGCCTGCGCATAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAGGAAATGGATG
AGGCAGGATGACGCGGCCTATCTCACCGTGCATTGTTTGCGTATTCGCGAAAACGAGACTGCTTTTAGGGTATACAAGTGGATGATGCAACAACATTGGTACCAATTTGA
TTATGCTTTAGCTACTAAGCTTGCTGATTACTTGGGAAAGGAACGGAAGTTCTCAAAGTGTCGGGAGGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAAT
CCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTATCATTTACAATCGTATGATTCAGTTAGGAGGTTACCAACCA
CGTCTTAGCTTGCACAATTCTCTCTTTAGGGCTCTCATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATACCACAATCTGGTAAGAAC
TGGACTCGAGTTACATAAACATATATATGGCGGTCTAATTTGGCTGCATAGTTATCAGGATACTGTAGACAAAGAAAGGATAGTGTCACTAAGAAAAGAAATGCAACAAG
CTGGAATCAAGGAGGAAAGAGAAGTCCTTTTGTCCATCTTGAGAGCGAGCTCAAAAATGGGGGATGTGATGGAAGCAGAAAGATCGTGGCTTAAACTTAAGTATTTTGAT
GGTAGCATGCCATCTCAGGCTTTTGTTTACAAAATGGAAGTCTATTCAAAGGTGGGTAACCCGATGAAAGCTTTGGAGATATTTAGGGAGATGGAGCAGCTGAACACTAC
AAGTGCTGCAACATATCAGACAATTATTGGGATTTTATGTAAATTTCAAGAGATAAAACTTGCAGAATCCATCATGGCAGGCTTCATAAAGAGTAATTTAAAACCCCTCA
TGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACACGATAAGTTAGAGTTAACCTTCTCCCAGTGCCTTGAGAAGTGTAAACCCAATCGTACTATT
TACAGCATATATTTGGACTCTTTGGTAAAAGTTGGTAATCTCAACAGGGCTGAAGAAATATTTAGTCAGATGGAAACAAATGGAGAAATTGGTGTAAATGCTCGTTCATG
CAACATCATTTTAAGTGGGTATCTGTTAAGTGGGAATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAAAAGTATGACATTGATTCTCGATTAATGGAGA
AACTTGATTATGTCCTCAGCTTGAGTAGGAAGGAGGTTAAGAAGCCAGTAAGCTTAAAGTTGAGTAAAGAACAAAGGGAGATTTTAGTAGGGTTGTTGTTAGGTGGCCTG
GAGATAGAATCTGATGAAGGGAGGAAGAATCATAGAATCCAATTTGAATTCCACAAAAAATGGAGCACCCACTCTCGTTTGAGGAGACACATATATGAGCAATATCACGA
GTGGTTACATCCTGCTTCAAAGTTGAGCGATAATGATATAGATATACCATATAAATTCTGCACTGTTTCACATTCATATTTTGGTTTCTATGCAGATCAGTTTTGGCCAC
GAGGCCATCCTGCAATCCCTAATCTAATTCACAGGTGGCTTTCACCTCGTGTTCTTGCATACTGGTACATGTATGGAGGCTGCAGGATATGGTCAGGGGATATTTTACTG
AAGCTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAAAAGTCCATGTATTGCAAGGTGAAACGGAAGGGCAGGGTGTATTGGATAGGTTTACT
TGGAAGCAACGCCACGTGGTTCTTGAAACTCATTGAACCTTTCATTCTGGATGACTTAAAAGATAATCTACAGGCAGGCAGCCTTAACTTGGAGAGTGCGTTAATTGAAA
CTGAAAATATCGATTTTGATAATCAATCTGATTCTGATGAGGAGGCTTCTAATTAATACAAGAATTTTAGTTGTTAGTCCCATGATCTGTTGGATTCTTTAATTCGCCAA
ATGATGAAATCCTTCAATGTTGCTTGGTTTTGGGAGGCTTTGTAAATCAAATAGGATCTTCCTTGACCAATTCGGAACTTGTAAATACCTAAATGGATATGTAACAAACT
TATGTTCTGATTGTGTGTATTGTTGTAGCA
Protein sequenceShow/hide protein sequence
MLTSKAQPSKQTPPSHRILPSLARHYGETARPRQTATSLCFQSPTVPLTPNSAFTVLSLRNPPSVFSMSIRTSAFATITLLPSLTFSLSQCNHHFRCHNYIIRTLSIPTY
SAKGRRQLPQIPAFASSSFVEPLVYDRDSLSEYEERSYSPYSNGAEGFHFENSFASADFKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWM
RQDDAAYLTVHCLRIRENETAFRVYKWMMQQHWYQFDYALATKLADYLGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASIIYNRMIQLGGYQP
RLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVRTGLELHKHIYGGLIWLHSYQDTVDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAERSWLKLKYFD
GSMPSQAFVYKMEVYSKVGNPMKALEIFREMEQLNTTSAATYQTIIGILCKFQEIKLAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTI
YSIYLDSLVKVGNLNRAEEIFSQMETNGEIGVNARSCNIILSGYLLSGNYLKAEKIYDLMCQKKYDIDSRLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGL
EIESDEGRKNHRIQFEFHKKWSTHSRLRRHIYEQYHEWLHPASKLSDNDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRIWSGDILL
KLKGSHEGVEKIVKSLREKSMYCKVKRKGRVYWIGLLGSNATWFLKLIEPFILDDLKDNLQAGSLNLESALIETENIDFDNQSDSDEEASN