; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004701 (gene) of Chayote v1 genome

Gene IDSed0004701
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG04:1829583..1834342
RNA-Seq ExpressionSed0004701
SyntenySed0004701
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006388 - tRNA splicing, via endonucleolytic cleavage and ligation (biological process)
GO:0010239 - chloroplast mRNA processing (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048564 - photosystem I assembly (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0004519 - endonuclease activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607381.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0081.38Show/hide
Query:  LNSPSVFSMSIRTSAFATAALLRSLAV------SRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISH
        LNS S  SMSIRTSAFAT  LLRSL +        F   NY + SL  PT S   RRQ P +PA +S S  E L+ DRDSP+ESEE L SPYS  AE   
Subjt:  LNSPSVFSMSIRTSAFATAALLRSLAV------SRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISH

Query:  ENAFASADLKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYAL
           FASADLKH G+PALEVK+LDELPEQWRRSKLAWLCKELPA KPGTL+RLLN Q KW+KQDDA Y+ VHCLRIREN TAFRVYKWMMQQHWYRFDYAL
Subjt:  ENAFASADLKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYAL

Query:  ATKLADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFI
        ATKLADYMGKERKF+KCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCI+EA  IYNRMIQLGGYQPRL LHNSLF+AL+SKPG+LSKHHLKQAEFI
Subjt:  ATKLADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFI

Query:  YHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGN
        YHN+ TTGLELHKDIYGGLIWLHSYQDT+DKERI S+R++MQQAGIEEEREVL+SILRASSK+GDV EAE+SW KLK+FDG+MPSQAFVYKMEVYAK+GN
Subjt:  YHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGN

Query:  PMKALEIFREMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVK
        PMKA EIFREMEQLN   AAAYQTII ILCK + V LAESVM  FIKSNLKPL PAYVDLMNMFFNLSLHDK++LTFS CLEKCKPNRTIYSIYL+SLVK
Subjt:  PMKALEIFREMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVK

Query:  VGNLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGL
        VGNL+RAEEIF++MQTN  IGV+ARSCN+ILSGYLLSG++LKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKPVSLKLSKE REILVGLLLGGL
Subjt:  VGNLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGL

Query:  KIETEEGTKNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGG
        +IE++EG KNHR+ FEFH+  STH+RL+RHIYEQY EWLH ASKLSDSD D+PY+FCT+SH+YFGFYADQFWPRGHP IPNLIHRWLSPR LAYWYMYGG
Subjt:  KIETEEGTKNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGG

Query:  CRLPSGEFLLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS
        CR+ SG+F+LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGL GSNATWFWKLIEPFIL+DLKDS+QA SLN+E+  NET NINFD QSDSDEEAS
Subjt:  CRLPSGEFLLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS

XP_022949171.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata]0.0e+0082.07Show/hide
Query:  MSIRTSAFATAALLRSLAV------SRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD
        MSIRTSAFAT  LLRSL +        F   NY + SL  PT S   RRQ P +PA +S S  E L+ DRDSP+ESEE L SPYS  AE      FASAD
Subjt:  MSIRTSAFATAALLRSLAV------SRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD

Query:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM
        LKH G+PALEVK+LDELPEQWRRSKLAWLCKELPA KPGTL+RLLN Q KW+KQDDA YL VHCLRIREN TAFRVYKWMMQQHWYRFDYALATKLADYM
Subjt:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM

Query:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG
        GKERKF+KCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCI+E+ TIYNRMIQLGGYQPRL LHNSLF+AL+SKPG+LSKHHLKQAEFIYHNL TTG
Subjt:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG

Query:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF
        LELHKDIYGGLIWLHSYQDT+DKERI S+R++M QAGIEEEREVL+SILRASSK+GDV EAE+SW KLK+FDG+MPSQAFVYKMEVYAK+GNPMKA EIF
Subjt:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF

Query:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE
        REMEQLNS  AAAYQTII ILCKF+ V LAESVM GFIKSNLKPL PAYVDLMNMFFNLSLHDK++LTFS CLEKCKPNRTIYSIYL+SLVKVGNL+RAE
Subjt:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE

Query:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT
        EIF++MQTN  IGV+ARSCN+ILSGYLLSG++LKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKPVSLKLSKE REILVGLLLGGL+IE++EG 
Subjt:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT

Query:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF
        KNHR+ FEFH+  STH+RL+RHI+EQY EWLHPASKLSDSD D+PY+FCT+SH+YFGFYADQFWPRGHP IPNLIHRWLSPR LAYWYMYGGCR+ SG+F
Subjt:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF

Query:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS
        +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGL GSNATWFWKLIEPFIL+DLKDS+QA SLN+E+  NET NINFD QSDSDEEAS
Subjt:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS

XP_022998786.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima]0.0e+0082.45Show/hide
Query:  MSIRTSAFATAALLRSLAV------SRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD
        MSIRTSAFAT  LLRSL +      + F   NY + SL  PT S   RRQ P +PA +S S  E L+ DRDSP+ESEE L SPYSN AE      FASAD
Subjt:  MSIRTSAFATAALLRSLAV------SRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD

Query:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM
        LKH G+PALEVK+LDELPEQWRRSKLAWLCKELPAHKPGTL+RLLN Q KW+KQDDA YL VHCLRIREN TAFRVYKWMMQQHWYRFDYALATKLADYM
Subjt:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM

Query:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG
        GKERKF+KCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCI+EA TIYNRMIQLGGY PRL LHNSLF+AL+SKPG+LSKHHLKQAEFIYHNLVTTG
Subjt:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG

Query:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF
        LELHKDIYGGLIWLHSYQDT+DKERI S+R++MQQAGIEEEREVL+SILRASSK+GDV EAE+SW K+K+FDG+MPSQAFVYKMEVYAK+GNPMKALEIF
Subjt:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF

Query:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE
        REMEQLNS  +AAYQTII ILCKF+ V LAESVM+GFIKSNLKPL PAYVDLMNMFFNLSLHDK++LTFS CLEKCKPNRTIYSIYL+SLVKVGNL+RAE
Subjt:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE

Query:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT
        EIF++MQTN  IGV+ARSCN+ILSGYLLSG++LKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKPVSLKLSKE REILVGLLLGGL+IE++EG 
Subjt:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT

Query:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF
        KNHR+ FEFH+  STH+ L+RH+YEQY EWLHPASKLSDSD D+PY+FCT+SH+YFGFYADQFWPRGHP IPNLIHRWLSPR LAYWYMYGGCR+ SG+F
Subjt:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF

Query:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS
        +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGL GSNATWFWKLIEPFIL+DLKDS+QA +LNLE+ +NET NINFD QSDSDEEAS
Subjt:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS

XP_023521219.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0081.69Show/hide
Query:  MSIRTSAFATAALLRSLAVS------RFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD
        MSIRTSAFAT  LLRSL +S       F   NY + SL  PT S   RRQ P +PA +S S  E L+ DRDSP+ESEE L SPYS  AE      FASAD
Subjt:  MSIRTSAFATAALLRSLAVS------RFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD

Query:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM
        LKH G+PALEVK+LDELPEQWRRSKLAWLCKELPAH PGTL+RLLN Q KW+KQDDA Y+ VHCLRIREN TAFRVYKWMMQQHWYRFDYALATKLADYM
Subjt:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM

Query:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG
        GKERKF+KCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCI+EA  IYNRMIQLGGY+PRL LHNSLF+AL+SKPG+LSKHHLKQAEFIYHNLVTTG
Subjt:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG

Query:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF
        LELHKDIY GLIWLHSYQDT+DKERI S+R++MQQAGIEEEREVL+SILRASSK+GDV EAE+SW KLK+FDG+MPSQAFVYKMEVYAK+GNPMKA EIF
Subjt:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF

Query:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE
        REMEQLNS  AAAYQTII ILCK + V LAESVM GFIKSNLKPL PAYVDLMNMFFNLSLHDK++LTFS CLEKCKPNRTIYSIYL+SLVKVGNL+RAE
Subjt:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE

Query:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT
        EIF++MQTN  IGV+ARSCN+ILSGYLLSG++LKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKPVSLKLSKE REILVGLLLGGL+IE++EG 
Subjt:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT

Query:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF
        KNHR+ FEFH+  STH+RL+RHIYEQY EWLHPASK SDSD D+PY+FCT+SH+YFGFYADQFWPRGHP IPNLIHRWLSPR LAYWYMYGGCR+ SG+F
Subjt:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF

Query:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS
        +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGL GSNATWFWKLIEPFIL+DLKD +QA SLN+E+ +NET NINFD QSDSDEEAS
Subjt:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS

XP_023525582.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0081.69Show/hide
Query:  MSIRTSAFATAALLRSLAVS------RFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD
        MSIRTSAFAT  LLRSL +S       F   NY + SL  PT S   RRQ   +PA +S S  E L+ DRDSP+ESEE L SPYS  AE      FASAD
Subjt:  MSIRTSAFATAALLRSLAVS------RFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD

Query:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM
        LKH G+PALEVK+LDELPEQWRRSKLAWLCKELPAHKPGTL+RLLN Q KW+KQDDA Y+ VHCLRIREN TAFRVYKWMMQQHWYRFDYALATKLADYM
Subjt:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM

Query:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG
        GKERKF+KCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCI+EA  IYNRMIQLGGY+PRL LHNSLF+AL+SKPG+LSKHHLKQAEFIYHNLVTTG
Subjt:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG

Query:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF
        LELHKDIY GLIWLHSYQDT+DKERI S+R++MQQAGIEEEREVL+SILRASSK+GDV EAE+SW KLK+FDG+MPSQAFVYKMEVYAK+GNPMKA EIF
Subjt:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF

Query:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE
        REMEQLNS  AAAYQTII ILCK + V LAESVM GFIKSNLKPL PAYVDLMNMFFNLSLHDK++LTFS CLEKCKPNRTIYSIYL+SLVKVGNL+RAE
Subjt:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE

Query:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT
        EIF++MQTN  IGV+ARSCN+ILSGYLLSG++LKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKPVSLKLSKE REILVGLLLGGL+IE++EG 
Subjt:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT

Query:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF
        KNHR+ FEFH+  STH+RL+RHIYEQY EWLHPASK SDSD D+PY+FCT+SH+YFGFYADQFWPRGHP IPNLIHRWLSPR LAYWYMYGGCR+ SG+F
Subjt:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF

Query:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS
        +LKLKGS EGV KIVKSL EKSM CKVKRKGRVYWIGL GSNATWFWKLIEPFIL+DLKD +QA SLN+E+ +NET NINFD QSDSDEEAS
Subjt:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS

TrEMBL top hitse value%identityAlignment
A0A0A0LBL0 LAGLIDADG_2 domain-containing protein0.0e+0078.64Show/hide
Query:  VFSMSIRTSAFATAALLRSLAVS------RFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISH-ENAF
        VFSMSI TSAF+T   LRSL +S       F   N+ + +LF P  SV  RRQ P + A +SGSF + L+ D DSPSESEE L S +SN  +  H EN F
Subjt:  VFSMSIRTSAFATAALLRSLAVS------RFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISH-ENAF

Query:  ASADLKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKL
        AS DLKH G+P LEVK+LDELPEQWRRSK+AWLCKELPA KPGT++RLLN Q KW+ QDDATYL VHCLRIREN TAFRVYKWMMQQHWYRFDYAL+TKL
Subjt:  ASADLKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKL

Query:  ADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNL
        ADYMGKERKF+KCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCI+EA TIYNRMIQLGGYQPRL LH+SLFRAL+SKPG+LSKHHLKQAEFIYHNL
Subjt:  ADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNL

Query:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKA
        VT+GLELHKD+YGGLIWLHSYQDTID+ERI S+R++MQQAGI+EEREVLLSILRASSKMGDV EAEK W++LK  DGNMPSQAFVYKMEVYAK+G PMKA
Subjt:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKA

Query:  LEIFREMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNL
        LEIFREMEQLNST AAAYQTII ILCKFQ +ELAES+M+GFI+SNLKPL PAYVDLMNMFFNL+L DK++LTFS CLEKCKPNRTIYSIYLDSLVKVGNL
Subjt:  LEIFREMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNL

Query:  NRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIET
        +RAEEIF++M+TN  IG+NARSCN+IL GYLL GN++KAEKIYDLMCQK+YDIDPPLMEKL+Y+LSLSRKEVKKP+SLKLSKE REILVGLLLGGL+IE+
Subjt:  NRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIET

Query:  EEGTKNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLP
        ++  KNHR+ FEFH+   TH+ L+RHIYEQY +WLH ASKL+D D+D+PY+FCT+SH+YFGFYADQFWPRG   IPNLIHRWLSPR LAYWYMYGGCR  
Subjt:  EEGTKNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLP

Query:  SGEFLLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS
        SG+ LLKLKGSHEGVEKIVKSLREKS+HCKVKRKG +YWIGL GSNATWFWKLIEPFIL+ LK+S QA SLNL   LN +ENINFD +SDS EE S
Subjt:  SGEFLLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS

A0A1S3CPK0 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0081.03Show/hide
Query:  VFSMSIRTSAFATAALLRSLAVS------RFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISH-ENAF
        VFSMSI TSAF+T  LLRSL +S       F   N+ + +LF  + SV + RQ P + A +SGSF + L+ DRDSPSESEE L SPYSN  +  H EN F
Subjt:  VFSMSIRTSAFATAALLRSLAVS------RFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISH-ENAF

Query:  ASADLKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKL
        AS DLKH G+PALEVK+LDELPEQWRRSKLAWLCKELPA KPGT++RLLN Q KW+ QDDATYLTVHCLRIREN TAFRVYKWMMQQHWYRFDYAL+TKL
Subjt:  ASADLKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKL

Query:  ADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNL
        ADYMGKERKF+KCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCI+EA TIYNRMIQLGGYQPRL LH+SLFRAL+SKPG+LSKHHLKQAEFIYHNL
Subjt:  ADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNL

Query:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKA
        VT+GLELHKDIYGGLIWLHSYQDTIDKERI S+R++MQQAGI+EE+EVLLSILRASSKMGDV EAE+ W+KLK  DGNMP QAFVYKMEVYAK+G PMKA
Subjt:  VTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKA

Query:  LEIFREMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNL
        LEIFREMEQLNST AAAYQTII ILCKFQ +ELAES+M+GFI+SNLKPL PAYVD+MNMFFNLSLHDK++LTFS CLEKCKPNRTIYSIYLDSLVKVGNL
Subjt:  LEIFREMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNL

Query:  NRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIET
        +RAEEIF++M+TN  IGVNARSCN+IL GYLL GN++KAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKP+SLKLSKE REILVGLLLGGL+IE+
Subjt:  NRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIET

Query:  EEGTKNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLP
        +E  KNHR+ FEFH+   TH+ L+RHIYEQY +WLH ASKL+D DID+PY+FCT+SH+YFGFYADQFWPRG  TIPNLIHRWLSPRALAYWYMYGGCR  
Subjt:  EEGTKNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLP

Query:  SGEFLLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS
        SG+ LLKLKGSHEGVEKIVKSLREKSMHCKVKRKG +YWIGL GSNATWFWKLIEPFIL+DLK+S QA SLNL   LNETENINFD QSDS EE S
Subjt:  SGEFLLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS

A0A6J1GB98 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0082.07Show/hide
Query:  MSIRTSAFATAALLRSLAV------SRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD
        MSIRTSAFAT  LLRSL +        F   NY + SL  PT S   RRQ P +PA +S S  E L+ DRDSP+ESEE L SPYS  AE      FASAD
Subjt:  MSIRTSAFATAALLRSLAV------SRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD

Query:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM
        LKH G+PALEVK+LDELPEQWRRSKLAWLCKELPA KPGTL+RLLN Q KW+KQDDA YL VHCLRIREN TAFRVYKWMMQQHWYRFDYALATKLADYM
Subjt:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM

Query:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG
        GKERKF+KCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCI+E+ TIYNRMIQLGGYQPRL LHNSLF+AL+SKPG+LSKHHLKQAEFIYHNL TTG
Subjt:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG

Query:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF
        LELHKDIYGGLIWLHSYQDT+DKERI S+R++M QAGIEEEREVL+SILRASSK+GDV EAE+SW KLK+FDG+MPSQAFVYKMEVYAK+GNPMKA EIF
Subjt:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF

Query:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE
        REMEQLNS  AAAYQTII ILCKF+ V LAESVM GFIKSNLKPL PAYVDLMNMFFNLSLHDK++LTFS CLEKCKPNRTIYSIYL+SLVKVGNL+RAE
Subjt:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE

Query:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT
        EIF++MQTN  IGV+ARSCN+ILSGYLLSG++LKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKPVSLKLSKE REILVGLLLGGL+IE++EG 
Subjt:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT

Query:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF
        KNHR+ FEFH+  STH+RL+RHI+EQY EWLHPASKLSDSD D+PY+FCT+SH+YFGFYADQFWPRGHP IPNLIHRWLSPR LAYWYMYGGCR+ SG+F
Subjt:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF

Query:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS
        +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGL GSNATWFWKLIEPFIL+DLKDS+QA SLN+E+  NET NINFD QSDSDEEAS
Subjt:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS

A0A6J1KB64 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0082.45Show/hide
Query:  MSIRTSAFATAALLRSLAV------SRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD
        MSIRTSAFAT  LLRSL +      + F   NY + SL  PT S   RRQ P +PA +S S  E L+ DRDSP+ESEE L SPYSN AE      FASAD
Subjt:  MSIRTSAFATAALLRSLAV------SRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASAD

Query:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM
        LKH G+PALEVK+LDELPEQWRRSKLAWLCKELPAHKPGTL+RLLN Q KW+KQDDA YL VHCLRIREN TAFRVYKWMMQQHWYRFDYALATKLADYM
Subjt:  LKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYM

Query:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG
        GKERKF+KCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCI+EA TIYNRMIQLGGY PRL LHNSLF+AL+SKPG+LSKHHLKQAEFIYHNLVTTG
Subjt:  GKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTG

Query:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF
        LELHKDIYGGLIWLHSYQDT+DKERI S+R++MQQAGIEEEREVL+SILRASSK+GDV EAE+SW K+K+FDG+MPSQAFVYKMEVYAK+GNPMKALEIF
Subjt:  LELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIF

Query:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE
        REMEQLNS  +AAYQTII ILCKF+ V LAESVM+GFIKSNLKPL PAYVDLMNMFFNLSLHDK++LTFS CLEKCKPNRTIYSIYL+SLVKVGNL+RAE
Subjt:  REMEQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAE

Query:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT
        EIF++MQTN  IGV+ARSCN+ILSGYLLSG++LKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKPVSLKLSKE REILVGLLLGGL+IE++EG 
Subjt:  EIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGT

Query:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF
        KNHR+ FEFH+  STH+ L+RH+YEQY EWLHPASKLSDSD D+PY+FCT+SH+YFGFYADQFWPRGHP IPNLIHRWLSPR LAYWYMYGGCR+ SG+F
Subjt:  KNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEF

Query:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS
        +LKLKGS EGV KIVKSLREKSM CKVKRKGRVYWIGL GSNATWFWKLIEPFIL+DLKDS+QA +LNLE+ +NET NINFD QSDSDEEAS
Subjt:  LLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEAS

A0A6P3ZHH7 pentatricopeptide repeat-containing protein At2g15820, chloroplastic7.3e-30765.52Show/hide
Query:  NLAFTPNSAFTFLCNLNSPSVFSMSIRTSAFATAALLRS--LAVSRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLY
        +LA  PNS+ +FL      +  S+S+R+S+F   +LLRS  L++S     +     +FTP  S    + F      SSG+F E L        E+    +
Subjt:  NLAFTPNSAFTFLCNLNSPSVFSMSIRTSAFATAALLRS--LAVSRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLY

Query:  SPYSNAAEISHENAFASADLKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMM
        S         +E +FAS DLKH  SP LEVK+L+ELPEQWRRSKLAWLCKELPAHKP TL+R+LN Q KW++Q+DATY+ VHC+RIREN   FRVYKWMM
Subjt:  SPYSNAAEISHENAFASADLKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMM

Query:  QQHWYRFDYALATKLADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGEL
        QQHWYRFD+ALATKLADYMGKERKF+KCRE+FDDIINQG VPSESTFHIL+VAYLS PVQGC++EAC+IYNRMIQLGGYQPRL LHNSLFR++I KPG  
Subjt:  QQHWYRFDYALATKLADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGEL

Query:  SKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFV
        SK +LKQAEFI+HNL TTGLE+HKDIY GLIWLHS+QDT+DKER+ ++R  MQQAGIEE REVL+S+LRA SK GDV EAEK+W KL   D   PSQAFV
Subjt:  SKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFV

Query:  YKMEVYAKLGNPMKALEIFREMEQ-LNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNR
        Y+MEV+AK GN  K+LEIFR+M++ LNST   AY  +I ILC+ Q VELAESVM  F+ S LKPLMP+YVDLM+M+F+L LHDKV+L F  CL+KC+PNR
Subjt:  YKMEVYAKLGNPMKALEIFREMEQ-LNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNR

Query:  TIYSIYLDSLVKVGNLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEG
        TIY+IYLDSLVK  NL +AEEIF +MQ +  IGV+ARSCN+ILSGYL SG+++KAEKIYDLMCQK+YDI+  LMEK+DYVLSLSRK VKKP+SLKLSKE 
Subjt:  TIYSIYLDSLVKVGNLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEG

Query:  REILVGLLLGGLKIETEEGTKNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLS
        REILVGLLLGGLKIE++E  KNH L FEF++ S  H+ LKRHI++QY EWLHP+ K +D+  D+P RF TISH+YFGFYADQFWP+G  TIP LIHRWLS
Subjt:  REILVGLLLGGLKIETEEGTKNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLS

Query:  PRALAYWYMYGGCRLPSGEFLLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENIN
        PR LAYWYMYGG R  SG+ LLKLKG+ E VEKIVK+L+ +S++C+VK+KGRV+WIG  G+N+TWFWKL EP+I++DLKDS++ G   +     ETENI+
Subjt:  PRALAYWYMYGGCRLPSGEFLLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLNETENIN

Query:  FDGQSDSDEEAS
        F+  SDSDE+AS
Subjt:  FDGQSDSDEEAS

SwissProt top hitse value%identityAlignment
O04491 Putative pentatricopeptide repeat-containing protein At1g096808.5e-1023.51Show/hide
Query:  KWMMQQHWYRFDYALATKLADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISK
        K  M++   R D    + L + + KE K      +FD++  +G +P++  F  LI  +      G ID     Y +M+   G QP + L+N+L       
Subjt:  KWMMQQHWYRFDYALATKLADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISK

Query:  PGELSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPS
         G      L  A  I   ++  GL   K  Y  LI    +    D E    IR++M Q GIE +R    +++    K G V +AE++ R++         
Subjt:  PGELSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPS

Query:  QAFVYKMEVYAKLGNPMKALEIFREMEQLNST-RAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKP
          +   M+ + K G+     ++ +EM+          Y  ++  LCK   ++ A+ ++   +   + P
Subjt:  QAFVYKMEVYAKLGNPMKALEIFREMEQLNST-RAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKP

Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171403.5e-1121.49Show/hide
Query:  REVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHL---KQAEFIYHNLVTTGLELHKD
        RE+FD++  +GC P+E TF IL+  Y  A   G  D+   + N M +  G  P   ++N++  +   +        +    + E +  ++VT    +   
Subjt:  REVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHL---KQAEFIYHNLVTTGLELHKD

Query:  IYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSI-LRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREMEQ
           G +        +D  RI S     +  G+     +  ++ L+   K+G + +A+  +  ++  D     Q++   ++   + G  ++A  + ++M  
Subjt:  IYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSI-LRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREMEQ

Query:  LN-STRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCL-EKCKPNRTIYSIYLDSLVKVGNLNRAEEIF
                +Y  ++  LCK   +  A++++    ++ + P    Y  L++ + ++   D  K    + +   C PN    +I L SL K+G ++ AEE+ 
Subjt:  LN-STRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCL-EKCKPNRTIYSIYLDSLVKVGNLNRAEEIF

Query:  AEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKI
         +M   +  G++  +CN+I+ G   SG   KA +I
Subjt:  AEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKI

Q6ZHJ5 Pentatricopeptide repeat-containing protein OTP51, chloroplastic1.5e-22752.02Show/hide
Query:  PPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASADLKH-FGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNK
        P +PA +S      L  D D   E EE  +  +          A+A+AD +    SP L V +L+ELPEQWRRS++AWLCKELPA+K  T  R+LN Q K
Subjt:  PPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISHENAFASADLKH-FGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNK

Query:  WLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTI
        W+ QDDATY+ VHCLRIR N  AFRVY WM++QHW+RF++ALAT++AD +G++ K  KCREVF+ ++ QG VP+ESTFHILIVAYLS P   C++EACTI
Subjt:  WLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTI

Query:  YNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILR
        YN+MIQ+GGY+PRL LHNSLFRAL+SK G  +K++LKQAEF+YHN+VTT L++HKD+Y GLIWLHSYQD ID+ERI ++R++M+QAG +E  +VL+S++R
Subjt:  YNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILR

Query:  ASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREMEQLN-STRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAY
        A SK G+VAE E +W  +     ++P QA+V +ME YA+ G PMK+L++F+EM+  N     A+Y  II I+ K   V++ E +M+ FI+S++K LMPA+
Subjt:  ASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREMEQLN-STRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAY

Query:  VDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDI
        +DLM M+ +L +H+K++LTF  C+ +C+PNR +Y+IYL+SLVKVGN+ +AEE+F EM  N +IG N +SCN++L GYL + ++ KAEK+YD+M +KKYD+
Subjt:  VDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDI

Query:  DPPLMEKLDYVLSLSRKEVK-KPVSLKLSKEGREILVGLLLGGLKIETEEGTKNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRF
            +EKL   L L++K +K K VS+KL +E REIL+GLLLGG ++E+      H + F+F + S+ H+ L+ HI+E++ EWL  AS+  D    +PY+F
Subjt:  DPPLMEKLDYVLSLSRKEVK-KPVSLKLSKEGREILVGLLLGGLKIETEEGTKNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRF

Query:  CTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEFLLKLKGSH-EGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFW
         TI H +F F+ DQF+ +G P +P LIHRWL+PR LAYW+M+GG +LPSG+ +LKL G + EGVE+IV SL  +S+  KVKRKGR +WIG  GSNA  FW
Subjt:  CTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEFLLKLKGSH-EGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFW

Query:  KLIEPFILEDLKDSV-QAGSLNLERGLNETENINFDGQSDSDEE
        ++IEP +L +    V Q GS     G  +T+  + D    SD E
Subjt:  KLIEPFILEDLKDSV-QAGSLNLERGLNETENINFDGQSDSDEE

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011103.8e-1021.49Show/hide
Query:  KERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKH-HLKQAEFIYHNLVTTG
        K+ K  K       +  +G  P   T++ LI AY S   +G ++EA  + N M    G+ P +  +N++          L KH   ++A+ ++  ++ +G
Subjt:  KERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKH-HLKQAEFIYHNLVTTG

Query:  LELHKDIYGGLIWLHSYQ-DTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEI
        L      Y  L+     + D ++ E++ S   DM+   +  +     S++   ++ G++ +A   +  +K       +  +   ++ Y + G    A+ +
Subjt:  LELHKDIYGGLIWLHSYQ-DTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEI

Query:  FREM-EQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKP-------LMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLV
          EM +Q  +     Y TI+  LCK + +  A+ + +   +  L P       L+  +  L N+   + L  K+K       ++ + +   Y+  LD   
Subjt:  FREM-EQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKP-------LMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLV

Query:  KVGNLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLM
        KVG+++ A+EI+A+M + E++     S +++++     G+  +A +++D M  K  +I P +M
Subjt:  KVGNLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLM

Q9XIL5 Pentatricopeptide repeat-containing protein At2g15820, chloroplastic4.7e-25059.27Show/hide
Query:  EVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFAKC
        EV++L+ELPE+WRRSKLAWLCKE+P HK  TL+RLLN Q KW++Q+DATY++VHC+RIREN T FRVY+WM QQ+WYRFD+ L TKLA+Y+GKERKF KC
Subjt:  EVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFAKC

Query:  REVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTGLELHKDIY
        REVFDD++NQG VPSESTFHIL+VAYLS+  V+GC++EAC++YNRMIQLGGY+PRL LHNSLFRAL+SK G +    LKQAEFI+HN+VTTGLE+ KDIY
Subjt:  REVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTGLELHKDIY

Query:  GGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREMEQ-LN
         GLIWLHS QD +D  RI S+R +M++AG +E +EV++S+LRA +K G V E E++W +L + D  +PSQAFVYK+E Y+K+G+  KA+EIFREME+ + 
Subjt:  GGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREMEQ-LN

Query:  STRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFAEMQ
            + Y  II +LCK Q VEL E++M  F +S  KPL+P+++++  M+F+L LH+K+++ F  CLEKC+P++ IY+IYLDSL K+GNL +A ++F EM+
Subjt:  STRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFAEMQ

Query:  TNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKK-PVSLKLSKEGREILVGLLLGGLKIETEEGTKNHRLL
         N  I V+ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LSL +KEVKK P S+KLSK+ RE+LVGLLLGGL+IE+++  K+H + 
Subjt:  TNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKK-PVSLKLSKEGREILVGLLLGGLKIETEEGTKNHRLL

Query:  FEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEFLLKLKG
        FEF + S  H  LK++I++Q+REWLHP S   + DI +P+ F ++ H+YFGFYA+ +WP+G P IP LIHRWLSP +LAYWYMY G +  SG+ +L+LKG
Subjt:  FEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEFLLKLKG

Query:  SHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLN-ETENINFDGQSDSDEE
        S EGVEK+VK+L+ KSM C+VK+KG+V+WIGL G+N+  FWKLIEP +LE+LK+ ++  S +L+     E ++INF   SD  ++
Subjt:  SHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLN-ETENINFDGQSDSDEE

Arabidopsis top hitse value%identityAlignment
AT1G09680.1 Pentatricopeptide repeat (PPR) superfamily protein6.0e-1123.51Show/hide
Query:  KWMMQQHWYRFDYALATKLADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISK
        K  M++   R D    + L + + KE K      +FD++  +G +P++  F  LI  +      G ID     Y +M+   G QP + L+N+L       
Subjt:  KWMMQQHWYRFDYALATKLADYMGKERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISK

Query:  PGELSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPS
         G      L  A  I   ++  GL   K  Y  LI    +    D E    IR++M Q GIE +R    +++    K G V +AE++ R++         
Subjt:  PGELSKHHLKQAEFIYHNLVTTGLELHKDIYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPS

Query:  QAFVYKMEVYAKLGNPMKALEIFREMEQLNST-RAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKP
          +   M+ + K G+     ++ +EM+          Y  ++  LCK   ++ A+ ++   +   + P
Subjt:  QAFVYKMEVYAKLGNPMKALEIFREMEQLNST-RAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKP

AT1G62670.1 rna processing factor 21.5e-0922.35Show/hide
Query:  KFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTGLELH
        K ++   + D ++ +GC P   T+ +++        +G  D A  + N+M Q G  +P + ++N++        G     H+  A  ++  + T G+  +
Subjt:  KFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTGLELH

Query:  KDIYGGLI-WLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREM
           Y  LI  L +Y    D  R+ S   DM +  I  +     +++ A  K G + EAEK +         M  ++    +  Y+ L N     +   E 
Subjt:  KDIYGGLI-WLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREM

Query:  EQLNSTRAA--------AYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCL-EKCKPNRTIYSIYLDSLVKVG
        +Q+     +         Y T+I+  CK++ VE    V     +  L      Y  L+   F     D  +  F + + +   PN   Y+  LD L K G
Subjt:  EQLNSTRAA--------AYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCL-EKCKPNRTIYSIYLDSLVKVG

Query:  NLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMC
         L +A  +F  +Q ++ +     + N+++ G   +G   K E  +DL C
Subjt:  NLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMC

AT2G15820.1 endonucleases3.3e-25159.27Show/hide
Query:  EVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFAKC
        EV++L+ELPE+WRRSKLAWLCKE+P HK  TL+RLLN Q KW++Q+DATY++VHC+RIREN T FRVY+WM QQ+WYRFD+ L TKLA+Y+GKERKF KC
Subjt:  EVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFAKC

Query:  REVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTGLELHKDIY
        REVFDD++NQG VPSESTFHIL+VAYLS+  V+GC++EAC++YNRMIQLGGY+PRL LHNSLFRAL+SK G +    LKQAEFI+HN+VTTGLE+ KDIY
Subjt:  REVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTGLELHKDIY

Query:  GGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREMEQ-LN
         GLIWLHS QD +D  RI S+R +M++AG +E +EV++S+LRA +K G V E E++W +L + D  +PSQAFVYK+E Y+K+G+  KA+EIFREME+ + 
Subjt:  GGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREMEQ-LN

Query:  STRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFAEMQ
            + Y  II +LCK Q VEL E++M  F +S  KPL+P+++++  M+F+L LH+K+++ F  CLEKC+P++ IY+IYLDSL K+GNL +A ++F EM+
Subjt:  STRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFAEMQ

Query:  TNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKK-PVSLKLSKEGREILVGLLLGGLKIETEEGTKNHRLL
         N  I V+ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LSL +KEVKK P S+KLSK+ RE+LVGLLLGGL+IE+++  K+H + 
Subjt:  TNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKK-PVSLKLSKEGREILVGLLLGGLKIETEEGTKNHRLL

Query:  FEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEFLLKLKG
        FEF + S  H  LK++I++Q+REWLHP S   + DI +P+ F ++ H+YFGFYA+ +WP+G P IP LIHRWLSP +LAYWYMY G +  SG+ +L+LKG
Subjt:  FEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDIDVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEFLLKLKG

Query:  SHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLN-ETENINFDGQSDSDEE
        S EGVEK+VK+L+ KSM C+VK+KG+V+WIGL G+N+  FWKLIEP +LE+LK+ ++  S +L+     E ++INF   SD  ++
Subjt:  SHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEPFILEDLKDSVQAGSLNLERGLN-ETENINFDGQSDSDEE

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein2.5e-1221.49Show/hide
Query:  REVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHL---KQAEFIYHNLVTTGLELHKD
        RE+FD++  +GC P+E TF IL+  Y  A   G  D+   + N M +  G  P   ++N++  +   +        +    + E +  ++VT    +   
Subjt:  REVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHL---KQAEFIYHNLVTTGLELHKD

Query:  IYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSI-LRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREMEQ
           G +        +D  RI S     +  G+     +  ++ L+   K+G + +A+  +  ++  D     Q++   ++   + G  ++A  + ++M  
Subjt:  IYGGLIWLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSI-LRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREMEQ

Query:  LN-STRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCL-EKCKPNRTIYSIYLDSLVKVGNLNRAEEIF
                +Y  ++  LCK   +  A++++    ++ + P    Y  L++ + ++   D  K    + +   C PN    +I L SL K+G ++ AEE+ 
Subjt:  LN-STRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCL-EKCKPNRTIYSIYLDSLVKVGNLNRAEEIF

Query:  AEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKI
         +M   +  G++  +CN+I+ G   SG   KA +I
Subjt:  AEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKI

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-1121.49Show/hide
Query:  KERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKH-HLKQAEFIYHNLVTTG
        K+ K  K       +  +G  P   T++ LI AY S   +G ++EA  + N M    G+ P +  +N++          L KH   ++A+ ++  ++ +G
Subjt:  KERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKH-HLKQAEFIYHNLVTTG

Query:  LELHKDIYGGLIWLHSYQ-DTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEI
        L      Y  L+     + D ++ E++ S   DM+   +  +     S++   ++ G++ +A   +  +K       +  +   ++ Y + G    A+ +
Subjt:  LELHKDIYGGLIWLHSYQ-DTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEI

Query:  FREM-EQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKP-------LMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLV
          EM +Q  +     Y TI+  LCK + +  A+ + +   +  L P       L+  +  L N+   + L  K+K       ++ + +   Y+  LD   
Subjt:  FREM-EQLNSTRAAAYQTIIRILCKFQAVELAESVMSGFIKSNLKP-------LMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLV

Query:  KVGNLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLM
        KVG+++ A+EI+A+M + E++     S +++++     G+  +A +++D M  K  +I P +M
Subjt:  KVGNLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNHLKAEKIYDLMCQKKYDIDPPLM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCTCGCATTCACTCCCAATTCCGCGTTCACTTTTCTGTGTAATCTCAACTCCCCTTCGGTTTTCTCCATGTCCATCCGCACCTCTGCTTTCGCCACTGCCGCCCT
TCTCCGGTCTCTCGCCGTTTCCCGCTTCTCCCGCGGCAACTACACTCTCACTTCCCTCTTTACCCCAACCTGTTCCGTACATCGACGGCGGCAATTTCCGCCGGTTCCCG
CCTGTTCTTCCGGTTCTTTCGGTGAACCGTTGCTGTGTGATCGGGATTCTCCGTCCGAGTCTGAAGAGGTATTGTATTCTCCGTACAGTAATGCGGCTGAGATTTCTCAT
GAAAATGCTTTTGCGTCGGCGGATTTGAAACACTTCGGTTCGCCGGCTCTTGAAGTTAAGGACCTGGATGAGTTGCCGGAGCAATGGCGGAGATCGAAACTGGCTTGGCT
TTGTAAAGAATTGCCGGCGCATAAGCCGGGGACGTTGATGCGGCTGCTTAATGGTCAGAACAAATGGCTGAAGCAGGATGATGCGACCTATCTCACCGTGCATTGTTTGC
GTATTCGAGAAAATGTAACTGCTTTTAGGGTATACAAGTGGATGATGCAACAACATTGGTACCGATTTGATTATGCTTTAGCAACTAAGCTTGCTGATTACATGGGCAAG
GAACGGAAGTTCGCGAAGTGTCGGGAGGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACC
TGTTCAAGGATGCATAGACGAAGCATGTACCATTTACAACCGTATGATTCAGTTAGGAGGTTACCAACCGCGTCTCGGCTTGCACAATTCTCTCTTTAGAGCTCTCATAA
GCAAACCAGGGGAGTTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCATAATCTGGTAACAACTGGACTTGAGTTACATAAAGATATTTATGGTGGTCTAATT
TGGCTACATAGTTATCAGGATACTATAGACAAAGAAAGGATAGCATCGATAAGGAGAGATATGCAACAAGCAGGAATTGAGGAGGAAAGAGAAGTCCTTTTGTCCATCTT
GAGAGCCAGCTCAAAGATGGGGGATGTGGCTGAAGCAGAAAAATCGTGGCGTAAACTTAAGAATTTTGATGGTAACATGCCCTCTCAAGCTTTTGTTTACAAAATGGAAG
TCTATGCCAAGCTCGGTAACCCGATGAAAGCTTTAGAGATATTTAGGGAGATGGAGCAGTTGAACTCTACAAGGGCTGCAGCATATCAGACAATTATTAGGATTTTATGT
AAATTTCAAGCGGTAGAACTTGCAGAATCCGTCATGTCAGGCTTCATAAAGAGTAATTTAAAGCCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTT
AAGCTTACATGATAAGGTAAAGTTAACCTTCTCTGATTGCCTTGAGAAGTGTAAGCCGAATCGTACTATTTACAGCATCTATTTGGATTCTTTGGTAAAAGTTGGTAATC
TCAACAGGGCTGAAGAAATATTTGCTGAGATGCAAACAAATGAAGTAATTGGTGTAAATGCTCGTTCATGCAACGTTATCTTAAGTGGGTATCTGTTAAGTGGAAATCAT
TTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAAAAGTACGACATCGATCCTCCATTAATGGAGAAACTTGATTATGTTCTAAGCTTGAGTAGGAAGGAGGTTAA
GAAGCCAGTAAGCTTGAAGTTGAGTAAAGAAGGAAGAGAGATTTTAGTAGGGTTATTGTTAGGTGGCCTGAAGATAGAAACTGAAGAAGGGACAAAGAATCATAGACTCC
TATTTGAATTCCACCAAAAAAGCAGCACTCACACTCGTTTGAAGAGACACATATACGAGCAATATCGCGAGTGGTTACATCCTGCGTCAAAGTTGAGCGATAGCGATATC
GACGTGCCATATAGATTTTGCACCATTTCACACACATATTTTGGTTTCTACGCAGATCAGTTTTGGCCACGAGGTCATCCAACAATTCCTAATCTTATTCACAGGTGGCT
TTCGCCTCGTGCTCTTGCGTACTGGTATATGTACGGGGGCTGCAGGTTACCATCCGGGGAATTTTTACTGAAGCTGAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTA
AATCTCTGAGAGAAAAGTCTATGCATTGCAAGGTGAAAAGGAAGGGCAGGGTGTATTGGATAGGTTTATTTGGAAGCAATGCCACATGGTTCTGGAAACTAATTGAACCT
TTCATTCTGGAAGACTTGAAAGATAGTGTACAGGCAGGCAGCCTTAATTTGGAGAGGGGTTTAAATGAAACTGAAAATATCAACTTTGATGGTCAATCTGATTCTGATGA
GGAGGCTTCTAAATAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAATTGTTTGTCATTTGCAAATTATTTGGGTCGGGCTTAATTAGATAAAGAAAACCTAGCTCTCTCTTCATACATCAGCCGGTCGCCGCCATTCTCCACTCCCG
AAATCACCGCCGGCAGCCATTGAAGTTCGCCTCTCTCCTCGCAGAGCTTGCCGTCTCTTCGCCGGCGACGGCGCGAAATTAGGGTTCAGGTTTTCAGTTCATCAGCCGCT
TTGTAATTCGCACGTCTTCCAAATTCGCCATGAATGTCAAAATCAGGTTATCTTCTCGGTGAAGATGAACCTCGCATTCACTCCCAATTCCGCGTTCACTTTTCTGTGTA
ATCTCAACTCCCCTTCGGTTTTCTCCATGTCCATCCGCACCTCTGCTTTCGCCACTGCCGCCCTTCTCCGGTCTCTCGCCGTTTCCCGCTTCTCCCGCGGCAACTACACT
CTCACTTCCCTCTTTACCCCAACCTGTTCCGTACATCGACGGCGGCAATTTCCGCCGGTTCCCGCCTGTTCTTCCGGTTCTTTCGGTGAACCGTTGCTGTGTGATCGGGA
TTCTCCGTCCGAGTCTGAAGAGGTATTGTATTCTCCGTACAGTAATGCGGCTGAGATTTCTCATGAAAATGCTTTTGCGTCGGCGGATTTGAAACACTTCGGTTCGCCGG
CTCTTGAAGTTAAGGACCTGGATGAGTTGCCGGAGCAATGGCGGAGATCGAAACTGGCTTGGCTTTGTAAAGAATTGCCGGCGCATAAGCCGGGGACGTTGATGCGGCTG
CTTAATGGTCAGAACAAATGGCTGAAGCAGGATGATGCGACCTATCTCACCGTGCATTGTTTGCGTATTCGAGAAAATGTAACTGCTTTTAGGGTATACAAGTGGATGAT
GCAACAACATTGGTACCGATTTGATTATGCTTTAGCAACTAAGCTTGCTGATTACATGGGCAAGGAACGGAAGTTCGCGAAGTGTCGGGAGGTATTTGATGATATAATTA
ATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGACGAAGCATGTACCATTTACAACCGTATG
ATTCAGTTAGGAGGTTACCAACCGCGTCTCGGCTTGCACAATTCTCTCTTTAGAGCTCTCATAAGCAAACCAGGGGAGTTGTCAAAGCATCATCTTAAACAGGCTGAGTT
TATATATCATAATCTGGTAACAACTGGACTTGAGTTACATAAAGATATTTATGGTGGTCTAATTTGGCTACATAGTTATCAGGATACTATAGACAAAGAAAGGATAGCAT
CGATAAGGAGAGATATGCAACAAGCAGGAATTGAGGAGGAAAGAGAAGTCCTTTTGTCCATCTTGAGAGCCAGCTCAAAGATGGGGGATGTGGCTGAAGCAGAAAAATCG
TGGCGTAAACTTAAGAATTTTGATGGTAACATGCCCTCTCAAGCTTTTGTTTACAAAATGGAAGTCTATGCCAAGCTCGGTAACCCGATGAAAGCTTTAGAGATATTTAG
GGAGATGGAGCAGTTGAACTCTACAAGGGCTGCAGCATATCAGACAATTATTAGGATTTTATGTAAATTTCAAGCGGTAGAACTTGCAGAATCCGTCATGTCAGGCTTCA
TAAAGAGTAATTTAAAGCCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACATGATAAGGTAAAGTTAACCTTCTCTGATTGCCTTGAG
AAGTGTAAGCCGAATCGTACTATTTACAGCATCTATTTGGATTCTTTGGTAAAAGTTGGTAATCTCAACAGGGCTGAAGAAATATTTGCTGAGATGCAAACAAATGAAGT
AATTGGTGTAAATGCTCGTTCATGCAACGTTATCTTAAGTGGGTATCTGTTAAGTGGAAATCATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAAAAGTACG
ACATCGATCCTCCATTAATGGAGAAACTTGATTATGTTCTAAGCTTGAGTAGGAAGGAGGTTAAGAAGCCAGTAAGCTTGAAGTTGAGTAAAGAAGGAAGAGAGATTTTA
GTAGGGTTATTGTTAGGTGGCCTGAAGATAGAAACTGAAGAAGGGACAAAGAATCATAGACTCCTATTTGAATTCCACCAAAAAAGCAGCACTCACACTCGTTTGAAGAG
ACACATATACGAGCAATATCGCGAGTGGTTACATCCTGCGTCAAAGTTGAGCGATAGCGATATCGACGTGCCATATAGATTTTGCACCATTTCACACACATATTTTGGTT
TCTACGCAGATCAGTTTTGGCCACGAGGTCATCCAACAATTCCTAATCTTATTCACAGGTGGCTTTCGCCTCGTGCTCTTGCGTACTGGTATATGTACGGGGGCTGCAGG
TTACCATCCGGGGAATTTTTACTGAAGCTGAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAAAAGTCTATGCATTGCAAGGTGAAAAGGAAGGG
CAGGGTGTATTGGATAGGTTTATTTGGAAGCAATGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGAAGACTTGAAAGATAGTGTACAGGCAGGCAGCCTTA
ATTTGGAGAGGGGTTTAAATGAAACTGAAAATATCAACTTTGATGGTCAATCTGATTCTGATGAGGAGGCTTCTAAATAATACAAGAATATTTGCTCTGCTCCAACATTA
AATATCTTCCATATGAATCTTTTGTGGGAAGCAAAGTATCAAAGAAGGTTTAAATTTTCCTTTAGCCCCCTTGTGTACAGAAGCTTAAATGTGGAGGGTAAACTTCGAAA
GAGCAACTCGATGGGCGGCTTCTCTTCCCCTCGACTTGCTGTTTTGTTTCCATGATGGGTAAACTCCAAATCATGTCTTCATCAATCATCCATTTGTGGCCAGAGGGTAG
TTCTTTAAACCTTTGCTGTTTTGTTAGCTTTACTATGATGACTTGTTTGATTAGGAAGAAACCATATTGTTTTGTTCCATGCAGGCGCAGTGGTTGAAAACTTGGGTTTT
GATGATATGCTCCTCTCAAGGTCCCAGTTCAAAATTCACCTATGACACTACTCTTTCGATGTCTTTGGTGCCCCGTCTAGGGACGAGCGTGGTTACATTATTTCAAAAAA
AAAAAAGTTTCTTATTTAAAAAACTTAGGGGAGAAATTGTTCCCAAGGTGCTCTTGAGTATATAGTGGAATGAAGCTCTTATTCTTGGTTATTAAAAATAGAAACTATTT
ATTTAAACGGTTGAAGCACGATAAAATATTTTGATCTATGAATGAGGAAGTTGTGTTATTTAAGGTCGAACCATTCGAAGCTCAAAAGTTGCGAACTCGAGTAATGGTTG
TACAAAATCTGTCACAAGCTTTGCTGGTTCATAAATTAACTTTTGGGGAG
Protein sequenceShow/hide protein sequence
MNLAFTPNSAFTFLCNLNSPSVFSMSIRTSAFATAALLRSLAVSRFSRGNYTLTSLFTPTCSVHRRRQFPPVPACSSGSFGEPLLCDRDSPSESEEVLYSPYSNAAEISH
ENAFASADLKHFGSPALEVKDLDELPEQWRRSKLAWLCKELPAHKPGTLMRLLNGQNKWLKQDDATYLTVHCLRIRENVTAFRVYKWMMQQHWYRFDYALATKLADYMGK
ERKFAKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIDEACTIYNRMIQLGGYQPRLGLHNSLFRALISKPGELSKHHLKQAEFIYHNLVTTGLELHKDIYGGLI
WLHSYQDTIDKERIASIRRDMQQAGIEEEREVLLSILRASSKMGDVAEAEKSWRKLKNFDGNMPSQAFVYKMEVYAKLGNPMKALEIFREMEQLNSTRAAAYQTIIRILC
KFQAVELAESVMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKVKLTFSDCLEKCKPNRTIYSIYLDSLVKVGNLNRAEEIFAEMQTNEVIGVNARSCNVILSGYLLSGNH
LKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEGREILVGLLLGGLKIETEEGTKNHRLLFEFHQKSSTHTRLKRHIYEQYREWLHPASKLSDSDI
DVPYRFCTISHTYFGFYADQFWPRGHPTIPNLIHRWLSPRALAYWYMYGGCRLPSGEFLLKLKGSHEGVEKIVKSLREKSMHCKVKRKGRVYWIGLFGSNATWFWKLIEP
FILEDLKDSVQAGSLNLERGLNETENINFDGQSDSDEEASK