; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0016906 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0016906
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr04:22846170..22850225
RNA-Seq ExpressionPay0016906
SyntenyPay0016906
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006388 - tRNA splicing, via endonucleolytic cleavage and ligation (biological process)
GO:0010239 - chloroplast mRNA processing (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048564 - photosystem I assembly (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0004519 - endonuclease activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR004860 - Homing endonuclease, LAGLIDADG
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR027434 - Homing endonuclease


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152074.2 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis sativus]0.0e+0094.24Show/hide
Query:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKV-RQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENG
        MVFSMSIPTSAFSTVT LRSLTLSLSPYHHYFH PNHIIPTLF+ +YSVKV RQLPRIRAFASGSFVKQLVYD DSPSESEEHLSS +SNGGDGFHFENG
Subjt:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKV-RQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENG

Query:  FASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTK
        FASVDLKHLGTP LEVKELDELPEQWRRSK+AWLCKELPAQKPGTVIRLLNAQ+KWMGQDDATYL VHCLRIRENETAFRVYKWMMQQHWYRFDYALSTK
Subjt:  FASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTK

Query:  LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHN
        LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRAL+SKPGDLSKHHLKQAEFIYHN
Subjt:  LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHN

Query:  LVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMK
        LVTSGLELHKD+YGGLIWLHSYQDTID+ERIVSLRKEMQQAGIKEE+EVLLSILRASSKMGDV+EAE+LWQ+LKYLDGNMP QAFVYKMEVYAKMGKPMK
Subjt:  LVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMK

Query:  ALEIFREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN
        ALEIFREMEQLNST AAAYQTIIGILCKFQ IELAESIMAGFIESNLKPLTPAYVD+MNMFFNL+L DKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN
Subjt:  ALEIFREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN

Query:  LDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIE
        LDRAEEIFSQMETNGEIG+NARSCN+IL GYLL GNYMKAEKIYDLMCQK+YDIDPPLMEKL+Y+LSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIE
Subjt:  LDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIE

Query:  SDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRT
        SD+ERKNHRIQFEFH+NCKTHSVLRRHIYEQYHKWLHSASKLTDGD+DIPYKFCTVSHSYFGFYADQFWPRGR+ IPNLIHRWLSPR LAYWYMYGGCRT
Subjt:  SDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRT

Query:  SSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN
        SSGDILLKLKGSHEGVEKIVKSLREKS+HCKVKRKG+MYWIGLLGSNATWFWKLIEPFILD LKESTQADSLNL GVLN +ENINFDS+SDSVEETSN
Subjt:  SSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN

XP_008465080.1 PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis melo]0.0e+0099.87Show/hide
Query:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF
        MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF
Subjt:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF

Query:  ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKL
        ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKL
Subjt:  ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKL

Query:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNL
        ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNL
Subjt:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNL

Query:  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKA
        VTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKA
Subjt:  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKA

Query:  LEIFREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
        LEIFREMEQLNST AAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
Subjt:  LEIFREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL

Query:  DRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES
        DRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES
Subjt:  DRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES

Query:  DEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTS
        DEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTS
Subjt:  DEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTS

Query:  SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLNETENINFDSQSDSVEETSN
        SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLNETENINFDSQSDSVEETSN
Subjt:  SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLNETENINFDSQSDSVEETSN

XP_022949171.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata]0.0e+0084.63Show/hide
Query:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV
        MSI TSAF+TVTLLRSLTL  S  HH+F   N++I +L I +YS K  RQLPRI AFAS S V+ LVYDRDSP+ESEE L SPYS G +      GFAS 
Subjt:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAQKPGT+IRLLNAQRKWM QDDA YL VHCLRIRENETAFRVYKWMMQQHWYRFDYAL+TKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS
        MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGYQPRLSLH+SLF+AL+SKPGDLSKHHLKQAEFIYHNL T+
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEI
        GLELHKDIYGGLIWLHSYQDT+DKERI+SLRKEM QAGI+EE+EVL+SILRASSK+GDV+EAER W KLK  DG+MP QAFVYKMEVYAK+G PMKA EI
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEI

Query:  FREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRA
        FREMEQLNS +AAAYQTIIGILCKF+E+ LAES+M GFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNLDRA
Subjt:  FREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRA

Query:  EEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEE
        EEIFSQM+TNGEIGV+ARSCN+IL GYLL G+Y+KAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREILVGLLLGGLEIESDE 
Subjt:  EEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEE

Query:  RKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGD
        RKNHRIQFEFH++C THS LRRHI+EQYH+WLH ASKL+D D DIPYKFCTVSHSYFGFYADQFWPRG   IPNLIHRWLSPR LAYWYMYGGCR SSGD
Subjt:  RKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN
         +LKLKGS EGV KIVKSLREKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILDDLK+S QADSLN+    NET NINFDSQSDS EE S+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN

XP_022998786.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima]0.0e+0084.76Show/hide
Query:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV
        MSI TSAF+TVTLLRSLTL  S  H++F   N++I +L I +YS K  RQLPRI AFAS S V+ LVYDRDSP+ESEE L SPYSNG +       FAS 
Subjt:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGT+IRLLNAQRKWM QDDA YL VHCLRIRENETAFRVYKWMMQQHWYRFDYAL+TKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS
        MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY PRLSLH+SLF+AL+SKPGDLSKHHLKQAEFIYHNLVT+
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEI
        GLELHKDIYGGLIWLHSYQDT+DKERI+SLRKEMQQAGI+EE+EVL+SILRASSK+GDV+EAER W K+K  DG+MP QAFVYKMEVYAK+G PMKALEI
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEI

Query:  FREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRA
        FREMEQLNS ++AAYQTIIGILCKF+E+ LAES+MAGFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNLDRA
Subjt:  FREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRA

Query:  EEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEE
        EEIFSQM+TNGEIGV+ARSCN+IL GYLL G+Y+KAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREILVGLLLGGLEIESDE 
Subjt:  EEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEE

Query:  RKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGD
        RKNHRIQFEFH++C THS LRRH+YEQYH+WLH ASKL+D D DIPYKFCTVSHSYFGFYADQFWPRG   IPNLIHRWLSPR LAYWYMYGGCR SSGD
Subjt:  RKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN
         +LKLKGS EGV KIVKSLREKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILDDLK+S QAD+LNL   +NET NINFDSQSDS EE S+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN

XP_038887990.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Benincasa hispida]0.0e+0089.17Show/hide
Query:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV
        MSI TSAFS+VTLLRS +LSLSPYHHYF  PNHI+ T+FI  YSVK  +QLPRI +FAS S V+QLVYDRDS  ESEEHLSSPYSNG D      GFAS 
Subjt:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADY
        DLKHL  PALEVKELDELP+QWRRSKLAWLCKELPAQKPGT+IRLLNAQRKWM QDDATYLTVHCLRIRENETAFRVYKWMMQQ WYRFDYAL+TKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS
        MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLH+SLFRAL SKPGDLSKHHLKQAEFIYHNLVTS
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEI
        GLE+HKDI GGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEE+EVLLSILRASSKMG+V+EAER WQKLK  DGNMP QAFVYKMEVYAKMGKPMKALEI
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEI

Query:  FREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRA
        FREMEQLNS  AAAY+TIIGILCKFQ+IELAESIM GFI+SNLKPL PAYVD+MNMFFNLSLH+KLEL FSQCLEKCKP+RTIYSIYLDSLVKVGNLDRA
Subjt:  FREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRA

Query:  EEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEE
        EEIFSQMETNGEIG+NARSCN+IL GYLLFGNY+KAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKP+SLKLSKEQREILVGLLLGG+EIESDEE
Subjt:  EEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEE

Query:  RKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGD
        RKNHRIQFEF +NC THS+LRRHIYEQYH+WLHSASKL DGDIDIPYKFCTVSHSYFGFYADQFWP+G   IPNLIHRWLSPR LAYWYMYGGCRTSSGD
Subjt:  RKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLG-VLNETENINFDSQSDSVEETSN
        ILLKLKGS EGVEKIVKSLREKSM CKVKRKGSMYWIGLLG+NATWFWKL+EPFILD LK+S +ADS NLG VLNETENINFDSQSDSVEE SN
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLG-VLNETENINFDSQSDSVEETSN

TrEMBL top hitse value%identityAlignment
A0A0A0LBL0 LAGLIDADG_2 domain-containing protein0.0e+0094.24Show/hide
Query:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKV-RQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENG
        MVFSMSIPTSAFSTVT LRSLTLSLSPYHHYFH PNHIIPTLF+ +YSVKV RQLPRIRAFASGSFVKQLVYD DSPSESEEHLSS +SNGGDGFHFENG
Subjt:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKV-RQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENG

Query:  FASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTK
        FASVDLKHLGTP LEVKELDELPEQWRRSK+AWLCKELPAQKPGTVIRLLNAQ+KWMGQDDATYL VHCLRIRENETAFRVYKWMMQQHWYRFDYALSTK
Subjt:  FASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTK

Query:  LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHN
        LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRAL+SKPGDLSKHHLKQAEFIYHN
Subjt:  LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHN

Query:  LVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMK
        LVTSGLELHKD+YGGLIWLHSYQDTID+ERIVSLRKEMQQAGIKEE+EVLLSILRASSKMGDV+EAE+LWQ+LKYLDGNMP QAFVYKMEVYAKMGKPMK
Subjt:  LVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMK

Query:  ALEIFREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN
        ALEIFREMEQLNST AAAYQTIIGILCKFQ IELAESIMAGFIESNLKPLTPAYVD+MNMFFNL+L DKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN
Subjt:  ALEIFREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN

Query:  LDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIE
        LDRAEEIFSQMETNGEIG+NARSCN+IL GYLL GNYMKAEKIYDLMCQK+YDIDPPLMEKL+Y+LSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIE
Subjt:  LDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIE

Query:  SDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRT
        SD+ERKNHRIQFEFH+NCKTHSVLRRHIYEQYHKWLHSASKLTDGD+DIPYKFCTVSHSYFGFYADQFWPRGR+ IPNLIHRWLSPR LAYWYMYGGCRT
Subjt:  SDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRT

Query:  SSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN
        SSGDILLKLKGSHEGVEKIVKSLREKS+HCKVKRKG+MYWIGLLGSNATWFWKLIEPFILD LKESTQADSLNL GVLN +ENINFDS+SDSVEETSN
Subjt:  SSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN

A0A1S3CPK0 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0099.87Show/hide
Query:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF
        MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF
Subjt:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF

Query:  ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKL
        ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKL
Subjt:  ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKL

Query:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNL
        ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNL
Subjt:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNL

Query:  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKA
        VTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKA
Subjt:  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKA

Query:  LEIFREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
        LEIFREMEQLNST AAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL
Subjt:  LEIFREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNL

Query:  DRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES
        DRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES
Subjt:  DRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES

Query:  DEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTS
        DEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTS
Subjt:  DEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTS

Query:  SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLNETENINFDSQSDSVEETSN
        SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLNETENINFDSQSDSVEETSN
Subjt:  SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLNETENINFDSQSDSVEETSN

A0A5N6QQ61 LAGLIDADG_2 domain-containing protein0.0e+0068.05Show/hide
Query:  SMSIPTSAF------STVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFE
        S + PT+ F      S+++ LRSL+LSLS +HH  HY   I    F S    K R+   +RA +  + V+ L  +   P   E    S  S+    F F+
Subjt:  SMSIPTSAF------STVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFE

Query:  NGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALS
            S+DLK L  PAL+VKEL +LPEQWRRSKLAWLCKELPA K GT+IR+LNAQRKW+ Q DATY+ VHC+RIRENET F+VYKWMMQQHWY+FD+AL+
Subjt:  NGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALS

Query:  TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIY
        TKLADYMGKERKFSKCRE+FDDIINQG VP ESTFHILI+AYLSAP+Q C+EEA +IYNRMIQLGGYQP+LSLH+ LFRAL+SKPG  SK +LKQAEFI+
Subjt:  TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIY

Query:  HNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKP
        HNL+TSGLE+HKDIYGGLIWLHSYQDTID+ERI SL+KEM+ AGI+E +EVLLSILRA SK  +V EAER W KL  LDG +P+ AFVYKMEVYAK+G+P
Subjt:  HNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKP

Query:  MKALEIFREM-EQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVK
        MK+LEIFREM E+L+ST+ AAY  II +LCK QE+ELAES+M  FI+SNLKPLTP+Y+DMMN++FNL+LHDKLEL FSQ L+KC+PN T+YSIYLDSLV 
Subjt:  MKALEIFREM-EQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVK

Query:  VGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGL
        +GNLD+AEEIF+QM +NG IGVN+RSCN IL GYL  G+Y+KAEKIYDLMCQKKY ID PLMEK+DYVLSLSR+ VKKP+S+KLSKEQREILVGLLLGGL
Subjt:  VGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGL

Query:  EIESDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGG
        +IESDEERK+H ++FEF +N  TH VL+RHI++QYH+WLH + K  +G  DIP +FCT+SHSYFGFYADQFWP+GR  IP LIHRWLSPR LAYWYMYGG
Subjt:  EIESDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGG

Query:  CRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLN-ETENINFDSQSDS
         RTSSGDILL+LKG+HEGVEK+  +L EKS+ C++KRKGS++WIG LGSN+ WFWKLIEP++LDD+K+  +A    L   + ET++I++DS S+S
Subjt:  CRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLN-ETENINFDSQSDS

A0A6J1GB98 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0084.63Show/hide
Query:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV
        MSI TSAF+TVTLLRSLTL  S  HH+F   N++I +L I +YS K  RQLPRI AFAS S V+ LVYDRDSP+ESEE L SPYS G +      GFAS 
Subjt:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAQKPGT+IRLLNAQRKWM QDDA YL VHCLRIRENETAFRVYKWMMQQHWYRFDYAL+TKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS
        MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGYQPRLSLH+SLF+AL+SKPGDLSKHHLKQAEFIYHNL T+
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEI
        GLELHKDIYGGLIWLHSYQDT+DKERI+SLRKEM QAGI+EE+EVL+SILRASSK+GDV+EAER W KLK  DG+MP QAFVYKMEVYAK+G PMKA EI
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEI

Query:  FREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRA
        FREMEQLNS +AAAYQTIIGILCKF+E+ LAES+M GFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNLDRA
Subjt:  FREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRA

Query:  EEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEE
        EEIFSQM+TNGEIGV+ARSCN+IL GYLL G+Y+KAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREILVGLLLGGLEIESDE 
Subjt:  EEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEE

Query:  RKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGD
        RKNHRIQFEFH++C THS LRRHI+EQYH+WLH ASKL+D D DIPYKFCTVSHSYFGFYADQFWPRG   IPNLIHRWLSPR LAYWYMYGGCR SSGD
Subjt:  RKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN
         +LKLKGS EGV KIVKSLREKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILDDLK+S QADSLN+    NET NINFDSQSDS EE S+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN

A0A6J1KB64 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0084.76Show/hide
Query:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV
        MSI TSAF+TVTLLRSLTL  S  H++F   N++I +L I +YS K  RQLPRI AFAS S V+ LVYDRDSP+ESEE L SPYSNG +       FAS 
Subjt:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADY
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGT+IRLLNAQRKWM QDDA YL VHCLRIRENETAFRVYKWMMQQHWYRFDYAL+TKLADY
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADY

Query:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS
        MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY PRLSLH+SLF+AL+SKPGDLSKHHLKQAEFIYHNLVT+
Subjt:  MGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS

Query:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEI
        GLELHKDIYGGLIWLHSYQDT+DKERI+SLRKEMQQAGI+EE+EVL+SILRASSK+GDV+EAER W K+K  DG+MP QAFVYKMEVYAK+G PMKALEI
Subjt:  GLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEI

Query:  FREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRA
        FREMEQLNS ++AAYQTIIGILCKF+E+ LAES+MAGFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGNLDRA
Subjt:  FREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRA

Query:  EEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEE
        EEIFSQM+TNGEIGV+ARSCN+IL GYLL G+Y+KAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREILVGLLLGGLEIESDE 
Subjt:  EEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEE

Query:  RKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGD
        RKNHRIQFEFH++C THS LRRH+YEQYH+WLH ASKL+D D DIPYKFCTVSHSYFGFYADQFWPRG   IPNLIHRWLSPR LAYWYMYGGCR SSGD
Subjt:  RKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGD

Query:  ILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN
         +LKLKGS EGV KIVKSLREKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILDDLK+S QAD+LNL   +NET NINFDSQSDS EE S+
Subjt:  ILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETENINFDSQSDSVEETSN

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351305.2e-1218.03Show/hide
Query:  LAWLCKELPAQKPGTVIRLL-NAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   V+  L +    W   DD   ++V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAQKPGTVIRLL-NAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHSSLFRALMSKPGD----------LSKHHLKQAEFIYHNLVTSGLELHKDIYGGL
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++++    LM + G+          + +   K     Y+ ++    +  K      
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHSSLFRALMSKPGD----------LSKHHLKQAEFIYHNLVTSGLELHKDIYGGL

Query:  IW-----------LHSYQDTIDK-------ERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKP
        ++           + +Y   ++        E+   + +++Q+ G++ +  V  +++ + S+ G    A  ++  ++++       ++   ++ Y + G  
Subjt:  IW-----------LHSYQDTIDK-------ERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKP

Query:  MKALEIFREMEQLN-STTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLE-KCKPNRTIYSIYLDSLV
          A  +F EM++L  + T  ++  ++    K +++   E+I+    E+ ++P T     M+N++  L    K+E   ++     C  + + Y+I ++   
Subjt:  MKALEIFREMEQLN-STTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLE-KCKPNRTIYSIYLDSLV

Query:  KVGNLDRAEEIFSQME
        K G L+R EE+F +++
Subjt:  KVGNLDRAEEIFSQME

Q6ZHJ5 Pentatricopeptide repeat-containing protein OTP51, chloroplastic6.4e-22852.44Show/hide
Query:  PRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRK
        P I A AS   ++ L+ D D   E E+            F  E   A+ + + + +P L V EL+ELPEQWRRS++AWLCKELPA K  T  R+LNAQRK
Subjt:  PRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRK

Query:  WMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTI
        W+ QDDATY+ VHCLRIR N+ AFRVY WM++QHW+RF++AL+T++AD +G++ K  KCREVF+ ++ QG VP+ESTFHILIVAYLS P   C+EEA TI
Subjt:  WMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTI

Query:  YNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILR
        YN+MIQ+GGY+PRLSLH+SLFRAL+SK G  +K++LKQAEF+YHN+VT+ L++HKD+Y GLIWLHSYQD ID+ERI++LRKEM+QAG  E  +VL+S++R
Subjt:  YNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILR

Query:  ASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREMEQLN-STTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAY
        A SK G+V E E  W  +     ++P QA+V +ME YA+ G+PMK+L++F+EM+  N     A+Y  II I+ K  E+++ E +M  FIES++K L PA+
Subjt:  ASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREMEQLN-STTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAY

Query:  VDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDI
        +D+M M+ +L +H+KLELTF +C+ +C+PNR +Y+IYL+SLVKVGN+++AEE+F +M  NG IG N +SCN++L GYL   +Y KAEK+YD+M +KKYD+
Subjt:  VDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDI

Query:  DPPLMEKLDYVLSLSRKEVK-KPMSLKLSKEQREILVGLLLGGLEIESDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKF
            +EKL   L L++K +K K +S+KL +EQREIL+GLLLGG  +ES  +R  H + F+F ++   HSVLR HI+E++ +WL SAS+  D    IPY+F
Subjt:  DPPLMEKLDYVLSLSRKEVK-KPMSLKLSKEQREILVGLLLGGLEIESDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKF

Query:  CTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGDILLKLKGSH-EGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFW
         T+ H +F F+ DQF+ +G+  +P LIHRWL+PR LAYW+M+GG +  SGDI+LKL G + EGVE+IV SL  +S+  KVKRKG  +WIG  GSNA  FW
Subjt:  CTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGDILLKLKGSH-EGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFW

Query:  KLIEPFILDDLKESTQADSLNLGVLNETENINFDSQSD
        ++IEP +L++       +  ++G  + T++ + DS  D
Subjt:  KLIEPFILDDLKESTQADSLNLGVLNETENINFDSQSD

Q9M9X9 Pentatricopeptide repeat-containing protein At1g06710, mitochondrial8.4e-1020.05Show/hide
Query:  STKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFI
        ST L   + K ++  +C+ V + ++ +GC PS   F+ L+ AY ++   G    A  +  +M++  G+ P   +++ L  ++      L+   L  AE  
Subjt:  STKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFI

Query:  YHNLVTSGLELHK--------------------DIYGGLIWLHSYQDTIDKERIVS-------------LRKEMQQAGIKEEKEVLLSILRASSKMGDVV
        Y  ++ +G+ L+K                     +   +I      DT    ++++             L +EM++ G+  +      ++ +  K G + 
Subjt:  YHNLVTSGLELHK--------------------DIYGGLIWLHSYQDTIDKERIVS-------------LRKEMQQAGIKEEKEVLLSILRASSKMGDVV

Query:  EAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREM-EQLNSTTAAAYQTIIGILCKFQEIELAESI---MAG-------------FIESNL
        +A + + +++ +        +   +  Y K  K   A E+F  M  +        Y  +I   CK  ++E A  I   M G             + +++ 
Subjt:  EAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREM-EQLNSTTAAAYQTIIGILCKFQEIELAESI---MAG-------------FIESNL

Query:  KPLTPAYVDMMNMFF-NLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNG
        +P    Y  +++ F  +  + +  +L  +  +E C+PN+ +Y   +D L KVG LD A+E+ ++M  +G
Subjt:  KPLTPAYVDMMNMFF-NLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNG

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic2.4e-0919.92Show/hide
Query:  GTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY---
        G++ R L+  +  +  +D   +        + + + R++K+M +Q W + +  + T +   +G+E    KC EVFD++ +QG   S  ++  LI AY   
Subjt:  GTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY---

Query:  ---------LSAPVQGCIEEASTIYNRMIQ------------LG--------GYQPRLSLHSSLFRALMSKP-GDLSKHHLKQAEFIYHNLVTSGLELHK
                 L       I  +   YN +I             LG        G QP +  +++L  A   +  GD       +AE ++  +   G+    
Subjt:  ---------LSAPVQGCIEEASTIYNRMIQ------------LG--------GYQPRLSLHSSLFRALMSKP-GDLSKHHLKQAEFIYHNLVTSGLELHK

Query:  DIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREMEQ
          Y  L+   ++      E++  L  EM   G   +      +L A +K G + EA  ++ +++          +   + ++ + G+     ++F EM+ 
Subjt:  DIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREMEQ

Query:  LNS-TTAAAYQTIIGILCK---FQEI--------------------------------ELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTF
         N+   AA Y  +I +  +   F+E+                                E A  I+     +++ P + AY  ++  F   +L+++  + F
Subjt:  LNS-TTAAAYQTIIGILCK---FQEI--------------------------------ELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTF

Query:  SQCLE-KCKPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSR
        +   E    P+   +   L S  + G +  +E I S++  +G I  N  + N  +  Y   G + +A K Y  M + + D D   +E +  V S +R
Subjt:  SQCLE-KCKPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSR

Q9XIL5 Pentatricopeptide repeat-containing protein At2g15820, chloroplastic3.9e-25756.68Show/hide
Query:  SIPTSAFSTVTLLRSLTLSLSPYHHYFH---------YPNHIIPTLFISSYSVKVRQLPRIRAFA--SGSFVKQLVYDRDSPSESEEHLSSPYSNG-GDG
        S P    S+ TL RSL+ SL  +   +          +  H   T F S  S +   L    + A  SG+FV+ L       +ESEE +S   +NG GD 
Subjt:  SIPTSAFSTVTLLRSLTLSLSPYHHYFH---------YPNHIIPTLFISSYSVKVRQLPRIRAFA--SGSFVKQLVYDRDSPSESEEHLSSPYSNG-GDG

Query:  FHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFD
            N   +V  + + T   EV+EL+ELPE+WRRSKLAWLCKE+P  K  T++RLLNAQ+KW+ Q+DATY++VHC+RIRENET FRVY+WM QQ+WYRFD
Subjt:  FHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFD

Query:  YALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQ
        + L+TKLA+Y+GKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YNRMIQLGGY+PRLSLH+SLFRAL+SK G +    LKQ
Subjt:  YALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQ

Query:  AEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYA
        AEFI+HN+VT+GLE+ KDIY GLIWLHS QD +D  RI SLR+EM++AG +E KEV++S+LRA +K G V E ER W +L  LD  +P QAFVYK+E Y+
Subjt:  AEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYA

Query:  KMGKPMKALEIFREMEQ-LNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL
        K+G   KA+EIFREME+ +   T + Y  II +LCK Q++EL E++M  F ES  KPL P+++++  M+F+L LH+KLE+ F QCLEKC+P++ IY+IYL
Subjt:  KMGKPMKALEIFREMEQ-LNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL

Query:  DSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKK-PMSLKLSKEQREILVG
        DSL K+GNL++A ++F++M+ NG I V+ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LSL +KEVKK P S+KLSK+QRE+LVG
Subjt:  DSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKK-PMSLKLSKEQREILVG

Query:  LLLGGLEIESDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAY
        LLLGGL+IESD+E+K+H I+FEF +N + H VL+++I++Q+ +WLH  S   + DI IP++F +V HSYFGFYA+ +WP+G+  IP LIHRWLSP +LAY
Subjt:  LLLGGLEIESDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAY

Query:  WYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLNETE--NINFDSQS
        WYMY G +TSSGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+KG ++WIGL G+N+  FWKLIEP +L++LKE  +  S +L  + E E  +INF S S
Subjt:  WYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLNETE--NINFDSQS

Query:  DSVEETSN
        D  ++  N
Subjt:  DSVEETSN

Arabidopsis top hitse value%identityAlignment
AT1G06710.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.9e-1120.05Show/hide
Query:  STKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFI
        ST L   + K ++  +C+ V + ++ +GC PS   F+ L+ AY ++   G    A  +  +M++  G+ P   +++ L  ++      L+   L  AE  
Subjt:  STKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFI

Query:  YHNLVTSGLELHK--------------------DIYGGLIWLHSYQDTIDKERIVS-------------LRKEMQQAGIKEEKEVLLSILRASSKMGDVV
        Y  ++ +G+ L+K                     +   +I      DT    ++++             L +EM++ G+  +      ++ +  K G + 
Subjt:  YHNLVTSGLELHK--------------------DIYGGLIWLHSYQDTIDKERIVS-------------LRKEMQQAGIKEEKEVLLSILRASSKMGDVV

Query:  EAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREM-EQLNSTTAAAYQTIIGILCKFQEIELAESI---MAG-------------FIESNL
        +A + + +++ +        +   +  Y K  K   A E+F  M  +        Y  +I   CK  ++E A  I   M G             + +++ 
Subjt:  EAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREM-EQLNSTTAAAYQTIIGILCKFQEIELAESI---MAG-------------FIESNL

Query:  KPLTPAYVDMMNMFF-NLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNG
        +P    Y  +++ F  +  + +  +L  +  +E C+PN+ +Y   +D L KVG LD A+E+ ++M  +G
Subjt:  KPLTPAYVDMMNMFF-NLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNG

AT1G62670.1 rna processing factor 24.5e-1121.67Show/hide
Query:  LSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSK-PGDLSKHHLKQAE
        LS+ L  Y   +R  S+   + D +   G  P+  TF+ LI       +     EA  + +RM+   G QP L  +  +   L  +   DL+ + L + E
Subjt:  LSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSK-PGDLSKHHLKQAE

Query:  FIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKM
              +  G+ ++  I  GL       D +      +L KEM+  GI+       S++      G   +A RL   +     N     F   ++ + K 
Subjt:  FIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKM

Query:  GKPMKALEIFREMEQLN-STTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLS-LHDKLELTFSQCLEKCKPNRTIYSIYLD
        GK ++A +++ EM + +   +   Y ++I   C    ++ A+ +    +  +  P    Y  ++  F     + + +E+           N   Y+I + 
Subjt:  GKPMKALEIFREMEQLN-STTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLS-LHDKLELTFSQCLEKCKPNRTIYSIYLD

Query:  SLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYD
         L + G+ D A+EIF +M ++G +  N  + N +L G    G   KA  +++ + + K +
Subjt:  SLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYD

AT2G15820.1 endonucleases2.7e-25856.68Show/hide
Query:  SIPTSAFSTVTLLRSLTLSLSPYHHYFH---------YPNHIIPTLFISSYSVKVRQLPRIRAFA--SGSFVKQLVYDRDSPSESEEHLSSPYSNG-GDG
        S P    S+ TL RSL+ SL  +   +          +  H   T F S  S +   L    + A  SG+FV+ L       +ESEE +S   +NG GD 
Subjt:  SIPTSAFSTVTLLRSLTLSLSPYHHYFH---------YPNHIIPTLFISSYSVKVRQLPRIRAFA--SGSFVKQLVYDRDSPSESEEHLSSPYSNG-GDG

Query:  FHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFD
            N   +V  + + T   EV+EL+ELPE+WRRSKLAWLCKE+P  K  T++RLLNAQ+KW+ Q+DATY++VHC+RIRENET FRVY+WM QQ+WYRFD
Subjt:  FHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFD

Query:  YALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQ
        + L+TKLA+Y+GKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YNRMIQLGGY+PRLSLH+SLFRAL+SK G +    LKQ
Subjt:  YALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQ

Query:  AEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYA
        AEFI+HN+VT+GLE+ KDIY GLIWLHS QD +D  RI SLR+EM++AG +E KEV++S+LRA +K G V E ER W +L  LD  +P QAFVYK+E Y+
Subjt:  AEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYA

Query:  KMGKPMKALEIFREMEQ-LNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL
        K+G   KA+EIFREME+ +   T + Y  II +LCK Q++EL E++M  F ES  KPL P+++++  M+F+L LH+KLE+ F QCLEKC+P++ IY+IYL
Subjt:  KMGKPMKALEIFREMEQ-LNSTTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL

Query:  DSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKK-PMSLKLSKEQREILVG
        DSL K+GNL++A ++F++M+ NG I V+ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LSL +KEVKK P S+KLSK+QRE+LVG
Subjt:  DSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKK-PMSLKLSKEQREILVG

Query:  LLLGGLEIESDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAY
        LLLGGL+IESD+E+K+H I+FEF +N + H VL+++I++Q+ +WLH  S   + DI IP++F +V HSYFGFYA+ +WP+G+  IP LIHRWLSP +LAY
Subjt:  LLLGGLEIESDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGFYADQFWPRGRQTIPNLIHRWLSPRALAY

Query:  WYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLNETE--NINFDSQS
        WYMY G +TSSGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+KG ++WIGL G+N+  FWKLIEP +L++LKE  +  S +L  + E E  +INF S S
Subjt:  WYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNLGVLNETE--NINFDSQS

Query:  DSVEETSN
        D  ++  N
Subjt:  DSVEETSN

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-1318.03Show/hide
Query:  LAWLCKELPAQKPGTVIRLL-NAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   V+  L +    W   DD   ++V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAQKPGTVIRLL-NAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHSSLFRALMSKPGD----------LSKHHLKQAEFIYHNLVTSGLELHKDIYGGL
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++++    LM + G+          + +   K     Y+ ++    +  K      
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHSSLFRALMSKPGD----------LSKHHLKQAEFIYHNLVTSGLELHKDIYGGL

Query:  IW-----------LHSYQDTIDK-------ERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKP
        ++           + +Y   ++        E+   + +++Q+ G++ +  V  +++ + S+ G    A  ++  ++++       ++   ++ Y + G  
Subjt:  IW-----------LHSYQDTIDK-------ERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKP

Query:  MKALEIFREMEQLN-STTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLE-KCKPNRTIYSIYLDSLV
          A  +F EM++L  + T  ++  ++    K +++   E+I+    E+ ++P T     M+N++  L    K+E   ++     C  + + Y+I ++   
Subjt:  MKALEIFREMEQLN-STTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLE-KCKPNRTIYSIYLDSLV

Query:  KVGNLDRAEEIFSQME
        K G L+R EE+F +++
Subjt:  KVGNLDRAEEIFSQME

AT2G35130.2 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-1318.03Show/hide
Query:  LAWLCKELPAQKPGTVIRLL-NAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   V+  L +    W   DD   ++V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAQKPGTVIRLL-NAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHSSLFRALMSKPGD----------LSKHHLKQAEFIYHNLVTSGLELHKDIYGGL
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++++    LM + G+          + +   K     Y+ ++    +  K      
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPR---LSLHSSLFRALMSKPGD----------LSKHHLKQAEFIYHNLVTSGLELHKDIYGGL

Query:  IW-----------LHSYQDTIDK-------ERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKP
        ++           + +Y   ++        E+   + +++Q+ G++ +  V  +++ + S+ G    A  ++  ++++       ++   ++ Y + G  
Subjt:  IW-----------LHSYQDTIDK-------ERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKP

Query:  MKALEIFREMEQLN-STTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLE-KCKPNRTIYSIYLDSLV
          A  +F EM++L  + T  ++  ++    K +++   E+I+    E+ ++P T     M+N++  L    K+E   ++     C  + + Y+I ++   
Subjt:  MKALEIFREMEQLN-STTAAAYQTIIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLE-KCKPNRTIYSIYLDSLV

Query:  KVGNLDRAEEIFSQME
        K G L+R EE+F +++
Subjt:  KVGNLDRAEEIFSQME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTCTCCATGTCCATTCCTACCTCTGCATTTTCCACTGTGACCCTTCTCCGTTCTCTCACTCTTTCCCTCTCTCCGTACCATCACTACTTTCATTATCCCAATCA
TATAATCCCTACTCTCTTTATTTCTTCATATTCTGTTAAAGTGCGACAACTTCCCAGAATTCGTGCCTTTGCTTCCGGTTCTTTTGTTAAACAGCTGGTGTATGACCGGG
ATTCCCCGTCCGAATCTGAGGAGCACTTATCATCTCCATACAGTAATGGGGGTGATGGTTTCCATTTTGAAAATGGTTTTGCATCAGTGGATTTGAAACATTTGGGAACG
CCTGCGCTCGAAGTCAAGGAGCTGGATGAGTTGCCAGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGCCGGGAACAGTGATACG
ACTGCTTAATGCACAGAGAAAATGGATGGGGCAGGATGATGCGACCTATCTCACCGTGCATTGTTTGCGTATCCGTGAAAACGAGACAGCATTTAGGGTGTACAAGTGGA
TGATGCAACAACATTGGTATCGATTTGATTATGCTTTATCTACTAAGCTTGCCGATTACATGGGCAAGGAACGGAAGTTCTCAAAGTGTCGAGAAGTATTTGATGATATA
ATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTACCATTTACAATCG
TATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAGTTCCCTCTTTAGAGCTCTTATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTG
AGTTTATATATCATAATTTGGTAACAAGTGGACTCGAGCTACATAAAGATATATATGGTGGTCTAATTTGGCTACATAGTTATCAGGATACTATTGACAAAGAAAGGATA
GTGTCACTAAGGAAAGAAATGCAACAAGCAGGAATCAAGGAGGAAAAAGAAGTCCTTTTGTCCATCTTGAGAGCAAGCTCGAAAATGGGGGATGTAGTGGAAGCAGAAAG
ATTGTGGCAAAAACTTAAGTATTTAGATGGTAACATGCCATATCAGGCTTTTGTTTATAAAATGGAAGTCTACGCAAAGATGGGTAAACCAATGAAGGCTTTGGAGATAT
TTAGGGAGATGGAGCAGTTGAACTCTACTACTGCTGCGGCATATCAGACAATTATTGGTATTTTATGTAAATTTCAAGAGATAGAACTTGCAGAATCAATCATGGCAGGC
TTCATAGAGAGTAATTTGAAACCCCTCACGCCAGCTTATGTTGATATGATGAATATGTTTTTCAATTTAAGCTTACATGATAAGTTAGAGTTAACCTTCTCTCAGTGCCT
TGAGAAGTGTAAACCCAATCGTACCATCTATAGCATATATTTGGACTCTTTGGTAAAAGTTGGTAATCTTGACAGGGCTGAAGAAATATTTAGTCAGATGGAAACAAACG
GAGAAATTGGTGTAAATGCTCGTTCATGCAACCTCATTTTATGTGGGTATCTTTTATTTGGAAATTATATGAAGGCTGAAAAGATATATGATTTGATGTGTCAGAAAAAG
TATGACATTGATCCTCCTTTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGAAAGGAGGTTAAGAAGCCAATGAGCTTGAAGTTGAGTAAAGAACAGAGGGAGAT
TTTAGTAGGGTTGTTGTTAGGTGGTCTGGAGATAGAGTCTGATGAAGAGAGGAAAAATCACAGAATCCAATTCGAATTCCACAAAAACTGTAAAACTCACTCTGTTTTGA
GGAGGCACATATATGAGCAATACCACAAGTGGTTGCATTCTGCTTCAAAGTTGACCGATGGTGATATAGATATACCATATAAATTCTGCACTGTTTCACATTCATATTTT
GGTTTCTATGCAGATCAGTTTTGGCCACGAGGGCGTCAGACAATCCCTAATCTTATTCACCGGTGGCTTTCACCTCGTGCTCTTGCATACTGGTATATGTATGGAGGCTG
CAGGACATCATCAGGGGATATTTTATTGAAACTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAGAAGTCCATGCATTGCAAAGTGAAAAGGA
AGGGCAGCATGTATTGGATAGGTTTACTTGGAAGCAATGCCACATGGTTCTGGAAACTAATTGAGCCTTTCATTCTGGATGACTTGAAAGAAAGTACACAGGCAGACAGT
CTTAACTTGGGGGTTTTAAATGAAACTGAAAATATCAACTTTGATAGTCAATCTGATTCCGTTGAGGAGACTTCAAATTAA
mRNA sequenceShow/hide mRNA sequence
AAGAACCCTAAGCCCAATTTAAAAATAAAAAGTAAAATTAGTTGGAAAGCGCTAAAACTTCTCCCTTTCCCTCAAGCCTAGTCACGCCTCACGCCTCCTCCCTCTGCAGA
CACTTCTCCTCCCGACGCCGTCCGCCGCAACAATTCAGTTGCTTGCTGGCTGATGCTTTGAAGTGGCTCCCAGATTCGGGTTAGTCACTCCAAACTCTGCGTTTTCTTTC
TAATCGTAATCCTCCTATGGTTTTCTCCATGTCCATTCCTACCTCTGCATTTTCCACTGTGACCCTTCTCCGTTCTCTCACTCTTTCCCTCTCTCCGTACCATCACTACT
TTCATTATCCCAATCATATAATCCCTACTCTCTTTATTTCTTCATATTCTGTTAAAGTGCGACAACTTCCCAGAATTCGTGCCTTTGCTTCCGGTTCTTTTGTTAAACAG
CTGGTGTATGACCGGGATTCCCCGTCCGAATCTGAGGAGCACTTATCATCTCCATACAGTAATGGGGGTGATGGTTTCCATTTTGAAAATGGTTTTGCATCAGTGGATTT
GAAACATTTGGGAACGCCTGCGCTCGAAGTCAAGGAGCTGGATGAGTTGCCAGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGC
CGGGAACAGTGATACGACTGCTTAATGCACAGAGAAAATGGATGGGGCAGGATGATGCGACCTATCTCACCGTGCATTGTTTGCGTATCCGTGAAAACGAGACAGCATTT
AGGGTGTACAAGTGGATGATGCAACAACATTGGTATCGATTTGATTATGCTTTATCTACTAAGCTTGCCGATTACATGGGCAAGGAACGGAAGTTCTCAAAGTGTCGAGA
AGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAA
GTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAGTTCCCTCTTTAGAGCTCTTATGAGCAAACCAGGGGATTTGTCAAAGCAT
CATCTTAAACAGGCTGAGTTTATATATCATAATTTGGTAACAAGTGGACTCGAGCTACATAAAGATATATATGGTGGTCTAATTTGGCTACATAGTTATCAGGATACTAT
TGACAAAGAAAGGATAGTGTCACTAAGGAAAGAAATGCAACAAGCAGGAATCAAGGAGGAAAAAGAAGTCCTTTTGTCCATCTTGAGAGCAAGCTCGAAAATGGGGGATG
TAGTGGAAGCAGAAAGATTGTGGCAAAAACTTAAGTATTTAGATGGTAACATGCCATATCAGGCTTTTGTTTATAAAATGGAAGTCTACGCAAAGATGGGTAAACCAATG
AAGGCTTTGGAGATATTTAGGGAGATGGAGCAGTTGAACTCTACTACTGCTGCGGCATATCAGACAATTATTGGTATTTTATGTAAATTTCAAGAGATAGAACTTGCAGA
ATCAATCATGGCAGGCTTCATAGAGAGTAATTTGAAACCCCTCACGCCAGCTTATGTTGATATGATGAATATGTTTTTCAATTTAAGCTTACATGATAAGTTAGAGTTAA
CCTTCTCTCAGTGCCTTGAGAAGTGTAAACCCAATCGTACCATCTATAGCATATATTTGGACTCTTTGGTAAAAGTTGGTAATCTTGACAGGGCTGAAGAAATATTTAGT
CAGATGGAAACAAACGGAGAAATTGGTGTAAATGCTCGTTCATGCAACCTCATTTTATGTGGGTATCTTTTATTTGGAAATTATATGAAGGCTGAAAAGATATATGATTT
GATGTGTCAGAAAAAGTATGACATTGATCCTCCTTTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGAAAGGAGGTTAAGAAGCCAATGAGCTTGAAGTTGAGTA
AAGAACAGAGGGAGATTTTAGTAGGGTTGTTGTTAGGTGGTCTGGAGATAGAGTCTGATGAAGAGAGGAAAAATCACAGAATCCAATTCGAATTCCACAAAAACTGTAAA
ACTCACTCTGTTTTGAGGAGGCACATATATGAGCAATACCACAAGTGGTTGCATTCTGCTTCAAAGTTGACCGATGGTGATATAGATATACCATATAAATTCTGCACTGT
TTCACATTCATATTTTGGTTTCTATGCAGATCAGTTTTGGCCACGAGGGCGTCAGACAATCCCTAATCTTATTCACCGGTGGCTTTCACCTCGTGCTCTTGCATACTGGT
ATATGTATGGAGGCTGCAGGACATCATCAGGGGATATTTTATTGAAACTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAGAAGTCCATGCAT
TGCAAAGTGAAAAGGAAGGGCAGCATGTATTGGATAGGTTTACTTGGAAGCAATGCCACATGGTTCTGGAAACTAATTGAGCCTTTCATTCTGGATGACTTGAAAGAAAG
TACACAGGCAGACAGTCTTAACTTGGGGGTTTTAAATGAAACTGAAAATATCAACTTTGATAGTCAATCTGATTCCGTTGAGGAGACTTCAAATTAATTTAAGAGTTTTA
GTTATTAGGCCCATTATCAGTTGGATTCCTTAATTTGCCAACTGACGGAAGCCTACAATGTTTCATGATTTTGGTAGGGTATGTTATTCACATAGGATCTTCCTTGATCA
ATTTGGAAGATGTAAATATGTAAATCGATATGTAATGCTTATGTTCTGATTCTGTGTATTGTTGTAGCATTCTTGTTCACTTTTTAATATACGGTATATGTTAGTTTGGA
AGTCATACTTTCTTTTCTTTATCTTGTTTCTTAAAAAAATATAGTTTTCTTCTTTTGATAGGTAACTTTTTAGTGGTGGTTCTTACTGGATTCGTCGGAGGAAAGGGAAC
TGCTCAGTTTTTGTAATAATTTAAATTATTATTATTTTTAATAACTATTTTGTTCCCTCCGCTGGTCCAAAATTAAGTTCCCCTCAAGAACCTCATATGGATAATGGATG
TACCAAAAATGATTTAAATTTTCCTGTAGTATCTTTGCGTATAGGAGCTCTAAATGCACAGCCGACTTCAACACTCTCAGTAGAGACACCTCAGTCGGCTTCTCTCTCCC
TCGGGTAGCAGTCTTTGTTTCCATGATGGGAAATCCCAGATCATCTCTTCTGCCTACTGCCACCATCCCATTGTGGCCGGAGTGTGGTTCTTTACGGCCATTTTTGGTAT
GTCTGCTTCCCCAAAGTATTGACGATTGTTTCGCTGAAGTTCTTTTGTTGGTCGGTTGGTGACTGAAAGCAAGCCTGAAAACTTTTAGACATTTGCAGGTCAAGCTGTTC
TTTGGCTTATTTGGTTAGAATGAAACGAAAAGATCTTTTTTTCTAATCTTTGTGGCCATAATTTATTTTAATGTCGTATGGTAAATGGCCCTTGTAGACTGTACCTTAGA
TTTTATTTTTTATGACGATACTACGATGGTG
Protein sequenceShow/hide protein sequence
MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGT
PALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDI
INQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERI
VSLRKEMQQAGIKEEKEVLLSILRASSKMGDVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREMEQLNSTTAAAYQTIIGILCKFQEIELAESIMAG
FIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKIYDLMCQKK
YDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYF
GFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADS
LNLGVLNETENINFDSQSDSVEETSN