; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G01760 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G01760
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLAGLIDADG_2 domain-containing protein
Genome locationClcChr07:1850298..1858497
RNA-Seq ExpressionClc07G01760
SyntenyClc07G01760
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0006388 - tRNA splicing, via endonucleolytic cleavage and ligation (biological process)
GO:0010239 - chloroplast mRNA processing (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048564 - photosystem I assembly (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0004519 - endonuclease activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR004860 - Homing endonuclease, LAGLIDADG
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR027434 - Homing endonuclease


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152074.2 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis sativus]0.0e+0087.97Show/hide
Query:  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK--GQLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENG
        VFSMSIP TSAF+TVT  RSLTLSLSPYH YFHCPNHI+ TLF+P YSVK   QL RI +FAS SFV+QLV+D DSP ESEEH+ SS+SN  DGFHFENG
Subjt:  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK--GQLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENG

Query:  FASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATK
        FAS DLKHLGTP LEVKELDELPEQWRRSK+AWLCKELPAQKPGT+IRLLNAQKKW+GQD+ATYL VHCLRIRENETAFRVYKWMMQQ WYRFDYAL+TK
Subjt:  FASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATK

Query:  LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHN
        LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY+PRLSLH+SLFRAL+SKPGDLSKHHLKQAEFIYHN
Subjt:  LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHN

Query:  LVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMK
        LVTSGLELHKD+YGGLIWLHSYQDTID+ERIV LRKEMQQAGIKEEREVLLSILRASSKMGDV+EAE+ WQ+LKY DG+MPSQAFVYKMEVYAKMG+PMK
Subjt:  LVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMK

Query:  ALEIFREMEQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN
        ALEIFREMEQLN T+AAAYQTIIGILCK Q IELAESIMAGFI+SNLKPL PAYVDLMNMFFNL+L DKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN
Subjt:  ALEIFREMEQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN

Query:  IYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIE
        + +AEEIF+QMETNGEIGINARSCNIIL GYLL GNY+KAEKIYDLMCQK+YDIDPPLMEKL+Y+LSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIE
Subjt:  IYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIE

Query:  SDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRT
        SD+ERKNHRIQFEF +N  +HS+LRRHIYEQYH+WLH ASKL+DGD+DIPYKFCTVSHSYFGFYADQFWPRG  +IPNLIHRWLSP VLAYWYMYGGCRT
Subjt:  SDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRT

Query:  SSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN
        SSGDILLKLKGSHEGVEKIVKSLREKS+HCKVKRKGN+YWIGLLGSNATWFWKLIEPFILDY+K+S +AD+LNL  VLN +ENINFDS+SDSV E SN
Subjt:  SSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN

XP_008465080.1 PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis melo]0.0e+0089.84Show/hide
Query:  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK-GQLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGF
        VFSMSIP TSAF+TVTL RSLTLSLSPYH YFH PNHI+ TLFI +YSVK  QL RI +FAS SFV+QLV+DRDSP ESEEH+ S YSN  DGFHFENGF
Subjt:  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK-GQLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGF

Query:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKL
        AS DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGT+IRLLNAQ+KW+GQD+ATYLTVHCLRIRENETAFRVYKWMMQQ WYRFDYAL+TKL
Subjt:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKL

Query:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL
        ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY+PRLSLH+SLFRALMSKPGDLSKHHLKQAEFIYHNL
Subjt:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL

Query:  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKA
        VTSGLELHKDIYGGLIWLHSYQDTIDKERIV LRKEMQQAGIKEE+EVLLSILRASSKMGDVVEAER WQKLKY DG+MP QAFVYKMEVYAKMG+PMKA
Subjt:  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKA

Query:  LEIFREMEQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNI
        LEIFREMEQLN T+AAAYQTIIGILCK Q+IELAESIMAGFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN+
Subjt:  LEIFREMEQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNI

Query:  YKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIES
         +AEEIF+QMETNGEIG+NARSCN+IL GYLLFGNY+KAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIES
Subjt:  YKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIES

Query:  DEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTS
        DEERKNHRIQFEF KN  +HS+LRRHIYEQYH+WLH ASKL+DGDIDIPYKFCTVSHSYFGFYADQFWPRG  +IPNLIHRWLSP  LAYWYMYGGCRTS
Subjt:  DEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTS

Query:  SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN
        SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKG++YWIGLLGSNATWFWKLIEPFILD +K+S +AD+LNL  VLNETENINFDSQSDSV E SN
Subjt:  SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN

XP_022949171.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata]0.0e+0084.81Show/hide
Query:  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKH
        TSAFATVTL RSLTL  S  H +F C N+++R+L IPTYS KG  QL RIP+FASSS VE LV+DRDSP ESEE + S YS  A+      GFASADLKH
Subjt:  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKH

Query:  LGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKE
        LG PALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQ+KW+ QD+A YL VHCLRIRENETAFRVYKWMMQQ WYRFDYALATKLADYMGKE
Subjt:  LGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKE

Query:  RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL
        RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGY+PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL T+GLEL
Subjt:  RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL

Query:  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREM
        HKDIYGGLIWLHSYQDT+DKERI+ LRKEM QAGI+EEREVL+SILRASSK+GDV+EAERSW KLK FDGSMPSQAFVYKMEVYAK+G PMKA EIFREM
Subjt:  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREM

Query:  EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIF
        EQLN  SAAAYQTIIGILCK +++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGN+ +AEEIF
Subjt:  EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIF

Query:  NQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH
        +QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREIL+GLLLGGLEIESDE RKNH
Subjt:  NQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH

Query:  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLK
        RIQFEF ++ ++HS LRRHI+EQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFYADQFWPRGHP IPNLIHRWLSP VLAYWYMYGGCR SSGD +LK
Subjt:  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLK

Query:  LKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN
        LKGS EGV KIVKSLREKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILD +KDSL+AD+LN+E+  NET NINFDSQSDS  EAS+
Subjt:  LKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN

XP_022998786.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima]0.0e+0085.44Show/hide
Query:  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKH
        TSAFATVTL RSLTL  S  H +F C N+++R+L IPTYS KG  QL RIP+FASSS VE LV+DRDSP ESEE + S YSN A+       FASADLKH
Subjt:  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKH

Query:  LGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKE
        LG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQ+KW+ QD+A YL VHCLRIRENETAFRVYKWMMQQ WYRFDYALATKLADYMGKE
Subjt:  LGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKE

Query:  RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL
        RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVT+GLEL
Subjt:  RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL

Query:  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREM
        HKDIYGGLIWLHSYQDT+DKERI+ LRKEMQQAGI+EEREVL+SILRASSK+GDV+EAERSW K+K FDGSMPSQAFVYKMEVYAK+G PMKALEIFREM
Subjt:  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREM

Query:  EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIF
        EQLN  S+AAYQTIIGILCK +++ LAES+MAGFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGN+ +AEEIF
Subjt:  EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIF

Query:  NQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH
        +QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREIL+GLLLGGLEIESDE RKNH
Subjt:  NQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH

Query:  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLK
        RIQFEF ++ ++HS LRRH+YEQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFYADQFWPRGHP+IPNLIHRWLSP VLAYWYMYGGCR SSGD +LK
Subjt:  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLK

Query:  LKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN
        LKGS EGV KIVKSLREKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILD +KDSL+ADNLNLE+ +NET NINFDSQSDS  EAS+
Subjt:  LKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN

XP_038887990.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Benincasa hispida]0.0e+0089.75Show/hide
Query:  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKH
        TSAF++VTL RS +LSLSPYH YF CPNHIVRT+FIP YSVKG  QL RIPSFASSS VEQLV+DRDS  ESEEH+ S YSN AD      GFASADLKH
Subjt:  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKH

Query:  LGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKE
        L  PALEVKELDELP+QWRRSKLAWLCKELPAQKPGTLIRLLNAQ+KW+ QD+ATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKE
Subjt:  LGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKE

Query:  RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL
        RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY+PRLSLHNSLFRAL SKPGDLSKHHLKQAEFIYHNLVTSGLE+
Subjt:  RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL

Query:  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREM
        HKDI GGLIWLHSYQDTIDKERIV LRKEMQQAGIKEEREVLLSILRASSKMG+V+EAERSWQKLK FDG+MPSQAFVYKMEVYAKMG+PMKALEIFREM
Subjt:  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREM

Query:  EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIF
        EQLN  +AAAY+TIIGILCK Q IELAESIM GFIKSNLKPLMPAYVDLMNMFFNLSLH+KLEL FSQCLEKCKP+RTIYSIYLDSLVKVGN+ +AEEIF
Subjt:  EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIF

Query:  NQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH
        +QMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKP+SLKLSKEQREIL+GLLLGG+EIESDEERKNH
Subjt:  NQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH

Query:  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLK
        RIQFEF +N N+HSLLRRHIYEQYHEWLH ASKL+DGDIDIPYKFCTVSHSYFGFYADQFWP+GHP+IPNLIHRWLSP VLAYWYMYGGCRTSSGDILLK
Subjt:  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLK

Query:  LKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN
        LKGS EGVEKIVKSLREKSM CKVKRKG++YWIGLLG+NATWFWKL+EPFILDY+KDSL AD+ NL RVLNETENINFDSQSDSV EASN
Subjt:  LKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN

TrEMBL top hitse value%identityAlignment
A0A0A0LBL0 LAGLIDADG_2 domain-containing protein0.0e+0087.97Show/hide
Query:  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK--GQLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENG
        VFSMSIP TSAF+TVT  RSLTLSLSPYH YFHCPNHI+ TLF+P YSVK   QL RI +FAS SFV+QLV+D DSP ESEEH+ SS+SN  DGFHFENG
Subjt:  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK--GQLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENG

Query:  FASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATK
        FAS DLKHLGTP LEVKELDELPEQWRRSK+AWLCKELPAQKPGT+IRLLNAQKKW+GQD+ATYL VHCLRIRENETAFRVYKWMMQQ WYRFDYAL+TK
Subjt:  FASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATK

Query:  LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHN
        LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY+PRLSLH+SLFRAL+SKPGDLSKHHLKQAEFIYHN
Subjt:  LADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHN

Query:  LVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMK
        LVTSGLELHKD+YGGLIWLHSYQDTID+ERIV LRKEMQQAGIKEEREVLLSILRASSKMGDV+EAE+ WQ+LKY DG+MPSQAFVYKMEVYAKMG+PMK
Subjt:  LVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMK

Query:  ALEIFREMEQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN
        ALEIFREMEQLN T+AAAYQTIIGILCK Q IELAESIMAGFI+SNLKPL PAYVDLMNMFFNL+L DKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN
Subjt:  ALEIFREMEQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN

Query:  IYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIE
        + +AEEIF+QMETNGEIGINARSCNIIL GYLL GNY+KAEKIYDLMCQK+YDIDPPLMEKL+Y+LSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIE
Subjt:  IYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIE

Query:  SDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRT
        SD+ERKNHRIQFEF +N  +HS+LRRHIYEQYH+WLH ASKL+DGD+DIPYKFCTVSHSYFGFYADQFWPRG  +IPNLIHRWLSP VLAYWYMYGGCRT
Subjt:  SDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRT

Query:  SSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN
        SSGDILLKLKGSHEGVEKIVKSLREKS+HCKVKRKGN+YWIGLLGSNATWFWKLIEPFILDY+K+S +AD+LNL  VLN +ENINFDS+SDSV E SN
Subjt:  SSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN

A0A1S3CPK0 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0089.84Show/hide
Query:  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK-GQLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGF
        VFSMSIP TSAF+TVTL RSLTLSLSPYH YFH PNHI+ TLFI +YSVK  QL RI +FAS SFV+QLV+DRDSP ESEEH+ S YSN  DGFHFENGF
Subjt:  VFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVK-GQLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGF

Query:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKL
        AS DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGT+IRLLNAQ+KW+GQD+ATYLTVHCLRIRENETAFRVYKWMMQQ WYRFDYAL+TKL
Subjt:  ASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKL

Query:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL
        ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY+PRLSLH+SLFRALMSKPGDLSKHHLKQAEFIYHNL
Subjt:  ADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL

Query:  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKA
        VTSGLELHKDIYGGLIWLHSYQDTIDKERIV LRKEMQQAGIKEE+EVLLSILRASSKMGDVVEAER WQKLKY DG+MP QAFVYKMEVYAKMG+PMKA
Subjt:  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKA

Query:  LEIFREMEQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNI
        LEIFREMEQLN T+AAAYQTIIGILCK Q+IELAESIMAGFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGN+
Subjt:  LEIFREMEQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNI

Query:  YKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIES
         +AEEIF+QMETNGEIG+NARSCN+IL GYLLFGNY+KAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIES
Subjt:  YKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIES

Query:  DEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTS
        DEERKNHRIQFEF KN  +HS+LRRHIYEQYH+WLH ASKL+DGDIDIPYKFCTVSHSYFGFYADQFWPRG  +IPNLIHRWLSP  LAYWYMYGGCRTS
Subjt:  DEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTS

Query:  SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN
        SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKG++YWIGLLGSNATWFWKLIEPFILD +K+S +AD+LNL  VLNETENINFDSQSDSV E SN
Subjt:  SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN

A0A5N6QQ61 LAGLIDADG_2 domain-containing protein0.0e+0067.51Show/hide
Query:  MSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKGQLRR---IPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFA
        +S+P  S+ +++   RSL+LSLS +HR     +H  R++F P +    + R+   + + + S+ VE L  +   P   E   FS+ S+    F F+    
Subjt:  MSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKGQLRR---IPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFA

Query:  SADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLA
        S DLK L  PAL+VKEL +LPEQWRRSKLAWLCKELPA K GTLIR+LNAQ+KW+ Q +ATY+ VHC+RIRENET F+VYKWMMQQ WY+FD+ALATKLA
Subjt:  SADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLA

Query:  DYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLV
        DYMGKERKFSKCRE+FDDIINQG VP ESTFHILI+AYLSAP+Q C+EEA +IYNRMIQLGGY+P+LSLHN LFRAL+SKPG  SK +LKQAEFI+HNL+
Subjt:  DYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLV

Query:  TSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKAL
        TSGLE+HKDIYGGLIWLHSYQDTID+ERI  L+KEM+ AGI+E REVLLSILRA SK  +V EAER+W KL   DG +P  AFVYKMEVYAK+GEPMK+L
Subjt:  TSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKAL

Query:  EIFREM-EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNI
        EIFREM E+L+ TS AAY  II +LCK+Q++ELAES+M  FIKSNLKPL P+Y+D+MN++FNL+LHDKLEL FSQ L+KC+PN T+YSIYLDSLV +GN+
Subjt:  EIFREM-EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNI

Query:  YKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIES
         KAEEIFNQM +NG IG+N+RSCN IL GYL  G+Y+KAEKIYDLMCQKKY ID PLMEK+DYVLSLSR+ VKKP+S+KLSKEQREIL+GLLLGGL+IES
Subjt:  YKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIES

Query:  DEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTS
        DEERK+H ++FEF +N ++H +L+RHI++QYHEWLHP+ K  +G  DIP +FCT+SHSYFGFYADQFWP+G P IP LIHRWLSP VLAYWYMYGG RTS
Subjt:  DEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTS

Query:  SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDS
        SGDILL+LKG+HEGVEK+  +L EKS+ C++KRKG+++WIG LGSN+ WFWKLIEP++LD MKD L+A    LE    ET++I++DS S+S
Subjt:  SGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDS

A0A6J1GB98 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0084.81Show/hide
Query:  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKH
        TSAFATVTL RSLTL  S  H +F C N+++R+L IPTYS KG  QL RIP+FASSS VE LV+DRDSP ESEE + S YS  A+      GFASADLKH
Subjt:  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKH

Query:  LGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKE
        LG PALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQ+KW+ QD+A YL VHCLRIRENETAFRVYKWMMQQ WYRFDYALATKLADYMGKE
Subjt:  LGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKE

Query:  RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL
        RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIEE+STIYNRMIQLGGY+PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL T+GLEL
Subjt:  RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL

Query:  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREM
        HKDIYGGLIWLHSYQDT+DKERI+ LRKEM QAGI+EEREVL+SILRASSK+GDV+EAERSW KLK FDGSMPSQAFVYKMEVYAK+G PMKA EIFREM
Subjt:  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREM

Query:  EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIF
        EQLN  SAAAYQTIIGILCK +++ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGN+ +AEEIF
Subjt:  EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIF

Query:  NQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH
        +QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREIL+GLLLGGLEIESDE RKNH
Subjt:  NQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH

Query:  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLK
        RIQFEF ++ ++HS LRRHI+EQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFYADQFWPRGHP IPNLIHRWLSP VLAYWYMYGGCR SSGD +LK
Subjt:  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLK

Query:  LKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN
        LKGS EGV KIVKSLREKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILD +KDSL+AD+LN+E+  NET NINFDSQSDS  EAS+
Subjt:  LKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN

A0A6J1KB64 pentatricopeptide repeat-containing protein At2g15820, chloroplastic0.0e+0085.44Show/hide
Query:  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKH
        TSAFATVTL RSLTL  S  H +F C N+++R+L IPTYS KG  QL RIP+FASSS VE LV+DRDSP ESEE + S YSN A+       FASADLKH
Subjt:  TSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKG--QLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKH

Query:  LGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKE
        LG PALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQ+KW+ QD+A YL VHCLRIRENETAFRVYKWMMQQ WYRFDYALATKLADYMGKE
Subjt:  LGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKE

Query:  RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL
        RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVT+GLEL
Subjt:  RKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLEL

Query:  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREM
        HKDIYGGLIWLHSYQDT+DKERI+ LRKEMQQAGI+EEREVL+SILRASSK+GDV+EAERSW K+K FDGSMPSQAFVYKMEVYAK+G PMKALEIFREM
Subjt:  HKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREM

Query:  EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIF
        EQLN  S+AAYQTIIGILCK +++ LAES+MAGFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYL+SLVKVGN+ +AEEIF
Subjt:  EQLNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIF

Query:  NQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH
        +QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREIL+GLLLGGLEIESDE RKNH
Subjt:  NQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNH

Query:  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLK
        RIQFEF ++ ++HS LRRH+YEQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFYADQFWPRGHP+IPNLIHRWLSP VLAYWYMYGGCR SSGD +LK
Subjt:  RIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLK

Query:  LKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN
        LKGS EGV KIVKSLREKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILD +KDSL+ADNLNLE+ +NET NINFDSQSDS  EAS+
Subjt:  LKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDYMKDSLRADNLNLERVLNETENINFDSQSDSVGEASN

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351302.2e-1120.55Show/hide
Query:  LAWLCKELPAQKPGTLIRLL-NAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   D+   ++V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAQKPGTLIRLL-NAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTI
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTI

Query:  DKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYK--MEVYAKMGEPMKALEIFREMEQLNC-TSAAAYQTII
               L  EM+    K       +++ A ++ G   +AE  +++L+  DG  P   +VY   ME Y++ G P  A EIF  M+ + C    A+Y  ++
Subjt:  DKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYK--MEVYAKMGEPMKALEIFREMEQLNC-TSAAAYQTII

Query:  GILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINAR
            ++     AE++     +  + P M +++ L++ +       K E    +  E   +P+  + +  L+   ++G   K E+I  +ME NG    +  
Subjt:  GILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINAR

Query:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID
        + NI+++ Y   G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID

Q5G1S8 Pentatricopeptide repeat-containing protein At3g18110, chloroplastic6.3e-1122.01Show/hide
Query:  KFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELH
        KFSK +E+ D +  +GCVP   +F+ LI A L +   G     +     M++  G RP    +N+L  A           +L  A  ++ ++     +  
Subjt:  KFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELH

Query:  KDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREME
           Y  +I ++       +   +F+  E++  G   +     S+L A ++  +  + +  +Q+++          +   + +Y K G+   AL+++++M+
Subjt:  KDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREME

Query:  QLNCTS--AAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEE
         L+  +  A  Y  +I  L K+ +   A ++M+  +   +KP +  Y  L+  +      ++ E TFS  L    KP+   YS+ LD L++     KA  
Subjt:  QLNCTS--AAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEE

Query:  IFNQMETNG
        ++  M ++G
Subjt:  IFNQMETNG

Q6ZHJ5 Pentatricopeptide repeat-containing protein OTP51, chloroplastic4.1e-22853.19Show/hide
Query:  IPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWL
        IP+ AS+  +E L+ D D   E E+        E   F  E   A+ + + + +P L V EL+ELPEQWRRS++AWLCKELPA K  T  R+LNAQ+KW+
Subjt:  IPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWL

Query:  GQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYN
         QD+ATY+ VHCLRIR N+ AFRVY WM++Q W+RF++ALAT++AD +G++ K  KCREVF+ ++ QG VP+ESTFHILIVAYLS P   C+EEA TIYN
Subjt:  GQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYN

Query:  RMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRAS
        +MIQ+GGY+PRLSLHNSLFRAL+SK G  +K++LKQAEF+YHN+VT+ L++HKD+Y GLIWLHSYQD ID+ERI+ LRKEM+QAG  E  +VL+S++RA 
Subjt:  RMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRAS

Query:  SKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNC-TSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVD
        SK G+V E E +W  +      +P QA+V +ME YA+ GEPMK+L++F+EM+  N   + A+Y  II I+ K+ ++++ E +M  FI+S++K LMPA++D
Subjt:  SKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNC-TSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVD

Query:  LMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDP
        LM M+ +L +H+KLELTF +C+ +C+PNR +Y+IYL+SLVKVGNI KAEE+F +M  NG IG N +SCNI+L GYL   +Y KAEK+YD+M +KKYD+  
Subjt:  LMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDP

Query:  PLMEKLDYVLSLSRKEVK-KPLSLKLSKEQREILIGLLLGGLEIESDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCT
          +EKL   L L++K +K K +S+KL +EQREILIGLLLGG  +ES  +R  H + F+F ++ N+HS+LR HI+E++ EWL  AS+  D    IPY+F T
Subjt:  PLMEKLDYVLSLSRKEVK-KPLSLKLSKEQREILIGLLLGGLEIESDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCT

Query:  VSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSH-EGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKL
        + H +F F+ DQF+ +G P +P LIHRWL+P VLAYW+M+GG +  SGDI+LKL G + EGVE+IV SL  +S+  KVKRKG  +WIG  GSNA  FW++
Subjt:  VSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSH-EGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKL

Query:  IEPFILDYMKDSLRADNLNLERVLNETENINFDSQSD
        IEP +L+     +  +  ++    + T++ + DS  D
Subjt:  IEPFILDYMKDSLRADNLNLERVLNETENINFDSQSD

Q76C99 Protein Rf1, mitochondrial1.7e-1124.05Show/hide
Query:  KERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGL
        KE    K    + +++++G +P   T++ +I A   A     +++A  + N M++  G  P    +NS+        G  S    K+A      + + G+
Subjt:  KERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGL

Query:  ELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQ-AFVYKMEVYAKMGEPMKALEIF
        E     Y  L+          + R +F    M + G+K E     ++L+  +  G +VE       L   +G  P    F   +  YAK G+  +A+ +F
Subjt:  ELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQ-AFVYKMEVYAKMGEPMKALEIF

Query:  REMEQLNCT-SAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK--CKPNRTIYSIYLDSLVKVGNIY
         +M Q     +A  Y  +IGILCKS ++E A       I   L P    Y  L++     +  ++ E    + L++  C  N   ++  +DS  K G + 
Subjt:  REMEQLNCT-SAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK--CKPNRTIYSIYLDSLVKVGNIY

Query:  KAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKI
        ++E++F  M   G +  N  + N +++GY L G   +A K+
Subjt:  KAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKI

Q9XIL5 Pentatricopeptide repeat-containing protein At2g15820, chloroplastic1.3e-25854.74Show/hide
Query:  SSCSLADDLKSSQIRLSLPILHFISIRNPPSVFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKGQL------RRIPSFASSS
        SS SLA    SS   +S+   +  S+ + P++ + S          TLFRSL+ SL   HR  +    + R      +  K Q       R  P F ++S
Subjt:  SSCSLADDLKSSQIRLSLPILHFISIRNPPSVFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKGQL------RRIPSFASSS

Query:  FVEQ---LVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLG
          ++    V       ESEE +     +EA+GF  +   A  D++++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+P  K  TL+RLLNAQKKW+ 
Subjt:  FVEQ---LVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLG

Query:  QDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYN
        Q++ATY++VHC+RIRENET FRVY+WM QQ WYRFD+ L TKLA+Y+GKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YN
Subjt:  QDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYN

Query:  RMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRAS
        RMIQLGGY+PRLSLHNSLFRAL+SK G +    LKQAEFI+HN+VT+GLE+ KDIY GLIWLHS QD +D  RI  LR+EM++AG +E +EV++S+LRA 
Subjt:  RMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRAS

Query:  SKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQ-LNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVD
        +K G V E ER+W +L   D  +PSQAFVYK+E Y+K+G+  KA+EIFREME+ +   + + Y  II +LCK QQ+EL E++M  F +S  KPL+P++++
Subjt:  SKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQ-LNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVD

Query:  LMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDP
        +  M+F+L LH+KLE+ F QCLEKC+P++ IY+IYLDSL K+GN+ KA ++FN+M+ NG I ++ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+P
Subjt:  LMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDP

Query:  PLMEKLDYVLSLSRKEVKK-PLSLKLSKEQREILIGLLLGGLEIESDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCT
        PLMEKLDY+LSL +KEVKK P S+KLSK+QRE+L+GLLLGGL+IESD+E+K+H I+FEF +N  +H +L+++I++Q+ EWLHP S   + DI IP++F +
Subjt:  PLMEKLDYVLSLSRKEVKK-PLSLKLSKEQREILIGLLLGGLEIESDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCT

Query:  VSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLI
        V HSYFGFYA+ +WP+G P IP LIHRWLSP  LAYWYMY G +TSSGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+KG ++WIGL G+N+  FWKLI
Subjt:  VSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLI

Query:  EPFILDYMKDSLRADNLNLERVLN-ETENINFDSQSDSVGEASN
        EP +L+ +K+ L+  + +L+ V   E ++INF S SD   +  N
Subjt:  EPFILDYMKDSLRADNLNLERVLN-ETENINFDSQSDSVGEASN

Arabidopsis top hitse value%identityAlignment
AT2G15820.1 endonucleases9.3e-26054.74Show/hide
Query:  SSCSLADDLKSSQIRLSLPILHFISIRNPPSVFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKGQL------RRIPSFASSS
        SS SLA    SS   +S+   +  S+ + P++ + S          TLFRSL+ SL   HR  +    + R      +  K Q       R  P F ++S
Subjt:  SSCSLADDLKSSQIRLSLPILHFISIRNPPSVFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKGQL------RRIPSFASSS

Query:  FVEQ---LVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLG
          ++    V       ESEE +     +EA+GF  +   A  D++++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+P  K  TL+RLLNAQKKW+ 
Subjt:  FVEQ---LVHDRDSPLESEEHVFSSYSNEADGFHFENGFASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLG

Query:  QDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYN
        Q++ATY++VHC+RIRENET FRVY+WM QQ WYRFD+ L TKLA+Y+GKERKF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YN
Subjt:  QDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYN

Query:  RMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRAS
        RMIQLGGY+PRLSLHNSLFRAL+SK G +    LKQAEFI+HN+VT+GLE+ KDIY GLIWLHS QD +D  RI  LR+EM++AG +E +EV++S+LRA 
Subjt:  RMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRAS

Query:  SKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQ-LNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVD
        +K G V E ER+W +L   D  +PSQAFVYK+E Y+K+G+  KA+EIFREME+ +   + + Y  II +LCK QQ+EL E++M  F +S  KPL+P++++
Subjt:  SKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQ-LNCTSAAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVD

Query:  LMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDP
        +  M+F+L LH+KLE+ F QCLEKC+P++ IY+IYLDSL K+GN+ KA ++FN+M+ NG I ++ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+P
Subjt:  LMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDP

Query:  PLMEKLDYVLSLSRKEVKK-PLSLKLSKEQREILIGLLLGGLEIESDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCT
        PLMEKLDY+LSL +KEVKK P S+KLSK+QRE+L+GLLLGGL+IESD+E+K+H I+FEF +N  +H +L+++I++Q+ EWLHP S   + DI IP++F +
Subjt:  PLMEKLDYVLSLSRKEVKK-PLSLKLSKEQREILIGLLLGGLEIESDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCT

Query:  VSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLI
        V HSYFGFYA+ +WP+G P IP LIHRWLSP  LAYWYMY G +TSSGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+KG ++WIGL G+N+  FWKLI
Subjt:  VSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLI

Query:  EPFILDYMKDSLRADNLNLERVLN-ETENINFDSQSDSVGEASN
        EP +L+ +K+ L+  + +L+ V   E ++INF S SD   +  N
Subjt:  EPFILDYMKDSLRADNLNLERVLN-ETENINFDSQSDSVGEASN

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-1220.55Show/hide
Query:  LAWLCKELPAQKPGTLIRLL-NAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   D+   ++V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAQKPGTLIRLL-NAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTI
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTI

Query:  DKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYK--MEVYAKMGEPMKALEIFREMEQLNC-TSAAAYQTII
               L  EM+    K       +++ A ++ G   +AE  +++L+  DG  P   +VY   ME Y++ G P  A EIF  M+ + C    A+Y  ++
Subjt:  DKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYK--MEVYAKMGEPMKALEIFREMEQLNC-TSAAAYQTII

Query:  GILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINAR
            ++     AE++     +  + P M +++ L++ +       K E    +  E   +P+  + +  L+   ++G   K E+I  +ME NG    +  
Subjt:  GILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINAR

Query:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID
        + NI+++ Y   G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID

AT2G35130.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-1220.55Show/hide
Query:  LAWLCKELPAQKPGTLIRLL-NAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS
        L+++ KE    K   ++  L +    W   D+   ++V     ++ ++   V +W++++  ++ D      L D  G++ ++ +   ++  ++    VP+
Subjt:  LAWLCKELPAQKPGTLIRLL-NAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPS

Query:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTI
        E T+ +LI AY  A   G IE A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     +   + Y   + ++ Y    
Subjt:  ESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTI

Query:  DKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYK--MEVYAKMGEPMKALEIFREMEQLNC-TSAAAYQTII
               L  EM+    K       +++ A ++ G   +AE  +++L+  DG  P   +VY   ME Y++ G P  A EIF  M+ + C    A+Y  ++
Subjt:  DKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYK--MEVYAKMGEPMKALEIFREMEQLNC-TSAAAYQTII

Query:  GILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINAR
            ++     AE++     +  + P M +++ L++ +       K E    +  E   +P+  + +  L+   ++G   K E+I  +ME NG    +  
Subjt:  GILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINAR

Query:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID
        + NI+++ Y   G   + E+++  + +K +  D
Subjt:  SCNIILSGYLLFGNYLKAEKIYDLMCQKKYDID

AT3G18110.1 Pentatricopeptide repeat (PPR) superfamily protein4.5e-1222.01Show/hide
Query:  KFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELH
        KFSK +E+ D +  +GCVP   +F+ LI A L +   G     +     M++  G RP    +N+L  A           +L  A  ++ ++     +  
Subjt:  KFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELH

Query:  KDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREME
           Y  +I ++       +   +F+  E++  G   +     S+L A ++  +  + +  +Q+++          +   + +Y K G+   AL+++++M+
Subjt:  KDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREME

Query:  QLNCTS--AAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEE
         L+  +  A  Y  +I  L K+ +   A ++M+  +   +KP +  Y  L+  +      ++ E TFS  L    KP+   YS+ LD L++     KA  
Subjt:  QLNCTS--AAAYQTIIGILCKSQQIELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEE

Query:  IFNQMETNG
        ++  M ++G
Subjt:  IFNQMETNG

AT5G08310.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-1022.63Show/hide
Query:  AFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFR
        A+  + W  +Q  YR D      +A  + + R+ +  + +  D++N  C  S   F   I    +A   G ++EAS++++R+ ++G   P    +N L  
Subjt:  AFRVYKWMMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFR

Query:  ALMSKPGDLSKHHLKQAEFIYHNL-VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYF
        A       +SK +    E +   L        H D +     L  Y +T   ER + +  E+   G  +E  +   ++ +  K G V +A    + L+  
Subjt:  ALMSKPGDLSKHHLKQAEFIYHNL-VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYF

Query:  DGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTS-AAAYQTIIGILCKSQQIELAESIMAGFIKSNLKP
        D  +  + +   +  + K     KA ++F +M ++   +  A Y  +IG LCK + +E+A S+     +S + P
Subjt:  DGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTS-AAAYQTIIGILCKSQQIELAESIMAGFIKSNLKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCAAGTTCTTCAATGGGTTTTCAAGGAAACAAATGAAAAACAAGGACAGAGGACTTCTCAATCCACCAAGAAGGGGAAAAGTTGGGATAAAGGAACTGCTAGATC
AAAGGAGATAGTGGTGTCTAGTCCTTGTGGGAGTTCTCAAAAATCAAAGGGATTTGAAGGAAGGAAATTTGATCTAAAGAAGTTGAAATCATTTGCTTTGTTGTGCAGAA
AGGATATTGCAAAGTCTTGCTTTTATAGTACCATTAATACAAAGAGACCAAGAACACAGGGCTTTGATAATATGCACCAATATTGGATCCAAAAGATGAAAAAGGAAGAT
ATTGCTAAGGGAATTAGACAGATTGGTCAAAAGGTGGATTCTTCTTCTTCTGTTCATCCTGGCACCAAAGTTTTGCCAATAACTGATGCCACTTTAGTAGAGCAATATAG
CAGTAAGCCTGAGAAGAAAGTTGTGGGCAATGGGGAGAGTAAAACCAAAACCAAATCAAGAATGAAGGAACTTTTGAAATGGGCTATGGCTTCAAGATCAGAAAAAGGAG
GGAAGTTCATCACTGGCAAGGTTTCACGACTGCGGAATCGAGGAACTCTTAAAGCAGGCTTGGACGATGATCGAGGGAGTAATGACTCCCCGAAGATCAGTTTCAGATGG
GAAGCTGAAAGCTGCTCCTCCATTTCCTCAGCCTACTCGTCGGTGTCGGCGGTGTCCTCATTCAAGAACTGTTCCATTACACTGAATTCCACTGCCATTCATGAAATCAA
TCAATATTATCCAAGAAGAGGAAGCTGGATCACTACAGATTCTGAATGTACCAGAAGCTCGAGAAGCCTAAACCTGATTCTGCTCGGAATGAAAATTAAATCCGAAAAAC
TAGTCAAGGCGTTGCTCGGAACTAAATCTCGTATGCTTCCACCCTGCTCCAGTTGCTCGCTGGCTGATGACTTGAAGTCCTCCCAAATTCGGCTGTCACTCCCAATTCTG
CATTTTATATCTATCCGTAATCCTCCTTCCGTTTTCTCCATGTCCATTCCTACTACCTCTGCATTTGCCACTGTGACCCTTTTCCGTTCTCTCACTCTTTCCCTCTCTCC
ATACCATCGCTACTTTCATTGTCCCAATCACATAGTCCGTACTCTCTTTATCCCAACATATTCTGTAAAAGGACAACTTCGGCGGATTCCGTCCTTTGCTTCCAGTTCTT
TTGTTGAACAGCTGGTGCATGACCGGGATTCCCCGTTGGAGTCTGAAGAGCACGTATTTTCTTCATACAGTAATGAGGCTGATGGTTTTCATTTTGAAAATGGTTTTGCG
TCGGCGGATTTGAAACATTTGGGAACGCCTGCGCTTGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCC
AGCACAAAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAAGAAATGGCTGGGGCAGGATGAAGCGACCTATCTCACTGTGCATTGTTTGCGTATTCGTGAAAACG
AGACTGCATTTAGGGTGTACAAGTGGATGATGCAACAACGTTGGTACCGATTCGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGAAAGTTCTCA
AAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGATGCGTGCCTAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCAT
AGAGGAAGCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCGACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTTATGAGCAAACCAGGGGATT
TGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCACAATTTGGTAACAAGTGGGCTTGAGTTACATAAAGATATATATGGTGGTCTAATTTGGCTACATAGTTAC
CAGGATACTATAGACAAAGAAAGGATAGTGTTTCTAAGGAAAGAAATGCAACAAGCAGGAATCAAGGAGGAAAGAGAAGTCCTTTTGTCCATCTTGAGAGCGAGCTCAAA
AATGGGGGATGTAGTGGAAGCAGAAAGATCGTGGCAAAAACTTAAGTATTTTGATGGCAGCATGCCATCTCAAGCTTTTGTTTACAAAATGGAAGTCTATGCAAAGATGG
GTGAACCAATGAAAGCTTTGGAGATCTTTAGGGAGATGGAGCAGTTGAACTGTACAAGTGCTGCAGCATATCAGACGATTATTGGTATTTTATGTAAATCTCAACAGATA
GAACTTGCAGAATCGATCATGGCAGGCTTCATAAAGAGTAATTTGAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACACGATAA
GTTAGAGTTAACCTTCTCTCAGTGCCTTGAGAAGTGTAAACCCAATCGTACTATCTATAGCATATATTTGGACTCTTTAGTAAAAGTTGGTAATATCTACAAGGCCGAAG
AAATATTTAATCAGATGGAAACAAATGGAGAAATTGGTATAAATGCTCGTTCGTGCAACATCATTTTAAGTGGGTATCTTTTATTTGGAAATTATTTGAAGGCTGAAAAA
ATATATGATTTGATGTGTCAGAAGAAGTATGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTAAGTAGGAAGGAGGTTAAGAAGCCACTAAGCTT
GAAGTTGAGTAAAGAACAGAGGGAGATTTTAATAGGGTTGTTGTTGGGTGGCCTGGAGATAGAGTCTGATGAAGAGAGGAAAAATCATAGAATCCAATTTGAATTCCTCA
AAAACCGGAACTCCCACTCTCTTTTGAGGAGACACATATATGAGCAATATCATGAATGGTTACATCCTGCTTCGAAGTTGAGTGACGGTGATATAGATATACCATATAAA
TTCTGCACTGTTTCACATTCATATTTTGGTTTCTATGCCGATCAGTTTTGGCCACGAGGCCATCCTTCAATACCTAATCTAATTCACCGGTGGCTTTCACCTTGTGTTCT
TGCATACTGGTATATGTATGGAGGCTGCAGGACATCATCAGGGGATATTTTACTGAAGCTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAGA
AGTCCATGCATTGCAAAGTGAAAAGGAAGGGCAACATATATTGGATAGGTTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATTAC
ATGAAAGATAGTCTACGGGCAGACAATCTTAACTTGGAGAGGGTTTTAAATGAAACTGAAAATATCAACTTTGATAGTCAATCTGATTCCGTTGGGGAGGCTTCTAATTA
A
mRNA sequenceShow/hide mRNA sequence
ATGCAGCAAGTTCTTCAATGGGTTTTCAAGGAAACAAATGAAAAACAAGGACAGAGGACTTCTCAATCCACCAAGAAGGGGAAAAGTTGGGATAAAGGAACTGCTAGATC
AAAGGAGATAGTGGTGTCTAGTCCTTGTGGGAGTTCTCAAAAATCAAAGGGATTTGAAGGAAGGAAATTTGATCTAAAGAAGTTGAAATCATTTGCTTTGTTGTGCAGAA
AGGATATTGCAAAGTCTTGCTTTTATAGTACCATTAATACAAAGAGACCAAGAACACAGGGCTTTGATAATATGCACCAATATTGGATCCAAAAGATGAAAAAGGAAGAT
ATTGCTAAGGGAATTAGACAGATTGGTCAAAAGGTGGATTCTTCTTCTTCTGTTCATCCTGGCACCAAAGTTTTGCCAATAACTGATGCCACTTTAGTAGAGCAATATAG
CAGTAAGCCTGAGAAGAAAGTTGTGGGCAATGGGGAGAGTAAAACCAAAACCAAATCAAGAATGAAGGAACTTTTGAAATGGGCTATGGCTTCAAGATCAGAAAAAGGAG
GGAAGTTCATCACTGGCAAGGTTTCACGACTGCGGAATCGAGGAACTCTTAAAGCAGGCTTGGACGATGATCGAGGGAGTAATGACTCCCCGAAGATCAGTTTCAGATGG
GAAGCTGAAAGCTGCTCCTCCATTTCCTCAGCCTACTCGTCGGTGTCGGCGGTGTCCTCATTCAAGAACTGTTCCATTACACTGAATTCCACTGCCATTCATGAAATCAA
TCAATATTATCCAAGAAGAGGAAGCTGGATCACTACAGATTCTGAATGTACCAGAAGCTCGAGAAGCCTAAACCTGATTCTGCTCGGAATGAAAATTAAATCCGAAAAAC
TAGTCAAGGCGTTGCTCGGAACTAAATCTCGTATGCTTCCACCCTGCTCCAGTTGCTCGCTGGCTGATGACTTGAAGTCCTCCCAAATTCGGCTGTCACTCCCAATTCTG
CATTTTATATCTATCCGTAATCCTCCTTCCGTTTTCTCCATGTCCATTCCTACTACCTCTGCATTTGCCACTGTGACCCTTTTCCGTTCTCTCACTCTTTCCCTCTCTCC
ATACCATCGCTACTTTCATTGTCCCAATCACATAGTCCGTACTCTCTTTATCCCAACATATTCTGTAAAAGGACAACTTCGGCGGATTCCGTCCTTTGCTTCCAGTTCTT
TTGTTGAACAGCTGGTGCATGACCGGGATTCCCCGTTGGAGTCTGAAGAGCACGTATTTTCTTCATACAGTAATGAGGCTGATGGTTTTCATTTTGAAAATGGTTTTGCG
TCGGCGGATTTGAAACATTTGGGAACGCCTGCGCTTGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCC
AGCACAAAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAAGAAATGGCTGGGGCAGGATGAAGCGACCTATCTCACTGTGCATTGTTTGCGTATTCGTGAAAACG
AGACTGCATTTAGGGTGTACAAGTGGATGATGCAACAACGTTGGTACCGATTCGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGAAAGTTCTCA
AAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGATGCGTGCCTAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCAT
AGAGGAAGCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCGACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTTATGAGCAAACCAGGGGATT
TGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCACAATTTGGTAACAAGTGGGCTTGAGTTACATAAAGATATATATGGTGGTCTAATTTGGCTACATAGTTAC
CAGGATACTATAGACAAAGAAAGGATAGTGTTTCTAAGGAAAGAAATGCAACAAGCAGGAATCAAGGAGGAAAGAGAAGTCCTTTTGTCCATCTTGAGAGCGAGCTCAAA
AATGGGGGATGTAGTGGAAGCAGAAAGATCGTGGCAAAAACTTAAGTATTTTGATGGCAGCATGCCATCTCAAGCTTTTGTTTACAAAATGGAAGTCTATGCAAAGATGG
GTGAACCAATGAAAGCTTTGGAGATCTTTAGGGAGATGGAGCAGTTGAACTGTACAAGTGCTGCAGCATATCAGACGATTATTGGTATTTTATGTAAATCTCAACAGATA
GAACTTGCAGAATCGATCATGGCAGGCTTCATAAAGAGTAATTTGAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACACGATAA
GTTAGAGTTAACCTTCTCTCAGTGCCTTGAGAAGTGTAAACCCAATCGTACTATCTATAGCATATATTTGGACTCTTTAGTAAAAGTTGGTAATATCTACAAGGCCGAAG
AAATATTTAATCAGATGGAAACAAATGGAGAAATTGGTATAAATGCTCGTTCGTGCAACATCATTTTAAGTGGGTATCTTTTATTTGGAAATTATTTGAAGGCTGAAAAA
ATATATGATTTGATGTGTCAGAAGAAGTATGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTAAGTAGGAAGGAGGTTAAGAAGCCACTAAGCTT
GAAGTTGAGTAAAGAACAGAGGGAGATTTTAATAGGGTTGTTGTTGGGTGGCCTGGAGATAGAGTCTGATGAAGAGAGGAAAAATCATAGAATCCAATTTGAATTCCTCA
AAAACCGGAACTCCCACTCTCTTTTGAGGAGACACATATATGAGCAATATCATGAATGGTTACATCCTGCTTCGAAGTTGAGTGACGGTGATATAGATATACCATATAAA
TTCTGCACTGTTTCACATTCATATTTTGGTTTCTATGCCGATCAGTTTTGGCCACGAGGCCATCCTTCAATACCTAATCTAATTCACCGGTGGCTTTCACCTTGTGTTCT
TGCATACTGGTATATGTATGGAGGCTGCAGGACATCATCAGGGGATATTTTACTGAAGCTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTCTGAGAGAGA
AGTCCATGCATTGCAAAGTGAAAAGGAAGGGCAACATATATTGGATAGGTTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATTAC
ATGAAAGATAGTCTACGGGCAGACAATCTTAACTTGGAGAGGGTTTTAAATGAAACTGAAAATATCAACTTTGATAGTCAATCTGATTCCGTTGGGGAGGCTTCTAATTA
ATACAAGAGTTTTAGTTGCTGGTCCATTATCAGTTGGATTCTTTACTTTGCCAAATGACGGAAGCCTTGAATGTTTCATGGTTTTGGGAGGGTTTGTTATTCAACTAGGA
TCTTCCTTGACCAATTCGGAAGATGTAAATACTTGAATGGATAAGTAATGTTTATGTTCTGATTCTGTGTATTGTCGTAGCATTCTTGTTCACTTTAATATAGGTTAGTT
AGTTTGGAAGTCATTAAAGTTTGTTGTTTTGTCGTTGTAAATATATATTTAAAAAGGATAAGTTACCTTCATAAGACTTGCGCTTACCTACTGATTGGTTAGTAGAAATC
TTCTATTGATCACCATAAAGTTTTTGACTGGTAAGAAAGAGAG
Protein sequenceShow/hide protein sequence
MQQVLQWVFKETNEKQGQRTSQSTKKGKSWDKGTARSKEIVVSSPCGSSQKSKGFEGRKFDLKKLKSFALLCRKDIAKSCFYSTINTKRPRTQGFDNMHQYWIQKMKKED
IAKGIRQIGQKVDSSSSVHPGTKVLPITDATLVEQYSSKPEKKVVGNGESKTKTKSRMKELLKWAMASRSEKGGKFITGKVSRLRNRGTLKAGLDDDRGSNDSPKISFRW
EAESCSSISSAYSSVSAVSSFKNCSITLNSTAIHEINQYYPRRGSWITTDSECTRSSRSLNLILLGMKIKSEKLVKALLGTKSRMLPPCSSCSLADDLKSSQIRLSLPIL
HFISIRNPPSVFSMSIPTTSAFATVTLFRSLTLSLSPYHRYFHCPNHIVRTLFIPTYSVKGQLRRIPSFASSSFVEQLVHDRDSPLESEEHVFSSYSNEADGFHFENGFA
SADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDEATYLTVHCLRIRENETAFRVYKWMMQQRWYRFDYALATKLADYMGKERKFS
KCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYRPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSY
QDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLKYFDGSMPSQAFVYKMEVYAKMGEPMKALEIFREMEQLNCTSAAAYQTIIGILCKSQQI
ELAESIMAGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEK
IYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPLSLKLSKEQREILIGLLLGGLEIESDEERKNHRIQFEFLKNRNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYK
FCTVSHSYFGFYADQFWPRGHPSIPNLIHRWLSPCVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNIYWIGLLGSNATWFWKLIEPFILDY
MKDSLRADNLNLERVLNETENINFDSQSDSVGEASN