; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037535 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037535
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold11:33344381..33346126
RNA-Seq ExpressionSpg037535
SyntenySpg037535
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453994.1 PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucumis melo]3.4e-28484.55Show/hide
Query:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL
        T R++SALLK ++L F GFSS+FF TS TTKHIAIA RAL RRPTSRTAP PR+ +T+ S+DVVNSVCSLL+NKN QT NLD++HLLKRFK+ L+SDLVL
Subjt:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMGFRFD SVVEYMADFLGRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRI+ PDK+SYSN+IIGLCKFGR+ TAIE F EM R GLVPTRSA NILIG+LCSLS+KEGA+EKVRVRST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRST

Query:  RRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLS
         RPFTVL+PNVNPKSGAIEPAVG+FWAA +L LVPS+FV VQLISELCR+GQ QEAI++LKV+E  KLRCAEECYS+VM+ALCEHR ++EASDLFGRMLS
Subjt:  RRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMN+KRC PDHVTYSALIHAYGE RNWSAAY LL+EMLSLGMSPHFHVYSLVDKLMREHGQ+DLCLKLE
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE

Query:  MKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        MKWEAQILQKLCK GQLEAAYEK+K+MLEKG  PPIYVRDAFE AFQK GK KIARELLQ++DGVH+HE+GT+NSS
Subjt:  MKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

XP_022955543.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita moschata]4.0e-28586.23Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLSTST RNASA LK +    +GFSS    TS+T K  AIA RALARRPTSRTAPIPRALD    TD V+SVCSLL+NKNHQTTNL+LDHLLKRFKETL+
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMG+RFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRI+ PDK+SYSNIIIGLCKFGRFGTA+EVFDEM R GLVPTRSAVNILIGDLCSLS+KEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV

Query:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF
        RVRSTRRPFTVL+PNVNPKSGAI+ AVGVFWAA RLALVPS FVIV+LISELCRLGQ QEAIR+LKV+E  KLRC EECYSIVMQALCEHR+V+EASDLF
Subjt:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GRMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE RNWSAAYSLL+EMLSLG+SPHFHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWE+QILQKLCK GQL  AYEKLK+MLEKG YPPIYVRDAFE AFQK GK KIARELLQ +DGVH+HE+ ++ +S
Subjt:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

XP_022979738.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita maxima]2.4e-28586.4Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLSTST RNASA LK +    +GFSS    TS+TTK  AIA RALARRPTSRTA IPRALD    TD V+SVCSLL+NK+HQTTNL+LDHLLKRFKETL+
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMG+RFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEP 
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRI+ PDK+SYSNIIIGLCKFGRFGTA+EVFDEM R  LVPTRSAVNILIGDLCSLS+KEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV

Query:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF
        RVRSTRRPFTVL+PNVNPKSGAIEPAVGVFWAA R+ALVPSAFVIV+LISELCRLGQ QEAIR+LKV+E  KLRC EECYSIVMQALCEHR+V+EASDLF
Subjt:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GRMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE RNWSAAYSLL+EMLSLG+SPHFHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWE+QILQKLCK GQL AAYEKLK+MLEKG YPPIYVRDAFE AFQK GK KIARELLQ +DGVH+HE+ T+ +S
Subjt:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

XP_023526018.1 pentatricopeptide repeat-containing protein At4g20090-like isoform X1 [Cucurbita pepo subsp. pepo]2.8e-28686.06Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLSTST RNASA LK +    +GFSSN F TS+TTK  AIA RALARRPTSRTAPIPRALD    TD V+SVCSLL+NKNHQT NL+LDHLLKRFKET++
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWS LQMG+RFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRI+ PDK+SYSNIIIGLCKFGRFGTA+EVFDEM R GLVPTRSAVNILIGDLCSLS+KEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV

Query:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF
        RVRSTRRPFTVL+PNVNPKSGAIEPAVGVFWAA RLALVPSAFVIV+LI ELCRLGQ QEAIR+LKV+E  KLRC EECYSIVMQALCEHR+V+EASDL 
Subjt:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GRMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRK+CVPDHVTYSALIHAYGE RNWSA YSLL++MLSLG+SPHFHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWE+QILQKLCK GQL  AYEKLK+MLEKG YPPIYVRDAFE AFQK GK KIARELLQ +DGVH+HE+G++ +S
Subjt:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

XP_038875040.1 pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Benincasa hispida]8.1e-30288.81Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLS +T RNASA LK   L F+GFSS+FF TST TKHIAIA RALARRPTSRTAPIPRA DTL S+DVVNSVCSLL+NKNHQT NLDLDHLLKRFK+TL+
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SDLVLQILMNYRLLGRAKTLEFFSWSGLQMG+RFDE+VVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTAL+IFRRI+ PDK+SYSN+IIGLCKFGRFGTAIEVFDEM+R GLVPTRSAVNILIGDLCSLS+KEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV

Query:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF
        RVRSTRRPFTVL+PNVNPKSGAIEPAVG+FWAA +LALVPSAFVIVQLISELCRLGQ QEAI++LKV+EG KLRCAEECYS+VM+ALCEHR VEEASDLF
Subjt:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GR+LSQGMKPKLAIYNS+ICMLCK+GNL+DAERVFKIMNRKRC PDHVTYS+LIHAYGETRNWSAAYSLL+EMLSLGMSPHFH+YSLVDKLMREHGQIDL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWEAQILQKLCK GQL+AAYEK+K+MLEKG YPPIYVRD+FE AFQK GK KIARELLQ+IDGVH+HE+GT+NSS
Subjt:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

TrEMBL top hitse value%identityAlignment
A0A0A0KU61 Uncharacterized protein1.6e-27185.58Show/hide
Query:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL
        T R++SALLK ++L F G SS+FF TS TT HIAIA RALARRPTSRTAP PR+ +TL S+DVVNSVCSLL+NKN QT NLDLDHLLKRFK+ L+SD VL
Subjt:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMGFRFD SVVEYMADFLGRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRI+ PDK+SYSN+IIGLCKFGR+ TAIE F EM R GLVPTR+AVNILIG+LCSLS+KEGA+EKVRV ST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRST

Query:  RRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLS
         RPFTVL+PNVNPKSGAIEPAVG+FWAA +L+LVPS+FV VQLISELCRLGQ QEAIR+LKV+EG KLRCAEECYS+VM+ALCEHR V+EASDLFGRMLS
Subjt:  RRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMN+KRC PDHVTYSALIHAYGE R+WSAAY LL+EMLSLGMSPHFHVYS+VDKLMREHGQIDLCLKLE
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE

Query:  MKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQK
        MKWEAQILQKLCK GQLEAAYEK+K+MLEKG  PPIYVRDAFE AFQK
Subjt:  MKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQK

A0A1S3BXL0 pentatricopeptide repeat-containing protein At5g65560-like isoform X11.6e-28484.55Show/hide
Query:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL
        T R++SALLK ++L F GFSS+FF TS TTKHIAIA RAL RRPTSRTAP PR+ +T+ S+DVVNSVCSLL+NKN QT NLD++HLLKRFK+ L+SDLVL
Subjt:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMGFRFD SVVEYMADFLGRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRST
        DNLVFNNMLYALCKKEPTGELIDTAL IFRRI+ PDK+SYSN+IIGLCKFGR+ TAIE F EM R GLVPTRSA NILIG+LCSLS+KEGA+EKVRVRST
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRST

Query:  RRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLS
         RPFTVL+PNVNPKSGAIEPAVG+FWAA +L LVPS+FV VQLISELCR+GQ QEAI++LKV+E  KLRCAEECYS+VM+ALCEHR ++EASDLFGRMLS
Subjt:  RRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLS

Query:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE
        QGMKPKLAIYN VICMLCKLGNLD AERVF IMN+KRC PDHVTYSALIHAYGE RNWSAAY LL+EMLSLGMSPHFHVYSLVDKLMREHGQ+DLCLKLE
Subjt:  QGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE

Query:  MKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        MKWEAQILQKLCK GQLEAAYEK+K+MLEKG  PPIYVRDAFE AFQK GK KIARELLQ++DGVH+HE+GT+NSS
Subjt:  MKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

A0A6J1CV78 pentatricopeptide repeat-containing protein At5g39710-like3.4e-28286.97Show/hide
Query:  TRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVLQ
        +RN S LLK  TL F  FSSNFFGTSTT   IAIA R  ARRPTSR+AP+PRALDTLSSTDVVNSVCSLL+NKNHQTTNLDLD LLKRF E L+SDLVL+
Subjt:  TRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVLQ

Query:  ILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD
        ILMNYR+LGRAKTLEFFSWSGLQMG+RFDESVVEYMADF GRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD
Subjt:  ILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD

Query:  NLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRSTR
        NLVFNN+LYALCKKE TGELIDTALTIFRRI+ PDK+SYSNIIIGLCKFGRF TA+EVF+EM+R G VPTRSAVNILIGDLCSLS+KEGAIEKVRVRSTR
Subjt:  NLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRSTR

Query:  RPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLSQ
        RPFTVL+PNVN KSGAIEPAVGVFWAA R+ALVPS+FV+VQLISELCRLGQ QEAI +LKV+E GKLRC EEC+SIVMQALCE+RQVEEASDLFGRMLSQ
Subjt:  RPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLSQ

Query:  GMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEM
        GMKPKLA+YNSVICMLCKLGN+ DAERVFKIMNRKRCVPD VTYSALIHAY E  NWSAAYSLL+EMLSLGMSPHFH+YS VDKLMREHGQ+DLCLKLEM
Subjt:  GMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEM

Query:  KWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHE
        KWEAQILQKLCK GQLEAAYEKLK+MLEKG +PP YVRDAFE AFQKNGK KIARELL++I GVH  E
Subjt:  KWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHE

A0A6J1GU90 pentatricopeptide repeat-containing protein At4g20090-like1.9e-28586.23Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLSTST RNASA LK +    +GFSS    TS+T K  AIA RALARRPTSRTAPIPRALD    TD V+SVCSLL+NKNHQTTNL+LDHLLKRFKETL+
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMG+RFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRI+ PDK+SYSNIIIGLCKFGRFGTA+EVFDEM R GLVPTRSAVNILIGDLCSLS+KEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV

Query:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF
        RVRSTRRPFTVL+PNVNPKSGAI+ AVGVFWAA RLALVPS FVIV+LISELCRLGQ QEAIR+LKV+E  KLRC EECYSIVMQALCEHR+V+EASDLF
Subjt:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GRMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE RNWSAAYSLL+EMLSLG+SPHFHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWE+QILQKLCK GQL  AYEKLK+MLEKG YPPIYVRDAFE AFQK GK KIARELLQ +DGVH+HE+ ++ +S
Subjt:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

A0A6J1IX53 pentatricopeptide repeat-containing protein At4g20090-like1.1e-28586.4Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLSTST RNASA LK +    +GFSS    TS+TTK  AIA RALARRPTSRTA IPRALD    TD V+SVCSLL+NK+HQTTNL+LDHLLKRFKETL+
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMG+RFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEP 
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRI+ PDK+SYSNIIIGLCKFGRFGTA+EVFDEM R  LVPTRSAVNILIGDLCSLS+KEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKV

Query:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF
        RVRSTRRPFTVL+PNVNPKSGAIEPAVGVFWAA R+ALVPSAFVIV+LISELCRLGQ QEAIR+LKV+E  KLRC EECYSIVMQALCEHR+V+EASDLF
Subjt:  RVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GRMLSQ MKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE RNWSAAYSLL+EMLSLG+SPHFHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWE+QILQKLCK GQL AAYEKLK+MLEKG YPPIYVRDAFE AFQK GK KIARELLQ +DGVH+HE+ T+ +S
Subjt:  CLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

SwissProt top hitse value%identityAlignment
Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial4.5e-2924.74Show/hide
Query:  QILMNYRLLGRAKTLEF----FSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF
        ++L  + +LGRA  L +     ++S L  GF  +  V E +A              L+  +   K R    T S  I  L  +GRV EAL L + M  ++
Subjt:  QILMNYRLLGRAKTLEF----FSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF

Query:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVR
        G +PD + +  +L  LCK   +   +D    +  R        YS +I  LCK G F  A+ +F+EM+  G+       + LIG LC+    +   + +R
Subjt:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVR

Query:  VRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFG
            R     +IP+V   S  I+    VF    +L          +L +E+   G   + I    +I+G    C E C             + EA+ +F 
Subjt:  VRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFG

Query:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLC
         M+S+G +P +  Y+ +I   CK   +DD  R+F+ ++ K  +P+ +TY+ L+  + ++   +AA  L QEM+S G+ P    Y ++   + ++G+++  
Subjt:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLC

Query:  LKLEMKWEAQ-----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRI
        L++  K +             I+  +C   +++ A+    ++ +KG  P +   +       K G L  A  L +++
Subjt:  LKLEMKWEAQ-----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRI

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745801.1e-2724.76Show/hide
Query:  LSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEM
        L   TF+  +R L ++G V+E   L +++  K G  P+   +N  +  LC++      +     +  +   PD  +Y+N+I GLCK  +F  A     +M
Subjt:  LSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEM

Query:  DRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVI
           GL P     N LI   C                              K G ++ A  +   A     VP  F    LI  LC  G+T  A+ +    
Subjt:  DRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVI

Query:  EGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYS
         G  ++     Y+ +++ L     + EA+ L   M  +G+ P++  +N ++  LCK+G + DA+ + K+M  K   PD  T++ LIH Y        A  
Subjt:  EGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYS

Query:  LLQEMLSLGMSPHFHVY-SLVDKLMREHGQIDLCLKLEMKWEAQ----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKL
        +L  ML  G+ P  + Y SL++ L +     D+    +   E            +L+ LC+  +L+ A   L+ M  K   P           F KNG L
Subjt:  LLQEMLSLGMSPHFHVY-SLVDKLMREHGQIDLCLKLEMKWEAQ----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKL

Query:  KIARELLQRIDGVHKHETGT
          A  L ++++  +K  + T
Subjt:  KIARELLQRIDGVHKHETGT

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic1.4e-3025.25Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD ++Y+++I GLCK G    A+EV D+M      P     N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNI

Query:  LIGDLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIE---------PAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKL
        LI  LC  +  E A E  RV +++     ++P+V   +  I+          A+ +F         P  F    LI  LC  G+  EA+ +LK +E    
Subjt:  LIGDLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIE---------PAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKL

Query:  RCAEECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEM
          +   Y+ ++   C+  +  EA ++F  M   G+      YN++I  LCK   ++DA ++   M  +   PD  TY++L+  +    +   A  ++Q M
Subjt:  RCAEECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEM

Query:  LSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPP--IYVRDAFERAFQKNGKLKIA
         S G  P    Y  +   + + G++++  KL    + +           ++Q L +  +   A    + MLE+   PP  +  R  F       G ++ A
Subjt:  LSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPP--IYVRDAFERAFQKNGKLKIA

Query:  RELL
         + L
Subjt:  RELL

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655604.1e-3024.44Show/hide
Query:  RVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIG
        R+ EA+ LF +M+    C P    +  ++ +LC  E   E ++    +      P+  +Y+ +I  LC   +F  A E+  +M   GL+P     N LI 
Subjt:  RVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIG

Query:  DLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIE--------PAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAE
          C     E A++ V +  +R+    L PN    +  I+         A+GV        ++P       LI   CR G    A R+L ++    L   +
Subjt:  DLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIE--------PAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAE

Query:  ECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLG
          Y+ ++ +LC+ ++VEEA DLF  +  +G+ P + +Y ++I   CK G +D+A  + + M  K C+P+ +T++ALIH          A  L ++M+ +G
Subjt:  ECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLG

Query:  MSPHFHVYSLVDKLMREHGQIDLC-----------LKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQR
        + P     +++   + + G  D              K +       +Q  C+ G+L  A + +  M E G  P ++   +  + +   G+   A ++L+R
Subjt:  MSPHFHVYSLVDKLMREHGQIDLC-----------LKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQR

Query:  I
        +
Subjt:  I

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic8.5e-2823.35Show/hide
Query:  GFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALC------------
        GFR        +   LG+R+  D +  LL  + +   + +  TF+ICIR LGR G++ EA  + + M+ + GC PD + +  ++ ALC            
Subjt:  GFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALC------------

Query:  KKEPTGELIDTALTIFRRID-----------------------FPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKE-
        +K  TG      +T    +D                        PD  +++ ++  LCK G FG A +  D M   G++P     N LI  L  +   + 
Subjt:  KKEPTGELIDTALTIFRRID-----------------------FPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKE-

Query:  -----GAIEKVRVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCE
             G +E + V+ T   + V I +   KSG    A+  F       + P+       +  L + G+ +EA +I   ++   L      Y+++M+   +
Subjt:  -----GAIEKVRVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCE

Query:  HRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVD
          +++EA  L   M+  G +P + + NS+I  L K   +D+A ++F  M   +  P  VTY+ L+   G+      A  L + M+  G  P+   ++   
Subjt:  HRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVD

Query:  KLMREHGQIDLCLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIA
                              +   LCK  ++  A + L  M++ G  P ++  +       KNG++K A
Subjt:  KLMREHGQIDLCLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIA

Arabidopsis top hitse value%identityAlignment
AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein7.9e-2924.76Show/hide
Query:  LSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEM
        L   TF+  +R L ++G V+E   L +++  K G  P+   +N  +  LC++      +     +  +   PD  +Y+N+I GLCK  +F  A     +M
Subjt:  LSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEM

Query:  DRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVI
           GL P     N LI   C                              K G ++ A  +   A     VP  F    LI  LC  G+T  A+ +    
Subjt:  DRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVI

Query:  EGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYS
         G  ++     Y+ +++ L     + EA+ L   M  +G+ P++  +N ++  LCK+G + DA+ + K+M  K   PD  T++ LIH Y        A  
Subjt:  EGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYS

Query:  LLQEMLSLGMSPHFHVY-SLVDKLMREHGQIDLCLKLEMKWEAQ----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKL
        +L  ML  G+ P  + Y SL++ L +     D+    +   E            +L+ LC+  +L+ A   L+ M  K   P           F KNG L
Subjt:  LLQEMLSLGMSPHFHVY-SLVDKLMREHGQIDLCLKLEMKWEAQ----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKL

Query:  KIARELLQRIDGVHKHETGT
          A  L ++++  +K  + T
Subjt:  KIARELLQRIDGVHKHETGT

AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein3.2e-3024.74Show/hide
Query:  QILMNYRLLGRAKTLEF----FSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF
        ++L  + +LGRA  L +     ++S L  GF  +  V E +A              L+  +   K R    T S  I  L  +GRV EAL L + M  ++
Subjt:  QILMNYRLLGRAKTLEF----FSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF

Query:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVR
        G +PD + +  +L  LCK   +   +D    +  R        YS +I  LCK G F  A+ +F+EM+  G+       + LIG LC+    +   + +R
Subjt:  GCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVR

Query:  VRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFG
            R     +IP+V   S  I+    VF    +L          +L +E+   G   + I    +I+G    C E C             + EA+ +F 
Subjt:  VRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFG

Query:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLC
         M+S+G +P +  Y+ +I   CK   +DD  R+F+ ++ K  +P+ +TY+ L+  + ++   +AA  L QEM+S G+ P    Y ++   + ++G+++  
Subjt:  RMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLC

Query:  LKLEMKWEAQ-----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRI
        L++  K +             I+  +C   +++ A+    ++ +KG  P +   +       K G L  A  L +++
Subjt:  LKLEMKWEAQ-----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRI

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein9.9e-3225.25Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD ++Y+++I GLCK G    A+EV D+M      P     N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNI

Query:  LIGDLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIE---------PAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKL
        LI  LC  +  E A E  RV +++     ++P+V   +  I+          A+ +F         P  F    LI  LC  G+  EA+ +LK +E    
Subjt:  LIGDLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIE---------PAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKL

Query:  RCAEECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEM
          +   Y+ ++   C+  +  EA ++F  M   G+      YN++I  LCK   ++DA ++   M  +   PD  TY++L+  +    +   A  ++Q M
Subjt:  RCAEECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEM

Query:  LSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPP--IYVRDAFERAFQKNGKLKIA
         S G  P    Y  +   + + G++++  KL    + +           ++Q L +  +   A    + MLE+   PP  +  R  F       G ++ A
Subjt:  LSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKLGQLEAAYEKLKAMLEKGSYPP--IYVRDAFERAFQKNGKLKIA

Query:  RELL
         + L
Subjt:  RELL

AT4G31850.1 proton gradient regulation 36.0e-2923.35Show/hide
Query:  GFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALC------------
        GFR        +   LG+R+  D +  LL  + +   + +  TF+ICIR LGR G++ EA  + + M+ + GC PD + +  ++ ALC            
Subjt:  GFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALC------------

Query:  KKEPTGELIDTALTIFRRID-----------------------FPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKE-
        +K  TG      +T    +D                        PD  +++ ++  LCK G FG A +  D M   G++P     N LI  L  +   + 
Subjt:  KKEPTGELIDTALTIFRRID-----------------------FPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKE-

Query:  -----GAIEKVRVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCE
             G +E + V+ T   + V I +   KSG    A+  F       + P+       +  L + G+ +EA +I   ++   L      Y+++M+   +
Subjt:  -----GAIEKVRVRSTRRPFTVLIPNVNPKSGAIEPAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCE

Query:  HRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVD
          +++EA  L   M+  G +P + + NS+I  L K   +D+A ++F  M   +  P  VTY+ L+   G+      A  L + M+  G  P+   ++   
Subjt:  HRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVD

Query:  KLMREHGQIDLCLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIA
                              +   LCK  ++  A + L  M++ G  P ++  +       KNG++K A
Subjt:  KLMREHGQIDLCLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIA

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein2.9e-3124.44Show/hide
Query:  RVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIG
        R+ EA+ LF +M+    C P    +  ++ +LC  E   E ++    +      P+  +Y+ +I  LC   +F  A E+  +M   GL+P     N LI 
Subjt:  RVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIG

Query:  DLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIE--------PAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAE
          C     E A++ V +  +R+    L PN    +  I+         A+GV        ++P       LI   CR G    A R+L ++    L   +
Subjt:  DLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIE--------PAVGVFWAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAE

Query:  ECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLG
          Y+ ++ +LC+ ++VEEA DLF  +  +G+ P + +Y ++I   CK G +D+A  + + M  K C+P+ +T++ALIH          A  L ++M+ +G
Subjt:  ECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLG

Query:  MSPHFHVYSLVDKLMREHGQIDLC-----------LKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQR
        + P     +++   + + G  D              K +       +Q  C+ G+L  A + +  M E G  P ++   +  + +   G+   A ++L+R
Subjt:  MSPHFHVYSLVDKLMREHGQIDLC-----------LKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQR

Query:  I
        +
Subjt:  I


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGCACGAGCACGACTAGGAATGCTTCGGCGTTGCTCAAATCCATTACTCTTCAGTTTTTTGGCTTCTCTTCAAACTTTTTCGGCACTTCAACGACCACAAAGCA
CATTGCCATAGCTTCAAGAGCTCTGGCAAGAAGACCCACTTCGCGAACTGCCCCAATCCCTCGGGCTTTGGACACCCTCAGCTCTACCGATGTCGTCAATTCAGTATGTT
CTTTACTTGCGAACAAAAATCACCAAACAACTAATCTCGATCTTGATCATTTGTTGAAAAGGTTCAAAGAAACCTTAAATTCTGATCTCGTTCTTCAAATTCTGATGAAC
TATAGGCTGTTGGGTAGGGCTAAAACGTTGGAATTCTTTTCTTGGTCTGGATTGCAAATGGGGTTTCGGTTTGATGAGTCGGTGGTTGAGTACATGGCTGATTTCTTAGG
TAGAAGGAAACTGTTTGACGATATGAAGTGTCTTTTGGTGACGGTGTCGTCTCATAAGGGTCGGCTTTCTTGTAGAACGTTTTCAATTTGTATCAGATTTTTGGGTAGGC
AAGGGAGGGTTAGAGAAGCACTTTGCTTGTTTGAAGAAATGGAGCCAAAATTTGGGTGCAAACCTGATAATTTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAG
GAGCCAACTGGGGAATTGATTGATACTGCTCTAACAATTTTCAGAAGAATTGATTTTCCTGATAAATTTTCATACAGTAATATAATTATAGGATTGTGTAAATTTGGTAG
GTTTGGTACAGCTATTGAAGTGTTTGATGAAATGGATAGGGGAGGTTTGGTACCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTCATTGAGTTCCAAAG
AAGGGGCTATAGAAAAAGTTAGGGTCAGAAGTACTCGTAGACCCTTTACTGTTCTAATTCCAAATGTGAATCCGAAGAGCGGTGCCATTGAACCAGCAGTTGGAGTTTTT
TGGGCAGCTTATAGGCTAGCTTTAGTTCCCAGTGCATTTGTAATAGTTCAGCTCATCTCGGAGCTTTGTCGATTAGGTCAAACGCAAGAAGCAATTAGAATATTGAAGGT
TATTGAGGGTGGCAAGCTAAGATGTGCTGAAGAGTGTTACTCCATTGTGATGCAAGCTTTGTGTGAACATCGTCAGGTTGAAGAAGCTAGTGATCTGTTTGGGAGGATGC
TTTCTCAGGGTATGAAGCCAAAGTTGGCTATTTACAATTCTGTTATTTGCATGCTATGCAAATTAGGAAATTTGGATGATGCTGAAAGGGTCTTCAAGATTATGAACAGG
AAAAGATGTGTGCCTGATCACGTTACTTATTCGGCATTAATCCATGCCTATGGTGAAACTAGGAATTGGTCGGCAGCCTACAGTTTATTGCAGGAAATGTTAAGTTTAGG
CATGTCTCCTCATTTTCATGTGTATAGTTTAGTGGATAAACTAATGAGGGAACATGGGCAAATTGATCTGTGCTTGAAGCTGGAAATGAAGTGGGAAGCCCAAATTTTGC
AGAAGCTCTGTAAACTAGGACAACTAGAGGCTGCGTATGAGAAGCTCAAGGCAATGCTTGAAAAGGGTTCTTATCCTCCTATCTATGTGAGAGATGCTTTTGAGAGGGCA
TTTCAAAAGAATGGTAAGTTGAAGATTGCACGGGAGCTGCTGCAGAGGATAGATGGCGTCCACAAACATGAGACAGGAACCAAAAATTCATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGCACGAGCACGACTAGGAATGCTTCGGCGTTGCTCAAATCCATTACTCTTCAGTTTTTTGGCTTCTCTTCAAACTTTTTCGGCACTTCAACGACCACAAAGCA
CATTGCCATAGCTTCAAGAGCTCTGGCAAGAAGACCCACTTCGCGAACTGCCCCAATCCCTCGGGCTTTGGACACCCTCAGCTCTACCGATGTCGTCAATTCAGTATGTT
CTTTACTTGCGAACAAAAATCACCAAACAACTAATCTCGATCTTGATCATTTGTTGAAAAGGTTCAAAGAAACCTTAAATTCTGATCTCGTTCTTCAAATTCTGATGAAC
TATAGGCTGTTGGGTAGGGCTAAAACGTTGGAATTCTTTTCTTGGTCTGGATTGCAAATGGGGTTTCGGTTTGATGAGTCGGTGGTTGAGTACATGGCTGATTTCTTAGG
TAGAAGGAAACTGTTTGACGATATGAAGTGTCTTTTGGTGACGGTGTCGTCTCATAAGGGTCGGCTTTCTTGTAGAACGTTTTCAATTTGTATCAGATTTTTGGGTAGGC
AAGGGAGGGTTAGAGAAGCACTTTGCTTGTTTGAAGAAATGGAGCCAAAATTTGGGTGCAAACCTGATAATTTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAG
GAGCCAACTGGGGAATTGATTGATACTGCTCTAACAATTTTCAGAAGAATTGATTTTCCTGATAAATTTTCATACAGTAATATAATTATAGGATTGTGTAAATTTGGTAG
GTTTGGTACAGCTATTGAAGTGTTTGATGAAATGGATAGGGGAGGTTTGGTACCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTCATTGAGTTCCAAAG
AAGGGGCTATAGAAAAAGTTAGGGTCAGAAGTACTCGTAGACCCTTTACTGTTCTAATTCCAAATGTGAATCCGAAGAGCGGTGCCATTGAACCAGCAGTTGGAGTTTTT
TGGGCAGCTTATAGGCTAGCTTTAGTTCCCAGTGCATTTGTAATAGTTCAGCTCATCTCGGAGCTTTGTCGATTAGGTCAAACGCAAGAAGCAATTAGAATATTGAAGGT
TATTGAGGGTGGCAAGCTAAGATGTGCTGAAGAGTGTTACTCCATTGTGATGCAAGCTTTGTGTGAACATCGTCAGGTTGAAGAAGCTAGTGATCTGTTTGGGAGGATGC
TTTCTCAGGGTATGAAGCCAAAGTTGGCTATTTACAATTCTGTTATTTGCATGCTATGCAAATTAGGAAATTTGGATGATGCTGAAAGGGTCTTCAAGATTATGAACAGG
AAAAGATGTGTGCCTGATCACGTTACTTATTCGGCATTAATCCATGCCTATGGTGAAACTAGGAATTGGTCGGCAGCCTACAGTTTATTGCAGGAAATGTTAAGTTTAGG
CATGTCTCCTCATTTTCATGTGTATAGTTTAGTGGATAAACTAATGAGGGAACATGGGCAAATTGATCTGTGCTTGAAGCTGGAAATGAAGTGGGAAGCCCAAATTTTGC
AGAAGCTCTGTAAACTAGGACAACTAGAGGCTGCGTATGAGAAGCTCAAGGCAATGCTTGAAAAGGGTTCTTATCCTCCTATCTATGTGAGAGATGCTTTTGAGAGGGCA
TTTCAAAAGAATGGTAAGTTGAAGATTGCACGGGAGCTGCTGCAGAGGATAGATGGCGTCCACAAACATGAGACAGGAACCAAAAATTCATCATGA
Protein sequenceShow/hide protein sequence
MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVLQILMN
YRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKK
EPTGELIDTALTIFRRIDFPDKFSYSNIIIGLCKFGRFGTAIEVFDEMDRGGLVPTRSAVNILIGDLCSLSSKEGAIEKVRVRSTRRPFTVLIPNVNPKSGAIEPAVGVF
WAAYRLALVPSAFVIVQLISELCRLGQTQEAIRILKVIEGGKLRCAEECYSIVMQALCEHRQVEEASDLFGRMLSQGMKPKLAIYNSVICMLCKLGNLDDAERVFKIMNR
KRCVPDHVTYSALIHAYGETRNWSAAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQILQKLCKLGQLEAAYEKLKAMLEKGSYPPIYVRDAFERA
FQKNGKLKIARELLQRIDGVHKHETGTKNSS