; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021760 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021760
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr7:11601810..11603555
RNA-Seq ExpressionLag0021760
SyntenyLag0021760
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453994.1 PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucumis melo]3.3e-28785.07Show/hide
Query:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL
        T R++SALLK ++L F GFSS+FF TS TTKHIAIA RAL RRPTSRTAP PR+ +T+ S+DVVNSVCSLL+NKN QT NLD++HLLKRFK+ L+SDLVL
Subjt:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMGFRFD SVVEYMADFLGRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRST
        DNLVFNNMLYALCKKEPTGE+IDTAL IFRRI+ PDKYSYSN+IIGLCKFGR+ TAIE F EM RAGLVPTRSA NILIG+LCSLSAKEGA+EKVRVRST
Subjt:  DNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRST

Query:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAAN+L L+PS+FV VQLISELCR+GQMQEAI++LKV+E  KLRCAEECYS+VM+ LCEHR ++EASDLFGRMLS
Subjt:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLS

Query:  QGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE
        QGMKPKLAIYN+VICMLCKLGNLD AERVF IMN+KRC PDHVTYSALIHAYGE RNWS AY LL+EMLSLGMSPHFHVYSLVDKLMREHGQ+DLCLKLE
Subjt:  QGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE

Query:  MKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        MKWEAQILQKLCKQGQLEAAYEK+K+MLEKG  PPIYVRDAFE AFQK GK KIARELLQ++DGVH+HE+GT+NSS
Subjt:  MKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

XP_022955543.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita moschata]7.3e-28786.57Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLSTST RNASA LK +    +GFSS    TS+T K  AIA RALARRPTSRTAPIPRALD    TD V+SVCSLL+NKNHQTTNL+LDHLLKRFKETL+
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMG+RFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGE+IDTALTIFRRI+ PDKYSYSNIIIGLCKFGRFGTA+EVFDEM RAGLVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF
        RVRSTRRPFTVLVPNVNPKSGAI+ AVGVFWAANRLAL+PS FVIV+LISELCRLGQMQEAIR+LKV+E  KLRC EECYSIVMQ LCEHR+V+EASDLF
Subjt:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GRMLSQ MKPKLAIYN VICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE RNWS AYSLL+EMLSLG+SPHFHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWE+QILQKLCKQGQL  AYEKLK+MLEKG YPPIYVRDAFE AFQK GK KIARELLQ +DGVH+HE+ ++ +S
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

XP_022979738.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita maxima]4.3e-28786.75Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLSTST RNASA LK +    +GFSS    TS+TTK  AIA RALARRPTSRTA IPRALD    TD V+SVCSLL+NK+HQTTNL+LDHLLKRFKETL+
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMG+RFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEP 
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGE+IDTALTIFRRI+ PDKYSYSNIIIGLCKFGRFGTA+EVFDEM RA LVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF
        RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANR+AL+PSAFVIV+LISELCRLGQMQEAIR+LKV+E  KLRC EECYSIVMQ LCEHR+V+EASDLF
Subjt:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GRMLSQ MKPKLAIYN VICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE RNWS AYSLL+EMLSLG+SPHFHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWE+QILQKLCKQGQL AAYEKLK+MLEKG YPPIYVRDAFE AFQK GK KIARELLQ +DGVH+HE+ T+ +S
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

XP_023526018.1 pentatricopeptide repeat-containing protein At4g20090-like isoform X1 [Cucurbita pepo subsp. pepo]5.1e-28886.4Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLSTST RNASA LK +    +GFSSN F TS+TTK  AIA RALARRPTSRTAPIPRALD    TD V+SVCSLL+NKNHQT NL+LDHLLKRFKET++
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWS LQMG+RFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGE+IDTALTIFRRI+ PDKYSYSNIIIGLCKFGRFGTA+EVFDEM RAGLVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF
        RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLAL+PSAFVIV+LI ELCRLGQMQEAIR+LKV+E  KLRC EECYSIVMQ LCEHR+V+EASDL 
Subjt:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GRMLSQ MKPKLAIYN VICMLCKLGNLDDAERVFKIMNRK+CVPDHVTYSALIHAYGE RNWS  YSLL++MLSLG+SPHFHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWE+QILQKLCKQGQL  AYEKLK+MLEKG YPPIYVRDAFE AFQK GK KIARELLQ +DGVH+HE+G++ +S
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

XP_038875040.1 pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Benincasa hispida]4.3e-30388.98Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLS +T RNASA LK   L F+GFSS+FF TST TKHIAIA RALARRPTSRTAPIPRA DTL S+DVVNSVCSLL+NKNHQT NLDLDHLLKRFK+TL+
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SDLVLQILMNYRLLGRAKTLEFFSWSGLQMG+RFDE+VVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGE+IDTAL+IFRRI+ PDKYSYSN+IIGLCKFGRFGTAIEVFDEM+RAGLVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF
        RVRSTRRPFTVLVPNVNPKSGAIEPAVG+FWAAN+LAL+PSAFVIVQLISELCRLGQMQEAI++LKV+EG KLRCAEECYS+VM+ LCEHR VEEASDLF
Subjt:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GR+LSQGMKPKLAIYN +ICMLCK+GNL+DAERVFKIMNRKRC PDHVTYS+LIHAYGETRNWS AYSLL+EMLSLGMSPHFH+YSLVDKLMREHGQIDL
Subjt:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWEAQILQKLCK GQL+AAYEK+K+MLEKG YPPIYVRD+FE AFQK GK KIARELLQ+IDGVH+HE+GT+NSS
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

TrEMBL top hitse value%identityAlignment
A0A0A0KU61 Uncharacterized protein1.5e-27486.13Show/hide
Query:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL
        T R++SALLK ++L F G SS+FF TS TT HIAIA RALARRPTSRTAP PR+ +TL S+DVVNSVCSLL+NKN QT NLDLDHLLKRFK+ L+SD VL
Subjt:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMGFRFD SVVEYMADFLGRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRST
        DNLVFNNMLYALCKKEPTGE+IDTAL IFRRI+ PDKYSYSN+IIGLCKFGR+ TAIE F EM RAGLVPTR+AVNILIG+LCSLSAKEGA+EKVRV ST
Subjt:  DNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRST

Query:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAAN+L+L+PS+FV VQLISELCRLGQMQEAIR+LKV+EG KLRCAEECYS+VM+ LCEHR V+EASDLFGRMLS
Subjt:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLS

Query:  QGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE
        QGMKPKLAIYN+VICMLCKLGNLD AERVF IMN+KRC PDHVTYSALIHAYGE R+WS AY LL+EMLSLGMSPHFHVYS+VDKLMREHGQIDLCLKLE
Subjt:  QGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE

Query:  MKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQK
        MKWEAQILQKLCKQGQLEAAYEK+K+MLEKG  PPIYVRDAFE AFQK
Subjt:  MKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQK

A0A1S3BXL0 pentatricopeptide repeat-containing protein At5g65560-like isoform X11.6e-28785.07Show/hide
Query:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL
        T R++SALLK ++L F GFSS+FF TS TTKHIAIA RAL RRPTSRTAP PR+ +T+ S+DVVNSVCSLL+NKN QT NLD++HLLKRFK+ L+SDLVL
Subjt:  TTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVL

Query:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        QILMNY+LLGRAKTLEFFSWSGLQMGFRFD SVVEYMADFLGRRKLFDDMKCLLVTV SHKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRST
        DNLVFNNMLYALCKKEPTGE+IDTAL IFRRI+ PDKYSYSN+IIGLCKFGR+ TAIE F EM RAGLVPTRSA NILIG+LCSLSAKEGA+EKVRVRST
Subjt:  DNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRST

Query:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLS
         RPFTVLVPNVNPKSGAIEPAVG+FWAAN+L L+PS+FV VQLISELCR+GQMQEAI++LKV+E  KLRCAEECYS+VM+ LCEHR ++EASDLFGRMLS
Subjt:  RRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLS

Query:  QGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE
        QGMKPKLAIYN+VICMLCKLGNLD AERVF IMN+KRC PDHVTYSALIHAYGE RNWS AY LL+EMLSLGMSPHFHVYSLVDKLMREHGQ+DLCLKLE
Subjt:  QGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLE

Query:  MKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        MKWEAQILQKLCKQGQLEAAYEK+K+MLEKG  PPIYVRDAFE AFQK GK KIARELLQ++DGVH+HE+GT+NSS
Subjt:  MKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

A0A6J1CV78 pentatricopeptide repeat-containing protein At5g39710-like8.2e-28487.15Show/hide
Query:  TRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVLQ
        +RN S LLK  TL F  FSSNFFGTSTT   IAIA R  ARRPTSR+AP+PRALDTLSSTDVVNSVCSLL+NKNHQTTNLDLD LLKRF E L+SDLVL+
Subjt:  TRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVLQ

Query:  ILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD
        ILMNYR+LGRAKTLEFFSWSGLQMG+RFDESVVEYMADF GRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD
Subjt:  ILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD

Query:  NLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTR
        NLVFNN+LYALCKKE TGE+IDTALTIFRRI+ PDKYSYSNIIIGLCKFGRF TA+EVF+EM+R G VPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTR
Subjt:  NLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTR

Query:  RPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQ
        RPFTVLVPNVN KSGAIEPAVGVFWAANR+AL+PS+FV+VQLISELCRLGQMQEAI +LKV+E GKLRC EEC+SIVMQ LCE+RQVEEASDLFGRMLSQ
Subjt:  RPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQ

Query:  GMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEM
        GMKPKLA+YN VICMLCKLGN+ DAERVFKIMNRKRCVPD VTYSALIHAY E  NWS AYSLL+EMLSLGMSPHFH+YS VDKLMREHGQ+DLCLKLEM
Subjt:  GMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEM

Query:  KWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHE
        KWEAQILQKLCKQGQLEAAYEKLK+MLEKG +PP YVRDAFE AFQKNGK KIARELL++I GVH  E
Subjt:  KWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHE

A0A6J1GU90 pentatricopeptide repeat-containing protein At4g20090-like3.5e-28786.57Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLSTST RNASA LK +    +GFSS    TS+T K  AIA RALARRPTSRTAPIPRALD    TD V+SVCSLL+NKNHQTTNL+LDHLLKRFKETL+
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMG+RFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGE+IDTALTIFRRI+ PDKYSYSNIIIGLCKFGRFGTA+EVFDEM RAGLVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF
        RVRSTRRPFTVLVPNVNPKSGAI+ AVGVFWAANRLAL+PS FVIV+LISELCRLGQMQEAIR+LKV+E  KLRC EECYSIVMQ LCEHR+V+EASDLF
Subjt:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GRMLSQ MKPKLAIYN VICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE RNWS AYSLL+EMLSLG+SPHFHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWE+QILQKLCKQGQL  AYEKLK+MLEKG YPPIYVRDAFE AFQK GK KIARELLQ +DGVH+HE+ ++ +S
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

A0A6J1IX53 pentatricopeptide repeat-containing protein At4g20090-like2.1e-28786.75Show/hide
Query:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN
        MLSTST RNASA LK +    +GFSS    TS+TTK  AIA RALARRPTSRTA IPRALD    TD V+SVCSLL+NK+HQTTNL+LDHLLKRFKETL+
Subjt:  MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLN

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SD VLQILMNYRL GRAKTLEFFSWSGLQMG+RFDESVVEYMADFLGRRKLFDDMKCLLVTVSS+KGR+SCRTFSICIRFLGRQGRVREALCLFEEMEP 
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGE+IDTALTIFRRI+ PDKYSYSNIIIGLCKFGRFGTA+EVFDEM RA LVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF
        RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANR+AL+PSAFVIV+LISELCRLGQMQEAIR+LKV+E  KLRC EECYSIVMQ LCEHR+V+EASDLF
Subjt:  RVRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLF

Query:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL
        GRMLSQ MKPKLAIYN VICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE RNWS AYSLL+EMLSLG+SPHFHVYS+VDKLMRE GQ DL
Subjt:  GRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDL

Query:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS
        CLKLEMKWE+QILQKLCKQGQL AAYEKLK+MLEKG YPPIYVRDAFE AFQK GK KIARELLQ +DGVH+HE+ T+ +S
Subjt:  CLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRIDGVHKHETGTKNSS

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200902.2e-2823.49Show/hide
Query:  SGLQMG-FRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK--------------------
        S  +MG F+  +S +  M +       FD ++ LL  +      +  R+F +  R  G+     +A+ LF  M  +F CK                    
Subjt:  SGLQMG-FRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK--------------------

Query:  -------------------PDNLVFNNMLYALCKKEPTGEMIDTALTIFR----RIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVN
                           P+ L FN ++ ALCK       +D A+ +FR    R   PD Y+Y  ++ GLCK  R   A+ + DEM   G  P+    N
Subjt:  -------------------PDNLVFNNMLYALCKKEPTGEMIDTALTIFR----RIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVN

Query:  ILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNPKS---------GAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGK
        +LI  LC    K+G + +V            VPN    +         G ++ AV +         IP+      LI+ L +  +  +A+R+L  +E   
Subjt:  ILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNPKS---------GAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGK

Query:  LRCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGET-------RNWST
            +  YS+++  L +  + EEA  L+ +M  +G KP + +Y+ ++  LC+ G  ++A+ +   M    C+P+  TYS+L+  + +T       + W  
Subjt:  LRCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGET-------RNWST

Query:  -----------AYSLL-----------------QEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKL--EMKWEAQ------------ILQKLCKQGQLE
                    YS+L                  +ML++G+ P    YS + K +   G +D  LKL  EM  + +            +L  LC Q  + 
Subjt:  -----------AYSLL-----------------QEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKL--EMKWEAQ------------ILQKLCKQGQLE

Query:  AAYEKLKAMLEKGSYPPIYVRDAFERAF-QKNGKLKIARELLQRI
         A + L +ML++G  P +   + F     +K+      R  L+ +
Subjt:  AAYEKLKAMLEKGSYPPIYVRDAFERAF-QKNGKLKIARELLQRI

Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial1.0e-2824.32Show/hide
Query:  QILMNYRLLGRAKTLEF----FSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF
        ++L  + +LGRA  L +     ++S L  GF  +  V E +A              L+  +   K R    T S  I  L  +GRV EAL L + M  ++
Subjt:  QILMNYRLLGRAKTLEF----FSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF

Query:  GCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVR
        G +PD + +  +L  LCK   +   +D    +  R        YS +I  LCK G F  A+ +F+EM+  G+       + LIG LC+    +   + +R
Subjt:  GCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVR

Query:  VRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFG
            R     ++P+V   S  I+    VF    +L          +L +E+   G   + I    +I+G    C E C             + EA+ +F 
Subjt:  VRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFG

Query:  RMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLC
         M+S+G +P +  Y+ +I   CK   +DD  R+F+ ++ K  +P+ +TY+ L+  + ++   + A  L QEM+S G+ P    Y ++   + ++G+++  
Subjt:  RMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLC

Query:  LKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRI
        L++  K +             I+  +C   +++ A+    ++ +KG  P +   +       K G L  A  L +++
Subjt:  LKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRI

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397102.2e-2825.74Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDF----PDKYSYSNIIIGLCKFGRFGTAIEVFDEM
        T++I IR     G +  AL LF++ME K GC P+ + +N ++   CK       ID    + R +      P+  SY+ +I GLC+ GR      V  EM
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDF----PDKYSYSNIIIGLCKFGRFGTAIEVFDEM

Query:  DRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNP---------KSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQ
        +R G        N LI   C    KEG   +  V         L P+V           K+G +  A+          L P+      L+    + G M 
Subjt:  DRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNP---------KSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQ

Query:  EAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE
        EA R+L+ +       +   Y+ ++   C   ++E+A  +   M  +G+ P +  Y+ V+   C+  ++D+A RV + M  K   PD +TYS+LI  + E
Subjt:  EAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE

Query:  TRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKI
         R    A  L +EML +G+ P    Y+                         ++   C +G LE A +    M+EKG  P +           K  + + 
Subjt:  TRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKI

Query:  ARELLQRI
        A+ LL ++
Subjt:  ARELLQRI

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011107.7e-2924.38Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAG
        T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  +    +  +   P  Y+Y+ +I GLCK G++  A EVF EM R+G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAG

Query:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRI
        L P  +    L+ + C    K   +E  +V S  R   V+        + ++  +SG ++ A+  F +     LIP   +   LI   CR G +  A+ +
Subjt:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRI

Query:  L-KVIEGGKLRCAEE--CYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETR
          ++++ G   CA +   Y+ ++  LC+ + + EA  LF  M  + + P       +I   CKLGNL +A  +F+ M  KR   D VTY+ L+  +G+  
Subjt:  L-KVIEGGKLRCAEE--CYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETR

Query:  NWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIAR
        +  TA  +  +M+S  + P    YS+                        ++  LC +G L  A+     M+ K   P + + ++  + + ++G      
Subjt:  NWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIAR

Query:  ELLQRI
          L+++
Subjt:  ELLQRI

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic4.8e-3125.5Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y+++I GLCK G    A+EV D+M      P     N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNI

Query:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKL
        LI  LC  +  E A E  RV +++     ++P+V   +  I+          A+ +F         P  F    LI  LC  G++ EA+ +LK +E    
Subjt:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKL

Query:  RCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEM
          +   Y+ ++   C+  +  EA ++F  M   G+      YN +I  LCK   ++DA ++   M  +   PD  TY++L+  +    +   A  ++Q M
Subjt:  RCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEM

Query:  LSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKAMLEKGSYPP--IYVRDAFERAFQKNGKLKIA
         S G  P    Y  +   + + G++++  KL    + +           ++Q L ++ +   A    + MLE+   PP  +  R  F       G ++ A
Subjt:  LSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKAMLEKGSYPP--IYVRDAFERAFQKNGKLKIA

Query:  RELL
         + L
Subjt:  RELL

Arabidopsis top hitse value%identityAlignment
AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein7.1e-3024.32Show/hide
Query:  QILMNYRLLGRAKTLEF----FSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF
        ++L  + +LGRA  L +     ++S L  GF  +  V E +A              L+  +   K R    T S  I  L  +GRV EAL L + M  ++
Subjt:  QILMNYRLLGRAKTLEF----FSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKF

Query:  GCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVR
        G +PD + +  +L  LCK   +   +D    +  R        YS +I  LCK G F  A+ +F+EM+  G+       + LIG LC+    +   + +R
Subjt:  GCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVR

Query:  VRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFG
            R     ++P+V   S  I+    VF    +L          +L +E+   G   + I    +I+G    C E C             + EA+ +F 
Subjt:  VRSTRRPFTVLVPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFG

Query:  RMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLC
         M+S+G +P +  Y+ +I   CK   +DD  R+F+ ++ K  +P+ +TY+ L+  + ++   + A  L QEM+S G+ P    Y ++   + ++G+++  
Subjt:  RMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLC

Query:  LKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRI
        L++  K +             I+  +C   +++ A+    ++ +KG  P +   +       K G L  A  L +++
Subjt:  LKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIARELLQRI

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein3.4e-3225.5Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y+++I GLCK G    A+EV D+M      P     N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNI

Query:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKL
        LI  LC  +  E A E  RV +++     ++P+V   +  I+          A+ +F         P  F    LI  LC  G++ EA+ +LK +E    
Subjt:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNPKSGAIE---------PAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKL

Query:  RCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEM
          +   Y+ ++   C+  +  EA ++F  M   G+      YN +I  LCK   ++DA ++   M  +   PD  TY++L+  +    +   A  ++Q M
Subjt:  RCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETRNWSTAYSLLQEM

Query:  LSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKAMLEKGSYPP--IYVRDAFERAFQKNGKLKIA
         S G  P    Y  +   + + G++++  KL    + +           ++Q L ++ +   A    + MLE+   PP  +  R  F       G ++ A
Subjt:  LSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKLKAMLEKGSYPP--IYVRDAFERAFQKNGKLKIA

Query:  RELL
         + L
Subjt:  RELL

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-2923.49Show/hide
Query:  SGLQMG-FRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK--------------------
        S  +MG F+  +S +  M +       FD ++ LL  +      +  R+F +  R  G+     +A+ LF  M  +F CK                    
Subjt:  SGLQMG-FRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK--------------------

Query:  -------------------PDNLVFNNMLYALCKKEPTGEMIDTALTIFR----RIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVN
                           P+ L FN ++ ALCK       +D A+ +FR    R   PD Y+Y  ++ GLCK  R   A+ + DEM   G  P+    N
Subjt:  -------------------PDNLVFNNMLYALCKKEPTGEMIDTALTIFR----RIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVN

Query:  ILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNPKS---------GAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGK
        +LI  LC    K+G + +V            VPN    +         G ++ AV +         IP+      LI+ L +  +  +A+R+L  +E   
Subjt:  ILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNPKS---------GAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGK

Query:  LRCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGET-------RNWST
            +  YS+++  L +  + EEA  L+ +M  +G KP + +Y+ ++  LC+ G  ++A+ +   M    C+P+  TYS+L+  + +T       + W  
Subjt:  LRCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGET-------RNWST

Query:  -----------AYSLL-----------------QEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKL--EMKWEAQ------------ILQKLCKQGQLE
                    YS+L                  +ML++G+ P    YS + K +   G +D  LKL  EM  + +            +L  LC Q  + 
Subjt:  -----------AYSLL-----------------QEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKL--EMKWEAQ------------ILQKLCKQGQLE

Query:  AAYEKLKAMLEKGSYPPIYVRDAFERAF-QKNGKLKIARELLQRI
         A + L +ML++G  P +   + F     +K+      R  L+ +
Subjt:  AAYEKLKAMLEKGSYPPIYVRDAFERAF-QKNGKLKIARELLQRI

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.4e-3024.38Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAG
        T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  +    +  +   P  Y+Y+ +I GLCK G++  A EVF EM R+G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAG

Query:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRI
        L P  +    L+ + C    K   +E  +V S  R   V+        + ++  +SG ++ A+  F +     LIP   +   LI   CR G +  A+ +
Subjt:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVL--------VPNVNPKSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQEAIRI

Query:  L-KVIEGGKLRCAEE--CYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETR
          ++++ G   CA +   Y+ ++  LC+ + + EA  LF  M  + + P       +I   CKLGNL +A  +F+ M  KR   D VTY+ L+  +G+  
Subjt:  L-KVIEGGKLRCAEE--CYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGETR

Query:  NWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIAR
        +  TA  +  +M+S  + P    YS+                        ++  LC +G L  A+     M+ K   P + + ++  + + ++G      
Subjt:  NWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKIAR

Query:  ELLQRI
          L+++
Subjt:  ELLQRI

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-2925.74Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDF----PDKYSYSNIIIGLCKFGRFGTAIEVFDEM
        T++I IR     G +  AL LF++ME K GC P+ + +N ++   CK       ID    + R +      P+  SY+ +I GLC+ GR      V  EM
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGEMIDTALTIFRRIDF----PDKYSYSNIIIGLCKFGRFGTAIEVFDEM

Query:  DRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNP---------KSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQ
        +R G        N LI   C    KEG   +  V         L P+V           K+G +  A+          L P+      L+    + G M 
Subjt:  DRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNP---------KSGAIEPAVGVFWAANRLALIPSAFVIVQLISELCRLGQMQ

Query:  EAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE
        EA R+L+ +       +   Y+ ++   C   ++E+A  +   M  +G+ P +  Y+ V+   C+  ++D+A RV + M  K   PD +TYS+LI  + E
Subjt:  EAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNRKRCVPDHVTYSALIHAYGE

Query:  TRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKI
         R    A  L +EML +G+ P    Y+                         ++   C +G LE A +    M+EKG  P +           K  + + 
Subjt:  TRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERAFQKNGKLKI

Query:  ARELLQRI
        A+ LL ++
Subjt:  ARELLQRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGCACGAGCACGACTAGGAATGCTTCGGCGTTGCTCAAATCCATTACTCTTCAGTTTTTTGGCTTCTCTTCAAACTTTTTCGGCACTTCAACCACCACAAAGCA
CATTGCCATAGCTTCAAGAGCTCTTGCAAGAAGACCCACTTCGCGAACTGCCCCAATCCCTCGGGCTTTGGACACCCTCAGCTCTACCGATGTCGTCAATTCAGTATGTT
CTTTACTTGCGAACAAAAATCACCAAACAACTAATCTCGATCTTGATCATTTGTTGAAAAGGTTCAAAGAAACCTTAAATTCTGATCTCGTTCTTCAAATTCTGATGAAC
TATAGGCTGTTGGGTAGGGCTAAAACGTTGGAATTCTTTTCTTGGTCTGGATTGCAAATGGGGTTTCGGTTTGATGAGTCAGTGGTTGAGTACATGGCTGATTTCTTAGG
TAGAAGGAAACTGTTTGACGATATGAAGTGTCTTTTGGTGACGGTGTCGTCTCATAAGGGTCGGCTTTCTTGTAGAACGTTTTCAATTTGTATCAGATTTTTGGGTAGGC
AAGGGAGGGTTAGAGAAGCGCTTTGCTTGTTTGAAGAAATGGAGCCAAAATTTGGGTGCAAACCTGATAATTTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAG
GAGCCAACTGGGGAAATGATTGATACTGCTCTAACAATTTTCAGAAGAATTGATTTTCCTGATAAATATTCATACAGTAATATAATTATAGGATTGTGTAAATTTGGTAG
GTTTGGTACAGCTATTGAAGTGTTTGATGAAATGGATAGGGCAGGTTTGGTACCTACTCGATCTGCTGTAAACATTCTCATTGGGGATTTGTGTTCATTGAGTGCCAAAG
AAGGGGCTATAGAAAAAGTTAGGGTCAGAAGTACTCGTCGACCTTTTACTGTTCTAGTTCCAAATGTGAATCCGAAGAGCGGTGCCATTGAACCTGCAGTTGGAGTTTTT
TGGGCAGCTAATAGGCTGGCTTTAATTCCCAGTGCATTTGTAATAGTTCAGCTCATCTCGGAGCTTTGTCGATTAGGTCAAATGCAAGAAGCAATTAGAATACTGAAGGT
TATTGAGGGTGGCAAGCTAAGATGTGCTGAAGAGTGTTACTCCATTGTGATGCAAACTTTGTGTGAACATCGTCAGGTTGAAGAAGCTAGTGATCTGTTTGGGAGGATGC
TTTCTCAGGGTATGAAGCCAAAGTTGGCTATTTATAATTTTGTTATTTGCATGCTATGCAAATTAGGAAATTTGGATGATGCTGAAAGGGTCTTCAAGATTATGAACAGG
AAAAGATGTGTGCCTGATCACGTTACTTATTCGGCATTAATCCATGCCTATGGTGAAACTAGGAATTGGTCGACAGCCTACAGTTTATTGCAGGAAATGTTAAGTTTAGG
CATGTCTCCTCATTTTCATGTGTATAGTTTAGTGGATAAACTAATGAGGGAACATGGGCAAATTGATCTGTGCTTGAAGCTGGAAATGAAGTGGGAAGCCCAAATTTTGC
AGAAGCTCTGTAAACAAGGACAACTAGAGGCTGCGTATGAGAAGCTCAAGGCAATGCTTGAAAAGGGTTCTTATCCTCCTATCTATGTGAGAGATGCTTTTGAGAGGGCA
TTTCAAAAGAATGGTAAGTTGAAGATTGCACGGGAGCTGCTGCAGAGGATAGATGGCGTCCACAAACATGAGACAGGAACCAAAAATTCATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGCACGAGCACGACTAGGAATGCTTCGGCGTTGCTCAAATCCATTACTCTTCAGTTTTTTGGCTTCTCTTCAAACTTTTTCGGCACTTCAACCACCACAAAGCA
CATTGCCATAGCTTCAAGAGCTCTTGCAAGAAGACCCACTTCGCGAACTGCCCCAATCCCTCGGGCTTTGGACACCCTCAGCTCTACCGATGTCGTCAATTCAGTATGTT
CTTTACTTGCGAACAAAAATCACCAAACAACTAATCTCGATCTTGATCATTTGTTGAAAAGGTTCAAAGAAACCTTAAATTCTGATCTCGTTCTTCAAATTCTGATGAAC
TATAGGCTGTTGGGTAGGGCTAAAACGTTGGAATTCTTTTCTTGGTCTGGATTGCAAATGGGGTTTCGGTTTGATGAGTCAGTGGTTGAGTACATGGCTGATTTCTTAGG
TAGAAGGAAACTGTTTGACGATATGAAGTGTCTTTTGGTGACGGTGTCGTCTCATAAGGGTCGGCTTTCTTGTAGAACGTTTTCAATTTGTATCAGATTTTTGGGTAGGC
AAGGGAGGGTTAGAGAAGCGCTTTGCTTGTTTGAAGAAATGGAGCCAAAATTTGGGTGCAAACCTGATAATTTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAG
GAGCCAACTGGGGAAATGATTGATACTGCTCTAACAATTTTCAGAAGAATTGATTTTCCTGATAAATATTCATACAGTAATATAATTATAGGATTGTGTAAATTTGGTAG
GTTTGGTACAGCTATTGAAGTGTTTGATGAAATGGATAGGGCAGGTTTGGTACCTACTCGATCTGCTGTAAACATTCTCATTGGGGATTTGTGTTCATTGAGTGCCAAAG
AAGGGGCTATAGAAAAAGTTAGGGTCAGAAGTACTCGTCGACCTTTTACTGTTCTAGTTCCAAATGTGAATCCGAAGAGCGGTGCCATTGAACCTGCAGTTGGAGTTTTT
TGGGCAGCTAATAGGCTGGCTTTAATTCCCAGTGCATTTGTAATAGTTCAGCTCATCTCGGAGCTTTGTCGATTAGGTCAAATGCAAGAAGCAATTAGAATACTGAAGGT
TATTGAGGGTGGCAAGCTAAGATGTGCTGAAGAGTGTTACTCCATTGTGATGCAAACTTTGTGTGAACATCGTCAGGTTGAAGAAGCTAGTGATCTGTTTGGGAGGATGC
TTTCTCAGGGTATGAAGCCAAAGTTGGCTATTTATAATTTTGTTATTTGCATGCTATGCAAATTAGGAAATTTGGATGATGCTGAAAGGGTCTTCAAGATTATGAACAGG
AAAAGATGTGTGCCTGATCACGTTACTTATTCGGCATTAATCCATGCCTATGGTGAAACTAGGAATTGGTCGACAGCCTACAGTTTATTGCAGGAAATGTTAAGTTTAGG
CATGTCTCCTCATTTTCATGTGTATAGTTTAGTGGATAAACTAATGAGGGAACATGGGCAAATTGATCTGTGCTTGAAGCTGGAAATGAAGTGGGAAGCCCAAATTTTGC
AGAAGCTCTGTAAACAAGGACAACTAGAGGCTGCGTATGAGAAGCTCAAGGCAATGCTTGAAAAGGGTTCTTATCCTCCTATCTATGTGAGAGATGCTTTTGAGAGGGCA
TTTCAAAAGAATGGTAAGTTGAAGATTGCACGGGAGCTGCTGCAGAGGATAGATGGCGTCCACAAACATGAGACAGGAACCAAAAATTCATCATGA
Protein sequenceShow/hide protein sequence
MLSTSTTRNASALLKSITLQFFGFSSNFFGTSTTTKHIAIASRALARRPTSRTAPIPRALDTLSSTDVVNSVCSLLANKNHQTTNLDLDHLLKRFKETLNSDLVLQILMN
YRLLGRAKTLEFFSWSGLQMGFRFDESVVEYMADFLGRRKLFDDMKCLLVTVSSHKGRLSCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKK
EPTGEMIDTALTIFRRIDFPDKYSYSNIIIGLCKFGRFGTAIEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTVLVPNVNPKSGAIEPAVGVF
WAANRLALIPSAFVIVQLISELCRLGQMQEAIRILKVIEGGKLRCAEECYSIVMQTLCEHRQVEEASDLFGRMLSQGMKPKLAIYNFVICMLCKLGNLDDAERVFKIMNR
KRCVPDHVTYSALIHAYGETRNWSTAYSLLQEMLSLGMSPHFHVYSLVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKLKAMLEKGSYPPIYVRDAFERA
FQKNGKLKIARELLQRIDGVHKHETGTKNSS