; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G014620 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G014620
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr07:21281174..21296419
RNA-Seq ExpressionLsi07G014620
SyntenyLsi07G014620
Gene Ontology termsGO:0018026 - peptidyl-lysine monomethylation (biological process)
GO:0005515 - protein binding (molecular function)
GO:0016279 - protein-lysine N-methyltransferase activity (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR015353 - Rubisco LSMT, substrate-binding domain
IPR036464 - Rubisco LSMT, substrate-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145716.1 pentatricopeptide repeat-containing protein At5g16420, mitochondrial [Cucumis sativus]4.9e-28489.35Show/hide
Query:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
        M CYFRSN+FK ISL TPISIVPLRFIFAV++P+QSYTVTPPIKPWPQRL+P RLVAMI RQQNLDLALQIFHYAGK+HP F+HNYDTYHAII+RLSRAR
Subjt:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK
        AFEPVESLLLELQ+SGINC EDLFITVIRSYGLA RPK+A+KTF+RIQTFGVRRSVRSLNTLLNALVQNNRFS VHLLFKYS+SKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY
        ALCKKNDVEGARKVFDEMP+MG+VPNVVTYTTILGGYVSR DM GAKRVFGELFDHGWLPDATTYTILMDGY+KQGRFTDAVKVMDEMEENGVEPND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY

Query:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVII  Y +E+KSGEALNLL+DMLEKKYIP+SALCCKVIDVLCGEGRVKEACK+WEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERG+ISS
Subjt:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVG A+EVIKV EEMLDKGCL NESTY IL EGLLKLGK  E  NILSM ISS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS

Query:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIETYR
        GA DFKAW+LFVP FV+N++EQAN+L+KILIET+R
Subjt:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIETYR

XP_008450076.1 PREDICTED: pentatricopeptide repeat-containing protein At5g16420, mitochondrial [Cucumis melo]8.1e-28789.91Show/hide
Query:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
        M CYFRSN+FK ISL TPISIVPLRFIFA+++P+QSYTVTPPIKPWPQRL+PKRLVAMI RQQNLDLALQIFHYAGKFHP FSHNYDTYHAIIHRLSRAR
Subjt:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK
        AFEPVESLLLELQ++GINC EDLFITVIRSYGLAGRPK+A+KTF+RIQTFGVRRSVRSLNTLLNALVQNNRFS VHLLFKYS+SKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY
        ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSR DM GAKRVFGELFDHGWLPDATTYTILMDGYIK+GRFTDAVKVMDEMEENGVEPNDVTY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY

Query:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVII AY +E+KSGEALNLL+DMLEKKYIP+SALCCKVIDVLC EGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFE G+ISS
Subjt:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS
        LLTYNTLI GMCE+GELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLK+G AEEVIKV EEMLDKGCL NESTYS+L EGLLKLGKG E FNILSM IS+
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS

Query:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIETYR
        GA DFKAWH  +P FV+N++EQ NML+KILIET+R
Subjt:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIETYR

XP_022154201.1 pentatricopeptide repeat-containing protein At5g16420, mitochondrial [Momordica charantia]1.5e-28890.81Show/hide
Query:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
        MWCY   NK K ISL + I+IVPLRFIFAVDS +QSYTVTPPIKPWPQRLYP+RLV+MII QQNLDLALQIFH+AGKFHPGFSHNYDTYHAI+HRLSRAR
Subjt:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK
        AFEPVESLLL+LQ+SGI CGEDLFITVIRSYGLAGRPKLAVKTF+RIQTFGVRRSVRSLNTLLNALVQNNRFS VHLLFK+S SKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY
        ALCKKND+EGARKVFDEMPAMGMVPNVVTYTTILGGYVSR DM GAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENG++PND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY

Query:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAY REKKSGEALNLL+DMLEKKYIP+SALCCKVIDVLC EGRVKEACKLWE+LLKKNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI S
Subjt:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+PNEFTYNMLIKGFL+VG AEEVIKV EEMLDKGCLPNESTYSILAEGLLKLGKGGEF NILSMFISS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS

Query:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIET
        G  DFK WHLFVP+FV NMDEQ+N+L+KILIET
Subjt:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIET

XP_022982687.1 pentatricopeptide repeat-containing protein At5g16420, mitochondrial [Cucurbita maxima]1.1e-28389.87Show/hide
Query:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
        MW  FR NKF P SL +PIS + LRFIF+VDS +QSYTVTPPIKPWPQRLYPKRLVAM+IRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
Subjt:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK
        AFEPVESLL EL+NSGINCGEDLFI+VIR+YGLA RPK+AVKTF+RIQTFGVRRSVRSLNTLLNALVQN RFS VHLLFK+SRSKFGVV NVFTCNILIK
Subjt:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY
        ALC+KNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYV+R DM  AKRVFGELFDHGWLPDATTYTILM+GYIK GRFT+AVKVMDEMEENGVEPND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY

Query:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAY REKKSGEALNLL+DMLEKKY+PSSALCCKVIDVLC EGRVKE CKLWEKLL KNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI S
Subjt:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+P+EFTYNMLIKGFLKVG AEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEF NILSMF+SS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS

Query:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIET
        G  D K WHLFVP+FV NMDEQANML+KILIET
Subjt:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIET

XP_038893383.1 pentatricopeptide repeat-containing protein At5g16420, mitochondrial [Benincasa hispida]4.7e-30395.51Show/hide
Query:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
        MWCYFRSNKFK ISLRTPI IVPLRFIFAVDSP+QSYTVTPPIKPWPQRLYPKRLV+MIIRQQNLDLALQIFHYAGK+HPGFSHNYDTYHAII+RLSRAR
Subjt:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK
        AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTF+RIQTFGVRRSVRSLNTLLNALVQNNRFS VHLLFKYSRSKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY
         LCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSR DM GAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY

Query:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAY REKKSGEALNLL+DMLEKKYIPSSALCCKVIDVLC EGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
Subjt:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+PN+FTYNMLIKGFLKVG AEEVIK+AEEMLDKGCLPNESTYSILAEGLLKLGKGGEF NILSMFISS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS

Query:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIETYR
        G  DFKAWHLFVPQFV+N+DEQANMLDKILIET R
Subjt:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIETYR

TrEMBL top hitse value%identityAlignment
A0A0A0LDM6 Uncharacterized protein2.4e-28489.35Show/hide
Query:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
        M CYFRSN+FK ISL TPISIVPLRFIFAV++P+QSYTVTPPIKPWPQRL+P RLVAMI RQQNLDLALQIFHYAGK+HP F+HNYDTYHAII+RLSRAR
Subjt:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK
        AFEPVESLLLELQ+SGINC EDLFITVIRSYGLA RPK+A+KTF+RIQTFGVRRSVRSLNTLLNALVQNNRFS VHLLFKYS+SKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY
        ALCKKNDVEGARKVFDEMP+MG+VPNVVTYTTILGGYVSR DM GAKRVFGELFDHGWLPDATTYTILMDGY+KQGRFTDAVKVMDEMEENGVEPND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY

Query:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVII  Y +E+KSGEALNLL+DMLEKKYIP+SALCCKVIDVLCGEGRVKEACK+WEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERG+ISS
Subjt:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVG A+EVIKV EEMLDKGCL NESTY IL EGLLKLGK  E  NILSM ISS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS

Query:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIETYR
        GA DFKAW+LFVP FV+N++EQAN+L+KILIET+R
Subjt:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIETYR

A0A1S3BPE8 pentatricopeptide repeat-containing protein At5g16420, mitochondrial3.9e-28789.91Show/hide
Query:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
        M CYFRSN+FK ISL TPISIVPLRFIFA+++P+QSYTVTPPIKPWPQRL+PKRLVAMI RQQNLDLALQIFHYAGKFHP FSHNYDTYHAIIHRLSRAR
Subjt:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK
        AFEPVESLLLELQ++GINC EDLFITVIRSYGLAGRPK+A+KTF+RIQTFGVRRSVRSLNTLLNALVQNNRFS VHLLFKYS+SKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY
        ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSR DM GAKRVFGELFDHGWLPDATTYTILMDGYIK+GRFTDAVKVMDEMEENGVEPNDVTY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY

Query:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVII AY +E+KSGEALNLL+DMLEKKYIP+SALCCKVIDVLC EGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFE G+ISS
Subjt:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS
        LLTYNTLI GMCE+GELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLK+G AEEVIKV EEMLDKGCL NESTYS+L EGLLKLGKG E FNILSM IS+
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS

Query:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIETYR
        GA DFKAWH  +P FV+N++EQ NML+KILIET+R
Subjt:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIETYR

A0A6J1DN12 pentatricopeptide repeat-containing protein At5g16420, mitochondrial7.1e-28990.81Show/hide
Query:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
        MWCY   NK K ISL + I+IVPLRFIFAVDS +QSYTVTPPIKPWPQRLYP+RLV+MII QQNLDLALQIFH+AGKFHPGFSHNYDTYHAI+HRLSRAR
Subjt:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK
        AFEPVESLLL+LQ+SGI CGEDLFITVIRSYGLAGRPKLAVKTF+RIQTFGVRRSVRSLNTLLNALVQNNRFS VHLLFK+S SKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY
        ALCKKND+EGARKVFDEMPAMGMVPNVVTYTTILGGYVSR DM GAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENG++PND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY

Query:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAY REKKSGEALNLL+DMLEKKYIP+SALCCKVIDVLC EGRVKEACKLWE+LLKKNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI S
Subjt:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+PNEFTYNMLIKGFL+VG AEEVIKV EEMLDKGCLPNESTYSILAEGLLKLGKGGEF NILSMFISS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS

Query:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIET
        G  DFK WHLFVP+FV NMDEQ+N+L+KILIET
Subjt:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIET

A0A6J1F4B0 pentatricopeptide repeat-containing protein At5g16420, mitochondrial1.4e-28189.49Show/hide
Query:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
        MW   R NKF P S   PI+ +PLRFIFAVDS +QSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGK+HPGFSHNYDTYHAIIHRLSRAR
Subjt:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK
        AFEPVESLL EL+NSGINCGEDLFI+VIR+YGLAGRPK+AVK F+RIQTFGVRRSVRSLNTLLNALVQN RFS VHLLFK+SRSKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY
        ALC+KNDVEGARKV DEMPAMGMVPNVVTYTTILGGYV+R DM  AKRVFGELFDHGWLPDATTYTILM+GYIK GRFT+AVKVMDEMEENGVEPND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY

Query:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAY REKKSGEALNLL+DMLEKKY+PSSALCCKVIDVLCGEGRVKE CKLW KLL KNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI S
Subjt:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+PNEFTYNMLIKGFLKVG AEEVI+VAEEMLDKGCLPNESTYSILAE LLKLGK GEF NILSMF+SS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS

Query:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIET
        G  D KAWHLFVP+FV NMDEQANML+KILIET
Subjt:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIET

A0A6J1J5I0 pentatricopeptide repeat-containing protein At5g16420, mitochondrial5.3e-28489.87Show/hide
Query:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
        MW  FR NKF P SL +PIS + LRFIF+VDS +QSYTVTPPIKPWPQRLYPKRLVAM+IRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR
Subjt:  MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK
        AFEPVESLL EL+NSGINCGEDLFI+VIR+YGLA RPK+AVKTF+RIQTFGVRRSVRSLNTLLNALVQN RFS VHLLFK+SRSKFGVV NVFTCNILIK
Subjt:  AFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY
        ALC+KNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYV+R DM  AKRVFGELFDHGWLPDATTYTILM+GYIK GRFT+AVKVMDEMEENGVEPND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTY

Query:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAY REKKSGEALNLL+DMLEKKY+PSSALCCKVIDVLC EGRVKE CKLWEKLL KNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI S
Subjt:  GVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+P+EFTYNMLIKGFLKVG AEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEF NILSMF+SS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISS

Query:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIET
        G  D K WHLFVP+FV NMDEQANML+KILIET
Subjt:  GAADFKAWHLFVPQFVTNMDEQANMLDKILIET

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200901.4e-5531.98Show/hide
Query:  TYHAIIHRLSRARAFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIR-IQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKY---SR
        T  ++I   + +  F+ VE LL  ++       E  FI V R+YG A  P  AV  F R +  F  +RSV+S N++LN ++    +      + Y   S 
Subjt:  TYHAIIHRLSRARAFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIR-IQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKY---SR

Query:  SKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVK
            + PN  + N++IKALCK   V+ A +VF  MP    +P+  TY T++ G      +  A  +  E+   G  P    Y +L+DG  K+G  T   K
Subjt:  SKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVK

Query:  VMDEMEENGVEPNDVTYGVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNI
        ++D M   G  PN+VTY  +I     + K  +A++LL  M+  K IP+      +I+ L  + R  +A +L   + ++    +  I S LI  L KEG  
Subjt:  VMDEMEENGVEPNDVTYGVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNI

Query:  WEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLL
         EA  L+ +  E+G   +++ Y+ L+ G+C  G+  EA  + + M+  GC+PN +TY+ L+KGF K G  EE ++V +EM   GC  N+  YS+L +GL 
Subjt:  WEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLL

Query:  KLGKGGEFFNILSMFISSG
         +G+  E   + S  ++ G
Subjt:  KLGKGGEFFNILSMFISSG

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745801.6e-5628.43Show/hide
Query:  LYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESLLLEL-QNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQ
        L PK + A+I  Q++   AL++F+   K   GF H   TY ++I +L     FE +E +L+++ +N G +  E +++  +++YG  G+ + AV  F R+ 
Subjt:  LYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESLLLEL-QNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQ

Query:  TFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKR
         +    +V S N +++ LV +  F   H ++   R + G+ P+V++  I +K+ CK +    A ++ + M + G   NVV Y T++GG+      A    
Subjt:  TFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKR

Query:  VFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTYGVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRV
        +FG++   G     +T+  L+    K+G   +  K++D++ + GV PN  TY + I+   +  +   A+ ++  ++E+   P       +I  LC   + 
Subjt:  VFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTYGVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRV

Query:  KEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNE-FERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFL
        +EA     K++ +   PD+   +TLI   CK G +  A ++  +    G +    TY +LI G+C  GE   A  L+++ L KG  PN   YN LIKG  
Subjt:  KEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNE-FERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFL

Query:  KVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISSG-AADFKAWHLFVPQFVT--NMDEQANMLDKIL-------IETYRSL
          G   E  ++A EM +KG +P   T++IL  GL K+G   +   ++ + IS G   D   +++ +  + T   M+    +LD +L       + TY SL
Subjt:  KVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISSG-AADFKAWHLFVPQFVT--NMDEQANMLDKIL-------IETYRSL

Query:  LTG
        L G
Subjt:  LTG

Q9FFE3 Pentatricopeptide repeat-containing protein At5g16420, mitochondrial4.1e-19363.35Show/hide
Query:  IQSY-TVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESLLLELQNS--GINCGEDLFITVIRS
        +Q Y T  PPIKPWPQRL+PKRLV+MI +QQN+DLALQIF YAGK HPGF+HNYDTYH+I+ +LSRARAF+PVESL+ +L+NS   I CGE+LFI ++R+
Subjt:  IQSY-TVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESLLLELQNS--GINCGEDLFITVIRS

Query:  YGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTY
        YGLAGR + +++ F+RI  FGV+RSVRSLNTLLN L+QN RF  VH +FK S+  FG+ PN+FTCN+L+KALCKKND+E A KV DE+P+MG+VPN+VTY
Subjt:  YGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTY

Query:  TTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTYGVIIEAYSREKKSGEALNLLSDMLEKKYIP
        TTILGGYV+R DM  AKRV  E+ D GW PDATTYT+LMDGY K GRF++A  VMD+ME+N +EPN+VTYGV+I A  +EKKSGEA N+  +MLE+ ++P
Subjt:  TTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTYGVIIEAYSREKKSGEALNLLSDMLEKKYIP

Query:  SSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEK
         S+LCCKVID LC + +V EAC LW K+LK NC PDNA+ STLIHWLCKEG + EARKLF+EFE+GSI SLLTYNTLIAGMCE GEL EA RLWDDM E+
Subjt:  SSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEK

Query:  GCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISSGAADFKAWHLFVPQFVTNMDEQANMLDKIL
         C PN FTYN+LI+G  K GN +E ++V EEML+ GC PN++T+ IL EGL KLGK  +   I+SM + +G  D ++W LF+ +F   +D+    L ++L
Subjt:  GCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISSGAADFKAWHLFVPQFVTNMDEQANMLDKIL

Query:  IE
         E
Subjt:  IE

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic2.2e-6127.66Show/hide
Query:  KFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESL
        KF P S+   +++    F   +  P  + +   P          K L+  +  Q +   AL++F+ A K  P FS     Y  I+ RL R+ +F+ ++ +
Subjt:  KFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESL

Query:  LLELQNSGINCGEDLFITVIRSYG-LAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKND
        L ++++S    G   F+ +I SY     + ++       I  FG++      N +LN LV  N    V +      S +G+ P+V T N+LIKALC+ + 
Subjt:  LLELQNSGINCGEDLFITVIRSYG-LAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKND

Query:  VEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEM-EENGVEPNDVTYGVIIEA
        +  A  + ++MP+ G+VP+  T+TT++ GY+   D+ GA R+  ++ + G      +  +++ G+ K+GR  DA+  + EM  ++G  P+  T+  ++  
Subjt:  VEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEM-EENGVEPNDVTYGVIIEA

Query:  YSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLTYN
          +      A+ ++  ML++ Y P       VI  LC  G VKEA ++ ++++ ++C+P+    +TLI  LCKE  + EA +L      +G +  + T+N
Subjt:  YSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLTYN

Query:  TLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNIL-SMFISSGAAD
        +LI G+C       A  L+++M  KGC P+EFTYNMLI      G  +E + + ++M   GC  +  TY+ L +G  K  K  E   I   M +   + +
Subjt:  TLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNIL-SMFISSGAAD

Query:  FKAWHLFVPQFVTN--MDEQANMLDKILIE-------TYRSLLTGY
           ++  +     +  +++ A ++D++++E       TY SLLT +
Subjt:  FKAWHLFVPQFVTN--MDEQANMLDKILIE-------TYRSLLTGY

Q9SS81 Pentatricopeptide repeat-containing protein At3g090606.4e-6132.03Show/hide
Query:  WPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFI
        +P+ L PK ++ ++  ++N   A  +F  A + HPG++H+   YH I+ RLS  R    V  ++  +++    C ED+ ++VI++YG    P  A+  F 
Subjt:  WPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESLLLELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFI

Query:  RI-QTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMA
        R+ + FG   ++RS NTLLNA V+  ++  V  LF Y  +  GV PN+ T N+LIK  CKK + E AR   D M   G  P+V +Y+T++        + 
Subjt:  RI-QTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMA

Query:  GAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDE-MEENGVEPNDVTYGVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLC
         A  +F E+ + G  PD T Y IL+DG++K+     A+++ D  +E++ V PN  T+ ++I   S+                                 C
Subjt:  GAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDE-MEENGVEPNDVTYGVIIEAYSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLC

Query:  GEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNML
          GRV +  K+WE++ +     D    S+LIH LC  GN+ +A  +FNE  ER +   ++TYNT++ G C  G++ E+  LW  M  K  V N  +YN+L
Subjt:  GEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNML

Query:  IKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISSG
        IKG L+ G  +E   +   M  KG   +++TY I   GL   G   +   ++    SSG
Subjt:  IKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISSG

Arabidopsis top hitse value%identityAlignment
AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-6227.66Show/hide
Query:  KFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESL
        KF P S+   +++    F   +  P  + +   P          K L+  +  Q +   AL++F+ A K  P FS     Y  I+ RL R+ +F+ ++ +
Subjt:  KFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESL

Query:  LLELQNSGINCGEDLFITVIRSYG-LAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKND
        L ++++S    G   F+ +I SY     + ++       I  FG++      N +LN LV  N    V +      S +G+ P+V T N+LIKALC+ + 
Subjt:  LLELQNSGINCGEDLFITVIRSYG-LAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKND

Query:  VEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEM-EENGVEPNDVTYGVIIEA
        +  A  + ++MP+ G+VP+  T+TT++ GY+   D+ GA R+  ++ + G      +  +++ G+ K+GR  DA+  + EM  ++G  P+  T+  ++  
Subjt:  VEGARKVFDEMPAMGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEM-EENGVEPNDVTYGVIIEA

Query:  YSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLTYN
          +      A+ ++  ML++ Y P       VI  LC  G VKEA ++ ++++ ++C+P+    +TLI  LCKE  + EA +L      +G +  + T+N
Subjt:  YSREKKSGEALNLLSDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLTYN

Query:  TLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNIL-SMFISSGAAD
        +LI G+C       A  L+++M  KGC P+EFTYNMLI      G  +E + + ++M   GC  +  TY+ L +G  K  K  E   I   M +   + +
Subjt:  TLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNIL-SMFISSGAAD

Query:  FKAWHLFVPQFVTN--MDEQANMLDKILIE-------TYRSLLTGY
           ++  +     +  +++ A ++D++++E       TY SLLT +
Subjt:  FKAWHLFVPQFVTN--MDEQANMLDKILIE-------TYRSLLTGY

AT5G14260.1 Rubisco methyltransferase family protein1.4e-19680.85Show/hide
Query:  NKERPTRRRNVCSASSSDTLVA-GSRKEDGKTGEPLTKKEEDEFGDLKAWMHKNGLPPCKVVLEEKASHDKNHRPIHYVAASEDLEVGDVAFSVPNSLVV
        +  R    R++C  SSSDTLVA GS KED +  +  +KKE D+  DLK WM KNGLPPCKV+L+E+ +HD+ H+PIHYVAASEDL+ GDVAFSVP+SLVV
Subjt:  NKERPTRRRNVCSASSSDTLVA-GSRKEDGKTGEPLTKKEEDEFGDLKAWMHKNGLPPCKVVLEEKASHDKNHRPIHYVAASEDLEVGDVAFSVPNSLVV

Query:  TLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEDELDYLAGSPTKKEVLERAEGIKKEYNELDTV
        TLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKS WYPYIRELDRQRGRGQL  ESPLLWSE ELDYL GSPTK EVLERAEGIK+EYNELDTV
Subjt:  TLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEDELDYLAGSPTKKEVLERAEGIKKEYNELDTV

Query:  WFMAGSLFQQYPYDIPTEAFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAIDGAVELVVDRPYKAGESIAVWCGPQPNSK
        WFMAGSLFQQYP+DIPTEAFSFEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPLLAY SNCKAMLTA+DGAVELVVDRPYKAG+ I VWCGPQPN+K
Subjt:  WFMAGSLFQQYPYDIPTEAFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAIDGAVELVVDRPYKAGESIAVWCGPQPNSK

Query:  LLLNYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTDPSEMQSVISSQGPV---SPCMERAMLE
        LLLNYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNG+LS Q F V  GKE+EAV DMLPYLRLGY++DPSEMQSVISSQGPV   SPCMERA+L+
Subjt:  LLLNYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTDPSEMQSVISSQGPV---SPCMERAMLE

Query:  QVADYFKRRLAGYPTTLSEDESL
        Q+A+YF RRL+GYPTT  ED++L
Subjt:  QVADYFKRRLAGYPTTLSEDESL

AT5G14260.2 Rubisco methyltransferase family protein1.4e-19680.85Show/hide
Query:  NKERPTRRRNVCSASSSDTLVA-GSRKEDGKTGEPLTKKEEDEFGDLKAWMHKNGLPPCKVVLEEKASHDKNHRPIHYVAASEDLEVGDVAFSVPNSLVV
        +  R    R++C  SSSDTLVA GS KED +  +  +KKE D+  DLK WM KNGLPPCKV+L+E+ +HD+ H+PIHYVAASEDL+ GDVAFSVP+SLVV
Subjt:  NKERPTRRRNVCSASSSDTLVA-GSRKEDGKTGEPLTKKEEDEFGDLKAWMHKNGLPPCKVVLEEKASHDKNHRPIHYVAASEDLEVGDVAFSVPNSLVV

Query:  TLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEDELDYLAGSPTKKEVLERAEGIKKEYNELDTV
        TLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKS WYPYIRELDRQRGRGQL  ESPLLWSE ELDYL GSPTK EVLERAEGIK+EYNELDTV
Subjt:  TLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEDELDYLAGSPTKKEVLERAEGIKKEYNELDTV

Query:  WFMAGSLFQQYPYDIPTEAFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAIDGAVELVVDRPYKAGESIAVWCGPQPNSK
        WFMAGSLFQQYP+DIPTEAFSFEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPLLAY SNCKAMLTA+DGAVELVVDRPYKAG+ I VWCGPQPN+K
Subjt:  WFMAGSLFQQYPYDIPTEAFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAIDGAVELVVDRPYKAGESIAVWCGPQPNSK

Query:  LLLNYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTDPSEMQSVISSQGPV---SPCMERAMLE
        LLLNYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNG+LS Q F V  GKE+EAV DMLPYLRLGY++DPSEMQSVISSQGPV   SPCMERA+L+
Subjt:  LLLNYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTDPSEMQSVISSQGPV---SPCMERAMLE

Query:  QVADYFKRRLAGYPTTLSEDESL
        Q+A+YF RRL+GYPTT  ED++L
Subjt:  QVADYFKRRLAGYPTTLSEDESL

AT5G14260.3 Rubisco methyltransferase family protein1.4e-19680.85Show/hide
Query:  NKERPTRRRNVCSASSSDTLVA-GSRKEDGKTGEPLTKKEEDEFGDLKAWMHKNGLPPCKVVLEEKASHDKNHRPIHYVAASEDLEVGDVAFSVPNSLVV
        +  R    R++C  SSSDTLVA GS KED +  +  +KKE D+  DLK WM KNGLPPCKV+L+E+ +HD+ H+PIHYVAASEDL+ GDVAFSVP+SLVV
Subjt:  NKERPTRRRNVCSASSSDTLVA-GSRKEDGKTGEPLTKKEEDEFGDLKAWMHKNGLPPCKVVLEEKASHDKNHRPIHYVAASEDLEVGDVAFSVPNSLVV

Query:  TLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEDELDYLAGSPTKKEVLERAEGIKKEYNELDTV
        TLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKS WYPYIRELDRQRGRGQL  ESPLLWSE ELDYL GSPTK EVLERAEGIK+EYNELDTV
Subjt:  TLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEDELDYLAGSPTKKEVLERAEGIKKEYNELDTV

Query:  WFMAGSLFQQYPYDIPTEAFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAIDGAVELVVDRPYKAGESIAVWCGPQPNSK
        WFMAGSLFQQYP+DIPTEAFSFEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPLLAY SNCKAMLTA+DGAVELVVDRPYKAG+ I VWCGPQPN+K
Subjt:  WFMAGSLFQQYPYDIPTEAFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAIDGAVELVVDRPYKAGESIAVWCGPQPNSK

Query:  LLLNYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTDPSEMQSVISSQGPV---SPCMERAMLE
        LLLNYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNG+LS Q F V  GKE+EAV DMLPYLRLGY++DPSEMQSVISSQGPV   SPCMERA+L+
Subjt:  LLLNYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTDPSEMQSVISSQGPV---SPCMERAMLE

Query:  QVADYFKRRLAGYPTTLSEDESL
        Q+A+YF RRL+GYPTT  ED++L
Subjt:  QVADYFKRRLAGYPTTLSEDESL

AT5G16420.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.9e-19463.35Show/hide
Query:  IQSY-TVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESLLLELQNS--GINCGEDLFITVIRS
        +Q Y T  PPIKPWPQRL+PKRLV+MI +QQN+DLALQIF YAGK HPGF+HNYDTYH+I+ +LSRARAF+PVESL+ +L+NS   I CGE+LFI ++R+
Subjt:  IQSY-TVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESLLLELQNS--GINCGEDLFITVIRS

Query:  YGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTY
        YGLAGR + +++ F+RI  FGV+RSVRSLNTLLN L+QN RF  VH +FK S+  FG+ PN+FTCN+L+KALCKKND+E A KV DE+P+MG+VPN+VTY
Subjt:  YGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTY

Query:  TTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTYGVIIEAYSREKKSGEALNLLSDMLEKKYIP
        TTILGGYV+R DM  AKRV  E+ D GW PDATTYT+LMDGY K GRF++A  VMD+ME+N +EPN+VTYGV+I A  +EKKSGEA N+  +MLE+ ++P
Subjt:  TTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTYGVIIEAYSREKKSGEALNLLSDMLEKKYIP

Query:  SSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEK
         S+LCCKVID LC + +V EAC LW K+LK NC PDNA+ STLIHWLCKEG + EARKLF+EFE+GSI SLLTYNTLIAGMCE GEL EA RLWDDM E+
Subjt:  SSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEK

Query:  GCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISSGAADFKAWHLFVPQFVTNMDEQANMLDKIL
         C PN FTYN+LI+G  K GN +E ++V EEML+ GC PN++T+ IL EGL KLGK  +   I+SM + +G  D ++W LF+ +F   +D+    L ++L
Subjt:  GCVPNEFTYNMLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISSGAADFKAWHLFVPQFVTNMDEQANMLDKIL

Query:  IE
         E
Subjt:  IE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTGCTACTTCCGCTCCAACAAATTCAAACCCATTTCACTCCGCACCCCAATTTCCATTGTCCCTCTTCGTTTCATCTTCGCCGTTGATTCTCCTATTCAGTCCTA
CACCGTCACGCCCCCCATCAAACCCTGGCCGCAGCGTCTCTATCCCAAACGCCTCGTCGCTATGATCATTCGCCAACAGAATCTCGACCTCGCCCTTCAAATCTTCCACT
ACGCCGGCAAATTTCATCCCGGCTTTTCCCACAACTACGATACCTATCATGCAATCATTCATCGTCTCTCTCGCGCTCGAGCTTTTGAACCTGTCGAGTCTTTGCTCCTG
GAATTGCAGAATTCTGGTATCAATTGCGGTGAGGATTTGTTCATTACTGTGATTAGAAGCTATGGGCTTGCGGGTCGCCCGAAATTGGCTGTGAAAACATTTATACGCAT
TCAAACCTTCGGTGTTCGACGCTCGGTGAGGTCGTTGAACACGTTGCTCAACGCTTTGGTTCAGAACAATCGGTTTTCTTTTGTTCATTTGTTGTTTAAGTATTCCAGAT
CCAAATTTGGGGTTGTGCCTAATGTGTTTACTTGTAACATTTTGATCAAAGCGCTTTGCAAAAAGAATGATGTTGAGGGTGCACGGAAGGTGTTTGACGAAATGCCTGCC
ATGGGTATGGTTCCAAATGTGGTTACTTATACTACAATCTTAGGTGGTTATGTTTCAAGACGCGATATGGCTGGTGCCAAGAGAGTTTTTGGTGAGCTTTTTGATCATGG
GTGGCTTCCTGATGCGACCACTTACACAATTTTAATGGATGGATACATTAAGCAAGGTAGGTTTACTGATGCTGTAAAGGTGATGGATGAGATGGAGGAAAATGGGGTCG
AGCCAAATGATGTTACTTATGGAGTCATTATTGAAGCTTATAGTAGGGAGAAAAAGTCTGGCGAAGCACTTAACCTGCTTAGTGATATGCTTGAAAAGAAGTATATTCCA
AGCTCAGCGCTTTGCTGTAAGGTGATTGATGTTTTGTGCGGTGAAGGGAGGGTGAAGGAAGCTTGTAAGCTGTGGGAGAAGCTTTTGAAGAAAAACTGTACCCCGGATAA
TGCTATTACAAGTACCCTTATTCATTGGCTTTGTAAGGAGGGGAATATATGGGAAGCAAGAAAATTATTTAATGAGTTTGAGAGGGGCTCAATTTCAAGTTTATTAACAT
ATAACACGCTCATTGCAGGAATGTGTGAAATGGGGGAGTTGTGTGAAGCCGCTAGGTTGTGGGATGACATGTTGGAAAAGGGTTGTGTGCCTAATGAATTTACTTATAAC
ATGCTGATAAAAGGATTTCTTAAAGTTGGTAACGCTGAAGAAGTGATTAAAGTAGCAGAGGAGATGTTGGATAAGGGATGTTTGCCAAATGAGTCTACTTACTCAATATT
GGCTGAAGGGCTCCTCAAATTGGGAAAAGGAGGAGAATTTTTTAATATTCTTTCGATGTTTATCTCAAGCGGAGCTGCTGACTTTAAAGCCTGGCATCTATTTGTACCGC
AGTTTGTTACTAATATGGATGAACAAGCAAATATGCTTGACAAAATATTGATTGAAACTTATAGGTCTTTGCTTACTGGCTATGAGAGTAGCACAGTGTCGAAGGGTGAG
TCTGAGATCTCTCTAGAGCCTAGATCACTCCTTCCTCTTTCACACGATGGAACACTGTTGAGTTTGGCTGCTGCGAGAGAGTGCATGAAAGCTTTGCCATCAACAAGAAT
CCATGACCACCGAGCTGCTCAGTTGCAATGCAACAAGGAAAGGCCAACTCGTCGCCGGAATGTTTGTTCTGCTTCAAGCTCTGATACCCTTGTTGCTGGGTCGCGTAAGG
AGGATGGCAAGACTGGAGAACCTCTAACTAAGAAGGAAGAGGATGAGTTTGGAGATTTGAAGGCTTGGATGCACAAAAATGGGCTTCCTCCTTGCAAGGTTGTTCTTGAG
GAAAAGGCTTCGCATGATAAGAATCATAGGCCTATACATTACGTGGCTGCCAGTGAAGATCTTGAGGTGGGTGATGTTGCATTTTCGGTTCCGAATTCCTTGGTCGTGAC
GCTTGAGAGAGTTCTAGGAAATGAGACCGTCGCTGAATTGTTAACTACCAATAAGTTGTCAGAGTTGGCGTGCTTAGCTTTGTATTTGATGTATGAGAAGAAACAAGGAA
AGAAGTCTTTCTGGTACCCTTATATTAGAGAGCTTGATCGCCAACGTGGGAGGGGCCAGCTAGCTGTAGAGTCACCTCTTCTATGGTCAGAAGACGAACTTGATTACCTC
GCAGGCAGTCCAACAAAGAAAGAAGTTCTTGAAAGGGCTGAAGGAATCAAGAAGGAGTATAATGAGCTCGACACTGTCTGGTTTATGGCTGGCTCCCTGTTTCAGCAATA
CCCATATGACATTCCAACTGAAGCATTTTCCTTTGAGATTTTCAAACAAGCCTTTGTTGCAGTTCAGTCATGTGTGGTGCATTTACAGAAAGTAAGTTTGGCTCGGAGAT
TTGCTTTGGTTCCTCTTGGCCCTCCACTATTGGCTTATAGAAGCAATTGCAAGGCAATGTTAACTGCTATTGATGGTGCTGTGGAACTGGTGGTTGATCGTCCATACAAG
GCTGGGGAATCAATAGCTGTTTGGTGTGGGCCACAGCCTAATTCAAAATTACTCCTGAATTATGGATTTGTTGATGAAGATAATCCCTATGATCGCTTAGTAGTTGAGGC
AGCTTTGAACACTGAAGATCCTCAATATCAGGATAAAAGAATGGTTGCTCAAAGAAATGGCAGATTATCAATACAAGCTTTTTATGTTTATGCCGGAAAGGAGAAAGAAG
CTGTCTTAGACATGCTTCCTTATCTTCGACTTGGCTACGTTACTGATCCTTCAGAAATGCAGTCTGTTATTTCTTCTCAAGGTCCAGTGAGCCCCTGCATGGAAAGAGCA
ATGTTGGAACAAGTTGCTGACTATTTCAAGAGACGACTGGCTGGCTACCCAACTACCTTGAGTGAAGACGAGTCTCTGTTCGATTTCAAGATCTGGGGTTTAGTGGCGGC
TTCTGATCTACGTTGTTCGATTGGGTTTTTTCTATGTTCTTCAGTTGTTGTGGGGTTTTTCTTGCGATTCAAGCCAGTGCGAGAGGTTTTGTTTCGTGGAGCTGTTGGGA
TTTACTGGAAAGTTTGTCAAGCTACTGTGGAGGTGCATGATCACGTCTGTGTTCAGTTAGTGACTGCATTCTTGAAGAAGCAGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGTGCTACTTCCGCTCCAACAAATTCAAACCCATTTCACTCCGCACCCCAATTTCCATTGTCCCTCTTCGTTTCATCTTCGCCGTTGATTCTCCTATTCAGTCCTA
CACCGTCACGCCCCCCATCAAACCCTGGCCGCAGCGTCTCTATCCCAAACGCCTCGTCGCTATGATCATTCGCCAACAGAATCTCGACCTCGCCCTTCAAATCTTCCACT
ACGCCGGCAAATTTCATCCCGGCTTTTCCCACAACTACGATACCTATCATGCAATCATTCATCGTCTCTCTCGCGCTCGAGCTTTTGAACCTGTCGAGTCTTTGCTCCTG
GAATTGCAGAATTCTGGTATCAATTGCGGTGAGGATTTGTTCATTACTGTGATTAGAAGCTATGGGCTTGCGGGTCGCCCGAAATTGGCTGTGAAAACATTTATACGCAT
TCAAACCTTCGGTGTTCGACGCTCGGTGAGGTCGTTGAACACGTTGCTCAACGCTTTGGTTCAGAACAATCGGTTTTCTTTTGTTCATTTGTTGTTTAAGTATTCCAGAT
CCAAATTTGGGGTTGTGCCTAATGTGTTTACTTGTAACATTTTGATCAAAGCGCTTTGCAAAAAGAATGATGTTGAGGGTGCACGGAAGGTGTTTGACGAAATGCCTGCC
ATGGGTATGGTTCCAAATGTGGTTACTTATACTACAATCTTAGGTGGTTATGTTTCAAGACGCGATATGGCTGGTGCCAAGAGAGTTTTTGGTGAGCTTTTTGATCATGG
GTGGCTTCCTGATGCGACCACTTACACAATTTTAATGGATGGATACATTAAGCAAGGTAGGTTTACTGATGCTGTAAAGGTGATGGATGAGATGGAGGAAAATGGGGTCG
AGCCAAATGATGTTACTTATGGAGTCATTATTGAAGCTTATAGTAGGGAGAAAAAGTCTGGCGAAGCACTTAACCTGCTTAGTGATATGCTTGAAAAGAAGTATATTCCA
AGCTCAGCGCTTTGCTGTAAGGTGATTGATGTTTTGTGCGGTGAAGGGAGGGTGAAGGAAGCTTGTAAGCTGTGGGAGAAGCTTTTGAAGAAAAACTGTACCCCGGATAA
TGCTATTACAAGTACCCTTATTCATTGGCTTTGTAAGGAGGGGAATATATGGGAAGCAAGAAAATTATTTAATGAGTTTGAGAGGGGCTCAATTTCAAGTTTATTAACAT
ATAACACGCTCATTGCAGGAATGTGTGAAATGGGGGAGTTGTGTGAAGCCGCTAGGTTGTGGGATGACATGTTGGAAAAGGGTTGTGTGCCTAATGAATTTACTTATAAC
ATGCTGATAAAAGGATTTCTTAAAGTTGGTAACGCTGAAGAAGTGATTAAAGTAGCAGAGGAGATGTTGGATAAGGGATGTTTGCCAAATGAGTCTACTTACTCAATATT
GGCTGAAGGGCTCCTCAAATTGGGAAAAGGAGGAGAATTTTTTAATATTCTTTCGATGTTTATCTCAAGCGGAGCTGCTGACTTTAAAGCCTGGCATCTATTTGTACCGC
AGTTTGTTACTAATATGGATGAACAAGCAAATATGCTTGACAAAATATTGATTGAAACTTATAGGTCTTTGCTTACTGGCTATGAGAGTAGCACAGTGTCGAAGGGTGAG
TCTGAGATCTCTCTAGAGCCTAGATCACTCCTTCCTCTTTCACACGATGGAACACTGTTGAGTTTGGCTGCTGCGAGAGAGTGCATGAAAGCTTTGCCATCAACAAGAAT
CCATGACCACCGAGCTGCTCAGTTGCAATGCAACAAGGAAAGGCCAACTCGTCGCCGGAATGTTTGTTCTGCTTCAAGCTCTGATACCCTTGTTGCTGGGTCGCGTAAGG
AGGATGGCAAGACTGGAGAACCTCTAACTAAGAAGGAAGAGGATGAGTTTGGAGATTTGAAGGCTTGGATGCACAAAAATGGGCTTCCTCCTTGCAAGGTTGTTCTTGAG
GAAAAGGCTTCGCATGATAAGAATCATAGGCCTATACATTACGTGGCTGCCAGTGAAGATCTTGAGGTGGGTGATGTTGCATTTTCGGTTCCGAATTCCTTGGTCGTGAC
GCTTGAGAGAGTTCTAGGAAATGAGACCGTCGCTGAATTGTTAACTACCAATAAGTTGTCAGAGTTGGCGTGCTTAGCTTTGTATTTGATGTATGAGAAGAAACAAGGAA
AGAAGTCTTTCTGGTACCCTTATATTAGAGAGCTTGATCGCCAACGTGGGAGGGGCCAGCTAGCTGTAGAGTCACCTCTTCTATGGTCAGAAGACGAACTTGATTACCTC
GCAGGCAGTCCAACAAAGAAAGAAGTTCTTGAAAGGGCTGAAGGAATCAAGAAGGAGTATAATGAGCTCGACACTGTCTGGTTTATGGCTGGCTCCCTGTTTCAGCAATA
CCCATATGACATTCCAACTGAAGCATTTTCCTTTGAGATTTTCAAACAAGCCTTTGTTGCAGTTCAGTCATGTGTGGTGCATTTACAGAAAGTAAGTTTGGCTCGGAGAT
TTGCTTTGGTTCCTCTTGGCCCTCCACTATTGGCTTATAGAAGCAATTGCAAGGCAATGTTAACTGCTATTGATGGTGCTGTGGAACTGGTGGTTGATCGTCCATACAAG
GCTGGGGAATCAATAGCTGTTTGGTGTGGGCCACAGCCTAATTCAAAATTACTCCTGAATTATGGATTTGTTGATGAAGATAATCCCTATGATCGCTTAGTAGTTGAGGC
AGCTTTGAACACTGAAGATCCTCAATATCAGGATAAAAGAATGGTTGCTCAAAGAAATGGCAGATTATCAATACAAGCTTTTTATGTTTATGCCGGAAAGGAGAAAGAAG
CTGTCTTAGACATGCTTCCTTATCTTCGACTTGGCTACGTTACTGATCCTTCAGAAATGCAGTCTGTTATTTCTTCTCAAGGTCCAGTGAGCCCCTGCATGGAAAGAGCA
ATGTTGGAACAAGTTGCTGACTATTTCAAGAGACGACTGGCTGGCTACCCAACTACCTTGAGTGAAGACGAGTCTCTGTTCGATTTCAAGATCTGGGGTTTAGTGGCGGC
TTCTGATCTACGTTGTTCGATTGGGTTTTTTCTATGTTCTTCAGTTGTTGTGGGGTTTTTCTTGCGATTCAAGCCAGTGCGAGAGGTTTTGTTTCGTGGAGCTGTTGGGA
TTTACTGGAAAGTTTGTCAAGCTACTGTGGAGGTGCATGATCACGTCTGTGTTCAGTTAGTGACTGCATTCTTGAAGAAGCAGAATTGA
Protein sequenceShow/hide protein sequence
MWCYFRSNKFKPISLRTPISIVPLRFIFAVDSPIQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHYAGKFHPGFSHNYDTYHAIIHRLSRARAFEPVESLLL
ELQNSGINCGEDLFITVIRSYGLAGRPKLAVKTFIRIQTFGVRRSVRSLNTLLNALVQNNRFSFVHLLFKYSRSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPA
MGMVPNVVTYTTILGGYVSRRDMAGAKRVFGELFDHGWLPDATTYTILMDGYIKQGRFTDAVKVMDEMEENGVEPNDVTYGVIIEAYSREKKSGEALNLLSDMLEKKYIP
SSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYN
MLIKGFLKVGNAEEVIKVAEEMLDKGCLPNESTYSILAEGLLKLGKGGEFFNILSMFISSGAADFKAWHLFVPQFVTNMDEQANMLDKILIETYRSLLTGYESSTVSKGE
SEISLEPRSLLPLSHDGTLLSLAAARECMKALPSTRIHDHRAAQLQCNKERPTRRRNVCSASSSDTLVAGSRKEDGKTGEPLTKKEEDEFGDLKAWMHKNGLPPCKVVLE
EKASHDKNHRPIHYVAASEDLEVGDVAFSVPNSLVVTLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEDELDYL
AGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAIDGAVELVVDRPYK
AGESIAVWCGPQPNSKLLLNYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTDPSEMQSVISSQGPVSPCMERA
MLEQVADYFKRRLAGYPTTLSEDESLFDFKIWGLVAASDLRCSIGFFLCSSVVVGFFLRFKPVREVLFRGAVGIYWKVCQATVEVHDHVCVQLVTAFLKKQN