; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021181 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021181
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold6:48106448..48108022
RNA-Seq ExpressionSpg021181
SyntenySpg021181
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596261.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.7e-28791.79Show/hide
Query:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL RRSS+ALVS+ +P FRFLRS F+D S REYHRDSYDYT LLQ CR+IRSVQELHAQI+VEGHDQNGFLATKLIGKYAE+GE KM +ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM
        E+DVFVWNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHVLKCGLDLDLFVGNAL+AFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM

Query:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIVSWNSMIAGYT NGKVDEAI+LFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVN+ARDVF
Subjt:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA EALNMFTSLEE GLKPDGVIFLNLLSTCSHAGLV KG EIYEKMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAK+VG+KLF+LDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFDTLVKLERIIEEDFEPI
        PFTEQIFDTL KLER+++E+FEPI
Subjt:  PFTEQIFDTLVKLERIIEEDFEPI

XP_022942131.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita moschata]3.9e-28791.6Show/hide
Query:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL RRSS+ALVS+ +P FRFLRS F+D S REYHRDSYDYT LLQ CR+IRSV+ELHAQI+VEGHDQNGFLATKLIGKYAE+GE KM +ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM
        E+DVFVWNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHVLKCGLDLDLFVGNAL+AFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM

Query:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIVSWNSMIAGYT NGKVDEAI+LFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVN+ARDVF
Subjt:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA EALNMFTSLEE GLKPDGVIFLNLLSTCSHAGLV KG EIYEKMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAK+VG+KLF+LDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFDTLVKLERIIEEDFEPI
        PFTEQIFDTL KLER+++E+FEPI
Subjt:  PFTEQIFDTLVKLERIIEEDFEPI

XP_022971604.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita maxima]3.2e-28992.56Show/hide
Query:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL RR S+ALVS+ QP FRFLRS F+D S REYHRDSYDYT LLQ CR+IRSVQELHAQI+VEGHDQNGFLATKLIGKYAE+GE KMG+ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM
        E+DVFVWNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKNS+KGKIVHGHVLKCGLDLDLFVGNAL+AFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM

Query:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIVSWNSMIAGYT NGKVDEAI+LFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
Subjt:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA EALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLV KG EIYEKMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAKEVG+KLF+LDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFDTLVKLERIIEEDFEPI
        PFTEQIFDTL KLER+++E+FEPI
Subjt:  PFTEQIFDTLVKLERIIEEDFEPI

XP_023540099.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita pepo subsp. pepo]6.6e-28791.41Show/hide
Query:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL  RSS+ALVS+ +P FRFLRS F+D S REYHRDSYDYT LLQ CR+IRSVQELHAQI+VEGHDQNGFLATKLIGKYAE+GE KMG+ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM
        E+DVF+WNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHVLKCGLDLDLFVGNAL+AFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM

Query:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIVSWNSMIAGYT NGKVDEAI++FHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSY+IKTGMEVGAPLGSCLISMYANCGHVNIARDVF
Subjt:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA EALNMFTSLEE GLKPDGVIFLNLLSTCSHAGLV KG EIYE+MEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAKEVG+KLF+LDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFDTLVKLERIIEEDFEPI
        PFTEQIFDTL KLER+++E+FEPI
Subjt:  PFTEQIFDTLVKLERIIEEDFEPI

XP_038905146.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Benincasa hispida]9.9e-28389.89Show/hide
Query:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MN LCRR+ QAL S+ QPCFR LRSFFSDRSS +YHRDSYDYT LL  CRTIRSVQ LHAQIIVEG DQNGFLATKLIGKY EHGE+KMG+ARKVFD+LL
Subjt:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM
        +RDVFVWNVVIQGYAN GPF EALNLFDEMRVSG PTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFV NAL+AFYAKCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM

Query:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIVSWNSMIAGYT N KV++AI+LFHAMLHNQ+ACSPD+ATL+GILPACVTKSASQVGFWVHSYVIKTGMEVGA LGSCLIS+YANCGHVNIARDVF
Subjt:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        +RIDDKNVIVWSAIIRCYGMHGFA EALNMFT LEE GLKPDGV+FLNLLSTCSHAGL+AKGHEIYEKME YGVERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIH NIELAKE+GEKLFILD +NAGRY+ILASMYEDAGQWEDAAKLRKLLRDRN+RKPVGCSSIE+DRIHHVFGKEDE+H
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFDTLVKLERIIEEDFEPI
        PFTEQIFDTL KLE I+E DFEPI
Subjt:  PFTEQIFDTLVKLERIIEEDFEPI

TrEMBL top hitse value%identityAlignment
A0A0A0L101 Uncharacterized protein1.9e-26384.22Show/hide
Query:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEH--GEAKMGVARKVFDR
        MNGLCRR+ QAL  + QP       FFS+ S  +YHRDSY +  LL  CR+IRSVQELHAQI+VEG DQNGF+A KLIGKY EH  GE KMG ARKVFD 
Subjt:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEH--GEAKMGVARKVFDR

Query:  LLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFD
        L+ RDVFVWNVVIQGYA+ GPF EALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKG+IVHGHV+KCGLDLDLFVGNAL+AFY+KCQDVETARKVFD
Subjt:  LLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFD

Query:  EMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARD
        +MSLRDIVSWNSMI GYT NGK DEAI+ FHAMLHNQ  C+PD+ATLV ILPAC TKSASQVGFWVHSYVIKTG+EVGAPLGSCLI MY NCGHVNIARD
Subjt:  EMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAV
        VFDRIDDKNVIVWSAIIRCYGMHGFA EA NMF  LEE G+KPDG+IFLNLLS CSHAGLVAKGHEIYEKMEAYG+ERK+ HYACMVDLLGRAGFLEQAV
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDE
        EFIEGMPVQAGKDVYGALLGACRIHNN+ELAKEVGEKLFILDPE A RYV LA+MYEDAGQWEDAAKLRKLLRDRNIRKP GCSSIE+DRIHHVFGK+DE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDE

Query:  SHPFTEQIFDTLVKLERIIEEDFEPI
        +HP TE+IFDTL KLERI+EEDFEPI
Subjt:  SHPFTEQIFDTLVKLERIIEEDFEPI

A0A5A7UCB1 Pentatricopeptide repeat-containing protein2.0e-25781.75Show/hide
Query:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAE--HGEAKMGVARKVFDR
        MNGL RR+ QAL  + QP       FFS+R S +YHRDSYD+  LL  CRTIRSVQELHAQI+VEG DQNGFLATKLIGKY E   GE+KMG ARKVFDR
Subjt:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAE--HGEAKMGVARKVFDR

Query:  LLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFD
        LL+RDVF+WNVVIQGYA++GPF EALNLFDEMRVSGEPTNRYTFPFVLKACGA+KNSDKG+IVHG+V+KCGLDLDLFVGNAL++ Y+KCQDVETARKVFD
Subjt:  LLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFD

Query:  EMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARD
        +MSLRD VSWNSMI GYT N + D+AI+ FHAMLHNQ  C PD+ATLV ILPAC TKSASQVGFWVHSY+IKTGMEVGA LGSCLISMY NCGH+NIARD
Subjt:  EMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAV
        VF+RID+KNVIVWSAIIRCYGMHG A EALNMF  LEEVG+KPDG+IFLNLLS CSHAGLVAKG EIY+KMEAYG+ER ++HYACMVDLL RAGFLEQA 
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDE
        EFIE MPVQAGKDVYGAL GACRIHNN+ELAKEVGEKLFILDPENAGRY+ILASMYEDAGQWEDAAK+RKLLRDRNI+KP GCSSIE+DRIHHVFGK+DE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDE

Query:  SHPFTEQIFDTLVKLERIIEEDFEPI
        +HPFTE+IFDT+ KLER++EEDFEPI
Subjt:  SHPFTEQIFDTLVKLERIIEEDFEPI

A0A6J1CVV9 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like1.0e-27788.36Show/hide
Query:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        M GL RR S+ LV K QP FRFLR F SDRS  EY RDSYDYTNLLQ CRTIRSVQELHAQIIVEG DQNGFLATKLIGKYAEHG++KMGVARKVFDRL+
Subjt:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM
        ERDVFVWNVVI+GYANWGPF EALNLFDEMRV+G PTNRYTFPFVLKACGAMKN DKGK+VHGHVLK GLDLDLFVGNAL+AFYAKC D+ET RKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM

Query:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
         L+DIVSWNSMIAG+TSNGKVDEAI+LFHAM+HNQ ACSPDNATLVGILPACV+KSA+QVGFWVHSYVIKTGM+VGAPLGSCLISMYANCGHVNIARDVF
Subjt:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        +RIDDKNVIVWSA+IRCYGMHG A EAL MFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKG +IYEKME YGVERKEEHYACMVDLLGRAGF++QAV+F
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMP+QAGKDVYGALLGACRIHNNIE+AKE  EKLF+LDPENAGRYVILASM+EDAGQWEDAAKLRKLLRDR I+KPVGCSSIEIDRIHHVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFDTLVKLERIIEEDFEPI
        PF+EQIFDTL KLERI+EEDFEP+
Subjt:  PFTEQIFDTLVKLERIIEEDFEPI

A0A6J1FMZ9 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like1.9e-28791.6Show/hide
Query:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL RRSS+ALVS+ +P FRFLRS F+D S REYHRDSYDYT LLQ CR+IRSV+ELHAQI+VEGHDQNGFLATKLIGKYAE+GE KM +ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM
        E+DVFVWNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHVLKCGLDLDLFVGNAL+AFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM

Query:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIVSWNSMIAGYT NGKVDEAI+LFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVN+ARDVF
Subjt:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA EALNMFTSLEE GLKPDGVIFLNLLSTCSHAGLV KG EIYEKMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAK+VG+KLF+LDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFDTLVKLERIIEEDFEPI
        PFTEQIFDTL KLER+++E+FEPI
Subjt:  PFTEQIFDTLVKLERIIEEDFEPI

A0A6J1I919 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like1.5e-28992.56Show/hide
Query:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL RR S+ALVS+ QP FRFLRS F+D S REYHRDSYDYT LLQ CR+IRSVQELHAQI+VEGHDQNGFLATKLIGKYAE+GE KMG+ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM
        E+DVFVWNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKNS+KGKIVHGHVLKCGLDLDLFVGNAL+AFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEM

Query:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIVSWNSMIAGYT NGKVDEAI+LFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
Subjt:  SLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA EALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLV KG EIYEKMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAKEVG+KLF+LDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFDTLVKLERIIEEDFEPI
        PFTEQIFDTL KLER+++E+FEPI
Subjt:  PFTEQIFDTLVKLERIIEEDFEPI

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic2.0e-9735.33Show/hide
Query:  SFFSDRSSREYHRDSYDYTNLLQRCRTIRSV---QELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFA
        +F    S  + + + Y +  L++    + S+   Q LH   +      + F+A  LI  Y   G+  +  A KVF  + E+DV  WN +I G+   G   
Subjt:  SFFSDRSSREYHRDSYDYTNLLQRCRTIRSV---QELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFA

Query:  EALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKC-------------------------------QDV
        +AL LF +M       +  T   VL AC  ++N + G+ V  ++ +  ++++L + NA++  Y KC                               +D 
Subjt:  EALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKC-------------------------------QDV

Query:  ETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANC
        E AR+V + M  +DIV+WN++I+ Y  NGK +EA+I+FH  L  Q     +  TLV  L AC    A ++G W+HSY+ K G+ +   + S LI MY+ C
Subjt:  ETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANC

Query:  GHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLG
        G +  +R+VF+ ++ ++V VWSA+I    MHG  +EA++MF  ++E  +KP+GV F N+   CSH GLV +   ++ +ME+ YG+  +E+HYAC+VD+LG
Subjt:  GHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLG

Query:  RAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI
        R+G+LE+AV+FIE MP+     V+GALLGAC+IH N+ LA+    +L  L+P N G +V+L+++Y   G+WE+ ++LRK +R   ++K  GCSSIEID +
Subjt:  RAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI

Query:  HHVFGKEDESHPFTEQIFDTLVK-LERIIEEDFEP
         H F   D +HP +E+++  L + +E++    +EP
Subjt:  HHVFGKEDESHPFTEQIFDTLVK-LERIIEEDFEP

P0C899 Putative pentatricopeptide repeat-containing protein At3g491424.0e-9335.03Show/hide
Query:  IRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA
        IR+++ +H++II+E    N  L  KL+  YA   +  +  ARKVFD + ER+V + NV+I+ Y N G + E + +F  M       + YTFP VLKAC  
Subjt:  IRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA

Query:  MKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAM------------------LH
              G+ +HG   K GL   LFVGN LV+ Y KC  +  AR V DEMS RD+VSWNS++ GY  N + D+A+ +   M                  + 
Subjt:  MKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAM------------------LH

Query:  NQT------------------------------------------------ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLIS
        N T                                                   PD  ++  +LPAC   SA  +G  +H Y+ +  +     L + LI 
Subjt:  NQT------------------------------------------------ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLIS

Query:  MYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACM
        MYA CG +  ARDVF+ +  ++V+ W+A+I  YG  G   +A+ +F+ L++ GL PD + F+  L+ CSHAGL+ +G   ++ M + Y +  + EH ACM
Subjt:  MYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACM

Query:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSI
        VDLLGRAG +++A  FI+ M ++  + V+GALLGACR+H++ ++     +KLF L PE +G YV+L+++Y  AG+WE+   +R +++ + ++K  G S++
Subjt:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSI

Query:  EIDRIHHVFGKEDESHPFTEQIF---DTLVK
        E++RI H F   D SHP +++I+   D LVK
Subjt:  EIDRIHHVFGKEDESHPFTEQIF---DTLVK

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic5.2e-9334.24Show/hide
Query:  DSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGE--------------------------AKMGV---ARKVFDRLLERDVF
        +SY +  +L+ C   +  +  Q++H  ++  G D + ++ T LI  Y ++G                           A  G    A+K+FD +  +DV 
Subjt:  DSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGE--------------------------AKMGV---ARKVFDRLLERDVF

Query:  VWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDI
         WN +I GYA  G + EAL LF +M  +    +  T   V+ AC    + + G+ VH  +   G   +L + NAL+  Y+KC ++ETA  +F+ +  +D+
Subjt:  VWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDI

Query:  VSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIK--TGMEVGAPLGSCLISMYANCGHVNIARDVFDRI
        +SWN++I GYT      EA++LF  ML  ++  +P++ T++ ILPAC    A  +G W+H Y+ K   G+   + L + LI MYA CG +  A  VF+ I
Subjt:  VSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIK--TGMEVGAPLGSCLISMYANCGHVNIARDVFDRI

Query:  DDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIE
          K++  W+A+I  + MHG A  + ++F+ + ++G++PD + F+ LLS CSH+G++  G  I+  M + Y +  K EHY CM+DLLG +G  ++A E I 
Subjt:  DDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIE

Query:  GMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPF
         M ++    ++ +LL AC++H N+EL +   E L  ++PEN G YV+L+++Y  AG+W + AK R LL D+ ++K  GCSSIEID + H F   D+ HP 
Subjt:  GMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPF

Query:  TEQIFDTLVKLERIIEE
          +I+  L ++E ++E+
Subjt:  TEQIFDTLVKLERIIEE

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.2e-9439.78Show/hide
Query:  LIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFV
        L+  Y++ G+  +  A+ VF  + +R V  +  +I GYA  G   EA+ LF+EM   G   + YT   VL  C   +  D+GK VH  + +  L  D+FV
Subjt:  LIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFV

Query:  GNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVG
         NAL+  YAKC  ++ A  VF EM ++DI+SWN++I GY+ N   +EA+ LF+ +L  +   SPD  T+  +LPAC + SA   G  +H Y+++ G    
Subjt:  GNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVG

Query:  APLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVE
          + + L+ MYA CG + +A  +FD I  K+++ W+ +I  YGMHGF  EA+ +F  + + G++ D + F++LL  CSH+GLV +G   +  M     +E
Subjt:  APLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVE

Query:  RKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNI
           EHYAC+VD+L R G L +A  FIE MP+     ++GALL  CRIH++++LA++V EK+F L+PEN G YV++A++Y +A +WE   +LRK +  R +
Subjt:  RKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNI

Query:  RKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLVKLE-RIIEEDFEPI
        RK  GCS IEI    ++F   D S+P TE I   L K+  R+IEE + P+
Subjt:  RKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLVKLE-RIIEEDFEPI

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic3.0e-10137.5Show/hide
Query:  SREYHRDSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFD
        S+E       Y  L+  C    ++     +H  I+  G DQ+ FLATKLIG Y++ G   +  ARKVFD+  +R ++VWN + +     G   E L L+ 
Subjt:  SREYHRDSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFD

Query:  EMRVSGEPTNRYTFPFVLKACGA----MKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEA
        +M   G  ++R+T+ +VLKAC A    + +  KGK +H H+ + G    +++   LV  YA+   V+ A  VF  M +R++VSW++MIA Y  NGK  EA
Subjt:  EMRVSGEPTNRYTFPFVLKACGA----MKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEA

Query:  IILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFA
        +  F  M+      SP++ T+V +L AC + +A + G  +H Y+++ G++   P+ S L++MY  CG + + + VFDR+ D++V+ W+++I  YG+HG+ 
Subjt:  IILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFA

Query:  SEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIH
         +A+ +F  +   G  P  V F+++L  CSH GLV +G  ++E M   +G++ + EHYACMVDLLGRA  L++A + ++ M  + G  V+G+LLG+CRIH
Subjt:  SEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIH

Query:  NNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLVKLERIIEE
         N+ELA+    +LF L+P+NAG YV+LA +Y +A  W++  +++KLL  R ++K  G   +E+ R  + F   DE +P  EQI   LVKL   ++E
Subjt:  NNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLVKLERIIEE

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-9434.24Show/hide
Query:  DSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGE--------------------------AKMGV---ARKVFDRLLERDVF
        +SY +  +L+ C   +  +  Q++H  ++  G D + ++ T LI  Y ++G                           A  G    A+K+FD +  +DV 
Subjt:  DSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGE--------------------------AKMGV---ARKVFDRLLERDVF

Query:  VWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDI
         WN +I GYA  G + EAL LF +M  +    +  T   V+ AC    + + G+ VH  +   G   +L + NAL+  Y+KC ++ETA  +F+ +  +D+
Subjt:  VWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDI

Query:  VSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIK--TGMEVGAPLGSCLISMYANCGHVNIARDVFDRI
        +SWN++I GYT      EA++LF  ML  ++  +P++ T++ ILPAC    A  +G W+H Y+ K   G+   + L + LI MYA CG +  A  VF+ I
Subjt:  VSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIK--TGMEVGAPLGSCLISMYANCGHVNIARDVFDRI

Query:  DDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIE
          K++  W+A+I  + MHG A  + ++F+ + ++G++PD + F+ LLS CSH+G++  G  I+  M + Y +  K EHY CM+DLLG +G  ++A E I 
Subjt:  DDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIE

Query:  GMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPF
         M ++    ++ +LL AC++H N+EL +   E L  ++PEN G YV+L+++Y  AG+W + AK R LL D+ ++K  GCSSIEID + H F   D+ HP 
Subjt:  GMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPF

Query:  TEQIFDTLVKLERIIEE
          +I+  L ++E ++E+
Subjt:  TEQIFDTLVKLERIIEE

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-9835.33Show/hide
Query:  SFFSDRSSREYHRDSYDYTNLLQRCRTIRSV---QELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFA
        +F    S  + + + Y +  L++    + S+   Q LH   +      + F+A  LI  Y   G+  +  A KVF  + E+DV  WN +I G+   G   
Subjt:  SFFSDRSSREYHRDSYDYTNLLQRCRTIRSV---QELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFA

Query:  EALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKC-------------------------------QDV
        +AL LF +M       +  T   VL AC  ++N + G+ V  ++ +  ++++L + NA++  Y KC                               +D 
Subjt:  EALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKC-------------------------------QDV

Query:  ETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANC
        E AR+V + M  +DIV+WN++I+ Y  NGK +EA+I+FH  L  Q     +  TLV  L AC    A ++G W+HSY+ K G+ +   + S LI MY+ C
Subjt:  ETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANC

Query:  GHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLG
        G +  +R+VF+ ++ ++V VWSA+I    MHG  +EA++MF  ++E  +KP+GV F N+   CSH GLV +   ++ +ME+ YG+  +E+HYAC+VD+LG
Subjt:  GHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLG

Query:  RAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI
        R+G+LE+AV+FIE MP+     V+GALLGAC+IH N+ LA+    +L  L+P N G +V+L+++Y   G+WE+ ++LRK +R   ++K  GCSSIEID +
Subjt:  RAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI

Query:  HHVFGKEDESHPFTEQIFDTLVK-LERIIEEDFEP
         H F   D +HP +E+++  L + +E++    +EP
Subjt:  HHVFGKEDESHPFTEQIFDTLVK-LERIIEEDFEP

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-10237.5Show/hide
Query:  SREYHRDSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFD
        S+E       Y  L+  C    ++     +H  I+  G DQ+ FLATKLIG Y++ G   +  ARKVFD+  +R ++VWN + +     G   E L L+ 
Subjt:  SREYHRDSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFD

Query:  EMRVSGEPTNRYTFPFVLKACGA----MKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEA
        +M   G  ++R+T+ +VLKAC A    + +  KGK +H H+ + G    +++   LV  YA+   V+ A  VF  M +R++VSW++MIA Y  NGK  EA
Subjt:  EMRVSGEPTNRYTFPFVLKACGA----MKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEA

Query:  IILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFA
        +  F  M+      SP++ T+V +L AC + +A + G  +H Y+++ G++   P+ S L++MY  CG + + + VFDR+ D++V+ W+++I  YG+HG+ 
Subjt:  IILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFA

Query:  SEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIH
         +A+ +F  +   G  P  V F+++L  CSH GLV +G  ++E M   +G++ + EHYACMVDLLGRA  L++A + ++ M  + G  V+G+LLG+CRIH
Subjt:  SEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIH

Query:  NNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLVKLERIIEE
         N+ELA+    +LF L+P+NAG YV+LA +Y +A  W++  +++KLL  R ++K  G   +E+ R  + F   DE +P  EQI   LVKL   ++E
Subjt:  NNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLVKLERIIEE

AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-9435.03Show/hide
Query:  IRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA
        IR+++ +H++II+E    N  L  KL+  YA   +  +  ARKVFD + ER+V + NV+I+ Y N G + E + +F  M       + YTFP VLKAC  
Subjt:  IRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA

Query:  MKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAM------------------LH
              G+ +HG   K GL   LFVGN LV+ Y KC  +  AR V DEMS RD+VSWNS++ GY  N + D+A+ +   M                  + 
Subjt:  MKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAM------------------LH

Query:  NQT------------------------------------------------ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLIS
        N T                                                   PD  ++  +LPAC   SA  +G  +H Y+ +  +     L + LI 
Subjt:  NQT------------------------------------------------ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLIS

Query:  MYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACM
        MYA CG +  ARDVF+ +  ++V+ W+A+I  YG  G   +A+ +F+ L++ GL PD + F+  L+ CSHAGL+ +G   ++ M + Y +  + EH ACM
Subjt:  MYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACM

Query:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSI
        VDLLGRAG +++A  FI+ M ++  + V+GALLGACR+H++ ++     +KLF L PE +G YV+L+++Y  AG+WE+   +R +++ + ++K  G S++
Subjt:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSI

Query:  EIDRIHHVFGKEDESHPFTEQIF---DTLVK
        E++RI H F   D SHP +++I+   D LVK
Subjt:  EIDRIHHVFGKEDESHPFTEQIF---DTLVK

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein8.8e-9639.78Show/hide
Query:  LIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFV
        L+  Y++ G+  +  A+ VF  + +R V  +  +I GYA  G   EA+ LF+EM   G   + YT   VL  C   +  D+GK VH  + +  L  D+FV
Subjt:  LIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFV

Query:  GNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVG
         NAL+  YAKC  ++ A  VF EM ++DI+SWN++I GY+ N   +EA+ LF+ +L  +   SPD  T+  +LPAC + SA   G  +H Y+++ G    
Subjt:  GNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGKVDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVG

Query:  APLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVE
          + + L+ MYA CG + +A  +FD I  K+++ W+ +I  YGMHGF  EA+ +F  + + G++ D + F++LL  CSH+GLV +G   +  M     +E
Subjt:  APLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVE

Query:  RKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNI
           EHYAC+VD+L R G L +A  FIE MP+     ++GALL  CRIH++++LA++V EK+F L+PEN G YV++A++Y +A +WE   +LRK +  R +
Subjt:  RKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNI

Query:  RKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLVKLE-RIIEEDFEPI
        RK  GCS IEI    ++F   D S+P TE I   L K+  R+IEE + P+
Subjt:  RKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLVKLE-RIIEEDFEPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGGGCTCTGCCGGCGATCTTCGCAAGCTCTGGTATCGAAAATTCAACCATGCTTCAGATTTCTAAGATCCTTCTTCAGCGATCGGAGTTCTAGGGAGTATCATCG
CGATTCATACGATTACACGAACTTATTACAACGTTGTAGAACGATCAGAAGCGTTCAAGAACTACATGCCCAGATCATCGTAGAAGGCCACGACCAAAATGGATTCTTAG
CCACGAAGCTAATCGGCAAATACGCGGAGCATGGTGAGGCGAAAATGGGAGTTGCACGGAAGGTGTTCGATAGATTGCTTGAAAGAGATGTGTTCGTGTGGAACGTGGTT
ATTCAAGGGTATGCGAACTGGGGTCCGTTTGCCGAAGCCCTCAACCTGTTTGATGAAATGCGAGTCAGTGGCGAGCCCACTAATCGCTACACATTCCCTTTTGTGTTGAA
GGCATGTGGCGCAATGAAGAACAGTGACAAGGGGAAGATTGTTCATGGACATGTTTTGAAATGTGGGTTGGACTTGGATCTGTTCGTGGGCAATGCTCTGGTTGCGTTTT
ATGCCAAGTGCCAGGACGTTGAAACTGCTCGTAAAGTGTTTGATGAAATGTCTCTGAGAGATATTGTGAGTTGGAACTCCATGATTGCTGGGTATACTTCGAATGGTAAA
GTGGATGAAGCTATTATCCTTTTCCATGCGATGCTGCATAATCAAACTGCTTGTTCACCTGATAATGCTACTCTTGTTGGGATTCTGCCTGCTTGTGTTACAAAATCTGC
TTCCCAAGTTGGCTTCTGGGTTCATTCCTATGTTATAAAGACAGGAATGGAAGTTGGGGCCCCGTTGGGCAGTTGCCTTATTTCAATGTATGCTAACTGTGGTCATGTGA
ACATTGCGAGAGATGTTTTCGACCGAATCGACGACAAAAACGTCATCGTATGGAGTGCGATCATAAGGTGTTATGGAATGCATGGTTTTGCAAGTGAGGCATTAAACATG
TTCACAAGTTTGGAAGAAGTTGGTCTAAAACCAGACGGCGTGATCTTCCTGAATTTGTTGTCGACATGTAGTCACGCAGGGCTCGTCGCGAAAGGCCACGAGATATACGA
AAAAATGGAGGCTTATGGTGTGGAGAGGAAAGAGGAACATTATGCGTGCATGGTGGATCTCTTAGGGAGGGCTGGTTTCTTAGAACAAGCAGTCGAGTTCATTGAAGGCA
TGCCAGTGCAGGCAGGAAAAGATGTGTATGGTGCATTGCTTGGTGCTTGTAGGATACATAACAACATAGAGCTAGCTAAAGAAGTTGGTGAGAAGTTGTTCATTTTGGAT
CCCGAAAACGCAGGGCGATACGTGATCCTGGCTAGTATGTATGAAGATGCAGGGCAGTGGGAAGATGCTGCCAAACTGAGGAAGTTGCTGAGAGATAGGAATATTAGGAA
GCCAGTTGGTTGTAGCTCAATAGAGATAGATAGGATTCATCATGTGTTTGGGAAGGAGGATGAATCTCACCCCTTCACAGAACAAATTTTTGACACATTGGTGAAGCTAG
AAAGGATAATTGAGGAAGATTTTGAACCTATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGGGCTCTGCCGGCGATCTTCGCAAGCTCTGGTATCGAAAATTCAACCATGCTTCAGATTTCTAAGATCCTTCTTCAGCGATCGGAGTTCTAGGGAGTATCATCG
CGATTCATACGATTACACGAACTTATTACAACGTTGTAGAACGATCAGAAGCGTTCAAGAACTACATGCCCAGATCATCGTAGAAGGCCACGACCAAAATGGATTCTTAG
CCACGAAGCTAATCGGCAAATACGCGGAGCATGGTGAGGCGAAAATGGGAGTTGCACGGAAGGTGTTCGATAGATTGCTTGAAAGAGATGTGTTCGTGTGGAACGTGGTT
ATTCAAGGGTATGCGAACTGGGGTCCGTTTGCCGAAGCCCTCAACCTGTTTGATGAAATGCGAGTCAGTGGCGAGCCCACTAATCGCTACACATTCCCTTTTGTGTTGAA
GGCATGTGGCGCAATGAAGAACAGTGACAAGGGGAAGATTGTTCATGGACATGTTTTGAAATGTGGGTTGGACTTGGATCTGTTCGTGGGCAATGCTCTGGTTGCGTTTT
ATGCCAAGTGCCAGGACGTTGAAACTGCTCGTAAAGTGTTTGATGAAATGTCTCTGAGAGATATTGTGAGTTGGAACTCCATGATTGCTGGGTATACTTCGAATGGTAAA
GTGGATGAAGCTATTATCCTTTTCCATGCGATGCTGCATAATCAAACTGCTTGTTCACCTGATAATGCTACTCTTGTTGGGATTCTGCCTGCTTGTGTTACAAAATCTGC
TTCCCAAGTTGGCTTCTGGGTTCATTCCTATGTTATAAAGACAGGAATGGAAGTTGGGGCCCCGTTGGGCAGTTGCCTTATTTCAATGTATGCTAACTGTGGTCATGTGA
ACATTGCGAGAGATGTTTTCGACCGAATCGACGACAAAAACGTCATCGTATGGAGTGCGATCATAAGGTGTTATGGAATGCATGGTTTTGCAAGTGAGGCATTAAACATG
TTCACAAGTTTGGAAGAAGTTGGTCTAAAACCAGACGGCGTGATCTTCCTGAATTTGTTGTCGACATGTAGTCACGCAGGGCTCGTCGCGAAAGGCCACGAGATATACGA
AAAAATGGAGGCTTATGGTGTGGAGAGGAAAGAGGAACATTATGCGTGCATGGTGGATCTCTTAGGGAGGGCTGGTTTCTTAGAACAAGCAGTCGAGTTCATTGAAGGCA
TGCCAGTGCAGGCAGGAAAAGATGTGTATGGTGCATTGCTTGGTGCTTGTAGGATACATAACAACATAGAGCTAGCTAAAGAAGTTGGTGAGAAGTTGTTCATTTTGGAT
CCCGAAAACGCAGGGCGATACGTGATCCTGGCTAGTATGTATGAAGATGCAGGGCAGTGGGAAGATGCTGCCAAACTGAGGAAGTTGCTGAGAGATAGGAATATTAGGAA
GCCAGTTGGTTGTAGCTCAATAGAGATAGATAGGATTCATCATGTGTTTGGGAAGGAGGATGAATCTCACCCCTTCACAGAACAAATTTTTGACACATTGGTGAAGCTAG
AAAGGATAATTGAGGAAGATTTTGAACCTATTTGA
Protein sequenceShow/hide protein sequence
MNGLCRRSSQALVSKIQPCFRFLRSFFSDRSSREYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVV
IQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALVAFYAKCQDVETARKVFDEMSLRDIVSWNSMIAGYTSNGK
VDEAIILFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFASEALNM
FTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFILD
PENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPFTEQIFDTLVKLERIIEEDFEPI