; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017160 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017160
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr5:642843..644417
RNA-Seq ExpressionLag0017160
SyntenyLag0017160
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596261.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]8.9e-28490.84Show/hide
Query:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL RRSS+ALVS+N+P FRFLRS F+D S R+YHRDSYDYT LLQ CR+IRSVQELHAQI+VEGHDQNGFLATKLIGKYAE+GE KM +ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM
        E+DVFVWNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHV KCGLDLDLFVGNALIAFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM

Query:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIV WNSMIAGYT NGKVDEAI+LFHAMLH+QTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVN+ARDVF
Subjt:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEE GLKPD VIFLNLLSTCSHAGL+ KG EIY+KMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAK+VG+KLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PLTEQIFDTLVKLERTVEEDFEPI
        P TEQIFDTL KLER ++E+FEPI
Subjt:  PLTEQIFDTLVKLERTVEEDFEPI

XP_022942131.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita moschata]2.0e-28390.65Show/hide
Query:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL RRSS+ALVS+N+P FRFLRS F+D S R+YHRDSYDYT LLQ CR+IRSV+ELHAQI+VEGHDQNGFLATKLIGKYAE+GE KM +ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM
        E+DVFVWNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHV KCGLDLDLFVGNALIAFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM

Query:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIV WNSMIAGYT NGKVDEAI+LFHAMLH+QTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVN+ARDVF
Subjt:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEE GLKPD VIFLNLLSTCSHAGL+ KG EIY+KMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAK+VG+KLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PLTEQIFDTLVKLERTVEEDFEPI
        P TEQIFDTL KLER ++E+FEPI
Subjt:  PLTEQIFDTLVKLERTVEEDFEPI

XP_022971604.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita maxima]1.4e-28491.22Show/hide
Query:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL RR S+ALVS+ QP FRFLRS F+D S R+YHRDSYDYT LLQ CR+IRSVQELHAQI+VEGHDQNGFLATKLIGKYAE+GE KMG+ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM
        E+DVFVWNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKN +KGKIVHGHV KCGLDLDLFVGNALIAFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM

Query:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIV WNSMIAGYT NGKVDEAI+LFHAMLH+QTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
Subjt:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEEVGLKPD VIFLNLLSTCSHAGL+ KG EIY+KMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAKEVG+KLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PLTEQIFDTLVKLERTVEEDFEPI
        P TEQIFDTL KLER ++E+FEPI
Subjt:  PLTEQIFDTLVKLERTVEEDFEPI

XP_023540099.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita pepo subsp. pepo]3.4e-28390.46Show/hide
Query:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL  RSS+ALVS+N+P FRFLRS F+D S R+YHRDSYDYT LLQ CR+IRSVQELHAQI+VEGHDQNGFLATKLIGKYAE+GE KMG+ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM
        E+DVF+WNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHV KCGLDLDLFVGNALIAFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM

Query:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIV WNSMIAGYT NGKVDEAI++FHAMLH+QTACSPDNATLVGILPACVTKSASQVGFWVHSY+IKTGMEVGAPLGSCLISMYANCGHVNIARDVF
Subjt:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEE GLKPD VIFLNLLSTCSHAGL+ KG EIY++MEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAKEVG+KLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PLTEQIFDTLVKLERTVEEDFEPI
        P TEQIFDTL KLER ++E+FEPI
Subjt:  PLTEQIFDTLVKLERTVEEDFEPI

XP_038905146.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Benincasa hispida]1.7e-27988.93Show/hide
Query:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MN LCRR+ QAL S+NQPCFR LRSFFSDRSS KYHRDSYDYT LL  CRTIRSVQ LHAQIIVEG DQNGFLATKLIGKY EHGE+KMG+ARKVFD+LL
Subjt:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM
        +RDVFVWNVVIQGYAN GPF EALNLFDEMRVSG PTNRYTFPFVLKACGAMKN DKGKIVHGHV KCGLDLDLFV NALIAFYAKCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM

Query:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIV WNSMIAGYT N KV++AI+LFHAMLH+Q+ACSPD+ATL+GILPACVTKSASQVGFWVHSYVIKTGMEVGA LGSCLIS+YANCGHVNIARDVF
Subjt:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        +RIDDKNVIVWSAIIRCYGMHGFA+EALNMFT LEE GLKPD V+FLNLLSTCSHAGLIAKGHEIY+KME YGVERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIH NIELAKE+GEKLF+LD +NAGRY+ILASMYEDAGQWEDAAKLRKLLRDRN+RKPVGCSSIE+DRIHHVFGKEDE+H
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PLTEQIFDTLVKLERTVEEDFEPI
        P TEQIFDTL KLE  +E DFEPI
Subjt:  PLTEQIFDTLVKLERTVEEDFEPI

TrEMBL top hitse value%identityAlignment
A0A0A0L101 Uncharacterized protein1.8e-26183.46Show/hide
Query:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEH--GEAKMGVARKVFDR
        MNGLCRR+ QAL  +NQP       FFS+ S  KYHRDSY +  LL  CR+IRSVQELHAQI+VEG DQNGF+A KLIGKY EH  GE KMG ARKVFD 
Subjt:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEH--GEAKMGVARKVFDR

Query:  LLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFD
        L+ RDVFVWNVVIQGYA+ GPF EALNLFDEMRVSGEPTNRYTFPFVLKACGAMKN DKG+IVHGHV KCGLDLDLFVGNALIAFY+KCQDVETARKVFD
Subjt:  LLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFD

Query:  EMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARD
        +MSLRDIV WNSMI GYT NGK DEAI+ FHAMLH+Q  C+PD+ATLV ILPAC TKSASQVGFWVHSYVIKTG+EVGAPLGSCLI MY NCGHVNIARD
Subjt:  EMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAV
        VFDRIDDKNVIVWSAIIRCYGMHGFA+EA NMF  LEE G+KPD +IFLNLLS CSHAGL+AKGHEIY+KMEAYG+ERK+ HYACMVDLLGRAGFLEQAV
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDE
        EFIEGMPVQAGKDVYGALLGACRIHNN+ELAKEVGEKLF+LDPE A RYV LA+MYEDAGQWEDAAKLRKLLRDRNIRKP GCSSIE+DRIHHVFGK+DE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDE

Query:  SHPLTEQIFDTLVKLERTVEEDFEPI
        +HPLTE+IFDTL KLER +EEDFEPI
Subjt:  SHPLTEQIFDTLVKLERTVEEDFEPI

A0A5A7UCB1 Pentatricopeptide repeat-containing protein9.4e-25580.99Show/hide
Query:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAE--HGEAKMGVARKVFDR
        MNGL RR+ QAL  ++QP       FFS+R S KYHRDSYD+  LL  CRTIRSVQELHAQI+VEG DQNGFLATKLIGKY E   GE+KMG ARKVFDR
Subjt:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAE--HGEAKMGVARKVFDR

Query:  LLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFD
        LL+RDVF+WNVVIQGYA++GPF EALNLFDEMRVSGEPTNRYTFPFVLKACGA+KN DKG+IVHG+V KCGLDLDLFVGNALI+ Y+KCQDVETARKVFD
Subjt:  LLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFD

Query:  EMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARD
        +MSLRD V WNSMI GYT N + D+AI+ FHAMLH+Q  C PD+ATLV ILPAC TKSASQVGFWVHSY+IKTGMEVGA LGSCLISMY NCGH+NIARD
Subjt:  EMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARD

Query:  VFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAV
        VF+RID+KNVIVWSAIIRCYGMHG A+EALNMF  LEEVG+KPD +IFLNLLS CSHAGL+AKG EIYKKMEAYG+ER ++HYACMVDLL RAGFLEQA 
Subjt:  VFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDE
        EFIE MPVQAGKDVYGAL GACRIHNN+ELAKEVGEKLF+LDPENAGRY+ILASMYEDAGQWEDAAK+RKLLRDRNI+KP GCSSIE+DRIHHVFGK+DE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDE

Query:  SHPLTEQIFDTLVKLERTVEEDFEPI
        +HP TE+IFDT+ KLER +EEDFEPI
Subjt:  SHPLTEQIFDTLVKLERTVEEDFEPI

A0A6J1CVV9 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like6.9e-27487.21Show/hide
Query:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        M GL RR S+ LV KNQP FRFLR F SDRS  +Y RDSYDYTNLLQ CRTIRSVQELHAQIIVEG DQNGFLATKLIGKYAEHG++KMGVARKVFDRL+
Subjt:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM
        ERDVFVWNVVI+GYANWGPF EALNLFDEMRV+G PTNRYTFPFVLKACGAMKN DKGK+VHGHV K GLDLDLFVGNALIAFYAKC D+ET RKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM

Query:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
         L+DIV WNSMIAG+TSNGKVDEAI+LFHAM+H+Q ACSPDNATLVGILPACV+KSA+QVGFWVHSYVIKTGM+VGAPLGSCLISMYANCGHVNIARDVF
Subjt:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        +RIDDKNVIVWSA+IRCYGMHG A+EAL MFTSLEEVGLKPD VIFLNLLSTCSHAGL+AKG +IY+KME YGVERKEEHYACMVDLLGRAGF++QAV+F
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMP+QAGKDVYGALLGACRIHNNIE+AKE  EKLFVLDPENAGRYVILASM+EDAGQWEDAAKLRKLLRDR I+KPVGCSSIEIDRIHHVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PLTEQIFDTLVKLERTVEEDFEPI
        P +EQIFDTL KLER +EEDFEP+
Subjt:  PLTEQIFDTLVKLERTVEEDFEPI

A0A6J1FMZ9 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like9.6e-28490.65Show/hide
Query:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL RRSS+ALVS+N+P FRFLRS F+D S R+YHRDSYDYT LLQ CR+IRSV+ELHAQI+VEGHDQNGFLATKLIGKYAE+GE KM +ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM
        E+DVFVWNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHV KCGLDLDLFVGNALIAFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM

Query:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIV WNSMIAGYT NGKVDEAI+LFHAMLH+QTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVN+ARDVF
Subjt:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEE GLKPD VIFLNLLSTCSHAGL+ KG EIY+KMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAK+VG+KLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PLTEQIFDTLVKLERTVEEDFEPI
        P TEQIFDTL KLER ++E+FEPI
Subjt:  PLTEQIFDTLVKLERTVEEDFEPI

A0A6J1I919 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like6.7e-28591.22Show/hide
Query:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL
        MNGL RR S+ALVS+ QP FRFLRS F+D S R+YHRDSYDYT LLQ CR+IRSVQELHAQI+VEGHDQNGFLATKLIGKYAE+GE KMG+ARKVFDRLL
Subjt:  MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM
        E+DVFVWNVVIQGYANWGPFAEALNL+DEMRV GEPTNRYTFPFVLKACGAMKN +KGKIVHGHV KCGLDLDLFVGNALIAFY+KCQDVETARKVFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEM

Query:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
        SLRDIV WNSMIAGYT NGKVDEAI+LFHAMLH+QTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF
Subjt:  SLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVF

Query:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEEVGLKPD VIFLNLLSTCSHAGL+ KG EIY+KMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAKEVG+KLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESH

Query:  PLTEQIFDTLVKLERTVEEDFEPI
        P TEQIFDTL KLER ++E+FEPI
Subjt:  PLTEQIFDTLVKLERTVEEDFEPI

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic2.0e-9735.14Show/hide
Query:  SFFSDRSSRKYHRDSYDYTNLLQRCRTIRSV---QELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFA
        +F    S  + + + Y +  L++    + S+   Q LH   +      + F+A  LI  Y   G+  +  A KVF  + E+DV  WN +I G+   G   
Subjt:  SFFSDRSSRKYHRDSYDYTNLLQRCRTIRSV---QELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFA

Query:  EALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKC-------------------------------QDV
        +AL LF +M       +  T   VL AC  ++N + G+ V  ++ +  ++++L + NA++  Y KC                               +D 
Subjt:  EALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKC-------------------------------QDV

Query:  ETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANC
        E AR+V + M  +DIV WN++I+ Y  NGK +EA+I+FH  L  Q     +  TLV  L AC    A ++G W+HSY+ K G+ +   + S LI MY+ C
Subjt:  ETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANC

Query:  GHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEA-YGVERKEEHYACMVDLLG
        G +  +R+VF+ ++ ++V VWSA+I    MHG  NEA++MF  ++E  +KP+ V F N+   CSH GL+ +   ++ +ME+ YG+  +E+HYAC+VD+LG
Subjt:  GHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEA-YGVERKEEHYACMVDLLG

Query:  RAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI
        R+G+LE+AV+FIE MP+     V+GALLGAC+IH N+ LA+    +L  L+P N G +V+L+++Y   G+WE+ ++LRK +R   ++K  GCSSIEID +
Subjt:  RAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI

Query:  HHVFGKEDESHPLTEQIFDTLVK-LERTVEEDFEP
         H F   D +HP++E+++  L + +E+     +EP
Subjt:  HHVFGKEDESHPLTEQIFDTLVK-LERTVEEDFEP

P0C899 Putative pentatricopeptide repeat-containing protein At3g491421.4e-9334.39Show/hide
Query:  IRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA
        IR+++ +H++II+E    N  L  KL+  YA   +  +  ARKVFD + ER+V + NV+I+ Y N G + E + +F  M       + YTFP VLKAC  
Subjt:  IRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA

Query:  MKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAM-----LHDQTACS-------
              G+ +HG  +K GL   LFVGN L++ Y KC  +  AR V DEMS RD+V WNS++ GY  N + D+A+ +   M      HD    +       
Subjt:  MKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAM-----LHDQTACS-------

Query:  ------------------------------------------------------PDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLIS
                                                              PD  ++  +LPAC   SA  +G  +H Y+ +  +     L + LI 
Subjt:  ------------------------------------------------------PDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLIS

Query:  MYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVERKEEHYACM
        MYA CG +  ARDVF+ +  ++V+ W+A+I  YG  G   +A+ +F+ L++ GL PD + F+  L+ CSHAGL+ +G   +K M + Y +  + EH ACM
Subjt:  MYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVERKEEHYACM

Query:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSI
        VDLLGRAG +++A  FI+ M ++  + V+GALLGACR+H++ ++     +KLF L PE +G YV+L+++Y  AG+WE+   +R +++ + ++K  G S++
Subjt:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSI

Query:  EIDRIHHVFGKEDESHPLTEQIFDTLVKLERTVEE
        E++RI H F   D SHP +++I+  L  L + ++E
Subjt:  EIDRIHHVFGKEDESHPLTEQIFDTLVKLERTVEE

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.4e-9334.24Show/hide
Query:  DSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGE--------------------------AKMGV---ARKVFDRLLERDVF
        +SY +  +L+ C   +  +  Q++H  ++  G D + ++ T LI  Y ++G                           A  G    A+K+FD +  +DV 
Subjt:  DSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGE--------------------------AKMGV---ARKVFDRLLERDVF

Query:  VWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDI
         WN +I GYA  G + EAL LF +M  +    +  T   V+ AC    + + G+ VH  +   G   +L + NALI  Y+KC ++ETA  +F+ +  +D+
Subjt:  VWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDI

Query:  VCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIK--TGMEVGAPLGSCLISMYANCGHVNIARDVFDRI
        + WN++I GYT      EA++LF  ML  ++  +P++ T++ ILPAC    A  +G W+H Y+ K   G+   + L + LI MYA CG +  A  VF+ I
Subjt:  VCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIK--TGMEVGAPLGSCLISMYANCGHVNIARDVFDRI

Query:  DDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIE
          K++  W+A+I  + MHG A+ + ++F+ + ++G++PD + F+ LLS CSH+G++  G  I++ M + Y +  K EHY CM+DLLG +G  ++A E I 
Subjt:  DDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIE

Query:  GMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPL
         M ++    ++ +LL AC++H N+EL +   E L  ++PEN G YV+L+++Y  AG+W + AK R LL D+ ++K  GCSSIEID + H F   D+ HP 
Subjt:  GMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPL

Query:  TEQIFDTLVKLERTVEE
          +I+  L ++E  +E+
Subjt:  TEQIFDTLVKLERTVEE

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic7.2e-9539.11Show/hide
Query:  LIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFV
        L+  Y++ G+  +  A+ VF  + +R V  +  +I GYA  G   EA+ LF+EM   G   + YT   VL  C   +  D+GK VH  + +  L  D+FV
Subjt:  LIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFV

Query:  GNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVG
         NAL+  YAKC  ++ A  VF EM ++DI+ WN++I GY+ N   +EA+ LF+ +L ++   SPD  T+  +LPAC + SA   G  +H Y+++ G    
Subjt:  GNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVG

Query:  APLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVE
          + + L+ MYA CG + +A  +FD I  K+++ W+ +I  YGMHGF  EA+ +F  + + G++ D + F++LL  CSH+GL+ +G   +  M     +E
Subjt:  APLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVE

Query:  RKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNI
           EHYAC+VD+L R G L +A  FIE MP+     ++GALL  CRIH++++LA++V EK+F L+PEN G YV++A++Y +A +WE   +LRK +  R +
Subjt:  RKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNI

Query:  RKPVGCSSIEIDRIHHVFGKEDESHPLTEQIFDTLVKLE-RTVEEDFEPI
        RK  GCS IEI    ++F   D S+P TE I   L K+  R +EE + P+
Subjt:  RKPVGCSSIEIDRIHHVFGKEDESHPLTEQIFDTLVKLE-RTVEEDFEPI

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic8.0e-10238.12Show/hide
Query:  LHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA----MK
        +H  I+  G DQ+ FLATKLIG Y++ G   +  ARKVFD+  +R ++VWN + +     G   E L L+ +M   G  ++R+T+ +VLKAC A    + 
Subjt:  LHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA----MK

Query:  NCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACV
        +  KGK +H H+++ G    +++   L+  YA+   V+ A  VF  M +R++V W++MIA Y  NGK  EA+  F  M+ +    SP++ T+V +L AC 
Subjt:  NCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACV

Query:  TKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTC
        + +A + G  +H Y+++ G++   P+ S L++MY  CG + + + VFDR+ D++V+ W+++I  YG+HG+  +A+ +F  +   G  P  V F+++L  C
Subjt:  TKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTC

Query:  SHAGLIAKGHEIYKKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILAS
        SH GL+ +G  +++ M   +G++ + EHYACMVDLLGRA  L++A + ++ M  + G  V+G+LLG+CRIH N+ELA+    +LF L+P+NAG YV+LA 
Subjt:  SHAGLIAKGHEIYKKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILAS

Query:  MYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPLTEQIFDTLVKLERTVEE
        +Y +A  W++  +++KLL  R ++K  G   +E+ R  + F   DE +PL EQI   LVKL   ++E
Subjt:  MYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPLTEQIFDTLVKLERTVEE

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.7e-9534.24Show/hide
Query:  DSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGE--------------------------AKMGV---ARKVFDRLLERDVF
        +SY +  +L+ C   +  +  Q++H  ++  G D + ++ T LI  Y ++G                           A  G    A+K+FD +  +DV 
Subjt:  DSYDYTNLLQRC---RTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGE--------------------------AKMGV---ARKVFDRLLERDVF

Query:  VWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDI
         WN +I GYA  G + EAL LF +M  +    +  T   V+ AC    + + G+ VH  +   G   +L + NALI  Y+KC ++ETA  +F+ +  +D+
Subjt:  VWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDI

Query:  VCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIK--TGMEVGAPLGSCLISMYANCGHVNIARDVFDRI
        + WN++I GYT      EA++LF  ML  ++  +P++ T++ ILPAC    A  +G W+H Y+ K   G+   + L + LI MYA CG +  A  VF+ I
Subjt:  VCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIK--TGMEVGAPLGSCLISMYANCGHVNIARDVFDRI

Query:  DDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIE
          K++  W+A+I  + MHG A+ + ++F+ + ++G++PD + F+ LLS CSH+G++  G  I++ M + Y +  K EHY CM+DLLG +G  ++A E I 
Subjt:  DDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIE

Query:  GMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPL
         M ++    ++ +LL AC++H N+EL +   E L  ++PEN G YV+L+++Y  AG+W + AK R LL D+ ++K  GCSSIEID + H F   D+ HP 
Subjt:  GMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPL

Query:  TEQIFDTLVKLERTVEE
          +I+  L ++E  +E+
Subjt:  TEQIFDTLVKLERTVEE

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-9835.14Show/hide
Query:  SFFSDRSSRKYHRDSYDYTNLLQRCRTIRSV---QELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFA
        +F    S  + + + Y +  L++    + S+   Q LH   +      + F+A  LI  Y   G+  +  A KVF  + E+DV  WN +I G+   G   
Subjt:  SFFSDRSSRKYHRDSYDYTNLLQRCRTIRSV---QELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFA

Query:  EALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKC-------------------------------QDV
        +AL LF +M       +  T   VL AC  ++N + G+ V  ++ +  ++++L + NA++  Y KC                               +D 
Subjt:  EALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKC-------------------------------QDV

Query:  ETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANC
        E AR+V + M  +DIV WN++I+ Y  NGK +EA+I+FH  L  Q     +  TLV  L AC    A ++G W+HSY+ K G+ +   + S LI MY+ C
Subjt:  ETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANC

Query:  GHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEA-YGVERKEEHYACMVDLLG
        G +  +R+VF+ ++ ++V VWSA+I    MHG  NEA++MF  ++E  +KP+ V F N+   CSH GL+ +   ++ +ME+ YG+  +E+HYAC+VD+LG
Subjt:  GHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEA-YGVERKEEHYACMVDLLG

Query:  RAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI
        R+G+LE+AV+FIE MP+     V+GALLGAC+IH N+ LA+    +L  L+P N G +V+L+++Y   G+WE+ ++LRK +R   ++K  GCSSIEID +
Subjt:  RAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRI

Query:  HHVFGKEDESHPLTEQIFDTLVK-LERTVEEDFEP
         H F   D +HP++E+++  L + +E+     +EP
Subjt:  HHVFGKEDESHPLTEQIFDTLVK-LERTVEEDFEP

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.7e-10338.12Show/hide
Query:  LHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA----MK
        +H  I+  G DQ+ FLATKLIG Y++ G   +  ARKVFD+  +R ++VWN + +     G   E L L+ +M   G  ++R+T+ +VLKAC A    + 
Subjt:  LHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA----MK

Query:  NCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACV
        +  KGK +H H+++ G    +++   L+  YA+   V+ A  VF  M +R++V W++MIA Y  NGK  EA+  F  M+ +    SP++ T+V +L AC 
Subjt:  NCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACV

Query:  TKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTC
        + +A + G  +H Y+++ G++   P+ S L++MY  CG + + + VFDR+ D++V+ W+++I  YG+HG+  +A+ +F  +   G  P  V F+++L  C
Subjt:  TKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTC

Query:  SHAGLIAKGHEIYKKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILAS
        SH GL+ +G  +++ M   +G++ + EHYACMVDLLGRA  L++A + ++ M  + G  V+G+LLG+CRIH N+ELA+    +LF L+P+NAG YV+LA 
Subjt:  SHAGLIAKGHEIYKKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILAS

Query:  MYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPLTEQIFDTLVKLERTVEE
        +Y +A  W++  +++KLL  R ++K  G   +E+ R  + F   DE +PL EQI   LVKL   ++E
Subjt:  MYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPLTEQIFDTLVKLERTVEE

AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.7e-9534.39Show/hide
Query:  IRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA
        IR+++ +H++II+E    N  L  KL+  YA   +  +  ARKVFD + ER+V + NV+I+ Y N G + E + +F  M       + YTFP VLKAC  
Subjt:  IRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGA

Query:  MKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAM-----LHDQTACS-------
              G+ +HG  +K GL   LFVGN L++ Y KC  +  AR V DEMS RD+V WNS++ GY  N + D+A+ +   M      HD    +       
Subjt:  MKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAM-----LHDQTACS-------

Query:  ------------------------------------------------------PDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLIS
                                                              PD  ++  +LPAC   SA  +G  +H Y+ +  +     L + LI 
Subjt:  ------------------------------------------------------PDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLIS

Query:  MYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVERKEEHYACM
        MYA CG +  ARDVF+ +  ++V+ W+A+I  YG  G   +A+ +F+ L++ GL PD + F+  L+ CSHAGL+ +G   +K M + Y +  + EH ACM
Subjt:  MYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVERKEEHYACM

Query:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSI
        VDLLGRAG +++A  FI+ M ++  + V+GALLGACR+H++ ++     +KLF L PE +G YV+L+++Y  AG+WE+   +R +++ + ++K  G S++
Subjt:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSI

Query:  EIDRIHHVFGKEDESHPLTEQIFDTLVKLERTVEE
        E++RI H F   D SHP +++I+  L  L + ++E
Subjt:  EIDRIHHVFGKEDESHPLTEQIFDTLVKLERTVEE

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein5.1e-9639.11Show/hide
Query:  LIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFV
        L+  Y++ G+  +  A+ VF  + +R V  +  +I GYA  G   EA+ LF+EM   G   + YT   VL  C   +  D+GK VH  + +  L  D+FV
Subjt:  LIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVVIQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFV

Query:  GNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVG
         NAL+  YAKC  ++ A  VF EM ++DI+ WN++I GY+ N   +EA+ LF+ +L ++   SPD  T+  +LPAC + SA   G  +H Y+++ G    
Subjt:  GNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGKVDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVG

Query:  APLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVE
          + + L+ MYA CG + +A  +FD I  K+++ W+ +I  YGMHGF  EA+ +F  + + G++ D + F++LL  CSH+GL+ +G   +  M     +E
Subjt:  APLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNMFTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKM-EAYGVE

Query:  RKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNI
           EHYAC+VD+L R G L +A  FIE MP+     ++GALL  CRIH++++LA++V EK+F L+PEN G YV++A++Y +A +WE   +LRK +  R +
Subjt:  RKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNI

Query:  RKPVGCSSIEIDRIHHVFGKEDESHPLTEQIFDTLVKLE-RTVEEDFEPI
        RK  GCS IEI    ++F   D S+P TE I   L K+  R +EE + P+
Subjt:  RKPVGCSSIEIDRIHHVFGKEDESHPLTEQIFDTLVKLE-RTVEEDFEPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGGGCTCTGCCGGCGATCTTCGCAAGCTCTGGTATCGAAAAATCAACCATGTTTCAGATTTCTAAGATCCTTCTTCAGTGATCGGAGTTCTAGGAAGTATCATCG
CGATTCATACGATTACACGAACTTATTACAACGTTGTAGAACGATCAGAAGCGTTCAAGAACTACATGCCCAGATCATCGTTGAAGGCCACGACCAAAATGGATTCTTAG
CCACGAAGCTAATCGGCAAATACGCCGAGCATGGTGAGGCGAAAATGGGAGTTGCACGGAAGGTGTTCGATAGATTGCTTGAAAGAGATGTGTTCGTGTGGAATGTGGTT
ATTCAAGGGTATGCGAACTGGGGTCCGTTTGCCGAAGCCCTCAACCTGTTTGATGAAATGCGAGTCAGTGGCGAACCCACTAATCGCTATACATTCCCTTTTGTGTTGAA
GGCATGTGGCGCAATGAAGAACTGTGACAAGGGGAAGATTGTTCATGGACATGTTTCGAAATGTGGGTTGGACTTGGATCTGTTCGTGGGCAATGCTCTGATTGCGTTTT
ATGCCAAGTGCCAGGACGTTGAAACTGCTCGTAAAGTGTTTGATGAAATGTCTCTGAGAGATATTGTGTGTTGGAACTCCATGATTGCTGGGTATACTTCGAATGGCAAA
GTGGATGAAGCTATTATCCTTTTCCATGCGATGCTGCATGACCAAACTGCTTGTTCACCTGATAATGCTACTCTTGTTGGGATTCTGCCTGCTTGTGTTACAAAATCTGC
TTCCCAAGTTGGCTTCTGGGTTCATTCCTATGTTATAAAAACAGGAATGGAAGTTGGGGCCCCGTTGGGCAGTTGCCTTATTTCAATGTATGCTAACTGTGGTCATGTGA
ACATTGCAAGAGATGTTTTCGACCGAATCGACGACAAAAACGTCATCGTATGGAGTGCGATCATAAGGTGTTATGGAATGCATGGTTTTGCAAATGAGGCATTAAACATG
TTCACAAGTTTGGAAGAAGTTGGTCTAAAACCAGACCGCGTGATCTTCCTGAATTTGTTGTCGACATGTAGTCACGCAGGGCTCATTGCGAAAGGCCACGAGATATACAA
AAAAATGGAGGCTTATGGTGTGGAGAGGAAAGAGGAACATTATGCGTGCATGGTGGATCTCTTAGGGAGGGCTGGTTTCTTAGAACAAGCAGTCGAGTTCATTGAAGGCA
TGCCAGTGCAGGCAGGAAAAGATGTGTATGGTGCATTGCTTGGTGCTTGTAGGATACACAACAACATCGAGCTAGCTAAAGAAGTTGGCGAGAAGTTGTTCGTCTTGGAT
CCCGAAAACGCGGGGCGATACGTGATCCTGGCTAGTATGTATGAAGATGCAGGGCAGTGGGAAGATGCTGCCAAACTGAGGAAGCTGCTGAGAGATAGGAATATTAGGAA
GCCAGTTGGTTGTAGCTCAATAGAGATAGATAGGATTCATCATGTGTTTGGGAAGGAGGATGAATCTCACCCCTTAACAGAACAAATTTTTGACACATTGGTGAAGCTAG
AAAGGACAGTTGAGGAAGATTTTGAACCTATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGGGCTCTGCCGGCGATCTTCGCAAGCTCTGGTATCGAAAAATCAACCATGTTTCAGATTTCTAAGATCCTTCTTCAGTGATCGGAGTTCTAGGAAGTATCATCG
CGATTCATACGATTACACGAACTTATTACAACGTTGTAGAACGATCAGAAGCGTTCAAGAACTACATGCCCAGATCATCGTTGAAGGCCACGACCAAAATGGATTCTTAG
CCACGAAGCTAATCGGCAAATACGCCGAGCATGGTGAGGCGAAAATGGGAGTTGCACGGAAGGTGTTCGATAGATTGCTTGAAAGAGATGTGTTCGTGTGGAATGTGGTT
ATTCAAGGGTATGCGAACTGGGGTCCGTTTGCCGAAGCCCTCAACCTGTTTGATGAAATGCGAGTCAGTGGCGAACCCACTAATCGCTATACATTCCCTTTTGTGTTGAA
GGCATGTGGCGCAATGAAGAACTGTGACAAGGGGAAGATTGTTCATGGACATGTTTCGAAATGTGGGTTGGACTTGGATCTGTTCGTGGGCAATGCTCTGATTGCGTTTT
ATGCCAAGTGCCAGGACGTTGAAACTGCTCGTAAAGTGTTTGATGAAATGTCTCTGAGAGATATTGTGTGTTGGAACTCCATGATTGCTGGGTATACTTCGAATGGCAAA
GTGGATGAAGCTATTATCCTTTTCCATGCGATGCTGCATGACCAAACTGCTTGTTCACCTGATAATGCTACTCTTGTTGGGATTCTGCCTGCTTGTGTTACAAAATCTGC
TTCCCAAGTTGGCTTCTGGGTTCATTCCTATGTTATAAAAACAGGAATGGAAGTTGGGGCCCCGTTGGGCAGTTGCCTTATTTCAATGTATGCTAACTGTGGTCATGTGA
ACATTGCAAGAGATGTTTTCGACCGAATCGACGACAAAAACGTCATCGTATGGAGTGCGATCATAAGGTGTTATGGAATGCATGGTTTTGCAAATGAGGCATTAAACATG
TTCACAAGTTTGGAAGAAGTTGGTCTAAAACCAGACCGCGTGATCTTCCTGAATTTGTTGTCGACATGTAGTCACGCAGGGCTCATTGCGAAAGGCCACGAGATATACAA
AAAAATGGAGGCTTATGGTGTGGAGAGGAAAGAGGAACATTATGCGTGCATGGTGGATCTCTTAGGGAGGGCTGGTTTCTTAGAACAAGCAGTCGAGTTCATTGAAGGCA
TGCCAGTGCAGGCAGGAAAAGATGTGTATGGTGCATTGCTTGGTGCTTGTAGGATACACAACAACATCGAGCTAGCTAAAGAAGTTGGCGAGAAGTTGTTCGTCTTGGAT
CCCGAAAACGCGGGGCGATACGTGATCCTGGCTAGTATGTATGAAGATGCAGGGCAGTGGGAAGATGCTGCCAAACTGAGGAAGCTGCTGAGAGATAGGAATATTAGGAA
GCCAGTTGGTTGTAGCTCAATAGAGATAGATAGGATTCATCATGTGTTTGGGAAGGAGGATGAATCTCACCCCTTAACAGAACAAATTTTTGACACATTGGTGAAGCTAG
AAAGGACAGTTGAGGAAGATTTTGAACCTATTTGA
Protein sequenceShow/hide protein sequence
MNGLCRRSSQALVSKNQPCFRFLRSFFSDRSSRKYHRDSYDYTNLLQRCRTIRSVQELHAQIIVEGHDQNGFLATKLIGKYAEHGEAKMGVARKVFDRLLERDVFVWNVV
IQGYANWGPFAEALNLFDEMRVSGEPTNRYTFPFVLKACGAMKNCDKGKIVHGHVSKCGLDLDLFVGNALIAFYAKCQDVETARKVFDEMSLRDIVCWNSMIAGYTSNGK
VDEAIILFHAMLHDQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEVGAPLGSCLISMYANCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFANEALNM
FTSLEEVGLKPDRVIFLNLLSTCSHAGLIAKGHEIYKKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLD
PENAGRYVILASMYEDAGQWEDAAKLRKLLRDRNIRKPVGCSSIEIDRIHHVFGKEDESHPLTEQIFDTLVKLERTVEEDFEPI