; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015675 (gene) of Snake gourd v1 genome

Gene IDTan0015675
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG01:28312778..28314550
RNA-Seq ExpressionTan0015675
SyntenyTan0015675
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596261.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.7e-28591.41Show/hide
Query:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL
        MNG  RRSSRAL+S+N+PFFRFLRSIF+D S REYHRDSYDYT LLQHCR+IRSVQELHAQI+VEGHDQNGFL TKLIGKYAE+GE KM IARKVFDRLL
Subjt:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM
        E+DVFVWNVVIQGYANWGPFAEAL L+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHVLKCGLDLDLFVGNALI+FY+K QDVETAR+VFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM

Query:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF
        SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGME+GAPLGSCLISMYANCGH+ VARDVF
Subjt:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF

Query:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEE GLKPDGVIFLNLLSTCSHAGLV KG EIYEKMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAK+VG+KLFVLDPENAGRYVILASMYEDAG+WEDAAKLRKLLRDR IRKP+G SSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFYTLEKLERIMEEDFEPI
        PFTEQIF TLEKLER+M+E+FEPI
Subjt:  PFTEQIFYTLEKLERIMEEDFEPI

XP_022942131.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita moschata]1.1e-28491.22Show/hide
Query:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL
        MNG  RRSSRAL+S+N+PFFRFLRSIF+D S REYHRDSYDYT LLQHCR+IRSV+ELHAQI+VEGHDQNGFL TKLIGKYAE+GE KM IARKVFDRLL
Subjt:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM
        E+DVFVWNVVIQGYANWGPFAEAL L+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHVLKCGLDLDLFVGNALI+FY+K QDVETAR+VFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM

Query:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF
        SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGME+GAPLGSCLISMYANCGH+ VARDVF
Subjt:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF

Query:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEE GLKPDGVIFLNLLSTCSHAGLV KG EIYEKMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAK+VG+KLFVLDPENAGRYVILASMYEDAG+WEDAAKLRKLLRDR IRKP+G SSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFYTLEKLERIMEEDFEPI
        PFTEQIF TLEKLER+M+E+FEPI
Subjt:  PFTEQIFYTLEKLERIMEEDFEPI

XP_022971604.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita maxima]7.3e-28691.6Show/hide
Query:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL
        MNG  RR SRAL+S+ QPFFRFLRSIF+D S REYHRDSYDYT LLQHCR+IRSVQELHAQI+VEGHDQNGFL TKLIGKYAE+GE KMGIARKVFDRLL
Subjt:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM
        E+DVFVWNVVIQGYANWGPFAEAL L+DEMRV GEPTNRYTFPFVLKACGAMKNS+KGKIVHGHVLKCGLDLDLFVGNALI+FY+K QDVETAR+VFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM

Query:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF
        SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGME+GAPLGSCLISMYANCGH+ +ARDVF
Subjt:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF

Query:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLV KG EIYEKMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAKEVG+KLFVLDPENAGRYVILASMYEDAG+WEDAAKLRKLLRDR IRKP+G SSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFYTLEKLERIMEEDFEPI
        PFTEQIF TLEKLER+M+E+FEPI
Subjt:  PFTEQIFYTLEKLERIMEEDFEPI

XP_023540099.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita pepo subsp. pepo]3.1e-28490.65Show/hide
Query:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL
        MNG   RSSRAL+S+N+PFFRFLRSIF+D S REYHRDSYDYT LLQHCR+IRSVQELHAQI+VEGHDQNGFL TKLIGKYAE+GE KMGIARKVFDRLL
Subjt:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM
        E+DVF+WNVVIQGYANWGPFAEAL L+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHVLKCGLDLDLFVGNALI+FY+K QDVETAR+VFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM

Query:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF
        SLRDIVSWNSMIAGYTLNGKVDEAIM+FHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSY+IKTGME+GAPLGSCLISMYANCGH+ +ARDVF
Subjt:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF

Query:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEE GLKPDGVIFLNLLSTCSHAGLV KG EIYE+MEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAKEVG+KLFVLDPENAGRYVILASMYEDAG+WEDAAKLRKLLRDR IRKP+G SSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFYTLEKLERIMEEDFEPI
        PFTEQIF TLEKLER+M+E+FEPI
Subjt:  PFTEQIFYTLEKLERIMEEDFEPI

XP_038905146.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Benincasa hispida]1.3e-27487.4Show/hide
Query:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL
        MN  CRR+ +AL S+NQP FR LRS FSDRS  +YHRDSYDYT LL HCRTIRSVQ LHAQIIVEG DQNGFL TKLIGKY EHGE+KMGIARKVFD+LL
Subjt:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM
        +RDVFVWNVVIQGYAN GPF EAL LFDEMRVSG PTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFV NALI+FYAK QDVETAR+VFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM

Query:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF
        SLRDIVSWNSMIAGYTLN KV++AIMLFHAMLHNQ+ACSPD+ATL+GILPACVTKSASQVGFWVHSYVIKTGME+GA LGSCLIS+YANCGH+ +ARDVF
Subjt:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF

Query:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        +RI+DKNVIVWSAIIRCYGMHGFA+EALNMFT LEE GLKPDGV+FLNLLSTCSHAGL+AKGHEIYEKME YGVERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIH NIELAKE+GEKLF+LD +NAGRY+ILASMYEDAG+WEDAAKLRKLLRDR +RKP+G SSIE+DRIHHVFGKEDE+H
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFYTLEKLERIMEEDFEPI
        PFTEQIF TLEKLE IME DFEPI
Subjt:  PFTEQIFYTLEKLERIMEEDFEPI

TrEMBL top hitse value%identityAlignment
A0A0A0L101 Uncharacterized protein2.6e-25782.32Show/hide
Query:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEH--GEAKMGIARKVFDR
        MNG CRR+ +AL  +NQPF       FS+ S  +YHRDSY +  LL HCR+IRSVQELHAQI+VEG DQNGF+  KLIGKY EH  GE KMG ARKVFD 
Subjt:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEH--GEAKMGIARKVFDR

Query:  LLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFD
        L+ RDVFVWNVVIQGYA+ GPF EAL LFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKG+IVHGHV+KCGLDLDLFVGNALI+FY+K QDVETAR+VFD
Subjt:  LLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFD

Query:  EMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARD
        +MSLRDIVSWNSMI GYTLNGK DEAIM FHAMLHNQ  C+PD+ATLV ILPAC TKSASQVGFWVHSYVIKTG+E+GAPLGSCLI MY NCGH+ +ARD
Subjt:  EMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARD

Query:  VFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAV
        VFDRI+DKNVIVWSAIIRCYGMHGFA+EA NMF  LEE G+KPDG+IFLNLLS CSHAGLVAKGHEIYEKMEAYG+ERK+ HYACMVDLLGRAGFLEQAV
Subjt:  VFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDE
        EFIEGMPVQAGKDVYGALLGACRIHNN+ELAKEVGEKLF+LDPE A RYV LA+MYEDAG+WEDAAKLRKLLRDR IRKP G SSIE+DRIHHVFGK+DE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDE

Query:  SHPFTEQIFYTLEKLERIMEEDFEPI
        +HP TE+IF TLEKLERIMEEDFEPI
Subjt:  SHPFTEQIFYTLEKLERIMEEDFEPI

A0A5A7UCB1 Pentatricopeptide repeat-containing protein3.7e-25180.04Show/hide
Query:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAE--HGEAKMGIARKVFDR
        MNG  RR+ +AL  ++QPF       FS+R   +YHRDSYD+  LL HCRTIRSVQELHAQI+VEG DQNGFL TKLIGKY E   GE+KMG ARKVFDR
Subjt:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAE--HGEAKMGIARKVFDR

Query:  LLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFD
        LL+RDVF+WNVVIQGYA++GPF EAL LFDEMRVSGEPTNRYTFPFVLKACGA+KNSDKG+IVHG+V+KCGLDLDLFVGNALIS Y+K QDVETAR+VFD
Subjt:  LLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFD

Query:  EMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARD
        +MSLRD VSWNSMI GYTLN + D+AIM FHAMLHNQ  C PD+ATLV ILPAC TKSASQVGFWVHSY+IKTGME+GA LGSCLISMY NCGH+ +ARD
Subjt:  EMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARD

Query:  VFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAV
        VF+RI++KNVIVWSAIIRCYGMHG A+EALNMF  LEEVG+KPDG+IFLNLLS CSHAGLVAKG EIY+KMEAYG+ER ++HYACMVDLL RAGFLEQA 
Subjt:  VFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAV

Query:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDE
        EFIE MPVQAGKDVYGAL GACRIHNN+ELAKEVGEKLF+LDPENAGRY+ILASMYEDAG+WEDAAK+RKLLRDR I+KP G SSIE+DRIHHVFGK+DE
Subjt:  EFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDE

Query:  SHPFTEQIFYTLEKLERIMEEDFEPI
        +HPFTE+IF T+EKLER+MEEDFEPI
Subjt:  SHPFTEQIFYTLEKLERIMEEDFEPI

A0A6J1CVV9 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like6.9e-27486.83Show/hide
Query:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL
        M G  RR SR L+ KNQPFFRFLR   SDRSP EY RDSYDYTNLLQHCRTIRSVQELHAQIIVEG DQNGFL TKLIGKYAEHG++KMG+ARKVFDRL+
Subjt:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM
        ERDVFVWNVVI+GYANWGPF EAL LFDEMRV+G PTNRYTFPFVLKACGAMKN DKGK+VHGHVLK GLDLDLFVGNALI+FYAK  D+ET R+VFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM

Query:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF
         L+DIVSWNSMIAG+T NGKVDEAIMLFHAM+HNQ ACSPDNATLVGILPACV+KSA+QVGFWVHSYVIKTGM++GAPLGSCLISMYANCGH+ +ARDVF
Subjt:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF

Query:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        +RI+DKNVIVWSA+IRCYGMHG A+EAL MFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKG +IYEKME YGVERKEEHYACMVDLLGRAGF++QAV+F
Subjt:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH
        IEGMP+QAGKDVYGALLGACRIHNNIE+AKE  EKLFVLDPENAGRYVILASM+EDAG+WEDAAKLRKLLRDRKI+KP+G SSIEIDRIHHVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFYTLEKLERIMEEDFEPI
        PF+EQIF TLEKLERIMEEDFEP+
Subjt:  PFTEQIFYTLEKLERIMEEDFEPI

A0A6J1FMZ9 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like5.1e-28591.22Show/hide
Query:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL
        MNG  RRSSRAL+S+N+PFFRFLRSIF+D S REYHRDSYDYT LLQHCR+IRSV+ELHAQI+VEGHDQNGFL TKLIGKYAE+GE KM IARKVFDRLL
Subjt:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM
        E+DVFVWNVVIQGYANWGPFAEAL L+DEMRV GEPTNRYTFPFVLKACGAMKN DKGKIVHGHVLKCGLDLDLFVGNALI+FY+K QDVETAR+VFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM

Query:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF
        SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGME+GAPLGSCLISMYANCGH+ VARDVF
Subjt:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF

Query:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEE GLKPDGVIFLNLLSTCSHAGLV KG EIYEKMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAK+VG+KLFVLDPENAGRYVILASMYEDAG+WEDAAKLRKLLRDR IRKP+G SSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFYTLEKLERIMEEDFEPI
        PFTEQIF TLEKLER+M+E+FEPI
Subjt:  PFTEQIFYTLEKLERIMEEDFEPI

A0A6J1I919 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like3.5e-28691.6Show/hide
Query:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL
        MNG  RR SRAL+S+ QPFFRFLRSIF+D S REYHRDSYDYT LLQHCR+IRSVQELHAQI+VEGHDQNGFL TKLIGKYAE+GE KMGIARKVFDRLL
Subjt:  MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLL

Query:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM
        E+DVFVWNVVIQGYANWGPFAEAL L+DEMRV GEPTNRYTFPFVLKACGAMKNS+KGKIVHGHVLKCGLDLDLFVGNALI+FY+K QDVETAR+VFDEM
Subjt:  ERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEM

Query:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF
        SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGME+GAPLGSCLISMYANCGH+ +ARDVF
Subjt:  SLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVF

Query:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF
        DRI DKNVIVWSAIIR YGMHGFA+EALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLV KG EIYEKMEAYG ERKEEHYACMVDLLGRAGFLEQAVEF
Subjt:  DRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEF

Query:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH
        IEGMPVQAGKDVYGALLGACRIHNNIELAKEVG+KLFVLDPENAGRYVILASMYEDAG+WEDAAKLRKLLRDR IRKP+G SSIEIDRI+HVFGKEDESH
Subjt:  IEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESH

Query:  PFTEQIFYTLEKLERIMEEDFEPI
        PFTEQIF TLEKLER+M+E+FEPI
Subjt:  PFTEQIFYTLEKLERIMEEDFEPI

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic2.5e-9535.54Show/hide
Query:  SPREYHRDSYDYTNLLQHCRTIRSV---QELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLF
        S  + + + Y +  L++    + S+   Q LH   +      + F+   LI  Y   G+  +  A KVF  + E+DV  WN +I G+   G   +AL LF
Subjt:  SPREYHRDSYDYTNLLQHCRTIRSV---QELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLF

Query:  DEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAK-------------------------------SQDVETAREV
         +M       +  T   VL AC  ++N + G+ V  ++ +  ++++L + NA++  Y K                               S+D E AREV
Subjt:  DEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAK-------------------------------SQDVETAREV

Query:  FDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVA
         + M  +DIV+WN++I+ Y  NGK +EA+++FH  L  Q     +  TLV  L AC    A ++G W+HSY+ K G+ +   + S LI MY+ CG L+ +
Subjt:  FDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVA

Query:  RDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLGRAGFLE
        R+VF+ +  ++V VWSA+I    MHG   EA++MF  ++E  +KP+GV F N+   CSH GLV +   ++ +ME+ YG+  +E+HYAC+VD+LGR+G+LE
Subjt:  RDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLGRAGFLE

Query:  QAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGK
        +AV+FIE MP+     V+GALLGAC+IH N+ LA+    +L  L+P N G +V+L+++Y   G+WE+ ++LRK +R   ++K  G SSIEID + H F  
Subjt:  QAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGK

Query:  EDESHPFTEQIFYTL-EKLERIMEEDFEP
         D +HP +E+++  L E +E++    +EP
Subjt:  EDESHPFTEQIFYTL-EKLERIMEEDFEP

P0C899 Putative pentatricopeptide repeat-containing protein At3g491424.4e-9234.95Show/hide
Query:  IRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGA
        IR+++ +H++II+E    N  L  KL+  YA   +  +  ARKVFD + ER+V + NV+I+ Y N G + E + +F  M       + YTFP VLKAC  
Subjt:  IRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGA

Query:  MKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAM------------------LH
              G+ +HG   K GL   LFVGN L+S Y K   +  AR V DEMS RD+VSWNS++ GY  N + D+A+ +   M                  + 
Subjt:  MKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAM------------------LH

Query:  NQT------------------------------------------------ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLIS
        N T                                                   PD  ++  +LPAC   SA  +G  +H Y+ +  +     L + LI 
Subjt:  NQT------------------------------------------------ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLIS

Query:  MYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACM
        MYA CG L+ ARDVF+ +  ++V+ W+A+I  YG  G   +A+ +F+ L++ GL PD + F+  L+ CSHAGL+ +G   ++ M + Y +  + EH ACM
Subjt:  MYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACM

Query:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSI
        VDLLGRAG +++A  FI+ M ++  + V+GALLGACR+H++ ++     +KLF L PE +G YV+L+++Y  AGRWE+   +R +++ + ++K  G S++
Subjt:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSI

Query:  EIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE
        E++RI H F   D SHP +++I+  L+ L + M+E
Subjt:  EIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE

Q7Y211 Pentatricopeptide repeat-containing protein At3g57430, chloroplastic4.4e-9237.55Show/hide
Query:  HCRTIRSVQELHAQIIVEGH-DQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVS-GEPTNRYTFPFV
        H   +R+ +ELHA  +  G  D+N F+ + L+  Y    +   G  R+VFD + +R + +WN +I GY+      EAL LF  M  S G   N  T   V
Subjt:  HCRTIRSVQELHAQIIVEGH-DQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVS-GEPTNRYTFPFV

Query:  LKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLH---------NQT
        + AC       + + +HG V+K GLD D FV N L+  Y++   ++ A  +F +M  RD+V+WN+MI GY  +   ++A++L H M +         ++ 
Subjt:  LKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLH---------NQT

Query:  ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEE
        +  P++ TL+ ILP+C   SA   G  +H+Y IK  +     +GS L+ MYA CG L+++R VFD+I  KNVI W+ II  YGMHG  +EA+++   +  
Subjt:  ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEE

Query:  VGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGK-DVYGALLGACRIHNNIELAKEVGE
         G+KP+ V F+++ + CSH+G+V +G  I+  M+  YGVE   +HYAC+VDLLGRAG +++A + +  MP    K   + +LLGA RIHNN+E+ +   +
Subjt:  VGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGK-DVYGALLGACRIHNNIELAKEVGE

Query:  KLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKL-ERIMEEDFEP
         L  L+P  A  YV+LA++Y  AG W+ A ++R+ ++++ +RK  G S IE     H F   D SHP +E++   LE L ER+ +E + P
Subjt:  KLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKL-ERIMEEDFEP

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic5.2e-10138.16Show/hide
Query:  HCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLK
        H  ++     +H  I+  G DQ+ FL TKLIG Y++ G   +  ARKVFD+  +R ++VWN + +     G   E L L+ +M   G  ++R+T+ +VLK
Subjt:  HCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLK

Query:  ACGA----MKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNA
        AC A    + +  KGK +H H+ + G    +++   L+  YA+   V+ A  VF  M +R++VSW++MIA Y  NGK  EA+  F  M+      SP++ 
Subjt:  ACGA----MKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNA

Query:  TLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDG
        T+V +L AC + +A + G  +H Y+++ G++   P+ S L++MY  CG L+V + VFDR++D++V+ W+++I  YG+HG+ ++A+ +F  +   G  P  
Subjt:  TLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDG

Query:  VIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPE
        V F+++L  CSH GLV +G  ++E M   +G++ + EHYACMVDLLGRA  L++A + ++ M  + G  V+G+LLG+CRIH N+ELA+    +LF L+P+
Subjt:  VIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPE

Query:  NAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE
        NAG YV+LA +Y +A  W++  +++KLL  R ++K  G   +E+ R  + F   DE +P  EQI   L KL   M+E
Subjt:  NAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE

Q9SUH6 Pentatricopeptide repeat-containing protein At4g307002.0e-9238.68Show/hide
Query:  DYTNLLQHCRTIRSVQEL------HAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSG
        D T LL     +  +QEL      H+     G   + ++ T  I  Y++ G+ KMG A  +F    + D+  +N +I GY + G    +L LF E+ +SG
Subjt:  DYTNLLQHCRTIRSVQEL------HAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSG

Query:  EPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHN
              T   ++   G +        +HG+ LK        V  AL + Y+K  ++E+AR++FDE   + + SWN+MI+GYT NG  ++AI LF  M   
Subjt:  EPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHN

Query:  QTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSL
        ++  SP+  T+  IL AC    A  +G WVH  V  T  E    + + LI MYA CG +  AR +FD +  KN + W+ +I  YG+HG  +EALN+F  +
Subjt:  QTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSL

Query:  EEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVG
           G+ P  V FL +L  CSHAGLV +G EI+  M   YG E   +HYACMVD+LGRAG L++A++FIE M ++ G  V+  LLGACRIH +  LA+ V 
Subjt:  EEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVG

Query:  EKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE
        EKLF LDP+N G +V+L++++     +  AA +R+  + RK+ K  G++ IEI    HVF   D+SHP  ++I+  LEKLE  M E
Subjt:  EKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE

Arabidopsis top hitse value%identityAlignment
AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-9635.54Show/hide
Query:  SPREYHRDSYDYTNLLQHCRTIRSV---QELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLF
        S  + + + Y +  L++    + S+   Q LH   +      + F+   LI  Y   G+  +  A KVF  + E+DV  WN +I G+   G   +AL LF
Subjt:  SPREYHRDSYDYTNLLQHCRTIRSV---QELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLF

Query:  DEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAK-------------------------------SQDVETAREV
         +M       +  T   VL AC  ++N + G+ V  ++ +  ++++L + NA++  Y K                               S+D E AREV
Subjt:  DEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAK-------------------------------SQDVETAREV

Query:  FDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVA
         + M  +DIV+WN++I+ Y  NGK +EA+++FH  L  Q     +  TLV  L AC    A ++G W+HSY+ K G+ +   + S LI MY+ CG L+ +
Subjt:  FDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVA

Query:  RDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLGRAGFLE
        R+VF+ +  ++V VWSA+I    MHG   EA++MF  ++E  +KP+GV F N+   CSH GLV +   ++ +ME+ YG+  +E+HYAC+VD+LGR+G+LE
Subjt:  RDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLGRAGFLE

Query:  QAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGK
        +AV+FIE MP+     V+GALLGAC+IH N+ LA+    +L  L+P N G +V+L+++Y   G+WE+ ++LRK +R   ++K  G SSIEID + H F  
Subjt:  QAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGK

Query:  EDESHPFTEQIFYTL-EKLERIMEEDFEP
         D +HP +E+++  L E +E++    +EP
Subjt:  EDESHPFTEQIFYTL-EKLERIMEEDFEP

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-10238.16Show/hide
Query:  HCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLK
        H  ++     +H  I+  G DQ+ FL TKLIG Y++ G   +  ARKVFD+  +R ++VWN + +     G   E L L+ +M   G  ++R+T+ +VLK
Subjt:  HCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLK

Query:  ACGA----MKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNA
        AC A    + +  KGK +H H+ + G    +++   L+  YA+   V+ A  VF  M +R++VSW++MIA Y  NGK  EA+  F  M+      SP++ 
Subjt:  ACGA----MKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHNQTACSPDNA

Query:  TLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDG
        T+V +L AC + +A + G  +H Y+++ G++   P+ S L++MY  CG L+V + VFDR++D++V+ W+++I  YG+HG+ ++A+ +F  +   G  P  
Subjt:  TLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDG

Query:  VIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPE
        V F+++L  CSH GLV +G  ++E M   +G++ + EHYACMVDLLGRA  L++A + ++ M  + G  V+G+LLG+CRIH N+ELA+    +LF L+P+
Subjt:  VIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPE

Query:  NAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE
        NAG YV+LA +Y +A  W++  +++KLL  R ++K  G   +E+ R  + F   DE +P  EQI   L KL   M+E
Subjt:  NAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE

AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-9334.95Show/hide
Query:  IRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGA
        IR+++ +H++II+E    N  L  KL+  YA   +  +  ARKVFD + ER+V + NV+I+ Y N G + E + +F  M       + YTFP VLKAC  
Subjt:  IRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGA

Query:  MKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAM------------------LH
              G+ +HG   K GL   LFVGN L+S Y K   +  AR V DEMS RD+VSWNS++ GY  N + D+A+ +   M                  + 
Subjt:  MKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAM------------------LH

Query:  NQT------------------------------------------------ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLIS
        N T                                                   PD  ++  +LPAC   SA  +G  +H Y+ +  +     L + LI 
Subjt:  NQT------------------------------------------------ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLIS

Query:  MYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACM
        MYA CG L+ ARDVF+ +  ++V+ W+A+I  YG  G   +A+ +F+ L++ GL PD + F+  L+ CSHAGL+ +G   ++ M + Y +  + EH ACM
Subjt:  MYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACM

Query:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSI
        VDLLGRAG +++A  FI+ M ++  + V+GALLGACR+H++ ++     +KLF L PE +G YV+L+++Y  AGRWE+   +R +++ + ++K  G S++
Subjt:  VDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSI

Query:  EIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE
        E++RI H F   D SHP +++I+  L+ L + M+E
Subjt:  EIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE

AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-9337.55Show/hide
Query:  HCRTIRSVQELHAQIIVEGH-DQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVS-GEPTNRYTFPFV
        H   +R+ +ELHA  +  G  D+N F+ + L+  Y    +   G  R+VFD + +R + +WN +I GY+      EAL LF  M  S G   N  T   V
Subjt:  HCRTIRSVQELHAQIIVEGH-DQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVS-GEPTNRYTFPFV

Query:  LKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLH---------NQT
        + AC       + + +HG V+K GLD D FV N L+  Y++   ++ A  +F +M  RD+V+WN+MI GY  +   ++A++L H M +         ++ 
Subjt:  LKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLH---------NQT

Query:  ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEE
        +  P++ TL+ ILP+C   SA   G  +H+Y IK  +     +GS L+ MYA CG L+++R VFD+I  KNVI W+ II  YGMHG  +EA+++   +  
Subjt:  ACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSLEE

Query:  VGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGK-DVYGALLGACRIHNNIELAKEVGE
         G+KP+ V F+++ + CSH+G+V +G  I+  M+  YGVE   +HYAC+VDLLGRAG +++A + +  MP    K   + +LLGA RIHNN+E+ +   +
Subjt:  VGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEA-YGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGK-DVYGALLGACRIHNNIELAKEVGE

Query:  KLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKL-ERIMEEDFEP
         L  L+P  A  YV+LA++Y  AG W+ A ++R+ ++++ +RK  G S IE     H F   D SHP +E++   LE L ER+ +E + P
Subjt:  KLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKL-ERIMEEDFEP

AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-9338.68Show/hide
Query:  DYTNLLQHCRTIRSVQEL------HAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSG
        D T LL     +  +QEL      H+     G   + ++ T  I  Y++ G+ KMG A  +F    + D+  +N +I GY + G    +L LF E+ +SG
Subjt:  DYTNLLQHCRTIRSVQEL------HAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVVIQGYANWGPFAEALYLFDEMRVSG

Query:  EPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHN
              T   ++   G +        +HG+ LK        V  AL + Y+K  ++E+AR++FDE   + + SWN+MI+GYT NG  ++AI LF  M   
Subjt:  EPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGKVDEAIMLFHAMLHN

Query:  QTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSL
        ++  SP+  T+  IL AC    A  +G WVH  V  T  E    + + LI MYA CG +  AR +FD +  KN + W+ +I  YG+HG  +EALN+F  +
Subjt:  QTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNMFTSL

Query:  EEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVG
           G+ P  V FL +L  CSHAGLV +G EI+  M   YG E   +HYACMVD+LGRAG L++A++FIE M ++ G  V+  LLGACRIH +  LA+ V 
Subjt:  EEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKM-EAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVG

Query:  EKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE
        EKLF LDP+N G +V+L++++     +  AA +R+  + RK+ K  G++ IEI    HVF   D+SHP  ++I+  LEKLE  M E
Subjt:  EKLFVLDPENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGGGTTCTGCCGGCGATCTTCGCGAGCTCTGTTATCCAAAAACCAACCATTCTTCAGGTTTCTAAGATCCATCTTTAGCGATCGGAGTCCTAGGGAGTATCATCG
CGATTCGTACGATTACACGAACTTATTACAACATTGCAGAACCATCAGAAGCGTTCAAGAACTACATGCCCAGATCATCGTTGAGGGTCACGACCAAAATGGATTCTTAA
CCACGAAGCTAATCGGCAAATACGCCGAACATGGCGAGGCAAAAATGGGAATTGCACGGAAGGTGTTCGATAGATTGCTTGAAAGAGATGTGTTCGTGTGGAATGTGGTT
ATTCAAGGGTATGCGAATTGGGGTCCGTTTGCTGAAGCCCTATACCTGTTTGATGAAATGCGAGTCAGTGGCGAACCCACCAATCGCTACACATTTCCTTTTGTGTTGAA
GGCATGTGGCGCAATGAAGAACAGTGACAAGGGGAAGATTGTTCATGGGCACGTTTTGAAATGTGGGTTGGACTTGGATTTGTTCGTGGGCAATGCTCTGATTTCGTTCT
ATGCGAAGTCCCAGGATGTTGAAACAGCTCGTGAAGTGTTTGATGAAATGTCTCTGAGAGACATTGTGAGTTGGAACTCCATGATTGCTGGGTATACTTTGAATGGGAAA
GTGGATGAAGCTATTATGCTTTTCCATGCCATGCTGCATAATCAAACTGCTTGCTCACCTGATAATGCAACTCTTGTTGGGATTCTGCCTGCTTGTGTTACAAAATCAGC
TTCCCAAGTAGGCTTCTGGGTTCATTCCTATGTTATAAAGACAGGAATGGAAATTGGGGCTCCGTTGGGCAGTTGCCTTATCTCAATGTATGCTAACTGTGGTCACTTGA
AAGTTGCGAGAGACGTTTTCGACCGAATCAACGACAAAAACGTCATCGTATGGAGTGCAATCATAAGGTGTTATGGAATGCATGGTTTTGCAGAAGAGGCATTAAACATG
TTCACAAGTTTGGAAGAAGTTGGTCTAAAACCAGATGGTGTGATCTTCCTGAATTTGTTGTCGACGTGTAGTCACGCAGGGCTCGTAGCGAAAGGCCACGAGATATACGA
AAAGATGGAGGCTTATGGTGTGGAGAGGAAAGAGGAACATTATGCGTGCATGGTGGATCTCTTAGGGAGAGCTGGTTTCTTAGAACAAGCAGTAGAGTTCATTGAAGGCA
TGCCAGTGCAGGCAGGAAAAGATGTGTATGGTGCATTGCTTGGTGCTTGTAGGATACATAACAACATAGAGCTAGCTAAAGAAGTTGGGGAGAAGTTGTTCGTTTTGGAT
CCCGAAAACGCAGGGCGATACGTGATCTTAGCTAGTATGTATGAAGATGCAGGACGGTGGGAAGATGCTGCTAAACTAAGGAAGTTGCTGAGAGATAGGAAGATTAGGAA
GCCAATTGGTTTCAGTTCAATAGAGATAGATAGGATTCATCATGTGTTTGGGAAGGAGGATGAATCTCACCCCTTCACAGAACAAATTTTTTACACATTGGAGAAGCTAG
AAAGGATAATGGAGGAAGATTTTGAACCTATTTAA
mRNA sequenceShow/hide mRNA sequence
CACAAATCGGACACGTCGTCCCGTACCGGAAAACGGATCGGCGTTTGAACGCTGCCGCAATTACATTTACAGTAATCTTTTCCGCGATTTCCGTTTCAAGTTTTTGCTTT
GAAGAATCCAATGAATGGGTTCTGCCGGCGATCTTCGCGAGCTCTGTTATCCAAAAACCAACCATTCTTCAGGTTTCTAAGATCCATCTTTAGCGATCGGAGTCCTAGGG
AGTATCATCGCGATTCGTACGATTACACGAACTTATTACAACATTGCAGAACCATCAGAAGCGTTCAAGAACTACATGCCCAGATCATCGTTGAGGGTCACGACCAAAAT
GGATTCTTAACCACGAAGCTAATCGGCAAATACGCCGAACATGGCGAGGCAAAAATGGGAATTGCACGGAAGGTGTTCGATAGATTGCTTGAAAGAGATGTGTTCGTGTG
GAATGTGGTTATTCAAGGGTATGCGAATTGGGGTCCGTTTGCTGAAGCCCTATACCTGTTTGATGAAATGCGAGTCAGTGGCGAACCCACCAATCGCTACACATTTCCTT
TTGTGTTGAAGGCATGTGGCGCAATGAAGAACAGTGACAAGGGGAAGATTGTTCATGGGCACGTTTTGAAATGTGGGTTGGACTTGGATTTGTTCGTGGGCAATGCTCTG
ATTTCGTTCTATGCGAAGTCCCAGGATGTTGAAACAGCTCGTGAAGTGTTTGATGAAATGTCTCTGAGAGACATTGTGAGTTGGAACTCCATGATTGCTGGGTATACTTT
GAATGGGAAAGTGGATGAAGCTATTATGCTTTTCCATGCCATGCTGCATAATCAAACTGCTTGCTCACCTGATAATGCAACTCTTGTTGGGATTCTGCCTGCTTGTGTTA
CAAAATCAGCTTCCCAAGTAGGCTTCTGGGTTCATTCCTATGTTATAAAGACAGGAATGGAAATTGGGGCTCCGTTGGGCAGTTGCCTTATCTCAATGTATGCTAACTGT
GGTCACTTGAAAGTTGCGAGAGACGTTTTCGACCGAATCAACGACAAAAACGTCATCGTATGGAGTGCAATCATAAGGTGTTATGGAATGCATGGTTTTGCAGAAGAGGC
ATTAAACATGTTCACAAGTTTGGAAGAAGTTGGTCTAAAACCAGATGGTGTGATCTTCCTGAATTTGTTGTCGACGTGTAGTCACGCAGGGCTCGTAGCGAAAGGCCACG
AGATATACGAAAAGATGGAGGCTTATGGTGTGGAGAGGAAAGAGGAACATTATGCGTGCATGGTGGATCTCTTAGGGAGAGCTGGTTTCTTAGAACAAGCAGTAGAGTTC
ATTGAAGGCATGCCAGTGCAGGCAGGAAAAGATGTGTATGGTGCATTGCTTGGTGCTTGTAGGATACATAACAACATAGAGCTAGCTAAAGAAGTTGGGGAGAAGTTGTT
CGTTTTGGATCCCGAAAACGCAGGGCGATACGTGATCTTAGCTAGTATGTATGAAGATGCAGGACGGTGGGAAGATGCTGCTAAACTAAGGAAGTTGCTGAGAGATAGGA
AGATTAGGAAGCCAATTGGTTTCAGTTCAATAGAGATAGATAGGATTCATCATGTGTTTGGGAAGGAGGATGAATCTCACCCCTTCACAGAACAAATTTTTTACACATTG
GAGAAGCTAGAAAGGATAATGGAGGAAGATTTTGAACCTATTTAATGGAATGCAGTTATTTTCCCTTCAATTGTTCTTTTATTTTTATTTTTCTTTTTCTTTTCCCCCTC
CATTGGAAGCTCC
Protein sequenceShow/hide protein sequence
MNGFCRRSSRALLSKNQPFFRFLRSIFSDRSPREYHRDSYDYTNLLQHCRTIRSVQELHAQIIVEGHDQNGFLTTKLIGKYAEHGEAKMGIARKVFDRLLERDVFVWNVV
IQGYANWGPFAEALYLFDEMRVSGEPTNRYTFPFVLKACGAMKNSDKGKIVHGHVLKCGLDLDLFVGNALISFYAKSQDVETAREVFDEMSLRDIVSWNSMIAGYTLNGK
VDEAIMLFHAMLHNQTACSPDNATLVGILPACVTKSASQVGFWVHSYVIKTGMEIGAPLGSCLISMYANCGHLKVARDVFDRINDKNVIVWSAIIRCYGMHGFAEEALNM
FTSLEEVGLKPDGVIFLNLLSTCSHAGLVAKGHEIYEKMEAYGVERKEEHYACMVDLLGRAGFLEQAVEFIEGMPVQAGKDVYGALLGACRIHNNIELAKEVGEKLFVLD
PENAGRYVILASMYEDAGRWEDAAKLRKLLRDRKIRKPIGFSSIEIDRIHHVFGKEDESHPFTEQIFYTLEKLERIMEEDFEPI