; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0268 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0268
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionglycine-rich protein A3
Genome locationMC05:1968122..1971911
RNA-Seq ExpressionMC05g0268
SyntenyMC05g0268
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133933.1 glycine-rich protein A3 [Cucumis sativus]2.75e-7775Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPG---
        MGGGKE +  ++GLFS++ AFAAGHHYPH+HGYPPPP  Y GA YPPPGGYPP GYP      P GY P+GGHP +AYP PGGYPPAGYPGPHH+PG   
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPG---

Query:  ---HGHGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
           HGHG+G +LAGGAAAAAAAYGAHHL HARPFGFGHGKFKHGKFKHGKFGKRWKHGG  M+FK+WK
Subjt:  ---HGHGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK

XP_008438206.1 PREDICTED: glycine-rich protein A3 [Cucumis melo]3.85e-7675.3Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPG---
        MGGGKE +  ++GLFS++ AFAAGHHYPH+HGYPPPP  Y GAGYPPPGGYPPAGYP      PAGY P+GGHP +AYPY GGYPPAGYPGPHH+PG   
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPG---

Query:  -HGHGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
         HGHG+G +LAGGAAAAAAAYGAHHL HARPFGFGHGKFKHGKF     GKRWKHGGKF +FK+WK
Subjt:  -HGHGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK

XP_022147022.1 glycine-rich protein A3 [Momordica charantia]4.06e-114100Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPGHGH
        MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPGHGH
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPGHGH

Query:  GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
        GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
Subjt:  GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK

XP_022980434.1 glycine-rich protein A3-like [Cucurbita maxima]8.19e-7272.16Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYP----------
        MGGGKE++S ++GLFS + +FAAG HY H HGYPPPPP Y GAGYPPPGGYPPAGYPPPGGYPPA Y P GG+PP+AYP PGGYPPAGYP          
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYP----------

Query:  -GPHHHPGHGH---GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
          PHH+PGHG    GMG +LAGGAAAAAAAYGAHHL HARPFGFGHGKFKHGKF     GKRWKHGG   KFK+WK
Subjt:  -GPHHHPGHGH---GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK

XP_038883140.1 glycine-rich protein A3 [Benincasa hispida]6.62e-7276.22Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPGHG-
        MGGGKE DS+++GLFS++   AAG HY   HGYPPP   Y GAGYPPPGGYPPAGYP      PAGY P+GGHP +AYPY GGYPPAGY GPHH+PGHG 
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPGHG-

Query:  -HGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
         HGMG +LAGGAAAAAAAYGAHHL HARPFGFGHGKFKHGKFKHGKFGKRW HGGK   FKRWK
Subjt:  -HGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK

TrEMBL top hitse value%identityAlignment
A0A0A0L4F2 Uncharacterized protein1.33e-7775Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPG---
        MGGGKE +  ++GLFS++ AFAAGHHYPH+HGYPPPP  Y GA YPPPGGYPP GYP      P GY P+GGHP +AYP PGGYPPAGYPGPHH+PG   
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPG---

Query:  ---HGHGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
           HGHG+G +LAGGAAAAAAAYGAHHL HARPFGFGHGKFKHGKFKHGKFGKRWKHGG  M+FK+WK
Subjt:  ---HGHGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK

A0A1S3AWG9 glycine-rich protein A31.86e-7675.3Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPG---
        MGGGKE +  ++GLFS++ AFAAGHHYPH+HGYPPPP  Y GAGYPPPGGYPPAGYP      PAGY P+GGHP +AYPY GGYPPAGYPGPHH+PG   
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPG---

Query:  -HGHGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
         HGHG+G +LAGGAAAAAAAYGAHHL HARPFGFGHGKFKHGKF     GKRWKHGGKF +FK+WK
Subjt:  -HGHGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK

A0A6J1D180 glycine-rich protein A31.97e-114100Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPGHGH
        MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPGHGH
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPGHGH

Query:  GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
        GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
Subjt:  GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK

A0A6J1E983 glycine-rich protein A3-like7.61e-7071.02Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYP----------
        MGGGKE++S ++GLFS + +FAAG HY H HGYPPPPP Y GAGYPPPGGYPPAGYPPPGGYPPA Y P GG+PP+AY  PGGYPPAGYP          
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYP----------

Query:  -GPHHHPGHGH---GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
          PHH+PGHG    GMG +LAGGAAAAAAAYGAHHL HARPFGFGHGKFKHGKF     GKRWKHG    KFK+WK
Subjt:  -GPHHHPGHGH---GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK

A0A6J1IZ97 glycine-rich protein A3-like3.96e-7272.16Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYP----------
        MGGGKE++S ++GLFS + +FAAG HY H HGYPPPPP Y GAGYPPPGGYPPAGYPPPGGYPPA Y P GG+PP+AYP PGGYPPAGYP          
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYP----------

Query:  -GPHHHPGHGH---GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK
          PHH+PGHG    GMG +LAGGAAAAAAAYGAHHL HARPFGFGHGKFKHGKF     GKRWKHGG   KFK+WK
Subjt:  -GPHHHPGHGH---GMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK

SwissProt top hitse value%identityAlignment
O16005 Rhodopsin1.6e-0558.06Show/hide
Query:  PGAGYPPPGGYPPAGYPPP---GGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPGHG
        P   YPP GGYPP GYPPP   GGYPP GY P    PP  YP   GYPP GYP P   P  G
Subjt:  PGAGYPPPGGYPPAGYPPP---GGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPGHG

P37705 Glycine-rich protein A35.8e-1953.47Show/hide
Query:  MGGGKEKDSDERGLFSHL-GAFAAGHHYPHNHGYPPPPPPYPGAGYPPP-GGYPPAGYPPP-GGYPPAGYHPHGGHPPSAYPYPGGYPPAGY---PGPHH
        MGGG   +  ++GLFS+L G  A G HYP    YPP    YP  GYPP  GGYPP GYPP  GGYPP GY P GG     YP P GYPPAG+       H
Subjt:  MGGGKEKDSDERGLFSHL-GAFAAGHHYPHNHGYPPPPPPYPGAGYPPP-GGYPPAGYPPP-GGYPPAGYHPHGGHPPSAYPYPGGYPPAGY---PGPHH

Query:  HPGHGHGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHG
        H GHG G+  M+AGG AAAAAAYG HH+        GHG + HG
Subjt:  HPGHGHGMGAMLAGGAAAAAAAYGAHHLVHARPFGFGHGKFKHG

Arabidopsis top hitse value%identityAlignment
AT1G31750.1 proline-rich family protein3.7e-2155.24Show/hide
Query:  HHYPHNHGYPPPP-PPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAY-PYPGGYPPAGYPGPH-HHPGHGHGMGAMLAGGAAAAAAAYGAHH
        HH  H HGYPP   PP P   YPPPGGYPP GYPPP         PH G+PP+AY P PG YPPAGYPGP    PG G G+G ++AG A AAAAA G HH
Subjt:  HHYPHNHGYPPPP-PPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAY-PYPGGYPPAGYPGPH-HHPGHGHGMGAMLAGGAAAAAAAYGAHH

Query:  LVHARPFG-FGHGKFKHGKFKHGKFGKRWKH----GGKFMKFK
          H   +G  GHGK+K G F  GK+ KR KH    GGK+ + K
Subjt:  LVHARPFG-FGHGKFKHGKFKHGKFGKRWKH----GGKFMKFK

AT4G19200.1 proline-rich family protein7.1e-2853.68Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGP-HHHPGH-
        MGGGK+K  DE+    H G    GH+ P   GYPP   P P  GYPP GGYPPAGY PPG YP A     GG+PP+    PGGYPPAGYP P  HH GH 
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGP-HHHPGH-

Query:  GHGMGAMLAGGAAAAAAAYGAHHLVHA--RPFGF------------------GHGKFKH----GKFKHGKFGKRWKHG--GKFMKFKRWK
        G G+G M+AG A AAAAAYGAHH+ HA   P+G                   GHGKFKH    GKFKHGK GK  KHG  G   KFK+WK
Subjt:  GHGMGAMLAGGAAAAAAAYGAHHLVHA--RPFGF------------------GHGKFKH----GKFKHGKFGKRWKHG--GKFMKFKRWK

AT5G17650.1 glycine/proline-rich protein7.8e-2750.82Show/hide
Query:  GGKEKDSDERGLFSHLGAFAAGHHYPHNHGYP----------PPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGP
        G  + +  +RG F +L  FA G + PH HGY           P PPP      PPP GYPP  YPP GGYPPAGY P  G+PP+ YP   GYP  GYP P
Subjt:  GGKEKDSDERGLFSHLGAFAAGHHYPHNHGYP----------PPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGP

Query:  HHHPGHGHGMGAMLAGGAAAAAAA---------YGAHHLVHARPFGF-GHGKFKHGKFKHGKFGKR---WKHGGKFMKFKRWK
         H   H  G+GA++AGG AAAA A         YG HH  H   +G+ GHGKFKHGKFKHGKFGK     KH GKF  FK+WK
Subjt:  HHHPGHGHGMGAMLAGGAAAAAAA---------YGAHHLVHARPFGF-GHGKFKHGKFKHGKFGKR---WKHGGKFMKFKRWK

AT5G45350.1 proline-rich family protein1.7e-2149.49Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPP----PPPPYPGAGY-PPPGGYPPAGYPP------PGGYPPA----GYHP---HGGHPPSAYPYPG
        MGG  + D D+             H YP   GYPP    PP  YP  GY PPPG YPPAGYPP      PGGYPPA    GY P   +GG+PP+  P  G
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPP----PPPPYPGAGY-PPPGGYPPAGYPP------PGGYPPA----GYHP---HGGHPPSAYPYPG

Query:  GYPPAGYPGPHHHPGHGHGMGAMLAGGAAAAAAAYGAHHLVH---------ARPFGFGHGK-----FKHGKFKHGKFGKRWKHGGKFM----KFKRWK
        GYPPAGYP   HH GH  G+G M+AG    AAAAYGAHH+ H         A   GFGHG        HGKFKHGK GK +KHG   M    KFK+WK
Subjt:  GYPPAGYPGPHHHPGHGHGMGAMLAGGAAAAAAAYGAHHLVH---------ARPFGFGHGK-----FKHGKFKHGKFGKRWKHGGKFM----KFKRWK

AT5G45350.2 proline-rich family protein1.7e-2149.49Show/hide
Query:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPP----PPPPYPGAGY-PPPGGYPPAGYPP------PGGYPPA----GYHP---HGGHPPSAYPYPG
        MGG  + D D+             H YP   GYPP    PP  YP  GY PPPG YPPAGYPP      PGGYPPA    GY P   +GG+PP+  P  G
Subjt:  MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPP----PPPPYPGAGY-PPPGGYPPAGYPP------PGGYPPA----GYHP---HGGHPPSAYPYPG

Query:  GYPPAGYPGPHHHPGHGHGMGAMLAGGAAAAAAAYGAHHLVH---------ARPFGFGHGK-----FKHGKFKHGKFGKRWKHGGKFM----KFKRWK
        GYPPAGYP   HH GH  G+G M+AG    AAAAYGAHH+ H         A   GFGHG        HGKFKHGK GK +KHG   M    KFK+WK
Subjt:  GYPPAGYPGPHHHPGHGHGMGAMLAGGAAAAAAAYGAHHLVH---------ARPFGFGHGK-----FKHGKFKHGKFGKRWKHGGKFM----KFKRWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGTGGGAAGGAGAAGGACAGTGATGAGAGAGGCCTGTTTTCACATTTGGGGGCGTTTGCTGCAGGGCACCACTACCCTCATAACCATGGATATCCGCCACCGCC
ACCGCCCTATCCCGGAGCTGGATATCCTCCTCCGGGAGGGTATCCTCCGGCTGGATATCCCCCTCCCGGCGGATATCCTCCGGCTGGCTATCATCCTCACGGTGGACACC
CACCTTCAGCGTATCCCTATCCTGGCGGATACCCTCCCGCTGGCTACCCTGGCCCTCATCATCACCCAGGCCACGGACACGGCATGGGGGCAATGTTGGCTGGTGGAGCA
GCTGCTGCAGCCGCTGCCTACGGTGCTCATCATCTAGTTCATGCACGCCCATTCGGATTCGGTCACGGCAAGTTCAAACATGGCAAGTTTAAGCATGGGAAATTTGGCAA
GCGCTGGAAGCATGGAGGCAAGTTCATGAAATTCAAGAGATGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
GCGCGAGATGCTTCAATCAAATGGTATAAAAATGAATGAGAGAAGGGCAATGTGTTATTCATAATTCATATTTCTCCTAAATCGCCGTTTCTACGGAGCTCGCCAATAAT
CTTCTTCGTTTCTCTGGATTAGTCGTCGAAACTTGAAGAAAAATGGGAGGTGGGAAGGAGAAGGACAGTGATGAGAGAGGCCTGTTTTCACATTTGGGGGCGTTTGCTGC
AGGGCACCACTACCCTCATAACCATGGATATCCGCCACCGCCACCGCCCTATCCCGGAGCTGGATATCCTCCTCCGGGAGGGTATCCTCCGGCTGGATATCCCCCTCCCG
GCGGATATCCTCCGGCTGGCTATCATCCTCACGGTGGACACCCACCTTCAGCGTATCCCTATCCTGGCGGATACCCTCCCGCTGGCTACCCTGGCCCTCATCATCACCCA
GGCCACGGACACGGCATGGGGGCAATGTTGGCTGGTGGAGCAGCTGCTGCAGCCGCTGCCTACGGTGCTCATCATCTAGTTCATGCACGCCCATTCGGATTCGGTCACGG
CAAGTTCAAACATGGCAAGTTTAAGCATGGGAAATTTGGCAAGCGCTGGAAGCATGGAGGCAAGTTCATGAAATTCAAGAGATGGAAGTGATGAAATCGCTACCGAACAC
ATCGGATTTATATATAGTTATCGTTACACAAGGAAGCTTTCCATCTTGATCCCACGTGATGATTTATGCCACATATGATGAGAGCTTCTGGACAATGAATAGAGCAGTCC
TCAAGAAATGATGTTCCATCGACACTCTCGTTCTCCTTCCATTGATAGTTCACGCACGACAATAGCTTACTCCTGCCAAACACGAGCCCCACCCGCTCTATTAAGAATGA
GGATCTGTCTAGTTCAACTCAATCTAGTTAGTCTAGATTGATCGCCTGAAATAGATCCTCTTTTAACATCCTAATCTATAAACTTGAATTCGCCCCACAGTACAGATGGT
AAAAACTTAGAATAGTTCTTGAAACATAAAATATACAACAGCATTGAAGTTTACAATTATGTATGGCGAGCTATTAGAAACGAACCTGTGGAACGCTCCGACTGCCACGT
CATTAAACTGTAGAGATCCAGCAGCCCGCTGCCTCGAAACACCGTTGATAGTCGATCTACTAATCTCAAGCCGTTGGTCATGAATGTTTTCTTTCCCACTTGCCTTCACG
TCTGCCACCATCCCTTTCCCTTCTTTCTATTACTTGCCTTGACTTGCTGCCTCTAGATCCATGCCTTTGAGCAGACCCAAGTGGAGCTGAATGTCCATCAAATATTGAAA
CTAATTCTGGATCACCAACACTTCCAGTGTCACCGGAATCTCTGCTCCCGTCTACATTTTCTTCGTAACTACTACCGCATTCGTCTGGACGCTCACGAGATTGTAGGATC
GTAGGCAACTGTCCTGAATATTGCCTCGCCCCGGAAAGCACAATGGTCGGTACCCCTGACATGGAGGAACTGGTGGTAGTAGAAGTTTCGGGCTGTAGGCCACTGGCACT
TGTTCTGGGATTCCCTTCATTGCTGGGTCTGCCATTGTCTCGTTCGCGATATCTATCCCGAGGTCGACTGCGAGGAAAAAGGCCACCTCATGGATTCAGTATATTGGGCT
GGACATCTTCAAAATGCGTAATTTGTCCAGATATGAGAATTGAGTTATATTACTAATTTAGTCTTTGAACTTGGATAGTTTTTGCTCTTTTGATGATGTATTGGACTAAA
CATGTTCCGGAGAACGTTCTATACAACGCTATATTTGAACTCT
Protein sequenceShow/hide protein sequence
MGGGKEKDSDERGLFSHLGAFAAGHHYPHNHGYPPPPPPYPGAGYPPPGGYPPAGYPPPGGYPPAGYHPHGGHPPSAYPYPGGYPPAGYPGPHHHPGHGHGMGAMLAGGA
AAAAAAYGAHHLVHARPFGFGHGKFKHGKFKHGKFGKRWKHGGKFMKFKRWK