; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G03260 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G03260
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionglycine-rich protein A3
Genome locationClcChr05:2240746..2242974
RNA-Seq ExpressionClc05G03260
SyntenyClc05G03260
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133933.1 glycine-rich protein A3 [Cucumis sativus]1.1e-5979.29Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHGH
        MGGGKEN+  DKDKGLFSNMAAFAAGHHYPH HGYPPPPY G  YPPPGGYPP GYPP+      GYPPYGGHPHTAYP  GGYPPAGYPGPHHYPG+GH
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHGH

Query:  GHGHGMGGMGGLLAG----AAAAYGAHHLVHARPFGYG-----HGKFKHGKFGKRWKHGGKFMRFKKWK
        G+GHG  G+GGLLAG    AAAAYGAHHL HARPFG+G     HGKFKHGKFGKRWKHGG  MRFKKWK
Subjt:  GHGHGMGGMGGLLAG----AAAAYGAHHLVHARPFGYG-----HGKFKHGKFGKRWKHGGKFMRFKKWK

XP_008438206.1 PREDICTED: glycine-rich protein A3 [Cucumis melo]4.2e-6785.09Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHGH
        MGGGKEN+  DKDKGLFSNMAAFAAGHHYPH HGYPPPPY G GYPPPGGYPPA      GYPPAGYPPYGGHPHTAYPY GGYPPAGYPGPHHYPG+GH
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHGH

Query:  GHGHGMGG-MGGLLAGAAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK
        GHGHG+GG + G  A AAAAYGAHHL HARPFG+GHGKFKHGKFGKRWKHGGKF RFKKWK
Subjt:  GHGHGMGG-MGGLLAGAAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK

XP_022147022.1 glycine-rich protein A3 [Momordica charantia]4.6e-5876.05Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGY--PPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGH
        MGGGKE DS   ++GLFS++ AFAAGHHYPH+HGY  PPPPY G GYPPPGGYPPAGYPP GGYPPAGY P+GGHP +AYPY GGYPPAGYPGPHH+PGH
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGY--PPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGH

Query:  GHGHGHGMGGMGGLLAGAAAAYGAHHLVHARPFGYG-----HGKFKHGKFGKRWKHGGKFMRFKKWK
        GHG G     + G  A AAAAYGAHHLVHARPFG+G     HGKFKHGKFGKRWKHGGKFM+FK+WK
Subjt:  GHGHGHGMGGMGGLLAGAAAAYGAHHLVHARPFGYG-----HGKFKHGKFGKRWKHGGKFMRFKKWK

XP_022980434.1 glycine-rich protein A3-like [Cucurbita maxima]1.5e-5372.83Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGY-PPPPYAG------------GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAG
        MGGGKE +S  KDKGLFS+MA+FAAG HY H HGY PPPPYAG            GYPPPGGYPPA YPP GGYPPA YPP GG+P   YP HG YPPAG
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGY-PPPPYAG------------GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAG

Query:  YPGPHHYPGHGHGHGHGMGG-MGGLLAGAAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK
        YP PHHYPGHG GHG GMGG + G  A AAAAYGAHHL HARPFG+GHGKFKHGKFGKRWKHGG   +FKKWK
Subjt:  YPGPHHYPGHGHGHGHGMGG-MGGLLAGAAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK

XP_023527127.1 glycine-rich protein A3-like [Cucurbita pepo subsp. pepo]1.5e-5373.18Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGY-PPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTA-----------YPYHGGYPPAG
        MGGGKE +S  KDKGLFS+MA+FAAG HY H HGY PPPPYAG GYPPPGGYPPA YPP GGYPPA YPP GG+P  A           YP HG YPPAG
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGY-PPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTA-----------YPYHGGYPPAG

Query:  YPGPHHYPGHGHGHGHGMG---GMGGLLAG----AAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK
        YP PHHYPGHG GHG G G   GMGGLLAG    AAAAYGAHHL HARPFG+GHGKFKHGKFGKRWKHGG   +FKKWK
Subjt:  YPGPHHYPGHGHGHGHGMG---GMGGLLAG----AAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK

TrEMBL top hitse value%identityAlignment
A0A0A0L4F2 Uncharacterized protein5.3e-6079.29Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHGH
        MGGGKEN+  DKDKGLFSNMAAFAAGHHYPH HGYPPPPY G  YPPPGGYPP GYPP+      GYPPYGGHPHTAYP  GGYPPAGYPGPHHYPG+GH
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHGH

Query:  GHGHGMGGMGGLLAG----AAAAYGAHHLVHARPFGYG-----HGKFKHGKFGKRWKHGGKFMRFKKWK
        G+GHG  G+GGLLAG    AAAAYGAHHL HARPFG+G     HGKFKHGKFGKRWKHGG  MRFKKWK
Subjt:  GHGHGMGGMGGLLAG----AAAAYGAHHLVHARPFGYG-----HGKFKHGKFGKRWKHGGKFMRFKKWK

A0A1S3AWG9 glycine-rich protein A32.0e-6785.09Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHGH
        MGGGKEN+  DKDKGLFSNMAAFAAGHHYPH HGYPPPPY G GYPPPGGYPPA      GYPPAGYPPYGGHPHTAYPY GGYPPAGYPGPHHYPG+GH
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHGH

Query:  GHGHGMGG-MGGLLAGAAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK
        GHGHG+GG + G  A AAAAYGAHHL HARPFG+GHGKFKHGKFGKRWKHGGKF RFKKWK
Subjt:  GHGHGMGG-MGGLLAGAAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK

A0A6J1D180 glycine-rich protein A32.2e-5876.05Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGY--PPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGH
        MGGGKE DS   ++GLFS++ AFAAGHHYPH+HGY  PPPPY G GYPPPGGYPPAGYPP GGYPPAGY P+GGHP +AYPY GGYPPAGYPGPHH+PGH
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGY--PPPPYAG-GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGH

Query:  GHGHGHGMGGMGGLLAGAAAAYGAHHLVHARPFGYG-----HGKFKHGKFGKRWKHGGKFMRFKKWK
        GHG G     + G  A AAAAYGAHHLVHARPFG+G     HGKFKHGKFGKRWKHGGKFM+FK+WK
Subjt:  GHGHGHGMGGMGGLLAGAAAAYGAHHLVHARPFGYG-----HGKFKHGKFGKRWKHGGKFMRFKKWK

A0A6J1IGV8 glycine-rich protein A3-like isoform X13.7e-5372.84Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPP--YAGGYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHG
        MGGGK+ D++  D+GL S+MAAFAAG+HY   HGYPPPP     GY PP GYPPAGYPP GGYPPAG           YPYHGGYPPAGYPG HHYPGHG
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPP--YAGGYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHG

Query:  HGHGHGMGG-MGGLLAGAAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK
        HGHGHGMGG + G  A AAAAYGAHH+VHA PFGYG+G+FKHGKFGKRWKHGGKFM+FKKWK
Subjt:  HGHGHGMGG-MGGLLAGAAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK

A0A6J1IZ97 glycine-rich protein A3-like7.4e-5472.83Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGY-PPPPYAG------------GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAG
        MGGGKE +S  KDKGLFS+MA+FAAG HY H HGY PPPPYAG            GYPPPGGYPPA YPP GGYPPA YPP GG+P   YP HG YPPAG
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGY-PPPPYAG------------GYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAG

Query:  YPGPHHYPGHGHGHGHGMGG-MGGLLAGAAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK
        YP PHHYPGHG GHG GMGG + G  A AAAAYGAHHL HARPFG+GHGKFKHGKFGKRWKHGG   +FKKWK
Subjt:  YPGPHHYPGHGHGHGHGMGG-MGGLLAGAAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK

SwissProt top hitse value%identityAlignment
O16005 Rhodopsin9.5e-0656.72Show/hide
Query:  YPPPPYAGGYPPPGGYPPAGYPP---SGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHG
        YPP    G YPP GGYPP GYPP    GGYPP GYPP    P   YP   GYPP GYP P   P  G
Subjt:  YPPPPYAGGYPPPGGYPPAGYPP---SGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHG

P09241 Rhodopsin6.8e-0444.44Show/hide
Query:  GGKENDSSDKDK--GLFSNMAAFAAGHH-YPHSHGYPPPPY--AGGYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAG
        GG+  D++   +   +   M A  A +   P   GYPP  Y   G YPPP GYPP GYPP  GYPP GYPP G  P    P   G PP G
Subjt:  GGKENDSSDKDK--GLFSNMAAFAAGHH-YPHSHGYPPPPY--AGGYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAG

P24639 Annexin A74.0e-0444.55Show/hide
Query:  YPHSHGYPP----PPYAG-----GYPPPGGYPP-AGYPPSGGYPP-AGYPPYGGH-PHTAYPYHGGYPPAGYPGPHHYPGHGHGHGHGMGGMGGLLAGAA
        YP   GYPP    PP  G     GYPP  GYPP  GYPP  GYPP  GYPP  G+ P   YP   GYPP GYP    YP  G   G  +G   G++ G  
Subjt:  YPHSHGYPP----PPYAG-----GYPPPGGYPP-AGYPPSGGYPP-AGYPPYGGH-PHTAYPYHGGYPPAGYPGPHHYPGHGHGHGHGMGGMGGLLAGAA

Query:  AAYGAHHLVH
          Y    + H
Subjt:  AAYGAHHLVH

P37705 Glycine-rich protein A34.4e-1949.03Show/hide
Query:  MGGGKENDSSDKDKGLFSNMA-AFAAGHHYPHSHGYPPPPYAGGYPPP------GGYPPAGYPPS-GGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPH
        MGGG  +  +D+DKGLFSN+A   A G HYP       PP AGGYPP       GGYPP GYPP+ GGYPP GYPP G          GGYPP GYP   
Subjt:  MGGGKENDSSDKDKGLFSNMA-AFAAGHHYPHSHGYPPPPYAGGYPPP------GGYPPAGYPPS-GGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPH

Query:  HYPGHGHGHGHGMGGMGGLLAG----AAAAYGAHHLVHARPFGYGHGKFKHGKFG
        H+ G    H  G GG+ G++AG    AAAAYG HH+        GHG + HG  G
Subjt:  HYPGHGHGHGHGMGGMGGLLAG----AAAAYGAHHLVHARPFGYGHGKFKHGKFG

Arabidopsis top hitse value%identityAlignment
AT1G31750.1 proline-rich family protein2.6e-1949.11Show/hide
Query:  ENDSSDKDKGLFSNMAAFAAGHHYPHSHGYP----PPPYAGGYPPPGGYPPAGY-PPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHGHG
        ++D  ++DK  FS        HH  H HGYP    PPP  G YPPPGGYPP GY PP  GYPPA YP          P  G YPPAGYPGP   P  G G
Subjt:  ENDSSDKDKGLFSNMAAFAAGHHYPHSHGYP----PPPYAGGYPPPGGYPPAGY-PPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHGHG

Query:  HGHGMGGMGGLLAG----AAAAYGAHHLVHARPFG-YGHGKFKHGKFG----KRWKH----GGKFMRFK
             GG+GGL+AG    AAAA G HH  H   +G +GHGK+K G FG    KR KH    GGK+ R K
Subjt:  HGHGMGGMGGLLAG----AAAAYGAHHLVHARPFG-YGHGKFKHGKFG----KRWKH----GGKFMRFK

AT4G19200.1 proline-rich family protein3.0e-2344.9Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPPYAGGYPPPGGYPPAGYPPSGGYPPAGYPP--YGGHPHTAYPYHGGYPPAGYPGPHHYPGHG
        MGGGK+    +++KG       F  G HY        PP  GGYPP G  P  GYPP+GGYPPAGYPP  Y   P    P  GGYPPAGYP P       
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPPYAGGYPPPGGYPPAGYPPSGGYPPAGYPP--YGGHPHTAYPYHGGYPPAGYPGPHHYPGHG

Query:  HGHGHGMGGMGGLLAG----AAAAYGAHHLVHA--RPFGY------------------GHGKFKHGKFGKRWKHG-----------GKFMRFKKWK
        H  GH  GG+GG++AG    AAAAYGAHH+ HA   P+G+                  GHGKFKHGK G ++KHG           G   +FKKWK
Subjt:  HGHGHGMGGMGGLLAG----AAAAYGAHHLVHA--RPFGY------------------GHGKFKHGKFGKRWKHG-----------GKFMRFKKWK

AT5G17650.1 glycine/proline-rich protein3.2e-2547.92Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYP------------PPPYAGGYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGY
        MG  + N S   D+G F N+A FA G + PH HGY             PPP     PPP GYPP  YPP GGYPPAGYPP  G+P   YP H GYP  GY
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYP------------PPPYAGGYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGY

Query:  PGPHHYPGHGHGHGHGMGGMGGLLA-GAAAAYGAHHLV---------HARPFGYG--------HGKFKHGKFGKR---WKHGGKFMRFKKWK
        P P H        GH  GG+G ++A G AAA GAHH+          H   +GYG        HGKFKHGKFGK     KH GKF  FKKWK
Subjt:  PGPHHYPGHGHGHGHGMGGMGGLLA-GAAAAYGAHHLV---------HARPFGYG--------HGKFKHGKFGKR---WKHGGKFMRFKKWK

AT5G45350.1 proline-rich family protein6.1e-2447.29Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPP--------PYAGGYPPPGGYPPAGYPPS------GGYPPA----GYPP---YGGHPHTAYPY
        MGG  +N   DKDKG           H YP + GYPPP        P  G  PPPG YPPAGYPP       GGYPPA    GYPP   YGG+P    P 
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPP--------PYAGGYPPPGGYPPAGYPPS------GGYPPA----GYPP---YGGHPHTAYPY

Query:  HGGYPPAGYPGPHHYPGHGHGHGHGMGGMGGLLAGAAAAYGAHHLVHAR-------------------PFGYGHGKFKHGKFGKRWKHGGKFM----RFK
        HGGYPPAGYP   H+ GH        GG+GG++AGAAAAYGAHH+ H+                     +G+GHGKFKHGK GK +KHG   M    +FK
Subjt:  HGGYPPAGYPGPHHYPGHGHGHGHGMGGMGGLLAGAAAAYGAHHLVHAR-------------------PFGYGHGKFKHGKFGKRWKHGGKFM----RFK

Query:  KWK
        KWK
Subjt:  KWK

AT5G45350.2 proline-rich family protein6.1e-2447.29Show/hide
Query:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPP--------PYAGGYPPPGGYPPAGYPPS------GGYPPA----GYPP---YGGHPHTAYPY
        MGG  +N   DKDKG           H YP + GYPPP        P  G  PPPG YPPAGYPP       GGYPPA    GYPP   YGG+P    P 
Subjt:  MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPP--------PYAGGYPPPGGYPPAGYPPS------GGYPPA----GYPP---YGGHPHTAYPY

Query:  HGGYPPAGYPGPHHYPGHGHGHGHGMGGMGGLLAGAAAAYGAHHLVHAR-------------------PFGYGHGKFKHGKFGKRWKHGGKFM----RFK
        HGGYPPAGYP   H+ GH        GG+GG++AGAAAAYGAHH+ H+                     +G+GHGKFKHGK GK +KHG   M    +FK
Subjt:  HGGYPPAGYPGPHHYPGHGHGHGHGMGGMGGLLAGAAAAYGAHHLVHAR-------------------PFGYGHGKFKHGKFGKRWKHGGKFM----RFK

Query:  KWK
        KWK
Subjt:  KWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGTGGGAAGGAAAATGACAGCAGTGACAAAGACAAAGGCCTGTTTTCAAATATGGCGGCGTTTGCTGCGGGGCACCACTATCCTCATTCTCATGGATATCCACC
ACCACCATACGCTGGAGGATATCCCCCCCCGGGAGGGTACCCTCCGGCTGGGTATCCCCCTTCCGGTGGATATCCTCCGGCTGGCTATCCTCCTTACGGTGGACACCCTC
ATACAGCCTATCCATATCACGGCGGATACCCCCCTGCTGGCTATCCTGGCCCCCATCATTACCCTGGCCATGGACACGGACACGGACACGGTATGGGGGGTATGGGGGGA
TTGTTGGCTGGTGCAGCCGCTGCTTACGGCGCTCATCATCTTGTTCATGCACGCCCATTTGGCTACGGTCACGGAAAGTTCAAACATGGGAAATTTGGCAAGCGTTGGAA
GCATGGAGGCAAGTTCATGAGATTCAAGAAGTGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
GAAACTAGTCATGCTTTTTATATACCAAAAATGAAAAATGAAAATTTGGTCCATTCGTAGTCGGGGTTGGTTCACGAAAAACAGAGACCGCAAATCGTAATAATAATAAT
AAAAAAATATTGATTATATACACATTATATTTTCAAATATGGAATAATTAAAATTCGAAGAACATCACGTGCTGAGTTGGACGCGTCGAAGTAACGCCAAGGCGGTTGAT
ACTCTTTGATCGTACAAAACAAAAAGGCAGAGAGATTGGCTTTCACAAAGACATCAATCTCTGATTTGTAGGTTTCTGTTTCTCTTATTCGCCATTTTACGGAGCACACC
GGAAACTCTCTTCTTTTTTGGTTTACTACTCTTGGAAACTCGAAGGAAAATGGGAGGTGGGAAGGAAAATGACAGCAGTGACAAAGACAAAGGCCTGTTTTCAAATATGG
CGGCGTTTGCTGCGGGGCACCACTATCCTCATTCTCATGGATATCCACCACCACCATACGCTGGAGGATATCCCCCCCCGGGAGGGTACCCTCCGGCTGGGTATCCCCCT
TCCGGTGGATATCCTCCGGCTGGCTATCCTCCTTACGGTGGACACCCTCATACAGCCTATCCATATCACGGCGGATACCCCCCTGCTGGCTATCCTGGCCCCCATCATTA
CCCTGGCCATGGACACGGACACGGACACGGTATGGGGGGTATGGGGGGATTGTTGGCTGGTGCAGCCGCTGCTTACGGCGCTCATCATCTTGTTCATGCACGCCCATTTG
GCTACGGTCACGGAAAGTTCAAACATGGGAAATTTGGCAAGCGTTGGAAGCATGGAGGCAAGTTCATGAGATTCAAGAAGTGGAAGTGATGAATTCGCTAGAACATTGTC
TGATGTATATCACCGTGACTTTTGATTCCAAAACCTAGAAAAATAAATATGGTATCCTAATGTTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MGGGKENDSSDKDKGLFSNMAAFAAGHHYPHSHGYPPPPYAGGYPPPGGYPPAGYPPSGGYPPAGYPPYGGHPHTAYPYHGGYPPAGYPGPHHYPGHGHGHGHGMGGMGG
LLAGAAAAYGAHHLVHARPFGYGHGKFKHGKFGKRWKHGGKFMRFKKWK