; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G007770 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G007770
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionglycine-rich protein A3-like
Genome locationchr07:8524045..8525270
RNA-Seq ExpressionLsi07G007770
SyntenyLsi07G007770
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451874.1 PREDICTED: glycine-rich protein A3-like [Cucumis melo]7.3e-6988.55Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP------------GHGVAGMLAGG
        MGGGKDKHDESDKGLFSHLAHGVAHGAGYP H GYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPA GYPP  YPP            GHGVAGMLAGG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP------------GHGVAGMLAGG

Query:  AAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        AAAAAAAYGAHHVAH AGHYPHGVAHYGHGHGGKFKHHGKFKHGK GKHKHGKHGMFGGGKFKKWK
Subjt:  AAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGGGKFKKWK

XP_022931474.1 glycine-rich protein A3-like [Cucurbita moschata]2.8e-6884.97Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP-----------------GHGVAG
        MGGGKDKHDESDKGLFSHLAHGVAHG GYP  GGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPP HGYPPAGYPP                 GHGVAG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP-----------------GHGVAG

Query:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKH--HGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        MLAGGAAAAAAAYG HHVAHGAGHYPHGVAHYGH HGGKFKH  HGKFKHGK GKHKHGKHGMFGGGKFKKWK
Subjt:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKH--HGKFKHGKHGKHKHGKHGMFGGGKFKKWK

XP_022983060.1 glycine-rich protein A3-like [Cucurbita maxima]1.2e-6685.88Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP------------GHGVAGMLAGG
        MGGGKDKHDESDKGLFSHLAHGVAHGAGYPP GGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPA GYPPAGYPP            GHGVAGMLAGG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP------------GHGVAGMLAGG

Query:  AAAAAAAYGAHHVAHGAGHYPHGVAHY----GHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        AAAAAAAYGAHHVAHGAGHYPHGVA Y    GHGHGGKFKHHGKFK GK GKHKHGKH   GGGKFKKWK
Subjt:  AAAAAAAYGAHHVAHGAGHYPHGVAHY----GHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGGGKFKKWK

XP_022984947.1 glycine-rich protein A3-like [Cucurbita maxima]9.6e-6985.55Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP-----------------GHGVAG
        MGGGKDKHDESDKGLFSHLAHGVAHG GYP  GGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPP HGYPPAGYPP                 GHGVAG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP-----------------GHGVAG

Query:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKH--HGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGH HGGKFKH  HGKFKHGK GKHKHGKHGMFGGGKFKKWK
Subjt:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKH--HGKFKHGKHGKHKHGKHGMFGGGKFKKWK

XP_023553494.1 glycine-rich protein A3-like [Cucurbita pepo subsp. pepo]9.6e-6985.55Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP-----------------GHGVAG
        MGGGKDKHDESDKGLFSHLAHGVAHG GYP  GGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPP HGYPPAGYPP                 GHGVAG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP-----------------GHGVAG

Query:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKH--HGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGH HGGKFKH  HGKFKHGK GKHKHGKHGMFGGGKFKKWK
Subjt:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKH--HGKFKHGKHGKHKHGKHGMFGGGKFKKWK

TrEMBL top hitse value%identityAlignment
A0A1S3BRX9 glycine-rich protein A3-like3.6e-6988.55Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP------------GHGVAGMLAGG
        MGGGKDKHDESDKGLFSHLAHGVAHGAGYP H GYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPA GYPP  YPP            GHGVAGMLAGG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP------------GHGVAGMLAGG

Query:  AAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        AAAAAAAYGAHHVAH AGHYPHGVAHYGHGHGGKFKHHGKFKHGK GKHKHGKHGMFGGGKFKKWK
Subjt:  AAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGGGKFKKWK

A0A5A7TDL9 Glycine-rich protein A3-like3.6e-6988.55Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP------------GHGVAGMLAGG
        MGGGKDKHDESDKGLFSHLAHGVAHGAGYP H GYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPA GYPP  YPP            GHGVAGMLAGG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP------------GHGVAGMLAGG

Query:  AAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        AAAAAAAYGAHHVAH AGHYPHGVAHYGHGHGGKFKHHGKFKHGK GKHKHGKHGMFGGGKFKKWK
Subjt:  AAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGGGKFKKWK

A0A6J1ETR5 glycine-rich protein A3-like1.4e-6884.97Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP-----------------GHGVAG
        MGGGKDKHDESDKGLFSHLAHGVAHG GYP  GGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPP HGYPPAGYPP                 GHGVAG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP-----------------GHGVAG

Query:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKH--HGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        MLAGGAAAAAAAYG HHVAHGAGHYPHGVAHYGH HGGKFKH  HGKFKHGK GKHKHGKHGMFGGGKFKKWK
Subjt:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKH--HGKFKHGKHGKHKHGKHGMFGGGKFKKWK

A0A6J1J3I6 glycine-rich protein A3-like4.6e-6985.55Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP-----------------GHGVAG
        MGGGKDKHDESDKGLFSHLAHGVAHG GYP  GGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPP HGYPPAGYPP                 GHGVAG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP-----------------GHGVAG

Query:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKH--HGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGH HGGKFKH  HGKFKHGK GKHKHGKHGMFGGGKFKKWK
Subjt:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKH--HGKFKHGKHGKHKHGKHGMFGGGKFKKWK

A0A6J1J4M6 glycine-rich protein A3-like5.7e-6785.88Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP------------GHGVAGMLAGG
        MGGGKDKHDESDKGLFSHLAHGVAHGAGYPP GGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPA GYPPAGYPP            GHGVAGMLAGG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP------------GHGVAGMLAGG

Query:  AAAAAAAYGAHHVAHGAGHYPHGVAHY----GHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        AAAAAAAYGAHHVAHGAGHYPHGVA Y    GHGHGGKFKHHGKFK GK GKHKHGKH   GGGKFKKWK
Subjt:  AAAAAAAYGAHHVAHGAGHYPHGVAHY----GHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGGGKFKKWK

SwissProt top hitse value%identityAlignment
O16005 Rhodopsin2.4e-0646.73Show/hide
Query:  GGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPP--GGYPPSHGY--PPQGYPPAHGYPPAGYPPGHGVAGMLAGGAAAAAAAYGA
        GG+       K + + +       A YPP G YPP GGYPP G  PPP  GGYPP  GY  PPQGYPPA GYPP GYPP  G      G    AA   G 
Subjt:  GGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPP--GGYPPSHGY--PPQGYPPAHGYPPAGYPPGHGVAGMLAGGAAAAAAAYGA

Query:  HHVAHGA
         + A+ A
Subjt:  HHVAHGA

P09241 Rhodopsin1.8e-0652Show/hide
Query:  GGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP
        GG+ +     K + + +    A  A Y P    PPP GYPP  GYPP G YPP  GYPPQGYPP  GYPP GYPP
Subjt:  GGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPP

P24639 Annexin A73.7e-0748.94Show/hide
Query:  GYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPP-QGYPPAHGYPPAGYPPGHGV------AGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAH
        GYPP  GYPP  GYPP  GYPP  GYPP  GYPP QGYPP  GYPP GYPP  G        G+  G A      Y   H  +  G   H   H
Subjt:  GYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPP-QGYPPAHGYPPAGYPPGHGV------AGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAH

P37705 Glycine-rich protein A31.9e-1950.89Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYP------PPGGYPPPGGYPPPGGYPPS-HGYPPQGYPPA-HGYPPAGYPP-------------GH
        MGGG D H++ DKGLFS+LA G+A G  YPP G YP      PP GYPP GG  PP GYPP+  GYPPQGYPPA  GYPP GYPP             GH
Subjt:  MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYP------PPGGYPPPGGYPPPGGYPPS-HGYPPQGYPPA-HGYPPAGYPP-------------GH

Query:  -GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGG
         GVAGM+AGG AAAAAAYG HH+  G G                        HG HG + HG  GM  G
Subjt:  -GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGG

Arabidopsis top hitse value%identityAlignment
AT1G31750.1 proline-rich family protein1.4e-2555.49Show/hide
Query:  GKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPP--GGYPPPGGYPPPGGYPPSHGYPPQGYPPAHG-YPPAGYPP--------GHGVAGMLAGGAAAA
        GKD H E DK  FSH  H   HG GYPP G YPPP  G YPPPGGYPP G  PP HGYPP  YPP  G YPPAGYP         G GV G++AG A AA
Subjt:  GKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPP--GGYPPPGGYPPPGGYPPSHGYPPQGYPPAHG-YPPAGYPP--------GHGVAGMLAGGAAAA

Query:  AAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKH-----GKHKHGKHGMFGGGKFK
        AAA G HH  H  G+  HG   Y  G  G     GK+K GKH     GK+K GKHGMFGG + K
Subjt:  AAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKH-----GKHKHGKHGMFGGGKFK

AT4G19200.1 proline-rich family protein2.9e-3158.82Show/hide
Query:  MGGGKDK-HDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYP-------PQGYPPA-HGYPPAGYP---------PGHGVA
        MGGGKDK HDE +KG      HG   G  YPP  G  PP GYPP  GYPP GGYPP+ GYP       P GYPPA  GYPPAGYP          G G+ 
Subjt:  MGGGKDK-HDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYP-------PQGYPPA-HGYPPAGYP---------PGHGVA

Query:  GMLAGGAAAAAAAYGAHHVAH----------GAGHYPHGVAH-YGHGHGGKF---KHHGKFKHGKHGKHKHGKHGMF-GGGKFKKWK
        GM+AG A AAAAAYGAHHV H          G G Y H  AH +GHG  GKF   KH GKFKHGKHG  KHGKHGMF GGGKFKKWK
Subjt:  GMLAGGAAAAAAAYGAHHVAH----------GAGHYPHGVAH-YGHGHGGKF---KHHGKFKHGKHGKHKHGKHGMF-GGGKFKKWK

AT5G17650.1 glycine/proline-rich protein2.6e-2452.75Show/hide
Query:  GKDKHDESDKGLFSHLAHGVAHGAGYPPHG--------GY---------PPPGGYPPPGGYPPPGGYPPSHGYPPQGYP----PAHGYPPAGYP-PGH--
        G D+H+ SD+G F +LA G A G  YPPHG        GY         PPP GYPP   YPP GGYPP+ GYPP GYP    PAHGYP  GYP P H  
Subjt:  GKDKHDESDKGLFSHLAHGVAHGAGYPPHG--------GY---------PPPGGYPPPGGYPPPGGYPPSHGYPPQGYP----PAHGYPPAGYP-PGH--

Query:  ----GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKHGKHKHGKHGMFG---GGKFKKWK
            G+  ++AGG AAAA   GAHH++H  GHY H   H+GHG+G  +  HGKFKHGK    K GKHGMFG   G  FKKWK
Subjt:  ----GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGHGGKFKHHGKFKHGKHGKHKHGKHGMFG---GGKFKKWK

AT5G45350.1 proline-rich family protein9.4e-3057.63Show/hide
Query:  GKDKHDESDKGLFSHLAHGVAHGAGYPPHG----GYPPPGGYPPPGGYPP------PGGYPPS---HGYPPQ----GYPPA---HGYPPAGYPPGH-GVA
        G D  ++ DKG   +   G      YPP G    GYPPP G  PP GYPP      PGGYPP+    GYPP     GYPPA    GYPPAGYP  H G A
Subjt:  GKDKHDESDKGLFSHLAHGVAHGAGYPPHG----GYPPPGGYPPPGGYPP------PGGYPPS---HGYPPQ----GYPPA---HGYPPAGYPPGH-GVA

Query:  GMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHY----GHGHGGKFKH-HGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        G + G  A AAAAYGAHHVAH + H P+G A Y    GHGHG  + H HGKFKHGKHGK KHGKHGMFGGGKFKKWK
Subjt:  GMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHY----GHGHGGKFKH-HGKFKHGKHGKHKHGKHGMFGGGKFKKWK

AT5G45350.2 proline-rich family protein9.4e-3057.63Show/hide
Query:  GKDKHDESDKGLFSHLAHGVAHGAGYPPHG----GYPPPGGYPPPGGYPP------PGGYPPS---HGYPPQ----GYPPA---HGYPPAGYPPGH-GVA
        G D  ++ DKG   +   G      YPP G    GYPPP G  PP GYPP      PGGYPP+    GYPP     GYPPA    GYPPAGYP  H G A
Subjt:  GKDKHDESDKGLFSHLAHGVAHGAGYPPHG----GYPPPGGYPPPGGYPP------PGGYPPS---HGYPPQ----GYPPA---HGYPPAGYPPGH-GVA

Query:  GMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHY----GHGHGGKFKH-HGKFKHGKHGKHKHGKHGMFGGGKFKKWK
        G + G  A AAAAYGAHHVAH + H P+G A Y    GHGHG  + H HGKFKHGKHGK KHGKHGMFGGGKFKKWK
Subjt:  GMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHY----GHGHGGKFKH-HGKFKHGKHGKHKHGKHGMFGGGKFKKWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGGTGGAAAAGATAAACATGATGAATCTGACAAAGGTTTATTCTCGCATCTGGCTCATGGTGTTGCTCATGGAGCTGGATATCCTCCCCATGGAGGATATCCACC
CCCTGGAGGATATCCCCCACCTGGAGGGTATCCCCCACCTGGAGGATATCCTCCATCTCATGGCTATCCACCCCAAGGATATCCCCCGGCACACGGTTACCCCCCGGCTG
GTTACCCTCCTGGGCACGGTGTAGCAGGAATGCTCGCCGGGGGTGCTGCCGCTGCAGCAGCAGCATATGGAGCTCATCATGTGGCCCATGGCGCTGGCCATTACCCCCAT
GGCGTTGCTCATTATGGTCATGGTCATGGTGGAAAATTCAAGCACCATGGGAAGTTCAAGCATGGGAAACATGGGAAGCACAAACATGGGAAGCACGGCATGTTTGGTGG
TGGAAAATTCAAGAAGTGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGGTGGAAAAGATAAACATGATGAATCTGACAAAGGTTTATTCTCGCATCTGGCTCATGGTGTTGCTCATGGAGCTGGATATCCTCCCCATGGAGGATATCCACC
CCCTGGAGGATATCCCCCACCTGGAGGGTATCCCCCACCTGGAGGATATCCTCCATCTCATGGCTATCCACCCCAAGGATATCCCCCGGCACACGGTTACCCCCCGGCTG
GTTACCCTCCTGGGCACGGTGTAGCAGGAATGCTCGCCGGGGGTGCTGCCGCTGCAGCAGCAGCATATGGAGCTCATCATGTGGCCCATGGCGCTGGCCATTACCCCCAT
GGCGTTGCTCATTATGGTCATGGTCATGGTGGAAAATTCAAGCACCATGGGAAGTTCAAGCATGGGAAACATGGGAAGCACAAACATGGGAAGCACGGCATGTTTGGTGG
TGGAAAATTCAAGAAGTGGAAGTGATTTTTCTGCCATCTCTTTTCACCGAGCCTCCTTCTTCGGTCGCTTTCAAGAATGGCCTCGGTACAGATAGTCAGTCGCCTCTACT
ATCTTCTTCTCTAATATCTGTAATGTCTTAAGACTCTGGATTTTCGTATCTACTCTTATATTGTCTTAACTACCGATTATGGCTTCTGTACTGTTGCTGCTGGACATCAA
TAATCTGGATCTTTCCAGTTTCCACCTTATTTTCTTCTTATTGTAAATTTTTTTTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MGGGKDKHDESDKGLFSHLAHGVAHGAGYPPHGGYPPPGGYPPPGGYPPPGGYPPSHGYPPQGYPPAHGYPPAGYPPGHGVAGMLAGGAAAAAAAYGAHHVAHGAGHYPH
GVAHYGHGHGGKFKHHGKFKHGKHGKHKHGKHGMFGGGKFKKWK