; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002613 (gene) of Snake gourd v1 genome

Gene IDTan0002613
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglycine-rich protein A3-like
Genome locationLG01:9917011..9919529
RNA-Seq ExpressionTan0002613
SyntenyTan0002613
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451874.1 PREDICTED: glycine-rich protein A3-like [Cucumis melo]2.2e-6889.29Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVGHGA-----GYPPPGGYPPPGGYPPPHGYPP-QGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA
        MGGGKDKHDESDKGLFSHLAHGV HGA     GYPPPGGYPPPGGYPPP GYPP  GYPP  GYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA
Subjt:  MGGGKDKHDESDKGLFSHLAHGVGHGA-----GYPPPGGYPPPGGYPPPHGYPP-QGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA

Query:  AAAAAAYGAHHVAHGAGHYPHGVAHY--GHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        AAAAAAYGAHHVAH AGHYPHGVAHY  GHGGKFKHHGKFKHGKF   GKHK GKHGMFGGGKFKKWK
Subjt:  AAAAAAYGAHHVAHGAGHYPHGVAHY--GHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

XP_022136842.1 glycine-rich protein A3-like [Momordica charantia]2.5e-7290.36Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVG-HGAGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAG-----YPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA
        MGGGKDKHDESDKGLFSHLAHGV  HGAGYPP  GYPPPGGYPPPHGYPPQGYPPAHGYPPAG     YPPGAYPPGAYPGPSAPHH+GHGVAGMLAGGA
Subjt:  MGGGKDKHDESDKGLFSHLAHGVG-HGAGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAG-----YPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA

Query:  AAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        AAAAAAYGAHHVAHGAGHYPHG AHYGHGGKFKHHGKFKHGKF   GKHK GKHGM+GGGKFKKWK
Subjt:  AAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

XP_022931474.1 glycine-rich protein A3-like [Cucurbita moschata]1.4e-7086.29Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVGHG------------AGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG
        MGGGKDKHDESDKGLFSHLAHGV HG             GYPPPGGYPPPGGYPP HGYPPQGYPP HGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVGHG------------AGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG

Query:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG-HGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        MLAGGAAAAAAAYG HHVAHGAGHYPHGVAHYG HGGKFKH  HGKFKHGKF   GKHK GKHGMFGGGKFKKWK
Subjt:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG-HGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

XP_022984947.1 glycine-rich protein A3-like [Cucurbita maxima]1.8e-7086.29Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVGHG------------AGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG
        MGGGKDKHDESDKGLFSHLAHGV HG             GYPPPGGYPPPGGYPP HGYPPQGYPP HGYPPAGYPPGAYPPG YPGPSAPHHSGHGVAG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVGHG------------AGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG

Query:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG-HGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG HGGKFKH  HGKFKHGKF   GKHK GKHGMFGGGKFKKWK
Subjt:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG-HGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

XP_023553494.1 glycine-rich protein A3-like [Cucurbita pepo subsp. pepo]4.8e-7186.86Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVGHG------------AGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG
        MGGGKDKHDESDKGLFSHLAHGV HG             GYPPPGGYPPPGGYPP HGYPPQGYPP HGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVGHG------------AGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG

Query:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG-HGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG HGGKFKH  HGKFKHGKF   GKHK GKHGMFGGGKFKKWK
Subjt:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG-HGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

TrEMBL top hitse value%identityAlignment
A0A1S3BRX9 glycine-rich protein A3-like1.1e-6889.29Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVGHGA-----GYPPPGGYPPPGGYPPPHGYPP-QGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA
        MGGGKDKHDESDKGLFSHLAHGV HGA     GYPPPGGYPPPGGYPPP GYPP  GYPP  GYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA
Subjt:  MGGGKDKHDESDKGLFSHLAHGVGHGA-----GYPPPGGYPPPGGYPPPHGYPP-QGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA

Query:  AAAAAAYGAHHVAHGAGHYPHGVAHY--GHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        AAAAAAYGAHHVAH AGHYPHGVAHY  GHGGKFKHHGKFKHGKF   GKHK GKHGMFGGGKFKKWK
Subjt:  AAAAAAYGAHHVAHGAGHYPHGVAHY--GHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

A0A5A7TDL9 Glycine-rich protein A3-like1.1e-6889.29Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVGHGA-----GYPPPGGYPPPGGYPPPHGYPP-QGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA
        MGGGKDKHDESDKGLFSHLAHGV HGA     GYPPPGGYPPPGGYPPP GYPP  GYPP  GYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA
Subjt:  MGGGKDKHDESDKGLFSHLAHGVGHGA-----GYPPPGGYPPPGGYPPPHGYPP-QGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA

Query:  AAAAAAYGAHHVAHGAGHYPHGVAHY--GHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        AAAAAAYGAHHVAH AGHYPHGVAHY  GHGGKFKHHGKFKHGKF   GKHK GKHGMFGGGKFKKWK
Subjt:  AAAAAAYGAHHVAHGAGHYPHGVAHY--GHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

A0A6J1C8M2 glycine-rich protein A3-like1.2e-7290.36Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVG-HGAGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAG-----YPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA
        MGGGKDKHDESDKGLFSHLAHGV  HGAGYPP  GYPPPGGYPPPHGYPPQGYPPAHGYPPAG     YPPGAYPPGAYPGPSAPHH+GHGVAGMLAGGA
Subjt:  MGGGKDKHDESDKGLFSHLAHGVG-HGAGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAG-----YPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGA

Query:  AAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        AAAAAAYGAHHVAHGAGHYPHG AHYGHGGKFKHHGKFKHGKF   GKHK GKHGM+GGGKFKKWK
Subjt:  AAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

A0A6J1ETR5 glycine-rich protein A3-like6.7e-7186.29Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVGHG------------AGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG
        MGGGKDKHDESDKGLFSHLAHGV HG             GYPPPGGYPPPGGYPP HGYPPQGYPP HGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVGHG------------AGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG

Query:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG-HGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        MLAGGAAAAAAAYG HHVAHGAGHYPHGVAHYG HGGKFKH  HGKFKHGKF   GKHK GKHGMFGGGKFKKWK
Subjt:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG-HGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

A0A6J1J3I6 glycine-rich protein A3-like8.8e-7186.29Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVGHG------------AGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG
        MGGGKDKHDESDKGLFSHLAHGV HG             GYPPPGGYPPPGGYPP HGYPPQGYPP HGYPPAGYPPGAYPPG YPGPSAPHHSGHGVAG
Subjt:  MGGGKDKHDESDKGLFSHLAHGVGHG------------AGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAG

Query:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG-HGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG HGGKFKH  HGKFKHGKF   GKHK GKHGMFGGGKFKKWK
Subjt:  MLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYG-HGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

SwissProt top hitse value%identityAlignment
O16005 Rhodopsin1.2e-0555.71Show/hide
Query:  AGYPPPGGYPPPGGYP-------------PPHGY--PPQGYPPAHGYPPAGYPP--GAYPPGAYPGPSAP
        A YPP G YPP GGYP             PP GY  PPQGYPPA GYPP GYPP  GA P GA P  + P
Subjt:  AGYPPPGGYPPPGGYP-------------PPHGY--PPQGYPPAHGYPPAGYPP--GAYPPGAYPGPSAP

P09241 Rhodopsin1.2e-0548.72Show/hide
Query:  GGKDKHDESDKGLFSHLAHGVGHGAGYPPPGGYPPPGGYPPPHGYPPQG-YPPAHGYPPAGYPPGAYPPGAYPGPSAP
        GG+ +     K + + +       A Y PP   PPP GY PP GYPPQG YPP  GYPP GYPP  YPP  YP   AP
Subjt:  GGKDKHDESDKGLFSHLAHGVGHGAGYPPPGGYPPPGGYPPPHGYPPQG-YPPAHGYPPAGYPPGAYPPGAYPGPSAP

P24639 Annexin A73.3e-0659.32Show/hide
Query:  GYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGV
        GYPP  GYPP  GYPP  GYPPQGYPP  GYPP G P G  P G  PG    +H G+ V
Subjt:  GYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGV

P37705 Glycine-rich protein A35.0e-2356.49Show/hide
Query:  MGGGKDKHDESDKGLFSHLAHGVGHGAGYPP------PGGYPPPGGYPPPHGYPPQGYPPA-HGYPPAGYPP--GAYPPGAYP------GPSAPHHSGH-
        MGGG D H++ DKGLFS+LA G+  G  YPP       GGYPP G  P   GYPPQGYPPA  GYPP GYPP  G YPP  YP      G SAPHHSGH 
Subjt:  MGGGKDKHDESDKGLFSHLAHGVGHGAGYPP------PGGYPPPGGYPPPHGYPPQGYPPA-HGYPPAGYPP--GAYPPGAYP------GPSAPHHSGH-

Query:  GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKHHGKFKHGKFG
        GVAGM+AGG AAAAAAYG HH+  G         H  HGG    HG + HG  G
Subjt:  GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKHHGKFKHGKFG

Q8S8M0 Cysteine-rich and transmembrane domain-containing protein WIH24.0e-0451.25Show/hide
Query:  GYPPPGGYPPPGGYP----PPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAP-------HHSGHGVAGMLAGGAAA
        G PPP GYPP  GYP    PP GYPPQGY P  GYPP GYP   YP   YP P AP       H       G L G  AA
Subjt:  GYPPPGGYPPPGGYP----PPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAP-------HHSGHGVAGMLAGGAAA

Arabidopsis top hitse value%identityAlignment
AT1G31750.1 proline-rich family protein3.6e-3260.74Show/hide
Query:  GKDKHDESDKGLFSHLAHGVGHGAGYPPPGGYPPP--GGYPPPHGYPPQGY-PPAHGYPPAGY--PPGAYPPGAYPGPSAPHHS-GHGVAGMLAGGAAAA
        GKD H E DK  FSH  H   HG GY PPG YPPP  G YPPP GYPPQGY PP HGYPPA Y  PPGAYPP  YPGPS P    G GV G++AG A AA
Subjt:  GKDKHDESDKGLFSHLAHGVGHGAGYPPPGGYPPP--GGYPPPHGYPPQGY-PPAHGYPPAGY--PPGAYPPGAYPGPSAPHHS-GHGVAGMLAGGAAAA

Query:  AAAYGAHHVAHGAGHYPHGVAHYGHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        AAA G HH  H  G        YGH G    HGK+K G FG  GK+KRGKH MFGGGK+K+ K
Subjt:  AAAYGAHHVAHGAGHYPHGVAHYGHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

AT4G19200.1 proline-rich family protein5.5e-3359.04Show/hide
Query:  MGGGKDK-HDESDKGLFSHLAHGVGHGAGYP------PPGGYPPPGGYPPPHGYPPQGYPP-AHGYPPAGYP--PGAYPPGAYPGPSAPH--HSGHGVAG
        MGGGKDK HDE +KG      HG   G  YP      PP GYPP  GYPP  GYPP GYPP A+   P GYP  PG YPP  YP P A H  HSG G+ G
Subjt:  MGGGKDK-HDESDKGLFSHLAHGVGHGAGYP------PPGGYPPPGGYPPPHGYPPQGYPP-AHGYPPAGYP--PGAYPPGAYPGPSAPH--HSGHGVAG

Query:  MLAGGAAAAAAAYGAHHVAH----------GAGHYPHGVAH-YGHGGKFKHHGKFKHGKFG---KHGKH-KRGKHGMF-GGGKFKKWK
        M+AG A AAAAAYGAHHV H          G G Y H  AH +GHGG    HGKFKHGK G   KHGKH K GKHGMF GGGKFKKWK
Subjt:  MLAGGAAAAAAAYGAHHVAH----------GAGHYPHGVAH-YGHGGKFKHHGKFKHGKFG---KHGKH-KRGKHGMF-GGGKFKKWK

AT5G17650.1 glycine/proline-rich protein7.0e-2853.26Show/hide
Query:  GKDKHDESDKGLFSHLA-----------HGVG-HGAGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYP-------GPSAPHHSGH
        G D+H+ SD+G F +LA           HG G HG GY     YPPP   PPPHGYPP  YPP  GYPPAGYPP  YPP  YP       G   P HSGH
Subjt:  GKDKHDESDKGLFSHLA-----------HGVG-HGAGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYP-------GPSAPHHSGH

Query:  ---GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFG---GGKFKKWK
           G+  ++AGG AAAA   GAHH++H  GHY H   H+GHG  + +  HGKFKHGKF KHGK   GKHGMFG   G  FKKWK
Subjt:  ---GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKH--HGKFKHGKFGKHGKHKRGKHGMFG---GGKFKKWK

AT5G45350.1 proline-rich family protein1.2e-2755.79Show/hide
Query:  GKDKHDESDKGLFSHLAHGVGHGAGYPPPGGYPPPG----GYPPPHG-YPPQGYPPA------HGYPPA----GYPP----GAYPP----GAYPGPSAP-
        G D  ++ DKG      HG    AGYPPPG YPP G    GYPPP G YPP GYPP        GYPPA    GYPP    G YPP    G YP    P 
Subjt:  GKDKHDESDKGLFSHLAHGVGHGAGYPPPGGYPPPG----GYPPPHG-YPPQGYPPA------HGYPPA----GYPP----GAYPP----GAYPGPSAP-

Query:  HHSGH--GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKH-------HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        HHSGH  G+ GM+AG    AAAAYGAHHVAH + H P+G A YGHG    H       HGKFKH   GKHGK K GKHGMFGGGKFKKWK
Subjt:  HHSGH--GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKH-------HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK

AT5G45350.2 proline-rich family protein1.2e-2755.79Show/hide
Query:  GKDKHDESDKGLFSHLAHGVGHGAGYPPPGGYPPPG----GYPPPHG-YPPQGYPPA------HGYPPA----GYPP----GAYPP----GAYPGPSAP-
        G D  ++ DKG      HG    AGYPPPG YPP G    GYPPP G YPP GYPP        GYPPA    GYPP    G YPP    G YP    P 
Subjt:  GKDKHDESDKGLFSHLAHGVGHGAGYPPPGGYPPPG----GYPPPHG-YPPQGYPPA------HGYPPA----GYPP----GAYPP----GAYPGPSAP-

Query:  HHSGH--GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKH-------HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK
        HHSGH  G+ GM+AG    AAAAYGAHHVAH + H P+G A YGHG    H       HGKFKH   GKHGK K GKHGMFGGGKFKKWK
Subjt:  HHSGH--GVAGMLAGGAAAAAAAYGAHHVAHGAGHYPHGVAHYGHGGKFKH-------HGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGGTGGAAAAGATAAACATGATGAATCTGACAAAGGGTTATTCTCGCATCTGGCTCATGGTGTTGGTCATGGAGCTGGATATCCTCCTCCTGGAGGATATCCCCC
ACCTGGAGGGTATCCTCCACCTCATGGCTATCCACCCCAAGGATATCCCCCCGCACATGGCTACCCCCCGGCTGGCTATCCTCCGGGCGCTTATCCTCCTGGTGCATATC
CTGGACCCTCTGCCCCACACCATTCAGGGCACGGTGTAGCAGGAATGCTTGCCGGTGGTGCTGCCGCTGCAGCAGCTGCATATGGAGCTCACCATGTCGCCCATGGCGCT
GGTCACTACCCCCACGGCGTTGCTCATTATGGTCATGGTGGAAAATTCAAGCACCATGGCAAGTTCAAGCATGGGAAGTTTGGGAAGCATGGGAAACACAAACGTGGGAA
GCACGGCATGTTTGGTGGTGGTAAATTCAAGAAGTGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
CAAAAACAAAAAACAAAAAACAAAAAACAAAAGATATATCTTATTCCCTTCCCCTTGTTTCTCTCTCTAGTCTCTATATATAACGAATTCTTCAAGATTTCAATTCTCCT
TGCCTCTCTCTCTCTTTGTCTCTCTTAAGAACCAAATCGTCCATTTGATCGTTATTATAAGCTGATAAACCATGGGGGGTGGAAAAGATAAACATGATGAATCTGACAAA
GGGTTATTCTCGCATCTGGCTCATGGTGTTGGTCATGGAGCTGGATATCCTCCTCCTGGAGGATATCCCCCACCTGGAGGGTATCCTCCACCTCATGGCTATCCACCCCA
AGGATATCCCCCCGCACATGGCTACCCCCCGGCTGGCTATCCTCCGGGCGCTTATCCTCCTGGTGCATATCCTGGACCCTCTGCCCCACACCATTCAGGGCACGGTGTAG
CAGGAATGCTTGCCGGTGGTGCTGCCGCTGCAGCAGCTGCATATGGAGCTCACCATGTCGCCCATGGCGCTGGTCACTACCCCCACGGCGTTGCTCATTATGGTCATGGT
GGAAAATTCAAGCACCATGGCAAGTTCAAGCATGGGAAGTTTGGGAAGCATGGGAAACACAAACGTGGGAAGCACGGCATGTTTGGTGGTGGTAAATTCAAGAAGTGGAA
GTGATTTTTCTGCCATCACTTTTCACCACCGAGCCTTCTTCTTCAGTGGCTTTCAAGAATGGCCTCGGTACAGATAGTGGGGGTACCATTTTCTTCTTCTCTAATCTCTG
TTGTTATGTCTTAAGAACTTTGGATTTGGTATTTACTCTTATTATCTTAACTACTGTTTAATGACTTCTGTACTCTGCTGCTGGACATCAATAATCTGGATCTTCTCCAG
TTTCACCTTTATTTTCTTTTTCTTGTAAATCCCTTTCTTTCTAGTTTCTACCTAATTTAAAATGGTTGTTCAATTACAGTGTTTTGTGCTTCACTTAGGATGAACCAATG
TACTATTTGATTTTTTTTTTTGGTAAAAGGAAGCAATTATCAAAATTGTGAGCAGGCTAACCAAAAAATGTGTATCTGGGGTCTTATGTTACTAGTTCCTTAAAAGAATT
TGTATGAAATTTTACCCTTTTTTTGGGATCTGTATA
Protein sequenceShow/hide protein sequence
MGGGKDKHDESDKGLFSHLAHGVGHGAGYPPPGGYPPPGGYPPPHGYPPQGYPPAHGYPPAGYPPGAYPPGAYPGPSAPHHSGHGVAGMLAGGAAAAAAAYGAHHVAHGA
GHYPHGVAHYGHGGKFKHHGKFKHGKFGKHGKHKRGKHGMFGGGKFKKWK