; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G014940 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G014940
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGlycine-rich RNA-binding protein like
Genome locationCG_Chr09:27945134..27949566
RNA-Seq ExpressionClCG09G014940
SyntenyClCG09G014940
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR001878 - Zinc finger, CCHC-type
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136860.1 glycine-rich RNA-binding protein RZ1A [Cucumis sativus]4.0e-7196.53Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  G----RDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
        G    RDRGRD GGARGSNSGDCFKCGKPGHFARECPSEGGRGG
Subjt:  G----RDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

XP_008455237.1 PREDICTED: glycine-rich RNA-binding protein RZ1A [Cucumis melo]4.0e-7196.53Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  G----RDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
        G    RDRGRD GGARGSNSGDCFKCGKPGHFARECPSEGGRGG
Subjt:  G----RDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

XP_022154392.1 glycine-rich RNA-binding protein RZ1A [Momordica charantia]9.1e-6893.01Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MA+EVEYRCFIGGLSWSTSDR LKEAFEKFGHL+EAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  ---GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
            RDR RD GG RGSNSGDCFKCGKPGHFARECPSEGGRGG
Subjt:  ---GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

XP_023553985.1 glycine-rich RNA-binding protein RZ1A-like [Cucurbita pepo subsp. pepo]3.5e-6792.14Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MA+EVEYRCFIGGLSWSTSDRGLK+AFEKFGHL+EAKVVVDKFSGRSRGFGFVTFDEKKAM+EAIK MNG+DLDGR+ITVDKAQPNQGSGRDHDGDRPR 
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
        GRDRGRD GG RGSN GDCFKCGKPGHFARECPSEGGRGG
Subjt:  GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

XP_038888155.1 glycine-rich RNA-binding protein RZ1A [Benincasa hispida]8.0e-7297.22Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  G----RDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
        G    RDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
Subjt:  G----RDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

TrEMBL top hitse value%identityAlignment
A0A0A0K5W4 Uncharacterized protein1.9e-7196.53Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  G----RDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
        G    RDRGRD GGARGSNSGDCFKCGKPGHFARECPSEGGRGG
Subjt:  G----RDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

A0A1S3C164 glycine-rich RNA-binding protein RZ1A1.9e-7196.53Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  G----RDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
        G    RDRGRD GGARGSNSGDCFKCGKPGHFARECPSEGGRGG
Subjt:  G----RDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

A0A6J1DLZ2 glycine-rich RNA-binding protein RZ1A4.4e-6893.01Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MA+EVEYRCFIGGLSWSTSDR LKEAFEKFGHL+EAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  ---GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
            RDR RD GG RGSNSGDCFKCGKPGHFARECPSEGGRGG
Subjt:  ---GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

A0A6J1HH58 glycine-rich RNA-binding protein RZ1A-like3.7e-6791.43Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MA+EVEYRCFIGGLSWSTSDRGLK+AFEKFGHL+EAKVVVDKFSGRSRGFGFVTFDEKKAM+EAIK MNG+DLDGR+ITVDKAQPNQGSG+DHDGDRPR 
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
        GRDRGRD GG RGSN GDCFKCGKPGHFARECPSEGGRGG
Subjt:  GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

A0A6J1HVL9 glycine-rich RNA-binding protein RZ1A-like8.3e-6791.43Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MA+EVEYRCFIGGLSWSTSDRGLK+AFEKFGHL+EAKVVVDKFSGRSRGFGFVTFDEKKAM+EAIK MNG+DLDGR+ITVDKAQPNQGSGRDHDGDRPR 
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
        GRDRGRD G  RGSN GDCFKCGKPGHFARECPSEGGRGG
Subjt:  GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

SwissProt top hitse value%identityAlignment
P49311 Glycine-rich RNA-binding protein GRP2A1.5e-2548.91Show/hide
Query:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRGGRD
        +VEYRCF+GGL+W+T +R L+ AF +FG LV++K++ D+ +GRSRGFGFVTF ++K+M +AI+ MNG DLDGRSITV++AQ ++GSG    G    GG  
Subjt:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRGGRD

Query:  RGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
         G   GG  G   G   + G          S GG GG
Subjt:  RGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

Q03250 Glycine-rich RNA-binding protein 76.9e-2649.64Show/hide
Query:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRGGRD
        +VEYRCF+GGL+W+T DR L+ AF ++G ++++K++ D+ +GRSRGFGFVTF ++KAM +AI+ MNG DLDGRSITV++AQ ++GSG    G   RGG  
Subjt:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRGGRD

Query:  RGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
         G  SGG  G + G     G  G         GG GG
Subjt:  RGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

Q03878 Glycine-rich RNA-binding protein2.0e-2543.92Show/hide
Query:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQP---------NQGSGRDHD
        EVEYRCF+GGL+W+T+D  L++AF +FG + ++K++ D+ +GRSRGFGFVTF ++K+M +AI+ MNG +LDGR+ITV++AQ           +G G  + 
Subjt:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQP---------NQGSGRDHD

Query:  GDRPRGGRDRGRDSG--GARGSNSGDCFKCGKPGHFARECPSEGGRGG
        G    GGR  G   G  G R    G  +  G  G+  R    +GG GG
Subjt:  GDRPRGGRDRGRDSG--GARGSNSGDCFKCGKPGHFARECPSEGGRGG

Q8RWN5 Glycine-rich RNA-binding protein RZ1C1.8e-2644.44Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MA +   R F+GGLS   +DR L+ AF +FG +++ ++++++ +GRSRGFGF+TF +++AMDE+I+ M+G D   R I+V++A+P  G   D +    RG
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  GRDRGRD--------SGGARGSNSG--DCFKCGKPGHFARECPSE-GGRGGVV
        GRD G           GG  G   G  +CFKCG+ GH+AR+CPS  GGRGG V
Subjt:  GRDRGRD--------SGGARGSNSG--DCFKCGKPGHFARECPSE-GGRGGVV

Q9LIN3 Glycine-rich RNA-binding protein RZ1A1.1e-5577.08Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQ-GSGRDHDGDRPR
        M+++ EYRCFIGGL+W+TSDRGL++AFEK+GHLVEAKVV+DKFSGRSRGFGF+TFDEKKAMDEAI AMNGMDLDGR+ITVDKAQP+Q G+GRD+DGDR R
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQ-GSGRDHDGDRPR

Query:  G---GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
             RDR R SGG  G   GDCFKCGKPGHFARECPSE  R G
Subjt:  G---GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

Arabidopsis top hitse value%identityAlignment
AT2G21660.1 cold, circadian rhythm, and rna binding 24.9e-2749.64Show/hide
Query:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRGGRD
        +VEYRCF+GGL+W+T DR L+ AF ++G ++++K++ D+ +GRSRGFGFVTF ++KAM +AI+ MNG DLDGRSITV++AQ ++GSG    G   RGG  
Subjt:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRGGRD

Query:  RGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
         G  SGG  G + G     G  G         GG GG
Subjt:  RGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

AT2G21660.2 cold, circadian rhythm, and rna binding 26.4e-2747.45Show/hide
Query:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQP-NQGSGRDHDGDRPRGGR
        +VEYRCF+GGL+W+T DR L+ AF ++G ++++K++ D+ +GRSRGFGFVTF ++KAM +AI+ MNG DLDGRSITV++AQ    G G  H G    GG 
Subjt:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQP-NQGSGRDHDGDRPRGGR

Query:  DRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRG
           R+ GG      G     G  G        EGG G
Subjt:  DRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRG

AT3G26420.1 RNA-binding (RRM/RBD/RNP motifs) family protein with retrovirus zinc finger-like domain7.7e-5777.08Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQ-GSGRDHDGDRPR
        M+++ EYRCFIGGL+W+TSDRGL++AFEK+GHLVEAKVV+DKFSGRSRGFGF+TFDEKKAMDEAI AMNGMDLDGR+ITVDKAQP+Q G+GRD+DGDR R
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQ-GSGRDHDGDRPR

Query:  G---GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG
             RDR R SGG  G   GDCFKCGKPGHFARECPSE  R G
Subjt:  G---GRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGG

AT4G39260.1 cold, circadian rhythm, and RNA binding 12.4e-2648.2Show/hide
Query:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRGGRD
        EVEYRCF+GGL+W+T+D  L+  F +FG ++++K++ D+ SGRSRGFGFVTF ++KAM +AI+ MNG +LDGR ITV++AQ ++GSG    G   RGG  
Subjt:  EVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRGGRD

Query:  RGRDSGGARG--SNSGDCFKCGKPGHFARECPSEGGRGG
         G  SGG  G     G  +  G  G + R     G  GG
Subjt:  RGRDSGGARG--SNSGDCFKCGKPGHFARECPSEGGRGG

AT5G04280.1 RNA-binding (RRM/RBD/RNP motifs) family protein with retrovirus zinc finger-like domain1.3e-2744.44Show/hide
Query:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG
        MA +   R F+GGLS   +DR L+ AF +FG +++ ++++++ +GRSRGFGF+TF +++AMDE+I+ M+G D   R I+V++A+P  G   D +    RG
Subjt:  MADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVDKFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRG

Query:  GRDRGRD--------SGGARGSNSG--DCFKCGKPGHFARECPSE-GGRGGVV
        GRD G           GG  G   G  +CFKCG+ GH+AR+CPS  GGRGG V
Subjt:  GRDRGRD--------SGGARGSNSG--DCFKCGKPGHFARECPSE-GGRGGVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCAATCGATCATGCCAAACAGTGAATATATCTGGATTCATGAACTTCAGGTCTTCGGTTCGTTGACTGGCTCTTTGCCCTACAACCAGTTTAACCATCTC
ACCCTCACTTTGCCTTCTCCATCTTTTAGGTTCTCTCGCTACGGAATCCAATTCCTAGGGCTTCCCTTCTTCATAAATTCTCTCATAATGGCTGACGAGGTGGAG
TATCGCTGCTTTATTGGCGGCCTTTCTTGGTCAACATCTGACAGAGGTCTCAAAGAGGCGTTTGAAAAGTTTGGCCATCTTGTCGAGGCCAAGGTAGTTGTTGAC
AAATTTTCTGGGCGCTCCCGTGGTTTTGGATTTGTCACTTTTGATGAGAAAAAAGCTATGGATGAGGCAATCAAAGCCATGAATGGAATGGATTTAGATGGCCGG
AGTATTACAGTTGATAAGGCTCAACCAAACCAAGGTTCAGGTAGAGATCATGATGGTGACCGTCCACGTGGTGGCCGTGATCGAGGTCGAGATTCTGGTGGTGCA
CGTGGATCTAACAGTGGAGATTGCTTTAAATGTGGGAAACCTGGACATTTTGCTAGGGAATGTCCCAGTGAAGGTGGTAGAGGAGGGGTAGTGGTGACCGAGGAG
GTTCGGGAAGTGATCGATATAGCCGTGACCGCTCTGGGCCGTATGAACGACGCGGTTCTGGTGGTTTTCGTTCTGGATAGAATTATGACTTATCTTGTAGCTTAC
ATGGTGGGATCTTGCTGTTATGTCCTGCAGAATGTTAAATGTTCCTTTCTACAGGTTGTCTCACGTACAGTTCAGAGTTCTGAAGTGCAGAACTTGACGTTAATA
TTGAGATGTTTGTGTTCCTATGTGATGAATTTGGTATCTTGGTACAGTGGTGTGGTGTCTAATGTGTTCTTGACATATTCATTCGAATCTGGTTTTATGATGTTT
GAGTGCCCTAGTAGCTGCTGCATTACTTGTGACACACATACACAGGGAATTTCAATAACTGCTGCACCATTTCATTTTGAAAATTTTCTTTTGAATACAACTAGA
CCTGAACGTTGTTCCCATTGTCTTATAAATTCCACAGGAGTTTACAGCATCCTTCCTTTTTATGGTCTTATTGGGTGCATAATACAGTTTCTTGTTAATTGGGCG
CTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCCAATCGATCATGCCAAACAGTGAATATATCTGGATTCATGAACTTCAGGTCTTCGGTTCGTTGACTGGCTCTTTGCCCTACAACCAGTTTAACCATCTC
ACCCTCACTTTGCCTTCTCCATCTTTTAGGTTCTCTCGCTACGGAATCCAATTCCTAGGGCTTCCCTTCTTCATAAATTCTCTCATAATGGCTGACGAGGTGGAG
TATCGCTGCTTTATTGGCGGCCTTTCTTGGTCAACATCTGACAGAGGTCTCAAAGAGGCGTTTGAAAAGTTTGGCCATCTTGTCGAGGCCAAGGTAGTTGTTGAC
AAATTTTCTGGGCGCTCCCGTGGTTTTGGATTTGTCACTTTTGATGAGAAAAAAGCTATGGATGAGGCAATCAAAGCCATGAATGGAATGGATTTAGATGGCCGG
AGTATTACAGTTGATAAGGCTCAACCAAACCAAGGTTCAGGTAGAGATCATGATGGTGACCGTCCACGTGGTGGCCGTGATCGAGGTCGAGATTCTGGTGGTGCA
CGTGGATCTAACAGTGGAGATTGCTTTAAATGTGGGAAACCTGGACATTTTGCTAGGGAATGTCCCAGTGAAGGTGGTAGAGGAGGGGTAGTGGTGACCGAGGAG
GTTCGGGAAGTGATCGATATAGCCGTGACCGCTCTGGGCCGTATGAACGACGCGGTTCTGGTGGTTTTCGTTCTGGATAGAATTATGACTTATCTTGTAGCTTAC
ATGGTGGGATCTTGCTGTTATGTCCTGCAGAATGTTAAATGTTCCTTTCTACAGGTTGTCTCACGTACAGTTCAGAGTTCTGAAGTGCAGAACTTGACGTTAATA
TTGAGATGTTTGTGTTCCTATGTGATGAATTTGGTATCTTGGTACAGTGGTGTGGTGTCTAATGTGTTCTTGACATATTCATTCGAATCTGGTTTTATGATGTTT
GAGTGCCCTAGTAGCTGCTGCATTACTTGTGACACACATACACAGGGAATTTCAATAACTGCTGCACCATTTCATTTTGAAAATTTTCTTTTGAATACAACTAGA
CCTGAACGTTGTTCCCATTGTCTTATAAATTCCACAGGAGTTTACAGCATCCTTCCTTTTTATGGTCTTATTGGGTGCATAATACAGTTTCTTGTTAATTGGGCG
CTTTAA
Protein sequenceShow/hide protein sequence
MTQSIMPNSEYIWIHELQVFGSLTGSLPYNQFNHLTLTLPSPSFRFSRYGIQFLGLPFFINSLIMADEVEYRCFIGGLSWSTSDRGLKEAFEKFGHLVEAKVVVD
KFSGRSRGFGFVTFDEKKAMDEAIKAMNGMDLDGRSITVDKAQPNQGSGRDHDGDRPRGGRDRGRDSGGARGSNSGDCFKCGKPGHFARECPSEGGRGGVVVTEE
VREVIDIAVTALGRMNDAVLVVFVLDRIMTYLVAYMVGSCCYVLQNVKCSFLQVVSRTVQSSEVQNLTLILRCLCSYVMNLVSWYSGVVSNVFLTYSFESGFMMF
ECPSSCCITCDTHTQGISITAAPFHFENFLLNTTRPERCSHCLINSTGVYSILPFYGLIGCIIQFLVNWAL