; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019286 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019286
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr04:19778161..19780837
RNA-Seq ExpressionHG10019286
SyntenyHG10019286
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152251.1 uncharacterized protein LOC101206482 [Cucumis sativus]9.7e-13186.6Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTH-SSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFRLI
        MSRR LDSR SIDSCTLK HGW+PFHLPKTLDSD H +SAPTN+KPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSI       RSFRLI
Subjt:  MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTH-SSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFRLI

Query:  ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNESGY
        ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGF  SNGGFDAQGNESGY
Subjt:  ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNESGY

Query:  GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG                     DSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_008454343.1 PREDICTED: uncharacterized protein LOC103494772 [Cucumis melo]9.7e-13186.6Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTH-SSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFRLI
        MSRR LDSR SIDSCTLK HGW+PFHLPKTLDSD H +SAPTN+KPYYSSTP+HTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSI       RSFRLI
Subjt:  MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTH-SSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFRLI

Query:  ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNESGY
        ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGF  SNGGFDAQGNESGY
Subjt:  ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNESGY

Query:  GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG                     DSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_022155253.1 uncharacterized protein LOC111022393 [Momordica charantia]5.9e-12884.64Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWSPFHL---PKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFR
        MSRRALDSRQSI+SCTLKLH W PF L   PKTLDSD H+SAPTN+KPYYSST LHTKRPCLSDR TSF+VDAIDMS LSLIDDDKPSIAAGCYTRRSFR
Subjt:  MSRRALDSRQSIDSCTLKLHGWSPFHL---PKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFR

Query:  LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNES
        L+A KRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD  G     SNGGFDAQGNES
Subjt:  LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNES

Query:  GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG                     DSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_022949469.1 uncharacterized protein LOC111452804 isoform X2 [Cucurbita moschata]7.7e-12885.08Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWSPFH----LPKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSF
        MSRR LDSRQSIDSCTLKLH W PFH     PKTLDSDTH S PT +KPYYSST LHTKRPCLSDRTTSFNVDAIDMS LSLIDDDKPSIAAG YTRRSF
Subjt:  MSRRALDSRQSIDSCTLKLHGWSPFH----LPKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGSGFGPSNGGFDAQGN
         LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER+EKD HLG GFG SNGG DAQGN
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGSGFGPSNGGFDAQGN

Query:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG                     DSRMEIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_038906083.1 uncharacterized protein LOC120091971 [Benincasa hispida]5.5e-14291.38Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFRLIA
        MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTHSSAPTN+KPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRS RLIA
Subjt:  MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFRLIA

Query:  RKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNESGYG
        RKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFG SNGGFDAQGNESGYG
Subjt:  RKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNESGYG

Query:  SEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        SEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG                     DS+MEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  SEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

TrEMBL top hitse value%identityAlignment
A0A0A0KT54 Uncharacterized protein4.7e-13186.6Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTH-SSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFRLI
        MSRR LDSR SIDSCTLK HGW+PFHLPKTLDSD H +SAPTN+KPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSI       RSFRLI
Subjt:  MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTH-SSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFRLI

Query:  ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNESGY
        ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGF  SNGGFDAQGNESGY
Subjt:  ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNESGY

Query:  GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG                     DSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A1S3BYD4 uncharacterized protein LOC1034947724.7e-13186.6Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTH-SSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFRLI
        MSRR LDSR SIDSCTLK HGW+PFHLPKTLDSD H +SAPTN+KPYYSSTP+HTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSI       RSFRLI
Subjt:  MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTH-SSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFRLI

Query:  ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNESGY
        ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGF  SNGGFDAQGNESGY
Subjt:  ARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNESGY

Query:  GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG                     DSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  GSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A6J1DMG8 uncharacterized protein LOC1110223932.9e-12884.64Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWSPFHL---PKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFR
        MSRRALDSRQSI+SCTLKLH W PF L   PKTLDSD H+SAPTN+KPYYSST LHTKRPCLSDR TSF+VDAIDMS LSLIDDDKPSIAAGCYTRRSFR
Subjt:  MSRRALDSRQSIDSCTLKLHGWSPFHL---PKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFR

Query:  LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNES
        L+A KRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD  G     SNGGFDAQGNES
Subjt:  LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNES

Query:  GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG                     DSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A6J1GC64 uncharacterized protein LOC111452804 isoform X12.4e-12784.56Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWSPFH----LPKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSF
        MSRR LDSRQSIDSCTLKLH W PFH     PKTLDSDTH S PT +KPYYSST LHTKRPCLSDRTTSFNVDAIDMS LSLIDDDKPSIAAG YTRRSF
Subjt:  MSRRALDSRQSIDSCTLKLHGWSPFH----LPKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGSGFGPSNGGFDAQGN
         LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER+EKD HLG GFG SNGG DAQGN
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGSGFGPSNGGFDAQGN

Query:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG-VPFFWFKVQVKEMIDMFT--FYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG +        + E+   F   +   SRMEIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG-VPFFWFKVQVKEMIDMFT--FYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A6J1GCW8 uncharacterized protein LOC111452804 isoform X23.7e-12885.08Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWSPFH----LPKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSF
        MSRR LDSRQSIDSCTLKLH W PFH     PKTLDSDTH S PT +KPYYSST LHTKRPCLSDRTTSFNVDAIDMS LSLIDDDKPSIAAG YTRRSF
Subjt:  MSRRALDSRQSIDSCTLKLHGWSPFH----LPKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGSGFGPSNGGFDAQGN
         LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER+EKD HLG GFG SNGG DAQGN
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGSGFGPSNGGFDAQGN

Query:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG                     DSRMEIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02425.1 unknown protein8.6e-6954.85Show/hide
Query:  MSRRALD-SRQSIDSCTLKLHGWSPFHLPKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDK--PSIAAGCYTRR-SF
        MS + L+ SR SI+SCT +L  W PFH  KTLDS   S  P     ++S TP   KRPC SDR+TSF ++A  MS LSL DDD    +++A  Y+ R SF
Subjt:  MSRRALD-SRQSIDSCTLKLHGWSPFHLPKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDK--PSIAAGCYTRR-SF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSE-AKNSRREREE---KDHLGSGFGPSNGGFDA
        RL+ARKRRRR SRSVSGRSSDRSGTRRCCS+G   AHGTCSD P AVGTDSSGELF  G+ANW+SDVSE A+NSRRER +   +     GFG +N G D 
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSE-AKNSRREREE---KDHLGSGFGPSNGGFDA

Query:  QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKK-HECRMVDALR
         GNESGYGSEPGYRGD EFGYGDE D+E+ED + L WG+                        DS M + GE  F+D K   RCRR++ H+ + VD++R
Subjt:  QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKK-HECRMVDALR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGCAGAGCCCTAGATTCCCGCCAATCCATTGACTCTTGTACTCTCAAGCTCCATGGTTGGAGCCCTTTTCACCTCCCCAAAACCCTAGATTCCGACACCCATTC
CTCAGCTCCCACTAACGCTAAACCCTACTACTCTTCCACTCCCCTCCACACCAAGCGCCCTTGTCTCTCCGATCGAACTACCTCTTTCAATGTCGACGCCATTGACATGT
CCGCCCTCAGTTTGATTGACGACGACAAGCCTTCTATTGCCGCTGGCTGTTACACGCGTCGGAGCTTCCGATTGATTGCTAGGAAGCGGCGGCGTCGTGGATCTAGGTCT
GTTTCTGGGCGGAGTAGTGATCGGAGTGGGACGAGACGGTGCTGCTCTGTTGGGGCTTCCGCGGCTCATGGGACTTGCTCCGATTTCCCTCTGGCGGTTGGGACTGATTC
CAGTGGGGAGTTGTTTGTGAATGGAGATGCGAATTGGTCGTCGGATGTGAGTGAAGCGAAGAATTCGAGGAGGGAGAGAGAGGAGAAGGATCATTTGGGTTCTGGGTTTG
GTCCTTCTAATGGGGGTTTTGATGCTCAGGGGAATGAGTCTGGATATGGTAGTGAGCCTGGTTATCGTGGAGATGGTGAATTTGGATATGGTGATGAGATCGATGAGGAG
GATGAGGATGCCAGATTGCTGTTGTGGGGTGAACGACTAGGAGTCCCCTTTTTCTGGTTCAAAGTCCAAGTTAAGGAGATGATTGATATGTTCACATTCTATGCAGATTC
TAGAATGGAAATTGTAGGAGAGAATACATTTGCAGATCAGAAATCACACCATAGATGCCGTCGTAAGAAGCACGAATGTAGAATGGTTGATGCCCTGAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCGCAGAGCCCTAGATTCCCGCCAATCCATTGACTCTTGTACTCTCAAGCTCCATGGTTGGAGCCCTTTTCACCTCCCCAAAACCCTAGATTCCGACACCCATTC
CTCAGCTCCCACTAACGCTAAACCCTACTACTCTTCCACTCCCCTCCACACCAAGCGCCCTTGTCTCTCCGATCGAACTACCTCTTTCAATGTCGACGCCATTGACATGT
CCGCCCTCAGTTTGATTGACGACGACAAGCCTTCTATTGCCGCTGGCTGTTACACGCGTCGGAGCTTCCGATTGATTGCTAGGAAGCGGCGGCGTCGTGGATCTAGGTCT
GTTTCTGGGCGGAGTAGTGATCGGAGTGGGACGAGACGGTGCTGCTCTGTTGGGGCTTCCGCGGCTCATGGGACTTGCTCCGATTTCCCTCTGGCGGTTGGGACTGATTC
CAGTGGGGAGTTGTTTGTGAATGGAGATGCGAATTGGTCGTCGGATGTGAGTGAAGCGAAGAATTCGAGGAGGGAGAGAGAGGAGAAGGATCATTTGGGTTCTGGGTTTG
GTCCTTCTAATGGGGGTTTTGATGCTCAGGGGAATGAGTCTGGATATGGTAGTGAGCCTGGTTATCGTGGAGATGGTGAATTTGGATATGGTGATGAGATCGATGAGGAG
GATGAGGATGCCAGATTGCTGTTGTGGGGTGAACGACTAGGAGTCCCCTTTTTCTGGTTCAAAGTCCAAGTTAAGGAGATGATTGATATGTTCACATTCTATGCAGATTC
TAGAATGGAAATTGTAGGAGAGAATACATTTGCAGATCAGAAATCACACCATAGATGCCGTCGTAAGAAGCACGAATGTAGAATGGTTGATGCCCTGAGGTGA
Protein sequenceShow/hide protein sequence
MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTHSSAPTNAKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSFRLIARKRRRRGSRS
VSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGSGFGPSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEE
DEDARLLLWGERLGVPFFWFKVQVKEMIDMFTFYADSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR