; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g38960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g38960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein CDI
Genome locationchr6:30224831..30226335
RNA-Seq ExpressionMoc06g38960
SyntenyMoc06g38960
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR029044 - Nucleotide-diphospho-sugar transferases


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011466.1 Protein CDI, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-13978.93Show/hide
Query:  LSRRFYRTLPTFFGPRFSPNLQIREGTANQFDCFQFLFFEIYVFVLLALENIISEDLGHPLPMGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVC
        LS R +RTL TF  PRF P +  +      FD   F       F              H LPMGSC     PAV  G+VEQPFKIFVGYDVREDLAYEVC
Subjt:  LSRRFYRTLPTFFGPRFSPNLQIREGTANQFDCFQFLFFEIYVFVLLALENIISEDLGHPLPMGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVC

Query:  RHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDCDFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDG
        RHSILKRSSIPVEIIPIKQ+DLR  GVYWRERGQ ESTEFSFSRFLTPYLANY+GWAMFVDCDFLYLADIKELRDL+DNK+AVMCVHHDYTPKETTKMDG
Subjt:  RHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDCDFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDG

Query:  AVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLW
        AVQTVYPRKNWSSMVLYNCGH KNK+LTPE VNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLW
Subjt:  AVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLW

Query:  LKEKEEYLMNEAKKKSEE
        LKE EEY   EA KKS E
Subjt:  LKEKEEYLMNEAKKKSEE

XP_022131238.1 protein CDI [Momordica charantia]2.3e-152100Show/hide
Query:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
        MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
Subjt:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC

Query:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
        DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
Subjt:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE

Query:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
        GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
Subjt:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE

XP_022958670.1 protein CDI-like [Cucurbita moschata]1.0e-13690.62Show/hide
Query:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
        MGSC G TH AV    +EQPF+IFVGYDV EDLAYEVCRHSILKRSSIPVEIIPIKQ+DLRK GVYWRERGQ ESTEFSFSRFLTPYLANYKGWAMFVDC
Subjt:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC

Query:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
        DFLYLADIKELRDL+DNKYA+MCVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGH KNKVLTPE VNTQTGAFLHRFQWLEDDEIGS+PFVWNFLE
Subjt:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE

Query:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
        GHNKSVEGDL+TLPKAIHYTRGGPWFEAWKNCEFADLWLKE EEY   EAKKKSEE
Subjt:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE

XP_023533470.1 protein CDI-like [Cucurbita pepo subsp. pepo]3.0e-13690.23Show/hide
Query:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
        MGSC G TH AV    +EQPF+IFVGYDV EDLAYEVCRHSILKRSSIPVEIIPIKQ+DLRK G YWRERGQ ESTEFSFSRFLTPYLANYKGWAMFVDC
Subjt:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC

Query:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
        DFLYLADIKELRDL+DNKYA+MCVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGH KNKVLTPE VNTQTGAFLHRFQWLEDDEIGS+PFVWNFLE
Subjt:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE

Query:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
        GHNKSVEGDL+TLPKAIHYTRGGPWFEAWKNCEFADLWLKE EEY   EAKKKSEE
Subjt:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE

XP_038886869.1 protein CDI [Benincasa hispida]2.6e-14093.36Show/hide
Query:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
        MGSC G  HPAV  GDVEQPF+IFVGYDVREDLAYEVCR+SI+KRSSIPVEIIPIKQ+DLRK GVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
Subjt:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC

Query:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
        DFLYLADIKELRDL+DNK+AVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGH KNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
Subjt:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE

Query:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
        GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKE EEY   EAKKKSEE
Subjt:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE

TrEMBL top hitse value%identityAlignment
A0A0A0LKJ4 Uncharacterized protein8.0e-13591.02Show/hide
Query:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
        MGS  G   PAV  GD EQPF+IFVGYDVREDLAY+VCRHSILKRSSIPVEIIPIKQ+DLRK GVYWRERGQ ESTEFSFSRFLTPYLAN+KGWAMFVDC
Subjt:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC

Query:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
        DFLYLADIKELRDL+DNK+AVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGH KNKVLTPEIVNTQTGAFLHRFQWLED+EIGSVPFVWNFLE
Subjt:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE

Query:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
        GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKE EEY   EA+KKSEE
Subjt:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE

A0A6J1BSS3 protein CDI1.1e-152100Show/hide
Query:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
        MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
Subjt:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC

Query:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
        DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
Subjt:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE

Query:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
        GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
Subjt:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE

A0A6J1EY80 protein CDI-like3.6e-13590.62Show/hide
Query:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
        MGSC     PAV  G+VEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQ+DLR  GVYWRERGQ ESTEFSFSRFLTPYLANY+GWAMFVDC
Subjt:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC

Query:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
        DFLYLADIKELRDL+DNK+AVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGH KNK+LTPE VNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
Subjt:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE

Query:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
        GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKE EEY   EA KKS E
Subjt:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE

A0A6J1H454 protein CDI-like5.0e-13790.62Show/hide
Query:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
        MGSC G TH AV    +EQPF+IFVGYDV EDLAYEVCRHSILKRSSIPVEIIPIKQ+DLRK GVYWRERGQ ESTEFSFSRFLTPYLANYKGWAMFVDC
Subjt:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC

Query:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
        DFLYLADIKELRDL+DNKYA+MCVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGH KNKVLTPE VNTQTGAFLHRFQWLEDDEIGS+PFVWNFLE
Subjt:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE

Query:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
        GHNKSVEGDL+TLPKAIHYTRGGPWFEAWKNCEFADLWLKE EEY   EAKKKSEE
Subjt:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE

A0A6J1K1D2 protein CDI-like2.1e-13589.84Show/hide
Query:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
        MGSC G TH AV    +EQPF+IFVGYDV EDLAYEVCRHSILKRSSIPVEIIPIKQ+DLRK GVYWRERGQ ESTEFSFSRFLTP LANYKGWAMFVDC
Subjt:  MGSCTGVTHPAVSNGDVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC

Query:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
        DFLYLADIKELRDL+DNKYA+MCVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGH KNKVLTPE VNTQTGAFLHRFQWLEDDEIGS+PFVWNFLE
Subjt:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE

Query:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
        GHNKSVEGDL+TLPKAIHYTRGGPWFEAWKNCEFADLWLKE EEY   EAK KSEE
Subjt:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE

SwissProt top hitse value%identityAlignment
Q9XIP8 Protein CDI4.9e-12178.52Show/hide
Query:  VSNGDV-----------EQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
        +SNGDV           ++PF+IFVGYD REDLAY+VC HSI KRSSIPVEI PI QSDLRKKG+YWRERGQ ESTEFSFSRFLTP+L++Y+GWAMFVDC
Subjt:  VSNGDV-----------EQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC

Query:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
        DFLYLADIKEL DL+D+KYA+MCV HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGH KNK L+PEIVNTQTGAFLHRFQWLED+EIGS+PFVWNFLE
Subjt:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE

Query:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
        GHN+ VE D TT PKA+HYTRGGPWF+AWK+CEFADLWL E EEY  N+  KK  +
Subjt:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE

Arabidopsis top hitse value%identityAlignment
AT1G64980.1 Nucleotide-diphospho-sugar transferases superfamily protein3.5e-12278.52Show/hide
Query:  VSNGDV-----------EQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC
        +SNGDV           ++PF+IFVGYD REDLAY+VC HSI KRSSIPVEI PI QSDLRKKG+YWRERGQ ESTEFSFSRFLTP+L++Y+GWAMFVDC
Subjt:  VSNGDV-----------EQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDC

Query:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE
        DFLYLADIKEL DL+D+KYA+MCV HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGH KNK L+PEIVNTQTGAFLHRFQWLED+EIGS+PFVWNFLE
Subjt:  DFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLE

Query:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE
        GHN+ VE D TT PKA+HYTRGGPWF+AWK+CEFADLWL E EEY  N+  KK  +
Subjt:  GHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEKEEYLMNEAKKKSEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCCATCCCAAGGAGCGTACGCGCTTATTTCTCTCTTTCTCGCAGATTTTATAGAACGCTCCCTACATTTTTTGGACCAAGATTTTCCCCAAATCTACAA
ATTAGAGAAGGAACTGCGAACCAATTTGACTGTTTTCAATTCCTCTTCTTCGAGATCTACGTTTTCGTCCTTCTTGCATTGGAGAATATTATATCTGAAGATTTG
GGGCATCCACTTCCAATGGGTTCTTGTACTGGGGTGACTCATCCTGCGGTTAGCAATGGAGATGTGGAGCAGCCATTCAAGATCTTTGTGGGCTACGATGTTCGT
GAAGATCTTGCTTATGAGGTGTGTCGCCATTCCATCTTGAAGCGATCTTCAATCCCTGTTGAGATCATTCCAATCAAGCAGTCAGATCTGAGAAAGAAGGGTGTC
TATTGGCGTGAGAGAGGACAATTTGAGAGCACCGAGTTCTCGTTCTCTCGGTTCTTAACCCCGTACCTGGCGAATTACAAAGGATGGGCAATGTTTGTCGACTGT
GATTTTTTGTACCTAGCAGACATTAAGGAACTGAGAGACTTGGTTGACAATAAGTATGCAGTTATGTGTGTGCACCATGATTACACACCAAAAGAAACTACAAAA
ATGGATGGTGCAGTGCAAACTGTGTACCCAAGGAAGAACTGGTCTTCCATGGTTTTGTACAACTGTGGGCACTCAAAGAACAAAGTCTTGACACCTGAGATTGTG
AACACCCAAACTGGTGCTTTTCTTCACAGGTTCCAATGGCTTGAGGATGATGAAATTGGGTCAGTCCCATTTGTTTGGAACTTCCTTGAGGGCCATAACAAGAGT
GTAGAGGGTGATCTCACCACTCTCCCAAAAGCAATCCATTACACTCGCGGTGGGCCATGGTTCGAAGCTTGGAAGAACTGTGAATTTGCAGATCTCTGGCTGAAA
GAGAAGGAGGAGTATCTGATGAATGAGGCTAAGAAGAAATCTGAAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTCCATCCCAAGGAGCGTACGCGCTTATTTCTCTCTTTCTCGCAGATTTTATAGAACGCTCCCTACATTTTTTGGACCAAGATTTTCCCCAAATCTACAA
ATTAGAGAAGGAACTGCGAACCAATTTGACTGTTTTCAATTCCTCTTCTTCGAGATCTACGTTTTCGTCCTTCTTGCATTGGAGAATATTATATCTGAAGATTTG
GGGCATCCACTTCCAATGGGTTCTTGTACTGGGGTGACTCATCCTGCGGTTAGCAATGGAGATGTGGAGCAGCCATTCAAGATCTTTGTGGGCTACGATGTTCGT
GAAGATCTTGCTTATGAGGTGTGTCGCCATTCCATCTTGAAGCGATCTTCAATCCCTGTTGAGATCATTCCAATCAAGCAGTCAGATCTGAGAAAGAAGGGTGTC
TATTGGCGTGAGAGAGGACAATTTGAGAGCACCGAGTTCTCGTTCTCTCGGTTCTTAACCCCGTACCTGGCGAATTACAAAGGATGGGCAATGTTTGTCGACTGT
GATTTTTTGTACCTAGCAGACATTAAGGAACTGAGAGACTTGGTTGACAATAAGTATGCAGTTATGTGTGTGCACCATGATTACACACCAAAAGAAACTACAAAA
ATGGATGGTGCAGTGCAAACTGTGTACCCAAGGAAGAACTGGTCTTCCATGGTTTTGTACAACTGTGGGCACTCAAAGAACAAAGTCTTGACACCTGAGATTGTG
AACACCCAAACTGGTGCTTTTCTTCACAGGTTCCAATGGCTTGAGGATGATGAAATTGGGTCAGTCCCATTTGTTTGGAACTTCCTTGAGGGCCATAACAAGAGT
GTAGAGGGTGATCTCACCACTCTCCCAAAAGCAATCCATTACACTCGCGGTGGGCCATGGTTCGAAGCTTGGAAGAACTGTGAATTTGCAGATCTCTGGCTGAAA
GAGAAGGAGGAGTATCTGATGAATGAGGCTAAGAAGAAATCTGAAGAATAG
Protein sequenceShow/hide protein sequence
MNSIPRSVRAYFSLSRRFYRTLPTFFGPRFSPNLQIREGTANQFDCFQFLFFEIYVFVLLALENIISEDLGHPLPMGSCTGVTHPAVSNGDVEQPFKIFVGYDVR
EDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKKGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDCDFLYLADIKELRDLVDNKYAVMCVHHDYTPKETTK
MDGAVQTVYPRKNWSSMVLYNCGHSKNKVLTPEIVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLK
EKEEYLMNEAKKKSEE