; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G5300 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G5300
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionprotein CDI
Genome locationctg1251:10854..12930
RNA-Seq ExpressionCucsat.G5300
SyntenyCucsat.G5300
Gene Ontology termsNA
InterPro domainsIPR029044 - Nucleotide-diphospho-sugar transferases


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148344.1 protein CDI [Cucumis sativus]6.13e-192100Show/hide
Query:  MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL
        MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL
Subjt:  MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL

Query:  YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
        YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
Subjt:  YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN

Query:  KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE
        KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE
Subjt:  KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE

XP_008465853.1 PREDICTED: protein CDI [Cucumis melo]2.49e-18496.43Show/hide
Query:  MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL
        MGSSNGEN PAVGDEQPFRIFVGYDVREDLA++VCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLAN+KGWAMFVDCDFL
Subjt:  MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL

Query:  YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
        YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
Subjt:  YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN

Query:  KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE
        K VEGDLTTLPKAIHYTRGGPWFEAWKNCEF DLW+KEMEEY K AEKKSEE
Subjt:  KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE

XP_022932955.1 protein CDI-like [Cucurbita moschata]9.64e-17893.63Show/hide
Query:  MGSSNGENRPAVGD-EQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDF
        MGS N EN PAVG+ EQPF+IFVGYDVREDLAY+VCRHSILKRSSIPVEIIPIKQADLR NGVYWRERGQ ESTEFSFSRFLTPYLAN++GWAMFVDCDF
Subjt:  MGSSNGENRPAVGD-EQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDF

Query:  LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+LTPE VNTQTGAFLHRFQWLED+EIGSVPFVWNFLEGH
Subjt:  LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKS
        NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY KEA KKS
Subjt:  NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKS

XP_022958670.1 protein CDI-like [Cucurbita moschata]5.58e-17792.49Show/hide
Query:  MGSSNGENRPAV-GDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDF
        MGS NGE   AV G EQPFRIFVGYDV EDLAY+VCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQ ESTEFSFSRFLTPYLAN+KGWAMFVDCDF
Subjt:  MGSSNGENRPAV-GDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDF

Query:  LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELRDLIDNK+A+MCVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPE VNTQTGAFLHRFQWLED+EIGS+PFVWNFLEGH
Subjt:  LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE
        NKSVEGDL+TLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY KEA+KKSEE
Subjt:  NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE

XP_038886869.1 protein CDI [Benincasa hispida]1.67e-18095.26Show/hide
Query:  MGSSNGENRPAVGD-EQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDF
        MGS NGEN PAVGD EQPFRIFVGYDVREDLAY+VCR+SI+KRSSIPVEIIPIKQADLRK+GVYWRERGQ ESTEFSFSRFLTPYLAN+KGWAMFVDCDF
Subjt:  MGSSNGENRPAVGD-EQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDF

Query:  LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLED+EIGSVPFVWNFLEGH
Subjt:  LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE
        NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY KEA+KKSEE
Subjt:  NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE

TrEMBL top hitse value%identityAlignment
A0A0A0LKJ4 Uncharacterized protein2.97e-192100Show/hide
Query:  MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL
        MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL
Subjt:  MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL

Query:  YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
        YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
Subjt:  YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN

Query:  KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE
        KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE
Subjt:  KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE

A0A1S3CPV2 protein CDI1.20e-18496.43Show/hide
Query:  MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL
        MGSSNGEN PAVGDEQPFRIFVGYDVREDLA++VCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLAN+KGWAMFVDCDFL
Subjt:  MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL

Query:  YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
        YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
Subjt:  YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN

Query:  KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE
        K VEGDLTTLPKAIHYTRGGPWFEAWKNCEF DLW+KEMEEY K AEKKSEE
Subjt:  KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE

A0A5D3BW54 Protein CDI1.20e-18496.43Show/hide
Query:  MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL
        MGSSNGEN PAVGDEQPFRIFVGYDVREDLA++VCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLAN+KGWAMFVDCDFL
Subjt:  MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFL

Query:  YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
        YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
Subjt:  YLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN

Query:  KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE
        K VEGDLTTLPKAIHYTRGGPWFEAWKNCEF DLW+KEMEEY K AEKKSEE
Subjt:  KSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE

A0A6J1EY80 protein CDI-like4.67e-17893.63Show/hide
Query:  MGSSNGENRPAVGD-EQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDF
        MGS N EN PAVG+ EQPF+IFVGYDVREDLAY+VCRHSILKRSSIPVEIIPIKQADLR NGVYWRERGQ ESTEFSFSRFLTPYLAN++GWAMFVDCDF
Subjt:  MGSSNGENRPAVGD-EQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDF

Query:  LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+LTPE VNTQTGAFLHRFQWLED+EIGSVPFVWNFLEGH
Subjt:  LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKS
        NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY KEA KKS
Subjt:  NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKS

A0A6J1H454 protein CDI-like2.70e-17792.49Show/hide
Query:  MGSSNGENRPAV-GDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDF
        MGS NGE   AV G EQPFRIFVGYDV EDLAY+VCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQ ESTEFSFSRFLTPYLAN+KGWAMFVDCDF
Subjt:  MGSSNGENRPAV-GDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDF

Query:  LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELRDLIDNK+A+MCVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPE VNTQTGAFLHRFQWLED+EIGS+PFVWNFLEGH
Subjt:  LYLADIKELRDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE
        NKSVEGDL+TLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY KEA+KKSEE
Subjt:  NKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE

SwissProt top hitse value%identityAlignment
Q9XIP8 Protein CDI2.3e-12383.19Show/hide
Query:  DEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFLYLADIKELRDLID
        +++PFRIFVGYD REDLAYQVC HSI KRSSIPVEI PI Q+DLRK G+YWRERGQ ESTEFSFSRFLTP+L++++GWAMFVDCDFLYLADIKEL DLID
Subjt:  DEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFLYLADIKELRDLID

Query:  NKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHNKSVEGDLTTLPKA
        +K+A+MCV HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK L+PEIVNTQTGAFLHRFQWLED EIGS+PFVWNFLEGHN+ VE D TT PKA
Subjt:  NKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHNKSVEGDLTTLPKA

Query:  IHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSE
        +HYTRGGPWF+AWK+CEFADLWL EMEEYNKE +K+++
Subjt:  IHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSE

Arabidopsis top hitse value%identityAlignment
AT1G64980.1 Nucleotide-diphospho-sugar transferases superfamily protein1.6e-12483.19Show/hide
Query:  DEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFLYLADIKELRDLID
        +++PFRIFVGYD REDLAYQVC HSI KRSSIPVEI PI Q+DLRK G+YWRERGQ ESTEFSFSRFLTP+L++++GWAMFVDCDFLYLADIKEL DLID
Subjt:  DEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFLYLADIKELRDLID

Query:  NKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHNKSVEGDLTTLPKA
        +K+A+MCV HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK L+PEIVNTQTGAFLHRFQWLED EIGS+PFVWNFLEGHN+ VE D TT PKA
Subjt:  NKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHNKSVEGDLTTLPKA

Query:  IHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSE
        +HYTRGGPWF+AWK+CEFADLWL EMEEYNKE +K+++
Subjt:  IHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTTCCAATGGAGAGAATCGTCCTGCTGTTGGAGATGAGCAACCATTTAGGATCTTTGTGGGCTATGATGTTCGTGAAGATCTTGCGTATCAGGTCTGTCGCCA
TTCCATCTTGAAGCGATCTTCAATCCCTGTTGAGATCATACCAATCAAGCAAGCAGATCTGAGAAAGAATGGTGTCTATTGGCGTGAGAGAGGACAAACTGAAAGCACAG
AATTCTCGTTTTCACGGTTCTTAACTCCTTATTTGGCAAATTTTAAAGGATGGGCAATGTTTGTGGATTGTGATTTCCTGTATCTAGCTGATATTAAGGAACTGAGGGAC
TTAATTGACAATAAGTTTGCAGTTATGTGTGTCCACCATGATTATACTCCAAAAGAAACTACAAAAATGGATGGTGCAGTTCAAACTGTTTACCCAAGGAAAAATTGGTC
TTCAATGGTTTTGTACAATTGTGGGCATCCAAAGAACAAAGTGTTGACACCCGAGATTGTCAACACTCAAACTGGTGCATTTCTTCATAGATTTCAATGGCTTGAGGATA
ATGAAATTGGGTCAGTCCCATTTGTTTGGAACTTTCTTGAAGGCCATAACAAGAGTGTGGAAGGTGATTTAACCACTCTCCCTAAAGCAATTCATTACACTCGTGGTGGA
CCATGGTTTGAAGCTTGGAAGAATTGTGAATTTGCAGATCTCTGGCTGAAAGAAATGGAGGAGTATAATAAGGAGGCTGAGAAGAAATCTGAAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTTCCAATGGAGAGAATCGTCCTGCTGTTGGAGATGAGCAACCATTTAGGATCTTTGTGGGCTATGATGTTCGTGAAGATCTTGCGTATCAGGTCTGTCGCCA
TTCCATCTTGAAGCGATCTTCAATCCCTGTTGAGATCATACCAATCAAGCAAGCAGATCTGAGAAAGAATGGTGTCTATTGGCGTGAGAGAGGACAAACTGAAAGCACAG
AATTCTCGTTTTCACGGTTCTTAACTCCTTATTTGGCAAATTTTAAAGGATGGGCAATGTTTGTGGATTGTGATTTCCTGTATCTAGCTGATATTAAGGAACTGAGGGAC
TTAATTGACAATAAGTTTGCAGTTATGTGTGTCCACCATGATTATACTCCAAAAGAAACTACAAAAATGGATGGTGCAGTTCAAACTGTTTACCCAAGGAAAAATTGGTC
TTCAATGGTTTTGTACAATTGTGGGCATCCAAAGAACAAAGTGTTGACACCCGAGATTGTCAACACTCAAACTGGTGCATTTCTTCATAGATTTCAATGGCTTGAGGATA
ATGAAATTGGGTCAGTCCCATTTGTTTGGAACTTTCTTGAAGGCCATAACAAGAGTGTGGAAGGTGATTTAACCACTCTCCCTAAAGCAATTCATTACACTCGTGGTGGA
CCATGGTTTGAAGCTTGGAAGAATTGTGAATTTGCAGATCTCTGGCTGAAAGAAATGGAGGAGTATAATAAGGAGGCTGAGAAGAAATCTGAAGAATAG
Protein sequenceShow/hide protein sequence
MGSSNGENRPAVGDEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFLYLADIKELRD
LIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHNKSVEGDLTTLPKAIHYTRGG
PWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE