; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G008910 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G008910
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein CDI-like
Genome locationchr05:15519542..15520303
RNA-Seq ExpressionLsi05G008910
SyntenyLsi05G008910
Gene Ontology termsNA
InterPro domainsIPR029044 - Nucleotide-diphospho-sugar transferases


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011466.1 Protein CDI, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-13992.49Show/hide
Query:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCN E  PAVG VEQPF+IFVGYDVREDLAYEVCR+SI+KRSS PVE+IPIKQADLR NGVYWRERGQ ESTEFSFSRFLTPYLA+Y+GWAMFVDCDF
Subjt:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
        LYLADIKEL+DLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+LTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
Subjt:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH

Query:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE
        N+SVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY+KEA KKS E
Subjt:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE

XP_022958670.1 protein CDI-like [Cucurbita moschata]3.1e-14192.09Show/hide
Query:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCNGE H AV G+EQPFRIFVGYDV EDLAYEVCR+SI+KRSS PVE+IPIKQADLRKNGVYWRERGQ ESTEFSFSRFLTPYLA+YKGWAMFVDCDF
Subjt:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
        LYLADIKEL+DLIDNK+A+MCVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPE VNTQTGAFLHRFQWLEDDEIGS+PFVWNFLEGH
Subjt:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH

Query:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE
        N+SVEGDL+TLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY+KEAKKKSEE
Subjt:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE

XP_022995211.1 protein CDI-like [Cucurbita maxima]1.3e-13991.3Show/hide
Query:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCNGE H AV G+EQPFRIFVGYDV EDLAYEVCR+SI+KRSS PVE+IPIKQADLRKNGVYWRERGQ ESTEFSFSRFLTP LA+YKGWAMFVDCDF
Subjt:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
        LYLADIKEL+DLIDNK+A+MCVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPE VNTQTGAFLHRFQWLEDDEIGS+PFVWNFLEGH
Subjt:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH

Query:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE
        N+SVEGDL+TLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY+KEAK KSEE
Subjt:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE

XP_023533470.1 protein CDI-like [Cucurbita pepo subsp. pepo]9.0e-14191.7Show/hide
Query:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCNGE H AV G+EQPFRIFVGYDV EDLAYEVCR+SI+KRSS PVE+IPIKQADLRKNG YWRERGQ ESTEFSFSRFLTPYLA+YKGWAMFVDCDF
Subjt:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
        LYLADIKEL+DLIDNK+A+MCVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPE VNTQTGAFLHRFQWLEDDEIGS+PFVWNFLEGH
Subjt:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH

Query:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE
        N+SVEGDL+TLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY+KEAKKKSEE
Subjt:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE

XP_038886869.1 protein CDI [Benincasa hispida]2.3e-14496.05Show/hide
Query:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCNGE HPAVG VEQPFRIFVGYDVREDLAYEVCRYSIMKRSS PVE+IPIKQADLRK+GVYWRERGQFESTEFSFSRFLTPYLA+YKGWAMFVDCDF
Subjt:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
        LYLADIKEL+DLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPE VNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
Subjt:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH

Query:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE
        N+SVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY KEAKKKSEE
Subjt:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE

TrEMBL top hitse value%identityAlignment
A0A0A0LKJ4 Uncharacterized protein9.1e-13992.49Show/hide
Query:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF
        MGS NGE  PAVG  EQPFRIFVGYDVREDLAY+VCR+SI+KRSS PVE+IPIKQADLRKNGVYWRERGQ ESTEFSFSRFLTPYLA++KGWAMFVDCDF
Subjt:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
        LYLADIKEL+DLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPE VNTQTGAFLHRFQWLED+EIGSVPFVWNFLEGH
Subjt:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH

Query:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE
        N+SVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY KEA+KKSEE
Subjt:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE

A0A6J1EY80 protein CDI-like6.3e-14092.49Show/hide
Query:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCN E  PAVG VEQPF+IFVGYDVREDLAYEVCR+SI+KRSS PVE+IPIKQADLR NGVYWRERGQ ESTEFSFSRFLTPYLA+Y+GWAMFVDCDF
Subjt:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
        LYLADIKEL+DLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+LTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
Subjt:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH

Query:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE
        N+SVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY+KEA KKS E
Subjt:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE

A0A6J1H454 protein CDI-like1.5e-14192.09Show/hide
Query:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCNGE H AV G+EQPFRIFVGYDV EDLAYEVCR+SI+KRSS PVE+IPIKQADLRKNGVYWRERGQ ESTEFSFSRFLTPYLA+YKGWAMFVDCDF
Subjt:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
        LYLADIKEL+DLIDNK+A+MCVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPE VNTQTGAFLHRFQWLEDDEIGS+PFVWNFLEGH
Subjt:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH

Query:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE
        N+SVEGDL+TLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY+KEAKKKSEE
Subjt:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE

A0A6J1I857 protein CDI-like3.1e-13992.09Show/hide
Query:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCN E  PAVG VEQPF+IFVGYDVREDLA+EVCR+SI+KRSS PVE+IPIKQADLR NGVYWRERGQ ESTEFSFSRFLTPYLA+Y+GWAMFVDCDF
Subjt:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
        LYLADIKEL+DLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+LTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
Subjt:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH

Query:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE
        N SVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY+KEA KKS E
Subjt:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE

A0A6J1K1D2 protein CDI-like6.3e-14091.3Show/hide
Query:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCNGE H AV G+EQPFRIFVGYDV EDLAYEVCR+SI+KRSS PVE+IPIKQADLRKNGVYWRERGQ ESTEFSFSRFLTP LA+YKGWAMFVDCDF
Subjt:  MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH
        LYLADIKEL+DLIDNK+A+MCVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPE VNTQTGAFLHRFQWLEDDEIGS+PFVWNFLEGH
Subjt:  LYLADIKELKDLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGH

Query:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE
        N+SVEGDL+TLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY+KEAK KSEE
Subjt:  NRSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE

SwissProt top hitse value%identityAlignment
Q9XIP8 Protein CDI5.7e-12282.7Show/hide
Query:  EQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDFLYLADIKELKDLIDN
        ++PFRIFVGYD REDLAY+VC +SI KRSS PVE+ PI Q+DLRK G+YWRERGQ ESTEFSFSRFLTP+L+DY+GWAMFVDCDFLYLADIKEL DLID+
Subjt:  EQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDFLYLADIKELKDLIDN

Query:  KFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGHNRSVEGDLTTLPKAI
        K+A+MCV HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK L+PE VNTQTGAFLHRFQWLED+EIGS+PFVWNFLEGHNR VE D TT PKA+
Subjt:  KFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGHNRSVEGDLTTLPKAI

Query:  HYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSE
        HYTRGGPWF+AWK+CEFADLWL EMEEY KE KK+++
Subjt:  HYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSE

Arabidopsis top hitse value%identityAlignment
AT1G64980.1 Nucleotide-diphospho-sugar transferases superfamily protein4.1e-12382.7Show/hide
Query:  EQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDFLYLADIKELKDLIDN
        ++PFRIFVGYD REDLAY+VC +SI KRSS PVE+ PI Q+DLRK G+YWRERGQ ESTEFSFSRFLTP+L+DY+GWAMFVDCDFLYLADIKEL DLID+
Subjt:  EQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDFLYLADIKELKDLIDN

Query:  KFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGHNRSVEGDLTTLPKAI
        K+A+MCV HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK L+PE VNTQTGAFLHRFQWLED+EIGS+PFVWNFLEGHNR VE D TT PKA+
Subjt:  KFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGHNRSVEGDLTTLPKAI

Query:  HYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSE
        HYTRGGPWF+AWK+CEFADLWL EMEEY KE KK+++
Subjt:  HYTRGGPWFEAWKNCEFADLWLKEMEEYKKEAKKKSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTTGTAATGGAGAGATTCATCCTGCTGTTGGAGGTGTGGAGCAACCATTTAGGATCTTTGTGGGCTATGATGTTCGTGAAGATCTTGCCTATGAGGTCTGTCG
CTATTCCATCATGAAGCGATCTTCAACCCCTGTGGAGGTCATACCAATCAAACAGGCAGATCTGAGAAAGAATGGTGTCTATTGGCGTGAGAGAGGACAATTTGAAAGCA
CCGAGTTCTCGTTTTCGCGGTTCTTGACTCCATATTTGGCGGATTATAAAGGATGGGCAATGTTTGTTGATTGTGATTTTCTGTATCTTGCTGATATTAAGGAACTGAAG
GACTTAATTGACAATAAGTTTGCTGTTATGTGTGTTCATCATGATTATACACCAAAAGAAACTACAAAAATGGATGGGGCAGTTCAAACTGTGTACCCAAGGAAGAATTG
GTCTTCAATGGTTTTATACAATTGTGGGCATCCAAAGAACAAAGTGTTGACACCTGAGACTGTCAATACCCAAACTGGTGCATTTCTTCATAGGTTCCAATGGCTTGAGG
ATGATGAAATTGGGTCAGTCCCATTTGTTTGGAACTTCCTTGAGGGCCATAACAGGAGTGTGGAGGGTGATTTAACCACTCTCCCTAAAGCAATTCATTACACTCGTGGT
GGGCCATGGTTTGAAGCTTGGAAGAATTGTGAATTTGCAGATCTCTGGCTGAAAGAAATGGAGGAGTATAAGAAGGAGGCCAAGAAGAAATCTGAAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTTGTAATGGAGAGATTCATCCTGCTGTTGGAGGTGTGGAGCAACCATTTAGGATCTTTGTGGGCTATGATGTTCGTGAAGATCTTGCCTATGAGGTCTGTCG
CTATTCCATCATGAAGCGATCTTCAACCCCTGTGGAGGTCATACCAATCAAACAGGCAGATCTGAGAAAGAATGGTGTCTATTGGCGTGAGAGAGGACAATTTGAAAGCA
CCGAGTTCTCGTTTTCGCGGTTCTTGACTCCATATTTGGCGGATTATAAAGGATGGGCAATGTTTGTTGATTGTGATTTTCTGTATCTTGCTGATATTAAGGAACTGAAG
GACTTAATTGACAATAAGTTTGCTGTTATGTGTGTTCATCATGATTATACACCAAAAGAAACTACAAAAATGGATGGGGCAGTTCAAACTGTGTACCCAAGGAAGAATTG
GTCTTCAATGGTTTTATACAATTGTGGGCATCCAAAGAACAAAGTGTTGACACCTGAGACTGTCAATACCCAAACTGGTGCATTTCTTCATAGGTTCCAATGGCTTGAGG
ATGATGAAATTGGGTCAGTCCCATTTGTTTGGAACTTCCTTGAGGGCCATAACAGGAGTGTGGAGGGTGATTTAACCACTCTCCCTAAAGCAATTCATTACACTCGTGGT
GGGCCATGGTTTGAAGCTTGGAAGAATTGTGAATTTGCAGATCTCTGGCTGAAAGAAATGGAGGAGTATAAGAAGGAGGCCAAGAAGAAATCTGAAGAATAG
Protein sequenceShow/hide protein sequence
MGSCNGEIHPAVGGVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSTPVEVIPIKQADLRKNGVYWRERGQFESTEFSFSRFLTPYLADYKGWAMFVDCDFLYLADIKELK
DLIDNKFAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPETVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGHNRSVEGDLTTLPKAIHYTRG
GPWFEAWKNCEFADLWLKEMEEYKKEAKKKSEE