; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031831 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031831
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein CDI-like
Genome locationchr11:15702362..15703120
RNA-Seq ExpressionLag0031831
SyntenyLag0031831
Gene Ontology termsNA
InterPro domainsIPR029044 - Nucleotide-diphospho-sugar transferases


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011466.1 Protein CDI, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-13490.04Show/hide
Query:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCN E  PA+G+VE PFKIFVGYDVREDLAYEVCR+SILKRSSIPVEIIPIK  DLR NGVYWRERGQLEST+FSFSRFLTPYLA+Y+GWAMFVDCDF
Subjt:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELR+LIDNK+AVMCVHHDYTPKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+L+PETVNTQTGAFLHRFQWLED+EIGSVPFVWNFLEGH
Subjt:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKS
        NK VEGD  TLPKAIHYTRGGPWFEAWK CEFADLWLKEMEEYQ EA KKS
Subjt:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKS

XP_008465853.1 PREDICTED: protein CDI [Cucumis melo]3.3e-13589.29Show/hide
Query:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF
        MGS NGE HPA+GD E PF+IFVGYDVREDLA+EVCR+SILKRSSIPVEIIPIK  DLRKNGVYWRERGQ EST+FSFSRFLTPYLA+YKGWAMFVDCDF
Subjt:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELR+LIDNK+AVMCVHHDYTPKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVL+PE VNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
Subjt:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKKSEK
        NKIVEGD  TLPKAIHYTRGGPWFEAWK CEF DLW+KEMEEY++A+KKSE+
Subjt:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKKSEK

XP_022958670.1 protein CDI-like [Cucurbita moschata]1.5e-13589.33Show/hide
Query:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCNGETH A+  +E PF+IFVGYDV EDLAYEVCR+SILKRSSIPVEIIPIK  DLRKNGVYWRERGQLEST+FSFSRFLTPYLA+YKGWAMFVDCDF
Subjt:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELR+LIDNKYA+MCVHHDY PKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVL+PE VNTQTGAFLHRFQWLED+EIGS+PFVWNFLEGH
Subjt:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKSEK
        NK VEGD +TLPKAIHYTRGGPWFEAWK CEFADLWLKEMEEYQ EAKKKSE+
Subjt:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKSEK

XP_023533470.1 protein CDI-like [Cucurbita pepo subsp. pepo]4.3e-13588.93Show/hide
Query:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCNGETH A+  +E PF+IFVGYDV EDLAYEVCR+SILKRSSIPVEIIPIK  DLRKNG YWRERGQLEST+FSFSRFLTPYLA+YKGWAMFVDCDF
Subjt:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELR+LIDNKYA+MCVHHDY PKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVL+PE VNTQTGAFLHRFQWLED+EIGS+PFVWNFLEGH
Subjt:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKSEK
        NK VEGD +TLPKAIHYTRGGPWFEAWK CEFADLWLKEMEEYQ EAKKKSE+
Subjt:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKSEK

XP_038886869.1 protein CDI [Benincasa hispida]4.9e-13990.87Show/hide
Query:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCNGE HPA+GDVE PF+IFVGYDVREDLAYEVCRYSI+KRSSIPVEIIPIK  DLRK+GVYWRERGQ EST+FSFSRFLTPYLA+YKGWAMFVDCDF
Subjt:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELR+LIDNK+AVMCVHHDYTPKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVL+PE VNTQTGAFLHRFQWLED+EIGSVPFVWNFLEGH
Subjt:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKKSEK
        NK VEGD  TLPKAIHYTRGGPWFEAWK CEFADLWLKEMEEY+EAKKKSE+
Subjt:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKKSEK

TrEMBL top hitse value%identityAlignment
A0A1S3CPV2 protein CDI1.6e-13589.29Show/hide
Query:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF
        MGS NGE HPA+GD E PF+IFVGYDVREDLA+EVCR+SILKRSSIPVEIIPIK  DLRKNGVYWRERGQ EST+FSFSRFLTPYLA+YKGWAMFVDCDF
Subjt:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELR+LIDNK+AVMCVHHDYTPKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVL+PE VNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
Subjt:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKKSEK
        NKIVEGD  TLPKAIHYTRGGPWFEAWK CEF DLW+KEMEEY++A+KKSE+
Subjt:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKKSEK

A0A5D3BW54 Protein CDI1.6e-13589.29Show/hide
Query:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF
        MGS NGE HPA+GD E PF+IFVGYDVREDLA+EVCR+SILKRSSIPVEIIPIK  DLRKNGVYWRERGQ EST+FSFSRFLTPYLA+YKGWAMFVDCDF
Subjt:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELR+LIDNK+AVMCVHHDYTPKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVL+PE VNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
Subjt:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKKSEK
        NKIVEGD  TLPKAIHYTRGGPWFEAWK CEF DLW+KEMEEY++A+KKSE+
Subjt:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKKSEK

A0A6J1EY80 protein CDI-like8.0e-13590.04Show/hide
Query:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCN E  PA+G+VE PFKIFVGYDVREDLAYEVCR+SILKRSSIPVEIIPIK  DLR NGVYWRERGQLEST+FSFSRFLTPYLA+Y+GWAMFVDCDF
Subjt:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELR+LIDNK+AVMCVHHDYTPKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+L+PETVNTQTGAFLHRFQWLED+EIGSVPFVWNFLEGH
Subjt:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKS
        NK VEGD  TLPKAIHYTRGGPWFEAWK CEFADLWLKEMEEYQ EA KKS
Subjt:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKS

A0A6J1H454 protein CDI-like7.2e-13689.33Show/hide
Query:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCNGETH A+  +E PF+IFVGYDV EDLAYEVCR+SILKRSSIPVEIIPIK  DLRKNGVYWRERGQLEST+FSFSRFLTPYLA+YKGWAMFVDCDF
Subjt:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELR+LIDNKYA+MCVHHDY PKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVL+PE VNTQTGAFLHRFQWLED+EIGS+PFVWNFLEGH
Subjt:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKSEK
        NK VEGD +TLPKAIHYTRGGPWFEAWK CEFADLWLKEMEEYQ EAKKKSE+
Subjt:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKSEK

A0A6J1K1D2 protein CDI-like3.0e-13488.54Show/hide
Query:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF
        MGSCNGETH A+  +E PF+IFVGYDV EDLAYEVCR+SILKRSSIPVEIIPIK  DLRKNGVYWRERGQLEST+FSFSRFLTP LA+YKGWAMFVDCDF
Subjt:  MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDF

Query:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH
        LYLADIKELR+LIDNKYA+MCVHHDY PKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVL+PE VNTQTGAFLHRFQWLED+EIGS+PFVWNFLEGH
Subjt:  LYLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGH

Query:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKSEK
        NK VEGD +TLPKAIHYTRGGPWFEAWK CEFADLWLKEMEEYQ EAK KSE+
Subjt:  NKIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQ-EAKKKSEK

SwissProt top hitse value%identityAlignment
Q9XIP8 Protein CDI3.1e-12078.63Show/hide
Query:  GSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDFL
        G    ET       + PF+IFVGYD REDLAY+VC +SI KRSSIPVEI PI   DLRK G+YWRERGQLEST+FSFSRFLTP+L+DY+GWAMFVDCDFL
Subjt:  GSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDFL

Query:  YLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
        YLADIKEL +LID+KYA+MCV HDYTPKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNK LSPE VNTQTGAFLHRFQWLED EIGS+PFVWNFLEGHN
Subjt:  YLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN

Query:  KIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKK
        ++VE D  T PKA+HYTRGGPWF+AWK CEFADLWL EMEEY +  KK
Subjt:  KIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKK

Arabidopsis top hitse value%identityAlignment
AT1G64980.1 Nucleotide-diphospho-sugar transferases superfamily protein2.2e-12178.63Show/hide
Query:  GSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDFL
        G    ET       + PF+IFVGYD REDLAY+VC +SI KRSSIPVEI PI   DLRK G+YWRERGQLEST+FSFSRFLTP+L+DY+GWAMFVDCDFL
Subjt:  GSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDFL

Query:  YLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN
        YLADIKEL +LID+KYA+MCV HDYTPKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNK LSPE VNTQTGAFLHRFQWLED EIGS+PFVWNFLEGHN
Subjt:  YLADIKELRNLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHN

Query:  KIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKK
        ++VE D  T PKA+HYTRGGPWF+AWK CEFADLWL EMEEY +  KK
Subjt:  KIVEGDSNTLPKAIHYTRGGPWFEAWKTCEFADLWLKEMEEYQEAKKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTTGTAATGGAGAGACTCATCCTGCTATTGGAGATGTGGAGCACCCATTCAAGATCTTTGTGGGCTATGATGTTCGTGAAGATCTTGCTTATGAGGTCTGTCG
CTATTCCATCTTGAAGCGATCTTCGATCCCTGTTGAGATCATACCAATCAAGCTGGGAGATCTGAGAAAGAATGGTGTCTATTGGCGTGAGAGAGGACAATTGGAAAGCA
CCGACTTCTCATTTTCCCGGTTCTTAACTCCGTACCTGGCGGATTATAAAGGATGGGCAATGTTTGTTGACTGTGATTTTCTGTATCTAGCTGACATTAAGGAGCTGAGG
AACCTGATTGACAATAAGTATGCAGTCATGTGTGTCCACCATGATTACACACCTAAAGAAGCTACAAAAATGGATGGTGCAGTGCAAACTGTGTACCCAAGGAAGAACTG
GTCTTCAATGGTTCTGTACAACTGTGGGCATCCAAAGAACAAGGTCTTGTCGCCTGAGACTGTCAACACACAGACTGGTGCCTTTCTTCATAGGTTCCAATGGCTTGAGG
ATAATGAAATTGGGTCAGTCCCATTTGTTTGGAACTTTCTTGAGGGCCACAACAAGATTGTGGAGGGTGATTCGAACACTCTCCCAAAAGCAATCCATTACACTCGTGGT
GGGCCGTGGTTTGAAGCTTGGAAGACTTGTGAATTTGCAGATCTCTGGCTGAAAGAAATGGAGGAGTATCAGGAGGCTAAGAAGAAATCTGAAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTTGTAATGGAGAGACTCATCCTGCTATTGGAGATGTGGAGCACCCATTCAAGATCTTTGTGGGCTATGATGTTCGTGAAGATCTTGCTTATGAGGTCTGTCG
CTATTCCATCTTGAAGCGATCTTCGATCCCTGTTGAGATCATACCAATCAAGCTGGGAGATCTGAGAAAGAATGGTGTCTATTGGCGTGAGAGAGGACAATTGGAAAGCA
CCGACTTCTCATTTTCCCGGTTCTTAACTCCGTACCTGGCGGATTATAAAGGATGGGCAATGTTTGTTGACTGTGATTTTCTGTATCTAGCTGACATTAAGGAGCTGAGG
AACCTGATTGACAATAAGTATGCAGTCATGTGTGTCCACCATGATTACACACCTAAAGAAGCTACAAAAATGGATGGTGCAGTGCAAACTGTGTACCCAAGGAAGAACTG
GTCTTCAATGGTTCTGTACAACTGTGGGCATCCAAAGAACAAGGTCTTGTCGCCTGAGACTGTCAACACACAGACTGGTGCCTTTCTTCATAGGTTCCAATGGCTTGAGG
ATAATGAAATTGGGTCAGTCCCATTTGTTTGGAACTTTCTTGAGGGCCACAACAAGATTGTGGAGGGTGATTCGAACACTCTCCCAAAAGCAATCCATTACACTCGTGGT
GGGCCGTGGTTTGAAGCTTGGAAGACTTGTGAATTTGCAGATCTCTGGCTGAAAGAAATGGAGGAGTATCAGGAGGCTAAGAAGAAATCTGAAAAATAG
Protein sequenceShow/hide protein sequence
MGSCNGETHPAIGDVEHPFKIFVGYDVREDLAYEVCRYSILKRSSIPVEIIPIKLGDLRKNGVYWRERGQLESTDFSFSRFLTPYLADYKGWAMFVDCDFLYLADIKELR
NLIDNKYAVMCVHHDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLSPETVNTQTGAFLHRFQWLEDNEIGSVPFVWNFLEGHNKIVEGDSNTLPKAIHYTRG
GPWFEAWKTCEFADLWLKEMEEYQEAKKKSEK