; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G7002 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G7002
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionSASA domain-containing protein
Genome locationctg1522:1274103..1278716
RNA-Seq ExpressionCucsat.G7002
SyntenyCucsat.G7002
Gene Ontology termsNA
InterPro domainsIPR005181 - Sialate O-acetylesterase domain
IPR036514 - SGNH hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646868.1 hypothetical protein Csa_020851 [Cucumis sativus]8.65e-15090.13Show/hide
Query:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA
        MVLLRLSIIL VMLY P LSGA SPKNIFI AGQSNMAGRGGVENN +GNL WDGLVPPECQ +PSILRLNP  QWEIAREPLHLGIDI RTPGIGPG+ 
Subjt:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA

Query:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF
        FAHELL K GPNAGAVGLVPCARGGTLI QW+KNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIR+DIKPRF
Subjt:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF

Query:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS
        LPIIVVKIALYDF  QHDTHNLPAVREAQ+AVS
Subjt:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS

KAE8652070.1 hypothetical protein Csa_018719 [Cucumis sativus]7.97e-16296.14Show/hide
Query:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA
        MVLLRLSIILCVMLYGPSLSGA SPKNIFILAGQSNMAGRGGVENNAQG LQWDGLVPPECQPQPSILRLNP  QWEIAREPLHLGIDIKRTPGIGPGIA
Subjt:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA

Query:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF
        FAHELL KAGPNAGAVGLVPCARGGTLIE+W+KNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPR+
Subjt:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF

Query:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS
        LPIIVVKIALYDFFR HDTHNLPAVREAQEAVS
Subjt:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS

KAE8652071.1 hypothetical protein Csa_018776 [Cucumis sativus]4.56e-160100Show/hide
Query:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA
        MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA
Subjt:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA

Query:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF
        FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF
Subjt:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF

Query:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS
        LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS
Subjt:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS

XP_031736508.1 probable carbohydrate esterase At4g34215 [Cucumis sativus]3.70e-16799.57Show/hide
Query:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA
        MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDI RTPGIGPGIA
Subjt:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA

Query:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF
        FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF
Subjt:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF

Query:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS
        LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS
Subjt:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS

XP_031737051.1 probable carbohydrate esterase At4g34215 [Cucumis sativus]5.22e-13895.94Show/hide
Query:  MAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPS
        MAGRGGVENNAQG LQWDGLVPPECQPQPSILRLNP  QWEIAREPLHLGIDIKRTPGIGPGIAFAHELL KAGPNAGAVGLVPCARGGTLIE+W+KNPS
Subjt:  MAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPS

Query:  NPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS
        NPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPR+LPIIVVKIALYDFFR HDTHNLPAVREAQEAVS
Subjt:  NPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS

TrEMBL top hitse value%identityAlignment
A0A0A0LNC5 SASA domain-containing protein1.26e-16799.57Show/hide
Query:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA
        MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA
Subjt:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA

Query:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF
        FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF
Subjt:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF

Query:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS
        LPIIVVKIALYDFFRQHDTHNLPAVREA+EAVS
Subjt:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS

A0A5A7V246 Putative carbohydrate esterase2.12e-13384.62Show/hide
Query:  MLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPN
        MLYG  LSGA  PKNIFILAGQSNMAGRGGVE   +  L WDGLVPPEC P+PSILRLNP  QWE+AREPLHLGIDI RTPGIGPGI FA ELL KAGP 
Subjt:  MLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPN

Query:  AGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIALYD
        AGAVGLVPCARGGTLI QW+KNPSNPSATFYQNFIERI+ SDKDGGVVRALFWFQGESDAAMNDTA+RYKDNL KFFTDIR+DIKPRFLPI+VVKIALYD
Subjt:  AGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIALYD

Query:  FFRQHDTHNLPAVREAQEAVS
        FF +HDTHNLPAVR AQ+AV+
Subjt:  FFRQHDTHNLPAVREAQEAVS

A0A5A7VGP3 Putative carbohydrate esterase2.91e-13379.92Show/hide
Query:  MVLLRLSIILCVMLYG------PSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPG
        M LL+LSI+LC ML+        SLSGAASPKNIFILAGQSNMAGRGGVE +  GNL WD LVPPEC+PQPSILRLNP  +WE AREPLH+GIDI RT G
Subjt:  MVLLRLSIILCVMLYG------PSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPG

Query:  IGPGIAFAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRD
        IGPG+ FAH LL KAGPNAG VGLVPCARGGTLIEQWIKNPSNP+ATFY+NFIERIKASDKDGGVVRALFWFQGESDAAM+DTA RYKDNLK+FFTDIR+
Subjt:  IGPGIAFAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRD

Query:  DIKPRFLPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS
        DIKPRFLPII+ KIA+YD F +HDTH+L AVR AQE VS
Subjt:  DIKPRFLPIIVVKIALYDFFRQHDTHNLPAVREAQEAVS

A0A6J1GJF6 probable carbohydrate esterase At4g342155.52e-13280.17Show/hide
Query:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA
        MVLL+LSI+LC++L+ PSLSGA SP NIFILAGQSNMAGRGGVE    G L WDG VP ECQ  PSILRLNP  QWEIA EPLHLGIDI  TPGIGPGI 
Subjt:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA

Query:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF
        FAH+   KAG  AG VGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIK S+K+GGVVRALFW+QGESDAAMNDTA RYKDNLKKF TDIR+DIKPRF
Subjt:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF

Query:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAV
        LP+I+VKI+LYDFF +HDTHNLPAVR A++AV
Subjt:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAV

A0A6J1KIR8 probable carbohydrate esterase At4g342155.52e-13278.45Show/hide
Query:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA
        M+LL+LS +LC++L+ PSLS A SP NIFILAGQSNMAGRGGVENN +G L+WDG VP ECQ  PSILRLNP  QWEIA+EPLHLGIDI +TPGIGPGI 
Subjt:  MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIA

Query:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF
        FAH+   KAG  AG VGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIK S+K+GGVVRALFW+QGESDAAM+DTA RYKDNLKKF TDIR+DIKPRF
Subjt:  FAHELLVKAGPNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRF

Query:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAV
        LP+I+VKI++YDFF +HDTH+LPAVR A++AV
Subjt:  LPIIVVKIALYDFFRQHDTHNLPAVREAQEAV

SwissProt top hitse value%identityAlignment
Q8L9J9 Probable carbohydrate esterase At4g342153.9e-4042.78Show/hide
Query:  PSLSGAASPKNIFILAGQSNMAGRGGVENNAQGN-LQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGA
        P +     P  IFIL+GQSNMAGRGGV  +   N   WD ++PPEC P  SILRL+  L+WE A EPLH+ ID  +  G+GPG+AFA+ +  +   ++  
Subjt:  PSLSGAASPKNIFILAGQSNMAGRGGVENNAQGN-LQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGA

Query:  VGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIA
        +GLVPCA GGT I++W +      +  Y+  ++R + S K GG ++A+ W+QGESD      A  Y +N+ +   ++R D+    LPII V IA
Subjt:  VGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIA

Arabidopsis top hitse value%identityAlignment
AT3G53010.1 Domain of unknown function (DUF303)6.5e-4348.92Show/hide
Query:  NIFILAGQSNMAGRGGVENNAQGNLQ-WDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGAVGLVPCARGG
        +IFILAGQSNMAGRGGV N+   N   WDG++PPEC+  PSILRL   L+W+ A+EPLH+ IDI +T G+GPG+ FA+ ++ +     G VGLVPC+ GG
Subjt:  NIFILAGQSNMAGRGGVENNAQGNLQ-WDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGAVGLVPCARGG

Query:  TLIEQWIKNPSNPSATFYQNFIERIKA--SDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIA
        T + QW K         Y+  ++R KA  +   GG  RA+ W+QGESD      A  YK  L KFF+D+R+D++   LPII V +A
Subjt:  TLIEQWIKNPSNPSATFYQNFIERIKA--SDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIA

AT4G34215.1 Domain of unknown function (DUF303)2.7e-4142.78Show/hide
Query:  PSLSGAASPKNIFILAGQSNMAGRGGVENNAQGN-LQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGA
        P +     P  IFIL+GQSNMAGRGGV  +   N   WD ++PPEC P  SILRL+  L+WE A EPLH+ ID  +  G+GPG+AFA+ +  +   ++  
Subjt:  PSLSGAASPKNIFILAGQSNMAGRGGVENNAQGN-LQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGA

Query:  VGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIA
        +GLVPCA GGT I++W +      +  Y+  ++R + S K GG ++A+ W+QGESD      A  Y +N+ +   ++R D+    LPII V IA
Subjt:  VGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIA

AT4G34215.2 Domain of unknown function (DUF303)2.7e-4142.78Show/hide
Query:  PSLSGAASPKNIFILAGQSNMAGRGGVENNAQGN-LQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGA
        P +     P  IFIL+GQSNMAGRGGV  +   N   WD ++PPEC P  SILRL+  L+WE A EPLH+ ID  +  G+GPG+AFA+ +  +   ++  
Subjt:  PSLSGAASPKNIFILAGQSNMAGRGGVENNAQGN-LQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGA

Query:  VGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIA
        +GLVPCA GGT I++W +      +  Y+  ++R + S K GG ++A+ W+QGESD      A  Y +N+ +   ++R D+    LPII V IA
Subjt:  VGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTGTTGAGATTGTCAATCATATTATGTGTGATGTTATATGGTCCTTCTCTTTCAGGAGCTGCTTCTCCTAAGAACATATTCATCCTCGCAGGTCAGAGTAACAT
GGCTGGTCGAGGTGGAGTTGAGAACAATGCTCAAGGAAATCTCCAGTGGGATGGGTTAGTCCCACCAGAATGTCAACCTCAACCATCCATCCTACGATTGAACCCTGGTC
TCCAATGGGAGATAGCACGAGAGCCACTCCATTTGGGAATTGACATCAAAAGGACCCCTGGAATTGGTCCCGGTATCGCATTTGCTCATGAATTGCTAGTCAAAGCTGGA
CCAAATGCTGGCGCTGTGGGTTTAGTTCCATGTGCTAGAGGTGGCACTTTAATTGAACAATGGATTAAAAATCCTAGCAATCCTAGTGCAACCTTTTACCAAAACTTCAT
TGAACGAATCAAAGCATCGGATAAAGATGGTGGGGTTGTGCGTGCTCTTTTCTGGTTCCAAGGAGAAAGTGATGCAGCTATGAATGACACCGCCATTAGATACAAAGACA
ACCTAAAGAAATTCTTCACTGACATTCGTGATGACATAAAACCTAGATTTTTGCCCATCATTGTTGTTAAAATAGCTCTCTACGACTTTTTTAGGCAGCACGATACTCAT
AACCTCCCAGCAGTGAGGGAAGCACAAGAAGCAGTCAGC
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTGTTGAGATTGTCAATCATATTATGTGTGATGTTATATGGTCCTTCTCTTTCAGGAGCTGCTTCTCCTAAGAACATATTCATCCTCGCAGGTCAGAGTAACAT
GGCTGGTCGAGGTGGAGTTGAGAACAATGCTCAAGGAAATCTCCAGTGGGATGGGTTAGTCCCACCAGAATGTCAACCTCAACCATCCATCCTACGATTGAACCCTGGTC
TCCAATGGGAGATAGCACGAGAGCCACTCCATTTGGGAATTGACATCAAAAGGACCCCTGGAATTGGTCCCGGTATCGCATTTGCTCATGAATTGCTAGTCAAAGCTGGA
CCAAATGCTGGCGCTGTGGGTTTAGTTCCATGTGCTAGAGGTGGCACTTTAATTGAACAATGGATTAAAAATCCTAGCAATCCTAGTGCAACCTTTTACCAAAACTTCAT
TGAACGAATCAAAGCATCGGATAAAGATGGTGGGGTTGTGCGTGCTCTTTTCTGGTTCCAAGGAGAAAGTGATGCAGCTATGAATGACACCGCCATTAGATACAAAGACA
ACCTAAAGAAATTCTTCACTGACATTCGTGATGACATAAAACCTAGATTTTTGCCCATCATTGTTGTTAAAATAGCTCTCTACGACTTTTTTAGGCAGCACGATACTCAT
AACCTCCCAGCAGTGAGGGAAGCACAAGAAGCAGTCAGC
Protein sequenceShow/hide protein sequence
MVLLRLSIILCVMLYGPSLSGAASPKNIFILAGQSNMAGRGGVENNAQGNLQWDGLVPPECQPQPSILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAG
PNAGAVGLVPCARGGTLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRFLPIIVVKIALYDFFRQHDTH
NLPAVREAQEAVS