; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028506 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028506
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein YeeZ isoform X2
Genome locationtig00153204:1723554..1726917
RNA-Seq ExpressionSgr028506
SyntenySgr028506
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR036291 - NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152681.1 uncharacterized protein LOC101213235 isoform X2 [Cucumis sativus]7.9e-10674.91Show/hide
Query:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ
        + SS  ESKPSPL+ QNRMFILG+GFVGQFFAQELK  GWAVSGTCRNLG+KM+LEGRGFDVY FDANDP Q TL+AMKYHTHLL+SIPPDVD    +L 
Subjt:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ

Query:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR
        HEKLLRTTLQGGDL+WLCYLSSTSVYGDY GAWVDED P N  SQSGKLRIEAEERW+NLG+DLGLS QVFRLGGIYGPGRSAIDTIIKQ SLSE QQRR
Subjt:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR

Query:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVK
        ARR++TS    +   + L    +   P +      +R YNIVDDDPAPREEVFSYARDLVE+  P K
Subjt:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVK

XP_008444750.1 PREDICTED: protein YeeZ isoform X2 [Cucumis melo]4.3e-10467.43Show/hide
Query:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ
        + SS   SKPSPL+ QNRMFILG+GFVGQFFAQELK  GW VSGTCRNLG+KM+LEGRGFDVY FDANDP Q TL+AMKYHTHLL+SIPPDVD    +L 
Subjt:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ

Query:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR
        HEKLLRTTLQGGDL+WLCYLSSTSVYGDY GAWVDED PAN  S+SGKLRIEAEERW+NLG+DLG S QVFRLGGIYGPGRSAIDTIIKQ SLSE QQRR
Subjt:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR

Query:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRRWRQVLLPMGVEGVIKGCV-----MCV
        ARR++TS    +   + L    +   P +      +R YNIVDDDPAPREEVFSYARDLVE+  P K       +  +L P+    V  G V     +C 
Subjt:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRRWRQVLLPMGVEGVIKGCV-----MCV

Query:  SRME
        +RM+
Subjt:  SRME

XP_022144383.1 uncharacterized protein LOC111014076 isoform X1 [Momordica charantia]3.2e-10776.28Show/hide
Query:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDV----DMLQ
        LKSS TES PS  + QNRMFILGIGFVGQFFAQELKNQGWAVSGTCRN GKKMELEGRGFDVY FDANDPEQSTLRAM++HTHLLVSIPPDV     +LQ
Subjt:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDV----DMLQ

Query:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR
        HE+LLR TLQ GDL WLCYLSSTSVYGDY GAWVDED PAN SSQSGK RIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIK+ SLSE QQRR
Subjt:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR

Query:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRR
         RRRYTS        + L    ++  P +      +RVYNIVDDDPAPREEVFSYARDLVE+  P K    R++
Subjt:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRR

XP_031736669.1 uncharacterized protein LOC101213235 isoform X1 [Cucumis sativus]1.6e-10373.43Show/hide
Query:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ
        + SS  ESKPSPL+ QNRMFILG+GFVGQFFAQELK  GWAVSGTCRNLG+KM+LEGRGFDVY FDANDP Q TL+AMKYHTHLL+SIPPDVD    +L 
Subjt:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ

Query:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPG----RSAIDTIIKQSSLSEG
        HEKLLRTTLQGGDL+WLCYLSSTSVYGDY GAWVDED P N  SQSGKLRIEAEERW+NLG+DLGLS QVFRLGGIYGPG     SAIDTIIKQ SLSE 
Subjt:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPG----RSAIDTIIKQSSLSEG

Query:  QQRRARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVK
        QQRRARR++TS    +   + L    +   P +      +R YNIVDDDPAPREEVFSYARDLVE+  P K
Subjt:  QQRRARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVK

XP_038886458.1 protein YeeZ isoform X1 [Benincasa hispida]1.7e-10872.63Show/hide
Query:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ
        + SS +ESKPSPL+ QNRMFILG+GFVGQFFAQELK  GWAVSGTCRN G+KM+LEGRGFDVY FDANDPEQ TL+AMKYHTHLLVSIPPDVD    +LQ
Subjt:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ

Query:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR
        HEKLLRTTLQGGDL+WLCYLSSTSVYGDY GAWV+ED P N SSQSGKLRIEAEERWLNLG+DLGLS+QVFRLGGIYGPGRSAIDTIIKQ SLSE QQ R
Subjt:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR

Query:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRRWRQVLLPMGVE
        ARR+YTS    +   + L    +   P +      +RVYNIVDDDPAPREEVFSYARDLVER  P K  +  ++    ++  G E
Subjt:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRRWRQVLLPMGVE

TrEMBL top hitse value%identityAlignment
A0A0A0LPC5 Uncharacterized protein3.8e-10674.91Show/hide
Query:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ
        + SS  ESKPSPL+ QNRMFILG+GFVGQFFAQELK  GWAVSGTCRNLG+KM+LEGRGFDVY FDANDP Q TL+AMKYHTHLL+SIPPDVD    +L 
Subjt:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ

Query:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR
        HEKLLRTTLQGGDL+WLCYLSSTSVYGDY GAWVDED P N  SQSGKLRIEAEERW+NLG+DLGLS QVFRLGGIYGPGRSAIDTIIKQ SLSE QQRR
Subjt:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR

Query:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVK
        ARR++TS    +   + L    +   P +      +R YNIVDDDPAPREEVFSYARDLVE+  P K
Subjt:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVK

A0A1S3BBU1 protein YeeZ isoform X12.0e-10266.13Show/hide
Query:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ
        + SS   SKPSPL+ QNRMFILG+GFVGQFFAQELK  GW VSGTCRNLG+KM+LEGRGFDVY FDANDP Q TL+AMKYHTHLL+SIPPDVD    +L 
Subjt:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ

Query:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGR------SAIDTIIKQSSLS
        HEKLLRTTLQGGDL+WLCYLSSTSVYGDY GAWVDED PAN  S+SGKLRIEAEERW+NLG+DLG S QVFRLGGIYGPGR      SAIDTIIKQ SLS
Subjt:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGR------SAIDTIIKQSSLS

Query:  EGQQRRARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRRWRQVLLPMGVEGVIKGCV--
        E QQRRARR++TS    +   + L    +   P +      +R YNIVDDDPAPREEVFSYARDLVE+  P K       +  +L P+    V  G V  
Subjt:  EGQQRRARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRRWRQVLLPMGVEGVIKGCV--

Query:  ---MCVSRME
           +C +RM+
Subjt:  ---MCVSRME

A0A1S3BBX8 protein YeeZ isoform X22.1e-10467.43Show/hide
Query:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ
        + SS   SKPSPL+ QNRMFILG+GFVGQFFAQELK  GW VSGTCRNLG+KM+LEGRGFDVY FDANDP Q TL+AMKYHTHLL+SIPPDVD    +L 
Subjt:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ

Query:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR
        HEKLLRTTLQGGDL+WLCYLSSTSVYGDY GAWVDED PAN  S+SGKLRIEAEERW+NLG+DLG S QVFRLGGIYGPGRSAIDTIIKQ SLSE QQRR
Subjt:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR

Query:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRRWRQVLLPMGVEGVIKGCV-----MCV
        ARR++TS    +   + L    +   P +      +R YNIVDDDPAPREEVFSYARDLVE+  P K       +  +L P+    V  G V     +C 
Subjt:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRRWRQVLLPMGVEGVIKGCV-----MCV

Query:  SRME
        +RM+
Subjt:  SRME

A0A5A7VC56 Protein YeeZ isoform X22.1e-10467.43Show/hide
Query:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ
        + SS   SKPSPL+ QNRMFILG+GFVGQFFAQELK  GW VSGTCRNLG+KM+LEGRGFDVY FDANDP Q TL+AMKYHTHLL+SIPPDVD    +L 
Subjt:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD----MLQ

Query:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR
        HEKLLRTTLQGGDL+WLCYLSSTSVYGDY GAWVDED PAN  S+SGKLRIEAEERW+NLG+DLG S QVFRLGGIYGPGRSAIDTIIKQ SLSE QQRR
Subjt:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR

Query:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRRWRQVLLPMGVEGVIKGCV-----MCV
        ARR++TS    +   + L    +   P +      +R YNIVDDDPAPREEVFSYARDLVE+  P K       +  +L P+    V  G V     +C 
Subjt:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRRWRQVLLPMGVEGVIKGCV-----MCV

Query:  SRME
        +RM+
Subjt:  SRME

A0A6J1CT45 uncharacterized protein LOC111014076 isoform X11.5e-10776.28Show/hide
Query:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDV----DMLQ
        LKSS TES PS  + QNRMFILGIGFVGQFFAQELKNQGWAVSGTCRN GKKMELEGRGFDVY FDANDPEQSTLRAM++HTHLLVSIPPDV     +LQ
Subjt:  LKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDV----DMLQ

Query:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR
        HE+LLR TLQ GDL WLCYLSSTSVYGDY GAWVDED PAN SSQSGK RIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIK+ SLSE QQRR
Subjt:  HEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRR

Query:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRR
         RRRYTS        + L    ++  P +      +RVYNIVDDDPAPREEVFSYARDLVE+  P K    R++
Subjt:  ARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19690.1 NAD(P)-binding Rossmann-fold superfamily protein1.4e-7151.84Show/hide
Query:  IPFLPPLKSSITESKPSP-LQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD
        I F  PL ++      +P  +S N+MFILG+GFVG FFAQ+LK   W VSGTCR+  KK E E RG +++ F A+ PE S L ++K +THLL+SIPP  D
Subjt:  IPFLPPLKSSITESKPSP-LQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPPDVD

Query:  ----MLQHEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSL
            ML++ +L+R  L  G+L+WLCYLSSTSVYGD  GAWV+E+   N  +QS K+R+ AE+ WL+LG DLG+S Q+ RLGGIYGPGRSAIDT++KQ  L
Subjt:  ----MLQHEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSL

Query:  SEGQQRRARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGP
        SEGQ+RRA R++TS        ++L    +    G         +YNIVDDDPA REEVF YA +L+E+  P
Subjt:  SEGQQRRARRRYTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAAATTCACTCCAGTAGTATCGCAGATCGTCGTCGCAATTCCATTCCTGCCGCCGTTGAAGAGTTCAATTACAGAATCGAAACCGTCGCCACTGCAAAGCCAGAA
TCGCATGTTTATTTTAGGCATAGGTTTCGTCGGACAGTTCTTTGCGCAAGAGCTAAAGAACCAAGGATGGGCTGTTTCTGGGACTTGCAGAAACCTTGGGAAGAAGATGG
AACTGGAGGGAAGGGGTTTTGATGTTTATTTTTTTGATGCAAATGATCCAGAACAGAGCACTCTAAGAGCAATGAAGTATCATACTCACCTTCTTGTTTCCATTCCACCA
GATGTGGATATGCTCCAGCATGAAAAACTTTTAAGGACTACTTTACAGGGTGGAGATCTTCAATGGCTTTGCTATTTGTCATCAACAAGTGTTTATGGAGATTATGCTGG
TGCTTGGGTAGATGAAGATACCCCGGCGAACCTATCAAGTCAGTCGGGCAAGTTGAGGATTGAAGCTGAGGAGAGATGGTTAAATTTGGGTAGTGATCTTGGCCTCTCAG
CTCAAGTATTTCGGCTTGGAGGTATCTATGGTCCTGGTAGAAGTGCTATTGATACAATAATCAAGCAGAGTTCCTTATCCGAGGGTCAACAACGTAGAGCACGCAGGCGA
TACACATCAGAGTTCATGTTCAGGACATCTGCCAAGCTCTTAATGCCAGTATTCAAAAGCCTTCTCCCAGGTAACTTAACATTATATCAGCGACAGAGAGTATACAACAT
AGTCGATGACGATCCAGCTCCAAGGGAAGAAGTATTCTCGTATGCTCGGGACTTGGTCGAGAGAAGTGGCCCGGTAAAGTCGAGCAATTGCCGAAGAAGGTGGAGGCAAG
TGTTGTTACCAATGGGAGTGGAAGGGGTGATAAAAGGGTGTGTAATGTGCGTAAGCAGAATGGAAGGAATGACTGTCTTTGGAAAAAATAGCTATATAATAAAATATCCC
TTGAAACACCACGTCAGGATGGCCGAGTGGTCTAAGGCGCCAGACTCAAGTTCTGGTCCTGTGAGAGGGCGTGGGTTCAAATCCCACTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGAAATTCACTCCAGTAGTATCGCAGATCGTCGTCGCAATTCCATTCCTGCCGCCGTTGAAGAGTTCAATTACAGAATCGAAACCGTCGCCACTGCAAAGCCAGAA
TCGCATGTTTATTTTAGGCATAGGTTTCGTCGGACAGTTCTTTGCGCAAGAGCTAAAGAACCAAGGATGGGCTGTTTCTGGGACTTGCAGAAACCTTGGGAAGAAGATGG
AACTGGAGGGAAGGGGTTTTGATGTTTATTTTTTTGATGCAAATGATCCAGAACAGAGCACTCTAAGAGCAATGAAGTATCATACTCACCTTCTTGTTTCCATTCCACCA
GATGTGGATATGCTCCAGCATGAAAAACTTTTAAGGACTACTTTACAGGGTGGAGATCTTCAATGGCTTTGCTATTTGTCATCAACAAGTGTTTATGGAGATTATGCTGG
TGCTTGGGTAGATGAAGATACCCCGGCGAACCTATCAAGTCAGTCGGGCAAGTTGAGGATTGAAGCTGAGGAGAGATGGTTAAATTTGGGTAGTGATCTTGGCCTCTCAG
CTCAAGTATTTCGGCTTGGAGGTATCTATGGTCCTGGTAGAAGTGCTATTGATACAATAATCAAGCAGAGTTCCTTATCCGAGGGTCAACAACGTAGAGCACGCAGGCGA
TACACATCAGAGTTCATGTTCAGGACATCTGCCAAGCTCTTAATGCCAGTATTCAAAAGCCTTCTCCCAGGTAACTTAACATTATATCAGCGACAGAGAGTATACAACAT
AGTCGATGACGATCCAGCTCCAAGGGAAGAAGTATTCTCGTATGCTCGGGACTTGGTCGAGAGAAGTGGCCCGGTAAAGTCGAGCAATTGCCGAAGAAGGTGGAGGCAAG
TGTTGTTACCAATGGGAGTGGAAGGGGTGATAAAAGGGTGTGTAATGTGCGTAAGCAGAATGGAAGGAATGACTGTCTTTGGAAAAAATAGCTATATAATAAAATATCCC
TTGAAACACCACGTCAGGATGGCCGAGTGGTCTAAGGCGCCAGACTCAAGTTCTGGTCCTGTGAGAGGGCGTGGGTTCAAATCCCACTTCTGA
Protein sequenceShow/hide protein sequence
MLKFTPVVSQIVVAIPFLPPLKSSITESKPSPLQSQNRMFILGIGFVGQFFAQELKNQGWAVSGTCRNLGKKMELEGRGFDVYFFDANDPEQSTLRAMKYHTHLLVSIPP
DVDMLQHEKLLRTTLQGGDLQWLCYLSSTSVYGDYAGAWVDEDTPANLSSQSGKLRIEAEERWLNLGSDLGLSAQVFRLGGIYGPGRSAIDTIIKQSSLSEGQQRRARRR
YTSEFMFRTSAKLLMPVFKSLLPGNLTLYQRQRVYNIVDDDPAPREEVFSYARDLVERSGPVKSSNCRRRWRQVLLPMGVEGVIKGCVMCVSRMEGMTVFGKNSYIIKYP
LKHHVRMAEWSKAPDSSSGPVRGRGFKSHF