; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr004546 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr004546
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionalpha-glucosidase 2
Genome locationtig00003038:50753..52811
RNA-Seq ExpressionSgr004546
SyntenySgr004546
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
GO:0030246 - carbohydrate binding (molecular function)
InterPro domainsIPR011013 - Galactose mutarotase-like domain superfamily
IPR025887 - Glycoside hydrolase family 31, N-terminal domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577761.1 hypothetical protein SDJN03_25335, partial [Cucurbita argyrosperma subsp. sororia]1.2e-11179.32Show/hide
Query:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI
        MEE+SVLSGLV   G + GYIP LNRTPN  + IC  RISG+ + DS KL        FSRRK  +KKLIS +F CKM +TKEKG T D +SGNMIFE I
Subjt:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI

Query:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
        LEDGVFRFDCS+ DRAAA+PSFSF+KSKDRDTPISSQKL TY+PVFEC LGQQIVKL+LPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
Subjt:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY

Query:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
        Q+HPWVLAILPNGEALGVLADT+LRCEIDLR+DSIIQF+APSSYPVITFGPFSSP+AVLKSF  AV
Subjt:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV

XP_022145307.1 uncharacterized protein LOC111014792 [Momordica charantia]6.2e-11681.51Show/hide
Query:  MEEASVLSGLVGGGGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELIL
        MEEASV SGLV  GGAWGYIPRLNRTPNF  +     +SG P  D KKL L        RRK+I++KLIS  F CKM D KE+G TTD ISGNMIFE IL
Subjt:  MEEASVLSGLVGGGGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELIL

Query:  EDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLYQ
        EDGVFRFDCSS DRAAA+PSFSFL+S+DRDTPISSQKLPTY+PVFEC LGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYG+GTTSLYQ
Subjt:  EDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLYQ

Query:  THPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
         HPWVLAILPNG+ALGVLADT+LRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLK+F SAV
Subjt:  THPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV

XP_022923247.1 uncharacterized protein LOC111430993 isoform X1 [Cucurbita moschata]1.2e-11179.32Show/hide
Query:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI
        MEE+SVLSGLV   G + GYIP LNRTPN  + IC  RISG+ + DS KL        FSRRK  +KKLIS +F CKM +TKEKG T D +SGNMIFE I
Subjt:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI

Query:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
        LEDGVFRFDCS+ DRAAA+PSFSF+KSKDRDTPISSQKL TY+PVFEC LGQQIVKL+LPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
Subjt:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY

Query:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
        Q+HPWVLAILPNGEALGVLADT+LRCEIDLR+DSIIQF+APSSYPVITFGPFSSP+AVLKSF  AV
Subjt:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV

XP_022923248.1 uncharacterized protein LOC111430993 isoform X2 [Cucurbita moschata]1.2e-11179.32Show/hide
Query:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI
        MEE+SVLSGLV   G + GYIP LNRTPN  + IC  RISG+ + DS KL        FSRRK  +KKLIS +F CKM +TKEKG T D +SGNMIFE I
Subjt:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI

Query:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
        LEDGVFRFDCS+ DRAAA+PSFSF+KSKDRDTPISSQKL TY+PVFEC LGQQIVKL+LPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
Subjt:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY

Query:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
        Q+HPWVLAILPNGEALGVLADT+LRCEIDLR+DSIIQF+APSSYPVITFGPFSSP+AVLKSF  AV
Subjt:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV

XP_038876569.1 alpha-glucosidase 2 [Benincasa hispida]9.3e-11280.22Show/hide
Query:  MMEEASVLSGL-VGGGGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTD-AISGNMIFE
        M+EEASVLSGL V  GG+ GYIP L+RTPN  + I    ISGA + DSKKLD       F RRKR +KKLISE F CKM + KEKG T D  ISGNMIFE
Subjt:  MMEEASVLSGL-VGGGGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTD-AISGNMIFE

Query:  LILEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTS
         ILEDGVFRFDCS+ DRAAA+PSFSFLKSKDRDTPISSQKLPTYIPVFEC LGQQIVKLELPAGTS YGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTS
Subjt:  LILEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTS

Query:  LYQTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
        LYQ+HPWVLAILPNGEALGVLADT+LRCEIDLR+DS+IQF+APSSYPVITFGPFSSP A LKSF  AV
Subjt:  LYQTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV

TrEMBL top hitse value%identityAlignment
A0A6J1CW78 uncharacterized protein LOC1110147923.0e-11681.51Show/hide
Query:  MEEASVLSGLVGGGGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELIL
        MEEASV SGLV  GGAWGYIPRLNRTPNF  +     +SG P  D KKL L        RRK+I++KLIS  F CKM D KE+G TTD ISGNMIFE IL
Subjt:  MEEASVLSGLVGGGGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELIL

Query:  EDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLYQ
        EDGVFRFDCSS DRAAA+PSFSFL+S+DRDTPISSQKLPTY+PVFEC LGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYG+GTTSLYQ
Subjt:  EDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLYQ

Query:  THPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
         HPWVLAILPNG+ALGVLADT+LRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLK+F SAV
Subjt:  THPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV

A0A6J1E5M6 uncharacterized protein LOC111430993 isoform X15.9e-11279.32Show/hide
Query:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI
        MEE+SVLSGLV   G + GYIP LNRTPN  + IC  RISG+ + DS KL        FSRRK  +KKLIS +F CKM +TKEKG T D +SGNMIFE I
Subjt:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI

Query:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
        LEDGVFRFDCS+ DRAAA+PSFSF+KSKDRDTPISSQKL TY+PVFEC LGQQIVKL+LPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
Subjt:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY

Query:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
        Q+HPWVLAILPNGEALGVLADT+LRCEIDLR+DSIIQF+APSSYPVITFGPFSSP+AVLKSF  AV
Subjt:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV

A0A6J1E909 uncharacterized protein LOC111430993 isoform X25.9e-11279.32Show/hide
Query:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI
        MEE+SVLSGLV   G + GYIP LNRTPN  + IC  RISG+ + DS KL        FSRRK  +KKLIS +F CKM +TKEKG T D +SGNMIFE I
Subjt:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI

Query:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
        LEDGVFRFDCS+ DRAAA+PSFSF+KSKDRDTPISSQKL TY+PVFEC LGQQIVKL+LPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
Subjt:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY

Query:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
        Q+HPWVLAILPNGEALGVLADT+LRCEIDLR+DSIIQF+APSSYPVITFGPFSSP+AVLKSF  AV
Subjt:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV

A0A6J1HNP7 uncharacterized protein LOC111465248 isoform X17.7e-11279.7Show/hide
Query:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI
        MEE+SVLSGLV   G + GYIP LNRTP+F   IC  R SGA + DSKKL        FSRRK  +KKLIS +F CKM +TKEKG T D ISGNMIFE I
Subjt:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI

Query:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
        LEDGVFRFDCS+ DRAAA+PSFSF+KSKDRDTPISSQKL TY+PVFEC LGQQIVKL+LPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
Subjt:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY

Query:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
        Q+HPWVLAILPNGEALG+LADT LRCEIDLR+DSIIQF+APSSYPVITFGPFSSP+AVLKSF  AV
Subjt:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV

A0A6J1HQT7 uncharacterized protein LOC111465248 isoform X27.7e-11279.7Show/hide
Query:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI
        MEE+SVLSGLV   G + GYIP LNRTP+F   IC  R SGA + DSKKL        FSRRK  +KKLIS +F CKM +TKEKG T D ISGNMIFE I
Subjt:  MEEASVLSGLVGG-GGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELI

Query:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
        LEDGVFRFDCS+ DRAAA+PSFSF+KSKDRDTPISSQKL TY+PVFEC LGQQIVKL+LPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY
Subjt:  LEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLY

Query:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
        Q+HPWVLAILPNGEALG+LADT LRCEIDLR+DSIIQF+APSSYPVITFGPFSSP+AVLKSF  AV
Subjt:  QTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV

SwissProt top hitse value%identityAlignment
Q9F234 Alpha-glucosidase 23.2e-0632.26Show/hide
Query:  FYGTGEVSGQLERTGKRIFTWNTDAYG-YGSGTTSLYQTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPT
        FYG GE +G L++ G+ +  WNTD Y  +   T  LYQ+HP+ + +  NG A G+  D T +   D  + +  ++   +    I +  F+ PT
Subjt:  FYGTGEVSGQLERTGKRIFTWNTDAYG-YGSGTTSLYQTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPT

Arabidopsis top hitse value%identityAlignment
AT3G23640.1 heteroglycan glucosidase 11.8e-6866.85Show/hide
Query:  EKGATTDAISGNMIFELILEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRI
        +   T +  S +MIFE ILE GVFRFDCS   R AA PS SF  SKDR+ PI S  +P YIP   C   QQ+V  E   GTSFYGTGEVSGQLERTGKR+
Subjt:  EKGATTDAISGNMIFELILEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRI

Query:  FTWNTDAYGYGSGTTSLYQTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
        FTWNTDA+GYGSGTTSLYQ+HPWVL +LP GE LGVLADTT +CEIDLRK+ II+ ++P+SYP+ITFGPFSSPTAVL+S   A+
Subjt:  FTWNTDAYGYGSGTTSLYQTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV

AT3G23640.2 heteroglycan glucosidase 11.8e-6866.85Show/hide
Query:  EKGATTDAISGNMIFELILEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRI
        +   T +  S +MIFE ILE GVFRFDCS   R AA PS SF  SKDR+ PI S  +P YIP   C   QQ+V  E   GTSFYGTGEVSGQLERTGKR+
Subjt:  EKGATTDAISGNMIFELILEDGVFRFDCSSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRI

Query:  FTWNTDAYGYGSGTTSLYQTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV
        FTWNTDA+GYGSGTTSLYQ+HPWVL +LP GE LGVLADTT +CEIDLRK+ II+ ++P+SYP+ITFGPFSSPTAVL+S   A+
Subjt:  FTWNTDAYGYGSGTTSLYQTHPWVLAILPNGEALGVLADTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAAGAAGCATCAGTGTTGAGTGGGCTAGTGGGTGGAGGTGGAGCATGGGGATATATTCCTCGGCTTAATCGGACTCCAAATTTTAAATACGACATTTGTAGAAG
AAGAATTTCGGGTGCTCCTGCTGGGGATTCGAAGAAGCTTGATCTTGTGGCTTCTCTCTTTTGCTTTTCCAGGAGAAAGAGGATCAGCAAGAAGTTGATTTCTGAAAGGT
TTATATGTAAAATGGTGGATACCAAAGAGAAAGGAGCCACAACAGATGCTATCTCAGGGAATATGATTTTTGAGCTTATACTGGAGGATGGGGTTTTTCGATTTGATTGT
TCTTCACTTGATAGAGCTGCAGCTCATCCAAGTTTTTCTTTCTTAAAATCCAAGGACAGAGACACACCAATTTCTAGCCAGAAGCTTCCTACATATATTCCTGTGTTTGA
GTGTCGTCTTGGCCAGCAGATTGTTAAACTGGAGCTTCCTGCTGGTACCTCCTTTTATGGAACTGGGGAAGTTAGTGGACAGCTTGAGCGAACCGGGAAAAGAATTTTCA
CTTGGAATACAGATGCATATGGATATGGTTCCGGAACTACATCCTTGTACCAAACACATCCATGGGTGTTAGCCATTCTTCCAAATGGGGAGGCACTAGGTGTTCTTGCT
GACACAACCCTGCGTTGTGAGATTGATCTGAGAAAAGATTCAATAATACAGTTTGTTGCTCCTTCCTCATATCCTGTCATTACGTTCGGTCCATTTTCCTCACCAACTGC
AGTTTTAAAGTCCTTCTGTAGTGCAGTTG
mRNA sequenceShow/hide mRNA sequence
ATGATGGAAGAAGCATCAGTGTTGAGTGGGCTAGTGGGTGGAGGTGGAGCATGGGGATATATTCCTCGGCTTAATCGGACTCCAAATTTTAAATACGACATTTGTAGAAG
AAGAATTTCGGGTGCTCCTGCTGGGGATTCGAAGAAGCTTGATCTTGTGGCTTCTCTCTTTTGCTTTTCCAGGAGAAAGAGGATCAGCAAGAAGTTGATTTCTGAAAGGT
TTATATGTAAAATGGTGGATACCAAAGAGAAAGGAGCCACAACAGATGCTATCTCAGGGAATATGATTTTTGAGCTTATACTGGAGGATGGGGTTTTTCGATTTGATTGT
TCTTCACTTGATAGAGCTGCAGCTCATCCAAGTTTTTCTTTCTTAAAATCCAAGGACAGAGACACACCAATTTCTAGCCAGAAGCTTCCTACATATATTCCTGTGTTTGA
GTGTCGTCTTGGCCAGCAGATTGTTAAACTGGAGCTTCCTGCTGGTACCTCCTTTTATGGAACTGGGGAAGTTAGTGGACAGCTTGAGCGAACCGGGAAAAGAATTTTCA
CTTGGAATACAGATGCATATGGATATGGTTCCGGAACTACATCCTTGTACCAAACACATCCATGGGTGTTAGCCATTCTTCCAAATGGGGAGGCACTAGGTGTTCTTGCT
GACACAACCCTGCGTTGTGAGATTGATCTGAGAAAAGATTCAATAATACAGTTTGTTGCTCCTTCCTCATATCCTGTCATTACGTTCGGTCCATTTTCCTCACCAACTGC
AGTTTTAAAGTCCTTCTGTAGTGCAGTTG
Protein sequenceShow/hide protein sequence
MMEEASVLSGLVGGGGAWGYIPRLNRTPNFKYDICRRRISGAPAGDSKKLDLVASLFCFSRRKRISKKLISERFICKMVDTKEKGATTDAISGNMIFELILEDGVFRFDC
SSLDRAAAHPSFSFLKSKDRDTPISSQKLPTYIPVFECRLGQQIVKLELPAGTSFYGTGEVSGQLERTGKRIFTWNTDAYGYGSGTTSLYQTHPWVLAILPNGEALGVLA
DTTLRCEIDLRKDSIIQFVAPSSYPVITFGPFSSPTAVLKSFCSAVX