; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028167 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028167
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionE3 SUMO-protein ligase MMS21
Genome locationtig00153056:4156806..4161477
RNA-Seq ExpressionSgr028167
SyntenySgr028167
Gene Ontology termsGO:0000724 - double-strand break repair via homologous recombination (biological process)
GO:0080038 - positive regulation of cytokinin-activated signaling pathway (biological process)
GO:0008284 - positive regulation of cell population proliferation (biological process)
GO:0010082 - regulation of root meristem growth (biological process)
GO:0060250 - germ-line stem-cell niche homeostasis (biological process)
GO:0016925 - protein sumoylation (biological process)
GO:0048509 - regulation of meristem development (biological process)
GO:0032876 - negative regulation of DNA endoreduplication (biological process)
GO:0045931 - positive regulation of mitotic cell cycle (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030915 - Smc5-Smc6 complex (cellular component)
GO:0016874 - ligase activity (molecular function)
GO:0061665 - SUMO ligase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004181 - Zinc finger, MIZ-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR026846 - E3 SUMO-protein ligase Nse2 (Mms21)


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607898.1 E3 SUMO-protein ligase MMS21, partial [Cucurbita argyrosperma subsp. sororia]5.3e-11687.5Show/hide
Query:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL
        MASTS+S S+GVS+RIKSAAT+M SENQSLLAELRKSLIMMKEIG+DLERDNQS+MVKELE SVVELL TYE C+NFSSAIQSVGNIYEP+EELTDFEKL
Subjt:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA
        LD+EVAKVSENSSSNL NHSIIRQFREAIWNVHHAGQP+PGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK A+MQYLKSK+SRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA

Query:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE
        QCPVAACPK+LQPEKV+SDPFL IEIDELRK ++HSVRIQDFTE+D +
Subjt:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE

XP_022139610.1 E3 SUMO-protein ligase MMS21 isoform X1 [Momordica charantia]1.5e-11890.32Show/hide
Query:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL
        MASTSD+RSS VS RIKSAATMM+SENQSLLAE+RK LIMMKEIGMDLERDNQS MVK+LE +VVELLSTYENC+NFSSAIQSVGNIYEPREELTDFEKL
Subjt:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA
        LDDEVAKVSENSSSNLQNHSIIR+FREAIWNVHHAGQP+PGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELA+PVRSMECKH+YEK AVMQYL+SK SRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA

Query:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE
        QCPVAACPK+LQPEKV  DPFLLIEIDELRK +RHSVRIQDFTELD E
Subjt:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE

XP_022941288.1 E3 SUMO-protein ligase MMS21 [Cucurbita moschata]1.1e-11687.9Show/hide
Query:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL
        MASTS+S S+GVS+RIKSAAT+M SENQSLLAELRKSLIMMKEIG++LERDNQS+MVKELE SVVELL TYE C+NFSSAIQSVGNIYEP+EELTDFEKL
Subjt:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA
        LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQP+PGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK A+MQYLKSK+SRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA

Query:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE
        QCPVAACPK+LQPEKV+SDPFL IEIDELRK ++HSVRIQDFTE+D +
Subjt:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE

XP_022981951.1 E3 SUMO-protein ligase MMS21 [Cucurbita maxima]1.5e-11587.6Show/hide
Query:  MAST--SDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFE
        MAST  S+S S+GVS+RIKSAAT+M SENQSLLAELRKSLIMMKEIG+DLERDNQS+MVKELE SVVELLSTYE C+NFSSAIQSVGNIYEP+EELTDFE
Subjt:  MAST--SDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFE

Query:  KLLDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKS
        KLLD+EVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQP+PGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK A+MQYLKSK+S
Subjt:  KLLDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKS

Query:  RAQCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE
        RAQCPVAACPK+LQPEKV+SDPFL IEIDELRK ++HSVRIQDFTE+D +
Subjt:  RAQCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE

XP_023525588.1 E3 SUMO-protein ligase MMS21 [Cucurbita pepo subsp. pepo]1.4e-11687.9Show/hide
Query:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL
        MASTS+S S+GVS+RIKSAAT+M SENQSLLAELRKSLIMMKEIG+DLERDNQS+MVKELE SVVELL TYE C+NFSSAIQSVGNIYEP+EELTDFEKL
Subjt:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA
        LD+EVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQP+PGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK A+MQYLKSK+SRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA

Query:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE
        QCPVAACPK+LQPEKV+SDPFL IEIDELRK ++HSVRIQDFTE+D +
Subjt:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE

TrEMBL top hitse value%identityAlignment
A0A1S3CLF9 E3 SUMO-protein ligase MMS212.0e-11385.89Show/hide
Query:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL
        MAS SDSRS+GV+ RIKSAAT+M+SENQSLLAELRK+LIMMKEIG+DLE++NQ+KMVKELEKS+VELLS YENCNNFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA
        LDDEVAKVS NSSSN  NH IIRQFREAIWNVHHAGQP+ GEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHIYEKAA+MQYL SKKSRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA

Query:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE
        QCPVAACPK+LQP+KV+ DPFL IEIDELRK++RHS RIQDFTELD +
Subjt:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE

A0A5D3DWX9 E3 SUMO-protein ligase MMS212.0e-11385.89Show/hide
Query:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL
        MAS SDSRS+GV+ RIKSAAT+M+SENQSLLAELRK+LIMMKEIG+DLE++NQ+KMVKELEKS+VELLS YENCNNFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA
        LDDEVAKVS NSSSN  NH IIRQFREAIWNVHHAGQP+ GEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHIYEKAA+MQYL SKKSRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA

Query:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE
        QCPVAACPK+LQP+KV+ DPFL IEIDELRK++RHS RIQDFTELD +
Subjt:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE

A0A6J1CCS6 E3 SUMO-protein ligase MMS21 isoform X17.2e-11990.32Show/hide
Query:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL
        MASTSD+RSS VS RIKSAATMM+SENQSLLAE+RK LIMMKEIGMDLERDNQS MVK+LE +VVELLSTYENC+NFSSAIQSVGNIYEPREELTDFEKL
Subjt:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA
        LDDEVAKVSENSSSNLQNHSIIR+FREAIWNVHHAGQP+PGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELA+PVRSMECKH+YEK AVMQYL+SK SRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA

Query:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE
        QCPVAACPK+LQPEKV  DPFLLIEIDELRK +RHSVRIQDFTELD E
Subjt:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE

A0A6J1FRP6 E3 SUMO-protein ligase MMS215.1e-11787.9Show/hide
Query:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL
        MASTS+S S+GVS+RIKSAAT+M SENQSLLAELRKSLIMMKEIG++LERDNQS+MVKELE SVVELL TYE C+NFSSAIQSVGNIYEP+EELTDFEKL
Subjt:  MASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA
        LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQP+PGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK A+MQYLKSK+SRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRA

Query:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE
        QCPVAACPK+LQPEKV+SDPFL IEIDELRK ++HSVRIQDFTE+D +
Subjt:  QCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE

A0A6J1IXZ2 E3 SUMO-protein ligase MMS217.4e-11687.6Show/hide
Query:  MAST--SDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFE
        MAST  S+S S+GVS+RIKSAAT+M SENQSLLAELRKSLIMMKEIG+DLERDNQS+MVKELE SVVELLSTYE C+NFSSAIQSVGNIYEP+EELTDFE
Subjt:  MAST--SDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFE

Query:  KLLDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKS
        KLLD+EVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQP+PGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK A+MQYLKSK+S
Subjt:  KLLDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKS

Query:  RAQCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE
        RAQCPVAACPK+LQPEKV+SDPFL IEIDELRK ++HSVRIQDFTE+D +
Subjt:  RAQCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE

SwissProt top hitse value%identityAlignment
Q8GYH7 E3 SUMO-protein ligase MMS211.3e-6955.56Show/hide
Query:  SDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKLLDDE
        S S S GV+ RI++A+ ++ S+N S LA++RK++ MMK I + LE++NQ+  VK+LE SV ELL  + +CN+ S+AIQSV N Y+P E+LTDF+KLLDDE
Subjt:  SDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKLLDDE

Query:  VAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRAQCPV
          K+    SS  QN  ++RQFREA+WNVHHAG+P+PG++ EDIVMTSTQC LLN+TCPLSGKPVTELA+PVRSM+C+H+YEK+ ++ Y+ +  + A CPV
Subjt:  VAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRAQCPV

Query:  AACPKILQPEKVLSDPFLLIEIDELRKVTRHSVR---IQDFTE
        A C   LQ  KV+ D  L  EI+E+R + + S R   I+DFTE
Subjt:  AACPKILQPEKVLSDPFLLIEIDELRKVTRHSVR---IQDFTE

Arabidopsis top hitse value%identityAlignment
AT3G15150.1 RING/U-box superfamily protein9.4e-7155.56Show/hide
Query:  SDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKLLDDE
        S S S GV+ RI++A+ ++ S+N S LA++RK++ MMK I + LE++NQ+  VK+LE SV ELL  + +CN+ S+AIQSV N Y+P E+LTDF+KLLDDE
Subjt:  SDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSVGNIYEPREELTDFEKLLDDE

Query:  VAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRAQCPV
          K+    SS  QN  ++RQFREA+WNVHHAG+P+PG++ EDIVMTSTQC LLN+TCPLSGKPVTELA+PVRSM+C+H+YEK+ ++ Y+ +  + A CPV
Subjt:  VAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLKSKKSRAQCPV

Query:  AACPKILQPEKVLSDPFLLIEIDELRKVTRHSVR---IQDFTE
        A C   LQ  KV+ D  L  EI+E+R + + S R   I+DFTE
Subjt:  AACPKILQPEKVLSDPFLLIEIDELRKVTRHSVR---IQDFTE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGAGTGCGAGTCCACGGCCGTCGGCGCTCTGGCTGCTTGAGTCGGATTCCAGGCAGTCGGAACAAACCACCGGCTGCTCGGTCTTGCCGCACAGATTTCTGAT
CAACGCGTCGTCCGGTTTCGCCGGGAAGAAGAGCGCGAAGAGGAAGAGTACTGGAAACATGGAATTGAAGAAACTCATCTGTGCTTTTTTCTTCGGGATCAGCGTCTCCG
AGAACGATCGATCTTCCAGGTTTACCAAACTAAAACATGTTTGGGCCTGGGCGTCCGGCGTGCCACGGGATTCTCCAAAATTCGAACCATTTCCACCAAACGGATATTCT
CTTTTCGCGCTTAAACCAACCATCGCCGGAGGAGTTCCGGCGATTTCTTCTTCCGAGACTACCGATCCTAGTACCAGAATGGCATCGACTTCTGATTCGCGTTCAAGTGG
CGTTTCCGATAGAATCAAATCCGCTGCCACTATGATGTACTCCGAGAATCAATCCCTGCTTGCCGAATTACGGAAGTCGCTGATCATGATGAAGGAAATTGGAATGGATT
TAGAGAGGGACAATCAATCGAAGATGGTCAAGGAGCTTGAAAAATCTGTTGTTGAGCTGTTGAGCACTTATGAAAACTGTAACAATTTTTCATCTGCAATTCAGTCGGTT
GGAAATATATATGAACCAAGAGAAGAGTTAACAGATTTTGAGAAACTACTTGACGATGAAGTTGCAAAAGTCAGTGAAAATTCATCTTCAAATTTGCAGAACCATTCAAT
AATTCGGCAGTTTAGAGAAGCTATTTGGAATGTTCATCATGCAGGACAACCGTTGCCAGGTGAGGAGCAGGAGGACATTGTGATGACCAGTACGCAGTGTAATCTATTGA
ATGTCACTTGCCCGTTAAGTGGAAAGCCTGTCACTGAATTGGCAGAACCCGTTCGCAGTATGGAATGCAAGCACATTTACGAAAAGGCGGCCGTGATGCAGTACCTAAAA
TCGAAGAAATCTCGAGCTCAATGCCCTGTAGCAGCCTGTCCTAAGATTTTGCAGCCCGAAAAGGTTCTCTCCGATCCGTTCTTACTGATTGAAATTGATGAACTGAGAAA
AGTGACTAGGCACTCTGTGAGGATACAAGATTTCACAGAGCTTGATGGAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATGAGTGCGAGTCCACGGCCGTCGGCGCTCTGGCTGCTTGAGTCGGATTCCAGGCAGTCGGAACAAACCACCGGCTGCTCGGTCTTGCCGCACAGATTTCTGAT
CAACGCGTCGTCCGGTTTCGCCGGGAAGAAGAGCGCGAAGAGGAAGAGTACTGGAAACATGGAATTGAAGAAACTCATCTGTGCTTTTTTCTTCGGGATCAGCGTCTCCG
AGAACGATCGATCTTCCAGGTTTACCAAACTAAAACATGTTTGGGCCTGGGCGTCCGGCGTGCCACGGGATTCTCCAAAATTCGAACCATTTCCACCAAACGGATATTCT
CTTTTCGCGCTTAAACCAACCATCGCCGGAGGAGTTCCGGCGATTTCTTCTTCCGAGACTACCGATCCTAGTACCAGAATGGCATCGACTTCTGATTCGCGTTCAAGTGG
CGTTTCCGATAGAATCAAATCCGCTGCCACTATGATGTACTCCGAGAATCAATCCCTGCTTGCCGAATTACGGAAGTCGCTGATCATGATGAAGGAAATTGGAATGGATT
TAGAGAGGGACAATCAATCGAAGATGGTCAAGGAGCTTGAAAAATCTGTTGTTGAGCTGTTGAGCACTTATGAAAACTGTAACAATTTTTCATCTGCAATTCAGTCGGTT
GGAAATATATATGAACCAAGAGAAGAGTTAACAGATTTTGAGAAACTACTTGACGATGAAGTTGCAAAAGTCAGTGAAAATTCATCTTCAAATTTGCAGAACCATTCAAT
AATTCGGCAGTTTAGAGAAGCTATTTGGAATGTTCATCATGCAGGACAACCGTTGCCAGGTGAGGAGCAGGAGGACATTGTGATGACCAGTACGCAGTGTAATCTATTGA
ATGTCACTTGCCCGTTAAGTGGAAAGCCTGTCACTGAATTGGCAGAACCCGTTCGCAGTATGGAATGCAAGCACATTTACGAAAAGGCGGCCGTGATGCAGTACCTAAAA
TCGAAGAAATCTCGAGCTCAATGCCCTGTAGCAGCCTGTCCTAAGATTTTGCAGCCCGAAAAGGTTCTCTCCGATCCGTTCTTACTGATTGAAATTGATGAACTGAGAAA
AGTGACTAGGCACTCTGTGAGGATACAAGATTTCACAGAGCTTGATGGAGAGTAA
Protein sequenceShow/hide protein sequence
MAMSASPRPSALWLLESDSRQSEQTTGCSVLPHRFLINASSGFAGKKSAKRKSTGNMELKKLICAFFFGISVSENDRSSRFTKLKHVWAWASGVPRDSPKFEPFPPNGYS
LFALKPTIAGGVPAISSSETTDPSTRMASTSDSRSSGVSDRIKSAATMMYSENQSLLAELRKSLIMMKEIGMDLERDNQSKMVKELEKSVVELLSTYENCNNFSSAIQSV
GNIYEPREELTDFEKLLDDEVAKVSENSSSNLQNHSIIRQFREAIWNVHHAGQPLPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHIYEKAAVMQYLK
SKKSRAQCPVAACPKILQPEKVLSDPFLLIEIDELRKVTRHSVRIQDFTELDGE