; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037862 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037862
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionE3 SUMO-protein ligase MMS21
Genome locationchr2:9952912..9955602
RNA-Seq ExpressionLag0037862
SyntenyLag0037862
Gene Ontology termsGO:0000724 - double-strand break repair via homologous recombination (biological process)
GO:0080038 - positive regulation of cytokinin-activated signaling pathway (biological process)
GO:0008284 - positive regulation of cell population proliferation (biological process)
GO:0010082 - regulation of root meristem growth (biological process)
GO:0060250 - germ-line stem-cell niche homeostasis (biological process)
GO:0016925 - protein sumoylation (biological process)
GO:0048509 - regulation of meristem development (biological process)
GO:0032876 - negative regulation of DNA endoreduplication (biological process)
GO:0045931 - positive regulation of mitotic cell cycle (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030915 - Smc5-Smc6 complex (cellular component)
GO:0016874 - ligase activity (molecular function)
GO:0061665 - SUMO ligase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004181 - Zinc finger, MIZ-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR026846 - E3 SUMO-protein ligase Nse2 (Mms21)


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607898.1 E3 SUMO-protein ligase MMS21, partial [Cucurbita argyrosperma subsp. sororia]5.1e-12090.73Show/hide
Query:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASTSNS S+GVS+RIKSAATIM+SENQSLLAELRKSLIMMKEIGVDLE+DNQSRMVKELENSVVELLGTYE C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA
        LD+EVAK+SENSSSNL NHSIIR FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHVYEK  IMQYLKSK+SRA
Subjt:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA

Query:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD
        QCPVAACP+MLQPEKVISDPFL IEIDELRK S+HSVRIQDFTE+DAD
Subjt:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD

XP_022139610.1 E3 SUMO-protein ligase MMS21 isoform X1 [Momordica charantia]5.8e-11687.5Show/hide
Query:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASTS++RSS VS RIKSAAT+MHSENQSLLAE+RK LIMMKEIG+DLE+DNQS MVK+LEN+VVELL TYE+C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA
        LDDEVAK+SENSSSNLQNHSIIR FREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELA+PVRSMECKHVYEK  +MQYL+SK SRA
Subjt:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA

Query:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD
        QCPVAACP+MLQPEKV  DPFL IEIDELRK SRHSVRIQDFTELDA+
Subjt:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD

XP_022941288.1 E3 SUMO-protein ligase MMS21 [Cucurbita moschata]1.0e-12091.13Show/hide
Query:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASTSNS S+GVS+RIKSAATIM+SENQSLLAELRKSLIMMKEIGV+LE+DNQSRMVKELENSVVELLGTYE C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA
        LDDEVAK+SENSSSNLQNHSIIR FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHVYEK  IMQYLKSK+SRA
Subjt:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA

Query:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD
        QCPVAACP+MLQPEKVISDPFL IEIDELRK S+HSVRIQDFTE+DAD
Subjt:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD

XP_022981951.1 E3 SUMO-protein ligase MMS21 [Cucurbita maxima]2.1e-11890Show/hide
Query:  MAST--SNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFE
        MAST  SNS S+GVS+RIKSAATIM+SENQSLLAELRKSLIMMKEIGVDLE+DNQSRMVKELENSVVELL TYE C+NFSSAIQSVGN YEP+EELTDFE
Subjt:  MAST--SNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFE

Query:  KLLDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKS
        KLLD+EVAK+SENSSSNLQNHSIIR FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHVYEK  IMQYLKSK+S
Subjt:  KLLDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKS

Query:  RAQCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD
        RAQCPVAACP+MLQPEKVISDPFL IEIDELRK S+HSVRIQDFTE+DAD
Subjt:  RAQCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD

XP_023525588.1 E3 SUMO-protein ligase MMS21 [Cucurbita pepo subsp. pepo]1.3e-12091.13Show/hide
Query:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASTSNS S+GVS+RIKSAATIM+SENQSLLAELRKSLIMMKEIGVDLE+DNQSRMVKELENSVVELLGTYE C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA
        LD+EVAK+SENSSSNLQNHSIIR FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHVYEK  IMQYLKSK+SRA
Subjt:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA

Query:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD
        QCPVAACP+MLQPEKVISDPFL IEIDELRK S+HSVRIQDFTE+DAD
Subjt:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD

TrEMBL top hitse value%identityAlignment
A0A1S3CLF9 E3 SUMO-protein ligase MMS212.4e-11586.69Show/hide
Query:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL
        MAS S+SRS+GV+ RIKSAATIMHSENQSLLAELRK+LIMMKEIGVDLEK+NQ++MVKELE S+VELL  YE+CNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA
        LDDEVAK+S NSSSN  NH IIR FREAIWNVHHAGQPM GEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEKA IMQYL SKKSRA
Subjt:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA

Query:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD
        QCPVAACP+MLQP+KV+ DPFL+IEIDELRK+SRHS RIQDFTELDAD
Subjt:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD

A0A5D3DWX9 E3 SUMO-protein ligase MMS212.4e-11586.69Show/hide
Query:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL
        MAS S+SRS+GV+ RIKSAATIMHSENQSLLAELRK+LIMMKEIGVDLEK+NQ++MVKELE S+VELL  YE+CNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA
        LDDEVAK+S NSSSN  NH IIR FREAIWNVHHAGQPM GEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEKA IMQYL SKKSRA
Subjt:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA

Query:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD
        QCPVAACP+MLQP+KV+ DPFL+IEIDELRK+SRHS RIQDFTELDAD
Subjt:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD

A0A6J1CCS6 E3 SUMO-protein ligase MMS21 isoform X12.8e-11687.5Show/hide
Query:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASTS++RSS VS RIKSAAT+MHSENQSLLAE+RK LIMMKEIG+DLE+DNQS MVK+LEN+VVELL TYE+C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA
        LDDEVAK+SENSSSNLQNHSIIR FREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELA+PVRSMECKHVYEK  +MQYL+SK SRA
Subjt:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA

Query:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD
        QCPVAACP+MLQPEKV  DPFL IEIDELRK SRHSVRIQDFTELDA+
Subjt:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD

A0A6J1FRP6 E3 SUMO-protein ligase MMS214.9e-12191.13Show/hide
Query:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASTSNS S+GVS+RIKSAATIM+SENQSLLAELRKSLIMMKEIGV+LE+DNQSRMVKELENSVVELLGTYE C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA
        LDDEVAK+SENSSSNLQNHSIIR FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHVYEK  IMQYLKSK+SRA
Subjt:  LDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRA

Query:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD
        QCPVAACP+MLQPEKVISDPFL IEIDELRK S+HSVRIQDFTE+DAD
Subjt:  QCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD

A0A6J1IXZ2 E3 SUMO-protein ligase MMS211.0e-11890Show/hide
Query:  MAST--SNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFE
        MAST  SNS S+GVS+RIKSAATIM+SENQSLLAELRKSLIMMKEIGVDLE+DNQSRMVKELENSVVELL TYE C+NFSSAIQSVGN YEP+EELTDFE
Subjt:  MAST--SNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFE

Query:  KLLDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKS
        KLLD+EVAK+SENSSSNLQNHSIIR FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHVYEK  IMQYLKSK+S
Subjt:  KLLDDEVAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKS

Query:  RAQCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD
        RAQCPVAACP+MLQPEKVISDPFL IEIDELRK S+HSVRIQDFTE+DAD
Subjt:  RAQCPVAACPRMLQPEKVISDPFLQIEIDELRKVSRHSVRIQDFTELDAD

SwissProt top hitse value%identityAlignment
Q8GYH7 E3 SUMO-protein ligase MMS216.3e-7358.44Show/hide
Query:  SNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKLLDDE
        S S S GV+ RI++A+ ++ S+N S LA++RK++ MMK I V LEK+NQ+  VK+LENSV ELL  +  CN+ S+AIQSV N Y+P E+LTDF+KLLDDE
Subjt:  SNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKLLDDE

Query:  VAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRAQCPV
          K+    SS  QN  ++R FREA+WNVHHAG+PMPG++ EDIVMTSTQC LLN+TCPLSGKPVTELA+PVRSM+C+HVYEK+VI+ Y+ +  + A CPV
Subjt:  VAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRAQCPV

Query:  AACPRMLQPEKVISDPFLQIEIDELRKVSRHSVR---IQDFTE
        A C   LQ  KVI D  L+ EI+E+R +++ S R   I+DFTE
Subjt:  AACPRMLQPEKVISDPFLQIEIDELRKVSRHSVR---IQDFTE

Arabidopsis top hitse value%identityAlignment
AT3G15150.1 RING/U-box superfamily protein4.5e-7458.44Show/hide
Query:  SNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKLLDDE
        S S S GV+ RI++A+ ++ S+N S LA++RK++ MMK I V LEK+NQ+  VK+LENSV ELL  +  CN+ S+AIQSV N Y+P E+LTDF+KLLDDE
Subjt:  SNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKLLDDE

Query:  VAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRAQCPV
          K+    SS  QN  ++R FREA+WNVHHAG+PMPG++ EDIVMTSTQC LLN+TCPLSGKPVTELA+PVRSM+C+HVYEK+VI+ Y+ +  + A CPV
Subjt:  VAKISENSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRAQCPV

Query:  AACPRMLQPEKVISDPFLQIEIDELRKVSRHSVR---IQDFTE
        A C   LQ  KVI D  L+ EI+E+R +++ S R   I+DFTE
Subjt:  AACPRMLQPEKVISDPFLQIEIDELRKVSRHSVR---IQDFTE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTTCCAATTCCCGTTCAAGTGGCGTTTCCGATAGAATCAAATCTGCTGCCACTATTATGCACTCCGAGAATCAATCCCTTCTTGCCGAATTACGGAAGTC
GCTGATTATGATGAAGGAAATTGGGGTGGATTTGGAGAAGGACAACCAGTCGAGGATGGTCAAGGAGCTTGAAAATTCTGTTGTTGAGCTGCTGGGTACTTATGAATCCT
GTAACAACTTTTCATCTGCAATTCAGTCGGTTGGAAACACATATGAACCAAAAGAAGAGTTAACAGATTTTGAGAAACTACTCGACGACGAAGTTGCAAAAATCAGTGAA
AATTCATCTTCAAATTTGCAGAACCATTCAATAATTCGGAATTTTAGAGAAGCTATTTGGAATGTTCATCATGCAGGACAGCCGATGCCAGGTGAGGAGCAGGAGGACAT
TGTGATGACCAGTACTCAGTGTAATCTATTGAATGTCACTTGCCCATTGAGTGGAAAGCCTGTCACTGAATTAGCGGAACCGGTTCGCAGTATGGAATGCAAGCACGTAT
ACGAAAAGGCGGTCATAATGCAGTACCTAAAATCGAAGAAATCTCGAGCTCAATGCCCTGTTGCTGCCTGTCCCAGGATGTTGCAGCCCGAAAAGGTCATTTCTGATCCA
TTCTTACAGATTGAAATTGATGAACTGCGAAAAGTGTCTAGGCATTCTGTGAGGATACAGGATTTCACAGAGCTTGATGCAGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCACTTCCAATTCCCGTTCAAGTGGCGTTTCCGATAGAATCAAATCTGCTGCCACTATTATGCACTCCGAGAATCAATCCCTTCTTGCCGAATTACGGAAGTC
GCTGATTATGATGAAGGAAATTGGGGTGGATTTGGAGAAGGACAACCAGTCGAGGATGGTCAAGGAGCTTGAAAATTCTGTTGTTGAGCTGCTGGGTACTTATGAATCCT
GTAACAACTTTTCATCTGCAATTCAGTCGGTTGGAAACACATATGAACCAAAAGAAGAGTTAACAGATTTTGAGAAACTACTCGACGACGAAGTTGCAAAAATCAGTGAA
AATTCATCTTCAAATTTGCAGAACCATTCAATAATTCGGAATTTTAGAGAAGCTATTTGGAATGTTCATCATGCAGGACAGCCGATGCCAGGTGAGGAGCAGGAGGACAT
TGTGATGACCAGTACTCAGTGTAATCTATTGAATGTCACTTGCCCATTGAGTGGAAAGCCTGTCACTGAATTAGCGGAACCGGTTCGCAGTATGGAATGCAAGCACGTAT
ACGAAAAGGCGGTCATAATGCAGTACCTAAAATCGAAGAAATCTCGAGCTCAATGCCCTGTTGCTGCCTGTCCCAGGATGTTGCAGCCCGAAAAGGTCATTTCTGATCCA
TTCTTACAGATTGAAATTGATGAACTGCGAAAAGTGTCTAGGCATTCTGTGAGGATACAGGATTTCACAGAGCTTGATGCAGATTAG
Protein sequenceShow/hide protein sequence
MASTSNSRSSGVSDRIKSAATIMHSENQSLLAELRKSLIMMKEIGVDLEKDNQSRMVKELENSVVELLGTYESCNNFSSAIQSVGNTYEPKEELTDFEKLLDDEVAKISE
NSSSNLQNHSIIRNFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAEPVRSMECKHVYEKAVIMQYLKSKKSRAQCPVAACPRMLQPEKVISDP
FLQIEIDELRKVSRHSVRIQDFTELDAD