; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0848 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0848
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionE3 SUMO-protein ligase MMS21
Genome locationMC11:7219368..7222913
RNA-Seq ExpressionMC11g0848
SyntenyMC11g0848
Gene Ontology termsGO:0000724 - double-strand break repair via homologous recombination (biological process)
GO:0080038 - positive regulation of cytokinin-activated signaling pathway (biological process)
GO:0008284 - positive regulation of cell population proliferation (biological process)
GO:0010082 - regulation of root meristem growth (biological process)
GO:0060250 - germ-line stem-cell niche homeostasis (biological process)
GO:0016925 - protein sumoylation (biological process)
GO:0048509 - regulation of meristem development (biological process)
GO:0032876 - negative regulation of DNA endoreduplication (biological process)
GO:0045931 - positive regulation of mitotic cell cycle (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030915 - Smc5-Smc6 complex (cellular component)
GO:0016874 - ligase activity (molecular function)
GO:0061665 - SUMO ligase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004181 - Zinc finger, MIZ-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR026846 - E3 SUMO-protein ligase Nse2 (Mms21)


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607898.1 E3 SUMO-protein ligase MMS21, partial [Cucurbita argyrosperma subsp. sororia]3.52e-14686.29Show/hide
Query:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL
        MASTS++ S+ VS RIKSAAT+M+SENQSLLAE+RK LIMMKEIG+DLERDNQS MVK+LEN+VVELL TYE C NFSSAIQSVGNIYEP+EELTDFEKL
Subjt:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA
        LD+EVAKVSENSSSNL NHSIIR+FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELA+PVRS ECKHVYEK A+MQYL+SK SRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA

Query:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE
        QCPVAACPKMLQPEKV  DPFL IEIDELRKTS+HSVRIQDFTE+DA+
Subjt:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE

XP_022139610.1 E3 SUMO-protein ligase MMS21 isoform X1 [Momordica charantia]6.08e-170100Show/hide
Query:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL
        MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL
Subjt:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA
        LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA

Query:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE
        QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE
Subjt:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE

XP_022941288.1 E3 SUMO-protein ligase MMS21 [Cucurbita moschata]4.30e-14786.69Show/hide
Query:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL
        MASTS++ S+ VS RIKSAAT+M+SENQSLLAE+RK LIMMKEIG++LERDNQS MVK+LEN+VVELL TYE C NFSSAIQSVGNIYEP+EELTDFEKL
Subjt:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA
        LDDEVAKVSENSSSNLQNHSIIR+FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELA+PVRS ECKHVYEK A+MQYL+SK SRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA

Query:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE
        QCPVAACPKMLQPEKV  DPFL IEIDELRKTS+HSVRIQDFTE+DA+
Subjt:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE

XP_022981951.1 E3 SUMO-protein ligase MMS21 [Cucurbita maxima]1.08e-14586.4Show/hide
Query:  MASTSDTRSSA--VSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFE
        MASTS++ S++  VS RIKSAAT+M+SENQSLLAE+RK LIMMKEIG+DLERDNQS MVK+LEN+VVELLSTYE C NFSSAIQSVGNIYEP+EELTDFE
Subjt:  MASTSDTRSSA--VSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFE

Query:  KLLDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNS
        KLLD+EVAKVSENSSSNLQNHSIIR+FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELA+PVRS ECKHVYEK A+MQYL+SK S
Subjt:  KLLDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNS

Query:  RAQCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE
        RAQCPVAACPKMLQPEKV  DPFL IEIDELRKTS+HSVRIQDFTE+DA+
Subjt:  RAQCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE

XP_023525588.1 E3 SUMO-protein ligase MMS21 [Cucurbita pepo subsp. pepo]6.10e-14786.69Show/hide
Query:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL
        MASTS++ S+ VS RIKSAAT+M+SENQSLLAE+RK LIMMKEIG+DLERDNQS MVK+LEN+VVELL TYE C NFSSAIQSVGNIYEP+EELTDFEKL
Subjt:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA
        LD+EVAKVSENSSSNLQNHSIIR+FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELA+PVRS ECKHVYEK A+MQYL+SK SRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA

Query:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE
        QCPVAACPKMLQPEKV  DPFL IEIDELRKTS+HSVRIQDFTE+DA+
Subjt:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE

TrEMBL top hitse value%identityAlignment
A0A5D3DWX9 E3 SUMO-protein ligase MMS218.95e-14283.06Show/hide
Query:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL
        MAS SD+RS+ V+GRIKSAAT+MHSENQSLLAE+RK LIMMKEIG+DLE++NQ+ MVK+LE ++VELLS YENC+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA
        LDDEVAKVS NSSSN  NH IIR+FREAIWNVHHAGQPM GEEQED+VMTSTQCNLLNVTCPLSGKPVTELA+PVRS ECKH+YEK A+MQYL SK SRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA

Query:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE
        QCPVAACPKMLQP+KV  DPFL IEIDELRK SRHS RIQDFTELDA+
Subjt:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE

A0A6J1CCS6 E3 SUMO-protein ligase MMS21 isoform X12.94e-170100Show/hide
Query:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL
        MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL
Subjt:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA
        LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA

Query:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE
        QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE
Subjt:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE

A0A6J1CD70 E3 SUMO-protein ligase MMS21 isoform X22.82e-145100Show/hide
Query:  MMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKLLDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPM
        MMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKLLDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPM
Subjt:  MMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKLLDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPM

Query:  PGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRAQCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRI
        PGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRAQCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRI
Subjt:  PGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRAQCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRI

Query:  QDFTELDAE
        QDFTELDAE
Subjt:  QDFTELDAE

A0A6J1FRP6 E3 SUMO-protein ligase MMS212.08e-14786.69Show/hide
Query:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL
        MASTS++ S+ VS RIKSAAT+M+SENQSLLAE+RK LIMMKEIG++LERDNQS MVK+LEN+VVELL TYE C NFSSAIQSVGNIYEP+EELTDFEKL
Subjt:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA
        LDDEVAKVSENSSSNLQNHSIIR+FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELA+PVRS ECKHVYEK A+MQYL+SK SRA
Subjt:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA

Query:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE
        QCPVAACPKMLQPEKV  DPFL IEIDELRKTS+HSVRIQDFTE+DA+
Subjt:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE

A0A6J1IXZ2 E3 SUMO-protein ligase MMS215.25e-14686.4Show/hide
Query:  MASTSDTRSSA--VSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFE
        MASTS++ S++  VS RIKSAAT+M+SENQSLLAE+RK LIMMKEIG+DLERDNQS MVK+LEN+VVELLSTYE C NFSSAIQSVGNIYEP+EELTDFE
Subjt:  MASTSDTRSSA--VSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFE

Query:  KLLDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNS
        KLLD+EVAKVSENSSSNLQNHSIIR+FREAIWNVHHAGQPMPGEEQED+VMTSTQCNLLNVTCPLSGKPVTELA+PVRS ECKHVYEK A+MQYL+SK S
Subjt:  KLLDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNS

Query:  RAQCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE
        RAQCPVAACPKMLQPEKV  DPFL IEIDELRKTS+HSVRIQDFTE+DA+
Subjt:  RAQCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE

SwissProt top hitse value%identityAlignment
Q8GYH7 E3 SUMO-protein ligase MMS215.0e-6956.28Show/hide
Query:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL
        MAS S   S  V+GRI++A+ ++ S+N S LA+IRK + MMK I + LE++NQ+  VK LEN+V ELL  + +C++ S+AIQSV N Y+P E+LTDF+KL
Subjt:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA
        LDDE  K+    SS  QN  ++R+FREA+WNVHHAG+PMPG++ EDIVMTSTQC LLN+TCPLSGKPVTELA PVRSM+C+HVYEK  ++ Y+   N  A
Subjt:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA

Query:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVR---IQDFTE
         CPVA C   LQ  KV  D  L  EI+E+R  ++ S R   I+DFTE
Subjt:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVR---IQDFTE

Arabidopsis top hitse value%identityAlignment
AT3G15150.1 RING/U-box superfamily protein3.5e-7056.28Show/hide
Query:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL
        MAS S   S  V+GRI++A+ ++ S+N S LA+IRK + MMK I + LE++NQ+  VK LEN+V ELL  + +C++ S+AIQSV N Y+P E+LTDF+KL
Subjt:  MASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVVELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKL

Query:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA
        LDDE  K+    SS  QN  ++R+FREA+WNVHHAG+PMPG++ EDIVMTSTQC LLN+TCPLSGKPVTELA PVRSM+C+HVYEK  ++ Y+   N  A
Subjt:  LDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPVRSMECKHVYEKEAVMQYLRSKNSRA

Query:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVR---IQDFTE
         CPVA C   LQ  KV  D  L  EI+E+R  ++ S R   I+DFTE
Subjt:  QCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVR---IQDFTE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGTTACGACCGAGTTCTCCAAAATTCGAACCATTTTCTTTCACCAAGTGGGTACTCTCTTTTCGCGCTTAAACTAACCATCGCCGGAGGAATTTCGGCGATTTCTTCTTC
CGAGACTACTGATCCTCGTACGGCAATGGCGTCGACTTCTGATACCCGTTCAAGTGCCGTTTCGGGCAGAATCAAATCCGCTGCCACTATGATGCACTCCGAGAATCAGT
CCCTGCTTGCTGAAATACGCAAGTTGTTGATTATGATGAAGGAAATTGGGATGGATTTGGAGAGGGACAATCAGTCGACGATGGTCAAGAAGCTTGAAAATGCTGTTGTT
GAGCTGTTGAGTACTTACGAAAACTGTGACAACTTTTCATCTGCGATTCAGTCGGTTGGAAATATATATGAACCAAGAGAAGAGTTAACAGATTTTGAGAAACTACTTGA
CGATGAAGTTGCAAAAGTCAGTGAAAATTCATCTTCAAATTTGCAGAACCATTCAATAATTCGGAAGTTTAGAGAAGCTATTTGGAATGTACATCATGCAGGGCAACCAA
TGCCAGGTGAAGAGCAGGAGGACATTGTGATGACCAGTACTCAGTGTAATTTATTGAATGTCACTTGCCCATTAAGTGGAAAGCCTGTCACTGAATTGGCACAACCTGTT
CGCAGTATGGAATGCAAGCATGTGTATGAAAAGGAGGCTGTGATGCAGTACCTAAGATCTAAGAATTCTCGAGCTCAATGCCCTGTAGCAGCCTGTCCTAAGATGTTGCA
GCCAGAAAAGGTCACGCCTGATCCGTTCTTACTGATTGAAATTGATGAACTGAGAAAAACGTCTAGGCACTCTGTGAGAATACAAGATTTTACAGAGCTTGATGCAGAAT
AG
mRNA sequenceShow/hide mRNA sequence
TAGGTTACGACCGAGTTCTCCAAAATTCGAACCATTTTCTTTCACCAAGTGGGTACTCTCTTTTCGCGCTTAAACTAACCATCGCCGGAGGAATTTCGGCGATTTCTTCT
TCCGAGACTACTGATCCTCGTACGGCAATGGCGTCGACTTCTGATACCCGTTCAAGTGCCGTTTCGGGCAGAATCAAATCCGCTGCCACTATGATGCACTCCGAGAATCA
GTCCCTGCTTGCTGAAATACGCAAGTTGTTGATTATGATGAAGGAAATTGGGATGGATTTGGAGAGGGACAATCAGTCGACGATGGTCAAGAAGCTTGAAAATGCTGTTG
TTGAGCTGTTGAGTACTTACGAAAACTGTGACAACTTTTCATCTGCGATTCAGTCGGTTGGAAATATATATGAACCAAGAGAAGAGTTAACAGATTTTGAGAAACTACTT
GACGATGAAGTTGCAAAAGTCAGTGAAAATTCATCTTCAAATTTGCAGAACCATTCAATAATTCGGAAGTTTAGAGAAGCTATTTGGAATGTACATCATGCAGGGCAACC
AATGCCAGGTGAAGAGCAGGAGGACATTGTGATGACCAGTACTCAGTGTAATTTATTGAATGTCACTTGCCCATTAAGTGGAAAGCCTGTCACTGAATTGGCACAACCTG
TTCGCAGTATGGAATGCAAGCATGTGTATGAAAAGGAGGCTGTGATGCAGTACCTAAGATCTAAGAATTCTCGAGCTCAATGCCCTGTAGCAGCCTGTCCTAAGATGTTG
CAGCCAGAAAAGGTCACGCCTGATCCGTTCTTACTGATTGAAATTGATGAACTGAGAAAAACGTCTAGGCACTCTGTGAGAATACAAGATTTTACAGAGCTTGATGCAGA
ATAGCAACTCTGAAGTAGTTTCTTTGCCTTTTTCTCTTGAAGTCATCGTCGTAGTCGTCGTTACCGTAGTCGTTAAATTGTCATCATTTCATCAAATGCTTTGTTTCGTA
AGTTGAGCTGGTTGGATTTAAATGGTATCTTTTTGTCCATGTTATTGTGAACATCGTGTTGAATGGAGAAGTAGGACCTCAGCTACAAACTGTAATTGCCCTTTGTATTG
GAGCTGTTCTTTTTCTCCTCTGGTATGTAAATATCATTTCTATTTACAAAAAATTTTACTCTAAAATGTTCATTTGGCA
Protein sequenceShow/hide protein sequence
GYDRVLQNSNHFLSPSGYSLFALKLTIAGGISAISSSETTDPRTAMASTSDTRSSAVSGRIKSAATMMHSENQSLLAEIRKLLIMMKEIGMDLERDNQSTMVKKLENAVV
ELLSTYENCDNFSSAIQSVGNIYEPREELTDFEKLLDDEVAKVSENSSSNLQNHSIIRKFREAIWNVHHAGQPMPGEEQEDIVMTSTQCNLLNVTCPLSGKPVTELAQPV
RSMECKHVYEKEAVMQYLRSKNSRAQCPVAACPKMLQPEKVTPDPFLLIEIDELRKTSRHSVRIQDFTELDAE