; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G19120 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G19120
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionE3 SUMO-protein ligase MMS21
Genome locationClcChr11:29601303..29603862
RNA-Seq ExpressionClc11G19120
SyntenyClc11G19120
Gene Ontology termsGO:0000724 - double-strand break repair via homologous recombination (biological process)
GO:0080038 - positive regulation of cytokinin-activated signaling pathway (biological process)
GO:0008284 - positive regulation of cell population proliferation (biological process)
GO:0010082 - regulation of root meristem growth (biological process)
GO:0060250 - germ-line stem-cell niche homeostasis (biological process)
GO:0016925 - protein sumoylation (biological process)
GO:0048509 - regulation of meristem development (biological process)
GO:0032876 - negative regulation of DNA endoreduplication (biological process)
GO:0045931 - positive regulation of mitotic cell cycle (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030915 - Smc5-Smc6 complex (cellular component)
GO:0016874 - ligase activity (molecular function)
GO:0061665 - SUMO ligase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004181 - Zinc finger, MIZ-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR026846 - E3 SUMO-protein ligase Nse2 (Mms21)


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607898.1 E3 SUMO-protein ligase MMS21, partial [Cucurbita argyrosperma subsp. sororia]3.3e-11484.71Show/hide
Query:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL
        MAS S+S S+GVS RIKSAATIM+SENQSLLAELRK+LIMMKEIGVDLERDNQSRMVKELENSVVELL TYE C+NFSSAIQSVGN+YEP+EELTDF KL
Subjt:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL

Query:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL
        LD+EVAK+SENS+SNL NHSIIRQFREAIW       NVHH+GQPMPGEEQED+VMTSTQCNLLN+TCPLSGKPVTELAEP+RS ECKH+YEK AIMQYL
Subjt:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL

Query:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD
        KSK+SRAQCPVAACPKMLQPEKV++DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD

XP_008463706.1 PREDICTED: E3 SUMO-protein ligase MMS21 [Cucumis melo]5.0e-11585.49Show/hide
Query:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL
        MASASDSRS+GV+GRIKSAATIMHSENQSLLAELRK LIMMKEIGVDLE++NQ++MVKELE S+VELLS YE+CNNFSSAIQSVGN YEPKEELTDF KL
Subjt:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL

Query:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL
        LDDEVAK+S NS+SN  NH IIRQFREAIW       NVHH+GQPM GEEQED+VMTSTQCNLLN+TCPLSGKPVTELAEP+RS ECKHIYEKAAIMQYL
Subjt:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL

Query:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD
         SKKSRAQCPVAACPKMLQP+KVV DPFL+IEIDELRKMSRHS RIQDFTELDAD
Subjt:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD

XP_022941288.1 E3 SUMO-protein ligase MMS21 [Cucurbita moschata]1.9e-11484.71Show/hide
Query:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL
        MAS S+S S+GVS RIKSAATIM+SENQSLLAELRK+LIMMKEIGV+LERDNQSRMVKELENSVVELL TYE C+NFSSAIQSVGN+YEP+EELTDF KL
Subjt:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL

Query:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL
        LDDEVAK+SENS+SNL NHSIIRQFREAIW       NVHH+GQPMPGEEQED+VMTSTQCNLLN+TCPLSGKPVTELAEP+RS ECKH+YEK AIMQYL
Subjt:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL

Query:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD
        KSK+SRAQCPVAACPKMLQPEKV++DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD

XP_023525588.1 E3 SUMO-protein ligase MMS21 [Cucurbita pepo subsp. pepo]2.5e-11484.71Show/hide
Query:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL
        MAS S+S S+GVS RIKSAATIM+SENQSLLAELRK+LIMMKEIGVDLERDNQSRMVKELENSVVELL TYE C+NFSSAIQSVGN+YEP+EELTDF KL
Subjt:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL

Query:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL
        LD+EVAK+SENS+SNL NHSIIRQFREAIW       NVHH+GQPMPGEEQED+VMTSTQCNLLN+TCPLSGKPVTELAEP+RS ECKH+YEK AIMQYL
Subjt:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL

Query:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD
        KSK+SRAQCPVAACPKMLQPEKV++DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD

XP_038897059.1 E3 SUMO-protein ligase MMS21 isoform X1 [Benincasa hispida]7.5e-11987.84Show/hide
Query:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL
        MASASDSRS+GVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGV+LERDNQSRMVKELENSVVELLSTYE+CNNFS AIQSVGN+YEPKEELTDFGKL
Subjt:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL

Query:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL
        LDDEVAK+S++S+SNL+NHSIIRQFREA+W       NVHH+GQPMPGEE+EDIVMTSTQCNLLN+TCPLSGKPV EL EP+RS ECKH+YEKAAIMQYL
Subjt:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL

Query:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD
        KSKKSRAQCPVAACPKMLQ +KVV DPFLQIEIDELRK+SRHS RIQDFTELDAD
Subjt:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD

TrEMBL top hitse value%identityAlignment
A0A1S3CLF9 E3 SUMO-protein ligase MMS212.4e-11585.49Show/hide
Query:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL
        MASASDSRS+GV+GRIKSAATIMHSENQSLLAELRK LIMMKEIGVDLE++NQ++MVKELE S+VELLS YE+CNNFSSAIQSVGN YEPKEELTDF KL
Subjt:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL

Query:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL
        LDDEVAK+S NS+SN  NH IIRQFREAIW       NVHH+GQPM GEEQED+VMTSTQCNLLN+TCPLSGKPVTELAEP+RS ECKHIYEKAAIMQYL
Subjt:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL

Query:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD
         SKKSRAQCPVAACPKMLQP+KVV DPFL+IEIDELRKMSRHS RIQDFTELDAD
Subjt:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD

A0A5D3DWX9 E3 SUMO-protein ligase MMS212.4e-11585.49Show/hide
Query:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL
        MASASDSRS+GV+GRIKSAATIMHSENQSLLAELRK LIMMKEIGVDLE++NQ++MVKELE S+VELLS YE+CNNFSSAIQSVGN YEPKEELTDF KL
Subjt:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL

Query:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL
        LDDEVAK+S NS+SN  NH IIRQFREAIW       NVHH+GQPM GEEQED+VMTSTQCNLLN+TCPLSGKPVTELAEP+RS ECKHIYEKAAIMQYL
Subjt:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL

Query:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD
         SKKSRAQCPVAACPKMLQP+KVV DPFL+IEIDELRKMSRHS RIQDFTELDAD
Subjt:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD

A0A6J1CCS6 E3 SUMO-protein ligase MMS21 isoform X12.7e-11483.92Show/hide
Query:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL
        MAS SD+RSS VSGRIKSAAT+MHSENQSLLAE+RK LIMMKEIG+DLERDNQS MVK+LEN+VVELLSTYE+C+NFSSAIQSVGN+YEP+EELTDF KL
Subjt:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL

Query:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL
        LDDEVAK+SENS+SNL NHSIIR+FREAIW       NVHH+GQPMPGEEQEDIVMTSTQCNLLN+TCPLSGKPVTELA+P+RSMECKH+YEK A+MQYL
Subjt:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL

Query:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD
        +SK SRAQCPVAACPKMLQPEKV  DPFL IEIDELRK SRHS RIQDFTELDA+
Subjt:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD

A0A6J1FRP6 E3 SUMO-protein ligase MMS219.3e-11584.71Show/hide
Query:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL
        MAS S+S S+GVS RIKSAATIM+SENQSLLAELRK+LIMMKEIGV+LERDNQSRMVKELENSVVELL TYE C+NFSSAIQSVGN+YEP+EELTDF KL
Subjt:  MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKL

Query:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL
        LDDEVAK+SENS+SNL NHSIIRQFREAIW       NVHH+GQPMPGEEQED+VMTSTQCNLLN+TCPLSGKPVTELAEP+RS ECKH+YEK AIMQYL
Subjt:  LDDEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYL

Query:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD
        KSK+SRAQCPVAACPKMLQPEKV++DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  KSKKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD

A0A6J1IXZ2 E3 SUMO-protein ligase MMS217.8e-11484.98Show/hide
Query:  SASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKLLD
        S S+S S+GVS RIKSAATIM+SENQSLLAELRK+LIMMKEIGVDLERDNQSRMVKELENSVVELLSTYE C+NFSSAIQSVGN+YEP+EELTDF KLLD
Subjt:  SASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKLLD

Query:  DEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYLKS
        +EVAK+SENS+SNL NHSIIRQFREAIW       NVHH+GQPMPGEEQED+VMTSTQCNLLN+TCPLSGKPVTELAEP+RS ECKH+YEK AIMQYLKS
Subjt:  DEVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYLKS

Query:  KKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD
        K+SRAQCPVAACPKMLQPEKV++DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  KKSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD

SwissProt top hitse value%identityAlignment
Q8GYH7 E3 SUMO-protein ligase MMS217.2e-7255.78Show/hide
Query:  ASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKLLDD
        AS S S GV+GRI++A+ ++ S+N S LA++RKA+ MMK I V LE++NQ+  VK+LENSV ELL  + DCN+ S+AIQSV N Y+P E+LTDF KLLDD
Subjt:  ASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKLLDD

Query:  EVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYLKSK
        E  KL    +S   N  ++RQFREA+W       NVHH+G+PMPG++ EDIVMTSTQC LLN+TCPLSGKPVTELA+P+RSM+C+H+YEK+ I+ Y+ + 
Subjt:  EVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYLKSK

Query:  KSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSAR---IQDFTE
         + A CPVA C   LQ  KV+ D  L+ EI+E+R +++ S R   I+DFTE
Subjt:  KSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSAR---IQDFTE

Arabidopsis top hitse value%identityAlignment
AT3G15150.1 RING/U-box superfamily protein5.1e-7355.78Show/hide
Query:  ASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKLLDD
        AS S S GV+GRI++A+ ++ S+N S LA++RKA+ MMK I V LE++NQ+  VK+LENSV ELL  + DCN+ S+AIQSV N Y+P E+LTDF KLLDD
Subjt:  ASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKLLDD

Query:  EVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYLKSK
        E  KL    +S   N  ++RQFREA+W       NVHH+G+PMPG++ EDIVMTSTQC LLN+TCPLSGKPVTELA+P+RSM+C+H+YEK+ I+ Y+ + 
Subjt:  EVAKLSENSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYLKSK

Query:  KSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSAR---IQDFTE
         + A CPVA C   LQ  KV+ D  L+ EI+E+R +++ S R   I+DFTE
Subjt:  KSRAQCPVAACPKMLQPEKVVADPFLQIEIDELRKMSRHSAR---IQDFTE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCCGCTTCCGATTCCCGTTCAAGCGGCGTTTCCGGTAGAATCAAATCCGCCGCCACTATTATGCACTCCGAGAATCAATCCCTCCTTGCCGAATTGCGGAAGGC
GCTGATTATGATGAAGGAAATTGGGGTGGATTTGGAAAGGGATAACCAATCCAGGATGGTCAAGGAGCTTGAAAATTCTGTTGTCGAGCTGTTGAGTACTTATGAAGACT
GTAACAACTTTTCATCTGCAATTCAGTCGGTTGGAAATATGTATGAACCAAAAGAAGAGTTAACAGATTTTGGGAAACTACTTGACGATGAAGTTGCAAAACTCAGTGAA
AATTCCACTTCAAATTTGCTGAACCATTCAATAATTCGGCAATTTAGAGAAGCTATTTGGCCGATCTCAGTATTCCTGCAGAATGTTCATCATTCAGGACAACCAATGCC
AGGTGAGGAGCAGGAGGACATTGTGATGACCAGTACTCAGTGTAATCTATTGAATATCACTTGCCCGTTAAGTGGAAAGCCTGTCACCGAATTAGCAGAACCCATTCGCA
GTATGGAATGCAAGCACATATATGAAAAGGCAGCCATAATGCAGTATCTTAAATCCAAGAAATCTCGCGCGCAATGCCCGGTTGCAGCCTGTCCTAAGATGTTGCAGCCC
GAAAAGGTTGTAGCTGATCCGTTCTTACAGATTGAAATTGATGAACTCCGAAAAATGTCTAGGCATTCTGCGAGGATACAAGACTTCACAGAGCTTGATGCAGATTAG
mRNA sequenceShow/hide mRNA sequence
CCTAAATAACCTCCTACAAAGGCAACCGTAATTTAGGATTTAAACCTTAATTTCGAGGGGTTCTCCAAAACTCGGACCATGTTGTTTCCTCAAGCGGGTATTCTATTTTT
CGCGCTTTAAAAGTTAAAATTCGAACCACCGTCTGAGGAATTTCTGTTTCGGCAATCTCTTCTTCAAAGGAATGGCATCCGCTTCCGATTCCCGTTCAAGCGGCGTTTCC
GGTAGAATCAAATCCGCCGCCACTATTATGCACTCCGAGAATCAATCCCTCCTTGCCGAATTGCGGAAGGCGCTGATTATGATGAAGGAAATTGGGGTGGATTTGGAAAG
GGATAACCAATCCAGGATGGTCAAGGAGCTTGAAAATTCTGTTGTCGAGCTGTTGAGTACTTATGAAGACTGTAACAACTTTTCATCTGCAATTCAGTCGGTTGGAAATA
TGTATGAACCAAAAGAAGAGTTAACAGATTTTGGGAAACTACTTGACGATGAAGTTGCAAAACTCAGTGAAAATTCCACTTCAAATTTGCTGAACCATTCAATAATTCGG
CAATTTAGAGAAGCTATTTGGCCGATCTCAGTATTCCTGCAGAATGTTCATCATTCAGGACAACCAATGCCAGGTGAGGAGCAGGAGGACATTGTGATGACCAGTACTCA
GTGTAATCTATTGAATATCACTTGCCCGTTAAGTGGAAAGCCTGTCACCGAATTAGCAGAACCCATTCGCAGTATGGAATGCAAGCACATATATGAAAAGGCAGCCATAA
TGCAGTATCTTAAATCCAAGAAATCTCGCGCGCAATGCCCGGTTGCAGCCTGTCCTAAGATGTTGCAGCCCGAAAAGGTTGTAGCTGATCCGTTCTTACAGATTGAAATT
GATGAACTCCGAAAAATGTCTAGGCATTCTGCGAGGATACAAGACTTCACAGAGCTTGATGCAGATTAG
Protein sequenceShow/hide protein sequence
MASASDSRSSGVSGRIKSAATIMHSENQSLLAELRKALIMMKEIGVDLERDNQSRMVKELENSVVELLSTYEDCNNFSSAIQSVGNMYEPKEELTDFGKLLDDEVAKLSE
NSTSNLLNHSIIRQFREAIWPISVFLQNVHHSGQPMPGEEQEDIVMTSTQCNLLNITCPLSGKPVTELAEPIRSMECKHIYEKAAIMQYLKSKKSRAQCPVAACPKMLQP
EKVVADPFLQIEIDELRKMSRHSARIQDFTELDAD