; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G8935 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G8935
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionE3 SUMO-protein ligase MMS21
Genome locationctg1635:2115483..2118402
RNA-Seq ExpressionCucsat.G8935
SyntenyCucsat.G8935
Gene Ontology termsGO:0000724 - double-strand break repair via homologous recombination (biological process)
GO:0008284 - positive regulation of cell population proliferation (biological process)
GO:0010082 - regulation of root meristem growth (biological process)
GO:0016925 - protein sumoylation (biological process)
GO:0032876 - negative regulation of DNA endoreduplication (biological process)
GO:0045931 - positive regulation of mitotic cell cycle (biological process)
GO:0048509 - regulation of meristem development (biological process)
GO:0060250 - germ-line stem-cell niche homeostasis (biological process)
GO:0080038 - positive regulation of cytokinin-activated signaling pathway (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030915 - Smc5-Smc6 complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0016874 - ligase activity (molecular function)
GO:0061665 - SUMO ligase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146042.1 E3 SUMO-protein ligase MMS21 [Cucumis sativus]2.02e-170100Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA
Subjt:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
        QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD

XP_008463706.1 PREDICTED: E3 SUMO-protein ligase MMS21 [Cucumis melo]4.19e-16396.37Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRSTGVTGRIKSAATIMHS+NQSLLAELRKTLIMMKEIGVDLEKE Q KMVKELEKS+VELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVS +SSSNFANHPIIRQFREAIWNVHHAGQ MAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHIYEKAAIMQYLNSKKSRA
Subjt:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
        QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHS RIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD

XP_022941288.1 E3 SUMO-protein ligase MMS21 [Cucurbita moschata]2.58e-14183.47Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MAS S+S STGV+ RIKSAATIM+S+NQSLLAELRK+LIMMKEIGV+LE++ Q +MVKELE S+VELL  YE C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVSE+SSSN  NH IIRQFREAIWNVHHAGQ M GEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK AIMQYL SK+SRA
Subjt:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
        QCPVAACPKMLQP+KV+ DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD

XP_023525588.1 E3 SUMO-protein ligase MMS21 [Cucurbita pepo subsp. pepo]3.67e-14183.47Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MAS S+S STGV+ RIKSAATIM+S+NQSLLAELRK+LIMMKEIGVDLE++ Q +MVKELE S+VELL  YE C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA
        LD+EVAKVSE+SSSN  NH IIRQFREAIWNVHHAGQ M GEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK AIMQYL SK+SRA
Subjt:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
        QCPVAACPKMLQP+KV+ DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD

XP_038897059.1 E3 SUMO-protein ligase MMS21 isoform X1 [Benincasa hispida]4.02e-14585.48Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRS GV+GRIKSAATIMHS+NQSLLAELRK LIMMKEIGV+LE++ Q +MVKELE S+VELLS YENCNNFS AIQSVGN YEPKEELTDF KL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVS+ SSSN  NH IIRQFREA+WNVHHAGQ M GEE+ED+VMTSTQCNLLNVTCPLSGKPV EL EPVRS ECKH+YEKAAIMQYL SKKSRA
Subjt:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
        QCPVAACPKMLQ DKVV DPFL+IEIDELRK+SRHS RIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD

TrEMBL top hitse value%identityAlignment
A0A0A0L3C3 SP-RING-type domain-containing protein9.76e-171100Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA
Subjt:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
        QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD

A0A1S3CLF9 E3 SUMO-protein ligase MMS212.03e-16396.37Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRSTGVTGRIKSAATIMHS+NQSLLAELRKTLIMMKEIGVDLEKE Q KMVKELEKS+VELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVS +SSSNFANHPIIRQFREAIWNVHHAGQ MAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHIYEKAAIMQYLNSKKSRA
Subjt:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
        QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHS RIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD

A0A5D3DWX9 E3 SUMO-protein ligase MMS212.03e-16396.37Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRSTGVTGRIKSAATIMHS+NQSLLAELRKTLIMMKEIGVDLEKE Q KMVKELEKS+VELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVS +SSSNFANHPIIRQFREAIWNVHHAGQ MAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHIYEKAAIMQYLNSKKSRA
Subjt:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
        QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHS RIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD

A0A6J1FRP6 E3 SUMO-protein ligase MMS211.25e-14183.47Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MAS S+S STGV+ RIKSAATIM+S+NQSLLAELRK+LIMMKEIGV+LE++ Q +MVKELE S+VELL  YE C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVSE+SSSN  NH IIRQFREAIWNVHHAGQ M GEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK AIMQYL SK+SRA
Subjt:  LDDEVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
        QCPVAACPKMLQP+KV+ DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD

A0A6J1IXZ2 E3 SUMO-protein ligase MMS211.57e-14084.02Show/hide
Query:  SDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDDE
        S+S STGV+ RIKSAATIM+S+NQSLLAELRK+LIMMKEIGVDLE++ Q +MVKELE S+VELLS YE C+NFSSAIQSVGN YEP+EELTDFEKLLD+E
Subjt:  SDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDDE

Query:  VAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRAQCPV
        VAKVSE+SSSN  NH IIRQFREAIWNVHHAGQ M GEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK AIMQYL SK+SRAQCPV
Subjt:  VAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRAQCPV

Query:  AACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD
        AACPKMLQP+KV+ DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  AACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGRIQDFTELDAD

SwissProt top hitse value%identityAlignment
Q8GYH7 E3 SUMO-protein ligase MMS213.8e-7055.74Show/hide
Query:  ASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDD
        AS S S GV GRI++A+ ++ SDN S LA++RK + MMK I V LEKE Q   VK+LE S+ ELL  + +CN+ S+AIQSV N Y+P E+LTDF+KLLDD
Subjt:  ASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDD

Query:  EVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRAQCP
        E  K+  + SS   N  ++RQFREA+WNVHHAG+ M G++ ED+VMTSTQC LLN+TCPLSGKPVTELA+PVRS++C+H+YEK+ I+ Y+ +  + A CP
Subjt:  EVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRAQCP

Query:  VAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGR---IQDFTE
        VA C   LQ  KV+ D  L+ EI+E+R +++ S R   I+DFTE
Subjt:  VAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGR---IQDFTE

Arabidopsis top hitse value%identityAlignment
AT3G15150.1 RING/U-box superfamily protein2.7e-7155.74Show/hide
Query:  ASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDD
        AS S S GV GRI++A+ ++ SDN S LA++RK + MMK I V LEKE Q   VK+LE S+ ELL  + +CN+ S+AIQSV N Y+P E+LTDF+KLLDD
Subjt:  ASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDD

Query:  EVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRAQCP
        E  K+  + SS   N  ++RQFREA+WNVHHAG+ M G++ ED+VMTSTQC LLN+TCPLSGKPVTELA+PVRS++C+H+YEK+ I+ Y+ +  + A CP
Subjt:  EVAKVSESSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRAQCP

Query:  VAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGR---IQDFTE
        VA C   LQ  KV+ D  L+ EI+E+R +++ S R   I+DFTE
Subjt:  VAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSGR---IQDFTE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCCGCTTCCGATTCCCGATCAACCGGCGTCACCGGCAGAATCAAATCCGCCGCCACCATTATGCACTCCGACAATCAATCCCTCCTTGCAGAATTGCGTAAGAC
GCTGATTATGATGAAGGAAATTGGGGTGGATTTGGAGAAGGAAAAACAGTACAAGATGGTCAAGGAGCTTGAAAAATCTATTGTTGAGCTGTTGAGTGCCTATGAAAACT
GTAACAACTTTTCATCTGCAATTCAGTCGGTTGGAAATACATATGAACCAAAAGAAGAGTTGACAGATTTTGAGAAACTACTTGATGATGAAGTTGCAAAAGTCAGCGAA
AGTTCATCTTCAAATTTTGCGAACCATCCTATAATTCGGCAATTTAGAGAAGCTATTTGGAATGTTCACCATGCAGGACAAGCTATGGCAGGTGAGGAGCAGGAAGACGT
TGTGATGACCAGTACTCAGTGTAATCTATTGAATGTCACTTGCCCGTTAAGTGGAAAGCCTGTCACTGAATTAGCAGAACCCGTTCGCAGCGTGGAATGCAAGCACATAT
ACGAAAAGGCAGCCATAATGCAGTACCTTAATTCCAAGAAATCTCGCGCTCAATGCCCGGTTGCAGCCTGTCCTAAGATGTTGCAGCCTGATAAGGTTGTGCTTGATCCA
TTCTTAGAGATTGAAATCGATGAACTACGAAAGATGTCTAGGCATTCTGGGAGAATACAGGACTTCACAGAGCTTGATGCAGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCATCCGCTTCCGATTCCCGATCAACCGGCGTCACCGGCAGAATCAAATCCGCCGCCACCATTATGCACTCCGACAATCAATCCCTCCTTGCAGAATTGCGTAAGAC
GCTGATTATGATGAAGGAAATTGGGGTGGATTTGGAGAAGGAAAAACAGTACAAGATGGTCAAGGAGCTTGAAAAATCTATTGTTGAGCTGTTGAGTGCCTATGAAAACT
GTAACAACTTTTCATCTGCAATTCAGTCGGTTGGAAATACATATGAACCAAAAGAAGAGTTGACAGATTTTGAGAAACTACTTGATGATGAAGTTGCAAAAGTCAGCGAA
AGTTCATCTTCAAATTTTGCGAACCATCCTATAATTCGGCAATTTAGAGAAGCTATTTGGAATGTTCACCATGCAGGACAAGCTATGGCAGGTGAGGAGCAGGAAGACGT
TGTGATGACCAGTACTCAGTGTAATCTATTGAATGTCACTTGCCCGTTAAGTGGAAAGCCTGTCACTGAATTAGCAGAACCCGTTCGCAGCGTGGAATGCAAGCACATAT
ACGAAAAGGCAGCCATAATGCAGTACCTTAATTCCAAGAAATCTCGCGCTCAATGCCCGGTTGCAGCCTGTCCTAAGATGTTGCAGCCTGATAAGGTTGTGCTTGATCCA
TTCTTAGAGATTGAAATCGATGAACTACGAAAGATGTCTAGGCATTCTGGGAGAATACAGGACTTCACAGAGCTTGATGCAGATTAG
Protein sequenceShow/hide protein sequence
MASASDSRSTGVTGRIKSAATIMHSDNQSLLAELRKTLIMMKEIGVDLEKEKQYKMVKELEKSIVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDDEVAKVSE
SSSSNFANHPIIRQFREAIWNVHHAGQAMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSVECKHIYEKAAIMQYLNSKKSRAQCPVAACPKMLQPDKVVLDP
FLEIEIDELRKMSRHSGRIQDFTELDAD