; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0003801 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0003801
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionE3 SUMO-protein ligase MMS21
Genome locationchr07:3792093..3794958
RNA-Seq ExpressionIVF0003801
SyntenyIVF0003801
Gene Ontology termsGO:0000724 - double-strand break repair via homologous recombination (biological process)
GO:0080038 - positive regulation of cytokinin-activated signaling pathway (biological process)
GO:0008284 - positive regulation of cell population proliferation (biological process)
GO:0010082 - regulation of root meristem growth (biological process)
GO:0060250 - germ-line stem-cell niche homeostasis (biological process)
GO:0016925 - protein sumoylation (biological process)
GO:0048509 - regulation of meristem development (biological process)
GO:0032876 - negative regulation of DNA endoreduplication (biological process)
GO:0045931 - positive regulation of mitotic cell cycle (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030915 - Smc5-Smc6 complex (cellular component)
GO:0016874 - ligase activity (molecular function)
GO:0061665 - SUMO ligase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004181 - Zinc finger, MIZ-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR026846 - E3 SUMO-protein ligase Nse2 (Mms21)


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146042.1 E3 SUMO-protein ligase MMS21 [Cucumis sativus]8.45e-16396.37Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRSTGVTGRIKSAATIMHS+NQSLLAELRKTLIMMKEIGVDLEKE Q KMVKELEKS+VELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVS +SSSNFANHPIIRQFREAIWNVHHAGQ MAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHIYEKAAIMQYLNSKKSRA
Subjt:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
        QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHS RIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD

XP_008463706.1 PREDICTED: E3 SUMO-protein ligase MMS21 [Cucumis melo]1.42e-170100Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
Subjt:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
        QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD

XP_022941288.1 E3 SUMO-protein ligase MMS21 [Cucurbita moschata]1.34e-14384.68Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MAS S+S STGV+ RIKSAATIM+SENQSLLAELRK+LIMMKEIGV+LE++NQ++MVKELE S+VELL  YE C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVS NSSSN  NH IIRQFREAIWNVHHAGQPM GEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK AIMQYL SK+SRA
Subjt:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
        QCPVAACPKMLQP+KV+ DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD

XP_023525588.1 E3 SUMO-protein ligase MMS21 [Cucurbita pepo subsp. pepo]1.90e-14384.68Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MAS S+S STGV+ RIKSAATIM+SENQSLLAELRK+LIMMKEIGVDLE++NQ++MVKELE S+VELL  YE C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
        LD+EVAKVS NSSSN  NH IIRQFREAIWNVHHAGQPM GEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK AIMQYL SK+SRA
Subjt:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
        QCPVAACPKMLQP+KV+ DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD

XP_038897059.1 E3 SUMO-protein ligase MMS21 isoform X1 [Benincasa hispida]3.60e-14887.1Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRS GV+GRIKSAATIMHSENQSLLAELRK LIMMKEIGV+LE++NQ++MVKELE S+VELLS YENCNNFS AIQSVGN YEPKEELTDF KL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVS +SSSN  NH IIRQFREA+WNVHHAGQPM GEE+ED+VMTSTQCNLLNVTCPLSGKPV EL EPVRSAECKH+YEKAAIMQYL SKKSRA
Subjt:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
        QCPVAACPKMLQ DKVV DPFL+IEIDELRK+SRHS+RIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD

TrEMBL top hitse value%identityAlignment
A0A0A0L3C3 SP-RING-type domain-containing protein3.9e-12696.37Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRSTGVTGRIKSAATIMHS+NQSLLAELRKTLIMMKEIGVDLEKE Q KMVKELEKS+VELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVS +SSSNFANHPIIRQFREAIWNVHHAGQ MAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKHIYEKAAIMQYLNSKKSRA
Subjt:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
        QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHS RIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD

A0A1S3CLF9 E3 SUMO-protein ligase MMS214.8e-132100Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
Subjt:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
        QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD

A0A5D3DWX9 E3 SUMO-protein ligase MMS214.8e-132100Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
Subjt:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
        QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD

A0A6J1FRP6 E3 SUMO-protein ligase MMS211.6e-11184.68Show/hide
Query:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL
        MAS S+S STGV+ RIKSAATIM+SENQSLLAELRK+LIMMKEIGV+LE++NQ++MVKELE S+VELL  YE C+NFSSAIQSVGN YEP+EELTDFEKL
Subjt:  MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKL

Query:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA
        LDDEVAKVS NSSSN  NH IIRQFREAIWNVHHAGQPM GEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK AIMQYL SK+SRA
Subjt:  LDDEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRA

Query:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
        QCPVAACPKMLQP+KV+ DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  QCPVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD

A0A6J1IXZ2 E3 SUMO-protein ligase MMS211.0e-11084.96Show/hide
Query:  SASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLD
        S S+S STGV+ RIKSAATIM+SENQSLLAELRK+LIMMKEIGVDLE++NQ++MVKELE S+VELLS YE C+NFSSAIQSVGN YEP+EELTDFEKLLD
Subjt:  SASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLD

Query:  DEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRAQC
        +EVAKVS NSSSN  NH IIRQFREAIWNVHHAGQPM GEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRS ECKH+YEK AIMQYL SK+SRAQC
Subjt:  DEVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRAQC

Query:  PVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD
        PVAACPKMLQP+KV+ DPFL IEIDELRK S+HS RIQDFTE+DAD
Subjt:  PVAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSERIQDFTELDAD

SwissProt top hitse value%identityAlignment
Q8GYH7 E3 SUMO-protein ligase MMS214.5e-7156.15Show/hide
Query:  ASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDD
        AS S S GV GRI++A+ ++ S+N S LA++RK + MMK I V LEKENQ   VK+LE S+ ELL  + +CN+ S+AIQSV N Y+P E+LTDF+KLLDD
Subjt:  ASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDD

Query:  EVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRAQCP
        E  K+    SS   N  ++RQFREA+WNVHHAG+PM G++ ED+VMTSTQC LLN+TCPLSGKPVTELA+PVRS +C+H+YEK+ I+ Y+ +  + A CP
Subjt:  EVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRAQCP

Query:  VAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSER---IQDFTE
        VA C   LQ  KV+ D  L+ EI+E+R +++ S R   I+DFTE
Subjt:  VAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSER---IQDFTE

Arabidopsis top hitse value%identityAlignment
AT3G15150.1 RING/U-box superfamily protein3.2e-7256.15Show/hide
Query:  ASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDD
        AS S S GV GRI++A+ ++ S+N S LA++RK + MMK I V LEKENQ   VK+LE S+ ELL  + +CN+ S+AIQSV N Y+P E+LTDF+KLLDD
Subjt:  ASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDD

Query:  EVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRAQCP
        E  K+    SS   N  ++RQFREA+WNVHHAG+PM G++ ED+VMTSTQC LLN+TCPLSGKPVTELA+PVRS +C+H+YEK+ I+ Y+ +  + A CP
Subjt:  EVAKVSVNSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRAQCP

Query:  VAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSER---IQDFTE
        VA C   LQ  KV+ D  L+ EI+E+R +++ S R   I+DFTE
Subjt:  VAACPKMLQPDKVVLDPFLEIEIDELRKMSRHSER---IQDFTE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCCGCTTCCGATTCTCGATCTACCGGCGTCACCGGCAGAATCAAATCCGCCGCCACCATTATGCACTCCGAGAACCAATCCCTCCTTGCAGAATTGCGGAAGAC
GCTGATTATGATGAAGGAAATTGGGGTGGATTTGGAGAAGGAAAATCAGAACAAGATGGTCAAGGAGCTTGAAAAATCTATGGTTGAGCTGTTGAGTGCTTATGAAAACT
GTAACAACTTTTCATCTGCAATTCAGTCGGTTGGAAATACATATGAACCAAAAGAAGAGTTGACAGATTTTGAGAAACTACTTGATGATGAAGTTGCAAAAGTGAGCGTA
AATTCATCTTCAAATTTTGCGAACCATCCTATAATTCGGCAATTTAGAGAAGCTATTTGGAATGTTCACCATGCAGGACAACCAATGGCAGGTGAGGAGCAGGAAGACGT
TGTGATGACCAGTACGCAGTGTAATCTATTGAATGTCACTTGCCCGTTAAGTGGAAAGCCTGTCACTGAATTAGCAGAACCCGTTCGCAGTGCGGAATGCAAGCACATAT
ACGAAAAGGCAGCCATAATGCAGTACCTTAATTCCAAGAAATCTCGCGCTCAATGCCCGGTTGCAGCCTGTCCTAAGATGTTGCAGCCCGATAAGGTTGTGCTTGATCCA
TTCTTAGAGATTGAAATCGATGAACTACGAAAGATGTCTAGGCATTCTGAGAGAATACAGGACTTCACAGAGCTTGATGCAGATTAA
mRNA sequenceShow/hide mRNA sequence
GGAAATATTATCTAAAATAGGCGTGCCTTTCCCCTCTATCGACGGATTTTCAAAACTCCGGGCGATTTTGTTTCCTCAAGCGGGTATTCTATATTTTCCCGCTCTAGCAA
TAGTTTCTGTTTCGAGAATCTCTTTTTCGAACGAATGGCATCCGCTTCCGATTCTCGATCTACCGGCGTCACCGGCAGAATCAAATCCGCCGCCACCATTATGCACTCCG
AGAACCAATCCCTCCTTGCAGAATTGCGGAAGACGCTGATTATGATGAAGGAAATTGGGGTGGATTTGGAGAAGGAAAATCAGAACAAGATGGTCAAGGAGCTTGAAAAA
TCTATGGTTGAGCTGTTGAGTGCTTATGAAAACTGTAACAACTTTTCATCTGCAATTCAGTCGGTTGGAAATACATATGAACCAAAAGAAGAGTTGACAGATTTTGAGAA
ACTACTTGATGATGAAGTTGCAAAAGTGAGCGTAAATTCATCTTCAAATTTTGCGAACCATCCTATAATTCGGCAATTTAGAGAAGCTATTTGGAATGTTCACCATGCAG
GACAACCAATGGCAGGTGAGGAGCAGGAAGACGTTGTGATGACCAGTACGCAGTGTAATCTATTGAATGTCACTTGCCCGTTAAGTGGAAAGCCTGTCACTGAATTAGCA
GAACCCGTTCGCAGTGCGGAATGCAAGCACATATACGAAAAGGCAGCCATAATGCAGTACCTTAATTCCAAGAAATCTCGCGCTCAATGCCCGGTTGCAGCCTGTCCTAA
GATGTTGCAGCCCGATAAGGTTGTGCTTGATCCATTCTTAGAGATTGAAATCGATGAACTACGAAAGATGTCTAGGCATTCTGAGAGAATACAGGACTTCACAGAGCTTG
ATGCAGATTAACCACTGAATTACTTTCTTTTCCTTTTTTATATTTTGTTGTGTTTATTGGCATTGTCGCCGTCGGTCGTAGGGGTCAGTCTCTGGATCAAATGTTTTGGT
TGGTAAGTTGAGGTAGTTGAGATGTATATGGTGGTATCTATAGTTTGTCTATATCGTCTCGAGTTGAATGGAGAAGTGTGGCTTCATTGACAAATTGTCATTGCCCTTTA
TATGTTGAATAAAAAATGGTGGTATGTACATTAAATTTGTG
Protein sequenceShow/hide protein sequence
MASASDSRSTGVTGRIKSAATIMHSENQSLLAELRKTLIMMKEIGVDLEKENQNKMVKELEKSMVELLSAYENCNNFSSAIQSVGNTYEPKEELTDFEKLLDDEVAKVSV
NSSSNFANHPIIRQFREAIWNVHHAGQPMAGEEQEDVVMTSTQCNLLNVTCPLSGKPVTELAEPVRSAECKHIYEKAAIMQYLNSKKSRAQCPVAACPKMLQPDKVVLDP
FLEIEIDELRKMSRHSERIQDFTELDAD