; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007650 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007650
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSH3 domain-containing protein
Genome locationChr10:9275846..9286264
RNA-Seq ExpressionHG10007650
SyntenyHG10007650
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001509 - NAD-dependent epimerase/dehydratase
IPR036291 - NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588255.1 hypothetical protein SDJN03_16820, partial [Cucurbita argyrosperma subsp. sororia]6.1e-6281.37Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDRSVLRYVYYYLARILSD  AQGVSTGGGIPTPNWDALADIDAVGGVTRADVVP IVNQLV EASN
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
        PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAP KRKKGVLG KGGDKE
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE

XP_008442260.1 PREDICTED: uncharacterized protein LOC103486168 [Cucumis melo]3.1e-6685.09Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
        PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE

XP_011653942.1 uncharacterized protein LOC101209457 [Cucumis sativus]3.1e-6685.09Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
        PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE

XP_022926173.1 uncharacterized protein LOC111433357 isoform X2 [Cucurbita moschata]6.1e-6281.37Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDRSVLRYVYYYLARILSD  AQGVSTGGGIPTPNWDALADIDAVGGVTRADVVP IVNQLV EASN
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
        PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAP KRKKGVLG KGGDKE
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE

XP_038881318.1 uncharacterized protein LOC120072865 [Benincasa hispida]2.0e-6584.47Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDA GGVTRADVVPRIVNQLVKEASN
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
        PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE

TrEMBL top hitse value%identityAlignment
A0A0A0L427 Uncharacterized protein8.9e-6784.57Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKED
        PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE+
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKED

A0A1S3B5A4 uncharacterized protein LOC1034861681.5e-6685.09Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
        PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE

A0A6J1KG15 uncharacterized protein LOC111495436 isoform X23.0e-6281.37Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDRSVLRYVYYYLARILSD  AQGVSTGGGIPTPNWDALADIDAVGGVTRADVVP IVNQLV EASN
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
        PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAP KRKKGVLG KGGDKE
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE

A0A6J1KI53 uncharacterized protein LOC111495436 isoform X33.0e-6281.37Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDRSVLRYVYYYLARILSD  AQGVSTGGGIPTPNWDALADIDAVGGVTRADVVP IVNQLV EASN
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
        PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAP KRKKGVLG KGGDKE
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE

A0A6J1KKN6 uncharacterized protein LOC111495436 isoform X13.0e-6281.37Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDRSVLRYVYYYLARILSD  AQGVSTGGGIPTPNWDALADIDAVGGVTRADVVP IVNQLV EASN
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
        PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAP KRKKGVLG KGGDKE
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE

SwissProt top hitse value%identityAlignment
O48917 UDP-sulfoquinovose synthase, chloroplastic5.4e-5382.61Show/hide
Query:  QRVMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQR
        +RVMVIGGDGYCGWATALHLSKK YEV IVDNLVRRLFDHQLGL+SLTPI+SIH+RI  WK++TGK+IEL++GDICDFEFL E+FKSF+PD+VVHFGEQR
Subjt:  QRVMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQR

Query:  SAPYSMIDRLRVVFT
        SAPYSMIDR R V+T
Subjt:  SAPYSMIDRLRVVFT

Q05026 UDP-glucose 4-epimerase2.9e-0628.3Show/hide
Query:  VMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQRSA
        V++ GG G+ G  TA+ L + GY+  I+DNL                 +++  R+R    ITG+ I  + GDI D + L + F   + ++V+HF   ++ 
Subjt:  VMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQRSA

Query:  PYSMID
          S+ +
Subjt:  PYSMID

Q4JBJ3 UDP-sulfoquinovose synthase2.6e-2344.74Show/hide
Query:  RVMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQRS
        R++V+G DG+ GW  AL L+K+G+EV  +DNL  R F  ++G DS  P+     R+   K   G  I  ++GDI ++ F  +  + +KPDA+VHF EQRS
Subjt:  RVMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQRS

Query:  APYSMIDRLRVVFT
        APYSMID    V+T
Subjt:  APYSMIDRLRVVFT

Q84KI6 UDP-sulfoquinovose synthase, chloroplastic1.4e-5383.62Show/hide
Query:  RQRVMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQ
        R+RVMVIGGDGYCGWATALHLSKK Y+V IVDNLVRRLFDHQLGLDSLTPI+SI NRIR W+ +TGKTI+L +GDICDFEFL ETFKSF+PD VVHFGEQ
Subjt:  RQRVMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQ

Query:  RSAPYSMIDRLRVVFT
        RSAPYSMIDR R V+T
Subjt:  RSAPYSMIDRLRVVFT

Q9HDU3 Bifunctional protein gal102.2e-0631.73Show/hide
Query:  VMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQRSA
        ++V GG GY G  T + L   GY+V IVDNL                 +S ++ +   + I  K+I+ F  D+ D E L + F +FK   V+HF   ++ 
Subjt:  VMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQRSA

Query:  PYSM
          SM
Subjt:  PYSM

Arabidopsis top hitse value%identityAlignment
AT2G07360.1 SH3 domain-containing protein1.6e-5568.94Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDR+VLRYVYYYLARILSD    G++ GGGIPTPNWDALADIDA GGVTRADVVPRIVNQL  EA+N
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
         + EFHARRLQALKALTY+PS +SE+LS+LYEIVF IL+KV D P KRKKGV GTKGGDKE
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE

AT2G07360.2 SH3 domain-containing protein1.6e-5568.94Show/hide
Query:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN
        QK  +    P   + ++    L  ++  LNQQCEDR+VLRYVYYYLARILSD    G++ GGGIPTPNWDALADIDA GGVTRADVVPRIVNQL  EA+N
Subjt:  QKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVYYYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASN

Query:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE
         + EFHARRLQALKALTY+PS +SE+LS+LYEIVF IL+KV D P KRKKGV GTKGGDKE
Subjt:  PDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVLGTKGGDKE

AT4G20460.1 NAD(P)-binding Rossmann-fold superfamily protein1.3e-0430.28Show/hide
Query:  VMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQRSA
        V+V GG GY G   AL L K  Y V IVDNL R        L  L P                  ++    D+ D + + + F     DAV+HF      
Subjt:  VMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQRSA

Query:  PYSMIDRLR
          S +D L+
Subjt:  PYSMIDRLR

AT4G33030.1 sulfoquinovosyldiacylglycerol 13.8e-5482.61Show/hide
Query:  QRVMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQR
        +RVMVIGGDGYCGWATALHLSKK YEV IVDNLVRRLFDHQLGL+SLTPI+SIH+RI  WK++TGK+IEL++GDICDFEFL E+FKSF+PD+VVHFGEQR
Subjt:  QRVMVIGGDGYCGWATALHLSKKGYEVAIVDNLVRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQR

Query:  SAPYSMIDRLRVVFT
        SAPYSMIDR R V+T
Subjt:  SAPYSMIDRLRVVFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATGACGGTTAATGACATGACAGTAAAGGAGGCTTTGTGGAGGAAAGTGATTGTTGCTAAGTATCATTGGTATGATCATGATAGGTGGCCCTCAAATGAACGGCT
CTCGCACATGTACGGGCCGAAGGAAAAAATTATGAAGAATATCCAATCATTTGAAGGTTGGTGCTGCATTAAACTTGGCAAAGGGAGGATGCAAAAGCACTTCCTCCAAC
AAGGACTGCCCATCTCTATTATGAGCTCCACTAACACCAGAAAACTTTGGATAACGATCCCTAAGTTAAATCAGCAGTGTGAAGATAGAAGTGTCCTCCGTTATGTGTAC
TATTATTTAGCCAGAATTTTATCAGATAATGGTGCACAAGGTGTAAGTACAGGTGGTGGGATCCCGACCCCTAATTGGGATGCTTTAGCTGATATTGATGCCGTTGGGGG
GGTGACTCGAGCTGATGTTGTACCAAGAATAGTCAATCAGCTTGTAAAAGAGGCCTCTAATCCTGATGTTGAATTTCATGCTAGAAGACTACAAGCACTAAAGGCTCTTA
CCTATGCTCCTTCAAGCAGCTCTGAGATTTTGTCCCAACTATATGAAATTGTTTTTTCAATTCTCGATAAGGTTGCTGATGCTCCTCAAAAACGCAAGAAAGGGGTACTT
GGGACTAAAGGTGGTGATAAGGAGGATGCTCGTAACAAAGTTGGTGGTATCTTAACACTATTGAACGACCCAATGCAAGTCACCGAGGTCATTGAAGATAATCACTCCTT
CACTCTCCAAATTGTTCTCATGGATGGCTTTTCCTTTTGGTTGACAGGGGTTTATACCAAGAAGCCCCTGCCACTCAATCCAGCTCTGCATCTCTTGATGCCTCAAGTGG
ATTGTCTAAGACAGCGAGTAATGGTCATTGGTGGAGATGGTTACTGTGGCTGGGCTACTGCTCTCCACCTCTCCAAGAAAGGTTATGAGGTTGCCATTGTTGATAACCTT
GTCCGCCGCCTCTTTGACCATCAGCTCGGTCTGGACTCATTGACTCCCATTTCCTCCATCCATAATCGCATTCGCTGCTGGAAATCTATAACTGGGAAGACTATTGAACT
CTTCATGGGTGATATTTGTGACTTTGAGTTCTTAACAGAGACCTTCAAATCATTTAAACCTGATGCTGTCGTCCATTTCGGTGAGCAACGCTCTGCTCCATATTCCATGA
TTGATCGATTGAGAGTTGTATTTACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAATGACGGTTAATGACATGACAGTAAAGGAGGCTTTGTGGAGGAAAGTGATTGTTGCTAAGTATCATTGGTATGATCATGATAGGTGGCCCTCAAATGAACGGCT
CTCGCACATGTACGGGCCGAAGGAAAAAATTATGAAGAATATCCAATCATTTGAAGGTTGGTGCTGCATTAAACTTGGCAAAGGGAGGATGCAAAAGCACTTCCTCCAAC
AAGGACTGCCCATCTCTATTATGAGCTCCACTAACACCAGAAAACTTTGGATAACGATCCCTAAGTTAAATCAGCAGTGTGAAGATAGAAGTGTCCTCCGTTATGTGTAC
TATTATTTAGCCAGAATTTTATCAGATAATGGTGCACAAGGTGTAAGTACAGGTGGTGGGATCCCGACCCCTAATTGGGATGCTTTAGCTGATATTGATGCCGTTGGGGG
GGTGACTCGAGCTGATGTTGTACCAAGAATAGTCAATCAGCTTGTAAAAGAGGCCTCTAATCCTGATGTTGAATTTCATGCTAGAAGACTACAAGCACTAAAGGCTCTTA
CCTATGCTCCTTCAAGCAGCTCTGAGATTTTGTCCCAACTATATGAAATTGTTTTTTCAATTCTCGATAAGGTTGCTGATGCTCCTCAAAAACGCAAGAAAGGGGTACTT
GGGACTAAAGGTGGTGATAAGGAGGATGCTCGTAACAAAGTTGGTGGTATCTTAACACTATTGAACGACCCAATGCAAGTCACCGAGGTCATTGAAGATAATCACTCCTT
CACTCTCCAAATTGTTCTCATGGATGGCTTTTCCTTTTGGTTGACAGGGGTTTATACCAAGAAGCCCCTGCCACTCAATCCAGCTCTGCATCTCTTGATGCCTCAAGTGG
ATTGTCTAAGACAGCGAGTAATGGTCATTGGTGGAGATGGTTACTGTGGCTGGGCTACTGCTCTCCACCTCTCCAAGAAAGGTTATGAGGTTGCCATTGTTGATAACCTT
GTCCGCCGCCTCTTTGACCATCAGCTCGGTCTGGACTCATTGACTCCCATTTCCTCCATCCATAATCGCATTCGCTGCTGGAAATCTATAACTGGGAAGACTATTGAACT
CTTCATGGGTGATATTTGTGACTTTGAGTTCTTAACAGAGACCTTCAAATCATTTAAACCTGATGCTGTCGTCCATTTCGGTGAGCAACGCTCTGCTCCATATTCCATGA
TTGATCGATTGAGAGTTGTATTTACTTAG
Protein sequenceShow/hide protein sequence
MGMTVNDMTVKEALWRKVIVAKYHWYDHDRWPSNERLSHMYGPKEKIMKNIQSFEGWCCIKLGKGRMQKHFLQQGLPISIMSSTNTRKLWITIPKLNQQCEDRSVLRYVY
YYLARILSDNGAQGVSTGGGIPTPNWDALADIDAVGGVTRADVVPRIVNQLVKEASNPDVEFHARRLQALKALTYAPSSSSEILSQLYEIVFSILDKVADAPQKRKKGVL
GTKGGDKEDARNKVGGILTLLNDPMQVTEVIEDNHSFTLQIVLMDGFSFWLTGVYTKKPLPLNPALHLLMPQVDCLRQRVMVIGGDGYCGWATALHLSKKGYEVAIVDNL
VRRLFDHQLGLDSLTPISSIHNRIRCWKSITGKTIELFMGDICDFEFLTETFKSFKPDAVVHFGEQRSAPYSMIDRLRVVFT