; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G003030 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G003030
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionS-adenosylmethionine-dependent methyltransferase, putative
Genome locationCmo_Chr20:1478316..1481422
RNA-Seq ExpressionCmoCh20G003030
SyntenyCmoCh20G003030
Gene Ontology termsNA
InterPro domainsIPR010719 - Putative rRNA methylase
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570591.1 hypothetical protein SDJN03_29506, partial [Cucurbita argyrosperma subsp. sororia]3.0e-14297.79Show/hide
Query:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSP-SSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGD
        MLSLKFGYKWVAVAAASEPVVGRQRHLRN CC SNRI+SNGISSEYQSDFSSP SSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGD
Subjt:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSP-SSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGD

Query:  TVVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSE
        TVVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALES SALLDESL EKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSE
Subjt:  TVVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSE

Query:  TTLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
        TTLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TTLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR

XP_022943494.1 uncharacterized protein LOC111448248 [Cucurbita moschata]2.4e-147100Show/hide
Query:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
        MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
Subjt:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT

Query:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
        VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET

Query:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
        TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR

XP_022986072.1 uncharacterized protein LOC111483929 [Cucurbita maxima]9.7e-14196.67Show/hide
Query:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
        MLSLKFGYKWVAVAAASEPVVGRQRHLRN CCFSNRIHSNGISSEYQ+ FSSP   SSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
Subjt:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT

Query:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
        VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALES SALLDESL EKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET

Query:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
        TLQALEAAYRILKPGGLISLVVYVGHPGGLEEL+TIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR

XP_023511899.1 uncharacterized protein LOC111776775 [Cucurbita pepo subsp. pepo]2.3e-14297.04Show/hide
Query:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
        MLSLKFGYKWVAVAAASEPVVGRQRHLRN CCFSNRIHSNGISSEYQSDFSSP   SSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
Subjt:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT

Query:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
        VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALES SALLDESLC+KERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET

Query:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
        TLQALEAAYR+LKPGGLISLVVYVGHPGGLEELETIQKF GELGVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR

XP_038902806.1 putative rRNA methylase YtqB isoform X1 [Benincasa hispida]4.5e-12285.56Show/hide
Query:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
        MLSLKFG KWVAV AA +PVVG QRH+RN C  +NRI SNG+SSEYQ DF+SP     S KDFSSLEGLEDVMVGY FGKKRATEVAHSVWK V+R GDT
Subjt:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT

Query:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
        VVDATCGNGYDT AM+KMVADE+GSARVYAMDVQKEALE+ SA LDESL EKE+KLVKLSSICHSRMEDVIPE SPVRLVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET

Query:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
        TL+AL+AA+RILKPGGLISLVVYVGHPGGLEELETIQKFS +L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR

TrEMBL top hitse value%identityAlignment
A0A0A0KEF8 Uncharacterized protein9.2e-12184.44Show/hide
Query:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
        MLSLKFG KWVAV A SE VVGRQRHLR+ CCFSNRI SNG+SS+YQ DF+SP SS  S     SLEGLEDVMVGY FGKKRATEVAHSVWK +++ GDT
Subjt:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT

Query:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
        VVDATCGNGYDTLAM+KMVADE+GSARVYAMDVQ EALES SALLDESL EKE+KLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET

Query:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
        T QAL+AA+RILKPGGLISLVVYVGHPGG+EELETI+KFS +L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1D648 uncharacterized protein LOC111017978 isoform X21.4e-11382.59Show/hide
Query:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
        MLSLKFG K VAV    +PVV RQRH RN    SN I SNG+S EYQ++FSSP    SSSK+FSSLEGLEDVMVGY+ GKKRATEVAHSVWK +IR GDT
Subjt:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT

Query:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
        VVDAT GNGYDTLAM+KMVADE+GS  VYAMDVQKEAL   SALL+ESL E+E KLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET

Query:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
        TLQALEAA RILKPGGLISLVVYVGHPGG+EELETIQKF+  L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1D7G1 uncharacterized protein LOC111017978 isoform X13.4e-11582.59Show/hide
Query:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
        MLSLKFG K VAV    +PVV RQRH RN    SN I SNG+S EYQ++FSSP    SSSK+FSSLEGLEDVMVGY+ GKKRATEVAHSVWK +IR GDT
Subjt:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT

Query:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
        VVDAT GNGYDTLAM+KMVADE+GS  VYAMDVQKEAL   SALL+ESL E+E+KLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET

Query:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
        TLQALEAA RILKPGGLISLVVYVGHPGG+EELETIQKF+  L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1FUH7 uncharacterized protein LOC1114482481.1e-147100Show/hide
Query:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
        MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
Subjt:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT

Query:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
        VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET

Query:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
        TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1J6K5 uncharacterized protein LOC1114839294.7e-14196.67Show/hide
Query:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
        MLSLKFGYKWVAVAAASEPVVGRQRHLRN CCFSNRIHSNGISSEYQ+ FSSP   SSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT
Subjt:  MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDT

Query:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
        VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALES SALLDESL EKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET

Query:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
        TLQALEAAYRILKPGGLISLVVYVGHPGGLEEL+TIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR

SwissProt top hitse value%identityAlignment
O34614 Putative rRNA methylase YtqB3.0e-2037.58Show/hide
Query:  KRATEVAHSVWKRVIRTGDTVVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLV
        K+    +  + K     GD VVDAT GNG+DT  + ++V +   +  VYA D+Q    ESA A   E L +  +    L    H ++ + +P  +  ++ 
Subjt:  KRATEVAHSVWKRVIRTGDTVVDATCGNGYDTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLV

Query:  A--FNLGYLPGGNKAITTKSETTLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGEL
        A  FNLGYLPGG+K+ITT   +T++A+E    I+K  GLI LVVY GHP G  E   + +F  +L
Subjt:  A--FNLGYLPGGNKAITTKSETTLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGEL

Arabidopsis top hitse value%identityAlignment
AT1G16445.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein2.5e-7866.21Show/hide
Query:  SSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDTVVDATCGNGYDTLAMLKMVA-DEAG-SARVYAMDVQKEALESASALLDESLCE
        SS+ S S++F  + GLEDV VGYLFG+K+ATEVAH VW++VI+ GDTV+DATCGNG DTLAMLKMV  D  G    VYAMD+QK+A+ES S+LLD+++  
Subjt:  SSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDTVVDATCGNGYDTLAMLKMVA-DEAG-SARVYAMDVQKEALESASALLDESLCE

Query:  KERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICC
        KE++ VKL ++CHS+M +++PE + VR+VAFNLGYLPGGNK+I T S+TTL AL+AA RILKPGGLISLVVY+GHPGG EELE ++ F   L V +WICC
Subjt:  KERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQALEAAYRILKPGGLISLVVYVGHPGGLEELETIQKFSGELGVENWICC

Query:  KLQMLNRPLAPVPVFLFKR
        K QMLNRPLAPV VF+FKR
Subjt:  KLQMLNRPLAPVPVFLFKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATCTTTGAAATTTGGGTACAAATGGGTAGCGGTTGCTGCTGCCTCCGAACCAGTCGTAGGACGTCAAAGACATTTGAGAAATTTTTGCTGCTTTTCTAATCGTAT
TCACTCAAATGGGATATCGTCCGAGTATCAGAGTGATTTCTCTTCACCGTCGTCGTCGTCGTCGTCGTCGAAAGATTTTTCGTCTCTAGAAGGATTGGAGGATGTCATGG
TTGGCTACCTTTTTGGGAAGAAGAGAGCGACAGAAGTTGCTCACTCTGTATGGAAACGTGTCATCAGAACAGGGGATACAGTGGTAGATGCTACTTGTGGAAATGGGTAT
GATACTCTAGCTATGCTCAAAATGGTTGCTGATGAAGCTGGTTCTGCGCGTGTTTATGCAATGGACGTTCAGAAAGAGGCTTTAGAAAGTGCTTCTGCTTTACTGGACGA
ATCACTCTGTGAAAAAGAGAGGAAACTTGTTAAACTCTCCTCCATTTGCCACAGCAGAATGGAAGATGTCATTCCAGAGGGTTCTCCCGTTAGGCTTGTTGCATTTAACC
TTGGCTACCTACCTGGTGGAAACAAAGCAATCACTACAAAGTCAGAAACAACATTACAAGCACTTGAAGCTGCCTATAGAATTCTGAAGCCTGGAGGGCTCATCAGCCTT
GTGGTTTATGTGGGGCATCCTGGTGGACTGGAAGAATTGGAGACTATCCAAAAATTTTCTGGCGAGCTGGGTGTCGAGAATTGGATTTGTTGTAAGCTTCAGATGTTGAA
CCGGCCACTAGCTCCAGTGCCTGTGTTCTTATTCAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
CTCATTGTAACCCGGCCCGGTCGTATTGTAATCGTCATTCCGTCCCTTCACCGACGACAGTACAGGCGGAATCGGCGGTGGAGGAAATCCAGAGAGTTGGGAGAGATGTT
ATCTTTGAAATTTGGGTACAAATGGGTAGCGGTTGCTGCTGCCTCCGAACCAGTCGTAGGACGTCAAAGACATTTGAGAAATTTTTGCTGCTTTTCTAATCGTATTCACT
CAAATGGGATATCGTCCGAGTATCAGAGTGATTTCTCTTCACCGTCGTCGTCGTCGTCGTCGTCGAAAGATTTTTCGTCTCTAGAAGGATTGGAGGATGTCATGGTTGGC
TACCTTTTTGGGAAGAAGAGAGCGACAGAAGTTGCTCACTCTGTATGGAAACGTGTCATCAGAACAGGGGATACAGTGGTAGATGCTACTTGTGGAAATGGGTATGATAC
TCTAGCTATGCTCAAAATGGTTGCTGATGAAGCTGGTTCTGCGCGTGTTTATGCAATGGACGTTCAGAAAGAGGCTTTAGAAAGTGCTTCTGCTTTACTGGACGAATCAC
TCTGTGAAAAAGAGAGGAAACTTGTTAAACTCTCCTCCATTTGCCACAGCAGAATGGAAGATGTCATTCCAGAGGGTTCTCCCGTTAGGCTTGTTGCATTTAACCTTGGC
TACCTACCTGGTGGAAACAAAGCAATCACTACAAAGTCAGAAACAACATTACAAGCACTTGAAGCTGCCTATAGAATTCTGAAGCCTGGAGGGCTCATCAGCCTTGTGGT
TTATGTGGGGCATCCTGGTGGACTGGAAGAATTGGAGACTATCCAAAAATTTTCTGGCGAGCTGGGTGTCGAGAATTGGATTTGTTGTAAGCTTCAGATGTTGAACCGGC
CACTAGCTCCAGTGCCTGTGTTCTTATTCAAGAGATGAAAGTGGTAAACGCTTCTTTTCTTTCCACTCCTACGATTCCTAGCTTCTTTGTTTGATTATGGTAGATGGACT
CGTAATAGAATGTTGGTTATTATCATAATTTCATGTCAATTCCGTAATGAAGTGAGCTTAGTTCTGCAGTTTCATGTCAATAAATAAATAATTAATGCATATATTTGTTT
GAGTAAGTAAAGTTTGGTGTAGATTTAAAGGCAAAGCTACTGACTGG
Protein sequenceShow/hide protein sequence
MLSLKFGYKWVAVAAASEPVVGRQRHLRNFCCFSNRIHSNGISSEYQSDFSSPSSSSSSSKDFSSLEGLEDVMVGYLFGKKRATEVAHSVWKRVIRTGDTVVDATCGNGY
DTLAMLKMVADEAGSARVYAMDVQKEALESASALLDESLCEKERKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQALEAAYRILKPGGLISL
VVYVGHPGGLEELETIQKFSGELGVENWICCKLQMLNRPLAPVPVFLFKR