; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G036040 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G036040
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionS-adenosylmethionine-dependent methyltransferase, putative
Genome locationCicolChr02:31892378..31899237
RNA-Seq ExpressionCcUC02G036040
SyntenyCcUC02G036040
Gene Ontology termsNA
InterPro domainsIPR010719 - Putative rRNA methylase
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570591.1 hypothetical protein SDJN03_29506, partial [Cucurbita argyrosperma subsp. sororia]5.4e-12081.75Show/hide
Query:  MLSLKFGSKLVAV-AAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSP-----LPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATE
        MLSLKFG K VAV AA  PV+GRQRHLRNLCC SNRI SNG+SSE Q DF+SP       S+DFSSLE               LEDVMVGY FGKKRATE
Subjt:  MLSLKFGSKLVAV-AAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSP-----LPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATE

Query:  VAHSIWKRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLG
        VAHS+WKRV+R GDTVVDATCGNGYDTLAM+KMVADE+GS  VYAMDVQKEALESTS LLDESLSEKE+KLVKLSSICHSRMEDVIPE SPVRLVAFNLG
Subjt:  VAHSIWKRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLG

Query:  YLPGGNKAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR
        YLPGGNKAITTKSETTLQAL+AA RILKPGGLISLVVYVGH GGLEELETI+KFSG+L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  YLPGGNKAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_004139901.1 uncharacterized protein LOC101214958 isoform X1 [Cucumis sativus]3.4e-12283.87Show/hide
Query:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW
        MLSLKFGSK VAVA    V+GRQRHLR+LCC SNRIQSNGLSS+ QIDFNSPL S+   SLE               LEDVMVGYFFGKKRATEVAHS+W
Subjt:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW

Query:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN
        K +V+KGDTVVDATCGNGYDTLAMVKMVADESGS  VYAMDVQ EALESTS LLDESLSEKEKKLVKLSSICHSRMEDVI EDSPV LVAFNLGYLPGGN
Subjt:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN

Query:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAITTKSETT QALKAA+RILKPGGLISLVVYVGH GG+EELETIEKFS DLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_011656977.1 uncharacterized protein LOC101214958 isoform X2 [Cucumis sativus]3.2e-12083.51Show/hide
Query:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW
        MLSLKFGSK VAVA    V+GRQRHLR+LCC SNRIQSNGLSS+ QIDFNSPL S+   SLE               LEDVMVGYFFGKKRATEVAHS+W
Subjt:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW

Query:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN
        K +V+KGDTVVDATCGNGYDTLAMVKMVADESGS  VYAMDVQ EALESTS LLDESLSEKE KLVKLSSICHSRMEDVI EDSPV LVAFNLGYLPGGN
Subjt:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN

Query:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAITTKSETT QALKAA+RILKPGGLISLVVYVGH GG+EELETIEKFS DLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_038902806.1 putative rRNA methylase YtqB isoform X1 [Benincasa hispida]8.7e-12685.3Show/hide
Query:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW
        MLSLKFGSK VAVAAF PV+G QRH+RN+C  +NRIQSNGLSSE QIDFNSPL  +DFSSLE               LEDVMVGYFFGKKRATEVAHS+W
Subjt:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW

Query:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN
        K VVRKGDTVVDATCGNGYDT AMVKMVADESGS  VYAMDVQKEALE+TS  LDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN
Subjt:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN

Query:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAITTKSETTL+ALKAA+RILKPGGLISLVVYVGH GGLEELETI+KFS DLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_038902809.1 putative rRNA methylase YtqB isoform X2 [Benincasa hispida]8.1e-12484.95Show/hide
Query:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW
        MLSLKFGSK VAVAAF PV+G QRH+RN+C  +NRIQSNGLSSE QIDFNSPL  +DFSSLE               LEDVMVGYFFGKKRATEVAHS+W
Subjt:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW

Query:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN
        K VVRKGDTVVDATCGNGYDT AMVKMVADESGS  VYAMDVQKEALE+TS  LDESLSEKE KLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN
Subjt:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN

Query:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAITTKSETTL+ALKAA+RILKPGGLISLVVYVGH GGLEELETI+KFS DLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR

TrEMBL top hitse value%identityAlignment
A0A0A0KEF8 Uncharacterized protein1.6e-12283.87Show/hide
Query:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW
        MLSLKFGSK VAVA    V+GRQRHLR+LCC SNRIQSNGLSS+ QIDFNSPL S+   SLE               LEDVMVGYFFGKKRATEVAHS+W
Subjt:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW

Query:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN
        K +V+KGDTVVDATCGNGYDTLAMVKMVADESGS  VYAMDVQ EALESTS LLDESLSEKEKKLVKLSSICHSRMEDVI EDSPV LVAFNLGYLPGGN
Subjt:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN

Query:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAITTKSETT QALKAA+RILKPGGLISLVVYVGH GG+EELETIEKFS DLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1D648 uncharacterized protein LOC111017978 isoform X22.9e-11178.14Show/hide
Query:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW
        MLSLKFG K VAV    PV+ RQRH RNL   SN IQSNGLS E Q +F+SP  S++FSSLE               LEDVMVGY  GKKRATEVAHS+W
Subjt:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW

Query:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN
        K ++R+GDTVVDAT GNGYDTLAMVKMVADESGSG VYAMDVQKEAL  TS LL+ESL E+E KLVKLSSICHSRMEDVIPE SPVRLVAFNLGYLPGGN
Subjt:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN

Query:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAITTKSETTLQAL+AANRILKPGGLISLVVYVGH GG+EELETI+KF+ +LAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1D7G1 uncharacterized protein LOC111017978 isoform X13.1e-11378.49Show/hide
Query:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW
        MLSLKFG K VAV    PV+ RQRH RNL   SN IQSNGLS E Q +F+SP  S++FSSLE               LEDVMVGY  GKKRATEVAHS+W
Subjt:  MLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIW

Query:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN
        K ++R+GDTVVDAT GNGYDTLAMVKMVADESGSG VYAMDVQKEAL  TS LL+ESL E+EKKLVKLSSICHSRMEDVIPE SPVRLVAFNLGYLPGGN
Subjt:  KRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGN

Query:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAITTKSETTLQAL+AANRILKPGGLISLVVYVGH GG+EELETI+KF+ +LAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1FUH7 uncharacterized protein LOC1114482488.5e-11980.99Show/hide
Query:  MLSLKFGSKLVAV-AAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSP----LPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEV
        MLSLKFG K VAV AA  PV+GRQRHLRN CC SNRI SNG+SSE Q DF+SP      S+DFSSLE               LEDVMVGY FGKKRATEV
Subjt:  MLSLKFGSKLVAV-AAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSP----LPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEV

Query:  AHSIWKRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGY
        AHS+WKRV+R GDTVVDATCGNGYDTLAM+KMVADE+GS  VYAMDVQKEALES S LLDESL EKE+KLVKLSSICHSRMEDVIPE SPVRLVAFNLGY
Subjt:  AHSIWKRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGY

Query:  LPGGNKAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR
        LPGGNKAITTKSETTLQAL+AA RILKPGGLISLVVYVGH GGLEELETI+KFSG+L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  LPGGNKAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1J6K5 uncharacterized protein LOC1114839291.7e-11982.21Show/hide
Query:  MLSLKFGSKLVAV-AAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSP-LPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHS
        MLSLKFG K VAV AA  PV+GRQRHLRNLCC SNRI SNG+SSE Q  F+SP   S+DFSSLE               LEDVMVGY FGKKRATEVAHS
Subjt:  MLSLKFGSKLVAV-AAFPPVLGRQRHLRNLCCCSNRIQSNGLSSECQIDFNSP-LPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHS

Query:  IWKRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPG
        +WKRV+R GDTVVDATCGNGYDTLAM+KMVADE+GS  VYAMDVQKEALESTS LLDESLSEKE+KLVKLSSICHSRMEDVIPE SPVRLVAFNLGYLPG
Subjt:  IWKRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPG

Query:  GNKAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR
        GNKAITTKSETTLQAL+AA RILKPGGLISLVVYVGH GGLEEL+TI+KFSG+L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  GNKAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVFLFKR

SwissProt top hitse value%identityAlignment
O34614 Putative rRNA methylase YtqB9.3e-2236.97Show/hide
Query:  KRATEVAHSIWKRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLV
        K+    +  + K    +GD VVDAT GNG+DT  + ++V +   +GHVYA D+Q+ A+ +T     E L +  +    L    H ++ + +P ++  ++ 
Subjt:  KRATEVAHSIWKRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLV

Query:  A--FNLGYLPGGNKAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDL
        A  FNLGYLPGG+K+ITT   +T++A++    I+K  GLI LVVY GH  G  E   + +F  DL
Subjt:  A--FNLGYLPGGNKAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDL

Arabidopsis top hitse value%identityAlignment
AT1G16445.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein5.0e-7968.14Show/hide
Query:  LEDVMVGYFFGKKRATEVAHSIWKRVVRKGDTVVDATCGNGYDTLAMVKMVADES--GSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSR
        LEDV VGY FG+K+ATEVAH +W++V++KGDTV+DATCGNG DTLAM+KMV  +S    G+VYAMD+QK+A+ESTS LLD+++  KEK+ VKL ++CHS+
Subjt:  LEDVMVGYFFGKKRATEVAHSIWKRVVRKGDTVVDATCGNGYDTLAMVKMVADES--GSGHVYAMDVQKEALESTSVLLDESLSEKEKKLVKLSSICHSR

Query:  MEDVIPEDSPVRLVAFNLGYLPGGNKAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVF
        M +++PE++ VR+VAFNLGYLPGGNK+I T S+TTL ALKAA RILKPGGLISLVVY+GH GG EELE +E F   L V +WICCK QMLNRPLAPV VF
Subjt:  MEDVIPEDSPVRLVAFNLGYLPGGNKAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCKLQMLNRPLAPVPVF

Query:  LFKR
        +FKR
Subjt:  LFKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGTCTCGACGCCACGTGTCGGTTCATGCACAGAGAAAATGACAATGCCAACAAAACAGCTACAAAACGTTGTGATTGTGGATATCGGAAATCGTGATGTTTGCTG
GTGTTGGAGCCGACGCAATAAACCTCTTTTCAGTTTGGGGTACGTGTTGAAATCTGATTGGAATCTCCGGGTAAGGACATCCATAGAGTTGGGAGAAATGTTGTCTTTGA
AATTTGGGTCAAAATTAGTGGCGGTTGCTGCCTTCCCACCAGTCTTAGGACGTCAAAGACACTTGAGAAATCTTTGCTGCTGTTCTAACCGTATTCAGTCGAATGGTTTA
TCTTCTGAATGTCAGATTGATTTCAATTCACCATTGCCGTCGAGAGATTTTTCGTCTTTGGAAGATAATTTGGTGTATGCTAATTGCAAGTTTGATGGGGGGTTTAGACT
CGAGGATGTCATGGTCGGCTATTTTTTTGGGAAGAAGAGAGCTACAGAAGTTGCTCACTCTATATGGAAACGTGTCGTCAGAAAAGGGGATACAGTGGTAGATGCTACTT
GTGGAAATGGGTATGATACGCTGGCTATGGTCAAAATGGTTGCAGATGAATCTGGTTCTGGGCATGTTTATGCAATGGACGTTCAGAAAGAGGCTTTAGAAAGTACTTCT
GTGTTGCTGGACGAGTCACTCAGTGAAAAAGAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTCCAGAGGATTCTCCCGTTAGGCT
TGTTGCATTTAACCTAGGCTACCTACCTGGTGGTAACAAAGCAATTACTACAAAGTCAGAAACAACATTACAAGCACTTAAAGCTGCCAATAGAATTCTGAAACCTGGAG
GGCTTATCAGCCTTGTGGTTTATGTGGGGCATACTGGTGGACTGGAAGAATTGGAGACTATTGAAAAATTTTCTGGTGATCTGGCTGTTGAGAATTGGATTTGTTGTAAG
CTTCAGATGTTAAACCGGCCATTAGCTCCAGTGCCTGTGTTTTTATTCAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGTCTCGACGCCACGTGTCGGTTCATGCACAGAGAAAATGACAATGCCAACAAAACAGCTACAAAACGTTGTGATTGTGGATATCGGAAATCGTGATGTTTGCTG
GTGTTGGAGCCGACGCAATAAACCTCTTTTCAGTTTGGGGTACGTGTTGAAATCTGATTGGAATCTCCGGGTAAGGACATCCATAGAGTTGGGAGAAATGTTGTCTTTGA
AATTTGGGTCAAAATTAGTGGCGGTTGCTGCCTTCCCACCAGTCTTAGGACGTCAAAGACACTTGAGAAATCTTTGCTGCTGTTCTAACCGTATTCAGTCGAATGGTTTA
TCTTCTGAATGTCAGATTGATTTCAATTCACCATTGCCGTCGAGAGATTTTTCGTCTTTGGAAGATAATTTGGTGTATGCTAATTGCAAGTTTGATGGGGGGTTTAGACT
CGAGGATGTCATGGTCGGCTATTTTTTTGGGAAGAAGAGAGCTACAGAAGTTGCTCACTCTATATGGAAACGTGTCGTCAGAAAAGGGGATACAGTGGTAGATGCTACTT
GTGGAAATGGGTATGATACGCTGGCTATGGTCAAAATGGTTGCAGATGAATCTGGTTCTGGGCATGTTTATGCAATGGACGTTCAGAAAGAGGCTTTAGAAAGTACTTCT
GTGTTGCTGGACGAGTCACTCAGTGAAAAAGAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTCCAGAGGATTCTCCCGTTAGGCT
TGTTGCATTTAACCTAGGCTACCTACCTGGTGGTAACAAAGCAATTACTACAAAGTCAGAAACAACATTACAAGCACTTAAAGCTGCCAATAGAATTCTGAAACCTGGAG
GGCTTATCAGCCTTGTGGTTTATGTGGGGCATACTGGTGGACTGGAAGAATTGGAGACTATTGAAAAATTTTCTGGTGATCTGGCTGTTGAGAATTGGATTTGTTGTAAG
CTTCAGATGTTAAACCGGCCATTAGCTCCAGTGCCTGTGTTTTTATTCAAGAGATGAAAGTGTTGAGAGCAAGTATCCAAAAGTTCTGCCAGATTCGAGTAGTTTGAAGT
TTTTGTTCTATCGGATTGGTCGTTGCATCCTACTGAGACTATCGTTGGAGTTTGCTTGCAGCTTGTGCAGAGCAAATAATTGTCTTAAGGACAACGCTTTGAAGCGTGTC
TCAACTATCGTTAGAATCTCCACAAGTTTTTGAGGTTTTTCTGGGTATTTCAATTTATAAGCCTTGTCATACAATCTAGCTTATTTTTGTGTTTGAACATTATTATTGTA
TACAGTATGTATTGTAATTACTATATCCAGTTGAGTGGTAAACAATTTGCTAGCGTTATTCAGTTTCTATATATAGTTTCCCCCAACATAACCAAATAAGAA
Protein sequenceShow/hide protein sequence
MLVSTPRVGSCTEKMTMPTKQLQNVVIVDIGNRDVCWCWSRRNKPLFSLGYVLKSDWNLRVRTSIELGEMLSLKFGSKLVAVAAFPPVLGRQRHLRNLCCCSNRIQSNGL
SSECQIDFNSPLPSRDFSSLEDNLVYANCKFDGGFRLEDVMVGYFFGKKRATEVAHSIWKRVVRKGDTVVDATCGNGYDTLAMVKMVADESGSGHVYAMDVQKEALESTS
VLLDESLSEKEKKLVKLSSICHSRMEDVIPEDSPVRLVAFNLGYLPGGNKAITTKSETTLQALKAANRILKPGGLISLVVYVGHTGGLEELETIEKFSGDLAVENWICCK
LQMLNRPLAPVPVFLFKR