; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G18190 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G18190
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionS-adenosyl-L-methionine-dependent methyltransferases superfamily protein
Genome locationctg3345:2323772..2335530
RNA-Seq ExpressionCucsat.G18190
SyntenyCucsat.G18190
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570591.1 hypothetical protein SDJN03_29506, partial [Cucurbita argyrosperma subsp. sororia]4.11e-15784.87Show/hide
Query:  MLSLKFGSKWVAVAT-SESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLS-----LEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGD
        MLSLKFG KWVAVA  SE VVGRQRHLR+LCC SNRI SNG+SS+YQ DF+SP SS  S S     LEGLEDVMVGY FGKKRATEVAHSVWK +++ GD
Subjt:  MLSLKFGSKWVAVAT-SESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLS-----LEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGD

Query:  TVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSE
        TVVDATCGNGYDTLAM+KMVADE+GSARVYAMDVQ EALESTSALLDESLSEKE+KLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSE
Subjt:  TVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSE

Query:  TTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        TT QAL+AA+RILKPGGLISLVVYVGHPGG+EELETI+KFS +L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_004139901.1 uncharacterized protein LOC101214958 isoform X1 [Cucumis sativus]1.09e-187100Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT
        MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_011656977.1 uncharacterized protein LOC101214958 isoform X2 [Cucumis sativus]4.10e-18599.62Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT
        MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEK LVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_038902806.1 putative rRNA methylase YtqB isoform X1 [Benincasa hispida]2.28e-16489.06Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT
        MLSLKFGSKWVAVA  + VVG QRH+R++C  +NRIQSNGLSS+YQIDFNSPLS K   SLEGLEDVMVGYFFGKKRATEVAHSVWK +V+KGDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDT AMVKMVADESGSARVYAMDVQ EALE+TSA LDESLSEKEKKLVKLSSICHSRMEDVI EDSPV LVAFNLGYLPGGNKAITTKSETT +AL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGG+EELETI+KFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_038902809.1 putative rRNA methylase YtqB isoform X2 [Benincasa hispida]8.57e-16288.68Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT
        MLSLKFGSKWVAVA  + VVG QRH+R++C  +NRIQSNGLSS+YQIDFNSPLS K   SLEGLEDVMVGYFFGKKRATEVAHSVWK +V+KGDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDT AMVKMVADESGSARVYAMDVQ EALE+TSA LDESLSEKEK LVKLSSICHSRMEDVI EDSPV LVAFNLGYLPGGNKAITTKSETT +AL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGG+EELETI+KFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

TrEMBL top hitse value%identityAlignment
A0A0A0KEF8 Uncharacterized protein5.28e-188100Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT
        MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1D648 uncharacterized protein LOC111017978 isoform X27.89e-14481.89Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT
        MLSLKFG K VAV T + VV RQRH R+L   SN IQSNGLS +YQ +F+SP SSK   SLEGLEDVMVGY  GKKRATEVAHSVWK I+++GDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
         GNGYDTLAMVKMVADESGS  VYAMDVQ EAL  TSALL+ESL E+EK LVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSETT QAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        +AA+RILKPGGLISLVVYVGHPGG+EELETI+KF+S+LAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1D7G1 uncharacterized protein LOC111017978 isoform X12.11e-14682.26Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT
        MLSLKFG K VAV T + VV RQRH R+L   SN IQSNGLS +YQ +F+SP SSK   SLEGLEDVMVGY  GKKRATEVAHSVWK I+++GDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
         GNGYDTLAMVKMVADESGS  VYAMDVQ EAL  TSALL+ESL E+EKKLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSETT QAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        +AA+RILKPGGLISLVVYVGHPGG+EELETI+KF+S+LAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1FUH7 uncharacterized protein LOC1114482481.57e-15684.44Show/hide
Query:  MLSLKFGSKWVAVAT-SESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSL----SLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDT
        MLSLKFG KWVAVA  SE VVGRQRHLR+ CCFSNRI SNG+SS+YQ DF+SP SS  S     SLEGLEDVMVGY FGKKRATEVAHSVWK +++ GDT
Subjt:  MLSLKFGSKWVAVAT-SESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSL----SLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDT

Query:  VVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSET
        VVDATCGNGYDTLAM+KMVADE+GSARVYAMDVQ EALES SALLDESL EKE+KLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSET

Query:  TFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        T QAL+AA+RILKPGGLISLVVYVGHPGG+EELETI+KFS +L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1J6K5 uncharacterized protein LOC1114839292.44e-15785.77Show/hide
Query:  MLSLKFGSKWVAVAT-SESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSS-KHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVD
        MLSLKFG KWVAVA  SE VVGRQRHLR+LCCFSNRI SNG+SS+YQ  F+SP SS K   SLEGLEDVMVGY FGKKRATEVAHSVWK +++ GDTVVD
Subjt:  MLSLKFGSKWVAVAT-SESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSS-KHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVD

Query:  ATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQ
        ATCGNGYDTLAM+KMVADE+GSARVYAMDVQ EALESTSALLDESLSEKE+KLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSETT Q
Subjt:  ATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQ

Query:  ALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        AL+AA+RILKPGGLISLVVYVGHPGG+EEL+TI+KFS +L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  ALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

SwissProt top hitse value%identityAlignment
O34614 Putative rRNA methylase YtqB8.6e-2035.76Show/hide
Query:  KRATEVAHSVWKCIVKKGDTVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDS--PVS
        K+    +  + K    +GD VVDAT GNG+DT  + ++V +   +  VYA D+Q  A+ +T     E L +  +    L    H ++ + +  ++   V+
Subjt:  KRATEVAHSVWKCIVKKGDTVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDS--PVS

Query:  LVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDL
           FNLGYLPGG+K+ITT   +T +A++    I+K  GLI LVVY GHP G  E   + +F  DL
Subjt:  LVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDL

Arabidopsis top hitse value%identityAlignment
AT1G16445.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein1.9e-7863.93Show/hide
Query:  FNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDATCGNGYDTLAMVKMVADES--GSARVYAMDVQNEALESTSALLDESLSE
        F+S  S   +  + GLEDV VGY FG+K+ATEVAH VW+ +++KGDTV+DATCGNG DTLAM+KMV  +S      VYAMD+Q +A+ESTS+LLD+++  
Subjt:  FNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDATCGNGYDTLAMVKMVADES--GSARVYAMDVQNEALESTSALLDESLSE

Query:  KEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICC
        KEK+ VKL ++CHS+M +++ E++ V +VAFNLGYLPGGNK+I T S+TT  ALKAA RILKPGGLISLVVY+GHPGG EELE +E F S L V +WICC
Subjt:  KEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICC

Query:  KLQMLNRPLAPVPVFLFKR
        K QMLNRPLAPV VF+FKR
Subjt:  KLQMLNRPLAPVPVFLFKR

AT3G20020.1 protein arginine methyltransferase 61.1e-0427.05Show/hide
Query:  KGDTVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITT
        +G  VVD  CG G     ++ +   ++G+ RVYA+D  + A+++   +    LS+K         + H R+EDV +++    +++  +GY+         
Subjt:  KGDTVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITT

Query:  KSETTFQALKAAHRILKPGGLI
                + A  R LKPGGLI
Subjt:  KSETTFQALKAAHRILKPGGLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATCTTTGAAATTTGGGTCAAAATGGGTGGCGGTTGCTACCTCCGAATCAGTCGTAGGACGTCAAAGACACTTGAGAGATCTTTGCTGTTTTTCCAACCGTATTCA
GTCAAATGGATTATCTTCTCAATATCAGATTGATTTCAATTCACCATTGTCGTCGAAACATTCTTTGTCTTTGGAAGGACTGGAGGATGTCATGGTCGGCTACTTTTTTG
GGAAGAAGAGGGCGACAGAAGTTGCTCACTCTGTTTGGAAATGTATTGTCAAAAAAGGGGATACAGTAGTAGATGCTACTTGTGGAAATGGTTATGATACTCTAGCTATG
GTCAAGATGGTTGCAGATGAATCTGGTTCTGCACGTGTTTATGCAATGGATGTACAAAATGAGGCTTTAGAAAGTACTTCTGCATTGCTGGATGAATCACTCAGTGAAAA
AGAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTCTAGAGGATTCCCCTGTTAGTCTTGTTGCATTTAACCTAGGGTACCTACCTG
GTGGTAACAAAGCAATCACTACAAAGTCAGAAACAACATTTCAAGCACTTAAAGCTGCACACAGGATTCTGAAACCTGGAGGGCTTATCAGCCTAGTGGTTTATGTGGGG
CATCCTGGTGGAATGGAAGAATTGGAGACTATCGAAAAATTTTCTAGCGACCTGGCTGTTGAGAATTGGATTTGTTGTAAGCTACAGATGTTAAACCGGCCACTAGCTCC
AGTGCCTGTATTCTTATTCAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTATCTTTGAAATTTGGGTCAAAATGGGTGGCGGTTGCTACCTCCGAATCAGTCGTAGGACGTCAAAGACACTTGAGAGATCTTTGCTGTTTTTCCAACCGTATTCA
GTCAAATGGATTATCTTCTCAATATCAGATTGATTTCAATTCACCATTGTCGTCGAAACATTCTTTGTCTTTGGAAGGACTGGAGGATGTCATGGTCGGCTACTTTTTTG
GGAAGAAGAGGGCGACAGAAGTTGCTCACTCTGTTTGGAAATGTATTGTCAAAAAAGGGGATACAGTAGTAGATGCTACTTGTGGAAATGGTTATGATACTCTAGCTATG
GTCAAGATGGTTGCAGATGAATCTGGTTCTGCACGTGTTTATGCAATGGATGTACAAAATGAGGCTTTAGAAAGTACTTCTGCATTGCTGGATGAATCACTCAGTGAAAA
AGAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTCTAGAGGATTCCCCTGTTAGTCTTGTTGCATTTAACCTAGGGTACCTACCTG
GTGGTAACAAAGCAATCACTACAAAGTCAGAAACAACATTTCAAGCACTTAAAGCTGCACACAGGATTCTGAAACCTGGAGGGCTTATCAGCCTAGTGGTTTATGTGGGG
CATCCTGGTGGAATGGAAGAATTGGAGACTATCGAAAAATTTTCTAGCGACCTGGCTGTTGAGAATTGGATTTGTTGTAAGCTACAGATGTTAAACCGGCCACTAGCTCC
AGTGCCTGTATTCTTATTCAAGAGATGA
Protein sequenceShow/hide protein sequence
MLSLKFGSKWVAVATSESVVGRQRHLRDLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLSLEGLEDVMVGYFFGKKRATEVAHSVWKCIVKKGDTVVDATCGNGYDTLAM
VKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVG
HPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR