; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G11120 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G11120
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionS-adenosyl-L-methionine-dependent methyltransferases superfamily protein
Genome locationChr6:9722288..9727293
RNA-Seq ExpressionCSPI06G11120
SyntenyCSPI06G11120
Gene Ontology termsNA
InterPro domainsIPR010719 - Putative rRNA methylase
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570591.1 hypothetical protein SDJN03_29506, partial [Cucurbita argyrosperma subsp. sororia]1.5e-12285.24Show/hide
Query:  MLSLKFGSKWVAV-ATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHS-----LPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGD
        MLSLKFG KWVAV A SE VVGRQRHLRNLCC SNRI SNG+SS+YQ DF+SP SS  S       LEGLEDVMVGY FGKKRATEVAHSVWK V++ GD
Subjt:  MLSLKFGSKWVAV-ATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHS-----LPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGD

Query:  TVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSE
        TVVDATCGNGYDTLAM+KMVADE+GSARVYAMDVQ EALESTSALLDESLSEKE+KLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSE
Subjt:  TVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSE

Query:  TTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        TT QAL+AA+RILKPGGLISLVVYVGHPGG+EELETI+KFS +L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_004139901.1 uncharacterized protein LOC101214958 isoform X1 [Cucumis sativus]2.4e-14498.87Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT
        MLSLKFGSKWVAVATSESVVGRQRHLR+LCCFSNRIQSNGLSSQYQIDFNSPLSSKHSL LEGLEDVMVGYFFGKKRATEVAHSVWKC+VKKGDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_011656977.1 uncharacterized protein LOC101214958 isoform X2 [Cucumis sativus]1.7e-14298.49Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT
        MLSLKFGSKWVAVATSESVVGRQRHLR+LCCFSNRIQSNGLSSQYQIDFNSPLSSKHSL LEGLEDVMVGYFFGKKRATEVAHSVWKC+VKKGDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKE KLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_038902806.1 putative rRNA methylase YtqB isoform X1 [Benincasa hispida]5.4e-12889.43Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT
        MLSLKFGSKWVAVA  + VVG QRH+RN+C  +NRIQSNGLSS+YQIDFNSPLS K    LEGLEDVMVGYFFGKKRATEVAHSVWK VV+KGDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDT AMVKMVADESGSARVYAMDVQ EALE+TSA LDESLSEKEKKLVKLSSICHSRMEDVI EDSPV LVAFNLGYLPGGNKAITTKSETT +AL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGG+EELETI+KFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_038902809.1 putative rRNA methylase YtqB isoform X2 [Benincasa hispida]3.9e-12689.06Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT
        MLSLKFGSKWVAVA  + VVG QRH+RN+C  +NRIQSNGLSS+YQIDFNSPLS K    LEGLEDVMVGYFFGKKRATEVAHSVWK VV+KGDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDT AMVKMVADESGSARVYAMDVQ EALE+TSA LDESLSEKE KLVKLSSICHSRMEDVI EDSPV LVAFNLGYLPGGNKAITTKSETT +AL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGG+EELETI+KFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

TrEMBL top hitse value%identityAlignment
A0A0A0KEF8 Uncharacterized protein1.2e-14498.87Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT
        MLSLKFGSKWVAVATSESVVGRQRHLR+LCCFSNRIQSNGLSSQYQIDFNSPLSSKHSL LEGLEDVMVGYFFGKKRATEVAHSVWKC+VKKGDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1D648 uncharacterized protein LOC111017978 isoform X22.6e-11281.51Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT
        MLSLKFG K VAV T + VV RQRH RNL   SN IQSNGLS +YQ +F+SP SSK    LEGLEDVMVGY  GKKRATEVAHSVWK ++++GDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
         GNGYDTLAMVKMVADESGS  VYAMDVQ EAL  TSALL+ESL E+E KLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSETT QAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        +AA+RILKPGGLISLVVYVGHPGG+EELETI+KF+S+LAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1D7G1 uncharacterized protein LOC111017978 isoform X13.7e-11481.89Show/hide
Query:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT
        MLSLKFG K VAV T + VV RQRH RNL   SN IQSNGLS +YQ +F+SP SSK    LEGLEDVMVGY  GKKRATEVAHSVWK ++++GDTVVDAT
Subjt:  MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL
         GNGYDTLAMVKMVADESGS  VYAMDVQ EAL  TSALL+ESL E+EKKLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSETT QAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        +AA+RILKPGGLISLVVYVGHPGG+EELETI+KF+S+LAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1FUH7 uncharacterized protein LOC1114482483.7e-12284.81Show/hide
Query:  MLSLKFGSKWVAV-ATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHS----LPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDT
        MLSLKFG KWVAV A SE VVGRQRHLRN CCFSNRI SNG+SS+YQ DF+SP SS  S      LEGLEDVMVGY FGKKRATEVAHSVWK V++ GDT
Subjt:  MLSLKFGSKWVAV-ATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHS----LPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDT

Query:  VVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSET
        VVDATCGNGYDTLAM+KMVADE+GSARVYAMDVQ EALES SALLDESL EKE+KLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSET

Query:  TFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        T QAL+AA+RILKPGGLISLVVYVGHPGG+EELETI+KFS +L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1J6K5 uncharacterized protein LOC1114839299.6e-12386.14Show/hide
Query:  MLSLKFGSKWVAV-ATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSP-LSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVD
        MLSLKFG KWVAV A SE VVGRQRHLRNLCCFSNRI SNG+SS+YQ  F+SP  SSK    LEGLEDVMVGY FGKKRATEVAHSVWK V++ GDTVVD
Subjt:  MLSLKFGSKWVAV-ATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSP-LSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVD

Query:  ATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQ
        ATCGNGYDTLAM+KMVADE+GSARVYAMDVQ EALESTSALLDESLSEKE+KLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSETT Q
Subjt:  ATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQ

Query:  ALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        AL+AA+RILKPGGLISLVVYVGHPGG+EEL+TI+KFS +L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  ALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

SwissProt top hitse value%identityAlignment
O34614 Putative rRNA methylase YtqB8.7e-2035.76Show/hide
Query:  KRATEVAHSVWKCVVKKGDTVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDS--PVS
        K+    +  + K    +GD VVDAT GNG+DT  + ++V +   +  VYA D+Q  A+ +T     E L +  +    L    H ++ + +  ++   V+
Subjt:  KRATEVAHSVWKCVVKKGDTVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDS--PVS

Query:  LVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDL
           FNLGYLPGG+K+ITT   +T +A++    I+K  GLI LVVY GHP G  E   + +F  DL
Subjt:  LVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDL

Arabidopsis top hitse value%identityAlignment
AT1G16445.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein1.3e-7964.84Show/hide
Query:  FNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDATCGNGYDTLAMVKMVADES--GSARVYAMDVQNEALESTSALLDESLSE
        F+S  S   + P+ GLEDV VGY FG+K+ATEVAH VW+ V++KGDTV+DATCGNG DTLAM+KMV  +S      VYAMD+Q +A+ESTS+LLD+++  
Subjt:  FNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDATCGNGYDTLAMVKMVADES--GSARVYAMDVQNEALESTSALLDESLSE

Query:  KEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICC
        KEK+ VKL ++CHS+M +++ E++ V +VAFNLGYLPGGNK+I T S+TT  ALKAA RILKPGGLISLVVY+GHPGG EELE +E F S L V +WICC
Subjt:  KEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGMEELETIEKFSSDLAVENWICC

Query:  KLQMLNRPLAPVPVFLFKR
        K QMLNRPLAPV VF+FKR
Subjt:  KLQMLNRPLAPVPVFLFKR

AT3G20020.1 protein arginine methyltransferase 61.1e-0427.05Show/hide
Query:  KGDTVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITT
        +G  VVD  CG G     ++ +   ++G+ RVYA+D  + A+++   +    LS+K         + H R+EDV +++    +++  +GY+         
Subjt:  KGDTVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITT

Query:  KSETTFQALKAAHRILKPGGLI
                + A  R LKPGGLI
Subjt:  KSETTFQALKAAHRILKPGGLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATCTTTGAAATTTGGGTCAAAATGGGTGGCGGTTGCTACCTCCGAATCAGTCGTAGGACGTCAAAGACACTTGAGAAATCTTTGCTGTTTTTCCAACCGTATTCA
GTCAAATGGATTATCTTCTCAATATCAGATTGATTTCAATTCACCATTGTCGTCGAAACATTCTTTGCCTTTGGAAGGACTGGAGGATGTCATGGTCGGCTACTTTTTTG
GGAAGAAGAGGGCGACAGAAGTTGCTCACTCTGTTTGGAAATGTGTTGTCAAAAAAGGGGATACAGTAGTAGATGCTACTTGTGGAAATGGTTATGATACTCTAGCTATG
GTCAAGATGGTTGCAGATGAATCTGGTTCTGCACGTGTTTATGCAATGGATGTACAAAATGAGGCTTTAGAAAGTACTTCTGCATTGCTGGACGAATCACTCAGTGAAAA
AGAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTCTAGAGGATTCCCCTGTTAGTCTTGTTGCATTTAACCTAGGGTACCTACCTG
GTGGTAACAAAGCAATCACTACAAAGTCAGAAACAACATTTCAAGCACTTAAAGCTGCACACAGGATTCTGAAACCTGGAGGGCTTATCAGCCTAGTGGTTTATGTGGGG
CATCCTGGTGGAATGGAAGAATTGGAGACTATCGAAAAATTTTCTAGCGACCTGGCTGTTGAGAATTGGATTTGTTGTAAGCTACAGATGTTAAACCGGCCACTAGCTCC
AGTGCCTGTATTCTTATTCAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTATCTTTGAAATTTGGGTCAAAATGGGTGGCGGTTGCTACCTCCGAATCAGTCGTAGGACGTCAAAGACACTTGAGAAATCTTTGCTGTTTTTCCAACCGTATTCA
GTCAAATGGATTATCTTCTCAATATCAGATTGATTTCAATTCACCATTGTCGTCGAAACATTCTTTGCCTTTGGAAGGACTGGAGGATGTCATGGTCGGCTACTTTTTTG
GGAAGAAGAGGGCGACAGAAGTTGCTCACTCTGTTTGGAAATGTGTTGTCAAAAAAGGGGATACAGTAGTAGATGCTACTTGTGGAAATGGTTATGATACTCTAGCTATG
GTCAAGATGGTTGCAGATGAATCTGGTTCTGCACGTGTTTATGCAATGGATGTACAAAATGAGGCTTTAGAAAGTACTTCTGCATTGCTGGACGAATCACTCAGTGAAAA
AGAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTCTAGAGGATTCCCCTGTTAGTCTTGTTGCATTTAACCTAGGGTACCTACCTG
GTGGTAACAAAGCAATCACTACAAAGTCAGAAACAACATTTCAAGCACTTAAAGCTGCACACAGGATTCTGAAACCTGGAGGGCTTATCAGCCTAGTGGTTTATGTGGGG
CATCCTGGTGGAATGGAAGAATTGGAGACTATCGAAAAATTTTCTAGCGACCTGGCTGTTGAGAATTGGATTTGTTGTAAGCTACAGATGTTAAACCGGCCACTAGCTCC
AGTGCCTGTATTCTTATTCAAGAGATGAAAGTGACAGTTTATGAAGTTGGGACAATGCTCATGCATAGTTCACTTGCTATTGCCTCTTTTGGTAGCCTTGGTGGCATTGG
ATTAATTTCATATCAATCTTGTCATGCAGCTTCATTACCACAAGTAAGTCACAGTAAAGTTATAATTTATTATAGCTCAATGTTACTTGACATGTATTTATTTTATCGAA
GTCGGGGGTTCAAATATCTAACGTGTCACTTGTACTGAAAAAATGGAGTACTATCAATTTCGTGC
Protein sequenceShow/hide protein sequence
MLSLKFGSKWVAVATSESVVGRQRHLRNLCCFSNRIQSNGLSSQYQIDFNSPLSSKHSLPLEGLEDVMVGYFFGKKRATEVAHSVWKCVVKKGDTVVDATCGNGYDTLAM
VKMVADESGSARVYAMDVQNEALESTSALLDESLSEKEKKLVKLSSICHSRMEDVILEDSPVSLVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVG
HPGGMEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR