; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001633 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001633
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionS-adenosylmethionine-dependent methyltransferase, putative
Genome locationChr09:18890656..18895854
RNA-Seq ExpressionHG10001633
SyntenyHG10001633
Gene Ontology termsNA
InterPro domainsIPR010719 - Putative rRNA methylase
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570591.1 hypothetical protein SDJN03_29506, partial [Cucurbita argyrosperma subsp. sororia]1.8e-12386.35Show/hide
Query:  MLSLKFGSKLVAV-AAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSP-----LSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGD
        MLSLKFG K VAV AA EPVVGRQRHLRNLCC SNRI SNG+SSE Q DF+SP      SSKD SS EGLEDVMVGY+FGKKRATEVAHSVWKRV+R GD
Subjt:  MLSLKFGSKLVAV-AAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSP-----LSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGD

Query:  TVVDATCGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSE
        TVVDATCGNGYDTLAM+KMV+DE+GS RVYAMDVQKEALESTSALLDESL EKE+KLVKLSSICHSRMEDVIPE S VRLVAFNLGYLPGGNKAITTKSE
Subjt:  TVVDATCGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSE

Query:  TTFQALKAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        TT QAL+AA+RILKPGGLISLVVYVGHPGG EELETI+KFS +L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TTFQALKAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_004139901.1 uncharacterized protein LOC101214958 isoform X1 [Cucumis sativus]2.6e-13091.7Show/hide
Query:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT
        MLSLKFGSK VAVA  E VVGRQRHLR+LCCFSNRIQSNGLSS+ QIDFNSPLSSK S S EGLEDVMVGY FGKKRATEVAHSVWK +V+KGDTVVDAT
Subjt:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDTLAMVKMV+DESGS RVYAMDVQ EALESTSALLDESL EKEKKLVKLSSICHSRMEDVI EDS V LVAFNLGYLPGGNKAITTKSETTFQAL
Subjt:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGG EELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_011656977.1 uncharacterized protein LOC101214958 isoform X2 [Cucumis sativus]2.4e-12891.32Show/hide
Query:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT
        MLSLKFGSK VAVA  E VVGRQRHLR+LCCFSNRIQSNGLSS+ QIDFNSPLSSK S S EGLEDVMVGY FGKKRATEVAHSVWK +V+KGDTVVDAT
Subjt:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDTLAMVKMV+DESGS RVYAMDVQ EALESTSALLDESL EKE KLVKLSSICHSRMEDVI EDS V LVAFNLGYLPGGNKAITTKSETTFQAL
Subjt:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGG EELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_038902806.1 putative rRNA methylase YtqB isoform X1 [Benincasa hispida]3.7e-12990.57Show/hide
Query:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT
        MLSLKFGSK VAVAAF+PVVG QRH+RN+C  +NRIQSNGLSSE QIDFNSPLS KD SS EGLEDVMVGY FGKKRATEVAHSVWK VVRKGDTVVDAT
Subjt:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDT AMVKMV+DESGS RVYAMDVQKEALE+TSA LDESL EKEKKLVKLSSICHSRMEDVIPEDS VRLVAFNLGYLPGGNKAITTKSETT +AL
Subjt:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGG EELETI+KFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

XP_038902809.1 putative rRNA methylase YtqB isoform X2 [Benincasa hispida]3.5e-12790.19Show/hide
Query:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT
        MLSLKFGSK VAVAAF+PVVG QRH+RN+C  +NRIQSNGLSSE QIDFNSPLS KD SS EGLEDVMVGY FGKKRATEVAHSVWK VVRKGDTVVDAT
Subjt:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDT AMVKMV+DESGS RVYAMDVQKEALE+TSA LDESL EKE KLVKLSSICHSRMEDVIPEDS VRLVAFNLGYLPGGNKAITTKSETT +AL
Subjt:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGG EELETI+KFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

TrEMBL top hitse value%identityAlignment
A0A0A0KEF8 Uncharacterized protein1.3e-13091.7Show/hide
Query:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT
        MLSLKFGSK VAVA  E VVGRQRHLR+LCCFSNRIQSNGLSS+ QIDFNSPLSSK S S EGLEDVMVGY FGKKRATEVAHSVWK +V+KGDTVVDAT
Subjt:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL
        CGNGYDTLAMVKMV+DESGS RVYAMDVQ EALESTSALLDESL EKEKKLVKLSSICHSRMEDVI EDS V LVAFNLGYLPGGNKAITTKSETTFQAL
Subjt:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        KAAHRILKPGGLISLVVYVGHPGG EELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1D648 uncharacterized protein LOC111017978 isoform X24.0e-11382.64Show/hide
Query:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT
        MLSLKFG K VAV   +PVV RQRH RNL   SN IQSNGLS E Q +F+SP SSK+ SS EGLEDVMVGY+ GKKRATEVAHSVWK ++R+GDTVVDAT
Subjt:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL
         GNGYDTLAMVKMV+DESGS  VYAMDVQKEAL  TSALL+ESL E+E KLVKLSSICHSRMEDVIPE S VRLVAFNLGYLPGGNKAITTKSETT QAL
Subjt:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        +AA+RILKPGGLISLVVYVGHPGG EELETI+KF+S+LAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1D7G1 uncharacterized protein LOC111017978 isoform X14.3e-11583.02Show/hide
Query:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT
        MLSLKFG K VAV   +PVV RQRH RNL   SN IQSNGLS E Q +F+SP SSK+ SS EGLEDVMVGY+ GKKRATEVAHSVWK ++R+GDTVVDAT
Subjt:  MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL
         GNGYDTLAMVKMV+DESGS  VYAMDVQKEAL  TSALL+ESL E+EKKLVKLSSICHSRMEDVIPE S VRLVAFNLGYLPGGNKAITTKSETT QAL
Subjt:  CGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQAL

Query:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        +AA+RILKPGGLISLVVYVGHPGG EELETI+KF+S+LAVENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  KAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1FUH7 uncharacterized protein LOC1114482482.5e-12386.3Show/hide
Query:  MLSLKFGSKLVAV-AAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSP----LSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDT
        MLSLKFG K VAV AA EPVVGRQRHLRN CCFSNRI SNG+SSE Q DF+SP     SSKD SS EGLEDVMVGY+FGKKRATEVAHSVWKRV+R GDT
Subjt:  MLSLKFGSKLVAV-AAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSP----LSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDT

Query:  VVDATCGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSET
        VVDATCGNGYDTLAM+KMV+DE+GS RVYAMDVQKEALES SALLDESL EKE+KLVKLSSICHSRMEDVIPE S VRLVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSET

Query:  TFQALKAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        T QAL+AA+RILKPGGLISLVVYVGHPGG EELETI+KFS +L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  TFQALKAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

A0A6J1J6K5 uncharacterized protein LOC1114839298.7e-12487.27Show/hide
Query:  MLSLKFGSKLVAV-AAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSP-LSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVD
        MLSLKFG K VAV AA EPVVGRQRHLRNLCCFSNRI SNG+SSE Q  F+SP  SSKD SS EGLEDVMVGY+FGKKRATEVAHSVWKRV+R GDTVVD
Subjt:  MLSLKFGSKLVAV-AAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSP-LSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVD

Query:  ATCGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQ
        ATCGNGYDTLAM+KMV+DE+GS RVYAMDVQKEALESTSALLDESL EKE+KLVKLSSICHSRMEDVIPE S VRLVAFNLGYLPGGNKAITTKSETT Q
Subjt:  ATCGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQ

Query:  ALKAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR
        AL+AA+RILKPGGLISLVVYVGHPGG EEL+TI+KFS +L VENWICCKLQMLNRPLAPVPVFLFKR
Subjt:  ALKAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR

SwissProt top hitse value%identityAlignment
O34614 Putative rRNA methylase YtqB5.4e-2236.97Show/hide
Query:  KRATEVAHSVWKRVVRKGDTVVDATCGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLV
        K+    +  + K    +GD VVDAT GNG+DT  + ++V +      VYA D+Q+ A+ +T     E LG+  +    L    H ++ + +P ++  ++ 
Subjt:  KRATEVAHSVWKRVVRKGDTVVDATCGNGYDTLAMVKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLV

Query:  A--FNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDL
        A  FNLGYLPGG+K+ITT   +T +A++    I+K  GLI LVVY GHP G+ E   + +F  DL
Subjt:  A--FNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDL

Arabidopsis top hitse value%identityAlignment
AT1G16445.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein2.8e-8265.47Show/hide
Query:  CQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDATCGNGYDTLAMVKMVSDESGSV--RVYAMDVQKEALESTSALLDE
        C   F+S  S   +    GLEDV VGY+FG+K+ATEVAH VW++V++KGDTV+DATCGNG DTLAM+KMV  +S      VYAMD+QK+A+ESTS+LLD+
Subjt:  CQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDATCGNGYDTLAMVKMVSDESGSV--RVYAMDVQKEALESTSALLDE

Query:  SLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVEN
        ++G KEK+ VKL ++CHS+M +++PE++ VR+VAFNLGYLPGGNK+I T S+TT  ALKAA RILKPGGLISLVVY+GHPGG+EELE +E F S L V +
Subjt:  SLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVGHPGGQEELETIEKFSSDLAVEN

Query:  WICCKLQMLNRPLAPVPVFLFKR
        WICCK QMLNRPLAPV VF+FKR
Subjt:  WICCKLQMLNRPLAPVPVFLFKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTCTTTGAAATTTGGGTCAAAATTGGTGGCGGTTGCTGCCTTCGAACCAGTCGTAGGACGTCAAAGACACTTGAGAAATCTTTGCTGCTTTTCTAACCGTATTCA
GTCAAACGGGTTATCTTCTGAATGTCAGATTGATTTCAATTCACCATTATCGTCGAAAGATTCTTCGTCTTTTGAAGGACTGGAGGATGTCATGGTCGGCTACATTTTTG
GGAAGAAGAGAGCTACAGAAGTTGCTCACTCTGTATGGAAACGTGTTGTCAGAAAAGGGGATACGGTGGTAGATGCTACTTGTGGAAATGGGTATGATACGCTAGCTATG
GTCAAAATGGTTTCAGATGAATCTGGTTCTGTACGTGTTTATGCAATGGATGTTCAAAAAGAGGCTTTAGAAAGTACTTCTGCATTGCTGGACGAGTCACTCGGTGAAAA
AGAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTCCAGAGGATTCTCTGGTTAGGCTTGTTGCATTTAACCTAGGCTACCTACCTG
GTGGTAACAAAGCAATAACTACAAAGTCGGAAACAACATTTCAAGCACTTAAAGCTGCCCATAGAATTCTGAAACCTGGAGGGCTTATCAGCCTTGTGGTTTATGTGGGG
CATCCTGGTGGACAGGAAGAATTGGAGACTATCGAAAAATTTTCTAGTGACCTGGCTGTTGAGAATTGGATTTGTTGTAAGCTTCAGATGTTAAACCGGCCACTAGCTCC
AGTGCCTGTGTTTTTATTCAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGTCTTTGAAATTTGGGTCAAAATTGGTGGCGGTTGCTGCCTTCGAACCAGTCGTAGGACGTCAAAGACACTTGAGAAATCTTTGCTGCTTTTCTAACCGTATTCA
GTCAAACGGGTTATCTTCTGAATGTCAGATTGATTTCAATTCACCATTATCGTCGAAAGATTCTTCGTCTTTTGAAGGACTGGAGGATGTCATGGTCGGCTACATTTTTG
GGAAGAAGAGAGCTACAGAAGTTGCTCACTCTGTATGGAAACGTGTTGTCAGAAAAGGGGATACGGTGGTAGATGCTACTTGTGGAAATGGGTATGATACGCTAGCTATG
GTCAAAATGGTTTCAGATGAATCTGGTTCTGTACGTGTTTATGCAATGGATGTTCAAAAAGAGGCTTTAGAAAGTACTTCTGCATTGCTGGACGAGTCACTCGGTGAAAA
AGAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTCCAGAGGATTCTCTGGTTAGGCTTGTTGCATTTAACCTAGGCTACCTACCTG
GTGGTAACAAAGCAATAACTACAAAGTCGGAAACAACATTTCAAGCACTTAAAGCTGCCCATAGAATTCTGAAACCTGGAGGGCTTATCAGCCTTGTGGTTTATGTGGGG
CATCCTGGTGGACAGGAAGAATTGGAGACTATCGAAAAATTTTCTAGTGACCTGGCTGTTGAGAATTGGATTTGTTGTAAGCTTCAGATGTTAAACCGGCCACTAGCTCC
AGTGCCTGTGTTTTTATTCAAGAGATGA
Protein sequenceShow/hide protein sequence
MLSLKFGSKLVAVAAFEPVVGRQRHLRNLCCFSNRIQSNGLSSECQIDFNSPLSSKDSSSFEGLEDVMVGYIFGKKRATEVAHSVWKRVVRKGDTVVDATCGNGYDTLAM
VKMVSDESGSVRVYAMDVQKEALESTSALLDESLGEKEKKLVKLSSICHSRMEDVIPEDSLVRLVAFNLGYLPGGNKAITTKSETTFQALKAAHRILKPGGLISLVVYVG
HPGGQEELETIEKFSSDLAVENWICCKLQMLNRPLAPVPVFLFKR