; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009772 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009772
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionS-adenosylmethionine-dependent methyltransferase, putative
Genome locationchr9:42199119..42202023
RNA-Seq ExpressionLag0009772
SyntenyLag0009772
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR010719 - Putative rRNA methylase
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570591.1 hypothetical protein SDJN03_29506, partial [Cucurbita argyrosperma subsp. sororia]2.9e-12185.24Show/hide
Query:  MLSLKFGSQWVTV-AAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSP-----SSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGD
        MLSLKFG +WV V AA + +V RQR+LRN  C SNR  SNG+SSE+Q+DFSSP     SS+K+FSSLEGLEDVMVGY FGKKRATEVAHSVW RVIR GD
Subjt:  MLSLKFGSQWVTV-AAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSP-----SSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGD

Query:  TVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSE
        TVVDATCGNGYDTLAM+KMVADE+GSARVYAMDVQKEALESTSALL+ESL+EKE+KLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSE
Subjt:  TVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSE

Query:  TTLQALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR
        TTLQALEAA RILKPGGLISLVVYVGHPGGLEELETIQ+F+  L VENW CCKLQMLNRPLAPVPVFLFKR
Subjt:  TTLQALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR

XP_022149584.1 uncharacterized protein LOC111017978 isoform X1 [Momordica charantia]9.9e-12286.42Show/hide
Query:  MLSLKFGSQWVTVAAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDAT
        MLSLKFG + V V  PK +VRRQR+ RN W  SN  QSNGLS E+QT+FSSPSS+KEFSSLEGLEDVMVGY  GKKRATEVAHSVW  +IR+GDTVVDAT
Subjt:  MLSLKFGSQWVTVAAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL
         GNGYDTLAMVKMVADESGS  VYAMDVQKEAL  TSALLEESL+E+EKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL

Query:  EAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR
        EAANRILKPGGLISLVVYVGHPGG+EELETIQ+FAS LAVENW CCKLQMLNRPLAPVPVFLFKR
Subjt:  EAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR

XP_022986072.1 uncharacterized protein LOC111483929 [Cucurbita maxima]1.3e-12186.52Show/hide
Query:  MLSLKFGSQWVTV-AAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSP-SSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVD
        MLSLKFG +WV V AA + +V RQR+LRN  CFSNR  SNG+SSE+QT FSSP SS+K+FSSLEGLEDVMVGY FGKKRATEVAHSVW RVIR GDTVVD
Subjt:  MLSLKFGSQWVTV-AAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSP-SSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVD

Query:  ATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQ
        ATCGNGYDTLAM+KMVADE+GSARVYAMDVQKEALESTSALL+ESL+EKE+KLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQ
Subjt:  ATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQ

Query:  ALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR
        ALEAA RILKPGGLISLVVYVGHPGGLEEL+TIQ+F+  L VENW CCKLQMLNRPLAPVPVFLFKR
Subjt:  ALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR

XP_023511899.1 uncharacterized protein LOC111776775 [Cucurbita pepo subsp. pepo]2.9e-12186.14Show/hide
Query:  MLSLKFGSQWVTV-AAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSP-SSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVD
        MLSLKFG +WV V AA + +V RQR+LRN  CFSNR  SNG+SSE+Q+DFSSP SS+K+FSSLEGLEDVMVGY FGKKRATEVAHSVW RVIR GDTVVD
Subjt:  MLSLKFGSQWVTV-AAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSP-SSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVD

Query:  ATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQ
        ATCGNGYDTLAM+KMVADE+GSARVYAMDVQKEALESTSALL+ESL +KE+KLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQ
Subjt:  ATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQ

Query:  ALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR
        ALEAA R+LKPGGLISLVVYVGHPGGLEELETIQ+F   L VENW CCKLQMLNRPLAPVPVFLFKR
Subjt:  ALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR

XP_038902806.1 putative rRNA methylase YtqB isoform X1 [Benincasa hispida]2.2e-12186.04Show/hide
Query:  MLSLKFGSQWVTVAAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDAT
        MLSLKFGS+WV VAA K +V  QR++RN    +NR QSNGLSSE+Q DF+SP S K+FSSLEGLEDVMVGYFFGKKRATEVAHSVW  V+RKGDTVVDAT
Subjt:  MLSLKFGSQWVTVAAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL
        CGNGYDT AMVKMVADESGSARVYAMDVQKEALE+TSA L+ESL+EKEKKLVKLSSICHSRMEDVIPE SPVRLVAFNLGYLPGGNKAITTKSETTL+AL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL

Query:  EAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR
        +AA+RILKPGGLISLVVYVGHPGGLEELETIQ+F+S LAVENW CCKLQMLNRPLAPVPVFLFKR
Subjt:  EAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR

TrEMBL top hitse value%identityAlignment
A0A0A0KEF8 Uncharacterized protein2.6e-12084.15Show/hide
Query:  MLSLKFGSQWVTVAAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDAT
        MLSLKFGS+WV VA  + +V RQR+LR+  CFSNR QSNGLSS++Q DF+SP S+K   SLEGLEDVMVGYFFGKKRATEVAHSVW  +++KGDTVVDAT
Subjt:  MLSLKFGSQWVTVAAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL
        CGNGYDTLAMVKMVADESGSARVYAMDVQ EALESTSALL+ESL+EKEKKLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNKAITTKSETT QAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL

Query:  EAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR
        +AA+RILKPGGLISLVVYVGHPGG+EELETI++F+S LAVENW CCKLQMLNRPLAPVPVFLFKR
Subjt:  EAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR

A0A6J1D648 uncharacterized protein LOC111017978 isoform X23.4e-12086.04Show/hide
Query:  MLSLKFGSQWVTVAAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDAT
        MLSLKFG + V V  PK +VRRQR+ RN W  SN  QSNGLS E+QT+FSSPSS+KEFSSLEGLEDVMVGY  GKKRATEVAHSVW  +IR+GDTVVDAT
Subjt:  MLSLKFGSQWVTVAAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL
         GNGYDTLAMVKMVADESGS  VYAMDVQKEAL  TSALLEESL+E+E KLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL

Query:  EAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR
        EAANRILKPGGLISLVVYVGHPGG+EELETIQ+FAS LAVENW CCKLQMLNRPLAPVPVFLFKR
Subjt:  EAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR

A0A6J1D7G1 uncharacterized protein LOC111017978 isoform X14.8e-12286.42Show/hide
Query:  MLSLKFGSQWVTVAAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDAT
        MLSLKFG + V V  PK +VRRQR+ RN W  SN  QSNGLS E+QT+FSSPSS+KEFSSLEGLEDVMVGY  GKKRATEVAHSVW  +IR+GDTVVDAT
Subjt:  MLSLKFGSQWVTVAAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDAT

Query:  CGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL
         GNGYDTLAMVKMVADESGS  VYAMDVQKEAL  TSALLEESL+E+EKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL
Subjt:  CGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQAL

Query:  EAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR
        EAANRILKPGGLISLVVYVGHPGG+EELETIQ+FAS LAVENW CCKLQMLNRPLAPVPVFLFKR
Subjt:  EAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR

A0A6J1FUH7 uncharacterized protein LOC1114482482.4e-12185.56Show/hide
Query:  MLSLKFGSQWVTV-AAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSP----SSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDT
        MLSLKFG +WV V AA + +V RQR+LRN  CFSNR  SNG+SSE+Q+DFSSP    SS+K+FSSLEGLEDVMVGY FGKKRATEVAHSVW RVIR GDT
Subjt:  MLSLKFGSQWVTV-AAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSP----SSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDT

Query:  VVDATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
        VVDATCGNGYDTLAM+KMVADE+GSARVYAMDVQKEALES SALL+ESL EKE+KLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET
Subjt:  VVDATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSET

Query:  TLQALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR
        TLQALEAA RILKPGGLISLVVYVGHPGGLEELETIQ+F+  L VENW CCKLQMLNRPLAPVPVFLFKR
Subjt:  TLQALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR

A0A6J1J6K5 uncharacterized protein LOC1114839296.2e-12286.52Show/hide
Query:  MLSLKFGSQWVTV-AAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSP-SSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVD
        MLSLKFG +WV V AA + +V RQR+LRN  CFSNR  SNG+SSE+QT FSSP SS+K+FSSLEGLEDVMVGY FGKKRATEVAHSVW RVIR GDTVVD
Subjt:  MLSLKFGSQWVTV-AAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSP-SSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVD

Query:  ATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQ
        ATCGNGYDTLAM+KMVADE+GSARVYAMDVQKEALESTSALL+ESL+EKE+KLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQ
Subjt:  ATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQ

Query:  ALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR
        ALEAA RILKPGGLISLVVYVGHPGGLEEL+TIQ+F+  L VENW CCKLQMLNRPLAPVPVFLFKR
Subjt:  ALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR

SwissProt top hitse value%identityAlignment
O34614 Putative rRNA methylase YtqB6.6e-2038.93Show/hide
Query:  KGDTVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVA--FNLGYLPGGNKAI
        +GD VVDAT GNG+DT  + ++V +   +  VYA D+Q+ A+ +T    +E L +  +    L    H ++ + +P  +  ++ A  FNLGYLPGG+K+I
Subjt:  KGDTVVDATCGNGYDTLAMVKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVA--FNLGYLPGGNKAI

Query:  TTKSETTLQALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGL
        TT   +T++A+E    I+K  GLI LVVY GHP G  E   + +F   L
Subjt:  TTKSETTLQALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGL

Arabidopsis top hitse value%identityAlignment
AT1G16445.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein2.0e-8066.06Show/hide
Query:  SSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDATCGNGYDTLAMVKMVADES--GSARVYAMDVQKEALESTSALLEESLNEK
        S+PS ++ F  + GLEDV VGY FG+K+ATEVAH VW +VI+KGDTV+DATCGNG DTLAM+KMV  +S      VYAMD+QK+A+ESTS+LL++++  K
Subjt:  SSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDATCGNGYDTLAMVKMVADES--GSARVYAMDVQKEALESTSALLEESLNEK

Query:  EKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCK
        EK+ VKL ++CHS+M +++PE + VR+VAFNLGYLPGGNK+I T S+TTL AL+AA RILKPGGLISLVVY+GHPGG EELE ++ F SGL V +W CCK
Subjt:  EKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQALEAANRILKPGGLISLVVYVGHPGGLEELETIQQFASGLAVENWTCCK

Query:  LQMLNRPLAPVPVFLFKR
         QMLNRPLAPV VF+FKR
Subjt:  LQMLNRPLAPVPVFLFKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATCTCTGAAATTTGGGTCACAATGGGTTACGGTTGCTGCCCCCAAACTAATCGTTCGACGTCAAAGAAACTTGAGAAATCAATGGTGCTTTTCTAATCGTTTTCA
GTCAAATGGGCTGTCTTCTGAATTTCAGACGGATTTCTCTTCACCATCGTCGGCGAAAGAGTTTTCGTCTTTAGAAGGACTGGAGGATGTCATGGTCGGCTACTTTTTTG
GAAAGAAGAGAGCGACAGAAGTTGCTCACTCTGTATGGAACCGTGTCATCAGAAAAGGGGATACTGTGGTAGATGCTACTTGTGGAAATGGGTATGATACTCTAGCTATG
GTCAAAATGGTTGCAGATGAATCTGGTTCTGCTCGTGTTTATGCGATGGATGTTCAGAAAGAGGCTTTAGAAAGTACGTCTGCATTGCTGGAAGAATCACTCAATGAAAA
GGAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTCCAGAGGGTTCTCCCGTTAGGCTTGTTGCATTTAACCTTGGCTATCTACCTG
GTGGTAACAAAGCAATCACTACAAAGTCAGAAACAACATTGCAAGCACTTGAAGCTGCCAATAGAATTCTAAAACCTGGAGGACTTATCAGCCTTGTGGTTTATGTGGGG
CATCCTGGTGGACTGGAAGAATTGGAGACTATACAACAATTTGCTAGCGGCTTGGCTGTTGAGAATTGGACTTGTTGTAAGCTTCAGATGCTAAACCGGCCACTAGCTCC
AGTGCCTGTGTTCTTATTCAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTATCTCTGAAATTTGGGTCACAATGGGTTACGGTTGCTGCCCCCAAACTAATCGTTCGACGTCAAAGAAACTTGAGAAATCAATGGTGCTTTTCTAATCGTTTTCA
GTCAAATGGGCTGTCTTCTGAATTTCAGACGGATTTCTCTTCACCATCGTCGGCGAAAGAGTTTTCGTCTTTAGAAGGACTGGAGGATGTCATGGTCGGCTACTTTTTTG
GAAAGAAGAGAGCGACAGAAGTTGCTCACTCTGTATGGAACCGTGTCATCAGAAAAGGGGATACTGTGGTAGATGCTACTTGTGGAAATGGGTATGATACTCTAGCTATG
GTCAAAATGGTTGCAGATGAATCTGGTTCTGCTCGTGTTTATGCGATGGATGTTCAGAAAGAGGCTTTAGAAAGTACGTCTGCATTGCTGGAAGAATCACTCAATGAAAA
GGAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTCCAGAGGGTTCTCCCGTTAGGCTTGTTGCATTTAACCTTGGCTATCTACCTG
GTGGTAACAAAGCAATCACTACAAAGTCAGAAACAACATTGCAAGCACTTGAAGCTGCCAATAGAATTCTAAAACCTGGAGGACTTATCAGCCTTGTGGTTTATGTGGGG
CATCCTGGTGGACTGGAAGAATTGGAGACTATACAACAATTTGCTAGCGGCTTGGCTGTTGAGAATTGGACTTGTTGTAAGCTTCAGATGCTAAACCGGCCACTAGCTCC
AGTGCCTGTGTTCTTATTCAAGAGATGA
Protein sequenceShow/hide protein sequence
MLSLKFGSQWVTVAAPKLIVRRQRNLRNQWCFSNRFQSNGLSSEFQTDFSSPSSAKEFSSLEGLEDVMVGYFFGKKRATEVAHSVWNRVIRKGDTVVDATCGNGYDTLAM
VKMVADESGSARVYAMDVQKEALESTSALLEESLNEKEKKLVKLSSICHSRMEDVIPEGSPVRLVAFNLGYLPGGNKAITTKSETTLQALEAANRILKPGGLISLVVYVG
HPGGLEELETIQQFASGLAVENWTCCKLQMLNRPLAPVPVFLFKR