; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0012120 (gene) of Chayote v1 genome

Gene IDSed0012120
OrganismSechium edule (Chayote v1)
DescriptionS-adenosyl-L-methionine-dependent methyltransferases superfamily protein
Genome locationLG05:42403978..42409364
RNA-Seq ExpressionSed0012120
SyntenySed0012120
Gene Ontology termsNA
InterPro domainsIPR010719 - Putative rRNA methylase
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570591.1 hypothetical protein SDJN03_29506, partial [Cucurbita argyrosperma subsp. sororia]1.0e-11480.81Show/hide
Query:  MLSLKFGSKWVTV-AAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP-----------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGD
        MLSLKFG KWV V AA +PVVG Q+HLRNL C SNRI SNG SS++Q  F +P           SSLEGLEDVMVGY+ GKKRATEVAHSVWK VIR GD
Subjt:  MLSLKFGSKWVTV-AAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP-----------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGD

Query:  TVVDATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSE
        TVVDATCGNGYDTLAM+KMV+DE+GSARVYA+DVQKEALESTSALL+E L+EKE+KLVKLSSICHSRMEDVI EGSPVRLVAFNLGYLPGGNK +TTKSE
Subjt:  TVVDATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSE

Query:  TTLQALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR
        TTLQALEAA RILKPGGLISLVVYVGHPGG EELETIQKF+ +L  +NW CCKLQMLNRPLAPVPVFLFKR
Subjt:  TTLQALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR

XP_004139901.1 uncharacterized protein LOC101214958 isoform X1 [Cucumis sativus]4.6e-11581.51Show/hide
Query:  MLSLKFGSKWVTVAAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDAT
        MLSLKFGSKWV VA  + VVG Q+HLR+L CFSNRIQSNG SS++Q  F +P       SLEGLEDVMVGY  GKKRATEVAHSVWK +++KGDTVVDAT
Subjt:  MLSLKFGSKWVTVAAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQAL
        CGNGYDTLAMVKMV+DESGSARVYA+DVQ EALESTSALL+E L+EKEKKLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNK +TTKSETT QAL
Subjt:  CGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQAL

Query:  EAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR
        +AA+RILKPGGLISLVVYVGHPGG EELETI+KF+SDLA +NW CCKLQMLNRPLAPVPVFLFKR
Subjt:  EAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR

XP_022986072.1 uncharacterized protein LOC111483929 [Cucurbita maxima]1.2e-11582.02Show/hide
Query:  MLSLKFGSKWVTV-AAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP-------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVD
        MLSLKFG KWV V AA +PVVG Q+HLRNL CFSNRI SNG SS++Q  F +P       SSLEGLEDVMVGY+ GKKRATEVAHSVWK VIR GDTVVD
Subjt:  MLSLKFGSKWVTV-AAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP-------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVD

Query:  ATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQ
        ATCGNGYDTLAM+KMV+DE+GSARVYA+DVQKEALESTSALL+E L+EKE+KLVKLSSICHSRMEDVI EGSPVRLVAFNLGYLPGGNK +TTKSETTLQ
Subjt:  ATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQ

Query:  ALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR
        ALEAA RILKPGGLISLVVYVGHPGG EEL+TIQKF+ +L  +NW CCKLQMLNRPLAPVPVFLFKR
Subjt:  ALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR

XP_023511899.1 uncharacterized protein LOC111776775 [Cucurbita pepo subsp. pepo]1.0e-11481.65Show/hide
Query:  MLSLKFGSKWVTV-AAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP-------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVD
        MLSLKFG KWV V AA +PVVG Q+HLRNL CFSNRI SNG SS++Q  F +P       SSLEGLEDVMVGY+ GKKRATEVAHSVWK VIR GDTVVD
Subjt:  MLSLKFGSKWVTV-AAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP-------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVD

Query:  ATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQ
        ATCGNGYDTLAM+KMV+DE+GSARVYA+DVQKEALESTSALL+E L +KE+KLVKLSSICHSRMEDVI EGSPVRLVAFNLGYLPGGNK +TTKSETTLQ
Subjt:  ATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQ

Query:  ALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR
        ALEAA R+LKPGGLISLVVYVGHPGG EELETIQKF  +L  +NW CCKLQMLNRPLAPVPVFLFKR
Subjt:  ALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR

XP_038902806.1 putative rRNA methylase YtqB isoform X1 [Benincasa hispida]4.1e-11682.64Show/hide
Query:  MLSLKFGSKWVTVAAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDAT
        MLSLKFGSKWV VAA KPVVG Q+H+RN+   +NRIQSNG SS++Q  F +P      SSLEGLEDVMVGY  GKKRATEVAHSVWKHV+RKGDTVVDAT
Subjt:  MLSLKFGSKWVTVAAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQAL
        CGNGYDT AMVKMV+DESGSARVYA+DVQKEALE+TSA L+E L+EKEKKLVKLSSICHSRMEDVI E SPVRLVAFNLGYLPGGNK +TTKSETTL+AL
Subjt:  CGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQAL

Query:  EAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR
        +AA+RILKPGGLISLVVYVGHPGG EELETIQKF+SDLA +NW CCKLQMLNRPLAPVPVFLFKR
Subjt:  EAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR

TrEMBL top hitse value%identityAlignment
A0A0A0KEF8 Uncharacterized protein2.2e-11581.51Show/hide
Query:  MLSLKFGSKWVTVAAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDAT
        MLSLKFGSKWV VA  + VVG Q+HLR+L CFSNRIQSNG SS++Q  F +P       SLEGLEDVMVGY  GKKRATEVAHSVWK +++KGDTVVDAT
Subjt:  MLSLKFGSKWVTVAAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQAL
        CGNGYDTLAMVKMV+DESGSARVYA+DVQ EALESTSALL+E L+EKEKKLVKLSSICHSRMEDVI E SPV LVAFNLGYLPGGNK +TTKSETT QAL
Subjt:  CGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQAL

Query:  EAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR
        +AA+RILKPGGLISLVVYVGHPGG EELETI+KF+SDLA +NW CCKLQMLNRPLAPVPVFLFKR
Subjt:  EAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR

A0A6J1D648 uncharacterized protein LOC111017978 isoform X25.7e-11181.13Show/hide
Query:  MLSLKFGSKWVTVAAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDAT
        MLSLKFG K V V  PKPVV  Q+H RNL   SN IQSNG S ++Q  F +P      SSLEGLEDVMVGY++GKKRATEVAHSVWKH+IR+GDTVVDAT
Subjt:  MLSLKFGSKWVTVAAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQAL
         GNGYDTLAMVKMV+DESGS  VYA+DVQKEAL  TSALLEE L+E+E KLVKLSSICHSRMEDVI EGSPVRLVAFNLGYLPGGNK +TTKSETTLQAL
Subjt:  CGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQAL

Query:  EAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR
        EAANRILKPGGLISLVVYVGHPGG EELETIQKFAS+LA +NW CCKLQMLNRPLAPVPVFLFKR
Subjt:  EAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR

A0A6J1D7G1 uncharacterized protein LOC111017978 isoform X16.0e-11381.51Show/hide
Query:  MLSLKFGSKWVTVAAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDAT
        MLSLKFG K V V  PKPVV  Q+H RNL   SN IQSNG S ++Q  F +P      SSLEGLEDVMVGY++GKKRATEVAHSVWKH+IR+GDTVVDAT
Subjt:  MLSLKFGSKWVTVAAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDAT

Query:  CGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQAL
         GNGYDTLAMVKMV+DESGS  VYA+DVQKEAL  TSALLEE L+E+EKKLVKLSSICHSRMEDVI EGSPVRLVAFNLGYLPGGNK +TTKSETTLQAL
Subjt:  CGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQAL

Query:  EAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR
        EAANRILKPGGLISLVVYVGHPGG EELETIQKFAS+LA +NW CCKLQMLNRPLAPVPVFLFKR
Subjt:  EAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR

A0A6J1FUH7 uncharacterized protein LOC1114482481.9e-11480.74Show/hide
Query:  MLSLKFGSKWVTV-AAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP----------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDT
        MLSLKFG KWV V AA +PVVG Q+HLRN  CFSNRI SNG SS++Q  F +P          SSLEGLEDVMVGY+ GKKRATEVAHSVWK VIR GDT
Subjt:  MLSLKFGSKWVTV-AAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP----------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDT

Query:  VVDATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSET
        VVDATCGNGYDTLAM+KMV+DE+GSARVYA+DVQKEALES SALL+E L EKE+KLVKLSSICHSRMEDVI EGSPVRLVAFNLGYLPGGNK +TTKSET
Subjt:  VVDATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSET

Query:  TLQALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR
        TLQALEAA RILKPGGLISLVVYVGHPGG EELETIQKF+ +L  +NW CCKLQMLNRPLAPVPVFLFKR
Subjt:  TLQALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR

A0A6J1J6K5 uncharacterized protein LOC1114839295.8e-11682.02Show/hide
Query:  MLSLKFGSKWVTV-AAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP-------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVD
        MLSLKFG KWV V AA +PVVG Q+HLRNL CFSNRI SNG SS++Q  F +P       SSLEGLEDVMVGY+ GKKRATEVAHSVWK VIR GDTVVD
Subjt:  MLSLKFGSKWVTV-AAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQ--FVAP-------SSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVD

Query:  ATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQ
        ATCGNGYDTLAM+KMV+DE+GSARVYA+DVQKEALESTSALL+E L+EKE+KLVKLSSICHSRMEDVI EGSPVRLVAFNLGYLPGGNK +TTKSETTLQ
Subjt:  ATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQ

Query:  ALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR
        ALEAA RILKPGGLISLVVYVGHPGG EEL+TIQKF+ +L  +NW CCKLQMLNRPLAPVPVFLFKR
Subjt:  ALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR

SwissProt top hitse value%identityAlignment
O34614 Putative rRNA methylase YtqB1.2e-2037.87Show/hide
Query:  KRATEVAHSVWKHVIRKGDTVVDATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSP----
        K+    +  + K    +GD VVDAT GNG+DT  + ++V +   +  VYA D+Q+ A+ +T    +E+L +  +    L    H    D IAE  P    
Subjt:  KRATEVAHSVWKHVIRKGDTVVDATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSP----

Query:  --VRLVAFNLGYLPGGNKTVTTKSETTLQALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDL
          V    FNLGYLPGG+K++TT   +T++A+E    I+K  GLI LVVY GHP G  E   + +F  DL
Subjt:  --VRLVAFNLGYLPGGNKTVTTKSETTLQALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDL

Arabidopsis top hitse value%identityAlignment
AT1G16445.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein3.4e-7662.56Show/hide
Query:  FSSKHQFVAPSSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDATCGNGYDTLAMVKMVSDES--GSARVYAIDVQKEALESTSALLEEKLNE
        FSS         + GLEDV VGY+ G+K+ATEVAH VW+ VI+KGDTV+DATCGNG DTLAM+KMV  +S      VYA+D+QK+A+ESTS+LL++ +  
Subjt:  FSSKHQFVAPSSLEGLEDVMVGYILGKKRATEVAHSVWKHVIRKGDTVVDATCGNGYDTLAMVKMVSDES--GSARVYAIDVQKEALESTSALLEEKLNE

Query:  KEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCC
        KEK+ VKL ++CHS+M +++ E + VR+VAFNLGYLPGGNK++ T S+TTL AL+AA RILKPGGLISLVVY+GHPGG EELE ++ F S L   +W CC
Subjt:  KEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYLPGGNKTVTTKSETTLQALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCC

Query:  KLQMLNRPLAPVPVFLFKR
        K QMLNRPLAPV VF+FKR
Subjt:  KLQMLNRPLAPVPVFLFKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTCATTGGTAACTCGAGCCGATCGTATCTTGACAGGGCTTTGTTTCAATTCGACCCATCTTTTGATAATCATTCCGTCCCTTCTCCGGCGACAAAAAAATTTCCG
GCCAAGGAAATTTAGAGGATCGAGAGAAATGTTATCCTTGAAATTTGGGTCTAAATGGGTTACGGTTGCAGCCCCAAAACCAGTTGTAGGACATCAAAAACACTTGAGAA
ATCTTGGGTGTTTCTCCAATCGTATTCAGTCAAATGGGTTCTCTTCAAAACATCAGTTCGTTGCACCTTCATCTTTAGAAGGATTGGAGGATGTGATGGTTGGCTACATT
CTTGGAAAGAAGAGAGCCACAGAAGTTGCTCACTCTGTATGGAAACATGTCATCAGAAAAGGGGATACTGTGGTAGATGCTACTTGTGGAAACGGGTATGATACTCTAGC
TATGGTCAAAATGGTTTCAGATGAATCCGGGTCTGCTCGTGTTTATGCGATCGATGTTCAGAAAGAGGCTTTAGAAAGTACTTCTGCACTGCTGGAAGAAAAACTCAATG
AAAAAGAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTGCAGAGGGTTCTCCTGTTAGGCTTGTTGCATTTAATCTAGGCTACCTA
CCTGGTGGTAACAAAACAGTAACTACAAAGTCAGAAACAACATTACAAGCACTTGAAGCTGCCAATAGAATTCTAAAACCTGGAGGGCTTATCAGCCTTGTGGTTTATGT
GGGCCATCCTGGTGGAGGGGAAGAATTGGAGACCATACAAAAATTTGCTAGCGACTTAGCTTTTGACAATTGGACTTGTTGTAAGCTTCAAATGTTAAACCGCCCGCTAG
CTCCAGTGCCTGTGTTCTTATTCAAAAGATGA
mRNA sequenceShow/hide mRNA sequence
AACTAATCTCAAGGAATGATCCATAGGTTATAGAATGTCAAAATGTAATTGAAAGATTACAAACAAATGCTCTTGATTCGGACATGGTGACCCAATTTCTCCTTAATGGG
CTCATTGGTAACTCGAGCCGATCGTATCTTGACAGGGCTTTGTTTCAATTCGACCCATCTTTTGATAATCATTCCGTCCCTTCTCCGGCGACAAAAAAATTTCCGGCCAA
GGAAATTTAGAGGATCGAGAGAAATGTTATCCTTGAAATTTGGGTCTAAATGGGTTACGGTTGCAGCCCCAAAACCAGTTGTAGGACATCAAAAACACTTGAGAAATCTT
GGGTGTTTCTCCAATCGTATTCAGTCAAATGGGTTCTCTTCAAAACATCAGTTCGTTGCACCTTCATCTTTAGAAGGATTGGAGGATGTGATGGTTGGCTACATTCTTGG
AAAGAAGAGAGCCACAGAAGTTGCTCACTCTGTATGGAAACATGTCATCAGAAAAGGGGATACTGTGGTAGATGCTACTTGTGGAAACGGGTATGATACTCTAGCTATGG
TCAAAATGGTTTCAGATGAATCCGGGTCTGCTCGTGTTTATGCGATCGATGTTCAGAAAGAGGCTTTAGAAAGTACTTCTGCACTGCTGGAAGAAAAACTCAATGAAAAA
GAGAAGAAACTTGTTAAACTCTCTTCCATTTGCCACAGCAGAATGGAGGATGTCATTGCAGAGGGTTCTCCTGTTAGGCTTGTTGCATTTAATCTAGGCTACCTACCTGG
TGGTAACAAAACAGTAACTACAAAGTCAGAAACAACATTACAAGCACTTGAAGCTGCCAATAGAATTCTAAAACCTGGAGGGCTTATCAGCCTTGTGGTTTATGTGGGCC
ATCCTGGTGGAGGGGAAGAATTGGAGACCATACAAAAATTTGCTAGCGACTTAGCTTTTGACAATTGGACTTGTTGTAAGCTTCAAATGTTAAACCGCCCGCTAGCTCCA
GTGCCTGTGTTCTTATTCAAAAGATGAAAATGATGACTATCTTGGATTGAACTAGTCATGAAGCGGAGGCCTAGAATATAACACGGTCAAGTGACCGAGTTATATCACCT
TGGCTCACGCAGGAAATTTTCACTAACACATCTCTACTAAGACCAAGTTTAACTTCTGAATCAGACGAGTGATCTGACAGGTTACTTTTATATTTGTATCCATCTTTTTT
ATAAAACAATATTTTGTGGAGTGAGATTTAAACCTAACTTTATGGTTACTGAGCTGTATAAGATGCAATTGAAATTATGTTA
Protein sequenceShow/hide protein sequence
MGSLVTRADRILTGLCFNSTHLLIIIPSLLRRQKNFRPRKFRGSREMLSLKFGSKWVTVAAPKPVVGHQKHLRNLGCFSNRIQSNGFSSKHQFVAPSSLEGLEDVMVGYI
LGKKRATEVAHSVWKHVIRKGDTVVDATCGNGYDTLAMVKMVSDESGSARVYAIDVQKEALESTSALLEEKLNEKEKKLVKLSSICHSRMEDVIAEGSPVRLVAFNLGYL
PGGNKTVTTKSETTLQALEAANRILKPGGLISLVVYVGHPGGGEELETIQKFASDLAFDNWTCCKLQMLNRPLAPVPVFLFKR