; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G04870 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G04870
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTENA_THI-4 domain-containing protein
Genome locationClcChr06:5028624..5029939
RNA-Seq ExpressionClc06G04870
SyntenyClc06G04870
Gene Ontology termsGO:0006772 - thiamine metabolic process (biological process)
GO:0005829 - cytosol (cellular component)
InterPro domainsIPR004305 - Thiaminase-2/PQQC
IPR016084 - Haem oxygenase-like, multi-helical


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148207.1 probable bifunctional TENA-E protein [Cucumis sativus]2.0e-11283.33Show/hide
Query:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA
        MADPK RAQL G MTAT++WLRKHRLIYT ATRHPF+LTIRDGT+DLSAF+TWLEQ+  FLRSFAAFVGS LVKAWKESD+RADEEVIL  LA+L+DE A
Subjt:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA

Query:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM
        WFKKE+LKRDINL+E++PQ ATAGYSRFLESLMRPEVEYTVAITALW IEAVYHESFA+CLEEG KTPLELRE CERWGNEGFG YCN LK+IADRRLEM
Subjt:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM

Query:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTAPLVEG
        GS EVSKKAEVG LRVLEYEV FWNM+ P PHRTA + EG
Subjt:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTAPLVEG

XP_008458358.1 PREDICTED: probable bifunctional TENA-E protein isoform X1 [Cucumis melo]3.1e-11888.51Show/hide
Query:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA
        MAD K RAQL G MTATD+WLRKHRLIYT ATRHPF+LTIRDGTVDLSAF+TWLEQECEFLRSFAAFVGS LVKAWKESD+RADEEVILGSLA+L+DE A
Subjt:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA

Query:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM
        WFKKEALKRDINL+EI+PQKATAGYSRFLESLMRPEVEYTVAITALW IEAVYHESFAYCLEEG KTPLELRE CERWG+EGF KYC+ LK+IADRRLEM
Subjt:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM

Query:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTA
        GSGEV+KKAEVGLLRVLEYEVGFWNM+RP  HRTA
Subjt:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTA

XP_008458359.1 PREDICTED: probable bifunctional TENA-E protein isoform X2 [Cucumis melo]1.9e-11587.66Show/hide
Query:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA
        MAD K RAQL G MTATD+WLRKHRLIYT ATRHPF+LTIRDGTVDLSAF+TWL  ECEFLRSFAAFVGS LVKAWKESD+RADEEVILGSLA+L+DE A
Subjt:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA

Query:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM
        WFKKEALKRDINL+EI+PQKATAGYSRFLESLMRPEVEYTVAITALW IEAVYHESFAYCLEEG KTPLELRE CERWG+EGF KYC+ LK+IADRRLEM
Subjt:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM

Query:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTA
        GSGEV+KKAEVGLLRVLEYEVGFWNM+RP  HRTA
Subjt:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTA

XP_022138361.1 probable bifunctional TENA-E protein [Momordica charantia]1.7e-11182.57Show/hide
Query:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA
        MADPKTR QLGG M ATDAWLRKHRLIY EATRHPFVLTIRDGT+D SAF +W+EQECEFLRSFAAFVGS LVKAWKESD+RADEEVILGSLASL+DE+ 
Subjt:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA

Query:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM
        WFKKEALKR +NL+EI+PQKATAGYSRFLESLMRPEVEYTVAITALW +EAVYHESFAYCL +G+KTP ELRE CERWGNEGFGKYCN LK IADRR+EM
Subjt:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM

Query:  GSGEVSKKAEVGLLRVLEYEVGFWNMLR----PHPHRTAPL
         +GEV+KKAEV LLRVLEYEVGFWNM R    P P  T P+
Subjt:  GSGEVSKKAEVGLLRVLEYEVGFWNMLR----PHPHRTAPL

XP_038875310.1 probable bifunctional TENA-E protein [Benincasa hispida]2.4e-12692.5Show/hide
Query:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA
        MADPKTRAQL GGMTATD+WLRKHRLIYTEATRHPFVLTIRDGTVDLSAFR WLEQECEFLRSFAAFV S LVKAWKESD+RADEEVILGSLASL+DE A
Subjt:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA

Query:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM
        WFKKEALKRDINLTEI+PQKAT GYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYC EEG KTPLELRE C RWGNEGFGKYCNRLKEIADRRLEM
Subjt:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM

Query:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTAPLVEG
        GSGEVSKKAEVGLLRVLEYEVGFWNM+RP PHRTAP+V G
Subjt:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTAPLVEG

TrEMBL top hitse value%identityAlignment
A0A0A0KEX4 TENA_THI-4 domain-containing protein9.6e-11383.33Show/hide
Query:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA
        MADPK RAQL G MTAT++WLRKHRLIYT ATRHPF+LTIRDGT+DLSAF+TWLEQ+  FLRSFAAFVGS LVKAWKESD+RADEEVIL  LA+L+DE A
Subjt:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA

Query:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM
        WFKKE+LKRDINL+E++PQ ATAGYSRFLESLMRPEVEYTVAITALW IEAVYHESFA+CLEEG KTPLELRE CERWGNEGFG YCN LK+IADRRLEM
Subjt:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM

Query:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTAPLVEG
        GS EVSKKAEVG LRVLEYEV FWNM+ P PHRTA + EG
Subjt:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTAPLVEG

A0A1S3C775 probable bifunctional TENA-E protein isoform X11.5e-11888.51Show/hide
Query:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA
        MAD K RAQL G MTATD+WLRKHRLIYT ATRHPF+LTIRDGTVDLSAF+TWLEQECEFLRSFAAFVGS LVKAWKESD+RADEEVILGSLA+L+DE A
Subjt:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA

Query:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM
        WFKKEALKRDINL+EI+PQKATAGYSRFLESLMRPEVEYTVAITALW IEAVYHESFAYCLEEG KTPLELRE CERWG+EGF KYC+ LK+IADRRLEM
Subjt:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM

Query:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTA
        GSGEV+KKAEVGLLRVLEYEVGFWNM+RP  HRTA
Subjt:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTA

A0A1S3C7T1 probable bifunctional TENA-E protein isoform X29.2e-11687.66Show/hide
Query:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA
        MAD K RAQL G MTATD+WLRKHRLIYT ATRHPF+LTIRDGTVDLSAF+TWL  ECEFLRSFAAFVGS LVKAWKESD+RADEEVILGSLA+L+DE A
Subjt:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA

Query:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM
        WFKKEALKRDINL+EI+PQKATAGYSRFLESLMRPEVEYTVAITALW IEAVYHESFAYCLEEG KTPLELRE CERWG+EGF KYC+ LK+IADRRLEM
Subjt:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM

Query:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTA
        GSGEV+KKAEVGLLRVLEYEVGFWNM+RP  HRTA
Subjt:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTA

A0A5D3BVD1 Putative bifunctional TENA-E protein isoform X11.5e-11888.51Show/hide
Query:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA
        MAD K RAQL G MTATD+WLRKHRLIYT ATRHPF+LTIRDGTVDLSAF+TWLEQECEFLRSFAAFVGS LVKAWKESD+RADEEVILGSLA+L+DE A
Subjt:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA

Query:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM
        WFKKEALKRDINL+EI+PQKATAGYSRFLESLMRPEVEYTVAITALW IEAVYHESFAYCLEEG KTPLELRE CERWG+EGF KYC+ LK+IADRRLEM
Subjt:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM

Query:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTA
        GSGEV+KKAEVGLLRVLEYEVGFWNM+RP  HRTA
Subjt:  GSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTA

A0A6J1C9H5 probable bifunctional TENA-E protein8.1e-11282.57Show/hide
Query:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA
        MADPKTR QLGG M ATDAWLRKHRLIY EATRHPFVLTIRDGT+D SAF +W+EQECEFLRSFAAFVGS LVKAWKESD+RADEEVILGSLASL+DE+ 
Subjt:  MADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELA

Query:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM
        WFKKEALKR +NL+EI+PQKATAGYSRFLESLMRPEVEYTVAITALW +EAVYHESFAYCL +G+KTP ELRE CERWGNEGFGKYCN LK IADRR+EM
Subjt:  WFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEM

Query:  GSGEVSKKAEVGLLRVLEYEVGFWNMLR----PHPHRTAPL
         +GEV+KKAEV LLRVLEYEVGFWNM R    P P  T P+
Subjt:  GSGEVSKKAEVGLLRVLEYEVGFWNMLR----PHPHRTAPL

SwissProt top hitse value%identityAlignment
B6TPF2 Bifunctional TENA2 protein1.1e-6049.54Show/hide
Query:  GGGM--TATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELAWFKKEALK
        GGG+    T AW+ KHR +Y  ATRHPF ++IRDGTVD+SAF+ WL Q+  F+R F AF+ S L+K  K+ D  +D E+ILG +AS+SDE++WFK EA  
Subjt:  GGGM--TATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELAWFKKEALK

Query:  RDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEMGSGEVSKK
          ++L  + P KA   Y RFL S   PE+ Y VA+T  W IE VY +SF +C+++GNKTP EL   C+RWG+ GF +YC  L+ I DR L     +  + 
Subjt:  RDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEMGSGEVSKK

Query:  AEVGLLRVLEYEVGFWNM
        AE   +RVLE E+GFW+M
Subjt:  AEVGLLRVLEYEVGFWNM

Q9ASY9 Bifunctional TENA-E protein1.6e-6958.29Show/hide
Query:  DAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELAWFKKEALKRDINLTEII
        D W+ KHR IYT ATRH FV++IRDG+VDLS+FRTWL Q+  F+R F  FV S L++A K+S E +D EV+LG +ASL+DE+ WFK+E  K D++ + ++
Subjt:  DAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELAWFKKEALKRDINLTEII

Query:  PQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEMGSGEVSKKAEVGLLRVL
        PQ+A   Y RFLE LM  EV+Y V +TA W IEAVY ESFA+CLE+GNKTP+EL   C RWGN+GF +YC+ +K IA+R LE  SGEV  +AE  L+RVL
Subjt:  PQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEMGSGEVSKKAEVGLLRVL

Query:  EYEVGFWNMLR
        E EV FW M R
Subjt:  EYEVGFWNMLR

Q9SWB6 Probable bifunctional TENA-E protein3.2e-7360.38Show/hide
Query:  TDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELAWFKKEALKRDINLTEI
        T+ WL+KHRL+Y  ATRHP +++IRDGT++ ++F+TWL Q+  F+R+F  FV S L+KAWKESD   D EVILG +ASL DE++WFK EA K  I+L+++
Subjt:  TDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELAWFKKEALKRDINLTEI

Query:  IPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEMGSGEVSKKAEVGLLRV
        +PQ+A   Y   LESLM P+ EYTVAITA W IE VY ESFA+C+EEG+KTP EL+E C RWGNE FGKYC  L+ IA+R L+  S E  KKAEV LL V
Subjt:  IPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEMGSGEVSKKAEVGLLRV

Query:  LEYEVGFWNMLR
        LE+EV FWNM R
Subjt:  LEYEVGFWNMLR

Arabidopsis top hitse value%identityAlignment
AT3G16990.1 Haem oxygenase-like, multi-helical1.2e-7058.29Show/hide
Query:  DAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELAWFKKEALKRDINLTEII
        D W+ KHR IYT ATRH FV++IRDG+VDLS+FRTWL Q+  F+R F  FV S L++A K+S E +D EV+LG +ASL+DE+ WFK+E  K D++ + ++
Subjt:  DAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGSALVKAWKESDERADEEVILGSLASLSDELAWFKKEALKRDINLTEII

Query:  PQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEMGSGEVSKKAEVGLLRVL
        PQ+A   Y RFLE LM  EV+Y V +TA W IEAVY ESFA+CLE+GNKTP+EL   C RWGN+GF +YC+ +K IA+R LE  SGEV  +AE  L+RVL
Subjt:  PQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGNEGFGKYCNRLKEIADRRLEMGSGEVSKKAEVGLLRVL

Query:  EYEVGFWNMLR
        E EV FW M R
Subjt:  EYEVGFWNMLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCACGTGTACAAGAAAACCCCTTCAAGTAATCACACGCGTACAATTTCTCCCCCGCTATTATCTATTTCACTCAGACTTCTTTCCTTCCATTCTGAATTCATAAG
CAACACAAAAATGGCGGACCCGAAAACCAGAGCTCAACTCGGCGGAGGAATGACCGCCACCGACGCATGGCTCAGAAAGCATCGCCTCATCTACACGGAGGCCACACGCC
ATCCTTTCGTCCTCACCATTCGCGACGGCACCGTCGATCTCTCCGCCTTCAGAACCTGGCTGGAACAGGAATGCGAATTTCTCCGATCTTTCGCTGCATTCGTCGGAAGT
GCGTTGGTGAAAGCATGGAAAGAATCGGACGAACGCGCGGACGAGGAAGTGATTCTCGGAAGCTTAGCATCTCTTAGCGACGAATTGGCGTGGTTCAAGAAAGAAGCCCT
GAAACGAGACATCAATTTGACTGAAATTATTCCTCAGAAAGCCACGGCCGGCTATTCTAGGTTTCTGGAGAGTTTGATGAGGCCGGAAGTGGAATACACGGTGGCGATTA
CGGCGCTTTGGGGGATAGAAGCGGTGTACCATGAGAGCTTTGCGTATTGCTTGGAAGAAGGAAACAAAACGCCATTGGAATTGAGAGAAGGTTGCGAAAGATGGGGCAAT
GAAGGATTTGGGAAGTACTGTAATAGATTGAAGGAGATTGCGGATAGAAGATTGGAAATGGGGAGTGGAGAAGTGAGTAAAAAAGCAGAAGTGGGTCTGTTGAGAGTTCT
TGAATATGAAGTTGGGTTTTGGAATATGCTCCGCCCTCATCCCCACCGCACTGCTCCGCTTGTCGAAGGGTTTTGA
mRNA sequenceShow/hide mRNA sequence
CAAAAGTTAATTTTTTAATTTAAAAATATTCGTTAAATGCCACGAAGCATGTGACGGCGCATAAGCAATGCTCCACGTGTACAAGAAAACCCCTTCAAGTAATCACACGC
GTACAATTTCTCCCCCGCTATTATCTATTTCACTCAGACTTCTTTCCTTCCATTCTGAATTCATAAGCAACACAAAAATGGCGGACCCGAAAACCAGAGCTCAACTCGGC
GGAGGAATGACCGCCACCGACGCATGGCTCAGAAAGCATCGCCTCATCTACACGGAGGCCACACGCCATCCTTTCGTCCTCACCATTCGCGACGGCACCGTCGATCTCTC
CGCCTTCAGAACCTGGCTGGAACAGGAATGCGAATTTCTCCGATCTTTCGCTGCATTCGTCGGAAGTGCGTTGGTGAAAGCATGGAAAGAATCGGACGAACGCGCGGACG
AGGAAGTGATTCTCGGAAGCTTAGCATCTCTTAGCGACGAATTGGCGTGGTTCAAGAAAGAAGCCCTGAAACGAGACATCAATTTGACTGAAATTATTCCTCAGAAAGCC
ACGGCCGGCTATTCTAGGTTTCTGGAGAGTTTGATGAGGCCGGAAGTGGAATACACGGTGGCGATTACGGCGCTTTGGGGGATAGAAGCGGTGTACCATGAGAGCTTTGC
GTATTGCTTGGAAGAAGGAAACAAAACGCCATTGGAATTGAGAGAAGGTTGCGAAAGATGGGGCAATGAAGGATTTGGGAAGTACTGTAATAGATTGAAGGAGATTGCGG
ATAGAAGATTGGAAATGGGGAGTGGAGAAGTGAGTAAAAAAGCAGAAGTGGGTCTGTTGAGAGTTCTTGAATATGAAGTTGGGTTTTGGAATATGCTCCGCCCTCATCCC
CACCGCACTGCTCCGCTTGTCGAAGGGTTTTGAATGACAAAAGACAATAAGGGTTAAATATTATTATTATTTATTTATTTTTAATTTCAGTTTTATTTCATTTTGGTCCC
CCTATCATCATAATGTCTAATTTTAGTAGTTTTTTTTCAAGTCCAAATATGTATATAGTCTCTGTTAGTCTTTTGTATCTTTACCCATATGAAAATAACTAAGACAAATA
TTTTTGGATTTGGGGTGTGGGTCAATTATTCAAATATTTTT
Protein sequenceShow/hide protein sequence
MLHVYKKTPSSNHTRTISPPLLSISLRLLSFHSEFISNTKMADPKTRAQLGGGMTATDAWLRKHRLIYTEATRHPFVLTIRDGTVDLSAFRTWLEQECEFLRSFAAFVGS
ALVKAWKESDERADEEVILGSLASLSDELAWFKKEALKRDINLTEIIPQKATAGYSRFLESLMRPEVEYTVAITALWGIEAVYHESFAYCLEEGNKTPLELREGCERWGN
EGFGKYCNRLKEIADRRLEMGSGEVSKKAEVGLLRVLEYEVGFWNMLRPHPHRTAPLVEGF