; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019339 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019339
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTENA_THI-4 domain-containing protein
Genome locationscaffold611:482388..483326
RNA-Seq ExpressionMS019339
SyntenyMS019339
Gene Ontology termsGO:0006772 - thiamine metabolic process (biological process)
GO:0005829 - cytosol (cellular component)
InterPro domainsIPR004305 - Thiaminase-2/PQQC
IPR016084 - Haem oxygenase-like, multi-helical


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594010.1 putative bifunctional TENA-E protein, partial [Cucurbita argyrosperma subsp. sororia]2.2e-10781.82Show/hide
Query:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
        MADPKTR QL G M ATD+W+RKHRLIY +ATRHPFVL+IRDGT+D SAF +WVEQ+CEFLRSF AFV SVLVKAWKESDDRADEEVILGSLA+LNDE  
Subjt:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG

Query:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
        WFKKEALKR ++L++IVPQ ATAGYSRFLESLMRPE+EYTVAITALWA+EAVYHESFAYC+ DGSKTP ELREACERWGNEGFG YCNTLK I DRR+EM
Subjt:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM

Query:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDP
        AAGE++KK EVALLRVLE EV FWNM+R  P
Subjt:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDP

XP_008458358.1 PREDICTED: probable bifunctional TENA-E protein isoform X1 [Cucumis melo]1.2e-10881.51Show/hide
Query:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
        MAD K R QL G M ATD+WLRKHRLIY  ATRHPF+LTIRDGT+D SAF +W+EQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLA+LNDE  
Subjt:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG

Query:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
        WFKKEALKR +NLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWA+EAVYHESFAYCL +G+KTP ELREACERWG+EGF KYC+TLK IADRR+EM
Subjt:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM

Query:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCT
         +GEV KKAEV LLRVLEYEVGFWNM R       +C+
Subjt:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCT

XP_022138361.1 probable bifunctional TENA-E protein [Momordica charantia]2.2e-139100Show/hide
Query:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
        MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
Subjt:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG

Query:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
        WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
Subjt:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM

Query:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCTVPVAGAT
        AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCTVPVAGAT
Subjt:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCTVPVAGAT

XP_023514083.1 bifunctional TENA-E protein isoform X2 [Cucurbita pepo subsp. pepo]2.8e-10781.82Show/hide
Query:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
        MADPKTR QL G M ATD+W+RKHRLIY +ATRHP VL+IRDGT+D +AF +WVEQECEFLRSF AFV SVLVKAWKESDDRADEEVILGSLASLNDE  
Subjt:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG

Query:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
        WFKKEALKR ++L++IVPQ ATAGYSRFLESLMRPE+EYTVAITALWA+EAVYHESFAYC+ DGSKTP ELREACERWGNEGFG YCNTLK I DRR+EM
Subjt:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM

Query:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDP
        AAGE++KK EVALLRVLE EV FWNM+R  P
Subjt:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDP

XP_038875310.1 probable bifunctional TENA-E protein [Benincasa hispida]2.7e-11082.3Show/hide
Query:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
        MADPKTR QL G M ATD+WLRKHRLIY EATRHPFVLTIRDGT+D SAF  W+EQECEFLRSFAAFV SVLVKAWKESDDRADEEVILGSLASLNDE  
Subjt:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG

Query:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
        WFKKEALKR +NL+EIVPQKAT GYSRFLESLMRPEVEYTVAITALW +EAVYHESFAYC  +G+KTP ELREAC RWGNEGFGKYCN LK IADRR+EM
Subjt:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM

Query:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCTVPVAG
         +GEV+KKAEV LLRVLEYEVGFWNM R    P P  T PV G
Subjt:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCTVPVAG

TrEMBL top hitse value%identityAlignment
A0A1S3C775 probable bifunctional TENA-E protein isoform X15.6e-10981.51Show/hide
Query:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
        MAD K R QL G M ATD+WLRKHRLIY  ATRHPF+LTIRDGT+D SAF +W+EQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLA+LNDE  
Subjt:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG

Query:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
        WFKKEALKR +NLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWA+EAVYHESFAYCL +G+KTP ELREACERWG+EGF KYC+TLK IADRR+EM
Subjt:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM

Query:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCT
         +GEV KKAEV LLRVLEYEVGFWNM R       +C+
Subjt:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCT

A0A1S3C7T1 probable bifunctional TENA-E protein isoform X23.4e-10680.67Show/hide
Query:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
        MAD K R QL G M ATD+WLRKHRLIY  ATRHPF+LTIRDGT+D SAF +W+  ECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLA+LNDE  
Subjt:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG

Query:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
        WFKKEALKR +NLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWA+EAVYHESFAYCL +G+KTP ELREACERWG+EGF KYC+TLK IADRR+EM
Subjt:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM

Query:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCT
         +GEV KKAEV LLRVLEYEVGFWNM R       +C+
Subjt:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCT

A0A5D3BVD1 Putative bifunctional TENA-E protein isoform X15.6e-10981.51Show/hide
Query:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
        MAD K R QL G M ATD+WLRKHRLIY  ATRHPF+LTIRDGT+D SAF +W+EQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLA+LNDE  
Subjt:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG

Query:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
        WFKKEALKR +NLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWA+EAVYHESFAYCL +G+KTP ELREACERWG+EGF KYC+TLK IADRR+EM
Subjt:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM

Query:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCT
         +GEV KKAEV LLRVLEYEVGFWNM R       +C+
Subjt:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCT

A0A6J1C9H5 probable bifunctional TENA-E protein1.0e-139100Show/hide
Query:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
        MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
Subjt:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG

Query:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
        WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
Subjt:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM

Query:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCTVPVAGAT
        AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCTVPVAGAT
Subjt:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDPVPLPQCTVPVAGAT

A0A6J1KLN7 bifunctional TENA-E protein1.2e-10681.39Show/hide
Query:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG
        MAD KTR QL G M ATD+W+RKHRLIY +ATRHPFVL+IRDGT+D SAF +WVEQEC+FLRSF AFV SVLVKAWKESDDRADEEVILGSLASLNDE  
Subjt:  MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIG

Query:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM
        WFKKEALKR ++L++IVPQ ATAGYSRFLESLMRPE+EYTVAITALWA+EAVYHESFAYC+ DGSKTP ELREACERWGNEGFG YCNTLK I DRR+EM
Subjt:  WFKKEALKRGMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEM

Query:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDP
        AAGE++KK E+ALLRVLE EV FWNM+R  P
Subjt:  AAGEVAKKAEVALLRVLEYEVGFWNMNRRDP

SwissProt top hitse value%identityAlignment
B6TPF2 Bifunctional TENA2 protein5.8e-6350.46Show/hide
Query:  GGV-MAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIGWFKKEALKR
        GGV  A T AW+ KHR +Y  ATRHPF ++IRDGT+D SAF  W+ Q+  F+R F AF+ SVL+K  K+ +D +D E+ILG +AS++DEI WFK EA   
Subjt:  GGV-MAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIGWFKKEALKR

Query:  GMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEMAAGEVAKKA
        G++L+ + P KA   Y RFL S   PE+ Y VA+T  W +E VY +SF +C+ DG+KTPPEL   C+RWG+ GF +YC +L++I DR +  A  +  + A
Subjt:  GMNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEMAAGEVAKKA

Query:  EVALLRVLEYEVGFWNMN
        E A +RVLE E+GFW+M+
Subjt:  EVALLRVLEYEVGFWNMN

Q9ASY9 Bifunctional TENA-E protein7.9e-6856.87Show/hide
Query:  DAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIGWFKKEALKRGMNLSEIV
        D W+ KHR IY  ATRH FV++IRDG++D S+F +W+ Q+  F+R F  FV SVL++A K+S + +D EV+LG +ASLNDEI WFK+E  K  ++ S +V
Subjt:  DAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIGWFKKEALKRGMNLSEIV

Query:  PQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEMAAGEVAKKAEVALLRVL
        PQ+A   Y RFLE LM  EV+Y V +TA WA+EAVY ESFA+CL DG+KTP EL  AC RWGN+GF +YC+++KNIA+R +E A+GEV  +AE  L+RVL
Subjt:  PQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEMAAGEVAKKAEVALLRVL

Query:  EYEVGFWNMNR
        E EV FW M+R
Subjt:  EYEVGFWNMNR

Q9SWB6 Probable bifunctional TENA-E protein1.0e-7561.79Show/hide
Query:  TDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIGWFKKEALKRGMNLSEI
        T+ WL+KHRL+Y  ATRHP +++IRDGTI+ ++F +W+ Q+  F+R+F  FV SVL+KAWKESD   D EVILG +ASL DEI WFK EA K G++LS++
Subjt:  TDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIGWFKKEALKRGMNLSEI

Query:  VPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEMAAGEVAKKAEVALLRV
        VPQ+A   Y   LESLM P+ EYTVAITA WA+E VY ESFA+C+ +GSKTPPEL+E C RWGNE FGKYC +L+NIA+R ++ A+ E  KKAEV LL V
Subjt:  VPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEMAAGEVAKKAEVALLRV

Query:  LEYEVGFWNMNR
        LE+EV FWNM+R
Subjt:  LEYEVGFWNMNR

Arabidopsis top hitse value%identityAlignment
AT3G16990.1 Haem oxygenase-like, multi-helical5.6e-6956.87Show/hide
Query:  DAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIGWFKKEALKRGMNLSEIV
        D W+ KHR IY  ATRH FV++IRDG++D S+F +W+ Q+  F+R F  FV SVL++A K+S + +D EV+LG +ASLNDEI WFK+E  K  ++ S +V
Subjt:  DAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIGWFKKEALKRGMNLSEIV

Query:  PQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEMAAGEVAKKAEVALLRVL
        PQ+A   Y RFLE LM  EV+Y V +TA WA+EAVY ESFA+CL DG+KTP EL  AC RWGN+GF +YC+++KNIA+R +E A+GEV  +AE  L+RVL
Subjt:  PQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEMAAGEVAKKAEVALLRVL

Query:  EYEVGFWNMNR
        E EV FW M+R
Subjt:  EYEVGFWNMNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGACCCGAAGACCCGACCGCAACTCGGCGGAGTAATGGCCGCCACCGACGCATGGCTCAGAAAGCACCGCCTCATCTACATCGAAGCCACTCGCCATCCTTTCGT
CCTAACCATTCGCGACGGCACCATCGATTTCTCCGCCTTCTCTTCTTGGGTGGAACAGGAATGCGAATTTCTGCGATCCTTCGCGGCGTTCGTCGGGAGTGTGTTGGTGA
AGGCATGGAAAGAATCGGACGACAGAGCGGACGAGGAGGTGATACTTGGAAGTTTGGCTTCTCTGAACGACGAAATCGGGTGGTTCAAGAAGGAAGCCCTAAAACGGGGG
ATGAATTTGAGTGAGATTGTTCCTCAGAAGGCAACCGCCGGCTACTCCAGGTTTCTGGAGAGTCTGATGCGGCCGGAAGTGGAATATACGGTGGCGATTACGGCTCTTTG
GGCGATGGAAGCCGTGTACCACGAGAGCTTTGCGTACTGCCTGGGAGATGGATCAAAAACGCCGCCGGAATTGAGAGAAGCGTGCGAGAGGTGGGGGAATGAAGGATTTG
GAAAGTACTGCAATACCTTGAAGAACATTGCGGATCGGCGGATGGAGATGGCCGCCGGAGAAGTGGCGAAGAAAGCAGAAGTGGCGTTGTTGCGAGTTCTTGAATATGAG
GTTGGGTTCTGGAATATGAACCGCAGGGATCCAGTTCCGCTCCCACAATGCACCGTTCCGGTGGCCGGAGCCACC
mRNA sequenceShow/hide mRNA sequence
ATGGCGGACCCGAAGACCCGACCGCAACTCGGCGGAGTAATGGCCGCCACCGACGCATGGCTCAGAAAGCACCGCCTCATCTACATCGAAGCCACTCGCCATCCTTTCGT
CCTAACCATTCGCGACGGCACCATCGATTTCTCCGCCTTCTCTTCTTGGGTGGAACAGGAATGCGAATTTCTGCGATCCTTCGCGGCGTTCGTCGGGAGTGTGTTGGTGA
AGGCATGGAAAGAATCGGACGACAGAGCGGACGAGGAGGTGATACTTGGAAGTTTGGCTTCTCTGAACGACGAAATCGGGTGGTTCAAGAAGGAAGCCCTAAAACGGGGG
ATGAATTTGAGTGAGATTGTTCCTCAGAAGGCAACCGCCGGCTACTCCAGGTTTCTGGAGAGTCTGATGCGGCCGGAAGTGGAATATACGGTGGCGATTACGGCTCTTTG
GGCGATGGAAGCCGTGTACCACGAGAGCTTTGCGTACTGCCTGGGAGATGGATCAAAAACGCCGCCGGAATTGAGAGAAGCGTGCGAGAGGTGGGGGAATGAAGGATTTG
GAAAGTACTGCAATACCTTGAAGAACATTGCGGATCGGCGGATGGAGATGGCCGCCGGAGAAGTGGCGAAGAAAGCAGAAGTGGCGTTGTTGCGAGTTCTTGAATATGAG
GTTGGGTTCTGGAATATGAACCGCAGGGATCCAGTTCCGCTCCCACAATGCACCGTTCCGGTGGCCGGAGCCACC
Protein sequenceShow/hide protein sequence
MADPKTRPQLGGVMAATDAWLRKHRLIYIEATRHPFVLTIRDGTIDFSAFSSWVEQECEFLRSFAAFVGSVLVKAWKESDDRADEEVILGSLASLNDEIGWFKKEALKRG
MNLSEIVPQKATAGYSRFLESLMRPEVEYTVAITALWAMEAVYHESFAYCLGDGSKTPPELREACERWGNEGFGKYCNTLKNIADRRMEMAAGEVAKKAEVALLRVLEYE
VGFWNMNRRDPVPLPQCTVPVAGAT