; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS007684 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS007684
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptiontranscription factor bHLH92
Genome locationscaffold13:466271..467343
RNA-Seq ExpressionMS007684
SyntenyMS007684
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily
IPR044658 - Transcription factor bHLH92/bHLH041-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593858.1 Transcription factor basic helix-loop-helix 92, partial [Cucurbita argyrosperma subsp. sororia]6.2e-6860Show/hide
Query:  MDDGFPVEFWQSDAFWLD-----ADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHM
        MDDGFPVEFW +D +WLD      D AP  + SAF PY  +      Q NN  T   +    S N++KR+IE+WRK+W EK K A  G DLERE+++RHM
Subjt:  MDDGFPVEFWQSDAFWLD-----ADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHM

Query:  LNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTV
        LNER+RR+K R+SY ELHSMLP KTKNDKNSIVQMAA TI+ELK  E +L+ RN+ELEMAL+A+KR++E G TT IRVA+ANPSSGINSML +LN+LKTV
Subjt:  LNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTV

Query:  GVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQER
        GVN K I A F D+ FS  + ID    M AAEVER +Q+T++EAERKFQ Q + R
Subjt:  GVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQER

XP_022138697.1 transcription factor bHLH92 [Momordica charantia]3.8e-13496.96Show/hide
Query:  MDDGFPVEFWQSDAFWLDADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHMLNERL
        MDDGFPVEFWQSDAFWLDADLAPVARTSAFDPYSTRTNARFGQDNNLAT AA+VGDNSRNINKRMIEFWRKKW EKNKAAAAGEDLEREKNYRHMLNERL
Subjt:  MDDGFPVEFWQSDAFWLDADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHMLNERL

Query:  RRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVK
        RR+KQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILE R+LELE+ALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVK
Subjt:  RRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVK

Query:  VISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNFNN
        VI+ANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNFNN
Subjt:  VISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNFNN

XP_022930317.1 transcription factor bHLH92-like [Cucurbita moschata]6.2e-6860Show/hide
Query:  MDDGFPVEFWQSDAFWLD-----ADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHM
        MDDGFPVEFW +D  WLD     +D AP  + SAF PY  +      Q NN  T   ++   S N++KR+IE+WRK+W EK K A  G DLERE+++RHM
Subjt:  MDDGFPVEFWQSDAFWLD-----ADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHM

Query:  LNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTV
        LNER+RR+K R+SY ELHSMLP KTKNDKNSIVQMAA TI+ELK  E +L+ RN+ELEMAL+A+KR++E G TT IRVA+ANPSSGINSML +LN+LKTV
Subjt:  LNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTV

Query:  GVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQER
        GVN K I A F D+ FS  + ID    M AAEVER +Q+TL+EAERK Q Q + R
Subjt:  GVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQER

XP_023000539.1 transcription factor bHLH92-like [Cucurbita maxima]5.6e-6960Show/hide
Query:  MDDGFPVEFWQSDAFWLD-----ADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHM
        MDDGFPVEFW +D +WLD     +D AP  + SAF PY  +     GQ NN      +    S NI+KR+I++WRKKW EK K A  G DLEREK+++HM
Subjt:  MDDGFPVEFWQSDAFWLD-----ADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHM

Query:  LNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTV
        LNER+RR+K R+SY ELHSMLP KTKNDKNSIVQMAA TI+ELK  E +L+ RN+ELEMAL+A+KR++E G TT IRVA+ANPSSGINSML +LN+LKTV
Subjt:  LNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTV

Query:  GVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIK
        GVN K I A F D++FS  + I+    M AAEVER +Q+TL+EAERKFQ QC+E +  +K
Subjt:  GVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIK

XP_038875776.1 transcription factor bHLH92 [Benincasa hispida]6.6e-7860.57Show/hide
Query:  MDDGFPVEFWQSDAFWLDADL---------APVARTSAFDPYSTRTNARFGQDNNLATPAAVV-------GDNSRNINKRMIEFWRKKWLEKNKAAAAGE
        MDD F VE+W +D FWLDA +         AP  + SAF PY TR     GQ+NN  T AA           NSRN+NKRM+E+WRK W EK +  + G 
Subjt:  MDDGFPVEFWQSDAFWLDADL---------APVARTSAFDPYSTRTNARFGQDNNLATPAAVV-------GDNSRNINKRMIEFWRKKWLEKNKAAAAGE

Query:  DLEREKNYRHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINS
        D EREK +RHMLNER+RR+KQ++SYL LHSMLPK TKNDKNSI+Q A  TIQE+K LE  L+ RNLELEMA+A +K+EKE GTT  I VAL+NPS GINS
Subjt:  DLEREKNYRHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINS

Query:  MLAILNVLKTVGVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNFNN
        MLA+LN LKTVGVN K I A FF+++FSAQL ID    MGAAEVER +Q+TL EAERKFQ QC+E +K IK+++FNFNN
Subjt:  MLAILNVLKTVGVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNFNN

TrEMBL top hitse value%identityAlignment
A0A1S3CPU0 transcription factor bHLH921.1e-6757.76Show/hide
Query:  MDDGFPVEFWQSDAFWLDADLA----------PVARTSAFDPY-STRTNARFGQDNN---LATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDL
        MDD FPVEFW +D FWLDA ++          P  + SAF PY S       GQDNN     T  A    +SRN+NKRMIE+W K W EK +   +  DL
Subjt:  MDDGFPVEFWQSDAFWLDADLA----------PVARTSAFDPY-STRTNARFGQDNN---LATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDL

Query:  EREKNYRHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRN--LELEMALAARKREKENGTTTTIRVALANPSSGINS
        EREK +RHMLNER+RR+KQ++SYL LHSMLPK TKNDKNSIVQ AA TIQE+K LE  L+ RN  LE+E+A+A +K+EKE      I VAL N S GINS
Subjt:  EREKNYRHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRN--LELEMALAARKREKENGTTTTIRVALANPSSGINS

Query:  MLAILNVLKTVGVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNF
        ML +LNVLKTVGVN K I A FF+++FSAQL ID    MGAAEVER +Q+TL EAERKF+ Q  E +K IK +YF F
Subjt:  MLAILNVLKTVGVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNF

A0A5A7UQX2 Transcription factor bHLH921.1e-6757.76Show/hide
Query:  MDDGFPVEFWQSDAFWLDADLA----------PVARTSAFDPY-STRTNARFGQDNN---LATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDL
        MDD FPVEFW +D FWLDA ++          P  + SAF PY S       GQDNN     T  A    +SRN+NKRMIE+W K W EK +   +  DL
Subjt:  MDDGFPVEFWQSDAFWLDADLA----------PVARTSAFDPY-STRTNARFGQDNN---LATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDL

Query:  EREKNYRHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRN--LELEMALAARKREKENGTTTTIRVALANPSSGINS
        EREK +RHMLNER+RR+KQ++SYL LHSMLPK TKNDKNSIVQ AA TIQE+K LE  L+ RN  LE+E+A+A +K+EKE      I VAL N S GINS
Subjt:  EREKNYRHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRN--LELEMALAARKREKENGTTTTIRVALANPSSGINS

Query:  MLAILNVLKTVGVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNF
        ML +LNVLKTVGVN K I A FF+++FSAQL ID    MGAAEVER +Q+TL EAERKF+ Q  E +K IK +YF F
Subjt:  MLAILNVLKTVGVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNF

A0A6J1CBV8 transcription factor bHLH921.9e-13496.96Show/hide
Query:  MDDGFPVEFWQSDAFWLDADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHMLNERL
        MDDGFPVEFWQSDAFWLDADLAPVARTSAFDPYSTRTNARFGQDNNLAT AA+VGDNSRNINKRMIEFWRKKW EKNKAAAAGEDLEREKNYRHMLNERL
Subjt:  MDDGFPVEFWQSDAFWLDADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHMLNERL

Query:  RRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVK
        RR+KQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILE R+LELE+ALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVK
Subjt:  RRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVK

Query:  VISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNFNN
        VI+ANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNFNN
Subjt:  VISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNFNN

A0A6J1EQ49 transcription factor bHLH92-like3.0e-6860Show/hide
Query:  MDDGFPVEFWQSDAFWLD-----ADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHM
        MDDGFPVEFW +D  WLD     +D AP  + SAF PY  +      Q NN  T   ++   S N++KR+IE+WRK+W EK K A  G DLERE+++RHM
Subjt:  MDDGFPVEFWQSDAFWLD-----ADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHM

Query:  LNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTV
        LNER+RR+K R+SY ELHSMLP KTKNDKNSIVQMAA TI+ELK  E +L+ RN+ELEMAL+A+KR++E G TT IRVA+ANPSSGINSML +LN+LKTV
Subjt:  LNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTV

Query:  GVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQER
        GVN K I A F D+ FS  + ID    M AAEVER +Q+TL+EAERK Q Q + R
Subjt:  GVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQER

A0A6J1KK97 transcription factor bHLH92-like2.7e-6960Show/hide
Query:  MDDGFPVEFWQSDAFWLD-----ADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHM
        MDDGFPVEFW +D +WLD     +D AP  + SAF PY  +     GQ NN      +    S NI+KR+I++WRKKW EK K A  G DLEREK+++HM
Subjt:  MDDGFPVEFWQSDAFWLD-----ADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHM

Query:  LNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTV
        LNER+RR+K R+SY ELHSMLP KTKNDKNSIVQMAA TI+ELK  E +L+ RN+ELEMAL+A+KR++E G TT IRVA+ANPSSGINSML +LN+LKTV
Subjt:  LNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTV

Query:  GVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIK
        GVN K I A F D++FS  + I+    M AAEVER +Q+TL+EAERKFQ QC+E +  +K
Subjt:  GVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIK

SwissProt top hitse value%identityAlignment
Q75KV9 Transcription factor BHLH1485.7e-1630.13Show/hide
Query:  APVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMI-------------------EFWRKKWLEKNKAAAAGEDLEREKNYRHMLNERLRR
        A  AR SAF  Y    +A           A  V     NI++R++                   E   ++ + + +    G D+E  + +RHM+ ER RR
Subjt:  APVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMI-------------------EFWRKKWLEKNKAAAAGEDLEREKNYRHMLNERLRR

Query:  DKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVKVI
        +K  +SY +L++M+  ++K DKNSIVQ AA  I ELK     L+ RN EL+  +       E     T++  +  PSS I+SM+A L  LK + V  + I
Subjt:  DKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVKVI

Query:  SANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAER
         ++        +++++ T  + A EVE+ V+  L E ER
Subjt:  SANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAER

Q9FIX5 Transcription factor bHLH926.1e-2642Show/hide
Query:  QDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLE
        +D  L   +  +  +  N+ KRM+   RK W EK    A     E+E++ RHML ER RR+KQ++SYL LHS+LP  TKNDKNSIV+ A   I +L+ L+
Subjt:  QDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLE

Query:  AILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERK
          L  R   +E   A  K   +  + T +RV L  P SG++SML  L+ LK++G  +K + ANF   EFSA + I+ TQ+ G  EVE+ V+  L E E K
Subjt:  AILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERK

Arabidopsis top hitse value%identityAlignment
AT5G43650.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein4.4e-2742Show/hide
Query:  QDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLE
        +D  L   +  +  +  N+ KRM+   RK W EK    A     E+E++ RHML ER RR+KQ++SYL LHS+LP  TKNDKNSIV+ A   I +L+ L+
Subjt:  QDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLE

Query:  AILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERK
          L  R   +E   A  K   +  + T +RV L  P SG++SML  L+ LK++G  +K + ANF   EFSA + I+ TQ+ G  EVE+ V+  L E E K
Subjt:  AILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVKVISANFFDAEFSAQLDIDHTQMMGAAEVERVVQLTLTEAERK

AT5G56960.1 basic helix-loop-helix (bHLH) DNA-binding family protein4.0e-0441.1Show/hide
Query:  RHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKEN
        +HM++ER RR+K  +S+  L S+LP  TK DK S++ +A   +  L+   + L  RN E+E  LA  +RE EN
Subjt:  RHMLNERLRRDKQRKSYLELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGACGGCTTTCCGGTGGAATTCTGGCAGAGCGATGCCTTTTGGCTCGATGCCGATCTCGCTCCGGTGGCCCGAACAAGTGCTTTTGACCCGTACTCGACCCGAAC
CAATGCCAGGTTCGGGCAAGATAACAATCTCGCCACCCCTGCAGCCGTTGTCGGTGACAATTCTCGGAACATCAACAAGAGGATGATCGAATTCTGGAGAAAGAAGTGGC
TTGAAAAAAATAAAGCAGCAGCTGCAGGAGAAGATCTAGAGAGGGAGAAGAATTATAGGCATATGTTGAATGAAAGGTTGAGGAGGGATAAACAAAGAAAGAGCTACCTT
GAGCTGCACTCCATGCTCCCAAAAAAAACTAAGAACGATAAGAATTCGATCGTTCAAATGGCGGCGAGTACAATACAAGAGCTTAAAACATTAGAGGCAATTTTAGAGGG
GAGAAATTTAGAGTTGGAGATGGCATTAGCGGCACGGAAGAGAGAGAAAGAAAATGGGACGACGACGACGATTAGGGTAGCGTTGGCGAACCCTTCGTCGGGGATCAACT
CGATGCTTGCCATTCTCAACGTTCTCAAAACCGTCGGAGTAAACGTGAAAGTCATTAGCGCTAATTTCTTCGACGCTGAGTTTTCAGCACAATTAGACATTGATCACACC
CAAATGATGGGAGCTGCCGAAGTGGAAAGAGTGGTGCAGCTGACGCTAACCGAAGCAGAGAGGAAATTTCAAGGGCAGTGCCAGGAAAGAACAAAATTCATAAAACAAAG
TTATTTTAATTTTAATAAT
mRNA sequenceShow/hide mRNA sequence
ATGGACGACGGCTTTCCGGTGGAATTCTGGCAGAGCGATGCCTTTTGGCTCGATGCCGATCTCGCTCCGGTGGCCCGAACAAGTGCTTTTGACCCGTACTCGACCCGAAC
CAATGCCAGGTTCGGGCAAGATAACAATCTCGCCACCCCTGCAGCCGTTGTCGGTGACAATTCTCGGAACATCAACAAGAGGATGATCGAATTCTGGAGAAAGAAGTGGC
TTGAAAAAAATAAAGCAGCAGCTGCAGGAGAAGATCTAGAGAGGGAGAAGAATTATAGGCATATGTTGAATGAAAGGTTGAGGAGGGATAAACAAAGAAAGAGCTACCTT
GAGCTGCACTCCATGCTCCCAAAAAAAACTAAGAACGATAAGAATTCGATCGTTCAAATGGCGGCGAGTACAATACAAGAGCTTAAAACATTAGAGGCAATTTTAGAGGG
GAGAAATTTAGAGTTGGAGATGGCATTAGCGGCACGGAAGAGAGAGAAAGAAAATGGGACGACGACGACGATTAGGGTAGCGTTGGCGAACCCTTCGTCGGGGATCAACT
CGATGCTTGCCATTCTCAACGTTCTCAAAACCGTCGGAGTAAACGTGAAAGTCATTAGCGCTAATTTCTTCGACGCTGAGTTTTCAGCACAATTAGACATTGATCACACC
CAAATGATGGGAGCTGCCGAAGTGGAAAGAGTGGTGCAGCTGACGCTAACCGAAGCAGAGAGGAAATTTCAAGGGCAGTGCCAGGAAAGAACAAAATTCATAAAACAAAG
TTATTTTAATTTTAATAAT
Protein sequenceShow/hide protein sequence
MDDGFPVEFWQSDAFWLDADLAPVARTSAFDPYSTRTNARFGQDNNLATPAAVVGDNSRNINKRMIEFWRKKWLEKNKAAAAGEDLEREKNYRHMLNERLRRDKQRKSYL
ELHSMLPKKTKNDKNSIVQMAASTIQELKTLEAILEGRNLELEMALAARKREKENGTTTTIRVALANPSSGINSMLAILNVLKTVGVNVKVISANFFDAEFSAQLDIDHT
QMMGAAEVERVVQLTLTEAERKFQGQCQERTKFIKQSYFNFNN