; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009661 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009661
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontranscription factor bHLH162-like
Genome locationchr9:41215049..41216092
RNA-Seq ExpressionLag0009661
SyntenyLag0009661
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR015660 - Achaete-scute transcription factor-related
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604789.1 Transcription factor basic helix-loop-helix 162, partial [Cucurbita argyrosperma subsp. sororia]2.2e-6165.5Show/hide
Query:  MEGSREGNIQQN---VGSIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQ
        ME +RE   QQN   V + KVERKVMEKNRR QMKLLYS LNSLLP + S + PLTVSDQI+EAIKYIKSLE KL+K  EKKE  L RL  S + S  + 
Subjt:  MEGSREGNIQQN---VGSIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQ

Query:  HTVPTRSQNCNSPELKIKEMGSAVEVVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEE
              S N NSPEL+IKEMGS VEVV T G EDQ LF E+I +F EER EIIN++YSV EN  LYS+HAEIEDVVYE GAKKL ERL +LV E KSD E
Subjt:  HTVPTRSQNCNSPELKIKEMGSAVEVVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEE

Query:  MQAGASSSSGHGGTDLPANTATTSSDRHF
        M AG +SSSGHGGT  PAN  T SS  HF
Subjt:  MQAGASSSSGHGGTDLPANTATTSSDRHF

XP_022944108.1 transcription factor bHLH162-like [Cucurbita moschata]7.8e-5969Show/hide
Query:  MKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVEVVLTCGL-
        MK LYSKLNSLLPTHHSNELPL+V DQI+EAIKYIKSLE KL+KD+EKKE F RR       SSSS +  PTRS+N N PEL+IKEMGSAVEVVL+ GL 
Subjt:  MKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVEVVLTCGL-

Query:  EDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSSDRHFWSH
        ED+F+FYEII IF EER EI+NVSYSV+ N+VLYSL+AEIEDVVYEFGA K TER++RLV   ++D EM+A A SSS HGG   P N  +T SD  F+SH
Subjt:  EDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSSDRHFWSH

XP_023512282.1 transcription factor bHLH162-like [Cucurbita pepo subsp. pepo]3.2e-6069.9Show/hide
Query:  MKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVEVVLTCGLE
        MKLLYSKLNSLLPTHHSNELPL+VSDQI+EAIKYIKSLE KL+KD+EKKE F RR      +SSSS +  PTR++N N PEL+IKEMGS VEVVL+ GLE
Subjt:  MKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVEVVLTCGLE

Query:  DQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSSDRHF
        D+F+FYEII IF EER EI+NVSYSV+ N+VLYSLHAEIEDVVYEFGA K TER++RLV   ++D EM+A A SSS HGG   P N  +T SD  F
Subjt:  DQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSSDRHF

XP_038902350.1 transcription factor bHLH162-like [Benincasa hispida]1.2e-6269.01Show/hide
Query:  KVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKE
        K+ERKV+EKNRR +MK LYS LNSLLPT HSN LPLTVS+QI+E IK IKSLE KL+KD+EKKE  LR+ K SLS+ S   +T  + +QN N PELKIKE
Subjt:  KVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKE

Query:  MGSAVEVVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPAN
        MGSAVEVVLT GLED+ +FYEIIRIF EERVEIINVSYS LE+T++YSLHAEIEDVVYEFG  KL ERL++LV E  +D E+QA A  SS HGGT+ PA+
Subjt:  MGSAVEVVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPAN

Query:  TATTSSDRHFWSH
          TTSS  +FWS+
Subjt:  TATTSSDRHFWSH

XP_038902351.1 transcription factor bHLH162-like [Benincasa hispida]2.1e-6768.56Show/hide
Query:  MEGSREGNIQQNVGSIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTV
        M  +RE N Q N    KVERKVMEKNRR +MK LY+ LNSLLP  HSNELPLTV DQID+AIKYIKSLE  L+KD+EKKE  LR+ K S   SSSS +T 
Subjt:  MEGSREGNIQQNVGSIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTV

Query:  PTRSQNCNSPELKIKEMGSAVEVVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQA
         + S N N PELKIKEMGSAVEVVLT GLED+ +FYEIIRIF EERVEIIN+SYS+LENT++YSLH EIEDVVYEFG  KL ERL++LV E  +D E+QA
Subjt:  PTRSQNCNSPELKIKEMGSAVEVVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQA

Query:  GASSSSGHGGTDLPANTATTSSDRHFWSH
         A SSS HGGT+ PA+  T SSD +FWSH
Subjt:  GASSSSGHGGTDLPANTATTSSDRHFWSH

TrEMBL top hitse value%identityAlignment
A0A6J1FXP8 transcription factor bHLH162-like3.8e-5969Show/hide
Query:  MKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVEVVLTCGL-
        MK LYSKLNSLLPTHHSNELPL+V DQI+EAIKYIKSLE KL+KD+EKKE F RR       SSSS +  PTRS+N N PEL+IKEMGSAVEVVL+ GL 
Subjt:  MKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVEVVLTCGL-

Query:  EDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSSDRHFWSH
        ED+F+FYEII IF EER EI+NVSYSV+ N+VLYSL+AEIEDVVYEFGA K TER++RLV   ++D EM+A A SSS HGG   P N  +T SD  F+SH
Subjt:  EDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSSDRHFWSH

A0A6J1G8B1 transcription factor bHLH162-like2.1e-5763.8Show/hide
Query:  MEGSRE--GNIQQNVGSIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQH
        ME  RE  GN QQN  + KVERKV+EKNRR QMKLLYSKLNSLLP + S + PLTV DQI+EAIKYI+SL  KL+K KEKK+  L               
Subjt:  MEGSRE--GNIQQNVGSIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQH

Query:  TVPTRSQNCNSPELKIKEMGSAVEVVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEM
        + PTRS + N PELKIKEMGSAVEVV    L D+FLFYEIIRIF EER EIIN +YSVL++TVLYSLHAEIEDV+Y FGA KLTERL+RL  E KSD E 
Subjt:  TVPTRSQNCNSPELKIKEMGSAVEVVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEM

Query:  QAGASSSSGHGGTDLPANTAT
        QA  +SSSG+GG+    +T T
Subjt:  QAGASSSSGHGGTDLPANTAT

A0A6J1G8Q1 transcription factor bHLH162-like1.0e-5666.18Show/hide
Query:  MEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVE
        MEKNRR QMKLLYS LNSLLP + S + PLTVSDQI+EAIKYIKSLE KL+K KEKKE  L RL  S + S  +       S N NSPEL+IKEMGS VE
Subjt:  MEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVE

Query:  VVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSS
        VV T G ED+ LF E+I +F EER EIIN++YSV E+  LYS+HA+IEDVVYE GAKKL ERL +LV E KSD EM AG +SSSGHGGT  PAN  T SS
Subjt:  VVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSS

Query:  DRHF
         RHF
Subjt:  DRHF

A0A6J1I4R7 transcription factor bHLH162-like6.7e-5665.2Show/hide
Query:  MEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVE
        MEKNRR QMKLLYS LNSLLP + S + PLTVSDQI+EAIKYIKSLE KL+K KEKKE  L  L  S + S  +       S N NSPEL+IKEMGS VE
Subjt:  MEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVE

Query:  VVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSS
        VV   G EDQ LF E+I +F EER EIIN++YS+ E+ +LYS+HAEIEDVVYE GAKKL ERL +LV E KSD E+ AG +SSSGHGGT  PA   T SS
Subjt:  VVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSS

Query:  DRHF
        DRHF
Subjt:  DRHF

A0A6J1JF05 transcription factor bHLH162-like3.0e-5667.51Show/hide
Query:  MKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVEVVLTCGL-
        MK LYSKLNSLLPTHHSNE+PL+VSDQI+EAIKYIKSLE KL+K++EKKE F R+       SSSS +T PTRS+N N PEL+IKE+GSAVEVVL+ GL 
Subjt:  MKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKEMGSAVEVVLTCGL-

Query:  EDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSSDRHF
        ED+F+FYEII IF +ER EI+NVSYSV+ N+VLYSLHAEIEDVVYEFGA K TER++RLV    +D EM+A A SSS  GG+  P N  +T SD  F
Subjt:  EDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATTSSDRHF

SwissProt top hitse value%identityAlignment
F4JIJ7 Transcription factor bHLH1624.5e-2541.9Show/hide
Query:  SIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFL--RRLKRSLSTSSSSQHTVPTRSQNCNSPEL
        S  V+RK +EKNRR QMK LYS+L SLLP HHS+  PLT+ DQ+DEA  YIK L+  +EK +E+K + +    L++  S  SSS  +    S     P++
Subjt:  SIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFL--RRLKRSLSTSSSSQHTVPTRSQNCNSPEL

Query:  KIKEMGSAVEVVLTCGLEDQFLFYEIIRIFREE-RVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAK-KLTERLRRLV
        +I+E GS   + L   LE +F+F EIIR+  EE   EI +  YS++++ V ++LH ++E+  +++GA+ ++ ERL ++V
Subjt:  KIKEMGSAVEVVLTCGLEDQFLFYEIIRIFREE-RVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAK-KLTERLRRLV

Q9FLI1 Transcription factor bHLH361.6e-0626.29Show/hide
Query:  KVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKE
        K+  +  E+ RR +M  LY+ L SLLP H       T SDQ++EA+ YIK L+ K+++   +++  +   + SL  SS+          +     + +++
Subjt:  KVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKE

Query:  MGSAVEVVLT---CGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRL
            VE++L+   CG   Q  F  ++++  E  + ++N   S++++ ++Y++ AE+ D+       +L +RL R+
Subjt:  MGSAVEVVLT---CGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRL

Q9XIJ1 Transcription factor bHLH1684.0e-0525.13Show/hide
Query:  MEGSREGNIQQNVGSIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTV
        ME +RE   + +  S++ +R + EK RR +MK L+    S+L +H S    L V   ID+A+ Y+  L+ K+        ++L  +KR +         V
Subjt:  MEGSREGNIQQNVGSIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTV

Query:  PTRSQNCN-SPELKIKEMGSAVEVVLTCGLEDQ-FLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLV
          RS+  +  P+L I+ + S +E+ L   L  +  + ++++ +F EE  ++++ +   L +   Y++ A+           ++ ERLR ++
Subjt:  PTRSQNCN-SPELKIKEMGSAVEVVLTCGLEDQ-FLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLV

Arabidopsis top hitse value%identityAlignment
AT1G10586.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.8e-0625.13Show/hide
Query:  MEGSREGNIQQNVGSIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTV
        ME +RE   + +  S++ +R + EK RR +MK L+    S+L +H S    L V   ID+A+ Y+  L+ K+        ++L  +KR +         V
Subjt:  MEGSREGNIQQNVGSIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTV

Query:  PTRSQNCN-SPELKIKEMGSAVEVVLTCGLEDQ-FLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLV
          RS+  +  P+L I+ + S +E+ L   L  +  + ++++ +F EE  ++++ +   L +   Y++ A+           ++ ERLR ++
Subjt:  PTRSQNCN-SPELKIKEMGSAVEVVLTCGLEDQ-FLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLV

AT4G20970.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.2e-2641.9Show/hide
Query:  SIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFL--RRLKRSLSTSSSSQHTVPTRSQNCNSPEL
        S  V+RK +EKNRR QMK LYS+L SLLP HHS+  PLT+ DQ+DEA  YIK L+  +EK +E+K + +    L++  S  SSS  +    S     P++
Subjt:  SIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFL--RRLKRSLSTSSSSQHTVPTRSQNCNSPEL

Query:  KIKEMGSAVEVVLTCGLEDQFLFYEIIRIFREE-RVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAK-KLTERLRRLV
        +I+E GS   + L   LE +F+F EIIR+  EE   EI +  YS++++ V ++LH ++E+  +++GA+ ++ ERL ++V
Subjt:  KIKEMGSAVEVVLTCGLEDQFLFYEIIRIFREE-RVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAK-KLTERLRRLV

AT5G51780.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.1e-0726.29Show/hide
Query:  KVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKE
        K+  +  E+ RR +M  LY+ L SLLP H       T SDQ++EA+ YIK L+ K+++   +++  +   + SL  SS+          +     + +++
Subjt:  KVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSPELKIKE

Query:  MGSAVEVVLT---CGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRL
            VE++L+   CG   Q  F  ++++  E  + ++N   S++++ ++Y++ AE+ D+       +L +RL R+
Subjt:  MGSAVEVVLT---CGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGTTCTCGTGAAGGGAATATTCAACAAAATGTTGGTTCTATTAAGGTCGAAAGAAAGGTTATGGAGAAGAATCGTAGAACCCAGATGAAATTGCTTTACTCTAA
ACTCAATTCTCTCCTCCCAACCCATCACTCCAATGAGCTGCCACTGACAGTGTCGGATCAGATCGACGAAGCCATAAAGTACATAAAATCACTCGAGACCAAGCTGGAGA
AGGATAAGGAGAAGAAGGAGAGCTTTTTGAGAAGATTGAAAAGGTCGTTGTCGACGTCGTCGTCGTCGCAGCACACGGTGCCGACGAGGAGCCAGAATTGTAACTCACCT
GAACTGAAAATCAAAGAGATGGGTTCGGCCGTGGAGGTTGTTTTAACATGTGGGTTGGAAGATCAGTTCTTATTTTACGAGATTATTCGCATCTTTCGTGAGGAGCGAGT
CGAGATCATCAATGTCAGTTATTCTGTTCTCGAGAATACCGTCTTGTATTCACTCCATGCAGAGATTGAAGACGTGGTGTATGAATTTGGAGCAAAGAAACTAACGGAGA
GGCTAAGAAGATTAGTTTGCGAATCGAAGAGTGATGAAGAAATGCAAGCAGGAGCTTCTTCTTCGAGCGGTCATGGGGGAACCGATCTGCCGGCGAACACTGCGACGACG
AGTTCCGATCGCCATTTTTGGAGCCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGTTCTCGTGAAGGGAATATTCAACAAAATGTTGGTTCTATTAAGGTCGAAAGAAAGGTTATGGAGAAGAATCGTAGAACCCAGATGAAATTGCTTTACTCTAA
ACTCAATTCTCTCCTCCCAACCCATCACTCCAATGAGCTGCCACTGACAGTGTCGGATCAGATCGACGAAGCCATAAAGTACATAAAATCACTCGAGACCAAGCTGGAGA
AGGATAAGGAGAAGAAGGAGAGCTTTTTGAGAAGATTGAAAAGGTCGTTGTCGACGTCGTCGTCGTCGCAGCACACGGTGCCGACGAGGAGCCAGAATTGTAACTCACCT
GAACTGAAAATCAAAGAGATGGGTTCGGCCGTGGAGGTTGTTTTAACATGTGGGTTGGAAGATCAGTTCTTATTTTACGAGATTATTCGCATCTTTCGTGAGGAGCGAGT
CGAGATCATCAATGTCAGTTATTCTGTTCTCGAGAATACCGTCTTGTATTCACTCCATGCAGAGATTGAAGACGTGGTGTATGAATTTGGAGCAAAGAAACTAACGGAGA
GGCTAAGAAGATTAGTTTGCGAATCGAAGAGTGATGAAGAAATGCAAGCAGGAGCTTCTTCTTCGAGCGGTCATGGGGGAACCGATCTGCCGGCGAACACTGCGACGACG
AGTTCCGATCGCCATTTTTGGAGCCACTAG
Protein sequenceShow/hide protein sequence
MEGSREGNIQQNVGSIKVERKVMEKNRRTQMKLLYSKLNSLLPTHHSNELPLTVSDQIDEAIKYIKSLETKLEKDKEKKESFLRRLKRSLSTSSSSQHTVPTRSQNCNSP
ELKIKEMGSAVEVVLTCGLEDQFLFYEIIRIFREERVEIINVSYSVLENTVLYSLHAEIEDVVYEFGAKKLTERLRRLVCESKSDEEMQAGASSSSGHGGTDLPANTATT
SSDRHFWSH