; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021933 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021933
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionBasic helix-loop-helix (bHLH) DNA-binding family protein
Genome locationscaffold110:304388..304894
RNA-Seq ExpressionMS021933
SyntenyMS021933
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily
IPR045084 - Transcription factor AIB/MYC-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441948.1 PREDICTED: transcription factor MYC3-like [Cucumis melo]1.3e-2951.52Show/hide
Query:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK------PNPEDEATSGAKKSVMEVEVEVKIIGLDRAVV
        H EA+R RREKLN  F+SLR ++PN S  D  +A++LSDAVSYINEL++K+ ++ES +  KK       N  +E  S   +    +E++VKIIG DRAV+
Subjt:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK------PNPEDEATSGAKKSVMEVEVEVKIIGLDRAVV

Query:  RIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP-QVETTEEGIKASLLKIL
        R++++N+SYAVA+LMEALRDL LKV H +M N  D+TLQDLV+ +P      ++EGIK +LL IL
Subjt:  RIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP-QVETTEEGIKASLLKIL

XP_022144463.1 transcription factor bHLH14-like [Momordica charantia]4.3e-3053.57Show/hide
Query:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDEATSGAKKS-------VMEVEVEVKIIGLDRAV
        H EA+R RREKLNH F  LRS++PN S  D  +A++LSDAV YI+ELQ K++DLE     +  + E +  +   +         +EVEVEVKI+GL+ AV
Subjt:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDEATSGAKKS-------VMEVEVEVKIIGLDRAV

Query:  VRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP---QVETTEEGIKASLLKIL
        +R+QTKNMS  +A+LMEALRDLELKVHHA+M N ND TL D+V+GL P   Q    EE I+ SLLK L
Subjt:  VRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP---QVETTEEGIKASLLKIL

XP_022144464.1 transcription factor bHLH14-like [Momordica charantia]6.8e-7695.73Show/hide
Query:  MNKGAGNDHAEAKRRREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDEATSGAKKSVMEVEVEVKIIGLDRAV
        MNKGAGNDHAEAKRRREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK NPEDEATSGAKKSVMEVEVEVKIIGLDRAV
Subjt:  MNKGAGNDHAEAKRRREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDEATSGAKKSVMEVEVEVKIIGLDRAV

Query:  VRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLLKI
        VRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEE     LLK+
Subjt:  VRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLLKI

XP_031736394.1 transcription factor bHLH14-like [Cucumis sativus]2.5e-3051.5Show/hide
Query:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK--------PNPEDEATSGAKKSVMEVEVEVKIIGLDRA
        H EA+R RREKLN  F+SLRS++PN S  D  +A++LSDAVSYINEL++K+ ++ES +  +K         N  +E  S   +    +E++VKIIG DRA
Subjt:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK--------PNPEDEATSGAKKSVMEVEVEVKIIGLDRA

Query:  VVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP-QVETTEEGIKASLLKIL
        V+R++++N+SYAVA+LMEALRDLELKV H +M N  D+TLQDLV+ +P     +++EGIK +LL IL
Subjt:  VVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP-QVETTEEGIKASLLKIL

XP_038890552.1 transcription factor MYC3-like [Benincasa hispida]7.4e-3057.75Show/hide
Query:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK--PNPEDEATSGAKKSVMEVEVEVKIIGLDRAVVRIQT
        H EA+R RREKLNH F+SLRS++P+ S  D  +A++LS+A+SYINELQ K+ ++ES +  KK     E+EA+S  ++  + +E++VKIIG D+AV+R+QT
Subjt:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK--PNPEDEATSGAKKSVMEVEVEVKIIGLDRAVVRIQT

Query:  KNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLP
        KN+SYAVA+LMEALRDLELKV HA+M N  D+TLQDLV+ +P
Subjt:  KNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLP

TrEMBL top hitse value%identityAlignment
A0A0A0LKN8 BHLH domain-containing protein1.2e-3051.5Show/hide
Query:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK--------PNPEDEATSGAKKSVMEVEVEVKIIGLDRA
        H EA+R RREKLN  F+SLRS++PN S  D  +A++LSDAVSYINEL++K+ ++ES +  +K         N  +E  S   +    +E++VKIIG DRA
Subjt:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK--------PNPEDEATSGAKKSVMEVEVEVKIIGLDRA

Query:  VVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP-QVETTEEGIKASLLKIL
        V+R++++N+SYAVA+LMEALRDLELKV H +M N  D+TLQDLV+ +P     +++EGIK +LL IL
Subjt:  VVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP-QVETTEEGIKASLLKIL

A0A1S3B4K7 transcription factor MYC3-like6.1e-3051.52Show/hide
Query:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK------PNPEDEATSGAKKSVMEVEVEVKIIGLDRAVV
        H EA+R RREKLN  F+SLR ++PN S  D  +A++LSDAVSYINEL++K+ ++ES +  KK       N  +E  S   +    +E++VKIIG DRAV+
Subjt:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK------PNPEDEATSGAKKSVMEVEVEVKIIGLDRAVV

Query:  RIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP-QVETTEEGIKASLLKIL
        R++++N+SYAVA+LMEALRDL LKV H +M N  D+TLQDLV+ +P      ++EGIK +LL IL
Subjt:  RIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP-QVETTEEGIKASLLKIL

A0A5A7V6X0 Transcription factor MYC3-like6.1e-3051.52Show/hide
Query:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK------PNPEDEATSGAKKSVMEVEVEVKIIGLDRAVV
        H EA+R RREKLN  F+SLR ++PN S  D  +A++LSDAVSYINEL++K+ ++ES +  KK       N  +E  S   +    +E++VKIIG DRAV+
Subjt:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK------PNPEDEATSGAKKSVMEVEVEVKIIGLDRAVV

Query:  RIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP-QVETTEEGIKASLLKIL
        R++++N+SYAVA+LMEALRDL LKV H +M N  D+TLQDLV+ +P      ++EGIK +LL IL
Subjt:  RIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP-QVETTEEGIKASLLKIL

A0A6J1CSD3 transcription factor bHLH14-like3.3e-7695.73Show/hide
Query:  MNKGAGNDHAEAKRRREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDEATSGAKKSVMEVEVEVKIIGLDRAV
        MNKGAGNDHAEAKRRREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKK NPEDEATSGAKKSVMEVEVEVKIIGLDRAV
Subjt:  MNKGAGNDHAEAKRRREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDEATSGAKKSVMEVEVEVKIIGLDRAV

Query:  VRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLLKI
        VRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEE     LLK+
Subjt:  VRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLLKI

A0A6J1CTB5 transcription factor bHLH14-like2.1e-3053.57Show/hide
Query:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDEATSGAKKS-------VMEVEVEVKIIGLDRAV
        H EA+R RREKLNH F  LRS++PN S  D  +A++LSDAV YI+ELQ K++DLE     +  + E +  +   +         +EVEVEVKI+GL+ AV
Subjt:  HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDEATSGAKKS-------VMEVEVEVKIIGLDRAV

Query:  VRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP---QVETTEEGIKASLLKIL
        +R+QTKNMS  +A+LMEALRDLELKVHHA+M N ND TL D+V+GL P   Q    EE I+ SLLK L
Subjt:  VRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPP---QVETTEEGIKASLLKIL

SwissProt top hitse value%identityAlignment
A0A060KY90 Transcription factor MYC12.0e-1735.37Show/hide
Query:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKM-------EDLESHLDVKKPNPEDEATSGAKKS--------VMEVEVEV
        +H EA+R RREKLN  F +LR+++PN S  D  +A++L DA++YINEL+ K+       E+L S ++  +    ++ +S    S        +++++++V
Subjt:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKM-------EDLESHLDVKKPNPEDEATSGAKKS--------VMEVEVEV

Query:  KIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEE
        K+IG D A++RIQ    ++  ARLM AL+DL+L VHHA++   ND+ +Q   + +  ++   E+
Subjt:  KIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEE

A0A3Q7HRZ6 Transcription factor MYC21.6e-1939.31Show/hide
Query:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKM-------EDLESHL-DVKK----------PNPEDEATSGAKKSVMEVE
        +H EA+R RREKLN  F +LR+++PN S  D  +A++L DA+SYINEL+ K+       EDL+S + D+KK          PN + + +S     +++V+
Subjt:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKM-------EDLESHL-DVKK----------PNPEDEATSGAKKSVMEVE

Query:  VEVKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASL
        ++VKIIG D A++RIQ    ++  ARLM AL +L+L VHHA++   ND+ +Q   + +  +   TEE ++ +L
Subjt:  VEVKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASL

O23090 Transcription factor bHLH142.3e-1835.23Show/hide
Query:  HAEA-KRRREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDE---ATSGAKKSVME---------------VEV
        H EA K+RREKLNH F +LR+++P  S  D  +A++LSDAVSYI  L+ K++DLE+ +   K    D+   ++S    S +E               +EV
Subjt:  HAEA-KRRREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDE---ATSGAKKSVME---------------VEV

Query:  EVKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLLKIL
        +VKI+G + A++R+QT+N+++  + LM AL +++ +V HA     + + +QD+V+ L P+   +E+ ++ +L++ L
Subjt:  EVKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLLKIL

O49687 Transcription factor MYC46.7e-1839.02Show/hide
Query:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKM-------EDLESHLDV-------KKPNPEDEATSGAKKSVM-EVEVEV
        +H EA+R RREKLN  F SLR+++PN S  D  +A++L DA+SYI+EL+ K+       E+L+  +DV        K + +D      + SV+ E+EV+V
Subjt:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKM-------EDLESHLDV-------KKPNPEDEATSGAKKSVM-EVEVEV

Query:  KIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEE
        KIIG D A++RIQ    ++  A+ MEAL++L+L+V+HA++   ND+ +Q   + +  Q  T ++
Subjt:  KIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEE

Q39204 Transcription factor MYC23.9e-1838.37Show/hide
Query:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESH-LDVKKPNPE---------------DEATSGAKKSVMEVEVE
        +H EA+R RREKLN  F +LR+++PN S  D  +A++L DA++YINEL+ K+   ES  L +K    E               D ++S +    + +E+E
Subjt:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESH-LDVKKPNPE---------------DEATSGAKKSVMEVEVE

Query:  VKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLL
        VKIIG D A++R+++   ++  ARLM AL DLEL+V+HA+M   ND+ +Q   + +  ++ T E+ ++ASL+
Subjt:  VKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLL

Arabidopsis top hitse value%identityAlignment
AT1G32640.1 Basic helix-loop-helix (bHLH) DNA-binding family protein2.8e-1938.37Show/hide
Query:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESH-LDVKKPNPE---------------DEATSGAKKSVMEVEVE
        +H EA+R RREKLN  F +LR+++PN S  D  +A++L DA++YINEL+ K+   ES  L +K    E               D ++S +    + +E+E
Subjt:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESH-LDVKKPNPE---------------DEATSGAKKSVMEVEVE

Query:  VKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLL
        VKIIG D A++R+++   ++  ARLM AL DLEL+V+HA+M   ND+ +Q   + +  ++ T E+ ++ASL+
Subjt:  VKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLL

AT4G00870.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.6e-1935.23Show/hide
Query:  HAEA-KRRREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDE---ATSGAKKSVME---------------VEV
        H EA K+RREKLNH F +LR+++P  S  D  +A++LSDAVSYI  L+ K++DLE+ +   K    D+   ++S    S +E               +EV
Subjt:  HAEA-KRRREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDE---ATSGAKKSVME---------------VEV

Query:  EVKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLLKIL
        +VKI+G + A++R+QT+N+++  + LM AL +++ +V HA     + + +QD+V+ L P+   +E+ ++ +L++ L
Subjt:  EVKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLLKIL

AT4G17880.1 Basic helix-loop-helix (bHLH) DNA-binding family protein4.8e-1939.02Show/hide
Query:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKM-------EDLESHLDV-------KKPNPEDEATSGAKKSVM-EVEVEV
        +H EA+R RREKLN  F SLR+++PN S  D  +A++L DA+SYI+EL+ K+       E+L+  +DV        K + +D      + SV+ E+EV+V
Subjt:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKM-------EDLESHLDV-------KKPNPEDEATSGAKKSVM-EVEVEV

Query:  KIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEE
        KIIG D A++RIQ    ++  A+ MEAL++L+L+V+HA++   ND+ +Q   + +  Q  T ++
Subjt:  KIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEE

AT5G46760.1 Basic helix-loop-helix (bHLH) DNA-binding family protein2.0e-1737.04Show/hide
Query:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLES---HLDVKKPNPEDEATSG----------------AKKSVMEV
        +H EA+R RREKLN  F SLR+++PN S  D  +A++L DA+SYINEL+ K++  ES    +  K      E  +G                +  S +E+
Subjt:  DHAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLES---HLDVKKPNPEDEATSG----------------AKKSVMEV

Query:  EVEVKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQ
        E++VKIIG D  ++R+Q     +  AR MEAL++L+L+V+HA++   ND+ +Q   + +  Q
Subjt:  EVEVKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQ

AT5G46830.1 NACL-inducible gene 17.6e-1736.93Show/hide
Query:  KEGMNKGAGND----HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKME--DLESHLDVKKPNPEDEATSGAKKSVMEV---
        K G     G D    H EA+R RREKLNH F +LR+++PN S  D  + ++L DAV YINEL+ K E  +LE H  ++    E +  +G + ++  V   
Subjt:  KEGMNKGAGND----HAEAKR-RREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKME--DLESHLDVKKPNPEDEATSGAKKSVMEV---

Query:  --------EVEVKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEE
                ++EVKI+  D A+VR++++   +  ARLM AL DLEL+V+HA++   ND+ +Q   + +  ++   EE
Subjt:  --------EVEVKIIGLDRAVVRIQTKNMSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAAGAAGGGATGAACAAAGGAGCTGGGAATGATCATGCCGAGGCAAAGCGGCGGCGGGAGAAGCTCAACCACCACTTCTCCTCCCTCCGCTCGCTACTCCCCAATGCTTC
GAATAGCGACAATAGCGAAGCGGCTGTTCTGTCTGATGCTGTTTCGTACATCAACGAGCTCCAAGTCAAGATGGAGGACTTGGAGTCTCATCTAGACGTCAAGAAACCCA
ATCCAGAGGACGAAGCGACATCGGGAGCAAAAAAGAGTGTGATGGAAGTGGAAGTTGAAGTGAAGATTATAGGATTGGATCGGGCAGTGGTGAGAATTCAGACAAAGAAT
ATGAGCTATGCAGTGGCTAGGCTAATGGAGGCCCTTAGAGACTTGGAGTTGAAGGTCCACCATGCCACCATGTGCAACTCAAATGATATCACATTGCAAGATCTGGTCAT
TGGGCTTCCTCCACAAGTAGAAACCACAGAAGAAGGTATTAAAGCCTCACTTCTCAAAATATTATGC
mRNA sequenceShow/hide mRNA sequence
AAAGAAGGGATGAACAAAGGAGCTGGGAATGATCATGCCGAGGCAAAGCGGCGGCGGGAGAAGCTCAACCACCACTTCTCCTCCCTCCGCTCGCTACTCCCCAATGCTTC
GAATAGCGACAATAGCGAAGCGGCTGTTCTGTCTGATGCTGTTTCGTACATCAACGAGCTCCAAGTCAAGATGGAGGACTTGGAGTCTCATCTAGACGTCAAGAAACCCA
ATCCAGAGGACGAAGCGACATCGGGAGCAAAAAAGAGTGTGATGGAAGTGGAAGTTGAAGTGAAGATTATAGGATTGGATCGGGCAGTGGTGAGAATTCAGACAAAGAAT
ATGAGCTATGCAGTGGCTAGGCTAATGGAGGCCCTTAGAGACTTGGAGTTGAAGGTCCACCATGCCACCATGTGCAACTCAAATGATATCACATTGCAAGATCTGGTCAT
TGGGCTTCCTCCACAAGTAGAAACCACAGAAGAAGGTATTAAAGCCTCACTTCTCAAAATATTATGC
Protein sequenceShow/hide protein sequence
KEGMNKGAGNDHAEAKRRREKLNHHFSSLRSLLPNASNSDNSEAAVLSDAVSYINELQVKMEDLESHLDVKKPNPEDEATSGAKKSVMEVEVEVKIIGLDRAVVRIQTKN
MSYAVARLMEALRDLELKVHHATMCNSNDITLQDLVIGLPPQVETTEEGIKASLLKILC