; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g1045 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g1045
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiontranscription factor bHLH162-like
Genome locationMC03:16874771..16877506
RNA-Seq ExpressionMC03g1045
SyntenyMC03g1045
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:2000112 - regulation of cellular macromolecule biosynthetic process (biological process)
GO:0090575 - RNA polymerase II transcription factor complex (cellular component)
GO:0000977 - RNA polymerase II regulatory region sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR015660 - Achaete-scute transcription factor-related
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008438283.1 PREDICTED: uncharacterized protein LOC103483440 [Cucumis melo]2.44e-4959.48Show/hide
Query:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHH--HHHHQASRSPGGGGSTLP
        G + +S +LDRKT+EKNRR HMKSLCS L +L+PPSHFKI K+L+SQQNQI +VIAYINELKERVE LEK+K+AL    H  H H QA+ +      TLP
Subjt:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHH--HHHHQASRSPGGGGSTLP

Query:  VVELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL
        ++E+ DLG+ ++ +MLIS +NR+F+L Q+I+++EEEGGQVVNA  ST+G K+ 
Subjt:  VVELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL

XP_022137301.1 transcription factor bHLH162-like [Momordica charantia]9.74e-140100Show/hide
Query:  MAKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGST
        MAKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGST
Subjt:  MAKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGST

Query:  LPVVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKILPLSPYPGENFSSGSRDFESREKTAGTGEPKLIGFWEDQSANELLAK
        LPVVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKILPLSPYPGENFSSGSRDFESREKTAGTGEPKLIGFWEDQSANELLAK
Subjt:  LPVVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKILPLSPYPGENFSSGSRDFESREKTAGTGEPKLIGFWEDQSANELLAK

Query:  RKKEKRNSKN
        RKKEKRNSKN
Subjt:  RKKEKRNSKN

XP_022954432.1 transcription factor bHLH168-like [Cucurbita moschata]3.57e-4862.25Show/hide
Query:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVV
        GGA  S+KLDRK +EKNRR+HMKSLCSKL +L+P SH+KISKDLLSQQ QISYVIAYI ELKERVE LEK+K AL+   H      S+ P    ST P+V
Subjt:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVV

Query:  ELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL
        E+R+L S ++ V LIS ++RN +L ++I +VEEEGG+VV+ASFSTLG K+ 
Subjt:  ELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL

XP_022994121.1 transcription factor bHLH168-like [Cucurbita maxima]5.34e-4858.48Show/hide
Query:  PLPPKSLLCLSHVRAM-AKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHH
        PL P S   L++  A     GGA  S KLDRKT+EKNRR+HMKSLCSKL  L+P SH+KISKDLLSQQ QISYVIAYINELKERVE LEK+K AL+   H
Subjt:  PLPPKSLLCLSHVRAM-AKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHH

Query:  HHHHQASRSPGGGGSTLPVVELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL
              S+      ST P+VE+++L S ++ V LIS ++RN +L ++I ++EEEGG+VV+ASFSTLG KI 
Subjt:  HHHHQASRSPGGGGSTLPVVELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL

XP_023542199.1 transcription factor bHLH168-like [Cucurbita pepo subsp. pepo]3.91e-5064.24Show/hide
Query:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVV
        GGA  S KLDRKT+EKNRR+HMKSLCSKL +L+P SH+KISKDLLSQQ QISYVIAYINELKERVE LEKKK AL+   H      S+ P    ST P+V
Subjt:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVV

Query:  ELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL
        E+R+L S ++ V LIS ++RN +L ++I +VEEEGG+VV+ASFSTLG K+ 
Subjt:  ELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL

TrEMBL top hitse value%identityAlignment
A0A1S3AW43 uncharacterized protein LOC1034834401.18e-4959.48Show/hide
Query:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHH--HHHHQASRSPGGGGSTLP
        G + +S +LDRKT+EKNRR HMKSLCS L +L+PPSHFKI K+L+SQQNQI +VIAYINELKERVE LEK+K+AL    H  H H QA+ +      TLP
Subjt:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHH--HHHHQASRSPGGGGSTLP

Query:  VVELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL
        ++E+ DLG+ ++ +MLIS +NR+F+L Q+I+++EEEGGQVVNA  ST+G K+ 
Subjt:  VVELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL

A0A6J1C669 transcription factor bHLH162-like4.72e-140100Show/hide
Query:  MAKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGST
        MAKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGST
Subjt:  MAKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGST

Query:  LPVVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKILPLSPYPGENFSSGSRDFESREKTAGTGEPKLIGFWEDQSANELLAK
        LPVVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKILPLSPYPGENFSSGSRDFESREKTAGTGEPKLIGFWEDQSANELLAK
Subjt:  LPVVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKILPLSPYPGENFSSGSRDFESREKTAGTGEPKLIGFWEDQSANELLAK

Query:  RKKEKRNSKN
        RKKEKRNSKN
Subjt:  RKKEKRNSKN

A0A6J1GQZ4 transcription factor bHLH168-like1.73e-4862.25Show/hide
Query:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVV
        GGA  S+KLDRK +EKNRR+HMKSLCSKL +L+P SH+KISKDLLSQQ QISYVIAYI ELKERVE LEK+K AL+   H      S+ P    ST P+V
Subjt:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVV

Query:  ELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL
        E+R+L S ++ V LIS ++RN +L ++I +VEEEGG+VV+ASFSTLG K+ 
Subjt:  ELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL

A0A6J1JY79 transcription factor bHLH168-like2.59e-4858.48Show/hide
Query:  PLPPKSLLCLSHVRAM-AKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHH
        PL P S   L++  A     GGA  S KLDRKT+EKNRR+HMKSLCSKL  L+P SH+KISKDLLSQQ QISYVIAYINELKERVE LEK+K AL+   H
Subjt:  PLPPKSLLCLSHVRAM-AKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHH

Query:  HHHHQASRSPGGGGSTLPVVELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL
              S+      ST P+VE+++L S ++ V LIS ++RN +L ++I ++EEEGG+VV+ASFSTLG KI 
Subjt:  HHHHQASRSPGGGGSTLPVVELRDLGS-VIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL

A0A6P3Z232 transcription factor bHLH162-like5.62e-4151.2Show/hide
Query:  VRAMAKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGG
        VR M K   +  SLKLDRKT+E+NRR+HMK LC KL SLIPP HFK SKD+LSQQ+Q++    YI +L E++EK++ KK A          +A  +  G 
Subjt:  VRAMAKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGG

Query:  ---------GSTLPVVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL
                 G  LPV+EL+DLGS IEV+LIS L +NF+L +VI+++EEEG +VV+ASFST+G K+ 
Subjt:  ---------GSTLPVVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL

SwissProt top hitse value%identityAlignment
A2WZ60 Protein IRON-RELATED TRANSCRIPTION FACTOR 21.3e-0631.61Show/hide
Query:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPS-HFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTL--
        GG+ +  KL     E++RR  +  L S L +L+P + H K     LS    +S V+ YI EL+++VE LE+KK+ L  T        +  PG  GS L  
Subjt:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPS-HFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTL--

Query:  ----PVVE---LRDLGSVIEVMLISDLNRNFL-LCQVIAIVEEEGGQVVNASFST
            P+V    + D+  +++V L+S++  + L L + I ++E EG   +++S S+
Subjt:  ----PVVE---LRDLGSVIEVMLISDLNRNFL-LCQVIAIVEEEGGQVVNASFST

F4I4E1 Transcription factor bHLH1673.2e-0833.11Show/hide
Query:  GAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVVE
        G  +SL+  R   EK+RR+ MK L S L S + P+        L   + I    +Y+ +LKE V  L++KKR L      + ++       G   LP + 
Subjt:  GAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVVE

Query:  LRDLGSVIEVMLISDLN-RNFLLCQVIAIVEEEGGQVVNASFSTLGGK
        +R   S IE+ LI DLN +  +L ++++I EEEG QV++A+   L  +
Subjt:  LRDLGSVIEVMLISDLN-RNFLLCQVIAIVEEEGGQVVNASFSTLGGK

F4JIJ7 Transcription factor bHLH1624.3e-1330.95Show/hide
Query:  SLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGST-------LP
        S  +DRKT+EKNRR+ MKSL S+L+SL+P      S + L+  +Q+     YI +L+  VEK  ++KR L  T       +  S     S        LP
Subjt:  SLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGST-------LP

Query:  VVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEE-GGQVVNASFSTLGGKILPLSPYPGENFSSGSR
         +E+++ GS+  + L++ L   F+ C++I ++ EE G ++ +A +S +   +        E    G+R
Subjt:  VVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEE-GGQVVNASFSTLGGKILPLSPYPGENFSSGSR

Q0JFZ0 Protein IRON-RELATED TRANSCRIPTION FACTOR 21.0e-0631.61Show/hide
Query:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPS-HFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTL--
        GG+ +  KL     E++RR  +  L S L +L+P + H K     LS    +S V+ YI EL+++VE LE+KK+ L  T        +  PG  GS L  
Subjt:  GGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPS-HFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTL--

Query:  ----PVVE---LRDLGSVIEVMLISDLNRNFL-LCQVIAIVEEEGGQVVNASFST
            P+V    + D+  +++V L+S++  + L L + I ++E EG   +++S S+
Subjt:  ----PVVE---LRDLGSVIEVMLISDLNRNFL-LCQVIAIVEEEGGQVVNASFST

Q9XIJ1 Transcription factor bHLH1682.9e-0931.76Show/hide
Query:  GAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVVE
        G+ +SL+  R   EK RR+ MK L S L S + P+       L+ Q       ++Y+ +LKE+V  L + KR +      +  +       G S LP + 
Subjt:  GAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVVE

Query:  LRDLGSVIEVMLISDLN-RNFLLCQVIAIVEEEGGQVVNASFSTLGGK
        +R L S+IE+ L+ DLN +  +L +++++ EEEG QV++A+   L  +
Subjt:  LRDLGSVIEVMLISDLN-RNFLLCQVIAIVEEEGGQVVNASFSTLGGK

Arabidopsis top hitse value%identityAlignment
AT1G10585.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.3e-0933.11Show/hide
Query:  GAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVVE
        G  +SL+  R   EK+RR+ MK L S L S + P+        L   + I    +Y+ +LKE V  L++KKR L      + ++       G   LP + 
Subjt:  GAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVVE

Query:  LRDLGSVIEVMLISDLN-RNFLLCQVIAIVEEEGGQVVNASFSTLGGK
        +R   S IE+ LI DLN +  +L ++++I EEEG QV++A+   L  +
Subjt:  LRDLGSVIEVMLISDLN-RNFLLCQVIAIVEEEGGQVVNASFSTLGGK

AT1G10586.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.1e-1031.76Show/hide
Query:  GAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVVE
        G+ +SL+  R   EK RR+ MK L S L S + P+       L+ Q       ++Y+ +LKE+V  L + KR +      +  +       G S LP + 
Subjt:  GAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVVE

Query:  LRDLGSVIEVMLISDLN-RNFLLCQVIAIVEEEGGQVVNASFSTLGGK
        +R L S+IE+ L+ DLN +  +L +++++ EEEG QV++A+   L  +
Subjt:  LRDLGSVIEVMLISDLN-RNFLLCQVIAIVEEEGGQVVNASFSTLGGK

AT1G10586.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.4e-0631.13Show/hide
Query:  LLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVVELRDLGSVIEVMLISDLN-RNFLLCQVIAIVEEEGGQVVNASF
        LL     I   ++Y+ +LKE+V  L + KR +      +  +       G S LP + +R L S+IE+ L+ DLN +  +L +++++ EEEG QV++A+ 
Subjt:  LLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVVELRDLGSVIEVMLISDLN-RNFLLCQVIAIVEEEGGQVVNASF

Query:  STLGGK
          L  +
Subjt:  STLGGK

AT2G41240.2 basic helix-loop-helix protein 1007.6e-0531.54Show/hide
Query:  KLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHH-----HHHQASRSPGGGGSTLPVVEL
        KL+    E+ RR  + ++ S L S +PP+    ++  LS    +S  + YI EL+E+V+KL KKK  L F         +  Q S+S  G  S    V  
Subjt:  KLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHH-----HHHQASRSPGGGGSTLPVVEL

Query:  RDLGSVIEVMLISDL-NRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL
          L     ++ IS L         V++ VEE+G  +V AS S   G+ L
Subjt:  RDLGSVIEVMLISDL-NRNFLLCQVIAIVEEEGGQVVNASFSTLGGKIL

AT4G20970.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.1e-1430.95Show/hide
Query:  SLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGST-------LP
        S  +DRKT+EKNRR+ MKSL S+L+SL+P      S + L+  +Q+     YI +L+  VEK  ++KR L  T       +  S     S        LP
Subjt:  SLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVEKLEKKKRALEFTHHHHHHQASRSPGGGGST-------LP

Query:  VVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEE-GGQVVNASFSTLGGKILPLSPYPGENFSSGSR
         +E+++ GS+  + L++ L   F+ C++I ++ EE G ++ +A +S +   +        E    G+R
Subjt:  VVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEE-GGQVVNASFSTLGGKILPLSPYPGENFSSGSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTATTTATTTATTATTATTTTTTATGCAGGGATTTCATGACACTATAAAACAGCGCTTCCCCGTTGCCCCGCTCCCTCCCAAATCACTTCTGTGTCTCTCACA
CGTACGGGCGATGGCGAAAAGGGGCGGCGCAGAAACTTCTTTGAAACTCGACCGGAAAACCATCGAGAAAAACCGCAGAGTTCACATGAAATCTCTCTGTTCGAAGCTTG
TTTCGCTTATTCCTCCGTCCCATTTCAAAATCTCTAAGGATTTGCTCTCGCAACAGAATCAGATCTCCTACGTCATCGCGTACATAAACGAACTAAAAGAAAGAGTGGAG
AAATTGGAGAAAAAGAAACGAGCTTTAGAATTTACACATCATCATCATCATCATCAAGCAAGTCGGAGTCCCGGCGGCGGCGGCTCCACGTTGCCGGTGGTTGAACTCAG
AGACTTGGGCAGCGTCATTGAAGTAATGCTCATCAGTGACCTCAACAGAAACTTTTTGCTCTGCCAAGTTATCGCCATCGTCGAGGAGGAAGGAGGCCAAGTCGTCAACG
CAAGCTTCTCCACGCTCGGTGGCAAGATTCTTCCACTCTCTCCATATCCAGGCGAAAATTTCTCGAGTGGGAGTAGAGACTTCGAGAGTAGAGAAAAGACTGCAGGAACT
GGTGAACCAAAACTTATTGGATTTTGGGAGGATCAGTCAGCAAACGAATTATTAGCAAAAAGGAAAAAAGAAAAAAGAAATTCGAAGAATTGA
mRNA sequenceShow/hide mRNA sequence
GGGGATGGGCAGGACAAGAATTTGGCAAAAGTCAAAGTCAAAGTGGGAGACCACCCAGAAGCGAAAGCTAACAACTGTGCATGGATTTTATTTATTTATTATTATTTTTT
ATGCAGGGATTTCATGACACTATAAAACAGCGCTTCCCCGTTGCCCCGCTCCCTCCCAAATCACTTCTGTGTCTCTCACACGTACGGGCGATGGCGAAAAGGGGCGGCGC
AGAAACTTCTTTGAAACTCGACCGGAAAACCATCGAGAAAAACCGCAGAGTTCACATGAAATCTCTCTGTTCGAAGCTTGTTTCGCTTATTCCTCCGTCCCATTTCAAAA
TCTCTAAGGATTTGCTCTCGCAACAGAATCAGATCTCCTACGTCATCGCGTACATAAACGAACTAAAAGAAAGAGTGGAGAAATTGGAGAAAAAGAAACGAGCTTTAGAA
TTTACACATCATCATCATCATCATCAAGCAAGTCGGAGTCCCGGCGGCGGCGGCTCCACGTTGCCGGTGGTTGAACTCAGAGACTTGGGCAGCGTCATTGAAGTAATGCT
CATCAGTGACCTCAACAGAAACTTTTTGCTCTGCCAAGTTATCGCCATCGTCGAGGAGGAAGGAGGCCAAGTCGTCAACGCAAGCTTCTCCACGCTCGGTGGCAAGATTC
TTCCACTCTCTCCATATCCAGGCGAAAATTTCTCGAGTGGGAGTAGAGACTTCGAGAGTAGAGAAAAGACTGCAGGAACTGGTGAACCAAAACTTATTGGATTTTGGGAG
GATCAGTCAGCAAACGAATTATTAGCAAAAAGGAAAAAAGAAAAAAGAAATTCGAAGAATTGAATCTTTTTTCTTTTTATTTTTCTTCAGTGGATGAACATAGACTAAGA
TTGTGTGGCCCCCGACGTGAAACGCGTAATGTCTAAATTTTAATGTAGTTTTAGTTTTCAGTAACGACCAATTTCGTCCCAGTACAATGTTTTACAAATATGTCTGTCCC
GTGACTAATAAATATGTCCTTTTTATGTGCTATCTTTACTATGTTTTTGTTATTCTGATATGTTTAAAAGTCTCTAGTAGTTAGATCGTTGAAAGTTCAGATAACTTATT
AGATATGATTGTGAAAGTTTGATGACTATCTTTTAATTTTATAGTTCATGTGTATTGAATGCTAAAATTCTAT
Protein sequenceShow/hide protein sequence
MDFIYLLLFFMQGFHDTIKQRFPVAPLPPKSLLCLSHVRAMAKRGGAETSLKLDRKTIEKNRRVHMKSLCSKLVSLIPPSHFKISKDLLSQQNQISYVIAYINELKERVE
KLEKKKRALEFTHHHHHHQASRSPGGGGSTLPVVELRDLGSVIEVMLISDLNRNFLLCQVIAIVEEEGGQVVNASFSTLGGKILPLSPYPGENFSSGSRDFESREKTAGT
GEPKLIGFWEDQSANELLAKRKKEKRNSKN