; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G003510 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G003510
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionBHLH domain-containing protein
Genome locationCmo_Chr18:2297311..2300966
RNA-Seq ExpressionCmoCh18G003510
SyntenyCmoCh18G003510
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:2000112 - regulation of cellular macromolecule biosynthetic process (biological process)
GO:0090575 - RNA polymerase II transcription factor complex (cellular component)
GO:0000977 - RNA polymerase II regulatory region sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR015660 - Achaete-scute transcription factor-related
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573270.1 Transcription factor basic helix-loop-helix 162, partial [Cucurbita argyrosperma subsp. sororia]1.4e-8394.86Show/hide
Query:  MTDNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVGSSLEFL
        M DNFIHYSTSTKTDRKI+EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGS+EGEIKARLL+QVEAHQVGSSLEFL
Subjt:  MTDNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVGSSLEFL

Query:  LTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE
        LTTGS+YHLVLSQILQLLQEN TQIVHINHSTIIDR+FHKIV QMVGEGMNSEGVDGERICETVKKFVSQYKDG+
Subjt:  LTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE

KAG7012436.1 Transcription factor bHLH [Cucurbita argyrosperma subsp. argyrosperma]4.9e-9763.91Show/hide
Query:  MTDNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVGSSLEFL
        M DNFIHYSTSTKTDRKI+EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLM S+EGEIKARLL+QVEAHQVGSSLEFL
Subjt:  MTDNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVGSSLEFL

Query:  LTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQ--------------------------YKDG
        LTTGSDYHLVLSQILQLLQEN TQIVHINHSTIIDR+FHKIV QMVGEGMNSEGVDGERICETVKKF+ +                            + 
Subjt:  LTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQ--------------------------YKDG

Query:  ERGEIESEERETNGIGR--------KKPKE----------------------------------------------RQILQLLQENETQIVHINQSTVTD
        E   ++  E  TN I +        K+ KE                                              +Q+LQLL+EN TQIV+IN STV D
Subjt:  ERGEIESEERETNGIGR--------KKPKE----------------------------------------------RQILQLLQENETQIVHINQSTVTD

Query:  LVFHKIITEMVGEGTTFESAEGERICKIVKKFVSQYKD
         VFHKII EMVGEGTT ES EGERIC+ VKKFVSQYKD
Subjt:  LVFHKIITEMVGEGTTFESAEGERICKIVKKFVSQYKD

XP_022994474.1 transcription factor bHLH167-like [Cucurbita maxima]4.5e-6679.89Show/hide
Query:  MTDNFIHYSTSTKTDRKII------EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVG
        M +NFIH  +STK DRK+I      EMK LFS LHSLVPNQSSTE ETTL  QLENATNYI+QLKENVEKLKEKREKLMGS+EGE KARLLVQVEAHQVG
Subjt:  MTDNFIHYSTSTKTDRKII------EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVG

Query:  SSLEFLLTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKD
        SSLE LLTTGSDY  VL+QILQLLQEN TQIV+INHST+ DRVFHKI+A+MVGEG  SE   GERICETVKKFVSQYKD
Subjt:  SSLEFLLTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKD

XP_022994481.1 transcription factor bHLH167-like [Cucurbita maxima]1.3e-7890.86Show/hide
Query:  MTDNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVGSSLEFL
        M DNFIHYSTSTKTDRK IEMK LFSTLHSLVPNQ STEGETTLPVQ+ENATNYI+QLKENVEKLKEKREKL+GS+EGEIKARLLVQVEAHQVGSSLE L
Subjt:  MTDNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVGSSLEFL

Query:  LTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE
        LTTGSDY LVL+QILQLLQEN TQI+HINHSTII+RVFHKIVAQMVGEGM+SEGVDGERICETVKKFVSQYKDG+
Subjt:  LTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE

XP_023542786.1 transcription factor bHLH167-like [Cucurbita pepo subsp. pepo]2.0e-8294.86Show/hide
Query:  MTDNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVGSSLEFL
        M DNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSS EGE TLPVQLENATNYI+QLKENVEKLKEKREKLMGS+EGEIKARLLVQVEAHQVGSSLEFL
Subjt:  MTDNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVGSSLEFL

Query:  LTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE
        LTTGSDYHLVLSQILQLLQEN TQIVHINHSTIIDRVFHKIVAQMVGEGM+SEGVDGERICETVKKFVSQYK+G+
Subjt:  LTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE

TrEMBL top hitse value%identityAlignment
A0A6J1GT92 uncharacterized protein LOC1114572296.5e-6375Show/hide
Query:  MTDNFIHYSTSTKTDRKII------EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLM-----GSQEGEIKARLLVQVE
        M +NFIH  +S K DRK+I      EMK LFS LHSLVPNQSS E ETTL  QLENATNYI+QLKENVEKLKEK+EKLM      ++  EIKARLLVQVE
Subjt:  MTDNFIHYSTSTKTDRKII------EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLM-----GSQEGEIKARLLVQVE

Query:  AHQVGSSLEFLLTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKD
        AHQVGSSLEFLLTT SDYHLVL Q+LQLLQEN TQIV+INHST+ DRVFHKI+A+MVGEG  SE ++GERICETVKKFVSQYKD
Subjt:  AHQVGSSLEFLLTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKD

A0A6J1GVE3 transcription factor bHLH162-like3.3e-5166.85Show/hide
Query:  MTDNFIHYSTSTKTDRKII------EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVG
        M +N IH  + +++D K+I      EMK LFS LHSLVPN+S  EGET L  Q+ENATNYI QLKENVEKLKEK+EKLM   E       +VQVEA  VG
Subjt:  MTDNFIHYSTSTKTDRKII------EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVG

Query:  SSLEFLLTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE
        SSLEFLLTT  DYH +L QILQLLQEN T+IVHIN ST+ DR+FHKI+ QMVGEGMNSE  +GERICE VKKFVSQYKDG+
Subjt:  SSLEFLLTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE

A0A6J1JZ96 transcription factor bHLH167-like6.5e-7990.86Show/hide
Query:  MTDNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVGSSLEFL
        M DNFIHYSTSTKTDRK IEMK LFSTLHSLVPNQ STEGETTLPVQ+ENATNYI+QLKENVEKLKEKREKL+GS+EGEIKARLLVQVEAHQVGSSLE L
Subjt:  MTDNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVGSSLEFL

Query:  LTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE
        LTTGSDY LVL+QILQLLQEN TQI+HINHSTII+RVFHKIVAQMVGEGM+SEGVDGERICETVKKFVSQYKDG+
Subjt:  LTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE

A0A6J1K2Y3 transcription factor bHLH167-like2.2e-6679.89Show/hide
Query:  MTDNFIHYSTSTKTDRKII------EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVG
        M +NFIH  +STK DRK+I      EMK LFS LHSLVPNQSSTE ETTL  QLENATNYI+QLKENVEKLKEKREKLMGS+EGE KARLLVQVEAHQVG
Subjt:  MTDNFIHYSTSTKTDRKII------EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVG

Query:  SSLEFLLTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKD
        SSLE LLTTGSDY  VL+QILQLLQEN TQIV+INHST+ DRVFHKI+A+MVGEG  SE   GERICETVKKFVSQYKD
Subjt:  SSLEFLLTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKD

A0A6J1K483 transcription factor bHLH162-like1.7e-5870.97Show/hide
Query:  MTDNFIHYSTSTKTDRKII------EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQE-----GEIKARLLVQVE
        M DNFI+  +S KTD++II      EMK L S LHSLVPNQ+STEGETTLP QLENATNYI+QLKENVEKLKEKREKLMG  E      E KAR++VQVE
Subjt:  MTDNFIHYSTSTKTDRKII------EMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQE-----GEIKARLLVQVE

Query:  AHQVGSSLEFLLTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE
        AH VGSSLE LLTTGSDYHLVL QI+QLLQEN T+IV IN ST+ +R FHKI+AQM GEG   EG  GERICE VKKFVS YKD +
Subjt:  AHQVGSSLEFLLTTGSDYHLVLSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGE

SwissProt top hitse value%identityAlignment
F4I4E1 Transcription factor bHLH1674.9e-0730.63Show/hide
Query:  DRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEI--KARLLVQVEAHQVGSSLEF-LLTTGSDYHLVL
        DR+ + MK LFS L S V    S   +  +P  ++ AT+Y+ QLKENV  LKEK+  L+  + G +   + LL ++      S++E  L+   +   ++L
Subjt:  DRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEI--KARLLVQVEAHQVGSSLEF-LLTTGSDYHLVL

Query:  SQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQY
         +++ + +E   Q++  N   + DR  + I+AQ +   ++  G+D  RI E V+K +  Y
Subjt:  SQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQY

F4JIJ7 Transcription factor bHLH1623.6e-1033.77Show/hide
Query:  STKTDRKIIE------MKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKR---------EKLMGSQEGEIKA-------RLLVQV
        S   DRK +E      MK+L+S L SL+P+ SSTE   TLP QL+ A NYI++L+ NVEK +E++         EKL       + +       R L ++
Subjt:  STKTDRKIIE------MKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKR---------EKLMGSQEGEIKA-------RLLVQV

Query:  EAHQVGSSLEFLLTTGSDYHLVLSQILQLL-QENETQIVHINHSTIIDRVFHKI
        E  + GS     L T  ++  +  +I+++L +E   +I H  +S + D VFH +
Subjt:  EAHQVGSSLEFLLTTGSDYHLVLSQILQLL-QENETQIVHINHSTIIDRVFHKI

Q9XIJ1 Transcription factor bHLH1684.6e-0528.66Show/hide
Query:  IEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKAR-----LLVQVEAHQVGSSLEFLLTTGSDYH-LVLS
        + MK LFS L S V    S      +P  ++ A +Y+ QLKE V  L E + +++G   GE+K R     LL ++    + S +E  L    +   ++L 
Subjt:  IEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKAR-----LLVQVEAHQVGSSLEFLLTTGSDYH-LVLS

Query:  QILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVS
        +++ + +E   Q++  N   + DR F+ I+AQ +   +   G+D  RI E ++  +S
Subjt:  QILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVS

Arabidopsis top hitse value%identityAlignment
AT1G10585.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.5e-0830.63Show/hide
Query:  DRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEI--KARLLVQVEAHQVGSSLEF-LLTTGSDYHLVL
        DR+ + MK LFS L S V    S   +  +P  ++ AT+Y+ QLKENV  LKEK+  L+  + G +   + LL ++      S++E  L+   +   ++L
Subjt:  DRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEI--KARLLVQVEAHQVGSSLEF-LLTTGSDYHLVL

Query:  SQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQY
         +++ + +E   Q++  N   + DR  + I+AQ +   ++  G+D  RI E V+K +  Y
Subjt:  SQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQY

AT1G10586.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.2e-0628.66Show/hide
Query:  IEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKAR-----LLVQVEAHQVGSSLEFLLTTGSDYH-LVLS
        + MK LFS L S V    S      +P  ++ A +Y+ QLKE V  L E + +++G   GE+K R     LL ++    + S +E  L    +   ++L 
Subjt:  IEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKAR-----LLVQVEAHQVGSSLEFLLTTGSDYH-LVLS

Query:  QILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVS
        +++ + +E   Q++  N   + DR F+ I+AQ +   +   G+D  RI E ++  +S
Subjt:  QILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVS

AT1G10586.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.6e-0527.27Show/hide
Query:  LPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKAR-----LLVQVEAHQVGSSLEFLLTTGSDYH-LVLSQILQLLQENETQIVHINHSTIIDRV
        +P  ++ A +Y+ QLKE V  L E + +++G   GE+K R     LL ++    + S +E  L    +   ++L +++ + +E   Q++  N   + DR 
Subjt:  LPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKAR-----LLVQVEAHQVGSSLEFLLTTGSDYH-LVLSQILQLLQENETQIVHINHSTIIDRV

Query:  FHKIVAQMVGEGMNSEGVDGERICETVKKFVS
        F+ I+AQ +   +   G+D  RI E ++  +S
Subjt:  FHKIVAQMVGEGMNSEGVDGERICETVKKFVS

AT4G20970.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.6e-1133.77Show/hide
Query:  STKTDRKIIE------MKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKR---------EKLMGSQEGEIKA-------RLLVQV
        S   DRK +E      MK+L+S L SL+P+ SSTE   TLP QL+ A NYI++L+ NVEK +E++         EKL       + +       R L ++
Subjt:  STKTDRKIIE------MKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKR---------EKLMGSQEGEIKA-------RLLVQV

Query:  EAHQVGSSLEFLLTTGSDYHLVLSQILQLL-QENETQIVHINHSTIIDRVFHKI
        E  + GS     L T  ++  +  +I+++L +E   +I H  +S + D VFH +
Subjt:  EAHQVGSSLEFLLTTGSDYHLVLSQILQLL-QENETQIVHINHSTIIDRVFHKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGACAACTTCATCCATTATTCAACATCGACCAAAACGGATAGAAAGATCATAGAGATGAAGACCCTTTTCTCCACACTCCATTCTCTTGTCCCCAACCAAAGCTC
AACGGAAGGTGAAACGACACTACCGGTTCAGTTAGAAAATGCCACAAATTATATAAGACAATTGAAGGAGAACGTAGAGAAATTGAAAGAGAAGAGAGAGAAGTTAATGG
GATCACAAGAAGGTGAAATTAAAGCAAGATTATTGGTGCAAGTTGAAGCTCATCAAGTGGGTTCTTCATTGGAGTTTCTCTTGACTACTGGATCTGATTATCACTTGGTT
TTAAGTCAAATCCTTCAGCTGCTTCAAGAAAACGAAACTCAGATCGTCCATATCAATCACTCAACGATCATAGATCGAGTTTTTCACAAGATAGTAGCTCAGATGGTGGG
AGAAGGGATGAACTCCGAAGGTGTTGATGGAGAAAGGATCTGTGAGACAGTGAAGAAGTTTGTTTCACAGTACAAAGATGGAGAACGTGGAGAAATTGAAAGTGAAGAGA
GAGAAACTAATGGGATTGGGAGAAAAAAACCTAAGGAGAGACAAATCCTCCAGCTGCTTCAAGAAAATGAAACTCAGATAGTCCATATCAATCAGTCAACGGTTACCGAT
TTGGTCTTTCACAAGATAATAACAGAGATGGTGGGAGAAGGAACGACCTTTGAAAGCGCTGAAGGTGAAAGGATTTGCAAGATAGTGAAGAAGTTTGTTTCTCAATACAA
AGATGACCCATGCACAGGTTAA
mRNA sequenceShow/hide mRNA sequence
TAACTATTCTTCGAAAAAACCAGAACTTATTAAGGTTCTCAAGTTTCCAACCATGACGGACAACTTCATCCATTATTCAACATCGACCAAAACGGATAGAAAGATCATAG
AGATGAAGACCCTTTTCTCCACACTCCATTCTCTTGTCCCCAACCAAAGCTCAACGGAAGGTGAAACGACACTACCGGTTCAGTTAGAAAATGCCACAAATTATATAAGA
CAATTGAAGGAGAACGTAGAGAAATTGAAAGAGAAGAGAGAGAAGTTAATGGGATCACAAGAAGGTGAAATTAAAGCAAGATTATTGGTGCAAGTTGAAGCTCATCAAGT
GGGTTCTTCATTGGAGTTTCTCTTGACTACTGGATCTGATTATCACTTGGTTTTAAGTCAAATCCTTCAGCTGCTTCAAGAAAACGAAACTCAGATCGTCCATATCAATC
ACTCAACGATCATAGATCGAGTTTTTCACAAGATAGTAGCTCAGATGGTGGGAGAAGGGATGAACTCCGAAGGTGTTGATGGAGAAAGGATCTGTGAGACAGTGAAGAAG
TTTGTTTCACAGTACAAAGATGGAGAACGTGGAGAAATTGAAAGTGAAGAGAGAGAAACTAATGGGATTGGGAGAAAAAAACCTAAGGAGAGACAAATCCTCCAGCTGCT
TCAAGAAAATGAAACTCAGATAGTCCATATCAATCAGTCAACGGTTACCGATTTGGTCTTTCACAAGATAATAACAGAGATGGTGGGAGAAGGAACGACCTTTGAAAGCG
CTGAAGGTGAAAGGATTTGCAAGATAGTGAAGAAGTTTGTTTCTCAATACAAAGATGACCCATGCACAGGTTAATGTTTTGTTATCTCCAAGATTTGTGATGCCTTCTGG
GTGAGGATTCAAATTTCTAATCTTTTAAATAAGGAA
Protein sequenceShow/hide protein sequence
MTDNFIHYSTSTKTDRKIIEMKTLFSTLHSLVPNQSSTEGETTLPVQLENATNYIRQLKENVEKLKEKREKLMGSQEGEIKARLLVQVEAHQVGSSLEFLLTTGSDYHLV
LSQILQLLQENETQIVHINHSTIIDRVFHKIVAQMVGEGMNSEGVDGERICETVKKFVSQYKDGERGEIESEERETNGIGRKKPKERQILQLLQENETQIVHINQSTVTD
LVFHKIITEMVGEGTTFESAEGERICKIVKKFVSQYKDDPCTG