; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0594 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0594
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionBHLH domain-containing protein
Genome locationMC05:4553526..4557888
RNA-Seq ExpressionMC05g0594
SyntenyMC05g0594
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582238.1 Transcription factor basic helix-loop-helix 95, partial [Cucurbita argyrosperma subsp. sororia]2.23e-9180.49Show/hide
Query:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LL  ILPV +SEAA+ NKASTSRKRRRA LEA GG+Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQLMRLEM
Subjt:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI
        +KK P ESV  TM+P STNSDS GGG VIVS S NIVLFGI+ ASVRRGMVTQILMAFER+QAEVLAANVAVSHGNL+LT+TASVHGY+EN IE+I+NDI
Subjt:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI

Query:  LSLNK
        LSL K
Subjt:  LSLNK

XP_022138122.1 uncharacterized protein LOC111009370 [Momordica charantia]5.62e-132100Show/hide
Query:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK
        MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK
Subjt:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK

Query:  KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
        KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
Subjt:  KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS

Query:  LNK
        LNK
Subjt:  LNK

XP_022955986.1 uncharacterized protein LOC111457820 [Cucurbita moschata]2.23e-9180.49Show/hide
Query:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LL  ILPV +SEAA+ NKASTSRKRRRA LEA GG+Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQLMRLEM
Subjt:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI
        +KK P ESV  TM+P STNSDS GGG VIVS S NIVLFGI+ ASVRRGMVTQILMAFER+QAEVLAANVAVSHGNL+LT+TASVHGY+EN IE+I+NDI
Subjt:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI

Query:  LSLNK
        LSL K
Subjt:  LSLNK

XP_022979931.1 uncharacterized protein LOC111479473 [Cucurbita maxima]3.57e-8978.47Show/hide
Query:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LL  ILPV +SEAA+ NKASTSRKR RA LEA GG Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQLMRLEM
Subjt:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGG----GGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKI
        +KK P ESV  TM+P STNSDS GG    GGVIVS S NIVLFGI+ ASVRRGMVT+ILMAFER+QAEVLAANVAVSHGNL+LT+TASVHGY+EN IE+I
Subjt:  KKKLPSESVIATMIPPSTNSDSSGG----GGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKI

Query:  KNDILSLNK
        +NDILSL K
Subjt:  KNDILSLNK

XP_023528352.1 uncharacterized protein LOC111791298 [Cucurbita pepo subsp. pepo]7.07e-9179.23Show/hide
Query:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LL  ILPV +SEAA+ NKASTSRKRRRA LEA GG+Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQL RLEM
Subjt:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGGG--GVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKN
        +KK P ESV  TM+P STNSDS GGG  GVIVS S NIVLFGI+ ASVRRGMVTQILMAFER+QAEVLAANVAVSHGNL+LT+TAS+HGY+EN IE+I+N
Subjt:  KKKLPSESVIATMIPPSTNSDSSGGG--GVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKN

Query:  DILSLNK
        DILSL K
Subjt:  DILSLNK

TrEMBL top hitse value%identityAlignment
A0A1S4DSM0 transcription factor bHLH95-like5.04e-7971.01Show/hide
Query:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKG-RAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEAC+DSV+PL   ILP+   EA +  +AS SRKR RA LEA GG+QK  + KRKEM++SFDVL+SLVPNLSPKATRE IVS  IQFI+FL+KQLMRLEM
Subjt:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKG-RAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGGGG--VIVSASGNIVLFG-ILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKN
        +KK  SESV  T++P S NSDSSGG G  VIVS SGNIVLFG I+ASV+RGMVTQIL+ FER++ EVLAANV VSHGNL+LT+TASVHGY+EN IE+I+N
Subjt:  KKKLPSESVIATMIPPSTNSDSSGGGG--VIVSASGNIVLFG-ILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKN

Query:  DILSLNK
        DILSL K
Subjt:  DILSLNK

A0A5A7U767 Transcription factor bHLH95-like5.04e-7971.01Show/hide
Query:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKG-RAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEAC+DSV+PL   ILP+   EA +  +AS SRKR RA LEA GG+QK  + KRKEM++SFDVL+SLVPNLSPKATRE IVS  IQFI+FL+KQLMRLEM
Subjt:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKG-RAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGGGG--VIVSASGNIVLFG-ILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKN
        +KK  SESV  T++P S NSDSSGG G  VIVS SGNIVLFG I+ASV+RGMVTQIL+ FER++ EVLAANV VSHGNL+LT+TASVHGY+EN IE+I+N
Subjt:  KKKLPSESVIATMIPPSTNSDSSGGGG--VIVSASGNIVLFG-ILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKN

Query:  DILSLNK
        DILSL K
Subjt:  DILSLNK

A0A6J1CA71 uncharacterized protein LOC1110093702.72e-132100Show/hide
Query:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK
        MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK
Subjt:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK

Query:  KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
        KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
Subjt:  KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS

Query:  LNK
        LNK
Subjt:  LNK

A0A6J1GWN9 uncharacterized protein LOC1114578201.08e-9180.49Show/hide
Query:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LL  ILPV +SEAA+ NKASTSRKRRRA LEA GG+Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQLMRLEM
Subjt:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI
        +KK P ESV  TM+P STNSDS GGG VIVS S NIVLFGI+ ASVRRGMVTQILMAFER+QAEVLAANVAVSHGNL+LT+TASVHGY+EN IE+I+NDI
Subjt:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI

Query:  LSLNK
        LSL K
Subjt:  LSLNK

A0A6J1IS55 uncharacterized protein LOC1114794731.73e-8978.47Show/hide
Query:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LL  ILPV +SEAA+ NKASTSRKR RA LEA GG Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQLMRLEM
Subjt:  MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGG----GGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKI
        +KK P ESV  TM+P STNSDS GG    GGVIVS S NIVLFGI+ ASVRRGMVT+ILMAFER+QAEVLAANVAVSHGNL+LT+TASVHGY+EN IE+I
Subjt:  KKKLPSESVIATMIPPSTNSDSSGG----GGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKI

Query:  KNDILSLNK
        +NDILSL K
Subjt:  KNDILSLNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01260.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.1e-0422.95Show/hide
Query:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST
        V   E+ ++      R+      EA   ++  R +R+++NQ F  L+S+VPN+S K  + +++ + + +I+ L  +L  +E +++          +  S+
Subjt:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST

Query:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
        N   S    + V  SG  V   I   +     ++I  AFE ++ EV+ +N+ VS   +       +H ++  + E  K  ++S
Subjt:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS

AT1G01260.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.1e-0422.95Show/hide
Query:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST
        V   E+ ++      R+      EA   ++  R +R+++NQ F  L+S+VPN+S K  + +++ + + +I+ L  +L  +E +++          +  S+
Subjt:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST

Query:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
        N   S    + V  SG  V   I   +     ++I  AFE ++ EV+ +N+ VS   +       +H ++  + E  K  ++S
Subjt:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS

AT1G01260.3 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.1e-0422.95Show/hide
Query:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST
        V   E+ ++      R+      EA   ++  R +R+++NQ F  L+S+VPN+S K  + +++ + + +I+ L  +L  +E +++          +  S+
Subjt:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST

Query:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
        N   S    + V  SG  V   I   +     ++I  AFE ++ EV+ +N+ VS   +       +H ++  + E  K  ++S
Subjt:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCCTGCACCGACTCTGTCATTCCACTTTTGACGCAGATTTTGCCGGTCCGTGAATCTGAAGCCGCCGATCACAACAAGGCTTCCACCTCGAGAAAGCGTCGCAG
AGCCGATCTGGAGGCCGGCGGAGGTCTACAGAAAGGGAGAGCGAAGAGGAAGGAGATGAACCAGAGTTTCGATGTTCTTCAATCTCTCGTCCCCAATCTCTCTCCCAAGG
CCACGAGGGAGAATATTGTTTCCGAAACGATCCAGTTCATCGATTTTCTGGAGAAGCAGTTGATGAGGCTGGAAATGAAGAAGAAATTGCCATCGGAATCGGTGATCGCG
ACGATGATTCCGCCGAGTACGAACTCGGATTCATCCGGCGGAGGCGGCGTTATCGTCTCGGCCTCCGGCAACATCGTGTTGTTTGGGATTCTTGCTTCTGTTCGACGAGG
TATGGTGACACAGATTTTAATGGCGTTTGAAAGAAACCAGGCTGAAGTTCTTGCAGCAAATGTTGCAGTCAGCCATGGAAATTTAAGTTTGACAATCACGGCTTCTGTAC
ACGGTTACATTGAGAATGCCATAGAGAAGATTAAAAACGATATCCTGAGCTTAAACAAGTAA
mRNA sequenceShow/hide mRNA sequence
CGCAGGAGTGTACAATCATGATCGAAATAACAAGTCAATCAAGAAAGAATATATGTCAATTTTGAAAAATGATCGGTTTTAGGGTGAAACCGACCAATGAACGGTTCAAT
CAAGTGGTAAAAGAACCCAAGTAACACTATTTACACATTTCAACTCATATGAACTGGGTTGAAAGATGAAATCAGCACCACTAATTTGTGATCAAATCACAAATGGTTGA
TCGAATCGGACGGCGAGAATGGAAACAGAAAAAGAAAAGTACAATCAACAGCAGAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAATCAAAACAGTAAAATCGGGAGTTGG
AATTTTCCATCGTATGGACTCTTCTCTGCGCTCTGTACTGCACACACAATTATTCTAACTGCAAATTTCTTCCACTTCAGACCAATCTCTCTCTCTCTCTCTCTCTCTCA
CGCACATTCTGCAACTGTTTTCATTGCTCCATAAAAACCCTCATCAATGGCAGAATAATGAACACCAACAAGACGGCGCCGCGGAGATTTGCAACCTTCTACTTCTTCTT
CTACTAATGGAGGCCTGCACCGACTCTGTCATTCCACTTTTGACGCAGATTTTGCCGGTCCGTGAATCTGAAGCCGCCGATCACAACAAGGCTTCCACCTCGAGAAAGCG
TCGCAGAGCCGATCTGGAGGCCGGCGGAGGTCTACAGAAAGGGAGAGCGAAGAGGAAGGAGATGAACCAGAGTTTCGATGTTCTTCAATCTCTCGTCCCCAATCTCTCTC
CCAAGGCCACGAGGGAGAATATTGTTTCCGAAACGATCCAGTTCATCGATTTTCTGGAGAAGCAGTTGATGAGGCTGGAAATGAAGAAGAAATTGCCATCGGAATCGGTG
ATCGCGACGATGATTCCGCCGAGTACGAACTCGGATTCATCCGGCGGAGGCGGCGTTATCGTCTCGGCCTCCGGCAACATCGTGTTGTTTGGGATTCTTGCTTCTGTTCG
ACGAGGTATGGTGACACAGATTTTAATGGCGTTTGAAAGAAACCAGGCTGAAGTTCTTGCAGCAAATGTTGCAGTCAGCCATGGAAATTTAAGTTTGACAATCACGGCTT
CTGTACACGGTTACATTGAGAATGCCATAGAGAAGATTAAAAACGATATCCTGAGCTTAAACAAGTAATAAATTCCATTTTCACGAGATTCTTCCATTTTTATTTACTCC
GAGGGGATCTTCAATACTTCTACACAAAAAAACAGCTTAGAAAGTGAAAGAATACAATACAATGGTCATATATATATATATTTGGATTTTGCTTAAAATCAATGTTATTG
ATCAATATTGATCAGGTTCAAGTTGGGTTCATCTTCTGGTTCTACATATATAATATTATATATTTGGCAACAAATCTCTCTCTCTCTCTTTCTCTCTCTCCACTGGTTTA
TTTTGATGGAAACTGTTCTGTTTCTGAGATCAATGTTCTGATTATAGTTGTTATCATTTCTATTTTCCATCTTGGTCGGCTTCAACCTTTCCGGATATTATTTCTATTTT
CAAAATATCCATTCAATAAATTTTAAAAATTGTGATAATAAATTTGCATTAAACAATTTCTTTTGCTGAAATTTTGCTAATATAGAGACCAAATTAATAATTTTTTAGTA
CAACAAATGAGTTGTGCTCATATTGATCAAATTAGTCCATTTTGAAAAATACATGAACCAAAAATGAACATTTTGAAAGTACAATGACCAAAATAAACCAAAACTAAAAA
TCCCGAAACCAAAATAGGATATAAACCTATAGACTTTTACAAATTTGTTATGGGTCTCTTATTAGACATTGAGAGCTGACCCAAACACATATAAGTCACTAGGTGAAATT
AAATTAGTAATGGTGGACTCAAATCCTGACCCATTGAATTTTCTTTATCTCATTTTTGGCAAAGCTTGTCTGTTTAGGAATGGAATATTTTTTGAACAAAGAGGGTTGAT
GAAACATCCAACCAAATTAAATTTGGTCATAAATTGTTGCAAAAAGATTCCTCAAAGCCACATGAGAAGAAACTCTATAATATATGTGTGTGTGTGTATATATATCTGAT
GGTTAGGATTCAGCACAGTCAACAGGAGCAAAGGAAATAAGTTGAGCTTCAAGGTCATACCCAACATAGTAGTTTTGCAGCTGAAAGTTCCCCAACACAGAAATTGGAGA
TCCAGAGCGCAGAAGGGCAAGGCACACAACTCCATCATCCTCCATCTTCACAAAGGTACTTTCTAAACTAAGAACTAAATCTGCACCATCAAAATGAACTGTAACGACTG
GTAATGACTCCAAATCATCTCCATTTCCTGCAAAGCACACTTCAAATCTGTTTCTAGGGTCATCTTTTCTCTTTGGTAAACCTGGAAATGCAACCAACTTATCTATCAAA
CCATCAAATGCATCTGTTTCGAGGCTCGAGTATGTCATTCCTGAATCTACGATCCATCCATCCCCTACATCAAACACATCAGAAACTCCATCCAAATACAGCTCATCTTT
GCCAAGGCTGACTCCCACAACCTTCACATAATAGGCATCTAAATTGGGATACAACAGAGGAGTTTGGCCCCCAGAAGTCACAGGCAGTGATCCAAAATACATTTTACTTG
CTGATCCCAAATTGAATGGGACCAAACAGTAGGAGAATTTCTTGACACCCAGTTGAGAGATTAGTGAGAGGGGTGTCTGGTTCAAGCCCACACTGCCCATGTAACTCTGC
AACCCTCCTGCTAAAGGAGCATCTGAACAGCCAAAGTTCAAATAGCCAACATCCACAAGTTTCCCATCTGAGAAGAATTGCAGGTTTGGAAGCCTGTCAAGGAATTGCAG
AAGTTAGAGCCACATGGCTCCTTCTCATAGGTGGAGGATTTGGAGGAGCTGAACTTGGTGTGGCCTTTTTCTGCCTCACAATGGCTTCTACAATCTGAGCATTGCACCCA
AATGAGACCATTTGATGTGTCTGCAAACCCCACCACTCTACTTGGAGGATTTCCAATGAAGAAACTCATGAGGTAATTGAGCCGGGACCGAGAACGGTGAACGGTTGCCT
CGATCCGTGCAGTGTCTGTTATGGAATGATTGTAAAATGGTGATAAAGGCGAGTCGCGGTGAATCAAATGTGCAGTGAAGCCAACTTCAGTTGGTAAGACCATATGTCTT
GCTGTTGATTCAAGGATGAAGGAAAAAAGGAAGAAAATCTTGCTTCCAAAACAATGGTTGGGACTCATTTGATTAAGCTTTATATCAAGAACAATTTTTTCATTTCCTTT
TATATGGTTGGGGAAGGGGAGATAGTTTGTTAAGTCACACCATTGGTGATTAGAGTTTGTGCAGTTGTAAAAATTCAGGTGGTGCATGATCTTCTTGTGGTTTCCAATGT
GCTAGTCGAAAAAATGGACAAAGAACTAACAGCAATTTTGTTCGATGGTGTTTGCCAACTCAAAAGAGAAGTAAAGACCATCCTTTGCTATCCTTGGCATTGTCTCGAGT
CTCTTGAACCACAGCATCTCGATGTCTTGGTAATGAAAAAACTCGGTTCAACTTCCCAATTGGATTCAGCTCATGACTATATGTTGATTACAAAAGCACATTTTCGAATC
GTGCTGAAATTTCATCTGAAATCAAGGCACTTCTTCTACTGATTTAGTTTTTTGCCTTTACGTCCACCTCTAATCCAGCATTTTGTAAACTACCGAGCAATGTAGTAATA
TAAGATTTAAGATCTGCATCAAATTCCTGGGAAATGAAACAGGAGATAAATGTTTAACTTATTTCTCATGAATGATGAAAGAGATGAATAAATAATGTAAGATATTAGAC
CTCTTTGTGAGCTTCTTTTTGGTGCTTACTAACTTGCACAGGCCACTCATTGACCAGATTTTCAATCTCAACCATGAAGTCTGACTTTGAATTTGGAAAGGCCTGCCAAT
AAAGAAACGGAATGAGCTATGAAATGTTTAATACTCAAGTATTCATTTTGTCCGCCGCCTTAACATAGCCCAACTGTCAGAAAAAAATTGAACATTTCAATCACCACCTC
ACATGTTTAAC
Protein sequenceShow/hide protein sequence
MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIA
TMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILSLNK