; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G017570 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G017570
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptiontranscription factor bHLH93
Genome locationCiama_Chr01:30998342..30999973
RNA-Seq ExpressionCaUC01G017570
SyntenyCaUC01G017570
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143539.2 transcription factor bHLH93 isoform X2 [Cucumis sativus]4.1e-17188.8Show/hide
Query:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE
        MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNF++FDENPQM +S F NFP IQT NDFSFADQQLYSNFLEGFAMPELDSSSYTKNNET PF+SQE
Subjt:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE

Query:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
        EMSNKN+GYPP AMEEEELGFIE ETAPSVCKVEMEQMG+RE NGS MGVAELGKRSS KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
Subjt:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD

Query:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV
        RTSILGDTIDYVKEL+ERINNLKEEEE GLDSNHVGFFNGISKEGKSNEVQVRNSPK                F+VER E+ETRIDICCATRPGLLLSTV
Subjt:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV

Query:  NTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL
        NTLEALGLEIQQCVISCFNDFSMQASC+EGSAQKAVASSDDIKE+LFRNAGYGGKCL
Subjt:  NTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL

XP_008440501.1 PREDICTED: transcription factor bHLH93 isoform X2 [Cucumis melo]1.3e-16988.24Show/hide
Query:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE
        MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFS+FDENPQMG+S FPNFP IQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNET PF+SQE
Subjt:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE

Query:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
        EMSNKN+GYPP AMEEEELGF+E ETAPSVCKVEMEQMG+RE NGS MG+AELGKRSS KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
Subjt:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD

Query:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV
        RTSILGDTIDYVKEL+ERINNLK EEE GLDSNHVG FNGIS EGKSNEVQVRNSPK                F+VER E+ETRIDICCATRPGLLLSTV
Subjt:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV

Query:  NTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL
        NTLEALGLEIQQCVISCFNDFSMQASC+EGSAQKAVASSDDIK++LFRNAGYGGKCL
Subjt:  NTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL

XP_011657932.1 transcription factor bHLH93 isoform X1 [Cucumis sativus]1.0e-16988.55Show/hide
Query:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE
        MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNF++FDENPQM +S F NFP IQT NDFSFADQQLYSNFLEGFAMPELDSSSYTKNNET PF+SQE
Subjt:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE

Query:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
        EMSNKN+GYPP AMEEEELGFIE ETAPSVCKVEMEQMG+RE NGS MGVAELGKRSS KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
Subjt:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD

Query:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV
        RTSILGDTIDYVKEL+ERINNLKEEEE GLDSNHVGFFNGISKEGKSNEVQVRNSPK                F+VER E+ETRIDICCATRPGLLLSTV
Subjt:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV

Query:  NTLEALGLEIQQCVISCFNDFSMQASCSE-GSAQKAVASSDDIKESLFRNAGYGGKCL
        NTLEALGLEIQQCVISCFNDFSMQASC+E GSAQKAVASSDDIKE+LFRNAGYGGKCL
Subjt:  NTLEALGLEIQQCVISCFNDFSMQASCSE-GSAQKAVASSDDIKESLFRNAGYGGKCL

XP_038882569.1 transcription factor bHLH93-like isoform X1 [Benincasa hispida]5.0e-16989.14Show/hide
Query:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE
        MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNF++FDENPQMGTS FPNFP IQ ANDFSFADQQLY NFLEGFAMPELDSSSYTKNNET PF+SQE
Subjt:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE

Query:  EMSNKNS-GYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKM
        E+SNKNS GYPPAAMEEEELGFIE ETAPSVCKVEMEQ+G+RETN SKMGVAELGKR+S KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKM
Subjt:  EMSNKNS-GYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKM

Query:  DRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLST
        DRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKE KSNEVQVRNSPK                F+VER ERETRIDICCATRPGLLLST
Subjt:  DRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLST

Query:  VNTLEALGLEIQQCVISCFNDFSMQASCSE-GSAQKAVASSDDIKESLFRNAGYGGKCL
        VNTLEALGLEIQQCVISCFNDFSMQASCSE GSAQKA+ASSD IKE+LFRNAGYGGKCL
Subjt:  VNTLEALGLEIQQCVISCFNDFSMQASCSE-GSAQKAVASSDDIKESLFRNAGYGGKCL

XP_038882570.1 transcription factor bHLH93-like isoform X2 [Benincasa hispida]2.0e-17089.39Show/hide
Query:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE
        MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNF++FDENPQMGTS FPNFP IQ ANDFSFADQQLY NFLEGFAMPELDSSSYTKNNET PF+SQE
Subjt:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE

Query:  EMSNKNS-GYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKM
        E+SNKNS GYPPAAMEEEELGFIE ETAPSVCKVEMEQ+G+RETN SKMGVAELGKR+S KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKM
Subjt:  EMSNKNS-GYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKM

Query:  DRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLST
        DRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKE KSNEVQVRNSPK                F+VER ERETRIDICCATRPGLLLST
Subjt:  DRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLST

Query:  VNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL
        VNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKA+ASSD IKE+LFRNAGYGGKCL
Subjt:  VNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL

TrEMBL top hitse value%identityAlignment
A0A0A0KG39 BHLH domain-containing protein2.0e-17188.8Show/hide
Query:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE
        MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNF++FDENPQM +S F NFP IQT NDFSFADQQLYSNFLEGFAMPELDSSSYTKNNET PF+SQE
Subjt:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE

Query:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
        EMSNKN+GYPP AMEEEELGFIE ETAPSVCKVEMEQMG+RE NGS MGVAELGKRSS KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
Subjt:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD

Query:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV
        RTSILGDTIDYVKEL+ERINNLKEEEE GLDSNHVGFFNGISKEGKSNEVQVRNSPK                F+VER E+ETRIDICCATRPGLLLSTV
Subjt:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV

Query:  NTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL
        NTLEALGLEIQQCVISCFNDFSMQASC+EGSAQKAVASSDDIKE+LFRNAGYGGKCL
Subjt:  NTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL

A0A1S3B0U8 transcription factor bHLH93 isoform X26.3e-17088.24Show/hide
Query:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE
        MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFS+FDENPQMG+S FPNFP IQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNET PF+SQE
Subjt:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE

Query:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
        EMSNKN+GYPP AMEEEELGF+E ETAPSVCKVEMEQMG+RE NGS MG+AELGKRSS KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
Subjt:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD

Query:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV
        RTSILGDTIDYVKEL+ERINNLK EEE GLDSNHVG FNGIS EGKSNEVQVRNSPK                F+VER E+ETRIDICCATRPGLLLSTV
Subjt:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV

Query:  NTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL
        NTLEALGLEIQQCVISCFNDFSMQASC+EGSAQKAVASSDDIK++LFRNAGYGGKCL
Subjt:  NTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL

A0A1S3B204 transcription factor bHLH93 isoform X11.6e-16887.99Show/hide
Query:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE
        MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFS+FDENPQMG+S FPNFP IQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNET PF+SQE
Subjt:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE

Query:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
        EMSNKN+GYPP AMEEEELGF+E ETAPSVCKVEMEQMG+RE NGS MG+AELGKRSS KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
Subjt:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD

Query:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV
        RTSILGDTIDYVKEL+ERINNLK EEE GLDSNHVG FNGIS EGKSNEVQVRNSPK                F+VER E+ETRIDICCATRPGLLLSTV
Subjt:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV

Query:  NTLEALGLEIQQCVISCFNDFSMQASCSE-GSAQKAVASSDDIKESLFRNAGYGGKCL
        NTLEALGLEIQQCVISCFNDFSMQASC+E GSAQKAVASSDDIK++LFRNAGYGGKCL
Subjt:  NTLEALGLEIQQCVISCFNDFSMQASCSE-GSAQKAVASSDDIKESLFRNAGYGGKCL

A0A6J1GCM1 transcription factor bHLH93-like isoform X19.5e-15882.96Show/hide
Query:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE
        MELSQHGFLEELLASTPW+SSYSNGFNDFFQN WNF  FDENPQMGTS  P+FP +QTA DFSFADQ LY+NF+EGFAMPELDSSSYT+NNET PF+SQE
Subjt:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE

Query:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
        EMSNKNSG+PP  MEEEELGF+E E APSVCKVEMEQMG RETN SKMGVAE  KR+S KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
Subjt:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD

Query:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV
        RTSILGDTIDYVKEL+ERINNLKEE+E GLDSNHVG FN ISKEGK NEVQVRNSPK                F++E+ E +TRIDICCATRPGLLLSTV
Subjt:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV

Query:  NTLEALGLEIQQCVISCFNDFSMQASCSE-GSAQKAVASSDDIKESLFRNAGYGGKCL
        NTLEALGLEIQQCVISCFNDFSMQA CSE GS +KAVASSDDIKE+LFRNAGYGGKCL
Subjt:  NTLEALGLEIQQCVISCFNDFSMQASCSE-GSAQKAVASSDDIKESLFRNAGYGGKCL

A0A6J1GDE8 transcription factor bHLH93-like isoform X23.9e-15983.19Show/hide
Query:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE
        MELSQHGFLEELLASTPW+SSYSNGFNDFFQN WNF  FDENPQMGTS  P+FP +QTA DFSFADQ LY+NF+EGFAMPELDSSSYT+NNET PF+SQE
Subjt:  MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQE

Query:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
        EMSNKNSG+PP  MEEEELGF+E E APSVCKVEMEQMG RETN SKMGVAE  KR+S KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD
Subjt:  EMSNKNSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMD

Query:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV
        RTSILGDTIDYVKEL+ERINNLKEE+E GLDSNHVG FN ISKEGK NEVQVRNSPK                F++E+ E +TRIDICCATRPGLLLSTV
Subjt:  RTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTV

Query:  NTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL
        NTLEALGLEIQQCVISCFNDFSMQA CSEGS +KAVASSDDIKE+LFRNAGYGGKCL
Subjt:  NTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL

SwissProt top hitse value%identityAlignment
Q10S44 Transcription factor BHLH32.5e-5460.61Show/hide
Query:  KIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGI--SKEGKSNEVQVRNSPKSAI
        K+ G PSKNLMAERRRRKRLNDRLSMLR+IVPKISKMDRTSILGDTIDYVKEL ERI  L  EEEIG+    +   N +  S  G +NE+ VRNS     
Subjt:  KIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGI--SKEGKSNEVQVRNSPKSAI

Query:  VELMAETLLFVTQFEVE-RTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL
                   T+F+VE R    TRI+ICC   PG+LLSTV+ LE LGLEI+QCV+SCF+DF MQASC +   ++ V S+D+IK++LFR+AGYGG+CL
Subjt:  VELMAETLLFVTQFEVE-RTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL

Q336V8 Basic helix-loop-helix protein 0041.3e-4254.36Show/hide
Query:  GQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSN-EVQVRNSPKSAIVELM
        G PSKNLMAERRRRKRLNDRLSMLR++VP+ISKMDRTSILGDTI YVKEL++RI NL+ E   G  S+        S E  S  ++     P S+     
Subjt:  GQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSN-EVQVRNSPKSAIVELM

Query:  AETLLFVTQFEVERTER-ETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVA-SSDDIKESLFRNAGYGGKCL
           +   T+FEVER E   TRI++ CA  P LL ST+  LEALG+EI+QCVISCF+DF+MQASC +   ++ +   +++IK++LFR+AGYG  CL
Subjt:  AETLLFVTQFEVERTER-ETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVA-SSDDIKESLFRNAGYGGKCL

Q9LSE2 Transcription factor ICE13.3e-3046.5Show/hide
Query:  KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSA
        K KK +G P+KNLMAERRRRK+LNDRL MLR++VPKISKMDR SILGD IDY+KEL++RIN+L  E    L+S   G     S          +      
Subjt:  KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSA

Query:  IVELMAETLLFV--TQFEVE---RTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGG
          EL   +L      Q  VE   R  R   I + C  RPGLLL+T+  L+ LGL++QQ VISCFN F++    +E   +      D IK  LF  AGY G
Subjt:  IVELMAETLLFV--TQFEVE---RTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGG

Q9LSL1 Transcription factor bHLH934.2e-6245.31Show/hide
Query:  MELS-QHGFLEELLASTPWTS--------SYSNGF----NDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFAD-------QQLYSNFLEGFAMP
        MELS Q    EELL  T   +        S++ GF    + FF NG+N      N +    N   +P        SF D         L+         P
Subjt:  MELS-QHGFLEELLASTPWTS--------SYSNGF----NDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFAD-------QQLYSNFLEGFAMP

Query:  ELDSSSYTKNNETSPFI-SQEEMSNKNSGYPPAAMEE-EELGFIEIETAPSVCKVEMEQMGLRETNGSK---MGVAELGKRSSIKAKKIEGQPSKNLMAE
         L SS+        PF+ + +E+ + +S  PP  ++  +E  F    + PS          L E++ SK   +G    G+ +  K+KK+EGQPSKNLMAE
Subjt:  ELDSSSYTKNNETSPFI-SQEEMSNKNSGYPPAAMEE-EELGFIEIETAPSVCKVEMEQMGLRETNGSK---MGVAELGKRSSIKAKKIEGQPSKNLMAE

Query:  RRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLK-EEEEIGLDSN-HVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQ
        RRRRKRLNDRLSMLR+IVPKISKMDRTSILGD IDY+KEL+++IN L+ EE+E+G  +N H     G  K+  +NE  VRNSPK                
Subjt:  RRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLK-EEEEIGLDSN-HVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQ

Query:  FEVERTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL
        FE++R + +TR+DICC+ +PGLLLSTVNTLE LGLEI+QCVISCF+DFS+QASCSEG+ Q+   +S+DIK++LFRNAGYGG CL
Subjt:  FEVERTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL

Q9LXA9 Transcription factor bHLH611.8e-5244.19Show/hide
Query:  NDFSFADQQLYSNFLEGFAMPELDSSSYTKN--NETSPFISQEEMSNKNSGYPPAAMEEEELGFIE------IETAPSVCKVEMEQMG----LRETNGSK
        ND++F D   ++N L+     +  SSS   N  ++  P + Q    +      P      +  F+E          P +     E       L E + S 
Subjt:  NDFSFADQQLYSNFLEGFAMPELDSSSYTKN--NETSPFISQEEMSNKNSGYPPAAMEEEELGFIE------IETAPSVCKVEMEQMG----LRETNGSK

Query:  MGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKS
        + + E  K+ S   KK+EGQPSKNLMAERRRRKRLNDRLS+LR+IVPKI+KMDRTSILGD IDY+KEL+++IN L+E+E+    ++H+           +
Subjt:  MGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKS

Query:  NEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLF
        NE  VRNS K                FEV++ E  T IDICC T+PGL++STV+TLE LGLEI+QCVISCF+DFS+QASC E   Q+ + +S+  K++L 
Subjt:  NEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLF

Query:  RNAGYGGKCL
        RNAGYGG+CL
Subjt:  RNAGYGGKCL

Arabidopsis top hitse value%identityAlignment
AT3G26744.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.3e-3146.5Show/hide
Query:  KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSA
        K KK +G P+KNLMAERRRRK+LNDRL MLR++VPKISKMDR SILGD IDY+KEL++RIN+L  E    L+S   G     S          +      
Subjt:  KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSA

Query:  IVELMAETLLFV--TQFEVE---RTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGG
          EL   +L      Q  VE   R  R   I + C  RPGLLL+T+  L+ LGL++QQ VISCFN F++    +E   +      D IK  LF  AGY G
Subjt:  IVELMAETLLFV--TQFEVE---RTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGG

AT3G26744.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.3e-3146.5Show/hide
Query:  KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSA
        K KK +G P+KNLMAERRRRK+LNDRL MLR++VPKISKMDR SILGD IDY+KEL++RIN+L  E    L+S   G     S          +      
Subjt:  KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSA

Query:  IVELMAETLLFV--TQFEVE---RTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGG
          EL   +L      Q  VE   R  R   I + C  RPGLLL+T+  L+ LGL++QQ VISCFN F++    +E   +      D IK  LF  AGY G
Subjt:  IVELMAETLLFV--TQFEVE---RTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGG

AT3G26744.4 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.3e-3146.5Show/hide
Query:  KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSA
        K KK +G P+KNLMAERRRRK+LNDRL MLR++VPKISKMDR SILGD IDY+KEL++RIN+L  E    L+S   G     S          +      
Subjt:  KAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSA

Query:  IVELMAETLLFV--TQFEVE---RTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGG
          EL   +L      Q  VE   R  R   I + C  RPGLLL+T+  L+ LGL++QQ VISCFN F++    +E   +      D IK  LF  AGY G
Subjt:  IVELMAETLLFV--TQFEVE---RTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGG

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.3e-5344.19Show/hide
Query:  NDFSFADQQLYSNFLEGFAMPELDSSSYTKN--NETSPFISQEEMSNKNSGYPPAAMEEEELGFIE------IETAPSVCKVEMEQMG----LRETNGSK
        ND++F D   ++N L+     +  SSS   N  ++  P + Q    +      P      +  F+E          P +     E       L E + S 
Subjt:  NDFSFADQQLYSNFLEGFAMPELDSSSYTKN--NETSPFISQEEMSNKNSGYPPAAMEEEELGFIE------IETAPSVCKVEMEQMG----LRETNGSK

Query:  MGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKS
        + + E  K+ S   KK+EGQPSKNLMAERRRRKRLNDRLS+LR+IVPKI+KMDRTSILGD IDY+KEL+++IN L+E+E+    ++H+           +
Subjt:  MGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKS

Query:  NEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLF
        NE  VRNS K                FEV++ E  T IDICC T+PGL++STV+TLE LGLEI+QCVISCF+DFS+QASC E   Q+ + +S+  K++L 
Subjt:  NEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLF

Query:  RNAGYGGKCL
        RNAGYGG+CL
Subjt:  RNAGYGGKCL

AT5G65640.1 beta HLH protein 933.0e-6345.31Show/hide
Query:  MELS-QHGFLEELLASTPWTS--------SYSNGF----NDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFAD-------QQLYSNFLEGFAMP
        MELS Q    EELL  T   +        S++ GF    + FF NG+N      N +    N   +P        SF D         L+         P
Subjt:  MELS-QHGFLEELLASTPWTS--------SYSNGF----NDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFAD-------QQLYSNFLEGFAMP

Query:  ELDSSSYTKNNETSPFI-SQEEMSNKNSGYPPAAMEE-EELGFIEIETAPSVCKVEMEQMGLRETNGSK---MGVAELGKRSSIKAKKIEGQPSKNLMAE
         L SS+        PF+ + +E+ + +S  PP  ++  +E  F    + PS          L E++ SK   +G    G+ +  K+KK+EGQPSKNLMAE
Subjt:  ELDSSSYTKNNETSPFI-SQEEMSNKNSGYPPAAMEE-EELGFIEIETAPSVCKVEMEQMGLRETNGSK---MGVAELGKRSSIKAKKIEGQPSKNLMAE

Query:  RRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLK-EEEEIGLDSN-HVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQ
        RRRRKRLNDRLSMLR+IVPKISKMDRTSILGD IDY+KEL+++IN L+ EE+E+G  +N H     G  K+  +NE  VRNSPK                
Subjt:  RRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTIDYVKELIERINNLK-EEEEIGLDSN-HVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQ

Query:  FEVERTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL
        FE++R + +TR+DICC+ +PGLLLSTVNTLE LGLEI+QCVISCF+DFS+QASCSEG+ Q+   +S+DIK++LFRNAGYGG CL
Subjt:  FEVERTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVISCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTCAGTCAACATGGTTTCTTAGAAGAGTTATTAGCTTCAACGCCTTGGACCTCTTCATACTCAAATGGTTTCAATGATTTCTTCCAAAATGGATGGAAT
TTCAGCACTTTTGATGAGAATCCCCAAATGGGTACATCTAATTTCCCTAATTTTCCCGGTATCCAAACAGCTAATGATTTTTCTTTCGCCGATCAACAGCTCTAC
AGCAATTTTCTCGAAGGGTTTGCAATGCCGGAGCTTGACTCCTCATCGTACACTAAGAACAACGAAACCTCACCATTTATTTCTCAAGAAGAAATGAGTAATAAG
AACAGTGGTTACCCTCCGGCGGCGATGGAGGAGGAAGAACTTGGTTTTATAGAGATTGAAACAGCTCCAAGTGTTTGCAAAGTGGAAATGGAGCAAATGGGTTTG
CGTGAAACTAATGGTTCTAAAATGGGTGTGGCAGAATTAGGGAAAAGAAGCAGCATCAAGGCTAAAAAGATTGAAGGACAGCCCTCAAAGAATTTAATGGCGGAA
AGAAGAAGAAGGAAGCGGTTGAATGATCGGCTTTCAATGCTCAGAGCCATAGTCCCTAAAATAAGCAAGATGGATAGAACGTCTATACTTGGAGACACAATCGAT
TATGTGAAAGAGCTGATAGAAAGAATCAATAACTTGAAAGAAGAAGAGGAAATTGGTTTAGATTCAAATCACGTGGGCTTCTTCAATGGGATCTCCAAGGAAGGG
AAGTCCAACGAAGTTCAAGTGAGGAATTCCCCAAAGTCTGCCATTGTTGAGCTAATGGCTGAAACTTTGTTGTTTGTTACACAGTTCGAAGTTGAAAGGACGGAG
AGGGAGACTCGAATCGACATTTGCTGTGCAACGAGGCCAGGGTTATTGCTGTCTACAGTCAACACATTAGAAGCATTGGGGCTTGAGATTCAACAGTGTGTTATT
AGCTGTTTCAATGATTTTTCAATGCAAGCTTCTTGTTCAGAGGGAAGTGCTCAGAAAGCTGTGGCAAGTTCTGATGATATAAAGGAATCACTGTTCAGAAATGCA
GGATATGGAGGGAAGTGCTTGTAG
mRNA sequenceShow/hide mRNA sequence
CCAACTTGGTGCAACTCTATGTTCATTAGTCTGTTTTCCACTTTCTTCAGACCAAAAGAAGGAAGAAGAAGACAATGGAGCTCAGTCAACATGGTTTCTTAGAAG
AGTTATTAGCTTCAACGCCTTGGACCTCTTCATACTCAAATGGTTTCAATGATTTCTTCCAAAATGGATGGAATTTCAGCACTTTTGATGAGAATCCCCAAATGG
GTACATCTAATTTCCCTAATTTTCCCGGTATCCAAACAGCTAATGATTTTTCTTTCGCCGATCAACAGCTCTACAGCAATTTTCTCGAAGGGTTTGCAATGCCGG
AGCTTGACTCCTCATCGTACACTAAGAACAACGAAACCTCACCATTTATTTCTCAAGAAGAAATGAGTAATAAGAACAGTGGTTACCCTCCGGCGGCGATGGAGG
AGGAAGAACTTGGTTTTATAGAGATTGAAACAGCTCCAAGTGTTTGCAAAGTGGAAATGGAGCAAATGGGTTTGCGTGAAACTAATGGTTCTAAAATGGGTGTGG
CAGAATTAGGGAAAAGAAGCAGCATCAAGGCTAAAAAGATTGAAGGACAGCCCTCAAAGAATTTAATGGCGGAAAGAAGAAGAAGGAAGCGGTTGAATGATCGGC
TTTCAATGCTCAGAGCCATAGTCCCTAAAATAAGCAAGATGGATAGAACGTCTATACTTGGAGACACAATCGATTATGTGAAAGAGCTGATAGAAAGAATCAATA
ACTTGAAAGAAGAAGAGGAAATTGGTTTAGATTCAAATCACGTGGGCTTCTTCAATGGGATCTCCAAGGAAGGGAAGTCCAACGAAGTTCAAGTGAGGAATTCCC
CAAAGTCTGCCATTGTTGAGCTAATGGCTGAAACTTTGTTGTTTGTTACACAGTTCGAAGTTGAAAGGACGGAGAGGGAGACTCGAATCGACATTTGCTGTGCAA
CGAGGCCAGGGTTATTGCTGTCTACAGTCAACACATTAGAAGCATTGGGGCTTGAGATTCAACAGTGTGTTATTAGCTGTTTCAATGATTTTTCAATGCAAGCTT
CTTGTTCAGAGGGAAGTGCTCAGAAAGCTGTGGCAAGTTCTGATGATATAAAGGAATCACTGTTCAGAAATGCAGGATATGGAGGGAAGTGCTTGTAGGGGAAAC
ACTTCAAAGTTTAAATCCGGAGCTCTCACCTTGGGGAACCTTGAAGTGAAGAACATTAATGGCATTCTTCATATGGATGAGTGAATCTGCCGCAATTAGAAATAA
TGGATCCGGAAGGCTCTTTCCAATAATTAATCTGGGTTTTATTTGCTGCATTTAGTGATTAGACTTAAATATACCGAAAATGACACTCCAATAATGTACTGAATG
AGATGAACAAAAGAG
Protein sequenceShow/hide protein sequence
MELSQHGFLEELLASTPWTSSYSNGFNDFFQNGWNFSTFDENPQMGTSNFPNFPGIQTANDFSFADQQLYSNFLEGFAMPELDSSSYTKNNETSPFISQEEMSNK
NSGYPPAAMEEEELGFIEIETAPSVCKVEMEQMGLRETNGSKMGVAELGKRSSIKAKKIEGQPSKNLMAERRRRKRLNDRLSMLRAIVPKISKMDRTSILGDTID
YVKELIERINNLKEEEEIGLDSNHVGFFNGISKEGKSNEVQVRNSPKSAIVELMAETLLFVTQFEVERTERETRIDICCATRPGLLLSTVNTLEALGLEIQQCVI
SCFNDFSMQASCSEGSAQKAVASSDDIKESLFRNAGYGGKCL