; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020534 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020534
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontranscription factor bHLH57-like
Genome locationchr7:145836..147583
RNA-Seq ExpressionLag0020534
SyntenyLag0020534
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134273.1 transcription factor bHLH67 isoform X2 [Cucumis sativus]6.3e-14378.61Show/hide
Query:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVN
        MERLQGPINPC YGEY+E G SEQEFT+L FEE EE   LTSSLE+K+PFLQMLQSVESQ FKEPNFQ+LLKLQHL KP WE  V++IQELV+L+SSP+N
Subjt:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVN

Query:  SETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQAS
        SETKDQNQ P S    + V SECN        QM KAPPV KERRKRKRS+PTKNKEEVE QRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQAS
Subjt:  SETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQAS

Query:  IIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV
        IIGGAIDFVKELEQ+LESL+A RKERKG E GECKGEQS          + +ASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+CPKRQ QLLK IV
Subjt:  IIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV

Query:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFRQYSGSR
        ALE+LRLTVLHLNI TSQ  ATMLYS NLKIEDECKL S EQIAATV++IFSFIN+GR+VNEAK NFRQYSGSR
Subjt:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFRQYSGSR

XP_008437748.1 PREDICTED: transcription factor bHLH67 isoform X1 [Cucumis melo]8.5e-14078.53Show/hide
Query:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVN
        MERLQGPINPC YGEY+E G SEQEF++L FEE EE   LTSSLE+K+PFLQMLQSVESQ FKEPNFQ+LLKLQHLNKP WEE VS+IQELVEL+SSP+N
Subjt:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVN

Query:  SETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQAS
        SETKDQNQ PNS    + V SECN        QM K PPV KERRKRKRS+PTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQAS
Subjt:  SETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQAS

Query:  IIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV
        IIGGAIDFVKELEQ+LESL+A RKERKG E GECK EQS          + +ASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+C KRQGQLLK IV
Subjt:  IIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV

Query:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFR
        ALE+LRLTVLHLNI TSQ  ATMLYS NLKIEDECKL S EQIAATV+QIFSF+N+GR+VNEAK  F+
Subjt:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFR

XP_022145118.1 LOW QUALITY PROTEIN: transcription factor bHLH67 [Momordica charantia]4.5e-15787.29Show/hide
Query:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWE-EEVSQIQELVELYSSPV
        MERLQG I+PC YGEY+ERG SEQ FTSLRFEE EEAYFLTS+LE+KMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKP WE EEVSQIQELVELYSSP+
Subjt:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWE-EEVSQIQELVELYSSPV

Query:  NSETKDQNQHPNSASCTEGVSSECN-----QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASII
        NSETKDQNQHPNSAS T+GVSSECN     QMAKAPPVTKERRKRKR+RP KNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYI RGDQASII
Subjt:  NSETKDQNQHPNSASCTEGVSSECN-----QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASII

Query:  GGAIDFVKELEQVLESLKAQRKERKGEEGG-ECKGEQSSLGSPT-SSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV
        GGAI FVKELEQ+LESL+AQ   RKGEEGG + KGE SS+GS + +SSAMGMASNGRIGEGVCAE KSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV
Subjt:  GGAIDFVKELEQVLESLKAQRKERKGEEGG-ECKGEQSSLGSPT-SSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV

Query:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNE
        ALE+LRLTVLHLNI+TSQATATM YS NLKIEDECK+GSAEQIAATVHQIFSF+NDGR+V E
Subjt:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNE

XP_031738937.1 transcription factor bHLH70 isoform X1 [Cucumis sativus]7.4e-13678.18Show/hide
Query:  YGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNS
        YGEY+E G SEQEFT+L FEE EE   LTSSLE+K+PFLQMLQSVESQ FKEPNFQ+LLKLQHL KP WE  V++IQELV+L+SSP+NSETKDQNQ P S
Subjt:  YGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNS

Query:  ASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKEL
            + V SECN        QM KAPPV KERRKRKRS+PTKNKEEVE QRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQASIIGGAIDFVKEL
Subjt:  ASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKEL

Query:  EQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHL
        EQ+LESL+A RKERKG E GECKGEQS          + +ASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+CPKRQ QLLK IVALE+LRLTVLHL
Subjt:  EQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHL

Query:  NITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFRQYSGSR
        NI TSQ  ATMLYS NLKIEDECKL S EQIAATV++IFSFIN+GR+VNEAK NFRQYSGSR
Subjt:  NITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFRQYSGSR

XP_038889465.1 transcription factor bHLH67 isoform X1 [Benincasa hispida]2.7e-15483.78Show/hide
Query:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPF--KEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSP
        MERLQGPINPC YGEY+E G SEQEFT+L FEE EE  FLTSSLE+K+PFLQMLQSVESQ    KEPNFQ+LLKLQHLNKP WEE VS+IQELVEL+SSP
Subjt:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPF--KEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSP

Query:  VNSETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQ
        +NSETKDQNQ PNS    EGVSSECN        QMAKAPPVTKERRKRKRS+PTK+KEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQ
Subjt:  VNSETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQ

Query:  ASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKA
        ASIIGGAIDFVKELEQ+LESL+A RKERKG E G CKGEQS           G+ASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKA
Subjt:  ASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKA

Query:  IVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFRQYSGSR
        IVALE+LRLTVLHLNITTSQATA MLYS NLKIEDEC+LGSAEQIAATV+QIFSFINDGR+VNEAKANFRQ SGSR
Subjt:  IVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFRQYSGSR

TrEMBL top hitse value%identityAlignment
A0A0A0L6W2 BHLH domain-containing protein3.0e-14378.61Show/hide
Query:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVN
        MERLQGPINPC YGEY+E G SEQEFT+L FEE EE   LTSSLE+K+PFLQMLQSVESQ FKEPNFQ+LLKLQHL KP WE  V++IQELV+L+SSP+N
Subjt:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVN

Query:  SETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQAS
        SETKDQNQ P S    + V SECN        QM KAPPV KERRKRKRS+PTKNKEEVE QRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQAS
Subjt:  SETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQAS

Query:  IIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV
        IIGGAIDFVKELEQ+LESL+A RKERKG E GECKGEQS          + +ASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+CPKRQ QLLK IV
Subjt:  IIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV

Query:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFRQYSGSR
        ALE+LRLTVLHLNI TSQ  ATMLYS NLKIEDECKL S EQIAATV++IFSFIN+GR+VNEAK NFRQYSGSR
Subjt:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFRQYSGSR

A0A1S3AVB5 transcription factor bHLH67 isoform X14.1e-14078.53Show/hide
Query:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVN
        MERLQGPINPC YGEY+E G SEQEF++L FEE EE   LTSSLE+K+PFLQMLQSVESQ FKEPNFQ+LLKLQHLNKP WEE VS+IQELVEL+SSP+N
Subjt:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVN

Query:  SETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQAS
        SETKDQNQ PNS    + V SECN        QM K PPV KERRKRKRS+PTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQAS
Subjt:  SETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQAS

Query:  IIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV
        IIGGAIDFVKELEQ+LESL+A RKERKG E GECK EQS          + +ASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+C KRQGQLLK IV
Subjt:  IIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV

Query:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFR
        ALE+LRLTVLHLNI TSQ  ATMLYS NLKIEDECKL S EQIAATV+QIFSF+N+GR+VNEAK  F+
Subjt:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFR

A0A5D3DAW7 Transcription factor bHLH67 isoform X18.3e-13378.09Show/hide
Query:  YGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNS
        YGEY+E G SEQEF++L FEE EE   LTSSLE+K+PFLQMLQSVESQ FKEPNFQ+LLKLQHLNKP WEE VS+IQELVEL+SSP+NSETKDQNQ PNS
Subjt:  YGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNS

Query:  ASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKEL
            + V SECN        QM K PPV KERRKRKRS+PTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQASIIGGAIDFVKEL
Subjt:  ASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKEL

Query:  EQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHL
        EQ+LESL+A RKERKG E GECK EQS          + +ASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+C KRQGQLLK IVALE+LRLTVLHL
Subjt:  EQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHL

Query:  NITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFR
        NI TSQ  ATMLYS NLKIEDECKL S EQIAATV+QIFSF+N+GR+VNEAK  F+
Subjt:  NITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFR

A0A6J1CV34 LOW QUALITY PROTEIN: transcription factor bHLH672.2e-15787.29Show/hide
Query:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWE-EEVSQIQELVELYSSPV
        MERLQG I+PC YGEY+ERG SEQ FTSLRFEE EEAYFLTS+LE+KMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKP WE EEVSQIQELVELYSSP+
Subjt:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWE-EEVSQIQELVELYSSPV

Query:  NSETKDQNQHPNSASCTEGVSSECN-----QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASII
        NSETKDQNQHPNSAS T+GVSSECN     QMAKAPPVTKERRKRKR+RP KNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYI RGDQASII
Subjt:  NSETKDQNQHPNSASCTEGVSSECN-----QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASII

Query:  GGAIDFVKELEQVLESLKAQRKERKGEEGG-ECKGEQSSLGSPT-SSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV
        GGAI FVKELEQ+LESL+AQ   RKGEEGG + KGE SS+GS + +SSAMGMASNGRIGEGVCAE KSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV
Subjt:  GGAIDFVKELEQVLESLKAQRKERKGEEGG-ECKGEQSSLGSPT-SSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIV

Query:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNE
        ALE+LRLTVLHLNI+TSQATATM YS NLKIEDECK+GSAEQIAATVHQIFSF+NDGR+V E
Subjt:  ALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNE

A0A6J1IB13 transcription factor bHLH57-like isoform X11.9e-12471.2Show/hide
Query:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKL-QHL-NKPAWEEEVSQIQELVELYSSP
        MERLQGPINP  YG  ++ G S+Q+F+SL F+E +EAY  TSSL+EKMPFL MLQ VE +PFKEP+FQNLLKL QHL N   W++EV             
Subjt:  MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKL-QHL-NKPAWEEEVSQIQELVELYSSP

Query:  VNSETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQ
                    NSASC   VS+ECN        QM  + P TKERRKRKR RP KNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQ
Subjt:  VNSETKDQNQHPNSASCTEGVSSECN--------QMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQ

Query:  ASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKA
        ASIIGGAIDFVKELEQ+LE L+AQRKERKGE   E        GSPTSS+A GMASNGRIGEGVCAE+KSEV EIEVTMIQ HVNLKIKCPKRQGQLLKA
Subjt:  ASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKA

Query:  IVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFRQYSGS
        IVALE+LRL+VLHLNI+TSQATAT+LYS NLKIEDECKLGSA QIA  VH+I SFINDG  VNE K N RQYSGS
Subjt:  IVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFINDGRVVNEAKANFRQYSGS

SwissProt top hitse value%identityAlignment
O81037 Transcription factor bHLH702.5e-6247.65Show/hide
Query:  QEFTSLRFEELEEAYFLTSSLEE-KMPFLQMLQSVESQ----PFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNSASCTEG
        ++  S   EE E+     S L++  +PFLQMLQ  E       FK+P+F  LL LQ L KP WE E     E+ E + SP++SET     +P+     E 
Subjt:  QEFTSLRFEELEEAYFLTSSLEE-KMPFLQMLQSVESQ----PFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNSASCTEG

Query:  VSSE---CNQMAKAPP------------VTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKE
        +S++    N +  A              +T+E+RKR+R++PTKN EE+ESQRMTHIAVERNRRRQMN HLN ++S+IP+SYIQRGDQASI+GGAIDFVK 
Subjt:  VSSE---CNQMAKAPP------------VTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKE

Query:  LEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLH
        LEQ L+SL+AQ++ ++ ++  E   E +SL + +S+     ASN         E +S   +IE T+I++HVNLKI+C ++QGQLL++I+ LE+LR TVLH
Subjt:  LEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLH

Query:  LNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIF
        LNI TS    ++ YS NLK+EDEC LGSA++I A + QIF
Subjt:  LNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIF

Q56YJ8 Transcription factor FAMA8.5e-4245.74Show/hide
Query:  ETKDQNQHPNSASCTEGVSSECNQMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFV
        E +D +   NS         E ++  K     + + KRKR+R +K  EEVESQRMTHIAVERNRR+QMN+HL V++SL+P SY+QRGDQASIIGGAI+FV
Subjt:  ETKDQNQHPNSASCTEGVSSECNQMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFV

Query:  KELEQVLESLKAQRKER-KGEEGGECKGEQSSLGSP---TSSSAMGMASNGRI-----GEGV---CAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLK
        +ELEQ+L+ L++Q++ R  GE G +     +S  SP    ++ A  +   G +     G G+    AE KS +A++EV ++     +KI   +R GQL+K
Subjt:  KELEQVLESLKAQRKER-KGEEGGECKGEQSSLGSP---TSSSAMGMASNGRI-----GEGV---CAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLK

Query:  AIVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFIN
         I ALE+L L++LH NITT +   T+LYS N+KI  E +  +AE IA+++ QIFSFI+
Subjt:  AIVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFIN

Q700E4 Transcription factor bHLH674.0e-6042.67Show/hide
Query:  MERLQGPINPCPYGEYAE------RGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPF---KEPNFQNLLKLQHLNKPAWE-EEVSQIQE
        MER QG INPC +    +      +G++E +  S  F+E EE      SL++ +PFLQMLQS +   F   KEPNF  LL LQ L +P WE E    +++
Subjt:  MERLQGPINPCPYGEYAE------RGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPF---KEPNFQNLLKLQHLNKPAWE-EEVSQIQE

Query:  LVELYSSPVNSETKDQNQHPNSA-----------------SCTEGVSSECNQMAK-----APPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQ
            + SPV SET    +  N A                 S +  +S+   +  K        +T+E+RKR++++P+KN EE+E+QR+ HIAVERNRRRQ
Subjt:  LVELYSSPVNSETKDQNQHPNSA-----------------SCTEGVSSECNQMAK-----APPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQ

Query:  MNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVT
        MN+H+N +++L+P SYIQRGDQASI+GGAI++VK LEQ+++SL++Q++ ++       +   + L         G++SN         E ++ + +IE T
Subjt:  MNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVT

Query:  MIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIF
        +IQ HV+LK++C K+QGQLLK I++LE+L+LTVLHLNITTS + +++ YS NLK+EDEC L SA++I A VH+IF
Subjt:  MIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIF

Q9M128 Transcription factor bHLH578.7e-5544.58Show/hide
Query:  LRFEELEEAY--FLTSSLEEKMPFLQMLQSVESQPF--KEPN--FQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNSASCTEGVSSE
        + F ELE+ +     +++EEK+PFLQMLQ +E  PF   EPN   Q+LL++Q L      E  S +     +   P  ++  +++    + + T      
Subjt:  LRFEELEEAY--FLTSSLEEKMPFLQMLQSVESQPF--KEPN--FQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNSASCTEGVSSE

Query:  CNQMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEE
                   KE+RKRKR+R  KNK+EVE+QRMTHIAVERNRRRQMN+HLN ++SL+P S++QRGDQASI+GGAIDF+KELEQ+L+SL+A++++   +E
Subjt:  CNQMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEE

Query:  ---GGECKGEQSSLGSPTSSSAMGMAS-NG---RIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATM
              C    S   + +S S++   S NG   R G G       +  E+E T+IQ HV+LK++C + + Q+LKAIV++EEL+L +LHL I++S     +
Subjt:  ---GGECKGEQSSLGSPTSSSAMGMAS-NG---RIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATM

Query:  LYSINLKIEDECKLGSAEQIAATVHQIFSFIN
        +YS NLK+ED CKLGSA++IA  VHQIF  IN
Subjt:  LYSINLKIEDECKLGSAEQIAATVHQIFSFIN

Q9SK91 Transcription factor bHLH942.7e-4046.79Show/hide
Query:  PVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQ
        P  + RRKR+R+R  KNKEE+E+QRMTHIAVERNRR+QMN++L V++SL+P+SY QRGDQASI+GGAI++VKELE +L+S++ +R      +G   K   
Subjt:  PVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQ

Query:  SSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLG
        SSL  P +        + +    V  E  S  AEIEVT+ ++H N+KI   K+  QLLK I +L+ LRLT+LHLN+TT     ++LYSI++++E+  +L 
Subjt:  SSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLG

Query:  SAEQIAATVHQIFSFIND
        + + IA  ++Q    I +
Subjt:  SAEQIAATVHQIFSFIND

Arabidopsis top hitse value%identityAlignment
AT2G46810.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.8e-6347.65Show/hide
Query:  QEFTSLRFEELEEAYFLTSSLEE-KMPFLQMLQSVESQ----PFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNSASCTEG
        ++  S   EE E+     S L++  +PFLQMLQ  E       FK+P+F  LL LQ L KP WE E     E+ E + SP++SET     +P+     E 
Subjt:  QEFTSLRFEELEEAYFLTSSLEE-KMPFLQMLQSVESQ----PFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNSASCTEG

Query:  VSSE---CNQMAKAPP------------VTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKE
        +S++    N +  A              +T+E+RKR+R++PTKN EE+ESQRMTHIAVERNRRRQMN HLN ++S+IP+SYIQRGDQASI+GGAIDFVK 
Subjt:  VSSE---CNQMAKAPP------------VTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKE

Query:  LEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLH
        LEQ L+SL+AQ++ ++ ++  E   E +SL + +S+     ASN         E +S   +IE T+I++HVNLKI+C ++QGQLL++I+ LE+LR TVLH
Subjt:  LEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLH

Query:  LNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIF
        LNI TS    ++ YS NLK+EDEC LGSA++I A + QIF
Subjt:  LNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIF

AT3G24140.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein6.0e-4345.74Show/hide
Query:  ETKDQNQHPNSASCTEGVSSECNQMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFV
        E +D +   NS         E ++  K     + + KRKR+R +K  EEVESQRMTHIAVERNRR+QMN+HL V++SL+P SY+QRGDQASIIGGAI+FV
Subjt:  ETKDQNQHPNSASCTEGVSSECNQMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFV

Query:  KELEQVLESLKAQRKER-KGEEGGECKGEQSSLGSP---TSSSAMGMASNGRI-----GEGV---CAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLK
        +ELEQ+L+ L++Q++ R  GE G +     +S  SP    ++ A  +   G +     G G+    AE KS +A++EV ++     +KI   +R GQL+K
Subjt:  KELEQVLESLKAQRKER-KGEEGGECKGEQSSLGSP---TSSSAMGMASNGRI-----GEGV---CAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLK

Query:  AIVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFIN
         I ALE+L L++LH NITT +   T+LYS N+KI  E +  +AE IA+++ QIFSFI+
Subjt:  AIVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIFSFIN

AT3G61950.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.9e-6142.67Show/hide
Query:  MERLQGPINPCPYGEYAE------RGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPF---KEPNFQNLLKLQHLNKPAWE-EEVSQIQE
        MER QG INPC +    +      +G++E +  S  F+E EE      SL++ +PFLQMLQS +   F   KEPNF  LL LQ L +P WE E    +++
Subjt:  MERLQGPINPCPYGEYAE------RGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPF---KEPNFQNLLKLQHLNKPAWE-EEVSQIQE

Query:  LVELYSSPVNSETKDQNQHPNSA-----------------SCTEGVSSECNQMAK-----APPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQ
            + SPV SET    +  N A                 S +  +S+   +  K        +T+E+RKR++++P+KN EE+E+QR+ HIAVERNRRRQ
Subjt:  LVELYSSPVNSETKDQNQHPNSA-----------------SCTEGVSSECNQMAK-----APPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQ

Query:  MNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVT
        MN+H+N +++L+P SYIQRGDQASI+GGAI++VK LEQ+++SL++Q++ ++       +   + L         G++SN         E ++ + +IE T
Subjt:  MNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVT

Query:  MIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIF
        +IQ HV+LK++C K+QGQLLK I++LE+L+LTVLHLNITTS + +++ YS NLK+EDEC L SA++I A VH+IF
Subjt:  MIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLGSAEQIAATVHQIF

AT3G61950.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.8e-5343.53Show/hide
Query:  MLQSVESQPF---KEPNFQNLLKLQHLNKPAWE-EEVSQIQELVELYSSPVNSETKDQNQHPNSA-----------------SCTEGVSSECNQMAK---
        MLQS +   F   KEPNF  LL LQ L +P WE E    +++    + SPV SET    +  N A                 S +  +S+   +  K   
Subjt:  MLQSVESQPF---KEPNFQNLLKLQHLNKPAWE-EEVSQIQELVELYSSPVNSETKDQNQHPNSA-----------------SCTEGVSSECNQMAK---

Query:  --APPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGEC
             +T+E+RKR++++P+KN EE+E+QR+ HIAVERNRRRQMN+H+N +++L+P SYIQRGDQASI+GGAI++VK LEQ+++SL++Q++ ++       
Subjt:  --APPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEEGGEC

Query:  KGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATMLYSINLKIEDE
        +   + L         G++SN         E ++ + +IE T+IQ HV+LK++C K+QGQLLK I++LE+L+LTVLHLNITTS + +++ YS NLK+EDE
Subjt:  KGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATMLYSINLKIEDE

Query:  CKLGSAEQIAATVHQIF
        C L SA++I A VH+IF
Subjt:  CKLGSAEQIAATVHQIF

AT4G01460.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein6.2e-5644.58Show/hide
Query:  LRFEELEEAY--FLTSSLEEKMPFLQMLQSVESQPF--KEPN--FQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNSASCTEGVSSE
        + F ELE+ +     +++EEK+PFLQMLQ +E  PF   EPN   Q+LL++Q L      E  S +     +   P  ++  +++    + + T      
Subjt:  LRFEELEEAY--FLTSSLEEKMPFLQMLQSVESQPF--KEPN--FQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHPNSASCTEGVSSE

Query:  CNQMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEE
                   KE+RKRKR+R  KNK+EVE+QRMTHIAVERNRRRQMN+HLN ++SL+P S++QRGDQASI+GGAIDF+KELEQ+L+SL+A++++   +E
Subjt:  CNQMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKGEE

Query:  ---GGECKGEQSSLGSPTSSSAMGMAS-NG---RIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATM
              C    S   + +S S++   S NG   R G G       +  E+E T+IQ HV+LK++C + + Q+LKAIV++EEL+L +LHL I++S     +
Subjt:  ---GGECKGEQSSLGSPTSSSAMGMAS-NG---RIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATM

Query:  LYSINLKIEDECKLGSAEQIAATVHQIFSFIN
        +YS NLK+ED CKLGSA++IA  VHQIF  IN
Subjt:  LYSINLKIEDECKLGSAEQIAATVHQIFSFIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGGCTCCAAGGACCCATCAATCCTTGTCCTTATGGTGAATATGCAGAGAGAGGTTACTCGGAACAAGAATTCACAAGCTTAAGATTTGAAGAACTAGAAGAAGC
CTATTTCTTAACATCAAGTTTGGAAGAAAAAATGCCATTTCTTCAGATGCTGCAGAGTGTGGAATCCCAACCATTCAAGGAGCCTAACTTTCAAAACTTGCTGAAGCTGC
AGCACCTAAACAAACCAGCATGGGAAGAGGAAGTTAGTCAAATTCAGGAGCTTGTAGAGTTGTATTCTTCACCAGTTAACTCAGAAACAAAAGACCAAAATCAACATCCA
AATTCCGCTTCATGTACTGAGGGAGTGAGTTCAGAGTGCAACCAAATGGCAAAGGCTCCTCCAGTCACCAAGGAAAGAAGAAAACGAAAGAGATCGAGACCAACTAAGAA
CAAGGAAGAAGTGGAGAGCCAGAGAATGACCCATATTGCCGTCGAGCGCAACCGGAGACGGCAAATGAACGACCATCTCAACGTTATCAAGTCCCTCATACCTACATCCT
ACATACAAAGGGGTGATCAGGCATCGATAATTGGGGGTGCAATAGACTTCGTGAAGGAATTGGAGCAGGTACTTGAATCTTTGAAAGCACAGAGGAAAGAAAGGAAAGGA
GAGGAAGGTGGTGAGTGTAAGGGTGAGCAATCCTCGCTAGGTTCACCCACATCCTCCTCAGCAATGGGAATGGCCTCCAATGGGAGAATAGGAGAAGGGGTTTGTGCAGA
GCTCAAGTCAGAAGTGGCTGAGATAGAGGTGACCATGATTCAAACCCATGTAAACTTGAAGATAAAATGCCCCAAAAGGCAAGGCCAATTGTTGAAAGCCATTGTTGCTT
TGGAAGAACTTAGGCTCACAGTTTTGCATCTCAACATTACTACTTCACAAGCCACTGCCACCATGCTCTACTCCATCAATCTGAAGATAGAAGATGAATGTAAGCTAGGA
TCAGCGGAGCAGATTGCAGCAACAGTTCATCAAATATTCAGTTTTATCAACGATGGCAGGGTGGTCAACGAGGCAAAGGCAAATTTCAGGCAGTACAGTGGCAGTCGCTG
A
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGGCTCCAAGGACCCATCAATCCTTGTCCTTATGGTGAATATGCAGAGAGAGGTTACTCGGAACAAGAATTCACAAGCTTAAGATTTGAAGAACTAGAAGAAGC
CTATTTCTTAACATCAAGTTTGGAAGAAAAAATGCCATTTCTTCAGATGCTGCAGAGTGTGGAATCCCAACCATTCAAGGAGCCTAACTTTCAAAACTTGCTGAAGCTGC
AGCACCTAAACAAACCAGCATGGGAAGAGGAAGTTAGTCAAATTCAGGAGCTTGTAGAGTTGTATTCTTCACCAGTTAACTCAGAAACAAAAGACCAAAATCAACATCCA
AATTCCGCTTCATGTACTGAGGGAGTGAGTTCAGAGTGCAACCAAATGGCAAAGGCTCCTCCAGTCACCAAGGAAAGAAGAAAACGAAAGAGATCGAGACCAACTAAGAA
CAAGGAAGAAGTGGAGAGCCAGAGAATGACCCATATTGCCGTCGAGCGCAACCGGAGACGGCAAATGAACGACCATCTCAACGTTATCAAGTCCCTCATACCTACATCCT
ACATACAAAGGGGTGATCAGGCATCGATAATTGGGGGTGCAATAGACTTCGTGAAGGAATTGGAGCAGGTACTTGAATCTTTGAAAGCACAGAGGAAAGAAAGGAAAGGA
GAGGAAGGTGGTGAGTGTAAGGGTGAGCAATCCTCGCTAGGTTCACCCACATCCTCCTCAGCAATGGGAATGGCCTCCAATGGGAGAATAGGAGAAGGGGTTTGTGCAGA
GCTCAAGTCAGAAGTGGCTGAGATAGAGGTGACCATGATTCAAACCCATGTAAACTTGAAGATAAAATGCCCCAAAAGGCAAGGCCAATTGTTGAAAGCCATTGTTGCTT
TGGAAGAACTTAGGCTCACAGTTTTGCATCTCAACATTACTACTTCACAAGCCACTGCCACCATGCTCTACTCCATCAATCTGAAGATAGAAGATGAATGTAAGCTAGGA
TCAGCGGAGCAGATTGCAGCAACAGTTCATCAAATATTCAGTTTTATCAACGATGGCAGGGTGGTCAACGAGGCAAAGGCAAATTTCAGGCAGTACAGTGGCAGTCGCTG
A
Protein sequenceShow/hide protein sequence
MERLQGPINPCPYGEYAERGYSEQEFTSLRFEELEEAYFLTSSLEEKMPFLQMLQSVESQPFKEPNFQNLLKLQHLNKPAWEEEVSQIQELVELYSSPVNSETKDQNQHP
NSASCTEGVSSECNQMAKAPPVTKERRKRKRSRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQVLESLKAQRKERKG
EEGGECKGEQSSLGSPTSSSAMGMASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGQLLKAIVALEELRLTVLHLNITTSQATATMLYSINLKIEDECKLG
SAEQIAATVHQIFSFINDGRVVNEAKANFRQYSGSR