; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G000030 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G000030
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptiontranscription factor bHLH57-like
Genome locationCG_Chr05:56108..60354
RNA-Seq ExpressionClCG05G000030
SyntenyClCG05G000030
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134273.1 transcription factor bHLH67 isoform X2 [Cucumis sativus]1.7e-16487.5Show/hide
Query:  MERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPI
        MERLQGPINPCFYGEYSETGCSEQEFT+LGFEESEEVC LTSSLED+IPFLQMLQSVESQSF  KEPNFQSLLKLQHL KPWE GV+KIQELV+LFSSPI
Subjt:  MERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPI

Query:  NSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIG
        NSETKDQ +PP S+ + SE N+NQG+ +TQM KAPPV KERRKRKRSKPTKNKEEVE QRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQASIIG
Subjt:  NSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIG

Query:  GAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLN
        GAIDFVKELEQLLESLEALRKERKG E  CKGEQ EV VASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+CPKRQ  LLK IVALEDLRLTVLHLN
Subjt:  GAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLN

Query:  ITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFRQYSGN
        I TSQ  ATMLYSFNLKIEDECKL S EQIAATVN+IFSFIN+GRLVNEAK NFRQYSG+
Subjt:  ITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFRQYSGN

XP_008437748.1 PREDICTED: transcription factor bHLH67 isoform X1 [Cucumis melo]1.4e-16187.89Show/hide
Query:  MERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPI
        MERLQGPINPC YGEYSETGCSEQEF++LGFEESEEVC LTSSLED+IPFLQMLQSVESQSF  KEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPI
Subjt:  MERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPI

Query:  NSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIG
        NSETKDQ +PPNS+ + SE N+NQGL +TQM K PPV KERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQASIIG
Subjt:  NSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIG

Query:  GAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLN
        GAIDFVKELEQLLESLEALRKERKG E  CK EQ EV VASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+C KRQG LLK IVALEDLRLTVLHLN
Subjt:  GAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLN

Query:  ITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFR
        I TSQ  ATMLYSFNLKIEDECKL S EQIAATVNQIFSF+N+GRLVNEAK  F+
Subjt:  ITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFR

XP_031738937.1 transcription factor bHLH70 isoform X1 [Cucumis sativus]6.0e-15787.11Show/hide
Query:  FYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPP
        FYGEYSETGCSEQEFT+LGFEESEEVC LTSSLED+IPFLQMLQSVESQSF  KEPNFQSLLKLQHL KPWE GV+KIQELV+LFSSPINSETKDQ +PP
Subjt:  FYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPP

Query:  NSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQ
         S+ + SE N+NQG+ +TQM KAPPV KERRKRKRSKPTKNKEEVE QRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQASIIGGAIDFVKELEQ
Subjt:  NSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQ

Query:  LLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATML
        LLESLEALRKERKG E  CKGEQ EV VASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+CPKRQ  LLK IVALEDLRLTVLHLNI TSQ  ATML
Subjt:  LLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATML

Query:  YSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFRQYSGN
        YSFNLKIEDECKL S EQIAATVN+IFSFIN+GRLVNEAK NFRQYSG+
Subjt:  YSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFRQYSGN

XP_038889465.1 transcription factor bHLH67 isoform X1 [Benincasa hispida]4.0e-19393.49Show/hide
Query:  MLVTPRQRSRKAKLTHFKTCSAVQMERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQ
        MLVT RQRSRKAKL HF TCSAVQMERLQGPINPCFYGEYSETGCSEQEFT+LGFEESEEVCFLTSSLED++PFLQMLQSVESQS KDKEPNFQSLLKLQ
Subjt:  MLVTPRQRSRKAKLTHFKTCSAVQMERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQ

Query:  HLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMND
        HLNKPWEEGVSKIQELVELFSSPINSETKDQ +PPNSEG+SSE N+ QGLCR QM KAPPVTKERRKRKRSKPTK+KEEVESQRMTHIAVERNRRRQMND
Subjt:  HLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMND

Query:  HLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCP
        HLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKG E+GCKGEQ E GVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCP
Subjt:  HLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCP

Query:  KRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFRQYSGN
        KRQG LLKAIVALEDLRLTVLHLNITTSQATA MLYSFNLKIEDEC+LGSAEQIAATVNQIFSFINDGRLVNEAKANFRQ SG+
Subjt:  KRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFRQYSGN

XP_038889530.1 transcription factor bHLH67 isoform X2 [Benincasa hispida]6.2e-17093.55Show/hide
Query:  MLVTPRQRSRKAKLTHFKTCSAVQMERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQ
        MLVT RQRSRKAKL HF TCSAVQMERLQGPINPCFYGEYSETGCSEQEFT+LGFEESEEVCFLTSSLED++PFLQMLQSVESQS KDKEPNFQSLLKLQ
Subjt:  MLVTPRQRSRKAKLTHFKTCSAVQMERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQ

Query:  HLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMND
        HLNKPWEEGVSKIQELVELFSSPINSETKDQ +PPNSEG+SSE N+ QGLCR QM KAPPVTKERRKRKRSKPTK+KEEVESQRMTHIAVERNRRRQMND
Subjt:  HLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMND

Query:  HLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCP
        HLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKG E+GCKGEQ E GVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCP
Subjt:  HLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCP

Query:  KRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLK
        KRQG LLKAIVALEDLRLTVLHLNITTSQATA MLYSFNLK
Subjt:  KRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLK

TrEMBL top hitse value%identityAlignment
A0A0A0L6W2 BHLH domain-containing protein8.4e-16587.5Show/hide
Query:  MERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPI
        MERLQGPINPCFYGEYSETGCSEQEFT+LGFEESEEVC LTSSLED+IPFLQMLQSVESQSF  KEPNFQSLLKLQHL KPWE GV+KIQELV+LFSSPI
Subjt:  MERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPI

Query:  NSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIG
        NSETKDQ +PP S+ + SE N+NQG+ +TQM KAPPV KERRKRKRSKPTKNKEEVE QRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQASIIG
Subjt:  NSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIG

Query:  GAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLN
        GAIDFVKELEQLLESLEALRKERKG E  CKGEQ EV VASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+CPKRQ  LLK IVALEDLRLTVLHLN
Subjt:  GAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLN

Query:  ITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFRQYSGN
        I TSQ  ATMLYSFNLKIEDECKL S EQIAATVN+IFSFIN+GRLVNEAK NFRQYSG+
Subjt:  ITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFRQYSGN

A0A1S3AUE4 transcription factor bHLH70 isoform X21.9e-13287.46Show/hide
Query:  MLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKN
        MLQSVESQSF  KEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQ +PPNS+ + SE N+NQGL +TQM K PPV KERRKRKRSKPTKN
Subjt:  MLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKN

Query:  KEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAE
        KEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQASIIGGAIDFVKELEQLLESLEALRKERKG E  CK EQ EV VASN RIGEGVCAE
Subjt:  KEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAE

Query:  LKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKA
        L+SEVAEIEVTMIQTHVNLKI+C KRQG LLK IVALEDLRLTVLHLNI TSQ  ATMLYSFNLKIEDECKL S EQIAATVNQIFSF+N+GRLVNEAK 
Subjt:  LKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKA

Query:  NFR
         F+
Subjt:  NFR

A0A1S3AVB5 transcription factor bHLH67 isoform X16.7e-16287.89Show/hide
Query:  MERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPI
        MERLQGPINPC YGEYSETGCSEQEF++LGFEESEEVC LTSSLED+IPFLQMLQSVESQSF  KEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPI
Subjt:  MERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPI

Query:  NSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIG
        NSETKDQ +PPNS+ + SE N+NQGL +TQM K PPV KERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQASIIG
Subjt:  NSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIG

Query:  GAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLN
        GAIDFVKELEQLLESLEALRKERKG E  CK EQ EV VASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+C KRQG LLK IVALEDLRLTVLHLN
Subjt:  GAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLN

Query:  ITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFR
        I TSQ  ATMLYSFNLKIEDECKL S EQIAATVNQIFSF+N+GRLVNEAK  F+
Subjt:  ITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFR

A0A5D3DAW7 Transcription factor bHLH67 isoform X11.8e-15487.76Show/hide
Query:  YGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPN
        YGEYSETGCSEQEF++LGFEESEEVC LTSSLED+IPFLQMLQSVESQSF  KEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQ +PPN
Subjt:  YGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPN

Query:  SEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQL
        S+ + SE N+NQGL +TQM K PPV KERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSY+QRGDQASIIGGAIDFVKELEQL
Subjt:  SEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQL

Query:  LESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLY
        LESLEALRKERKG E  CK EQ EV VASN RIGEGVCAEL+SEVAEIEVTMIQTHVNLKI+C KRQG LLK IVALEDLRLTVLHLNI TSQ  ATMLY
Subjt:  LESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLY

Query:  SFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFR
        SFNLKIEDECKL S EQIAATVNQIFSF+N+GRLVNEAK  F+
Subjt:  SFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNEAKANFR

A0A6J1CV34 LOW QUALITY PROTEIN: transcription factor bHLH675.7e-14578.24Show/hide
Query:  MERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWE-EGVSKIQELVELFSSP
        MERLQG I+PCFYGEYSE GCSEQ FTSL FEESEE  FLTS+LED++PFLQMLQSVESQ F  KEPNFQ+LLKLQHLNKPWE E VS+IQELVEL+SSP
Subjt:  MERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWE-EGVSKIQELVELFSSP

Query:  INSETKDQKRPPNS----EGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQ
        INSETKDQ + PNS    +G+SSE N+NQ   RTQM KAPPVTKERRKRKR++P KNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYI RGDQ
Subjt:  INSETKDQKRPPNS----EGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQ

Query:  ASIIGGAIDFVKELEQLLESLEALRKERKGTESGC--KGEQLEV----------GVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLL
        ASIIGGAI FVKELEQLLESLEA   +RKG E GC  KGE   V          G+ASNGRIGEGVCAE KSEVAEIEVTMIQTHVNLKIKCPKRQG LL
Subjt:  ASIIGGAIDFVKELEQLLESLEALRKERKGTESGC--KGEQLEV----------GVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLL

Query:  KAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNE-AKANFRQYSGNGLVRMAGQ
        KAIVALEDLRLTVLHLNI+TSQATATM YSFNLKIEDECK+GSAEQIAATV+QIFSF+NDGRLV E  K  F+      L    G+
Subjt:  KAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIFSFINDGRLVNE-AKANFRQYSGNGLVRMAGQ

SwissProt top hitse value%identityAlignment
O81037 Transcription factor bHLH707.3e-6549.85Show/hide
Query:  QEFTSLGFEESEEVCFLTSSLED-RIPFLQMLQSVESQS--FKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSES
        ++  S   EE E+     S L+D  IPFLQMLQ  E  S     K+P+F +LL LQ L KPWE       E+ E F SPI+SET      P+ EG+ +E+
Subjt:  QEFTSLGFEESEEVCFLTSSLED-RIPFLQMLQSVESQS--FKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSES

Query:  NRNQGLCRTQMEKAPP------------VTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKE
          NQ L    +E A              +T+E+RKR+R+KPTKN EE+ESQRMTHIAVERNRRRQMN HLN ++S+IP+SYIQRGDQASI+GGAIDFVK 
Subjt:  NRNQGLCRTQMEKAPP------------VTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKE

Query:  LEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVA---EIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQ
        LEQ L+SLEA   +++  +S    EQ+    +        + A  K E +   +IE T+I++HVNLKI+C ++QG LL++I+ LE LR TVLHLNI TS 
Subjt:  LEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVA---EIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQ

Query:  ATATMLYSFNLKIEDECKLGSAEQIAATVNQIF
           ++ YSFNLK+EDEC LGSA++I A + QIF
Subjt:  ATATMLYSFNLKIEDECKLGSAEQIAATVNQIF

Q56YJ8 Transcription factor FAMA7.3e-4148.7Show/hide
Query:  VTKE--RRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQ
        VTK+  + KRKR++ +K  EEVESQRMTHIAVERNRR+QMN+HL V++SL+P SY+QRGDQASIIGGAI+FV+ELEQLL+ LE+ ++ R   E+G     
Subjt:  VTKE--RRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQ

Query:  LEVGVAS--------------NGRI-----GEGV---CAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLY
             +S               G +     G G+    AE KS +A++EV ++     +KI   +R G L+K I ALEDL L++LH NITT +   T+LY
Subjt:  LEVGVAS--------------NGRI-----GEGV---CAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLY

Query:  SFNLKIEDECKLGSAEQIAATVNQIFSFIN
        SFN+KI  E +  +AE IA+++ QIFSFI+
Subjt:  SFNLKIEDECKLGSAEQIAATVNQIFSFIN

Q700E4 Transcription factor bHLH672.3e-6645.33Show/hide
Query:  MERLQGPINPCFYGEYSETGCSE----QEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQS-FKDKEPNFQSLLKLQHLNKPWE-EGVSKIQELVE
        MER QG INPCF+    +    E     E  S  F+E EE      SL+D +PFLQMLQS +  S F  KEPNF +LL LQ L +PWE E    +++   
Subjt:  MERLQGPINPCFYGEYSETGCSE----QEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQS-FKDKEPNFQSLLKLQHLNKPWE-EGVSKIQELVE

Query:  LFSSPINSET-----------KDQKRPPNSEGMSSESNRNQGLCRTQMEK-------APPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMND
         F SP+ SET             Q+ P +   M+  S+ +  L      K          +T+E+RKR+++KP+KN EE+E+QR+ HIAVERNRRRQMN+
Subjt:  LFSSPINSET-----------KDQKRPPNSEGMSSESNRNQGLCRTQMEK-------APPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMND

Query:  HLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLE--VGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIK
        H+N +++L+P SYIQRGDQASI+GGAI++VK LEQ+++SLE+ ++ ++ + S      L    G++SN         E ++ + +IE T+IQ HV+LK++
Subjt:  HLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLE--VGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIK

Query:  CPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIF
        C K+QG LLK I++LE L+LTVLHLNITTS + +++ YSFNLK+EDEC L SA++I A V++IF
Subjt:  CPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIF

Q9M128 Transcription factor bHLH571.7e-5344.59Show/hide
Query:  SSLEDRIPFLQMLQSVESQSFKDKEPN--FQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTK
        +++E++IPFLQMLQ +E   F   EPN   QSLL++Q L                   S +  ET  ++ P  ++    +     G             K
Subjt:  SSLEDRIPFLQMLQSVESQSFKDKEPN--FQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTK

Query:  ERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVG-
        E+RKRKR++  KNK+EVE+QRMTHIAVERNRRRQMN+HLN ++SL+P S++QRGDQASI+GGAIDF+KELEQLL+SLEA  K + GT+   K        
Subjt:  ERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVG-

Query:  --VASNGRIG-------EGVCAEL-KSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAE
            +N  I         G  A     +  E+E T+IQ HV+LK++C + +  +LKAIV++E+L+L +LHL I++S     ++YSFNLK+ED CKLGSA+
Subjt:  --VASNGRIG-------EGVCAEL-KSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAE

Query:  QIAATVNQIFSFIN
        +IA  V+QIF  IN
Subjt:  QIAATVNQIFSFIN

Q9SK91 Transcription factor bHLH944.0e-3944.25Show/hide
Query:  GLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERK
        GL    +E  PP  + RRKR+R++  KNKEE+E+QRMTHIAVERNRR+QMN++L V++SL+P+SY QRGDQASI+GGAI++VKELE +L+S+E  R    
Subjt:  GLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERK

Query:  GTESGCKGEQLEVGVASN-------GRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLK
          +         VG  ++                E  S  AEIEVT+ ++H N+KI   K+   LLK I +L+ LRLT+LHLN+TT     ++LYS +++
Subjt:  GTESGCKGEQLEVGVASN-------GRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLK

Query:  IEDECKLGSAEQIAATVNQIFSFIND
        +E+  +L + + IA  +NQ    I +
Subjt:  IEDECKLGSAEQIAATVNQIFSFIND

Arabidopsis top hitse value%identityAlignment
AT2G46810.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein5.2e-6649.85Show/hide
Query:  QEFTSLGFEESEEVCFLTSSLED-RIPFLQMLQSVESQS--FKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSES
        ++  S   EE E+     S L+D  IPFLQMLQ  E  S     K+P+F +LL LQ L KPWE       E+ E F SPI+SET      P+ EG+ +E+
Subjt:  QEFTSLGFEESEEVCFLTSSLED-RIPFLQMLQSVESQS--FKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSES

Query:  NRNQGLCRTQMEKAPP------------VTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKE
          NQ L    +E A              +T+E+RKR+R+KPTKN EE+ESQRMTHIAVERNRRRQMN HLN ++S+IP+SYIQRGDQASI+GGAIDFVK 
Subjt:  NRNQGLCRTQMEKAPP------------VTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKE

Query:  LEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVA---EIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQ
        LEQ L+SLEA   +++  +S    EQ+    +        + A  K E +   +IE T+I++HVNLKI+C ++QG LL++I+ LE LR TVLHLNI TS 
Subjt:  LEQLLESLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVA---EIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQ

Query:  ATATMLYSFNLKIEDECKLGSAEQIAATVNQIF
           ++ YSFNLK+EDEC LGSA++I A + QIF
Subjt:  ATATMLYSFNLKIEDECKLGSAEQIAATVNQIF

AT3G24140.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein5.2e-4248.7Show/hide
Query:  VTKE--RRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQ
        VTK+  + KRKR++ +K  EEVESQRMTHIAVERNRR+QMN+HL V++SL+P SY+QRGDQASIIGGAI+FV+ELEQLL+ LE+ ++ R   E+G     
Subjt:  VTKE--RRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQ

Query:  LEVGVAS--------------NGRI-----GEGV---CAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLY
             +S               G +     G G+    AE KS +A++EV ++     +KI   +R G L+K I ALEDL L++LH NITT +   T+LY
Subjt:  LEVGVAS--------------NGRI-----GEGV---CAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLY

Query:  SFNLKIEDECKLGSAEQIAATVNQIFSFIN
        SFN+KI  E +  +AE IA+++ QIFSFI+
Subjt:  SFNLKIEDECKLGSAEQIAATVNQIFSFIN

AT3G61950.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.6e-6745.33Show/hide
Query:  MERLQGPINPCFYGEYSETGCSE----QEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQS-FKDKEPNFQSLLKLQHLNKPWE-EGVSKIQELVE
        MER QG INPCF+    +    E     E  S  F+E EE      SL+D +PFLQMLQS +  S F  KEPNF +LL LQ L +PWE E    +++   
Subjt:  MERLQGPINPCFYGEYSETGCSE----QEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQS-FKDKEPNFQSLLKLQHLNKPWE-EGVSKIQELVE

Query:  LFSSPINSET-----------KDQKRPPNSEGMSSESNRNQGLCRTQMEK-------APPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMND
         F SP+ SET             Q+ P +   M+  S+ +  L      K          +T+E+RKR+++KP+KN EE+E+QR+ HIAVERNRRRQMN+
Subjt:  LFSSPINSET-----------KDQKRPPNSEGMSSESNRNQGLCRTQMEK-------APPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMND

Query:  HLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLE--VGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIK
        H+N +++L+P SYIQRGDQASI+GGAI++VK LEQ+++SLE+ ++ ++ + S      L    G++SN         E ++ + +IE T+IQ HV+LK++
Subjt:  HLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLE--VGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIK

Query:  CPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIF
        C K+QG LLK I++LE L+LTVLHLNITTS + +++ YSFNLK+EDEC L SA++I A V++IF
Subjt:  CPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQIAATVNQIF

AT3G61950.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein5.7e-5745.78Show/hide
Query:  MLQSVESQS-FKDKEPNFQSLLKLQHLNKPWE-EGVSKIQELVELFSSPINSET-----------KDQKRPPNSEGMSSESNRNQGLCRTQMEK------
        MLQS +  S F  KEPNF +LL LQ L +PWE E    +++    F SP+ SET             Q+ P +   M+  S+ +  L      K      
Subjt:  MLQSVESQS-FKDKEPNFQSLLKLQHLNKPWE-EGVSKIQELVELFSSPINSET-----------KDQKRPPNSEGMSSESNRNQGLCRTQMEK------

Query:  -APPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKG
            +T+E+RKR+++KP+KN EE+E+QR+ HIAVERNRRRQMN+H+N +++L+P SYIQRGDQASI+GGAI++VK LEQ+++SLE+ ++ ++ + S    
Subjt:  -APPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKG

Query:  EQLE--VGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQI
          L    G++SN         E ++ + +IE T+IQ HV+LK++C K+QG LLK I++LE L+LTVLHLNITTS + +++ YSFNLK+EDEC L SA++I
Subjt:  EQLE--VGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAEQI

Query:  AATVNQIF
         A V++IF
Subjt:  AATVNQIF

AT4G01460.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.2e-5444.59Show/hide
Query:  SSLEDRIPFLQMLQSVESQSFKDKEPN--FQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTK
        +++E++IPFLQMLQ +E   F   EPN   QSLL++Q L                   S +  ET  ++ P  ++    +     G             K
Subjt:  SSLEDRIPFLQMLQSVESQSFKDKEPN--FQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTK

Query:  ERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVG-
        E+RKRKR++  KNK+EVE+QRMTHIAVERNRRRQMN+HLN ++SL+P S++QRGDQASI+GGAIDF+KELEQLL+SLEA  K + GT+   K        
Subjt:  ERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGTESGCKGEQLEVG-

Query:  --VASNGRIG-------EGVCAEL-KSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAE
            +N  I         G  A     +  E+E T+IQ HV+LK++C + +  +LKAIV++E+L+L +LHL I++S     ++YSFNLK+ED CKLGSA+
Subjt:  --VASNGRIG-------EGVCAEL-KSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKLGSAE

Query:  QIAATVNQIFSFIN
        +IA  V+QIF  IN
Subjt:  QIAATVNQIFSFIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCAGATACGAGATGAGTTGAATGTGGCAATGAGAGGAGAGGAGAGGAAGAATGTAAGGAGGAGAGAGTATGTGGTTGAAGCGCAAAGTGGCGACAATGGAAAGGG
GCCCTTGAAAAGATTGTATTTGAAAGAATTACACAGACAACCCAACCCACCCAGCAGCCACTGCCTTGCTAACGAGGAAAGTATACAGCAGTCTCCTGCATACAAAATTT
GTGTCCCGACACATTTAATGCTGAATCAAGACCATTGCCATATCACTGTGGAATGTATGTTGGTTACCCCTAGACAGAGAAGTAGAAAGGCTAAATTAACACACTTCAAG
ACCTGCTCTGCTGTTCAAATGGAGAGGCTCCAAGGACCCATTAATCCTTGCTTTTATGGTGAATATTCAGAGACAGGTTGCTCGGAACAAGAATTCACAAGCTTGGGATT
TGAAGAATCAGAAGAAGTTTGTTTCCTAACCTCAAGTTTGGAAGATAGAATACCGTTCCTTCAGATGCTGCAGAGTGTAGAATCGCAATCATTCAAGGACAAGGAGCCTA
ACTTTCAAAGCTTGCTGAAGCTGCAGCACCTAAACAAACCGTGGGAAGAGGGAGTTAGTAAAATTCAGGAGCTTGTAGAGTTGTTTTCTTCGCCAATAAACTCAGAAACG
AAGGACCAAAAACGACCTCCAAATTCGGAGGGAATGAGTTCAGAGAGCAACCGAAATCAAGGCTTATGCCGGACACAAATGGAAAAGGCTCCTCCAGTCACAAAGGAAAG
AAGAAAACGAAAGAGATCGAAACCAACAAAGAACAAGGAAGAAGTAGAGAGCCAAAGAATGACCCATATTGCCGTCGAGCGCAACCGGAGACGGCAAATGAACGACCATC
TCAACGTTATCAAGTCCCTCATACCTACCTCCTACATACAGAGGGGTGACCAGGCATCCATAATTGGGGGTGCAATTGACTTCGTGAAGGAATTGGAGCAGCTACTAGAA
TCTTTGGAAGCACTGAGGAAAGAAAGGAAGGGAACGGAAAGTGGGTGTAAGGGTGAGCAATTAGAAGTGGGAGTGGCCTCAAATGGGAGAATAGGAGAAGGGGTTTGCGC
AGAGCTCAAGTCAGAAGTGGCTGAGATAGAAGTTACAATGATTCAAACCCATGTAAACTTAAAGATAAAATGCCCCAAAAGGCAAGGCCTGTTGTTGAAAGCCATTGTTG
CTTTGGAAGATCTTAGGCTCACAGTTTTGCATCTCAACATTACTACCTCGCAAGCCACTGCCACCATGCTTTACTCCTTCAATCTAAAGATAGAAGATGAATGTAAGCTA
GGATCAGCGGAGCAGATTGCAGCAACGGTTAATCAAATATTCAGTTTTATCAACGATGGCAGACTGGTCAATGAGGCAAAGGCAAATTTCAGGCAGTACAGTGGCAATGG
GTTGGTCAGGATGGCAGGACAGATGATGTTTACGGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGCAGATACGAGATGAGTTGAATGTGGCAATGAGAGGAGAGGAGAGGAAGAATGTAAGGAGGAGAGAGTATGTGGTTGAAGCGCAAAGTGGCGACAATGGAAAGGG
GCCCTTGAAAAGATTGTATTTGAAAGAATTACACAGACAACCCAACCCACCCAGCAGCCACTGCCTTGCTAACGAGGAAAGTATACAGCAGTCTCCTGCATACAAAATTT
GTGTCCCGACACATTTAATGCTGAATCAAGACCATTGCCATATCACTGTGGAATGTATGTTGGTTACCCCTAGACAGAGAAGTAGAAAGGCTAAATTAACACACTTCAAG
ACCTGCTCTGCTGTTCAAATGGAGAGGCTCCAAGGACCCATTAATCCTTGCTTTTATGGTGAATATTCAGAGACAGGTTGCTCGGAACAAGAATTCACAAGCTTGGGATT
TGAAGAATCAGAAGAAGTTTGTTTCCTAACCTCAAGTTTGGAAGATAGAATACCGTTCCTTCAGATGCTGCAGAGTGTAGAATCGCAATCATTCAAGGACAAGGAGCCTA
ACTTTCAAAGCTTGCTGAAGCTGCAGCACCTAAACAAACCGTGGGAAGAGGGAGTTAGTAAAATTCAGGAGCTTGTAGAGTTGTTTTCTTCGCCAATAAACTCAGAAACG
AAGGACCAAAAACGACCTCCAAATTCGGAGGGAATGAGTTCAGAGAGCAACCGAAATCAAGGCTTATGCCGGACACAAATGGAAAAGGCTCCTCCAGTCACAAAGGAAAG
AAGAAAACGAAAGAGATCGAAACCAACAAAGAACAAGGAAGAAGTAGAGAGCCAAAGAATGACCCATATTGCCGTCGAGCGCAACCGGAGACGGCAAATGAACGACCATC
TCAACGTTATCAAGTCCCTCATACCTACCTCCTACATACAGAGGGGTGACCAGGCATCCATAATTGGGGGTGCAATTGACTTCGTGAAGGAATTGGAGCAGCTACTAGAA
TCTTTGGAAGCACTGAGGAAAGAAAGGAAGGGAACGGAAAGTGGGTGTAAGGGTGAGCAATTAGAAGTGGGAGTGGCCTCAAATGGGAGAATAGGAGAAGGGGTTTGCGC
AGAGCTCAAGTCAGAAGTGGCTGAGATAGAAGTTACAATGATTCAAACCCATGTAAACTTAAAGATAAAATGCCCCAAAAGGCAAGGCCTGTTGTTGAAAGCCATTGTTG
CTTTGGAAGATCTTAGGCTCACAGTTTTGCATCTCAACATTACTACCTCGCAAGCCACTGCCACCATGCTTTACTCCTTCAATCTAAAGATAGAAGATGAATGTAAGCTA
GGATCAGCGGAGCAGATTGCAGCAACGGTTAATCAAATATTCAGTTTTATCAACGATGGCAGACTGGTCAATGAGGCAAAGGCAAATTTCAGGCAGTACAGTGGCAATGG
GTTGGTCAGGATGGCAGGACAGATGATGTTTACGGCTTAA
Protein sequenceShow/hide protein sequence
MMQIRDELNVAMRGEERKNVRRREYVVEAQSGDNGKGPLKRLYLKELHRQPNPPSSHCLANEESIQQSPAYKICVPTHLMLNQDHCHITVECMLVTPRQRSRKAKLTHFK
TCSAVQMERLQGPINPCFYGEYSETGCSEQEFTSLGFEESEEVCFLTSSLEDRIPFLQMLQSVESQSFKDKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSET
KDQKRPPNSEGMSSESNRNQGLCRTQMEKAPPVTKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYIQRGDQASIIGGAIDFVKELEQLLE
SLEALRKERKGTESGCKGEQLEVGVASNGRIGEGVCAELKSEVAEIEVTMIQTHVNLKIKCPKRQGLLLKAIVALEDLRLTVLHLNITTSQATATMLYSFNLKIEDECKL
GSAEQIAATVNQIFSFINDGRLVNEAKANFRQYSGNGLVRMAGQMMFTA