; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019189 (gene) of Snake gourd v1 genome

Gene IDTan0019189
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription factor bHLH111
Genome locationLG04:48391597..48398379
RNA-Seq ExpressionTan0019189
SyntenyTan0019189
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140441.1 transcription factor bHLH111 [Cucumis sativus]7.1e-17373.84Show/hide
Query:  MADECIESSVATS-STPPIWWD---NHNNH--------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASD-NQLWTQLLL
        MA+EC ESSVATS STP  WWD   NHN+H        YNSHWL QNPNS NSSC+EDVSIS SSFTNA          SNHLL  H SD N LWTQ+LL
Subjt:  MADECIESSVATS-STPPIWWD---NHNNH--------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASD-NQLWTQLLL

Query:  NIGNGVELQSNEQEIGANFLETITSK-SMSTTGIFEPTACSDYLKKMDT-----NNW-ETLHTF----NNNNGLITTHSHMLENERLLKLSNLVNTWSIA
        NIGN VEL+SNE+ I  NFLETI+S+ SMSTTGIFE TACSDYLKKMDT     NNW +T  TF    NNNN L+T+H+HML+NER LKLSNLVN WSIA
Subjt:  NIGNGVELQSNEQEIGANFLETITSK-SMSTTGIFEPTACSDYLKKMDT-----NNW-ETLHTF----NNNNGLITTHSHMLENERLLKLSNLVNTWSIA

Query:  LPSPDSHLRHL-MDEEHDHLRATTVPSHGGLDPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGKPVVDING-SNNPSFK-
        LP+PD HLRHL MD++HDHLRA+T+P+H  L+PDG +   GL+   S   RRS  N             +N+GDYISFNGRL KPVV ING SNNP FK 
Subjt:  LPSPDSHLRHL-MDEEHDHLRATTVPSHGGLDPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGKPVVDING-SNNPSFK-

Query:  SLNLSADSKKQIHQVSLPTRISGRGN-GVSSEGKKKRSEE-SSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIK
        SLNLSADSKKQIHQ+  PTRISGRG+ GVS+EGKKKRSEE SSETSTKKAKQDNST +SNK+QQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIK
Subjt:  SLNLSADSKKQIHQVSLPTRISGRGN-GVSSEGKKKRSEE-SSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIK

Query:  FLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
        FLQEQVQLLSNPYMKTNSYKDPWQSLERKE KGDGK+DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Subjt:  FLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

XP_008460218.1 PREDICTED: transcription factor bHLH111 [Cucumis melo]1.4e-16873.26Show/hide
Query:  MADECIESSVATS-STPPIWWD---NHNNH--------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASD-NQLWTQLLL
        MA+EC ESSVATS STP  WWD   NHN+H        YNSHWL  NPNS NSSC+EDVSIS SSFT             NHLL  H SD N LWTQ+LL
Subjt:  MADECIESSVATS-STPPIWWD---NHNNH--------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASD-NQLWTQLLL

Query:  NIGNGVELQSNEQEIGANFLETITSK-SMSTTGIFEPTACSDYLKKMDT-----NNW-ETLHTF----NNNNGLIT-THSHMLENERLLKLSNLVNTWSI
        NIGN VELQSNE++I  NFLETI+S+ SMSTTGIFEPTACSDYLKKMDT     NNW +T  TF    NNNN L+T + +HML+NER LKLSNLVN WSI
Subjt:  NIGNGVELQSNEQEIGANFLETITSK-SMSTTGIFEPTACSDYLKKMDT-----NNW-ETLHTF----NNNNGLIT-THSHMLENERLLKLSNLVNTWSI

Query:  ALPSPDSHLRHLMDEEHDHLRATTVPSHGGLD-PDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGKPVVDIN-GSNNPSFK
        ALPSPD HLRHL D++HDHLRATTVP+H  L+  DG V   GL+   S   RR+  N             +N+GDYISFNGRL KP+V IN  SNNP FK
Subjt:  ALPSPDSHLRHLMDEEHDHLRATTVPSHGGLD-PDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGKPVVDIN-GSNNPSFK

Query:  -SLNLSADSKKQIHQVSLPTRISGRGN-GVSSEGKKKRSEE-SSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYI
         SLNLSADSKKQIHQ+  PTRISGRG+ GVS+EGKKKRSEE SSETSTKKAKQDNST  SNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYI
Subjt:  -SLNLSADSKKQIHQVSLPTRISGRGN-GVSSEGKKKRSEE-SSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYI

Query:  KFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
        KFLQEQVQLLSNPYMKTNSYKDPWQSLERKE KGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Subjt:  KFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

XP_022158889.1 transcription factor bHLH111 isoform X1 [Momordica charantia]1.6e-17772.92Show/hide
Query:  MADECIESSVATSSTPPIWWD--NHNNH---------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASDNQLWTQLLLNI
        M DEC ESSVATSSTPP WWD  NHN+H         Y+SHW HQNPNS N+SCD+DVS+S SSF     HS LTV+SS  L     SDNQLWTQ+LLNI
Subjt:  MADECIESSVATSSTPPIWWD--NHNNH---------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASDNQLWTQLLLNI

Query:  GNGVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDT---NNWETLHTF------NNNNGLITTHSHMLENERLLKLSNLVNTWSIALPSP
        GNGVELQS+EQEIG NFLE I+SK++STTGIFE  ACSDYLKKMDT   NNW +   F      NNNNGLITTHSH  ENERLLKLS+LVNTWSIALP  
Subjt:  GNGVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDT---NNWETLHTF------NNNNGLITTHSHMLENERLLKLSNLVNTWSIALPSP

Query:  DSHLRHLMDEEHDHLRATTVPSHG-GLDPD----GAVAQAGLESGGSGVFRRS-----FHNLIGVKQF-----YDNANTRNFGDYISFNGRLGKPVVDIN
               MD+E DHLRA TVP  G GL  +    G VA   LE  G+ +FRRS     F N IG KQ+      DNA TRNF DYISFNGRLGKPV++IN
Subjt:  DSHLRHLMDEEHDHLRATTVPSHG-GLDPD----GAVAQAGLESGGSGVFRRS-----FHNLIGVKQF-----YDNANTRNFGDYISFNGRLGKPVVDIN

Query:  GSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTE
        G NNP FKSLNLSAD+KKQIHQ S PTRISGRG+GVSSEGKKKRSEE SET+TKK KQ+N+TAASNKMQQPKVK+GDRITALQQIVSPFGKTDTASVLTE
Subjt:  GSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTE

Query:  TIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
        TIGYIKFLQEQ+QLLSNPYMKTN+YKDPW+S ERK+ KGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Subjt:  TIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

XP_022158891.1 transcription factor bHLH111 isoform X2 [Momordica charantia]3.4e-16770.42Show/hide
Query:  MADECIESSVATSSTPPIWWD--NHNNH---------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASDNQLWTQLLLNI
        M DEC ESSVATSSTPP WWD  NHN+H         Y+SHW HQNPNS N+SCD+DVS+S SSF     HS LTV+SS  L     SDNQLWTQ+LLNI
Subjt:  MADECIESSVATSSTPPIWWD--NHNNH---------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASDNQLWTQLLLNI

Query:  GNGVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDT---NNWETLHTF------NNNNGLITTHSHMLENERLLKLSNLVNTWSIALPSP
        GNGVELQS+EQEIG NFLE I+SK++STTGIFE  ACSDYLKKMDT   NNW +   F      NNNNGLITTHSH  ENERLLKLS+LVNTWSIALP  
Subjt:  GNGVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDT---NNWETLHTF------NNNNGLITTHSHMLENERLLKLSNLVNTWSIALPSP

Query:  DSHLRHLMDEEHDHLRATTVPSHG-GLDPD----GAVAQAGLESGGSGVFRRS-----FHNLIGVKQF-----YDNANTRNFGDYISFNGRLGKPVVDIN
               MD+E DHLRA TVP  G GL  +    G VA   LE  G+ +FRRS     F N IG KQ+      DNA TRNF DYISFNGRLGKPV++IN
Subjt:  DSHLRHLMDEEHDHLRATTVPSHG-GLDPD----GAVAQAGLESGGSGVFRRS-----FHNLIGVKQF-----YDNANTRNFGDYISFNGRLGKPVVDIN

Query:  GSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTE
        G NNP FKSLNLSAD+KKQIHQ S PTRISGRG+GVSSEGKKKRSEE SET+TKK KQ+N+TAASNKMQQPKVK+GDRITALQQIVSPFGKTDTASVLTE
Subjt:  GSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTE

Query:  TIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
        TIGYIKFLQEQ+Q             DPW+S ERK+ KGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Subjt:  TIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

XP_038877207.1 transcription factor bHLH111 isoform X1 [Benincasa hispida]3.3e-17875.75Show/hide
Query:  MADECIESSVATS-STPPIWWD-NHNNH-------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASDNQLWTQLLLNIGN
        MA+EC ESSVATS STP  WWD NHN H       YNSHWL QNPNS NSSC++D     SSFTNASNHS LT   SNHLL   +  N  WTQ+LLNIGN
Subjt:  MADECIESSVATS-STPPIWWD-NHNNH-------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASDNQLWTQLLLNIGN

Query:  GVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDT---NNW-ETLHTF-----NNNNGLITT-HSHMLENERLLKLSNLVNTWSIALPSPD
         VELQSNE++I ANFLETI+S+SMST GIFEPTACSDYLKKMDT   NNW +T  TF     NNNNGL+TT H+HML+NER LKLSNLVNTWSIALP  D
Subjt:  GVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDT---NNW-ETLHTF-----NNNNGLITT-HSHMLENERLLKLSNLVNTWSIALPSPD

Query:  SHLRHLMDEEHDHLRATTVPSHGGLDPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGKPVVDING-SNNPSFK-SLNLSA
        + LRHLM+++HDHLRATTVP+H  L+PDG V   GL+   S + RRS HN             +N+GDYISFNGRL K VV ING SNNP FK SLNLSA
Subjt:  SHLRHLMDEEHDHLRATTVPSHGGLDPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGKPVVDING-SNNPSFK-SLNLSA

Query:  DSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQL
        DSKKQIHQ+S PTRISGRG+GV SEGKKKRSEESSETSTKKAKQDNST ASNK+QQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQL
Subjt:  DSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQL

Query:  LSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
        LSNPYMKTNSYKDPWQSLERKE KG+GK DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Subjt:  LSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

TrEMBL top hitse value%identityAlignment
A0A0A0KQW5 BHLH domain-containing protein3.4e-17373.84Show/hide
Query:  MADECIESSVATS-STPPIWWD---NHNNH--------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASD-NQLWTQLLL
        MA+EC ESSVATS STP  WWD   NHN+H        YNSHWL QNPNS NSSC+EDVSIS SSFTNA          SNHLL  H SD N LWTQ+LL
Subjt:  MADECIESSVATS-STPPIWWD---NHNNH--------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASD-NQLWTQLLL

Query:  NIGNGVELQSNEQEIGANFLETITSK-SMSTTGIFEPTACSDYLKKMDT-----NNW-ETLHTF----NNNNGLITTHSHMLENERLLKLSNLVNTWSIA
        NIGN VEL+SNE+ I  NFLETI+S+ SMSTTGIFE TACSDYLKKMDT     NNW +T  TF    NNNN L+T+H+HML+NER LKLSNLVN WSIA
Subjt:  NIGNGVELQSNEQEIGANFLETITSK-SMSTTGIFEPTACSDYLKKMDT-----NNW-ETLHTF----NNNNGLITTHSHMLENERLLKLSNLVNTWSIA

Query:  LPSPDSHLRHL-MDEEHDHLRATTVPSHGGLDPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGKPVVDING-SNNPSFK-
        LP+PD HLRHL MD++HDHLRA+T+P+H  L+PDG +   GL+   S   RRS  N             +N+GDYISFNGRL KPVV ING SNNP FK 
Subjt:  LPSPDSHLRHL-MDEEHDHLRATTVPSHGGLDPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGKPVVDING-SNNPSFK-

Query:  SLNLSADSKKQIHQVSLPTRISGRGN-GVSSEGKKKRSEE-SSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIK
        SLNLSADSKKQIHQ+  PTRISGRG+ GVS+EGKKKRSEE SSETSTKKAKQDNST +SNK+QQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIK
Subjt:  SLNLSADSKKQIHQVSLPTRISGRGN-GVSSEGKKKRSEE-SSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIK

Query:  FLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
        FLQEQVQLLSNPYMKTNSYKDPWQSLERKE KGDGK+DLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Subjt:  FLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

A0A1S3CD90 transcription factor bHLH1116.7e-16973.26Show/hide
Query:  MADECIESSVATS-STPPIWWD---NHNNH--------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASD-NQLWTQLLL
        MA+EC ESSVATS STP  WWD   NHN+H        YNSHWL  NPNS NSSC+EDVSIS SSFT             NHLL  H SD N LWTQ+LL
Subjt:  MADECIESSVATS-STPPIWWD---NHNNH--------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASD-NQLWTQLLL

Query:  NIGNGVELQSNEQEIGANFLETITSK-SMSTTGIFEPTACSDYLKKMDT-----NNW-ETLHTF----NNNNGLIT-THSHMLENERLLKLSNLVNTWSI
        NIGN VELQSNE++I  NFLETI+S+ SMSTTGIFEPTACSDYLKKMDT     NNW +T  TF    NNNN L+T + +HML+NER LKLSNLVN WSI
Subjt:  NIGNGVELQSNEQEIGANFLETITSK-SMSTTGIFEPTACSDYLKKMDT-----NNW-ETLHTF----NNNNGLIT-THSHMLENERLLKLSNLVNTWSI

Query:  ALPSPDSHLRHLMDEEHDHLRATTVPSHGGLD-PDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGKPVVDIN-GSNNPSFK
        ALPSPD HLRHL D++HDHLRATTVP+H  L+  DG V   GL+   S   RR+  N             +N+GDYISFNGRL KP+V IN  SNNP FK
Subjt:  ALPSPDSHLRHLMDEEHDHLRATTVPSHGGLD-PDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGKPVVDIN-GSNNPSFK

Query:  -SLNLSADSKKQIHQVSLPTRISGRGN-GVSSEGKKKRSEE-SSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYI
         SLNLSADSKKQIHQ+  PTRISGRG+ GVS+EGKKKRSEE SSETSTKKAKQDNST  SNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYI
Subjt:  -SLNLSADSKKQIHQVSLPTRISGRGN-GVSSEGKKKRSEE-SSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYI

Query:  KFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
        KFLQEQVQLLSNPYMKTNSYKDPWQSLERKE KGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Subjt:  KFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

A0A6J1E0R5 transcription factor bHLH111 isoform X17.9e-17872.92Show/hide
Query:  MADECIESSVATSSTPPIWWD--NHNNH---------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASDNQLWTQLLLNI
        M DEC ESSVATSSTPP WWD  NHN+H         Y+SHW HQNPNS N+SCD+DVS+S SSF     HS LTV+SS  L     SDNQLWTQ+LLNI
Subjt:  MADECIESSVATSSTPPIWWD--NHNNH---------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASDNQLWTQLLLNI

Query:  GNGVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDT---NNWETLHTF------NNNNGLITTHSHMLENERLLKLSNLVNTWSIALPSP
        GNGVELQS+EQEIG NFLE I+SK++STTGIFE  ACSDYLKKMDT   NNW +   F      NNNNGLITTHSH  ENERLLKLS+LVNTWSIALP  
Subjt:  GNGVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDT---NNWETLHTF------NNNNGLITTHSHMLENERLLKLSNLVNTWSIALPSP

Query:  DSHLRHLMDEEHDHLRATTVPSHG-GLDPD----GAVAQAGLESGGSGVFRRS-----FHNLIGVKQF-----YDNANTRNFGDYISFNGRLGKPVVDIN
               MD+E DHLRA TVP  G GL  +    G VA   LE  G+ +FRRS     F N IG KQ+      DNA TRNF DYISFNGRLGKPV++IN
Subjt:  DSHLRHLMDEEHDHLRATTVPSHG-GLDPD----GAVAQAGLESGGSGVFRRS-----FHNLIGVKQF-----YDNANTRNFGDYISFNGRLGKPVVDIN

Query:  GSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTE
        G NNP FKSLNLSAD+KKQIHQ S PTRISGRG+GVSSEGKKKRSEE SET+TKK KQ+N+TAASNKMQQPKVK+GDRITALQQIVSPFGKTDTASVLTE
Subjt:  GSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTE

Query:  TIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
        TIGYIKFLQEQ+QLLSNPYMKTN+YKDPW+S ERK+ KGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Subjt:  TIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

A0A6J1E2A3 transcription factor bHLH111 isoform X21.6e-16770.42Show/hide
Query:  MADECIESSVATSSTPPIWWD--NHNNH---------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASDNQLWTQLLLNI
        M DEC ESSVATSSTPP WWD  NHN+H         Y+SHW HQNPNS N+SCD+DVS+S SSF     HS LTV+SS  L     SDNQLWTQ+LLNI
Subjt:  MADECIESSVATSSTPPIWWD--NHNNH---------YNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASDNQLWTQLLLNI

Query:  GNGVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDT---NNWETLHTF------NNNNGLITTHSHMLENERLLKLSNLVNTWSIALPSP
        GNGVELQS+EQEIG NFLE I+SK++STTGIFE  ACSDYLKKMDT   NNW +   F      NNNNGLITTHSH  ENERLLKLS+LVNTWSIALP  
Subjt:  GNGVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDT---NNWETLHTF------NNNNGLITTHSHMLENERLLKLSNLVNTWSIALPSP

Query:  DSHLRHLMDEEHDHLRATTVPSHG-GLDPD----GAVAQAGLESGGSGVFRRS-----FHNLIGVKQF-----YDNANTRNFGDYISFNGRLGKPVVDIN
               MD+E DHLRA TVP  G GL  +    G VA   LE  G+ +FRRS     F N IG KQ+      DNA TRNF DYISFNGRLGKPV++IN
Subjt:  DSHLRHLMDEEHDHLRATTVPSHG-GLDPD----GAVAQAGLESGGSGVFRRS-----FHNLIGVKQF-----YDNANTRNFGDYISFNGRLGKPVVDIN

Query:  GSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTE
        G NNP FKSLNLSAD+KKQIHQ S PTRISGRG+GVSSEGKKKRSEE SET+TKK KQ+N+TAASNKMQQPKVK+GDRITALQQIVSPFGKTDTASVLTE
Subjt:  GSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTE

Query:  TIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
        TIGYIKFLQEQ+Q             DPW+S ERK+ KGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
Subjt:  TIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

A0A6J1FVQ7 transcription factor bHLH1112.2e-15669.3Show/hide
Query:  MADECIESSVATSSTPPIWWDNHNNHYNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSN------HLLSDHASDNQLWTQLLLNIGNGVE
        MAD+C ++SVATSSTPP WWD HN+H++ +       +++SSCD+DVSIS SSFTNASNHS LT+ SS+      HLL  HASD+ LWTQ+LLNIGNGVE
Subjt:  MADECIESSVATSSTPPIWWDNHNNHYNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSN------HLLSDHASDNQLWTQLLLNIGNGVE

Query:  LQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDTN-NW-ETLHTFNNNNGLITTHSHMLENERLLKLSNLVNTWSIALPSPDSHLRHLMDEEH
              +I  NFL              EP ACSDYLKKMDTN NW +T  TFNNNNGL+TT    LENERLLKLSNLVNTWSIALPSPD+HLRHLMD+E 
Subjt:  LQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDTN-NW-ETLHTFNNNNGLITTHSHMLENERLLKLSNLVNTWSIALPSPDSHLRHLMDEEH

Query:  DHLRATTVPSHGGLDPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDN--ANTRNFGDYISFNGRLGKPVVDINGSNNPSFKSLNLSADSKKQIHQVSL
          LR TT+     LDPD     A L+   S  FRRS HN +  K FYDN  A TRN+GDYISFN R  KP++ +    NPS KSLNLSA SKKQI Q+S 
Subjt:  DHLRATTVPSHGGLDPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDN--ANTRNFGDYISFNGRLGKPVVDINGSNNPSFKSLNLSADSKKQIHQVSL

Query:  PTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNS-TAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNS
        PTR SGRG+GV +EGKKKRSEESSET TKKAKQDNS T AS K+QQPKVKIGDRIT LQQIVSPFGKTDTASVL ETIGYIKFLQEQVQLL+NPYMK NS
Subjt:  PTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNS-TAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNS

Query:  YKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
        YKD WQSLERKESKG+GK++LRSRGLCLVPISCTPQVYREN+GSDYWTPYRGCFYR
Subjt:  YKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

SwissProt top hitse value%identityAlignment
Q8GXT3 Transcription factor bHLH1232.2e-2348.61Show/hide
Query:  SSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKE
        SS  + KR     + + K+AK +   A+ +   + K K+GDRI ALQQ+VSPFGKTD ASVL+E I YIKFL +QV  LSNPYMK+ +     QS    E
Subjt:  SSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKE

Query:  SKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
         +   + DLRSRGLCLVP+S T  V  + T  D+WTP  G  +R
Subjt:  SKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

Q8S3D1 Transcription factor bHLH685.3e-2240.21Show/hide
Query:  SGRGNGVS-SEGKKKRSEESSE--------TSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYM
        S   NG+  SEG+    + SSE        ++ KK +   S ++ + ++  K K+G RI AL Q+VSPFGKTDTASVL+E IGYI+FLQ Q++ LS+PY 
Subjt:  SGRGNGVS-SEGKKKRSEESSE--------TSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYM

Query:  KTNSY------------------KDPWQ---------------SLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRG
         T +                   +DP Q               S + + +  + K DLRSRGLCLVPISCT QV  +N G+DYW P  G
Subjt:  KTNSY------------------KDPWQ---------------SLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRG

Q9FYJ6 Transcription factor bHLH1112.1e-4744.78Show/hide
Query:  NERLLKLSNLVNT-WSIALP-SPD--SHLRHLMDEEHDHLRATTVPSHGGL----DPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYI
        ++RL KL++LV   WSIA P +PD   +L H  D  HDH +   +  +       + +      G   GGS      FH+ I         ++R+F D  
Subjt:  NERLLKLSNLVNT-WSIALP-SPD--SHLRHLMDEEHDHLRATTVPSHGGL----DPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYI

Query:  SFNGRLGKPVVDINGSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIV
            RL +P+ DIN S  P FK+LN+S  +KK+ HQ +    ++    G ++ GKKKR EE S+  +KKAK    +  S + + PK K+ D+IT LQQIV
Subjt:  SFNGRLGKPVVDINGSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIV

Query:  SPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKE--SKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTP-YRGCFYR
        SPFGKTDTASVL E I YI F QEQV+LLS PYMK +S KDPW   +R++   +G   +DLRSRGLCLVPIS TP  YR+N+ +DYW P YRG  YR
Subjt:  SPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKE--SKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTP-YRGCFYR

Q9LT67 Transcription factor bHLH1132.2e-2037.7Show/hide
Query:  VDINGSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTAS
        +D   SN+      +  + +KK+          +G GNG  S+  +K  ++       K  Q+ S+    K++  K ++G+RI ALQQ+VSP+GKTD AS
Subjt:  VDINGSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTAS

Query:  VLTETIGYIKFLQEQVQLLSNPYMKTNSYK------DPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTP
        VL E +GYIKFLQ+Q+Q+L +PY+  +S        D   +++ K        DLRSRGLCLVP+S T  V   N G+D+W+P
Subjt:  VLTETIGYIKFLQEQVQLLSNPYMKTNSYK------DPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTP

Q9SFZ3 Transcription factor bHLH1101.1e-1944.9Show/hide
Query:  SSEGKKKR---SEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQS-L
        +SEGK+     + ++ E ++KK + + S ++    +  K K+GDRI ALQQ+VSPFGKTDTASVL E IGYIKFLQ Q++ LS PYM+ +  +    S L
Subjt:  SSEGKKKR---SEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQS-L

Query:  ERKESKGDGK--IDLRSRGLCLVPISCTPQVYRE------NTGSDYW
          +  +GD +   DLRSRGLCLVP+SC   V  +        G+ +W
Subjt:  ERKESKGDGK--IDLRSRGLCLVPISCTPQVYRE------NTGSDYW

Arabidopsis top hitse value%identityAlignment
AT1G31050.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein9.2e-5437.42Show/hide
Query:  MADECIESSVATSSTPPIWWDNHNNHYNSH--------WLHQNPNSN---NSSCDEDVSISPSSFTNASNHSGLTVQSSNH---------------LLSD
        + +EC  SS         WW++  +H+N H        + H++ N+N   N+SC+ED ++S S+   ASN   LT +SSNH               LL D
Subjt:  MADECIESSVATSSTPPIWWDNHNNHYNSH--------WLHQNPNSN---NSSCDEDVSISPSSFTNASNHSGLTVQSSNH---------------LLSD

Query:  H--ASDNQLWTQLLL-NIGNGVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDTNNWETLHTFNNNNGLITTHSHM----LENERLLKLS
        H  +S N LW+   L     G ++  +   I +       S + S    FEP AC +                 N NG I   + +      ++RL KL+
Subjt:  H--ASDNQLWTQLLL-NIGNGVELQSNEQEIGANFLETITSKSMSTTGIFEPTACSDYLKKMDTNNWETLHTFNNNNGLITTHSHM----LENERLLKLS

Query:  NLVNT-WSIALP-SPD--SHLRHLMDEEHDHLRATTVPSHGGL----DPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGK
        +LV   WSIA P +PD   +L H  D  HDH +   +  +       + +      G   GGS      FH+ I         ++R+F D      RL +
Subjt:  NLVNT-WSIALP-SPD--SHLRHLMDEEHDHLRATTVPSHGGL----DPDGAVAQAGLESGGSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGK

Query:  PVVDINGSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDT
        P+ DIN S  P FK+LN+S  +KK+ HQ +    ++    G ++ GKKKR EE S+  +KKAK    +  S + + PK K+ D+IT LQQIVSPFGKTDT
Subjt:  PVVDINGSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDT

Query:  ASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKE--SKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTP-YRGCFYR
        ASVL E I YI F QEQV+LLS PYMK +S KDPW   +R++   +G   +DLRSRGLCLVPIS TP  YR+N+ +DYW P YRG  YR
Subjt:  ASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKE--SKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTP-YRGCFYR

AT1G49830.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.7e-2147.5Show/hide
Query:  KKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKI-DLRSRGLCL
        K+ K+D   ++    +  K K+G++IT LQ +VSP+GKTD ASVL ET+GYIKFLQ+QVQ+LS PY K N    P    +  E     K+ +LRS GLCL
Subjt:  KKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKI-DLRSRGLCL

Query:  VPISCTPQVYRENTGSDYWT
        VP++ T  V   N G+D W+
Subjt:  VPISCTPQVYRENTGSDYWT

AT3G19500.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.6e-2137.7Show/hide
Query:  VDINGSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTAS
        +D   SN+      +  + +KK+          +G GNG  S+  +K  ++       K  Q+ S+    K++  K ++G+RI ALQQ+VSP+GKTD AS
Subjt:  VDINGSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTAS

Query:  VLTETIGYIKFLQEQVQLLSNPYMKTNSYK------DPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTP
        VL E +GYIKFLQ+Q+Q+L +PY+  +S        D   +++ K        DLRSRGLCLVP+S T  V   N G+D+W+P
Subjt:  VLTETIGYIKFLQEQVQLLSNPYMKTNSYK------DPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTP

AT3G20640.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.5e-2448.61Show/hide
Query:  SSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKE
        SS  + KR     + + K+AK +   A+ +   + K K+GDRI ALQQ+VSPFGKTD ASVL+E I YIKFL +QV  LSNPYMK+ +     QS    E
Subjt:  SSEGKKKRSEESSETSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKE

Query:  SKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR
         +   + DLRSRGLCLVP+S T  V  + T  D+WTP  G  +R
Subjt:  SKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRGCFYR

AT4G29100.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.8e-2340.21Show/hide
Query:  SGRGNGVS-SEGKKKRSEESSE--------TSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYM
        S   NG+  SEG+    + SSE        ++ KK +   S ++ + ++  K K+G RI AL Q+VSPFGKTDTASVL+E IGYI+FLQ Q++ LS+PY 
Subjt:  SGRGNGVS-SEGKKKRSEESSE--------TSTKKAKQDNSTAASNKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYM

Query:  KTNSY------------------KDPWQ---------------SLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRG
         T +                   +DP Q               S + + +  + K DLRSRGLCLVPISCT QV  +N G+DYW P  G
Subjt:  KTNSY------------------KDPWQ---------------SLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGACGAATGCATTGAGAGCTCTGTTGCGACCTCCTCCACTCCGCCCATTTGGTGGGATAACCATAATAATCACTATAATTCCCATTGGCTTCACCAAAACCCTAA
TTCCAACAACTCCTCTTGTGATGAAGATGTTTCCATCTCACCCTCTTCCTTTACCAACGCTTCCAATCACTCTGGACTCACTGTTCAATCCTCCAACCACCTTCTTTCTG
ATCATGCTTCCGATAATCAACTCTGGACCCAACTTTTGCTTAACATAGGAAACGGTGTGGAATTGCAAAGCAATGAACAAGAGATAGGAGCAAATTTCCTTGAGACAATA
ACATCGAAAAGCATGTCGACCACCGGAATCTTCGAACCCACCGCATGTAGCGATTACCTTAAAAAAATGGACACCAATAATTGGGAGACTCTTCACACCTTCAACAACAA
CAATGGCCTAATTACAACCCATAGCCACATGCTTGAAAACGAGAGGTTATTGAAGCTTTCGAATCTCGTGAACACGTGGTCCATTGCTCTGCCGAGCCCAGACTCCCATC
TGCGGCACCTGATGGACGAGGAACATGACCATCTCCGAGCCACGACTGTGCCGAGCCATGGGGGACTCGACCCAGACGGTGCCGTGGCTCAGGCTGGGCTCGAGTCAGGA
GGTTCGGGTGTGTTTAGGAGGTCGTTTCATAATCTGATTGGTGTGAAGCAGTTTTATGATAACGCGAATACAAGAAATTTTGGTGATTATATTTCATTTAACGGGCGATT
GGGTAAGCCGGTGGTCGATATTAATGGTTCGAATAATCCTTCTTTTAAGTCGTTGAATTTGTCTGCTGATAGTAAGAAGCAAATTCACCAAGTTTCTTTGCCGACAAGAA
TCAGTGGACGAGGAAATGGAGTTTCGAGCGAAGGGAAGAAGAAAAGGTCCGAAGAATCTTCTGAAACTTCCACTAAAAAGGCAAAGCAAGATAACTCTACAGCTGCATCT
AATAAGATGCAGCAACCAAAGGTCAAAATTGGAGACAGGATAACGGCCCTTCAACAAATTGTGTCGCCATTTGGAAAGACTGATACCGCGTCGGTTCTAACCGAAACCAT
AGGATACATAAAGTTCTTACAGGAGCAAGTCCAGTTACTGAGCAATCCTTACATGAAGACAAATTCATACAAGGATCCATGGCAAAGCTTGGAGAGAAAAGAATCAAAAG
GGGATGGAAAGATTGACCTAAGGAGCAGGGGGCTTTGTCTTGTTCCAATTTCATGTACGCCCCAAGTCTACAGGGAGAACACAGGATCTGACTATTGGACACCTTATAGA
GGATGTTTCTATAGATAG
mRNA sequenceShow/hide mRNA sequence
CTTCCTTCTTATATTCTATATTCCCCTTTGTTCCACTCACAATAACCAAACCAAATTACAGTTTGATCAGTAACAAAACAAGGAGATATAGGAGGAGAAGAAATCGCACT
TTGATCAAAGAAGTGATAATGGCAGACGAATGCATTGAGAGCTCTGTTGCGACCTCCTCCACTCCGCCCATTTGGTGGGATAACCATAATAATCACTATAATTCCCATTG
GCTTCACCAAAACCCTAATTCCAACAACTCCTCTTGTGATGAAGATGTTTCCATCTCACCCTCTTCCTTTACCAACGCTTCCAATCACTCTGGACTCACTGTTCAATCCT
CCAACCACCTTCTTTCTGATCATGCTTCCGATAATCAACTCTGGACCCAACTTTTGCTTAACATAGGAAACGGTGTGGAATTGCAAAGCAATGAACAAGAGATAGGAGCA
AATTTCCTTGAGACAATAACATCGAAAAGCATGTCGACCACCGGAATCTTCGAACCCACCGCATGTAGCGATTACCTTAAAAAAATGGACACCAATAATTGGGAGACTCT
TCACACCTTCAACAACAACAATGGCCTAATTACAACCCATAGCCACATGCTTGAAAACGAGAGGTTATTGAAGCTTTCGAATCTCGTGAACACGTGGTCCATTGCTCTGC
CGAGCCCAGACTCCCATCTGCGGCACCTGATGGACGAGGAACATGACCATCTCCGAGCCACGACTGTGCCGAGCCATGGGGGACTCGACCCAGACGGTGCCGTGGCTCAG
GCTGGGCTCGAGTCAGGAGGTTCGGGTGTGTTTAGGAGGTCGTTTCATAATCTGATTGGTGTGAAGCAGTTTTATGATAACGCGAATACAAGAAATTTTGGTGATTATAT
TTCATTTAACGGGCGATTGGGTAAGCCGGTGGTCGATATTAATGGTTCGAATAATCCTTCTTTTAAGTCGTTGAATTTGTCTGCTGATAGTAAGAAGCAAATTCACCAAG
TTTCTTTGCCGACAAGAATCAGTGGACGAGGAAATGGAGTTTCGAGCGAAGGGAAGAAGAAAAGGTCCGAAGAATCTTCTGAAACTTCCACTAAAAAGGCAAAGCAAGAT
AACTCTACAGCTGCATCTAATAAGATGCAGCAACCAAAGGTCAAAATTGGAGACAGGATAACGGCCCTTCAACAAATTGTGTCGCCATTTGGAAAGACTGATACCGCGTC
GGTTCTAACCGAAACCATAGGATACATAAAGTTCTTACAGGAGCAAGTCCAGTTACTGAGCAATCCTTACATGAAGACAAATTCATACAAGGATCCATGGCAAAGCTTGG
AGAGAAAAGAATCAAAAGGGGATGGAAAGATTGACCTAAGGAGCAGGGGGCTTTGTCTTGTTCCAATTTCATGTACGCCCCAAGTCTACAGGGAGAACACAGGATCTGAC
TATTGGACACCTTATAGAGGATGTTTCTATAGATAGATAGATCGAAAGGTATATATATAGAAAATATCAACTTTTATAGGAAGGAATATATTAAATACCAACTCTCCTTT
ACAAATGTAATTTCAAGACTTGTATTTCTCTCACTCTATCTCATGTATTTCTTAATTTCAATCAGATGAAATCAACAAACGAGCTGAGAATTAAGGTTGAAACAATATTT
TACAGCC
Protein sequenceShow/hide protein sequence
MADECIESSVATSSTPPIWWDNHNNHYNSHWLHQNPNSNNSSCDEDVSISPSSFTNASNHSGLTVQSSNHLLSDHASDNQLWTQLLLNIGNGVELQSNEQEIGANFLETI
TSKSMSTTGIFEPTACSDYLKKMDTNNWETLHTFNNNNGLITTHSHMLENERLLKLSNLVNTWSIALPSPDSHLRHLMDEEHDHLRATTVPSHGGLDPDGAVAQAGLESG
GSGVFRRSFHNLIGVKQFYDNANTRNFGDYISFNGRLGKPVVDINGSNNPSFKSLNLSADSKKQIHQVSLPTRISGRGNGVSSEGKKKRSEESSETSTKKAKQDNSTAAS
NKMQQPKVKIGDRITALQQIVSPFGKTDTASVLTETIGYIKFLQEQVQLLSNPYMKTNSYKDPWQSLERKESKGDGKIDLRSRGLCLVPISCTPQVYRENTGSDYWTPYR
GCFYR