; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0000996 (gene) of Chayote v1 genome

Gene IDSed0000996
OrganismSechium edule (Chayote v1)
Descriptionheat stress transcription factor B-2b-like
Genome locationLG01:68338162..68340089
RNA-Seq ExpressionSed0000996
SyntenySed0000996
Gene Ontology termsGO:0006012 - galactose metabolic process (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003978 - UDP-glucose 4-epimerase activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042789.1 bifunctional UDP-glucose 4-epimerase and UDP-xylose 4-epimerase 1 [Cucumis melo var. makuwa]5.1e-13177.08Show/hide
Query:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF
        M P PAE IGDSGTG+SQRSIPTPFLTKTYQLVDDPA+DDLISWNEDGSTFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND F
Subjt:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF

Query:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE
        ++GEK LLRDIQRRK A+SV     T TT +A  A PV +AASPA+   VISPANS EEQVTSSNSS MA  R TS  +TPEL  ENERLRKEN QLSHE
Subjt:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE

Query:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--EE
        LTQLKGLCNNILSLMT+YAS  HH  ES SV DG+ L+L+P  ARQ  ED+GAVSDG  EVRLK+EE + AAAA A  G+TP LFGVSIG KR+RR  EE
Subjt:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--EE

Query:  EDEEMVGQNRVQSEEGETGSEIKAEPLDENSE------NPWLELGNQGS
        E+EEMVGQN VQSEEGETGSEIKAEPLDENSE      +PWLELGNQGS
Subjt:  EDEEMVGQNRVQSEEGETGSEIKAEPLDENSE------NPWLELGNQGS

KAG6606116.1 Heat stress transcription factor B-2b, partial [Cucurbita argyrosperma subsp. sororia]1.6e-13278.67Show/hide
Query:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF
        M P PAE IGDSGTG+SQRSIPTPFLTKTYQLVDDP +DDLISWNEDGSTFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND F
Subjt:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF

Query:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE
        KRGEK LLRDIQRRK AVSV    A   TP +V+ P VT+AASPA+   VISP NS EEQVTSSNSS M   RGTS  +TPEL  ENERLRKEN QLSHE
Subjt:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE

Query:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEED
        LTQLKGLCNNILSLMT+YA SGH + ES SV DG+ LDL+P  ARQ  +D+GAVSDGIQEVRLKVEE   A A   AEG TP LFGVSIG KRVRREEED
Subjt:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEED

Query:  EEMVGQNRVQSEEGETGSEIKAEPLDENSENP------WLELGNQGS
        EEMVG N VQSEE ETGSEIKAEPLDENSENP      WLELGNQGS
Subjt:  EEMVGQNRVQSEEGETGSEIKAEPLDENSENP------WLELGNQGS

XP_022958418.1 heat stress transcription factor B-2b-like [Cucurbita moschata]7.9e-13278.1Show/hide
Query:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF
        M P PAE IGDSGTG+SQRSIPTPFLTKTYQLVDDP +DDLISWNEDGSTFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND F
Subjt:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF

Query:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE
        KRGEK LLRDIQRRK A+SV    A   TP +V+ P VT+AASPA+   VISP NS EEQVTSSNSS M   RGTS  +TPEL  ENERLRKEN QLSHE
Subjt:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE

Query:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEED
        LTQLKGLCNNILSLMT+YA SGH + ES SV DG+ LDL+P  ARQ  +D+GAVSDGIQEVRLKVEE   A A   AEG TP LFGVSIG KRVRREE+D
Subjt:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEED

Query:  EEMVGQNRVQSEEGETGSEIKAEPLDENSENP------WLELGNQGS
        EEMVG N VQSEE ETGSEIKAEPLDENSENP      WLELGNQGS
Subjt:  EEMVGQNRVQSEEGETGSEIKAEPLDENSENP------WLELGNQGS

XP_023521234.1 heat stress transcription factor B-2b-like [Cucurbita pepo subsp. pepo]1.6e-13278.67Show/hide
Query:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF
        M P PAE IGDSG G+SQRSIPTPFLTKTYQLVDDP +DDLISWNEDGSTFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND F
Subjt:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF

Query:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE
        KRGEK LLRDIQRRK AVSV    A  TTP +V+ P VT+AASPA+   VISP NS EEQVTSSNSS M   RGTS  +TPEL  ENERLRKEN QLSHE
Subjt:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE

Query:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEED
        LTQLKGLCNNILSLMT+YA SGH + ES SV DG+ LDL+P  ARQ  +D+GAVSDGIQEVRLKVEE   A A   AEG TP LFGVSIG KRVRREEED
Subjt:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEED

Query:  EEMVGQNRVQSEEGETGSEIKAEPLDENSENP------WLELGNQGS
        EEMVG N VQSEE ETGSEIKAEPLDENSENP      WLELGNQGS
Subjt:  EEMVGQNRVQSEEGETGSEIKAEPLDENSENP------WLELGNQGS

XP_038875590.1 heat stress transcription factor B-2b [Benincasa hispida]5.4e-13378.51Show/hide
Query:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF
        M P PAE IGDSGTG+SQRSIPTPFLTKTYQLVDDPA+DDLISWNEDGSTFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND F
Subjt:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF

Query:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE
        +RGEK LLRDIQRRK A+S+  TTATA TP AVA  PV +AASPA+   VISPANS EEQVTSSNSS M   RGTS  +TPEL  ENERLRKEN QLSHE
Subjt:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE

Query:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEED
        LTQLKGLCNNILSLMT+YAS  +H+ ES SV DG+ L+L+P  ARQ  ED+GAVSDG QEVRLK+EE +AAA      GMTP LFGVSIG KR+RREE+D
Subjt:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEED

Query:  --EEMVGQNRVQSEEGETGSEIKAEPLDENSEN------PWLELGNQGS
          EEMVGQN VQSEEGETGSEIKAEPLDENSEN      PWLELGNQGS
Subjt:  --EEMVGQNRVQSEEGETGSEIKAEPLDENSEN------PWLELGNQGS

TrEMBL top hitse value%identityAlignment
A0A0A0KNZ8 HSF_DOMAIN domain-containing protein9.4e-13176.79Show/hide
Query:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF
        M P PAE IGDSGTG+SQRSIPTPFLTKTYQLVDDPA+DDLISWNEDGSTFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND F
Subjt:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF

Query:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE
        ++GEK LLRDIQRRK  +SV     T TT +A  A PVT+A SPA+   VISPANS EEQVTSSNSS MA  R TS  +TPEL  ENERLRKEN QLSHE
Subjt:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE

Query:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--EE
        LTQLKGLCNNILSLMT+YAS  H + ES SV DG+ L+L+P  ARQ  ED+GAVSDG  EVRLK+EE + AAA  AA GMTP LFGVSIG KR+RR  EE
Subjt:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--EE

Query:  EDEEMVGQNRVQSEEGETGSEIKAEPLDENSE------NPWLELGNQGS
        E+EEMVGQN VQSEEGETGSEIKAEPLDENSE      +PWLELGNQGS
Subjt:  EDEEMVGQNRVQSEEGETGSEIKAEPLDENSE------NPWLELGNQGS

A0A1S3AU16 heat stress transcription factor B-2b1.6e-13076.79Show/hide
Query:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF
        M P PAE IGDSGTG+SQRSIPTPFLTKTYQLVDDPA+DDLISWNEDGSTFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND F
Subjt:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF

Query:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE
        ++GEK LLRDIQRRK A+SV     T TT +A  A PV +AASPA+   VISPANS EEQVTSSNSS MA  R TS  +TPEL  ENERLRKEN QLSHE
Subjt:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE

Query:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--EE
        LTQLKGLCNNILSLMT+YAS  HH  ES SV DG+ L+L+P  ARQ  ED+GAVSDG  EVRLK+ E +AAAAA    G+TP LFGVSIG KR+RR  EE
Subjt:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--EE

Query:  EDEEMVGQNRVQSEEGETGSEIKAEPLDENSE------NPWLELGNQGS
        E+EEMVGQN VQSEEGETGSEIKAEPLDENSE      +PWLELGNQGS
Subjt:  EDEEMVGQNRVQSEEGETGSEIKAEPLDENSE------NPWLELGNQGS

A0A5A7TN46 Bifunctional UDP-glucose 4-epimerase and UDP-xylose 4-epimerase 12.5e-13177.08Show/hide
Query:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF
        M P PAE IGDSGTG+SQRSIPTPFLTKTYQLVDDPA+DDLISWNEDGSTFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND F
Subjt:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF

Query:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE
        ++GEK LLRDIQRRK A+SV     T TT +A  A PV +AASPA+   VISPANS EEQVTSSNSS MA  R TS  +TPEL  ENERLRKEN QLSHE
Subjt:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE

Query:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--EE
        LTQLKGLCNNILSLMT+YAS  HH  ES SV DG+ L+L+P  ARQ  ED+GAVSDG  EVRLK+EE + AAAA A  G+TP LFGVSIG KR+RR  EE
Subjt:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--EE

Query:  EDEEMVGQNRVQSEEGETGSEIKAEPLDENSE------NPWLELGNQGS
        E+EEMVGQN VQSEEGETGSEIKAEPLDENSE      +PWLELGNQGS
Subjt:  EDEEMVGQNRVQSEEGETGSEIKAEPLDENSE------NPWLELGNQGS

A0A6J1DU11 heat stress transcription factor B-2b4.6e-13077.71Show/hide
Query:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF
        M+P PAE IG+SGTG+SQRSIPTPFLTKT+QLVDDPA+DDLISWNEDGSTFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND F
Subjt:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF

Query:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAA--PPVTIAASPAM--VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSH
        +RGEK LLRDIQRRK A+SV      ATTP   AA   PVT+AA+PA+  VISPANS EEQVTSSNSS MA  RGTS  +TPEL  ENERLRKEN QLSH
Subjt:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAA--PPVTIAASPAM--VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSH

Query:  ELTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--E
        ELTQLKGLCNNILSLMT+YAS   H+ ES SV DG+ L+LMPA+ +   ED+GAVSDGIQE+RLKVEE   AAAA AAEG+TP LFGVSIG KRVRR  E
Subjt:  ELTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--E

Query:  EEDEEMVGQNRVQSEEGETGSEIKAEPLDENSENP------WLELGNQGS
        EE+EEMVGQN VQSEEGE GSEIKAEPLDENS+NP      WLELGNQGS
Subjt:  EEDEEMVGQNRVQSEEGETGSEIKAEPLDENSENP------WLELGNQGS

A0A6J1H317 heat stress transcription factor B-2b-like3.8e-13278.1Show/hide
Query:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF
        M P PAE IGDSGTG+SQRSIPTPFLTKTYQLVDDP +DDLISWNEDGSTFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND F
Subjt:  MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGF

Query:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE
        KRGEK LLRDIQRRK A+SV    A   TP +V+ P VT+AASPA+   VISP NS EEQVTSSNSS M   RGTS  +TPEL  ENERLRKEN QLSHE
Subjt:  KRGEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAM---VISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHE

Query:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEED
        LTQLKGLCNNILSLMT+YA SGH + ES SV DG+ LDL+P  ARQ  +D+GAVSDGIQEVRLKVEE   A A   AEG TP LFGVSIG KRVRREE+D
Subjt:  LTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEED

Query:  EEMVGQNRVQSEEGETGSEIKAEPLDENSENP------WLELGNQGS
        EEMVG N VQSEE ETGSEIKAEPLDENSENP      WLELGNQGS
Subjt:  EEMVGQNRVQSEEGETGSEIKAEPLDENSENP------WLELGNQGS

SwissProt top hitse value%identityAlignment
P22335 Heat shock factor protein HSF241.2e-4540.8Show/hide
Query:  SQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDIQRRKA
        SQR+ P PFL KTYQLVDD A DD+ISWNE G+TF+V + AEFA+DLLPKYFKHNNFSSFVRQLNTYGFRK+VPD+WEFAN+ FKRG+K LL  I+RRK 
Subjt:  SQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDIQRRKA

Query:  AVSVPATTATATTPTAVAAPPVTIAASPAMVISPANSVEEQVTSSNSSAMANARGTSSGSTP-------ELATENERLRKENKQLSHELTQLKGLCNNIL
          S PA               V   AS     SP NS ++  +SS SS   +++   S  TP       +L+ ENE+L+K+N+ LS EL Q K  CN ++
Subjt:  AVSVPATTATATTPTAVAAPPVTIAASPAMVISPANSVEEQVTSSNSSAMANARGTSSGSTP-------ELATENERLRKENKQLSHELTQLKGLCNNIL

Query:  SLMTSY---ASSGHHRPESASVGDGRVLD--LMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--EEEDEEMVG
        + ++ Y   A    +R  S     G  L+  +      +  E+ G+ +D   +                 +G T  LFGV +  K+ +R  +E  E   G
Subjt:  SLMTSY---ASSGHHRPESASVGDGRVLD--LMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRR--EEEDEEMVG

Query:  QNRVQSEEGETGSEIK-AEPLDENSE
        + ++       G  +K + P  E+S+
Subjt:  QNRVQSEEGETGSEIK-AEPLDENSE

Q652B0 Heat stress transcription factor B-2c2.3e-6550.3Show/hide
Query:  QRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDIQRRK--
        QRS+PTPFLTKTYQLV+DPA+DD+ISWNEDGSTF+V RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRK+VPDRWEFAND F+RGEK LL DI RRK  
Subjt:  QRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDIQRRK--

Query:  ---AAVSVPATTATATTPTAVAAPPVTIAASPAMVI-------SPANSVEEQVTSSNS-SAMANARGTSSGSTP-----------ELATENERLRKENKQ
           AA   P +   AT   AVA+  VT+AA+P  +        SPA+S EEQV SSNS S   + + + SGS P           ++  ENERLR+EN +
Subjt:  ---AAVSVPATTATATTPTAVAAPPVTIAASPAMVI-------SPANSVEEQVTSSNS-SAMANARGTSSGSTP-----------ELATENERLRKENKQ

Query:  LSHELTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAAT--AAEG---MTPMLFGVSIGA
        L+ EL  +K LCNNIL LM+ YA++ H      S G   + +    S+ +A      +   I ++      +  AAAA   A +G    +  LFGVSIG 
Subjt:  LSHELTQLKGLCNNILSLMTSYASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAAT--AAEG---MTPMLFGVSIGA

Query:  KRVRRE---EEDEEMVGQNRVQSEEGETGSEIKAEPLD
        KR R +     DE+  G++  Q+E G  G+++K E  D
Subjt:  KRVRRE---EEDEEMVGQNRVQSEEGETGSEIKAEPLD

Q6Z9C8 Heat stress transcription factor B-2b1.3e-6547.9Show/hide
Query:  PLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKR
        P PA +    G G+ QR++PTPFLTKTYQLVDDPA+DD+ISWN+DGSTF+V RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRK+VPDRWEFAND F+R
Subjt:  PLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKR

Query:  GEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAMVISPANSVEEQVTSSNSS--------AMANARGTSSGSTPELATENERLRKENKQL
        GE+ LL +I RRK     PA    ATT    AA P+ +  +     SP  S EEQV SS+SS           +  G+   ++ ++  ENERLR+EN QL
Subjt:  GEKVLLRDIQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAMVISPANSVEEQVTSSNSS--------AMANARGTSSGSTPELATENERLRKENKQL

Query:  SHELTQLKGLCNNILSLMTSYASSGHHRPESAS--VGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATA--AEG-MTPMLFGVSIGAK
        + EL+Q++ LCNNIL LM+ YAS+      +AS   G+    +    SA  AT         + ++        +AAA  +   EG M+  LFGVSIG K
Subjt:  SHELTQLKGLCNNILSLMTSYASSGHHRPESAS--VGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATA--AEG-MTPMLFGVSIGAK

Query:  RVRREEEDEEMVGQNRVQSEEGETGSEIKAEPLD
        R+R +   ++            +  + +KAEP+D
Subjt:  RVRREEEDEEMVGQNRVQSEEGETGSEIKAEPLD

Q9SCW4 Heat stress transcription factor B-2a1.5e-5645.35Show/hide
Query:  SGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDI
        +G   SQRSIPTPFLTKT+ LV+D +IDD+ISWNEDGS+FIV  P +FA+DLLPK+FKHNNFSSFVRQLNTYGF+KVVPDRWEF+ND FKRGEK LLR+I
Subjt:  SGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDI

Query:  QRRKAAVSVPATTATATTPTAVAAPPVTIAASPAMVISPANSVEE----QVTSSNSSAMANARGTSSGS---TPELATENERLRKENKQLSHELTQLKGL
        QRRK          T T  T VA  P +   +  MV+SP+NS E+    QV SS+ S+    +  ++G+   + EL  ENE+LR +N QL+ ELTQ+K +
Subjt:  QRRKAAVSVPATTATATTPTAVAAPPVTIAASPAMVISPANSVEE----QVTSSNSSAMANARGTSSGS---TPELATENERLRKENKQLSHELTQLKGL

Query:  CNNILSLMTSY-ASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEEDEEMVGQ
        C+NI SLM++Y  S    R  S      + ++ +PA                +   +++EE          E  +P LFGV IG KR R E        Q
Subjt:  CNNILSLMTSY-ASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEEDEEMVGQ

Query:  NRVQSEEGETGSEIKAEPLDENSENPWLELGNQ
         +  +  GE   E          E PWL   N+
Subjt:  NRVQSEEGETGSEIKAEPLDENSENPWLELGNQ

Q9T0D3 Heat stress transcription factor B-2b5.9e-8255.24Show/hide
Query:  GDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLR
        G  G G+SQRSIPTPFLTKTYQLV+DP  D+LISWNEDG+TFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEF+ND FKRGEK+LLR
Subjt:  GDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLR

Query:  DIQRRKAAVSVPA-TTATATTPTAVAAPPVTIAASP--AMVISPANSVEEQVTSSNSSAMANA-------------RGTSSGSTPELATENERLRKENKQ
        DIQRRK  +S PA   A A    AVAA  VT+AA P  A ++SP+NS EEQV SSNSS  A A             R TS  + PEL  ENERLRK+N++
Subjt:  DIQRRKAAVSVPA-TTATATTPTAVAAPPVTIAASP--AMVISPANSVEEQVTSSNSSAMANA-------------RGTSSGSTPELATENERLRKENKQ

Query:  LSHELTQLKGLCNNILSLMTSYA----SSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAK
        L  E+T+LKGL  NI +LM ++        H  PE      G+ LDL+P   RQ   +    S+    + LK+            E +TP LFGVSIG K
Subjt:  LSHELTQLKGLCNNILSLMTSYA----SSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAK

Query:  RVRREEE----DEEMVGQNRVQSEEGETGSEIKAEPLDENS----ENPWLELG
        R RREEE    +EE   +    ++EGE  S++KAEP++EN+       WLELG
Subjt:  RVRREEE----DEEMVGQNRVQSEEGETGSEIKAEPLDENS----ENPWLELG

Arabidopsis top hitse value%identityAlignment
AT1G32330.1 heat shock transcription factor A1D1.9e-3569.79Show/hide
Query:  PTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDIQRRKAA
        P PFL+KTY +VDD   D ++SW+ + ++FIV +P EFARDLLPK FKHNNFSSFVRQLNTYGFRKV PDRWEFAN+GF RG+K LL+ I RRK A
Subjt:  PTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDIQRRKAA

AT1G46264.1 heat shock transcription factor B45.2e-4146.3Show/hide
Query:  RSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDIQRRKAAV
        +++P PFLTKTYQLVDDPA D ++SW +D +TF+V RP EFARDLLP YFKHNNFSSFVRQLNTYGFRK+VPDRWEFAN+ FKRGEK LL +I RRK + 
Subjt:  RSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDIQRRKAAV

Query:  SVPATTATATTPTAVAAPPVTIAASPAMVISP--ANSVEEQVTSSNSSAMANARG-----TSSGSTPELATENERLRKENKQLSHELTQLKGLCNNILSL
         +P   +   +    A P +  +      + P    + EE     + S  +  R       ++     L+ +NERLR+ N  L  EL  +K L N+I+  
Subjt:  SVPATTATATTPTAVAAPPVTIAASPAMVISP--ANSVEEQVTSSNSSAMANARG-----TSSGSTPELATENERLRKENKQLSHELTQLKGLCNNILSL

Query:  MTSYASSGHHRPESAS
           Y    H +P + S
Subjt:  MTSYASSGHHRPESAS

AT4G11660.1 winged-helix DNA-binding transcription factor family protein4.2e-8355.24Show/hide
Query:  GDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLR
        G  G G+SQRSIPTPFLTKTYQLV+DP  D+LISWNEDG+TFIV RPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEF+ND FKRGEK+LLR
Subjt:  GDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLR

Query:  DIQRRKAAVSVPA-TTATATTPTAVAAPPVTIAASP--AMVISPANSVEEQVTSSNSSAMANA-------------RGTSSGSTPELATENERLRKENKQ
        DIQRRK  +S PA   A A    AVAA  VT+AA P  A ++SP+NS EEQV SSNSS  A A             R TS  + PEL  ENERLRK+N++
Subjt:  DIQRRKAAVSVPA-TTATATTPTAVAAPPVTIAASP--AMVISPANSVEEQVTSSNSSAMANA-------------RGTSSGSTPELATENERLRKENKQ

Query:  LSHELTQLKGLCNNILSLMTSYA----SSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAK
        L  E+T+LKGL  NI +LM ++        H  PE      G+ LDL+P   RQ   +    S+    + LK+            E +TP LFGVSIG K
Subjt:  LSHELTQLKGLCNNILSLMTSYA----SSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAK

Query:  RVRREEE----DEEMVGQNRVQSEEGETGSEIKAEPLDENS----ENPWLELG
        R RREEE    +EE   +    ++EGE  S++KAEP++EN+       WLELG
Subjt:  RVRREEE----DEEMVGQNRVQSEEGETGSEIKAEPLDENS----ENPWLELG

AT4G36990.1 heat shock factor 49.5e-4338.89Show/hide
Query:  SQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDIQRRKA
        +QRS+P PFL+KTYQLVDD + DD++SWNE+G+ F+V + AEFA+DLLP+YFKHNNFSSF+RQLNTYGFRK VPD+WEFAND F+RG + LL DI+RRK+
Subjt:  SQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDIQRRKA

Query:  AVSVPATTATATTPTAVAAPPVTIAASPAMVISPANSVEEQVTSSNSSAMANAR-GTSSGSTPELATENERLRKENKQLSHELTQLKGLCNNILSLMTSY
         +             A  A    +  SP+   S +   ++  +SS SS  ++   G+      +L+ ENE+L++EN  LS EL   K   + +++ +T +
Subjt:  AVSVPATTATATTPTAVAAPPVTIAASPAMVISPANSVEEQVTSSNSSAMANAR-GTSSGSTPELATENERLRKENKQLSHELTQLKGLCNNILSLMTSY

Query:  ASSGHHRPESAS--VGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEEDEEMV
              RPE     +  G+     P  + + +E +G    G              A     EG+   LFGV +  +R +R+ +++  V
Subjt:  ASSGHHRPESAS--VGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEEDEEMV

AT5G62020.1 heat shock transcription factor B2A1.0e-5745.35Show/hide
Query:  SGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDI
        +G   SQRSIPTPFLTKT+ LV+D +IDD+ISWNEDGS+FIV  P +FA+DLLPK+FKHNNFSSFVRQLNTYGF+KVVPDRWEF+ND FKRGEK LLR+I
Subjt:  SGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRDI

Query:  QRRKAAVSVPATTATATTPTAVAAPPVTIAASPAMVISPANSVEE----QVTSSNSSAMANARGTSSGS---TPELATENERLRKENKQLSHELTQLKGL
        QRRK          T T  T VA  P +   +  MV+SP+NS E+    QV SS+ S+    +  ++G+   + EL  ENE+LR +N QL+ ELTQ+K +
Subjt:  QRRKAAVSVPATTATATTPTAVAAPPVTIAASPAMVISPANSVEE----QVTSSNSSAMANARGTSSGS---TPELATENERLRKENKQLSHELTQLKGL

Query:  CNNILSLMTSY-ASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEEDEEMVGQ
        C+NI SLM++Y  S    R  S      + ++ +PA                +   +++EE          E  +P LFGV IG KR R E        Q
Subjt:  CNNILSLMTSY-ASSGHHRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEEDEEMVGQ

Query:  NRVQSEEGETGSEIKAEPLDENSENPWLELGNQ
         +  +  GE   E          E PWL   N+
Subjt:  NRVQSEEGETGSEIKAEPLDENSENPWLELGNQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCCGTTGCCGGCGGAACAGATCGGCGATTCCGGAACGGGAGAATCTCAGAGATCGATTCCGACGCCGTTTCTGACGAAAACGTATCAACTTGTGGATGATCCGGC
GATCGACGATCTCATCTCGTGGAACGAAGATGGATCTACGTTCATAGTTTTGCGACCTGCAGAATTTGCTCGAGATTTGCTTCCCAAATACTTTAAACACAATAATTTCT
CTAGTTTTGTCCGTCAACTTAATACTTACGGATTTCGAAAAGTTGTGCCGGATCGATGGGAATTTGCGAACGATGGATTCAAGAGAGGTGAGAAAGTTCTTCTCCGAGAT
ATCCAGCGACGGAAGGCGGCGGTGTCGGTACCGGCCACGACGGCGACGGCGACGACTCCGACCGCCGTGGCTGCACCACCTGTGACGATTGCAGCGTCACCGGCTATGGT
GATATCGCCGGCGAACTCAGTGGAAGAGCAGGTGACATCCTCGAACTCGTCGGCGATGGCTAACGCGCGAGGGACCAGCAGCGGCTCCACGCCGGAACTCGCGACGGAGA
ACGAGCGGCTGAGGAAGGAGAACAAGCAGCTGAGTCACGAGTTGACTCAATTGAAAGGACTCTGCAACAACATACTGTCGTTGATGACGAGCTACGCTTCTTCAGGTCAC
CACCGGCCGGAGTCGGCGAGCGTCGGCGACGGGAGGGTGCTGGATCTCATGCCGGCGTCGGCTAGGCAGGCGACGGAAGACGACGGCGCCGTCAGCGACGGGATTCAGGA
GGTGAGGTTGAAGGTGGAGGAGATGTTGGCGGCGGCGGCGGCGACGGCGGCGGAGGGAATGACGCCGATGCTGTTCGGAGTTTCGATCGGGGCAAAGCGCGTGAGGAGAG
AGGAAGAAGACGAAGAAATGGTGGGGCAGAATCGCGTACAGTCGGAAGAAGGTGAGACCGGGTCGGAGATCAAAGCTGAGCCGTTGGATGAGAACTCTGAAAATCCATGG
CTGGAACTCGGAAATCAAGGCTCCTGA
mRNA sequenceShow/hide mRNA sequence
AAATAATTTCCTTATCATTTACTCTCCTTACCCAGAAACTTCTAGAAACAACAAAAACTCCTCTCTACGCGGCGGCGCTGCCAGACGTTGCAGTGGCGGAAACCTTTCTC
AGGCACGGCGGCCTTCTTCTTCACCCCTCCGGCTCCGAGGAGAATCTCGTCCGTTGATTAGGTTTTAGACGAAACTTCTGGTGGTTTAGATCTGGGAGGCGAGGCAATGA
CTCCGTTGCCGGCGGAACAGATCGGCGATTCCGGAACGGGAGAATCTCAGAGATCGATTCCGACGCCGTTTCTGACGAAAACGTATCAACTTGTGGATGATCCGGCGATC
GACGATCTCATCTCGTGGAACGAAGATGGATCTACGTTCATAGTTTTGCGACCTGCAGAATTTGCTCGAGATTTGCTTCCCAAATACTTTAAACACAATAATTTCTCTAG
TTTTGTCCGTCAACTTAATACTTACGGATTTCGAAAAGTTGTGCCGGATCGATGGGAATTTGCGAACGATGGATTCAAGAGAGGTGAGAAAGTTCTTCTCCGAGATATCC
AGCGACGGAAGGCGGCGGTGTCGGTACCGGCCACGACGGCGACGGCGACGACTCCGACCGCCGTGGCTGCACCACCTGTGACGATTGCAGCGTCACCGGCTATGGTGATA
TCGCCGGCGAACTCAGTGGAAGAGCAGGTGACATCCTCGAACTCGTCGGCGATGGCTAACGCGCGAGGGACCAGCAGCGGCTCCACGCCGGAACTCGCGACGGAGAACGA
GCGGCTGAGGAAGGAGAACAAGCAGCTGAGTCACGAGTTGACTCAATTGAAAGGACTCTGCAACAACATACTGTCGTTGATGACGAGCTACGCTTCTTCAGGTCACCACC
GGCCGGAGTCGGCGAGCGTCGGCGACGGGAGGGTGCTGGATCTCATGCCGGCGTCGGCTAGGCAGGCGACGGAAGACGACGGCGCCGTCAGCGACGGGATTCAGGAGGTG
AGGTTGAAGGTGGAGGAGATGTTGGCGGCGGCGGCGGCGACGGCGGCGGAGGGAATGACGCCGATGCTGTTCGGAGTTTCGATCGGGGCAAAGCGCGTGAGGAGAGAGGA
AGAAGACGAAGAAATGGTGGGGCAGAATCGCGTACAGTCGGAAGAAGGTGAGACCGGGTCGGAGATCAAAGCTGAGCCGTTGGATGAGAACTCTGAAAATCCATGGCTGG
AACTCGGAAATCAAGGCTCCTGATATGAAACGGCGTCATATTGAGATGACGAAGAAGATAAGGGTTTTTTTTTTTTGAGAATAGAGAATGCAGAGACCAGACCGGCCCTG
GACAGCACGAAGAATCTCACGTGCCGAACACGTGAGCTGGAAAAATAAACCGAATTTAAAATGTTATTTTGGTTGGGCCTTGTAAAAGGCGAAGCCGTGAAAAAAACCGA
ACGGCTGGGCCGGTTTGTGAAGGGGACAAAATGACGAAAAGCCCTAGGGGGGAGAATGGGTATTAATTTCTGTTCGAAATTTTAATTTATGAAAAAGAAAATAAAAAAAT
TGTCGGTTCTGTGGTTCGGTGACTAGTGGCCTGATTTAAATGTCACGTGTAATTGCATGAGGTTTTTGTAACTTTAATCTCGTAAATACTATTTTTAGGTTTTTTTTTAT
TGAATACCGGAGGCTGAATTTGCA
Protein sequenceShow/hide protein sequence
MTPLPAEQIGDSGTGESQRSIPTPFLTKTYQLVDDPAIDDLISWNEDGSTFIVLRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDGFKRGEKVLLRD
IQRRKAAVSVPATTATATTPTAVAAPPVTIAASPAMVISPANSVEEQVTSSNSSAMANARGTSSGSTPELATENERLRKENKQLSHELTQLKGLCNNILSLMTSYASSGH
HRPESASVGDGRVLDLMPASARQATEDDGAVSDGIQEVRLKVEEMLAAAAATAAEGMTPMLFGVSIGAKRVRREEEDEEMVGQNRVQSEEGETGSEIKAEPLDENSENPW
LELGNQGS