; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0023636 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0023636
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionheat stress transcription factor B-2b
Genome locationchr09:2269320..2272084
RNA-Seq ExpressionPI0023636
SyntenyPI0023636
Gene Ontology termsGO:0006012 - galactose metabolic process (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003978 - UDP-glucose 4-epimerase activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042789.1 bifunctional UDP-glucose 4-epimerase and UDP-xylose 4-epimerase 1 [Cucumis melo var. makuwa]1.6e-17796.47Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK
        RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPV VAASP VLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK
Subjt:  RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK

Query:  GLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQN
        GLCNNILSLMTNYASGQ H  ESGSVRDGKALELLPARQVMEDEGAVSDGA EVRLKMEE MTAA A  G+TPKLFGVSIGVKRMRREV+EEEEEMVGQN
Subjt:  GLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQN

Query:  HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
        HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
Subjt:  HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS

KAE8647868.1 hypothetical protein Csa_000645 [Cucumis sativus]9.5e-17896.48Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RKGEKGLLRDIQRRKVALSV-TTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL
        RKGEKGLLRDIQRRKV LSV TTTTTSAAVAVPVTVA SP VLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL
Subjt:  RKGEKGLLRDIQRRKVALSV-TTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL

Query:  KGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQ
        KGLCNNILSLMTNYASGQ  QLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEE MTAA A VGMTPKLFGVSIG+KRMRRE++EEEEEMVGQ
Subjt:  KGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQ

Query:  NHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
        NHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
Subjt:  NHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS

XP_004143930.1 heat stress transcription factor B-2b [Cucumis sativus]9.5e-17896.48Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RKGEKGLLRDIQRRKVALSV-TTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL
        RKGEKGLLRDIQRRKV LSV TTTTTSAAVAVPVTVA SP VLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL
Subjt:  RKGEKGLLRDIQRRKVALSV-TTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL

Query:  KGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQ
        KGLCNNILSLMTNYASGQ  QLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEE MTAA A VGMTPKLFGVSIG+KRMRRE++EEEEEMVGQ
Subjt:  KGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQ

Query:  NHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
        NHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
Subjt:  NHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS

XP_008437221.1 PREDICTED: heat stress transcription factor B-2b [Cucumis melo]9.9e-17595.88Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK
        RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPV VAASP VLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK
Subjt:  RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK

Query:  GLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQN
        GLCNNILSLMTNYASGQ H  ESGSVRDGKALELLPARQVMEDEGAVSDGA EVRLKM E M AA A  G+TPKLFGVSIGVKRMRREV+EEEEEMVGQN
Subjt:  GLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQN

Query:  HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
        HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
Subjt:  HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS

XP_038875590.1 heat stress transcription factor B-2b [Benincasa hispida]8.1e-16992.71Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RKGEKGLLRDIQRRKVALSVTTTT---TSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELT
        R+GEKGLLRDIQRRKVALS+TTTT   T AAVAVPV VAASP VLAHVISPANS EEQVTSSNSSPM FQR TSCTTTPELVRENERLRKENMQLSHELT
Subjt:  RKGEKGLLRDIQRRKVALSVTTTT---TSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELT

Query:  QLKGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMV
        QLKGLCNNILSLMTNYAS   HQLES SVRDGKALELLPARQVMEDEGAVSDGA EVRLKMEETM AA   VGMTPKLFGVSIGVKRMRRE  +EEEEMV
Subjt:  QLKGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMV

Query:  GQNHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
        GQNHVQSEEGETGSEIKAEPLDENSE+P+GSASPWLELGNQGS
Subjt:  GQNHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS

TrEMBL top hitse value%identityAlignment
A0A0A0KNZ8 HSF_DOMAIN domain-containing protein4.6e-17896.48Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RKGEKGLLRDIQRRKVALSV-TTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL
        RKGEKGLLRDIQRRKV LSV TTTTTSAAVAVPVTVA SP VLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL
Subjt:  RKGEKGLLRDIQRRKVALSV-TTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL

Query:  KGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQ
        KGLCNNILSLMTNYASGQ  QLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEE MTAA A VGMTPKLFGVSIG+KRMRRE++EEEEEMVGQ
Subjt:  KGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQ

Query:  NHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
        NHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
Subjt:  NHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS

A0A1S3AU16 heat stress transcription factor B-2b4.8e-17595.88Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK
        RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPV VAASP VLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK
Subjt:  RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK

Query:  GLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQN
        GLCNNILSLMTNYASGQ H  ESGSVRDGKALELLPARQVMEDEGAVSDGA EVRLKM E M AA A  G+TPKLFGVSIGVKRMRREV+EEEEEMVGQN
Subjt:  GLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQN

Query:  HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
        HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
Subjt:  HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS

A0A5A7TN46 Bifunctional UDP-glucose 4-epimerase and UDP-xylose 4-epimerase 17.9e-17896.47Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK
        RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPV VAASP VLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK
Subjt:  RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLK

Query:  GLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQN
        GLCNNILSLMTNYASGQ H  ESGSVRDGKALELLPARQVMEDEGAVSDGA EVRLKMEE MTAA A  G+TPKLFGVSIGVKRMRREV+EEEEEMVGQN
Subjt:  GLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQN

Query:  HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
        HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
Subjt:  HVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS

A0A6J1DU11 heat stress transcription factor B-2b2.9e-15687.46Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        M+PSPAEPIG+SGTGDSQRSIPTPFLTKT+QLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RKGEKGLLRDIQRRKVALSVTTT-TTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL
        R+GEKGLLRDIQRRKVALSV TT  T AA+  PVTVAA+P V AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPEL+RENERLRKENMQLSHELTQL
Subjt:  RKGEKGLLRDIQRRKVALSVTTT-TTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL

Query:  KGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQV-MEDEGAVSDGAHEVRLKMEETMTAAVAGV-GMTPKLFGVSIGVKRMRREVDEEEEEMV
        KGLCNNILSLMTNYASG  HQ ES SVRDGKALEL+PA QV MEDEGAVSDG  E+RLK+EE  TAA A   G+TPKLFGVSIGVKR+RRE +EEEEEMV
Subjt:  KGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQV-MEDEGAVSDGAHEVRLKMEETMTAAVAGV-GMTPKLFGVSIGVKRMRREVDEEEEEMV

Query:  GQNHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
        GQNHVQSEEGE GSEIKAEPLDENS++P+GSAS WLELGNQGS
Subjt:  GQNHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS

A0A6J1H317 heat stress transcription factor B-2b-like1.1e-15286.51Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDP VDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVP-VTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL
        ++GEK LLRDIQRRKVALSV    T A+V+VP VTVAASP V A VISP NSAEEQVTSSNSSPM FQR TSC TTPELVRENERLRKENMQLSHELTQL
Subjt:  RKGEKGLLRDIQRRKVALSVTTTTTSAAVAVP-VTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQL

Query:  KGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQ
        KGLCNNILSLMTNYASG Q QLES SVRDGKAL+LLPARQ+M+DEGAVSDG  EVRLK+EE +  A A  G TPKLFGVSIGVKR+RRE  E++EEMVG 
Subjt:  KGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQ

Query:  NHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS
        NHVQSEE ETGSEIKAEPLDENSE+P+GSAS WLELGNQGS
Subjt:  NHVQSEEGETGSEIKAEPLDENSEHPDGSASPWLELGNQGS

SwissProt top hitse value%identityAlignment
P22335 Heat shock factor protein HSF242.1e-5042.24Show/hide
Query:  SQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDIQRRKV
        SQR+ P PFL KTYQLVDD A DD+ISWNE G+TF+VW+ AEFA+DLLPKYFKHNNFSSFVRQLNTYGFRK+VPD+WEFAN+ F++G+K LL  I+RRK 
Subjt:  SQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDIQRRKV

Query:  ALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTP-------ELVRENERLRKENMQLSHELTQLKGLCNNILSL
             T T++ A    V   AS        SP NS ++  +SS SSP +  ++     TP       +L  ENE+L+K+N  LS EL Q K  CN +++ 
Subjt:  ALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTP-------ELVRENERLRKENMQLSHELTQLKGLCNNILSL

Query:  MTNY---ASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSD----GAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQNHV
        ++ Y   A    +++ S     G +LE     +++++ G V D    G++      E+         G T KLFGV +  K+ +R  DE  E   G+  +
Subjt:  MTNY---ASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSD----GAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQNHV

Query:  QSEEGETGSEIK-AEPLDENSE
               G  +K + P  E+S+
Subjt:  QSEEGETGSEIK-AEPLDENSE

Q652B0 Heat stress transcription factor B-2c1.2e-6649.14Show/hide
Query:  QRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDIQRRKVA
        QRS+PTPFLTKTYQLV+DPAVDD+ISWNEDGSTF+VWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRK+VPDRWEFANDCFR+GEK LL DI RRKV 
Subjt:  QRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDIQRRKVA

Query:  LSVTT---------TTTSAAVAV-PVTVAASPTVLAHVI----SPANSAEEQVTSSNSSPMAFQRSTS------------CTTTPELVRENERLRKENMQ
         +             T +AAVA   VTVAA+P  +A  +    SPA+S+EEQV SSNS      R  S              +  ++  ENERLR+EN +
Subjt:  LSVTT---------TTTSAAVAV-PVTVAASPTVLAHVI----SPANSAEEQVTSSNSSPMAFQRSTS------------CTTTPELVRENERLRKENMQ

Query:  LSHELTQLKGLCNNILSLMTNYASGQQHQLESG--SVRD--GKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGM--------TPKLFGVS
        L+ EL  +K LCNNIL LM+ YA+ Q  +  +G  S+ +  G++ E +P    +    A+ D      +     +  A A  G+        + +LFGVS
Subjt:  LSHELTQLKGLCNNILSLMTNYASGQQHQLESG--SVRD--GKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGM--------TPKLFGVS

Query:  IGVKRMRREVDEEEEEMVGQNHVQSEEGETGSEIKAEPLDENSEHPDG
        IG+KR R +     +E  G    Q+E G  G+++K E  D    HP G
Subjt:  IGVKRMRREVDEEEEEMVGQNHVQSEEGETGSEIKAEPLDENSEHPDG

Q6Z9C8 Heat stress transcription factor B-2b6.8e-7049.41Show/hide
Query:  PSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRK
        P+PA      G G  QR++PTPFLTKTYQLVDDPAVDD+ISWN+DGSTF+VWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRK+VPDRWEFANDCFR+
Subjt:  PSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRK

Query:  GEKGLLRDIQRRKVALSVTTTTTSA-AVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSS---PMAFQRSTSCT-----TTPELVRENERLRKENMQLS
        GE+ LL +I RRKV       TT+A A A+P+   A P       SP  S EEQV SS+SS   P+   ++ S +      + ++  ENERLR+EN QL+
Subjt:  GEKGLLRDIQRRKVALSVTTTTTSA-AVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSS---PMAFQRSTSCT-----TTPELVRENERLRKENMQLS

Query:  HELTQLKGLCNNILSLMTNYASGQQHQLESGSVRDG------------KALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIG
         EL+Q++ LCNNIL LM+ YAS QQ    + S   G            +A   LP   V+ D      GA      + +          M+ KLFGVSIG
Subjt:  HELTQLKGLCNNILSLMTNYASGQQHQLESGSVRDG------------KALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIG

Query:  VKRMRREVDEEEEEMVGQNHVQSEEGETGSEIKAEPLD
         KRMR              H    + +  + +KAEP+D
Subjt:  VKRMRREVDEEEEEMVGQNHVQSEEGETGSEIKAEPLD

Q9SCW4 Heat stress transcription factor B-2a4.3e-6448.66Show/hide
Query:  SGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDI
        +G   SQRSIPTPFLTKT+ LV+D ++DD+ISWNEDGS+FIVW P +FA+DLLPK+FKHNNFSSFVRQLNTYGF+KVVPDRWEF+ND F++GEK LLR+I
Subjt:  SGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDI

Query:  QRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSS--NSSPMAFQRSTSCTT-----TPELVRENERLRKENMQLSHELTQLKGLCN
        QRRK+      TTT   V  P +   + T+   V+SP+NS E+   +   +SSP ++    + TT     + EL+ ENE+LR +N+QL+ ELTQ+K +C+
Subjt:  QRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSS--NSSPMAFQRSTSCTT-----TPELVRENERLRKENMQLSHELTQLKGLCN

Query:  NILSLMTNYASGQ-QHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRRE-VDEEEEEMVGQN
        NI SLM+NY   Q   +  S      + +E LPA++  E             +++EE   A       +P+LFGV IG+KR R E V  +   +VG+N
Subjt:  NILSLMTNYASGQ-QHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRRE-VDEEEEEMVGQN

Q9T0D3 Heat stress transcription factor B-2b1.4e-9457.8Show/hide
Query:  GDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLR
        G  G GDSQRSIPTPFLTKTYQLV+DP  D+LISWNEDG+TFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEF+NDCF++GEK LLR
Subjt:  GDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLR

Query:  DIQRRKV---ALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMA-------------FQRSTSCTTTPELVRENERLRKENMQLS
        DIQRRK+   A++      +AAVA      A+  V+AH++SP+NS EEQV SSNSSP A              QR+TSCTT PELV ENERLRK+N +L 
Subjt:  DIQRRKV---ALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMA-------------FQRSTSCTTTPELVRENERLRKENMQLS

Query:  HELTQLKGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRR--EVDE
         E+T+LKGL  NI +LM N+  GQ+    +  + +GK L+LLP RQ M +    S+    + LK+         G  +TP+LFGVSIGVKR RR  E+  
Subjt:  HELTQLKGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRR--EVDE

Query:  EEEEMVGQNHVQSEEGETGSEIKAEPLDE-NSEHPDGSASPWLELG
         EEE   +    ++EGE  S++KAEP++E NS + +GS   WLELG
Subjt:  EEEEMVGQNHVQSEEGETGSEIKAEPLDE-NSEHPDGSASPWLELG

Arabidopsis top hitse value%identityAlignment
AT1G46264.1 heat shock transcription factor B42.5e-4347.78Show/hide
Query:  RSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDIQRRKVAL
        +++P PFLTKTYQLVDDPA D ++SW +D +TF+VWRP EFARDLLP YFKHNNFSSFVRQLNTYGFRK+VPDRWEFAN+ F++GEK LL +I RRK + 
Subjt:  RSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDIQRRKVAL

Query:  SVTTTTT------SAAVAVPVTVAA-SPTVLAHVISPANSAEEQVTSSNSSPMAF-QRSTSCTTTPELVRENERLRKENMQLSHELTQLKGLCNNILSLM
         +    +       A   +P +  +  P     V +P         S  S P    Q+  +      L  +NERLR+ N  L  EL  +K L N+I+  +
Subjt:  SVTTTTT------SAAVAVPVTVAA-SPTVLAHVISPANSAEEQVTSSNSSPMAF-QRSTSCTTTPELVRENERLRKENMQLSHELTQLKGLCNNILSLM

Query:  TNY
         N+
Subjt:  TNY

AT4G11660.1 winged-helix DNA-binding transcription factor family protein9.7e-9657.8Show/hide
Query:  GDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLR
        G  G GDSQRSIPTPFLTKTYQLV+DP  D+LISWNEDG+TFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEF+NDCF++GEK LLR
Subjt:  GDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLR

Query:  DIQRRKV---ALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMA-------------FQRSTSCTTTPELVRENERLRKENMQLS
        DIQRRK+   A++      +AAVA      A+  V+AH++SP+NS EEQV SSNSSP A              QR+TSCTT PELV ENERLRK+N +L 
Subjt:  DIQRRKV---ALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMA-------------FQRSTSCTTTPELVRENERLRKENMQLS

Query:  HELTQLKGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRR--EVDE
         E+T+LKGL  NI +LM N+  GQ+    +  + +GK L+LLP RQ M +    S+    + LK+         G  +TP+LFGVSIGVKR RR  E+  
Subjt:  HELTQLKGLCNNILSLMTNYASGQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRR--EVDE

Query:  EEEEMVGQNHVQSEEGETGSEIKAEPLDE-NSEHPDGSASPWLELG
         EEE   +    ++EGE  S++KAEP++E NS + +GS   WLELG
Subjt:  EEEEMVGQNHVQSEEGETGSEIKAEPLDE-NSEHPDGSASPWLELG

AT4G17750.1 heat shock factor 12.7e-3744.28Show/hide
Query:  APSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFR
        AP P  P     T  +  S+P PFL+KTY +V+DPA D ++SW+   ++FIVW P EF+RDLLPKYFKHNNFSSFVRQLNTYGFRKV PDRWEFAN+ F 
Subjt:  APSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFR

Query:  KGEKGLLRDIQRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPE--LVRENERLRKENMQLSHELTQL
        +G+K LL+ I RRK                        +V  H  S +N   +Q++    S  A    +SC    +  L  E E+L+++   L  EL +L
Subjt:  KGEKGLLRDIQRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPE--LVRENERLRKENMQLSHELTQL

Query:  K
        +
Subjt:  K

AT4G36990.1 heat shock factor 43.1e-4942.76Show/hide
Query:  SQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDIQRRKV
        +QRS+P PFL+KTYQLVDD + DD++SWNE+G+ F+VW+ AEFA+DLLP+YFKHNNFSSF+RQLNTYGFRK VPD+WEFAND FR+G + LL DI+RRK 
Subjt:  SQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDIQRRKV

Query:  ALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSC-TTTPELVRENERLRKENMQLSHELTQLKGLCNNILSLMTNYAS
          SV  +T    V     V  SP+      S +   ++  +SS SSP + +   S      +L  ENE+L++EN  LS EL   K   + +++ +T +  
Subjt:  ALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSC-TTTPELVRENERLRKENMQLSHELTQLKGLCNNILSLMTNYAS

Query:  GQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVG
         +  Q++   ++ GK     P     E E    DG              A  GVG   KLFGV +  +R +R+ DE+   + G
Subjt:  GQQHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVG

AT5G62020.1 heat shock transcription factor B2A3.1e-6548.66Show/hide
Query:  SGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDI
        +G   SQRSIPTPFLTKT+ LV+D ++DD+ISWNEDGS+FIVW P +FA+DLLPK+FKHNNFSSFVRQLNTYGF+KVVPDRWEF+ND F++GEK LLR+I
Subjt:  SGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRDI

Query:  QRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSS--NSSPMAFQRSTSCTT-----TPELVRENERLRKENMQLSHELTQLKGLCN
        QRRK+      TTT   V  P +   + T+   V+SP+NS E+   +   +SSP ++    + TT     + EL+ ENE+LR +N+QL+ ELTQ+K +C+
Subjt:  QRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSS--NSSPMAFQRSTSCTT-----TPELVRENERLRKENMQLSHELTQLKGLCN

Query:  NILSLMTNYASGQ-QHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRRE-VDEEEEEMVGQN
        NI SLM+NY   Q   +  S      + +E LPA++  E             +++EE   A       +P+LFGV IG+KR R E V  +   +VG+N
Subjt:  NILSLMTNYASGQ-QHQLESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRRE-VDEEEEEMVGQN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCGTCGCCGGCCGAACCGATCGGCGATTCTGGAACGGGAGATTCTCAGAGATCTATACCCACGCCGTTTCTAACGAAAACTTATCAACTGGTTGATGATCCGGC
TGTAGATGATCTTATCTCCTGGAATGAAGATGGATCTACCTTCATAGTTTGGCGACCGGCTGAATTTGCTCGAGATTTACTTCCTAAATACTTTAAACACAATAATTTCT
CTAGTTTCGTCCGTCAACTCAACACTTACGGATTCCGAAAGGTTGTGCCGGACCGATGGGAATTTGCGAATGATTGTTTCCGGAAAGGTGAGAAAGGACTTCTCCGAGAC
ATTCAGCGGCGGAAAGTAGCGCTGTCGGTAACGACTACAACTACGTCAGCTGCGGTAGCTGTGCCGGTGACGGTAGCAGCGTCTCCAACTGTGCTGGCTCACGTGATATC
GCCGGCGAACTCTGCGGAAGAGCAGGTTACGTCCTCGAACTCATCGCCGATGGCATTCCAACGAAGTACGAGCTGCACCACGACGCCGGAACTTGTAAGAGAGAACGAAC
GATTAAGGAAGGAAAATATGCAACTGAGTCACGAGTTGACTCAGTTGAAAGGACTCTGTAACAACATACTATCGTTAATGACGAATTACGCTTCAGGTCAGCAGCACCAG
TTGGAGTCAGGGAGCGTCCGGGACGGAAAGGCTTTGGAGCTGTTACCGGCGAGACAGGTAATGGAAGACGAAGGAGCCGTCAGCGACGGGGCTCATGAGGTGAGACTAAA
GATGGAGGAGACGATGACAGCGGCGGTGGCAGGGGTAGGAATGACGCCAAAATTGTTCGGAGTGTCGATCGGAGTGAAGCGGATGAGGAGAGAGGTAGATGAGGAAGAAG
AAGAGATGGTGGGGCAAAATCATGTACAGTCGGAAGAAGGTGAGACCGGGTCAGAGATCAAAGCAGAACCGTTGGATGAAAACTCTGAACATCCAGATGGATCAGCGTCG
CCGTGGCTTGAACTCGGTAATCAAGGCTCCTGA
mRNA sequenceShow/hide mRNA sequence
CAACTAAAAAACAACGATGAAGATGGTGTTTGATGAAGTAATTGATTTTTGAGTTATTGAAAAAAGAGTTAAGAAGTAGTAGTGAAGTGAAAGTTGTAAAAAGTGAAAGG
TTGAGATGTAGGAAAGGTGTTTTAGAATAGTATTGGGCTAATAATTGGAGTAGTGGCCCATAAAAAGGCCCAAAGAAAAAGGGTAAAAGAGGAAAGAAGGTAAGGGAAGC
TTCTGGAGAAGAAGAAAAGAAATAAAAAAGGGAGAAAATCCAGAAGGGATAAAGAGACACGCACGCTTCTTTACCTCAATCCTTCTGGTGCCTTCCTATCCTTTCTCTCT
TTCTCTATAATGTCTCTTCTATTGTGGACCCCATTTCTTTCTCTCTTTTAATTTCCTTTTTTCTTTTATTTTTCGAAAAATAAAATTAAAAATATTTCCCTTTCCTTCTC
GAACCCACGTTTTTCTTTTTTTCCTTTTTCCCCCCTTCTCTTCCCTATCATTTACTCTCCTTACCCAGAAACTTCTAGAAACAACAAAACCTCTCTCTCTCTCCAACCAT
TGCAGTGGCGGAGCTCATTCTCAGGCACGGCGGTCTTCTTTACTCCTCCGATTCCGATTCCGGATTCTCCTTTCGATTGAAACGAGAGCTGTTGAAAATCGGAGATCCTT
GTGCTTCAGATCTGGGAGAAGAGGAGGCGATGGCTCCGTCGCCGGCCGAACCGATCGGCGATTCTGGAACGGGAGATTCTCAGAGATCTATACCCACGCCGTTTCTAACG
AAAACTTATCAACTGGTTGATGATCCGGCTGTAGATGATCTTATCTCCTGGAATGAAGATGGATCTACCTTCATAGTTTGGCGACCGGCTGAATTTGCTCGAGATTTACT
TCCTAAATACTTTAAACACAATAATTTCTCTAGTTTCGTCCGTCAACTCAACACTTACGGATTCCGAAAGGTTGTGCCGGACCGATGGGAATTTGCGAATGATTGTTTCC
GGAAAGGTGAGAAAGGACTTCTCCGAGACATTCAGCGGCGGAAAGTAGCGCTGTCGGTAACGACTACAACTACGTCAGCTGCGGTAGCTGTGCCGGTGACGGTAGCAGCG
TCTCCAACTGTGCTGGCTCACGTGATATCGCCGGCGAACTCTGCGGAAGAGCAGGTTACGTCCTCGAACTCATCGCCGATGGCATTCCAACGAAGTACGAGCTGCACCAC
GACGCCGGAACTTGTAAGAGAGAACGAACGATTAAGGAAGGAAAATATGCAACTGAGTCACGAGTTGACTCAGTTGAAAGGACTCTGTAACAACATACTATCGTTAATGA
CGAATTACGCTTCAGGTCAGCAGCACCAGTTGGAGTCAGGGAGCGTCCGGGACGGAAAGGCTTTGGAGCTGTTACCGGCGAGACAGGTAATGGAAGACGAAGGAGCCGTC
AGCGACGGGGCTCATGAGGTGAGACTAAAGATGGAGGAGACGATGACAGCGGCGGTGGCAGGGGTAGGAATGACGCCAAAATTGTTCGGAGTGTCGATCGGAGTGAAGCG
GATGAGGAGAGAGGTAGATGAGGAAGAAGAAGAGATGGTGGGGCAAAATCATGTACAGTCGGAAGAAGGTGAGACCGGGTCAGAGATCAAAGCAGAACCGTTGGATGAAA
ACTCTGAACATCCAGATGGATCAGCGTCGCCGTGGCTTGAACTCGGTAATCAAGGCTCCTGATATAAAACGACGTCGTTGTAATGGCCGTGTAGTCATAAATGACGGAGA
AAAAGAAGATAAGGTTTTTTTTTTTCCTTTTTTTTTTTGGGGGAAATTAGAGAGAAGAGACCAAACCGGGCCTAGACAGAAGGAAGAATCCCACGTGCAAAACACGTGAG
GTAGAAAATATACAAAATTTTAAAATAATTTAAATTTCCTTTCCCTTGATCTTTTTAAAGACGGGCTTAGGGTTGATTGGTCCTTGTAAAAGAGGAAAGCACGGAAAAAT
CAAGCGGGTGGGTCGGTTGGTGAAACGGGTAAAGTAAAATGACGGAAAAACCCATGAAGAAGGGTAGTTTGTAATTTATCCCAGTTTCTGTATTCTAAATTTTAATTTTA
AGTACGGACGAAAGAAAACCCCACTCATGTGCTTCTCTTCTTATCCCGTTTCCACCAACCCTTCTAATTGTATTGCCCCTCCCAATGTAACGCTTTATTTTTTCCTTATG
TATTTCTCATTCTCTTCGCCTTTGAGAGTGGAAAATAGTAACTTCAATCTTGTAAATATCAATTTTGGTAACTTTCTCCTTTCTTTCTTTCTTTTTTTATTTTCTTCCAT
AAATTTTAATTAATCTTCATATTTTGTTTAATTTTAGATGTGTG
Protein sequenceShow/hide protein sequence
MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRKGEKGLLRD
IQRRKVALSVTTTTTSAAVAVPVTVAASPTVLAHVISPANSAEEQVTSSNSSPMAFQRSTSCTTTPELVRENERLRKENMQLSHELTQLKGLCNNILSLMTNYASGQQHQ
LESGSVRDGKALELLPARQVMEDEGAVSDGAHEVRLKMEETMTAAVAGVGMTPKLFGVSIGVKRMRREVDEEEEEMVGQNHVQSEEGETGSEIKAEPLDENSEHPDGSAS
PWLELGNQGS