; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012033 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012033
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionheat stress transcription factor A-6b-like
Genome locationtig00153201:51986..53626
RNA-Seq ExpressionSgr012033
SyntenySgr012033
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155815.1 heat stress transcription factor A-6b-like [Momordica charantia]2.9e-13976.99Show/hide
Query:  NPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY
        NPLFPVK+EFPG  SSE  G+R+ M  PLPMEGLHDVG PPFLTKTF+IVDD NT+HVISWSFGG+SFV+WDPHCFSTELLPRFFKHNNFSSFVRQLNTY
Subjt:  NPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY

Query:  -----------------------------RRRATY-QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTE
                                     RRRATY QQSLHSQQA GACVEVGQFG D++V RLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTE
Subjt:  -----------------------------RRRATY-QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTE

Query:  IKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESE-EYGFGITELEALALEMQGLSKARHE
        IKQRQMMNFLARA+QNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQ      RFSGE SN IK EPLESE +YGFG+TELEALALEMQGL KARHE
Subjt:  IKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESE-EYGFGITELEALALEMQGLSKARHE

Query:  EVEEE--------DGKLLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR
        E EEE        + K LQ ED DKVLDE FWEEL SERLEVAR E+EDV VLADRLGYLGSSPR
Subjt:  EVEEE--------DGKLLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR

XP_022954107.1 heat stress transcription factor A-6b-like [Cucurbita moschata]7.1e-13873.26Show/hide
Query:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT
        MNPLF VKEEFPGS+SSE   ER  M PP+PMEGLHD G PPFLTKTF+IVDD NT HVISWSFGG SFV+WDPHCFSTELLPRFFKHNNFSSFVRQLNT
Subjt:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT

Query:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
        Y                             RRRATY   Q+  SQ ASGACVEVGQFG+D+++ RLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
Subjt:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG

Query:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR
        TEIKQRQMMNFLARA+QNPSFIQQL+QQKEKRK LEEAIT+KRRRPI+Q G+QS GGSGRF GEGS+ IKIEPLE EEYGFGITELEALALEMQGL +AR
Subjt:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR

Query:  HEEVEEEDGK---------------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR
        HEE EEE+ +               L  P D DKVLDE FWEEL SERLEVA  +DEDVKVLADRLGYLGS+PR
Subjt:  HEEVEEEDGK---------------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR

XP_022991636.1 heat stress transcription factor A-6b-like isoform X1 [Cucurbita maxima]1.3e-13974.73Show/hide
Query:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT
        MNPLFPVKEEFPGS+SSE   ER  M PP+PMEGLHD G PPFLTKTF+IVDD NT HVISWSFGG SFV+WDPHCFSTELLPRFFKHNNFSSFVRQLNT
Subjt:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT

Query:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
        Y                             RRRATY   Q+  SQ +SGACVEVGQFG+D+++ RLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
Subjt:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG

Query:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR
        TEIKQRQMMNFLARA+QNPSFIQQL+QQKEKRK LEEAIT+KRRRPI+Q G+QS GGSGRF GEGS+ IKIEPLESEEYGFGITELEALALEMQGL +AR
Subjt:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR

Query:  HEEVEEEDGK---------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR
        HEE EEE+ +         L  P D DKVLDE FWEEL SERLEVA  +DEDVKVLADRLGYLGS PR
Subjt:  HEEVEEEDGK---------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR

XP_022991639.1 heat stress transcription factor A-6b-like isoform X2 [Cucurbita maxima]1.3e-13974.73Show/hide
Query:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT
        MNPLFPVKEEFPGS+SSE   ER  M PP+PMEGLHD G PPFLTKTF+IVDD NT HVISWSFGG SFV+WDPHCFSTELLPRFFKHNNFSSFVRQLNT
Subjt:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT

Query:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
        Y                             RRRATY   Q+  SQ +SGACVEVGQFG+D+++ RLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
Subjt:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG

Query:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR
        TEIKQRQMMNFLARA+QNPSFIQQL+QQKEKRK LEEAIT+KRRRPI+Q G+QS GGSGRF GEGS+ IKIEPLESEEYGFGITELEALALEMQGL +AR
Subjt:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR

Query:  HEEVEEEDGK---------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR
        HEE EEE+ +         L  P D DKVLDE FWEEL SERLEVA  +DEDVKVLADRLGYLGS PR
Subjt:  HEEVEEEDGK---------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR

XP_038899271.1 heat stress transcription factor A-2b-like [Benincasa hispida]4.9e-13974.53Show/hide
Query:  MNPLFPVKEEFPGSSSSEPVGERS-VMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLN
        MNPLFPVKEEFPGSSSS+  GERS +M PPLPMEGLHD G PPFLTKTF+IVDD NT  VISWSFGGTSF++WDPHCFST+LLPRFFKHNNFSSFVRQLN
Subjt:  MNPLFPVKEEFPGSSSSEPVGERS-VMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLN

Query:  TY-----------------------------RRRATYQQSLH--------SQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQA
        TY                             RRRAT     H        SQ ASGACVEVGQFG+D+++ RLKRDKQVLMMELV LRQEQQNTRAYLQA
Subjt:  TY-----------------------------RRRATYQQSLH--------SQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQA

Query:  MEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEM
        MEQRLRGTEIKQRQMMNFLARA++NPSFIQQL+QQKEKRKELEEAITKKRRRPIEQ GQ SSGGSGRF GEGSN IKIEPLES+EYGFGITELEALALEM
Subjt:  MEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEM

Query:  QGLSKARH--------EEVEEEDGKLLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSP
        QGL K RH        EE EEE    L PED DKVLDE FWEEL SERLE AR+EDE+V VLADRLGYLGSSP
Subjt:  QGLSKARH--------EEVEEEDGKLLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSP

TrEMBL top hitse value%identityAlignment
A0A1S4DYR8 heat stress transcription factor A-6b-like isoform X34.5e-13874.59Show/hide
Query:  MNPLFPVKEEFPGSSSSEPVGERS-VMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLN
        MNPLFP+KEEF GSSSS+  GERS V+ PP+PMEGLHD G PPFLTKTF+IVDD NT HVISWSF GTSF++WDPHCFST+LLPRFFKHNNFSSFVRQLN
Subjt:  MNPLFPVKEEFPGSSSSEPVGERS-VMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLN

Query:  TY-----------------------------RRRAT---YQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRL
        TY                             RRR T   + Q+L SQ ASGACVEVGQFG+D+++ RLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRL
Subjt:  TY-----------------------------RRRAT---YQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRL

Query:  RGTEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSS--GGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLS
        RGTEIKQ+QMMNFLARA++NPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ  Q +  GGSGRF GEGSN IKIEPLES+EYGFGITELEALALEMQGL 
Subjt:  RGTEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSS--GGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLS

Query:  KARH-----EEVEEEDGKLLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSP
        K R+     EE EE+D  LL PED DKVLDE FWEEL SERLE AR+EDE+V VLADRLGYLGSSP
Subjt:  KARH-----EEVEEEDGKLLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSP

A0A6J1DNG9 heat stress transcription factor A-6b-like1.4e-13976.99Show/hide
Query:  NPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY
        NPLFPVK+EFPG  SSE  G+R+ M  PLPMEGLHDVG PPFLTKTF+IVDD NT+HVISWSFGG+SFV+WDPHCFSTELLPRFFKHNNFSSFVRQLNTY
Subjt:  NPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY

Query:  -----------------------------RRRATY-QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTE
                                     RRRATY QQSLHSQQA GACVEVGQFG D++V RLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTE
Subjt:  -----------------------------RRRATY-QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTE

Query:  IKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESE-EYGFGITELEALALEMQGLSKARHE
        IKQRQMMNFLARA+QNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQ      RFSGE SN IK EPLESE +YGFG+TELEALALEMQGL KARHE
Subjt:  IKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESE-EYGFGITELEALALEMQGLSKARHE

Query:  EVEEE--------DGKLLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR
        E EEE        + K LQ ED DKVLDE FWEEL SERLEVAR E+EDV VLADRLGYLGSSPR
Subjt:  EVEEE--------DGKLLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR

A0A6J1GQ41 heat stress transcription factor A-6b-like3.4e-13873.26Show/hide
Query:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT
        MNPLF VKEEFPGS+SSE   ER  M PP+PMEGLHD G PPFLTKTF+IVDD NT HVISWSFGG SFV+WDPHCFSTELLPRFFKHNNFSSFVRQLNT
Subjt:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT

Query:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
        Y                             RRRATY   Q+  SQ ASGACVEVGQFG+D+++ RLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
Subjt:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG

Query:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR
        TEIKQRQMMNFLARA+QNPSFIQQL+QQKEKRK LEEAIT+KRRRPI+Q G+QS GGSGRF GEGS+ IKIEPLE EEYGFGITELEALALEMQGL +AR
Subjt:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR

Query:  HEEVEEEDGK---------------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR
        HEE EEE+ +               L  P D DKVLDE FWEEL SERLEVA  +DEDVKVLADRLGYLGS+PR
Subjt:  HEEVEEEDGK---------------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR

A0A6J1JRB0 heat stress transcription factor A-6b-like isoform X16.3e-14074.73Show/hide
Query:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT
        MNPLFPVKEEFPGS+SSE   ER  M PP+PMEGLHD G PPFLTKTF+IVDD NT HVISWSFGG SFV+WDPHCFSTELLPRFFKHNNFSSFVRQLNT
Subjt:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT

Query:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
        Y                             RRRATY   Q+  SQ +SGACVEVGQFG+D+++ RLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
Subjt:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG

Query:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR
        TEIKQRQMMNFLARA+QNPSFIQQL+QQKEKRK LEEAIT+KRRRPI+Q G+QS GGSGRF GEGS+ IKIEPLESEEYGFGITELEALALEMQGL +AR
Subjt:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR

Query:  HEEVEEEDGK---------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR
        HEE EEE+ +         L  P D DKVLDE FWEEL SERLEVA  +DEDVKVLADRLGYLGS PR
Subjt:  HEEVEEEDGK---------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR

A0A6J1JVE6 heat stress transcription factor A-6b-like isoform X26.3e-14074.73Show/hide
Query:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT
        MNPLFPVKEEFPGS+SSE   ER  M PP+PMEGLHD G PPFLTKTF+IVDD NT HVISWSFGG SFV+WDPHCFSTELLPRFFKHNNFSSFVRQLNT
Subjt:  MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNT

Query:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
        Y                             RRRATY   Q+  SQ +SGACVEVGQFG+D+++ RLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG
Subjt:  Y-----------------------------RRRATY--QQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRG

Query:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR
        TEIKQRQMMNFLARA+QNPSFIQQL+QQKEKRK LEEAIT+KRRRPI+Q G+QS GGSGRF GEGS+ IKIEPLESEEYGFGITELEALALEMQGL +AR
Subjt:  TEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQ-GQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKAR

Query:  HEEVEEEDGK---------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR
        HEE EEE+ +         L  P D DKVLDE FWEEL SERLEVA  +DEDVKVLADRLGYLGS PR
Subjt:  HEEVEEEDGK---------LLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR

SwissProt top hitse value%identityAlignment
Q338B0 Heat stress transcription factor A-2c6.6e-6245.95Show/hide
Query:  PLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY---------------------------
        P PMEGLH+VG PPFLTKT+D+V+D  T  V+SWS  G SFV+WDPH F+  LLPR FKHNNFSSFVRQLNTY                           
Subjt:  PLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY---------------------------

Query:  --RRRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQKE
          RR+         QQ+  +C+EVG+FG + ++ RLKRDK +L+ E+VKLRQEQQ T+ +++AME RLR  E KQ QMM FLARA++NP F QQL QQKE
Subjt:  --RRRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQKE

Query:  KRKELEEAITKKRRRPIEQGQ-QSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKARHEEVEEEDGKLLQPEDADKVLDEWFWEELL
        KRKELE+AI+KKRRRPI+       G + +     S  +    + +E    GI ELE LA+ +Q L K + +E  +         +    L + FW ELL
Subjt:  KRKELEEAITKKRRRPIEQGQ-QSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKARHEEVEEEDGKLLQPEDADKVLDEWFWEELL

Query:  SERLEVARSEDE-DVKV-----LADRLGYLGSS
         E       + E D K+     LA +LGYL S+
Subjt:  SERLEVARSEDE-DVKV-----LADRLGYLGSS

Q6F388 Heat stress transcription factor A-2e2.5e-6144.63Show/hide
Query:  PVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTYR---
        PVK E  G S+    G+     PP PM+GL D G PPFLTKT+D+VDD  T  V+SWS    SFV+WDPH F   LLPR+FKHNNFSSFVRQLNTY    
Subjt:  PVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTYR---

Query:  ------------------------RRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMM
                                +R     S  SQQ+ G+ +EVG FG + ++ +LKRDK +LM E+VKLRQEQQNT++ LQAMEQ+L+GTE KQ+ MM
Subjt:  ------------------------RRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMM

Query:  NFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGF-GI-TELEALALEMQGLSKARHE--EVEE
         FL+R + NP FI+QL  Q E RKELEE ++KKRRR I+QG +        S E  + +  EP +  +  F G+ ++LE+ ++E  G  KA+ +      
Subjt:  NFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGF-GI-TELEALALEMQGLSKARHE--EVEE

Query:  EDGKLLQPEDADKVLDEWFWEELLSE---RLEVARSEDEDVKVLADRLGYLGSS
        E GK ++P + +  L+E FWE+LL E     +      +D+ +L+ ++GYL SS
Subjt:  EDGKLLQPEDADKVLDEWFWEELLSE---RLEVARSEDEDVKVLADRLGYLGSS

Q6VBB2 Heat stress transcription factor A-2b2.8e-6846.85Show/hide
Query:  VPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY-------------------------
        V P PMEGLHD G PPFLTKT+D+VDD  T   +SWS    SFV+WDPH F+T LLPRFFKHNNFSSFVRQLNTY                         
Subjt:  VPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY-------------------------

Query:  ---RRRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQK
           +RR     +  +QQ+ G  +EVG FG D+++ RLKRDKQ+LM E+VKLRQEQQNT+A L+AME RL+GTE +Q+QMM FLAR ++NP F++QL+ Q 
Subjt:  ---RRRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQK

Query:  EKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGF-GI-TELEALALEMQGLSKARHEEVEEEDGKLLQPEDADKVLDEWFWEE
        E RKEL++AI+KKRRR I+QG +        S E  +    +P ES E+   GI ++LE  A++  GL + +  +V   + + + P+     L++ FWEE
Subjt:  EKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGF-GI-TELEALALEMQGLSKARHEEVEEEDGKLLQPEDADKVLDEWFWEE

Query:  LLSERLEVARSE----DEDVKVLADRLGYLGSS
        LL+E L    ++    ++D+ VL++++GYL S+
Subjt:  LLSERLEVARSE----DEDVKVLADRLGYLGSS

Q8H7Y6 Heat stress transcription factor A-2d4.6e-6344.1Show/hide
Query:  VKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY-----
        VKEE+P SS  E  GE      P PMEGLH+VG PPFLTKTFD+V D  T  V+SW   G+SFV+WDPH F+   LPRFFKHNNFSSFVRQLNTY     
Subjt:  VKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY-----

Query:  -----------------------RRRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMM
                               +RR        SQQA G C+EVGQFG+D ++ RLKRDK +L+ E+VKLR +QQ+T+A ++AME+RL+  E KQ QMM
Subjt:  -----------------------RRRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMM

Query:  NFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGF--GITELEALALEMQGLSKARHEEVEEED
         FLARA+QNP F  QL+ Q++K K LE+  +KKR R I+     + G      +  + +  +P    E       +ELE LAL +QGL K + +     +
Subjt:  NFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGF--GITELEALALEMQGLSKARHEEVEEED

Query:  GKLLQPEDADKVLDEWFWEELLSERLE-------VARSEDEDVKVLADRLGYLGSS
            Q  +  ++ D+ FWEELL+E          + R     V  LA +LGYL +S
Subjt:  GKLLQPEDADKVLDEWFWEELLSERLE-------VARSEDEDVKVLADRLGYLGSS

Q9LUH8 Heat stress transcription factor A-6b2.0e-7444.03Show/hide
Query:  MNPLFP-VKEEFPGSSSSEP------------------VGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTEL
        M+P F  +KEEFP   S  P                  + + + +  P P+EGLH+ G PPFLTKT+D+V+D  T+HV+SWS    SF++WDP  FS  L
Subjt:  MNPLFP-VKEEFPGSSSSEP------------------VGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTEL

Query:  LPRFFKHNNFSSFVRQLNTY----------------------------RRRATYQQSLHSQQASGA--------CVEVGQFGIDSDVGRLKRDKQVLMME
        LPRFFKHNNFSSFVRQLNTY                            RRR T   S   QQ   +        C+EVG++G+D ++  L+RDKQVLMME
Subjt:  LPRFFKHNNFSSFVRQLNTY----------------------------RRRATYQQSLHSQQASGA--------CVEVGQFGIDSDVGRLKRDKQVLMME

Query:  LVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPL---
        LV+LRQ+QQ+T+ YL  +E++L+ TE KQ+QMM+FLARA+QNP FIQQLV+QKEKRKE+EEAI+KKR+RPI+QG+++    G  SG G++          
Subjt:  LVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPL---

Query:  ESEEYGFG------ITELEALALEMQGL---SKARHE---------EVEEEDGKLLQPEDADKVLDEWFWEELLSERLEV-ARSEDEDVKVLADRLGYLG
         S+EY +G      ++EL+ LA+ +QGL   S AR E         E E ED +    ++ +++  E FWE+LL+E        + E+V VL  +LGYLG
Subjt:  ESEEYGFG------ITELEALALEMQGL---SKARHE---------EVEEEDGKLLQPEDADKVLDEWFWEELLSERLEV-ARSEDEDVKVLADRLGYLG

Query:  SS
        SS
Subjt:  SS

Arabidopsis top hitse value%identityAlignment
AT1G32330.1 heat shock transcription factor A1D1.0e-4141.86Show/hide
Query:  PLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY---------------------------
        P P   L     PPFL+KT+D+VDD NT  ++SWS    SF++W P  F+ +LLP+ FKHNNFSSFVRQLNTY                           
Subjt:  PLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY---------------------------

Query:  --RRRAT------YQQSLHS---QQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSF
          RR+        +Q+S HS     +  ACVEVG+FG++ +V RLKRDK VLM ELV+LRQ+QQ+T   LQ M QRL+G E +Q+Q+M+FLA+A+Q+P F
Subjt:  --RRRAT------YQQSLHS---QQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSF

Query:  IQQLVQQKEKRKELEEAI--TKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEE
        + Q +QQ+ ++ E    I  T K+RR    G   +  S    G+    +K +P   E+
Subjt:  IQQLVQQKEKRKELEEAI--TKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEE

AT2G26150.1 heat shock transcription factor A23.6e-4737.75Show/hide
Query:  FPGS-SSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY--------
        F GS ++S  VG  S    P PMEGL++ G PPFLTKT+++V+D  T  V+SWS G  SFV+WD H FST LLPR+FKH+NFSSF+RQLNTY        
Subjt:  FPGS-SSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTY--------

Query:  ---------------------RRRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNF
                             RRR    Q+++ Q +  +CVEVGQ+G D +V RLKRD  VL+ E+V+LRQ+Q ++++ + AMEQRL  TE +Q+QMM F
Subjt:  ---------------------RRRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNF

Query:  LARALQNPSFIQQL-VQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKARHEEVEEEDGKL
        LA+AL NP+F+QQ  V  KEK+      + +KRR        S+   G       N +  +  +  +    +    A+  E       + E+  E    +
Subjt:  LARALQNPSFIQQL-VQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKARHEEVEEEDGKL

Query:  LQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSP
        ++  + +  LD    E+L+   L+    + +D+  + D++G+LGS P
Subjt:  LQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSP

AT3G22830.1 heat shock transcription factor A6B1.4e-7544.03Show/hide
Query:  MNPLFP-VKEEFPGSSSSEP------------------VGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTEL
        M+P F  +KEEFP   S  P                  + + + +  P P+EGLH+ G PPFLTKT+D+V+D  T+HV+SWS    SF++WDP  FS  L
Subjt:  MNPLFP-VKEEFPGSSSSEP------------------VGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTEL

Query:  LPRFFKHNNFSSFVRQLNTY----------------------------RRRATYQQSLHSQQASGA--------CVEVGQFGIDSDVGRLKRDKQVLMME
        LPRFFKHNNFSSFVRQLNTY                            RRR T   S   QQ   +        C+EVG++G+D ++  L+RDKQVLMME
Subjt:  LPRFFKHNNFSSFVRQLNTY----------------------------RRRATYQQSLHSQQASGA--------CVEVGQFGIDSDVGRLKRDKQVLMME

Query:  LVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPL---
        LV+LRQ+QQ+T+ YL  +E++L+ TE KQ+QMM+FLARA+QNP FIQQLV+QKEKRKE+EEAI+KKR+RPI+QG+++    G  SG G++          
Subjt:  LVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPL---

Query:  ESEEYGFG------ITELEALALEMQGL---SKARHE---------EVEEEDGKLLQPEDADKVLDEWFWEELLSERLEV-ARSEDEDVKVLADRLGYLG
         S+EY +G      ++EL+ LA+ +QGL   S AR E         E E ED +    ++ +++  E FWE+LL+E        + E+V VL  +LGYLG
Subjt:  ESEEYGFG------ITELEALALEMQGL---SKARHE---------EVEEEDGKLLQPEDADKVLDEWFWEELLSERLEV-ARSEDEDVKVLADRLGYLG

Query:  SS
        SS
Subjt:  SS

AT3G51910.1 heat shock transcription factor A7A3.8e-5748.26Show/hide
Query:  PPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTYRRR---------ATYQQSLHSQQASG
        PP PMEGLH+   PPFLTKTF++VDD NT H++SW+ GGTSFV+WD H FST LLPR FKH+NFSSF+RQLNTY  R         A  +  L  +Q   
Subjt:  PPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTYRRR---------ATYQQSLHSQQASG

Query:  ACVEVGQFGIDSD-----VGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQKEKR-KELEEAITKKR
               F   S         L+R+KQVLMME+V LRQ+QQ T++Y++AMEQR+ GTE KQRQMM+FLARA+Q+PSF+ QL++Q++K+ KELE+  + KR
Subjt:  ACVEVGQFGIDSD-----VGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQKEKR-KELEEAITKKR

Query:  RRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKARHEEVEEEDGKLLQPEDADKVLDEWFWEELLSE
        +R                  GS++              ++ELE LALEMQG  K R+  +EEED +L+     ++ LD+ FWEELLS+
Subjt:  RRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKARHEEVEEEDGKLLQPEDADKVLDEWFWEELLSE

AT3G63350.1 winged-helix DNA-binding transcription factor family protein1.6e-5041.07Show/hide
Query:  MVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTYR-----------------------
        M PP+PMEGL + G  PFLTKTF++V D NT+H++SW+ GG SFV+WDPH FS  +LP +FKHNNFSSFVRQLNTY                        
Subjt:  MVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTYR-----------------------

Query:  ----RRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQK
            +R T   S  S   S +  E    G+  ++ +L+ ++ VLMME+  LRQE+Q  R Y+QAMEQR+ G E KQR MM+FL RA++NPS +QQ+ +QK
Subjt:  ----RRATYQQSLHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQK

Query:  EKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKARHEEVEEEDGKLLQPEDADKVLDEWFWEELL
          R+E            I+Q               +  IK+E +E       ++ELEALALEMQG  + R + VE E             LD+ FWEELL
Subjt:  EKRKELEEAITKKRRRPIEQGQQSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKARHEEVEEEDGKLLQPEDADKVLDEWFWEELL

Query:  SERLEVARSEDEDVKVLAD
           +    S++E+  V  D
Subjt:  SERLEVARSEDEDVKVLAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCTCTGTTTCCGGTTAAGGAAGAGTTTCCGGGATCGAGTTCGTCGGAGCCGGTCGGAGAGCGATCGGTGATGGTTCCTCCTCTGCCAATGGAGGGGCTCCACGA
CGTCGGTCTTCCGCCATTTCTGACGAAGACGTTTGATATCGTCGATGACCTTAATACCAGCCATGTGATTTCTTGGAGCTTTGGGGGAACTAGCTTCGTCATCTGGGATC
CTCATTGCTTCTCCACTGAATTACTTCCACGGTTTTTCAAGCACAATAACTTCTCCAGCTTCGTTCGGCAGCTTAATACTTACAGGAGGAGGGCAACGTATCAGCAGTCT
CTCCACTCGCAGCAAGCTAGCGGAGCTTGTGTAGAGGTCGGCCAGTTTGGGATAGATTCAGACGTGGGTCGTCTAAAACGTGACAAGCAAGTGCTAATGATGGAGTTAGT
GAAGCTAAGGCAAGAGCAGCAGAACACTAGAGCGTATCTTCAAGCAATGGAGCAAAGGCTGAGAGGAACTGAAATCAAGCAGAGGCAAATGATGAACTTCTTAGCAAGAG
CCCTGCAAAATCCATCATTTATTCAGCAGTTAGTCCAGCAAAAGGAGAAGAGGAAGGAGCTTGAAGAAGCTATAACTAAGAAAAGGAGAAGGCCCATTGAGCAAGGGCAA
CAGAGCAGTGGTGGAAGTGGGAGATTTTCAGGTGAAGGATCAAACGCCATAAAGATTGAGCCTCTAGAATCTGAAGAATATGGGTTTGGAATAACAGAGCTAGAAGCACT
TGCCTTGGAGATGCAGGGGTTGAGTAAGGCAAGACATGAGGAGGTGGAAGAAGAAGATGGTAAGTTACTGCAACCAGAGGATGCAGATAAAGTACTTGATGAGTGGTTTT
GGGAAGAATTGTTAAGTGAAAGGTTGGAGGTGGCGAGAAGTGAAGATGAAGATGTGAAAGTGTTGGCTGATCGACTGGGCTACTTGGGCTCAAGCCCAAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCCTCTGTTTCCGGTTAAGGAAGAGTTTCCGGGATCGAGTTCGTCGGAGCCGGTCGGAGAGCGATCGGTGATGGTTCCTCCTCTGCCAATGGAGGGGCTCCACGA
CGTCGGTCTTCCGCCATTTCTGACGAAGACGTTTGATATCGTCGATGACCTTAATACCAGCCATGTGATTTCTTGGAGCTTTGGGGGAACTAGCTTCGTCATCTGGGATC
CTCATTGCTTCTCCACTGAATTACTTCCACGGTTTTTCAAGCACAATAACTTCTCCAGCTTCGTTCGGCAGCTTAATACTTACAGGAGGAGGGCAACGTATCAGCAGTCT
CTCCACTCGCAGCAAGCTAGCGGAGCTTGTGTAGAGGTCGGCCAGTTTGGGATAGATTCAGACGTGGGTCGTCTAAAACGTGACAAGCAAGTGCTAATGATGGAGTTAGT
GAAGCTAAGGCAAGAGCAGCAGAACACTAGAGCGTATCTTCAAGCAATGGAGCAAAGGCTGAGAGGAACTGAAATCAAGCAGAGGCAAATGATGAACTTCTTAGCAAGAG
CCCTGCAAAATCCATCATTTATTCAGCAGTTAGTCCAGCAAAAGGAGAAGAGGAAGGAGCTTGAAGAAGCTATAACTAAGAAAAGGAGAAGGCCCATTGAGCAAGGGCAA
CAGAGCAGTGGTGGAAGTGGGAGATTTTCAGGTGAAGGATCAAACGCCATAAAGATTGAGCCTCTAGAATCTGAAGAATATGGGTTTGGAATAACAGAGCTAGAAGCACT
TGCCTTGGAGATGCAGGGGTTGAGTAAGGCAAGACATGAGGAGGTGGAAGAAGAAGATGGTAAGTTACTGCAACCAGAGGATGCAGATAAAGTACTTGATGAGTGGTTTT
GGGAAGAATTGTTAAGTGAAAGGTTGGAGGTGGCGAGAAGTGAAGATGAAGATGTGAAAGTGTTGGCTGATCGACTGGGCTACTTGGGCTCAAGCCCAAGATAG
Protein sequenceShow/hide protein sequence
MNPLFPVKEEFPGSSSSEPVGERSVMVPPLPMEGLHDVGLPPFLTKTFDIVDDLNTSHVISWSFGGTSFVIWDPHCFSTELLPRFFKHNNFSSFVRQLNTYRRRATYQQS
LHSQQASGACVEVGQFGIDSDVGRLKRDKQVLMMELVKLRQEQQNTRAYLQAMEQRLRGTEIKQRQMMNFLARALQNPSFIQQLVQQKEKRKELEEAITKKRRRPIEQGQ
QSSGGSGRFSGEGSNAIKIEPLESEEYGFGITELEALALEMQGLSKARHEEVEEEDGKLLQPEDADKVLDEWFWEELLSERLEVARSEDEDVKVLADRLGYLGSSPR