; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr003137 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr003137
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHeat stress transcription factor
Genome locationtig00001981:19463..20667
RNA-Seq ExpressionSgr003137
SyntenySgr003137
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598287.1 Heat shock factor protein HSF30, partial [Cucurbita argyrosperma subsp. sororia]1.2e-14978.53Show/hide
Query:  SSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGG
        SSSSSS SS+TPQPI+G+HDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWD H+FSS+LLPRYFKH NFSSF+RQLNTYGFRKVDPDRWEFAN G
Subjt:  SSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGG

Query:  FLGGQRNLLKTIKRRRHSSQSFQ-HQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSF
        FLGGQR+LL+TIKRRRHS+QSFQ HQGG CVELG++GLEGELERL+RD+S L+A+LVRLRQQHQSSREQI+AMEDRL+ +E KQKQIMTFLS+ALKNPSF
Subjt:  FLGGQRNLLKTIKRRRHSSQSFQ-HQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSF

Query:  IRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEEL--WDEIW
        ++KFIHSNQ RELRG EIGRKRRLT+SPSVENL+EE+V VA+K E     EPD+ETLL VN E ES+ EI DPVSDG+PMAA+DVGLSAHEEL  + E+W
Subjt:  IRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEEL--WDEIW

Query:  TEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP
         ED  AG PEEE I+V +QS+ D EVEDL AEPPDW  +LQ+LVDQM +LR KP
Subjt:  TEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP

XP_022132202.1 heat shock factor protein HSF30 [Momordica charantia]3.0e-17488.76Show/hide
Query:  AAGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWE
        AAG +SSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDP TDSIVSWSKARNSFIVWDSH+FSSTLLPRYFKHRNFSSF+RQLNTYGFRKVDPDRWE
Subjt:  AAGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWE

Query:  FANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALK
        FAN GFLGGQRNLLKTIKRRRH+SQSF HQGGTCVELGQ+GL+GELERLRRD+S L+A+LVRLRQQHQSSREQ+ AMEDRLQNAERKQKQIMTFLS+ALK
Subjt:  FANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALK

Query:  NPSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELWDE
        NPSFI+KFIHSNQRRELRG EIGRKRRLTASPSVENL+EENVLVA++ EE+ETLEPDIETLLTVNLEDESSSE+KDPVSDGIPMAA+D+G  A EELWDE
Subjt:  NPSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELWDE

Query:  IWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP
        +WTEDL+AG PEEE IIV DQSEAD EVEDL AEPPDWT DLQELVDQMG+LRS P
Subjt:  IWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP

XP_022961710.1 heat stress transcription factor A-2-like [Cucurbita moschata]1.2e-14978.53Show/hide
Query:  SSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGG
        SSSSSS SS+TPQPI+G+HDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWD H+FSS+LLPRYFKH NFSSF+RQLNTYGFRKVDPDRWEFAN G
Subjt:  SSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGG

Query:  FLGGQRNLLKTIKRRRHSSQSFQ-HQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSF
        FLGGQR+LL+TIKRRRHS+QSFQ HQGG CVELG++GLEGELERL+RD+S L+A+LVRLRQQHQSSREQI+AMEDRL+ +E KQKQIMTFLS+ALKNPSF
Subjt:  FLGGQRNLLKTIKRRRHSSQSFQ-HQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSF

Query:  IRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEEL--WDEIW
        ++KFIHSNQ RELRG EIGRKRRLT+SPSVENL+EE+V VA+K E     EPD+ETLL VN E ES+ EI DPVSDG+PMAA+DVGLSAHEEL  + E+W
Subjt:  IRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEEL--WDEIW

Query:  TEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP
         ED  AG PEEE I+V +QS+ D EVEDL AEPPDW  +LQ+LVDQM +LR KP
Subjt:  TEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP

XP_023545907.1 heat stress transcription factor A-2-like [Cucurbita pepo subsp. pepo]5.2e-15078.81Show/hide
Query:  SSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGG
        SSSSSS SS+TPQPI+G+HDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWD H+FSS+LLPRYFKH NFSSF+RQLNTYGFRKVDPDRWEFAN G
Subjt:  SSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGG

Query:  FLGGQRNLLKTIKRRRHSSQSFQ-HQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSF
        FLGGQR+LL+TIKRRRHS+QSFQ HQGG CVELG++GLEGELERL+RD+S L+A+LVRLRQQHQSSREQI+AMEDRL+ +E KQKQIMTFLS+ALKNPSF
Subjt:  FLGGQRNLLKTIKRRRHSSQSFQ-HQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSF

Query:  IRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEEL--WDEIW
        ++KFIHSNQ RELRG EIGRKRRLT+SPSVENL+EE+V VA+K E     EPD+ETLL VN E ES+ EI DPVSDGIPMAA+DVGLSAHEEL  + E+W
Subjt:  IRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEEL--WDEIW

Query:  TEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP
         ED  AG PEEE I+V +QS+ D EVEDL AEPPDW  +LQ+LVDQM +LR KP
Subjt:  TEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP

XP_038885370.1 heat stress transcription factor A-2 [Benincasa hispida]1.5e-14979.17Show/hide
Query:  AGAAGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDR
        A ++  +SS SSSSSS+TPQPI+GLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWD H+FS+TLLPRYFKH NFSSFIRQLNTYGFRKVDPDR
Subjt:  AGAAGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDR

Query:  WEFANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRA
        WEFAN GFLGGQR+LL+TIKRRRHS Q+ QHQGGTCVELGQ+GLE +LERLRRD+S L+A+LVRLRQQHQSSREQIMAMEDRL+ AE KQKQIMTFLS+A
Subjt:  WEFANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRA

Query:  LKNPSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEEL-
        LKNPSFI+KFIHSNQ +ELR  EIGRKRRLTASPSVENL++ENV VA+K EELET EPDIETLLTVN EDESS EI DPVS        D+G SAHEEL 
Subjt:  LKNPSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEEL-

Query:  -WDEIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP
         + ++W EDL+AG+P EE I V +QS+ D EVEDL AEP DWT DLQELVDQMG+LRSKP
Subjt:  -WDEIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP

TrEMBL top hitse value%identityAlignment
A0A0A0LKS1 HSF_DOMAIN domain-containing protein2.8e-14978.95Show/hide
Query:  MAGAAGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPD
        +A A   +SS SSSSSS+TPQPI+GLHDVGPPPFLTKTFEMVEDPLTDSIVSWS+ARNSFIVWD H+FSSTLLPRYFKH NFSSFIRQLNTYGFRKVDPD
Subjt:  MAGAAGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPD

Query:  RWEFANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSR
        RWEFAN GFLGGQRNLL+TIKRRRHS QS QH GGTCVELGQ+GLE +LERLRRD+S L+A+LVRLRQQHQSSR++IM MEDRL+ AE KQKQIMTFLS+
Subjt:  RWEFANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSR

Query:  ALKNPSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEE-
        ALKNPSFI+KFI+SNQ RELRG EIGRKRRLTASPSVENL +ENV VALK EELET EPDIETLLTVN EDESS EI DPVS        D+G S HEE 
Subjt:  ALKNPSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEE-

Query:  -LWDEIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP
         ++  +W EDL+AG+PEE  IIV +QS+ D EVEDL AEP DWT DLQELVDQMG+LRSKP
Subjt:  -LWDEIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP

A0A1S4DVL9 heat stress transcription factor A-21.2e-14979.55Show/hide
Query:  AGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEF
        A  +SSSSSSSSS+TPQPI GLHDVGPPPFLTKTFEMVEDPLTDSIVSWS+ARNSFIVWD H+FSSTLLPRYFKH NFSSFIRQLNTYGFRKVDPDRWEF
Subjt:  AGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEF

Query:  ANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKN
        AN GFLGGQRNLL+TIKRRRHS QS QH GGTCVELGQ+GLE +LERLRRD+S L+A+LVRLRQQHQSSREQIMAMEDRL+ AE KQKQIMTFLS+ALKN
Subjt:  ANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKN

Query:  PSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEE--LWD
        PSF++KFI+SNQ RELRG EIGRKRRLTASPSVENL++ENV VA+K EELET EPDIETLLTVN EDESS EI DPVS        D+G S HEE  ++ 
Subjt:  PSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEE--LWD

Query:  EIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP
        + W ED +A +PEE  I+V +QS+ D EVEDL AEPPDWT DLQELVDQMG LRSKP
Subjt:  EIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP

A0A455PAZ2 HSF5.6e-15078.53Show/hide
Query:  SSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGG
        SSSSSS SS+TPQPI+G+HDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWD H+FSS+LLPRYFKH NFSSF+RQLNTYGFRKVDPDRWEFAN G
Subjt:  SSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGG

Query:  FLGGQRNLLKTIKRRRHSSQSFQ-HQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSF
        FLGGQR+LL+TIKRRRHS+QSFQ HQGG CVELG++GLEGELERL+RD+S L+A+LVRLRQQHQSSREQI+AMEDRL+ +E KQKQIMTFLS+ALKNPSF
Subjt:  FLGGQRNLLKTIKRRRHSSQSFQ-HQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSF

Query:  IRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEEL--WDEIW
        ++KFIHSNQ RELRG EIGRKRRLT+SPSVENL+EE+V VA+K E     EPD+ETLL VN E ES+ EI DPVSDG+PMAA+DVGLSAHEEL  + E+W
Subjt:  IRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEEL--WDEIW

Query:  TEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP
         ED  AG PEEE I+V +QS+ D EVEDL AEPPDW  +LQ+LVDQM +LR KP
Subjt:  TEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP

A0A5A7V104 Heat stress transcription factor A-21.2e-14979.55Show/hide
Query:  AGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEF
        A  +SSSSSSSSS+TPQPI GLHDVGPPPFLTKTFEMVEDPLTDSIVSWS+ARNSFIVWD H+FSSTLLPRYFKH NFSSFIRQLNTYGFRKVDPDRWEF
Subjt:  AGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEF

Query:  ANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKN
        AN GFLGGQRNLL+TIKRRRHS QS QH GGTCVELGQ+GLE +LERLRRD+S L+A+LVRLRQQHQSSREQIMAMEDRL+ AE KQKQIMTFLS+ALKN
Subjt:  ANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKN

Query:  PSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEE--LWD
        PSF++KFI+SNQ RELRG EIGRKRRLTASPSVENL++ENV VA+K EELET EPDIETLLTVN EDESS EI DPVS        D+G S HEE  ++ 
Subjt:  PSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEE--LWD

Query:  EIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP
        + W ED +A +PEE  I+V +QS+ D EVEDL AEPPDWT DLQELVDQMG LRSKP
Subjt:  EIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP

A0A6J1BVL5 heat shock factor protein HSF301.5e-17488.76Show/hide
Query:  AAGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWE
        AAG +SSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDP TDSIVSWSKARNSFIVWDSH+FSSTLLPRYFKHRNFSSF+RQLNTYGFRKVDPDRWE
Subjt:  AAGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWE

Query:  FANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALK
        FAN GFLGGQRNLLKTIKRRRH+SQSF HQGGTCVELGQ+GL+GELERLRRD+S L+A+LVRLRQQHQSSREQ+ AMEDRLQNAERKQKQIMTFLS+ALK
Subjt:  FANGGFLGGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALK

Query:  NPSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELWDE
        NPSFI+KFIHSNQRRELRG EIGRKRRLTASPSVENL+EENVLVA++ EE+ETLEPDIETLLTVNLEDESSSE+KDPVSDGIPMAA+D+G  A EELWDE
Subjt:  NPSFIRKFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELWDE

Query:  IWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP
        +WTEDL+AG PEEE IIV DQSEAD EVEDL AEPPDWT DLQELVDQMG+LRS P
Subjt:  IWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRSKP

SwissProt top hitse value%identityAlignment
O80982 Heat stress transcription factor A-24.1e-8950.42Show/hide
Query:  GGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFA
        G  ++SSS  SS +P+P++GL++ GPPPFLTKT+EMVEDP TD++VSWS  RNSF+VWDSH+FS+TLLPRYFKH NFSSFIRQLNTYGFRK+DPDRWEFA
Subjt:  GGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFA

Query:  NGGFLGGQRNLLKTIKRRRHSS-QSFQHQGG--TCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRAL
        N GFL GQ++LLK IKRRR+   Q+   QG   +CVE+GQYG +GE+ERL+RD  +L+A++VRLRQQ  SS+ Q+ AME RL   E++Q+Q+MTFL++AL
Subjt:  NGGFLGGQRNLLKTIKRRRHSS-QSFQHQGG--TCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRAL

Query:  KNPSFIRKF-IHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELW
         NP+F+++F + S +++ L G ++GRKRRLT++PS+  + EEN+   L  +E + ++ D+E L    ++DE+++                  +   EE  
Subjt:  KNPSFIRKF-IHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELW

Query:  DEIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDW-TGDLQELVDQMGYLRSKP
         E     +  GN E         +  D +VEDL   P DW + DL ++VDQMG+L S+P
Subjt:  DEIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDW-TGDLQELVDQMGYLRSKP

P41152 Heat shock factor protein HSF301.9e-9451.92Show/hide
Query:  PIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGGFLGGQRNLLKTIK
        P++GLHDVGPPPFL+KT+EMVED  TD ++SWS  RNSFIVWDSH+FS+TLLPR+FKH NFSSFIRQLNTYGFRKVDPDRWEFAN GFLGGQ++LLKTIK
Subjt:  PIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGGFLGGQRNLLKTIK

Query:  RRRHSSQSFQHQG-GTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSFIRKFIHSN-QRRE
        RRR+  QS   QG G C+E+G YG+E ELERL+RDK++L+ ++V+LRQQ QS+R QI+AM ++++  ERKQ Q+M+FL++   NP+F+++++     R++
Subjt:  RRRHSSQSFQHQG-GTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSFIRKFIHSN-QRRE

Query:  LRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDP---VSDGIPMAAIDVGLSAHEELWDEIWTEDLMAGNPEE
         +  E+G+KRRLT +PSV    ++ +  +   +E E     IE L +  +++ESSS ++      ++G  M  +       +++W+E+ +EDL++G+   
Subjt:  LRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDP---VSDGIPMAAIDVGLSAHEELWDEIWTEDLMAGNPEE

Query:  EVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYL
        E ++V +Q E D EVEDL  + P+W  +LQ+LVDQ+G+L
Subjt:  EVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYL

Q338B0 Heat stress transcription factor A-2c4.2e-7043.38Show/hide
Query:  PQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGGFLGGQRNLLKT
        P+P++GLH+VGPPPFLTKT+++VEDP TD +VSWS+A NSF+VWD H F+  LLPR FKH NFSSF+RQLNTYGFRKVDPDRWEFAN GFL GQR+LLKT
Subjt:  PQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGGFLGGQRNLLKT

Query:  IKRRRHSSQ---SFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSFIRKFIHSNQ
        IKRR+  S    S Q    +C+E+G++G E E++RL+RDK++L+ ++V+LRQ+ Q++++ + AMEDRL+ AE+KQ Q+M FL+RA++NP F ++     +
Subjt:  IKRRRHSSQ---SFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSFIRKFIHSNQ

Query:  RRELRGAEIGRKRRLTASPSVENLREENVLVALKPEEL------------ETLEPDIETL--LTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELWD
        +R+     I +KRR      ++N+   +     + E+L            E  EP I  L  L VN++D    ++ +   +           +   EL D
Subjt:  RRELRGAEIGRKRRLTASPSVENLREENVLVALKPEEL------------ETLEPDIETL--LTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELWD

Query:  EIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRS
        + W E L+     E+     +QSE D +++            + EL  Q+GYL S
Subjt:  EIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRS

Q6VBB2 Heat stress transcription factor A-2b1.0e-6842.31Show/hide
Query:  GASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFAN
        G   +++ ++ + P+P++GLHD GPPPFLTKT++MV+D  TD+ VSWS   NSF+VWD H F++ LLPR+FKH NFSSF+RQLNTYGFRKVDPDRWEFAN
Subjt:  GASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFAN

Query:  GGFLGGQRNLLKTIKRRRHSSQSFQHQG--GTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKN
          FL GQR+LLK IKRR+  S +  +Q   G  +E+G +G + E++RL+RDK LL+A++V+LRQ+ Q+++  + AMEDRLQ  E++Q+Q+M FL+R +KN
Subjt:  GGFLGGQRNLLKTIKRRRHSSQSFQHQG--GTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKN

Query:  PSFIRKFIHSNQ-RRELRGA-EIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDG---IPMAAIDVGLSAHE-
        P F+++ +  N+ R+EL+ A    R+RR+   P V+++   + +    P   +  E  +E L+     D   S++++   D    +     DVG S  + 
Subjt:  PSFIRKFIHSNQ-RRELRGA-EIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDG---IPMAAIDVGLSAHE-

Query:  -----ELWDEIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRS
             EL D  W E L  G       +V ++++ +  VED          D+  L ++MGYL S
Subjt:  -----ELWDEIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDWTGDLQELVDQMGYLRS

Q8H7Y6 Heat stress transcription factor A-2d2.0e-6746.56Show/hide
Query:  SSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGGFLG
        SS      P+P++GLH+VGPPPFLTKTF++V DP TD +VSW +A +SF+VWD H F++  LPR+FKH NFSSF+RQLNTYGFRK+DPDRWEFAN GFL 
Subjt:  SSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGGFLG

Query:  GQRNLLKTIKRRRHSS--QSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSFIR
        GQR+LLK IKRRR  S     Q   GTC+E+GQ+GL+ E++RL+RDK++LLA++V+LR + QS++  + AME+RLQ+AE+KQ Q+M FL+RA++NP F  
Subjt:  GQRNLLKTIKRRRHSS--QSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSFIR

Query:  KFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLL-----TVNLEDE-SSSEIKDPVSD--GIPMAAIDVGLSAHE---
        + IH  Q+ +++G E    ++ T        R  +++  L P E+   +    TLL        L DE + SE+++   +  G+     DV  + ++   
Subjt:  KFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLL-----TVNLEDE-SSSEIKDPVSD--GIPMAAIDVGLSAHE---

Query:  ------ELWDEIWTEDLMAG
              EL D+ W E L  G
Subjt:  ------ELWDEIWTEDLMAG

Arabidopsis top hitse value%identityAlignment
AT1G32330.1 heat shock transcription factor A1D1.3e-5850.4Show/hide
Query:  GASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFAN
        G   S  +  S  PQP   L    PPPFL+KT++MV+D  TDSIVSWS   NSFIVW    F+  LLP+ FKH NFSSF+RQLNTYGFRKVDPDRWEFAN
Subjt:  GASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFAN

Query:  GGFLGGQRNLLKTIKRRR------HSSQSFQHQGG------TCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQI
         GFL GQ++LL++I RR+         Q  QH  G       CVE+G++GLE E+ERL+RDK++L+ +LVRLRQQ QS+  Q+  M  RLQ  E +Q+Q+
Subjt:  GGFLGGQRNLLKTIKRRR------HSSQSFQHQGG------TCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQI

Query:  MTFLSRALKNPSFIRKFI-HSNQRRE--LRGAEIGRKRRLTASPSVEN
        M+FL++A+++P F+ +F+   NQ+ E   R ++  +KRR      V N
Subjt:  MTFLSRALKNPSFIRKFI-HSNQRRE--LRGAEIGRKRRLTASPSVEN

AT2G26150.1 heat shock transcription factor A22.9e-9050.42Show/hide
Query:  GGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFA
        G  ++SSS  SS +P+P++GL++ GPPPFLTKT+EMVEDP TD++VSWS  RNSF+VWDSH+FS+TLLPRYFKH NFSSFIRQLNTYGFRK+DPDRWEFA
Subjt:  GGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFA

Query:  NGGFLGGQRNLLKTIKRRRHSS-QSFQHQGG--TCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRAL
        N GFL GQ++LLK IKRRR+   Q+   QG   +CVE+GQYG +GE+ERL+RD  +L+A++VRLRQQ  SS+ Q+ AME RL   E++Q+Q+MTFL++AL
Subjt:  NGGFLGGQRNLLKTIKRRRHSS-QSFQHQGG--TCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRAL

Query:  KNPSFIRKF-IHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELW
         NP+F+++F + S +++ L G ++GRKRRLT++PS+  + EEN+   L  +E + ++ D+E L    ++DE+++                  +   EE  
Subjt:  KNPSFIRKF-IHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELW

Query:  DEIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDW-TGDLQELVDQMGYLRSKP
         E     +  GN E         +  D +VEDL   P DW + DL ++VDQMG+L S+P
Subjt:  DEIWTEDLMAGNPEEEVIIVSDQSEADAEVEDLFAEPPDW-TGDLQELVDQMGYLRSKP

AT3G22830.1 heat shock transcription factor A6B7.7e-6741.87Show/hide
Query:  PQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGGFLGGQRNLLKT
        PQP++GLH+ GPPPFLTKT+++VED  T+ +VSWSK+ NSFIVWD   FS TLLPR+FKH NFSSF+RQLNTYGFRKV+PDRWEFAN GFL GQ++LLK 
Subjt:  PQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGGFLGGQRNLLKT

Query:  IKRRRHSSQSFQHQ----------GGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSFIR
        I+RR+ S+ S Q Q             C+E+G+YGL+GE++ LRRDK +L+ +LVRLRQQ QS++  +  +E++L+  E KQKQ+M+FL+RA++NP FI+
Subjt:  IKRRRHSSQSFQHQ----------GGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSFIR

Query:  KFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELWDEIWTEDL
        + +   ++R+     I +KR+        N+ +         +   +    I         + S  E+ +     + +  +    SA EE+       ++
Subjt:  KFIHSNQRRELRGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELWDEIWTEDL

Query:  MAGNPEEEVIIVSDQSE----------ADAEVEDLFAEPP--DWTGDLQE---LVDQMGYLRS
          GN EEE   V DQ +           +   EDL  E    D+ GD +    L+ Q+GYL S
Subjt:  MAGNPEEEVIIVSDQSE----------ADAEVEDLFAEPP--DWTGDLQE---LVDQMGYLRS

AT3G51910.1 heat shock transcription factor A7A3.2e-5751.6Show/hide
Query:  PQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGGFLGGQRNLLKT
        PQP++GLH+  PPPFLTKTFEMV+DP TD IVSW++   SF+VWD H FS+ LLPR+FKH NFSSFIRQLNTYGFRK++ +RWEFAN  FL GQR LLK 
Subjt:  PQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGGFLGGQRNLLKT

Query:  IKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSFIRKFIHSNQR--
        IKRR   + S       C E            LRR+K +L+ ++V LRQQ Q+++  I AME R++  ERKQ+Q+M+FL+RA+++PSF+ + +    +  
Subjt:  IKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSFIRKFIHSNQR--

Query:  RELRGAEIGRKRRLTASPS
        +EL   E  +++R ++S S
Subjt:  RELRGAEIGRKRRLTASPS

AT4G17750.1 heat shock factor 12.9e-5849.21Show/hide
Query:  MAGAAGGASSSSSSSSS---ITPQPIDGLH-DVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRK
        M G  GG ++   + ++     P P   L+ +  PPPFL+KT++MVEDP TD+IVSWS   NSFIVWD   FS  LLP+YFKH NFSSF+RQLNTYGFRK
Subjt:  MAGAAGGASSSSSSSSS---ITPQPIDGLH-DVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRK

Query:  VDPDRWEFANGGFLGGQRNLLKTIKRRR------------HSSQSFQHQG-----GTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMA
        VDPDRWEFAN GFL GQ++LLK I RR+             S Q  Q QG      +CVE+G++GLE E+E+L+RDK++L+ +LV+LRQQ Q++  ++  
Subjt:  VDPDRWEFANGGFLGGQRNLLKTIKRRR------------HSSQSFQHQG-----GTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMA

Query:  MEDRLQNAERKQKQIMTFLSRALKNPSFIRKFIHSNQRRELRGAEIGRKRRL
        +   LQ  E++Q+QIM+FL++A++NP+F+ +FI       +   E  +KRRL
Subjt:  MEDRLQNAERKQKQIMTFLSRALKNPSFIRKFIHSNQRRELRGAEIGRKRRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGCGCGGCAGGGGGCGCGTCGTCTTCTTCTTCATCGTCTTCGAGCATCACGCCCCAGCCGATTGACGGCTTACATGACGTGGGTCCGCCTCCATTTCTGACGAA
GACCTTTGAGATGGTGGAAGATCCATTGACGGACTCCATTGTTTCATGGAGTAAGGCTCGGAATAGCTTCATCGTTTGGGACTCTCACAGATTCTCCAGTACCTTGCTTC
CTCGATACTTCAAACACCGTAATTTCTCCAGCTTCATTCGTCAGCTCAATACTTATGGTTTTCGAAAAGTCGATCCTGATCGTTGGGAGTTTGCGAATGGAGGGTTTCTG
GGAGGACAGCGAAATCTGTTGAAAACCATAAAGAGGAGGAGGCATTCATCCCAGAGCTTCCAACATCAAGGAGGAACGTGTGTGGAGTTGGGGCAATATGGACTGGAAGG
CGAACTCGAAAGGTTGAGAAGGGACAAAAGCTTGTTACTGGCGGATTTAGTAAGATTGAGGCAGCAACACCAGAGCTCAAGAGAGCAAATAATGGCCATGGAGGATAGGC
TACAGAATGCAGAGAGAAAACAGAAGCAGATCATGACATTCCTTAGCAGAGCCCTTAAGAATCCCTCCTTCATTCGGAAATTTATCCATAGTAATCAGCGAAGAGAACTG
AGAGGCGCCGAAATCGGACGAAAGCGAAGACTAACTGCCAGCCCAAGTGTAGAGAATCTCCGGGAAGAAAATGTACTAGTAGCTCTAAAACCAGAAGAACTAGAAACTTT
GGAGCCAGATATAGAGACGCTGTTAACAGTCAACTTAGAAGACGAGTCAAGCAGTGAGATCAAGGACCCAGTTTCAGATGGCATTCCCATGGCAGCCATCGATGTGGGTC
TCTCTGCTCATGAGGAATTATGGGATGAAATCTGGACTGAAGATCTAATGGCTGGAAATCCAGAGGAGGAAGTCATAATCGTTAGTGATCAATCAGAAGCCGATGCGGAA
GTGGAGGATCTGTTCGCTGAACCCCCTGATTGGACAGGGGACTTGCAGGAACTCGTCGATCAAATGGGCTATCTCAGGTCGAAGCCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGGCGCGGCAGGGGGCGCGTCGTCTTCTTCTTCATCGTCTTCGAGCATCACGCCCCAGCCGATTGACGGCTTACATGACGTGGGTCCGCCTCCATTTCTGACGAA
GACCTTTGAGATGGTGGAAGATCCATTGACGGACTCCATTGTTTCATGGAGTAAGGCTCGGAATAGCTTCATCGTTTGGGACTCTCACAGATTCTCCAGTACCTTGCTTC
CTCGATACTTCAAACACCGTAATTTCTCCAGCTTCATTCGTCAGCTCAATACTTATGGTTTTCGAAAAGTCGATCCTGATCGTTGGGAGTTTGCGAATGGAGGGTTTCTG
GGAGGACAGCGAAATCTGTTGAAAACCATAAAGAGGAGGAGGCATTCATCCCAGAGCTTCCAACATCAAGGAGGAACGTGTGTGGAGTTGGGGCAATATGGACTGGAAGG
CGAACTCGAAAGGTTGAGAAGGGACAAAAGCTTGTTACTGGCGGATTTAGTAAGATTGAGGCAGCAACACCAGAGCTCAAGAGAGCAAATAATGGCCATGGAGGATAGGC
TACAGAATGCAGAGAGAAAACAGAAGCAGATCATGACATTCCTTAGCAGAGCCCTTAAGAATCCCTCCTTCATTCGGAAATTTATCCATAGTAATCAGCGAAGAGAACTG
AGAGGCGCCGAAATCGGACGAAAGCGAAGACTAACTGCCAGCCCAAGTGTAGAGAATCTCCGGGAAGAAAATGTACTAGTAGCTCTAAAACCAGAAGAACTAGAAACTTT
GGAGCCAGATATAGAGACGCTGTTAACAGTCAACTTAGAAGACGAGTCAAGCAGTGAGATCAAGGACCCAGTTTCAGATGGCATTCCCATGGCAGCCATCGATGTGGGTC
TCTCTGCTCATGAGGAATTATGGGATGAAATCTGGACTGAAGATCTAATGGCTGGAAATCCAGAGGAGGAAGTCATAATCGTTAGTGATCAATCAGAAGCCGATGCGGAA
GTGGAGGATCTGTTCGCTGAACCCCCTGATTGGACAGGGGACTTGCAGGAACTCGTCGATCAAATGGGCTATCTCAGGTCGAAGCCGTAG
Protein sequenceShow/hide protein sequence
MAGAAGGASSSSSSSSSITPQPIDGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDSHRFSSTLLPRYFKHRNFSSFIRQLNTYGFRKVDPDRWEFANGGFL
GGQRNLLKTIKRRRHSSQSFQHQGGTCVELGQYGLEGELERLRRDKSLLLADLVRLRQQHQSSREQIMAMEDRLQNAERKQKQIMTFLSRALKNPSFIRKFIHSNQRREL
RGAEIGRKRRLTASPSVENLREENVLVALKPEELETLEPDIETLLTVNLEDESSSEIKDPVSDGIPMAAIDVGLSAHEELWDEIWTEDLMAGNPEEEVIIVSDQSEADAE
VEDLFAEPPDWTGDLQELVDQMGYLRSKP