; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G015890 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G015890
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHeat stress transcription factor
Genome locationchr08:23892060..23896892
RNA-Seq ExpressionLsi08G015890
SyntenyLsi08G015890
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598287.1 Heat shock factor protein HSF30, partial [Cucurbita argyrosperma subsp. sororia]2.8e-15982.62Show/hide
Query:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR
        MDE+KVKPEES VA G A  +SS SSS SSVTPQPIEG+HDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSS+LLPRYFKHSNFSSF+R
Subjt:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR

Query:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQ-HQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEK
        QLNTYGFRKVDPDRWEFANEGFLGGQR+LLRTIKRRRHS QS Q HQGG CVELG+FGLE ELERL+RDRSSLMAELVRLRQQHQSSREQI+AMEDRLEK
Subjt:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQ-HQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEK

Query:  AESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS----
        +E+KQKQIMTFLSKALKNPSF+QKFIHSNQGRELRGVEIGRKRRLT+SPSVENLQ+E+VPVAVKQE     EPD+ETLL VNFE ES+ EITDPVS    
Subjt:  AESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS----

Query:  ----DMGHFAHGELGIFSQLWAEDLIAGHP-EEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK
            D+G  AH ELG FS+LWAED  AG+P EE I+VG+QSD+DVEVEDLIAEP DW E+LQ+LVDQM FLR K
Subjt:  ----DMGHFAHGELGIFSQLWAEDLIAGHP-EEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK

XP_016900029.1 PREDICTED: heat stress transcription factor A-2 [Cucumis melo]6.4e-18091.26Show/hide
Query:  MDELKVKPEES--AVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSF
        MDELKVKPEES  A AT  A ++SS SSSSSSVTPQPI GLHDVGPPPFLTKTFEMVEDPLTDSIVSWS+ARNSFIVWDYHKFSSTLLPRYFKHSNFSSF
Subjt:  MDELKVKPEES--AVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSF

Query:  IRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLE
        IRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQH GGTCVELGQFGLEA+LERLRRDRS+LMAELVRLRQQHQSSREQIMAMEDRLE
Subjt:  IRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLE

Query:  KAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDMG
        KAESKQKQIMTFLSKALKNPSF+QKFI+SNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEI DPVSDMG
Subjt:  KAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDMG

Query:  HFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK
        H  H E GIFSQ WAED +A HPEE  IV +QSD+DVEVEDLIAEP DWTEDLQELVDQMG LRSK
Subjt:  HFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK

XP_022132202.1 heat shock factor protein HSF30 [Momordica charantia]6.0e-16282.53Show/hide
Query:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR
        MD+LKVK EE   A   A ++SS SSSSSS+TPQPI+GLHDVGPPPFLTKTFEMVEDP TDSIVSWSKARNSFIVWD HKFSSTLLPRYFKH NFSSF+R
Subjt:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR

Query:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKA
        QLNTYGFRKVDPDRWEFANEGFLGGQRNLL+TIKRRRH+ QS  HQGGTCVELGQFGL+ ELERLRRDRSSLMAELVRLRQQHQSSREQ+ AMEDRL+ A
Subjt:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKA

Query:  ESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS-----
        E KQKQIMTFLSKALKNPSFIQKFIHSNQ RELRG+EIGRKRRLTASPSVENLQ+ENV VAV+QEE+ET EPDIETLLTVN EDESS E+ DPVS     
Subjt:  ESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS-----

Query:  ---DMGHFAHGELGIFSQLWAEDLIAGHP-EEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRS
           D+GH+A  EL  + +LW EDL+AG+P EEAIIVGDQS+ DVEVEDLIAEP DWTEDLQELVDQMGFLRS
Subjt:  ---DMGHFAHGELGIFSQLWAEDLIAGHP-EEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRS

XP_031736391.1 heat stress transcription factor A-2 [Cucumis sativus]8.4e-18091.21Show/hide
Query:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR
        MDELKVKPEES VAT   AS+SS SSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWS+ARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR
Subjt:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR

Query:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKA
        QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHS QSIQH GGTCVELGQFGLEA+LERLRRDRS+LMAELVRLRQQHQSSR++IM MEDRLEKA
Subjt:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKA

Query:  ESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDMGHF
        ESKQKQIMTFLSKALKNPSFIQKFI+SNQGRELRGVEIGRKRRLTASPSVENL DENVPVA+KQEELETSEPDIETLLTVNFEDESSIEI DPVSD+GH 
Subjt:  ESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDMGHF

Query:  AHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK
         H E GIFS LW EDL+AGHPEE  I+ +QSD+DVEVEDLIAEPLDWTEDLQELVDQMGFLRSK
Subjt:  AHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK

XP_038885370.1 heat stress transcription factor A-2 [Benincasa hispida]1.4e-18794.82Show/hide
Query:  MDELKVKPEESAVATGRAA---SASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSS
        MDELKVKPEESAVATG AA   S+SS SSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFS+TLLPRYFKHSNFSS
Subjt:  MDELKVKPEESAVATGRAA---SASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSS

Query:  FIRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRL
        FIRQLNTYGFRKVDPDRWEFANEGFLGGQR+LLRTIKRRRHSHQ+IQHQGGTCVELGQFGLEA+LERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRL
Subjt:  FIRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRL

Query:  EKAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDM
        EKAESKQKQIMTFLSKALKNPSFIQKFIHSNQG+ELR VEIGRKRRLTASPSVENLQDENV VAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDM
Subjt:  EKAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDM

Query:  GHFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK
        GH AH ELG+F QLWAEDLIAGHPEEAI VG+QSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK
Subjt:  GHFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK

TrEMBL top hitse value%identityAlignment
A0A0A0LKS1 HSF_DOMAIN domain-containing protein4.0e-18091.21Show/hide
Query:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR
        MDELKVKPEES VAT   AS+SS SSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWS+ARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR
Subjt:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR

Query:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKA
        QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHS QSIQH GGTCVELGQFGLEA+LERLRRDRS+LMAELVRLRQQHQSSR++IM MEDRLEKA
Subjt:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKA

Query:  ESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDMGHF
        ESKQKQIMTFLSKALKNPSFIQKFI+SNQGRELRGVEIGRKRRLTASPSVENL DENVPVA+KQEELETSEPDIETLLTVNFEDESSIEI DPVSD+GH 
Subjt:  ESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDMGHF

Query:  AHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK
         H E GIFS LW EDL+AGHPEE  I+ +QSD+DVEVEDLIAEPLDWTEDLQELVDQMGFLRSK
Subjt:  AHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK

A0A1S4DVL9 heat stress transcription factor A-23.1e-18091.26Show/hide
Query:  MDELKVKPEES--AVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSF
        MDELKVKPEES  A AT  A ++SS SSSSSSVTPQPI GLHDVGPPPFLTKTFEMVEDPLTDSIVSWS+ARNSFIVWDYHKFSSTLLPRYFKHSNFSSF
Subjt:  MDELKVKPEES--AVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSF

Query:  IRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLE
        IRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQH GGTCVELGQFGLEA+LERLRRDRS+LMAELVRLRQQHQSSREQIMAMEDRLE
Subjt:  IRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLE

Query:  KAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDMG
        KAESKQKQIMTFLSKALKNPSF+QKFI+SNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEI DPVSDMG
Subjt:  KAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDMG

Query:  HFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK
        H  H E GIFSQ WAED +A HPEE  IV +QSD+DVEVEDLIAEP DWTEDLQELVDQMG LRSK
Subjt:  HFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK

A0A455PAZ2 HSF3.9e-15982.35Show/hide
Query:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR
        MDE++VKPEES VA G A  +SS SSS SSVTPQPIEG+HDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSS+LLPRYFKHSNFSSF+R
Subjt:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR

Query:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQ-HQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEK
        QLNTYGFRKVDPDRWEFANEGFLGGQR+LLRTIKRRRHS QS Q HQGG CVELG+FGLE ELERL+RDRSSLMAELVRLRQQHQSSREQI+AMEDRLEK
Subjt:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQ-HQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEK

Query:  AESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS----
        +E+KQKQIMTFLSKALKNPSF+QKFIHSNQGRELRGVEIGRKRRLT+SPSVENLQ+E+VPVAVKQE     EPD+ETLL VNFE ES+ EITDPVS    
Subjt:  AESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS----

Query:  ----DMGHFAHGELGIFSQLWAEDLIAGHP-EEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK
            D+G  AH ELG FS+LWAED  AG+P EE I+VG+QSD+DVEVEDLIAEP DW E+LQ+LVDQM FLR K
Subjt:  ----DMGHFAHGELGIFSQLWAEDLIAGHP-EEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK

A0A5A7V104 Heat stress transcription factor A-23.1e-18091.26Show/hide
Query:  MDELKVKPEES--AVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSF
        MDELKVKPEES  A AT  A ++SS SSSSSSVTPQPI GLHDVGPPPFLTKTFEMVEDPLTDSIVSWS+ARNSFIVWDYHKFSSTLLPRYFKHSNFSSF
Subjt:  MDELKVKPEES--AVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSF

Query:  IRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLE
        IRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQH GGTCVELGQFGLEA+LERLRRDRS+LMAELVRLRQQHQSSREQIMAMEDRLE
Subjt:  IRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLE

Query:  KAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDMG
        KAESKQKQIMTFLSKALKNPSF+QKFI+SNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEI DPVSDMG
Subjt:  KAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDMG

Query:  HFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK
        H  H E GIFSQ WAED +A HPEE  IV +QSD+DVEVEDLIAEP DWTEDLQELVDQMG LRSK
Subjt:  HFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSK

A0A6J1BVL5 heat shock factor protein HSF302.9e-16282.53Show/hide
Query:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR
        MD+LKVK EE   A   A ++SS SSSSSS+TPQPI+GLHDVGPPPFLTKTFEMVEDP TDSIVSWSKARNSFIVWD HKFSSTLLPRYFKH NFSSF+R
Subjt:  MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIR

Query:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKA
        QLNTYGFRKVDPDRWEFANEGFLGGQRNLL+TIKRRRH+ QS  HQGGTCVELGQFGL+ ELERLRRDRSSLMAELVRLRQQHQSSREQ+ AMEDRL+ A
Subjt:  QLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKA

Query:  ESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS-----
        E KQKQIMTFLSKALKNPSFIQKFIHSNQ RELRG+EIGRKRRLTASPSVENLQ+ENV VAV+QEE+ET EPDIETLLTVN EDESS E+ DPVS     
Subjt:  ESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS-----

Query:  ---DMGHFAHGELGIFSQLWAEDLIAGHP-EEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRS
           D+GH+A  EL  + +LW EDL+AG+P EEAIIVGDQS+ DVEVEDLIAEP DWTEDLQELVDQMGFLRS
Subjt:  ---DMGHFAHGELGIFSQLWAEDLIAGHP-EEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRS

SwissProt top hitse value%identityAlignment
O80982 Heat stress transcription factor A-21.8e-9252.16Show/hide
Query:  MDELKVKPEESAVA-TGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFI
        M+ELKV+ EE  V  TG  A++SSV SSSS   P+P+EGL++ GPPPFLTKT+EMVEDP TD++VSWS  RNSF+VWD HKFS+TLLPRYFKHSNFSSFI
Subjt:  MDELKVKPEESAVA-TGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFI

Query:  RQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRH-SHQSIQHQGG--TCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDR
        RQLNTYGFRK+DPDRWEFANEGFL GQ++LL+ IKRRR+   Q++  QG   +CVE+GQ+G + E+ERL+RD   L+AE+VRLRQQ  SS+ Q+ AME R
Subjt:  RQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRH-SHQSIQHQGG--TCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDR

Query:  LEKAESKQKQIMTFLSKALKNPSFIQKF-IHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS
        L   E +Q+Q+MTFL+KAL NP+F+Q+F + S + + L G+++GRKRRLT++PS+  +++      +  +E +  + D+E L     +DE++       +
Subjt:  LEKAESKQKQIMTFLSKALKNPSFIQKF-IHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS

Query:  DMGHFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDW-TEDLQELVDQMGFLRSK
         M       L   + +  +    G+ E A+        DV+VEDL+  PLDW ++DL ++VDQMGFL S+
Subjt:  DMGHFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDW-TEDLQELVDQMGFLRSK

P41152 Heat shock factor protein HSF306.2e-9351.76Show/hide
Query:  DELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQ
        D +KVK EE  + T                   P+EGLHDVGPPPFL+KT+EMVED  TD ++SWS  RNSFIVWD HKFS+TLLPR+FKHSNFSSFIRQ
Subjt:  DELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQ

Query:  LNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQG-GTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKA
        LNTYGFRKVDPDRWEFANEGFLGGQ++LL+TIKRRR+  QS+  QG G C+E+G +G+E ELERL+RD++ LM E+V+LRQQ QS+R QI+AM +++E  
Subjt:  LNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQG-GTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKA

Query:  ESKQKQIMTFLSKALKNPSFIQKFIHSNQGR-ELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIE------ITDP
        E KQ Q+M+FL+K   NP+F+Q+++     R + + +E+G+KRRLT +PSV    D+ +  +   +E E     IE L +   ++ESS        +T  
Subjt:  ESKQKQIMTFLSKALKNPSFIQKFIHSNQGR-ELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIE------ITDP

Query:  VSDMGHFAHGELGIFSQLWAEDLIAG-HPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFL
         +DM   A     I+ +L +EDLI+G    E ++V +Q + DVEVEDL+ +  +W E+LQ+LVDQ+GFL
Subjt:  VSDMGHFAHGELGIFSQLWAEDLIAG-HPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFL

Q338B0 Heat stress transcription factor A-2c3.3e-7044Show/hide
Query:  PQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRT
        P+P+EGLH+VGPPPFLTKT+++VEDP TD +VSWS+A NSF+VWD H F+  LLPR FKH+NFSSF+RQLNTYGFRKVDPDRWEFANEGFL GQR+LL+T
Subjt:  PQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRT

Query:  IKRRR---HSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKAESKQKQIMTFLSKALKNPSFIQKFIHSNQ
        IKRR+   ++  S Q    +C+E+G+FG E E++RL+RD++ L+ E+V+LRQ+ Q++++ + AMEDRL  AE KQ Q+M FL++A++NP F Q+     +
Subjt:  IKRRR---HSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKAESKQKQIMTFLSKALKNPSFIQKFIHSNQ

Query:  GRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEEL------------ETSEPDIETL--LTVNFEDESSIEITDPVSDMGHFAHGELGIFSQLWAED
         R+     I +KRR      ++N+   +     + E+L            E SEP I  L  L VN +D    ++ +   +  +  +G+  +    WAE 
Subjt:  GRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEEL------------ETSEPDIETL--LTVNFEDESSIEITDPVSDMGHFAHGELGIFSQLWAED

Query:  LIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSKS
        L+    E+     +QS++D ++           + + EL  Q+G+L S S
Subjt:  LIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSKS

Q8H7Y6 Heat stress transcription factor A-2d9.9e-6755.56Show/hide
Query:  SSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGFRKVDPDRWEFANEGFLG
        SS      P+P+EGLH+VGPPPFLTKTF++V DP TD +VSW +A +SF+VWD H F++  LPR+FKH+NFSSF+RQLNTYGFRK+DPDRWEFAN+GFL 
Subjt:  SSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGFRKVDPDRWEFANEGFLG

Query:  GQRNLLRTIKRRRHSH--QSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKAESKQKQIMTFLSKALKNPSFIQ
        GQR+LL+ IKRRR        Q   GTC+E+GQFGL+ E++RL+RD++ L+AE+V+LR + QS++  + AME+RL+ AE KQ Q+M FL++A++NP F  
Subjt:  GQRNLLRTIKRRRHSH--QSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKAESKQKQIMTFLSKALKNPSFIQ

Query:  KFIHSNQGRELRGVEIGRKRRLTAS
        + IH  Q  +++G+E    ++ T S
Subjt:  KFIHSNQGRELRGVEIGRKRRLTAS

Q9LUH8 Heat stress transcription factor A-6b4.5e-6750Show/hide
Query:  PEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGF
        P  S+     + + ++++  ++   PQP+EGLH+ GPPPFLTKT+++VED  T+ +VSWSK+ NSFIVWD   FS TLLPR+FKH+NFSSF+RQLNTYGF
Subjt:  PEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGF

Query:  RKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQ----------GGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRL
        RKV+PDRWEFANEGFL GQ++LL+ I+RR+ S+ S Q Q             C+E+G++GL+ E++ LRRD+  LM ELVRLRQQ QS++  +  +E++L
Subjt:  RKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQ----------GGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRL

Query:  EKAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQD
        +K ESKQKQ+M+FL++A++NP FIQ+ +   + R+     I +KR+        N++D
Subjt:  EKAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQD

Arabidopsis top hitse value%identityAlignment
AT1G32330.1 heat shock transcription factor A1D1.9e-6053.36Show/hide
Query:  SVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGFRKVDPDRWEFANEGFLGGQRNL
        S  PQP   L    PPPFL+KT++MV+D  TDSIVSWS   NSFIVW   +F+  LLP+ FKH+NFSSF+RQLNTYGFRKVDPDRWEFANEGFL GQ++L
Subjt:  SVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGFRKVDPDRWEFANEGFLGGQRNL

Query:  LRTIKRRR------HSHQSIQHQGG------TCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKAESKQKQIMTFLSKALKN
        L++I RR+        HQ  QH  G       CVE+G+FGLE E+ERL+RD++ LM ELVRLRQQ QS+  Q+  M  RL+  E++Q+Q+M+FL+KA+++
Subjt:  LRTIKRRR------HSHQSIQHQGG------TCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKAESKQKQIMTFLSKALKN

Query:  PSFIQKFI-HSNQGRE--LRGVEIGRKRRLTASPSVEN
        P F+ +F+   NQ  E   R  +  +KRR      V N
Subjt:  PSFIQKFI-HSNQGRE--LRGVEIGRKRRLTASPSVEN

AT2G26150.1 heat shock transcription factor A21.3e-9352.16Show/hide
Query:  MDELKVKPEESAVA-TGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFI
        M+ELKV+ EE  V  TG  A++SSV SSSS   P+P+EGL++ GPPPFLTKT+EMVEDP TD++VSWS  RNSF+VWD HKFS+TLLPRYFKHSNFSSFI
Subjt:  MDELKVKPEESAVA-TGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFI

Query:  RQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRH-SHQSIQHQGG--TCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDR
        RQLNTYGFRK+DPDRWEFANEGFL GQ++LL+ IKRRR+   Q++  QG   +CVE+GQ+G + E+ERL+RD   L+AE+VRLRQQ  SS+ Q+ AME R
Subjt:  RQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRRH-SHQSIQHQGG--TCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDR

Query:  LEKAESKQKQIMTFLSKALKNPSFIQKF-IHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS
        L   E +Q+Q+MTFL+KAL NP+F+Q+F + S + + L G+++GRKRRLT++PS+  +++      +  +E +  + D+E L     +DE++       +
Subjt:  LEKAESKQKQIMTFLSKALKNPSFIQKF-IHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVS

Query:  DMGHFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDW-TEDLQELVDQMGFLRSK
         M       L   + +  +    G+ E A+        DV+VEDL+  PLDW ++DL ++VDQMGFL S+
Subjt:  DMGHFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQSDVDVEVEDLIAEPLDW-TEDLQELVDQMGFLRSK

AT3G22830.1 heat shock transcription factor A6B3.2e-6850Show/hide
Query:  PEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGF
        P  S+     + + ++++  ++   PQP+EGLH+ GPPPFLTKT+++VED  T+ +VSWSK+ NSFIVWD   FS TLLPR+FKH+NFSSF+RQLNTYGF
Subjt:  PEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGF

Query:  RKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQ----------GGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRL
        RKV+PDRWEFANEGFL GQ++LL+ I+RR+ S+ S Q Q             C+E+G++GL+ E++ LRRD+  LM ELVRLRQQ QS++  +  +E++L
Subjt:  RKVDPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQ----------GGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRL

Query:  EKAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQD
        +K ESKQKQ+M+FL++A++NP FIQ+ +   + R+     I +KR+        N++D
Subjt:  EKAESKQKQIMTFLSKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQD

AT4G17750.1 heat shock factor 14.9e-6154.59Show/hide
Query:  PPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRR------
        PPPFL+KT++MVEDP TD+IVSWS   NSFIVWD  +FS  LLP+YFKH+NFSSF+RQLNTYGFRKVDPDRWEFANEGFL GQ++LL+ I RR+      
Subjt:  PPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGFRKVDPDRWEFANEGFLGGQRNLLRTIKRRR------

Query:  ------HSHQSIQHQG-----GTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKAESKQKQIMTFLSKALKNPSFIQKFIH
               S Q  Q QG      +CVE+G+FGLE E+E+L+RD++ LM ELV+LRQQ Q++  ++  +   L+  E +Q+QIM+FL+KA++NP+F+ +FI 
Subjt:  ------HSHQSIQHQG-----GTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKAESKQKQIMTFLSKALKNPSFIQKFIH

Query:  SNQGRELRGVEIGRKRRL
              +   E  +KRRL
Subjt:  SNQGRELRGVEIGRKRRL

AT5G16820.1 heat shock factor 36.0e-5948.77Show/hide
Query:  SASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGFRKVDPDRWEFAN
        S  S +S++ S+ P P+  +     PPFL+KT++MV+DPLT+ +VSWS   NSF+VW   +FS  LLP+YFKH+NFSSF+RQLNTYGFRKVDPDRWEFAN
Subjt:  SASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGFRKVDPDRWEFAN

Query:  EGFLGGQRNLLRTIKRRRHSH-----QSIQHQG---GTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKAESKQKQIMTFL
        EGFL G++ LL++I RR+ SH     Q  Q Q    G CVE+G+FG+E E+ERL+RD++ LM ELVRLRQQ Q++  Q+  +  +++  E +Q+Q+M+FL
Subjt:  EGFLGGQRNLLRTIKRRRHSH-----QSIQHQG---GTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKAESKQKQIMTFL

Query:  SKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQD
        +KA+++P F+ + +  N     R +    K+R       EN  D
Subjt:  SKALKNPSFIQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAACTGAAAGTGAAACCGGAGGAGTCAGCAGTAGCCACCGGCAGGGCGGCGTCTGCTTCTTCTGTTTCTTCATCGTCTTCATCCGTCACTCCCCAGCCGATCGA
AGGCCTACACGATGTCGGTCCGCCTCCATTTCTGACGAAGACCTTTGAAATGGTGGAAGATCCATTGACGGACTCCATTGTTTCCTGGAGTAAAGCTCGCAATAGTTTCA
TCGTTTGGGATTATCATAAATTCTCTAGTACTTTGCTGCCTCGTTACTTCAAACATTCCAATTTCTCTAGTTTCATTCGTCAACTCAATACTTATGGTTTTCGAAAAGTC
GATCCTGATCGTTGGGAGTTTGCGAATGAGGGGTTTCTGGGAGGACAGAGGAATCTGTTGAGAACCATAAAGAGGAGGAGACATTCACACCAAAGCATCCAGCACCAAGG
AGGAACATGTGTGGAATTAGGGCAATTTGGACTGGAAGCCGAACTTGAGAGGTTGAGAAGGGACAGAAGCTCGTTAATGGCGGAATTAGTGAGATTGAGGCAGCAACACC
AGAGCTCAAGAGAGCAAATAATGGCCATGGAGGATAGGTTAGAGAAAGCAGAGAGTAAACAGAAGCAGATCATGACATTTCTCAGCAAAGCCCTCAAGAATCCTTCCTTC
ATTCAGAAATTTATTCATAGTAATCAGGGAAGGGAATTGAGAGGCGTTGAAATTGGAAGAAAGCGGAGACTAACTGCCAGCCCAAGTGTAGAGAATCTCCAGGATGAGAA
TGTGCCGGTGGCTGTTAAACAAGAAGAGCTAGAAACTTCTGAACCAGATATAGAGACGCTGTTAACGGTTAACTTTGAAGACGAGTCAAGCATTGAGATCACGGACCCTG
TTTCTGATATGGGTCACTTTGCTCATGGGGAATTGGGGATCTTCAGTCAACTTTGGGCTGAAGATCTTATAGCTGGACATCCAGAAGAAGCCATAATCGTCGGTGATCAA
TCAGACGTTGATGTGGAAGTGGAGGATCTGATTGCTGAACCCCTGGATTGGACTGAGGACCTGCAGGAACTCGTCGATCAAATGGGGTTTCTCCGATCGAAGTCGTAG
mRNA sequenceShow/hide mRNA sequence
AAAAAGACATGCGCAGTCCTAGAACCTTCTACTTCGACGCCATTAAAATTCAGTTTCTTCCCTCCATTTATCATTCAATCAAACTCTAAAAACTCCTTTCTTCTTCAACT
AATCAGTCATTAACCTCCCAATCTCGGATTAACTTCTTCCGATCAAAGTAGTAATGGATGAACTGAAAGTGAAACCGGAGGAGTCAGCAGTAGCCACCGGCAGGGCGGCG
TCTGCTTCTTCTGTTTCTTCATCGTCTTCATCCGTCACTCCCCAGCCGATCGAAGGCCTACACGATGTCGGTCCGCCTCCATTTCTGACGAAGACCTTTGAAATGGTGGA
AGATCCATTGACGGACTCCATTGTTTCCTGGAGTAAAGCTCGCAATAGTTTCATCGTTTGGGATTATCATAAATTCTCTAGTACTTTGCTGCCTCGTTACTTCAAACATT
CCAATTTCTCTAGTTTCATTCGTCAACTCAATACTTATGGTTTTCGAAAAGTCGATCCTGATCGTTGGGAGTTTGCGAATGAGGGGTTTCTGGGAGGACAGAGGAATCTG
TTGAGAACCATAAAGAGGAGGAGACATTCACACCAAAGCATCCAGCACCAAGGAGGAACATGTGTGGAATTAGGGCAATTTGGACTGGAAGCCGAACTTGAGAGGTTGAG
AAGGGACAGAAGCTCGTTAATGGCGGAATTAGTGAGATTGAGGCAGCAACACCAGAGCTCAAGAGAGCAAATAATGGCCATGGAGGATAGGTTAGAGAAAGCAGAGAGTA
AACAGAAGCAGATCATGACATTTCTCAGCAAAGCCCTCAAGAATCCTTCCTTCATTCAGAAATTTATTCATAGTAATCAGGGAAGGGAATTGAGAGGCGTTGAAATTGGA
AGAAAGCGGAGACTAACTGCCAGCCCAAGTGTAGAGAATCTCCAGGATGAGAATGTGCCGGTGGCTGTTAAACAAGAAGAGCTAGAAACTTCTGAACCAGATATAGAGAC
GCTGTTAACGGTTAACTTTGAAGACGAGTCAAGCATTGAGATCACGGACCCTGTTTCTGATATGGGTCACTTTGCTCATGGGGAATTGGGGATCTTCAGTCAACTTTGGG
CTGAAGATCTTATAGCTGGACATCCAGAAGAAGCCATAATCGTCGGTGATCAATCAGACGTTGATGTGGAAGTGGAGGATCTGATTGCTGAACCCCTGGATTGGACTGAG
GACCTGCAGGAACTCGTCGATCAAATGGGGTTTCTCCGATCGAAGTCGTAGACATAACAGTGTAGTCAATTTGTTTGGCATTTTGCTTCTCTTCGTGTGGAATAGGTTTC
AGCTGGCTCGTGTAGGAAGGAATCGTGGATGTTGCAGGGGAGCTACAGATTGTTCCCGTTTCCTTATATCCTTTGAAACTCATTGGTTTTTCAATATTATGTACTAAATA
TTTTTGGTTGTGTTAAATTGGTTTATAGTTATTTTTTTCTTCCGTGGTTTTATTTGCAGGCATGTGCAGAAAAAAGGAGAACTGCGAGAAGATTCAGTCATTTGCTGATT
TCTTTACCATTAAGTGAAAGTCGGAGTAATTATCTCATTCATTAATTGAGAACCAATAACTGCTATTCATTTGAAGTGTACGAGTCTGCTGAAAACTAGAACCAAATTTT
TTCCCTCTAGTGAGGTGTTCGCTATTTTTTGTGGAAGATATCAAAAAGTAGCTATTTTCCGTTGTCATAACTTAGGAAATATAGTTTTGATTAAATGTATTTTGAATCTT
GATAGGCTTTTGGTCCACGATTGCTGGTTCGTGTTTGAGCTTAATAATGTTAAGCTCAGCTATGTAAGATGGAATAAACAAATTCAGAGTTCTAGGCTTCATTAACTTAA
TAGAAATTCTTTGATAATCTTCTATGCTGTCATGTATAATTCAATTGGTTGATAGATATCCTTTTCTGACAGAATTATAAAGAATCGCATGCTTGTCTTCTCTAGAGGAA
GAAATGAAATGTACCTAGAGGTAGTAGATGCTCTCTCTTTCTTTCTCTCTGGAATGCACCTTCTTAAAGGATGAGAACTTTTGGTCAATGTCAATGGATTGGTTTTCCAT
ATTCATGGCTTGACTGTAACTTTCCTCCATGGCATTATAAAGCATCTCACGTTTTTCTTTTTACATTTGAGATCTCTTGCATTTATAGCTTTGCTGGGATAGAAACACAC
ATCAAGATATCATGGGCCCTAAACTGTTAAGTTGATTTCCTTTTCCCTCACGTTCGGGGGATTGGAAAATAAAATATCTGTTCTTTATTCCAGGAAGTCAATGATCCAGT
GAAACAAGACATGTACTTTTCAAATAGTTACAGGTAGAAAGAACAAAGATGAGTTCCATTTACACGAATATGCGATCCAAATCAGAGGTTTTGTTAGGGGCTCCGTTAGA
TGGGCGCTATCACAGGCACCCTGGGGCTGGGGATCCCCATATATGAGAGCCTAGGAGACATCCTGATCTTTGGGCTTGGCCGAGGTGAAGGTATCGGAAGCGAGCTTCCG
AAGAAACCCGGTGAGGGTCTTGGTGACAAGTTCACTTGCTCCAGTGCCCGTGCTTGCAACTCTGCTGGATACTCCTTCACACACCCAATTCTGGGGCCAACTCCACTGCT
CCATCTGCACAGCAAACGCTTTGGTACTGCCAAATCGACTTCGTCTGCAACCATGTCTTCTGAAGATGTCGTCGATGTCTCTTCGCTCGAGTCTTTATATGAGTAGTTGT
CGTCATCTACTGAACATCTCTGAGTAATTATGAAGAAAACCCCATTGGCGAAATTCAGGAAAACAAGTTGTGCAATCTGAAAAGAAGTAGATACAAAAAAACTGCATCCA
TGTAGATATGTCCAGATAATTACCTTGACGTTGGTCAAGTCCACTGTATGTTCTTCAAGGAAACTAATGAACTCCTTAAAATTGTTCTCTGTTGGAAGATAATGACCACT
GTATGGCCATATAGCCTAAAGAAAATCCAAATGAAAACATCACCTCTATGGATTACTCTACAGAATGATTATCCAAAACAACACCCATATTCTCAAGCATTCAATTATGC
ACCTTAAGAATTCCATCAATAGCAACCAATCTTCCTGCAGCTGTTATAGCCCCTCCAGACAGGAAGCTGGAGTGTTGAAAGAGACCTTTCTTCTTCTGCCCCACATACAG
ATCTCTTGATGTGCTGAGTACAAAAATCCACTTGGAATCTTCAACTGTGTTAATGGGAATTCTGCTTTGCTTGTAAACAAGCTTCCTATTCTCCACAATCACCTGGTACT
CCTCCCTTTCTTTCTGCAACAGAAATGAAGAAAAAACACTTGCTTACTGCTTGCAATGGAGTAGAAACTACAATACAATGAGGTCACCGAGAGTGGGGCTTACTGGTCCA
AGGTACTTGATGCATTGCTTATACAGGACTGATCTACGGCACTTCTCAAGGTTTACCCTCTTCCCATCACCAATATCCAACCTTTTAAAAGAAAGCTTGAAGGATAAGAA
TGAAAGCTTTCAATAGTGCACATGGGGGAGGATTTACTTATGTAATGATACCAGCAGTTGGAGATGGAAATTTACCAGTAGAAGAAAGGTTGAGTGCTCTTGCTGTCAAA
CCAAACATCATAGTAAAAATGCAAGTTATGTCCATAGCGGTGGCGAGGATCAATCTGCTCAAAAGTCCAACAAAGGTAACTCAAATACAACTTTTTCAGTTGCAGATGAC
AATGTAGAGGATACAACAAAAGACTGAATCATATGAACTTACTGCTTCAAGCCAGTGTTGTAGAGCTAGTTTTTGAGCATTCTCATCCTTTGATAAACCTTTCCCGACCT
AAACAACTCTTGTCCTTGCTCGGGACCAGCGGGACGTAGCAGTTTCTGTCTTTCCGACGTCGAAGAATGAAACAGAGCTTACCTTGAGAGCTGCAAAGTCTAAAGCTTTC
CACCTTCAAAGTTCATTGAGCAGAATGAGAAACAAAGCACTCAACTTCCTTAAACCAAAAAATGAAAAAGAGAAATAGAATTCTTTCACAGACCATAGCTCCTCAACCAC
AACTGCACAATCTGCCAGGTTCCTTCTGGTTCTATAGCTTTTGTAGACCTTTTGTAGCTTAGTTGCTGCAGAATCCAATTCACTGACTGGCTTAGTAGAAGACATAACGA
CCGGCTTAGGAAGTGAAATGCTGGGTCTTCTTAGATTTTCAGGTTGTCGAATTTTTCTATCTGAGGAAGATAAACTTTTTTGGTCAACCCCCATATTGTTGAAAGAATCA
GAGACTTGTTTTAGTTGCAAACTCTGTAATCTTGGTGCGTCTTTTGCTCTGATTCTTTTACCACCTCCTTCCTCTTCCAAGCTAGATTTCTGGACATTCATCTCTCTGCT
CTGCAAATCTGAAGGCAAAAGGGGGTTAAATTCATTCATGG
Protein sequenceShow/hide protein sequence
MDELKVKPEESAVATGRAASASSVSSSSSSVTPQPIEGLHDVGPPPFLTKTFEMVEDPLTDSIVSWSKARNSFIVWDYHKFSSTLLPRYFKHSNFSSFIRQLNTYGFRKV
DPDRWEFANEGFLGGQRNLLRTIKRRRHSHQSIQHQGGTCVELGQFGLEAELERLRRDRSSLMAELVRLRQQHQSSREQIMAMEDRLEKAESKQKQIMTFLSKALKNPSF
IQKFIHSNQGRELRGVEIGRKRRLTASPSVENLQDENVPVAVKQEELETSEPDIETLLTVNFEDESSIEITDPVSDMGHFAHGELGIFSQLWAEDLIAGHPEEAIIVGDQ
SDVDVEVEDLIAEPLDWTEDLQELVDQMGFLRSKS