; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000651 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000651
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionheat stress transcription factor A-4c-like
Genome locationscaffold8:42657958..42659609
RNA-Seq ExpressionSpg000651
SyntenySpg000651
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0034605 - cellular response to heat (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604741.1 Heat stress transcription factor A-4b, partial [Cucurbita argyrosperma subsp. sororia]6.9e-14282.55Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        M+GSDGS GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLKS
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP
        IHRRKPI+SHSQSQ    SQSQSHGSGAPLSE ERQELELKIKTLH+EKTILQ+QLQ+HE+EKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q 
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP

Query:  SKKRKLGKLREFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLT
         K+RK+GKL EFL E+ LE    E+NGLKN + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+GK KEME K  G+KEGE R ENG NDVFWEQFLT
Subjt:  SKKRKLGKLREFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLT

Query:  EIPGSSNAGEVYLDRRNNVVR
        E+PG SN GEVYLDRR+NV R
Subjt:  EIPGSSNAGEVYLDRRNNVVR

KAG7034870.1 Heat stress transcription factor A-4b, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-14282.87Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        M+GSDGS GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLKS
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP
        IHRRKPI+SHSQSQ  S SQSQSHGSGAPLSE ERQELELKIKTLH+EKTILQ+QLQ+HE+EKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q 
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP

Query:  SKKRKLGKLREFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLT
         K+RK+GKL EFL E+ LE    E+NGLKN + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+GK KEME K  G+KEGE R ENG NDVFWEQFLT
Subjt:  SKKRKLGKLREFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLT

Query:  EIPGSSNAGEVYLDRRNNVVR
        E+PG SN GEVYLDRR+NV R
Subjt:  EIPGSSNAGEVYLDRRNNVVR

XP_022948138.1 heat stress transcription factor A-4d-like [Cucurbita moschata]1.5e-14182.24Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        M+GSDGS GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLKS
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP
        IHRRKPI+SHSQ      SQSQSHGSGAPLSE ERQELELKIKTLH+EKTILQ+QLQ+HENEKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q 
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP

Query:  SKKRKLGKLREFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLT
         K+RK+GKL EFL E+ LE    E+NGLKN + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+GK KEME K  G+KEGE R ENG NDVFWEQFLT
Subjt:  SKKRKLGKLREFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLT

Query:  EIPGSSNAGEVYLDRRNNVVR
        E+PG SN GEVYLDRR+NV+R
Subjt:  EIPGSSNAGEVYLDRRNNVVR

XP_022970770.1 heat stress transcription factor A-4c-like [Cucurbita maxima]2.6e-14182.02Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        M+GSDGS GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLKS
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP
        IHRRKPI+SHSQSQ    SQSQSHGSGAPLSE ERQELELKIKTLH+EKTILQ+QLQ+HE+EKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q 
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP

Query:  SKKRKLGKLREFLAEDSLEFEKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLTEIPG
         K+RK+GKL EFL E+ LE E++    + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+ K KEME K  G+KEGE R ENG NDVFWEQFLTE+PG
Subjt:  SKKRKLGKLREFLAEDSLEFEKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLTEIPG

Query:  SSNAGEVYLDRRNNVVR
         SNAGEVYLDRR+NV+R
Subjt:  SSNAGEVYLDRRNNVVR

XP_023534238.1 heat stress transcription factor A-4c-like [Cucurbita pepo subsp. pepo]3.1e-14282.55Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        M+GSDGS GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLKS
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP
        IHRRKPI+SHSQSQ    SQSQSHGSGAPLSE ERQELELKIKTLH+EKTILQ+QLQ+HE+EKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q 
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP

Query:  SKKRKLGKLREFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLT
         K+RK+GKL EFL E+ LE    E+NGLKN + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+GK KEME K  G+KEGE R ENG NDVFWEQFLT
Subjt:  SKKRKLGKLREFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLT

Query:  EIPGSSNAGEVYLDRRNNVVR
        E+PG SN GEVYLDRR+NV+R
Subjt:  EIPGSSNAGEVYLDRRNNVVR

TrEMBL top hitse value%identityAlignment
A0A0A0KBI1 HSF_DOMAIN domain-containing protein2.0e-12676.49Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        MDGS+GS  GAPPPFLTKTYEMVDDPMTNS+VSW+QSGFSFVVWNPPEFA+ELLP+YFKHNNFSSFVRQLNTYGFRKIDR+QWEFANEGFIRG+THLLKS
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP
        IHRRKPIYSHSQS       SQ +G GAPLSE ER ELE KIKTL++EKT LQSQLQKHENEKEQIG QI  IC++LWRMGNQQKQLI +LGAEL+K++ 
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP

Query:  SKKRKLGKLREFLAEDSLEFEKNGL--KNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLTEI
         KKRK+GK+ EFL E+  EFEK+ L  K V VPPLEL+GKLELSLG CEDLL NV +VL E      KEMEVK    KEGEMR  +G NDVFWE FLTEI
Subjt:  SKKRKLGKLREFLAEDSLEFEKNGL--KNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLTEI

Query:  PGSSNAGEVYLDRRNNVVR
        PGSSN  +V+LDRRNNVVR
Subjt:  PGSSNAGEVYLDRRNNVVR

A0A5D3BGW6 Heat stress transcription factor A-4c-like5.4e-12475.55Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        M  S+GS  GAPPPFLTKTYEMVDDPM+NS+VSWSQSGFSFVVWNPPEFA+ELLP+YFKHNNFSSFVRQLNTYGFRKIDR+QWEFANEGFIRG+THLLKS
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP
        IHRRKP+YSHSQS       SQ +G GAPLSE ERQELE KIKTL++EKT L+SQLQKHENEKEQIG QI  IC++LWRMG+QQKQLI +LGAEL+KH+ 
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP

Query:  SKKRKLGKLREFLAEDSLEFEKNGL--KNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLTEI
         KKRK+GK+ E L E+  EFE++ L  K V V PLELMGKLELSL  CEDLLCNVA+VL E      KEMEVK    KEGEMR  +G NDVFWE FLTEI
Subjt:  SKKRKLGKLREFLAEDSLEFEKNGL--KNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLTEI

Query:  PGSSNAGEVYLDRRNNVVR
        PGSS   EVYLDRRNNVVR
Subjt:  PGSSNAGEVYLDRRNNVVR

A0A6J1D7W3 heat stress transcription factor A-4c-like2.6e-12675.93Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        MD S+ S  GAPPPFLTKTYEMVDDP TN+VVSWSQSGFSFVVWNPPEFAKELLP+YFKHNNFSSFVRQLNTYGFRKIDR+QWEFANEGF+RGRTHLL++
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSH---GSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK
        IHRRKPIYSHSQ+QS S +Q QSH    +G P SEPERQELE KIK L++E T LQSQLQKHE EKE+I RQI ++C QLWRMGN+QKQLI ML A+LQK
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSH---GSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK

Query:  HQPSKKRKLGKLREFLAEDSLEF---EKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEM-EVKVVGLKEGEMRRENGANDVFWEQ
          PSKKR+L KL + L ED  E+   EKNG K+VM+PPLELMGKLE SLG CE+LLC+VA+V+GEE    SK + EVKVVG KEG++R  NGANDVFWEQ
Subjt:  HQPSKKRKLGKLREFLAEDSLEF---EKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEM-EVKVVGLKEGEMRRENGANDVFWEQ

Query:  FLTEIPGSSNAGEV-YLDRRNNVV
        FLTEIPGSSNAGEV YL+RRNNVV
Subjt:  FLTEIPGSSNAGEV-YLDRRNNVV

A0A6J1G8J3 heat stress transcription factor A-4d-like7.4e-14282.24Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        M+GSDGS GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLKS
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP
        IHRRKPI+SHSQ      SQSQSHGSGAPLSE ERQELELKIKTLH+EKTILQ+QLQ+HENEKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q 
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP

Query:  SKKRKLGKLREFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLT
         K+RK+GKL EFL E+ LE    E+NGLKN + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+GK KEME K  G+KEGE R ENG NDVFWEQFLT
Subjt:  SKKRKLGKLREFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLT

Query:  EIPGSSNAGEVYLDRRNNVVR
        E+PG SN GEVYLDRR+NV+R
Subjt:  EIPGSSNAGEVYLDRRNNVVR

A0A6J1I3T7 heat stress transcription factor A-4c-like1.3e-14182.02Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        M+GSDGS GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLKS
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP
        IHRRKPI+SHSQSQ    SQSQSHGSGAPLSE ERQELELKIKTLH+EKTILQ+QLQ+HE+EKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q 
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP

Query:  SKKRKLGKLREFLAEDSLEFEKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLTEIPG
         K+RK+GKL EFL E+ LE E++    + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+ K KEME K  G+KEGE R ENG NDVFWEQFLTE+PG
Subjt:  SKKRKLGKLREFLAEDSLEFEKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRENGANDVFWEQFLTEIPG

Query:  SSNAGEVYLDRRNNVVR
         SNAGEVYLDRR+NV+R
Subjt:  SSNAGEVYLDRRNNVVR

SwissProt top hitse value%identityAlignment
O49403 Heat stress transcription factor A-4a7.1e-4934.05Show/hide
Query:  DGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSI
        + + G    + PPFLTKTYEMVDD  ++S+VSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D +QWEFAN+ F+RG+ HL+K+I
Subjt:  DGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSI

Query:  HRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK----
        HRRKP++SHS        Q+Q +    PL++ ER  +  +I+ L +EK  L  +L K + E+E    Q+  + ++L  M  +QK +++ +   L+K    
Subjt:  HRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK----

Query:  --------HQPSKKRKLGKLREFLAEDSLEFEKNGL---KNVMVPPL-----ELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEME------------
                    +KR+  ++  F  E  LE  K  +   +     P        + +LE S+   E+L+ +  E + +  +  + +++            
Subjt:  --------HQPSKKRKLGKLREFLAEDSLEFEKNGL---KNVMVPPL-----ELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEME------------

Query:  ---------------VKVVGLK---EGEMRREN----------GANDVFWEQFLTEIPGSSNAGEVYLDRRNN
                        +++ +    +G   +            GAND FW+QF +E PGS+   EV L+R+++
Subjt:  ---------------VKVVGLK---EGEMRREN----------GANDVFWEQFLTEIPGSSNAGEVYLDRRNN

Q40152 Heat shock factor protein HSF86.7e-4750.52Show/hide
Query:  APPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSH
        APPPFL KTY+MVDDP T+ +VSWS +  SFVVW+PPEFAK+LLP YFKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLLKSI RRKP + H
Subjt:  APPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSH

Query:  SQSQSP----SPSQSQSHGSGAPLS---EPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK
        +Q Q      +  Q Q  G  A +    E  +  LE +++ L R+K +L  +L +   +++    Q+  + Q+L  M  +Q+Q+++ L   + +
Subjt:  SQSQSP----SPSQSQSHGSGAPLS---EPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK

Q93VB5 Heat stress transcription factor A-4d1.6e-5345.89Show/hide
Query:  SVNSMDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTH
        S N   G  G GGG PPPFL KTYEMV+D  TN VVSW   G SFVVWNP +F+++LLP YFKHNNFSSF+RQLNTYGFRKID ++WEFANE FIRG TH
Subjt:  SVNSMDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTH

Query:  LLKSIHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ
        LLK+IHRRKP++SHS        Q+Q +G   PL+E ER+ELE +I  L  EK+IL + LQ+   ++  I  Q+  +  +L  M  +QK ++A L   LQ
Subjt:  LLKSIHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ

Query:  KH------------QPSKKRKLGKLREFL-------AEDSLEFEKNGLKNVMVPPL------ELMGKLELSLGFCEDLL--CNVAEVLGEEM
        +               SKKR++ K+  F+        +   +F+  G     +PP+      E   ++ELSL   E L    N A    EEM
Subjt:  KH------------QPSKKRKLGKLREFL-------AEDSLEFEKNGLKNVMVPPL------ELMGKLELSLGFCEDLL--CNVAEVLGEEM

Q94J16 Heat stress transcription factor A-4b6.0e-4841.73Show/hide
Query:  GSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRK
        G GGG+ PPFL+KTYEMVDDP T++VV W+ +G SFVV N PEF ++LLP YFKHNNFSSFVRQLNTYGFRK+D +QWEFANE FI+G+ H LK+IHRRK
Subjt:  GSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRK

Query:  PIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIA----------MLGAEL
        PI+SHS         S S G+G PL++ ER++ E +I+ L  +   L S+LQ +  +K  + +++  + ++L+ + +QQ+ LI+           L + +
Subjt:  PIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIA----------MLGAEL

Query:  QKHQPSKKRKLGKLREFLAEDSLEFEKNGLKNVMVP-----------PLELMGKLELSLGFCEDLLCNVAEVLGEEMN
        Q+    +K++   +     ED+     N  +N ++P             E   K+E SL   E+ L   +E  G +++
Subjt:  QKHQPSKKRKLGKLREFLAEDSLEFEKNGLKNVMVP-----------PLELMGKLELSLGFCEDLLCNVAEVLGEEMN

Q9FK72 Heat stress transcription factor A-4c1.8e-5239.76Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        MD ++G G  + PPFLTKTYEMVDD  ++SVV+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D ++WEF N+ F+RGR +L+K+
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP
        IHRRKP++SHS        Q+Q+     PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++   Q+ T+  +L  M   QK ++A +         
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP

Query:  SKKRKLGKLREFLAEDSLEFEKNGLKNVMVPP----LELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRREN------------
           + LGK    L  ++ E  K   +   +PP    +E + KLE SL F E+L+    E  G  +   S + +     L  G+ R ++            
Subjt:  SKKRKLGKLREFLAEDSLEFEKNGLKNVMVPP----LELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRREN------------

Query:  --------GANDVFWEQFLTEIPGSSNAGEVYLDRRN
                G ND FWEQ LTE PGS+   EV  +RR+
Subjt:  --------GANDVFWEQFLTEIPGSSNAGEVYLDRRN

Arabidopsis top hitse value%identityAlignment
AT1G32330.1 heat shock transcription factor A1D1.4e-4746.48Show/hide
Query:  APPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSH
        APPPFL+KTY+MVDD  T+S+VSWS +  SF+VW PPEFA++LLP  FKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLL+SI RRKP +  
Subjt:  APPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSH

Query:  SQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQPSKKRKLGKLR
         Q    S   +  + S +   E  +  LE +++ L R+K +L  +L +   +++    Q+ T+ Q+L  M N+Q+QL++ L   +Q            L 
Subjt:  SQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQPSKKRKLGKLR

Query:  EFLAEDSLEFEKN
        +FL + + + E N
Subjt:  EFLAEDSLEFEKN

AT4G17750.1 heat shock factor 11.8e-4750.53Show/hide
Query:  PPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHS
        PPPFL+KTY+MV+DP T+++VSWS +  SF+VW+PPEF+++LLP YFKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLLK I RRK +  H 
Subjt:  PPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHS

Query:  QSQSPSPSQ--SQSHGSGAPLS---EPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ
         S S   SQ  SQ  GS A LS   E  +  LE +++ L R+K +L  +L K   +++    ++  + + L  M  +Q+Q+++ L   +Q
Subjt:  QSQSPSPSQ--SQSHGSGAPLS---EPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ

AT4G18880.1 heat shock transcription factor A4A5.1e-5034.05Show/hide
Query:  DGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSI
        + + G    + PPFLTKTYEMVDD  ++S+VSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D +QWEFAN+ F+RG+ HL+K+I
Subjt:  DGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSI

Query:  HRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK----
        HRRKP++SHS        Q+Q +    PL++ ER  +  +I+ L +EK  L  +L K + E+E    Q+  + ++L  M  +QK +++ +   L+K    
Subjt:  HRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK----

Query:  --------HQPSKKRKLGKLREFLAEDSLEFEKNGL---KNVMVPPL-----ELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEME------------
                    +KR+  ++  F  E  LE  K  +   +     P        + +LE S+   E+L+ +  E + +  +  + +++            
Subjt:  --------HQPSKKRKLGKLREFLAEDSLEFEKNGL---KNVMVPPL-----ELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEME------------

Query:  ---------------VKVVGLK---EGEMRREN----------GANDVFWEQFLTEIPGSSNAGEVYLDRRNN
                        +++ +    +G   +            GAND FW+QF +E PGS+   EV L+R+++
Subjt:  ---------------VKVVGLK---EGEMRREN----------GANDVFWEQFLTEIPGSSNAGEVYLDRRNN

AT5G16820.1 heat shock factor 33.8e-4552.72Show/hide
Query:  PPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHSQ
        PPFL+KTY+MVDDP+TN VVSWS    SFVVW+ PEF+K LLP YFKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RGR  LLKSI RRKP  SH Q
Subjt:  PPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHSQ

Query:  SQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ
         Q+   +Q QS   GA + E  +  +E +++ L R+K +L  +L +   +++    Q+  + Q++  M  +Q+Q+++ L   +Q
Subjt:  SQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ

AT5G45710.1 winged-helix DNA-binding transcription factor family protein1.3e-5339.76Show/hide
Query:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS
        MD ++G G  + PPFLTKTYEMVDD  ++SVV+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D ++WEF N+ F+RGR +L+K+
Subjt:  MDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKS

Query:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP
        IHRRKP++SHS        Q+Q+     PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++   Q+ T+  +L  M   QK ++A +         
Subjt:  IHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQP

Query:  SKKRKLGKLREFLAEDSLEFEKNGLKNVMVPP----LELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRREN------------
           + LGK    L  ++ E  K   +   +PP    +E + KLE SL F E+L+    E  G  +   S + +     L  G+ R ++            
Subjt:  SKKRKLGKLREFLAEDSLEFEKNGLKNVMVPP----LELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRREN------------

Query:  --------GANDVFWEQFLTEIPGSSNAGEVYLDRRN
                G ND FWEQ LTE PGS+   EV  +RR+
Subjt:  --------GANDVFWEQFLTEIPGSSNAGEVYLDRRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATTTCCTTCTCGTTCCTTCAATCCTCAAATTTCCCTTCTTCCTTTTTCCTTCAAGTTCTTTCTAATCCTTGGTCACTCTGGTTTCATCACGGCGGCGACGACGACGGCGT
CAGCGACGGACTCTATCGTTCGGTGAACTCAATGGATGGCTCAGATGGAAGCGGCGGCGGCGCGCCGCCGCCATTTCTGACCAAAACATATGAGATGGTGGATGATCCGA
TGACCAATTCCGTCGTCTCATGGAGTCAAAGCGGTTTCAGCTTCGTGGTTTGGAACCCACCGGAATTCGCCAAAGAACTTCTCCCTGTTTATTTCAAACACAACAACTTC
TCTAGCTTCGTTCGACAATTAAACACTTATGGGTTCAGGAAGATCGATCGAGACCAATGGGAATTTGCGAACGAGGGGTTCATAAGAGGACGAACCCATCTTCTAAAAAG
CATCCATAGACGCAAACCAATCTACAGTCACAGCCAGAGCCAGAGCCCGAGCCCGAGCCAGAGCCAGAGCCATGGAAGTGGAGCTCCATTGTCCGAACCAGAGAGACAGG
AGCTCGAGCTAAAAATAAAAACCCTTCATCGAGAAAAGACCATCCTCCAATCCCAGCTGCAGAAACACGAGAACGAAAAGGAACAGATAGGGCGTCAAATCCACACGATC
TGTCAGCAGTTATGGCGAATGGGGAATCAACAGAAGCAGCTAATCGCGATGTTGGGGGCAGAGTTGCAGAAGCACCAGCCGAGCAAGAAGAGAAAGTTGGGGAAGTTGCG
TGAGTTCTTGGCTGAGGATTCGTTGGAATTTGAGAAGAATGGTTTGAAGAATGTGATGGTTCCGCCATTGGAGCTGATGGGGAAGCTGGAATTGTCTTTGGGATTCTGCG
AGGATTTGCTGTGCAATGTCGCTGAGGTTTTGGGCGAGGAGATGAATGGGAAGTCGAAGGAAATGGAGGTGAAGGTTGTGGGATTGAAAGAAGGGGAAATGAGAAGGGAA
AATGGAGCGAATGATGTGTTTTGGGAACAGTTTTTGACTGAGATTCCAGGGTCTTCGAATGCTGGGGAAGTTTATTTGGATAGAAGGAACAATGTTGTAAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATTTCCTTCTCGTTCCTTCAATCCTCAAATTTCCCTTCTTCCTTTTTCCTTCAAGTTCTTTCTAATCCTTGGTCACTCTGGTTTCATCACGGCGGCGACGACGACGGCGT
CAGCGACGGACTCTATCGTTCGGTGAACTCAATGGATGGCTCAGATGGAAGCGGCGGCGGCGCGCCGCCGCCATTTCTGACCAAAACATATGAGATGGTGGATGATCCGA
TGACCAATTCCGTCGTCTCATGGAGTCAAAGCGGTTTCAGCTTCGTGGTTTGGAACCCACCGGAATTCGCCAAAGAACTTCTCCCTGTTTATTTCAAACACAACAACTTC
TCTAGCTTCGTTCGACAATTAAACACTTATGGGTTCAGGAAGATCGATCGAGACCAATGGGAATTTGCGAACGAGGGGTTCATAAGAGGACGAACCCATCTTCTAAAAAG
CATCCATAGACGCAAACCAATCTACAGTCACAGCCAGAGCCAGAGCCCGAGCCCGAGCCAGAGCCAGAGCCATGGAAGTGGAGCTCCATTGTCCGAACCAGAGAGACAGG
AGCTCGAGCTAAAAATAAAAACCCTTCATCGAGAAAAGACCATCCTCCAATCCCAGCTGCAGAAACACGAGAACGAAAAGGAACAGATAGGGCGTCAAATCCACACGATC
TGTCAGCAGTTATGGCGAATGGGGAATCAACAGAAGCAGCTAATCGCGATGTTGGGGGCAGAGTTGCAGAAGCACCAGCCGAGCAAGAAGAGAAAGTTGGGGAAGTTGCG
TGAGTTCTTGGCTGAGGATTCGTTGGAATTTGAGAAGAATGGTTTGAAGAATGTGATGGTTCCGCCATTGGAGCTGATGGGGAAGCTGGAATTGTCTTTGGGATTCTGCG
AGGATTTGCTGTGCAATGTCGCTGAGGTTTTGGGCGAGGAGATGAATGGGAAGTCGAAGGAAATGGAGGTGAAGGTTGTGGGATTGAAAGAAGGGGAAATGAGAAGGGAA
AATGGAGCGAATGATGTGTTTTGGGAACAGTTTTTGACTGAGATTCCAGGGTCTTCGAATGCTGGGGAAGTTTATTTGGATAGAAGGAACAATGTTGTAAGGTAG
Protein sequenceShow/hide protein sequence
ISFSFLQSSNFPSSFFLQVLSNPWSLWFHHGGDDDGVSDGLYRSVNSMDGSDGSGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNF
SSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHSQSQSPSPSQSQSHGSGAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTI
CQQLWRMGNQQKQLIAMLGAELQKHQPSKKRKLGKLREFLAEDSLEFEKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKVVGLKEGEMRRE
NGANDVFWEQFLTEIPGSSNAGEVYLDRRNNVVR