; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009722 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009722
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionheat stress transcription factor A-4c-like
Genome locationchr9:41750186..41751536
RNA-Seq ExpressionLag0009722
SyntenyLag0009722
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0034605 - cellular response to heat (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604741.1 Heat stress transcription factor A-4b, partial [Cucurbita argyrosperma subsp. sororia]1.1e-13981.99Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        M+GS+GS  GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ
        SIHRRKPI+SHSQSQ    SQSQSHGS APLSE ERQELELKIKTLH+EKTILQ+QLQ+HE+EKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ

Query:  PSKKRKLGKLSEFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFL
          K+RK+GKLSEFL E+ LE    E+NGLKN + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+GK KEME K  G+KEGE R ENG NDVFWEQFL
Subjt:  PSKKRKLGKLSEFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFL

Query:  TEIPGSSNAGEVYLDRRNNVVR
        TE+PG SN GEVYLDRR+NV R
Subjt:  TEIPGSSNAGEVYLDRRNNVVR

KAG7034870.1 Heat stress transcription factor A-4b, partial [Cucurbita argyrosperma subsp. argyrosperma]7.5e-14182.61Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        M+GS+GS  GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ
        SIHRRKPI+SHSQSQSQ  SQSQSHGS APLSE ERQELELKIKTLH+EKTILQ+QLQ+HE+EKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ

Query:  PSKKRKLGKLSEFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFL
          K+RK+GKLSEFL E+ LE    E+NGLKN + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+GK KEME K  G+KEGE R ENG NDVFWEQFL
Subjt:  PSKKRKLGKLSEFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFL

Query:  TEIPGSSNAGEVYLDRRNNVVR
        TE+PG SN GEVYLDRR+NV R
Subjt:  TEIPGSSNAGEVYLDRRNNVVR

XP_022948138.1 heat stress transcription factor A-4d-like [Cucurbita moschata]1.9e-13981.68Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        M+GS+GS  GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ
        SIHRRKPI+SHSQ      SQSQSHGS APLSE ERQELELKIKTLH+EKTILQ+QLQ+HENEKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ

Query:  PSKKRKLGKLSEFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFL
          K+RK+GKLSEFL E+ LE    E+NGLKN + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+GK KEME K  G+KEGE R ENG NDVFWEQFL
Subjt:  PSKKRKLGKLSEFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFL

Query:  TEIPGSSNAGEVYLDRRNNVVR
        TE+PG SN GEVYLDRR+NV+R
Subjt:  TEIPGSSNAGEVYLDRRNNVVR

XP_022970770.1 heat stress transcription factor A-4c-like [Cucurbita maxima]4.1e-13981.45Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        M+GS+GS  GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ
        SIHRRKPI+SHSQSQ    SQSQSHGS APLSE ERQELELKIKTLH+EKTILQ+QLQ+HE+EKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ

Query:  PSKKRKLGKLSEFLAEDSLEFEKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFLTEIP
          K+RK+GKLSEFL E+ LE E++    + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+ K KEME K  G+KEGE R ENG NDVFWEQFLTE+P
Subjt:  PSKKRKLGKLSEFLAEDSLEFEKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFLTEIP

Query:  GSSNAGEVYLDRRNNVVR
        G SNAGEVYLDRR+NV+R
Subjt:  GSSNAGEVYLDRRNNVVR

XP_023534238.1 heat stress transcription factor A-4c-like [Cucurbita pepo subsp. pepo]4.9e-14081.99Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        M+GS+GS  GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ
        SIHRRKPI+SHSQSQ    SQSQSHGS APLSE ERQELELKIKTLH+EKTILQ+QLQ+HE+EKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ

Query:  PSKKRKLGKLSEFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFL
          K+RK+GKLSEFL E+ LE    E+NGLKN + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+GK KEME K  G+KEGE R ENG NDVFWEQFL
Subjt:  PSKKRKLGKLSEFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFL

Query:  TEIPGSSNAGEVYLDRRNNVVR
        TE+PG SN GEVYLDRR+NV+R
Subjt:  TEIPGSSNAGEVYLDRRNNVVR

TrEMBL top hitse value%identityAlignment
A0A0A0KBI1 HSF_DOMAIN domain-containing protein8.2e-12576.25Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        MDGSEGS   GAPPPFLTKTYEMVDDPMTNS+VSW+QSGFSFVVWNPPEFA+ELLP+YFKHNNFSSFVRQLNTYGFRKIDR+QWEFANEGFIRG+THLLK
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ
        SIHRRKPIYSHSQS       SQ +G  APLSE ER ELE KIKTL++EKT LQSQLQKHENEKEQIG QI  IC++LWRMGNQQKQLI +LGAEL+K++
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ

Query:  PSKKRKLGKLSEFLAEDSLEFEKNGL--KNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFLTE
          KKRK+GK++EFL E+  EFEK+ L  K V VPPLEL+GKLELSLG CEDLL NV +VL E      KEMEVK    KEGEMR  +G NDVFWE FLTE
Subjt:  PSKKRKLGKLSEFLAEDSLEFEKNGL--KNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFLTE

Query:  IPGSSNAGEVYLDRRNNVVR
        IPGSSN  +V+LDRRNNVVR
Subjt:  IPGSSNAGEVYLDRRNNVVR

A0A5D3BGW6 Heat stress transcription factor A-4c-like2.2e-12275.31Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        M  SEGS   GAPPPFLTKTYEMVDDPM+NS+VSWSQSGFSFVVWNPPEFA+ELLP+YFKHNNFSSFVRQLNTYGFRKIDR+QWEFANEGFIRG+THLLK
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ
        SIHRRKP+YSHSQS       SQ +G  APLSE ERQELE KIKTL++EKT L+SQLQKHENEKEQIG QI  IC++LWRMG+QQKQLI +LGAEL+KH+
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ

Query:  PSKKRKLGKLSEFLAEDSLEFEKNGL--KNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFLTE
          KKRK+GK++E L E+  EFE++ L  K V V PLELMGKLELSL  CEDLLCNVA+VL E      KEMEVK    KEGEMR  +G NDVFWE FLTE
Subjt:  PSKKRKLGKLSEFLAEDSLEFEKNGL--KNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFLTE

Query:  IPGSSNAGEVYLDRRNNVVR
        IPGSS   EVYLDRRNNVVR
Subjt:  IPGSSNAGEVYLDRRNNVVR

A0A6J1D7W3 heat stress transcription factor A-4c-like2.5e-12675.38Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        MD SE S   GAPPPFLTKTYEMVDDP TN+VVSWSQSGFSFVVWNPPEFAKELLP+YFKHNNFSSFVRQLNTYGFRKIDR+QWEFANEGF+RGRTHLL+
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVA---PLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ
        +IHRRKPIYSHSQ+QS S +Q QSH   +   P SEPERQELE KIK L++E T LQSQLQKHE EKE+I RQI ++C QLWRMGN+QKQLI ML A+LQ
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVA---PLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ

Query:  KHQPSKKRKLGKLSEFLAEDSLEF---EKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEM-EVKIVGLKEGEMRRENGANDVFWE
        K  PSKKR+L KL++ L ED  E+   EKNG K+VM+PPLELMGKLE SLG CE+LLC+VA+V+GEE    SK + EVK+VG KEG++R  NGANDVFWE
Subjt:  KHQPSKKRKLGKLSEFLAEDSLEF---EKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEM-EVKIVGLKEGEMRRENGANDVFWE

Query:  QFLTEIPGSSNAGEV-YLDRRNNVV
        QFLTEIPGSSNAGEV YL+RRNNVV
Subjt:  QFLTEIPGSSNAGEV-YLDRRNNVV

A0A6J1G8J3 heat stress transcription factor A-4d-like9.0e-14081.68Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        M+GS+GS  GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ
        SIHRRKPI+SHSQ      SQSQSHGS APLSE ERQELELKIKTLH+EKTILQ+QLQ+HENEKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ

Query:  PSKKRKLGKLSEFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFL
          K+RK+GKLSEFL E+ LE    E+NGLKN + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+GK KEME K  G+KEGE R ENG NDVFWEQFL
Subjt:  PSKKRKLGKLSEFLAEDSLEF---EKNGLKN-VMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFL

Query:  TEIPGSSNAGEVYLDRRNNVVR
        TE+PG SN GEVYLDRR+NV+R
Subjt:  TEIPGSSNAGEVYLDRRNNVVR

A0A6J1I3T7 heat stress transcription factor A-4c-like2.0e-13981.45Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        M+GS+GS  GGAPPPFLTKTYEMVDDPMTNSVVSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ
        SIHRRKPI+SHSQSQ    SQSQSHGS APLSE ERQELELKIKTLH+EKTILQ+QLQ+HE+EKEQIGRQI T+CQQ+WRMGNQQKQLIA++ AELQK Q
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQ

Query:  PSKKRKLGKLSEFLAEDSLEFEKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFLTEIP
          K+RK+GKLSEFL E+ LE E++    + VP LELMGKLE+SLG CEDLLCNVAEVLG EM+ K KEME K  G+KEGE R ENG NDVFWEQFLTE+P
Subjt:  PSKKRKLGKLSEFLAEDSLEFEKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFLTEIP

Query:  GSSNAGEVYLDRRNNVVR
        G SNAGEVYLDRR+NV+R
Subjt:  GSSNAGEVYLDRRNNVVR

SwissProt top hitse value%identityAlignment
O49403 Heat stress transcription factor A-4a6.4e-5034.13Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        MD +       + PPFLTKTYEMVDD  ++S+VSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D +QWEFAN+ F+RG+ HL+K
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK--
        +IHRRKP++SHS    Q+         + PL++ ER  +  +I+ L +EK  L  +L K + E+E    Q+  + ++L  M  +QK +++ +   L+K  
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK--

Query:  ----------HQPSKKRKLGKLSEFLAEDSLEFEKNGL---KNVMVPPL-----ELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEME----------
                      +KR+  ++  F  E  LE  K  +   +     P        + +LE S+   E+L+ +  E + +  +  + +++          
Subjt:  ----------HQPSKKRKLGKLSEFLAEDSLEFEKNGL---KNVMVPPL-----ELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEME----------

Query:  -----------------VKIVGLK---EGEMRREN----------GANDVFWEQFLTEIPGSSNAGEVYLDRRNN
                          +I+ +    +G   +            GAND FW+QF +E PGS+   EV L+R+++
Subjt:  -----------------VKIVGLK---EGEMRREN----------GANDVFWEQFLTEIPGSSNAGEVYLDRRNN

Q93VB5 Heat stress transcription factor A-4d3.3e-5446.02Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        + G  G GGGG PPPFL KTYEMV+D  TN VVSW   G SFVVWNP +F+++LLP YFKHNNFSSF+RQLNTYGFRKID ++WEFANE FIRG THLLK
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKH-
        +IHRRKP++SHS        Q+Q +G   PL+E ER+ELE +I  L  EK+IL + LQ+   ++  I  Q+  +  +L  M  +QK ++A L   LQ+  
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKH-

Query:  -----------QPSKKRKLGKLSEFL-------AEDSLEFEKNGLKNVMVPPL------ELMGKLELSLGFCEDLL--CNVAEVLGEEM
                     SKKR++ K+  F+        +   +F+  G     +PP+      E   ++ELSL   E L    N A    EEM
Subjt:  -----------QPSKKRKLGKLSEFL-------AEDSLEFEKNGLKNVMVPPL------ELMGKLELSLGFCEDLL--CNVAEVLGEEM

Q94J16 Heat stress transcription factor A-4b1.6e-4842.14Show/hide
Query:  EGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHR
        EG GGGG+ PPFL+KTYEMVDDP T++VV W+ +G SFVV N PEF ++LLP YFKHNNFSSFVRQLNTYGFRK+D +QWEFANE FI+G+ H LK+IHR
Subjt:  EGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHR

Query:  RKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIA----------MLGA
        RKPI+SHS       S SQ  G   PL++ ER++ E +I+ L  +   L S+LQ +  +K  + +++  + ++L+ + +QQ+ LI+           L +
Subjt:  RKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIA----------MLGA

Query:  ELQKHQPSKKRKLGKLSEFLAEDSLEFEKNGLKNVMVP-----------PLELMGKLELSLGFCEDLLCNVAEVLGEEMN
         +Q+    +K++   +     ED+     N  +N ++P             E   K+E SL   E+ L   +E  G +++
Subjt:  ELQKHQPSKKRKLGKLSEFLAEDSLEFEKNGLKNVMVP-----------PLELMGKLELSLGFCEDLLCNVAEVLGEEMN

Q9FK72 Heat stress transcription factor A-4c4.3e-5439.52Show/hide
Query:  EGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHR
        E +GG  + PPFLTKTYEMVDD  ++SVV+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D ++WEF N+ F+RGR +L+K+IHR
Subjt:  EGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHR

Query:  RKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQPSKK
        RKP++SHS    Q+ +         PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++   Q+ T+  +L  M   QK ++A +            
Subjt:  RKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQPSKK

Query:  RKLGKLSEFLAEDSLEFEKNGLKNVMVPP----LELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRREN---------------
        + LGK    L  ++ E  K   +   +PP    +E + KLE SL F E+L+    E  G  +   S + +     L  G+ R ++               
Subjt:  RKLGKLSEFLAEDSLEFEKNGLKNVMVPP----LELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRREN---------------

Query:  -----GANDVFWEQFLTEIPGSSNAGEVYLDRRN
             G ND FWEQ LTE PGS+   EV  +RR+
Subjt:  -----GANDVFWEQFLTEIPGSSNAGEVYLDRRN

Q9LQM7 Heat stress transcription factor A-1d3.5e-4847.42Show/hide
Query:  APPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSH
        APPPFL+KTY+MVDD  T+S+VSWS +  SF+VW PPEFA++LLP  FKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLL+SI RRKP +  
Subjt:  APPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSH

Query:  SQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQPSKKRKLGKLS
         Q   +S   +  + SV+   E  +  LE +++ L R+K +L  +L +   +++    Q+ T+ Q+L  M N+Q+QL++ L   +Q            LS
Subjt:  SQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQPSKKRKLGKLS

Query:  EFLAEDSLEFEKN
        +FL + + + E N
Subjt:  EFLAEDSLEFEKN

Arabidopsis top hitse value%identityAlignment
AT1G32330.1 heat shock transcription factor A1D2.5e-4947.42Show/hide
Query:  APPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSH
        APPPFL+KTY+MVDD  T+S+VSWS +  SF+VW PPEFA++LLP  FKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLL+SI RRKP +  
Subjt:  APPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSH

Query:  SQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQPSKKRKLGKLS
         Q   +S   +  + SV+   E  +  LE +++ L R+K +L  +L +   +++    Q+ T+ Q+L  M N+Q+QL++ L   +Q            LS
Subjt:  SQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQPSKKRKLGKLS

Query:  EFLAEDSLEFEKN
        +FL + + + E N
Subjt:  EFLAEDSLEFEKN

AT4G17750.1 heat shock factor 14.7e-4850.53Show/hide
Query:  PPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHS
        PPPFL+KTY+MV+DP T+++VSWS +  SF+VW+PPEF+++LLP YFKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLLK I RRK +  H 
Subjt:  PPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHS

Query:  QSQSQSPSQ--SQSHGSVAPLS---EPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ
         S S   SQ  SQ  GS+A LS   E  +  LE +++ L R+K +L  +L K   +++    ++  + + L  M  +Q+Q+++ L   +Q
Subjt:  QSQSQSPSQ--SQSHGSVAPLS---EPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ

AT4G18880.1 heat shock transcription factor A4A4.5e-5134.13Show/hide
Query:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK
        MD +       + PPFLTKTYEMVDD  ++S+VSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D +QWEFAN+ F+RG+ HL+K
Subjt:  MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLK

Query:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK--
        +IHRRKP++SHS    Q+         + PL++ ER  +  +I+ L +EK  L  +L K + E+E    Q+  + ++L  M  +QK +++ +   L+K  
Subjt:  SIHRRKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQK--

Query:  ----------HQPSKKRKLGKLSEFLAEDSLEFEKNGL---KNVMVPPL-----ELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEME----------
                      +KR+  ++  F  E  LE  K  +   +     P        + +LE S+   E+L+ +  E + +  +  + +++          
Subjt:  ----------HQPSKKRKLGKLSEFLAEDSLEFEKNGL---KNVMVPPL-----ELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEME----------

Query:  -----------------VKIVGLK---EGEMRREN----------GANDVFWEQFLTEIPGSSNAGEVYLDRRNN
                          +I+ +    +G   +            GAND FW+QF +E PGS+   EV L+R+++
Subjt:  -----------------VKIVGLK---EGEMRREN----------GANDVFWEQFLTEIPGSSNAGEVYLDRRNN

AT5G16820.1 heat shock factor 31.3e-4553.26Show/hide
Query:  PPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHSQ
        PPFL+KTY+MVDDP+TN VVSWS    SFVVW+ PEF+K LLP YFKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RGR  LLKSI RRKP  SH Q
Subjt:  PPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHSQ

Query:  SQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ
         Q+Q  +Q QS  SV    E  +  +E +++ L R+K +L  +L +   +++    Q+  + Q++  M  +Q+Q+++ L   +Q
Subjt:  SQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQ

AT5G45710.1 winged-helix DNA-binding transcription factor family protein3.0e-5539.52Show/hide
Query:  EGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHR
        E +GG  + PPFLTKTYEMVDD  ++SVV+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D ++WEF N+ F+RGR +L+K+IHR
Subjt:  EGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHR

Query:  RKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQPSKK
        RKP++SHS    Q+ +         PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++   Q+ T+  +L  M   QK ++A +            
Subjt:  RKPIYSHSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQPSKK

Query:  RKLGKLSEFLAEDSLEFEKNGLKNVMVPP----LELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRREN---------------
        + LGK    L  ++ E  K   +   +PP    +E + KLE SL F E+L+    E  G  +   S + +     L  G+ R ++               
Subjt:  RKLGKLSEFLAEDSLEFEKNGLKNVMVPP----LELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRREN---------------

Query:  -----GANDVFWEQFLTEIPGSSNAGEVYLDRRN
             G ND FWEQ LTE PGS+   EV  +RR+
Subjt:  -----GANDVFWEQFLTEIPGSSNAGEVYLDRRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGCTCAGAAGGAAGCGGCGGCGGCGGCGCGCCGCCGCCATTTCTGACCAAAACATATGAGATGGTGGATGATCCGATGACCAATTCCGTCGTCTCATGGAGTCA
AAGCGGTTTCAGCTTCGTGGTTTGGAACCCACCGGAATTCGCCAAAGAACTTCTCCCTGTTTATTTCAAACACAACAATTTCTCTAGCTTCGTTCGACAATTAAACACTT
ATGGGTTCAGGAAGATCGATCGAGACCAATGGGAATTTGCGAACGAGGGGTTCATAAGAGGACGAACCCATCTTCTAAAAAGCATCCATAGACGCAAACCAATCTACAGT
CACAGCCAGAGCCAGAGCCAGAGTCCGAGCCAGAGCCAGAGCCATGGAAGTGTGGCTCCATTGTCCGAACCAGAGAGACAAGAACTCGAGCTAAAAATCAAAACCCTTCA
TCGAGAAAAGACCATCCTCCAATCCCAGCTGCAGAAACACGAGAACGAAAAGGAACAGATCGGGCGTCAAATCCACACGATCTGTCAGCAGTTATGGCGAATGGGGAATC
AACAGAAGCAGCTAATCGCGATGTTGGGGGCAGAGTTGCAGAAGCATCAGCCGAGCAAGAAGAGAAAGTTGGGGAAGTTGAGTGAGTTCTTGGCTGAGGATTCGTTGGAA
TTTGAGAAGAATGGTTTGAAGAATGTGATGGTTCCGCCATTGGAGCTGATGGGGAAGCTGGAATTGTCTTTGGGATTCTGTGAGGATTTGCTGTGCAATGTGGCTGAGGT
TTTGGGCGAGGAGATGAATGGGAAGTCGAAGGAAATGGAGGTGAAGATTGTGGGATTGAAAGAAGGGGAAATGAGAAGGGAAAATGGAGCGAATGATGTGTTTTGGGAAC
AGTTTTTGACTGAGATTCCTGGGTCTTCGAATGCTGGGGAAGTTTATTTGGATAGAAGGAACAATGTTGTAAGATCCCAAACTATGGAGGAGTCAATTCCTACCATCATA
TGTGTTCCTCTAGGTAACTCATGGCCAGGTGATCACGTCTTCCTTGGAGATGCTCCTTCGACCAAACATTTCATCGTTTCGCGTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGGCTCAGAAGGAAGCGGCGGCGGCGGCGCGCCGCCGCCATTTCTGACCAAAACATATGAGATGGTGGATGATCCGATGACCAATTCCGTCGTCTCATGGAGTCA
AAGCGGTTTCAGCTTCGTGGTTTGGAACCCACCGGAATTCGCCAAAGAACTTCTCCCTGTTTATTTCAAACACAACAATTTCTCTAGCTTCGTTCGACAATTAAACACTT
ATGGGTTCAGGAAGATCGATCGAGACCAATGGGAATTTGCGAACGAGGGGTTCATAAGAGGACGAACCCATCTTCTAAAAAGCATCCATAGACGCAAACCAATCTACAGT
CACAGCCAGAGCCAGAGCCAGAGTCCGAGCCAGAGCCAGAGCCATGGAAGTGTGGCTCCATTGTCCGAACCAGAGAGACAAGAACTCGAGCTAAAAATCAAAACCCTTCA
TCGAGAAAAGACCATCCTCCAATCCCAGCTGCAGAAACACGAGAACGAAAAGGAACAGATCGGGCGTCAAATCCACACGATCTGTCAGCAGTTATGGCGAATGGGGAATC
AACAGAAGCAGCTAATCGCGATGTTGGGGGCAGAGTTGCAGAAGCATCAGCCGAGCAAGAAGAGAAAGTTGGGGAAGTTGAGTGAGTTCTTGGCTGAGGATTCGTTGGAA
TTTGAGAAGAATGGTTTGAAGAATGTGATGGTTCCGCCATTGGAGCTGATGGGGAAGCTGGAATTGTCTTTGGGATTCTGTGAGGATTTGCTGTGCAATGTGGCTGAGGT
TTTGGGCGAGGAGATGAATGGGAAGTCGAAGGAAATGGAGGTGAAGATTGTGGGATTGAAAGAAGGGGAAATGAGAAGGGAAAATGGAGCGAATGATGTGTTTTGGGAAC
AGTTTTTGACTGAGATTCCTGGGTCTTCGAATGCTGGGGAAGTTTATTTGGATAGAAGGAACAATGTTGTAAGATCCCAAACTATGGAGGAGTCAATTCCTACCATCATA
TGTGTTCCTCTAGGTAACTCATGGCCAGGTGATCACGTCTTCCTTGGAGATGCTCCTTCGACCAAACATTTCATCGTTTCGCGTGTTTGA
Protein sequenceShow/hide protein sequence
MDGSEGSGGGGAPPPFLTKTYEMVDDPMTNSVVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYS
HSQSQSQSPSQSQSHGSVAPLSEPERQELELKIKTLHREKTILQSQLQKHENEKEQIGRQIHTICQQLWRMGNQQKQLIAMLGAELQKHQPSKKRKLGKLSEFLAEDSLE
FEKNGLKNVMVPPLELMGKLELSLGFCEDLLCNVAEVLGEEMNGKSKEMEVKIVGLKEGEMRRENGANDVFWEQFLTEIPGSSNAGEVYLDRRNNVVRSQTMEESIPTII
CVPLGNSWPGDHVFLGDAPSTKHFIVSRV