; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007529 (gene) of Snake gourd v1 genome

Gene IDTan0007529
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionheat stress transcription factor A-4c-like
Genome locationLG05:4841029..4842768
RNA-Seq ExpressionTan0007529
SyntenyTan0007529
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0034605 - cellular response to heat (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604741.1 Heat stress transcription factor A-4b, partial [Cucurbita argyrosperma subsp. sororia]1.2e-14784.39Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        M+GS+GSYGGAPPPFLTKTYEMVDDPMTNS+VSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR
        IHRRKPI+SHSQ Q QSQSHGSGAPLSE ERQELELKIKTLHQEK+ILQ+QLQ+HE+EKEQIGRQIQT+CQQ+WRMGNQQKQLIAI+ AELQK Q  K+R
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR

Query:  KLGKLNEFLVEESLEFERDEKKNGLKNG-KVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGS
        K+GKL+EFL EE LE ERDE +NGLKNG KVP LELM KLE++LG CEDLLCNVAEVLGGEM+GK KEME K VKEGE R ENGVND+FW QFLTE+PG 
Subjt:  KLGKLNEFLVEESLEFERDEKKNGLKNG-KVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGS

Query:  SNAGEIYLDRRNNV
        SN GE+YLDRR+NV
Subjt:  SNAGEIYLDRRNNV

KAG7034870.1 Heat stress transcription factor A-4b, partial [Cucurbita argyrosperma subsp. argyrosperma]4.0e-14683.86Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        M+GS+GSYGGAPPPFLTKTYEMVDDPMTNS+VSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSH--SQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSK
        IHRRKPI+SH  SQ Q QSQSHGSGAPLSE ERQELELKIKTLHQEK+ILQ+QLQ+HE+EKEQIGRQIQT+CQQ+WRMGNQQKQLIAI+ AELQK Q  K
Subjt:  IHRRKPIYSH--SQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSK

Query:  KRKLGKLNEFLVEESLEFERDEKKNGLKNG-KVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIP
        +RK+GKL+EFL EE LE ERDE +NGLKNG KVP LELM KLE++LG CEDLLCNVAEVLGGEM+GK KEME K VKEGE R ENGVND+FW QFLTE+P
Subjt:  KRKLGKLNEFLVEESLEFERDEKKNGLKNG-KVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIP

Query:  GSSNAGEIYLDRRNNV
        G SN GE+YLDRR+NV
Subjt:  GSSNAGEIYLDRRNNV

XP_022948138.1 heat stress transcription factor A-4d-like [Cucurbita moschata]3.1e-14683.54Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        M+GS+GSYGGAPPPFLTKTYEMVDDPMTNS+VSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR
        IHRRKPI+SHS  Q QSQSHGSGAPLSE ERQELELKIKTLHQEK+ILQ+QLQ+HENEKEQIGRQIQT+CQQ+WRMGNQQKQLIAI+ AELQK Q  K+R
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR

Query:  KLGKLNEFLVEESLEFERDEKKNGLKNG-KVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGS
        K+GKL+EFL EE LE ERDE +NGLKNG +VP LELM KLE++LG CEDLLCNVAEVLGGEM+GK KEME K VKEGE R ENGVND+FW QFLTE+PG 
Subjt:  KLGKLNEFLVEESLEFERDEKKNGLKNG-KVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGS

Query:  SNAGEIYLDRRNNVVK
        SN GE+YLDRR+NV++
Subjt:  SNAGEIYLDRRNNVVK

XP_022970770.1 heat stress transcription factor A-4c-like [Cucurbita maxima]1.2e-14583.49Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        M+GS+GSYGGAPPPFLTKTYEMVDDPMTNS+VSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR
        IHRRKPI+SHSQ Q QSQSHGSGAPLSE ERQELELKIKTLHQEK+ILQ+QLQ+HE+EKEQIGRQIQT+CQQ+WRMGNQQKQLIAI+ AELQK Q  K+R
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR

Query:  KLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGSS
        K+GKL+EFL EE LE ERDE KNGL   KVP LELM KLE++LG CEDLLCNVAEVLGGEM+ K KEME K VKEGE R ENGVND+FW QFLTE+PG S
Subjt:  KLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGSS

Query:  NAGEIYLDRRNNVVK
        NAGE+YLDRR+NV++
Subjt:  NAGEIYLDRRNNVVK

XP_023534238.1 heat stress transcription factor A-4c-like [Cucurbita pepo subsp. pepo]5.6e-14883.86Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        M+GS+GSYGGAPPPFLTKTYEMVDDPMTNS+VSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR
        IHRRKPI+SHSQ Q QSQSHGSGAPLSE ERQELELKIKTLHQEK+ILQ+QLQ+HE+EKEQIGRQIQT+CQQ+WRMGNQQKQLIAI+ AELQK Q  K+R
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR

Query:  KLGKLNEFLVEESLEFERDEKKNGLKNG-KVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGS
        K+GKL+EFL EE LE ERDE +NGLKNG KVP LELM KLE++LG CEDLLCNVAEVLGGEM+GK KEME K VKEGE R ENGVND+FW QFLTE+PG 
Subjt:  KLGKLNEFLVEESLEFERDEKKNGLKNG-KVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGS

Query:  SNAGEIYLDRRNNVVK
        SN GE+YLDRR+NV++
Subjt:  SNAGEIYLDRRNNVVK

TrEMBL top hitse value%identityAlignment
A0A0A0KBI1 HSF_DOMAIN domain-containing protein2.9e-12676.83Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        MDGSEGS  GAPPPFLTKTYEMVDDPMTNSIVSW+QSGFSFVVWNPPEFA+ELLP+YFKHNNFSSFVRQLNTYGFRKIDR+QWEFANEGFIRG+THLLK 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR
        IHRRKPIYSHSQ    SQ +G GAPLSE ER ELE KIKTL+QEK+ LQSQLQKHENEKEQIG QIQ IC++LWRMGNQQKQLI ILGAEL+K++  KKR
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR

Query:  KLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGSS
        K+GK+NEFLVEE  EFE+D  K   K   VPPLEL+ KLEL+LG CEDLL NV +VL      + KEMEVK  KEGEMR  +GVND+FW  FLTEIPGSS
Subjt:  KLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGSS

Query:  NAGEIYLDRRNNVVK
        N  +++LDRRNNVV+
Subjt:  NAGEIYLDRRNNVVK

A0A1S3C373 heat stress transcription factor A-4c-like7.2e-12576.51Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        M  SEGS  GAPPPFLTKTYEMVDDPM+NSIVSWSQSGFSFVVWNPPEFA+ELLP+YFKHNNFSSFVRQLNTYGFRKIDR+QWEFANEGFIRG+THLLK 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR
        IHRRKP+YSHSQ    SQ +G GAPLSE ERQELE KIKTL+QEK+ L+SQLQKHENEKEQIG QIQ IC++LWRMG+QQKQLI ILGAEL+KH+  KKR
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR

Query:  KLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGSS
        K+GK+NE LVEE  EFERD+ K   K   V PLELM KLEL+L  CEDLLCNVA+VL      + KEMEVK  KEGEMR  +GVND+FW  FLTEIPGSS
Subjt:  KLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGSS

Query:  NAGEIYLDRRNNVVK
           E+YLDRRNNVV+
Subjt:  NAGEIYLDRRNNVVK

A0A5D3BGW6 Heat stress transcription factor A-4c-like7.2e-12576.51Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        M  SEGS  GAPPPFLTKTYEMVDDPM+NSIVSWSQSGFSFVVWNPPEFA+ELLP+YFKHNNFSSFVRQLNTYGFRKIDR+QWEFANEGFIRG+THLLK 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR
        IHRRKP+YSHSQ    SQ +G GAPLSE ERQELE KIKTL+QEK+ L+SQLQKHENEKEQIG QIQ IC++LWRMG+QQKQLI ILGAEL+KH+  KKR
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR

Query:  KLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGSS
        K+GK+NE LVEE  EFERD+ K   K   V PLELM KLEL+L  CEDLLCNVA+VL      + KEMEVK  KEGEMR  +GVND+FW  FLTEIPGSS
Subjt:  KLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGSS

Query:  NAGEIYLDRRNNVVK
           E+YLDRRNNVV+
Subjt:  NAGEIYLDRRNNVVK

A0A6J1G8J3 heat stress transcription factor A-4d-like1.5e-14683.54Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        M+GS+GSYGGAPPPFLTKTYEMVDDPMTNS+VSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR
        IHRRKPI+SHS  Q QSQSHGSGAPLSE ERQELELKIKTLHQEK+ILQ+QLQ+HENEKEQIGRQIQT+CQQ+WRMGNQQKQLIAI+ AELQK Q  K+R
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR

Query:  KLGKLNEFLVEESLEFERDEKKNGLKNG-KVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGS
        K+GKL+EFL EE LE ERDE +NGLKNG +VP LELM KLE++LG CEDLLCNVAEVLGGEM+GK KEME K VKEGE R ENGVND+FW QFLTE+PG 
Subjt:  KLGKLNEFLVEESLEFERDEKKNGLKNG-KVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGS

Query:  SNAGEIYLDRRNNVVK
        SN GE+YLDRR+NV++
Subjt:  SNAGEIYLDRRNNVVK

A0A6J1I3T7 heat stress transcription factor A-4c-like5.6e-14683.49Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        M+GS+GSYGGAPPPFLTKTYEMVDDPMTNS+VSWS+SG+SFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRK+DRDQWEFANEGFIRGRTHLLK 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR
        IHRRKPI+SHSQ Q QSQSHGSGAPLSE ERQELELKIKTLHQEK+ILQ+QLQ+HE+EKEQIGRQIQT+CQQ+WRMGNQQKQLIAI+ AELQK Q  K+R
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR

Query:  KLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGSS
        K+GKL+EFL EE LE ERDE KNGL   KVP LELM KLE++LG CEDLLCNVAEVLGGEM+ K KEME K VKEGE R ENGVND+FW QFLTE+PG S
Subjt:  KLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGSS

Query:  NAGEIYLDRRNNVVK
        NAGE+YLDRR+NV++
Subjt:  NAGEIYLDRRNNVVK

SwissProt top hitse value%identityAlignment
O49403 Heat stress transcription factor A-4a1.3e-5134.32Show/hide
Query:  DGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGI
        + + G    + PPFLTKTYEMVDD  ++SIVSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D +QWEFAN+ F+RG+ HL+K I
Subjt:  DGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGI

Query:  HRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQK--------
        HRRKP++SHS   +Q+Q +    PL++ ER  +  +I+ L +EK  L  +L K + E+E    Q++ + ++L  M  +QK +++ +   L+K        
Subjt:  HRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQK--------

Query:  ----HQPSKKRKLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMR-----KLELALGFCEDLLCNVAEVLGGEMNGKTKE-----------------
                +KR+  ++ EF  +E +  E        + G   P    R     +LE ++   E+L+ +  E +    +  T +                 
Subjt:  ----HQPSKKRKLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMR-----KLELALGFCEDLLCNVAEVLGGEMNGKTKE-----------------

Query:  ---------------MEVKIVKEGEMRREN----------GVNDMFWAQFLTEIPGSSNAGEIYLDRRNN
                       +++    +G   +            G ND FW QF +E PGS+   E+ L+R+++
Subjt:  ---------------MEVKIVKEGEMRREN----------GVNDMFWAQFLTEIPGSSNAGEIYLDRRNN

P41153 Heat shock factor protein HSF81.5e-4752.13Show/hide
Query:  APPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGIHRRKPIYSH
        APPPFL KTY+MVDDP T+ IVSWS +  SFVVW+PPEFAK+LLP YFKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLLK I RRKP + H
Subjt:  APPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGIHRRKPIYSH

Query:  SQIQIQSQSH--------GSGAPLS---EPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAIL
        +Q Q Q   H        G  A +    E  +  LE +++ L ++K++L  +L +   +++    Q+Q + Q+L  M  +Q+Q+++ L
Subjt:  SQIQIQSQSH--------GSGAPLS---EPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAIL

Q93VB5 Heat stress transcription factor A-4d1.1e-5342.25Show/hide
Query:  GSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGIH
        G  G  GG PPPFL KTYEMV+D  TN +VSW   G SFVVWNP +F+++LLP YFKHNNFSSF+RQLNTYGFRKID ++WEFANE FIRG THLLK IH
Subjt:  GSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGIH

Query:  RRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKH--------
        RRKP++SHS   +Q+Q +G   PL+E ER+ELE +I  L  EKSIL + LQ+   ++  I  Q+Q +  +L  M  +QK ++A L   LQ+         
Subjt:  RRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKH--------

Query:  ----QPSKKRKLGKLNEFLVEESLEFERDE----KKNGLKNGKVPPL------ELMRKLELALGFCEDLL------CNVAEVLGGEMNGKTKEMEVKIVK
              SKKR++ K++ F V++    E  +    +  G     +PP+      E   ++EL+L   E L       C  AE +    +G T+       +
Subjt:  ----QPSKKRKLGKLNEFLVEESLEFERDE----KKNGLKNGKVPPL------ELMRKLELALGFCEDLL------CNVAEVLGGEMNGKTKEMEVKIVK

Query:  E-GEMRRENGVNDMFWAQFLTEIPGSSNA
        E      E G++    A      P + NA
Subjt:  E-GEMRRENGVNDMFWAQFLTEIPGSSNA

Q94J16 Heat stress transcription factor A-4b2.3e-4841.97Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        M+G  G  GG+ PPFL+KTYEMVDDP T+++V W+ +G SFVV N PEF ++LLP YFKHNNFSSFVRQLNTYGFRK+D +QWEFANE FI+G+ H LK 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIA----------ILGAE
        IHRRKPI+SHS     S S G+G PL++ ER++ E +I+ L  + + L S+LQ +  +K  + +++Q + ++L+ + +QQ+ LI+           L + 
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIA----------ILGAE

Query:  LQKHQPSKKRKLGKLNEFLVEESLEFERDEKKNGLKNGKVPPL--ELMRKLELALGFCEDLLCNVAEVLGGEMN
        +Q+    +K++   +     E++   E       L N        E   K+E +L   E+ L   +E  G +++
Subjt:  LQKHQPSKKRKLGKLNEFLVEESLEFERDEKKNGLKNGKVPPL--ELMRKLELALGFCEDLLCNVAEVLGGEMN

Q9FK72 Heat stress transcription factor A-4c4.4e-5539.58Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        MD + G    + PPFLTKTYEMVDD  ++S+V+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D ++WEF N+ F+RGR +L+K 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR
        IHRRKP++SHS + +Q+Q+     PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++   Q+ T+  +L  M   QK ++A +   L K  P    
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR

Query:  KLGKLNEFLVEESLEFERDE-KKNGLKNGKVPP----LELMRKLELALGFCEDLLCNVAEVLGGEMN-------------GKTKEMEVKIVKEGE-----
                    SL  E  E +K   +   +PP    +E + KLE +L F E+L+    E  G + +             G T+    KI    E     
Subjt:  KLGKLNEFLVEESLEFERDE-KKNGLKNGKVPP----LELMRKLELALGFCEDLLCNVAEVLGGEMN-------------GKTKEMEVKIVKEGE-----

Query:  --MRRENGVNDMFWAQFLTEIPGSSNAGEIYLDRRN
             + GVND FW Q LTE PGS+   E+  +RR+
Subjt:  --MRRENGVNDMFWAQFLTEIPGSSNAGEIYLDRRN

Arabidopsis top hitse value%identityAlignment
AT1G32330.1 heat shock transcription factor A1D5.4e-4852.15Show/hide
Query:  APPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGIHRRKPIYSH
        APPPFL+KTY+MVDD  T+SIVSWS +  SF+VW PPEFA++LLP  FKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLL+ I RRKP +  
Subjt:  APPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGIHRRKPIYSH

Query:  SQIQIQSQ-SHGSGAPLS---EPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQ
         Q   +SQ S+G  + +S   E  +  LE +++ L ++K++L  +L +   +++    Q+QT+ Q+L  M N+Q+QL++ L   +Q
Subjt:  SQIQIQSQ-SHGSGAPLS---EPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQ

AT4G17750.1 heat shock factor 15.9e-4750Show/hide
Query:  PPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGIHRRKPIYSHS
        PPPFL+KTY+MV+DP T++IVSWS +  SF+VW+PPEF+++LLP YFKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLLK I RRK +  H 
Subjt:  PPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGIHRRKPIYSHS

Query:  ------QIQIQSQSHGSGAPLS---EPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQ
              Q Q  SQ  GS A LS   E  +  LE +++ L ++K++L  +L K   +++    ++Q + + L  M  +Q+Q+++ L   +Q
Subjt:  ------QIQIQSQSHGSGAPLS---EPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQ

AT4G18880.1 heat shock transcription factor A4A9.4e-5334.32Show/hide
Query:  DGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGI
        + + G    + PPFLTKTYEMVDD  ++SIVSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D +QWEFAN+ F+RG+ HL+K I
Subjt:  DGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGI

Query:  HRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQK--------
        HRRKP++SHS   +Q+Q +    PL++ ER  +  +I+ L +EK  L  +L K + E+E    Q++ + ++L  M  +QK +++ +   L+K        
Subjt:  HRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQK--------

Query:  ----HQPSKKRKLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMR-----KLELALGFCEDLLCNVAEVLGGEMNGKTKE-----------------
                +KR+  ++ EF  +E +  E        + G   P    R     +LE ++   E+L+ +  E +    +  T +                 
Subjt:  ----HQPSKKRKLGKLNEFLVEESLEFERDEKKNGLKNGKVPPLELMR-----KLELALGFCEDLLCNVAEVLGGEMNGKTKE-----------------

Query:  ---------------MEVKIVKEGEMRREN----------GVNDMFWAQFLTEIPGSSNAGEIYLDRRNN
                       +++    +G   +            G ND FW QF +E PGS+   E+ L+R+++
Subjt:  ---------------MEVKIVKEGEMRREN----------GVNDMFWAQFLTEIPGSSNAGEIYLDRRNN

AT5G16820.1 heat shock factor 38.5e-4651.38Show/hide
Query:  PPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGIHRRKPIY-SHS
        PPFL+KTY+MVDDP+TN +VSWS    SFVVW+ PEF+K LLP YFKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RGR  LLK I RRKP +   +
Subjt:  PPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGIHRRKPIY-SHS

Query:  QIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQ
        Q Q Q QS   GA + E  +  +E +++ L ++K++L  +L +   +++    Q+Q + Q++  M  +Q+Q+++ L   +Q
Subjt:  QIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQ

AT5G45710.1 winged-helix DNA-binding transcription factor family protein3.1e-5639.58Show/hide
Query:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG
        MD + G    + PPFLTKTYEMVDD  ++S+V+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D ++WEF N+ F+RGR +L+K 
Subjt:  MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKG

Query:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR
        IHRRKP++SHS + +Q+Q+     PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++   Q+ T+  +L  M   QK ++A +   L K  P    
Subjt:  IHRRKPIYSHSQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKR

Query:  KLGKLNEFLVEESLEFERDE-KKNGLKNGKVPP----LELMRKLELALGFCEDLLCNVAEVLGGEMN-------------GKTKEMEVKIVKEGE-----
                    SL  E  E +K   +   +PP    +E + KLE +L F E+L+    E  G + +             G T+    KI    E     
Subjt:  KLGKLNEFLVEESLEFERDE-KKNGLKNGKVPP----LELMRKLELALGFCEDLLCNVAEVLGGEMN-------------GKTKEMEVKIVKEGE-----

Query:  --MRRENGVNDMFWAQFLTEIPGSSNAGEIYLDRRN
             + GVND FW Q LTE PGS+   E+  +RR+
Subjt:  --MRRENGVNDMFWAQFLTEIPGSSNAGEIYLDRRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGCTCAGAGGGAAGCTACGGCGGCGCACCGCCGCCATTTCTGACTAAAACATATGAGATGGTGGATGATCCAATGACCAATTCCATCGTGTCATGGAGTCAGAG
CGGTTTCAGCTTCGTGGTTTGGAACCCACCGGAATTCGCCAAAGAATTGCTCCCTGTTTATTTCAAACACAACAATTTCTCTAGCTTCGTTCGTCAATTAAACACTTATG
GGTTCAGGAAAATCGATCGAGATCAATGGGAATTCGCCAACGAGGGGTTCATAAGAGGACGAACCCATCTTCTAAAAGGCATCCATAGACGCAAACCAATCTACAGCCAT
AGCCAGATCCAGATCCAGAGCCAGAGCCATGGAAGTGGAGCTCCATTATCAGAACCAGAGAGACAAGAACTCGAGCTAAAAATCAAAACCCTTCATCAAGAAAAAAGCAT
CCTCCAATCCCAGCTACAGAAACACGAAAACGAAAAGGAACAAATCGGGCGTCAAATTCAAACAATCTGTCAGCAGTTATGGCGAATGGGAAATCAACAGAAGCAGCTGA
TTGCGATATTGGGGGCAGAGTTGCAGAAGCATCAGCCGAGCAAGAAGAGAAAACTGGGGAAATTGAACGAGTTTTTGGTTGAGGAGTCGTTGGAATTTGAGAGAGATGAG
AAGAAGAATGGTTTGAAGAATGGAAAGGTTCCGCCATTGGAGCTGATGAGGAAGCTGGAATTGGCGTTGGGGTTCTGTGAGGATTTGCTCTGCAATGTGGCGGAGGTTTT
GGGCGGAGAGATGAATGGGAAGACGAAGGAAATGGAAGTTAAGATTGTGAAAGAAGGGGAAATGAGAAGAGAAAATGGAGTGAATGATATGTTTTGGGCACAATTTTTGA
CCGAAATTCCGGGGTCTTCGAATGCCGGGGAAATTTATTTGGATAGAAGGAATAATGTTGTAAAGTAG
mRNA sequenceShow/hide mRNA sequence
TAAAGTATAAGTATCCAATGGTATTTAAAAAGAAAAGAAGAAATGGTTGGTCTTCATAATCCAGTCAAATTGACCTAAATTCATTCAAAGATCCCCAAAAAAACATAAAA
TAGAAAAACATACGACTTTCTCCTCAGAATCGAGAGTTGAAAATTCACCCATTGCTACAGAGCTCCGGTCAACCCCTTTTTAAGTCTCTCTTTTTCTTTCTTTTAGACTT
TTTCCACCACCGCGTCTATAAATCCCTCTCCCATTTCTTCATTTTTCAGTTCAATCAAAATTCAAAATCCATCTCCTTCAAATATCCCTTCAAGCTTTTCCTTCAAGTTC
TCATCTTGTTTCTAATCTCAGCAAAACTCTATTTCCTCTGTTTTTCTTCAACATACCCTCCTCCGTCACTCTGCTTACAGCACGGCGGCGACGACGACGACGGCGCCGGC
GAGGGACTCTATTGTTCAGTGAAAGAAATGGATGGCTCAGAGGGAAGCTACGGCGGCGCACCGCCGCCATTTCTGACTAAAACATATGAGATGGTGGATGATCCAATGAC
CAATTCCATCGTGTCATGGAGTCAGAGCGGTTTCAGCTTCGTGGTTTGGAACCCACCGGAATTCGCCAAAGAATTGCTCCCTGTTTATTTCAAACACAACAATTTCTCTA
GCTTCGTTCGTCAATTAAACACTTATGGGTTCAGGAAAATCGATCGAGATCAATGGGAATTCGCCAACGAGGGGTTCATAAGAGGACGAACCCATCTTCTAAAAGGCATC
CATAGACGCAAACCAATCTACAGCCATAGCCAGATCCAGATCCAGAGCCAGAGCCATGGAAGTGGAGCTCCATTATCAGAACCAGAGAGACAAGAACTCGAGCTAAAAAT
CAAAACCCTTCATCAAGAAAAAAGCATCCTCCAATCCCAGCTACAGAAACACGAAAACGAAAAGGAACAAATCGGGCGTCAAATTCAAACAATCTGTCAGCAGTTATGGC
GAATGGGAAATCAACAGAAGCAGCTGATTGCGATATTGGGGGCAGAGTTGCAGAAGCATCAGCCGAGCAAGAAGAGAAAACTGGGGAAATTGAACGAGTTTTTGGTTGAG
GAGTCGTTGGAATTTGAGAGAGATGAGAAGAAGAATGGTTTGAAGAATGGAAAGGTTCCGCCATTGGAGCTGATGAGGAAGCTGGAATTGGCGTTGGGGTTCTGTGAGGA
TTTGCTCTGCAATGTGGCGGAGGTTTTGGGCGGAGAGATGAATGGGAAGACGAAGGAAATGGAAGTTAAGATTGTGAAAGAAGGGGAAATGAGAAGAGAAAATGGAGTGA
ATGATATGTTTTGGGCACAATTTTTGACCGAAATTCCGGGGTCTTCGAATGCCGGGGAAATTTATTTGGATAGAAGGAATAATGTTGTAAAGTAGATTAGTTTTTTTTTC
TTTTTTTTTTTGTAAATGATTTTGTTCTGTATACTTTTTTGAATTAGAATTCTAGTTTTGGAGATGGAAGATTTGTTCATTACAATTATCTTGTGGTGTTTTTTTTTTCT
CGTAGCTGGTTGTATCAATTTTAACTATGAACTTTCAATTTTATCAATTTCAAC
Protein sequenceShow/hide protein sequence
MDGSEGSYGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAKELLPVYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKGIHRRKPIYSH
SQIQIQSQSHGSGAPLSEPERQELELKIKTLHQEKSILQSQLQKHENEKEQIGRQIQTICQQLWRMGNQQKQLIAILGAELQKHQPSKKRKLGKLNEFLVEESLEFERDE
KKNGLKNGKVPPLELMRKLELALGFCEDLLCNVAEVLGGEMNGKTKEMEVKIVKEGEMRRENGVNDMFWAQFLTEIPGSSNAGEIYLDRRNNVVK