; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003701 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003701
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHeat shock transcription factor A9
Genome locationscaffold4:48988909..48991747
RNA-Seq ExpressionSpg003701
SyntenySpg003701
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0034605 - cellular response to heat (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141170.1 heat stress transcription factor A-7a isoform X1 [Cucumis sativus]2.8e-11363.02Show/hide
Query:  IKEETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYG
        +KEE         A+  + DG      KPM GLH++GPPPFLKKTYEMVEDPETDPVVSWS+ R SFIVWDSHQLS  LLPKYFKH NFSSFIRQLNTYG
Subjt:  IKEETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYG

Query:  FRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN----KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEE
        FRKIDSDKWEFANEGFQGGKKHLLKNIKR+++YN N    ++HL +++   TL+DLTKP  VETE L+ L+TDNN L+VE+ KLREQQQDS NQLT VEE
Subjt:  FRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN----KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEE

Query:  RVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLRKEL---NG-NEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEV
        RVR  E K QQM  FL KMS NPAF RQL+QKRMLR ++   NG +EFGKK ++L +Q H+NLGL   D S DVN +N  QVQE +L + SEL E+FPEV
Subjt:  RVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLRKEL---NG-NEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEV

Query:  VEP---GRVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
        +EP   G +ETPFQAS                    E+MVVDE ++SNDS FFL+L+DL+ KP DC +GYVQKQ F+G VGSIP
Subjt:  VEP---GRVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

XP_022999504.1 heat stress transcription factor A-7a-like [Cucurbita maxima]1.5e-11458.59Show/hide
Query:  GGGVCREGAATASDEVMDVLNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS--AKPMV
        GGG C +GAATAS ++MD       Q+TVKEEEIE                                          G G SD  GDG SASS  AKPM 
Subjt:  GGGVCREGAATASDEVMDVLNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS--AKPMV

Query:  GLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
        GLHE+GP PFLKKTYEMVEDPETDPVVSWS+  NSFIVWDSH+LSI LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
Subjt:  GLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS

Query:  RYNYNKQ---HLAMAMTLQDLTK-PGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLR
        RYN  KQ   HL M M LQDLTK P VE EL ALK+DN  L+VE+LKLREQQ DSQNQLT VE+RVRCVE K QQM SF++KMS NP F RQLVQ+RMLR
Subjt:  RYNYNKQ---HLAMAMTLQDLTK-PGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLR

Query:  KEL--NGNEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSEL--NEMFPEVVEPGR--VETPFQASMNSKSRSSDAACMPPSNIFAE
        K+L  NG+EFG  RRLLA+QGH+NL                           SEL   EMFP V+EPG   +E P +AS                     
Subjt:  KEL--NGNEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSEL--NEMFPEVVEPGR--VETPFQASMNSKSRSSDAACMPPSNIFAE

Query:  NMVVDEELTSNDSKFFLELEDLIKKP-------HDC-AGYVQKQVFHGCVGSIP
         MVVDE+   +DSKFFLELEDLIKKP        DC +GYVQ+Q FHG VGSIP
Subjt:  NMVVDEELTSNDSKFFLELEDLIKKP-------HDC-AGYVQKQVFHGCVGSIP

XP_023546638.1 heat stress transcription factor A-2-like [Cucurbita pepo subsp. pepo]3.3e-11457.79Show/hide
Query:  GGGGGGGVCREGAATASDEVMDVLNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDC--SGDGGSASS-
        GGGGGGG C +GAATAS ++MD       Q++VKEEE+E                                +  ET     G G SDC   GDGG ASS 
Subjt:  GGGGGGGVCREGAATASDEVMDVLNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDC--SGDGGSASS-

Query:  -AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLK
         AKPM GLHE+GP PFLKKTYEMVEDPETDPVVSWS+  NSFIVWDSH+LSI LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLK
Subjt:  -AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLK

Query:  NIKRRSRYN-YNKQ----HLAMAMTLQDLTKPG-VETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQ
        NIKRRSRYN Y KQ    HL M M LQDL+K   VE EL+ALK+DNN L+VE+LKLREQQQDSQNQLT VE+RVRCVE K QQM SF++KMS NP F RQ
Subjt:  NIKRRSRYN-YNKQ----HLAMAMTLQDLTKPG-VETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQ

Query:  LVQKRMLRKEL--NGNEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSEL--NEMFPEVVEPGRV--ETPFQASMNSKSRSSDAACM
        LVQ+RMLRK+L  NG+EFG  RRLLA+QGH+NL                           SEL   EMFP+ +EPG V  E P +AS             
Subjt:  LVQKRMLRKEL--NGNEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSEL--NEMFPEVVEPGRV--ETPFQASMNSKSRSSDAACM

Query:  PPSNIFAENMVVDEELTSNDSKFFLELEDLIKK-------PHDC-AGYVQKQVFHGCVGSIP
                  +VDE+   +DSKFFLELEDLI+K       P DC + YV++Q FHG VGSIP
Subjt:  PPSNIFAENMVVDEELTSNDSKFFLELEDLIKK-------PHDC-AGYVQKQVFHGCVGSIP

XP_038890548.1 heat stress transcription factor A-9-like isoform X1 [Benincasa hispida]7.1e-12560.13Show/hide
Query:  MVQPDGGGGGGGVCREGAATASDEVMDVLNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSA
        MV PD  GG     R+GA TA  EV+D     +    VKEEE    E+N                             E +TAA N        G GG  
Subjt:  MVQPDGGGGGGGVCREGAATASDEVMDVLNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSA

Query:  SSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL
           KPM GLHE+GP PFLKKTYEMVEDPETDPVVSWS++RNSFIVWDSHQ S  LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL
Subjt:  SSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL

Query:  KNIKRRSR----YNYNKQHLAMAMT---LQDLTKP-GVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAF
        KNIKRR+R     NY KQHL +AM+   L+DLTKP  VETEL+ LKTDNN L++E+ KLR+QQQDSQNQLT VEERV+CVE K QQM  FL KMS NPAF
Subjt:  KNIKRRSR----YNYNKQHLAMAMT---LQDLTKP-GVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAF

Query:  FRQLVQKRMLRKEL--NGNEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEVVEPGRVETPFQASMNSKSRSSDAACMP
         RQLVQ+RML+K+   N +EFGKKR+ LA+QGH+NLG+E IDASRDVN E   +VQE ++ M SEL ++FP+V++ G+++TPFQA          A CMP
Subjt:  FRQLVQKRMLRKEL--NGNEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEVVEPGRVETPFQASMNSKSRSSDAACMP

Query:  PSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
        P     ++MVVDEEL+SN    FLELEDLIKKP DC +GYVQKQ F+  V SIP
Subjt:  PSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

XP_038890549.1 heat stress transcription factor A-9-like isoform X2 [Benincasa hispida]9.0e-12059.28Show/hide
Query:  MVQPDGGGGGGGVCREGAATASDEVMDVLNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSA
        MV PD  GG     R+GA TA  EV+D     +    VKEEE    E+N                             E +TAA N        G GG  
Subjt:  MVQPDGGGGGGGVCREGAATASDEVMDVLNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSA

Query:  SSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL
           KPM GLHE+GP PFLKKTYEMVEDPETDPVVSWS++RNSFIVWDSHQ S  LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL
Subjt:  SSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL

Query:  KNIKRRSR----YNYNKQHLAMAMT---LQDLTKP-GVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAF
        KNIKRR+R     NY KQHL +AM+   L+DLTKP  VETEL+ LKTDNN L++E+ KLR+QQQDSQNQLT VEERV+CVE K QQM  FL KMS NPAF
Subjt:  KNIKRRSR----YNYNKQHLAMAMT---LQDLTKP-GVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAF

Query:  FRQLVQKRMLRKEL--NGNEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEVVEPGRVETPFQASMNSKSRSSDAACMP
         RQLVQ+RML+K+   N +EFGKKR+ LA+QGH+NLG+E IDASRDVN E   +VQE ++ M SEL ++FP+V++ G+++TPFQA          A CMP
Subjt:  FRQLVQKRMLRKEL--NGNEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEVVEPGRVETPFQASMNSKSRSSDAACMP

Query:  PSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDCAGYVQK
        P     ++MVVDEEL+SN    FLELEDLIKKP DC   +++
Subjt:  PSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDCAGYVQK

TrEMBL top hitse value%identityAlignment
A0A0A0LEU6 HSF_DOMAIN domain-containing protein1.4e-11363.02Show/hide
Query:  IKEETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYG
        +KEE         A+  + DG      KPM GLH++GPPPFLKKTYEMVEDPETDPVVSWS+ R SFIVWDSHQLS  LLPKYFKH NFSSFIRQLNTYG
Subjt:  IKEETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYG

Query:  FRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN----KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEE
        FRKIDSDKWEFANEGFQGGKKHLLKNIKR+++YN N    ++HL +++   TL+DLTKP  VETE L+ L+TDNN L+VE+ KLREQQQDS NQLT VEE
Subjt:  FRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN----KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEE

Query:  RVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLRKEL---NG-NEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEV
        RVR  E K QQM  FL KMS NPAF RQL+QKRMLR ++   NG +EFGKK ++L +Q H+NLGL   D S DVN +N  QVQE +L + SEL E+FPEV
Subjt:  RVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLRKEL---NG-NEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEV

Query:  VEP---GRVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
        +EP   G +ETPFQAS                    E+MVVDE ++SNDS FFL+L+DL+ KP DC +GYVQKQ F+G VGSIP
Subjt:  VEP---GRVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

A0A1S3CAP8 heat shock factor protein HSF30-like isoform X15.4e-11060.96Show/hide
Query:  DGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS----AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYF
        DG+ + S  S  + E+ +  + G+   D      + +     AKPM GLH++GPPPFLKKTYEMVEDPETDPVVSWS+ R SFIVWDSHQLS  LLPKYF
Subjt:  DGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS----AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYF

Query:  KHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLRE
        KH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKR+++YN N  KQHL +++   TL+DLTKP  VETE L+ LKTDNN L+VE+ KLRE
Subjt:  KHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLRE

Query:  QQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLRKEL---NG-NEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESI
        QQQDS NQLT VEERVRC E K QQM  FL KMS NPAF RQL+QKRMLRK++   NG +EFGKKR+LLAVQ H+N                  QVQE +
Subjt:  QQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLRKEL---NG-NEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESI

Query:  LCMQSELNEMFPEVVE--PGRVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
          + SEL EMFPEV+E  PG + T F+ S++                 +ENMVVDE L+SNDS  FL+L+DLIKKP DC +GYVQKQ F+G VGS+P
Subjt:  LCMQSELNEMFPEVVE--PGRVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

A0A5A7V4T3 Heat shock factor protein HSF30-like isoform X11.3e-11166.2Show/hide
Query:  AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKN
        AKPM GLH++GPPPFLKKTYEMVEDPETDPVVSWS+ R SFIVWDSHQLS  LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKN
Subjt:  AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKN

Query:  IKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQ
        IKR+++YN N  KQHL +++   TL+DLTKP  VETE L+ LKTDNN L+VE+ KLREQQQDS NQLT VEERVRC E K QQM  FL KMS NPAF RQ
Subjt:  IKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQ

Query:  LVQKRMLRKEL---NG-NEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEVVE--PGRVETPFQASMNSKSRSSDAACM
        L+QKRMLRK++   NG +EFGKKR+LLAVQ H+N                  QVQE +  + SEL EMFPEV+E  PG + T F+ S++           
Subjt:  LVQKRMLRKEL---NG-NEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEVVE--PGRVETPFQASMNSKSRSSDAACM

Query:  PPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
              +ENMVVDE L+SNDS  FL+L+DLIKKP DC +GYVQKQ F+G VGS+P
Subjt:  PPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

A0A5D3CFL8 Heat shock factor protein HSF30-like isoform X14.1e-11060.96Show/hide
Query:  DGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS----AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYF
        DG+ + S  S  + E+ +  + G+   D      + +     AKPM GLH++GPPPFLKKTYEMVEDPETDPVVSWS+ R SFIVWDSHQLS  LLPKYF
Subjt:  DGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS----AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYF

Query:  KHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLRE
        KH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKR+++YN N  KQHL +++   TL+DLTKP  VETE L+ LKTDNN L+VE+ KLRE
Subjt:  KHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLRE

Query:  QQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLRKEL---NG-NEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESI
        QQQDS NQLT VEERVRC E K QQM  FL KMS NPAF RQL+QKRMLRK++   NG +EFGKKR+LLAVQ H+N                  QVQE +
Subjt:  QQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLRKEL---NG-NEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESI

Query:  LCMQSELNEMFPEVVE--PGRVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
          + SEL EMFPEV+E  PG + T F+ S++                 +ENMVVDE L+SNDS  FL+L+DLIKKP DC +GYVQKQ F+G VGS+P
Subjt:  LCMQSELNEMFPEVVE--PGRVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

A0A6J1KH95 heat stress transcription factor A-7a-like7.2e-11558.59Show/hide
Query:  GGGVCREGAATASDEVMDVLNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS--AKPMV
        GGG C +GAATAS ++MD       Q+TVKEEEIE                                          G G SD  GDG SASS  AKPM 
Subjt:  GGGVCREGAATASDEVMDVLNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS--AKPMV

Query:  GLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
        GLHE+GP PFLKKTYEMVEDPETDPVVSWS+  NSFIVWDSH+LSI LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
Subjt:  GLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS

Query:  RYNYNKQ---HLAMAMTLQDLTK-PGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLR
        RYN  KQ   HL M M LQDLTK P VE EL ALK+DN  L+VE+LKLREQQ DSQNQLT VE+RVRCVE K QQM SF++KMS NP F RQLVQ+RMLR
Subjt:  RYNYNKQ---HLAMAMTLQDLTK-PGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLR

Query:  KEL--NGNEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSEL--NEMFPEVVEPGR--VETPFQASMNSKSRSSDAACMPPSNIFAE
        K+L  NG+EFG  RRLLA+QGH+NL                           SEL   EMFP V+EPG   +E P +AS                     
Subjt:  KEL--NGNEFGKKRRLLAVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSEL--NEMFPEVVEPGR--VETPFQASMNSKSRSSDAACMPPSNIFAE

Query:  NMVVDEELTSNDSKFFLELEDLIKKP-------HDC-AGYVQKQVFHGCVGSIP
         MVVDE+   +DSKFFLELEDLIKKP        DC +GYVQ+Q FHG VGSIP
Subjt:  NMVVDEELTSNDSKFFLELEDLIKKP-------HDC-AGYVQKQVFHGCVGSIP

SwissProt top hitse value%identityAlignment
O80982 Heat stress transcription factor A-22.4e-6254.55Show/hide
Query:  EETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFR
        EE T    G  A+  S   GS+SS +PM GL+E GPPPFL KTYEMVEDP TD VVSWS  RNSF+VWDSH+ S  LLP+YFKH NFSSFIRQLNTYGFR
Subjt:  EETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFR

Query:  KIDSDKWEFANEGFQGGKKHLLKNIKRRSR---YNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECK
        KID D+WEFANEGF  G+KHLLKNIKRR      N N+Q     M+  ++ + G + E+E LK D+  L  E+++LR+QQ  S++Q+  +E+R+   E +
Subjt:  KIDSDKWEFANEGFQGGKKHLLKNIKRRSR---YNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECK

Query:  QQQMSSFLTKMSSNPAFFRQLVQKRMLRKELNGNEFGKKRRL
        QQQM +FL K  +NP F +Q       +K L G + G+KRRL
Subjt:  QQQMSSFLTKMSSNPAFFRQLVQKRMLRKELNGNEFGKKRRL

P41152 Heat shock factor protein HSF306.2e-6342.73Show/hide
Query:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG
        + G  ++  PM GLH++GPPPFL KTYEMVED  TD V+SWS  RNSFIVWDSH+ S  LLP++FKH NFSSFIRQLNTYGFRK+D D+WEFANEGF GG
Subjt:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG

Query:  KKHLLKNIKRRSRYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQ
        +KHLLK IKRR     +         + ++   G+E ELE LK D N L  EI+KLR+QQQ ++NQ+  + E++   E KQ QM SFL K+ SNP F +Q
Subjt:  KKHLLKNIKRRSRYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQ

Query:  LVQKRMLRKELNGNEFGKKRRLL---AVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEVVEPGRVETPFQASMNSKSRSSDAACMPPS
         + K++ RK+    E G+KRRL    +V G +    +P++ S  +  E+  ++    +   + ++      V P  V T     M   +       +   
Subjt:  LVQKRMLRKELNGNEFGKKRRLL---AVQGHENLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEVVEPGRVETPFQASMNSKSRSSDAACMPPS

Query:  NIFAENMVVDEELTSNDSKFFLELEDLIKK
        ++ + +   +E +     +F +E+EDL+ K
Subjt:  NIFAENMVVDEELTSNDSKFFLELEDLIKK

Q338B0 Heat stress transcription factor A-2c1.5e-5653.15Show/hide
Query:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG
        DGG+A   +PM GLHE+GPPPFL KTY++VEDP TD VVSWS+A NSF+VWD H  +  LLP+ FKH NFSSF+RQLNTYGFRK+D D+WEFANEGF  G
Subjt:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG

Query:  KKHLLKNIKRRSRYNYNKQHLAMAMT-LQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFR
        ++HLLK IKRR   +        ++T   ++ + G E E++ LK D N L  E++KLR++QQ +++ +  +E+R+R  E KQ QM  FL +   NP FF+
Subjt:  KKHLLKNIKRRSRYNYNKQHLAMAMT-LQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFR

Query:  QLVQKRMLRKELNGNEFGKKRR
        QL Q++  RKEL  +   KKRR
Subjt:  QLVQKRMLRKELNGNEFGKKRR

Q6F388 Heat stress transcription factor A-2e4.7e-5545.42Show/hide
Query:  KPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNI
        +PM GL + GPPPFL KTY+MV+DP TD VVSWS   NSF+VWD H     LLP+YFKH NFSSF+RQLNTYGFRK+D DKWEFANEGF  G+KHLLK+I
Subjt:  KPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNI

Query:  KRRSRYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLR
        KRR   N +    ++   L ++   G E E++ LK D + L  E++KLR++QQ++++ L  +E++++  E KQQ M +FL+++  NP F RQL  +  +R
Subjt:  KRRSRYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLR

Query:  KELNGNEF-GKKRRLLAVQGHE----NLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEVVEPGRVETPFQASMNSKSRSSDAACMPPSN
        KEL   EF  KKRR    QG E      G  P   S+ V  E H  V      + S+L     E           +A  +  S SS+   + PSN
Subjt:  KELNGNEF-GKKRRLLAVQGHE----NLGLEPIDASRDVNCENHVQVQESILCMQSELNEMFPEVVEPGRVETPFQASMNSKSRSSDAACMPPSN

Q9LVW2 Heat stress transcription factor A-93.3e-5654.63Show/hide
Query:  LHEIG-PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
        LHEIG   PFL+KT+E+V+D  TDPVVSWS  R SFI+WDS++ S NLLPKYFKH NFSSFIRQLN+YGF+K+DSD+WEFANEGFQGGKKHLLKNIKRRS
Subjt:  LHEIG-PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS

Query:  RYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLR--KE
        +        A   T         ETE+E+LK + + +++E+LKL++QQ++SQ+Q+  V+E++  V+ +QQ M SF  K++ +  F  +LV+KR ++  +E
Subjt:  RYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLR--KE

Query:  LNGNEFGKKRRLLAVQ
        L   EF KK +LL  Q
Subjt:  LNGNEFGKKRRLLAVQ

Arabidopsis top hitse value%identityAlignment
AT2G26150.1 heat shock transcription factor A21.7e-6354.55Show/hide
Query:  EETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFR
        EE T    G  A+  S   GS+SS +PM GL+E GPPPFL KTYEMVEDP TD VVSWS  RNSF+VWDSH+ S  LLP+YFKH NFSSFIRQLNTYGFR
Subjt:  EETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFR

Query:  KIDSDKWEFANEGFQGGKKHLLKNIKRRSR---YNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECK
        KID D+WEFANEGF  G+KHLLKNIKRR      N N+Q     M+  ++ + G + E+E LK D+  L  E+++LR+QQ  S++Q+  +E+R+   E +
Subjt:  KIDSDKWEFANEGFQGGKKHLLKNIKRRSR---YNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECK

Query:  QQQMSSFLTKMSSNPAFFRQLVQKRMLRKELNGNEFGKKRRL
        QQQM +FL K  +NP F +Q       +K L G + G+KRRL
Subjt:  QQQMSSFLTKMSSNPAFFRQLVQKRMLRKELNGNEFGKKRRL

AT3G22830.1 heat shock transcription factor A6B4.4e-5647.7Show/hide
Query:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG
        D  + S  +P+ GLHE GPPPFL KTY++VED  T+ VVSWS++ NSFIVWD    S+ LLP++FKH NFSSF+RQLNTYGFRK++ D+WEFANEGF  G
Subjt:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG

Query:  KKHLLKNIKRRSRYNYNKQHLAMAMTLQ--------DLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMS
        +KHLLKNI+RR   N + Q      + Q        ++ + G++ E+++L+ D   L +E+++LR+QQQ ++  LT +EE+++  E KQ+QM SFL +  
Subjt:  KKHLLKNIKRRSRYNYNKQHLAMAMTLQ--------DLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMS

Query:  SNPAFFRQLVQKRMLRKELNGNEFGKKRRLLAVQGHENL
         NP F +QLV+++  RKE+      KKR+    QG  N+
Subjt:  SNPAFFRQLVQKRMLRKELNGNEFGKKRRLLAVQGHENL

AT4G17750.1 heat shock factor 17.3e-5147.62Show/hide
Query:  DGASDCSGDGGSASSAKPMVGLHEIG-------PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKI
        DG +    + G A +A P    H          PPPFL KTY+MVEDP TD +VSWS   NSFIVWD  + S +LLPKYFKH NFSSF+RQLNTYGFRK+
Subjt:  DGASDCSGDGGSASSAKPMVGLHEIG-------PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKI

Query:  DSDKWEFANEGFQGGKKHLLKNIKRR--------SRYNYNKQHL-------AMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEV
        D D+WEFANEGF  G+KHLLK I RR        S  N   Q L       A   +  ++ K G+E E+E LK D N L  E++KLR+QQQ + N+L  +
Subjt:  DSDKWEFANEGFQGGKKHLLKNIKRR--------SRYNYNKQHL-------AMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEV

Query:  EERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLRKELNGNEFGKKRRL
         + ++ +E +QQQ+ SFL K   NP F  Q +QK+     ++  E  KKRRL
Subjt:  EERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLRKELNGNEFGKKRRL

AT5G16820.1 heat shock factor 34.7e-5050.23Show/hide
Query:  PPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRR--SRYNYN
        PPFL KTY+MV+DP T+ VVSWS   NSF+VW + + S  LLPKYFKH NFSSF+RQLNTYGFRK+D D+WEFANEGF  G+K LLK+I RR  S    N
Subjt:  PPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRR--SRYNYN

Query:  KQHLAMAMT----LQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRML--RKEL
        +Q   +  +      ++ K G+E E+E LK D N L  E+++LR+QQQ ++NQL  V ++V+ +E +QQQM SFL K   +P F  QLVQ+      +++
Subjt:  KQHLAMAMT----LQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRML--RKEL

Query:  NGNEFGKKRRLLAVQGHENLG
         G+    K+R L V   EN G
Subjt:  NGNEFGKKRRLLAVQGHENLG

AT5G54070.1 heat shock transcription factor A92.3e-5754.63Show/hide
Query:  LHEIG-PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
        LHEIG   PFL+KT+E+V+D  TDPVVSWS  R SFI+WDS++ S NLLPKYFKH NFSSFIRQLN+YGF+K+DSD+WEFANEGFQGGKKHLLKNIKRRS
Subjt:  LHEIG-PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS

Query:  RYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLR--KE
        +        A   T         ETE+E+LK + + +++E+LKL++QQ++SQ+Q+  V+E++  V+ +QQ M SF  K++ +  F  +LV+KR ++  +E
Subjt:  RYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLR--KE

Query:  LNGNEFGKKRRLLAVQ
        L   EF KK +LL  Q
Subjt:  LNGNEFGKKRRLLAVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCAGCCTGACGGTGGCGGTGGCGGTGGCGGTGTTTGCCGCGAAGGCGCTGCAACTGCAAGCGATGAAGTCATGGATGTATTAAACGAGCCACAGATTCAAAAGAC
TGTTAAGGAAGAGGAGATAGAGGAGCAAGAGCTCAATCGCAATAACAATGGTAATAGTTTCAATAACGACCTAATTCTATTAGATGGATCGGCTTCTTCTTCTTGTTCTT
CTTGTTTGATTAAAGAAGAAACTACGGCGGCGATGAACGGCGACGGAGCTTCTGATTGCAGCGGCGATGGCGGTTCTGCGTCATCGGCGAAACCAATGGTAGGGTTGCAT
GAGATCGGGCCGCCGCCGTTTCTGAAGAAGACGTATGAGATGGTGGAGGATCCGGAAACCGACCCGGTTGTATCGTGGAGTCAAGCTCGCAATAGCTTCATTGTTTGGGA
TTCTCATCAACTCTCCATAAATCTTCTCCCCAAATACTTCAAGCACTGCAATTTCTCCAGCTTCATTCGCCAGCTTAATACTTATGGTTTTAGGAAAATTGATTCTGATA
AATGGGAGTTTGCAAATGAAGGGTTTCAGGGAGGGAAGAAACATTTGCTCAAGAATATTAAGAGAAGAAGCAGGTACAATTACAACAAGCAGCATTTAGCCATGGCAATG
ACTTTACAAGATTTGACAAAGCCAGGAGTGGAAACAGAGCTTGAAGCTCTGAAAACTGATAACAACTTTTTGAAAGTAGAGATCTTGAAGCTCAGAGAGCAGCAGCAGGA
CTCACAGAACCAACTCACCGAGGTCGAAGAGCGCGTCCGGTGCGTTGAGTGCAAGCAGCAACAGATGTCCTCTTTCCTTACCAAAATGTCGAGCAACCCCGCCTTTTTCC
GACAGTTGGTCCAGAAGAGAATGCTTAGAAAGGAGCTGAATGGGAATGAGTTTGGAAAGAAAAGGAGATTACTTGCTGTGCAAGGCCATGAAAATCTCGGCCTCGAGCCA
ATCGATGCTTCTCGTGATGTTAATTGTGAAAACCATGTCCAAGTTCAAGAAAGCATTCTGTGCATGCAGTCTGAGCTCAATGAAATGTTTCCAGAAGTTGTCGAACCTGG
ACGGGTCGAAACGCCATTCCAAGCATCGATGAATAGTAAATCAAGAAGTTCAGATGCTGCTTGTATGCCACCATCCAATATATTTGCAGAGAATATGGTGGTTGATGAAG
AATTGACTTCTAATGACTCCAAATTCTTTCTGGAACTGGAGGATCTGATTAAGAAGCCTCATGATTGTGCTGGTTATGTACAGAAACAAGTTTTCCATGGCTGTGTTGGA
TCCATTCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCAGCCTGACGGTGGCGGTGGCGGTGGCGGTGTTTGCCGCGAAGGCGCTGCAACTGCAAGCGATGAAGTCATGGATGTATTAAACGAGCCACAGATTCAAAAGAC
TGTTAAGGAAGAGGAGATAGAGGAGCAAGAGCTCAATCGCAATAACAATGGTAATAGTTTCAATAACGACCTAATTCTATTAGATGGATCGGCTTCTTCTTCTTGTTCTT
CTTGTTTGATTAAAGAAGAAACTACGGCGGCGATGAACGGCGACGGAGCTTCTGATTGCAGCGGCGATGGCGGTTCTGCGTCATCGGCGAAACCAATGGTAGGGTTGCAT
GAGATCGGGCCGCCGCCGTTTCTGAAGAAGACGTATGAGATGGTGGAGGATCCGGAAACCGACCCGGTTGTATCGTGGAGTCAAGCTCGCAATAGCTTCATTGTTTGGGA
TTCTCATCAACTCTCCATAAATCTTCTCCCCAAATACTTCAAGCACTGCAATTTCTCCAGCTTCATTCGCCAGCTTAATACTTATGGTTTTAGGAAAATTGATTCTGATA
AATGGGAGTTTGCAAATGAAGGGTTTCAGGGAGGGAAGAAACATTTGCTCAAGAATATTAAGAGAAGAAGCAGGTACAATTACAACAAGCAGCATTTAGCCATGGCAATG
ACTTTACAAGATTTGACAAAGCCAGGAGTGGAAACAGAGCTTGAAGCTCTGAAAACTGATAACAACTTTTTGAAAGTAGAGATCTTGAAGCTCAGAGAGCAGCAGCAGGA
CTCACAGAACCAACTCACCGAGGTCGAAGAGCGCGTCCGGTGCGTTGAGTGCAAGCAGCAACAGATGTCCTCTTTCCTTACCAAAATGTCGAGCAACCCCGCCTTTTTCC
GACAGTTGGTCCAGAAGAGAATGCTTAGAAAGGAGCTGAATGGGAATGAGTTTGGAAAGAAAAGGAGATTACTTGCTGTGCAAGGCCATGAAAATCTCGGCCTCGAGCCA
ATCGATGCTTCTCGTGATGTTAATTGTGAAAACCATGTCCAAGTTCAAGAAAGCATTCTGTGCATGCAGTCTGAGCTCAATGAAATGTTTCCAGAAGTTGTCGAACCTGG
ACGGGTCGAAACGCCATTCCAAGCATCGATGAATAGTAAATCAAGAAGTTCAGATGCTGCTTGTATGCCACCATCCAATATATTTGCAGAGAATATGGTGGTTGATGAAG
AATTGACTTCTAATGACTCCAAATTCTTTCTGGAACTGGAGGATCTGATTAAGAAGCCTCATGATTGTGCTGGTTATGTACAGAAACAAGTTTTCCATGGCTGTGTTGGA
TCCATTCCATGA
Protein sequenceShow/hide protein sequence
MVQPDGGGGGGGVCREGAATASDEVMDVLNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASSAKPMVGLH
EIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSINLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYNKQHLAMAM
TLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPAFFRQLVQKRMLRKELNGNEFGKKRRLLAVQGHENLGLEP
IDASRDVNCENHVQVQESILCMQSELNEMFPEVVEPGRVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDCAGYVQKQVFHGCVG
SIP