; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007046 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007046
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionHeat shock transcription factor A9
Genome locationchr6:48329260..48331144
RNA-Seq ExpressionLag0007046
SyntenyLag0007046
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0034605 - cellular response to heat (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141170.1 heat stress transcription factor A-7a isoform X1 [Cucumis sativus]6.3e-11363.54Show/hide
Query:  IKEETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYG
        +KEE         A+  + DG      KPM GLH++GPPPFLKKTYEMVEDPETDPVVSWS+ R SFIVWDSHQLSK LLPKYFKH NFSSFIRQLNTYG
Subjt:  IKEETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYG

Query:  FRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN----KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEE
        FRKIDSDKWEFANEGFQGGKKHLLKNIKR+++YN N    ++HL +++   TL+DLTKP  VETE L+ L+TDNN L+VE+ KLREQQQDS NQLT VEE
Subjt:  FRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN----KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEE

Query:  RVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR--KELN--EIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEV
        RVR  E K QQM  FL KMS NP F RQL QKRMLR   ELN  + EFGKK ++L +Q H+N GL   D S DVN +N  QVQE LLS+ SEL E+FPEV
Subjt:  RVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR--KELN--EIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEV

Query:  VEP---GLVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
        +EP   G +ETPFQAS                    E+MVVDE ++SNDS FFL+L+DL+ KP DC +GYVQKQ F+G VGSIP
Subjt:  VEP---GLVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

XP_022999504.1 heat stress transcription factor A-7a-like [Cucurbita maxima]5.3e-11258.15Show/hide
Query:  GGGVCREGAATASDEIMDVFNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS--AKPMV
        GGG C +GAATAS +IMD       Q+TVKEEEIE                                          G G SD  GDG SASS  AKPM 
Subjt:  GGGVCREGAATASDEIMDVFNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS--AKPMV

Query:  GLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
        GLHE+GP PFLKKTYEMVEDPETDPVVSWS+  NSFIVWDSH+LS  LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
Subjt:  GLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS

Query:  RYNYNKQ---HLAMAMTLQDLTK-PGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR
        RYN  KQ   HL M M LQDLTK P VE EL ALK+DN  L+VE+LKLREQQ DSQNQLT VE+RVRCVE K QQM SF++KMS NP F RQL Q+RMLR
Subjt:  RYNYNKQ---HLAMAMTLQDLTK-PGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR

Query:  KEL--NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSEL--NEMFPEVVEPG--LVETPFQASMNSKSRSSDAACMPPSNIFAE
        K+L  N  EFG  RRLLAMQGH+N                            SEL   EMFP V+EPG  ++E P +AS                     
Subjt:  KEL--NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSEL--NEMFPEVVEPG--LVETPFQASMNSKSRSSDAACMPPSNIFAE

Query:  NMVVDEELTSNDSKFFLELEDLIKKP-------HDC-AGYVQKQVFHGCVGSIP
         MVVDE+   +DSKFFLELEDLIKKP        DC +GYVQ+Q FHG VGSIP
Subjt:  NMVVDEELTSNDSKFFLELEDLIKKP-------HDC-AGYVQKQVFHGCVGSIP

XP_023546638.1 heat stress transcription factor A-2-like [Cucurbita pepo subsp. pepo]9.1e-11257.39Show/hide
Query:  GGGGGGGVCREGAATASDEIMDVFNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDC--SGDGGSASS-
        GGGGGGG C +GAATAS +IMD       Q++VKEEE+E                                +  ET     G G SDC   GDGG ASS 
Subjt:  GGGGGGGVCREGAATASDEIMDVFNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDC--SGDGGSASS-

Query:  -AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLK
         AKPM GLHE+GP PFLKKTYEMVEDPETDPVVSWS+  NSFIVWDSH+LS  LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLK
Subjt:  -AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLK

Query:  NIKRRSRYN-YNKQ----HLAMAMTLQDLTKPG-VETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQ
        NIKRRSRYN Y KQ    HL M M LQDL+K   VE EL+ALK+DNN L+VE+LKLREQQQDSQNQLT VE+RVRCVE K QQM SF++KMS NP F RQ
Subjt:  NIKRRSRYN-YNKQ----HLAMAMTLQDLTKPG-VETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQ

Query:  LAQKRMLRKEL--NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVEPG--LVETPFQASMNSKSRSSDAACMPP
        L Q+RMLRK+L  N  EFG  RRLLAMQGH+N                       LLS      EMFP+ +EPG  ++E P +AS               
Subjt:  LAQKRMLRKEL--NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVEPG--LVETPFQASMNSKSRSSDAACMPP

Query:  SNIFAENMVVDEELTSNDSKFFLELEDLIKK-------PHDC-AGYVQKQVFHGCVGSIP
                +VDE+   +DSKFFLELEDLI+K       P DC + YV++Q FHG VGSIP
Subjt:  SNIFAENMVVDEELTSNDSKFFLELEDLIKK-------PHDC-AGYVQKQVFHGCVGSIP

XP_038890548.1 heat stress transcription factor A-9-like isoform X1 [Benincasa hispida]1.0e-12359.69Show/hide
Query:  MVQPDGGGGGGGVCREGAATASDEIMDVFNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSA
        MV PD  GG     R+GA TA  E++D     +    VKEEE    E+N                             E +TAA N        G GG  
Subjt:  MVQPDGGGGGGGVCREGAATASDEIMDVFNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSA

Query:  SSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL
           KPM GLHE+GP PFLKKTYEMVEDPETDPVVSWS++RNSFIVWDSHQ SK LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL
Subjt:  SSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL

Query:  KNIKRRSR----YNYNKQHLAMAMT---LQDLTKP-GVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTF
        KNIKRR+R     NY KQHL +AM+   L+DLTKP  VETEL+ LKTDNN L++E+ KLR+QQQDSQNQLT VEERV+CVE K QQM  FL KMS NP F
Subjt:  KNIKRRSR----YNYNKQHLAMAMT---LQDLTKP-GVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTF

Query:  FRQLAQKRMLRKEL--NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVEPGLVETPFQASMNSKSRSSDAACMP
         RQL Q+RML+K+   N  EFGKKR+ LA+QGH+N G++ IDASRDVN E   +VQE L+SM SEL ++FP+V++ G ++TPFQA          A CMP
Subjt:  FRQLAQKRMLRKEL--NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVEPGLVETPFQASMNSKSRSSDAACMP

Query:  PSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
        P     ++MVVDEEL+SN    FLELEDLIKKP DC +GYVQKQ F+  V SIP
Subjt:  PSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

XP_038890549.1 heat stress transcription factor A-9-like isoform X2 [Benincasa hispida]1.3e-11858.82Show/hide
Query:  MVQPDGGGGGGGVCREGAATASDEIMDVFNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSA
        MV PD  GG     R+GA TA  E++D     +    VKEEE    E+N                             E +TAA N        G GG  
Subjt:  MVQPDGGGGGGGVCREGAATASDEIMDVFNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSA

Query:  SSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL
           KPM GLHE+GP PFLKKTYEMVEDPETDPVVSWS++RNSFIVWDSHQ SK LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL
Subjt:  SSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLL

Query:  KNIKRRSR----YNYNKQHLAMAMT---LQDLTKP-GVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTF
        KNIKRR+R     NY KQHL +AM+   L+DLTKP  VETEL+ LKTDNN L++E+ KLR+QQQDSQNQLT VEERV+CVE K QQM  FL KMS NP F
Subjt:  KNIKRRSR----YNYNKQHLAMAMT---LQDLTKP-GVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTF

Query:  FRQLAQKRMLRKEL--NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVEPGLVETPFQASMNSKSRSSDAACMP
         RQL Q+RML+K+   N  EFGKKR+ LA+QGH+N G++ IDASRDVN E   +VQE L+SM SEL ++FP+V++ G ++TPFQA          A CMP
Subjt:  FRQLAQKRMLRKEL--NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVEPGLVETPFQASMNSKSRSSDAACMP

Query:  PSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDCAGYVQK
        P     ++MVVDEEL+SN    FLELEDLIKKP DC   +++
Subjt:  PSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDCAGYVQK

TrEMBL top hitse value%identityAlignment
A0A0A0LEU6 HSF_DOMAIN domain-containing protein3.0e-11363.54Show/hide
Query:  IKEETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYG
        +KEE         A+  + DG      KPM GLH++GPPPFLKKTYEMVEDPETDPVVSWS+ R SFIVWDSHQLSK LLPKYFKH NFSSFIRQLNTYG
Subjt:  IKEETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYG

Query:  FRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN----KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEE
        FRKIDSDKWEFANEGFQGGKKHLLKNIKR+++YN N    ++HL +++   TL+DLTKP  VETE L+ L+TDNN L+VE+ KLREQQQDS NQLT VEE
Subjt:  FRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN----KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEE

Query:  RVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR--KELN--EIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEV
        RVR  E K QQM  FL KMS NP F RQL QKRMLR   ELN  + EFGKK ++L +Q H+N GL   D S DVN +N  QVQE LLS+ SEL E+FPEV
Subjt:  RVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR--KELN--EIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEV

Query:  VEP---GLVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
        +EP   G +ETPFQAS                    E+MVVDE ++SNDS FFL+L+DL+ KP DC +GYVQKQ F+G VGSIP
Subjt:  VEP---GLVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

A0A1S3CAP8 heat shock factor protein HSF30-like isoform X12.7e-10960.45Show/hide
Query:  DGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS----AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYF
        DG+ + S  S  + E+ +  + G+   D      + +     AKPM GLH++GPPPFLKKTYEMVEDPETDPVVSWS+ R SFIVWDSHQLSK LLPKYF
Subjt:  DGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS----AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYF

Query:  KHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLRE
        KH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKR+++YN N  KQHL +++   TL+DLTKP  VETE L+ LKTDNN L+VE+ KLRE
Subjt:  KHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLRE

Query:  QQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLRKEL----NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESL
        QQQDS NQLT VEERVRC E K QQM  FL KMS NP F RQL QKRMLRK++     + EFGKKR+LLA+Q H+N                  QVQE L
Subjt:  QQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLRKEL----NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESL

Query:  LSMQSELNEMFPEVVE--PGLVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
         ++ SEL EMFPEV+E  PG + T F+ S++                 +ENMVVDE L+SNDS  FL+L+DLIKKP DC +GYVQKQ F+G VGS+P
Subjt:  LSMQSELNEMFPEVVE--PGLVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

A0A5A7V4T3 Heat shock factor protein HSF30-like isoform X16.3e-11165.63Show/hide
Query:  AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKN
        AKPM GLH++GPPPFLKKTYEMVEDPETDPVVSWS+ R SFIVWDSHQLSK LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKN
Subjt:  AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKN

Query:  IKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQ
        IKR+++YN N  KQHL +++   TL+DLTKP  VETE L+ LKTDNN L+VE+ KLREQQQDS NQLT VEERVRC E K QQM  FL KMS NP F RQ
Subjt:  IKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQ

Query:  LAQKRMLRKEL----NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVE--PGLVETPFQASMNSKSRSSDAACM
        L QKRMLRK++     + EFGKKR+LLA+Q H+N                  QVQE L ++ SEL EMFPEV+E  PG + T F+ S++           
Subjt:  LAQKRMLRKEL----NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVE--PGLVETPFQASMNSKSRSSDAACM

Query:  PPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
              +ENMVVDE L+SNDS  FL+L+DLIKKP DC +GYVQKQ F+G VGS+P
Subjt:  PPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

A0A5D3CFL8 Heat shock factor protein HSF30-like isoform X12.0e-10960.45Show/hide
Query:  DGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS----AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYF
        DG+ + S  S  + E+ +  + G+   D      + +     AKPM GLH++GPPPFLKKTYEMVEDPETDPVVSWS+ R SFIVWDSHQLSK LLPKYF
Subjt:  DGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS----AKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYF

Query:  KHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLRE
        KH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKR+++YN N  KQHL +++   TL+DLTKP  VETE L+ LKTDNN L+VE+ KLRE
Subjt:  KHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYN--KQHLAMAM---TLQDLTKP-GVETE-LEALKTDNNFLKVEILKLRE

Query:  QQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLRKEL----NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESL
        QQQDS NQLT VEERVRC E K QQM  FL KMS NP F RQL QKRMLRK++     + EFGKKR+LLA+Q H+N                  QVQE L
Subjt:  QQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLRKEL----NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESL

Query:  LSMQSELNEMFPEVVE--PGLVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP
         ++ SEL EMFPEV+E  PG + T F+ S++                 +ENMVVDE L+SNDS  FL+L+DLIKKP DC +GYVQKQ F+G VGS+P
Subjt:  LSMQSELNEMFPEVVE--PGLVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDC-AGYVQKQVFHGCVGSIP

A0A6J1KH95 heat stress transcription factor A-7a-like2.6e-11258.15Show/hide
Query:  GGGVCREGAATASDEIMDVFNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS--AKPMV
        GGG C +GAATAS +IMD       Q+TVKEEEIE                                          G G SD  GDG SASS  AKPM 
Subjt:  GGGVCREGAATASDEIMDVFNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASS--AKPMV

Query:  GLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
        GLHE+GP PFLKKTYEMVEDPETDPVVSWS+  NSFIVWDSH+LS  LLPKYFKH NFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
Subjt:  GLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS

Query:  RYNYNKQ---HLAMAMTLQDLTK-PGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR
        RYN  KQ   HL M M LQDLTK P VE EL ALK+DN  L+VE+LKLREQQ DSQNQLT VE+RVRCVE K QQM SF++KMS NP F RQL Q+RMLR
Subjt:  RYNYNKQ---HLAMAMTLQDLTK-PGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR

Query:  KEL--NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSEL--NEMFPEVVEPG--LVETPFQASMNSKSRSSDAACMPPSNIFAE
        K+L  N  EFG  RRLLAMQGH+N                            SEL   EMFP V+EPG  ++E P +AS                     
Subjt:  KEL--NEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSEL--NEMFPEVVEPG--LVETPFQASMNSKSRSSDAACMPPSNIFAE

Query:  NMVVDEELTSNDSKFFLELEDLIKKP-------HDC-AGYVQKQVFHGCVGSIP
         MVVDE+   +DSKFFLELEDLIKKP        DC +GYVQ+Q FHG VGSIP
Subjt:  NMVVDEELTSNDSKFFLELEDLIKKP-------HDC-AGYVQKQVFHGCVGSIP

SwissProt top hitse value%identityAlignment
O80982 Heat stress transcription factor A-21.8e-6254.55Show/hide
Query:  EETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFR
        EE T    G  A+  S   GS+SS +PM GL+E GPPPFL KTYEMVEDP TD VVSWS  RNSF+VWDSH+ S  LLP+YFKH NFSSFIRQLNTYGFR
Subjt:  EETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFR

Query:  KIDSDKWEFANEGFQGGKKHLLKNIKRRSR---YNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECK
        KID D+WEFANEGF  G+KHLLKNIKRR      N N+Q     M+  ++ + G + E+E LK D+  L  E+++LR+QQ  S++Q+  +E+R+   E +
Subjt:  KIDSDKWEFANEGFQGGKKHLLKNIKRRSR---YNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECK

Query:  QQQMSSFLTKMSSNPTFFRQLAQKRMLRKELNEIEFGKKRRL
        QQQM +FL K  +NP F +Q A     +K L  ++ G+KRRL
Subjt:  QQQMSSFLTKMSSNPTFFRQLAQKRMLRKELNEIEFGKKRRL

P41152 Heat shock factor protein HSF301.2e-6343.43Show/hide
Query:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG
        + G  ++  PM GLH++GPPPFL KTYEMVED  TD V+SWS  RNSFIVWDSH+ S  LLP++FKH NFSSFIRQLNTYGFRK+D D+WEFANEGF GG
Subjt:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG

Query:  KKHLLKNIKRRSRYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQ
        +KHLLK IKRR     +         + ++   G+E ELE LK D N L  EI+KLR+QQQ ++NQ+  + E++   E KQ QM SFL K+ SNPTF +Q
Subjt:  KKHLLKNIKRRSRYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQ

Query:  LAQKRMLRKELNEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVEPGLVETPFQASMNSKSRSSDAACMPPSNIF
           K++ RK+   IE G+KRR L M        +P++ S  +  E+  ++    +   + ++      V P  V T     M   +       +   ++ 
Subjt:  LAQKRMLRKELNEIEFGKKRRLLAMQGHENFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVEPGLVETPFQASMNSKSRSSDAACMPPSNIF

Query:  AENMVVDEELTSNDSKFFLELEDLIKK
        + +   +E +     +F +E+EDL+ K
Subjt:  AENMVVDEELTSNDSKFFLELEDLIKK

Q338B0 Heat stress transcription factor A-2c1.7e-5753.15Show/hide
Query:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG
        DGG+A   +PM GLHE+GPPPFL KTY++VEDP TD VVSWS+A NSF+VWD H  +  LLP+ FKH NFSSF+RQLNTYGFRK+D D+WEFANEGF  G
Subjt:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG

Query:  KKHLLKNIKRRSRYNYNKQHLAMAMT-LQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFR
        ++HLLK IKRR   +        ++T   ++ + G E E++ LK D N L  E++KLR++QQ +++ +  +E+R+R  E KQ QM  FL +   NP FF+
Subjt:  KKHLLKNIKRRSRYNYNKQHLAMAMT-LQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFR

Query:  QLAQKRMLRKELNEIEFGKKRR
        QLAQ++  RKEL +    K+RR
Subjt:  QLAQKRMLRKELNEIEFGKKRR

Q6F388 Heat stress transcription factor A-2e5.6e-5645.24Show/hide
Query:  KPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNI
        +PM GL + GPPPFL KTY+MV+DP TD VVSWS   NSF+VWD H     LLP+YFKH NFSSF+RQLNTYGFRK+D DKWEFANEGF  G+KHLLK+I
Subjt:  KPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNI

Query:  KRRSRYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR
        KRR   N +    ++   L ++   G E E++ LK D + L  E++KLR++QQ++++ L  +E++++  E KQQ M +FL+++  NP F RQL  +  +R
Subjt:  KRRSRYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR

Query:  KELNEIEFGKKRRLLAMQGHE----NFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVEPGLVETPFQASMNSKSRSSDAACMPPSN
        KEL E    KKRR    QG E      G  P   S+ V  E H  V      + S+L     E           +A  +  S SS+   + PSN
Subjt:  KELNEIEFGKKRRLLAMQGHE----NFGLKPIDASRDVNCENHVQVQESLLSMQSELNEMFPEVVEPGLVETPFQASMNSKSRSSDAACMPPSN

Q9LVW2 Heat stress transcription factor A-95.6e-5654.17Show/hide
Query:  LHEIG-PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
        LHEIG   PFL+KT+E+V+D  TDPVVSWS  R SFI+WDS++ S+NLLPKYFKH NFSSFIRQLN+YGF+K+DSD+WEFANEGFQGGKKHLLKNIKRRS
Subjt:  LHEIG-PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS

Query:  RYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR--KE
        +        A   T         ETE+E+LK + + +++E+LKL++QQ++SQ+Q+  V+E++  V+ +QQ M SF  K++ +  F  +L +KR ++  +E
Subjt:  RYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR--KE

Query:  LNEIEFGKKRRLLAMQ
        L   EF KK +LL  Q
Subjt:  LNEIEFGKKRRLLAMQ

Arabidopsis top hitse value%identityAlignment
AT2G26150.1 heat shock transcription factor A21.3e-6354.55Show/hide
Query:  EETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFR
        EE T    G  A+  S   GS+SS +PM GL+E GPPPFL KTYEMVEDP TD VVSWS  RNSF+VWDSH+ S  LLP+YFKH NFSSFIRQLNTYGFR
Subjt:  EETTAAMNGDGASDCSGDGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFR

Query:  KIDSDKWEFANEGFQGGKKHLLKNIKRRSR---YNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECK
        KID D+WEFANEGF  G+KHLLKNIKRR      N N+Q     M+  ++ + G + E+E LK D+  L  E+++LR+QQ  S++Q+  +E+R+   E +
Subjt:  KIDSDKWEFANEGFQGGKKHLLKNIKRRSR---YNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECK

Query:  QQQMSSFLTKMSSNPTFFRQLAQKRMLRKELNEIEFGKKRRL
        QQQM +FL K  +NP F +Q A     +K L  ++ G+KRRL
Subjt:  QQQMSSFLTKMSSNPTFFRQLAQKRMLRKELNEIEFGKKRRL

AT3G22830.1 heat shock transcription factor A6B2.9e-5547.9Show/hide
Query:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG
        D  + S  +P+ GLHE GPPPFL KTY++VED  T+ VVSWS++ NSFIVWD    S  LLP++FKH NFSSF+RQLNTYGFRK++ D+WEFANEGF  G
Subjt:  DGGSASSAKPMVGLHEIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGG

Query:  KKHLLKNIKRRSRYNYNKQHLAMAMTLQ--------DLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMS
        +KHLLKNI+RR   N + Q      + Q        ++ + G++ E+++L+ D   L +E+++LR+QQQ ++  LT +EE+++  E KQ+QM SFL +  
Subjt:  KKHLLKNIKRRSRYNYNKQHLAMAMTLQ--------DLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMS

Query:  SNPTFFRQLAQKRMLRKELNEIEFGKKRRLLAMQGHEN
         NP F +QL +++  RKE+ E    KKR+    QG  N
Subjt:  SNPTFFRQLAQKRMLRKELNEIEFGKKRRLLAMQGHEN

AT4G17750.1 heat shock factor 11.9e-5148.02Show/hide
Query:  DGASDCSGDGGSASSAKPMVGLHEIG-------PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKI
        DG +    + G A +A P    H          PPPFL KTY+MVEDP TD +VSWS   NSFIVWD  + S++LLPKYFKH NFSSF+RQLNTYGFRK+
Subjt:  DGASDCSGDGGSASSAKPMVGLHEIG-------PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKI

Query:  DSDKWEFANEGFQGGKKHLLKNIKRR--------SRYNYNKQHL-------AMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEV
        D D+WEFANEGF  G+KHLLK I RR        S  N   Q L       A   +  ++ K G+E E+E LK D N L  E++KLR+QQQ + N+L  +
Subjt:  DSDKWEFANEGFQGGKKHLLKNIKRR--------SRYNYNKQHL-------AMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEV

Query:  EERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLRKELNEIEFGKKRRL
         + ++ +E +QQQ+ SFL K   NPTF  Q  QK+     ++  E  KKRRL
Subjt:  EERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLRKELNEIEFGKKRRL

AT5G16820.1 heat shock factor 34.7e-5050.23Show/hide
Query:  PPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRR--SRYNYN
        PPFL KTY+MV+DP T+ VVSWS   NSF+VW + + SK LLPKYFKH NFSSF+RQLNTYGFRK+D D+WEFANEGF  G+K LLK+I RR  S    N
Subjt:  PPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRR--SRYNYN

Query:  KQHLAMAMT----LQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLRKELNE
        +Q   +  +      ++ K G+E E+E LK D N L  E+++LR+QQQ ++NQL  V ++V+ +E +QQQM SFL K   +P F  QL Q+        +
Subjt:  KQHLAMAMT----LQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLRKELNE

Query:  IEFGKKRRLLAMQGHENFG
        I    K+R L +   EN G
Subjt:  IEFGKKRRLLAMQGHENFG

AT5G54070.1 heat shock transcription factor A94.0e-5754.17Show/hide
Query:  LHEIG-PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS
        LHEIG   PFL+KT+E+V+D  TDPVVSWS  R SFI+WDS++ S+NLLPKYFKH NFSSFIRQLN+YGF+K+DSD+WEFANEGFQGGKKHLLKNIKRRS
Subjt:  LHEIG-PPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRS

Query:  RYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR--KE
        +        A   T         ETE+E+LK + + +++E+LKL++QQ++SQ+Q+  V+E++  V+ +QQ M SF  K++ +  F  +L +KR ++  +E
Subjt:  RYNYNKQHLAMAMTLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLR--KE

Query:  LNEIEFGKKRRLLAMQ
        L   EF KK +LL  Q
Subjt:  LNEIEFGKKRRLLAMQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCAGCCTGACGGTGGCGGTGGCGGTGGCGGTGTCTGCCGCGAAGGCGCTGCAACTGCAAGCGATGAAATCATGGATGTATTCAACGAGCCACAGATTCAAAAGAC
TGTTAAGGAAGAGGAGATAGAGGAGCAAGAGCTCAATCGCAATAACAATGGTAATAGTTTCAATAACGACCTAATTCTATTAGATGGATCGGCTTCTTCTTCTTGTTCTT
CTTGTTTAATTAAAGAAGAAACCACGGCGGCGATGAACGGCGACGGAGCTTCTGATTGCAGCGGCGATGGCGGTTCTGCGTCATCGGCGAAACCAATGGTAGGGTTGCAT
GAGATCGGGCCGCCGCCGTTTCTGAAGAAGACGTATGAGATGGTGGAGGATCCGGAAACCGACCCGGTTGTATCGTGGAGTCAAGCTCGCAATAGCTTCATTGTTTGGGA
TTCTCATCAACTCTCCAAAAATCTTCTCCCCAAATACTTCAAGCACTGCAATTTCTCCAGCTTCATTCGCCAGCTTAATACTTATGGTTTTAGGAAGATTGATTCTGATA
AATGGGAGTTTGCAAATGAAGGGTTTCAGGGAGGGAAGAAACATTTGCTCAAGAATATTAAGAGAAGAAGCAGGTACAATTACAACAAGCAGCATTTAGCCATGGCAATG
ACTTTACAAGATTTGACAAAGCCAGGAGTGGAAACAGAGCTTGAAGCTCTGAAAACTGATAACAACTTTTTGAAAGTAGAGATCTTGAAGCTCAGAGAGCAGCAGCAGGA
CTCACAGAACCAACTCACCGAGGTCGAAGAGCGCGTCCGGTGCGTCGAGTGCAAGCAGCAACAGATGTCCTCTTTCCTTACCAAAATGTCGAGCAACCCCACCTTTTTCC
GACAGTTGGCCCAGAAGAGAATGCTTAGAAAGGAGCTGAATGAGATTGAGTTTGGAAAGAAAAGGAGATTACTTGCAATGCAAGGCCATGAAAATTTCGGCCTCAAGCCA
ATCGATGCTTCTCGGGATGTTAATTGTGAAAACCATGTCCAAGTCCAAGAAAGCCTTCTGAGCATGCAGTCTGAGCTCAATGAAATGTTTCCAGAAGTTGTCGAACCAGG
ACTGGTCGAAACGCCATTCCAAGCATCGATGAATAGTAAATCAAGAAGTTCAGATGCTGCTTGTATGCCACCATCCAATATATTTGCAGAGAATATGGTGGTTGATGAAG
AATTGACTTCTAATGACTCCAAATTCTTTCTGGAACTGGAGGATCTGATCAAGAAGCCTCATGATTGTGCTGGTTATGTACAGAAACAAGTTTTCCATGGCTGTGTTGGA
TCCATTCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCAGCCTGACGGTGGCGGTGGCGGTGGCGGTGTCTGCCGCGAAGGCGCTGCAACTGCAAGCGATGAAATCATGGATGTATTCAACGAGCCACAGATTCAAAAGAC
TGTTAAGGAAGAGGAGATAGAGGAGCAAGAGCTCAATCGCAATAACAATGGTAATAGTTTCAATAACGACCTAATTCTATTAGATGGATCGGCTTCTTCTTCTTGTTCTT
CTTGTTTAATTAAAGAAGAAACCACGGCGGCGATGAACGGCGACGGAGCTTCTGATTGCAGCGGCGATGGCGGTTCTGCGTCATCGGCGAAACCAATGGTAGGGTTGCAT
GAGATCGGGCCGCCGCCGTTTCTGAAGAAGACGTATGAGATGGTGGAGGATCCGGAAACCGACCCGGTTGTATCGTGGAGTCAAGCTCGCAATAGCTTCATTGTTTGGGA
TTCTCATCAACTCTCCAAAAATCTTCTCCCCAAATACTTCAAGCACTGCAATTTCTCCAGCTTCATTCGCCAGCTTAATACTTATGGTTTTAGGAAGATTGATTCTGATA
AATGGGAGTTTGCAAATGAAGGGTTTCAGGGAGGGAAGAAACATTTGCTCAAGAATATTAAGAGAAGAAGCAGGTACAATTACAACAAGCAGCATTTAGCCATGGCAATG
ACTTTACAAGATTTGACAAAGCCAGGAGTGGAAACAGAGCTTGAAGCTCTGAAAACTGATAACAACTTTTTGAAAGTAGAGATCTTGAAGCTCAGAGAGCAGCAGCAGGA
CTCACAGAACCAACTCACCGAGGTCGAAGAGCGCGTCCGGTGCGTCGAGTGCAAGCAGCAACAGATGTCCTCTTTCCTTACCAAAATGTCGAGCAACCCCACCTTTTTCC
GACAGTTGGCCCAGAAGAGAATGCTTAGAAAGGAGCTGAATGAGATTGAGTTTGGAAAGAAAAGGAGATTACTTGCAATGCAAGGCCATGAAAATTTCGGCCTCAAGCCA
ATCGATGCTTCTCGGGATGTTAATTGTGAAAACCATGTCCAAGTCCAAGAAAGCCTTCTGAGCATGCAGTCTGAGCTCAATGAAATGTTTCCAGAAGTTGTCGAACCAGG
ACTGGTCGAAACGCCATTCCAAGCATCGATGAATAGTAAATCAAGAAGTTCAGATGCTGCTTGTATGCCACCATCCAATATATTTGCAGAGAATATGGTGGTTGATGAAG
AATTGACTTCTAATGACTCCAAATTCTTTCTGGAACTGGAGGATCTGATCAAGAAGCCTCATGATTGTGCTGGTTATGTACAGAAACAAGTTTTCCATGGCTGTGTTGGA
TCCATTCCATGA
Protein sequenceShow/hide protein sequence
MVQPDGGGGGGGVCREGAATASDEIMDVFNEPQIQKTVKEEEIEEQELNRNNNGNSFNNDLILLDGSASSSCSSCLIKEETTAAMNGDGASDCSGDGGSASSAKPMVGLH
EIGPPPFLKKTYEMVEDPETDPVVSWSQARNSFIVWDSHQLSKNLLPKYFKHCNFSSFIRQLNTYGFRKIDSDKWEFANEGFQGGKKHLLKNIKRRSRYNYNKQHLAMAM
TLQDLTKPGVETELEALKTDNNFLKVEILKLREQQQDSQNQLTEVEERVRCVECKQQQMSSFLTKMSSNPTFFRQLAQKRMLRKELNEIEFGKKRRLLAMQGHENFGLKP
IDASRDVNCENHVQVQESLLSMQSELNEMFPEVVEPGLVETPFQASMNSKSRSSDAACMPPSNIFAENMVVDEELTSNDSKFFLELEDLIKKPHDCAGYVQKQVFHGCVG
SIP