; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005604 (gene) of Snake gourd v1 genome

Gene IDTan0005604
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionheat stress transcription factor B-4-like
Genome locationLG01:113483497..113487480
RNA-Seq ExpressionTan0005604
SyntenyTan0005604
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0008356 - asymmetric cell division (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608467.1 Heat stress transcription factor B-4, partial [Cucurbita argyrosperma subsp. sororia]1.8e-17489.43Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDP TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVA--APNFNNSVTALSEDNERLRRSNN
        RKGEKHLLCEIHRRKTAQPQ+TVNQHHQS+SP LG+ N  FYH+PAR+SISPSDSDD NPNWCDSPPLSS    +    NFNNSVTALSEDNERLRRSNN
Subjt:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVA--APNFNNSVTALSEDNERLRRSNN

Query:  MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEE
        MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPV QR PNHHLLGYHPTNAKQVSGQTH ++GTPN       NNNGSSKSFVTI+EE
Subjt:  MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEE

Query:  PKTKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA
        PKTKLFGV++QSKKR+HPEYGSNNI KENNKARLVLEKDDLGLNLMPPSA
Subjt:  PKTKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA

XP_022940239.1 heat stress transcription factor B-4-like [Cucurbita moschata]1.4e-17489.94Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDP TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML
        RKGEKHLLCEIHRRKTAQPQ+TVNQHHQS+SP LG+ N  FYH+PAR+SISPSDSDD NPNWCDSPPLSS       NFNNSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEEPK
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPV QR PNHHLLGYHPTNAKQVSGQTH ++GTPN       NNNGSSKSFVTI+EEPK
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEEPK

Query:  TKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA
        TKLFGV++QSKKR+HPEYGSNNI KENNKARLVLEKDDLGLNLMPPSA
Subjt:  TKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA

XP_022982267.1 heat stress transcription factor B-4-like [Cucurbita maxima]1.8e-17489.94Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDP TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML
        RKGEKHLLCEIHRRKTAQPQ+TVNQHHQS+SP LG+ N  FYH+PAR+SISPSDSDD NPNWCDSPPLSS        FNNSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEEPK
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPV QR PNHHLLGYHPTNAKQVSGQTH + GTPN+N     NNNGSSKSFVTI+EEPK
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEEPK

Query:  TKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA
        TKLFGV++QSKKR+HPEYGSNNI KENNKARLVLEKDDLGLNLMPPSA
Subjt:  TKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA

XP_023536624.1 heat stress transcription factor B-4-like [Cucurbita pepo subsp. pepo]4.3e-15583.48Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALM+DNCEGVL+SLDSHK IPAPFLTKTYQLVDDP+TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFAN+FF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML
        RKG+KHLLCEIHRRKTAQPQL     HQS SP L +NNP FYHF  R SISPSDSDD N NWCDSPPLSS G     N NNSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGY-HPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEE-
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYST      FPVVQR PNHHL+GY +PTNAKQVSGQTHL++G PN+NN NN       KSFV I+EE 
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGY-HPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEE-

Query:  PKTKLFGVAIQSKKRLHPEYGSNNIGKE-NNKARLVLEKDDLGLNLMPPSA
         KTKLFGVAI SKKRLHPEY SNNIGKE NNKAR VLEKDDLGLNLMPPSA
Subjt:  PKTKLFGVAIQSKKRLHPEYGSNNIGKE-NNKARLVLEKDDLGLNLMPPSA

XP_038897232.1 heat stress transcription factor B-4-like [Benincasa hispida]8.9e-16184.57Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDP+TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAP--NFNNSVTALSEDNERLRRSNN
        RKGEKHLLCEIHRRKTAQPQ+TVNQHHQSHSP LG+NNP FYHFP R+SISPSDSDD N NWCDSP LSSP    AP  N NNSVTALSEDNERLRRSNN
Subjt:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAP--NFNNSVTALSEDNERLRRSNN

Query:  MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS-TSLLSDGFPVV-QRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNN--------TNNNGS
        MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS TSLLSD FPVV QRPPNHH   YH         Q HL++GT  +NNNNN         NNN  
Subjt:  MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS-TSLLSDGFPVV-QRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNN--------TNNNGS

Query:  SKSFVTILEE--PKTKLFGVAIQSKKRLHPEYGSNNIGKE--NNKARLVLEKDDLGLNLMPPS
        +KSFVTILEE   KTKLFGVAIQSKKRLHPEYGSNNIGKE  NNKARLVLE DDLGLNLMPPS
Subjt:  SKSFVTILEE--PKTKLFGVAIQSKKRLHPEYGSNNIGKE--NNKARLVLEKDDLGLNLMPPS

TrEMBL top hitse value%identityAlignment
A0A0A0L4W8 HSF_DOMAIN domain-containing protein1.6e-14779.21Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLD C+GVLLSLDSHKAIPAPFLTKTYQLVDDP+TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFP-ARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNM
        RKGEKHLLCEIHRRKTAQPQ+TVNQHHQ HSP+    NP FYHFP ARLSISPSDSDD N  WCDSP   SP      N NNSVTALSEDNERLRRSNNM
Subjt:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFP-ARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNM

Query:  LMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS--TSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILE
        LMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS  TSLLSDGFPVV++P ++H   +H  + +QVS Q                NN   +KSFVTILE
Subjt:  LMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS--TSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILE

Query:  E-----PKTKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA
        E      KTKLFGVAIQSKKRLHPEYG++N    NNKARLVLEKDDLGLNLMPPSA
Subjt:  E-----PKTKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA

A0A6J1F8Q8 heat stress transcription factor B-4-like1.8e-15483.19Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALM+DNCEGVL+SL+SHK IPAPFLTKTYQLVDDP+TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFAN+FF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML
        RKG+KHLLCEIHRRKTAQP     QH QS SP L +NNP FYHF AR SISPSDSDD N NWCDSPPLSS G     N NNSV+ALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGY-HPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEE-
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYST      FPVVQR PNHHL+GY +PTNAKQVSGQTHL++ TPN+NNNNN       KSFV I+EE 
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGY-HPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEE-

Query:  PKTKLFGVAIQSKKRLHPEYGSNNIGKE-NNKARLVLEKDDLGLNLMPPSA
         KTKLFGVAI SKKRLHPEY SNNIGKE NNKAR VLEKDDLGLNLMPPSA
Subjt:  PKTKLFGVAIQSKKRLHPEYGSNNIGKE-NNKARLVLEKDDLGLNLMPPSA

A0A6J1FJH2 heat stress transcription factor B-4-like6.9e-17589.94Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDP TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML
        RKGEKHLLCEIHRRKTAQPQ+TVNQHHQS+SP LG+ N  FYH+PAR+SISPSDSDD NPNWCDSPPLSS       NFNNSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEEPK
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPV QR PNHHLLGYHPTNAKQVSGQTH ++GTPN       NNNGSSKSFVTI+EEPK
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEEPK

Query:  TKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA
        TKLFGV++QSKKR+HPEYGSNNI KENNKARLVLEKDDLGLNLMPPSA
Subjt:  TKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA

A0A6J1IFM1 heat stress transcription factor B-4-like4.3e-15381.87Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALM+DNCEGVL+SL+ HK IPAPFLTKTYQLVDDP+TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFAN+FF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPG--GVAAPNFNNSVTALSEDNERLRRSNN
        RKG+KHLLCEIHRRKTAQP     QHHQS SP L +NNP FYHF  R SISPSDSDD N NWCDSPPLSS G       N++NSVTALSEDNERLRRSNN
Subjt:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPG--GVAAPNFNNSVTALSEDNERLRRSNN

Query:  MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGY-HPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILE
        MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYST      FPVVQR PNHHL+GY +PTNAK+VSGQTHL++GTPN       NNN S+KS V I+E
Subjt:  MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGY-HPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILE

Query:  E-PKTKLFGVAIQSKKRLHPEYGSNNIGKE-NNKARLVLEKDDLGLNLMPPSA
        E  KTKLFGVAI SKKRLHPEY SNNIGKE NNKAR VLEKDDLGLNLMPPSA
Subjt:  E-PKTKLFGVAIQSKKRLHPEYGSNNIGKE-NNKARLVLEKDDLGLNLMPPSA

A0A6J1IW73 heat stress transcription factor B-4-like9.0e-17589.94Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDP TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML
        RKGEKHLLCEIHRRKTAQPQ+TVNQHHQS+SP LG+ N  FYH+PAR+SISPSDSDD NPNWCDSPPLSS        FNNSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEEPK
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPV QR PNHHLLGYHPTNAKQVSGQTH + GTPN+N     NNNGSSKSFVTI+EEPK
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEEPK

Query:  TKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA
        TKLFGV++QSKKR+HPEYGSNNI KENNKARLVLEKDDLGLNLMPPSA
Subjt:  TKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA

SwissProt top hitse value%identityAlignment
Q10KX8 Heat stress transcription factor B-4d1.6e-6755.29Show/hide
Query:  MALMLDNCEG-VLLSLD-SH----------KAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKI
        MA +++ C G +++S++ SH           A PAPFL+KTYQLVDDP+TD +VSWGED+ TFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKI
Subjt:  MALMLDNCEG-VLLSLD-SH----------KAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKI

Query:  VPDRWEFANEFFRKGEKHLLCEIHRRKTAQ-------PQLTVNQHH---------QSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPG
        V DRWEFANEFFRKG KHLL EIHRRK++        P   ++QH+            SP +G    + YHF                 +C SP   + G
Subjt:  VPDRWEFANEFFRKGEKHLLCEIHRRKTAQ-------PQLTVNQHH---------QSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPG

Query:  GVAAPNFNNSVTALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAP
        G       + + ALSEDN +LRR N++L+SELAHM+KLYNDIIYF+QNHV+PVAP
Subjt:  GVAAPNFNNSVTALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAP

Q67U94 Heat stress transcription factor B-4c5.4e-6844.88Show/hide
Query:  NCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDD-TTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEK
        +CE    +  + KA+PAPFLTKTYQLVDDPATDHIVSWG+D  +TFVVWRPPEFARD+LPNYFKHNNFSSFVRQLNTYGFRK+VP+RWEFANEFFRKGEK
Subjt:  NCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDD-TTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEK

Query:  HLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDS---DDN------------NPNWCD------------SPPLSSPGGVAAPN
         LL EIHRRKT+    + +    S  P     +   +H P   +     +   DD              P+W +              P  SP    A  
Subjt:  HLLCEIHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDS---DDN------------NPNWCD------------SPPLSSPGGVAAPN

Query:  FNNSVTA--LSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQT--HLMSGT
           + TA  L E+NERLRRSN  L+ ELAHM+KLYNDIIYFVQNHV+PVAPS +   +  L   G    ++P   ++L     +    S  T     S  
Subjt:  FNNSVTA--LSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQT--HLMSGT

Query:  PNSNNNNNTNNNGSSKSFVTILEEPKTKLFGVAIQ-------SKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA
        P   +     + G + +         TKLFGV +        SK+   PE   +       K RLVLE DDL L + P S+
Subjt:  PNSNNNNNTNNNGSSKSFVTILEEPKTKLFGVAIQ-------SKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPSA

Q6Z9R8 Putative heat stress transcription factor B-4a2.2e-6151.13Show/hide
Query:  SHKAIPAPFLTKTYQLVDDPATDHIVSWGEDD-----TTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEI
        S   +PAPFLTKTYQLVDDPATDH+VSW +DD     ++FVVWRPPEFARD+LPNYFKH+NFSSFVRQLNTYGFRK+VP+RWEFANEFFRKGEK LLCEI
Subjt:  SHKAIPAPFLTKTYQLVDDPATDHIVSWGEDD-----TTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEI

Query:  HRRKTA----------QPQLTVNQH--------------HQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVT---
        HRRK+A           P     +H              H     ++       +   A L ++PS           S  LS  G V AP    + T   
Subjt:  HRRKTA----------QPQLTVNQH--------------HQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVT---

Query:  -ALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPP
         AL ++N RL R N  L+ ELAHM+KLY+DIIYFVQNHV+PVAPS     + +    G  V++ PP
Subjt:  -ALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPP

Q7XHZ0 Heat stress transcription factor B-4b7.5e-7056.35Show/hide
Query:  MALMLDNCEGVLLSLD----------SHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP
        MA +++ C  +++S++          + K +PAPFLTKTYQLVDDP TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIV 
Subjt:  MALMLDNCEGVLLSLD----------SHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP

Query:  DRWEFANEFFRKGEKHLLCEIHRRKTAQP-----QLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTA
        DRWEFANEFFRKG KHLL EIHRRK++QP           HH   +P      P  YH    +   P+ +       C        GG       + + A
Subjt:  DRWEFANEFFRKGEKHLLCEIHRRKTAQP-----QLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTA

Query:  LSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSL
        LSEDN +LRR N++L+SELAHMKKLYNDIIYF+QNHV PV  + +   ST++
Subjt:  LSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSL

Q9C635 Heat stress transcription factor B-41.7e-9053.33Show/hide
Query:  MALMLDNCEG----------VLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP
        MA+M++N  G           L+     KA+PAPFLTKTYQLVDDPATDH+VSWG+DDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP
Subjt:  MALMLDNCEG----------VLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP

Query:  DRWEFANEFFRKGEKHLLCEIHRRKTAQPQLTVNQH-----HQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAP-NFNNSVT
        DRWEFANEFF++GEKHLLCEIHRRKT+  Q+   QH     H    P +  +  SF+  P     +P    + +  WCD  P S P  +    +    VT
Subjt:  DRWEFANEFFRKGEKHLLCEIHRRKTAQPQLTVNQH-----HQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAP-NFNNSVT

Query:  ALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTN
        ALSEDNERLRRSN +LMSELAHMKKLYNDIIYFVQNHVKPVAPSN+  Y +S L       Q+PP    L Y+ T              T N+ N N  N
Subjt:  ALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTN

Query:  NN-GSSKSFVTILEEP-----------KTKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPS
        ++  +S+S +T+LE+            KTKLFGV++ S K+      S++   + +K RLVL++ DL LNLM  S
Subjt:  NN-GSSKSFVTILEEP-----------KTKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPS

Arabidopsis top hitse value%identityAlignment
AT1G46264.1 heat shock transcription factor B41.2e-9153.33Show/hide
Query:  MALMLDNCEG----------VLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP
        MA+M++N  G           L+     KA+PAPFLTKTYQLVDDPATDH+VSWG+DDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP
Subjt:  MALMLDNCEG----------VLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP

Query:  DRWEFANEFFRKGEKHLLCEIHRRKTAQPQLTVNQH-----HQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAP-NFNNSVT
        DRWEFANEFF++GEKHLLCEIHRRKT+  Q+   QH     H    P +  +  SF+  P     +P    + +  WCD  P S P  +    +    VT
Subjt:  DRWEFANEFFRKGEKHLLCEIHRRKTAQPQLTVNQH-----HQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAP-NFNNSVT

Query:  ALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTN
        ALSEDNERLRRSN +LMSELAHMKKLYNDIIYFVQNHVKPVAPSN+  Y +S L       Q+PP    L Y+ T              T N+ N N  N
Subjt:  ALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTN

Query:  NN-GSSKSFVTILEEP-----------KTKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPS
        ++  +S+S +T+LE+            KTKLFGV++ S K+      S++   + +K RLVL++ DL LNLM  S
Subjt:  NN-GSSKSFVTILEEP-----------KTKLFGVAIQSKKRLHPEYGSNNIGKENNKARLVLEKDDLGLNLMPPS

AT4G11660.1 winged-helix DNA-binding transcription factor family protein9.8e-4948.13Show/hide
Query:  DSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRK
        DS ++IP PFLTKTYQLV+DP  D ++SW ED TTF+VWRP EFARDLLP YFKHNNFSSFVRQLNTYGFRK+VPDRWEF+N+ F++GEK LL +I RRK
Subjt:  DSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRK

Query:  TAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSP-------GGVAAPNFNNSVTA--LSEDNERLRRSNNMLMSELAH
         +QP +       + +        +     A + +SPS+S +      +S P ++        GG +     +  TA  L E+NERLR+ N  L  E+  
Subjt:  TAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSP-------GGVAAPNFNNSVTA--LSEDNERLRRSNNMLMSELAH

Query:  MKKLYNDIIYFVQN
        +K LY +I   + N
Subjt:  MKKLYNDIIYFVQN

AT4G17750.1 heat shock factor 11.9e-3965.45Show/hide
Query:  AIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRKTAQP
        ++P PFL+KTY +V+DPATD IVSW   + +F+VW PPEF+RDLLP YFKHNNFSSFVRQLNTYGFRK+ PDRWEFANE F +G+KHLL +I RRK+ Q 
Subjt:  AIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRKTAQP

Query:  QLTVNQHHQS
          + + + QS
Subjt:  QLTVNQHHQS

AT4G36990.1 heat shock factor 45.4e-4748.56Show/hide
Query:  SHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRKT
        + +++PAPFL+KTYQLVDD +TD +VSW E+ T FVVW+  EFA+DLLP YFKHNNFSSF+RQLNTYGFRK VPD+WEFAN++FR+G + LL +I RRK 
Subjt:  SHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRKT

Query:  AQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAP-NFNNSVTALSEDNERLRRSNNMLMSELAHMKKLYNDII
             +V         V+G               SPS+S+    +   S   SSPG    P +  N V  LS +NE+L+R NN L SELA  KK  ++++
Subjt:  AQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAP-NFNNSVTALSEDNERLRRSNNMLMSELAHMKKLYNDII

Query:  YFVQNHVK
         F+  H+K
Subjt:  YFVQNHVK

AT5G62020.1 heat shock transcription factor B2A8.6e-4543.81Show/hide
Query:  SHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRKT
        S ++IP PFLTKT+ LV+D + D ++SW ED ++F+VW P +FA+DLLP +FKHNNFSSFVRQLNTYGF+K+VPDRWEF+N+FF++GEK LL EI RRK 
Subjt:  SHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRKT

Query:  AQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDS-DDNNPN----------WCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNMLMSELA
              +   HQ+      V  PS       + +SPS+S +DNN N          +C     +  GG++          L E+NE+LR  N  L  EL 
Subjt:  AQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDS-DDNNPN----------WCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNMLMSELA

Query:  HMKKLYNDIIYFVQNHVKPVAPSNSY
         MK + ++I   + N+V       SY
Subjt:  HMKKLYNDIIYFVQNHVKPVAPSNSY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTAATGCTTGATAATTGCGAAGGCGTTTTGCTTTCCTTAGACTCCCACAAAGCAATCCCAGCTCCATTTCTTACTAAAACTTACCAGCTCGTTGACGATCCCGC
CACCGACCATATTGTCTCATGGGGCGAAGATGATACCACCTTCGTTGTTTGGCGCCCTCCTGAATTTGCTAGAGATCTCCTTCCTAACTATTTCAAGCACAACAACTTCT
CTAGCTTCGTCCGCCAGCTCAACACCTATGGTTTTAGAAAAATTGTGCCGGATAGATGGGAATTTGCGAATGAGTTCTTCAGAAAAGGAGAGAAACATTTGTTATGTGAG
ATCCATCGACGAAAAACCGCTCAGCCTCAACTCACCGTCAACCAACATCACCAATCTCATTCTCCAGTACTCGGTGTCAACAATCCCAGTTTCTACCATTTTCCCGCCCG
ACTCAGCATCTCCCCGTCCGACTCCGACGACAACAACCCCAATTGGTGTGACTCGCCACCTCTCTCCTCCCCCGGAGGTGTCGCTGCACCCAACTTCAACAACTCCGTCA
CCGCCTTGTCGGAAGACAACGAGCGTCTTCGTCGGAGCAACAACATGCTCATGTCCGAATTAGCCCACATGAAAAAACTCTACAACGACATCATCTATTTCGTTCAAAAC
CACGTCAAGCCCGTTGCCCCGAGTAATTCATATCAATATTCAACGTCTCTACTTTCCGACGGTTTTCCGGTGGTGCAACGGCCGCCGAATCACCACCTTCTCGGGTACCA
TCCGACGAACGCGAAACAAGTTTCCGGGCAGACCCATTTGATGAGCGGAACTCCAAATAGTAATAATAATAATAATACTAATAATAATGGTTCCTCAAAAAGCTTCGTGA
CGATTCTAGAAGAACCCAAGACGAAGCTTTTTGGAGTGGCGATTCAATCGAAGAAACGGCTGCACCCGGAGTATGGTTCTAATAATATCGGCAAAGAGAACAACAAGGCT
CGATTGGTGTTGGAAAAAGATGATTTGGGCCTCAATCTCATGCCCCCTTCCGCTTTTCTACACAGAAAGAAGGGCAAAATATTTTTGAAAATATTTGTATCAGAAGAAGA
TAAAGAGATGAGAAAGACCGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTAATGCTTGATAATTGCGAAGGCGTTTTGCTTTCCTTAGACTCCCACAAAGCAATCCCAGCTCCATTTCTTACTAAAACTTACCAGCTCGTTGACGATCCCGC
CACCGACCATATTGTCTCATGGGGCGAAGATGATACCACCTTCGTTGTTTGGCGCCCTCCTGAATTTGCTAGAGATCTCCTTCCTAACTATTTCAAGCACAACAACTTCT
CTAGCTTCGTCCGCCAGCTCAACACCTATGGTTTTAGAAAAATTGTGCCGGATAGATGGGAATTTGCGAATGAGTTCTTCAGAAAAGGAGAGAAACATTTGTTATGTGAG
ATCCATCGACGAAAAACCGCTCAGCCTCAACTCACCGTCAACCAACATCACCAATCTCATTCTCCAGTACTCGGTGTCAACAATCCCAGTTTCTACCATTTTCCCGCCCG
ACTCAGCATCTCCCCGTCCGACTCCGACGACAACAACCCCAATTGGTGTGACTCGCCACCTCTCTCCTCCCCCGGAGGTGTCGCTGCACCCAACTTCAACAACTCCGTCA
CCGCCTTGTCGGAAGACAACGAGCGTCTTCGTCGGAGCAACAACATGCTCATGTCCGAATTAGCCCACATGAAAAAACTCTACAACGACATCATCTATTTCGTTCAAAAC
CACGTCAAGCCCGTTGCCCCGAGTAATTCATATCAATATTCAACGTCTCTACTTTCCGACGGTTTTCCGGTGGTGCAACGGCCGCCGAATCACCACCTTCTCGGGTACCA
TCCGACGAACGCGAAACAAGTTTCCGGGCAGACCCATTTGATGAGCGGAACTCCAAATAGTAATAATAATAATAATACTAATAATAATGGTTCCTCAAAAAGCTTCGTGA
CGATTCTAGAAGAACCCAAGACGAAGCTTTTTGGAGTGGCGATTCAATCGAAGAAACGGCTGCACCCGGAGTATGGTTCTAATAATATCGGCAAAGAGAACAACAAGGCT
CGATTGGTGTTGGAAAAAGATGATTTGGGCCTCAATCTCATGCCCCCTTCCGCTTTTCTACACAGAAAGAAGGGCAAAATATTTTTGAAAATATTTGTATCAGAAGAAGA
TAAAGAGATGAGAAAGACCGTGTAATTAATTATTAAATAAATAAGCTGTGCTGCTGACACGCAGGAGAAGGGAAAAACCAAAAAAAAAAGAAAAAAAAAGGTTTTCCCTG
TGGTATTGAACTCGAAAGAAAAAGGAAAGGTTTTCCTCTGATATTAAAGTTGGCAGAAAATTCAGCCCAACATGGGCTGCAGAAATGGGAAGGAATGATTAAGCCCACCC
Protein sequenceShow/hide protein sequence
MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPATDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCE
IHRRKTAQPQLTVNQHHQSHSPVLGVNNPSFYHFPARLSISPSDSDDNNPNWCDSPPLSSPGGVAAPNFNNSVTALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQN
HVKPVAPSNSYQYSTSLLSDGFPVVQRPPNHHLLGYHPTNAKQVSGQTHLMSGTPNSNNNNNTNNNGSSKSFVTILEEPKTKLFGVAIQSKKRLHPEYGSNNIGKENNKA
RLVLEKDDLGLNLMPPSAFLHRKKGKIFLKIFVSEEDKEMRKTV