; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G020010 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G020010
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionheat stress transcription factor B-4-like
Genome locationCG_Chr05:32219334..32222916
RNA-Seq ExpressionClCG05G020010
SyntenyClCG05G020010
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608467.1 Heat stress transcription factor B-4, partial [Cucurbita argyrosperma subsp. sororia]2.2e-15984.51Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDP+TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML
        RKGEKHLLCEIHRRKTAQPQVTVNQHHQS+SPLG+ N GFYH+P RVSISPSDSDD  NWCDSPPLSS+  +S  +NN  NSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS TSLLSDGFPV Q R PNHH   +H TN KQVS Q   V A TPNNN   N ++KSFVTI+EE   
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ

Query:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA
             KTKLFGV++QSKKR+HPEY SNNI KE NNKARLVLEKDDLGLNLMPPSA
Subjt:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA

XP_004141262.2 heat stress transcription factor B-4 [Cucumis sativus]4.3e-15582.96Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLD C+GVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFP-GRVSISPSDSDDQNN-WCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNN
        RKGEKHLLCEIHRRKTAQPQVTVNQHHQ HSPL   NPGFYHFP  R+SISPSDSDDQNN WCDSP           +NNN NSVTALSEDNERLRRSNN
Subjt:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFP-GRVSISPSDSDDQNN-WCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNN

Query:  MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS-TTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEE
        MLMSELAH+KKLYNDIIYFVQNHVKPVAPSNSYQYS TTSLLSDGFPVV  R PNH+HH+HH  + +QVSSQI+           N    TKSFVTILEE
Subjt:  MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS-TTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEE

Query:  QQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA
          +QQQQTKTKLFGVAIQSKKRLHPEY ++N    NNNKARLVLEKDDLGLNLMPPSA
Subjt:  QQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA

XP_022940239.1 heat stress transcription factor B-4-like [Cucurbita moschata]1.6e-15784.23Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDP+TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML
        RKGEKHLLCEIHRRKTAQPQVTVNQHHQS+SPLG+ N GFYH+P RVSISPSDSDD  NWCDSPPLSS+      +NN  NSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS TSLLSDGFPV Q R PNHH   +H TN KQVS Q   V A TPNNN   N ++KSFVTI+EE   
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ

Query:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA
             KTKLFGV++QSKKR+HPEY SNNI KE NNKARLVLEKDDLGLNLMPPSA
Subjt:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA

XP_022982267.1 heat stress transcription factor B-4-like [Cucurbita maxima]1.1e-15884.23Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDP+TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML
        RKGEKHLLCEIHRRKTAQPQVTVNQHHQS+SPLG+ N GFYH+P RVSISPSDSDD  NWCDSPPLSS+  ++ V N   NSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS TSLLSDGFPV Q R PNHH   +H TN KQVS Q   V   TPNNN N N ++KSFVTI+EE   
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ

Query:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA
             KTKLFGV++QSKKR+HPEY SNNI KE NNKARLVLEKDDLGLNLMPPSA
Subjt:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA

XP_038897232.1 heat stress transcription factor B-4-like [Benincasa hispida]4.2e-17487.16Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML
        RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSP LSS  P    HNNN NSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNT-----------NKNAATK
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSD FPVVQ RPPNHHHH++H        SQI LVT TTP NN            N N+ TK
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNT-----------NKNAATK

Query:  SFVTILEEQQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKE-NNNKARLVLEKDDLGLNLMPPS
        SFVTILEE     Q TKTKLFGVAIQSKKRLHPEY SNNIGKE NNNKARLVLE DDLGLNLMPPS
Subjt:  SFVTILEEQQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKE-NNNKARLVLEKDDLGLNLMPPS

TrEMBL top hitse value%identityAlignment
A0A0A0L4W8 HSF_DOMAIN domain-containing protein7.2e-15683.24Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLD C+GVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFP-GRVSISPSDSDDQNN-WCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNN
        RKGEKHLLCEIHRRKTAQPQVTVNQHHQ HSPL   NPGFYHFP  R+SISPSDSDDQNN WCDSP           +NNN NSVTALSEDNERLRRSNN
Subjt:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFP-GRVSISPSDSDDQNN-WCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNN

Query:  MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS-TTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEE
        MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS TTSLLSDGFPVV  R PNH+HH+HH  + +QVSSQI+           N    TKSFVTILEE
Subjt:  MLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS-TTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEE

Query:  QQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA
          +QQQQTKTKLFGVAIQSKKRLHPEY ++N    NNNKARLVLEKDDLGLNLMPPSA
Subjt:  QQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA

A0A6J1F8Q8 heat stress transcription factor B-4-like1.4e-15482.25Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALM+DNCEGVL+SL+SHK IPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFAN+FF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML
        RKG+KHLLCEIHRRKTAQP     QH QS SPL +NNPGFYHF  R SISPSDSDDQNNWCDSPPLSSSG      NNN NSV+ALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYST       FPVVQ +P +H   Y++ TN KQVS Q  LVTA TPNNN N N   KSFV I+EEQQ 
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ

Query:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA
             KTKLFGVAI SKKRLHPEYASNNIGKENNNKAR VLEKDDLGLNLMPPSA
Subjt:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA

A0A6J1FJH2 heat stress transcription factor B-4-like7.7e-15884.23Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDP+TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML
        RKGEKHLLCEIHRRKTAQPQVTVNQHHQS+SPLG+ N GFYH+P RVSISPSDSDD  NWCDSPPLSS+      +NN  NSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS TSLLSDGFPV Q R PNHH   +H TN KQVS Q   V A TPNNN   N ++KSFVTI+EE   
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ

Query:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA
             KTKLFGV++QSKKR+HPEY SNNI KE NNKARLVLEKDDLGLNLMPPSA
Subjt:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA

A0A6J1IFM1 heat stress transcription factor B-4-like1.0e-15481.41Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALM+DNCEGVL+SL+ HK IPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFAN+FF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML
        RKG+KHLLCEIHRRKTAQP     QHHQS SPL +NNPGFYHF GR SISPSDSDDQNNWCDSPPLSSSG ++  +NN +NSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYST       FPVVQ +P +H   Y++ TN K+VS Q  LVT  TPNNN + N   KS V I+EEQQ 
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ

Query:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA
             KTKLFGVAI SKKRLHPEYASNNIGKENNNKAR VLEKDDLGLNLMPPSA
Subjt:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA

A0A6J1IW73 heat stress transcription factor B-4-like5.3e-15984.23Show/hide
Query:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
        MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDP+TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF
Subjt:  MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFF

Query:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML
        RKGEKHLLCEIHRRKTAQPQVTVNQHHQS+SPLG+ N GFYH+P RVSISPSDSDD  NWCDSPPLSS+  ++ V N   NSVTALSEDNERLRRSNNML
Subjt:  RKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNML

Query:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ
        MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYS TSLLSDGFPV Q R PNHH   +H TN KQVS Q   V   TPNNN N N ++KSFVTI+EE   
Subjt:  MSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQ

Query:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA
             KTKLFGV++QSKKR+HPEY SNNI KE NNKARLVLEKDDLGLNLMPPSA
Subjt:  QQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA

SwissProt top hitse value%identityAlignment
Q10KX8 Heat stress transcription factor B-4d7.9e-6758.02Show/hide
Query:  MALMLDNCEG-VLLSLD-SH----------KAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKI
        MA +++ C G +++S++ SH           A PAPFL+KTYQLVDDPSTD +VSWGED+ TFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKI
Subjt:  MALMLDNCEG-VLLSLD-SH----------KAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKI

Query:  VPDRWEFANEFFRKGEKHLLCEIHRRKT---AQPQVTVNQHHQSHSPLGI-NNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVT
        V DRWEFANEFFRKG KHLL EIHRRK+   +QPQ         H PL + + P     P   + + +    Q  +C SP   + G          + + 
Subjt:  VPDRWEFANEFFRKGEKHLLCEIHRRKT---AQPQVTVNQHHQSHSPLGI-NNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVT

Query:  ALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAP
        ALSEDN +LRR N++L+SELAHM+KLYNDIIYF+QNHV+PVAP
Subjt:  ALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAP

Q67U94 Heat stress transcription factor B-4c2.8e-6444.3Show/hide
Query:  NCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDD-TTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEK
        +CE    +  + KA+PAPFLTKTYQLVDDP+TDHIVSWG+D  +TFVVWRPPEFARD+LPNYFKHNNFSSFVRQLNTYGFRK+VP+RWEFANEFFRKGEK
Subjt:  NCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDD-TTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEK

Query:  HLLCEIHRRKTA----------------------QPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWC-------DSPPLSSSGPHSGVH
         LL EIHRRKT+                       P V   QHH  H+ +G +     H  G     P   +                P  SS    G  
Subjt:  HLLCEIHRRKTA----------------------QPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWC-------DSPPLSSSGPHSGVH

Query:  NNNTNSVTALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTAT
           T +V  L E+NERLRRSN  L+ ELAHM+KLYNDIIYFVQNHV+PVAPS        +    G  +   + P   +  ++       SS + +    
Subjt:  NNNTNSVTALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTAT

Query:  TPNNNTNKNAATKSFVTILEEQQQQQQQTKTKLFGVAIQ-------SKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA
        +P       A  KS           +    TKLFGV +        SK+   PE    +       K RLVLE DDL L + P S+
Subjt:  TPNNNTNKNAATKSFVTILEEQQQQQQQTKTKLFGVAIQ-------SKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPSA

Q6Z9R8 Putative heat stress transcription factor B-4a1.0e-5841.05Show/hide
Query:  SHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDD-----TTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEI
        S   +PAPFLTKTYQLVDDP+TDH+VSW +DD     ++FVVWRPPEFARD+LPNYFKH+NFSSFVRQLNTYGFRK+VP+RWEFANEFFRKGEK LLCEI
Subjt:  SHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDD-----TTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEI

Query:  HRRKTA---------QPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPP---------LSSSGPHSGVHNN--NTNSVTALSEDNE
        HRRK+A          P       H +      +  G  H  GR+    + + ++ +W +S           LS  GP          T    AL ++N 
Subjt:  HRRKTA---------QPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPP---------LSSSGPHSGVHNN--NTNSVTALSEDNE

Query:  RLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAA----
        RL R N  L+ ELAHM+KLY+DIIYFVQNHV+PVAPS        +    G  V+  RPP           GK  +S+++  +  +  ++++   A    
Subjt:  RLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAA----

Query:  -------TKSFVTILEEQQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNN-------KARLVLEKDDLGLNLMPP
                ++   I+ E         TKLFGV + S        AS                K  LV+E  +L L+++ P
Subjt:  -------TKSFVTILEEQQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNN-------KARLVLEKDDLGLNLMPP

Q7XHZ0 Heat stress transcription factor B-4b1.7e-6954.79Show/hide
Query:  MALMLDNCEGVLLSLD----------SHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP
        MA +++ C  +++S++          + K +PAPFLTKTYQLVDDP TDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIV 
Subjt:  MALMLDNCEGVLLSLD----------SHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP

Query:  DRWEFANEFFRKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDN
        DRWEFANEFFRKG KHLL EIHRRK++QP      H   H    +N       P  +   P      +   + P  ++    +G      + + ALSEDN
Subjt:  DRWEFANEFFRKGEKHLLCEIHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDN

Query:  ERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPP
         +LRR N++L+SELAHMKKLYNDIIYF+QNHV PV        +TT+  S      QH  P
Subjt:  ERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPP

Q9C635 Heat stress transcription factor B-45.3e-8752.28Show/hide
Query:  MALMLDNCEG----------VLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP
        MA+M++N  G           L+     KA+PAPFLTKTYQLVDDP+TDH+VSWG+DDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP
Subjt:  MALMLDNCEG----------VLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP

Query:  DRWEFANEFFRKGEKHLLCEIHRRKTAQ--PQ--VTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNS-VTA
        DRWEFANEFF++GEKHLLCEIHRRKT+Q  PQ       HH +   +  +   F+  P     +P   ++ + WCD  P   S P       +T + VTA
Subjt:  DRWEFANEFFRKGEKHLLCEIHRRKTAQ--PQ--VTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNS-VTA

Query:  LSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKN
        LSEDNERLRRSN +LMSELAHMKKLYNDIIYFVQNHVKPVAPSN+  Y     LS      Q + P    +Y+  T           V AT  N   +  
Subjt:  LSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKN

Query:  AATKSFVTILEE----QQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPS
          ++S +T+LE+       Q    KTKLFGV++ S K+    ++      +  +K RLVL++ DL LNLM  S
Subjt:  AATKSFVTILEE----QQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPS

Arabidopsis top hitse value%identityAlignment
AT1G46264.1 heat shock transcription factor B43.7e-8852.28Show/hide
Query:  MALMLDNCEG----------VLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP
        MA+M++N  G           L+     KA+PAPFLTKTYQLVDDP+TDH+VSWG+DDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP
Subjt:  MALMLDNCEG----------VLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVP

Query:  DRWEFANEFFRKGEKHLLCEIHRRKTAQ--PQ--VTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNS-VTA
        DRWEFANEFF++GEKHLLCEIHRRKT+Q  PQ       HH +   +  +   F+  P     +P   ++ + WCD  P   S P       +T + VTA
Subjt:  DRWEFANEFFRKGEKHLLCEIHRRKTAQ--PQ--VTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNS-VTA

Query:  LSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKN
        LSEDNERLRRSN +LMSELAHMKKLYNDIIYFVQNHVKPVAPSN+  Y     LS      Q + P    +Y+  T           V AT  N   +  
Subjt:  LSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQNHVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKN

Query:  AATKSFVTILEE----QQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPS
          ++S +T+LE+       Q    KTKLFGV++ S K+    ++      +  +K RLVL++ DL LNLM  S
Subjt:  AATKSFVTILEE----QQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIGKENNNKARLVLEKDDLGLNLMPPS

AT4G11660.1 winged-helix DNA-binding transcription factor family protein2.7e-4648.62Show/hide
Query:  DSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRK
        DS ++IP PFLTKTYQLV+DP  D ++SW ED TTF+VWRP EFARDLLP YFKHNNFSSFVRQLNTYGFRK+VPDRWEF+N+ F++GEK LL +I RRK
Subjt:  DSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRK

Query:  TAQPQVTVNQHHQSHSPLG-----INNPGFYHFPGRVSISPSDS-DDQNNWCDSPPLSSSGPHSGV-----HNNNTNSVTA--LSEDNERLRRSNNMLMS
         +QP +       + +           P   H      +SPS+S ++Q    +S P +++    GV         T+  TA  L E+NERLR+ N  L  
Subjt:  TAQPQVTVNQHHQSHSPLG-----INNPGFYHFPGRVSISPSDS-DDQNNWCDSPPLSSSGPHSGV-----HNNNTNSVTA--LSEDNERLRRSNNMLMS

Query:  ELAHMKKLYNDIIYFVQN
        E+  +K LY +I   + N
Subjt:  ELAHMKKLYNDIIYFVQN

AT4G17750.1 heat shock factor 12.5e-3964.55Show/hide
Query:  AIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRKTAQP
        ++P PFL+KTY +V+DP+TD IVSW   + +F+VW PPEF+RDLLP YFKHNNFSSFVRQLNTYGFRK+ PDRWEFANE F +G+KHLL +I RRK+ Q 
Subjt:  AIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRKTAQP

Query:  QVTVNQHHQS
          + + + QS
Subjt:  QVTVNQHHQS

AT4G36990.1 heat shock factor 47.1e-4749.06Show/hide
Query:  SHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRK-
        + +++PAPFL+KTYQLVDD STD +VSW E+ T FVVW+  EFA+DLLP YFKHNNFSSF+RQLNTYGFRK VPD+WEFAN++FR+G + LL +I RRK 
Subjt:  SHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRK-

Query:  ----TAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNMLMSELAHMKKLY
            TA   V V    +S+S  G                    DD  +   S P SS  P S       N V  LS +NE+L+R NN L SELA  KK  
Subjt:  ----TAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNMLMSELAHMKKLY

Query:  NDIIYFVQNHVK
        ++++ F+  H+K
Subjt:  NDIIYFVQNHVK

AT5G62020.1 heat shock transcription factor B2A1.9e-4445.21Show/hide
Query:  SHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRKT
        S ++IP PFLTKT+ LV+D S D ++SW ED ++F+VW P +FA+DLLP +FKHNNFSSFVRQLNTYGF+K+VPDRWEF+N+FF++GEK LL EI RRK 
Subjt:  SHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCEIHRRKT

Query:  AQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALS----EDNERLRRSNNMLMSELAHMKKLYN
              +   HQ+     +  P        + +SPS+S + NN  ++  +SSS      H   T     LS    E+NE+LR  N  L  EL  MK + +
Subjt:  AQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALS----EDNERLRRSNNMLMSELAHMKKLYN

Query:  DIIYFVQNHVKPVAPSNSY
        +I   + N+V       SY
Subjt:  DIIYFVQNHVKPVAPSNSY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTAATGCTTGATAATTGCGAAGGCGTTTTGCTTTCCTTAGACTCTCACAAAGCAATCCCAGCTCCGTTTCTCACTAAAACCTACCAACTGGTTGATGATCCTTC
CACTGACCATATTGTCTCATGGGGTGAAGATGACACTACCTTCGTCGTTTGGCGTCCTCCTGAATTCGCTAGAGATCTCCTTCCTAACTATTTCAAACACAACAATTTCT
CTAGCTTCGTCCGTCAGCTCAACACTTATGGTTTTAGAAAAATTGTGCCGGACAGATGGGAATTTGCGAATGAGTTCTTTAGAAAAGGAGAGAAACATTTGTTATGTGAG
ATCCATAGACGGAAAACCGCTCAACCTCAAGTTACCGTCAACCAACACCACCAATCTCATTCTCCACTTGGTATTAACAATCCCGGGTTTTACCATTTTCCCGGCAGAGT
TAGTATCTCCCCCTCCGATTCCGACGACCAAAACAATTGGTGCGACTCGCCTCCACTCTCTTCCTCTGGTCCCCACAGCGGCGTTCACAACAACAACACCAACTCCGTCA
CCGCTTTGTCGGAGGACAACGAACGCCTCCGCCGAAGTAACAACATGCTGATGTCCGAATTAGCCCATATGAAAAAACTCTACAACGACATCATTTATTTCGTTCAAAAC
CATGTGAAGCCTGTCGCCCCGAGTAATTCATATCAATATTCAACCACATCCTTACTTTCCGACGGCTTTCCGGTGGTGCAACACCGGCCACCAAACCACCACCATCATTA
CCATCATCTGACGAATGGGAAACAAGTTTCGAGTCAGATCCAATTGGTGACGGCAACTACTCCAAATAATAATACTAACAAGAATGCTGCAACGAAAAGCTTCGTGACGA
TTCTAGAAGAACAACAACAGCAACAACAACAAACGAAAACAAAGCTTTTTGGTGTGGCGATTCAATCCAAGAAACGGTTACACCCAGAGTATGCTTCTAACAACATTGGG
AAAGAGAACAACAACAAAGCTAGATTGGTTTTGGAAAAAGACGATTTAGGCCTCAATCTCATGCCTCCTTCCGCTTGGGGCCTCGCACGTGATTCGGGTGTGCACGTGAA
GCCTAAAACAACGCAGCTGGGTTTGGGAAAGAAAGGATGA
mRNA sequenceShow/hide mRNA sequence
TGTCAGTAGAGAGATAAAAATAAAAATTATAGAAATATATACATATATATATATATATATATTTTTTTTTTAAAAAAAAAACATGTCACGGGTTTGTCTTGAAACCCTCT
TATTTCACAGACACATATTCTCTAATTTGTTAATATAAAATCAAAAACTAACTGCACACCACACGACACACTACCTTTCCTCTTTCTTTTTGCTTCTCTTTTATTCCTTC
ATCTGTTCTCAACACAAACACAAACACAAACACAAAACCTCTGAAATGGCTCTAATGCTTGATAATTGCGAAGGCGTTTTGCTTTCCTTAGACTCTCACAAAGCAATCCC
AGCTCCGTTTCTCACTAAAACCTACCAACTGGTTGATGATCCTTCCACTGACCATATTGTCTCATGGGGTGAAGATGACACTACCTTCGTCGTTTGGCGTCCTCCTGAAT
TCGCTAGAGATCTCCTTCCTAACTATTTCAAACACAACAATTTCTCTAGCTTCGTCCGTCAGCTCAACACTTATGGTTTTAGAAAAATTGTGCCGGACAGATGGGAATTT
GCGAATGAGTTCTTTAGAAAAGGAGAGAAACATTTGTTATGTGAGATCCATAGACGGAAAACCGCTCAACCTCAAGTTACCGTCAACCAACACCACCAATCTCATTCTCC
ACTTGGTATTAACAATCCCGGGTTTTACCATTTTCCCGGCAGAGTTAGTATCTCCCCCTCCGATTCCGACGACCAAAACAATTGGTGCGACTCGCCTCCACTCTCTTCCT
CTGGTCCCCACAGCGGCGTTCACAACAACAACACCAACTCCGTCACCGCTTTGTCGGAGGACAACGAACGCCTCCGCCGAAGTAACAACATGCTGATGTCCGAATTAGCC
CATATGAAAAAACTCTACAACGACATCATTTATTTCGTTCAAAACCATGTGAAGCCTGTCGCCCCGAGTAATTCATATCAATATTCAACCACATCCTTACTTTCCGACGG
CTTTCCGGTGGTGCAACACCGGCCACCAAACCACCACCATCATTACCATCATCTGACGAATGGGAAACAAGTTTCGAGTCAGATCCAATTGGTGACGGCAACTACTCCAA
ATAATAATACTAACAAGAATGCTGCAACGAAAAGCTTCGTGACGATTCTAGAAGAACAACAACAGCAACAACAACAAACGAAAACAAAGCTTTTTGGTGTGGCGATTCAA
TCCAAGAAACGGTTACACCCAGAGTATGCTTCTAACAACATTGGGAAAGAGAACAACAACAAAGCTAGATTGGTTTTGGAAAAAGACGATTTAGGCCTCAATCTCATGCC
TCCTTCCGCTTGGGGCCTCGCACGTGATTCGGGTGTGCACGTGAAGCCTAAAACAACGCAGCTGGGTTTGGGAAAGAAAGGATGAAAAAAGGGAAATAAGGAAAGAAAAA
AAAAAACTCCATTAAAGATGCTTTAAAGGACTTCATTGTAGGTTTTTGTTTGCTACCCAAACCAGCCGTATTCAACCCATTCTCCCATCGGGGATGATTCTTCCAACCTC
TAAAGTAAAAGGAAAATATTAAATTCTTTAAATAATTTTCTTTTAGTAAAGAGAAATGCATGTCCACTTTGTCTGAATCGACTTTATTGACATTTTTCTCCTTTCGTTTT
CTT
Protein sequenceShow/hide protein sequence
MALMLDNCEGVLLSLDSHKAIPAPFLTKTYQLVDDPSTDHIVSWGEDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANEFFRKGEKHLLCE
IHRRKTAQPQVTVNQHHQSHSPLGINNPGFYHFPGRVSISPSDSDDQNNWCDSPPLSSSGPHSGVHNNNTNSVTALSEDNERLRRSNNMLMSELAHMKKLYNDIIYFVQN
HVKPVAPSNSYQYSTTSLLSDGFPVVQHRPPNHHHHYHHLTNGKQVSSQIQLVTATTPNNNTNKNAATKSFVTILEEQQQQQQQTKTKLFGVAIQSKKRLHPEYASNNIG
KENNNKARLVLEKDDLGLNLMPPSAWGLARDSGVHVKPKTTQLGLGKKG