; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G19250 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G19250
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionheat stress transcription factor A-4c-like
Genome locationClcChr02:31823376..31830452
RNA-Seq ExpressionClc02G19250
SyntenyClc02G19250
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0034605 - cellular response to heat (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR001810 - F-box domain
IPR006566 - FBD domain
IPR027725 - Heat shock transcription factor family
IPR032675 - Leucine-rich repeat domain superfamily
IPR036047 - F-box-like domain superfamily
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054510.1 F-box/FBD/LRR-repeat protein [Cucumis melo var. makuwa]1.6e-21777.58Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPES+LVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y GENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC GLKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIILMSSDLHNMGVDICPNSCHWLEICRVQTSFHKVAPAWNMQHSQEFTL
        TFMCER IYE+VQMLL KLAE+       W + +   + +G   +   LL      +  H  G+     + +WLE     T+F+ + P      S    L
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIILMSSDLHNMGVDICPNSCHWLEICRVQTSFHKVAPAWNMQHSQEFTL

Query:  TEEAKWMEAYEFDGKDYWKSQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQEDHFTPEELLQFS
        TEEAKWMEAY+FDGKDYWKS+NGD+RGLRKYLKTVMIYGYVTEPYVLEL+EFLLKNALVLEKMVISTKRTLQPIHQYELFKDAV DQED FTPEELLQFS
Subjt:  TEEAKWMEAYEFDGKDYWKSQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQEDHFTPEELLQFS

Query:  QKLLTFPRASKSA
        QKLLT PRASKSA
Subjt:  QKLLTFPRASKSA

XP_004143230.1 putative F-box/LRR-repeat protein At3g18150 [Cucumis sativus]9.7e-22879.69Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SKRTR+EE +NDWISQLPES+LVDILSYLPT+DAVKM LISRFRNLWTYI  LSFDECAYHDH+ YDGENYDGPHYDE FLNLIRHVLILHERT I
Subjt:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFNAIHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTN YLKELSL GCGIEEKGRI LTSL
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLKEI+LSDKIMGEI++GCPMLEELSLDGCCGL KLKLTTSNIKRL+I +GWRNE++NSRLEISCPGLKSLELAG+I LVQLKYS+SI DASLYYSR
Subjt:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHAT---LLIIILMSSDLHNMGVDICPNSCHWLEICRVQTSFHKVAPAWNMQHSQE
        TFMCER IYEKVQMLLWKLAEV+VFIPCTWT L +   +     +       +   L+ +  H  G+     + HWLE     T    + P      S  
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHAT---LLIIILMSSDLHNMGVDICPNSCHWLEICRVQTSFHKVAPAWNMQHSQE

Query:  FTLTEEAKWMEAYEFDGKDYWKSQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQEDHFTPEEL
          LTEEAKWMEAY+FDGK YWKSQNGD+R GLRKYLKTV IYGYVTEPYVLEL+EFLLKNALVLEKMVISTK+TLQPIHQYELFKDAVFDQED FTP+EL
Subjt:  FTLTEEAKWMEAYEFDGKDYWKSQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQEDHFTPEEL

Query:  LQFSQKLLTFPRASKSA
        LQFSQKLLT PRASKSA
Subjt:  LQFSQKLLTFPRASKSA

XP_008456389.1 PREDICTED: heat stress transcription factor A-4c-like [Cucumis melo]3.0e-14487.42Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        M  S+G+S GAPPPFLTKTYEMVDDPM+NSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RG+THLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK
        IHRRKP+YSHSQS+    GAPLSEQERQELE KIKTL+QEKT L+SQLQKHENEKEQIG QIQ IC++LWRMG+QQKQLIGILGAEL+KH+Q KKRKM K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK

Query:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEWTEFERD+    KKKV V PLELMGKLELSL LCEDLLCNVAQVL+EGKEMEVKKEGEMRSGVNDVFWE FLTEIPGSSKV+EVYLDRRNNV
Subjt:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

XP_008465581.2 PREDICTED: F-box/FBD/LRR-repeat protein At1g78750-like [Cucumis melo]2.4e-16285.5Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPES+LVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y GENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC GLKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISV
        TFMCER IYE+VQMLL KLAEV+VFIPCTWT L  +S+
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISV

XP_038902159.1 heat stress transcription factor A-4c-like [Benincasa hispida]4.5e-14889.4Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        MDGS+G+SG APPPFLTKTYEMVDDPMTNSIVSW+QSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RGRTHLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK
        IHRRKPIYSHSQS+ +G GAPLSEQER+ELEQKIKTL+QEK ILQSQLQKHENEKEQIGLQIQTICQQLW+MGNQQKQLIGILGAELQ+H+QSKKRKM K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK

Query:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEW+EF+RDE     KKVKVPPLELM KLELSLGLCEDLLCNVA+VL+EG+ MEVKK GEMRSGVND+FWEQFLTEIPGSS +SEVYLDRRNNV
Subjt:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

TrEMBL top hitse value%identityAlignment
A0A0A0KDY5 F-box domain-containing protein4.7e-22879.69Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SKRTR+EE +NDWISQLPES+LVDILSYLPT+DAVKM LISRFRNLWTYI  LSFDECAYHDH+ YDGENYDGPHYDE FLNLIRHVLILHERT I
Subjt:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFNAIHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTN YLKELSL GCGIEEKGRI LTSL
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLKEI+LSDKIMGEI++GCPMLEELSLDGCCGL KLKLTTSNIKRL+I +GWRNE++NSRLEISCPGLKSLELAG+I LVQLKYS+SI DASLYYSR
Subjt:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHAT---LLIIILMSSDLHNMGVDICPNSCHWLEICRVQTSFHKVAPAWNMQHSQE
        TFMCER IYEKVQMLLWKLAEV+VFIPCTWT L +   +     +       +   L+ +  H  G+     + HWLE     T    + P      S  
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHAT---LLIIILMSSDLHNMGVDICPNSCHWLEICRVQTSFHKVAPAWNMQHSQE

Query:  FTLTEEAKWMEAYEFDGKDYWKSQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQEDHFTPEEL
          LTEEAKWMEAY+FDGK YWKSQNGD+R GLRKYLKTV IYGYVTEPYVLEL+EFLLKNALVLEKMVISTK+TLQPIHQYELFKDAVFDQED FTP+EL
Subjt:  FTLTEEAKWMEAYEFDGKDYWKSQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQEDHFTPEEL

Query:  LQFSQKLLTFPRASKSA
        LQFSQKLLT PRASKSA
Subjt:  LQFSQKLLTFPRASKSA

A0A1S3C373 heat stress transcription factor A-4c-like1.4e-14487.42Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        M  S+G+S GAPPPFLTKTYEMVDDPM+NSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RG+THLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK
        IHRRKP+YSHSQS+    GAPLSEQERQELE KIKTL+QEKT L+SQLQKHENEKEQIG QIQ IC++LWRMG+QQKQLIGILGAEL+KH+Q KKRKM K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK

Query:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEWTEFERD+    KKKV V PLELMGKLELSL LCEDLLCNVAQVL+EGKEMEVKKEGEMRSGVNDVFWE FLTEIPGSSKV+EVYLDRRNNV
Subjt:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

A0A1S3CPK7 F-box/FBD/LRR-repeat protein At1g78750-like1.2e-16285.5Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPES+LVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y GENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC GLKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISV
        TFMCER IYE+VQMLL KLAEV+VFIPCTWT L  +S+
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISV

A0A5A7UGX2 F-box/FBD/LRR-repeat protein7.5e-21877.58Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPES+LVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y GENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC GLKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIILMSSDLHNMGVDICPNSCHWLEICRVQTSFHKVAPAWNMQHSQEFTL
        TFMCER IYE+VQMLL KLAE+       W + +   + +G   +   LL      +  H  G+     + +WLE     T+F+ + P      S    L
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIILMSSDLHNMGVDICPNSCHWLEICRVQTSFHKVAPAWNMQHSQEFTL

Query:  TEEAKWMEAYEFDGKDYWKSQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQEDHFTPEELLQFS
        TEEAKWMEAY+FDGKDYWKS+NGD+RGLRKYLKTVMIYGYVTEPYVLEL+EFLLKNALVLEKMVISTKRTLQPIHQYELFKDAV DQED FTPEELLQFS
Subjt:  TEEAKWMEAYEFDGKDYWKSQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQEDHFTPEELLQFS

Query:  QKLLTFPRASKSA
        QKLLT PRASKSA
Subjt:  QKLLTFPRASKSA

A0A5D3BGW6 Heat stress transcription factor A-4c-like1.4e-14487.42Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        M  S+G+S GAPPPFLTKTYEMVDDPM+NSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RG+THLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK
        IHRRKP+YSHSQS+    GAPLSEQERQELE KIKTL+QEKT L+SQLQKHENEKEQIG QIQ IC++LWRMG+QQKQLIGILGAEL+KH+Q KKRKM K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK

Query:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEWTEFERD+    KKKV V PLELMGKLELSL LCEDLLCNVAQVL+EGKEMEVKKEGEMRSGVNDVFWE FLTEIPGSSKV+EVYLDRRNNV
Subjt:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

SwissProt top hitse value%identityAlignment
O49403 Heat stress transcription factor A-4a3.6e-5234.52Show/hide
Query:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSI
        + + G S  + PPFLTKTYEMVDD  ++SIVSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D EQWEFAN+ FVRG+ HL+K+I
Subjt:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSI

Query:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------
        HRRKP++SHS  N      PL++ ER  +  +I+ L +EK  L  +L K + E+E   +Q++ + ++L  M  +QK ++  +   L+K            
Subjt:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------

Query:  HQQSKKRKMWKV----NELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVKKEG
            +KR+  ++    +E ++EE        +              + +LE S+ + E+L+ +  + + + + M                    ++  + 
Subjt:  HQQSKKRKMWKV----NELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVKKEG

Query:  EMRS-------------------------------GVNDVFWEQFLTEIPGSSKVSEVYLDRRNN
         ++S                               G ND FW+QF +E PGS++  EV L+R+++
Subjt:  EMRS-------------------------------GVNDVFWEQFLTEIPGSSKVSEVYLDRRNN

Q93VB5 Heat stress transcription factor A-4d2.1e-5251.09Show/hide
Query:  GSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIH
        G  G  GG PPPFL KTYEMV+D  TN +VSW   G SFVVWNP +F+++LLP YFKHNNFSSF+RQLNTYGFRKID E+WEFANE F+RG THLLK+IH
Subjt:  GSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIH

Query:  RRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWKVN
        RRKP++SHS  N   NG PL+E ER+ELE++I  L  EK+IL + LQ+   ++  I  Q+Q +  +L  M  +QK ++  L   LQ       R+   V+
Subjt:  RRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWKVN

Query:  ELLVEEWTEFERDEKNNMKKKVKVPPLEL
          L+         E ++  KK +VP ++L
Subjt:  ELLVEEWTEFERDEKNNMKKKVKVPPLEL

Q94J16 Heat stress transcription factor A-4b3.2e-4850Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        M+G  G  GG+ PPFL+KTYEMVDDP T+++V W+ +G SFVV N PEF ++LLP YFKHNNFSSFVRQLNTYGFRK+D EQWEFANE F++G+ H LK+
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGA-PLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLI----------GILGAELQK
        IHRRKPI+SHS   SH  GA PL++ ER++ E++I+ L  +   L S+LQ +  +K  +  ++Q + ++L+ + +QQ+ LI          G L + +Q+
Subjt:  IHRRKPIYSHSQSNSHGNGA-PLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLI----------GILGAELQK

Query:  HQQSKKRK
            +K++
Subjt:  HQQSKKRK

Q9FK72 Heat stress transcription factor A-4c2.8e-5237.29Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        MD ++G S    PPFLTKTYEMVDD  ++S+V+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D E+WEF N+ FVRGR +L+K+
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQK-------QLIGILGAELQKHQQS
        IHRRKP++SHS  N      PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++  LQ+ T+  +L  M   QK       Q++G  G  L      
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQK-------QLIGILGAELQKHQQS

Query:  KKRKMWKVNELLVEEWTEFERDEKNNMKKKVKVPP----LELMGKLELSLGLCEDLL----------------------CNVAQVLREGKEMEVKKEGEM
        ++++ ++ N L                      PP    +E + KLE SL   E+L+                       ++     +  ++++  E  +
Subjt:  KKRKMWKVNELLVEEWTEFERDEKNNMKKKVKVPP----LELMGKLELSLGLCEDLL----------------------CNVAQVLREGKEMEVKKEGEM

Query:  -------RSGVNDVFWEQFLTEIPGSSKVSEVYLDRRN-------NVVRGKQTY
               ++GVND FWEQ LTE PGS++  EV  +RR+       N +  ++TY
Subjt:  -------RSGVNDVFWEQFLTEIPGSSKVSEVYLDRRN-------NVVRGKQTY

Q9LQM7 Heat stress transcription factor A-1d1.5e-4551.32Show/hide
Query:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPI
        S  APPPFL+KTY+MVDD  T+SIVSWS +  SF+VW PPEFA++LLP  FKHNNFSSFVRQLNTYGFRK+D ++WEFANEGF+RG+ HLL+SI RRKP 
Subjt:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPI

Query:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ
        +   Q    + H NG   S     E  +  LE++++ L ++K +L  +L +   +++    Q+QT+ Q+L  M N+Q+QL+  L   +Q
Subjt:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ

Arabidopsis top hitse value%identityAlignment
AT1G32330.1 heat shock transcription factor A1D1.1e-4651.32Show/hide
Query:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPI
        S  APPPFL+KTY+MVDD  T+SIVSWS +  SF+VW PPEFA++LLP  FKHNNFSSFVRQLNTYGFRK+D ++WEFANEGF+RG+ HLL+SI RRKP 
Subjt:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPI

Query:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ
        +   Q    + H NG   S     E  +  LE++++ L ++K +L  +L +   +++    Q+QT+ Q+L  M N+Q+QL+  L   +Q
Subjt:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ

AT4G13980.1 winged-helix DNA-binding transcription factor family protein2.9e-4450.56Show/hide
Query:  SDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHR
        S G   G P PFL KTYEMVDD  T+ IVSWS +  SF+VWN  EF++ LLP YFKHNNFSSF+RQLNTYGFRKID E+WEF N+ F++ + HLLK+IHR
Subjt:  SDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHR

Query:  RKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGIL
        RKPI+SHS        A  ++QER  L++++  L +EK  ++++L K + +K     Q + + + +  M N+QK+L+  L
Subjt:  RKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGIL

AT4G17750.1 heat shock factor 11.7e-4446.32Show/hide
Query:  PPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPIYSHS
        PPPFL+KTY+MV+DP T++IVSWS +  SF+VW+PPEF+++LLP YFKHNNFSSFVRQLNTYGFRK+D ++WEFANEGF+RG+ HLLK I RRK +  H 
Subjt:  PPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPIYSHS

Query:  QSNSHGNGAPLSEQE-------------RQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ
         S+S+     LS+ +             +  LE++++ L ++K +L  +L K   +++    ++Q + + L  M  +Q+Q++  L   +Q
Subjt:  QSNSHGNGAPLSEQE-------------RQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ

AT4G18880.1 heat shock transcription factor A4A2.6e-5334.52Show/hide
Query:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSI
        + + G S  + PPFLTKTYEMVDD  ++SIVSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D EQWEFAN+ FVRG+ HL+K+I
Subjt:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSI

Query:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------
        HRRKP++SHS  N      PL++ ER  +  +I+ L +EK  L  +L K + E+E   +Q++ + ++L  M  +QK ++  +   L+K            
Subjt:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------

Query:  HQQSKKRKMWKV----NELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVKKEG
            +KR+  ++    +E ++EE        +              + +LE S+ + E+L+ +  + + + + M                    ++  + 
Subjt:  HQQSKKRKMWKV----NELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVKKEG

Query:  EMRS-------------------------------GVNDVFWEQFLTEIPGSSKVSEVYLDRRNN
         ++S                               G ND FW+QF +E PGS++  EV L+R+++
Subjt:  EMRS-------------------------------GVNDVFWEQFLTEIPGSSKVSEVYLDRRNN

AT5G45710.1 winged-helix DNA-binding transcription factor family protein2.0e-5337.29Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        MD ++G S    PPFLTKTYEMVDD  ++S+V+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D E+WEF N+ FVRGR +L+K+
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQK-------QLIGILGAELQKHQQS
        IHRRKP++SHS  N      PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++  LQ+ T+  +L  M   QK       Q++G  G  L      
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQK-------QLIGILGAELQKHQQS

Query:  KKRKMWKVNELLVEEWTEFERDEKNNMKKKVKVPP----LELMGKLELSLGLCEDLL----------------------CNVAQVLREGKEMEVKKEGEM
        ++++ ++ N L                      PP    +E + KLE SL   E+L+                       ++     +  ++++  E  +
Subjt:  KKRKMWKVNELLVEEWTEFERDEKNNMKKKVKVPP----LELMGKLELSLGLCEDLL----------------------CNVAQVLREGKEMEVKKEGEM

Query:  -------RSGVNDVFWEQFLTEIPGSSKVSEVYLDRRN-------NVVRGKQTY
               ++GVND FWEQ LTE PGS++  EV  +RR+       N +  ++TY
Subjt:  -------RSGVNDVFWEQFLTEIPGSSKVSEVYLDRRN-------NVVRGKQTY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGCGTCTATAAATCCTTCACCCATCTTCATCATTTTATCACAACAATCAAATTCCCATCAATCTCATTCCTTCAATCTAATTTCCCATCTTTTCTTCCTTTTTTC
CTTCAACTTCTCGCCGTTCTGTTTTTACTTTCCCCTGTTCTACTTTCACCACGGTGGCGCCGCCCAGGGATTCTACTGTTGCTGCAGTGAACACAAAATGGATGGCTCAG
ACGGAACCTCTGGCGGCGCTCCGCCGCCATTTCTGACCAAAACATATGAGATGGTGGATGATCCGATGACCAATTCCATCGTATCGTGGAGTCAAAGTGGTTTCAGCTTT
GTGGTTTGGAACCCACCGGAATTCGCACAAGAACTACTTCCGATTTATTTCAAACACAACAATTTTTCTAGTTTCGTTCGTCAATTAAACACTTATGGGTTTAGAAAAAT
CGATCGAGAACAATGGGAATTCGCGAACGAGGGGTTTGTAAGAGGACGAACCCATCTTCTAAAAAGCATCCATAGAAGAAAACCAATCTACAGCCATAGCCAGAGCAATA
GCCATGGAAATGGAGCTCCATTATCAGAACAAGAAAGACAAGAACTCGAGCAAAAAATCAAAACCCTTCATCAAGAAAAGACCATTCTCCAATCCCAATTACAGAAACAC
GAAAACGAAAAAGAACAAATTGGGCTTCAAATTCAAACAATCTGCCAGCAATTATGGCGAATGGGGAATCAACAAAAGCAGCTAATTGGGATATTGGGAGCAGAGTTGCA
GAAGCATCAGCAGAGCAAAAAGAGGAAAATGTGGAAAGTGAATGAGTTATTAGTTGAAGAATGGACAGAATTTGAGAGAGATGAGAAGAATAATATGAAGAAGAAGGTGA
AGGTTCCGCCATTGGAGCTGATGGGGAAGCTGGAATTGTCATTGGGGTTGTGTGAGGATTTGCTTTGCAATGTGGCGCAGGTTCTGAGGGAAGGGAAGGAAATGGAAGTG
AAAAAAGAAGGGGAAATGAGGAGTGGAGTGAATGATGTGTTTTGGGAACAATTCTTGACGGAGATTCCAGGGTCGTCCAAAGTTAGTGAAGTTTATTTAGATAGAAGGAA
CAATGTTGTAAGGGGAAAGCAAACGTACCGTACGTACGGTTTCGTTTCGGTATTTGTGCCCTTGGGGTTTTCGGCCGATATCCGAAATTCAACAAATAGCCGTCCGTCGT
CGTCTCAAATCGGTGCTAAACCCTGTATCTGTAAGCAATTCCGTAAAGTTTCTTGCCCAACAGTTATGCGTAGCTCAAAGAGGACGAGGTCTGAAGAGGGTCAAAATGAT
TGGATTAGTCAGCTTCCGGAATCTATTCTTGTTGACATTCTTTCGTATTTACCAACAAAGGATGCTGTAAAGATGACATTGATTTCAAGATTCAGGAATCTTTGGACCTA
TATTCGTTGTCTCTCCTTTGATGAATGTGCATATCATGACCACAGTAGTTATGATGGTGAAAATTATGATGGTCCACATTATGATGAAAGTTTCCTAAATCTAATTCGTC
ATGTCTTGATACTTCATGAACGTACTACAATTGATGAGTTTCATCTCAAATTTGCTTTCAATTTGTTTAATGCTATTCATGATGATCATTATAATTCTGATGGTTATGCA
TCTAAAGAAAGACGAATGGCAAGTGAGCTTACTACATGGATTAAATTTTCTTTGAGGAAACAAGTTAAGGTTCTTGATATTGATTTATTAGGATGTGGTTTGTCAGAGCC
AGAAGTAAATTATGAACTGCCTACTAGTATTTTAACTAATAACTATCTAAAGGAGCTCAGTTTGGCTGGCTGTGGAATCGAGGAAAAAGGGCGTATTCATCTGACGTCTC
TCACTGTGCTTTCGCTCAAGGAAATAATATTGAGTGATAAGATTATGGGTGAAATCATTGTAGGATGCCCAATGCTTGAAGAACTTTCTCTTGATGGATGTTGTGGTCTT
CGTAAGCTGAAGTTAACTACTTCTAATATCAAGAGATTGAGGATTTTTATTGGATGGAGAAATGAAGTGGCAAATTCAAGGTTGGAGATTAGCTGCCCTGGTCTCAAATC
ATTAGAACTTGCTGGATCAATACAGCTTGTACAGTTAAAGTATTCATCTTCTATTTCCGACGCCTCCCTTTATTATAGCCGCACCTTCATGTGCGAACGCATGATATATG
AAAAAGTTCAAATGCTGCTGTGGAAGCTAGCTGAAGTCAATGTTTTCATACCATGCACATGGACTATTTTGTTTTATATATCTGTGAATTCTGGGCTAGCACGCATGCAT
GCAACTCTGCTAATCATTATTTTGATGTCATCAGATCTTCACAATATGGGAGTTGACATATGTCCCAATTCCTGTCACTGGTTGGAAATCTGTAGAGTTCAGACTTCTTT
TCACAAAGTGGCACCTGCCTGGAATATGCAGCATTCTCAGGAATTCACATTGACTGAGGAAGCAAAGTGGATGGAGGCATATGAATTTGATGGTAAAGATTACTGGAAGT
CACAGAATGGCGATTACCGTGGCCTCAGAAAGTACCTCAAGACGGTTATGATATACGGATATGTGACTGAGCCATATGTGTTGGAGCTTGTAGAATTTCTATTGAAGAAT
GCCTTGGTCCTCGAAAAGATGGTCATTTCCACCAAGAGGACTCTCCAGCCCATTCACCAATATGAATTATTCAAAGACGCTGTCTTTGATCAGGAGGATCATTTCACTCC
TGAGGAACTACTTCAGTTTTCTCAGAAACTGTTAACTTTTCCCAGGGCCTCCAAGTCTGCTGATGCTGAGATTGGGGCAGGGCCTACTCTTCTAGAACTGCCTTTTGTAA
TGGAATCAGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGACCGCGTCTATAAATCCTTCACCCATCTTCATCATTTTATCACAACAATCAAATTCCCATCAATCTCATTCCTTCAATCTAATTTCCCATCTTTTCTTCCTTTTTTC
CTTCAACTTCTCGCCGTTCTGTTTTTACTTTCCCCTGTTCTACTTTCACCACGGTGGCGCCGCCCAGGGATTCTACTGTTGCTGCAGTGAACACAAAATGGATGGCTCAG
ACGGAACCTCTGGCGGCGCTCCGCCGCCATTTCTGACCAAAACATATGAGATGGTGGATGATCCGATGACCAATTCCATCGTATCGTGGAGTCAAAGTGGTTTCAGCTTT
GTGGTTTGGAACCCACCGGAATTCGCACAAGAACTACTTCCGATTTATTTCAAACACAACAATTTTTCTAGTTTCGTTCGTCAATTAAACACTTATGGGTTTAGAAAAAT
CGATCGAGAACAATGGGAATTCGCGAACGAGGGGTTTGTAAGAGGACGAACCCATCTTCTAAAAAGCATCCATAGAAGAAAACCAATCTACAGCCATAGCCAGAGCAATA
GCCATGGAAATGGAGCTCCATTATCAGAACAAGAAAGACAAGAACTCGAGCAAAAAATCAAAACCCTTCATCAAGAAAAGACCATTCTCCAATCCCAATTACAGAAACAC
GAAAACGAAAAAGAACAAATTGGGCTTCAAATTCAAACAATCTGCCAGCAATTATGGCGAATGGGGAATCAACAAAAGCAGCTAATTGGGATATTGGGAGCAGAGTTGCA
GAAGCATCAGCAGAGCAAAAAGAGGAAAATGTGGAAAGTGAATGAGTTATTAGTTGAAGAATGGACAGAATTTGAGAGAGATGAGAAGAATAATATGAAGAAGAAGGTGA
AGGTTCCGCCATTGGAGCTGATGGGGAAGCTGGAATTGTCATTGGGGTTGTGTGAGGATTTGCTTTGCAATGTGGCGCAGGTTCTGAGGGAAGGGAAGGAAATGGAAGTG
AAAAAAGAAGGGGAAATGAGGAGTGGAGTGAATGATGTGTTTTGGGAACAATTCTTGACGGAGATTCCAGGGTCGTCCAAAGTTAGTGAAGTTTATTTAGATAGAAGGAA
CAATGTTGTAAGGGGAAAGCAAACGTACCGTACGTACGGTTTCGTTTCGGTATTTGTGCCCTTGGGGTTTTCGGCCGATATCCGAAATTCAACAAATAGCCGTCCGTCGT
CGTCTCAAATCGGTGCTAAACCCTGTATCTGTAAGCAATTCCGTAAAGTTTCTTGCCCAACAGTTATGCGTAGCTCAAAGAGGACGAGGTCTGAAGAGGGTCAAAATGAT
TGGATTAGTCAGCTTCCGGAATCTATTCTTGTTGACATTCTTTCGTATTTACCAACAAAGGATGCTGTAAAGATGACATTGATTTCAAGATTCAGGAATCTTTGGACCTA
TATTCGTTGTCTCTCCTTTGATGAATGTGCATATCATGACCACAGTAGTTATGATGGTGAAAATTATGATGGTCCACATTATGATGAAAGTTTCCTAAATCTAATTCGTC
ATGTCTTGATACTTCATGAACGTACTACAATTGATGAGTTTCATCTCAAATTTGCTTTCAATTTGTTTAATGCTATTCATGATGATCATTATAATTCTGATGGTTATGCA
TCTAAAGAAAGACGAATGGCAAGTGAGCTTACTACATGGATTAAATTTTCTTTGAGGAAACAAGTTAAGGTTCTTGATATTGATTTATTAGGATGTGGTTTGTCAGAGCC
AGAAGTAAATTATGAACTGCCTACTAGTATTTTAACTAATAACTATCTAAAGGAGCTCAGTTTGGCTGGCTGTGGAATCGAGGAAAAAGGGCGTATTCATCTGACGTCTC
TCACTGTGCTTTCGCTCAAGGAAATAATATTGAGTGATAAGATTATGGGTGAAATCATTGTAGGATGCCCAATGCTTGAAGAACTTTCTCTTGATGGATGTTGTGGTCTT
CGTAAGCTGAAGTTAACTACTTCTAATATCAAGAGATTGAGGATTTTTATTGGATGGAGAAATGAAGTGGCAAATTCAAGGTTGGAGATTAGCTGCCCTGGTCTCAAATC
ATTAGAACTTGCTGGATCAATACAGCTTGTACAGTTAAAGTATTCATCTTCTATTTCCGACGCCTCCCTTTATTATAGCCGCACCTTCATGTGCGAACGCATGATATATG
AAAAAGTTCAAATGCTGCTGTGGAAGCTAGCTGAAGTCAATGTTTTCATACCATGCACATGGACTATTTTGTTTTATATATCTGTGAATTCTGGGCTAGCACGCATGCAT
GCAACTCTGCTAATCATTATTTTGATGTCATCAGATCTTCACAATATGGGAGTTGACATATGTCCCAATTCCTGTCACTGGTTGGAAATCTGTAGAGTTCAGACTTCTTT
TCACAAAGTGGCACCTGCCTGGAATATGCAGCATTCTCAGGAATTCACATTGACTGAGGAAGCAAAGTGGATGGAGGCATATGAATTTGATGGTAAAGATTACTGGAAGT
CACAGAATGGCGATTACCGTGGCCTCAGAAAGTACCTCAAGACGGTTATGATATACGGATATGTGACTGAGCCATATGTGTTGGAGCTTGTAGAATTTCTATTGAAGAAT
GCCTTGGTCCTCGAAAAGATGGTCATTTCCACCAAGAGGACTCTCCAGCCCATTCACCAATATGAATTATTCAAAGACGCTGTCTTTGATCAGGAGGATCATTTCACTCC
TGAGGAACTACTTCAGTTTTCTCAGAAACTGTTAACTTTTCCCAGGGCCTCCAAGTCTGCTGATGCTGAGATTGGGGCAGGGCCTACTCTTCTAGAACTGCCTTTTGTAA
TGGAATCAGGATAACTTAGCTTTATTGTGCTGCAGTTCATTTTTTGTTTATCTAAGGAACGACCCCAGAGGGGTTTTTATTTATTTTAATTAATTTGTTTATTTTTATTT
GATAGCGGTCTTTTTGACTTTTGATGTTTTTCATCTGAGTTTACCATGTTTTTATGGTAAAGCTACAGTCTAAGAGTGTGTTTGGATTGGCTTTTTAAGTATTTAAATAA
GTGTTTATAATTGAAAAAAAAAAAAGTGTTCGTAAACACCTAGAAAGTCAATCCAAACCGGTCTTAAGATATGTAAATTAGTATGTATGAGAGAAGGTCAAGGCCAAGAT
TCTCTTATCAAA
Protein sequenceShow/hide protein sequence
MTASINPSPIFIILSQQSNSHQSHSFNLISHLFFLFSFNFSPFCFYFPLFYFHHGGAAQGFYCCCSEHKMDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSF
VVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKH
ENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWKVNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEV
KKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNVVRGKQTYRTYGFVSVFVPLGFSADIRNSTNSRPSSSQIGAKPCICKQFRKVSCPTVMRSSKRTRSEEGQND
WISQLPESILVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTIDEFHLKFAFNLFNAIHDDHYNSDGYA
SKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSLTVLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGL
RKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSRTFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMH
ATLLIIILMSSDLHNMGVDICPNSCHWLEICRVQTSFHKVAPAWNMQHSQEFTLTEEAKWMEAYEFDGKDYWKSQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKN
ALVLEKMVISTKRTLQPIHQYELFKDAVFDQEDHFTPEELLQFSQKLLTFPRASKSADAEIGAGPTLLELPFVMESG