; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G043690 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G043690
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionheat stress transcription factor A-4c-like
Genome locationCiama_Chr02:31354771..31362347
RNA-Seq ExpressionCaUC02G043690
SyntenyCaUC02G043690
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0034605 - cellular response to heat (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR001810 - F-box domain
IPR006566 - FBD domain
IPR027725 - Heat shock transcription factor family
IPR032675 - Leucine-rich repeat domain superfamily
IPR036047 - F-box-like domain superfamily
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054510.1 F-box/FBD/LRR-repeat protein [Cucumis melo var. makuwa]1.2e-24984Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPESVLVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y GENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC GLKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIFTIWELTYVPIPVTGWKSVEFRLLFTKWHLPGICSILRNSHWLETL-T
        TFMCER IYE+VQMLL KLAE                              IFTIWELTYVPI VTGWKSVEFRLLFTKWH+PGICSILRNS+WLET+ T
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIFTIWELTYVPIPVTGWKSVEFRLLFTKWHLPGICSILRNSHWLETL-T

Query:  FYIYPGSYSTFLTEEAKWMEAYEFDGKDYWKSQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQE
        FYIYPGSYSTFLTEEAKWMEAY+FDGKDYWKS+NGD+RGLRKYLKTVMIYGYVTEPYVLEL+EFLLKNALVLEKMVISTKRTLQPIHQYELFKDAV DQE
Subjt:  FYIYPGSYSTFLTEEAKWMEAYEFDGKDYWKSQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQE

Query:  DHFTPEELLQFSQKLLTFPRASKSA
        D FTPEELLQFSQKLLT PRASKSA
Subjt:  DHFTPEELLQFSQKLLTFPRASKSA

XP_004143230.1 putative F-box/LRR-repeat protein At3g18150 [Cucumis sativus]4.4e-26587.43Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SKRTR+EE +NDWISQLPESVLVDILSYLPT+DAVKM LISRFRNLWTYI  LSFDECAYHDH+ YDGENYDGPHYDE FLNLIRHVLILHERT I
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFNAIHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTN YLKELSL GCGIEEKGRI LTSL
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLKEI+LSDKIMGEI++GCPMLEELSLDGCCGL KLKLTTSNIKRL+I +GWRNE++NSRLEISCPGLKSLELAG+I LVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIFTIWELTYVPIPVTGWKSVEFRLLFTKWHLPGICSILRNSHWLETLTF
        TFMCER IYEKVQMLLWKLAEV+VFIPCTWT L                  IFTIW+LTYVPIPV GWKSVEFRLLFTKWH+PGICSILRNSHWLETLTF
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIFTIWELTYVPIPVTGWKSVEFRLLFTKWHLPGICSILRNSHWLETLTF

Query:  YIYPGSYSTFLTEEAKWMEAYEFDGKDYWKSQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQE
        YIYPGSYSTFLTEEAKWMEAY+FDGK YWKSQNGD+R GLRKYLKTV IYGYVTEPYVLEL+EFLLKNALVLEKMVISTK+TLQPIHQYELFKDAVFDQE
Subjt:  YIYPGSYSTFLTEEAKWMEAYEFDGKDYWKSQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQE

Query:  DHFTPEELLQFSQKLLTFPRASKSA
        D FTP+ELLQFSQKLLT PRASKSA
Subjt:  DHFTPEELLQFSQKLLTFPRASKSA

XP_008456389.1 PREDICTED: heat stress transcription factor A-4c-like [Cucumis melo]2.5e-14387.09Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        M  S+G+S GAPPPFLTKTYEMVDDPM+NSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RG+THLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK
        IHRRKP+YSHSQS+    GAPLSEQERQELE KIKTL+QEKT L+SQLQKHENEKEQIG QIQ IC++LWRMG+QQKQLIGILGAEL+KH+Q KKRKM K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK

Query:  VNELLVEEWTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEWTE ERD+    KKKV V PLELMGKLELSL LCEDLLCNVAQVL+EGKEMEVKKEGEMRSGVNDVFWE FLTEIPGSSKV+EVYLDRRNNV
Subjt:  VNELLVEEWTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

XP_008465581.2 PREDICTED: F-box/FBD/LRR-repeat protein At1g78750-like [Cucumis melo]1.4e-16285.8Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPESVLVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y GENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC GLKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISV
        TFMCER IYE+VQMLL KLAEV+VFIPCTWT L  +S+
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISV

XP_038902159.1 heat stress transcription factor A-4c-like [Benincasa hispida]4.9e-14789.07Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        MDGS+G+SG APPPFLTKTYEMVDDPMTNSIVSW+QSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RGRTHLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK
        IHRRKPIYSHSQS+ +G GAPLSEQER+ELEQKIKTL+QEK ILQSQLQKHENEKEQIGLQIQTICQQLW+MGNQQKQLIGILGAELQ+H+QSKKRKM K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK

Query:  VNELLVEEWTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEW+E +RDE     KKVKVPPLELM KLELSLGLCEDLLCNVA+VL+EG+ MEVKK GEMRSGVND+FWEQFLTEIPGSS +SEVYLDRRNNV
Subjt:  VNELLVEEWTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

TrEMBL top hitse value%identityAlignment
A0A0A0KDY5 F-box domain-containing protein2.1e-26587.43Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SKRTR+EE +NDWISQLPESVLVDILSYLPT+DAVKM LISRFRNLWTYI  LSFDECAYHDH+ YDGENYDGPHYDE FLNLIRHVLILHERT I
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFNAIHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTN YLKELSL GCGIEEKGRI LTSL
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLKEI+LSDKIMGEI++GCPMLEELSLDGCCGL KLKLTTSNIKRL+I +GWRNE++NSRLEISCPGLKSLELAG+I LVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIFTIWELTYVPIPVTGWKSVEFRLLFTKWHLPGICSILRNSHWLETLTF
        TFMCER IYEKVQMLLWKLAEV+VFIPCTWT L                  IFTIW+LTYVPIPV GWKSVEFRLLFTKWH+PGICSILRNSHWLETLTF
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIFTIWELTYVPIPVTGWKSVEFRLLFTKWHLPGICSILRNSHWLETLTF

Query:  YIYPGSYSTFLTEEAKWMEAYEFDGKDYWKSQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQE
        YIYPGSYSTFLTEEAKWMEAY+FDGK YWKSQNGD+R GLRKYLKTV IYGYVTEPYVLEL+EFLLKNALVLEKMVISTK+TLQPIHQYELFKDAVFDQE
Subjt:  YIYPGSYSTFLTEEAKWMEAYEFDGKDYWKSQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQE

Query:  DHFTPEELLQFSQKLLTFPRASKSA
        D FTP+ELLQFSQKLLT PRASKSA
Subjt:  DHFTPEELLQFSQKLLTFPRASKSA

A0A1S3C373 heat stress transcription factor A-4c-like1.2e-14387.09Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        M  S+G+S GAPPPFLTKTYEMVDDPM+NSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RG+THLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK
        IHRRKP+YSHSQS+    GAPLSEQERQELE KIKTL+QEKT L+SQLQKHENEKEQIG QIQ IC++LWRMG+QQKQLIGILGAEL+KH+Q KKRKM K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK

Query:  VNELLVEEWTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEWTE ERD+    KKKV V PLELMGKLELSL LCEDLLCNVAQVL+EGKEMEVKKEGEMRSGVNDVFWE FLTEIPGSSKV+EVYLDRRNNV
Subjt:  VNELLVEEWTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

A0A1S3CPK7 F-box/FBD/LRR-repeat protein At1g78750-like6.9e-16385.8Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPESVLVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y GENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC GLKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISV
        TFMCER IYE+VQMLL KLAEV+VFIPCTWT L  +S+
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISV

A0A5A7UGX2 F-box/FBD/LRR-repeat protein5.7e-25084Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPESVLVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y GENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC GLKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIFTIWELTYVPIPVTGWKSVEFRLLFTKWHLPGICSILRNSHWLETL-T
        TFMCER IYE+VQMLL KLAE                              IFTIWELTYVPI VTGWKSVEFRLLFTKWH+PGICSILRNS+WLET+ T
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMHATLLIIFTIWELTYVPIPVTGWKSVEFRLLFTKWHLPGICSILRNSHWLETL-T

Query:  FYIYPGSYSTFLTEEAKWMEAYEFDGKDYWKSQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQE
        FYIYPGSYSTFLTEEAKWMEAY+FDGKDYWKS+NGD+RGLRKYLKTVMIYGYVTEPYVLEL+EFLLKNALVLEKMVISTKRTLQPIHQYELFKDAV DQE
Subjt:  FYIYPGSYSTFLTEEAKWMEAYEFDGKDYWKSQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQE

Query:  DHFTPEELLQFSQKLLTFPRASKSA
        D FTPEELLQFSQKLLT PRASKSA
Subjt:  DHFTPEELLQFSQKLLTFPRASKSA

A0A5D3BGW6 Heat stress transcription factor A-4c-like1.2e-14387.09Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        M  S+G+S GAPPPFLTKTYEMVDDPM+NSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RG+THLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK
        IHRRKP+YSHSQS+    GAPLSEQERQELE KIKTL+QEKT L+SQLQKHENEKEQIG QIQ IC++LWRMG+QQKQLIGILGAEL+KH+Q KKRKM K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWK

Query:  VNELLVEEWTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEWTE ERD+    KKKV V PLELMGKLELSL LCEDLLCNVAQVL+EGKEMEVKKEGEMRSGVNDVFWE FLTEIPGSSKV+EVYLDRRNNV
Subjt:  VNELLVEEWTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

SwissProt top hitse value%identityAlignment
O49403 Heat stress transcription factor A-4a1.6e-5234.78Show/hide
Query:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSI
        + + G S  + PPFLTKTYEMVDD  ++SIVSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D EQWEFAN+ FVRG+ HL+K+I
Subjt:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSI

Query:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------
        HRRKP++SHS  N      PL++ ER  +  +I+ L +EK  L  +L K + E+E   +Q++ + ++L  M  +QK ++  +   L+K            
Subjt:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------

Query:  HQQSKKRKMWKV----NELLVEE---WTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVK
            +KR+  ++    +E ++EE      V  +   +     +   +E   +LE S+ + E+L+ +  + + + + M                    ++ 
Subjt:  HQQSKKRKMWKV----NELLVEE---WTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVK

Query:  KEGEMRS-------------------------------GVNDVFWEQFLTEIPGSSKVSEVYLDRRNN
         +  ++S                               G ND FW+QF +E PGS++  EV L+R+++
Subjt:  KEGEMRS-------------------------------GVNDVFWEQFLTEIPGSSKVSEVYLDRRNN

Q93VB5 Heat stress transcription factor A-4d2.8e-5251.09Show/hide
Query:  GSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIH
        G  G  GG PPPFL KTYEMV+D  TN +VSW   G SFVVWNP +F+++LLP YFKHNNFSSF+RQLNTYGFRKID E+WEFANE F+RG THLLK+IH
Subjt:  GSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIH

Query:  RRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWKVN
        RRKP++SHS  N   NG PL+E ER+ELE++I  L  EK+IL + LQ+   ++  I  Q+Q +  +L  M  +QK ++  L   LQ       R+   V+
Subjt:  RRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWKVN

Query:  ELLVEEWTEVERDEKNNMKKKVKVPPLEL
          L+         E ++  KK +VP ++L
Subjt:  ELLVEEWTEVERDEKNNMKKKVKVPPLEL

Q94J16 Heat stress transcription factor A-4b3.2e-4850Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        M+G  G  GG+ PPFL+KTYEMVDDP T+++V W+ +G SFVV N PEF ++LLP YFKHNNFSSFVRQLNTYGFRK+D EQWEFANE F++G+ H LK+
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGA-PLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLI----------GILGAELQK
        IHRRKPI+SHS   SH  GA PL++ ER++ E++I+ L  +   L S+LQ +  +K  +  ++Q + ++L+ + +QQ+ LI          G L + +Q+
Subjt:  IHRRKPIYSHSQSNSHGNGA-PLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLI----------GILGAELQK

Query:  HQQSKKRK
            +K++
Subjt:  HQQSKKRK

Q9FK72 Heat stress transcription factor A-4c1.4e-5138.44Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        MD ++G S    PPFLTKTYEMVDD  ++S+V+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D E+WEF N+ FVRGR +L+K+
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQK-------QLIGILGAELQKHQQS
        IHRRKP++SHS  N      PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++  LQ+ T+  +L  M   QK       Q++G  G  L      
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQK-------QLIGILGAELQKHQQS

Query:  KKRKMWKVNELLVEEWTEVERDEKNNMK----KKVKVPPLELMGKLELSLGL-CEDLLCNVAQVLREGKEMEVKKEGEM-------RSGVNDVFWEQFLT
        ++++ ++ N L     + +E+ EK        + +     E  G    S+     +   ++     +  ++++  E  +       ++GVND FWEQ LT
Subjt:  KKRKMWKVNELLVEEWTEVERDEKNNMK----KKVKVPPLELMGKLELSLGL-CEDLLCNVAQVLREGKEMEVKKEGEM-------RSGVNDVFWEQFLT

Query:  EIPGSSKVSEVYLDRRN-------NVVRGKQTY
        E PGS++  EV  +RR+       N +  ++TY
Subjt:  EIPGSSKVSEVYLDRRN-------NVVRGKQTY

Q9LQM7 Heat stress transcription factor A-1d1.5e-4551.32Show/hide
Query:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPI
        S  APPPFL+KTY+MVDD  T+SIVSWS +  SF+VW PPEFA++LLP  FKHNNFSSFVRQLNTYGFRK+D ++WEFANEGF+RG+ HLL+SI RRKP 
Subjt:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPI

Query:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ
        +   Q    + H NG   S     E  +  LE++++ L ++K +L  +L +   +++    Q+QT+ Q+L  M N+Q+QL+  L   +Q
Subjt:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ

Arabidopsis top hitse value%identityAlignment
AT1G32330.1 heat shock transcription factor A1D1.0e-4651.32Show/hide
Query:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPI
        S  APPPFL+KTY+MVDD  T+SIVSWS +  SF+VW PPEFA++LLP  FKHNNFSSFVRQLNTYGFRK+D ++WEFANEGF+RG+ HLL+SI RRKP 
Subjt:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPI

Query:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ
        +   Q    + H NG   S     E  +  LE++++ L ++K +L  +L +   +++    Q+QT+ Q+L  M N+Q+QL+  L   +Q
Subjt:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ

AT4G13980.1 winged-helix DNA-binding transcription factor family protein2.9e-4450.56Show/hide
Query:  SDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHR
        S G   G P PFL KTYEMVDD  T+ IVSWS +  SF+VWN  EF++ LLP YFKHNNFSSF+RQLNTYGFRKID E+WEF N+ F++ + HLLK+IHR
Subjt:  SDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHR

Query:  RKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGIL
        RKPI+SHS        A  ++QER  L++++  L +EK  ++++L K + +K     Q + + + +  M N+QK+L+  L
Subjt:  RKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGIL

AT4G17750.1 heat shock factor 11.7e-4446.32Show/hide
Query:  PPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPIYSHS
        PPPFL+KTY+MV+DP T++IVSWS +  SF+VW+PPEF+++LLP YFKHNNFSSFVRQLNTYGFRK+D ++WEFANEGF+RG+ HLLK I RRK +  H 
Subjt:  PPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPIYSHS

Query:  QSNSHGNGAPLSEQE-------------RQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ
         S+S+     LS+ +             +  LE++++ L ++K +L  +L K   +++    ++Q + + L  M  +Q+Q++  L   +Q
Subjt:  QSNSHGNGAPLSEQE-------------RQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ

AT4G18880.1 heat shock transcription factor A4A1.2e-5334.78Show/hide
Query:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSI
        + + G S  + PPFLTKTYEMVDD  ++SIVSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D EQWEFAN+ FVRG+ HL+K+I
Subjt:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSI

Query:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------
        HRRKP++SHS  N      PL++ ER  +  +I+ L +EK  L  +L K + E+E   +Q++ + ++L  M  +QK ++  +   L+K            
Subjt:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------

Query:  HQQSKKRKMWKV----NELLVEE---WTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVK
            +KR+  ++    +E ++EE      V  +   +     +   +E   +LE S+ + E+L+ +  + + + + M                    ++ 
Subjt:  HQQSKKRKMWKV----NELLVEE---WTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVK

Query:  KEGEMRS-------------------------------GVNDVFWEQFLTEIPGSSKVSEVYLDRRNN
         +  ++S                               G ND FW+QF +E PGS++  EV L+R+++
Subjt:  KEGEMRS-------------------------------GVNDVFWEQFLTEIPGSSKVSEVYLDRRNN

AT5G45710.1 winged-helix DNA-binding transcription factor family protein9.8e-5338.44Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS
        MD ++G S    PPFLTKTYEMVDD  ++S+V+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D E+WEF N+ FVRGR +L+K+
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQK-------QLIGILGAELQKHQQS
        IHRRKP++SHS  N      PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++  LQ+ T+  +L  M   QK       Q++G  G  L      
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQK-------QLIGILGAELQKHQQS

Query:  KKRKMWKVNELLVEEWTEVERDEKNNMK----KKVKVPPLELMGKLELSLGL-CEDLLCNVAQVLREGKEMEVKKEGEM-------RSGVNDVFWEQFLT
        ++++ ++ N L     + +E+ EK        + +     E  G    S+     +   ++     +  ++++  E  +       ++GVND FWEQ LT
Subjt:  KKRKMWKVNELLVEEWTEVERDEKNNMK----KKVKVPPLELMGKLELSLGL-CEDLLCNVAQVLREGKEMEVKKEGEM-------RSGVNDVFWEQFLT

Query:  EIPGSSKVSEVYLDRRN-------NVVRGKQTY
        E PGS++  EV  +RR+       N +  ++TY
Subjt:  EIPGSSKVSEVYLDRRN-------NVVRGKQTY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGCGTCTATAAATCCTTCACCCATCTTCATCATTTTATCACAACAATCAAATTCCCATCAATCTCATTCCTTCAATCTAATTTCCCATCTTTTCTTCCTTTTTTC
CTTCAACTTCTCGCCGTTCTGTTTTTACTTTCCCCTGTTCTACTTTCACCACGGTGGCGCCGCCCAGGGATTCTACTGTTGCTGCAGTGAACACAAAATGGATGGCTCAG
ACGGAACCTCTGGCGGCGCTCCGCCGCCATTTCTGACCAAAACATATGAGATGGTGGATGATCCGATGACCAATTCCATCGTATCGTGGAGTCAAAGTGGTTTCAGCTTT
GTGGTTTGGAACCCACCGGAATTTGCACAAGAACTACTTCCGATTTATTTCAAACACAACAATTTTTCTAGTTTCGTTCGTCAATTAAACACTTATGGGTTTAGAAAAAT
CGATCGAGAACAATGGGAATTCGCGAACGAGGGGTTTGTAAGAGGACGAACCCATCTTCTAAAAAGCATCCATAGAAGAAAACCAATCTACAGCCATAGCCAGAGCAATA
GCCATGGAAATGGAGCTCCATTATCAGAACAAGAAAGACAAGAACTCGAGCAAAAAATCAAAACCCTTCATCAAGAAAAGACCATTCTCCAATCCCAATTACAGAAACAC
GAAAACGAAAAAGAACAAATTGGGCTTCAAATTCAAACAATCTGCCAGCAATTATGGCGAATGGGGAATCAACAAAAGCAGCTAATTGGGATATTGGGAGCAGAGTTGCA
GAAGCATCAGCAGAGCAAAAAGAGGAAAATGTGGAAAGTGAATGAGTTATTAGTTGAAGAATGGACAGAAGTTGAGAGAGATGAGAAGAATAATATGAAGAAGAAGGTGA
AGGTTCCGCCATTAGAGCTGATGGGGAAGCTGGAATTGTCATTGGGGTTGTGTGAGGATTTGCTTTGCAATGTGGCGCAGGTTCTGAGGGAAGGGAAGGAAATGGAAGTG
AAAAAAGAAGGGGAAATGAGGAGTGGAGTGAATGATGTGTTTTGGGAACAATTCTTGACGGAGATTCCAGGGTCGTCCAAAGTTAGTGAAGTTTATTTGGATAGAAGGAA
CAATGTTGTAAGGGGAAAGCAAACGTACCGTACGTACGGTTTGGTTTCGGTATTTGTGCCCTTGGGGTTTTCGGCCGATATCCGAAATTCAACAAATAGCCGTCCGTCGT
CGTCTCAAATCGGTGCTAAACCCTGTATCTGTAAGCAATTCCGTAAAGTTTCTTGCCCAACAGTTATGCGTAGCTCAAAGAGGACGAGGTCTGAAGAAGGTCAAAATGAT
TGGATTAGTCAGCTTCCGGAATCTGTTCTTGTTGACATTCTTTCGTATTTACCAACAAAGGATGCTGTAAAGATGACATTGATTTCAAGATTCAGGAATCTTTGGACCTA
TATTCGTTGTCTCTCCTTTGATGAATGTGCATATCATGACCACAGTAGTTATGATGGTGAAAATTATGATGGTCCACATTATGATGAAAGTTTCCTAAATCTAATTCGTC
ATGTCTTGATACTTCATGAACGTACTACAATTGATGAGTTTCATCTCAAATTTGCTTTCAATTTGTTTAATGCTATTCATGATGATCATTATAATTCTGATGGTTATGCA
TCCAAAGAAAGACGAATGGCAAGTGAGCTTACTACATGGATTAAATTTTCTTTGAGGAAACAAGTTAAGGTTCTTGATATTGATTTATTAGGATGTGGTTTGTCAGAGCC
AGAAGTAAATTATGAACTGCCTACTAGTATTTTAACTAATAACTATCTAAAGGAGCTCAGTTTGGCTGGCTGCGGAATCGAGGAAAAAGGGCGTATTCATCTGACGTCTC
TCACTATGCTTTCGCTCAAGGAAATAATATTGAGTGATAAGATTATGGGTGAAATCATTGTAGGATGCCCAATGCTTGAAGAACTTTCTCTTGATGGATGTTGTGGTCTT
CGTAAGCTGAAGTTGACTACTTCTAATATCAAGAGATTGAGGATTTTTATTGGATGGAGAAATGAAGTGGCAAATTCAAGGTTGGAGATTAGCTGCCCTGGTCTCAAATC
ATTAGAACTTGCTGGATCAATACAGCTTGTACAGTTAAAGTATTCATCTTCTATTTCCGACGCCTCCCTTTATTATAGCCGCACCTTCATGTGCGAACGCATGATATATG
AAAAAGTTCAAATGCTGCTGTGGAAACTAGCTGAAGTCAATGTTTTCATACCATGCACATGGACTATTTTGTTTTATATATCTGTGAATTCTGGGCTAGCACGCATGCAT
GCAACTCTGCTAATCATCTTCACAATATGGGAGTTGACATATGTGCCAATTCCTGTCACTGGTTGGAAATCTGTAGAGTTCAGACTTCTTTTCACAAAGTGGCACCTGCC
TGGAATATGCAGCATTCTCAGGAATTCACATTGGTTGGAAACACTGACTTTCTACATTTACCCAGGATCATACTCTACATTCTTGACTGAGGAAGCAAAGTGGATGGAGG
CATATGAATTTGATGGTAAAGATTACTGGAAGTCACAGAATGGCGATTACCGTGGCCTCAGAAAGTACCTCAAGACAGTTATGATATACGGATATGTGACTGAGCCATAT
GTGTTGGAGCTTGTAGAATTTCTATTGAAGAATGCCTTGGTCCTTGAAAAGATGGTCATTTCCACCAAGAGGACTCTCCAGCCCATTCACCAATATGAATTATTCAAAGA
CGCTGTCTTTGATCAGGAGGATCATTTCACTCCTGAGGAACTACTTCAGTTTTCTCAGAAACTGTTAACTTTTCCCAGGGCCTCCAAGTCTGCTCGGTCTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCGCGTCTATAAATCCTTCACCCATCTTCATCATTTTATCACAACAATCAAATTCCCATCAATCTCATTCCTTCAATCTAATTTCCCATCTTTTCTTCCTTTTTTC
CTTCAACTTCTCGCCGTTCTGTTTTTACTTTCCCCTGTTCTACTTTCACCACGGTGGCGCCGCCCAGGGATTCTACTGTTGCTGCAGTGAACACAAAATGGATGGCTCAG
ACGGAACCTCTGGCGGCGCTCCGCCGCCATTTCTGACCAAAACATATGAGATGGTGGATGATCCGATGACCAATTCCATCGTATCGTGGAGTCAAAGTGGTTTCAGCTTT
GTGGTTTGGAACCCACCGGAATTTGCACAAGAACTACTTCCGATTTATTTCAAACACAACAATTTTTCTAGTTTCGTTCGTCAATTAAACACTTATGGGTTTAGAAAAAT
CGATCGAGAACAATGGGAATTCGCGAACGAGGGGTTTGTAAGAGGACGAACCCATCTTCTAAAAAGCATCCATAGAAGAAAACCAATCTACAGCCATAGCCAGAGCAATA
GCCATGGAAATGGAGCTCCATTATCAGAACAAGAAAGACAAGAACTCGAGCAAAAAATCAAAACCCTTCATCAAGAAAAGACCATTCTCCAATCCCAATTACAGAAACAC
GAAAACGAAAAAGAACAAATTGGGCTTCAAATTCAAACAATCTGCCAGCAATTATGGCGAATGGGGAATCAACAAAAGCAGCTAATTGGGATATTGGGAGCAGAGTTGCA
GAAGCATCAGCAGAGCAAAAAGAGGAAAATGTGGAAAGTGAATGAGTTATTAGTTGAAGAATGGACAGAAGTTGAGAGAGATGAGAAGAATAATATGAAGAAGAAGGTGA
AGGTTCCGCCATTAGAGCTGATGGGGAAGCTGGAATTGTCATTGGGGTTGTGTGAGGATTTGCTTTGCAATGTGGCGCAGGTTCTGAGGGAAGGGAAGGAAATGGAAGTG
AAAAAAGAAGGGGAAATGAGGAGTGGAGTGAATGATGTGTTTTGGGAACAATTCTTGACGGAGATTCCAGGGTCGTCCAAAGTTAGTGAAGTTTATTTGGATAGAAGGAA
CAATGTTGTAAGGGGAAAGCAAACGTACCGTACGTACGGTTTGGTTTCGGTATTTGTGCCCTTGGGGTTTTCGGCCGATATCCGAAATTCAACAAATAGCCGTCCGTCGT
CGTCTCAAATCGGTGCTAAACCCTGTATCTGTAAGCAATTCCGTAAAGTTTCTTGCCCAACAGTTATGCGTAGCTCAAAGAGGACGAGGTCTGAAGAAGGTCAAAATGAT
TGGATTAGTCAGCTTCCGGAATCTGTTCTTGTTGACATTCTTTCGTATTTACCAACAAAGGATGCTGTAAAGATGACATTGATTTCAAGATTCAGGAATCTTTGGACCTA
TATTCGTTGTCTCTCCTTTGATGAATGTGCATATCATGACCACAGTAGTTATGATGGTGAAAATTATGATGGTCCACATTATGATGAAAGTTTCCTAAATCTAATTCGTC
ATGTCTTGATACTTCATGAACGTACTACAATTGATGAGTTTCATCTCAAATTTGCTTTCAATTTGTTTAATGCTATTCATGATGATCATTATAATTCTGATGGTTATGCA
TCCAAAGAAAGACGAATGGCAAGTGAGCTTACTACATGGATTAAATTTTCTTTGAGGAAACAAGTTAAGGTTCTTGATATTGATTTATTAGGATGTGGTTTGTCAGAGCC
AGAAGTAAATTATGAACTGCCTACTAGTATTTTAACTAATAACTATCTAAAGGAGCTCAGTTTGGCTGGCTGCGGAATCGAGGAAAAAGGGCGTATTCATCTGACGTCTC
TCACTATGCTTTCGCTCAAGGAAATAATATTGAGTGATAAGATTATGGGTGAAATCATTGTAGGATGCCCAATGCTTGAAGAACTTTCTCTTGATGGATGTTGTGGTCTT
CGTAAGCTGAAGTTGACTACTTCTAATATCAAGAGATTGAGGATTTTTATTGGATGGAGAAATGAAGTGGCAAATTCAAGGTTGGAGATTAGCTGCCCTGGTCTCAAATC
ATTAGAACTTGCTGGATCAATACAGCTTGTACAGTTAAAGTATTCATCTTCTATTTCCGACGCCTCCCTTTATTATAGCCGCACCTTCATGTGCGAACGCATGATATATG
AAAAAGTTCAAATGCTGCTGTGGAAACTAGCTGAAGTCAATGTTTTCATACCATGCACATGGACTATTTTGTTTTATATATCTGTGAATTCTGGGCTAGCACGCATGCAT
GCAACTCTGCTAATCATCTTCACAATATGGGAGTTGACATATGTGCCAATTCCTGTCACTGGTTGGAAATCTGTAGAGTTCAGACTTCTTTTCACAAAGTGGCACCTGCC
TGGAATATGCAGCATTCTCAGGAATTCACATTGGTTGGAAACACTGACTTTCTACATTTACCCAGGATCATACTCTACATTCTTGACTGAGGAAGCAAAGTGGATGGAGG
CATATGAATTTGATGGTAAAGATTACTGGAAGTCACAGAATGGCGATTACCGTGGCCTCAGAAAGTACCTCAAGACAGTTATGATATACGGATATGTGACTGAGCCATAT
GTGTTGGAGCTTGTAGAATTTCTATTGAAGAATGCCTTGGTCCTTGAAAAGATGGTCATTTCCACCAAGAGGACTCTCCAGCCCATTCACCAATATGAATTATTCAAAGA
CGCTGTCTTTGATCAGGAGGATCATTTCACTCCTGAGGAACTACTTCAGTTTTCTCAGAAACTGTTAACTTTTCCCAGGGCCTCCAAGTCTGCTCGGTCTTTTTGA
Protein sequenceShow/hide protein sequence
MTASINPSPIFIILSQQSNSHQSHSFNLISHLFFLFSFNFSPFCFYFPLFYFHHGGAAQGFYCCCSEHKMDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSF
VVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGRTHLLKSIHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKH
ENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKMWKVNELLVEEWTEVERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEV
KKEGEMRSGVNDVFWEQFLTEIPGSSKVSEVYLDRRNNVVRGKQTYRTYGLVSVFVPLGFSADIRNSTNSRPSSSQIGAKPCICKQFRKVSCPTVMRSSKRTRSEEGQND
WISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDGENYDGPHYDESFLNLIRHVLILHERTTIDEFHLKFAFNLFNAIHDDHYNSDGYA
SKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSLTMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGL
RKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPGLKSLELAGSIQLVQLKYSSSISDASLYYSRTFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYISVNSGLARMH
ATLLIIFTIWELTYVPIPVTGWKSVEFRLLFTKWHLPGICSILRNSHWLETLTFYIYPGSYSTFLTEEAKWMEAYEFDGKDYWKSQNGDYRGLRKYLKTVMIYGYVTEPY
VLELVEFLLKNALVLEKMVISTKRTLQPIHQYELFKDAVFDQEDHFTPEELLQFSQKLLTFPRASKSARSF