; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G035590 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G035590
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionheat stress transcription factor A-4c-like
Genome locationCicolChr02:31384555..31391844
RNA-Seq ExpressionCcUC02G035590
SyntenyCcUC02G035590
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0034605 - cellular response to heat (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR001810 - F-box domain
IPR006566 - FBD domain
IPR027725 - Heat shock transcription factor family
IPR032675 - Leucine-rich repeat domain superfamily
IPR036047 - F-box-like domain superfamily
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054510.1 F-box/FBD/LRR-repeat protein [Cucumis melo var. makuwa]6.5e-21676.74Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPESVLVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y  ENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC  LKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYIFVNSGLARMHATLLIIILMPSDLHNMGVDICANSCHWLEICRVQTSFHKVAPAWNMQHSQKFTL
        TFMCER IYE+VQMLL KLAE+       W + +   + +G   +   LL      +  H  G+     + +WLE     T+F+ + P      S    L
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYIFVNSGLARMHATLLIIILMPSDLHNMGVDICANSCHWLEICRVQTSFHKVAPAWNMQHSQKFTL

Query:  TEEAKWMEAYEFDGKDYWESQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLKKMVISTKRTLQPIHQYELFKDAVFDQEDRFTPEELLQFS
        TEEAKWMEAY+FDGKDYW+S+NGD+RGLRKYLKTVMIYGYVTEPYVLEL+EFLLKNALVL+KMVISTKRTLQPIHQYELFKDAV DQEDRFTPEELLQFS
Subjt:  TEEAKWMEAYEFDGKDYWESQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLKKMVISTKRTLQPIHQYELFKDAVFDQEDRFTPEELLQFS

Query:  QKLLTFPRAFKSAVAE
        QKLLT PRA KSAV +
Subjt:  QKLLTFPRAFKSAVAE

XP_004143230.1 putative F-box/LRR-repeat protein At3g18150 [Cucumis sativus]4.1e-22678.74Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SKRTR+EE +NDWISQLPESVLVDILSYLPT+DAVKM LISRFRNLWTYI  LSFDECAYHDH+ YD ENYDGPHYDE FLNLIRHVLILHERT I
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFNAIHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTN YLKELSL GCGIEEKGRI LTSL
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLKEI+LSDKIMGEI++GCPMLEELSLDGCCGL KLKLTTSNIKRL+I +GWRNE++NSRLEISCP LKSLELAG+I LVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFY-----IFVNSGLARMHATLLIIILMPSDLHNMGVDICANSCHWLEICRVQTSFHKVAPAWNMQHS
        TFMCER IYEKVQMLLWKLAEV+VFIPCTWT L +      +V   +A   +    ++   +  H  G+     + HWLE     T    + P      S
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFY-----IFVNSGLARMHATLLIIILMPSDLHNMGVDICANSCHWLEICRVQTSFHKVAPAWNMQHS

Query:  QKFTLTEEAKWMEAYEFDGKDYWESQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLKKMVISTKRTLQPIHQYELFKDAVFDQEDRFTPE
            LTEEAKWMEAY+FDGK YW+SQNGD+R GLRKYLKTV IYGYVTEPYVLEL+EFLLKNALVL+KMVISTK+TLQPIHQYELFKDAVFDQEDRFTP+
Subjt:  QKFTLTEEAKWMEAYEFDGKDYWESQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLKKMVISTKRTLQPIHQYELFKDAVFDQEDRFTPE

Query:  ELLQFSQKLLTFPRAFKSAVAE
        ELLQFSQKLLT PRA KSAV +
Subjt:  ELLQFSQKLLTFPRAFKSAVAE

XP_008456389.1 PREDICTED: heat stress transcription factor A-4c-like [Cucumis melo]3.3e-14386.75Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS
        M  S+G+S GAPPPFLTKTYEMVDDPM+NSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RG THLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK
        IHRRKP+YSHSQS+    GAPLSEQERQELE KIKTL+QEKT L+SQLQKHENEKEQIG QIQ IC++LWRMG+QQKQLIGILGAEL+KH+Q KKRK+ K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK

Query:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGANDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEWTEFERD+    KKKV V PLELMGKLELSL LCEDLLCNVAQVL+EGKEMEVKKEGEMRSG NDVFWE FLTEIPGSSKV+EVYLDRRNNV
Subjt:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGANDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

XP_008465581.2 PREDICTED: F-box/FBD/LRR-repeat protein At1g78750-like [Cucumis melo]1.7e-16086.19Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPESVLVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y  ENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC  LKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTIL
        TFMCER IYE+VQMLL KLAEV+VFIPCTWT L
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTIL

XP_038902159.1 heat stress transcription factor A-4c-like [Benincasa hispida]1.4e-14688.41Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS
        MDGS+G+SG APPPFLTKTYEMVDDPMTNSIVSW+QSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RG THLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK
        IHRRKPIYSHSQS+ +G GAPLSEQER+ELEQKIKTL+QEK ILQSQLQKHENEKEQIGLQIQTICQQLW+MGNQQKQLIGILGAELQ+H+QSKKRK+ K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK

Query:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGANDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEW+EF+RDE     KKVKVPPLELM KLELSLGLCEDLLCNVA+VL+EG+ MEVKK GEMRSG ND+FWEQFLTEIPGSS +SEVYLDRRNNV
Subjt:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGANDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

TrEMBL top hitse value%identityAlignment
A0A0A0KDY5 F-box domain-containing protein2.0e-22678.74Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SKRTR+EE +NDWISQLPESVLVDILSYLPT+DAVKM LISRFRNLWTYI  LSFDECAYHDH+ YD ENYDGPHYDE FLNLIRHVLILHERT I
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFNAIHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTN YLKELSL GCGIEEKGRI LTSL
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLKEI+LSDKIMGEI++GCPMLEELSLDGCCGL KLKLTTSNIKRL+I +GWRNE++NSRLEISCP LKSLELAG+I LVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFY-----IFVNSGLARMHATLLIIILMPSDLHNMGVDICANSCHWLEICRVQTSFHKVAPAWNMQHS
        TFMCER IYEKVQMLLWKLAEV+VFIPCTWT L +      +V   +A   +    ++   +  H  G+     + HWLE     T    + P      S
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFY-----IFVNSGLARMHATLLIIILMPSDLHNMGVDICANSCHWLEICRVQTSFHKVAPAWNMQHS

Query:  QKFTLTEEAKWMEAYEFDGKDYWESQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLKKMVISTKRTLQPIHQYELFKDAVFDQEDRFTPE
            LTEEAKWMEAY+FDGK YW+SQNGD+R GLRKYLKTV IYGYVTEPYVLEL+EFLLKNALVL+KMVISTK+TLQPIHQYELFKDAVFDQEDRFTP+
Subjt:  QKFTLTEEAKWMEAYEFDGKDYWESQNGDYR-GLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLKKMVISTKRTLQPIHQYELFKDAVFDQEDRFTPE

Query:  ELLQFSQKLLTFPRAFKSAVAE
        ELLQFSQKLLT PRA KSAV +
Subjt:  ELLQFSQKLLTFPRAFKSAVAE

A0A1S3C373 heat stress transcription factor A-4c-like1.6e-14386.75Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS
        M  S+G+S GAPPPFLTKTYEMVDDPM+NSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RG THLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK
        IHRRKP+YSHSQS+    GAPLSEQERQELE KIKTL+QEKT L+SQLQKHENEKEQIG QIQ IC++LWRMG+QQKQLIGILGAEL+KH+Q KKRK+ K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK

Query:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGANDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEWTEFERD+    KKKV V PLELMGKLELSL LCEDLLCNVAQVL+EGKEMEVKKEGEMRSG NDVFWE FLTEIPGSSKV+EVYLDRRNNV
Subjt:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGANDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

A0A1S3CPK7 F-box/FBD/LRR-repeat protein At1g78750-like8.5e-16186.19Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPESVLVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y  ENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC  LKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTIL
        TFMCER IYE+VQMLL KLAEV+VFIPCTWT L
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTIL

A0A5A7UGX2 F-box/FBD/LRR-repeat protein3.2e-21676.74Show/hide
Query:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI
        +MR SK T+ EEGQNDWISQLPESVLVDILSYLPTKDAVKM LISRFRNLWTYI CLSFDECAYHDH+ Y  ENYDGPHYDE FLNLIRHVLILHERTTI
Subjt:  VMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTI

Query:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL
        DEFHLKFAFNLFN IHDD YNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPT ILTNNYLKELSL GCGIEEKG I L  L
Subjt:  DEFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSL

Query:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR
        + LSLK+I+LSDKIMGEI++GCPMLEELSLDGCCGL KLKL T NIKRL+I +GWRN+++NSRLEISC  LKSLE AG+IQLVQLKYS+SI DASLYYSR
Subjt:  TMLSLKEIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSR

Query:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYIFVNSGLARMHATLLIIILMPSDLHNMGVDICANSCHWLEICRVQTSFHKVAPAWNMQHSQKFTL
        TFMCER IYE+VQMLL KLAE+       W + +   + +G   +   LL      +  H  G+     + +WLE     T+F+ + P      S    L
Subjt:  TFMCERMIYEKVQMLLWKLAEVNVFIPCTWTILFYIFVNSGLARMHATLLIIILMPSDLHNMGVDICANSCHWLEICRVQTSFHKVAPAWNMQHSQKFTL

Query:  TEEAKWMEAYEFDGKDYWESQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLKKMVISTKRTLQPIHQYELFKDAVFDQEDRFTPEELLQFS
        TEEAKWMEAY+FDGKDYW+S+NGD+RGLRKYLKTVMIYGYVTEPYVLEL+EFLLKNALVL+KMVISTKRTLQPIHQYELFKDAV DQEDRFTPEELLQFS
Subjt:  TEEAKWMEAYEFDGKDYWESQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLKKMVISTKRTLQPIHQYELFKDAVFDQEDRFTPEELLQFS

Query:  QKLLTFPRAFKSAVAE
        QKLLT PRA KSAV +
Subjt:  QKLLTFPRAFKSAVAE

A0A5D3BGW6 Heat stress transcription factor A-4c-like1.6e-14386.75Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS
        M  S+G+S GAPPPFLTKTYEMVDDPM+NSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGF+RG THLLKS
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK
        IHRRKP+YSHSQS+    GAPLSEQERQELE KIKTL+QEKT L+SQLQKHENEKEQIG QIQ IC++LWRMG+QQKQLIGILGAEL+KH+Q KKRK+ K
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK

Query:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGANDVFWEQFLTEIPGSSKVSEVYLDRRNNV
        VNELLVEEWTEFERD+    KKKV V PLELMGKLELSL LCEDLLCNVAQVL+EGKEMEVKKEGEMRSG NDVFWE FLTEIPGSSKV+EVYLDRRNNV
Subjt:  VNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEMEVKKEGEMRSGANDVFWEQFLTEIPGSSKVSEVYLDRRNNV

Query:  VR
        VR
Subjt:  VR

SwissProt top hitse value%identityAlignment
O49403 Heat stress transcription factor A-4a1.6e-5234.79Show/hide
Query:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSI
        + + G S  + PPFLTKTYEMVDD  ++SIVSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D EQWEFAN+ FVRG  HL+K+I
Subjt:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSI

Query:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------
        HRRKP++SHS  N      PL++ ER  +  +I+ L +EK  L  +L K + E+E   +Q++ + ++L  M  +QK ++  +   L+K            
Subjt:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------

Query:  HQQSKKRKIWKV----NELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVKKEG
            +KR+  ++    +E ++EE        +              + +LE S+ + E+L+ +  + + + + M                    ++  + 
Subjt:  HQQSKKRKIWKV----NELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVKKEG

Query:  EMRS-------------------------------GANDVFWEQFLTEIPGSSKVSEVYLDRRNN
         ++S                               GAND FW+QF +E PGS++  EV L+R+++
Subjt:  EMRS-------------------------------GANDVFWEQFLTEIPGSSKVSEVYLDRRNN

Q93VB5 Heat stress transcription factor A-4d2.1e-5251.09Show/hide
Query:  GSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSIH
        G  G  GG PPPFL KTYEMV+D  TN +VSW   G SFVVWNP +F+++LLP YFKHNNFSSF+RQLNTYGFRKID E+WEFANE F+RG THLLK+IH
Subjt:  GSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSIH

Query:  RRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWKVN
        RRKP++SHS  N   NG PL+E ER+ELE++I  L  EK+IL + LQ+   ++  I  Q+Q +  +L  M  +QK ++  L   LQ       R+   V+
Subjt:  RRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWKVN

Query:  ELLVEEWTEFERDEKNNMKKKVKVPPLEL
          L+         E ++  KK +VP ++L
Subjt:  ELLVEEWTEFERDEKNNMKKKVKVPPLEL

Q94J16 Heat stress transcription factor A-4b3.2e-4850Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS
        M+G  G  GG+ PPFL+KTYEMVDDP T+++V W+ +G SFVV N PEF ++LLP YFKHNNFSSFVRQLNTYGFRK+D EQWEFANE F++G  H LK+
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGA-PLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLI----------GILGAELQK
        IHRRKPI+SHS   SH  GA PL++ ER++ E++I+ L  +   L S+LQ +  +K  +  ++Q + ++L+ + +QQ+ LI          G L + +Q+
Subjt:  IHRRKPIYSHSQSNSHGNGA-PLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLI----------GILGAELQK

Query:  HQQSKKRK
            +K++
Subjt:  HQQSKKRK

Q9FK72 Heat stress transcription factor A-4c4.0e-5137.75Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS
        MD ++G S    PPFLTKTYEMVDD  ++S+V+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D E+WEF N+ FVRG  +L+K+
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK
        IHRRKP++SHS  N      PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++  LQ+ T+  +L  M   QK ++  +   L K   S       
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK

Query:  VNELLVEEWTEFERDEKNNMKKKVKVPP----LELMGKLELSLGLCEDLL----------------------CNVAQVLREGKEMEVKKEGEM-------
           L +E     +R  + N      +PP    +E + KLE SL   E+L+                       ++     +  ++++  E  +       
Subjt:  VNELLVEEWTEFERDEKNNMKKKVKVPP----LELMGKLELSLGLCEDLL----------------------CNVAQVLREGKEMEVKKEGEM-------

Query:  RSGANDVFWEQFLTEIPGSSKVSEVYLDRRN-------NVVRGKQTY
        ++G ND FWEQ LTE PGS++  EV  +RR+       N +  ++TY
Subjt:  RSGANDVFWEQFLTEIPGSSKVSEVYLDRRN-------NVVRGKQTY

Q9LQM7 Heat stress transcription factor A-1d1.9e-4551.32Show/hide
Query:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSIHRRKPI
        S  APPPFL+KTY+MVDD  T+SIVSWS +  SF+VW PPEFA++LLP  FKHNNFSSFVRQLNTYGFRK+D ++WEFANEGF+RG  HLL+SI RRKP 
Subjt:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSIHRRKPI

Query:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ
        +   Q    + H NG   S     E  +  LE++++ L ++K +L  +L +   +++    Q+QT+ Q+L  M N+Q+QL+  L   +Q
Subjt:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ

Arabidopsis top hitse value%identityAlignment
AT1G32330.1 heat shock transcription factor A1D1.4e-4651.32Show/hide
Query:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSIHRRKPI
        S  APPPFL+KTY+MVDD  T+SIVSWS +  SF+VW PPEFA++LLP  FKHNNFSSFVRQLNTYGFRK+D ++WEFANEGF+RG  HLL+SI RRKP 
Subjt:  SGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSIHRRKPI

Query:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ
        +   Q    + H NG   S     E  +  LE++++ L ++K +L  +L +   +++    Q+QT+ Q+L  M N+Q+QL+  L   +Q
Subjt:  YSHSQS---NSHGNGAPLS-----EQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ

AT4G13980.1 winged-helix DNA-binding transcription factor family protein3.8e-4450.56Show/hide
Query:  SDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSIHR
        S G   G P PFL KTYEMVDD  T+ IVSWS +  SF+VWN  EF++ LLP YFKHNNFSSF+RQLNTYGFRKID E+WEF N+ F++   HLLK+IHR
Subjt:  SDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSIHR

Query:  RKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGIL
        RKPI+SHS        A  ++QER  L++++  L +EK  ++++L K + +K     Q + + + +  M N+QK+L+  L
Subjt:  RKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGIL

AT4G17750.1 heat shock factor 12.9e-4446.32Show/hide
Query:  PPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSIHRRKPIYSHS
        PPPFL+KTY+MV+DP T++IVSWS +  SF+VW+PPEF+++LLP YFKHNNFSSFVRQLNTYGFRK+D ++WEFANEGF+RG  HLLK I RRK +  H 
Subjt:  PPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSIHRRKPIYSHS

Query:  QSNSHGNGAPLSEQE-------------RQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ
         S+S+     LS+ +             +  LE++++ L ++K +L  +L K   +++    ++Q + + L  M  +Q+Q++  L   +Q
Subjt:  QSNSHGNGAPLSEQE-------------RQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQ

AT4G18880.1 heat shock transcription factor A4A1.2e-5334.79Show/hide
Query:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSI
        + + G S  + PPFLTKTYEMVDD  ++SIVSWSQS  SF+VWNPPEF+++LLP +FKHNNFSSF+RQLNTYGFRK D EQWEFAN+ FVRG  HL+K+I
Subjt:  DGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSI

Query:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------
        HRRKP++SHS  N      PL++ ER  +  +I+ L +EK  L  +L K + E+E   +Q++ + ++L  M  +QK ++  +   L+K            
Subjt:  HRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQK------------

Query:  HQQSKKRKIWKV----NELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVKKEG
            +KR+  ++    +E ++EE        +              + +LE S+ + E+L+ +  + + + + M                    ++  + 
Subjt:  HQQSKKRKIWKV----NELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLLCNVAQVLREGKEM--------------------EVKKEG

Query:  EMRS-------------------------------GANDVFWEQFLTEIPGSSKVSEVYLDRRNN
         ++S                               GAND FW+QF +E PGS++  EV L+R+++
Subjt:  EMRS-------------------------------GANDVFWEQFLTEIPGSSKVSEVYLDRRNN

AT5G45710.1 winged-helix DNA-binding transcription factor family protein2.9e-5237.75Show/hide
Query:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS
        MD ++G S    PPFLTKTYEMVDD  ++S+V+WS++  SF+V NP EF+++LLP +FKH NFSSF+RQLNTYGFRK+D E+WEF N+ FVRG  +L+K+
Subjt:  MDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQSGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKS

Query:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK
        IHRRKP++SHS  N      PL+E ER+ +E +I+ L  EK  L ++LQ  E E+++  LQ+ T+  +L  M   QK ++  +   L K   S       
Subjt:  IHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEKTILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWK

Query:  VNELLVEEWTEFERDEKNNMKKKVKVPP----LELMGKLELSLGLCEDLL----------------------CNVAQVLREGKEMEVKKEGEM-------
           L +E     +R  + N      +PP    +E + KLE SL   E+L+                       ++     +  ++++  E  +       
Subjt:  VNELLVEEWTEFERDEKNNMKKKVKVPP----LELMGKLELSLGLCEDLL----------------------CNVAQVLREGKEMEVKKEGEM-------

Query:  RSGANDVFWEQFLTEIPGSSKVSEVYLDRRN-------NVVRGKQTY
        ++G ND FWEQ LTE PGS++  EV  +RR+       N +  ++TY
Subjt:  RSGANDVFWEQFLTEIPGSSKVSEVYLDRRN-------NVVRGKQTY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGCGTCTATAAATCCTTCACCCATCTTCATCATTTTATCACAACAATCAAATTGCCATCAATCTCATTCCTTCAATGTAATTTCCCATCTTTTCTTCCTT
TTTTCCTTCAACTTCTCGCCGTTCTGTTTTTACTTTCCCCTGTTTTACTTTCACCACTGTGGCGCCGCCCAGGGATTCTACTGTTGCTGCAGTGAACACAAAATG
GATGGCTCAGACGGAACCTCTGGCGGCGCTCCGCCGCCATTTCTGACCAAAACATATGAGATGGTGGATGATCCGATGACCAATTCCATCGTATCGTGGAGTCAA
AGTGGTTTCAGCTTTGTGGTTTGGAACCCACCGGAATTCGCACAAGAACTACTTCCGATTTATTTCAAACACAACAATTTTTCTAGTTTCGTTCGTCAATTAAAC
ACTTATGGGTTTAGAAAAATCGATCGAGAACAATGGGAATTCGCGAACGAGGGGTTTGTAAGAGGAGGAACCCATCTTCTAAAAAGCATCCATAGAAGAAAACCA
ATCTACAGCCATAGCCAGAGCAATAGCCATGGAAATGGAGCTCCATTATCAGAACAAGAAAGACAAGAACTCGAGCAAAAAATCAAAACCCTTCATCAAGAAAAG
ACCATTCTCCAATCCCAATTACAGAAACACGAAAATGAAAAAGAACAAATTGGGCTTCAAATTCAAACAATCTGCCAGCAATTATGGCGAATGGGGAATCAACAA
AAGCAGCTAATTGGGATACTGGGAGCAGAGTTGCAGAAGCATCAGCAGAGCAAAAAGAGGAAAATATGGAAAGTGAATGAGTTATTAGTTGAAGAATGGACAGAA
TTTGAGAGAGATGAGAAGAATAATATGAAGAAGAAGGTGAAGGTTCCGCCATTGGAGCTGATGGGGAAGCTGGAATTATCATTGGGGTTGTGTGAGGATTTGCTT
TGCAATGTGGCGCAGGTTCTGAGGGAAGGGAAGGAAATGGAAGTGAAAAAAGAAGGGGAAATGAGGAGTGGAGCGAATGATGTGTTTTGGGAACAATTCTTGACG
GAGATTCCAGGGTCGTCCAAAGTTAGTGAAGTTTATTTGGATAGAAGGAACAATGTTGTAAGGGGAAAGCAAACGTACCGTACGTACCGTTTCGTTTCGGTATTT
GTGCCCTTGGGGTTTTCGGCCGATATCCGAAATTCAATATATAGCCGTCCGTCGTCGTCTCATATCGGTGCTAAACCCTGTATCTGTAAGCAATTCCGTAAAGTT
TCTTGCCCAACAGTTATGCGTAGCTCAAAGAGGACGAGGTCTGAAGAGGGTCAAAATGATTGGATTAGTCAGCTTCCGGAATCTGTTCTTGTTGACATTCTTTCG
TATTTACCAACAAAGGATGCTGTAAAGATGACATTGATTTCAAGATTCAGGAATCTTTGGACCTATATTCGTTGCCTCTCCTTTGATGAATGTGCATATCATGAC
CACAGTAGTTATGATAGTGAAAATTATGATGGTCCACATTATGATGAAAGTTTCCTAAATCTAATTCGTCATGTCTTGATACTTCATGAACGTACTACAATTGAC
GAGTTTCATCTCAAATTTGCTTTCAATTTGTTTAATGCTATTCATGATGATCATTATAATTCTGATGGTTATGCATCTAAAGAAAGACGAATGGCAAGTGAGCTT
ACTACATGGATTAAATTTTCTTTGAGGAAACAAGTTAAGGTTCTTGATATTGATTTATTAGGATGTGGTTTGTCAGAGCCAGAAGTAAATTATGAACTGCCTACT
AGTATTTTAACTAATAACTATCTAAAGGAGCTCAGTTTGGCTGGCTGTGGAATCGAGGAAAAAGGGCGTATTCATCTGACGTCTCTCACTATGCTTTCGCTCAAG
GAAATAATATTGAGTGATAAGATTATGGGTGAAATCATTGTAGGATGCCCAATGCTTGAAGAACTTTCTCTTGATGGATGTTGTGGTCTTCGTAAGCTGAAGTTG
ACTACTTCTAATATCAAGAGATTGAGGATTTTTATTGGATGGAGAAATGAAGTGGCAAATTCAAGGTTGGAGATTAGCTGCCCTAGTCTCAAATCATTAGAACTT
GCTGGATCAATACAGCTTGTACAGTTAAAGTATTCATCTTCTATTTCCGACGCCTCCCTTTATTATAGCCGCACCTTCATGTGCGAACGCATGATATATGAAAAA
GTGCAAATGCTGCTGTGGAAGCTAGCTGAAGTCAATGTTTTCATACCATGCACATGGACTATTTTGTTTTATATATTTGTGAATTCTGGACTAGCACGCATGCAT
GCAACTCTGCTTATCATTATTTTGATGCCATCAGATCTTCACAATATGGGAGTTGACATATGTGCCAATTCCTGTCACTGGTTGGAAATCTGTAGAGTTCAGACT
TCTTTTCACAAAGTGGCACCTGCCTGGAATATGCAGCATTCTCAGAAATTCACATTGACTGAGGAAGCAAAGTGGATGGAGGCATATGAATTTGATGGTAAAGAT
TACTGGGAGTCACAGAATGGCGATTACCGTGGCCTCAGAAAGTACCTCAAGACGGTTATGATATACGGATATGTGACTGAGCCATATGTGTTGGAGCTTGTAGAA
TTTCTATTGAAGAATGCCTTGGTCCTCAAAAAGATGGTCATTTCCACCAAGAGGACTCTCCAGCCCATTCACCAATATGAATTATTCAAAGACGCTGTCTTTGAT
CAGGAGGATCGTTTCACTCCTGAGGAACTACTTCAGTTTTCTCAGAAACTGTTAACTTTTCCCAGGGCCTTCAAGTCTGCTGTTGCTGAGATTGGGGGAGGGCCT
ACTCTTCTAGAATTGCCTTTTGTAATGGAATCAGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGACCGCGTCTATAAATCCTTCACCCATCTTCATCATTTTATCACAACAATCAAATTGCCATCAATCTCATTCCTTCAATGTAATTTCCCATCTTTTCTTCCTT
TTTTCCTTCAACTTCTCGCCGTTCTGTTTTTACTTTCCCCTGTTTTACTTTCACCACTGTGGCGCCGCCCAGGGATTCTACTGTTGCTGCAGTGAACACAAAATG
GATGGCTCAGACGGAACCTCTGGCGGCGCTCCGCCGCCATTTCTGACCAAAACATATGAGATGGTGGATGATCCGATGACCAATTCCATCGTATCGTGGAGTCAA
AGTGGTTTCAGCTTTGTGGTTTGGAACCCACCGGAATTCGCACAAGAACTACTTCCGATTTATTTCAAACACAACAATTTTTCTAGTTTCGTTCGTCAATTAAAC
ACTTATGGGTTTAGAAAAATCGATCGAGAACAATGGGAATTCGCGAACGAGGGGTTTGTAAGAGGAGGAACCCATCTTCTAAAAAGCATCCATAGAAGAAAACCA
ATCTACAGCCATAGCCAGAGCAATAGCCATGGAAATGGAGCTCCATTATCAGAACAAGAAAGACAAGAACTCGAGCAAAAAATCAAAACCCTTCATCAAGAAAAG
ACCATTCTCCAATCCCAATTACAGAAACACGAAAATGAAAAAGAACAAATTGGGCTTCAAATTCAAACAATCTGCCAGCAATTATGGCGAATGGGGAATCAACAA
AAGCAGCTAATTGGGATACTGGGAGCAGAGTTGCAGAAGCATCAGCAGAGCAAAAAGAGGAAAATATGGAAAGTGAATGAGTTATTAGTTGAAGAATGGACAGAA
TTTGAGAGAGATGAGAAGAATAATATGAAGAAGAAGGTGAAGGTTCCGCCATTGGAGCTGATGGGGAAGCTGGAATTATCATTGGGGTTGTGTGAGGATTTGCTT
TGCAATGTGGCGCAGGTTCTGAGGGAAGGGAAGGAAATGGAAGTGAAAAAAGAAGGGGAAATGAGGAGTGGAGCGAATGATGTGTTTTGGGAACAATTCTTGACG
GAGATTCCAGGGTCGTCCAAAGTTAGTGAAGTTTATTTGGATAGAAGGAACAATGTTGTAAGGGGAAAGCAAACGTACCGTACGTACCGTTTCGTTTCGGTATTT
GTGCCCTTGGGGTTTTCGGCCGATATCCGAAATTCAATATATAGCCGTCCGTCGTCGTCTCATATCGGTGCTAAACCCTGTATCTGTAAGCAATTCCGTAAAGTT
TCTTGCCCAACAGTTATGCGTAGCTCAAAGAGGACGAGGTCTGAAGAGGGTCAAAATGATTGGATTAGTCAGCTTCCGGAATCTGTTCTTGTTGACATTCTTTCG
TATTTACCAACAAAGGATGCTGTAAAGATGACATTGATTTCAAGATTCAGGAATCTTTGGACCTATATTCGTTGCCTCTCCTTTGATGAATGTGCATATCATGAC
CACAGTAGTTATGATAGTGAAAATTATGATGGTCCACATTATGATGAAAGTTTCCTAAATCTAATTCGTCATGTCTTGATACTTCATGAACGTACTACAATTGAC
GAGTTTCATCTCAAATTTGCTTTCAATTTGTTTAATGCTATTCATGATGATCATTATAATTCTGATGGTTATGCATCTAAAGAAAGACGAATGGCAAGTGAGCTT
ACTACATGGATTAAATTTTCTTTGAGGAAACAAGTTAAGGTTCTTGATATTGATTTATTAGGATGTGGTTTGTCAGAGCCAGAAGTAAATTATGAACTGCCTACT
AGTATTTTAACTAATAACTATCTAAAGGAGCTCAGTTTGGCTGGCTGTGGAATCGAGGAAAAAGGGCGTATTCATCTGACGTCTCTCACTATGCTTTCGCTCAAG
GAAATAATATTGAGTGATAAGATTATGGGTGAAATCATTGTAGGATGCCCAATGCTTGAAGAACTTTCTCTTGATGGATGTTGTGGTCTTCGTAAGCTGAAGTTG
ACTACTTCTAATATCAAGAGATTGAGGATTTTTATTGGATGGAGAAATGAAGTGGCAAATTCAAGGTTGGAGATTAGCTGCCCTAGTCTCAAATCATTAGAACTT
GCTGGATCAATACAGCTTGTACAGTTAAAGTATTCATCTTCTATTTCCGACGCCTCCCTTTATTATAGCCGCACCTTCATGTGCGAACGCATGATATATGAAAAA
GTGCAAATGCTGCTGTGGAAGCTAGCTGAAGTCAATGTTTTCATACCATGCACATGGACTATTTTGTTTTATATATTTGTGAATTCTGGACTAGCACGCATGCAT
GCAACTCTGCTTATCATTATTTTGATGCCATCAGATCTTCACAATATGGGAGTTGACATATGTGCCAATTCCTGTCACTGGTTGGAAATCTGTAGAGTTCAGACT
TCTTTTCACAAAGTGGCACCTGCCTGGAATATGCAGCATTCTCAGAAATTCACATTGACTGAGGAAGCAAAGTGGATGGAGGCATATGAATTTGATGGTAAAGAT
TACTGGGAGTCACAGAATGGCGATTACCGTGGCCTCAGAAAGTACCTCAAGACGGTTATGATATACGGATATGTGACTGAGCCATATGTGTTGGAGCTTGTAGAA
TTTCTATTGAAGAATGCCTTGGTCCTCAAAAAGATGGTCATTTCCACCAAGAGGACTCTCCAGCCCATTCACCAATATGAATTATTCAAAGACGCTGTCTTTGAT
CAGGAGGATCGTTTCACTCCTGAGGAACTACTTCAGTTTTCTCAGAAACTGTTAACTTTTCCCAGGGCCTTCAAGTCTGCTGTTGCTGAGATTGGGGGAGGGCCT
ACTCTTCTAGAATTGCCTTTTGTAATGGAATCAGGATAACTTAGCTTTATTCTGCTGCAGTTCATTTTTTGTGTTATCTAAGGAATGACCCCAGAGGGGTTTTTA
TTTATTAATTAATTTGTTTATTTTTTTTTACTTTTGATGTTTTTCATTTGAGTTTACCATGTTTTTATGGTAAAGCTACAGAATATTTGACTAAAAGTGTGTTTG
GATTGACTTTTTAAGTATTTAAATAGGTGTTTATTATTGAAAAAAAATGTGTTCATAAACACCTAGAAAGTCAATCCAAATCGGTCTTAAGATATGTAAATTAGT
ATGTATGAGAGAAGGTCAAGGCCATGATTCTCTAATCAAG
Protein sequenceShow/hide protein sequence
MTASINPSPIFIILSQQSNCHQSHSFNVISHLFFLFSFNFSPFCFYFPLFYFHHCGAAQGFYCCCSEHKMDGSDGTSGGAPPPFLTKTYEMVDDPMTNSIVSWSQ
SGFSFVVWNPPEFAQELLPIYFKHNNFSSFVRQLNTYGFRKIDREQWEFANEGFVRGGTHLLKSIHRRKPIYSHSQSNSHGNGAPLSEQERQELEQKIKTLHQEK
TILQSQLQKHENEKEQIGLQIQTICQQLWRMGNQQKQLIGILGAELQKHQQSKKRKIWKVNELLVEEWTEFERDEKNNMKKKVKVPPLELMGKLELSLGLCEDLL
CNVAQVLREGKEMEVKKEGEMRSGANDVFWEQFLTEIPGSSKVSEVYLDRRNNVVRGKQTYRTYRFVSVFVPLGFSADIRNSIYSRPSSSHIGAKPCICKQFRKV
SCPTVMRSSKRTRSEEGQNDWISQLPESVLVDILSYLPTKDAVKMTLISRFRNLWTYIRCLSFDECAYHDHSSYDSENYDGPHYDESFLNLIRHVLILHERTTID
EFHLKFAFNLFNAIHDDHYNSDGYASKERRMASELTTWIKFSLRKQVKVLDIDLLGCGLSEPEVNYELPTSILTNNYLKELSLAGCGIEEKGRIHLTSLTMLSLK
EIILSDKIMGEIIVGCPMLEELSLDGCCGLRKLKLTTSNIKRLRIFIGWRNEVANSRLEISCPSLKSLELAGSIQLVQLKYSSSISDASLYYSRTFMCERMIYEK
VQMLLWKLAEVNVFIPCTWTILFYIFVNSGLARMHATLLIIILMPSDLHNMGVDICANSCHWLEICRVQTSFHKVAPAWNMQHSQKFTLTEEAKWMEAYEFDGKD
YWESQNGDYRGLRKYLKTVMIYGYVTEPYVLELVEFLLKNALVLKKMVISTKRTLQPIHQYELFKDAVFDQEDRFTPEELLQFSQKLLTFPRAFKSAVAEIGGGP
TLLELPFVMESG