; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G19610 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G19610
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionH15 domain-containing protein
Genome locationChr5:20856447..20858674
RNA-Seq ExpressionCSPI05G19610
SyntenyCSPI05G19610
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR005818 - Linker histone H1/H5, domain H15
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046320.1 transcription regulatory protein SNF2-like isoform X3 [Cucumis melo var. makuwa]9.6e-24186.51Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        MEIS SQLSSI PPP+NL+S SS  PHSDHRHSLIAGR RDALFSAVAAKYS  NGTAHSLPF SDQFKSVIDCR+ ENFPSF+TPTHLPYASMI RAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEESIS FI+NEY+DLPWAHSAYLRRHLGKLCENGE+VKLKCGRYNFKVEDKGVKRKK RRK+GGRSRYREV+SADEIEE FDRKKRSKKL 
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        +IGPRVEEVVTS G+EEQSD  REVTVGVE VDHVGEGQ+VV+EQK+VEVDEMVDKQHGEKSKH +GAKVFN K QS+NLVILGL APLANKE+EKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREV-EGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ
         GEEV EV EGDH KGGQIQVRGEV+EVQ DV+IHQ CEKEVKSR GFQDFD+KKQSQNV AGN+GAQEA TM WNEEKRGSPREEICGAKE GYD DRQ
Subjt:  LGEEVREV-EGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ

Query:  VIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPK
         IMIYELKEVNG D VEDFGGRKQSQDL++VGLH+KEALMTKGTED+CSSFRKNV DGVEGKHAQAGQ EVLDKFKEVQVE+IDEHPEEEKQGERMEEPK
Subjt:  VIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPK

Query:  ELLS
        E  S
Subjt:  ELLS

KAE8648608.1 hypothetical protein Csa_009414 [Cucumis sativus]2.7e-28396Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
        LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
Subjt:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV

Query:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
        IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
Subjt:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE

Query:  LLSSSLTAGGGGVLLSLSVGSERKP
          S           +SL++ +   P
Subjt:  LLSSSLTAGGGGVLLSLSVGSERKP

XP_011655461.2 uncharacterized protein LOC105435537 isoform X1 [Cucumis sativus]3.5e-28399.6Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
        LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
Subjt:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV

Query:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
        IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
Subjt:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE

Query:  LLS
          S
Subjt:  LLS

XP_031741073.1 uncharacterized protein LOC105435537 isoform X2 [Cucumis sativus]3.5e-22799.51Show/hide
Query:  MILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRK
        MILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRK
Subjt:  MILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRK

Query:  KRSKKLIIIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKE
        KRSKKLIIIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKE
Subjt:  KRSKKLIIIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKE

Query:  IEKQSGSLGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESG
        IEKQSGSLGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESG
Subjt:  IEKQSGSLGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESG

Query:  YDHDRQVIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGE
        YDHDRQVIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGE
Subjt:  YDHDRQVIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGE

Query:  RMEEPKELLS
        RMEEPKE  S
Subjt:  RMEEPKELLS

XP_038907055.1 uncharacterized protein LOC120092885 [Benincasa hispida]2.0e-19372.67Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        ME SQ Q SSI PPP  L+  SS  P SDHRHSL+AGRFRDALFSAVAAKYS  NG+AHS PF S+QFKSV+DCRIHENFPSF+TPTHLPYASMI RAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        E G+EDGLSEESIS FI+NEY+DLPWAH A+LRRHLGKLCE+GE+VK  CGRYNFKVE  GVKRKK RRKS GR+R RE++SADEIEEDFDRKKRSKKL+
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        IIGPR EEVVTS GTEEQSDL REV VG   VDH   GQ+V+ E +E++ DEM+DK+HGEK K  +G K F  KKQS  LVI+GL AP+A  EIEKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ
        LGEEV+E E G+ +KGGQIQV GEV+EVQ DV+IHQ CEKEVKSRD  QDFDE+KQSQNV AGN+GAQEA TM  N EK GS REEI GAKE   D DRQ
Subjt:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ

Query:  VIMIYELK--------------EVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEV-QVEIIDE
        VI IY+LK              EVN RDG+EDFGG KQSQDLV+VGLH+KEAL TKGTEDQCSS RK V DG EG HAQAGQTE L KFKEV +VE+ID+
Subjt:  VIMIYELK--------------EVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEV-QVEIIDE

Query:  HPEEEKQGERMEEPKE
        H EEE+QGE MEEP E
Subjt:  HPEEEKQGERMEEPKE

TrEMBL top hitse value%identityAlignment
A0A0A0KPL0 H15 domain-containing protein1.1e-303100Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
        LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
Subjt:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV

Query:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
        IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
Subjt:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE

Query:  LLSSSLTAGGGGVLLSLSVGSERKPEIGFHLVLVGR
        LLSSSLTAGGGGVLLSLSVGSERKPEIGFHLVLVGR
Subjt:  LLSSSLTAGGGGVLLSLSVGSERKPEIGFHLVLVGR

A0A5D3E3L6 Transcription regulatory protein SNF2-like isoform X34.6e-24186.51Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        MEIS SQLSSI PPP+NL+S SS  PHSDHRHSLIAGR RDALFSAVAAKYS  NGTAHSLPF SDQFKSVIDCR+ ENFPSF+TPTHLPYASMI RAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEESIS FI+NEY+DLPWAHSAYLRRHLGKLCENGE+VKLKCGRYNFKVEDKGVKRKK RRK+GGRSRYREV+SADEIEE FDRKKRSKKL 
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        +IGPRVEEVVTS G+EEQSD  REVTVGVE VDHVGEGQ+VV+EQK+VEVDEMVDKQHGEKSKH +GAKVFN K QS+NLVILGL APLANKE+EKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREV-EGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ
         GEEV EV EGDH KGGQIQVRGEV+EVQ DV+IHQ CEKEVKSR GFQDFD+KKQSQNV AGN+GAQEA TM WNEEKRGSPREEICGAKE GYD DRQ
Subjt:  LGEEVREV-EGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ

Query:  VIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPK
         IMIYELKEVNG D VEDFGGRKQSQDL++VGLH+KEALMTKGTED+CSSFRKNV DGVEGKHAQAGQ EVLDKFKEVQVE+IDEHPEEEKQGERMEEPK
Subjt:  VIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPK

Query:  ELLS
        E  S
Subjt:  ELLS

A0A6J1DFJ3 uncharacterized protein LOC1110200329.6e-13859.13Show/hide
Query:  PPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIAEVGEEDGLSEESI
        P   + S  S+P  SD RHSL++ + RD LFSA+  KY+  +G+  SLPF S++ KS ++ R+HE  PSF TPTHLPYASMI RAIAEVGEEDGLSEESI
Subjt:  PPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIAEVGEEDGLSEESI

Query:  SGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLIIIGPRVEEVVTSN
        S FI+NEY+DLPWAH+A LRRHLGKLCE+GE+VK KCGRYNF+VEDKGVKRK+ RRKS GRSRYR V+S D+IEED D K++SKKL IIGPR  EVVTS 
Subjt:  SGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLIIIGPRVEEVVTSN

Query:  GTEEQSDLCREVTVGVEKVDHVGEGQIV-VHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGSLGEEVREV-EGD
        GTEEQ+ L RE   GVE  D    GQ+  + E KEV+ DEM+D+   E+ K   G + F+W+ QSQNLV+LGL AP   KEIE+QSGSLG++VR   E D
Subjt:  GTEEQSDLCREVTVGVEKVDHVGEGQIV-VHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGSLGEEVREV-EGD

Query:  HTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQVIMIYELKEV--
        H KGGQ+         + DV I + CEKEV+ RDG QDFD+KK+SQN +A  + A+E       EEK G  REEI  A+E G+    Q IMIYELKEV  
Subjt:  HTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQVIMIYELKEV--

Query:  ----NG--------RDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQ-VEIIDEHPEEEKQGERME
            NG        RDG EDFGG++Q QDLV+VGLH  EA  TKGTE+QCSS R+ + DG EG  AQ GQTEVLDK KEVQ VE+I  + EEE Q   ME
Subjt:  ----NG--------RDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQ-VEIIDEHPEEEKQGERME

Query:  EPKE
        EP E
Subjt:  EPKE

A0A6J1FEI4 uncharacterized protein LOC111444998 isoform X17.6e-16766.27Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        ME SQ  LS+I  PP+N    SS  PHSDHR+SLIAGRFRDALFSA AAKY+  NG+AHSLPF S+QFKSVI+C +H+NFPSFRTPTHLPYASMI +AIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        E+GEEDGLSEE IS FI+NEYKDLPWAH A+LRRHLGKLCE+GE+VK KCG+YNFKVE K VKRKK RRKS GRSR REV+S DEIEEDF+R KRSKKL 
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        I GP  E VVTS G++EQ++  REV +G E  DH   G++V+ E +EV+ DEM+DK H E+ K+K+GA  FN  K+S+NLVI+GL AP+A KEI KQS S
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREV-EGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ
        LG +V E  EGDH KGGQIQV G+V EVQ DV+I Q CEKEVKSR   QD DEK+QSQ V A N+G QEA  M   E K GS REEI G  E      R+
Subjt:  LGEEVREV-EGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ

Query:  VIMIYELKEVNGR--DGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQ-VEIIDEHPEEEKQGERME
        V MI +  +V  +  D  EDFG  KQSQDL++VGLH+K+AL TKGTEDQCSS RKNV DG EG   QAGQTEVL  FK  Q VE+IDEH EEE+QGE ME
Subjt:  VIMIYELKEVNGR--DGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQ-VEIIDEHPEEEKQGERME

Query:  EPKELLS
        EPKE  S
Subjt:  EPKELLS

A0A6J1K0W5 uncharacterized protein LOC111489634 isoform X13.9e-16366.27Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        ME SQ  LS+I  PP+N    SS  PHSDHR+SLIAGRFRDALFSA AAKY+  NG+AHSLPF S+QFKSVI+C +HENFPSFRTPTHLPYASMI +AIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEE IS FI+NEYKDLPWAH A+LRRHLGKLCE+GE+VK KCG+YNFKVE K VKRKK RRKS GRSR REV+S DEIE D DR KRSKKL 
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        I GP  EEVVTS GT+E++D   EV VG E  DH   GQ+++ E +EV+ DEM+DK H E+ K+K+GA  FN  K+S+NLVI+GL AP+A K IEKQS S
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREV-EGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ
        LG +V E  EGDH KGGQIQV G+V EVQ DV+I Q CEK+VKSR   QD DE +QSQ V A N+GAQEA  M   E K G  REEI G  +      R+
Subjt:  LGEEVREV-EGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ

Query:  VIMIYE--LKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQ-VEIIDEHPEEEKQGERME
        V MI +    EV   D  EDFG  KQSQDL++VGLH+K+AL TKGTEDQCSS RKNV  G EG   QAGQTEVL  FK  Q VE+IDEH EEE+QGE ME
Subjt:  VIMIYE--LKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQ-VEIIDEHPEEEKQGERME

Query:  EPKELLS
        EPKE  S
Subjt:  EPKELLS

SwissProt top hitse value%identityAlignment
P23444 Histone H17.9e-0437.88Show/hide
Query:  TPTHLPYASMILRAIAEVGEEDGLSEESISGFIMNEYK-DLPWAHSAYLRRHLGKLCENGEIVKLK
        +PTHLPYA M+  AI  + E  G S  +I+ F+ +++K  LP      L   L KL   G++ K+K
Subjt:  TPTHLPYASMILRAIAEVGEEDGLSEESISGFIMNEYK-DLPWAHSAYLRRHLGKLCENGEIVKLK

Q9FYS5 HMG-Y-related protein A7.6e-0737.97Show/hide
Query:  PYASMILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGR
        PY  MIL AI  + ++ G ++ +IS +I  +Y  LP AH++ L  HL ++ E+GE+V LK   +     D   KR +GR
Subjt:  PYASMILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGR

Arabidopsis top hitse value%identityAlignment
AT5G08780.1 winged-helix DNA-binding transcription factor family protein5.8e-1031.58Show/hide
Query:  RTPTHLPYASMILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGR--YNFKVEDKGVKRKKGRRKSGGRS-RYREV
        RTP H  Y++MI  AI ++ +E G SE++IS FI ++YK+LP+AH+  L  HL KL E  EI+   C    Y+   E K V     +RKS   + R  + 
Subjt:  RTPTHLPYASMILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGR--YNFKVEDKGVKRKKGRRKSGGRS-RYREV

Query:  KSADEIEEDFDRKKRSKKLIIIGPRV----EEVVTSNGTEEQSDLCREVTVGVEKVDHVGEG------QIVVHEQKEVEVDEMVDKQHGE
        ++ADE+    ++++  + L    P+V    E+ +T + T  +   C  + V +E +D    G         V   ++  V E+VD ++ E
Subjt:  KSADEIEEDFDRKKRSKKLIIIGPRV----EEVVTSNGTEEQSDLCREVTVGVEKVDHVGEG------QIVVHEQKEVEVDEMVDKQHGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTTCCCAATCCCAACTCTCTTCCATTCATCCGCCACCCGATAATCTAACGTCTTCCTCTTCTAAGCCACCGCATTCTGATCACCGTCATTCGCTCATTGCGGG
GAGGTTCAGAGACGCCCTCTTCTCCGCCGTCGCTGCCAAATATTCGATCAATAATGGCACCGCCCACTCTTTGCCCTTCCGCTCCGACCAGTTCAAGTCTGTTATCGACT
GCCGAATTCATGAGAACTTTCCCTCCTTTCGAACTCCTACACATCTTCCTTATGCCTCGATGATACTGAGAGCAATTGCTGAAGTTGGAGAGGAAGATGGGTTGAGTGAG
GAGTCAATATCAGGGTTTATCATGAATGAGTACAAGGACTTGCCATGGGCACACTCAGCGTATTTGCGTCGCCATTTGGGGAAGCTCTGTGAAAATGGGGAGATAGTGAA
ATTAAAATGTGGGAGGTATAACTTTAAGGTGGAGGATAAGGGAGTAAAGAGGAAGAAAGGGAGGAGGAAGTCGGGAGGAAGGAGTCGGTACCGAGAAGTGAAGAGTGCTG
ATGAGATAGAAGAGGATTTTGATAGGAAAAAACGATCAAAGAAATTGATCATCATAGGACCCCGTGTGGAGGAGGTGGTCACAAGTAATGGGACTGAAGAACAAAGTGAT
TTGTGTAGAGAGGTGACTGTTGGAGTTGAAAAGGTAGATCACGTTGGAGAAGGGCAAATTGTGGTGCATGAACAAAAAGAAGTTGAAGTAGATGAAATGGTTGACAAACA
GCATGGAGAGAAAAGCAAGCATAAACATGGGGCTAAAGTTTTTAATTGGAAAAAGCAATCTCAAAATTTGGTGATCTTAGGTCTTCGTGCTCCATTAGCTAATAAGGAGA
TTGAGAAACAAAGTGGTTCATTGGGGGAAGAAGTTCGTGAAGTTGAAGGAGATCACACGAAAGGAGGCCAAATTCAAGTGCGTGGTGAAGTTGATGAAGTTCAACCAGAT
GTATTGATTCACCAAACTTGTGAAAAGGAAGTCAAGAGTAGAGATGGTTTTCAAGATTTTGATGAGAAAAAGCAATCACAGAATGTGGTGGCTGGAAATATCGGTGCACA
GGAGGCACCAACAATGGCATGGAATGAAGAAAAACGTGGTTCGCCGAGAGAAGAAATTTGTGGAGCTAAAGAAAGTGGTTATGACCATGACAGGCAAGTAATAATGATAT
ATGAACTTAAAGAAGTTAATGGTAGAGATGGGGTTGAAGATTTTGGTGGGAGAAAACAATCACAGGATCTAGTGATTGTTGGACTTCATTCAAAAGAGGCACTAATGACT
AAAGGGACTGAAGACCAATGTAGTTCGTTCAGAAAAAATGTTGACGACGGGGTTGAAGGAAAACATGCACAAGCTGGCCAAACTGAAGTGCTAGATAAATTCAAAGAAGT
TCAAGTTGAAATAATTGACGAGCACCCTGAGGAGGAAAAACAAGGAGAAAGGATGGAAGAACCAAAAGAGCTCCTTTCTTCCTCACTCACTGCTGGTGGAGGTGGTGTCC
TGCTCTCGTTGTCAGTTGGCTCTGAAAGAAAGCCAGAAATAGGATTTCATCTTGTGCTTGTTGGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATTTCCCAATCCCAACTCTCTTCCATTCATCCGCCACCCGATAATCTAACGTCTTCCTCTTCTAAGCCACCGCATTCTGATCACCGTCATTCGCTCATTGCGGG
GAGGTTCAGAGACGCCCTCTTCTCCGCCGTCGCTGCCAAATATTCGATCAATAATGGCACCGCCCACTCTTTGCCCTTCCGCTCCGACCAGTTCAAGTCTGTTATCGACT
GCCGAATTCATGAGAACTTTCCCTCCTTTCGAACTCCTACACATCTTCCTTATGCCTCGATGATACTGAGAGCAATTGCTGAAGTTGGAGAGGAAGATGGGTTGAGTGAG
GAGTCAATATCAGGGTTTATCATGAATGAGTACAAGGACTTGCCATGGGCACACTCAGCGTATTTGCGTCGCCATTTGGGGAAGCTCTGTGAAAATGGGGAGATAGTGAA
ATTAAAATGTGGGAGGTATAACTTTAAGGTGGAGGATAAGGGAGTAAAGAGGAAGAAAGGGAGGAGGAAGTCGGGAGGAAGGAGTCGGTACCGAGAAGTGAAGAGTGCTG
ATGAGATAGAAGAGGATTTTGATAGGAAAAAACGATCAAAGAAATTGATCATCATAGGACCCCGTGTGGAGGAGGTGGTCACAAGTAATGGGACTGAAGAACAAAGTGAT
TTGTGTAGAGAGGTGACTGTTGGAGTTGAAAAGGTAGATCACGTTGGAGAAGGGCAAATTGTGGTGCATGAACAAAAAGAAGTTGAAGTAGATGAAATGGTTGACAAACA
GCATGGAGAGAAAAGCAAGCATAAACATGGGGCTAAAGTTTTTAATTGGAAAAAGCAATCTCAAAATTTGGTGATCTTAGGTCTTCGTGCTCCATTAGCTAATAAGGAGA
TTGAGAAACAAAGTGGTTCATTGGGGGAAGAAGTTCGTGAAGTTGAAGGAGATCACACGAAAGGAGGCCAAATTCAAGTGCGTGGTGAAGTTGATGAAGTTCAACCAGAT
GTATTGATTCACCAAACTTGTGAAAAGGAAGTCAAGAGTAGAGATGGTTTTCAAGATTTTGATGAGAAAAAGCAATCACAGAATGTGGTGGCTGGAAATATCGGTGCACA
GGAGGCACCAACAATGGCATGGAATGAAGAAAAACGTGGTTCGCCGAGAGAAGAAATTTGTGGAGCTAAAGAAAGTGGTTATGACCATGACAGGCAAGTAATAATGATAT
ATGAACTTAAAGAAGTTAATGGTAGAGATGGGGTTGAAGATTTTGGTGGGAGAAAACAATCACAGGATCTAGTGATTGTTGGACTTCATTCAAAAGAGGCACTAATGACT
AAAGGGACTGAAGACCAATGTAGTTCGTTCAGAAAAAATGTTGACGACGGGGTTGAAGGAAAACATGCACAAGCTGGCCAAACTGAAGTGCTAGATAAATTCAAAGAAGT
TCAAGTTGAAATAATTGACGAGCACCCTGAGGAGGAAAAACAAGGAGAAAGGATGGAAGAACCAAAAGAGCTCCTTTCTTCCTCACTCACTGCTGGTGGAGGTGGTGTCC
TGCTCTCGTTGTCAGTTGGCTCTGAAAGAAAGCCAGAAATAGGATTTCATCTTGTGCTTGTTGGGAGATGA
Protein sequenceShow/hide protein sequence
MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIAEVGEEDGLSE
ESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLIIIGPRVEEVVTSNGTEEQSD
LCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGSLGEEVREVEGDHTKGGQIQVRGEVDEVQPD
VLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQVIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMT
KGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKELLSSSLTAGGGGVLLSLSVGSERKPEIGFHLVLVGR