; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G9655 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G9655
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionH15 domain-containing protein
Genome locationctg1673:4274519..4281498
RNA-Seq ExpressionCucsat.G9655
SyntenyCucsat.G9655
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0006413 - translational initiation (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003743 - translation initiation factor activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR005818 - Linker histone H1/H5, domain H15
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046320.1 transcription regulatory protein SNF2-like isoform X3 [Cucumis melo var. makuwa]0.086.8Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        MEIS SQLSSI PPP+NL+S SS  PHSDHRHSLIAGR RDALFSAVAAKYS  NGTAHSLPF SDQFKSVIDCR+ ENFPSF+TPTHLPYASMI RAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEESIS FI+NEY+DLPWAHSAYLRRHLGKLCENGE+VKLKCGRYNFKVEDKGVKRKK RRK+GGRSRYREV+SADEIEE FDRKKRSKKL 
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        +IGPRVEEVVTS G+EEQSD  REVTVGVE VDHVGEGQ+VV+EQK+VEVDEMVDKQHGEKSKH +GAKVFN K QS+NLVILGL APLANKE+EKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ
         GEEV EVE GDH KGGQIQVRGEV+EVQ DV+IHQ CEKEVKSR GFQDFD+KKQSQNV AGN+GAQEA TM WNEEKRGSPREEICGAKE GYD DRQ
Subjt:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ

Query:  VIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPK
         IMIYELKEVNG D VEDFGGRKQSQDL++VGLH+KEALMTKGTED+CSSFRKNV DGVEGKHAQAGQ EVLDKFKEVQVE+IDEHPEEEKQGERMEEPK
Subjt:  VIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPK

Query:  ERASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEE-RPSKFSDD
        ERAS+GSIR+PVEEATLEFFDAMSYHSNAEEN VIDDAEGCKKLLEENEN EFFDAKSDHGYDGVNE IGAQSSK+ +LGEVSNKQNRLEE RPSKFSDD
Subjt:  ERASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEE-RPSKFSDD

Query:  QTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRY
        QT+ R GCEAEDLQLTKEHSQVRWPS+IT TLAKHSKQ   R +EADKNEKSEALSPEDIICSPSQPWGHRG+GRPRKLKVQE LATSLSS A DGDQRY
Subjt:  QTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRY

Query:  LASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP
        LASNVVD EAS  NT  GT HIDQQGLNLPRGRGRG GR RVVRQDQNS S+  SPSKHLNHRQSPGK RGRP
Subjt:  LASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP

KAE8648608.1 hypothetical protein Csa_009414 [Cucumis sativus]0.094.95Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
        LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
Subjt:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV

Query:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
        IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
Subjt:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE

Query:  -----------------------------------------RASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDH
                                                 RASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDH
Subjt:  -----------------------------------------RASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDH

Query:  GYDGVNETIGAQSSKEMILGEVSNKQNRLEERPSKFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDII
        GYDGVNETIGAQSSKEMILGEVSNKQNRLEERPSKFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDII
Subjt:  GYDGVNETIGAQSSKEMILGEVSNKQNRLEERPSKFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDII

Query:  CSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLN
        CSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLN
Subjt:  CSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLN

Query:  HRQSPGKRRGRP
        HRQSPGKRRGRP
Subjt:  HRQSPGKRRGRP

XP_011655461.2 uncharacterized protein LOC105435537 isoform X1 [Cucumis sativus]0.0100Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
        LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
Subjt:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV

Query:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
        IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
Subjt:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE

Query:  RASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEERPSKFSDDQT
        RASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEERPSKFSDDQT
Subjt:  RASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEERPSKFSDDQT

Query:  KTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRYLA
        KTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRYLA
Subjt:  KTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRYLA

Query:  SNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP
        SNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP
Subjt:  SNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP

XP_031741073.1 uncharacterized protein LOC105435537 isoform X2 [Cucumis sativus]0.0100Show/hide
Query:  MILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRK
        MILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRK
Subjt:  MILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRK

Query:  KRSKKLIIIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKE
        KRSKKLIIIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKE
Subjt:  KRSKKLIIIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKE

Query:  IEKQSGSLGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESG
        IEKQSGSLGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESG
Subjt:  IEKQSGSLGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESG

Query:  YDHDRQVIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGE
        YDHDRQVIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGE
Subjt:  YDHDRQVIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGE

Query:  RMEEPKERASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEERPS
        RMEEPKERASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEERPS
Subjt:  RMEEPKERASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEERPS

Query:  KFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHD
        KFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHD
Subjt:  KFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHD

Query:  GDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP
        GDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP
Subjt:  GDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP

XP_038907055.1 uncharacterized protein LOC120092885 [Benincasa hispida]0.071.3Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        ME SQ Q SSI PPP  L+  SS  P SDHRHSL+AGRFRDALFSAVAAKYS  NG+AHS PF S+QFKSV+DCRIHENFPSF+TPTHLPYASMI RAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        E G+EDGLSEESIS FI+NEY+DLPWAH A+LRRHLGKLCE+GE+VK  CGRYNFKVE  GVKRKK RRKS GR+R RE++SADEIEEDFDRKKRSKKL+
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        IIGPR EEVVTS GTEEQSDL REV VG   VDH   GQ+V+ E +E++ DEM+DK+HGEK K  +G K F  KKQS  LVI+GL AP+A  EIEKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ
        LGEEV+E E G+ +KGGQIQV GEV+EVQ DV+IHQ CEKEVKSRD  QDFDE+KQSQNV AGN+GAQEA TM  N EK GS REEI GAKE   D DRQ
Subjt:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ

Query:  VIMIYELKEV--------------NGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEV-QVEIIDE
        VI IY+LKEV              N RDG+EDFGG KQSQDLV+VGLH+KEAL TKGTEDQCSS RK VD G EG HAQAGQTE L KFKEV +VE+ID+
Subjt:  VIMIYELKEV--------------NGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEV-QVEIIDE

Query:  HPEEEKQGERMEEPKERASVGSIRD--PVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVS
        H EEE+QGE MEEP ER S+GS  +  P EEA LEFFDA S HSN EEN VI DAEGCKKL EENENLEFFDA+SDH  D VNE IGAQSSKEM+LGEVS
Subjt:  HPEEEKQGERMEEPKERASVGSIRD--PVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVS

Query:  NKQNRLEE-RPSKFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHR-GRGRPRKLKVQ
        N+QNRLEE RPSK SD+QT  RKG EAED QL+KEH QVRWPS+IT T  KHS+Q   R  EADKNE S+AL P DII  PS PWGH  GRGRPR LKVQ
Subjt:  NKQNRLEE-RPSKFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHR-GRGRPRKLKVQ

Query:  ETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP
        ETLATSL + A D D               P+ GDGT HIDQQ L LPRGRGRG GRPR+VRQDQ S S  FSPSKH +H+QSPGKR GRP
Subjt:  ETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP

TrEMBL top hitse value%identityAlignment
A0A0A0KPL0 H15 domain-containing protein0.099.6Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
        LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV
Subjt:  LGEEVREVEGDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQV

Query:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
        IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE
Subjt:  IMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKE

Query:  RAS
          S
Subjt:  RAS

A0A1S3CU41 uncharacterized protein LOC1035045665.18e-30986.76Show/hide
Query:  MVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGSLGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFD
        MVDKQHGEKSKH +GAKVFN K QS+NLVILGL APLANKE+EKQSGS GEEV EVE GDH KGGQIQVRGEV+EVQ DV+IHQ CEKEVKSR GFQDFD
Subjt:  MVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGSLGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFD

Query:  EKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQVIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFR
        +KKQSQNV AGN+GAQEA TM WNEEKRGSPREEICGAKE GYD DRQ IMIYELKEVNG D VEDFGGRKQSQDL++VGLH+KEALMTKGTED+CSSFR
Subjt:  EKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQVIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFR

Query:  KNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKERASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLE
        KNV DGVEGKHAQAGQ EVLDKFKEVQVE+IDEHPEEEKQGERMEEPKERAS+GSIR+PVEEATLEFFDAMSYHSNAEEN VIDDAEGCKKLLEENEN E
Subjt:  KNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKERASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLE

Query:  FFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEE-RPSKFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKS
        FFDAKSDHGYDGVNE IGAQSSK+ +LGEVSNKQNRLEE RPSKFSDDQT+ R GCEAEDLQLTKEHSQVRWPS+IT TLAKHSKQ   R +EADKNEKS
Subjt:  FFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEE-RPSKFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKS

Query:  EALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSR
        EALSPEDIICSPSQPWGHRG+GRPRKLKVQE LATSLSS A DGDQRYLASNVVD EAS  NT  GT HIDQQGLNLPRGRGRG GR RVVRQDQNS S+
Subjt:  EALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSR

Query:  TFSPSKHLNHRQSPGKRRGRP
          SPSKHLNHRQSPGK RGRP
Subjt:  TFSPSKHLNHRQSPGKRRGRP

A0A5D3E3L6 Transcription regulatory protein SNF2-like isoform X30.086.8Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        MEIS SQLSSI PPP+NL+S SS  PHSDHRHSLIAGR RDALFSAVAAKYS  NGTAHSLPF SDQFKSVIDCR+ ENFPSF+TPTHLPYASMI RAIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEESIS FI+NEY+DLPWAHSAYLRRHLGKLCENGE+VKLKCGRYNFKVEDKGVKRKK RRK+GGRSRYREV+SADEIEE FDRKKRSKKL 
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        +IGPRVEEVVTS G+EEQSD  REVTVGVE VDHVGEGQ+VV+EQK+VEVDEMVDKQHGEKSKH +GAKVFN K QS+NLVILGL APLANKE+EKQSGS
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ
         GEEV EVE GDH KGGQIQVRGEV+EVQ DV+IHQ CEKEVKSR GFQDFD+KKQSQNV AGN+GAQEA TM WNEEKRGSPREEICGAKE GYD DRQ
Subjt:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ

Query:  VIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPK
         IMIYELKEVNG D VEDFGGRKQSQDL++VGLH+KEALMTKGTED+CSSFRKNV DGVEGKHAQAGQ EVLDKFKEVQVE+IDEHPEEEKQGERMEEPK
Subjt:  VIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPK

Query:  ERASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEE-RPSKFSDD
        ERAS+GSIR+PVEEATLEFFDAMSYHSNAEEN VIDDAEGCKKLLEENEN EFFDAKSDHGYDGVNE IGAQSSK+ +LGEVSNKQNRLEE RPSKFSDD
Subjt:  ERASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEE-RPSKFSDD

Query:  QTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRY
        QT+ R GCEAEDLQLTKEHSQVRWPS+IT TLAKHSKQ   R +EADKNEKSEALSPEDIICSPSQPWGHRG+GRPRKLKVQE LATSLSS A DGDQRY
Subjt:  QTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRY

Query:  LASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP
        LASNVVD EAS  NT  GT HIDQQGLNLPRGRGRG GR RVVRQDQNS S+  SPSKHLNHRQSPGK RGRP
Subjt:  LASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSPGKRRGRP

A0A6J1FEI4 uncharacterized protein LOC111444998 isoform X11.90e-30864.41Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        ME SQ  LS+I  PP+N    SS  PHSDHR+SLIAGRFRDALFSA AAKY+  NG+AHSLPF S+QFKSVI+C +H+NFPSFRTPTHLPYASMI +AIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        E+GEEDGLSEE IS FI+NEYKDLPWAH A+LRRHLGKLCE+GE+VK KCG+YNFKVE K VKRKK RRKS GRSR REV+S DEIEEDF+R KRSKKL 
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        I GP  E VVTS G++EQ++  REV +G E  DH   G++V+ E +EV+ DEM+DK H E+ K+K+GA  FN  K+S+NLVI+GL AP+A KEI KQS S
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ
        LG +V E E GDH KGGQIQV G+V EVQ DV+I Q CEKEVKSR   QD DEK+QSQ V A N+G QEA  M   E K GS REEI G  E      R+
Subjt:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ

Query:  VIMIYELKEVNGR--DGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQ-VEIIDEHPEEEKQGERME
        V MI +  +V  +  D  EDFG  KQSQDL++VGLH+K+AL TKGTEDQCSS RKNVD G EG   QAGQTEVL  FK  Q VE+IDEH EEE+QGE ME
Subjt:  VIMIYELKEVNGR--DGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQ-VEIIDEHPEEEKQGERME

Query:  EPKERASVGSIRD--PVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEE-RPS
        EPKERAS  S  +  P EEATL+FFDAM    +A+EN V+D A+GC+KL EENE+LEFFDAKSDHG +  NE  GAQ+SK  +LGEV NKQN LEE R S
Subjt:  EPKERASVGSIRD--PVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEE-RPS

Query:  KFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKIT---------------ETLA-KHSKQVTPRAAEADKNEKSEALSPEDIICSP-SQPWGHRGRGRPRK
        K SDDQT   KGCEAE+ QL+ +H +VRWPS+IT               +T+A KHS+Q     +EADKNE SEAL  +D+ICSP SQP GHRGRGRP K
Subjt:  KFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKIT---------------ETLA-KHSKQVTPRAAEADKNEKSEALSPEDIICSP-SQPWGHRGRGRPRK

Query:  LKVQETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGR--PRVVRQDQNSGSRTFSPSKHLNHRQSPGKR-RGRP
        LK+QET ATSLSS A D DQ++L SNV DRE SGP+    T HIDQQ L LPRGRGRG GR  PR++RQD  S   TFSPS+HL H+ SP KR RGRP
Subjt:  LKVQETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGR--PRVVRQDQNSGSRTFSPSKHLNHRQSPGKR-RGRP

A0A6J1K0W5 uncharacterized protein LOC111489634 isoform X11.09e-30564.29Show/hide
Query:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA
        ME SQ  LS+I  PP+N    SS  PHSDHR+SLIAGRFRDALFSA AAKY+  NG+AHSLPF S+QFKSVI+C +HENFPSFRTPTHLPYASMI +AIA
Subjt:  MEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIA

Query:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI
        EVGEEDGLSEE IS FI+NEYKDLPWAH A+LRRHLGKLCE+GE+VK KCG+YNFKVE K VKRKK RRKS GRSR REV+S DEIE D DR KRSKKL 
Subjt:  EVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLI

Query:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS
        I GP  EEVVTS GT+E++D   EV VG E  DH   GQ+++ E +EV+ DEM+DK H E+ K+K+GA  FN  K+S+NLVI+GL AP+A K IEKQS S
Subjt:  IIGPRVEEVVTSNGTEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGS

Query:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ
        LG +V E E GDH KGGQIQV G+V EVQ DV+I Q CEK+VKSR   QD DE +QSQ V A N+GAQEA  M   E K G  REEI G  +      R+
Subjt:  LGEEVREVE-GDHTKGGQIQVRGEVDEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQ

Query:  VIMIYELK--EVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQ-VEIIDEHPEEEKQGERME
        V MI +    EV   D  EDFG  KQSQDL++VGLH+K+AL TKGTEDQCSS RKNV  G EG   QAGQTEVL  FK  Q VE+IDEH EEE+QGE ME
Subjt:  VIMIYELK--EVNGRDGVEDFGGRKQSQDLVIVGLHSKEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQ-VEIIDEHPEEEKQGERME

Query:  EPKERASVGSIRD--PVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEE-RPS
        EPKERAS  S  +  P EEATL+FFD M    +A+EN VID A+GC+KL EENE+LEFFDAKSDHG +   E  GAQ+SK  +LGEV NKQNRLEE R S
Subjt:  EPKERASVGSIRD--PVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLLEENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEE-RPS

Query:  KFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKIT----------------ETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSP-SQPWGHRGRGRPRK
        K SDDQT+  KGCEAE+ QL+ +H +VRWPS+IT                 T  KHS+Q     +EADKNE SEAL  +D+ICSP SQP GHRGRGRP K
Subjt:  KFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKIT----------------ETLAKHSKQVTPRAAEADKNEKSEALSPEDIICSP-SQPWGHRGRGRPRK

Query:  LKVQETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGR--PRVVRQDQNSGSRTFSPSKHLNHRQSPGKR-RGRP
        LK+QET ATSLSS A D DQ++L S V DRE SGP+    T HIDQQ L LPRGRGRG GR  PR++RQD  S   TFSPS+HL+H+QSP KR RGRP
Subjt:  LKVQETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGR--PRVVRQDQNSGSRTFSPSKHLNHRQSPGKR-RGRP

SwissProt top hitse value%identityAlignment
Q9FYS5 HMG-Y-related protein A1.1e-0637.97Show/hide
Query:  PYASMILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGR
        PY  MIL AI  + ++ G ++ +IS +I  +Y  LP AH++ L  HL ++ E+GE+V LK   +     D   KR +GR
Subjt:  PYASMILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGR

Arabidopsis top hitse value%identityAlignment
AT5G08780.1 winged-helix DNA-binding transcription factor family protein8.4e-1031.58Show/hide
Query:  RTPTHLPYASMILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGR--YNFKVEDKGVKRKKGRRKSGGRS-RYREV
        RTP H  Y++MI  AI ++ +E G SE++IS FI ++YK+LP+AH+  L  HL KL E  EI+   C    Y+   E K V     +RKS   + R  + 
Subjt:  RTPTHLPYASMILRAIAEVGEEDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGR--YNFKVEDKGVKRKKGRRKSGGRS-RYREV

Query:  KSADEIEEDFDRKKRSKKLIIIGPRV----EEVVTSNGTEEQSDLCREVTVGVEKVDHVGEG------QIVVHEQKEVEVDEMVDKQHGE
        ++ADE+    ++++  + L    P+V    E+ +T + T  +   C  + V +E +D    G         V   ++  V E+VD ++ E
Subjt:  KSADEIEEDFDRKKRSKKLIIIGPRV----EEVVTSNGTEEQSDLCREVTVGVEKVDHVGEG------QIVVHEQKEVEVDEMVDKQHGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAATGCATCACCGCCGAAATGGAGATTTCCCAATCCCAACTCTCTTCCATTCATCCGCCACCCGATAATCTAACGTCTTCCTCTTCTAAGCCACCGCATTCTGATCACCG
TCATTCGCTCATTGCGGGGAGGTTCAGAGACGCCCTCTTCTCCGCCGTCGCTGCCAAATATTCGATCAATAATGGCACCGCCCACTCTTTGCCCTTCCGCTCCGACCAGT
TCAAGTCTGTTATCGACTGCCGAATTCATGAGAACTTTCCCTCCTTTCGAACTCCTACACATCTTCCTTATGCCTCGATGATACTGAGAGCAATTGCTGAAGTTGGAGAG
GAAGATGGGTTGAGTGAGGAGTCAATATCAGGGTTTATCATGAATGAGTACAAGGACTTGCCATGGGCACACTCAGCGTATTTGCGTCGCCATTTGGGGAAGCTCTGTGA
AAATGGGGAGATAGTGAAATTAAAATGTGGGAGGTATAACTTTAAGGTGGAGGATAAGGGAGTAAAGAGGAAGAAAGGGAGGAGGAAGTCGGGAGGAAGGAGTCGGTACC
GAGAAGTGAAGAGTGCTGATGAGATAGAAGAGGATTTTGATAGGAAAAAACGATCAAAGAAATTGATCATCATAGGACCCCGTGTGGAGGAGGTGGTCACAAGTAATGGG
ACTGAAGAACAAAGTGATTTGTGTAGAGAGGTGACTGTTGGAGTTGAAAAGGTAGATCACGTTGGAGAAGGGCAAATTGTGGTGCATGAACAAAAAGAAGTTGAAGTAGA
TGAAATGGTTGACAAACAGCATGGAGAGAAAAGCAAGCATAAACATGGGGCTAAAGTTTTTAATTGGAAAAAGCAATCTCAAAATTTGGTGATCTTAGGTCTTCGTGCTC
CATTAGCTAATAAGGAGATTGAGAAACAAAGTGGTTCATTGGGGGAAGAAGTTCGTGAAGTTGAAGGAGATCACACGAAAGGAGGCCAAATTCAAGTGCGTGGTGAAGTT
GATGAAGTTCAACCAGATGTATTGATTCACCAAACTTGTGAAAAGGAAGTCAAGAGTAGAGATGGTTTTCAAGATTTTGATGAGAAAAAGCAATCACAGAATGTGGTGGC
TGGAAATATCGGTGCACAGGAGGCACCAACAATGGCATGGAATGAAGAAAAACGTGGTTCGCCGAGAGAAGAAATTTGTGGAGCTAAAGAAAGTGGTTATGACCATGACA
GGCAAGTAATAATGATATATGAACTTAAAGAAGTTAATGGTAGAGATGGGGTTGAAGATTTTGGTGGGAGAAAACAATCACAGGATCTAGTGATTGTTGGACTTCATTCA
AAAGAGGCACTAATGACTAAAGGGACTGAAGACCAATGTAGTTCGTTCAGAAAAAATGTTGACGACGGGGTTGAAGGAAAACATGCACAAGCTGGCCAAACTGAAGTGCT
AGATAAATTCAAAGAAGTTCAAGTTGAAATAATTGACGAGCACCCTGAGGAGGAAAAACAAGGAGAAAGGATGGAAGAACCAAAAGAGAGAGCATCCGTGGGATCAATCA
GAGATCCTGTTGAAGAAGCCACTTTGGAGTTTTTTGATGCTATGTCGTACCATAGCAATGCTGAAGAAAATAGAGTGATTGATGATGCTGAAGGTTGCAAGAAGTTACTA
GAGGAAAATGAAAATTTGGAATTCTTTGACGCGAAGTCTGACCATGGATACGATGGGGTGAATGAAACAATTGGTGCTCAATCTTCTAAGGAGATGATACTAGGTGAAGT
TAGCAATAAGCAGAATAGACTGGAAGAACGACCATCCAAGTTTAGTGATGATCAAACTAAAACAAGGAAGGGTTGCGAGGCCGAGGACCTTCAACTAACCAAGGAGCATT
CCCAAGTTAGATGGCCATCTAAAATAACTGAAACCCTAGCGAAGCATTCAAAACAAGTGACGCCTAGAGCTGCTGAGGCAGACAAAAATGAAAAGTCTGAGGCATTATCG
CCTGAAGATATTATTTGTAGTCCAAGTCAACCTTGGGGGCATCGTGGTCGAGGGAGGCCTCGGAAGTTGAAAGTTCAAGAAACTTTGGCAACTTCATTATCCTCATTGGC
TCACGATGGTGACCAGCGATATCTGGCATCAAATGTTGTAGACAGAGAGGCATCTGGCCCAAATACAGGCGATGGCACCCGTCATATTGATCAGCAAGGACTCAACCTGC
CAAGAGGTCGGGGGAGAGGCTGGGGAAGGCCTCGAGTAGTTAGACAAGACCAGAATTCAGGATCACGGACATTCTCACCTTCCAAGCATTTGAACCATCGGCAATCTCCT
GGAAAGAGACGTGGAAGGCCT
mRNA sequenceShow/hide mRNA sequence
CAATGCATCACCGCCGAAATGGAGATTTCCCAATCCCAACTCTCTTCCATTCATCCGCCACCCGATAATCTAACGTCTTCCTCTTCTAAGCCACCGCATTCTGATCACCG
TCATTCGCTCATTGCGGGGAGGTTCAGAGACGCCCTCTTCTCCGCCGTCGCTGCCAAATATTCGATCAATAATGGCACCGCCCACTCTTTGCCCTTCCGCTCCGACCAGT
TCAAGTCTGTTATCGACTGCCGAATTCATGAGAACTTTCCCTCCTTTCGAACTCCTACACATCTTCCTTATGCCTCGATGATACTGAGAGCAATTGCTGAAGTTGGAGAG
GAAGATGGGTTGAGTGAGGAGTCAATATCAGGGTTTATCATGAATGAGTACAAGGACTTGCCATGGGCACACTCAGCGTATTTGCGTCGCCATTTGGGGAAGCTCTGTGA
AAATGGGGAGATAGTGAAATTAAAATGTGGGAGGTATAACTTTAAGGTGGAGGATAAGGGAGTAAAGAGGAAGAAAGGGAGGAGGAAGTCGGGAGGAAGGAGTCGGTACC
GAGAAGTGAAGAGTGCTGATGAGATAGAAGAGGATTTTGATAGGAAAAAACGATCAAAGAAATTGATCATCATAGGACCCCGTGTGGAGGAGGTGGTCACAAGTAATGGG
ACTGAAGAACAAAGTGATTTGTGTAGAGAGGTGACTGTTGGAGTTGAAAAGGTAGATCACGTTGGAGAAGGGCAAATTGTGGTGCATGAACAAAAAGAAGTTGAAGTAGA
TGAAATGGTTGACAAACAGCATGGAGAGAAAAGCAAGCATAAACATGGGGCTAAAGTTTTTAATTGGAAAAAGCAATCTCAAAATTTGGTGATCTTAGGTCTTCGTGCTC
CATTAGCTAATAAGGAGATTGAGAAACAAAGTGGTTCATTGGGGGAAGAAGTTCGTGAAGTTGAAGGAGATCACACGAAAGGAGGCCAAATTCAAGTGCGTGGTGAAGTT
GATGAAGTTCAACCAGATGTATTGATTCACCAAACTTGTGAAAAGGAAGTCAAGAGTAGAGATGGTTTTCAAGATTTTGATGAGAAAAAGCAATCACAGAATGTGGTGGC
TGGAAATATCGGTGCACAGGAGGCACCAACAATGGCATGGAATGAAGAAAAACGTGGTTCGCCGAGAGAAGAAATTTGTGGAGCTAAAGAAAGTGGTTATGACCATGACA
GGCAAGTAATAATGATATATGAACTTAAAGAAGTTAATGGTAGAGATGGGGTTGAAGATTTTGGTGGGAGAAAACAATCACAGGATCTAGTGATTGTTGGACTTCATTCA
AAAGAGGCACTAATGACTAAAGGGACTGAAGACCAATGTAGTTCGTTCAGAAAAAATGTTGACGACGGGGTTGAAGGAAAACATGCACAAGCTGGCCAAACTGAAGTGCT
AGATAAATTCAAAGAAGTTCAAGTTGAAATAATTGACGAGCACCCTGAGGAGGAAAAACAAGGAGAAAGGATGGAAGAACCAAAAGAGAGAGCATCCGTGGGATCAATCA
GAGATCCTGTTGAAGAAGCCACTTTGGAGTTTTTTGATGCTATGTCGTACCATAGCAATGCTGAAGAAAATAGAGTGATTGATGATGCTGAAGGTTGCAAGAAGTTACTA
GAGGAAAATGAAAATTTGGAATTCTTTGACGCGAAGTCTGACCATGGATACGATGGGGTGAATGAAACAATTGGTGCTCAATCTTCTAAGGAGATGATACTAGGTGAAGT
TAGCAATAAGCAGAATAGACTGGAAGAACGACCATCCAAGTTTAGTGATGATCAAACTAAAACAAGGAAGGGTTGCGAGGCCGAGGACCTTCAACTAACCAAGGAGCATT
CCCAAGTTAGATGGCCATCTAAAATAACTGAAACCCTAGCGAAGCATTCAAAACAAGTGACGCCTAGAGCTGCTGAGGCAGACAAAAATGAAAAGTCTGAGGCATTATCG
CCTGAAGATATTATTTGTAGTCCAAGTCAACCTTGGGGGCATCGTGGTCGAGGGAGGCCTCGGAAGTTGAAAGTTCAAGAAACTTTGGCAACTTCATTATCCTCATTGGC
TCACGATGGTGACCAGCGATATCTGGCATCAAATGTTGTAGACAGAGAGGCATCTGGCCCAAATACAGGCGATGGCACCCGTCATATTGATCAGCAAGGACTCAACCTGC
CAAGAGGTCGGGGGAGAGGCTGGGGAAGGCCTCGAGTAGTTAGACAAGACCAGAATTCAGGATCACGGACATTCTCACCTTCCAAGCATTTGAACCATCGGCAATCTCCT
GGAAAGAGACGTGGAAGGCCT
Protein sequenceShow/hide protein sequence
QCITAEMEISQSQLSSIHPPPDNLTSSSSKPPHSDHRHSLIAGRFRDALFSAVAAKYSINNGTAHSLPFRSDQFKSVIDCRIHENFPSFRTPTHLPYASMILRAIAEVGE
EDGLSEESISGFIMNEYKDLPWAHSAYLRRHLGKLCENGEIVKLKCGRYNFKVEDKGVKRKKGRRKSGGRSRYREVKSADEIEEDFDRKKRSKKLIIIGPRVEEVVTSNG
TEEQSDLCREVTVGVEKVDHVGEGQIVVHEQKEVEVDEMVDKQHGEKSKHKHGAKVFNWKKQSQNLVILGLRAPLANKEIEKQSGSLGEEVREVEGDHTKGGQIQVRGEV
DEVQPDVLIHQTCEKEVKSRDGFQDFDEKKQSQNVVAGNIGAQEAPTMAWNEEKRGSPREEICGAKESGYDHDRQVIMIYELKEVNGRDGVEDFGGRKQSQDLVIVGLHS
KEALMTKGTEDQCSSFRKNVDDGVEGKHAQAGQTEVLDKFKEVQVEIIDEHPEEEKQGERMEEPKERASVGSIRDPVEEATLEFFDAMSYHSNAEENRVIDDAEGCKKLL
EENENLEFFDAKSDHGYDGVNETIGAQSSKEMILGEVSNKQNRLEERPSKFSDDQTKTRKGCEAEDLQLTKEHSQVRWPSKITETLAKHSKQVTPRAAEADKNEKSEALS
PEDIICSPSQPWGHRGRGRPRKLKVQETLATSLSSLAHDGDQRYLASNVVDREASGPNTGDGTRHIDQQGLNLPRGRGRGWGRPRVVRQDQNSGSRTFSPSKHLNHRQSP
GKRRGRP