; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021517 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021517
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionH15 domain-containing protein
Genome locationscaffold208:406802..409622
RNA-Seq ExpressionMS021517
SyntenyMS021517
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR005818 - Linker histone H1/H5, domain H15
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579248.1 hypothetical protein SDJN03_23696, partial [Cucurbita argyrosperma subsp. sororia]1.8e-19454.2Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD R+SL++ + RD LFSA   KYAT+GS  SLPFPSE+ KS +E  LHE  PSF TPTHLPYASMIQ+AIAE+GEEDGLSEE IS+FIVNEY+DLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IEED +  ++SKKL I GPRA EVVTSKG++EQN  LRE I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------
         EDG  A  G+V  + EL+EVQEDEMID+   EEIK   G  DF+    S+NLV++GL AP  IKEI +QS SLG +V  AEE DH KGGQ+        
Subjt:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------

Query:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
         + DV ID+ CEKEV+ R  +QD D+ ++S+ + A  L  +E L   G E KCG LREEI     GG            L EV  V +IN  H+V++KS 
Subjt:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        D AEDFG  +Q QDL+VVGLH  +A  TKGTE+QCSSLR+ +DG EGD  Q GQTEVL   K  QEVEMI  +HEEE Q  +MEEP ER    SN EE P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------
         EEATL+FFDAMPN  +A+E  +IDAQ C+KL+EENE+LEFFDAKSDHG++EAN +   Q+SKGKV  EV ++QNRL+EQ +SK                
Subjt:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------

Query:  -------------------------------HQTDTLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ
                                       HQT   KH+EQ    TSEA             +IC  KSQP RG RGRGRP KL +QET A S SS A 
Subjt:  -------------------------------HQTDTLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ

Query:  DEQRLQKRSRG-------------------------RGRGRGRGRGRARV-AQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS
        D  +    S+G                         RGRGRGRGRGR R+  QD IS+ +TFSPS++L HQQS  KR  GRPPK+KFDE    KDIS
Subjt:  DEQRLQKRSRG-------------------------RGRGRGRGRGRARV-AQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS

KAG7016763.1 hypothetical protein SDJN02_21873, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-19654.51Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD R+SL++ + RD LFSA   KYAT+GS  SLPFPSE+ KS +E  LHE  PSF TPTHLPYASMIQ+AIAE+GEEDGLSEE IS+FIVNEY+DLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IEED +  ++SKKLKI GPRA EVVTSKG++EQN  LRE I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------
         EDGD A  G+V  + EL+EVQEDEMID+   EEIK   G  DF+    S+NLV++GL AP  IKEI +QS SLG +V  AEE DH KGGQ+        
Subjt:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------

Query:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
         + DV ID+ CEKEV+ R  +QD D+ ++SQ + A  L  +E L   G E KCG LREEI     GG            L EV  V +IN  H+V++KS 
Subjt:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        D AEDFG  +Q QDL+VVGLH  +A  TKGTE+QCSSLR+ +DG EGD  Q GQTEVL   K  QEVEMI  +HEEE Q  +MEEP ER    SN EE P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------
         EEATL+FFDAMPN  +A+E  +IDAQ C+KL+EENE+LEFFDAKSDHG++EAN +   Q+SKGKV  EV ++QNRL+EQ +SK                
Subjt:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------

Query:  -------------------------------HQTDTLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ
                                       HQT   KH+EQ    TSEA             +IC  KSQP RG RGRGRP KL +QET A S SS A 
Subjt:  -------------------------------HQTDTLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ

Query:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS
        D                           +Q+  K  RGRGRGRGRGR R  + QD IS+ +TFSPS++L HQQS  KR  GRPPK+KFDE    KDIS
Subjt:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS

XP_022152264.1 uncharacterized protein LOC111020032, partial [Momordica charantia]0.0e+0099.53Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
        AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLREDVTIDR
        VEDGDQAEGGQVEG+GELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLREDVTIDR
Subjt:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLREDVTIDR

Query:  CCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGK
        CCEKEVECRDGVQDFDKKKRSQNL ATELCAKETLLREGTEEKCGL REEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGK
Subjt:  CCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGK

Query:  EQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFF
        EQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFF
Subjt:  EQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFF

Query:  DAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAEQGAPSTSEAHIICGD
        DAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAEQGAPSTSEAHIICGD
Subjt:  DAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAEQGAPSTSEAHIICGD

Query:  KSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG
        KSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG
Subjt:  KSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG

XP_023549578.1 uncharacterized protein LOC111808038 isoform X1 [Cucurbita pepo subsp. pepo]9.1e-19954.51Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD R+SL+  + RD LFSA   KYAT+GS  SLPFPSE+ KS +E  LH+  PSF TPTHLPYASMIQ+AI EVGEEDGLSEE IS+FIVNEY+DLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IEED D  ++SKKL I GPRA EVVTSKG++EQN  LRE I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------
         EDGD A  GQV  + EL+E QEDEMID+   EEIK      DF+   +S+NLV++GL AP  IKEIE+QS SLG++V  AEE DH KGGQ+        
Subjt:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------

Query:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
         + DV ID+ CEKEV+ R  +QD D+K++SQ + A  L A+E L   G E KCG  REEI     GG            L EV  V +IN  H+V++KS 
Subjt:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        D AEDFG  +Q QD++VVGLH  +A   KGTE+QCSSLR+ +DG EGD  Q GQTEVL   K  QEVEMI  +HEEE Q  +MEEP ER   GSN EE P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------
         EEATL+FFDAMPN  +A+E  ++DAQ C+KL+EENE+LEFFDAKSDHG++EAN +   Q+SKGKV  EV ++QNRL+EQ +SK                
Subjt:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------

Query:  -------------------------------HQTDTLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ
                                       HQT   KH+EQ    TSEA             +IC  KSQP RG RGRGRP KL +QET A S SS A 
Subjt:  -------------------------------HQTDTLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ

Query:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS
        D                           +Q+  K  RGRGRGRGRGR R  + QD IS+ +TFSPS+YLHHQQS  KR  GRPPK+KFDE    KDIS
Subjt:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS

XP_038907055.1 uncharacterized protein LOC120092885 [Benincasa hispida]1.4e-20758.05Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD RHSLV+ + RD LFSA+  KY+T+GS  S PF SE+ KS V+ R+HE  PSF TPTHLPYASMIQRAIAE G+EDGLSEESIS+FIVNEYEDLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKS CGRYNF+VE  GVKRK+RRRKS GR+R R +ES D+IEED D K++SKKL IIGPR  EVVTSKGTEEQ+ LLRE I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------
          D D A+GGQV  + EL+E+QEDEMID+   E+IK N GP+DF  + QS  LV++GL AP  I EIE+QSGSLG++V+ AE+ +  KGGQ+        
Subjt:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------

Query:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
         + DV I + CEKEV+ RD VQDFD++K+SQN+ A  L A+E L      EKCG LREEID A+E    Q  Q I IY+LKEV  VG+IN HHEV+  SR
Subjt:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        DG EDFGG +Q QDLVVVGLH  EA TTKGTE+QCSSLR+K+DG EG+ AQ GQTE L K KEV EVEMI  +HEEE Q  +MEEP ERP  GSN E  P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLI-DAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK---HQTDTLKHAEQG
         EEA LEFFDA  NH+N EE  +I DA+ CKKL+EENENLEFFDA+SDH  D  N +I  QSSK  V  EV+++QNRL+E+  SK   +QT   K  E  
Subjt:  SEEATLEFFDAMPNHANAEETLLI-DAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK---HQTDTLKHAEQG

Query:  APSTSEAH----------------------------------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD------------EQ
         P  S+ H                                        II G  S P     GRGRPR L VQETLA S  +SAQD            +Q
Subjt:  APSTSEAH----------------------------------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD------------EQ

Query:  RLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISAS
        +  K  RGRGRGRGR R    V QDQIS+S+ FSPSK+ HHQQS  K R GRPPK+KF+E    K IS S
Subjt:  RLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISAS

TrEMBL top hitse value%identityAlignment
A0A5D3E3L6 Transcription regulatory protein SNF2-like isoform X31.3e-18254.93Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD RHSL++ +LRD LFSA+  KY+T+G+  SLPF S++ KS ++ RL E  PSF TPTHLPYASMIQRAIAEVGEEDGLSEESIS+FIVNEYEDLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
        +A LRRHLGKLCE+GELVK KCGRYNF+VEDKGVKRK+RRRK+ GRSRYR VES D+IEE  D K++SKKLK+IGPR  EVVTSKG+EEQ+   RE   G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------
        VE+ D    GQV  V E K+V+ DEM+D+   E+ K   G + F+ + QS+NLV+LGL AP   KE+E+QSGS G++V   EE DH KGGQ+        
Subjt:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------

Query:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
         + DV I + CEKEV+ R G QDFD KK+SQN+ A  L A+E L     EEK G  REEI  A+E G+ Q  Q IMIYELKEV      NG  EV     
Subjt:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKI-DGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEEL
           EDFGG++Q QDL+VVGLH  EA  TKGTE++CSS R+ + DG EG  AQ GQ EVLDK KEVQ VEMI  + EEE Q   MEEP ER   GS  E  
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKI-DGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEEL

Query:  PSEEATLEFFDAMPNHANAEETLLI-DAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTD----------
        P EEATLEFFDAM  H+NAEE  +I DA+ CKKL EENEN EFFDAKSDHG D  N +I  QSSK  V  EV+++QNRL+EQ  SK   D          
Subjt:  PSEEATLEFFDAMPNHANAEETLLI-DAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTD----------

Query:  ---------------------TLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD-EQR---------
                               KH++Q    TSEA             IIC   SQP  G RG+GRPRKL VQE LA S SS A+D +QR         
Subjt:  ---------------------TLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD-EQR---------

Query:  -------------LQKRSRGRGRGRGRGRGRARVA-QDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS
                     + ++     RGRGRGRGR RV  QDQ S S   SPSK+L+H+QS  K R GRP K+ FDE +  KDIS
Subjt:  -------------LQKRSRGRGRGRGRGRGRARVA-QDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS

A0A6J1DFJ3 uncharacterized protein LOC1110200320.0e+0099.53Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
        AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLREDVTIDR
        VEDGDQAEGGQVEG+GELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLREDVTIDR
Subjt:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLREDVTIDR

Query:  CCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGK
        CCEKEVECRDGVQDFDKKKRSQNL ATELCAKETLLREGTEEKCGL REEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGK
Subjt:  CCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGK

Query:  EQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFF
        EQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFF
Subjt:  EQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFF

Query:  DAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAEQGAPSTSEAHIICGD
        DAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAEQGAPSTSEAHIICGD
Subjt:  DAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAEQGAPSTSEAHIICGD

Query:  KSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG
        KSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG
Subjt:  KSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG

A0A6J1FEI4 uncharacterized protein LOC111444998 isoform X11.1e-19253.45Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD R+SL++ + RD LFSA   KYAT+GS  SLPFPSE+ KS +E  LH+  PSF TPTHLPYASMIQ+AIAE+GEEDGLSEE IS+FIVNEY+DLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IEED +  ++SKKL I GP A  VVTSKG++EQN  LRE I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------
         EDGD A  G+V  + EL+EVQEDEMID+   EEIK   G  DF+   +S+NLV++GL AP  IKEI +QS SLG +V  AEE DH KGGQ+        
Subjt:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------

Query:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
         + DV ID+ CEKEV+ R  +QD D+K++SQ + A  L  +E L   G E KCG  REEI     GG            L E+  V +IN  H+V++KS 
Subjt:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        D AEDFG  +Q QDL+VVGLH  +A  TKGTE+QCSSLR+ +DG EGD  Q GQTEVL   K  QEVEMI  +HEEE Q  +MEEP ER    SN EE P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------
         EEATL+FFDAMPN  +A+E  ++DAQ C+KL+EENE+LEFFDAKSDHG++EAN +   Q+SKGKV  EV ++QN L+EQ +SK                
Subjt:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------

Query:  -------------------------------HQTDTLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ
                                       HQT   KH+EQ    TSEA             +IC  KSQP RG RGRGRP KL +QET A S SS A 
Subjt:  -------------------------------HQTDTLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ

Query:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDI
        D                           +Q+  K  RGRGRGRGRGR R  + QD IS+ +TFSPS++L HQ S  KR  GRPPK+KFDE    KDI
Subjt:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDI

A0A6J1JZB4 uncharacterized protein LOC111489634 isoform X21.3e-16953.01Show/hide
Query:  MIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGK
        MIQ+AIAEVGEEDGLSEE IS+FIVNEY+DLPWAH A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IE D+D  
Subjt:  MIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGK

Query:  EQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGGVEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIK
        ++SKKL I GP A EVVTSKGT+E+N  L E I G EDGD A  GQV  + EL+EVQEDEMID+   EEIK   G  DF+   +S+NLV++GL AP  IK
Subjt:  EQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGGVEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIK

Query:  EIEQQSGSLGKQVRRAEERDHMKGGQL---------REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEE
         IE+QS SLG +V  AEE DH KGGQ+         + DV ID+ CEK+V+ R  +QD D+ ++SQ + A  L A+E L   G E KCGL REEI     
Subjt:  EIEQQSGSLGKQVRRAEERDHMKGGQL---------REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEE

Query:  GGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQ
        GG            L +V  VG+IN  H+V+ KS D AEDFG  +Q QDL+VVGLH  +A TTKGTE+QCSSLR+ + G EG   Q GQTEVL   K  Q
Subjt:  GGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQ

Query:  EVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGK
        EVEMI  +HEEE Q  +MEEP ER    SN EE P EEATL+FFD MPN  +A+E  +IDAQ C+KL+EENE+LEFFDAKSDHG+++A  +   Q+SKGK
Subjt:  EVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGK

Query:  VPSEVNDQQNRLQEQLLSK-----------------------------------------------HQTDTLKHAEQGAPSTSEA------------HII
        V  EV ++QNRL+EQ +SK                                               HQT   KH+EQ    TSEA             +I
Subjt:  VPSEVNDQQNRLQEQLLSK-----------------------------------------------HQTDTLKHAEQGAPSTSEA------------HII

Query:  CGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD------EQRLQKRSRG-------------------RGRGRGRGRGRARV-AQDQISLSDTFSPS
        C  KSQP +G RGRGRP KL +QET A S SS A D      E +++ R                      RGRGRGRGRGR R+  QD IS+ +TFSPS
Subjt:  CGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD------EQRLQKRSRG-------------------RGRGRGRGRGRARV-AQDQISLSDTFSPS

Query:  KYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS
        ++LHHQQS  KR  GRPPK+KFDE    KDIS
Subjt:  KYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS

A0A6J1K0W5 uncharacterized protein LOC111489634 isoform X14.3e-19453.95Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD R+SL++ + RD LFSA   KYAT+GS  SLPFPSE+ KS +E  LHE  PSF TPTHLPYASMIQ+AIAEVGEEDGLSEE IS+FIVNEY+DLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IE D+D  ++SKKL I GP A EVVTSKGT+E+N  L E I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------
         EDGD A  GQV  + EL+EVQEDEMID+   EEIK   G  DF+   +S+NLV++GL AP  IK IE+QS SLG +V  AEE DH KGGQ+        
Subjt:  VEDGDQAEGGQVEGVGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQL--------

Query:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
         + DV ID+ CEK+V+ R  +QD D+ ++SQ + A  L A+E L   G E KCGL REEI     GG            L +V  VG+IN  H+V+ KS 
Subjt:  -REDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELCAKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        D AEDFG  +Q QDL+VVGLH  +A TTKGTE+QCSSLR+ + G EG   Q GQTEVL   K  QEVEMI  +HEEE Q  +MEEP ER    SN EE P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------
         EEATL+FFD MPN  +A+E  +IDAQ C+KL+EENE+LEFFDAKSDHG+++A  +   Q+SKGKV  EV ++QNRL+EQ +SK                
Subjt:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------

Query:  -------------------------------HQTDTLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ
                                       HQT   KH+EQ    TSEA             +IC  KSQP +G RGRGRP KL +QET A S SS A 
Subjt:  -------------------------------HQTDTLKHAEQGAPSTSEA------------HIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ

Query:  D------EQRLQKRSRG-------------------RGRGRGRGRGRARV-AQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS
        D      E +++ R                      RGRGRGRGRGR R+  QD IS+ +TFSPS++LHHQQS  KR  GRPPK+KFDE    KDIS
Subjt:  D------EQRLQKRSRG-------------------RGRGRGRGRGRARV-AQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDIS

SwissProt top hitse value%identityAlignment
Q9FYS5 HMG-Y-related protein A5.0e-0640.51Show/hide
Query:  PYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRR
        PY  MI  AI  + ++ G ++ +IS +I  +Y  LP AHA+LL  HL ++ ESGELV  K   +     D   KR R R
Subjt:  PYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRR

Arabidopsis top hitse value%identityAlignment
AT5G08780.1 winged-helix DNA-binding transcription factor family protein3.2e-0830.16Show/hide
Query:  TPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELV---KSKCGRYNFEVEDKGVKRKRRRRKS-VGRSRYRGV
        TP H  Y++MI  AI ++ +E G SE++IS+FI ++Y++LP+AH  LL  HL KL E  E++    + C  Y+   E K V     +RKS +   R    
Subjt:  TPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELV---KSKCGRYNFEVEDKGVKRKRRRRKS-VGRSRYRGV

Query:  ESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREE---IGGVEDGDQAEGG-------------QVEGVGELKEVQEDE
         + D++    + +E  + LK   P+ V +     T+ + G  R+    I  +E  D  + G             + EGV E+ +V+  E
Subjt:  ESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREE---IGGVEDGDQAEGG-------------QVEGVGELKEVQEDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCCGATCAGCGTCATTCATTAGTATCCAGAAAGCTCAGAGACGACCTCTTCTCCGCCATCACCACCAAATATGCAACCGACGGCAGCAACCCCTCGTTGCCTTTCCCCTC
CGAGAAGCTCAAGTCCGATGTCGAGCGGCGGCTTCATGAGTATCTCCCCTCCTTCCACACTCCTACTCACCTTCCATATGCCTCGATGATACAGAGGGCAATAGCTGAAG
TGGGAGAGGAAGATGGGTTGAGCGAGGAGTCTATTTCGGATTTTATCGTGAATGAATATGAGGATTTGCCATGGGCGCATGCAGCTCTTTTGCGTCGCCATTTGGGGAAG
CTTTGTGAAAGTGGGGAGCTTGTGAAATCGAAATGTGGGAGATATAACTTTGAGGTGGAGGATAAAGGAGTTAAAAGGAAGAGACGGCGGAGAAAGTCAGTGGGAAGGAG
TCGGTACCGAGGAGTGGAGAGTACAGATGATATTGAAGAAGATCTTGATGGGAAAGAGCAATCAAAGAAATTGAAAATCATAGGACCCCGAGCAGTGGAGGTAGTTACAA
GTAAAGGGACTGAAGAACAAAATGGTTTGCTGAGAGAAGAAATTGGTGGGGTTGAAGATGGAGATCAAGCAGAAGGAGGCCAAGTTGAAGGGGTAGGTGAACTGAAAGAA
GTTCAAGAAGATGAAATGATAGACGAGTGTCTTGAAGAGGAAATCAAGATTAACGATGGGCCAGAAGACTTTGATTGGCAGATGCAATCACAGAATCTGGTGGTATTAGG
TCTTTGTGCGCCAGGAAATATTAAAGAGATTGAACAACAAAGTGGTTCGTTGGGGAAACAAGTTCGCAGGGCTGAAGAAAGAGATCACATGAAAGGAGGCCAACTTCGAG
AAGATGTAACAATTGACCGATGTTGTGAAAAGGAAGTCGAGTGTAGAGATGGGGTTCAAGATTTTGATAAGAAAAAGCGATCACAGAATCTGGTGGCTACGGAGCTCTGC
GCAAAGGAGACACTATTGAGGGAAGGGACTGAAGAAAAATGTGGTTTGTTGAGAGAAGAAATTGATGTAGCTGAAGAAGGAGGTCACACACAAAAAAGCCAATTTATAAT
GATATATGAACTTAAAGAAGTTGGAAATGTTGGGTTGATTAATGGTCATCACGAAGTGAAAAGTAAGAGTAGAGATGGGGCTGAAGATTTTGGTGGGAAAGAACAACTGC
AGGATCTTGTGGTTGTTGGACTCCATGTAGGAGAGGCACCAACAACTAAAGGGACTGAAGAACAATGCAGTTCATTAAGAGAAAAAATTGATGGGACTGAAGGAGATGGT
GCACAAGTAGGCCAAACTGAAGTGCTAGATAAACTCAAAGAAGTTCAAGAAGTTGAAATGATTAAAAACTATCATGAGGAGGAAGGACAAAGAGTAGTGATGGAAGAACC
AACAGAGAGACCATACGGGGGATCAAACGTAGAAGAGTTACCTAGTGAAGAAGCCACTTTGGAGTTCTTTGATGCTATGCCTAACCATGCCAATGCTGAAGAAACTTTAC
TGATTGATGCTCAATGTTGCAAGAAGTTACGAGAGGAAAATGAAAATTTGGAGTTTTTTGATGCAAAGTCTGACCACGGCAATGATGAGGCGAATGGAATGATTGATGAC
CAATCCTCGAAGGGGAAGGTACCAAGCGAAGTGAATGATCAACAAAATAGACTGCAAGAACAACTGCTATCAAAGCATCAGACCGATACACTGAAGCATGCAGAGCAAGG
GGCACCTAGCACATCTGAGGCACATATTATTTGTGGTGACAAGAGTCAACCTGGTCGGGGACGTCGTGGTCGAGGGAGACCTCGAAAGTTGAATGTACAAGAAACTTTGG
CAGCTTCATTTTCTTCATCTGCTCAAGATGAGCAGCGGCTCCAGAAGCGGTCGAGAGGGAGGGGGAGGGGGAGGGGGAGAGGTCGGGGAAGGGCTCGAGTAGCTCAAGAC
CAGATTTCATTGTCAGACACGTTCTCACCTTCCAAGTATTTGCATCACCAGCAATCTTCTGAAAAGAGACGCCCTGGAAGGCCTCCAAAACGAAAATTTGATGAATATGT
TTCACCTAAGGACATCTCGGCTTCT
mRNA sequenceShow/hide mRNA sequence
TCCGATCAGCGTCATTCATTAGTATCCAGAAAGCTCAGAGACGACCTCTTCTCCGCCATCACCACCAAATATGCAACCGACGGCAGCAACCCCTCGTTGCCTTTCCCCTC
CGAGAAGCTCAAGTCCGATGTCGAGCGGCGGCTTCATGAGTATCTCCCCTCCTTCCACACTCCTACTCACCTTCCATATGCCTCGATGATACAGAGGGCAATAGCTGAAG
TGGGAGAGGAAGATGGGTTGAGCGAGGAGTCTATTTCGGATTTTATCGTGAATGAATATGAGGATTTGCCATGGGCGCATGCAGCTCTTTTGCGTCGCCATTTGGGGAAG
CTTTGTGAAAGTGGGGAGCTTGTGAAATCGAAATGTGGGAGATATAACTTTGAGGTGGAGGATAAAGGAGTTAAAAGGAAGAGACGGCGGAGAAAGTCAGTGGGAAGGAG
TCGGTACCGAGGAGTGGAGAGTACAGATGATATTGAAGAAGATCTTGATGGGAAAGAGCAATCAAAGAAATTGAAAATCATAGGACCCCGAGCAGTGGAGGTAGTTACAA
GTAAAGGGACTGAAGAACAAAATGGTTTGCTGAGAGAAGAAATTGGTGGGGTTGAAGATGGAGATCAAGCAGAAGGAGGCCAAGTTGAAGGGGTAGGTGAACTGAAAGAA
GTTCAAGAAGATGAAATGATAGACGAGTGTCTTGAAGAGGAAATCAAGATTAACGATGGGCCAGAAGACTTTGATTGGCAGATGCAATCACAGAATCTGGTGGTATTAGG
TCTTTGTGCGCCAGGAAATATTAAAGAGATTGAACAACAAAGTGGTTCGTTGGGGAAACAAGTTCGCAGGGCTGAAGAAAGAGATCACATGAAAGGAGGCCAACTTCGAG
AAGATGTAACAATTGACCGATGTTGTGAAAAGGAAGTCGAGTGTAGAGATGGGGTTCAAGATTTTGATAAGAAAAAGCGATCACAGAATCTGGTGGCTACGGAGCTCTGC
GCAAAGGAGACACTATTGAGGGAAGGGACTGAAGAAAAATGTGGTTTGTTGAGAGAAGAAATTGATGTAGCTGAAGAAGGAGGTCACACACAAAAAAGCCAATTTATAAT
GATATATGAACTTAAAGAAGTTGGAAATGTTGGGTTGATTAATGGTCATCACGAAGTGAAAAGTAAGAGTAGAGATGGGGCTGAAGATTTTGGTGGGAAAGAACAACTGC
AGGATCTTGTGGTTGTTGGACTCCATGTAGGAGAGGCACCAACAACTAAAGGGACTGAAGAACAATGCAGTTCATTAAGAGAAAAAATTGATGGGACTGAAGGAGATGGT
GCACAAGTAGGCCAAACTGAAGTGCTAGATAAACTCAAAGAAGTTCAAGAAGTTGAAATGATTAAAAACTATCATGAGGAGGAAGGACAAAGAGTAGTGATGGAAGAACC
AACAGAGAGACCATACGGGGGATCAAACGTAGAAGAGTTACCTAGTGAAGAAGCCACTTTGGAGTTCTTTGATGCTATGCCTAACCATGCCAATGCTGAAGAAACTTTAC
TGATTGATGCTCAATGTTGCAAGAAGTTACGAGAGGAAAATGAAAATTTGGAGTTTTTTGATGCAAAGTCTGACCACGGCAATGATGAGGCGAATGGAATGATTGATGAC
CAATCCTCGAAGGGGAAGGTACCAAGCGAAGTGAATGATCAACAAAATAGACTGCAAGAACAACTGCTATCAAAGCATCAGACCGATACACTGAAGCATGCAGAGCAAGG
GGCACCTAGCACATCTGAGGCACATATTATTTGTGGTGACAAGAGTCAACCTGGTCGGGGACGTCGTGGTCGAGGGAGACCTCGAAAGTTGAATGTACAAGAAACTTTGG
CAGCTTCATTTTCTTCATCTGCTCAAGATGAGCAGCGGCTCCAGAAGCGGTCGAGAGGGAGGGGGAGGGGGAGGGGGAGAGGTCGGGGAAGGGCTCGAGTAGCTCAAGAC
CAGATTTCATTGTCAGACACGTTCTCACCTTCCAAGTATTTGCATCACCAGCAATCTTCTGAAAAGAGACGCCCTGGAAGGCCTCCAAAACGAAAATTTGATGAATATGT
TTCACCTAAGGACATCTCGGCTTCT
Protein sequenceShow/hide protein sequence
SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGK
LCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGGVEDGDQAEGGQVEGVGELKE
VQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLREDVTIDRCCEKEVECRDGVQDFDKKKRSQNLVATELC
AKETLLREGTEEKCGLLREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDG
AQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDD
QSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAEQGAPSTSEAHIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRGRGRGRGRGRGRARVAQD
QISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISAS