; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0963 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0963
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionH15 domain-containing protein
Genome locationMC04:17490917..17496404
RNA-Seq ExpressionMC04g0963
SyntenyMC04g0963
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR005818 - Linker histone H1/H5, domain H15
IPR017956 - AT hook, DNA-binding motif
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016763.1 hypothetical protein SDJN02_21873, partial [Cucurbita argyrosperma subsp. argyrosperma]8.23e-24354.95Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD R+SL++ + RD LFSA   KYAT+GS  SLPFPSE+ KS +E  LHE  PSF TPTHLPYASMIQ+AIAE+GEEDGLSEE IS+FIVNEY+DLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IEED +  ++SKKLKI GPRA EVVTSKG++EQN  LRE I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------
         EDGD A  G+V  L EL+EVQEDEMID+   EEIK   G  DF+    S+NLV++GL AP  IKEI +QS SLG +V  AEE DH KGGQ++       
Subjt:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------

Query:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNL-ATELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
           DV ID+ CEKEV+ R  +QD D+ ++SQ + A  L  +E L   G E KCG  REEI     GG            L EV  V +IN  H+V++KS 
Subjt:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNL-ATELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        D AEDFG  +Q QDL+VVGLH  +A  TKGTE+QCSSLR+ +DG EGD  Q GQTEVL   K  QEVEMI  +HEEE Q  +MEEP ER    SN EE P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------
         EEATL+FFDAMPN  +A+E  +IDAQ C+KL+EENE+LEFFDAKSDHG++EAN +   Q+SKGKV  EV ++QNRL+EQ +SK                
Subjt:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------

Query:  -------------------------------HQTDTLKHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ
                                       HQT   KH+EQ    TSEA             +IC  KSQP RG RGRGRP KL +QET A S SS A 
Subjt:  -------------------------------HQTDTLKHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ

Query:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISAS
        D                           +Q+  K  RGRGRGRGRGR R  + QD IS+ +TFSPS++LH QQS  KR  GRPPK+KFDE    KDIS +
Subjt:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISAS

Query:  LENEQPERKCHGRGRGRGR
        LEN+Q ERK  GRGRGRGR
Subjt:  LENEQPERKCHGRGRGRGR

XP_022152264.1 uncharacterized protein LOC111020032, partial [Momordica charantia]0.099.7Show/hide
Query:  MVEMKRVNSVVSRPDVSDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEES
        M  MKRVNSVVSRPDVSDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEES
Subjt:  MVEMKRVNSVVSRPDVSDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEES

Query:  ISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTS
        ISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTS
Subjt:  ISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTS

Query:  KGTEEQNGLLREEIGGVEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEER
        KGTEEQNGLLREEIGGVEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEER
Subjt:  KGTEEQNGLLREEIGGVEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEER

Query:  DHMKGGQLREDVTIDRCCEKEVECRDGVQDFDKKKRSQNLATELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHE
        DHMKGGQLREDVTIDRCCEKEVECRDGVQDFDKKKRSQNLATELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHE
Subjt:  DHMKGGQLREDVTIDRCCEKEVECRDGVQDFDKKKRSQNLATELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHE

Query:  VKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGS
        VKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGS
Subjt:  VKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGS

Query:  NVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAE
        NVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAE
Subjt:  NVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAE

Query:  QGAPSTSEAHIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG
        QGAPSTSEAHIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG
Subjt:  QGAPSTSEAHIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG

XP_022993719.1 uncharacterized protein LOC111489634 isoform X1 [Cucurbita maxima]8.43e-24154.34Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD R+SL++ + RD LFSA   KYAT+GS  SLPFPSE+ KS +E  LHE  PSF TPTHLPYASMIQ+AIAEVGEEDGLSEE IS+FIVNEY+DLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IE D+D  ++SKKL I GP A EVVTSKGT+E+N  L E I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------
         EDGD A  GQV  L EL+EVQEDEMID+   EEIK   G  DF+   +S+NLV++GL AP  IK IE+QS SLG +V  AEE DH KGGQ++       
Subjt:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------

Query:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
           DV ID+ CEK+V+ R  +QD D+ ++SQ +A   L A+E L   G E KCGLSREEI     GG            L +V  VG+IN  H+V+ KS 
Subjt:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        D AEDFG  +Q QDL+VVGLH  +A TTKGTE+QCSSLR+ + G EG   Q GQTEVL   K  QEVEMI  +HEEE Q  +MEEP ER    SN EE P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------
         EEATL+FFD MPN  +A+E  +IDAQ C+KL+EENE+LEFFDAKSDHG+++A  +   Q+SKGKV  EV ++QNRL+EQ +SK                
Subjt:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------

Query:  -------------------------------HQTDTLKHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ
                                       HQT   KH+EQ    TSEA             +IC  KSQP +G RGRGRP KL +QET A S SS A 
Subjt:  -------------------------------HQTDTLKHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ

Query:  D------EQRLQKRSRG-------------------RGRGRGRGRGRARVA-QDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISASL
        D      E +++ R                      RGRGRGRGRGR R+  QD IS+ +TFSPS++LHHQQS  KR  GRPPK+KFDE    KDIS +L
Subjt:  D------EQRLQKRSRG-------------------RGRGRGRGRGRARVA-QDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISASL

Query:  ENEQPERKCHGRGRGRG----RPPREREQE
        EN+Q ERK  GRGRGRG    RP R R++E
Subjt:  ENEQPERKCHGRGRGRG----RPPREREQE

XP_023549578.1 uncharacterized protein LOC111808038 isoform X1 [Cucurbita pepo subsp. pepo]2.35e-24755.07Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD R+SL+  + RD LFSA   KYAT+GS  SLPFPSE+ KS +E  LH+  PSF TPTHLPYASMIQ+AI EVGEEDGLSEE IS+FIVNEY+DLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IEED D  ++SKKL I GPRA EVVTSKG++EQN  LRE I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------
         EDGD A  GQV  L EL+E QEDEMID+   EEIK      DF+   +S+NLV++GL AP  IKEIE+QS SLG++V  AEE DH KGGQ++       
Subjt:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------

Query:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
           DV ID+ CEKEV+ R  +QD D+K++SQ +A   L A+E L   G E KCG SREEI     GG            L EV  V +IN  H+V++KS 
Subjt:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        D AEDFG  +Q QD++VVGLH  +A   KGTE+QCSSLR+ +DG EGD  Q GQTEVL   K  QEVEMI  +HEEE Q  +MEEP ER   GSN EE P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------
         EEATL+FFDAMPN  +A+E  ++DAQ C+KL+EENE+LEFFDAKSDHG++EAN +   Q+SKGKV  EV ++QNRL+EQ +SK                
Subjt:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------

Query:  -------------------------------HQTDTLKHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ
                                       HQT   KH+EQ    TSEA             +IC  KSQP RG RGRGRP KL +QET A S SS A 
Subjt:  -------------------------------HQTDTLKHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ

Query:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISAS
        D                           +Q+  K  RGRGRGRGRGR R  + QD IS+ +TFSPS+YLHHQQS  KR  GRPPK+KFDE    KDIS +
Subjt:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISAS

Query:  LENEQPERKCHGRGRGRGR
        +EN+Q ERK  GRGRGRGR
Subjt:  LENEQPERKCHGRGRGRGR

XP_038907055.1 uncharacterized protein LOC120092885 [Benincasa hispida]1.26e-25958.4Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD RHSLV+ + RD LFSA+  KY+T+GS  S PF SE+ KS V+ R+HE  PSF TPTHLPYASMIQRAIAE G+EDGLSEESIS+FIVNEYEDLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKS CGRYNF+VE  GVKRK+RRRKS GR+R R +ES D+IEED D K++SKKL IIGPR  EVVTSKGTEEQ+ LLRE I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------
          D D A+GGQV  L EL+E+QEDEMID+   E+IK N GP+DF  + QS  LV++GL AP  I EIE+QSGSLG++V+ AE+ +  KGGQ++       
Subjt:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------

Query:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
           DV I + CEKEV+ RD VQDFD++K+SQN+A   L A+E L      EKCG  REEID A+E    Q  Q I IY+LKEV  VG+IN HHEV+  SR
Subjt:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        DG EDFGG +Q QDLVVVGLH  EA TTKGTE+QCSSLR+K+DG EG+ AQ GQTE L K KEV EVEMI  +HEEE Q  +MEEP ERP  GSN E  P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLI-DAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK---HQTDTLKHAEQG
         EEA LEFFDA  NH+N EE  +I DA+ CKKL+EENENLEFFDA+SDH  D  N +I  QSSK  V  EV+++QNRL+E+  SK   +QT   K  E  
Subjt:  SEEATLEFFDAMPNHANAEETLLI-DAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK---HQTDTLKHAEQG

Query:  APSTSEAH----------------------------------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD-------------E
         P  S+ H                                        II G  S P     GRGRPR L VQETLA S  +SAQD             +
Subjt:  APSTSEAH----------------------------------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD-------------E

Query:  QRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISASLENEQPERKCHGRGRGRGRPPREREQE
        QRL K  RGRGRGRGR R    V QDQIS+S+ FSPSK+ HHQQS  KR  GRPPK+KF+E    K IS SLENEQ E +  GRGRG GRP R+R++E
Subjt:  QRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISASLENEQPERKCHGRGRGRGRPPREREQE

TrEMBL top hitse value%identityAlignment
A0A5D3E3L6 Transcription regulatory protein SNF2-like isoform X33.79e-22054.3Show/hide
Query:  KRVNSVVSRPDVSDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDF
        + ++S  S    SD RHSL++ +LRD LFSA+  KY+T+G+  SLPF S++ KS ++ RL E  PSF TPTHLPYASMIQRAIAEVGEEDGLSEESIS+F
Subjt:  KRVNSVVSRPDVSDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDF

Query:  IVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTE
        IVNEYEDLPWAH+A LRRHLGKLCE+GELVK KCGRYNF+VEDKGVKRK+RRRK+ GRSRYR VES D+IEE  D K++SKKLK+IGPR  EVVTSKG+E
Subjt:  IVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTE

Query:  EQNGLLREEIGGVEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMK
        EQ+   RE   GVE+ D    GQV  + E K+V+ DEM+D+   E+ K   G + F+ + QS+NLV+LGL AP   KE+E+QSGS G++V   EE DH K
Subjt:  EQNGLLREEIGGVEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMK

Query:  GGQLR---------EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGL
        GGQ++          DV I + CEKEV+ R G QDFD KK+SQN+A   L A+E L     EEK G  REEI  A+E G+ Q  Q IMIYELKEV     
Subjt:  GGQLR---------EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGL

Query:  INGHHEVKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKI-DGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPT
         NG  EV        EDFGG++Q QDL+VVGLH  EA  TKGTE++CSS R+ + DG EG  AQ GQ EVLDK KEVQ VEMI  + EEE Q   MEEP 
Subjt:  INGHHEVKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKI-DGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPT

Query:  ERPYGGSNVEELPSEEATLEFFDAMPNHANAEETLLID-AQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQ
        ER   GS  E  P EEATLEFFDAM  H+NAEE  +ID A+ CKKL EENEN EFFDAKSDHG D  N +I  QSSK  V  EV+++QNRL+EQ  SK  
Subjt:  ERPYGGSNVEELPSEEATLEFFDAMPNHANAEETLLID-AQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQ

Query:  TD------------------------------TL-KHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD-
         D                              TL KH++Q    TSEA             IIC   SQP  G RG+GRPRKL VQE LA S SS A+D 
Subjt:  TD------------------------------TL-KHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD-

Query:  EQR----------------------LQKRSRGRGRGRGRGRGRARVA-QDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISASLENEQ
        +QR                      + ++     RGRGRGRGR RV  QDQ S S   SPSK+L+H+QS  K R GRP K+ FDE +  KDIS  LEN+ 
Subjt:  EQR----------------------LQKRSRGRGRGRGRGRGRARVA-QDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISASLENEQ

Query:  PERK-CHGRGRGRG
         E K   GRG G G
Subjt:  PERK-CHGRGRGRG

A0A6J1DFJ3 uncharacterized protein LOC1110200320.099.7Show/hide
Query:  MVEMKRVNSVVSRPDVSDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEES
        M  MKRVNSVVSRPDVSDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEES
Subjt:  MVEMKRVNSVVSRPDVSDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEES

Query:  ISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTS
        ISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTS
Subjt:  ISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTS

Query:  KGTEEQNGLLREEIGGVEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEER
        KGTEEQNGLLREEIGGVEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEER
Subjt:  KGTEEQNGLLREEIGGVEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEER

Query:  DHMKGGQLREDVTIDRCCEKEVECRDGVQDFDKKKRSQNLATELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHE
        DHMKGGQLREDVTIDRCCEKEVECRDGVQDFDKKKRSQNLATELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHE
Subjt:  DHMKGGQLREDVTIDRCCEKEVECRDGVQDFDKKKRSQNLATELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHE

Query:  VKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGS
        VKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGS
Subjt:  VKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGS

Query:  NVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAE
        NVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAE
Subjt:  NVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAE

Query:  QGAPSTSEAHIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG
        QGAPSTSEAHIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG
Subjt:  QGAPSTSEAHIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRG

A0A6J1FEI4 uncharacterized protein LOC111444998 isoform X12.87e-23853.97Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD R+SL++ + RD LFSA   KYAT+GS  SLPFPSE+ KS +E  LH+  PSF TPTHLPYASMIQ+AIAE+GEEDGLSEE IS+FIVNEY+DLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IEED +  ++SKKL I GP A  VVTSKG++EQN  LRE I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------
         EDGD A  G+V  L EL+EVQEDEMID+   EEIK   G  DF+   +S+NLV++GL AP  IKEI +QS SLG +V  AEE DH KGGQ++       
Subjt:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------

Query:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNL-ATELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
           DV ID+ CEKEV+ R  +QD D+K++SQ + A  L  +E L   G E KCG SREEI     GG            L E+  V +IN  H+V++KS 
Subjt:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNL-ATELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        D AEDFG  +Q QDL+VVGLH  +A  TKGTE+QCSSLR+ +DG EGD  Q GQTEVL   K  QEVEMI  +HEEE Q  +MEEP ER    SN EE P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------
         EEATL+FFDAMPN  +A+E  ++DAQ C+KL+EENE+LEFFDAKSDHG++EAN +   Q+SKGKV  EV ++QN L+EQ +SK                
Subjt:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------

Query:  -------------------------------HQTDTLKHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ
                                       HQT   KH+EQ    TSEA             +IC  KSQP RG RGRGRP KL +QET A S SS A 
Subjt:  -------------------------------HQTDTLKHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ

Query:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISAS
        D                           +Q+  K  RGRGRGRGRGR R  + QD IS+ +TFSPS++LH Q S  KR  GRPPK+KFDE    KDI  +
Subjt:  D---------------------------EQRLQKRSRGRGRGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISAS

Query:  LENEQPERKCHGRGRGRGR
        LEN+Q ERK  G GRGRGR
Subjt:  LENEQPERKCHGRGRGRGR

A0A6J1JZB4 uncharacterized protein LOC111489634 isoform X21.53e-21053.46Show/hide
Query:  MIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGK
        MIQ+AIAEVGEEDGLSEE IS+FIVNEY+DLPWAH A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IE D+D  
Subjt:  MIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGK

Query:  EQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGGVEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIK
        ++SKKL I GP A EVVTSKGT+E+N  L E I G EDGD A  GQV  L EL+EVQEDEMID+   EEIK   G  DF+   +S+NLV++GL AP  IK
Subjt:  EQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGGVEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIK

Query:  EIEQQSGSLGKQVRRAEERDHMKGGQLR---------EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEE
         IE+QS SLG +V  AEE DH KGGQ++          DV ID+ CEK+V+ R  +QD D+ ++SQ +A   L A+E L   G E KCGLSREEI     
Subjt:  EIEQQSGSLGKQVRRAEERDHMKGGQLR---------EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEE

Query:  GGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQ
        GG            L +V  VG+IN  H+V+ KS D AEDFG  +Q QDL+VVGLH  +A TTKGTE+QCSSLR+ + G EG   Q GQTEVL   K  Q
Subjt:  GGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQ

Query:  EVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGK
        EVEMI  +HEEE Q  +MEEP ER    SN EE P EEATL+FFD MPN  +A+E  +IDAQ C+KL+EENE+LEFFDAKSDHG+++A  +   Q+SKGK
Subjt:  EVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGK

Query:  VPSEVNDQQNRLQEQLLSK-----------------------------------------------HQTDTLKHAEQGAPSTSEAH------------II
        V  EV ++QNRL+EQ +SK                                               HQT   KH+EQ    TSEA             +I
Subjt:  VPSEVNDQQNRLQEQLLSK-----------------------------------------------HQTDTLKHAEQGAPSTSEAH------------II

Query:  CGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD------EQRLQKRSRG-------------------RGRGRGRGRGRARVA-QDQISLSDTFSPS
        C  KSQP +G RGRGRP KL +QET A S SS A D      E +++ R                      RGRGRGRGRGR R+  QD IS+ +TFSPS
Subjt:  CGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQD------EQRLQKRSRG-------------------RGRGRGRGRGRARVA-QDQISLSDTFSPS

Query:  KYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISASLENEQPERKCHGRGRGRG----RPPREREQE
        ++LHHQQS  KR  GRPPK+KFDE    KDIS +LEN+Q ERK  GRGRGRG    RP R R++E
Subjt:  KYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISASLENEQPERKCHGRGRGRG----RPPREREQE

A0A6J1K0W5 uncharacterized protein LOC111489634 isoform X14.08e-24154.34Show/hide
Query:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH
        SD R+SL++ + RD LFSA   KYAT+GS  SLPFPSE+ KS +E  LHE  PSF TPTHLPYASMIQ+AIAEVGEEDGLSEE IS+FIVNEY+DLPWAH
Subjt:  SDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAH

Query:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG
         A LRRHLGKLCESGELVKSKCG+YNF+VE K VKRK+RRRKS GRSR R VES D+IE D+D  ++SKKL I GP A EVVTSKGT+E+N  L E I G
Subjt:  AALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGG

Query:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------
         EDGD A  GQV  L EL+EVQEDEMID+   EEIK   G  DF+   +S+NLV++GL AP  IK IE+QS SLG +V  AEE DH KGGQ++       
Subjt:  VEDGDQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLR-------

Query:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR
           DV ID+ CEK+V+ R  +QD D+ ++SQ +A   L A+E L   G E KCGLSREEI     GG            L +V  VG+IN  H+V+ KS 
Subjt:  --EDVTIDRCCEKEVECRDGVQDFDKKKRSQNLAT-ELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSR

Query:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP
        D AEDFG  +Q QDL+VVGLH  +A TTKGTE+QCSSLR+ + G EG   Q GQTEVL   K  QEVEMI  +HEEE Q  +MEEP ER    SN EE P
Subjt:  DGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQCSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELP

Query:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------
         EEATL+FFD MPN  +A+E  +IDAQ C+KL+EENE+LEFFDAKSDHG+++A  +   Q+SKGKV  EV ++QNRL+EQ +SK                
Subjt:  SEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDAKSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSK----------------

Query:  -------------------------------HQTDTLKHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ
                                       HQT   KH+EQ    TSEA             +IC  KSQP +G RGRGRP KL +QET A S SS A 
Subjt:  -------------------------------HQTDTLKHAEQGAPSTSEAH------------IICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQ

Query:  D------EQRLQKRSRG-------------------RGRGRGRGRGRARVA-QDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISASL
        D      E +++ R                      RGRGRGRGRGR R+  QD IS+ +TFSPS++LHHQQS  KR  GRPPK+KFDE    KDIS +L
Subjt:  D------EQRLQKRSRG-------------------RGRGRGRGRGRARVA-QDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISASL

Query:  ENEQPERKCHGRGRGRG----RPPREREQE
        EN+Q ERK  GRGRGRG    RP R R++E
Subjt:  ENEQPERKCHGRGRGRG----RPPREREQE

SwissProt top hitse value%identityAlignment
Q9FYS5 HMG-Y-related protein A5.3e-0640.51Show/hide
Query:  PYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRR
        PY  MI  AI  + ++ G ++ +IS +I  +Y  LP AHA+LL  HL ++ ESGELV  K   +     D   KR R R
Subjt:  PYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRR

Arabidopsis top hitse value%identityAlignment
AT5G08780.1 winged-helix DNA-binding transcription factor family protein3.4e-0829.84Show/hide
Query:  TPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELV---KSKCGRYNFEVEDKGVKRKRRRRKS-VGRSRYRGV
        TP H  Y++MI  AI ++ +E G SE++IS+FI ++Y++LP+AH  LL  HL KL E  E++    + C  Y+   E K V     +RKS +   R    
Subjt:  TPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYEDLPWAHAALLRRHLGKLCESGELV---KSKCGRYNFEVEDKGVKRKRRRRKS-VGRSRYRGV

Query:  ESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREE---IGGVEDGDQAEGGQVEGLGE-LKEVQEDEMIDECLEEEIKIND
         + D++    + +E  + LK   P+ V +     T+ + G  R+    I  +E  D  + G   GL +   ++   E + E ++ E   N+
Subjt:  ESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREE---IGGVEDGDQAEGGQVEGLGE-LKEVQEDEMIDECLEEEIKIND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGAGATGAAGAGAGTAAACTCGGTTGTCTCGAGGCCTGATGTTTCCGATCAGCGTCATTCATTAGTATCCAGAAAGCTCAGAGACGACCTCTTCTCCGCCATCAC
CACCAAATATGCAACCGACGGCAGCAACCCCTCGTTGCCTTTCCCCTCCGAGAAGCTCAAGTCCGATGTCGAGCGGCGGCTTCATGAGTATCTCCCCTCCTTCCACACTC
CTACTCACCTTCCATATGCCTCGATGATACAGAGGGCAATAGCTGAAGTGGGAGAGGAAGATGGGTTGAGCGAGGAGTCTATTTCGGATTTTATCGTGAATGAATATGAG
GATTTGCCATGGGCGCATGCAGCTCTTTTGCGTCGCCATTTGGGGAAGCTTTGTGAAAGTGGGGAGCTTGTGAAATCGAAATGTGGGAGATATAACTTTGAGGTGGAGGA
TAAAGGAGTTAAAAGGAAGAGACGGCGGAGAAAGTCAGTGGGAAGGAGTCGGTACCGAGGAGTGGAGAGTACAGATGATATTGAAGAAGATCTTGATGGGAAAGAGCAAT
CAAAGAAATTGAAAATCATAGGACCCCGAGCAGTGGAGGTAGTTACAAGTAAAGGGACTGAAGAACAAAATGGTTTGCTGAGAGAAGAAATTGGTGGGGTTGAAGATGGA
GATCAAGCAGAAGGAGGCCAAGTTGAAGGGCTAGGTGAACTGAAAGAAGTTCAAGAAGATGAAATGATAGACGAGTGTCTTGAAGAGGAAATCAAGATTAACGATGGGCC
AGAAGACTTTGATTGGCAGATGCAATCACAGAATCTGGTGGTATTAGGTCTTTGTGCGCCAGGAAATATTAAAGAGATTGAACAACAAAGTGGTTCGTTGGGGAAACAAG
TTCGCAGGGCTGAAGAAAGAGATCACATGAAAGGAGGCCAACTTCGAGAAGATGTAACAATTGACCGATGTTGTGAAAAGGAAGTCGAGTGTAGAGATGGGGTTCAAGAT
TTTGATAAGAAAAAGCGATCACAGAATCTGGCTACGGAGCTCTGCGCAAAGGAGACACTATTGAGGGAAGGGACTGAAGAAAAATGTGGTTTGTCGAGAGAAGAAATTGA
TGTAGCCGAAGAAGGAGGTCACACACAAAAAAGCCAATTTATAATGATATATGAACTTAAAGAAGTTGGAAATGTTGGGTTGATTAATGGTCATCACGAAGTGAAAAGTA
AGAGTAGAGATGGGGCTGAAGATTTTGGTGGGAAAGAACAACTGCAGGATCTTGTGGTTGTTGGACTCCATGTAGGAGAGGCACCAACAACTAAAGGGACTGAAGAACAA
TGCAGTTCATTAAGAGAAAAAATTGATGGGACTGAAGGAGATGGTGCACAAGTAGGCCAAACTGAAGTGCTAGATAAACTCAAAGAAGTTCAAGAAGTTGAAATGATTAA
AAACTATCATGAGGAGGAAGGACAAAGAGTAGTGATGGAAGAACCAACTGAGAGACCATACGGGGGATCAAACGTAGAAGAGTTACCTAGTGAAGAAGCCACTTTGGAGT
TCTTTGATGCTATGCCTAACCATGCCAATGCTGAAGAAACTTTACTGATTGATGCTCAATGTTGCAAGAAGTTACGAGAGGAAAATGAAAATTTGGAGTTTTTTGATGCA
AAGTCTGACCACGGCAATGATGAGGCGAATGGAATGATTGATGACCAATCCTCGAAGGGGAAGGTACCAAGCGAAGTGAATGATCAACAAAATAGACTGCAAGAACAACT
GCTATCAAAGCATCAGACCGATACACTGAAGCATGCAGAGCAAGGGGCACCTAGCACATCTGAGGCACATATTATTTGTGGTGACAAGAGTCAACCTGGTCGGGGACGTC
GTGGTCGAGGGAGACCTCGAAAGTTGAATGTACAAGAAACTTTGGCAGCTTCATTTTCTTCATCTGCTCAAGATGAGCAGCGGCTCCAGAAGCGGTCGAGAGGGAGGGGG
AGGGGGAGGGGGAGAGGTCGGGGAAGGGCTCGAGTAGCTCAAGACCAGATTTCATTGTCAGACACGTTCTCACCTTCCAAGTATTTGCATCACCAGCAATCTTCTGAAAA
GAGACGCCCTGGAAGGCCTCCAAAACGAAAATTTGATGAATATGTTTCACCTAAGGACATCTCGGCTTCTTTAGAGAATGAGCAGCCAGAAAGGAAATGCCATGGCCGTG
GTCGTGGTCGTGGAAGACCTCCCAGAGAGAGAGAGCAAGAAAACTAA
mRNA sequenceShow/hide mRNA sequence
CTTCTCTCTCTAAAAAAAGGTCAGTCACCACGCACCTCCTCATCTTCTTTCTACGAACCTTGCCTCGGTCGCAATCCAATCGGCCCCAGTAAATCGAACTCCGACGGCGA
CGGCGACGGCGACGGTGGAACGCGGCGGCGACCAACCGACGTTCAGACCCGAATCTCCCTCCATTCTGTCGACATTCTCTATATAGGAGAGAGGGATTCTGAAGGATTGA
TTGAAGCTGAAGATATTTGTTTTTGAGTTGCTTTCTGGGAAAAGAAAAAAAAGTGGTGCAAAGATTATCACAGGTTTGCCGAGTCTTGAAGTTTGAATAAACACGTCTTC
CTCTAGAGAAAGCATATGAGTTGTTTGATGAGAATAGAAGCTGTTTGATGGGGGGCATATGAATTGTTTGGATCATCTTCCAATGTTCGGGATCTTGAGCGGCGTTGTAA
TGAGTCAGAACAACATCTTCAAGAACTTGAACAACAGAGGAAAGTTGATGTACAAGGATTCAAAGATCAAATTATTGAATTAAAAATTTGTTTTGAGCATCGATTTCAAA
TAATTATGGTTGAGATGAAGAGAGTAAACTCGGTTGTCTCGAGGCCTGATGTTTCCGATCAGCGTCATTCATTAGTATCCAGAAAGCTCAGAGACGACCTCTTCTCCGCC
ATCACCACCAAATATGCAACCGACGGCAGCAACCCCTCGTTGCCTTTCCCCTCCGAGAAGCTCAAGTCCGATGTCGAGCGGCGGCTTCATGAGTATCTCCCCTCCTTCCA
CACTCCTACTCACCTTCCATATGCCTCGATGATACAGAGGGCAATAGCTGAAGTGGGAGAGGAAGATGGGTTGAGCGAGGAGTCTATTTCGGATTTTATCGTGAATGAAT
ATGAGGATTTGCCATGGGCGCATGCAGCTCTTTTGCGTCGCCATTTGGGGAAGCTTTGTGAAAGTGGGGAGCTTGTGAAATCGAAATGTGGGAGATATAACTTTGAGGTG
GAGGATAAAGGAGTTAAAAGGAAGAGACGGCGGAGAAAGTCAGTGGGAAGGAGTCGGTACCGAGGAGTGGAGAGTACAGATGATATTGAAGAAGATCTTGATGGGAAAGA
GCAATCAAAGAAATTGAAAATCATAGGACCCCGAGCAGTGGAGGTAGTTACAAGTAAAGGGACTGAAGAACAAAATGGTTTGCTGAGAGAAGAAATTGGTGGGGTTGAAG
ATGGAGATCAAGCAGAAGGAGGCCAAGTTGAAGGGCTAGGTGAACTGAAAGAAGTTCAAGAAGATGAAATGATAGACGAGTGTCTTGAAGAGGAAATCAAGATTAACGAT
GGGCCAGAAGACTTTGATTGGCAGATGCAATCACAGAATCTGGTGGTATTAGGTCTTTGTGCGCCAGGAAATATTAAAGAGATTGAACAACAAAGTGGTTCGTTGGGGAA
ACAAGTTCGCAGGGCTGAAGAAAGAGATCACATGAAAGGAGGCCAACTTCGAGAAGATGTAACAATTGACCGATGTTGTGAAAAGGAAGTCGAGTGTAGAGATGGGGTTC
AAGATTTTGATAAGAAAAAGCGATCACAGAATCTGGCTACGGAGCTCTGCGCAAAGGAGACACTATTGAGGGAAGGGACTGAAGAAAAATGTGGTTTGTCGAGAGAAGAA
ATTGATGTAGCCGAAGAAGGAGGTCACACACAAAAAAGCCAATTTATAATGATATATGAACTTAAAGAAGTTGGAAATGTTGGGTTGATTAATGGTCATCACGAAGTGAA
AAGTAAGAGTAGAGATGGGGCTGAAGATTTTGGTGGGAAAGAACAACTGCAGGATCTTGTGGTTGTTGGACTCCATGTAGGAGAGGCACCAACAACTAAAGGGACTGAAG
AACAATGCAGTTCATTAAGAGAAAAAATTGATGGGACTGAAGGAGATGGTGCACAAGTAGGCCAAACTGAAGTGCTAGATAAACTCAAAGAAGTTCAAGAAGTTGAAATG
ATTAAAAACTATCATGAGGAGGAAGGACAAAGAGTAGTGATGGAAGAACCAACTGAGAGACCATACGGGGGATCAAACGTAGAAGAGTTACCTAGTGAAGAAGCCACTTT
GGAGTTCTTTGATGCTATGCCTAACCATGCCAATGCTGAAGAAACTTTACTGATTGATGCTCAATGTTGCAAGAAGTTACGAGAGGAAAATGAAAATTTGGAGTTTTTTG
ATGCAAAGTCTGACCACGGCAATGATGAGGCGAATGGAATGATTGATGACCAATCCTCGAAGGGGAAGGTACCAAGCGAAGTGAATGATCAACAAAATAGACTGCAAGAA
CAACTGCTATCAAAGCATCAGACCGATACACTGAAGCATGCAGAGCAAGGGGCACCTAGCACATCTGAGGCACATATTATTTGTGGTGACAAGAGTCAACCTGGTCGGGG
ACGTCGTGGTCGAGGGAGACCTCGAAAGTTGAATGTACAAGAAACTTTGGCAGCTTCATTTTCTTCATCTGCTCAAGATGAGCAGCGGCTCCAGAAGCGGTCGAGAGGGA
GGGGGAGGGGGAGGGGGAGAGGTCGGGGAAGGGCTCGAGTAGCTCAAGACCAGATTTCATTGTCAGACACGTTCTCACCTTCCAAGTATTTGCATCACCAGCAATCTTCT
GAAAAGAGACGCCCTGGAAGGCCTCCAAAACGAAAATTTGATGAATATGTTTCACCTAAGGACATCTCGGCTTCTTTAGAGAATGAGCAGCCAGAAAGGAAATGCCATGG
CCGTGGTCGTGGTCGTGGAAGACCTCCCAGAGAGAGAGAGCAAGAAAACTAACCATTCCATTATCACAATATGAATTAGTGTCTAGCTAGGCAGTTCTTAACATCTGATT
TTGTTGTGTTTTACTGAGTTAGCCTCTTTGCTTTTTACCATCACACCTCAGTAAAATTTCTAGTGCAACTCGTGAATCTAGTTTTTCAAGTCATGTAATTACAGGTAATA
CAGAGTATTAACAGCAAGAAGTGATGGCTAGTTACCCTTACAGACATGGTAAGTCTCTTCTTCCTCATTGTTTTACATGATCAAGTTAGAACACATTTTTTATTTCTTTG
CCAATTACTTATATTCTTAATAACATTTTGTTATTGCAGTTCTAGATCTATCCTATTTGTCTAAGAATAGGGCTTGAACAATTTTTCTCTATGCATGAATTTAATT
Protein sequenceShow/hide protein sequence
MVEMKRVNSVVSRPDVSDQRHSLVSRKLRDDLFSAITTKYATDGSNPSLPFPSEKLKSDVERRLHEYLPSFHTPTHLPYASMIQRAIAEVGEEDGLSEESISDFIVNEYE
DLPWAHAALLRRHLGKLCESGELVKSKCGRYNFEVEDKGVKRKRRRRKSVGRSRYRGVESTDDIEEDLDGKEQSKKLKIIGPRAVEVVTSKGTEEQNGLLREEIGGVEDG
DQAEGGQVEGLGELKEVQEDEMIDECLEEEIKINDGPEDFDWQMQSQNLVVLGLCAPGNIKEIEQQSGSLGKQVRRAEERDHMKGGQLREDVTIDRCCEKEVECRDGVQD
FDKKKRSQNLATELCAKETLLREGTEEKCGLSREEIDVAEEGGHTQKSQFIMIYELKEVGNVGLINGHHEVKSKSRDGAEDFGGKEQLQDLVVVGLHVGEAPTTKGTEEQ
CSSLREKIDGTEGDGAQVGQTEVLDKLKEVQEVEMIKNYHEEEGQRVVMEEPTERPYGGSNVEELPSEEATLEFFDAMPNHANAEETLLIDAQCCKKLREENENLEFFDA
KSDHGNDEANGMIDDQSSKGKVPSEVNDQQNRLQEQLLSKHQTDTLKHAEQGAPSTSEAHIICGDKSQPGRGRRGRGRPRKLNVQETLAASFSSSAQDEQRLQKRSRGRG
RGRGRGRGRARVAQDQISLSDTFSPSKYLHHQQSSEKRRPGRPPKRKFDEYVSPKDISASLENEQPERKCHGRGRGRGRPPREREQEN