; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G006570 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G006570
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionH15 domain-containing protein
Genome locationchr09:7345339..7350116
RNA-Seq ExpressionLsi09G006570
SyntenyLsi09G006570
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR005818 - Linker histone H1/H5, domain H15
IPR017956 - AT hook, DNA-binding motif
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046320.1 transcription regulatory protein SNF2-like isoform X3 [Cucumis melo var. makuwa]0.0e+0069.52Show/hide
Query:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE
        ME S   LSSIRPPPEN SSPSS  PHSDHRHSL+AGR RDALFSAVAAKYSTN + HSLPF S+QFKSVIDC L ENFPSFQTPTHLPYASMIQRAIAE
Subjt:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE

Query:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI
        +GEEDGLSEESISEFIVNEYEDLPWAH A+LRRHLGKLCE+GELVK  CGRY+FKVE KGVKRKKRRRK+ GR+R REVESADEIEE FDRKKRSKKL +
Subjt:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI

Query:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF
        IGPR EEVVTSKG+EEQSD  REV VG E+VDH   GQVV++E ++V+ DEM+DKQHGEK KH YG KVF+R  +SR +VI+GLHAP+A KE+++QSG F
Subjt:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF

Query:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDR--
        GEEV E EEGDH KGGQIQV GEVNEVQADV+I QPCEKEVKSR G QDFD+KKQSQNVAAGNLGAQEALTMT N+EK GS REEI GAKERGYDQDR  
Subjt:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDR--

Query:  LVIYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKV-DGAEENRPQAGQTEALGKFIEVQEVMIDEHH
        ++IYELKEV              N  D VEDFGGRKQ QDLMVVGLHAKEAL TKGTE++CSS RK V DG E    QAGQ E L KF EVQ  MIDEH 
Subjt:  LVIYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKV-DGAEENRPQAGQTEALGKFIEVQEVMIDEHH

Query:  EEERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVID
        EEE+QGE MEEPKE                                                 RAS+GS  E  P EEATLEFFDA+S HSNAEENGVID
Subjt:  EEERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVID

Query:  DAEGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGSLPKHS
        DAEGCKKL EENEN EFFDAKSD+GYDG NEII AQSSK TVLGEVSNKQNRLE+QRPSK SD+QT I  G EAED QL+KEH QVRWPSEITG+L KHS
Subjt:  DAEGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGSLPKHS

Query:  EIEMRRTSKADTNENSNVLLPADIICGPSHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD-----PDMGDGT-----------HIDQQRLKLPRGRGRG
        + EM RTS+AD NE S  L P DIIC PS P GHRG+GRPRKLKVQE L TSLSS A+D D      ++ DG            HIDQQ L LPRGRGRG
Subjt:  EIEMRRTSKADTNENSNVLLPADIICGPSHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD-----PDMGDGT-----------HIDQQRLKLPRGRGRG

Query:  RGRPRVVRQDQISASETFSPSKHLHHQQSPEKRRGRPPKRKFDEGTVSKDILTSLDNELQEHK-RRGRGSGTGRPSGGRKQERELFDNQ
        RGR RVVRQDQ S S+  SPSKHL+H+QSP K RGRP K+ FDE  VSKDI T L+N+ QE K   GRG G G  S GR +ER  FDNQ
Subjt:  RGRPRVVRQDQISASETFSPSKHLHHQQSPEKRRGRPPKRKFDEGTVSKDILTSLDNELQEHK-RRGRGSGTGRPSGGRKQERELFDNQ

KAG7016763.1 hypothetical protein SDJN02_21873, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-30066.45Show/hide
Query:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE
        MENSQPHLS+I  PPEN   PSS+TPHSDHR+SL+AGRFRDALFSA AAKY+TN S HSLPF SEQFKSVI+C LHENFPSF+TPTHLPYASMIQ+AIAE
Subjt:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE

Query:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI
        +GEEDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+Y+FKVE K VKRKKRRRKSAGR+RRREVES DEIEEDF+R KRSKKL I
Subjt:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI

Query:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF
         GPRAEEVVTSKG++EQ++ LREVI+GAED DHA  G+VVLDELEEVQEDEMIDK H E+IK+KYG   F+  K SR +VIIGL APVAIKEI RQS   
Subjt:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF

Query:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV
        G +VHEAEEGDH KGGQIQVLG+V EVQADV+IDQPCEKEVKSR  +QD DE +QSQ V A NLG QEAL MTG + KCGS REEIGG            
Subjt:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV

Query:  IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHHEE
           L EV KV +I+D H+ E    D  EDFG  KQ QDLMVVGLHAK+AL TKGTE+QCSSLRK VDGAE +  QAGQTE LG F   QEV MIDEHHEE
Subjt:  IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHHEE

Query:  ERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVIDDA
        ERQGEMMEEPKE                                                 RAS  S EE+ PGEEATL+FFDA+ N  +A+ENGVI DA
Subjt:  ERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVIDDA

Query:  EGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGS-------
        +GC+KL+EENE+LEFFDAKSD+G + ANEI  AQ+SKG VLGEV NKQNRLE+QR SKVSD+QTGISKG EAE+ QLS +HP+VRWPSEITG+       
Subjt:  EGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGS-------

Query:  ---------LPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------PDM-GDGTH
                  PKHSE  +R TS+AD NE S  L   D+IC P S PRGHRGRGRP KLK+QET  TSLSS A D D               PDM  D  H
Subjt:  ---------LPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------PDM-GDGTH

Query:  IDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG--------RP
        IDQQ+LKLP  RGRGRGRGRPR++RQD IS  ETFSPS+HL HQQSP KR RGRPPK+KFDE TVSKDI T L+N+ QE K RGRG G G        RP
Subjt:  IDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG--------RP

Query:  SGGRKQERELFD
        S GRK+E+E FD
Subjt:  SGGRKQERELFD

XP_022938936.1 uncharacterized protein LOC111444998 isoform X1 [Cucurbita moschata]3.9e-30065.97Show/hide
Query:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE
        MENSQPHLS+I  PPEN   PSS+TPHSDHR+SL+AGRFRDALFSA AAKY+TN S HSLPF SEQFKSVI+C LH+NFPSF+TPTHLPYASMIQ+AIAE
Subjt:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE

Query:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI
        +GEEDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+Y+FKVE K VKRKKRRRKSAGR+RRREVES DEIEEDF+R KRSKKL I
Subjt:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI

Query:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF
         GP AE VVTSKG++EQ++ LREVI+GAED DHA  G+VVLDELEEVQEDEMIDK H E+IK+KYG   F+  KKSR +VIIGLHAPVAIKEI +QS   
Subjt:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF

Query:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV
        G +VHEAEEGDH KGGQIQVLG+V EVQADV+IDQPCEKEVKSR  +QD DEK+QSQ V A NLG QEAL MTG + KCGSSREEIGG            
Subjt:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV

Query:  IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHHEE
           L E+ KV +I+D H+ E    D  EDFG  KQ QDLMVVGLHAK+AL TKGTE+QCSSLRK VDGAE +  QAGQTE LG F   QEV MIDEHHEE
Subjt:  IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHHEE

Query:  ERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVIDDA
        ERQGEMMEEPKE                                                 RAS  S EE+ PGEEATL+FFDA+ N  +A+ENGV+ DA
Subjt:  ERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVIDDA

Query:  EGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGS-------
        +GC+KL+EENE+LEFFDAKSD+G + ANEI  AQ+SKG VLGEV NKQN LE+QR SKVSD+QTGISKG EAE+ QLS +HP+VRWPSEITG+       
Subjt:  EGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGS-------

Query:  ---------LPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------PDM-GDGTH
                  PKHSE  +R TS+AD NE S  LL  D+IC P S PRGHRGRGRP KLK+QET  TSLSS A D D               PDM  D  H
Subjt:  ---------LPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------PDM-GDGTH

Query:  IDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG----------
        IDQQ+LKLP  RGRGRGRGRPR++RQD IS  ETFSPS+HL HQ SP KR RGRPPK+KFDE TVSKDILT L+N+ QE K RG G G G          
Subjt:  IDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG----------

Query:  RPSGGRKQERELFD
        RPS GRK+E+E FD
Subjt:  RPSGGRKQERELFD

XP_023549578.1 uncharacterized protein LOC111808038 isoform X1 [Cucurbita pepo subsp. pepo]1.3e-30366.48Show/hide
Query:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE
        MENSQPHLS+I  PPEN   PSS+TPHSDHR+SL+ GRFRDALFSA AAKY+TN S HSLPF SEQFKSVI+C LH+NFPSF+TPTHLPYASMIQ+AI E
Subjt:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE

Query:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI
        +GEEDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+Y+FKVE K VKRKKRRRKSAGR+RRREVES DEIEEDFDR KRSKKL I
Subjt:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI

Query:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF
         GPRAEEVVTSKG++EQ++ LREVIVGAED DHA  GQVVLDELEE QEDEMIDK H E+IK+KY    F+  KKSR +VIIGLHAPVAIKEI++QS   
Subjt:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF

Query:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV
        G +VHEAEEGDH KGGQIQVLG+V EVQADV+IDQPCEKEVKSR  +QD DEK+QSQ VAA NLGAQEAL M G + KCGSSREEIGG            
Subjt:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV

Query:  IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHHEE
           L EV KV +I+D H+ E    D  EDFG  KQ QD+MVVGLHAK+AL  KGTE+QCSSLRK VDGAE +  QAGQTE LG F   QEV MIDEHHEE
Subjt:  IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHHEE

Query:  ERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVIDDA
        ERQGEMMEEPKE                                                 RAS GS EE+ PGEEATL+FFDA+ N  +A+ENGV+ DA
Subjt:  ERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVIDDA

Query:  EGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGS-------
        +GC+KL+EENE+LEFFDAKSD+G + ANEI  AQ+SKG VLGEV NKQNRLE+QR SKVSD+QTGISKG EAE+ QLS +HP+VRWPSEITG+       
Subjt:  EGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGS-------

Query:  ---------LPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------PDM-GDGTH
                  PKHSE  +R TS+AD NE S  +L  D+IC P S PRGHRGRGRP KLK+QET  TSLSS A D D               PDM  D  H
Subjt:  ---------LPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------PDM-GDGTH

Query:  IDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG------RPSG
        IDQQ+LKLP  RGRGRGRGRPR++RQD IS  ETFSPS++LHHQQSP KR RGRPPK+KFDE TVSKDI T ++N+ QE K RGRG G G      RPS 
Subjt:  IDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG------RPSG

Query:  GRKQERELFD
        GRK+E+E FD
Subjt:  GRKQERELFD

XP_038907055.1 uncharacterized protein LOC120092885 [Benincasa hispida]0.0e+0077.46Show/hide
Query:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE
        MENSQP  SSI PPP   S PSSMTP SDHRHSLVAGRFRDALFSAVAAKYSTN S HS PFHSEQFKSV+DC +HENFPSFQTPTHLPYASMIQRAIAE
Subjt:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE

Query:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI
         G+EDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRY+FKVE  GVKRKKRRRKSAGRNRRRE+ESADEIEEDFDRKKRSKKLMI
Subjt:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI

Query:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF
        IGPR EEVVTSKG EEQSDLLREVIVGA DVDHAQGGQVVLDEL+E+QEDEMIDK+HGEKIK  YGPK F   K+S K+VIIGL APVAI EI++QSG  
Subjt:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF

Query:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV
        GEEV EAE+G+  KGGQIQV GEVNEVQADV+I QPCEKEVKSRD VQDFDE+KQSQNVAAGNLGAQEALTMT N EKCGS REEI GAKER  DQDR V
Subjt:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV

Query:  --IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHH
          IY+LKEV KVG+I+D+HE E N RDG+EDFGG KQ QDL+VVGLH KEALTTKGTE+QCSSLRKKVDGAE N  QAGQTEALGKF EV EV MID+HH
Subjt:  --IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHH

Query:  EEERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVID
        EEERQGEMMEEP E                                                 R SMGS EE  PGEEA LEFFDA SNHSN EENGVI 
Subjt:  EEERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVID

Query:  DAEGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGSLPKHS
        DAEGCKKL+EENENLEFFDA+SD+  D  NEII AQSSK  VLGEVSN+QNRLE++RPSKVSDNQTGI KGREAED QLSKEHPQVRWPSEITG+ PKHS
Subjt:  DAEGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGSLPKHS

Query:  EIEMRRTSKADTNENSNVLLPADIICGPSHPRG-HRGRGRPRKLKVQETLVTSLSSSAQDRDPDMGDGT-HIDQQRLKLPRGRGRGRGRPRVVRQDQISA
        E EM RT +AD NENS+ LLPADII GPSHP G H GRGRPR LKVQETL TSL +SAQD DPDMGDGT HIDQQRLKLPRGRGRGRGRPR+VRQDQIS 
Subjt:  EIEMRRTSKADTNENSNVLLPADIICGPSHPRG-HRGRGRPRKLKVQETLVTSLSSSAQDRDPDMGDGT-HIDQQRLKLPRGRGRGRGRPRVVRQDQISA

Query:  SETFSPSKHLHHQQSPEKRRGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTGRPSGGRKQERELFDNQ
        SE FSPSKH HHQQSP KR GRPPK+KF+E T SK I TSL+NE QEH+  GRG GTGRPS  RK+E+  FDNQ
Subjt:  SETFSPSKHLHHQQSPEKRRGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTGRPSGGRKQERELFDNQ

TrEMBL top hitse value%identityAlignment
A0A5D3E3L6 Transcription regulatory protein SNF2-like isoform X30.0e+0069.52Show/hide
Query:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE
        ME S   LSSIRPPPEN SSPSS  PHSDHRHSL+AGR RDALFSAVAAKYSTN + HSLPF S+QFKSVIDC L ENFPSFQTPTHLPYASMIQRAIAE
Subjt:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE

Query:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI
        +GEEDGLSEESISEFIVNEYEDLPWAH A+LRRHLGKLCE+GELVK  CGRY+FKVE KGVKRKKRRRK+ GR+R REVESADEIEE FDRKKRSKKL +
Subjt:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI

Query:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF
        IGPR EEVVTSKG+EEQSD  REV VG E+VDH   GQVV++E ++V+ DEM+DKQHGEK KH YG KVF+R  +SR +VI+GLHAP+A KE+++QSG F
Subjt:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF

Query:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDR--
        GEEV E EEGDH KGGQIQV GEVNEVQADV+I QPCEKEVKSR G QDFD+KKQSQNVAAGNLGAQEALTMT N+EK GS REEI GAKERGYDQDR  
Subjt:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDR--

Query:  LVIYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKV-DGAEENRPQAGQTEALGKFIEVQEVMIDEHH
        ++IYELKEV              N  D VEDFGGRKQ QDLMVVGLHAKEAL TKGTE++CSS RK V DG E    QAGQ E L KF EVQ  MIDEH 
Subjt:  LVIYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKV-DGAEENRPQAGQTEALGKFIEVQEVMIDEHH

Query:  EEERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVID
        EEE+QGE MEEPKE                                                 RAS+GS  E  P EEATLEFFDA+S HSNAEENGVID
Subjt:  EEERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVID

Query:  DAEGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGSLPKHS
        DAEGCKKL EENEN EFFDAKSD+GYDG NEII AQSSK TVLGEVSNKQNRLE+QRPSK SD+QT I  G EAED QL+KEH QVRWPSEITG+L KHS
Subjt:  DAEGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGSLPKHS

Query:  EIEMRRTSKADTNENSNVLLPADIICGPSHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD-----PDMGDGT-----------HIDQQRLKLPRGRGRG
        + EM RTS+AD NE S  L P DIIC PS P GHRG+GRPRKLKVQE L TSLSS A+D D      ++ DG            HIDQQ L LPRGRGRG
Subjt:  EIEMRRTSKADTNENSNVLLPADIICGPSHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD-----PDMGDGT-----------HIDQQRLKLPRGRGRG

Query:  RGRPRVVRQDQISASETFSPSKHLHHQQSPEKRRGRPPKRKFDEGTVSKDILTSLDNELQEHK-RRGRGSGTGRPSGGRKQERELFDNQ
        RGR RVVRQDQ S S+  SPSKHL+H+QSP K RGRP K+ FDE  VSKDI T L+N+ QE K   GRG G G  S GR +ER  FDNQ
Subjt:  RGRPRVVRQDQISASETFSPSKHLHHQQSPEKRRGRPPKRKFDEGTVSKDILTSLDNELQEHK-RRGRGSGTGRPSGGRKQERELFDNQ

A0A6J1FEI4 uncharacterized protein LOC111444998 isoform X11.9e-30065.97Show/hide
Query:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE
        MENSQPHLS+I  PPEN   PSS+TPHSDHR+SL+AGRFRDALFSA AAKY+TN S HSLPF SEQFKSVI+C LH+NFPSF+TPTHLPYASMIQ+AIAE
Subjt:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE

Query:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI
        +GEEDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+Y+FKVE K VKRKKRRRKSAGR+RRREVES DEIEEDF+R KRSKKL I
Subjt:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI

Query:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF
         GP AE VVTSKG++EQ++ LREVI+GAED DHA  G+VVLDELEEVQEDEMIDK H E+IK+KYG   F+  KKSR +VIIGLHAPVAIKEI +QS   
Subjt:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF

Query:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV
        G +VHEAEEGDH KGGQIQVLG+V EVQADV+IDQPCEKEVKSR  +QD DEK+QSQ V A NLG QEAL MTG + KCGSSREEIGG            
Subjt:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV

Query:  IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHHEE
           L E+ KV +I+D H+ E    D  EDFG  KQ QDLMVVGLHAK+AL TKGTE+QCSSLRK VDGAE +  QAGQTE LG F   QEV MIDEHHEE
Subjt:  IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHHEE

Query:  ERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVIDDA
        ERQGEMMEEPKE                                                 RAS  S EE+ PGEEATL+FFDA+ N  +A+ENGV+ DA
Subjt:  ERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVIDDA

Query:  EGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGS-------
        +GC+KL+EENE+LEFFDAKSD+G + ANEI  AQ+SKG VLGEV NKQN LE+QR SKVSD+QTGISKG EAE+ QLS +HP+VRWPSEITG+       
Subjt:  EGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITGS-------

Query:  ---------LPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------PDM-GDGTH
                  PKHSE  +R TS+AD NE S  LL  D+IC P S PRGHRGRGRP KLK+QET  TSLSS A D D               PDM  D  H
Subjt:  ---------LPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------PDM-GDGTH

Query:  IDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG----------
        IDQQ+LKLP  RGRGRGRGRPR++RQD IS  ETFSPS+HL HQ SP KR RGRPPK+KFDE TVSKDILT L+N+ QE K RG G G G          
Subjt:  IDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG----------

Query:  RPSGGRKQERELFD
        RPS GRK+E+E FD
Subjt:  RPSGGRKQERELFD

A0A6J1FFG2 eukaryotic translation initiation factor 5B-like isoform X22.4e-25564.36Show/hide
Query:  MIQRAIAELGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRK
        MIQ+AIAE+GEEDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+Y+FKVE K VKRKKRRRKSAGR+RRREVES DEIEEDF+R 
Subjt:  MIQRAIAELGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRK

Query:  KRSKKLMIIGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKE
        KRSKKL I GP AE VVTSKG++EQ++ LREVI+GAED DHA  G+VVLDELEEVQEDEMIDK H E+IK+KYG   F+  KKSR +VIIGLHAPVAIKE
Subjt:  KRSKKLMIIGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKE

Query:  IKRQSGLFGEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKER
        I +QS   G +VHEAEEGDH KGGQIQVLG+V EVQADV+IDQPCEKEVKSR  +QD DEK+QSQ V A NLG QEAL MTG + KCGSSREEIGG    
Subjt:  IKRQSGLFGEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKER

Query:  GYDQDRLVIYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-
                   L E+ KV +I+D H+ E    D  EDFG  KQ QDLMVVGLHAK+AL TKGTE+QCSSLRK VDGAE +  QAGQTE LG F   QEV 
Subjt:  GYDQDRLVIYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-

Query:  MIDEHHEEERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAE
        MIDEHHEEERQGEMMEEPKE                                                 RAS  S EE+ PGEEATL+FFDA+ N  +A+
Subjt:  MIDEHHEEERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAE

Query:  ENGVIDDAEGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITG
        ENGV+ DA+GC+KL+EENE+LEFFDAKSD+G + ANEI  AQ+SKG VLGEV NKQN LE+QR SKVSD+QTGISKG EAE+ QLS +HP+VRWPSEITG
Subjt:  ENGVIDDAEGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITG

Query:  S----------------LPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------P
        +                 PKHSE  +R TS+AD NE S  LL  D+IC P S PRGHRGRGRP KLK+QET  TSLSS A D D               P
Subjt:  S----------------LPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------P

Query:  DM-GDGTHIDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG--
        DM  D  HIDQQ+LKLP  RGRGRGRGRPR++RQD IS  ETFSPS+HL HQ SP KR RGRPPK+KFDE TVSKDILT L+N+ QE K RG G G G  
Subjt:  DM-GDGTHIDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG--

Query:  --------RPSGGRKQERELFD
                RPS GRK+E+E FD
Subjt:  --------RPSGGRKQERELFD

A0A6J1JZB4 uncharacterized protein LOC111489634 isoform X22.2e-25364.58Show/hide
Query:  MIQRAIAELGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRK
        MIQ+AIAE+GEEDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+Y+FKVE K VKRKKRRRKSAGR+RRREVES DEIE D DR 
Subjt:  MIQRAIAELGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRK

Query:  KRSKKLMIIGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKE
        KRSKKL I GP AEEVVTSKG +E++D L EVIVGAED DHA  GQV+LDELEEVQEDEMIDK H E+IK+KYG   F+  KKSR +VIIGLHAPVAIK 
Subjt:  KRSKKLMIIGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKE

Query:  IKRQSGLFGEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKER
        I++QS   G +VHEAEEGDH KGGQIQVLG+V EVQADV+IDQ CEK+VKSR  +QD DE +QSQ VAA NLGAQEAL MTG + KCG SREEIGG    
Subjt:  IKRQSGLFGEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKER

Query:  GYDQDRLVIYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-
                   L +V KVG+I+D H+ E    D  EDFG  KQ QDLMVVGLHAK+ALTTKGTE+QCSSLRK V GAE    QAGQTE LG F   QEV 
Subjt:  GYDQDRLVIYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-

Query:  MIDEHHEEERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAE
        MIDEHHEEERQGEMMEEPKE                                                 RAS  S EE+ PGEEATL+FFD + N  +A+
Subjt:  MIDEHHEEERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAE

Query:  ENGVIDDAEGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITG
        ENGVI DA+GC+KL+EENE+LEFFDAKSD+G + A EI  AQ+SKG VLGEV NKQNRLE+QR SKVSD+QT ISKG EAE+HQLS +HP+VRWPSEITG
Subjt:  ENGVIDDAEGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITG

Query:  ----------------SLPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------P
                        + PKHSE  +  TS+AD NENS  LL  D+IC P S P+GHRGRGRP KLK+QET  TSLSS A D D               P
Subjt:  ----------------SLPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------P

Query:  DM-GDGTHIDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG--
        DM  D  HIDQQ+LKLP  RGRGRGRGRPR++RQD IS  ETFSPS+HLHHQQSP KR RGRPPK+KFDE TVSKDI T L+N+ QE K RGRG G G  
Subjt:  DM-GDGTHIDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG--

Query:  --RPSGGRKQERELFD
          RPS GRK+E+E  D
Subjt:  --RPSGGRKQERELFD

A0A6J1K0W5 uncharacterized protein LOC111489634 isoform X17.9e-29966.3Show/hide
Query:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE
        MENSQPHLS+I  PPEN   PSS+TPHSDHR+SL+AGRFRDALFSA AAKY+TN S HSLPF SEQFKSVI+C LHENFPSF+TPTHLPYASMIQ+AIAE
Subjt:  MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAE

Query:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI
        +GEEDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+Y+FKVE K VKRKKRRRKSAGR+RRREVES DEIE D DR KRSKKL I
Subjt:  LGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMI

Query:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF
         GP AEEVVTSKG +E++D L EVIVGAED DHA  GQV+LDELEEVQEDEMIDK H E+IK+KYG   F+  KKSR +VIIGLHAPVAIK I++QS   
Subjt:  IGPRAEEVVTSKGNEEQSDLLREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLF

Query:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV
        G +VHEAEEGDH KGGQIQVLG+V EVQADV+IDQ CEK+VKSR  +QD DE +QSQ VAA NLGAQEAL MTG + KCG SREEIGG            
Subjt:  GEEVHEAEEGDHGKGGQIQVLGEVNEVQADVIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLV

Query:  IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHHEE
           L +V KVG+I+D H+ E    D  EDFG  KQ QDLMVVGLHAK+ALTTKGTE+QCSSLRK V GAE    QAGQTE LG F   QEV MIDEHHEE
Subjt:  IYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLMVVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEV-MIDEHHEE

Query:  ERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVIDDA
        ERQGEMMEEPKE                                                 RAS  S EE+ PGEEATL+FFD + N  +A+ENGVI DA
Subjt:  ERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAEESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVIDDA

Query:  EGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITG--------
        +GC+KL+EENE+LEFFDAKSD+G + A EI  AQ+SKG VLGEV NKQNRLE+QR SKVSD+QT ISKG EAE+HQLS +HP+VRWPSEITG        
Subjt:  EGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSDNQTGISKGREAEDHQLSKEHPQVRWPSEITG--------

Query:  --------SLPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------PDM-GDGTH
                + PKHSE  +  TS+AD NENS  LL  D+IC P S P+GHRGRGRP KLK+QET  TSLSS A D D               PDM  D  H
Subjt:  --------SLPKHSEIEMRRTSKADTNENSNVLLPADIICGP-SHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRD---------------PDM-GDGTH

Query:  IDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG----RPSGGR
        IDQQ+LKLP  RGRGRGRGRPR++RQD IS  ETFSPS+HLHHQQSP KR RGRPPK+KFDE TVSKDI T L+N+ QE K RGRG G G    RPS GR
Subjt:  IDQQRLKLP--RGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKR-RGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTG----RPSGGR

Query:  KQERELFD
        K+E+E  D
Subjt:  KQERELFD

SwissProt top hitse value%identityAlignment
Q9FYS5 HMG-Y-related protein A4.0e-0543.86Show/hide
Query:  PYASMIQRAIAELGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELV
        PY  MI  AI  L ++ G ++ +IS++I  +Y  LP AH + L  HL ++ ESGELV
Subjt:  PYASMIQRAIAELGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELV

Arabidopsis top hitse value%identityAlignment
AT5G08780.1 winged-helix DNA-binding transcription factor family protein2.1e-0941.9Show/hide
Query:  QTPTHLPYASMIQRAIAELGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGE-LVKSNCGRYSFKVEVKGVKRKKRRRKS-AGRNRRREVE
        +TP H  Y++MI  AI +L +E G SE++ISEFI ++Y++LP+AH   L  HL KL E  E L   N   YS   E K V     +RKS     R  +  
Subjt:  QTPTHLPYASMIQRAIAELGEEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGE-LVKSNCGRYSFKVEVKGVKRKKRRRKS-AGRNRRREVE

Query:  SADEI
        +ADE+
Subjt:  SADEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACTCCCAACCCCATCTCTCTTCCATTCGACCGCCGCCGGAAAATTTCTCGTCTCCCTCTTCTATGACACCGCATTCCGATCACCGGCATTCACTCGTAGCCGG
AAGGTTCAGAGACGCCCTCTTCTCCGCCGTCGCCGCCAAATATTCGACCAATGCCAGCGGCCACTCTTTGCCTTTCCACTCCGAGCAGTTCAAGTCCGTTATTGATTGCT
TCCTTCACGAGAATTTCCCCTCCTTCCAGACTCCTACACATCTTCCCTATGCCTCGATGATACAGAGGGCAATAGCTGAATTGGGAGAGGAAGATGGGTTGAGTGAAGAG
TCAATATCAGAGTTTATCGTGAATGAGTATGAGGACTTGCCATGGGCGCACCCGGCGTTTTTGCGTCGCCATTTGGGGAAGCTCTGTGAAAGTGGGGAGCTAGTGAAATC
GAACTGTGGGAGGTATAGCTTTAAGGTGGAGGTTAAGGGAGTAAAGAGGAAGAAGCGGAGGAGGAAGTCGGCAGGAAGAAATCGGCGCCGAGAAGTGGAGAGTGCTGATG
AGATAGAAGAGGATTTTGATAGGAAAAAGCGATCAAAGAAATTGATGATCATAGGACCCCGTGCGGAGGAGGTGGTAACAAGTAAAGGGAATGAAGAACAAAGTGATTTG
TTGAGGGAAGTAATTGTTGGGGCTGAAGATGTTGATCATGCTCAAGGAGGTCAAGTTGTGCTGGATGAACTTGAAGAAGTTCAAGAAGATGAAATGATTGACAAGCAACA
TGGAGAGAAAATCAAGCATAAATATGGGCCTAAAGTTTTTGATCGGAACAAGAAATCACGAAAAATGGTGATTATAGGTCTTCATGCGCCAGTAGCTATTAAGGAGATTA
AAAGACAAAGTGGTTTATTTGGGGAAGAAGTTCATGAAGCTGAAGAAGGAGATCACGGGAAAGGCGGCCAAATTCAAGTGCTTGGTGAAGTTAATGAAGTTCAAGCAGAT
GTAATAATTGACCAACCTTGTGAAAAGGAAGTCAAGAGTAGAGATGGTGTTCAAGATTTTGATGAGAAAAAGCAATCACAGAATGTTGCGGCTGGAAATCTCGGTGCACA
GGAGGCATTAACAATGACAGGGAATAAAGAAAAATGTGGTTCGTCGAGAGAAGAAATTGGTGGAGCCAAAGAAAGAGGTTATGACCAAGACAGGCTAGTGATATATGAAC
TTAAAGAAGTTGGTAAAGTTGGAATCATTAGTGATTATCACGAAGAGGAAGGTAATATTAGAGACGGGGTTGAAGATTTTGGTGGGAGAAAACAACCACAGGATCTAATG
GTTGTTGGACTTCATGCAAAAGAGGCACTAACGACTAAAGGGACTGAAAACCAATGTAGTTCGTTAAGAAAAAAAGTTGATGGGGCTGAAGAAAACCGTCCACAAGCAGG
CCAAACTGAAGCGCTAGGTAAATTCATAGAAGTTCAAGAAGTTATGATTGACGAGCATCATGAAGAGGAAAGGCAAGGAGAAATGATGGAAGAACCAAAAGAGCTCCTTT
CTTCCTCACCCACTGCTGGTGGAGGTGGTGTCCTGCTGCTGTTGTCAGCTGGCTCTGAAACAAAGCCAGACATAGGAATTCATCTGTGCTTGTTTGGAGATGAAGCAGAA
GAATCCTTCTCCTTTATGTTACAGAGAGGGAGAGCATCCATGGGATCAAAAGAAGAAAAGTCGCCTGGTGAAGAAGCCACTTTGGAGTTCTTTGATGCTGTGTCGAACCA
TAGCAATGCTGAAGAAAATGGAGTGATTGATGATGCTGAAGGTTGCAAGAAGTTACGAGAGGAAAATGAAAATTTGGAGTTCTTTGACGCAAAGTCTGACAATGGCTATG
ATGGGGCGAATGAAATAATTGATGCCCAATCTTCTAAGGGGACGGTACTAGGTGAAGTTAGCAATAAACAAAATAGACTGGAAAAACAACGACCATCCAAGGTGAGTGAT
AATCAAACAGGAATAAGTAAGGGCCGCGAGGCTGAGGACCATCAACTATCCAAGGAGCATCCTCAAGTTAGATGGCCGTCTGAAATAACTGGAAGTCTACCGAAGCATTC
AGAGATTGAGATGCGTAGGACTTCTAAGGCAGACACAAATGAAAATTCCAACGTGTTATTGCCTGCAGATATTATTTGTGGTCCAAGTCATCCTCGGGGGCATCGTGGTC
GAGGGAGGCCTCGGAAGTTGAAAGTTCAAGAAACTTTGGTGACTTCATTATCTTCATCTGCTCAAGACCGTGACCCAGATATGGGCGATGGCACTCATATTGATCAGCAA
CGACTCAAGCTGCCAAGAGGGAGGGGGAGAGGTCGGGGAAGGCCTCGAGTAGTTAGACAAGACCAGATTTCAGCATCAGAGACGTTCTCACCTTCCAAGCATTTGCATCA
CCAGCAATCTCCTGAAAAGAGACGCGGGAGGCCTCCAAAACGAAAATTTGATGAAGGTACCGTATCGAAGGACATCTTGACTTCTTTAGATAATGAACTGCAAGAACATA
AGCGTCGTGGCCGTGGCAGTGGTACTGGAAGACCTTCTGGAGGAAGAAAGCAAGAAAGGGAATTATTTGATAATCAGTAA
mRNA sequenceShow/hide mRNA sequence
TTTAAATATAGAACGAAAGGGGGTTATCAAAAATAAAATTCACAAGGGAAATCAAAGACCTATGAATAAAGCGTTTGCCCTCTCCAACGACGATGCATCACCGCCGATAT
GGAGAACTCCCAACCCCATCTCTCTTCCATTCGACCGCCGCCGGAAAATTTCTCGTCTCCCTCTTCTATGACACCGCATTCCGATCACCGGCATTCACTCGTAGCCGGAA
GGTTCAGAGACGCCCTCTTCTCCGCCGTCGCCGCCAAATATTCGACCAATGCCAGCGGCCACTCTTTGCCTTTCCACTCCGAGCAGTTCAAGTCCGTTATTGATTGCTTC
CTTCACGAGAATTTCCCCTCCTTCCAGACTCCTACACATCTTCCCTATGCCTCGATGATACAGAGGGCAATAGCTGAATTGGGAGAGGAAGATGGGTTGAGTGAAGAGTC
AATATCAGAGTTTATCGTGAATGAGTATGAGGACTTGCCATGGGCGCACCCGGCGTTTTTGCGTCGCCATTTGGGGAAGCTCTGTGAAAGTGGGGAGCTAGTGAAATCGA
ACTGTGGGAGGTATAGCTTTAAGGTGGAGGTTAAGGGAGTAAAGAGGAAGAAGCGGAGGAGGAAGTCGGCAGGAAGAAATCGGCGCCGAGAAGTGGAGAGTGCTGATGAG
ATAGAAGAGGATTTTGATAGGAAAAAGCGATCAAAGAAATTGATGATCATAGGACCCCGTGCGGAGGAGGTGGTAACAAGTAAAGGGAATGAAGAACAAAGTGATTTGTT
GAGGGAAGTAATTGTTGGGGCTGAAGATGTTGATCATGCTCAAGGAGGTCAAGTTGTGCTGGATGAACTTGAAGAAGTTCAAGAAGATGAAATGATTGACAAGCAACATG
GAGAGAAAATCAAGCATAAATATGGGCCTAAAGTTTTTGATCGGAACAAGAAATCACGAAAAATGGTGATTATAGGTCTTCATGCGCCAGTAGCTATTAAGGAGATTAAA
AGACAAAGTGGTTTATTTGGGGAAGAAGTTCATGAAGCTGAAGAAGGAGATCACGGGAAAGGCGGCCAAATTCAAGTGCTTGGTGAAGTTAATGAAGTTCAAGCAGATGT
AATAATTGACCAACCTTGTGAAAAGGAAGTCAAGAGTAGAGATGGTGTTCAAGATTTTGATGAGAAAAAGCAATCACAGAATGTTGCGGCTGGAAATCTCGGTGCACAGG
AGGCATTAACAATGACAGGGAATAAAGAAAAATGTGGTTCGTCGAGAGAAGAAATTGGTGGAGCCAAAGAAAGAGGTTATGACCAAGACAGGCTAGTGATATATGAACTT
AAAGAAGTTGGTAAAGTTGGAATCATTAGTGATTATCACGAAGAGGAAGGTAATATTAGAGACGGGGTTGAAGATTTTGGTGGGAGAAAACAACCACAGGATCTAATGGT
TGTTGGACTTCATGCAAAAGAGGCACTAACGACTAAAGGGACTGAAAACCAATGTAGTTCGTTAAGAAAAAAAGTTGATGGGGCTGAAGAAAACCGTCCACAAGCAGGCC
AAACTGAAGCGCTAGGTAAATTCATAGAAGTTCAAGAAGTTATGATTGACGAGCATCATGAAGAGGAAAGGCAAGGAGAAATGATGGAAGAACCAAAAGAGCTCCTTTCT
TCCTCACCCACTGCTGGTGGAGGTGGTGTCCTGCTGCTGTTGTCAGCTGGCTCTGAAACAAAGCCAGACATAGGAATTCATCTGTGCTTGTTTGGAGATGAAGCAGAAGA
ATCCTTCTCCTTTATGTTACAGAGAGGGAGAGCATCCATGGGATCAAAAGAAGAAAAGTCGCCTGGTGAAGAAGCCACTTTGGAGTTCTTTGATGCTGTGTCGAACCATA
GCAATGCTGAAGAAAATGGAGTGATTGATGATGCTGAAGGTTGCAAGAAGTTACGAGAGGAAAATGAAAATTTGGAGTTCTTTGACGCAAAGTCTGACAATGGCTATGAT
GGGGCGAATGAAATAATTGATGCCCAATCTTCTAAGGGGACGGTACTAGGTGAAGTTAGCAATAAACAAAATAGACTGGAAAAACAACGACCATCCAAGGTGAGTGATAA
TCAAACAGGAATAAGTAAGGGCCGCGAGGCTGAGGACCATCAACTATCCAAGGAGCATCCTCAAGTTAGATGGCCGTCTGAAATAACTGGAAGTCTACCGAAGCATTCAG
AGATTGAGATGCGTAGGACTTCTAAGGCAGACACAAATGAAAATTCCAACGTGTTATTGCCTGCAGATATTATTTGTGGTCCAAGTCATCCTCGGGGGCATCGTGGTCGA
GGGAGGCCTCGGAAGTTGAAAGTTCAAGAAACTTTGGTGACTTCATTATCTTCATCTGCTCAAGACCGTGACCCAGATATGGGCGATGGCACTCATATTGATCAGCAACG
ACTCAAGCTGCCAAGAGGGAGGGGGAGAGGTCGGGGAAGGCCTCGAGTAGTTAGACAAGACCAGATTTCAGCATCAGAGACGTTCTCACCTTCCAAGCATTTGCATCACC
AGCAATCTCCTGAAAAGAGACGCGGGAGGCCTCCAAAACGAAAATTTGATGAAGGTACCGTATCGAAGGACATCTTGACTTCTTTAGATAATGAACTGCAAGAACATAAG
CGTCGTGGCCGTGGCAGTGGTACTGGAAGACCTTCTGGAGGAAGAAAGCAAGAAAGGGAATTATTTGATAATCAGTAATCACAATATGTTCAAGTGCTAACTATATTAGT
TAGTATCTAGGAAAATTTAACATCTGATTTTGTTATGTGTTTCACTTAGTTTGCCTCTTTCCTTTTATCATCACTCCTTAATAAAATTTCCACTTCAACTCTTGAAGCTA
GTTTTTCAAGTTATGTAATTACAGGTATTATAGGAGGGGAATAAGAGCAAGAAGTTGCAAGATCTCTTCCCTGCATGACTCGCAGCTTTTCCATCCAATCTCATGTCTAT
CCTTATGATTGGCGACTCCGTCTTTGATACCGGAGACAACCATTTCATCAGAAAACACCCCTGTTCAGTCAGTTTTTCCAACTTTTTGTGGATTAAGCTTCTTCACAACC
CAACTGGTAGGTTCTCAGCAATGGCAGCACATTAGTTGAATTCAATTGTAAGTTGCAGAGTAGGGTTACATGATCAAATACATCCAATTCAAACACATTTTTAATTTCTC
TACTTTTAATAACAGTTGTGTTACTGCAGGTTTAGGTCCATGCTCTGTTGTCTCCAGAATAGGGCTTAAAAGGTAGGTAGGATATTACAATCTTGTGTTTCTTAAGATTC
ATTGATTATTTTGTGACATTAGTCGAGGTACATGTGCTTAAGTTCACACCTGTGACAAAATATACTTCGATTGATTTTACAAAAACTAACCACCGACGTCGATTGATCAA
TCCTCAACAGATTGTTGGTTGATTGATGTGCACACCTTGGCCCTCACAAAGTACTTTTTCCCTCCAAGCAACCAAACGAGAGTACTTTTTCCGTATTCCTTTTCCCCTAT
TAAGTATCCTTCTCCACCACCATCTCATACCAACTAAAAACAAATTCATATCCCTCCTTTCTCCAACTTTTACTCATTTTAGAGTAGTTAGGAACACAAACCAAATGATG
AATTGAAGAATCGATAGATATGGTTGAAGAATTTTCAATTAGTTAGGTGTATCAAAAGAATTTTTAGTAGTTATAATATGTTTTATTCCTAAATTTAATGCACATAATAT
GAATATAATTTTGGAAAATTTTCACAGATAGAAAAAATATCAAACTATTTAAAGAAAACAGCAAAAAAA
Protein sequenceShow/hide protein sequence
MENSQPHLSSIRPPPENFSSPSSMTPHSDHRHSLVAGRFRDALFSAVAAKYSTNASGHSLPFHSEQFKSVIDCFLHENFPSFQTPTHLPYASMIQRAIAELGEEDGLSEE
SISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYSFKVEVKGVKRKKRRRKSAGRNRRREVESADEIEEDFDRKKRSKKLMIIGPRAEEVVTSKGNEEQSDL
LREVIVGAEDVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGPKVFDRNKKSRKMVIIGLHAPVAIKEIKRQSGLFGEEVHEAEEGDHGKGGQIQVLGEVNEVQAD
VIIDQPCEKEVKSRDGVQDFDEKKQSQNVAAGNLGAQEALTMTGNKEKCGSSREEIGGAKERGYDQDRLVIYELKEVGKVGIISDYHEEEGNIRDGVEDFGGRKQPQDLM
VVGLHAKEALTTKGTENQCSSLRKKVDGAEENRPQAGQTEALGKFIEVQEVMIDEHHEEERQGEMMEEPKELLSSSPTAGGGGVLLLLSAGSETKPDIGIHLCLFGDEAE
ESFSFMLQRGRASMGSKEEKSPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLREENENLEFFDAKSDNGYDGANEIIDAQSSKGTVLGEVSNKQNRLEKQRPSKVSD
NQTGISKGREAEDHQLSKEHPQVRWPSEITGSLPKHSEIEMRRTSKADTNENSNVLLPADIICGPSHPRGHRGRGRPRKLKVQETLVTSLSSSAQDRDPDMGDGTHIDQQ
RLKLPRGRGRGRGRPRVVRQDQISASETFSPSKHLHHQQSPEKRRGRPPKRKFDEGTVSKDILTSLDNELQEHKRRGRGSGTGRPSGGRKQERELFDNQ