; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G06540 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G06540
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionH15 domain-containing protein
Genome locationClcChr01:6475829..6480305
RNA-Seq ExpressionClc01G06540
SyntenyClc01G06540
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR005818 - Linker histone H1/H5, domain H15
IPR017956 - AT hook, DNA-binding motif
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046320.1 transcription regulatory protein SNF2-like isoform X3 [Cucumis melo var. makuwa]0.0e+0074.53Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE
        ME S  QLSSIRP PENLSSPS    HSDHRHSL+AGR RDALFSA+AAKYSTNG+ HSLPF S+QFKSVIDCRL ENFPSFQTPTHLPYASMIQRAIAE
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
        VG EDGLSEESISEFIVNEYEDLPWAH A+LRRHLGKLCE+GELVK  CGRYNFK+E KGVKRKKRRRK+ GRSR REVESADEIEE FDRKKRSKKL  
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
        IGP  EEVVTSKG+EEQS   REV VG E VDH   GQVV++E ++V+ DEM+DKQ+GEK KH YG KVF+RK Q RNLVI+GL AP+A+KE+E+Q GS 
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV
        GEEV E EEG+  +GGQIQV GE NEVQADVMIHQPCEKEVKSR G QDFD KKQSQNVAA NLGAQE LTMT NEEK GS REEI GAKERGYDQDRQ 
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADRTEVLAKFKEVQEVEMIDKH
        IMIYELK              EVN  D V+DFGGRKQSQDLMVVGLHAKEAL TKGTED+CSS R  V DG EG  AQA + EVL KFKEVQ VEMID+H
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADRTEVLAKFKEVQEVEMIDKH

Query:  HEDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSN
         E+E+ GE MEEP ERA + S  E  P EEATLEFFDA+S HSNAEENGVIDDAEGCKKLLEENENFEFFDA+SDHGYDG  EIIGAQSS  T+LGEVSN
Subjt:  HEDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSN

Query:  KQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGTLPKHSEQEMPRTSESDKNENSEALLPVDIICGPSHPRGHRGRGRPQKLKVHET
        KQN+LEEQRPSK SDDQT IR G EAED QL+KEH QVRWPSEITGTL KHS+QEM RTSE+DKNE SEAL P DIIC PS P GHRG+GRP+KLKV E 
Subjt:  KQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGTLPKHSEQEMPRTSESDKNENSEALLPVDIICGPSHPRGHRGRGRPQKLKVHET

Query:  LATSLSSSAQDG---------------DPDMGDGSHHIDQQRLKLPRGRGKP----RIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS
        LATSLSS A+DG               D +   G+HHIDQQ L LPRGRG+     R+VRQDQ S S+  SPSKHL+H+QSP K RGRP KQ FDED VS
Subjt:  LATSLSSSAQDG---------------DPDMGDGSHHIDQQRLKLPRGRGKP----RIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS

Query:  KDFST
        KD ST
Subjt:  KDFST

KAE8648608.1 hypothetical protein Csa_009414 [Cucumis sativus]2.9e-29468.21Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYS-TNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIA
        ME SQ QLSSI P P+NL+S S    HSDHRHSL+AGRFRDALFSA+AAKYS  NG+ HSLPF S+QFKSVIDCR++ENFPSF+TPTHLPYASMI RAIA
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYS-TNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIA

Query:  EVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLM
        EVG EDGLSEESIS FI+NEY+DLPWAH A+LRRHLGKLCE+GE+VK  CGRYNFK+E KGVKRKK RRKS GRSR REV+SADEIEEDFDRKKRSKKL+
Subjt:  EVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLM

Query:  TIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGS
         IGP  EEVVTS GTEEQS L REV VG E VDH   GQ+V+ E +EV+ DEM+DKQ+GEK KHK+G KVF+ KKQ +NLVI+GLRAP+A+KEIE+Q GS
Subjt:  TIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGS

Query:  MGEEVHEAE-EGERGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV
        +GEEV E E +  +GGQIQV GE +EVQ DV+IHQ CEKEVKSRDG QDFD KKQSQNV A N+GAQE  TM  NEEK GS REEI GAKE GYD DRQV
Subjt:  MGEEVHEAE-EGERGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADRTEVLAKFKEVQEVEMIDKH
        IMIYELK              EVN RDGV+DFGGRKQSQDL++VGLH+KEAL TKGTEDQCSS R  V DG EG  AQA +TEVL KFKEVQ VE+ID+H
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADRTEVLAKFKEVQEVEMIDKH

Query:  HEDERLGEMMEEPIERAFMESKEE---------------------------------------KTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLL
         E+E+ GE MEEP E   + +                                          + P EEATLEFFDA+S HSNAEEN VIDDAEGCKKLL
Subjt:  HEDERLGEMMEEPIERAFMESKEE---------------------------------------KTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLL

Query:  EENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSNKQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGTLPKHSEQEMPRTSE
        EENEN EFFDA+SDHGYDG  E IGAQSS   ILGEVSNKQN+LEE RPSK SDDQT+ RKG EAED QL+KEH QVRWPS+IT TL KHS+Q  PR +E
Subjt:  EENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSNKQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGTLPKHSEQEMPRTSE

Query:  SDKNENSEALLPVDIICGPSHPRGHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLPRGR----GKPRIVRQ
        +DKNE SEAL P DIIC PS P GHRGRGRP+KLKV ETLATSLSS A DGD               P+ GDG+ HIDQQ L LPRGR    G+PR+VRQ
Subjt:  SDKNENSEALLPVDIICGPSHPRGHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLPRGR----GKPRIVRQ

Query:  DQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVSK
        DQ S S T SPSKHL+H+QSP KRRGRP +QKFDEDTVS+
Subjt:  DQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVSK

XP_011655461.2 uncharacterized protein LOC105435537 isoform X1 [Cucumis sativus]4.6e-30071.91Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYS-TNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIA
        ME SQ QLSSI P P+NL+S S    HSDHRHSL+AGRFRDALFSA+AAKYS  NG+ HSLPF S+QFKSVIDCR++ENFPSF+TPTHLPYASMI RAIA
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYS-TNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIA

Query:  EVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLM
        EVG EDGLSEESIS FI+NEY+DLPWAH A+LRRHLGKLCE+GE+VK  CGRYNFK+E KGVKRKK RRKS GRSR REV+SADEIEEDFDRKKRSKKL+
Subjt:  EVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLM

Query:  TIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGS
         IGP  EEVVTS GTEEQS L REV VG E VDH   GQ+V+ E +EV+ DEM+DKQ+GEK KHK+G KVF+ KKQ +NLVI+GLRAP+A+KEIE+Q GS
Subjt:  TIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGS

Query:  MGEEVHEAE-EGERGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV
        +GEEV E E +  +GGQIQV GE +EVQ DV+IHQ CEKEVKSRDG QDFD KKQSQNV A N+GAQE  TM  NEEK GS REEI GAKE GYD DRQV
Subjt:  MGEEVHEAE-EGERGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADRTEVLAKFKEVQEVEMIDKH
        IMIYELK              EVN RDGV+DFGGRKQSQDL++VGLH+KEAL TKGTEDQCSS R  V DG EG  AQA +TEVL KFKEVQ VE+ID+H
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADRTEVLAKFKEVQEVEMIDKH

Query:  HEDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSN
         E+E+ GE MEEP ERA + S  +  P EEATLEFFDA+S HSNAEEN VIDDAEGCKKLLEENEN EFFDA+SDHGYDG  E IGAQSS   ILGEVSN
Subjt:  HEDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSN

Query:  KQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGTLPKHSEQEMPRTSESDKNENSEALLPVDIICGPSHPRGHRGRGRPQKLKVHET
        KQN+LEE RPSK SDDQT+ RKG EAED QL+KEH QVRWPS+IT TL KHS+Q  PR +E+DKNE SEAL P DIIC PS P GHRGRGRP+KLKV ET
Subjt:  KQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGTLPKHSEQEMPRTSESDKNENSEALLPVDIICGPSHPRGHRGRGRPQKLKVHET

Query:  LATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLPRGR----GKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS
        LATSLSS A DGD               P+ GDG+ HIDQQ L LPRGR    G+PR+VRQDQ S S T SPSKHL+H+QSP KRRGRP +QKFDEDTVS
Subjt:  LATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLPRGR----GKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS

Query:  K
        +
Subjt:  K

XP_023549578.1 uncharacterized protein LOC111808038 isoform X1 [Cucurbita pepo subsp. pepo]8.6e-29169.17Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE
        MENSQP LS+I   PEN   PS +T HSDHR+SL+ GRFRDALFSA AAKY+TNGS HSLPF SEQFKSVI+C L++NFPSF+TPTHLPYASMIQ+AI E
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
        VG EDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+YNFK+EGK VKRKKRRRKSAGRSRRREVES DEIEEDFDR KRSKKL  
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
         GP AEEVVTSKG++EQ+  LREVIVGAE  DHA  GQVVLDELEE QEDEMIDK + E+IK+KY    F+  K+ RNLVIIGL APVA KEIE+Q  S+
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV
        G +VHEAEEG+  +GGQIQVLG+  EVQADVMI QPCEKEVKSR  +QD D K+QSQ VAAANLGAQE L M   E KCGSSREEIGG            
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQEVEMIDKHH
             L EV KV MIND H+VE  S D  +DFG  KQSQD+MVVGLHAK+AL  KGTEDQCSSLR  VDGAEG+  QA +TEVL  FK  QEVEMID+HH
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQEVEMIDKHH

Query:  EDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSNK
        E+ER GEMMEEP ERA   S EE+ PGEEATL+FFDA+ N  +A+ENGV+ DA+GC+KL EENE+ EFFDA+SDHG + A EI GAQ+S G +LGEV NK
Subjt:  EDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSNK

Query:  QNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGT----------------LPKHSEQEMPRTSESDKNENSEALLPVDIICGP-SHPR
        QN+LEEQR SKVSDDQT I KG EAE+ QLS +HP+VRWPSEITGT                 PKHSEQ +  TSE+DKNE SEA+L  D+IC P S PR
Subjt:  QNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGT----------------LPKHSEQEMPRTSESDKNENSEALLPVDIICGP-SHPR

Query:  GHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLP------RGRGKPRIVRQDQISVSETLSPSKHLHHQQSP
        GHRGRGRP KLK+ ET ATSLSS A D D               PDM   +HHIDQQ+LKLP      RGRG+PRI+RQD ISV ET SPS++LHHQQSP
Subjt:  GHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLP------RGRGKPRIVRQDQISVSETLSPSKHLHHQQSP

Query:  EKR-RGRPPKQKFDEDTVSKDFST
         KR RGRPPKQKFDEDTVSKD ST
Subjt:  EKR-RGRPPKQKFDEDTVSKDFST

XP_038907055.1 uncharacterized protein LOC120092885 [Benincasa hispida]0.0e+0082.43Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE
        MENSQPQ SSI P P  LS PS MT  SDHRHSLVAGRFRDALFSA+AAKYSTNGS HS PFHSEQFKSV+DCR++ENFPSFQTPTHLPYASMIQRAIAE
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
         G EDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFK+EG GVKRKKRRRKSAGR+RRRE+ESADEIEEDFDRKKRSKKLM 
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
        IGP  EEVVTSKGTEEQS LLREVIVGA  VDHAQGGQVVLDEL+E+QEDEMIDK++GEKIK  YG K F  KKQ   LVIIGL APVA  EIE+Q GS+
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV
        GEEV EAE+GE  +GGQIQV GE NEVQADVMIHQPCEKEVKSRD VQDFD +KQSQNVAA NLGAQE LTMTRN EKCGS REEI GAKER  DQDRQV
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQEVEMIDKHH
        I IY+LKEV KVGMINDHHEVEVNSRDG++DFGG KQSQDL+VVGLH KEALTTKGTEDQCSSLR KVDGAEGN AQA +TE L KFKEV EVEMIDKHH
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQEVEMIDKHH

Query:  EDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSNK
        E+ER GEMMEEPIER  M S EE  PGEEA LEFFDA SNHSN EENGVI DAEGCKKL EENEN EFFDARSDH  D   EIIGAQSS   +LGEVSN+
Subjt:  EDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSNK

Query:  QNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGTLPKHSEQEMPRTSESDKNENSEALLPVDIICGPSHPRG-HRGRGRPQKLKVHET
        QN+LEE+RPSKVSD+QT IRKG EAED QLSKEHPQVRWPSEITGT PKHSEQEM RT E+DKNENS+ALLP DII GPSHP G H GRGRP+ LKV ET
Subjt:  QNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGTLPKHSEQEMPRTSESDKNENSEALLPVDIICGPSHPRG-HRGRGRPQKLKVHET

Query:  LATSLSSSAQDGDPDMGDGSHHIDQQRLKLP----RGRGKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVSKDFSTS
        LATSL +SAQD DPDMGDG+HHIDQQRLKLP    RGRG+PRIVRQDQISVSE  SPSKH HHQQSP KR GRPPKQKF+EDT SK  STS
Subjt:  LATSLSSSAQDGDPDMGDGSHHIDQQRLKLP----RGRGKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVSKDFSTS

TrEMBL top hitse value%identityAlignment
A0A5D3E3L6 Transcription regulatory protein SNF2-like isoform X30.0e+0074.53Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE
        ME S  QLSSIRP PENLSSPS    HSDHRHSL+AGR RDALFSA+AAKYSTNG+ HSLPF S+QFKSVIDCRL ENFPSFQTPTHLPYASMIQRAIAE
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
        VG EDGLSEESISEFIVNEYEDLPWAH A+LRRHLGKLCE+GELVK  CGRYNFK+E KGVKRKKRRRK+ GRSR REVESADEIEE FDRKKRSKKL  
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
        IGP  EEVVTSKG+EEQS   REV VG E VDH   GQVV++E ++V+ DEM+DKQ+GEK KH YG KVF+RK Q RNLVI+GL AP+A+KE+E+Q GS 
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV
        GEEV E EEG+  +GGQIQV GE NEVQADVMIHQPCEKEVKSR G QDFD KKQSQNVAA NLGAQE LTMT NEEK GS REEI GAKERGYDQDRQ 
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADRTEVLAKFKEVQEVEMIDKH
        IMIYELK              EVN  D V+DFGGRKQSQDLMVVGLHAKEAL TKGTED+CSS R  V DG EG  AQA + EVL KFKEVQ VEMID+H
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADRTEVLAKFKEVQEVEMIDKH

Query:  HEDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSN
         E+E+ GE MEEP ERA + S  E  P EEATLEFFDA+S HSNAEENGVIDDAEGCKKLLEENENFEFFDA+SDHGYDG  EIIGAQSS  T+LGEVSN
Subjt:  HEDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSN

Query:  KQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGTLPKHSEQEMPRTSESDKNENSEALLPVDIICGPSHPRGHRGRGRPQKLKVHET
        KQN+LEEQRPSK SDDQT IR G EAED QL+KEH QVRWPSEITGTL KHS+QEM RTSE+DKNE SEAL P DIIC PS P GHRG+GRP+KLKV E 
Subjt:  KQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGTLPKHSEQEMPRTSESDKNENSEALLPVDIICGPSHPRGHRGRGRPQKLKVHET

Query:  LATSLSSSAQDG---------------DPDMGDGSHHIDQQRLKLPRGRGKP----RIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS
        LATSLSS A+DG               D +   G+HHIDQQ L LPRGRG+     R+VRQDQ S S+  SPSKHL+H+QSP K RGRP KQ FDED VS
Subjt:  LATSLSSSAQDG---------------DPDMGDGSHHIDQQRLKLPRGRGKP----RIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS

Query:  KDFST
        KD ST
Subjt:  KDFST

A0A6J1FEI4 uncharacterized protein LOC111444998 isoform X11.9e-28868.81Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE
        MENSQP LS+I   PEN   PS +T HSDHR+SL+AGRFRDALFSA AAKY+TNGS HSLPF SEQFKSVI+C L++NFPSF+TPTHLPYASMIQ+AIAE
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
        +G EDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+YNFK+EGK VKRKKRRRKSAGRSRRREVES DEIEEDF+R KRSKKL  
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
         GP AE VVTSKG++EQ+  LREVI+GAE  DHA  G+VVLDELEEVQEDEMIDK + E+IK+KYG   F+  K+ RNLVIIGL APVA KEI +Q  S+
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV
        G +VHEAEEG+  +GGQIQVLG+  EVQADVMI QPCEKEVKSR  +QD D K+QSQ V AANLG QE L MT  E KCGSSREEIGG            
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQEVEMIDKHH
             L E+ KV MIND H+VE  S D  +DFG  KQSQDLMVVGLHAK+AL TKGTEDQCSSLR  VDGAEG+  QA +TEVL  FK  QEVEMID+HH
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQEVEMIDKHH

Query:  EDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSNK
        E+ER GEMMEEP ERA   S EE+ PGEEATL+FFDA+ N  +A+ENGV+ DA+GC+KL EENE+ EFFDA+SDHG + A EI GAQ+S G +LGEV NK
Subjt:  EDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSNK

Query:  QNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGT----------------LPKHSEQEMPRTSESDKNENSEALLPVDIICGP-SHPR
        QN LEEQR SKVSDDQT I KG EAE+ QLS +HP+VRWPSEITGT                 PKHSEQ +  TSE+DKNE SEALL  D+IC P S PR
Subjt:  QNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGT----------------LPKHSEQEMPRTSESDKNENSEALLPVDIICGP-SHPR

Query:  GHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLP------RGRGKPRIVRQDQISVSETLSPSKHLHHQQSP
        GHRGRGRP KLK+ ET ATSLSS A D D               PDM   +HHIDQQ+LKLP      RGRG+PRI+RQD ISV ET SPS+HL HQ SP
Subjt:  GHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLP------RGRGKPRIVRQDQISVSETLSPSKHLHHQQSP

Query:  EKR-RGRPPKQKFDEDTVSKDFST
         KR RGRPPKQKFDEDTVSKD  T
Subjt:  EKR-RGRPPKQKFDEDTVSKDFST

A0A6J1FFG2 eukaryotic translation initiation factor 5B-like isoform X21.4e-24667.9Show/hide
Query:  MIQRAIAEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRK
        MIQ+AIAE+G EDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+YNFK+EGK VKRKKRRRKSAGRSRRREVES DEIEEDF+R 
Subjt:  MIQRAIAEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRK

Query:  KRSKKLMTIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKE
        KRSKKL   GP AE VVTSKG++EQ+  LREVI+GAE  DHA  G+VVLDELEEVQEDEMIDK + E+IK+KYG   F+  K+ RNLVIIGL APVA KE
Subjt:  KRSKKLMTIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKE

Query:  IERQRGSMGEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKER
        I +Q  S+G +VHEAEEG+  +GGQIQVLG+  EVQADVMI QPCEKEVKSR  +QD D K+QSQ V AANLG QE L MT  E KCGSSREEIGG    
Subjt:  IERQRGSMGEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKER

Query:  GYDQDRQVIMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQE
                     L E+ KV MIND H+VE  S D  +DFG  KQSQDLMVVGLHAK+AL TKGTEDQCSSLR  VDGAEG+  QA +TEVL  FK  QE
Subjt:  GYDQDRQVIMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQE

Query:  VEMIDKHHEDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGT
        VEMID+HHE+ER GEMMEEP ERA   S EE+ PGEEATL+FFDA+ N  +A+ENGV+ DA+GC+KL EENE+ EFFDA+SDHG + A EI GAQ+S G 
Subjt:  VEMIDKHHEDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGT

Query:  ILGEVSNKQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGT----------------LPKHSEQEMPRTSESDKNENSEALLPVDII
        +LGEV NKQN LEEQR SKVSDDQT I KG EAE+ QLS +HP+VRWPSEITGT                 PKHSEQ +  TSE+DKNE SEALL  D+I
Subjt:  ILGEVSNKQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGT----------------LPKHSEQEMPRTSESDKNENSEALLPVDII

Query:  CGP-SHPRGHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLP------RGRGKPRIVRQDQISVSETLSPSK
        C P S PRGHRGRGRP KLK+ ET ATSLSS A D D               PDM   +HHIDQQ+LKLP      RGRG+PRI+RQD ISV ET SPS+
Subjt:  CGP-SHPRGHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLP------RGRGKPRIVRQDQISVSETLSPSK

Query:  HLHHQQSPEKR-RGRPPKQKFDEDTVSKDFST
        HL HQ SP KR RGRPPKQKFDEDTVSKD  T
Subjt:  HLHHQQSPEKR-RGRPPKQKFDEDTVSKDFST

A0A6J1JZB4 uncharacterized protein LOC111489634 isoform X21.2e-24868.58Show/hide
Query:  MIQRAIAEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRK
        MIQ+AIAEVG EDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+YNFK+EGK VKRKKRRRKSAGRSRRREVES DEIE D DR 
Subjt:  MIQRAIAEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRK

Query:  KRSKKLMTIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKE
        KRSKKL   GP AEEVVTSKGT+E++  L EVIVGAE  DHA  GQV+LDELEEVQEDEMIDK + E+IK+KYG   F+  K+ RNLVIIGL APVA K 
Subjt:  KRSKKLMTIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKE

Query:  IERQRGSMGEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKER
        IE+Q  S+G +VHEAEEG+  +GGQIQVLG+  EVQADVMI Q CEK+VKSR  +QD D  +QSQ VAAANLGAQE L MT  E KCG SREEIGG    
Subjt:  IERQRGSMGEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKER

Query:  GYDQDRQVIMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQE
                     L +V KVGMIND H+VEV S D  +DFG  KQSQDLMVVGLHAK+ALTTKGTEDQCSSLR  V GAEG   QA +TEVL  FK  QE
Subjt:  GYDQDRQVIMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQE

Query:  VEMIDKHHEDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGT
        VEMID+HHE+ER GEMMEEP ERA   S EE+ PGEEATL+FFD + N  +A+ENGVI DA+GC+KL EENE+ EFFDA+SDHG + A EI GAQ+S G 
Subjt:  VEMIDKHHEDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGT

Query:  ILGEVSNKQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITG----------------TLPKHSEQEMPRTSESDKNENSEALLPVDII
        +LGEV NKQN+LEEQR SKVSDDQTRI KG EAE+ QLS +HP+VRWPSEITG                T PKHSEQ +  TSE+DKNENSEALL  D+I
Subjt:  ILGEVSNKQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITG----------------TLPKHSEQEMPRTSESDKNENSEALLPVDII

Query:  CGP-SHPRGHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLP------RGRGKPRIVRQDQISVSETLSPSK
        C P S P+GHRGRGRP KLK+ ET ATSLSS A D D               PDM   +HHIDQQ+LKLP      RGRG+PRI+RQD ISV ET SPS+
Subjt:  CGP-SHPRGHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLP------RGRGKPRIVRQDQISVSETLSPSK

Query:  HLHHQQSPEKR-RGRPPKQKFDEDTVSKDFST
        HLHHQQSP KR RGRPPKQKFDEDTVSKD ST
Subjt:  HLHHQQSPEKR-RGRPPKQKFDEDTVSKDFST

A0A6J1K0W5 uncharacterized protein LOC111489634 isoform X17.1e-29169.54Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE
        MENSQP LS+I   PEN   PS +T HSDHR+SL+AGRFRDALFSA AAKY+TNGS HSLPF SEQFKSVI+C L+ENFPSF+TPTHLPYASMIQ+AIAE
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
        VG EDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+YNFK+EGK VKRKKRRRKSAGRSRRREVES DEIE D DR KRSKKL  
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
         GP AEEVVTSKGT+E++  L EVIVGAE  DHA  GQV+LDELEEVQEDEMIDK + E+IK+KYG   F+  K+ RNLVIIGL APVA K IE+Q  S+
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV
        G +VHEAEEG+  +GGQIQVLG+  EVQADVMI Q CEK+VKSR  +QD D  +QSQ VAAANLGAQE L MT  E KCG SREEIGG            
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQEVEMIDKHH
             L +V KVGMIND H+VEV S D  +DFG  KQSQDLMVVGLHAK+ALTTKGTEDQCSSLR  V GAEG   QA +TEVL  FK  QEVEMID+HH
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQEVEMIDKHH

Query:  EDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSNK
        E+ER GEMMEEP ERA   S EE+ PGEEATL+FFD + N  +A+ENGVI DA+GC+KL EENE+ EFFDA+SDHG + A EI GAQ+S G +LGEV NK
Subjt:  EDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDDAEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSNK

Query:  QNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITG----------------TLPKHSEQEMPRTSESDKNENSEALLPVDIICGP-SHPR
        QN+LEEQR SKVSDDQTRI KG EAE+ QLS +HP+VRWPSEITG                T PKHSEQ +  TSE+DKNENSEALL  D+IC P S P+
Subjt:  QNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITG----------------TLPKHSEQEMPRTSESDKNENSEALLPVDIICGP-SHPR

Query:  GHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLP------RGRGKPRIVRQDQISVSETLSPSKHLHHQQSP
        GHRGRGRP KLK+ ET ATSLSS A D D               PDM   +HHIDQQ+LKLP      RGRG+PRI+RQD ISV ET SPS+HLHHQQSP
Subjt:  GHRGRGRPQKLKVHETLATSLSSSAQDGD---------------PDMGDGSHHIDQQRLKLP------RGRGKPRIVRQDQISVSETLSPSKHLHHQQSP

Query:  EKR-RGRPPKQKFDEDTVSKDFST
         KR RGRPPKQKFDEDTVSKD ST
Subjt:  EKR-RGRPPKQKFDEDTVSKDFST

SwissProt top hitse value%identityAlignment
Q9FYS5 HMG-Y-related protein A1.0e-0442.11Show/hide
Query:  PYASMIQRAIAEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELV
        PY  MI  AI  +  + G ++ +IS++I  +Y  LP AH + L  HL ++ ESGELV
Subjt:  PYASMIQRAIAEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELV

Arabidopsis top hitse value%identityAlignment
AT5G08780.1 winged-helix DNA-binding transcription factor family protein1.6e-0840Show/hide
Query:  QTPTHLPYASMIQRAIAEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGE-LVKSNCGRYNFKLEGKGVKRKKRRRKS-AGRSRRREVE
        +TP H  Y++MI  AI ++  E G SE++ISEFI ++Y++LP+AH   L  HL KL E  E L   N   Y+   E K V     +RKS     R  +  
Subjt:  QTPTHLPYASMIQRAIAEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGE-LVKSNCGRYNFKLEGKGVKRKKRRRKS-AGRSRRREVE

Query:  SADEI
        +ADE+
Subjt:  SADEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACTCCCAACCTCAACTCTCTTCCATTCGACCGCTGCCGGAAAATCTCTCGTCTCCCTCTTATATGACATCGCATTCCGATCACCGACATTCCCTCGTAGCCGG
AAGGTTTAGAGACGCCCTCTTCTCCGCCCTCGCCGCCAAATATTCGACCAATGGCAGCGGCCACTCTTTGCCCTTCCACTCCGAGCAGTTCAAGTCCGTTATCGATTGCC
GCCTTTATGAGAATTTCCCTTCCTTCCAAACTCCTACGCATCTTCCCTATGCCTCGATGATACAGAGGGCAATAGCTGAAGTGGGAGGGGAAGATGGATTGAGCGAGGAG
TCAATATCAGAGTTTATCGTGAATGAGTATGAGGACTTGCCATGGGCGCACCCGGCGTTTTTGCGTCGCCATTTGGGAAAGCTCTGTGAAAGTGGGGAGCTAGTAAAATC
GAACTGTGGGAGGTATAACTTTAAGTTGGAGGGTAAGGGAGTGAAGAGGAAGAAGCGAAGGAGGAAGTCAGCGGGAAGGAGTCGGCGCCGAGAAGTGGAGAGTGCTGATG
AGATAGAAGAGGATTTTGATAGGAAAAAGCGATCAAAGAAATTGATGACCATAGGACCCGGTGCGGAGGAGGTGGTAACAAGTAAAGGGACTGAAGAACAAAGTGCTTTG
TTGAGAGAAGTAATTGTTGGGGCTGAAGGTGTAGATCATGCTCAAGGAGGCCAAGTTGTGCTGGATGAACTTGAAGAAGTTCAAGAAGATGAAATGATTGACAAGCAGTA
TGGAGAGAAAATCAAACATAAATATGGGCGTAAAGTTTTTGATCGGAAAAAGCAATTGCGGAATCTGGTGATTATAGGCCTTCGTGCACCAGTAGCTAGTAAGGAGATTG
AAAGACAAAGGGGTTCAATGGGGGAAGAAGTTCATGAAGCTGAAGAAGGAGAGCGCGGAGGCCAAATTCAAGTGCTTGGTGAAGCTAATGAAGTTCAAGCAGATGTAATG
ATTCACCAACCTTGTGAAAAGGAAGTCAAGAGTAGAGATGGCGTTCAAGATTTTGATGGGAAAAAGCAATCACAGAATGTGGCGGCTGCAAATCTTGGTGCACAGGAGAC
ATTAACAATGACAAGGAATGAAGAAAAATGTGGTTCGTCAAGAGAAGAAATTGGTGGAGCCAAAGAAAGAGGTTATGATCAAGACAGGCAAGTTATAATGATATATGAAC
TTAAAGAAGTTGGAAAAGTTGGAATGATTAATGATCATCACGAAGTGGAAGTTAATAGTAGAGATGGGGTTAAAGATTTTGGTGGGAGAAAACAATCACAGGATCTGATG
GTTGTTGGACTTCATGCAAAAGAGGCACTAACGACTAAAGGGACTGAAGACCAATGTAGTTCGTTAAGAACAAAAGTCGATGGGGCTGAAGGAAACCGTGCACAAGCTGA
CCGAACTGAAGTGCTTGCTAAATTCAAAGAAGTTCAAGAAGTTGAAATGATTGACAAGCATCATGAAGATGAAAGGCTTGGAGAAATGATGGAAGAACCAATAGAGAGAG
CATTCATGGAATCAAAAGAAGAAAAGACGCCCGGTGAAGAAGCCACTTTGGAGTTCTTTGATGCTGTGTCAAACCATAGCAATGCTGAAGAAAATGGAGTTATTGATGAT
GCTGAAGGTTGCAAGAAGTTACTAGAGGAAAATGAAAATTTTGAGTTCTTTGACGCAAGGTCTGACCATGGCTATGATGGGGCGAAAGAAATAATTGGTGCTCAATCTTC
GAATGGGACAATACTAGGTGAAGTTAGCAATAAACAAAATAAACTGGAAGAACAACGACCATCCAAGGTGAGTGATGATCAAACTAGAATAAGGAAGGGCCACGAGGCTG
AGGATCGTCAACTGTCCAAGGAGCATCCTCAAGTTAGATGGCCGTCTGAAATAACTGGAACTCTACCGAAGCATTCAGAGCAAGAGATGCCTAGGACTTCCGAGTCTGAC
AAAAATGAAAATTCCGAGGCATTATTGCCTGTAGATATTATTTGTGGTCCAAGTCATCCTCGGGGGCATCGTGGTCGAGGGAGGCCTCAAAAGTTGAAAGTTCACGAAAC
TTTGGCAACTTCATTATCTTCATCTGCTCAAGACGGTGACCCAGATATGGGTGATGGCAGTCACCATATTGATCAGCAACGACTCAAGCTGCCGAGAGGTCGGGGAAAGC
CTCGAATAGTTAGACAAGACCAGATTTCAGTGTCAGAGACGTTGTCACCTTCCAAGCATTTGCATCACCAGCAATCTCCAGAAAAGAGACGTGGGAGGCCTCCAAAACAA
AAATTTGATGAAGATACTGTATCGAAGGACTTCTCGACTTCGTGA
mRNA sequenceShow/hide mRNA sequence
AAGGGAAATCAAAGACCATATATGAATAAAGCGTTTGCCCTCTCCAACGACGATGCATCGCCGCCGATATGGAGAACTCCCAACCTCAACTCTCTTCCATTCGACCGCTG
CCGGAAAATCTCTCGTCTCCCTCTTATATGACATCGCATTCCGATCACCGACATTCCCTCGTAGCCGGAAGGTTTAGAGACGCCCTCTTCTCCGCCCTCGCCGCCAAATA
TTCGACCAATGGCAGCGGCCACTCTTTGCCCTTCCACTCCGAGCAGTTCAAGTCCGTTATCGATTGCCGCCTTTATGAGAATTTCCCTTCCTTCCAAACTCCTACGCATC
TTCCCTATGCCTCGATGATACAGAGGGCAATAGCTGAAGTGGGAGGGGAAGATGGATTGAGCGAGGAGTCAATATCAGAGTTTATCGTGAATGAGTATGAGGACTTGCCA
TGGGCGCACCCGGCGTTTTTGCGTCGCCATTTGGGAAAGCTCTGTGAAAGTGGGGAGCTAGTAAAATCGAACTGTGGGAGGTATAACTTTAAGTTGGAGGGTAAGGGAGT
GAAGAGGAAGAAGCGAAGGAGGAAGTCAGCGGGAAGGAGTCGGCGCCGAGAAGTGGAGAGTGCTGATGAGATAGAAGAGGATTTTGATAGGAAAAAGCGATCAAAGAAAT
TGATGACCATAGGACCCGGTGCGGAGGAGGTGGTAACAAGTAAAGGGACTGAAGAACAAAGTGCTTTGTTGAGAGAAGTAATTGTTGGGGCTGAAGGTGTAGATCATGCT
CAAGGAGGCCAAGTTGTGCTGGATGAACTTGAAGAAGTTCAAGAAGATGAAATGATTGACAAGCAGTATGGAGAGAAAATCAAACATAAATATGGGCGTAAAGTTTTTGA
TCGGAAAAAGCAATTGCGGAATCTGGTGATTATAGGCCTTCGTGCACCAGTAGCTAGTAAGGAGATTGAAAGACAAAGGGGTTCAATGGGGGAAGAAGTTCATGAAGCTG
AAGAAGGAGAGCGCGGAGGCCAAATTCAAGTGCTTGGTGAAGCTAATGAAGTTCAAGCAGATGTAATGATTCACCAACCTTGTGAAAAGGAAGTCAAGAGTAGAGATGGC
GTTCAAGATTTTGATGGGAAAAAGCAATCACAGAATGTGGCGGCTGCAAATCTTGGTGCACAGGAGACATTAACAATGACAAGGAATGAAGAAAAATGTGGTTCGTCAAG
AGAAGAAATTGGTGGAGCCAAAGAAAGAGGTTATGATCAAGACAGGCAAGTTATAATGATATATGAACTTAAAGAAGTTGGAAAAGTTGGAATGATTAATGATCATCACG
AAGTGGAAGTTAATAGTAGAGATGGGGTTAAAGATTTTGGTGGGAGAAAACAATCACAGGATCTGATGGTTGTTGGACTTCATGCAAAAGAGGCACTAACGACTAAAGGG
ACTGAAGACCAATGTAGTTCGTTAAGAACAAAAGTCGATGGGGCTGAAGGAAACCGTGCACAAGCTGACCGAACTGAAGTGCTTGCTAAATTCAAAGAAGTTCAAGAAGT
TGAAATGATTGACAAGCATCATGAAGATGAAAGGCTTGGAGAAATGATGGAAGAACCAATAGAGAGAGCATTCATGGAATCAAAAGAAGAAAAGACGCCCGGTGAAGAAG
CCACTTTGGAGTTCTTTGATGCTGTGTCAAACCATAGCAATGCTGAAGAAAATGGAGTTATTGATGATGCTGAAGGTTGCAAGAAGTTACTAGAGGAAAATGAAAATTTT
GAGTTCTTTGACGCAAGGTCTGACCATGGCTATGATGGGGCGAAAGAAATAATTGGTGCTCAATCTTCGAATGGGACAATACTAGGTGAAGTTAGCAATAAACAAAATAA
ACTGGAAGAACAACGACCATCCAAGGTGAGTGATGATCAAACTAGAATAAGGAAGGGCCACGAGGCTGAGGATCGTCAACTGTCCAAGGAGCATCCTCAAGTTAGATGGC
CGTCTGAAATAACTGGAACTCTACCGAAGCATTCAGAGCAAGAGATGCCTAGGACTTCCGAGTCTGACAAAAATGAAAATTCCGAGGCATTATTGCCTGTAGATATTATT
TGTGGTCCAAGTCATCCTCGGGGGCATCGTGGTCGAGGGAGGCCTCAAAAGTTGAAAGTTCACGAAACTTTGGCAACTTCATTATCTTCATCTGCTCAAGACGGTGACCC
AGATATGGGTGATGGCAGTCACCATATTGATCAGCAACGACTCAAGCTGCCGAGAGGTCGGGGAAAGCCTCGAATAGTTAGACAAGACCAGATTTCAGTGTCAGAGACGT
TGTCACCTTCCAAGCATTTGCATCACCAGCAATCTCCAGAAAAGAGACGTGGGAGGCCTCCAAAACAAAAATTTGATGAAGATACTGTATCGAAGGACTTCTCGACTTCG
TGAGAGAATGGACAACAAGAACATAAGGGCCGTGGTACTGGAAGGCATTCCAGAGGAAGAAAGCAAAAAAGGGAATCATTTGATAATCAGTAATCACCATATGTTCAAGT
GCTAACTATATTAATTAGTATCTAGGAAAAGTTAACGTCTGATTTTGTTGCGTGTCTTTCCTTTTATGATCACACGTTAATAAAATTTCCACTTAACTCTTGAAGCTGGT
TTTTCAAGTTATGTAATTACAGGTATTATAG
Protein sequenceShow/hide protein sequence
MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLPFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIAEVGGEDGLSEE
SISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKLEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMTIGPGAEEVVTSKGTEEQSAL
LREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQYGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSMGEEVHEAEEGERGGQIQVLGEANEVQADVM
IHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTRNEEKCGSSREEIGGAKERGYDQDRQVIMIYELKEVGKVGMINDHHEVEVNSRDGVKDFGGRKQSQDLM
VVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADRTEVLAKFKEVQEVEMIDKHHEDERLGEMMEEPIERAFMESKEEKTPGEEATLEFFDAVSNHSNAEENGVIDD
AEGCKKLLEENENFEFFDARSDHGYDGAKEIIGAQSSNGTILGEVSNKQNKLEEQRPSKVSDDQTRIRKGHEAEDRQLSKEHPQVRWPSEITGTLPKHSEQEMPRTSESD
KNENSEALLPVDIICGPSHPRGHRGRGRPQKLKVHETLATSLSSSAQDGDPDMGDGSHHIDQQRLKLPRGRGKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQ
KFDEDTVSKDFSTS