; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G006510 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G006510
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionH15 domain-containing protein
Genome locationCicolChr01:6796166..6804495
RNA-Seq ExpressionCcUC01G006510
SyntenyCcUC01G006510
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR005818 - Linker histone H1/H5, domain H15
IPR017956 - AT hook, DNA-binding motif
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046320.1 transcription regulatory protein SNF2-like isoform X3 [Cucumis melo var. makuwa]0.0e+0073.21Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE
        ME S  QLSSIRP PENLSSPS    HSDHRHSL+AGR RDALFSA+AAKYSTNG+ HSL F S+QFKSVIDCRL ENFPSFQTPTHLPYASMIQRAI E
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
        VG EDGLSEESISEFIVNEYEDLPWAH A+LRRHLGKLCE+GELVK  CGRYNFKVE KGVKRKKRRRK+ GRSR REVESADEIEE FDRKKRSKKL  
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
        IGP  EEVVTSKG+EEQS   REV VG E VDH   GQVV++E ++V+ DEM+DKQHGEK KH YG KVF+RK Q RNLVI+GL AP+A+KE+E+Q GS 
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV
        GEEV E EEG+  +GGQIQV GE NEVQADVMIHQPCEKEVKSR G QDFD KKQSQNVAA NLGAQE LTMT NEEK GS REEI GAKERGYDQDRQ 
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADQTEVLGKFKEVQEVEMIDKH
        IMIYELK              EVN  D VEDFGGRKQSQDLMVVGLHAKEAL TKGTED+CSS R  V DG EG  AQA Q EVL KFKEVQ VEMID+H
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADQTEVLGKFKEVQEVEMIDKH

Query:  HEDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSN
         E+E+ GE MEEP ERA + S   + P EEATLEFFDA+S HSNAEENGVIDDAE CKKLLEENENFEFFD +SDHGYDG  EIIGAQSSK T+LGEVSN
Subjt:  HEDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSN

Query:  KQNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGTVPKHSEQEMPRTSESDKNENSEALLPADIICGPSHPRGHLGRGRPQKLKVHET
        KQN+LEEQ+ SK SDDQT IR G EAED +L+KEH QVRWPSEITGT+ KHS+QEM RTSE+DKNE SEAL P DIIC PS P GH G+GRP+KLKV E 
Subjt:  KQNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGTVPKHSEQEMPRTSESDKNENSEALLPADIICGPSHPRGHLGRGRPQKLKVHET

Query:  LATSLSSSAQDG---------------DPDIGDGTHHIGQQRLKLPRGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS
        LATSLSS A+DG               D +   GTHHI QQ L LPRGRGRGRG+ R+VRQDQ S S+  SPSKHL+H+QSP K RGRP KQ FDED VS
Subjt:  LATSLSSSAQDG---------------DPDIGDGTHHIGQQRLKLPRGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS

Query:  KDFSTSLENGQQEHK-----GRGTGRPSRGRKQKKG
        KD ST LEN  QE K     G G G  S GR +++G
Subjt:  KDFSTSLENGQQEHK-----GRGTGRPSRGRKQKKG

XP_011655461.2 uncharacterized protein LOC105435537 isoform X1 [Cucumis sativus]4.4e-29871.66Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYS-TNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIV
        ME SQ QLSSI P P+NL+S S    HSDHRHSL+AGRFRDALFSA+AAKYS  NG+ HSL F S+QFKSVIDCR++ENFPSF+TPTHLPYASMI RAI 
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYS-TNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIV

Query:  EVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLM
        EVG EDGLSEESIS FI+NEY+DLPWAH A+LRRHLGKLCE+GE+VK  CGRYNFKVE KGVKRKK RRKS GRSR REV+SADEIEEDFDRKKRSKKL+
Subjt:  EVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLM

Query:  TIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGS
         IGP  EEVVTS GTEEQS L REV VG E VDH   GQ+V+ E +EV+ DEM+DKQHGEK KHK+G KVF+ KKQ +NLVI+GLRAP+A+KEIE+Q GS
Subjt:  TIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGS

Query:  MGEEVHEAE-EGERGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV
        +GEEV E E +  +GGQIQV GE +EVQ DV+IHQ CEKEVKSRDG QDFD KKQSQNV A N+GAQE  TM  NEEK GS REEI GAKE GYD DRQV
Subjt:  MGEEVHEAE-EGERGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADQTEVLGKFKEVQEVEMIDKH
        IMIYELK              EVN RD VEDFGGRKQSQDL++VGLH+KEAL TKGTEDQCSS R  V DG EG  AQA QTEVL KFKEVQ VE+ID+H
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADQTEVLGKFKEVQEVEMIDKH

Query:  HEDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSN
         E+E+ GE MEEP ERA + S   + P EEATLEFFDA+S HSNAEEN VIDDAE CKKLLEENEN EFFD +SDHGYDG  E IGAQSSK  ILGEVSN
Subjt:  HEDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSN

Query:  KQNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGTVPKHSEQEMPRTSESDKNENSEALLPADIICGPSHPRGHLGRGRPQKLKVHET
        KQN+LEE+  SK SDDQT+ RKG EAED +L+KEH QVRWPS+IT T+ KHS+Q  PR +E+DKNE SEAL P DIIC PS P GH GRGRP+KLKV ET
Subjt:  KQNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGTVPKHSEQEMPRTSESDKNENSEALLPADIICGPSHPRGHLGRGRPQKLKVHET

Query:  LATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLPRGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS
        LATSLSS A DGD               P+ GDGT HI QQ L LPRGRGRG G+PR+VRQDQ S S T SPSKHL+H+QSP KRRGRP +QKFDEDTVS
Subjt:  LATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLPRGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS

Query:  K
        +
Subjt:  K

XP_022993719.1 uncharacterized protein LOC111489634 isoform X1 [Cucurbita maxima]4.4e-29869.31Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE
        MENSQP LS+I   PEN   PS +T HSDHR+SL+AGRFRDALFSA AAKY+TNGS HSL F SEQFKSVI+C L+ENFPSF+TPTHLPYASMIQ+AI E
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
        VG EDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+YNFKVEGK VKRKKRRRKSAGRSRRREVES DEIE D DR KRSKKL  
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
         GP AEEVVTSKGT+E++  L EVIVGAE  DHA  GQV+LDELEEVQEDEMIDK H E+IK+KYG   F+  K+ RNLVIIGL APVA K IE+Q  S+
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV
        G +VHEAEEG+  +GGQIQVLG+  EVQADVMI Q CEK+VKSR  +QD D  +QSQ VAAANLGAQE L MTG E KCG SREEIGG            
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQEVEMIDKHH
             L +V KVGMIND H+VEV S DR EDFG  KQSQDLMVVGLHAK+ALTTKGTEDQCSSLR  V GAEG   QA QTEVLG FK  QEVEMID+HH
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQEVEMIDKHH

Query:  EDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSNK
        E+ER GEMMEEP ERA   S E + PGEEATL+FFD + N  +A+ENGVI DA+ C+KL EENE+ EFFD +SDHG + A EI GAQ+SKG +LGEV NK
Subjt:  EDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSNK

Query:  QNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITG----------------TVPKHSEQEMPRTSESDKNENSEALLPADIICGP-SHPR
        QN+LEEQ++SK+SDDQTRI KG EAE+ +LS +HP+VRWPSEITG                T PKHSEQ +  TSE+DKNENSEALL  D+IC P S P+
Subjt:  QNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITG----------------TVPKHSEQEMPRTSESDKNENSEALLPADIICGP-SHPR

Query:  GHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSP
        GH GRGRP KLK+ ET ATSLSS A D D               PD+   THHI QQ+LKLP  RGRGRGRG+PRI+RQD ISV ET SPS+HLHHQQSP
Subjt:  GHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSP

Query:  EKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG--------RPSRGRKQKK
         KR RGRPPKQKFDEDTVSKD ST LEN QQE KGRG G        RPSRGRK++K
Subjt:  EKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG--------RPSRGRKQKK

XP_023549578.1 uncharacterized protein LOC111808038 isoform X1 [Cucurbita pepo subsp. pepo]2.0e-29868.8Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE
        MENSQP LS+I   PEN   PS +T HSDHR+SL+ GRFRDALFSA AAKY+TNGS HSL F SEQFKSVI+C L++NFPSF+TPTHLPYASMIQ+AI E
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
        VG EDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+YNFKVEGK VKRKKRRRKSAGRSRRREVES DEIEEDFDR KRSKKL  
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
         GP AEEVVTSKG++EQ+  LREVIVGAE  DHA  GQVVLDELEE QEDEMIDK H E+IK+KY    F+  K+ RNLVIIGL APVA KEIE+Q  S+
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV
        G +VHEAEEG+  +GGQIQVLG+  EVQADVMI QPCEKEVKSR  +QD D K+QSQ VAAANLGAQE L M G E KCGSSREEIGG            
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQEVEMIDKHH
             L EV KV MIND H+VE  S DR EDFG  KQSQD+MVVGLHAK+AL  KGTEDQCSSLR  VDGAEG+  QA QTEVLG FK  QEVEMID+HH
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQEVEMIDKHH

Query:  EDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSNK
        E+ER GEMMEEP ERA   S E + PGEEATL+FFDA+ N  +A+ENGV+ DA+ C+KL EENE+ EFFD +SDHG + A EI GAQ+SKG +LGEV NK
Subjt:  EDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSNK

Query:  QNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGT----------------VPKHSEQEMPRTSESDKNENSEALLPADIICGP-SHPR
        QN+LEEQ++SK+SDDQT I KG EAE+ +LS +HP+VRWPSEITGT                 PKHSEQ +  TSE+DKNE SEA+L  D+IC P S PR
Subjt:  QNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGT----------------VPKHSEQEMPRTSESDKNENSEALLPADIICGP-SHPR

Query:  GHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSP
        GH GRGRP KLK+ ET ATSLSS A D D               PD+   THHI QQ+LKLP  RGRGRGRG+PRI+RQD ISV ET SPS++LHHQQSP
Subjt:  GHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSP

Query:  EKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG----------RPSRGRKQKK
         KR RGRPPKQKFDEDTVSKD ST +EN QQE KGRG G          RPSRGRK++K
Subjt:  EKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG----------RPSRGRKQKK

XP_038907055.1 uncharacterized protein LOC120092885 [Benincasa hispida]0.0e+0081.83Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE
        MENSQPQ SSI P P  LS PS MT  SDHRHSLVAGRFRDALFSA+AAKYSTNGS HS  FHSEQFKSV+DCR++ENFPSFQTPTHLPYASMIQRAI E
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
         G EDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEG GVKRKKRRRKSAGR+RRRE+ESADEIEEDFDRKKRSKKLM 
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
        IGP  EEVVTSKGTEEQS LLREVIVGA  VDHAQGGQVVLDEL+E+QEDEMIDK+HGEKIK  YG K F  KKQ   LVIIGL APVA  EIE+Q GS+
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV
        GEEV EAE+GE  +GGQIQV GE NEVQADVMIHQPCEKEVKSRD VQDFD +KQSQNVAA NLGAQE LTMT N EKCGS REEI GAKER  DQDRQV
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQEVEMIDKHH
        I IY+LKEV KVGMINDHHEVEVNSRD +EDFGG KQSQDL+VVGLH KEALTTKGTEDQCSSLR KVDGAEGN AQA QTE LGKFKEV EVEMIDKHH
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQEVEMIDKHH

Query:  EDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSNK
        E+ER GEMMEEPIER  M S E   PGEEA LEFFDA SNHSN EENGVI DAE CKKL EENEN EFFD RSDH  D   EIIGAQSSK  +LGEVSN+
Subjt:  EDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSNK

Query:  QNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGTVPKHSEQEMPRTSESDKNENSEALLPADIICGPSHPRG-HLGRGRPQKLKVHET
        QN+LEE++ SK+SD+QT IRKG EAED +LSKEHPQVRWPSEITGT PKHSEQEM RT E+DKNENS+ALLPADII GPSHP G H GRGRP+ LKV ET
Subjt:  QNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGTVPKHSEQEMPRTSESDKNENSEALLPADIICGPSHPRG-HLGRGRPQKLKVHET

Query:  LATSLSSSAQDGDPDIGDGTHHIGQQRLKLPRGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVSKDFSTSLENGQQEH-
        LATSL +SAQD DPD+GDGTHHI QQRLKLPRGRGRGRG+PRIVRQDQISVSE  SPSKH HHQQSP KR GRPPKQKF+EDT SK  STSLEN QQEH 
Subjt:  LATSLSSSAQDGDPDIGDGTHHIGQQRLKLPRGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVSKDFSTSLENGQQEH-

Query:  ---KGRGTGRPSRGRKQKKG
           +GRGTGRPSR RK++KG
Subjt:  ---KGRGTGRPSRGRKQKKG

TrEMBL top hitse value%identityAlignment
A0A5D3E3L6 Transcription regulatory protein SNF2-like isoform X30.0e+0073.21Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE
        ME S  QLSSIRP PENLSSPS    HSDHRHSL+AGR RDALFSA+AAKYSTNG+ HSL F S+QFKSVIDCRL ENFPSFQTPTHLPYASMIQRAI E
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
        VG EDGLSEESISEFIVNEYEDLPWAH A+LRRHLGKLCE+GELVK  CGRYNFKVE KGVKRKKRRRK+ GRSR REVESADEIEE FDRKKRSKKL  
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
        IGP  EEVVTSKG+EEQS   REV VG E VDH   GQVV++E ++V+ DEM+DKQHGEK KH YG KVF+RK Q RNLVI+GL AP+A+KE+E+Q GS 
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV
        GEEV E EEG+  +GGQIQV GE NEVQADVMIHQPCEKEVKSR G QDFD KKQSQNVAA NLGAQE LTMT NEEK GS REEI GAKERGYDQDRQ 
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADQTEVLGKFKEVQEVEMIDKH
        IMIYELK              EVN  D VEDFGGRKQSQDLMVVGLHAKEAL TKGTED+CSS R  V DG EG  AQA Q EVL KFKEVQ VEMID+H
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKV-DGAEGNRAQADQTEVLGKFKEVQEVEMIDKH

Query:  HEDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSN
         E+E+ GE MEEP ERA + S   + P EEATLEFFDA+S HSNAEENGVIDDAE CKKLLEENENFEFFD +SDHGYDG  EIIGAQSSK T+LGEVSN
Subjt:  HEDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSN

Query:  KQNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGTVPKHSEQEMPRTSESDKNENSEALLPADIICGPSHPRGHLGRGRPQKLKVHET
        KQN+LEEQ+ SK SDDQT IR G EAED +L+KEH QVRWPSEITGT+ KHS+QEM RTSE+DKNE SEAL P DIIC PS P GH G+GRP+KLKV E 
Subjt:  KQNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGTVPKHSEQEMPRTSESDKNENSEALLPADIICGPSHPRGHLGRGRPQKLKVHET

Query:  LATSLSSSAQDG---------------DPDIGDGTHHIGQQRLKLPRGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS
        LATSLSS A+DG               D +   GTHHI QQ L LPRGRGRGRG+ R+VRQDQ S S+  SPSKHL+H+QSP K RGRP KQ FDED VS
Subjt:  LATSLSSSAQDG---------------DPDIGDGTHHIGQQRLKLPRGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVS

Query:  KDFSTSLENGQQEHK-----GRGTGRPSRGRKQKKG
        KD ST LEN  QE K     G G G  S GR +++G
Subjt:  KDFSTSLENGQQEHK-----GRGTGRPSRGRKQKKG

A0A6J1FEI4 uncharacterized protein LOC111444998 isoform X11.7e-29568.13Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE
        MENSQP LS+I   PEN   PS +T HSDHR+SL+AGRFRDALFSA AAKY+TNGS HSL F SEQFKSVI+C L++NFPSF+TPTHLPYASMIQ+AI E
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
        +G EDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+YNFKVEGK VKRKKRRRKSAGRSRRREVES DEIEEDF+R KRSKKL  
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
         GP AE VVTSKG++EQ+  LREVI+GAE  DHA  G+VVLDELEEVQEDEMIDK H E+IK+KYG   F+  K+ RNLVIIGL APVA KEI +Q  S+
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV
        G +VHEAEEG+  +GGQIQVLG+  EVQADVMI QPCEKEVKSR  +QD D K+QSQ V AANLG QE L MTG E KCGSSREEIGG            
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQEVEMIDKHH
             L E+ KV MIND H+VE  S DR EDFG  KQSQDLMVVGLHAK+AL TKGTEDQCSSLR  VDGAEG+  QA QTEVLG FK  QEVEMID+HH
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQEVEMIDKHH

Query:  EDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSNK
        E+ER GEMMEEP ERA   S E + PGEEATL+FFDA+ N  +A+ENGV+ DA+ C+KL EENE+ EFFD +SDHG + A EI GAQ+SKG +LGEV NK
Subjt:  EDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSNK

Query:  QNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGT----------------VPKHSEQEMPRTSESDKNENSEALLPADIICGP-SHPR
        QN LEEQ++SK+SDDQT I KG EAE+ +LS +HP+VRWPSEITGT                 PKHSEQ +  TSE+DKNE SEALL  D+IC P S PR
Subjt:  QNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGT----------------VPKHSEQEMPRTSESDKNENSEALLPADIICGP-SHPR

Query:  GHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSP
        GH GRGRP KLK+ ET ATSLSS A D D               PD+   THHI QQ+LKLP  RGRGRGRG+PRI+RQD ISV ET SPS+HL HQ SP
Subjt:  GHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSP

Query:  EKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG--------------RPSRGRKQKK
         KR RGRPPKQKFDEDTVSKD  T LEN QQE KGRG G              RPSRGRK++K
Subjt:  EKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG--------------RPSRGRKQKK

A0A6J1FFG2 eukaryotic translation initiation factor 5B-like isoform X28.5e-25567.32Show/hide
Query:  MIQRAIVEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRK
        MIQ+AI E+G EDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+YNFKVEGK VKRKKRRRKSAGRSRRREVES DEIEEDF+R 
Subjt:  MIQRAIVEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRK

Query:  KRSKKLMTIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKE
        KRSKKL   GP AE VVTSKG++EQ+  LREVI+GAE  DHA  G+VVLDELEEVQEDEMIDK H E+IK+KYG   F+  K+ RNLVIIGL APVA KE
Subjt:  KRSKKLMTIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKE

Query:  IERQRGSMGEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKER
        I +Q  S+G +VHEAEEG+  +GGQIQVLG+  EVQADVMI QPCEKEVKSR  +QD D K+QSQ V AANLG QE L MTG E KCGSSREEIGG    
Subjt:  IERQRGSMGEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKER

Query:  GYDQDRQVIMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQE
                     L E+ KV MIND H+VE  S DR EDFG  KQSQDLMVVGLHAK+AL TKGTEDQCSSLR  VDGAEG+  QA QTEVLG FK  QE
Subjt:  GYDQDRQVIMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQE

Query:  VEMIDKHHEDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGT
        VEMID+HHE+ER GEMMEEP ERA   S E + PGEEATL+FFDA+ N  +A+ENGV+ DA+ C+KL EENE+ EFFD +SDHG + A EI GAQ+SKG 
Subjt:  VEMIDKHHEDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGT

Query:  ILGEVSNKQNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGT----------------VPKHSEQEMPRTSESDKNENSEALLPADII
        +LGEV NKQN LEEQ++SK+SDDQT I KG EAE+ +LS +HP+VRWPSEITGT                 PKHSEQ +  TSE+DKNE SEALL  D+I
Subjt:  ILGEVSNKQNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITGT----------------VPKHSEQEMPRTSESDKNENSEALLPADII

Query:  CGP-SHPRGHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSK
        C P S PRGH GRGRP KLK+ ET ATSLSS A D D               PD+   THHI QQ+LKLP  RGRGRGRG+PRI+RQD ISV ET SPS+
Subjt:  CGP-SHPRGHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSK

Query:  HLHHQQSPEKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG--------------RPSRGRKQKK
        HL HQ SP KR RGRPPKQKFDEDTVSKD  T LEN QQE KGRG G              RPSRGRK++K
Subjt:  HLHHQQSPEKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG--------------RPSRGRKQKK

A0A6J1JZB4 uncharacterized protein LOC111489634 isoform X21.8e-25768.5Show/hide
Query:  MIQRAIVEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRK
        MIQ+AI EVG EDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+YNFKVEGK VKRKKRRRKSAGRSRRREVES DEIE D DR 
Subjt:  MIQRAIVEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRK

Query:  KRSKKLMTIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKE
        KRSKKL   GP AEEVVTSKGT+E++  L EVIVGAE  DHA  GQV+LDELEEVQEDEMIDK H E+IK+KYG   F+  K+ RNLVIIGL APVA K 
Subjt:  KRSKKLMTIGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKE

Query:  IERQRGSMGEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKER
        IE+Q  S+G +VHEAEEG+  +GGQIQVLG+  EVQADVMI Q CEK+VKSR  +QD D  +QSQ VAAANLGAQE L MTG E KCG SREEIGG    
Subjt:  IERQRGSMGEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKER

Query:  GYDQDRQVIMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQE
                     L +V KVGMIND H+VEV S DR EDFG  KQSQDLMVVGLHAK+ALTTKGTEDQCSSLR  V GAEG   QA QTEVLG FK  QE
Subjt:  GYDQDRQVIMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQE

Query:  VEMIDKHHEDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGT
        VEMID+HHE+ER GEMMEEP ERA   S E + PGEEATL+FFD + N  +A+ENGVI DA+ C+KL EENE+ EFFD +SDHG + A EI GAQ+SKG 
Subjt:  VEMIDKHHEDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGT

Query:  ILGEVSNKQNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITG----------------TVPKHSEQEMPRTSESDKNENSEALLPADII
        +LGEV NKQN+LEEQ++SK+SDDQTRI KG EAE+ +LS +HP+VRWPSEITG                T PKHSEQ +  TSE+DKNENSEALL  D+I
Subjt:  ILGEVSNKQNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITG----------------TVPKHSEQEMPRTSESDKNENSEALLPADII

Query:  CGP-SHPRGHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSK
        C P S P+GH GRGRP KLK+ ET ATSLSS A D D               PD+   THHI QQ+LKLP  RGRGRGRG+PRI+RQD ISV ET SPS+
Subjt:  CGP-SHPRGHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSK

Query:  HLHHQQSPEKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG--------RPSRGRKQKK
        HLHHQQSP KR RGRPPKQKFDEDTVSKD ST LEN QQE KGRG G        RPSRGRK++K
Subjt:  HLHHQQSPEKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG--------RPSRGRKQKK

A0A6J1K0W5 uncharacterized protein LOC111489634 isoform X12.1e-29869.31Show/hide
Query:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE
        MENSQP LS+I   PEN   PS +T HSDHR+SL+AGRFRDALFSA AAKY+TNGS HSL F SEQFKSVI+C L+ENFPSF+TPTHLPYASMIQ+AI E
Subjt:  MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVE

Query:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT
        VG EDGLSEE ISEFIVNEY+DLPWAHPAFLRRHLGKLCESGELVKS CG+YNFKVEGK VKRKKRRRKSAGRSRRREVES DEIE D DR KRSKKL  
Subjt:  VGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMT

Query:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM
         GP AEEVVTSKGT+E++  L EVIVGAE  DHA  GQV+LDELEEVQEDEMIDK H E+IK+KYG   F+  K+ RNLVIIGL APVA K IE+Q  S+
Subjt:  IGPGAEEVVTSKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSM

Query:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV
        G +VHEAEEG+  +GGQIQVLG+  EVQADVMI Q CEK+VKSR  +QD D  +QSQ VAAANLGAQE L MTG E KCG SREEIGG            
Subjt:  GEEVHEAEEGE--RGGQIQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQV

Query:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQEVEMIDKHH
             L +V KVGMIND H+VEV S DR EDFG  KQSQDLMVVGLHAK+ALTTKGTEDQCSSLR  V GAEG   QA QTEVLG FK  QEVEMID+HH
Subjt:  IMIYELKEVGKVGMINDHHEVEVNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQEVEMIDKHH

Query:  EDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSNK
        E+ER GEMMEEP ERA   S E + PGEEATL+FFD + N  +A+ENGVI DA+ C+KL EENE+ EFFD +SDHG + A EI GAQ+SKG +LGEV NK
Subjt:  EDERLGEMMEEPIERAFMESKEGKSPGEEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSNK

Query:  QNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITG----------------TVPKHSEQEMPRTSESDKNENSEALLPADIICGP-SHPR
        QN+LEEQ++SK+SDDQTRI KG EAE+ +LS +HP+VRWPSEITG                T PKHSEQ +  TSE+DKNENSEALL  D+IC P S P+
Subjt:  QNKLEEQQLSKMSDDQTRIRKGHEAED-RLSKEHPQVRWPSEITG----------------TVPKHSEQEMPRTSESDKNENSEALLPADIICGP-SHPR

Query:  GHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSP
        GH GRGRP KLK+ ET ATSLSS A D D               PD+   THHI QQ+LKLP  RGRGRGRG+PRI+RQD ISV ET SPS+HLHHQQSP
Subjt:  GHLGRGRPQKLKVHETLATSLSSSAQDGD---------------PDIGDGTHHIGQQRLKLP--RGRGRGRGKPRIVRQDQISVSETLSPSKHLHHQQSP

Query:  EKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG--------RPSRGRKQKK
         KR RGRPPKQKFDEDTVSKD ST LEN QQE KGRG G        RPSRGRK++K
Subjt:  EKR-RGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTG--------RPSRGRKQKK

SwissProt top hitse value%identityAlignment
Q9FYS5 HMG-Y-related protein A1.1e-0442.11Show/hide
Query:  PYASMIQRAIVEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELV
        PY  MI  AI  +  + G ++ +IS++I  +Y  LP AH + L  HL ++ ESGELV
Subjt:  PYASMIQRAIVEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELV

Arabidopsis top hitse value%identityAlignment
AT5G08780.1 winged-helix DNA-binding transcription factor family protein7.4e-0940Show/hide
Query:  QTPTHLPYASMIQRAIVEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGE-LVKSNCGRYNFKVEGKGVKRKKRRRKS-AGRSRRREVE
        +TP H  Y++MI  AI+++  E G SE++ISEFI ++Y++LP+AH   L  HL KL E  E L   N   Y+   E K V     +RKS     R  +  
Subjt:  QTPTHLPYASMIQRAIVEVGGEDGLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGE-LVKSNCGRYNFKVEGKGVKRKKRRRKS-AGRSRRREVE

Query:  SADEI
        +ADE+
Subjt:  SADEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACTCCCAACCTCAACTCTCTTCCATTCGACCGCTGCCGGAAAATCTCTCGTCTCCCTCTTATATGACATCGCATTCCGATCACCGACATTCCCTCGTA
GCCGGAAGGTTTAGAGACGCCCTCTTCTCCGCCCTCGCCGCCAAATATTCGACCAATGGCAGCGGCCACTCTTTGTCCTTCCACTCCGAGCAGTTCAAGTCCGTT
ATCGATTGCCGCCTTTATGAGAATTTCCCTTCCTTCCAAACTCCTACGCATCTTCCCTATGCCTCGATGATACAGAGGGCAATAGTTGAAGTGGGAGGGGAAGAT
GGATTGAGCGAGGAGTCAATATCAGAGTTTATCGTGAATGAGTATGAGGACTTGCCATGGGCGCACCCGGCGTTTTTGCGTCGCCATTTGGGAAAGCTCTGTGAA
AGTGGGGAGCTAGTAAAATCGAACTGTGGGAGGTATAACTTTAAGGTGGAGGGTAAGGGAGTGAAGAGGAAGAAGCGGAGGAGGAAGTCGGCGGGAAGGAGTCGG
CGCCGAGAAGTGGAGAGTGCTGATGAGATAGAAGAGGATTTTGATAGGAAAAAGCGATCAAAGAAATTGATGACCATAGGACCCGGTGCGGAGGAGGTGGTAACA
AGTAAAGGGACTGAAGAACAAAGTGCTTTGTTGAGAGAAGTAATTGTTGGGGCTGAAGGTGTAGATCATGCTCAAGGAGGCCAAGTTGTGCTGGATGAACTTGAA
GAAGTTCAAGAAGATGAAATGATTGACAAGCAGCATGGAGAGAAAATCAAACATAAATATGGGCGTAAAGTTTTTGATCGGAAAAAGCAATTGCGGAATCTGGTG
ATTATAGGCCTTCGTGCACCAGTAGCTAGTAAGGAGATTGAAAGACAAAGGGGTTCAATGGGGGAAGAAGTTCATGAAGCTGAAGAAGGAGAGCGCGGAGGCCAA
ATTCAAGTGCTTGGTGAAGCTAATGAAGTTCAAGCAGATGTAATGATTCACCAACCTTGTGAAAAGGAAGTCAAGAGTAGAGATGGTGTTCAAGATTTTGATGGG
AAAAAGCAATCACAGAATGTGGCGGCTGCAAATCTTGGTGCACAGGAGACACTAACAATGACAGGGAATGAAGAAAAATGTGGTTCGTCAAGAGAAGAAATTGGT
GGAGCCAAAGAAAGAGGTTATGATCAAGACAGGCAAGTTATAATGATATATGAACTTAAAGAAGTTGGAAAAGTTGGAATGATTAATGATCATCACGAAGTGGAA
GTCAATAGTAGAGATAGGGTTGAAGATTTTGGTGGGAGAAAACAATCACAGGATCTGATGGTGGTTGGACTTCATGCAAAAGAGGCACTAACGACTAAAGGGACT
GAAGACCAATGTAGTTCGTTAAGAACAAAAGTCGATGGGGCTGAAGGAAACCGTGCACAAGCTGACCAAACTGAAGTGCTTGGTAAATTCAAAGAAGTTCAAGAA
GTTGAAATGATTGACAAGCATCATGAAGATGAAAGGCTAGGAGAAATGATGGAAGAACCAATAGAGAGAGCATTCATGGAATCAAAAGAAGGAAAGTCGCCCGGT
GAAGAAGCCACTTTGGAGTTCTTTGATGCTGTGTCAAACCATAGCAATGCTGAAGAAAATGGAGTGATTGATGATGCTGAATGTTGCAAGAAGTTACTAGAGGAA
AATGAAAATTTTGAGTTCTTTGACACAAGGTCTGACCATGGCTATGATGGGGCGAAAGAAATAATTGGTGCTCAATCTTCGAAGGGGACAATACTAGGTGAAGTT
AGCAATAAACAAAATAAACTGGAAGAACAACAACTATCCAAGATGAGTGATGATCAAACTAGAATAAGGAAGGGCCACGAGGCTGAGGATCGTCTGTCCAAGGAG
CATCCTCAAGTTAGATGGCCGTCTGAAATAACTGGAACTGTACCCAAGCATTCAGAGCAAGAGATGCCTAGGACTTCTGAGTCTGACAAAAATGAAAATTCCGAG
GCATTATTGCCTGCAGATATTATTTGTGGTCCAAGTCATCCTCGGGGGCATCTTGGTCGAGGGAGGCCTCAAAAGTTGAAAGTTCACGAAACTTTGGCAACTTCA
TTATCTTCATCTGCTCAAGACGGTGACCCAGATATTGGTGATGGCACTCACCATATTGGTCAGCAACGACTCAAGCTGCCGAGAGGGAGGGGGAGAGGTCGGGGA
AAGCCTCGAATAGTTAGACAAGACCAGATTTCAGTGTCAGAGACGTTGTCACCTTCCAAGCATTTGCATCACCAGCAATCTCCAGAAAAGAGACGTGGGAGGCCT
CCAAAACAAAAATTTGATGAAGATACTGTATCGAAGGACTTCTCGACTTCGTTAGAGAATGGACAACAAGAACATAAGGGCCGTGGTACTGGAAGGCCTTCCAGA
GGAAGAAAGCAAAAAAAGGGAATCATTTGA
mRNA sequenceShow/hide mRNA sequence
GTACTAAGGACGGAAATAGAGAGCAGTCGTTTACTAAAATTGTAGCGATGAAAAGGGAAGGAGAAGAGTTTTTAATATCGCATGGATATACAACGAAACGGGGTT
GCCAAAATTAAATACCATATATGAATAAAGCGTTTGCCTTCTCCAACGACGATGCATCGCCGCCGATATGGAGAACTCCCAACCTCAACTCTCTTCCATTCGACC
GCTGCCGGAAAATCTCTCGTCTCCCTCTTATATGACATCGCATTCCGATCACCGACATTCCCTCGTAGCCGGAAGGTTTAGAGACGCCCTCTTCTCCGCCCTCGC
CGCCAAATATTCGACCAATGGCAGCGGCCACTCTTTGTCCTTCCACTCCGAGCAGTTCAAGTCCGTTATCGATTGCCGCCTTTATGAGAATTTCCCTTCCTTCCA
AACTCCTACGCATCTTCCCTATGCCTCGATGATACAGAGGGCAATAGTTGAAGTGGGAGGGGAAGATGGATTGAGCGAGGAGTCAATATCAGAGTTTATCGTGAA
TGAGTATGAGGACTTGCCATGGGCGCACCCGGCGTTTTTGCGTCGCCATTTGGGAAAGCTCTGTGAAAGTGGGGAGCTAGTAAAATCGAACTGTGGGAGGTATAA
CTTTAAGGTGGAGGGTAAGGGAGTGAAGAGGAAGAAGCGGAGGAGGAAGTCGGCGGGAAGGAGTCGGCGCCGAGAAGTGGAGAGTGCTGATGAGATAGAAGAGGA
TTTTGATAGGAAAAAGCGATCAAAGAAATTGATGACCATAGGACCCGGTGCGGAGGAGGTGGTAACAAGTAAAGGGACTGAAGAACAAAGTGCTTTGTTGAGAGA
AGTAATTGTTGGGGCTGAAGGTGTAGATCATGCTCAAGGAGGCCAAGTTGTGCTGGATGAACTTGAAGAAGTTCAAGAAGATGAAATGATTGACAAGCAGCATGG
AGAGAAAATCAAACATAAATATGGGCGTAAAGTTTTTGATCGGAAAAAGCAATTGCGGAATCTGGTGATTATAGGCCTTCGTGCACCAGTAGCTAGTAAGGAGAT
TGAAAGACAAAGGGGTTCAATGGGGGAAGAAGTTCATGAAGCTGAAGAAGGAGAGCGCGGAGGCCAAATTCAAGTGCTTGGTGAAGCTAATGAAGTTCAAGCAGA
TGTAATGATTCACCAACCTTGTGAAAAGGAAGTCAAGAGTAGAGATGGTGTTCAAGATTTTGATGGGAAAAAGCAATCACAGAATGTGGCGGCTGCAAATCTTGG
TGCACAGGAGACACTAACAATGACAGGGAATGAAGAAAAATGTGGTTCGTCAAGAGAAGAAATTGGTGGAGCCAAAGAAAGAGGTTATGATCAAGACAGGCAAGT
TATAATGATATATGAACTTAAAGAAGTTGGAAAAGTTGGAATGATTAATGATCATCACGAAGTGGAAGTCAATAGTAGAGATAGGGTTGAAGATTTTGGTGGGAG
AAAACAATCACAGGATCTGATGGTGGTTGGACTTCATGCAAAAGAGGCACTAACGACTAAAGGGACTGAAGACCAATGTAGTTCGTTAAGAACAAAAGTCGATGG
GGCTGAAGGAAACCGTGCACAAGCTGACCAAACTGAAGTGCTTGGTAAATTCAAAGAAGTTCAAGAAGTTGAAATGATTGACAAGCATCATGAAGATGAAAGGCT
AGGAGAAATGATGGAAGAACCAATAGAGAGAGCATTCATGGAATCAAAAGAAGGAAAGTCGCCCGGTGAAGAAGCCACTTTGGAGTTCTTTGATGCTGTGTCAAA
CCATAGCAATGCTGAAGAAAATGGAGTGATTGATGATGCTGAATGTTGCAAGAAGTTACTAGAGGAAAATGAAAATTTTGAGTTCTTTGACACAAGGTCTGACCA
TGGCTATGATGGGGCGAAAGAAATAATTGGTGCTCAATCTTCGAAGGGGACAATACTAGGTGAAGTTAGCAATAAACAAAATAAACTGGAAGAACAACAACTATC
CAAGATGAGTGATGATCAAACTAGAATAAGGAAGGGCCACGAGGCTGAGGATCGTCTGTCCAAGGAGCATCCTCAAGTTAGATGGCCGTCTGAAATAACTGGAAC
TGTACCCAAGCATTCAGAGCAAGAGATGCCTAGGACTTCTGAGTCTGACAAAAATGAAAATTCCGAGGCATTATTGCCTGCAGATATTATTTGTGGTCCAAGTCA
TCCTCGGGGGCATCTTGGTCGAGGGAGGCCTCAAAAGTTGAAAGTTCACGAAACTTTGGCAACTTCATTATCTTCATCTGCTCAAGACGGTGACCCAGATATTGG
TGATGGCACTCACCATATTGGTCAGCAACGACTCAAGCTGCCGAGAGGGAGGGGGAGAGGTCGGGGAAAGCCTCGAATAGTTAGACAAGACCAGATTTCAGTGTC
AGAGACGTTGTCACCTTCCAAGCATTTGCATCACCAGCAATCTCCAGAAAAGAGACGTGGGAGGCCTCCAAAACAAAAATTTGATGAAGATACTGTATCGAAGGA
CTTCTCGACTTCGTTAGAGAATGGACAACAAGAACATAAGGGCCGTGGTACTGGAAGGCCTTCCAGAGGAAGAAAGCAAAAAAAGGGAATCATTTGATAATCAGT
AATCACCATATGTTCAAGTGCTAACTATATTAATTAGTATCTAGGAAAAGTTAACATCTGATTTTGTTGCGTGTTTCACTTAGTTTGCTTCTTTCCTTTTATGAT
CACACGTTAATAAAATTTCCACTTAACTCTTGAAGCTGGTTTTTCAAGTTATGTAATTACAGGTATTATAGGAGGGGAATAAGAGCAAGAAGTTGCAAGATCTCT
TCCCTGCATGACTTGCAGCTTTTCCATCTATTCTCATGTCTATCCTTATGATCAGTGACTCCGTTTTTTATACAGGAGATAATCGTTTCATCAGAAAACACCCGT
GTTCATGTAGATTTTCCAACTCTATGTGGACTAAGCTTCTTCACGACTCAACTGGTTTAGGTTCATGCTCTTTTGTCTCCAAAATAGGGGCTTAAAAGGTAGGTA
GGATATAATCCCTCTCTCTAACTTTTCTCTCAAATGTGAGACATTCTTAGAGATGGGAAAAGTGAACGATAGATAGGAAACTGTTCTTGATCTGCAAATCCGGGA
TTGGGAGGAAGATTTCTATCAAAGAAAGGCTCGGCAAGAAATCGGGCAAGCTGGATTTAGAGTTGGGTACTTCGGCCTGGGTAAGAGATTGCTCAGTGTTTGCTG
CAGACCTGAGTAACCCTTTGGGTTTTTGGCGGAGAAGACGGATGGATCGTGCGATTATATTCTTTCAAGTGCTTGAAAATTCTAGGGAAGATTTGTTTTGTTATC
TCTGGAACTGTTCAAGGGGCAAAGGACGAGGTTCTTTATTCTGAAGGCAAGAAAGGAAGTGGATGGAGGTTATTGGCGAGGGAGGTTTCTGGACTTTTTCCATCC
CTTCGAAGACTCTTCCGTCTGCACATACAAGAGCACAGTTGACAAAACGTCTGGAAGGGGAAGCCAAAATAGTCACCTCGGTTAAGGCTACTGAGGCAAGTTTGG
AGGGGAAGCTGGTGTTCGGAAAGGTATTAGGCTCTGTGATGGAGGTCGAAAGGAAAAAGGCAGAGCTAGAGAAGTTTGCAAAAGATAAGAAGTGGGAGTGGAATT
TGCATTCGATTGTGAAGTGGCAGTGATCATTAAGAAGGTCAATGCAGAAGTGGACAAGGAGGTGCTAAAGAAAGTCTTCGATAGGCCATGGTTTCTCAAATGCGG
CCTCAGAATTTTCTGTGCAAACTTAGCTGTGGGAATTTGCGATTCAAAGTTGGAGATAGAGGATCTAATTTGTTTAGGCTCTTCCCTCAATTCTCAAGATTTTAT
CTTTCAATAGTGGGATTTCCAAACGGTGGAAGAGGAAAGGAAACAGGACTTCAAAGGAGGTTGGGTACCGGCGTTGGATGTTCCCCCCTTCCTCAGAACTAAGGA
GTTTAGCCTCCATGGCAAAGTGGTGTGGAGGATTGATCGACGAAGAGTTAGACTGTTATAGATCGAATTGGTCGGAGGAAGTATGCTTCAAAGGAAAGGTAATGT
GGACGCTTTTATCCCGATGGGTTGTTATGTTAATCACTCAGAGTGCGTTTCTGGTGAGGTTCTATTTTGGTCGAAAAAGGGAAGAAGACGATCAGAGGCGGTCAG
AGAGTCCTTCGATTAAAGCTATTTTGTGGTCCTTTGGTCTGTCGTGTGGTCAAACAGAAAAGGAGATAGGAGGTCAGGGATTAATTTGGGGAAAAGGGAACGAGG
TAGGGTTGATGAGCATGATGGTCAAGAGGATGGGTTGGTTCGTAGCAGTTCCCAAAAGCTCGATTCGGGCACCAAGGCCACAGCTGAAGGTAAAAAACGGGCTAT
TCTGGAGAGCGATTTCTTAAAATCTGGGCCGCCTTCGTTTGCACACCAGGGCCCAAGAAGGGATTCAAGCCCATGAGTCGGTTATTCAGCTCGCTTCTACTTTAG
GAGGGTTGGAGACCCATTCTACGAGAGTTGGGGAAGAGTATCATGGAATGAGATGGGAGCTAGAGGTGAGTGTCAGGCCGTCGGATCTTTTAGCTGAAGATGGGG
ATCTCGGATTTGACCAATTGATTGTTAGTTATTCTAACAATGAATCGTTTCTCTCGACTTTGATGTTGAACAAAGCGACAGAGGAGGGTGTCAAGGGGCTGATTT
TGGAAGACGCAGATGCTCAAGGAGATTCAAATTCGAATACACTTTCGAAGGTGTTGTTTGAGTAAGTGAGGTGGGATGGGTCAGTAAGCAACGGTGATGTCACTC
AGGTGAAGGTGCCGAAGGCTGTGGATGGAGCTGCTGACACAAAAGTAATGAATAAAACGGGCTTTGAGAACCAAAAGGCTTCGTCTTTCGATAGAGGTTATCCCT
CAGGTCCTGCTATTGGGTCTTGCTCCTACTCCTATCTGGTGGTAAGTACGGTTGGCAATACGAGCCAAGTAAGTCCTCTTTCTTTGGTTTCTTATCCTCTGTCCG
GGTTGGGGTTGCTTGAGAGTGGGGCTGGTGTCATTTCGACCAATTCGCTGTTTTTGTTTCCTTGTAATGTGGGTGGGAGAGACGGGCAGGTCGATGTTGAGAGTT
TTTATCACCCTTCGATCTTTCTACAACCCTTTTCTCATCTGGGGTGTGCCTTACCCTTTTCGCCCACGTTTTCTCTGGGAGATAGGTGGTGTTTAATCCTCTAGT
TCCTGTTGCAAGGGACGGGTCAAGGGGTGGCGGTATGCTTCTTACCAACGGGGGTGTTGTTGGAGCTCTTTTAGTACTCCCAGTTCTCTTGATTCTCAGACCTCA
CGTCCAAGTATTGGCATTGAAAACCCAGGGTGGGTCGCTCTTAGCCTCGGTGGGAATATAAAAACTAGAGGTTAAGGGAGGTTGGGGGGAAAGGGGGGCTCGGGC
CCTGAGAGAAAGTGGTAGTGGAGAAGAGGAAACGAAAAGAGGGTGGTGATAAATAAGCCAGGGAGTTAAAGAAGTTGGATAGTTCGATTAACTATGAGAGTAGGA
AAGCCTCAAAGGAGGTGGGGCCATCCAGGTCTGCTAAAAATCATTTCCTGGAATGTTAGGGGGTTGAAAGGACAAGTTGAAAGAAGGCTTGTTAAGGAGGTTCTT
AAACAAGAGAGTCTGGATATTGTCATTCTTATGGAAACTAAAAGTTCTTTTAGTATGAGGTGTGTTCGAACATTTGGAGGAGTAAAGGGTGGAGTGAGAGGTCCT
TGATTGGATTGGGTCCTTCGAAGGAATTCTCATGATGTGGAACACTAGGTCCTGTATTACCAAAGAAATTGTTAGAGGCAATTTTTCTCTTTCCATTTATTTTTT
TGGATAATGAAGGTGAGGGGTTTTGGATTGCTTGTGGGGCCTTTTTGTTGTATTCTTTTTCGGAAGGTGGAGGAAGATTTGGATCATCTCTTTTGGGACTGCCAG
TATGCGAGGATGATGCAGAGTCTCTTCTTGCAAAAGTTCAATATTAGCTTTGCTAGGCAGAGAGATGTTCGTGCAACAATTGAGGAGTTTCTCTTCCATCCGCCT
TTCAAAGAGAAAAGTCATTTTTTTGTGGCTTGTTGGGGTGAGTGCGGTTGTTTGGGATATTTGGGGGAAGAGGAATGATAGTGTTTAGAGGTAGGGACAGGGACC
CTTGTGAGGTTTGGTCTTTGGTGAGATTTCATGTGTCCTTTTCGAAGCTTTTTTATAACTATTCCTTAGGAAACATTTTACTTAGGTGGAACCCTTTCTTTTAGT
TGGAGTGTTTTGGTGGGCTGGTTTTTTTTGCATGCCCTCGTATCCTTTCACTTTTTTTTTTTTCTCAATGAAAGCAGTTTTTATTATTTAAAAAAAAAAATCCAC
Protein sequenceShow/hide protein sequence
MENSQPQLSSIRPLPENLSSPSYMTSHSDHRHSLVAGRFRDALFSALAAKYSTNGSGHSLSFHSEQFKSVIDCRLYENFPSFQTPTHLPYASMIQRAIVEVGGED
GLSEESISEFIVNEYEDLPWAHPAFLRRHLGKLCESGELVKSNCGRYNFKVEGKGVKRKKRRRKSAGRSRRREVESADEIEEDFDRKKRSKKLMTIGPGAEEVVT
SKGTEEQSALLREVIVGAEGVDHAQGGQVVLDELEEVQEDEMIDKQHGEKIKHKYGRKVFDRKKQLRNLVIIGLRAPVASKEIERQRGSMGEEVHEAEEGERGGQ
IQVLGEANEVQADVMIHQPCEKEVKSRDGVQDFDGKKQSQNVAAANLGAQETLTMTGNEEKCGSSREEIGGAKERGYDQDRQVIMIYELKEVGKVGMINDHHEVE
VNSRDRVEDFGGRKQSQDLMVVGLHAKEALTTKGTEDQCSSLRTKVDGAEGNRAQADQTEVLGKFKEVQEVEMIDKHHEDERLGEMMEEPIERAFMESKEGKSPG
EEATLEFFDAVSNHSNAEENGVIDDAECCKKLLEENENFEFFDTRSDHGYDGAKEIIGAQSSKGTILGEVSNKQNKLEEQQLSKMSDDQTRIRKGHEAEDRLSKE
HPQVRWPSEITGTVPKHSEQEMPRTSESDKNENSEALLPADIICGPSHPRGHLGRGRPQKLKVHETLATSLSSSAQDGDPDIGDGTHHIGQQRLKLPRGRGRGRG
KPRIVRQDQISVSETLSPSKHLHHQQSPEKRRGRPPKQKFDEDTVSKDFSTSLENGQQEHKGRGTGRPSRGRKQKKGII