; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G2772 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G2772
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionprotein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1
Genome locationctg1041:1190045..1193425
RNA-Seq ExpressionCucsat.G2772
SyntenyCucsat.G2772
Gene Ontology termsGO:0060967 - negative regulation of gene silencing by RNA (biological process)
GO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061306.1 putative nuclease HARBI1 isoform X1 [Cucumis melo var. makuwa]2.45e-29397.25Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPG +DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKS FEA F LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
        NNNYSMFLQGIVDHQMRF+DIVTGWPGAMTTSRLLKCS+IFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYE+DNLSPL FNFNAVQG
Subjt:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS

XP_008461482.1 PREDICTED: uncharacterized protein LOC103500068 [Cucumis melo]1.16e-29197Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKN SLSGSPG +DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKS FEA F LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
        NNNYSMFLQGIVDHQMRF+DIVTGWPGAMTTSRLLKCS+IFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYE+DNLSPL FNFNAVQG
Subjt:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS

XP_011659137.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 [Cucumis sativus]3.52e-302100Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
        NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
Subjt:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
        AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS

XP_023513674.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita pepo subsp. pepo]2.37e-27591Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKR +DSKKLKK KNLSVVPMEPRAS+PDWWEIFWHKNCS SGS G NDEA  FKYFFRTSKKTFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
          NYSMFLQGIVDHQMRF+DIVTGWPG MTTSRLLKCS+ FKLC+ GERLNGN +K SGGSEIREYLVGGVGYPLLPWLITPYE+D+LSPL  NFN VQG
Subjt:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGN  RENLA H HQNKE++CSS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS

XP_038896181.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 [Benincasa hispida]2.82e-28393.25Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKR KDSKKL KRKNL VVPMEPRAS+PDWWE+FWH+NC +SGS G NDEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL WPSSSRLEEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
        NNNYSMFLQGIVDHQMRF+DIVTGWPGAMTTSRLLKCS+IFKLCDAGERLNGN++KFSGGSEIREYLVGGVGYPLLPWLITPYE+DNLSPL FNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
         AK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDEL+PDVALSGHHD+GYQEHCCKQLDPLGN LRENLAKHL+QNKER+CSS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS

TrEMBL top hitse value%identityAlignment
A0A0A0K4D8 DDE Tnp4 domain-containing protein1.71e-302100Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
        NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
Subjt:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
        AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS

A0A1S3CEQ5 uncharacterized protein LOC1035000685.64e-29297Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKN SLSGSPG +DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKS FEA F LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
        NNNYSMFLQGIVDHQMRF+DIVTGWPGAMTTSRLLKCS+IFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYE+DNLSPL FNFNAVQG
Subjt:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS

A0A5A7V486 Putative nuclease HARBI1 isoform X11.19e-29397.25Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPG +DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKS FEA F LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
        NNNYSMFLQGIVDHQMRF+DIVTGWPGAMTTSRLLKCS+IFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYE+DNLSPL FNFNAVQG
Subjt:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS

A0A6J1H7G5 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X22.59e-27290Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKR +DSKKLKK KNLSVVPMEPRA++PDWWEIFWHKNCS SGS G NDEA  FKYFFRTSK+TFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
          NYSMFLQGIVDHQMRF+DIVTGWPG MTTSRLLKCS+ FKLC+ GERLNGN +K SGGSEIREYLVGGVGYPLLPWLITPYE+D+LSPL  NFN V G
Subjt:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGN  RENLA H HQNKE++ SS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS

A0A6J1KQ50 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X25.45e-27490.25Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKR +DSKKLKK KNLSVVPMEPR S+PDWWEIFWHKNCS SGS G NDEA  FKYFFRTSKKTFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG
        N NYSMFLQGIVDHQMRF+DIVTGWPG MTTSRLLKCS+ FKLC+ GERLNGN +K SGGSEIREYLVGGVGYPLLPWLITPYE+D+L PL  NFN V G
Subjt:  NNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGN  RENLA H HQNKE++CSS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS

SwissProt top hitse value%identityAlignment
Q6AZB8 Putative nuclease HARBI19.0e-2124.64Show/hide
Query:  YICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWP-SSSRLEEIKSQFEAFFGL
        Y+  L+++ L+ R          R +S + Q+  A+    SG  Q  +G A G+ Q+++S+      +AL ++A   + +    +  ++ K +F    G+
Subjt:  YICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWP-SSSRLEEIKSQFEAFFGL

Query:  PNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVG
        PN  G +D  HI +  P    S  + +    +S+  Q + D +   +   T WPG++T   + K S + KL +  E            ++   +L+G   
Subjt:  PNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVG

Query:  YPLLPWLITPYENDNLSPLNFNFNAVQGAAKLLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNIIIDNG
        YPL  WL+TP ++   SP ++ +N        +  R F  ++  +R L+  K   +    K   II  CC+L NI + +G
Subjt:  YPLLPWLITPYENDNLSPLNFNFNAVQGAAKLLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNIIIDNG

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 12.0e-15367.25Show/hide
Query:  MAPTKKSKKRNK----DSKKL---KKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN
        MAP K+ KK  K     +KKL   K++K ++ VP++P A D DWW+ FW +N S S     +DE   FK+FFR SK TF YICSLVREDLISRPPSGLIN
Subjt:  MAPTKKSKKRNK----DSKKL---KKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN

Query:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQT
        IEGRLLSVEKQVAIA+RRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EALE+RAKHHL+WP S R+EEIKS+FE  +GLPNCCGAID THIIMTLPAVQ 
Subjt:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQT

Query:  SDDWCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNF
        SDDWCD   NYSMFLQG+ DH+MRF+++VTGWPG MT S+LLK S  FKLC+  + L+GN K  S G++IREY+VGG+ YPLLPWLITP+++D+ S    
Subjt:  SDDWCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNF

Query:  NFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHL
         FN      + +A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NIIID GD LQ DV LSGHHD GY +  CKQ +PLG+ LR  L +HL
Subjt:  NFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHL

Q9M2U3 Protein ALP1-like6.1e-9444.85Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRAS-----------------DPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDL
        M P K  KK+ +  KK+ +   L+       AS                   DWW+ F  +        G + +   F+  F+ S+KTFDYICSLV+ D 
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRAS-----------------DPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDL

Query:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATH
         ++ P+   +  G  LS+  +VA+A+RRL SGES   +G  FG+ QSTVSQ+TWRFVE++E+RA HHL WP  S+L+EIKS+FE   GLPNCCGAID TH
Subjt:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATH

Query:  IIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITP
        I+M LPAV+ S+  W D   N+SM LQ +VD  MRF+D++ GWPG++    +LK S  +KL + G+RLNG     S  +E+REY+VG  G+PLLPWL+TP
Subjt:  IIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITP

Query:  YENDNLSPLNFNFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNL
        Y+    S     FN     A   A  A S+LK  WRI+N VMW PD+ +LP II VCCLL NIIID  D+   D  LS  HD+ Y++  CK  D   + L
Subjt:  YENDNLSPLNFNFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNL

Query:  RENLAKHL
        R+ L+  L
Subjt:  RENLAKHL

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)7.1e-2125.95Show/hide
Query:  SKKLKKRKNLSVVPM--EPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRR
        S+ L+    +S +P+   P +S        W      S +   +D    +  +FR SK TF  + S++     S  PS                A  + R
Subjt:  SKKLKKRKNLSVVPM--EPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRR

Query:  LASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGI
        LA G S   +   FG    + SQ +  F    +      +    S +L++ K  F     LPNC G +      +    +             S+ +Q +
Subjt:  LASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGI

Query:  VDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYE-NDNLSPLNFNFNAVQGAAKLLAVRAF
        VD   RF+DI  GWP  M    + + +++F +  A E L+G   K   G  +  Y++G    PLLPWL+TPY+   +       FN V          AF
Subjt:  VDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYE-NDNLSPLNFNFNAVQGAAKLLAVRAF

Query:  SQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNIIIDNGDE
        ++++  WRIL+K  W+P+  + +P +I   CLL N ++++GD+
Subjt:  SQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNIIIDNGDE

AT3G55350.1 PIF / Ping-Pong family of plant transposases4.3e-9544.85Show/hide
Query:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRAS-----------------DPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDL
        M P K  KK+ +  KK+ +   L+       AS                   DWW+ F  +        G + +   F+  F+ S+KTFDYICSLV+ D 
Subjt:  MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRAS-----------------DPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDL

Query:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATH
         ++ P+   +  G  LS+  +VA+A+RRL SGES   +G  FG+ QSTVSQ+TWRFVE++E+RA HHL WP  S+L+EIKS+FE   GLPNCCGAID TH
Subjt:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATH

Query:  IIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITP
        I+M LPAV+ S+  W D   N+SM LQ +VD  MRF+D++ GWPG++    +LK S  +KL + G+RLNG     S  +E+REY+VG  G+PLLPWL+TP
Subjt:  IIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITP

Query:  YENDNLSPLNFNFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNL
        Y+    S     FN     A   A  A S+LK  WRI+N VMW PD+ +LP II VCCLL NIIID  D+   D  LS  HD+ Y++  CK  D   + L
Subjt:  YENDNLSPLNFNFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNL

Query:  RENLAKHL
        R+ L+  L
Subjt:  RENLAKHL

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.4e-15467.25Show/hide
Query:  MAPTKKSKKRNK----DSKKL---KKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN
        MAP K+ KK  K     +KKL   K++K ++ VP++P A D DWW+ FW +N S S     +DE   FK+FFR SK TF YICSLVREDLISRPPSGLIN
Subjt:  MAPTKKSKKRNK----DSKKL---KKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN

Query:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQT
        IEGRLLSVEKQVAIA+RRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EALE+RAKHHL+WP S R+EEIKS+FE  +GLPNCCGAID THIIMTLPAVQ 
Subjt:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQT

Query:  SDDWCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNF
        SDDWCD   NYSMFLQG+ DH+MRF+++VTGWPG MT S+LLK S  FKLC+  + L+GN K  S G++IREY+VGG+ YPLLPWLITP+++D+ S    
Subjt:  SDDWCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNF

Query:  NFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHL
         FN      + +A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NIIID GD LQ DV LSGHHD GY +  CKQ +PLG+ LR  L +HL
Subjt:  NFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHL

AT4G29780.1 unknown protein1.7e-3026.52Show/hide
Query:  DWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQV
        DWW+        +S      DE   F+  FR SK TF+ IC  + +  +++  + L +     +   K+V + + RLA+G     V   FG+G ST  ++
Subjt:  DWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQV

Query:  TWRFVEAL-EQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMFLQGIVDHQMRFIDIVTGWPGA
              A+ +     +L WPS S +   K++FE+   +PN  G+I  THI +  P V  +  +       +   +YS+ +QG+V+    F D+  G PG+
Subjt:  TWRFVEAL-EQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMFLQGIVDHQMRFIDIVTGWPGA

Query:  MTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPD
        +T  ++L+ S + +            ++ + G     ++VG  G+PL  +L+ PY   NL+     FN   G  + +A  AF +LKG W  L K      
Subjt:  MTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPD

Query:  KRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHL
         + LP ++  CC+L NI     +E+ P++      D+   E+  +    +  N R++++ +L
Subjt:  KRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHL

AT5G12010.1 unknown protein4.1e-3727.62Show/hide
Query:  WWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVT
        WWE      CS    P  +     FK  FR SK TF+ IC  +    +++  + L N     + V ++VA+ + RLA+GE    V   FG+G ST  ++ 
Subjt:  WWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVT

Query:  WRFVEALEQ-RAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAM
            +A++      +LQWP    L  I+ +FE+  G+PN  G++  THI +  P +  +  +       +   +YS+ +Q +V+ +  F D+  GWPG+M
Subjt:  WRFVEALEQ-RAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAM

Query:  TTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDK
           ++L+ S +++  + G  L G             ++ GG G+PLL W++ PY   NL+     FN      + +A  AF +LKG W  L K       
Subjt:  TTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDK

Query:  RKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPL--GNNLRENLAKH
        + LP+++  CC+L NI     ++++P++ +    D    E+  + ++ +   + +  NL  H
Subjt:  RKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPL--GNNLRENLAKH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCTACAAAGAAATCGAAGAAGCGCAACAAGGATTCCAAGAAACTGAAGAAACGTAAAAATTTGAGCGTTGTTCCCATGGAGCCCAGAGCATCAGACCCTGATTG
GTGGGAAATTTTCTGGCACAAGAACTGTTCCCTCTCAGGTTCTCCTGGACGTAATGATGAAGCAGCTGGATTCAAGTATTTCTTTCGAACGTCGAAAAAAACTTTCGACT
ACATTTGTTCCCTCGTACGAGAGGATCTCATTTCAAGGCCACCGTCTGGGCTTATCAATATTGAAGGGAGACTTCTTAGTGTAGAGAAGCAGGTTGCAATTGCTATGCGA
AGATTGGCATCGGGTGAATCGCAAGTTTCTGTGGGAGCTGCCTTTGGAGTTGGCCAGTCCACAGTCTCTCAGGTTACTTGGAGATTTGTCGAAGCTTTGGAGCAACGTGC
GAAGCACCATCTTCAGTGGCCGAGTTCCTCTAGATTGGAGGAAATCAAATCACAGTTTGAAGCTTTCTTTGGTTTGCCTAATTGTTGTGGAGCCATAGATGCAACACACA
TCATTATGACACTTCCAGCTGTACAAACATCAGATGATTGGTGTGATACCAACAATAATTACAGTATGTTCTTGCAGGGAATTGTTGATCACCAGATGAGATTTATTGAT
ATTGTAACTGGTTGGCCTGGGGCCATGACGACTAGTAGGTTATTAAAGTGTTCACGAATTTTCAAACTATGCGATGCCGGTGAACGTTTGAATGGGAATGTAAAGAAGTT
CTCTGGAGGGTCAGAGATCAGAGAATACTTAGTTGGTGGAGTTGGTTATCCTCTTCTTCCTTGGTTGATTACTCCTTACGAAAATGATAACCTATCGCCGTTGAATTTCA
ACTTCAATGCTGTGCAAGGAGCTGCAAAATTGCTTGCTGTGAGGGCATTCTCTCAGTTGAAGGGCAGCTGGAGAATCCTCAACAAGGTGATGTGGAGACCCGATAAGCGG
AAGCTGCCAAGCATTATACTGGTATGCTGTTTACTTCAAAACATTATAATTGACAATGGAGATGAGTTACAACCAGATGTTGCTTTATCTGGTCATCATGATTTGGGATA
TCAGGAGCATTGTTGTAAACAGTTAGATCCATTAGGGAACAATCTAAGGGAAAACTTAGCCAAGCACTTGCATCAAAATAAAGAGAGAGTTTGTTCTTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCTACAAAGAAATCGAAGAAGCGCAACAAGGATTCCAAGAAACTGAAGAAACGTAAAAATTTGAGCGTTGTTCCCATGGAGCCCAGAGCATCAGACCCTGATTG
GTGGGAAATTTTCTGGCACAAGAACTGTTCCCTCTCAGGTTCTCCTGGACGTAATGATGAAGCAGCTGGATTCAAGTATTTCTTTCGAACGTCGAAAAAAACTTTCGACT
ACATTTGTTCCCTCGTACGAGAGGATCTCATTTCAAGGCCACCGTCTGGGCTTATCAATATTGAAGGGAGACTTCTTAGTGTAGAGAAGCAGGTTGCAATTGCTATGCGA
AGATTGGCATCGGGTGAATCGCAAGTTTCTGTGGGAGCTGCCTTTGGAGTTGGCCAGTCCACAGTCTCTCAGGTTACTTGGAGATTTGTCGAAGCTTTGGAGCAACGTGC
GAAGCACCATCTTCAGTGGCCGAGTTCCTCTAGATTGGAGGAAATCAAATCACAGTTTGAAGCTTTCTTTGGTTTGCCTAATTGTTGTGGAGCCATAGATGCAACACACA
TCATTATGACACTTCCAGCTGTACAAACATCAGATGATTGGTGTGATACCAACAATAATTACAGTATGTTCTTGCAGGGAATTGTTGATCACCAGATGAGATTTATTGAT
ATTGTAACTGGTTGGCCTGGGGCCATGACGACTAGTAGGTTATTAAAGTGTTCACGAATTTTCAAACTATGCGATGCCGGTGAACGTTTGAATGGGAATGTAAAGAAGTT
CTCTGGAGGGTCAGAGATCAGAGAATACTTAGTTGGTGGAGTTGGTTATCCTCTTCTTCCTTGGTTGATTACTCCTTACGAAAATGATAACCTATCGCCGTTGAATTTCA
ACTTCAATGCTGTGCAAGGAGCTGCAAAATTGCTTGCTGTGAGGGCATTCTCTCAGTTGAAGGGCAGCTGGAGAATCCTCAACAAGGTGATGTGGAGACCCGATAAGCGG
AAGCTGCCAAGCATTATACTGGTATGCTGTTTACTTCAAAACATTATAATTGACAATGGAGATGAGTTACAACCAGATGTTGCTTTATCTGGTCATCATGATTTGGGATA
TCAGGAGCATTGTTGTAAACAGTTAGATCCATTAGGGAACAATCTAAGGGAAAACTTAGCCAAGCACTTGCATCAAAATAAAGAGAGAGTTTGTTCTTCGTAA
Protein sequenceShow/hide protein sequence
MAPTKKSKKRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCSLSGSPGRNDEAAGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMR
RLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFID
IVTGWPGAMTTSRLLKCSRIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAVQGAAKLLAVRAFSQLKGSWRILNKVMWRPDKR
KLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKERVCSS