; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G022620 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G022620
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1
Genome locationchr02:29188835..29191114
RNA-Seq ExpressionLsi02G022620
SyntenyLsi02G022620
Gene Ontology termsGO:0060967 - negative regulation of gene silencing by RNA (biological process)
GO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061306.1 putative nuclease HARBI1 isoform X1 [Cucumis melo var. makuwa]2.5e-22595.25Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKR KDSKKLKKRKNLSVVPMEPRAS+PDWWEIFWHKNCS+SGSPGG+DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSR EEIKS FEA F LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCS+IFKLCDAGERLNGNV+KFSGG EIREYLVGGVGYPLLPWL+TPY+SDNLSPLKFNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS

XP_008461482.1 PREDICTED: uncharacterized protein LOC103500068 [Cucumis melo]4.6e-22495Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKR KDSKKLKKRKNLSVVPMEPRAS+PDWWEIFWHKN S+SGSPGG+DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSR EEIKS FEA F LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCS+IFKLCDAGERLNGNV+KFSGG EIREYLVGGVGYPLLPWL+TPY+SDNLSPLKFNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS

XP_011659137.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 [Cucumis sativus]1.6e-22494.75Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKR KDSKKLKKRKNLSVVPMEPRAS+PDWWEIFWHKNCS+SGSPG NDEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSR EEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRF+DIVTGWPGAMTTSRLLKCS+IFKLCDAGERLNGNV+KFSGG EIREYLVGGVGYPLLPWL+TPY++DNLSPL FNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS

XP_023004332.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita maxima]2.0e-21992.75Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRK+DSKKLKK KNLSVVPMEPR SEPDWWEIFWHKNCS SGS G NDEAE FKYFFRTSKKTFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSR EEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG
        N NYSMFLQGIVDHQMRFLDIVTGWPG MTTSRLLKCSK FKLC+ GERLNGN RK SGG EIREYLVGGVGYPLLPWL+TPY+SD+L PLK NFN VHG
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNT+RENLA H HQNKE+ICSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS

XP_038896181.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 [Benincasa hispida]5.5e-22595.25Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKKDSKKL KRKNL VVPMEPRASEPDWWE+FWH+NC VSGS G NDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL WPSSSR EEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGN+RKFSGG EIREYLVGGVGYPLLPWL+TPY+SDNLSPLKFNFNAVHG
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS
         AKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDEL+PDVALSGHHD+GYQEHCCKQLDPLGNT RENLAKHL+QNKERICSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS

TrEMBL top hitse value%identityAlignment
A0A0A0K4D8 DDE Tnp4 domain-containing protein7.7e-22594.75Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKR KDSKKLKKRKNLSVVPMEPRAS+PDWWEIFWHKNCS+SGSPG NDEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSR EEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRF+DIVTGWPGAMTTSRLLKCS+IFKLCDAGERLNGNV+KFSGG EIREYLVGGVGYPLLPWL+TPY++DNLSPL FNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS

A0A1S3CEQ5 uncharacterized protein LOC1035000682.2e-22495Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKR KDSKKLKKRKNLSVVPMEPRAS+PDWWEIFWHKN S+SGSPGG+DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSR EEIKS FEA F LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCS+IFKLCDAGERLNGNV+KFSGG EIREYLVGGVGYPLLPWL+TPY+SDNLSPLKFNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS

A0A5A7V486 Putative nuclease HARBI1 isoform X11.2e-22595.25Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKR KDSKKLKKRKNLSVVPMEPRAS+PDWWEIFWHKNCS+SGSPGG+DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSR EEIKS FEA F LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCS+IFKLCDAGERLNGNV+KFSGG EIREYLVGGVGYPLLPWL+TPY+SDNLSPLKFNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS

A0A6J1H7G5 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X21.8e-21892.5Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRK+DSKKLKK KNLSVVPMEPRA+EPDWWEIFWHKNCS SGS G NDEAE FKYFFRTSK+TFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSR EEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG
          NYSMFLQGIVDHQMRFLDIVTGWPG MTTSRLLKCSK FKLC+ GERLNGN RK SGG EIREYLVGGVGYPLLPWL+TPY+SD+LSPLK NFN VHG
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNT+RENLA H HQNKE+I SS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS

A0A6J1KQ50 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X29.8e-22092.75Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRK+DSKKLKK KNLSVVPMEPR SEPDWWEIFWHKNCS SGS G NDEAE FKYFFRTSKKTFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSR EEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG
        N NYSMFLQGIVDHQMRFLDIVTGWPG MTTSRLLKCSK FKLC+ GERLNGN RK SGG EIREYLVGGVGYPLLPWL+TPY+SD+L PLK NFN VHG
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNT+RENLA H HQNKE+ICSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS

SwissProt top hitse value%identityAlignment
Q6AZB8 Putative nuclease HARBI14.8e-2226.86Show/hide
Query:  YICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASF---
        Y+  L+++ L+ R          R +S + Q+  A+    SG  Q  +G A G+ Q+++S+      +AL ++A   + +   +R E  K QF+  F   
Subjt:  YICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASF---

Query:  -GLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVG
         G+PN  G +D  HI +  P    S  + +    +S+  Q + D +   L   T WPG++T   + K S + KL +  E             +   +L+G
Subjt:  -GLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVG

Query:  GVGYPLLPWLVTPYKSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNIIIDNG
           YPL  WL+TP +S   SP  + +N  H     +  R F  ++  +R L+  K   +    K   II  CC+L NI + +G
Subjt:  GVGYPLLPWLVTPYKSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNIIIDNG

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 15.3e-15467.76Show/hide
Query:  MAPTKKSKKRKK----DSKKL---KKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN
        MAP K+ KK KK     +KKL   K++K ++ VP++P A + DWW+ FW +N S S     +DE   FK+FFR SK TF YICSLVREDLISRPPSGLIN
Subjt:  MAPTKKSKKRKK----DSKKL---KKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN

Query:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQT
        IEGRLLSVEKQVAIA+RRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EALE+RAKHHLRWP S R EEIKS+FE  +GLPNCCGAID THIIMTLPAVQ 
Subjt:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQT

Query:  SDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKF
        SDDWCD   NYSMFLQG+ DH+MRFL++VTGWPG MT S+LLK S  FKLC+  + L+GN +  S G +IREY+VGG+ YPLLPWL+TP+ SD+ S    
Subjt:  SDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKF

Query:  NFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL
         FN  H   +S+A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NIIID GD LQ DV LSGHHD GY +  CKQ +PLG+  R  L +HL
Subjt:  NFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL

Q9M2U3 Protein ALP1-like2.5e-9545.34Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEP-----------------DWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDL
        M P K  KK+K+  KK+ +   L+       AS                   DWW+ F  +        GG+ + + F+  F+ S+KTFDYICSLV+ D 
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEP-----------------DWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDL

Query:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATH
         ++ P+   +  G  LS+  +VA+A+RRL SGES   +G  FG+ QSTVSQ+TWRFVE++E+RA HHL WP  S+ +EIKS+FE   GLPNCCGAID TH
Subjt:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATH

Query:  IIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTP
        I+M LPAV+ S+  W D   N+SM LQ +VD  MRFLD++ GWPG++    +LK S  +KL + G+RLNG     S   E+REY+VG  G+PLLPWL+TP
Subjt:  IIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTP

Query:  YKSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTT
        Y+    S  +  FN  H  A   A  A S+LK  WRI+N VMW PD+ +LP II VCCLL NIIID  D+   D  LS  HD+ Y++  CK  D   +  
Subjt:  YKSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTT

Query:  RENLAKHL
        R+ L+  L
Subjt:  RENLAKHL

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)1.7e-2229.11Show/hide
Query:  FFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIK
        +FR SK TF  + S++     S  PS                A  + RLA G S   +   FG    + SQ +  F    +      +    S + ++ K
Subjt:  FFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIK

Query:  SQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEI
          F  +  LPNC G +      +    +             S+ +Q +VD   RF+DI  GWP  M    + + +K+F +  A E L+G   K   G+ +
Subjt:  SQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEI

Query:  REYLVGGVGYPLLPWLVTPY--KSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNIIIDNGDE
          Y++G    PLLPWLVTPY   SD  S  +   N VH    S+ + AF++++  WRIL+K  W+P+  + +P +I   CLL N ++++GD+
Subjt:  REYLVGGVGYPLLPWLVTPY--KSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNIIIDNGDE

AT3G55350.1 PIF / Ping-Pong family of plant transposases1.8e-9645.34Show/hide
Query:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEP-----------------DWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDL
        M P K  KK+K+  KK+ +   L+       AS                   DWW+ F  +        GG+ + + F+  F+ S+KTFDYICSLV+ D 
Subjt:  MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEP-----------------DWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDL

Query:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATH
         ++ P+   +  G  LS+  +VA+A+RRL SGES   +G  FG+ QSTVSQ+TWRFVE++E+RA HHL WP  S+ +EIKS+FE   GLPNCCGAID TH
Subjt:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATH

Query:  IIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTP
        I+M LPAV+ S+  W D   N+SM LQ +VD  MRFLD++ GWPG++    +LK S  +KL + G+RLNG     S   E+REY+VG  G+PLLPWL+TP
Subjt:  IIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTP

Query:  YKSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTT
        Y+    S  +  FN  H  A   A  A S+LK  WRI+N VMW PD+ +LP II VCCLL NIIID  D+   D  LS  HD+ Y++  CK  D   +  
Subjt:  YKSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTT

Query:  RENLAKHL
        R+ L+  L
Subjt:  RENLAKHL

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)3.7e-15567.76Show/hide
Query:  MAPTKKSKKRKK----DSKKL---KKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN
        MAP K+ KK KK     +KKL   K++K ++ VP++P A + DWW+ FW +N S S     +DE   FK+FFR SK TF YICSLVREDLISRPPSGLIN
Subjt:  MAPTKKSKKRKK----DSKKL---KKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN

Query:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQT
        IEGRLLSVEKQVAIA+RRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EALE+RAKHHLRWP S R EEIKS+FE  +GLPNCCGAID THIIMTLPAVQ 
Subjt:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQT

Query:  SDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKF
        SDDWCD   NYSMFLQG+ DH+MRFL++VTGWPG MT S+LLK S  FKLC+  + L+GN +  S G +IREY+VGG+ YPLLPWL+TP+ SD+ S    
Subjt:  SDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKF

Query:  NFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL
         FN  H   +S+A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NIIID GD LQ DV LSGHHD GY +  CKQ +PLG+  R  L +HL
Subjt:  NFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL

AT4G29780.1 unknown protein3.4e-3126.8Show/hide
Query:  DWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQV
        DWW+        VS      DE   F+  FR SK TF+ IC  + +  +++  + L +     +   K+V + + RLA+G     V   FG+G ST  ++
Subjt:  DWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQV

Query:  TWRFVEAL-EQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMFLQGIVDHQMRFLDIVTGWPGA
              A+ +     +L WPS S     K++FE+   +PN  G+I  THI +  P V  +  +       +   +YS+ +QG+V+    F D+  G PG+
Subjt:  TWRFVEAL-EQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMFLQGIVDHQMRFLDIVTGWPGA

Query:  MTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPD
        +T  ++L+ S + +            ++ + G+    ++VG  G+PL  +L+ PY   NL+  +  FN   G  + +A  AF +LKG W  L K      
Subjt:  MTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPD

Query:  KRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL
         + LP ++  CC+L NI     +E+ P++      D+   E+  +    +   TR++++ +L
Subjt:  KRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL

AT5G12010.1 unknown protein3.2e-3727.9Show/hide
Query:  WWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVT
        WWE      CS    P      E FK  FR SK TF+ IC  +    +++  + L N     + V ++VA+ + RLA+GE    V   FG+G ST  ++ 
Subjt:  WWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVT

Query:  WRFVEALEQ-RAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAM
            +A++      +L+WP       I+ +FE+  G+PN  G++  THI +  P +  +  +       +   +YS+ +Q +V+ +  F D+  GWPG+M
Subjt:  WRFVEALEQ-RAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAM

Query:  TTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDK
           ++L+ S +++            R  +GGL    ++ GG G+PLL W++ PY   NL+  +  FN      + +A  AF +LKG W  L K       
Subjt:  TTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDK

Query:  RKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPL--GNTTRENLAKH
        + LP+++  CC+L NI     ++++P++ +    D    E+  + ++ +   +T   NL  H
Subjt:  RKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPL--GNTTRENLAKH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCTACAAAGAAATCGAAGAAGCGCAAGAAGGATTCCAAGAAACTGAAGAAACGTAAAAACTTGAGCGTTGTTCCCATGGAACCCAGAGCTTCAGAGCCTGATTG
GTGGGAAATTTTCTGGCACAAGAACTGTTCCGTCTCAGGTTCTCCTGGAGGTAATGATGAAGCAGAAGGATTCAAGTATTTCTTTCGAACATCGAAGAAAACTTTCGACT
ACATTTGTTCCCTCGTACGAGAAGATCTCATTTCGAGGCCGCCGTCTGGGCTTATCAATATTGAAGGGAGACTTCTTAGTGTGGAGAAGCAGGTTGCAATTGCTATGCGA
AGATTGGCATCTGGTGAATCACAAGTTTCTGTGGGAGCTGCCTTTGGAGTTGGCCAGTCCACAGTCTCTCAGGTTACTTGGAGATTTGTCGAAGCTTTGGAGCAACGTGC
GAAGCACCATCTTCGGTGGCCGAGTTCGTCTAGATTTGAAGAAATCAAGTCACAGTTTGAGGCTTCTTTTGGGCTTCCCAATTGTTGTGGAGCCATAGATGCAACACACA
TCATTATGACACTTCCAGCCGTACAAACATCAGATGATTGGTGTGATACCAACAATAATTACAGTATGTTCTTGCAGGGAATTGTTGATCACCAGATGAGATTTCTTGAC
ATTGTAACTGGTTGGCCCGGGGCCATGACAACTAGTAGGTTATTGAAGTGCTCAAAAATTTTCAAGCTATGTGACGCGGGTGAACGTTTGAATGGGAATGTAAGGAAGTT
TTCTGGAGGGTTAGAAATCAGAGAATACTTGGTTGGTGGAGTTGGTTATCCTCTTCTTCCTTGGTTGGTTACTCCTTACAAAAGTGACAACCTATCGCCGTTGAAGTTCA
ACTTCAATGCTGTGCACGGAGCTGCAAAATCGCTTGCTGTGAGGGCATTCTCTCAGTTGAAGGGCAGCTGGAGAATCCTCAACAAGGTGATGTGGAGACCCGATAAGCGG
AAATTGCCAAGCATTATACTGGTATGCTGTTTACTTCAAAACATTATAATTGACAATGGAGATGAGTTACAACCAGATGTTGCTTTATCTGGTCATCATGACTTGGGATA
TCAGGAACATTGTTGTAAACAGTTAGATCCATTGGGGAACACTACAAGGGAAAACTTAGCCAAGCACTTGCATCAAAATAAAGAGAGAATTTGTTCTTCGTAA
mRNA sequenceShow/hide mRNA sequence
TGATTTAGTTTTTGGTTGGCTCCAAAATCGACAAAACCAAATCGATTATTGACCTTATAATAAGCATAAGTCATAAAACATTTAATCTATGTTAATATCAAATTGAAAAA
ATAATGTACTTAAAAAAATTGAAAAAATAATAAAAAAATGATCCAAGGGAGAGCAGAAGGTTAGTGCGAGGTCAAGCTTCAAGCTTCATTCTCCAGTGGATTCATTCGTT
TTTTCAATTTCCAAATATTTTTCATTTTTCAGGGCAATCGCAAACACCTTAATCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCATTCACATTAGTAGCGACTTCTC
CCCTCAAACGAACGTCGGTCGTTGGACGGTTGATTCCTTGTTGAATCAGAATGGCGCCTACAAAGAAATCGAAGAAGCGCAAGAAGGATTCCAAGAAACTGAAGAAACGT
AAAAACTTGAGCGTTGTTCCCATGGAACCCAGAGCTTCAGAGCCTGATTGGTGGGAAATTTTCTGGCACAAGAACTGTTCCGTCTCAGGTTCTCCTGGAGGTAATGATGA
AGCAGAAGGATTCAAGTATTTCTTTCGAACATCGAAGAAAACTTTCGACTACATTTGTTCCCTCGTACGAGAAGATCTCATTTCGAGGCCGCCGTCTGGGCTTATCAATA
TTGAAGGGAGACTTCTTAGTGTGGAGAAGCAGGTTGCAATTGCTATGCGAAGATTGGCATCTGGTGAATCACAAGTTTCTGTGGGAGCTGCCTTTGGAGTTGGCCAGTCC
ACAGTCTCTCAGGTTACTTGGAGATTTGTCGAAGCTTTGGAGCAACGTGCGAAGCACCATCTTCGGTGGCCGAGTTCGTCTAGATTTGAAGAAATCAAGTCACAGTTTGA
GGCTTCTTTTGGGCTTCCCAATTGTTGTGGAGCCATAGATGCAACACACATCATTATGACACTTCCAGCCGTACAAACATCAGATGATTGGTGTGATACCAACAATAATT
ACAGTATGTTCTTGCAGGGAATTGTTGATCACCAGATGAGATTTCTTGACATTGTAACTGGTTGGCCCGGGGCCATGACAACTAGTAGGTTATTGAAGTGCTCAAAAATT
TTCAAGCTATGTGACGCGGGTGAACGTTTGAATGGGAATGTAAGGAAGTTTTCTGGAGGGTTAGAAATCAGAGAATACTTGGTTGGTGGAGTTGGTTATCCTCTTCTTCC
TTGGTTGGTTACTCCTTACAAAAGTGACAACCTATCGCCGTTGAAGTTCAACTTCAATGCTGTGCACGGAGCTGCAAAATCGCTTGCTGTGAGGGCATTCTCTCAGTTGA
AGGGCAGCTGGAGAATCCTCAACAAGGTGATGTGGAGACCCGATAAGCGGAAATTGCCAAGCATTATACTGGTATGCTGTTTACTTCAAAACATTATAATTGACAATGGA
GATGAGTTACAACCAGATGTTGCTTTATCTGGTCATCATGACTTGGGATATCAGGAACATTGTTGTAAACAGTTAGATCCATTGGGGAACACTACAAGGGAAAACTTAGC
CAAGCACTTGCATCAAAATAAAGAGAGAATTTGTTCTTCGTAAGGCTTCGAAATTGTATCAAACTCCGAAAACTTGCCAACGATCTGGTAAGACTTAGATTTGTGTCTAC
ACTGATGAAGTAATGCATATTTTGTTCAAAATTTCCCTCATTGTGAAGATGAGTATTTTAGCCAACATGTCATTGTTAGTTTCTGTAAGTATATATTCTTTCACTGAAAA
CTCCTATTTGCTGCATTTATGTAAGAGACGAAATGAAATTACGGTATGGTACGATAGGACGAACAGAGTATCTC
Protein sequenceShow/hide protein sequence
MAPTKKSKKRKKDSKKLKKRKNLSVVPMEPRASEPDWWEIFWHKNCSVSGSPGGNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMR
RLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSRFEEIKSQFEASFGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLD
IVTGWPGAMTTSRLLKCSKIFKLCDAGERLNGNVRKFSGGLEIREYLVGGVGYPLLPWLVTPYKSDNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKR
KLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERICSS