; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G07620 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G07620
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1
Genome locationClcChr09:6142042..6143692
RNA-Seq ExpressionClc09G07620
SyntenyClc09G07620
Gene Ontology termsGO:0060967 - negative regulation of gene silencing by RNA (biological process)
GO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061306.1 putative nuclease HARBI1 isoform X1 [Cucumis melo var. makuwa]3.1e-22093.75Show/hide
Query:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        M  TKKSKKR KDSKKLKKRKNL+VVPMEPR S+PDWW IFWHKNCS+SGSPG +DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKS FEACF LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCS+IFKLC+A ERLNGNV+K SGG EIREYLVGGV YPLLPWLITPYES NLSPLKFNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS

XP_008461482.1 PREDICTED: uncharacterized protein LOC103500068 [Cucumis melo]5.9e-21993.5Show/hide
Query:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        M  TKKSKKR KDSKKLKKRKNL+VVPMEPR S+PDWW IFWHKN S+SGSPG +DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKS FEACF LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCS+IFKLC+A ERLNGNV+K SGG EIREYLVGGV YPLLPWLITPYES NLSPLKFNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS

XP_011659137.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 [Cucumis sativus]3.4e-21993.25Show/hide
Query:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        M  TKKSKKR KDSKKLKKRKNL+VVPMEPR S+PDWW IFWHKNCS+SGSPG NDEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRF+DIVTGWPGAMTTSRLLKCS+IFKLC+A ERLNGNV+K SGG EIREYLVGGV YPLLPWLITPYE+ NLSPL FNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS

XP_023004332.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita maxima]3.6e-21691.5Show/hide
Query:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        M  TKKSKKRK+DSKKLKK KNL+VVPMEPR SEPDWW IFWHKNCS SGS GPNDEAE FKYFFRTSKKTFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG
        N NYSMFLQGIVDHQMRFLDIVTGWPG MTTSRLLKCSK FKLCN  ERLNGN RK+SGG EIREYLVGGV YPLLPWLITPYES +L PLK NFN VHG
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNT+RENLA H HQNKE++CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS

XP_038896181.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 [Benincasa hispida]1.8e-22093.75Show/hide
Query:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        M  TKKSKKRKKDSKKL KRKNL VVPMEPR SEPDWW +FWH+NC VSGS G NDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL WPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLC+A ERLNGN+RK SGG EIREYLVGGV YPLLPWLITPYES NLSPLKFNFNAVHG
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS
         AKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDEL+PDVALSGHHD+GYQEHCCKQLDPLGNT RENLAKHL+QNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS

TrEMBL top hitse value%identityAlignment
A0A0A0K4D8 DDE Tnp4 domain-containing protein1.7e-21993.25Show/hide
Query:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        M  TKKSKKR KDSKKLKKRKNL+VVPMEPR S+PDWW IFWHKNCS+SGSPG NDEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRF+DIVTGWPGAMTTSRLLKCS+IFKLC+A ERLNGNV+K SGG EIREYLVGGV YPLLPWLITPYE+ NLSPL FNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS

A0A1S3CEQ5 uncharacterized protein LOC1035000682.8e-21993.5Show/hide
Query:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        M  TKKSKKR KDSKKLKKRKNL+VVPMEPR S+PDWW IFWHKN S+SGSPG +DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKS FEACF LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCS+IFKLC+A ERLNGNV+K SGG EIREYLVGGV YPLLPWLITPYES NLSPLKFNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS

A0A5A7V486 Putative nuclease HARBI1 isoform X11.5e-22093.75Show/hide
Query:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        M  TKKSKKR KDSKKLKKRKNL+VVPMEPR S+PDWW IFWHKNCS+SGSPG +DEA GFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKS FEACF LPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG
        NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCS+IFKLC+A ERLNGNV+K SGG EIREYLVGGV YPLLPWLITPYES NLSPLKFNFNAV G
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGN  RENLAKHLHQNKER+CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS

A0A6J1KQ50 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X21.7e-21691.5Show/hide
Query:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        M  TKKSKKRK+DSKKLKK KNL+VVPMEPR SEPDWW IFWHKNCS SGS GPNDEAE FKYFFRTSKKTFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG
        N NYSMFLQGIVDHQMRFLDIVTGWPG MTTSRLLKCSK FKLCN  ERLNGN RK+SGG EIREYLVGGV YPLLPWLITPYES +L PLK NFN VHG
Subjt:  NNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHG

Query:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS
        AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNT+RENLA H HQNKE++CSS
Subjt:  AAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS

A0A6J1KRU3 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X17.2e-21590.82Show/hide
Query:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVS---GSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGR
        M  TKKSKKRK+DSKKLKK KNL+VVPMEPR SEPDWW IFWHKNCS S   GS GPNDEAE FKYFFRTSKKTFDYICSLVREDL+SRPPSGLINIEGR
Subjt:  MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVS---GSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGR

Query:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDW
        LLSVEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEA FGLPNCCGAIDATHIIMTLPAVQTSDDW
Subjt:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDW

Query:  CDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNA
        CDTN NYSMFLQGIVDHQMRFLDIVTGWPG MTTSRLLKCSK FKLCN  ERLNGN RK+SGG EIREYLVGGV YPLLPWLITPYES +L PLK NFN 
Subjt:  CDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNA

Query:  VHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERL
        VHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNT+RENLA H HQNKE++
Subjt:  VHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERL

Query:  CSS
        CSS
Subjt:  CSS

SwissProt top hitse value%identityAlignment
Q6AZB8 Putative nuclease HARBI18.2e-2225.71Show/hide
Query:  YICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWP-SSSRLEEIKSQFEACFGL
        Y+  L+++ L+ R          R +S + Q+  A+    SG  Q  +G A G+ Q+++S+      +AL ++A   + +    +  ++ K +F    G+
Subjt:  YICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWP-SSSRLEEIKSQFEACFGL

Query:  PNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVS
        PN  G +D  HI +  P    S  + +    +S+  Q + D +   L   T WPG++T   + K S + KL    E             +   +L+G   
Subjt:  PNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVS

Query:  YPLLPWLITPYESGNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNIIIDNG
        YPL  WL+TP +S   SP  + +N  H     +  R F  ++  +R L+  K   +    K   II  CC+L NI + +G
Subjt:  YPLLPWLITPYESGNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNIIIDNG

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 12.9e-15267.25Show/hide
Query:  MVATKKSKKRKK----DSKKL---KKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN
        M   K+ KK KK     +KKL   K++K +  VP++P   + DWW  FW +N S S    P+DE   FK+FFR SK TF YICSLVREDLISRPPSGLIN
Subjt:  MVATKKSKKRKK----DSKKL---KKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN

Query:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQT
        IEGRLLSVEKQVAIA+RRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EALE+RAKHHL+WP S R+EEIKS+FE  +GLPNCCGAID THIIMTLPAVQ 
Subjt:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQT

Query:  SDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKF
        SDDWCD   NYSMFLQG+ DH+MRFL++VTGWPG MT S+LLK S  FKLC   + L+GN + +S G +IREY+VGG+SYPLLPWLITP++S + S    
Subjt:  SDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKF

Query:  NFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL
         FN  H   +S+A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NIIID GD LQ DV LSGHHD GY +  CKQ +PLG+  R  L +HL
Subjt:  NFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL

Q9M2U3 Protein ALP1-like1.0e-9348.88Show/hide
Query:  DWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQV
        DWW  F  +     GS  P    + F+  F+ S+KTFDYICSLV+ D  ++ P+   +  G  LS+  +VA+A+RRL SGES   +G  FG+ QSTVSQ+
Subjt:  DWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQV

Query:  TWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRL
        TWRFVE++E+RA HHL WP  S+L+EIKS+FE   GLPNCCGAID THI+M LPAV+ S+  W D   N+SM LQ +VD  MRFLD++ GWPG++    +
Subjt:  TWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRL

Query:  LKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPS
        LK S  +KL    +RLNG    +S   E+REY+VG   +PLLPWL+TPY+    S  +  FN  H  A   A  A S+LK  WRI+N VMW PD+ +LP 
Subjt:  LKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPS

Query:  IILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL
        II VCCLL NIIID  D+   D  LS  HD+ Y++  CK  D   +  R+ L+  L
Subjt:  IILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)7.6e-2328.42Show/hide
Query:  FFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIK
        +FR SK TF  + S++     S  PS                A  + RLA G S   +   FG    + SQ +  F    +      +    S +L++ K
Subjt:  FFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIK

Query:  SQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEI
          F     LPNC G +      +    +             S+ +Q +VD   RF+DI  GWP  M    + + +K+F +  A+E L+G   K+  G+ +
Subjt:  SQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEI

Query:  REYLVGGVSYPLLPWLITPYE-SGNLSPLKFNF-NAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNIIIDNGDE
          Y++G    PLLPWL+TPY+ + +    +  F N VH    S+ + AF++++  WRIL+K  W+P+  + +P +I   CLL N ++++GD+
Subjt:  REYLVGGVSYPLLPWLITPYE-SGNLSPLKFNF-NAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNIIIDNGDE

AT1G72270.2 LOCATED IN: mitochondrion7.6e-2328.42Show/hide
Query:  FFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIK
        +FR SK TF  + S++     S  PS                A  + RLA G S   +   FG    + SQ +  F    +      +    S +L++ K
Subjt:  FFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIK

Query:  SQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEI
          F     LPNC G +      +    +             S+ +Q +VD   RF+DI  GWP  M    + + +K+F +  A+E L+G   K+  G+ +
Subjt:  SQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEI

Query:  REYLVGGVSYPLLPWLITPYE-SGNLSPLKFNF-NAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNIIIDNGDE
          Y++G    PLLPWL+TPY+ + +    +  F N VH    S+ + AF++++  WRIL+K  W+P+  + +P +I   CLL N ++++GD+
Subjt:  REYLVGGVSYPLLPWLITPYE-SGNLSPLKFNF-NAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNIIIDNGDE

AT3G55350.1 PIF / Ping-Pong family of plant transposases7.4e-9548.88Show/hide
Query:  DWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQV
        DWW  F  +     GS  P    + F+  F+ S+KTFDYICSLV+ D  ++ P+   +  G  LS+  +VA+A+RRL SGES   +G  FG+ QSTVSQ+
Subjt:  DWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQV

Query:  TWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRL
        TWRFVE++E+RA HHL WP  S+L+EIKS+FE   GLPNCCGAID THI+M LPAV+ S+  W D   N+SM LQ +VD  MRFLD++ GWPG++    +
Subjt:  TWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDD-WCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRL

Query:  LKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPS
        LK S  +KL    +RLNG    +S   E+REY+VG   +PLLPWL+TPY+    S  +  FN  H  A   A  A S+LK  WRI+N VMW PD+ +LP 
Subjt:  LKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPS

Query:  IILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL
        II VCCLL NIIID  D+   D  LS  HD+ Y++  CK  D   +  R+ L+  L
Subjt:  IILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.1e-15367.25Show/hide
Query:  MVATKKSKKRKK----DSKKL---KKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN
        M   K+ KK KK     +KKL   K++K +  VP++P   + DWW  FW +N S S    P+DE   FK+FFR SK TF YICSLVREDLISRPPSGLIN
Subjt:  MVATKKSKKRKK----DSKKL---KKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLIN

Query:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQT
        IEGRLLSVEKQVAIA+RRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EALE+RAKHHL+WP S R+EEIKS+FE  +GLPNCCGAID THIIMTLPAVQ 
Subjt:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQT

Query:  SDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKF
        SDDWCD   NYSMFLQG+ DH+MRFL++VTGWPG MT S+LLK S  FKLC   + L+GN + +S G +IREY+VGG+SYPLLPWLITP++S + S    
Subjt:  SDDWCDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKF

Query:  NFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL
         FN  H   +S+A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NIIID GD LQ DV LSGHHD GY +  CKQ +PLG+  R  L +HL
Subjt:  NFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHL

AT5G12010.1 unknown protein1.1e-3728.01Show/hide
Query:  WHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVE
        W + CS    P      E FK  FR SK TF+ IC  +    +++  + L N     + V ++VA+ + RLA+GE    V   FG+G ST  ++     +
Subjt:  WHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVE

Query:  ALEQ-RAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRL
        A++      +LQWP    L  I+ +FE+  G+PN  G++  THI +  P +  +  +       +   +YS+ +Q +V+ +  F D+  GWPG+M   ++
Subjt:  ALEQ-RAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMFLQGIVDHQMRFLDIVTGWPGAMTTSRL

Query:  LKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPS
        L+ S +++  N            +GGL    ++ GG  +PLL W++ PY   NL+  +  FN      + +A  AF +LKG W  L K       + LP+
Subjt:  LKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKRKLPS

Query:  IILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPL--GNTTRENLAKH
        ++  CC+L NI     ++++P++ +    D    E+  + ++ +   +T   NL  H
Subjt:  IILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPL--GNTTRENLAKH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCTACAAAGAAATCGAAGAAGCGCAAGAAGGATTCCAAGAAACTGAAGAAACGTAAAAACTTGACCGTTGTTCCTATGGAACCCAGAACTTCAGAGCCTGATTG
GTGGGGAATTTTCTGGCACAAAAATTGTTCCGTCTCAGGTTCTCCTGGACCTAATGATGAAGCAGAAGGATTCAAGTATTTCTTTCGAACTTCAAAGAAAACTTTCGACT
ACATTTGTTCCCTCGTACGAGAAGATCTCATTTCGAGGCCGCCATCTGGGCTTATCAATATTGAAGGGAGACTTCTTAGTGTGGAGAAGCAGGTTGCAATTGCCATGCGA
AGATTGGCATCAGGTGAATCACAAGTTTCTGTGGGAGCTGCTTTTGGAGTTGGCCAGTCCACAGTCTCTCAGGTTACTTGGAGATTTGTCGAAGCTTTGGAGCAACGTGC
GAAGCACCATCTTCAGTGGCCGAGTTCCTCTAGATTGGAGGAAATCAAGTCACAATTTGAAGCTTGCTTTGGGCTGCCTAATTGTTGTGGAGCCATAGATGCAACACACA
TCATTATGACACTTCCAGCCGTACAAACATCGGATGATTGGTGTGATACCAACAATAATTACAGTATGTTCTTGCAAGGAATTGTTGATCATCAGATGAGATTTCTTGAC
ATTGTAACTGGGTGGCCTGGGGCCATGACGACTAGTAGGTTATTGAAGTGCTCCAAAATTTTCAAGCTATGCAATGCGGATGAACGTTTGAATGGGAATGTAAGGAAGGT
TTCTGGAGGGTTAGAAATCAGAGAATACTTGGTTGGTGGAGTTAGTTATCCTCTTCTTCCTTGGTTGATTACTCCTTACGAAAGTGGCAACCTATCTCCGTTGAAGTTCA
ACTTCAATGCTGTGCACGGAGCTGCGAAATCGCTTGCTGTGAGGGCATTCTCTCAGTTGAAGGGCAGCTGGAGAATCCTCAACAAGGTGATGTGGAGACCCGATAAGCGG
AAGCTGCCAAGCATTATACTGGTATGCTGTTTACTTCAAAACATTATAATTGACAATGGAGATGAGTTACAACCAGATGTTGCTTTATCTGGTCATCATGACTTGGGATA
TCAGGAACATTGTTGTAAACAGTTGGATCCATTGGGGAACACTACAAGGGAAAATTTAGCCAAGCACTTGCATCAAAATAAAGAGAGACTTTGTTCTTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCTACAAAGAAATCGAAGAAGCGCAAGAAGGATTCCAAGAAACTGAAGAAACGTAAAAACTTGACCGTTGTTCCTATGGAACCCAGAACTTCAGAGCCTGATTG
GTGGGGAATTTTCTGGCACAAAAATTGTTCCGTCTCAGGTTCTCCTGGACCTAATGATGAAGCAGAAGGATTCAAGTATTTCTTTCGAACTTCAAAGAAAACTTTCGACT
ACATTTGTTCCCTCGTACGAGAAGATCTCATTTCGAGGCCGCCATCTGGGCTTATCAATATTGAAGGGAGACTTCTTAGTGTGGAGAAGCAGGTTGCAATTGCCATGCGA
AGATTGGCATCAGGTGAATCACAAGTTTCTGTGGGAGCTGCTTTTGGAGTTGGCCAGTCCACAGTCTCTCAGGTTACTTGGAGATTTGTCGAAGCTTTGGAGCAACGTGC
GAAGCACCATCTTCAGTGGCCGAGTTCCTCTAGATTGGAGGAAATCAAGTCACAATTTGAAGCTTGCTTTGGGCTGCCTAATTGTTGTGGAGCCATAGATGCAACACACA
TCATTATGACACTTCCAGCCGTACAAACATCGGATGATTGGTGTGATACCAACAATAATTACAGTATGTTCTTGCAAGGAATTGTTGATCATCAGATGAGATTTCTTGAC
ATTGTAACTGGGTGGCCTGGGGCCATGACGACTAGTAGGTTATTGAAGTGCTCCAAAATTTTCAAGCTATGCAATGCGGATGAACGTTTGAATGGGAATGTAAGGAAGGT
TTCTGGAGGGTTAGAAATCAGAGAATACTTGGTTGGTGGAGTTAGTTATCCTCTTCTTCCTTGGTTGATTACTCCTTACGAAAGTGGCAACCTATCTCCGTTGAAGTTCA
ACTTCAATGCTGTGCACGGAGCTGCGAAATCGCTTGCTGTGAGGGCATTCTCTCAGTTGAAGGGCAGCTGGAGAATCCTCAACAAGGTGATGTGGAGACCCGATAAGCGG
AAGCTGCCAAGCATTATACTGGTATGCTGTTTACTTCAAAACATTATAATTGACAATGGAGATGAGTTACAACCAGATGTTGCTTTATCTGGTCATCATGACTTGGGATA
TCAGGAACATTGTTGTAAACAGTTGGATCCATTGGGGAACACTACAAGGGAAAATTTAGCCAAGCACTTGCATCAAAATAAAGAGAGACTTTGTTCTTCATAA
Protein sequenceShow/hide protein sequence
MVATKKSKKRKKDSKKLKKRKNLTVVPMEPRTSEPDWWGIFWHKNCSVSGSPGPNDEAEGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMR
RLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEACFGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFLD
IVTGWPGAMTTSRLLKCSKIFKLCNADERLNGNVRKVSGGLEIREYLVGGVSYPLLPWLITPYESGNLSPLKFNFNAVHGAAKSLAVRAFSQLKGSWRILNKVMWRPDKR
KLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQLDPLGNTTRENLAKHLHQNKERLCSS