; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012473 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012473
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1
Genome locationtig00153403:60652..65089
RNA-Seq ExpressionSgr012473
SyntenySgr012473
Gene Ontology termsGO:0060967 - negative regulation of gene silencing by RNA (biological process)
GO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025509.1 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1, partial [Cucurbita argyrosperma subsp. argyrosperma]6.4e-21589.16Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS
        MAP KKSKKRKRDSKKLKKCKNLSVVPMEPRA+EPDWWEIFWHKNCS SGS GPNDEAE FKYFFRTSK TFDYICSLVREDLVSRPPSGLINIEGRLLS
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS

Query:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIA+RRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEAS+GLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE
          NYSM LQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN  KLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLK NFN V  
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE

Query:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKHINCSFFNSNCN
        A + LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ DPLGNTSRENL  H     ++ N  
Subjt:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKHINCSFFNSNCN

Query:  RPIFSDPLRACSEIR
          I+S    +CS+ R
Subjt:  RPIFSDPLRACSEIR

XP_022960391.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita moschata]4.4e-21694.09Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS
        MAP KKSKKRKRDSKKLKKCKNLSVVPMEPRA+EPDWWEIFWHKNCSTSGS GPNDEAE FKYFFRTSK TFDYICSLVREDLVSRPPSGLINIEGRLLS
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS

Query:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIA+RRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEAS+GLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE
          NYSM LQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLK NFN V  
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE

Query:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH
        A + LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ DPLGNTSRENL  H
Subjt:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH

XP_023004332.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita maxima]9.8e-21693.83Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS
        MAP KKSKKRKRDSKKLKKCKNLSVVPMEPR SEPDWWEIFWHKNCSTSGS GPNDEAE FKYFFRTSK TFDYICSLVREDLVSRPPSGLINIEGRLLS
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS

Query:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIA+RRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEAS+GLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE
        N NYSM LQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKLSGGSEIREYLVGGVGYPLLPWLITPYESDDL PLK NFN V  
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE

Query:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH
        A + LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ DPLGNTSRENL  H
Subjt:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH

XP_023513671.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X1 [Cucurbita pepo subsp. pepo]2.2e-21593.88Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTS---GSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGR
        MAP KKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTS   GS GPNDEAE FKYFFRTSK TFDYICSLVREDLVSRPPSGLINIEGR
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTS---GSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGR

Query:  LLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDW
        LLSVEKQVAIA+RRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEAS+GLPNCCGAIDATHIIMTLPAVQTSDDW
Subjt:  LLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDW

Query:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNA
        CDT  NYSM LQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLK NFN 
Subjt:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNA

Query:  VQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH
        VQ A + LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ DPLGNTSRENL  H
Subjt:  VQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH

XP_023513674.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita pepo subsp. pepo]5.2e-21794.6Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS
        MAP KKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGS GPNDEAE FKYFFRTSK TFDYICSLVREDLVSRPPSGLINIEGRLLS
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS

Query:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIA+RRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEAS+GLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE
          NYSM LQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLK NFN VQ 
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE

Query:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH
        A + LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ DPLGNTSRENL  H
Subjt:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH

TrEMBL top hitse value%identityAlignment
A0A6J1DWX1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.3e-21392.07Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS
        MAP KKSKKRKRDS KLKK K LSVVPM PRASEPDWWEIFWHKNCSTS SPGPNDEAEGFK+FFRTSK TFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS

Query:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIA+RRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSS+LEEIKSQFEAS+GLPNCCGAIDATHIIMTLPA+QTSDDWCDT
Subjt:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE
        NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTT+RLLKCSKFFKLCDVGERLNGNVRKLSG SEIREYLVGG  YPLLPWLITPYE+DDLSP K +FNAV +
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE

Query:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKHIN
        A RLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNI+IDNGDELQPDVALSGHHDLGYQEHCCKQ DPLGNTSRENL  H++
Subjt:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKHIN

A0A6J1H7G5 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X22.1e-21694.09Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS
        MAP KKSKKRKRDSKKLKKCKNLSVVPMEPRA+EPDWWEIFWHKNCSTSGS GPNDEAE FKYFFRTSK TFDYICSLVREDLVSRPPSGLINIEGRLLS
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS

Query:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIA+RRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEAS+GLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE
          NYSM LQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLK NFN V  
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE

Query:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH
        A + LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ DPLGNTSRENL  H
Subjt:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH

A0A6J1H8X9 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X19.0e-21593.37Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTS---GSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGR
        MAP KKSKKRKRDSKKLKKCKNLSVVPMEPRA+EPDWWEIFWHKNCSTS   GS GPNDEAE FKYFFRTSK TFDYICSLVREDLVSRPPSGLINIEGR
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTS---GSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGR

Query:  LLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDW
        LLSVEKQVAIA+RRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEAS+GLPNCCGAIDATHIIMTLPAVQTSDDW
Subjt:  LLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDW

Query:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNA
        CDT  NYSM LQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLK NFN 
Subjt:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNA

Query:  VQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH
        V  A + LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ DPLGNTSRENL  H
Subjt:  VQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH

A0A6J1KQ50 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X24.8e-21693.83Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS
        MAP KKSKKRKRDSKKLKKCKNLSVVPMEPR SEPDWWEIFWHKNCSTSGS GPNDEAE FKYFFRTSK TFDYICSLVREDLVSRPPSGLINIEGRLLS
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLS

Query:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT
        VEKQVAIA+RRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEAS+GLPNCCGAIDATHIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE
        N NYSM LQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKLSGGSEIREYLVGGVGYPLLPWLITPYESDDL PLK NFN V  
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQE

Query:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH
        A + LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ DPLGNTSRENL  H
Subjt:  AGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH

A0A6J1KRU3 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X12.0e-21493.11Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTS---GSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGR
        MAP KKSKKRKRDSKKLKKCKNLSVVPMEPR SEPDWWEIFWHKNCSTS   GS GPNDEAE FKYFFRTSK TFDYICSLVREDLVSRPPSGLINIEGR
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTS---GSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGR

Query:  LLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDW
        LLSVEKQVAIA+RRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHL+WPSSSRLEEIKSQFEAS+GLPNCCGAIDATHIIMTLPAVQTSDDW
Subjt:  LLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDW

Query:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNA
        CDTN NYSM LQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKLSGGSEIREYLVGGVGYPLLPWLITPYESDDL PLK NFN 
Subjt:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNA

Query:  VQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH
        V  A + LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQ DPLGNTSRENL  H
Subjt:  VQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKH

SwissProt top hitse value%identityAlignment
Q6AZB8 Putative nuclease HARBI11.5e-2025.71Show/hide
Query:  YICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWP-SSSRLEEIKSQFEASYGL
        Y+  L+++ L+ R          R +S + Q+  AL    SG  Q  +G A G+ Q+++S+      +AL ++A   + +    +  ++ K +F    G+
Subjt:  YICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWP-SSSRLEEIKSQFEASYGL

Query:  PNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVG
        PN  G +D  HI +  P    S  + +    +S+  Q + D +   L   T WPG +T   + K S   KL +  E            ++   +L+G   
Subjt:  PNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVG

Query:  YPLLPWLITPYESDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNIIIDNG
        YPL  WL+TP +S + SP  + +N        +  R F  ++  +R L+  K   +    K   II  CC+L NI + +G
Subjt:  YPLLPWLITPYESDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNIIIDNG

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.2e-15568.26Show/hide
Query:  MAPAKKSKKRKR----DSKKL---KKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLIN
        MAP K+ KK K+     +KKL   K+ K ++ VP++P A + DWW+ FW +N S S    P+DE   FK+FFR SK TF YICSLVREDL+SRPPSGLIN
Subjt:  MAPAKKSKKRKR----DSKKL---KKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLIN

Query:  IEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQT
        IEGRLLSVEKQVAIALRRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EALE+RAKHHL+WP S R+EEIKS+FE  YGLPNCCGAID THIIMTLPAVQ 
Subjt:  IEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQT

Query:  SDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKF
        SDDWCD   NYSM LQG+ DH+MRFL++VTGWPGGMT S+LLK S FFKLC+  + L+GN + LS G++IREY+VGG+ YPLLPWLITP++SD  S    
Subjt:  SDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKF

Query:  NFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKHI
         FN   E  R +A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NIIID GD LQ DV LSGHHD GY +  CKQ +PLG+  R  L +H+
Subjt:  NFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKHI

Q9M2U3 Protein ALP1-like9.0e-9546.04Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEP-----------------DWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDL
        M P K  KK+KR  KK+ +   L+       AS                   DWW+ F  +    S  P      + F+  F+ S+ TFDYICSLV+ D 
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEP-----------------DWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDL

Query:  VSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATH
         ++ P+   +  G  LS+  +VA+ALRRL SGES   +G  FG+ QSTVSQ+TWRFVE++E+RA HHL WP  S+L+EIKS+FE   GLPNCCGAID TH
Subjt:  VSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATH

Query:  IIMTLPAVQTSDD-WCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITP
        I+M LPAV+ S+  W D   N+SM LQ +VD  MRFLD++ GWPG +    +LK S F+KL + G+RLNG    LS  +E+REY+VG  G+PLLPWL+TP
Subjt:  IIMTLPAVQTSDD-WCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITP

Query:  YESDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTS
        Y+    S  +  FN         A  A S+LK  WRI+N VMW PD+ +LP II VCCLL NIIID  D+   D  LS  HD+ Y++  CK  D   +  
Subjt:  YESDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTS

Query:  RENL
        R+ L
Subjt:  RENL

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)3.8e-2428.12Show/hide
Query:  WHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVE
        W     TS +   +D    +  +FR SK TF  + S++     S  PS                A  + RLA G S   +   FG    + SQ +  F  
Subjt:  WHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVE

Query:  ALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFF
          +      +    S +L++ K  F  +  LPNC G +      +    +             S+L+Q +VD   RF+DI  GWP  M    + + +K F
Subjt:  ALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFF

Query:  KLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYE-SDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVC
         + +  E L+G   KL  G  +  Y++G    PLLPWL+TPY+ + D    +  FN V   G      AF++++  WRIL+K  W+P+  + +P +I   
Subjt:  KLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYE-SDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVC

Query:  CLLQNIIIDNGDE
        CLL N ++++GD+
Subjt:  CLLQNIIIDNGDE

AT3G55350.1 PIF / Ping-Pong family of plant transposases6.4e-9646.04Show/hide
Query:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEP-----------------DWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDL
        M P K  KK+KR  KK+ +   L+       AS                   DWW+ F  +    S  P      + F+  F+ S+ TFDYICSLV+ D 
Subjt:  MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEP-----------------DWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDL

Query:  VSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATH
         ++ P+   +  G  LS+  +VA+ALRRL SGES   +G  FG+ QSTVSQ+TWRFVE++E+RA HHL WP  S+L+EIKS+FE   GLPNCCGAID TH
Subjt:  VSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATH

Query:  IIMTLPAVQTSDD-WCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITP
        I+M LPAV+ S+  W D   N+SM LQ +VD  MRFLD++ GWPG +    +LK S F+KL + G+RLNG    LS  +E+REY+VG  G+PLLPWL+TP
Subjt:  IIMTLPAVQTSDD-WCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITP

Query:  YESDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTS
        Y+    S  +  FN         A  A S+LK  WRI+N VMW PD+ +LP II VCCLL NIIID  D+   D  LS  HD+ Y++  CK  D   +  
Subjt:  YESDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTS

Query:  RENL
        R+ L
Subjt:  RENL

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)8.4e-15768.26Show/hide
Query:  MAPAKKSKKRKR----DSKKL---KKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLIN
        MAP K+ KK K+     +KKL   K+ K ++ VP++P A + DWW+ FW +N S S    P+DE   FK+FFR SK TF YICSLVREDL+SRPPSGLIN
Subjt:  MAPAKKSKKRKR----DSKKL---KKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLIN

Query:  IEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQT
        IEGRLLSVEKQVAIALRRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EALE+RAKHHL+WP S R+EEIKS+FE  YGLPNCCGAID THIIMTLPAVQ 
Subjt:  IEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQT

Query:  SDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKF
        SDDWCD   NYSM LQG+ DH+MRFL++VTGWPGGMT S+LLK S FFKLC+  + L+GN + LS G++IREY+VGG+ YPLLPWLITP++SD  S    
Subjt:  SDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKF

Query:  NFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKHI
         FN   E  R +A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NIIID GD LQ DV LSGHHD GY +  CKQ +PLG+  R  L +H+
Subjt:  NFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKHI

AT4G29780.1 unknown protein3.9e-2927.19Show/hide
Query:  DWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQV
        DWW+         S    P DE   F+  FR SK TF+ IC  + +  V++  + L +     +   K+V + + RLA+G     V   FG+G ST  ++
Subjt:  DWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQV

Query:  TWRFVEAL-EQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGG
              A+ +     +L WPS S +   K++FE+ + +PN  G+I  THI +  P V  +  +       +   +YS+ +QG+V+    F D+  G PG 
Subjt:  TWRFVEAL-EQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGG

Query:  MTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPD
        +T  ++L+ S   +            ++ + G     ++VG  G+PL  +L+ PY   +L+  +  FN      + +A  AF +LKG W  L K      
Subjt:  MTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPD

Query:  KRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEH
         + LP ++  CC+L NI     +E+ P++      D+   E+
Subjt:  KRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEH

AT5G12010.1 unknown protein5.1e-3728.18Show/hide
Query:  WWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVT
        WWE      CS    P      E FK  FR SK TF+ IC  +    V++  + L N     + V ++VA+ + RLA+GE    V   FG+G ST  ++ 
Subjt:  WWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVT

Query:  WRFVEALEQ-RAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGM
            +A++      +LQWP    L  I+ +FE+  G+PN  G++  THI +  P +  +  +       +   +YS+ +Q +V+ +  F D+  GWPG M
Subjt:  WRFVEALEQ-RAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDW------CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGM

Query:  TTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDK
           ++L+ S  ++  + G  L G             ++ GG G+PLL W++ PY   +L+  +  FN      + +A  AF +LKG W  L K       
Subjt:  TTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDK

Query:  RKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPL--GNTSRENLVKH
        + LP+++  CC+L NI     ++++P++ +    D    E+  +  + +   +T   NL+ H
Subjt:  RKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPL--GNTSRENLVKH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCTGCAAAGAAATCGAAGAAGCGCAAAAGGGATTCGAAGAAACTAAAGAAATGTAAAAACTTGAGTGTTGTTCCTATGGAACCCAGAGCCTCGGAGCCTGATTG
GTGGGAAATTTTCTGGCACAAGAATTGTTCGACCTCAGGTTCTCCTGGACCTAATGATGAAGCAGAAGGATTCAAGTACTTCTTTCGAACTTCGAAGATAACTTTTGATT
ACATTTGTTCCCTTGTAAGAGAAGATCTTGTGTCGAGGCCACCGTCTGGGCTTATCAATATCGAAGGGAGACTTCTTAGCGTTGAGAAGCAGGTTGCAATTGCTTTGAGA
AGATTGGCATCTGGTGAGTCACAAGTTTCTGTGGGAGCTGCCTTTGGAGTTGGCCAATCCACAGTCTCTCAGGTTACTTGGAGATTCGTCGAAGCTTTGGAGCAACGTGC
AAAGCACCATCTTCAGTGGCCAAGTTCTTCTAGATTGGAGGAAATCAAGTCCCAATTTGAAGCTTCCTATGGGCTGCCTAATTGTTGTGGAGCCATAGATGCAACGCACA
TCATTATGACCCTTCCAGCAGTACAAACATCCGATGATTGGTGCGATACCAACAATAATTACAGTATGTTGTTGCAGGGAATCGTTGATCACCAGATGAGATTTCTTGAT
ATTGTAACAGGTTGGCCTGGGGGCATGACGACTAGTAGATTGTTGAAGTGCTCAAAATTCTTCAAATTATGTGATGTCGGAGAGCGTTTGAATGGAAATGTAAGGAAGTT
GTCTGGAGGGTCAGAAATCAGAGAATACTTGGTTGGTGGAGTTGGTTATCCTCTTCTTCCATGGTTGATTACTCCTTATGAAAGTGATGACCTATCACCGTTGAAGTTCA
ACTTCAATGCCGTACAAGAAGCTGGAAGGTTGCTTGCTGTGAGGGCATTCTCCCAGTTGAAGGGCAGCTGGAGAATCCTCAACAAGGTTATGTGGAGACCCGACAAGCGG
AAACTGCCAAGCATTATACTGGTATGCTGTTTACTTCAAAACATTATAATTGACAATGGAGATGAGTTACAACCAGATGTTGCTTTATCTGGTCATCATGACTTGGGGTA
TCAGGAGCATTGTTGTAAGCAGTTTGATCCATTGGGGAACACTTCAAGGGAAAACTTAGTTAAGCACATAAACTGTAGCTTCTTCAACTCCAACTGCAACCGCCCGATCT
TCTCGGATCCCCTTCGTGCTTGTTCTGAAATTCGCCTACTTTGGACACTGTCATTTTCCTCTGGCCCCAGTGTGGATGCTCCATCAGCAGCCAGAAAACCATCTTGGACA
TTTTTTGTCAGTTTATGGTTCATTTCAAACAATTTGGTGATGGCTTCTTCGGCTTCTTCTACTTGCCCCTTCACTTTATCTGAAGGTTTGCCAGTTTCTGGGCATCAGAA
TCAAGTCTTTCTAGAATCCTTCTCTTGTTTCCTTCTTGGGGGAGCTCAGAAAGTCTCCTGGAGATCTCTAGTTTATCCACACTCAACTCCTTCTCGAACAACGATTCGTT
TGAAGGGTGCTTGCTATTTCGTCTCCTGGTTGACTCAACCCGGTGATATCCGATCAAGTAGAATGTCTTTCGTCAGAACTTCATTTCCTGCTTCAGAAATTTCAGACTTT
GAATGCGACCTATTAAAACGCTCCTGTCCTCGAAGTGCTTTGTCCTTGGTCGCACGACCACTGTCTCGTGACCAGCCATTTCCAGATTTTACCTCTTCGATCTCCTTCAT
TACTTTCTCTAGCTTGGATGCTGGCAACTTGCAAAATCAGAATCCTGCAAATGTAAAGGTTAAAGGCCATGAGATTATGGAATTAAAACTCAGACAGCCTCCAATGCTAC
AATACAACTCTTTGATATTTGGGTACTGGCACCTCAGTCTATCCTATTTGATTGAAGAGAAATGGATGGGACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCTGCAAAGAAATCGAAGAAGCGCAAAAGGGATTCGAAGAAACTAAAGAAATGTAAAAACTTGAGTGTTGTTCCTATGGAACCCAGAGCCTCGGAGCCTGATTG
GTGGGAAATTTTCTGGCACAAGAATTGTTCGACCTCAGGTTCTCCTGGACCTAATGATGAAGCAGAAGGATTCAAGTACTTCTTTCGAACTTCGAAGATAACTTTTGATT
ACATTTGTTCCCTTGTAAGAGAAGATCTTGTGTCGAGGCCACCGTCTGGGCTTATCAATATCGAAGGGAGACTTCTTAGCGTTGAGAAGCAGGTTGCAATTGCTTTGAGA
AGATTGGCATCTGGTGAGTCACAAGTTTCTGTGGGAGCTGCCTTTGGAGTTGGCCAATCCACAGTCTCTCAGGTTACTTGGAGATTCGTCGAAGCTTTGGAGCAACGTGC
AAAGCACCATCTTCAGTGGCCAAGTTCTTCTAGATTGGAGGAAATCAAGTCCCAATTTGAAGCTTCCTATGGGCTGCCTAATTGTTGTGGAGCCATAGATGCAACGCACA
TCATTATGACCCTTCCAGCAGTACAAACATCCGATGATTGGTGCGATACCAACAATAATTACAGTATGTTGTTGCAGGGAATCGTTGATCACCAGATGAGATTTCTTGAT
ATTGTAACAGGTTGGCCTGGGGGCATGACGACTAGTAGATTGTTGAAGTGCTCAAAATTCTTCAAATTATGTGATGTCGGAGAGCGTTTGAATGGAAATGTAAGGAAGTT
GTCTGGAGGGTCAGAAATCAGAGAATACTTGGTTGGTGGAGTTGGTTATCCTCTTCTTCCATGGTTGATTACTCCTTATGAAAGTGATGACCTATCACCGTTGAAGTTCA
ACTTCAATGCCGTACAAGAAGCTGGAAGGTTGCTTGCTGTGAGGGCATTCTCCCAGTTGAAGGGCAGCTGGAGAATCCTCAACAAGGTTATGTGGAGACCCGACAAGCGG
AAACTGCCAAGCATTATACTGGTATGCTGTTTACTTCAAAACATTATAATTGACAATGGAGATGAGTTACAACCAGATGTTGCTTTATCTGGTCATCATGACTTGGGGTA
TCAGGAGCATTGTTGTAAGCAGTTTGATCCATTGGGGAACACTTCAAGGGAAAACTTAGTTAAGCACATAAACTGTAGCTTCTTCAACTCCAACTGCAACCGCCCGATCT
TCTCGGATCCCCTTCGTGCTTGTTCTGAAATTCGCCTACTTTGGACACTGTCATTTTCCTCTGGCCCCAGTGTGGATGCTCCATCAGCAGCCAGAAAACCATCTTGGACA
TTTTTTGTCAGTTTATGGTTCATTTCAAACAATTTGGTGATGGCTTCTTCGGCTTCTTCTACTTGCCCCTTCACTTTATCTGAAGGTTTGCCAGTTTCTGGGCATCAGAA
TCAAGTCTTTCTAGAATCCTTCTCTTGTTTCCTTCTTGGGGGAGCTCAGAAAGTCTCCTGGAGATCTCTAGTTTATCCACACTCAACTCCTTCTCGAACAACGATTCGTT
TGAAGGGTGCTTGCTATTTCGTCTCCTGGTTGACTCAACCCGGTGATATCCGATCAAGTAGAATGTCTTTCGTCAGAACTTCATTTCCTGCTTCAGAAATTTCAGACTTT
GAATGCGACCTATTAAAACGCTCCTGTCCTCGAAGTGCTTTGTCCTTGGTCGCACGACCACTGTCTCGTGACCAGCCATTTCCAGATTTTACCTCTTCGATCTCCTTCAT
TACTTTCTCTAGCTTGGATGCTGGCAACTTGCAAAATCAGAATCCTGCAAATGTAAAGGTTAAAGGCCATGAGATTATGGAATTAAAACTCAGACAGCCTCCAATGCTAC
AATACAACTCTTTGATATTTGGGTACTGGCACCTCAGTCTATCCTATTTGATTGAAGAGAAATGGATGGGACAATAG
Protein sequenceShow/hide protein sequence
MAPAKKSKKRKRDSKKLKKCKNLSVVPMEPRASEPDWWEIFWHKNCSTSGSPGPNDEAEGFKYFFRTSKITFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALR
RLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEASYGLPNCCGAIDATHIIMTLPAVQTSDDWCDTNNNYSMLLQGIVDHQMRFLD
IVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLSGGSEIREYLVGGVGYPLLPWLITPYESDDLSPLKFNFNAVQEAGRLLAVRAFSQLKGSWRILNKVMWRPDKR
KLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQEHCCKQFDPLGNTSRENLVKHINCSFFNSNCNRPIFSDPLRACSEIRLLWTLSFSSGPSVDAPSAARKPSWT
FFVSLWFISNNLVMASSASSTCPFTLSEGLPVSGHQNQVFLESFSCFLLGGAQKVSWRSLVYPHSTPSRTTIRLKGACYFVSWLTQPGDIRSSRMSFVRTSFPASEISDF
ECDLLKRSCPRSALSLVARPLSRDQPFPDFTSSISFITFSSLDAGNLQNQNPANVKVKGHEIMELKLRQPPMLQYNSLIFGYWHLSLSYLIEEKWMGQ