; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0020193 (gene) of Chayote v1 genome

Gene IDSed0020193
OrganismSechium edule (Chayote v1)
Descriptionprotein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1
Genome locationLG09:34977249..34980293
RNA-Seq ExpressionSed0020193
SyntenySed0020193
Gene Ontology termsGO:0060967 - negative regulation of gene silencing by RNA (biological process)
GO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025509.1 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1, partial [Cucurbita argyrosperma subsp. argyrosperma]3.8e-21090.48Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDSKKLKK K L+VVP EPR ++ DWWEIFWHKNCS SG SG N+E E FK+FFRTSK+TFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEA+EQRAKHHLRWP+S RLEEIKSQFEASFGLPNCCGAID+THIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT

Query:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG
         KNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN  KL GGSEIREYLVGGVGYPLLPWLITPYES+DLSP + NFN VHG
Subjt:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQE+CCKQ D LGNTSRENLANH HQNKE+I S
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRS

XP_022960391.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita moschata]1.2e-21190.75Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDSKKLKK K L+VVP EPR ++ DWWEIFWHKNCS+SG SG N+E E FK+FFRTSK+TFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEA+EQRAKHHLRWP+S RLEEIKSQFEASFGLPNCCGAID+THIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT

Query:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG
         KNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKL GGSEIREYLVGGVGYPLLPWLITPYES+DLSP + NFN VHG
Subjt:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQE+CCKQ D LGNTSRENLANH HQNKE+I SS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS

XP_023004330.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X1 [Cucurbita maxima]2.9e-21090.32Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSS---GFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGR
        MAPTKKSKKRKRDSKKLKK K L+VVP EPR S+ DWWEIFWHKNCS+S   G SG N+E E FK+FFRTSKKTFDYICSLVREDL+SRPPSGLINIEGR
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSS---GFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGR

Query:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDW
        LLSVEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEA+EQRAKHHLRWP+S RLEEIKSQFEASFGLPNCCGAID+THIIMTLPAVQTSDDW
Subjt:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDW

Query:  CDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFND
        CDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKL GGSEIREYLVGGVGYPLLPWLITPYES+DL P + NFN 
Subjt:  CDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFND

Query:  VHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERI
        VHGAAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQE+CCKQ D LGNTSRENLANH HQNKE+I
Subjt:  VHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERI

Query:  RSS
         SS
Subjt:  RSS

XP_023004332.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita maxima]7.0e-21291Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDSKKLKK K L+VVP EPR S+ DWWEIFWHKNCS+SG SG N+E E FK+FFRTSKKTFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEA+EQRAKHHLRWP+S RLEEIKSQFEASFGLPNCCGAID+THIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT

Query:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG
        NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKL GGSEIREYLVGGVGYPLLPWLITPYES+DL P + NFN VHG
Subjt:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQE+CCKQ D LGNTSRENLANH HQNKE+I SS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS

XP_023513674.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita pepo subsp. pepo]2.6e-21191Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDSKKLKK K L+VVP EPR S+ DWWEIFWHKNCS+SG SG N+E E FK+FFRTSKKTFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEA+EQRAKHHLRWP+S RLEEIKSQFEASFGLPNCCGAID+THIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT

Query:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG
         KNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKL GGSEIREYLVGGVGYPLLPWLITPYES+DLSP + NFN V G
Subjt:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQE+CCKQ D LGNTSRENLANH HQNKE+I SS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS

TrEMBL top hitse value%identityAlignment
A0A6J1DWX1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 13.9e-20889.25Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDS KLKK KTL+VVP  PR S+ DWWEIFWHKNCS+S   G N+E EGFKFFFRTSK TFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEA+EQRAKHHLRWP+S +LEEIKSQFEASFGLPNCCGAID+THIIMTLPA+QTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT

Query:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG
        N NYSM LQGIVDHQMRFLDIVTGWPGGMTT+RLLKCSKFFKLCDVGERLNGNVRKL G SEIREYLVGG  YPLLPWLITPYE++DLSPS+ +FN VH 
Subjt:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS
        AA+LLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNI+IDNGDELQPDVALSGHHDLGYQE+CCKQ D LGNTSRENLA H+HQNKERIRSS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS

A0A6J1H7G5 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X25.7e-21290.75Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDSKKLKK K L+VVP EPR ++ DWWEIFWHKNCS+SG SG N+E E FK+FFRTSK+TFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEA+EQRAKHHLRWP+S RLEEIKSQFEASFGLPNCCGAID+THIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT

Query:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG
         KNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKL GGSEIREYLVGGVGYPLLPWLITPYES+DLSP + NFN VHG
Subjt:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQE+CCKQ D LGNTSRENLANH HQNKE+I SS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS

A0A6J1H8X9 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X12.4e-21090.07Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSS---GFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGR
        MAPTKKSKKRKRDSKKLKK K L+VVP EPR ++ DWWEIFWHKNCS+S   G SG N+E E FK+FFRTSK+TFDYICSLVREDL+SRPPSGLINIEGR
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSS---GFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGR

Query:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDW
        LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEA+EQRAKHHLRWP+S RLEEIKSQFEASFGLPNCCGAID+THIIMTLPAVQTSDDW
Subjt:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDW

Query:  CDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFND
        CDT KNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKL GGSEIREYLVGGVGYPLLPWLITPYES+DLSP + NFN 
Subjt:  CDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFND

Query:  VHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERI
        VHGAAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQE+CCKQ D LGNTSRENLANH HQNKE+I
Subjt:  VHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERI

Query:  RSS
         SS
Subjt:  RSS

A0A6J1KQ50 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X23.4e-21291Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDSKKLKK K L+VVP EPR S+ DWWEIFWHKNCS+SG SG N+E E FK+FFRTSKKTFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEA+EQRAKHHLRWP+S RLEEIKSQFEASFGLPNCCGAID+THIIMTLPAVQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDT

Query:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG
        NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKL GGSEIREYLVGGVGYPLLPWLITPYES+DL P + NFN VHG
Subjt:  NKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHG

Query:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS
        AAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQE+CCKQ D LGNTSRENLANH HQNKE+I SS
Subjt:  AAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS

A0A6J1KRU3 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X11.4e-21090.32Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSS---GFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGR
        MAPTKKSKKRKRDSKKLKK K L+VVP EPR S+ DWWEIFWHKNCS+S   G SG N+E E FK+FFRTSKKTFDYICSLVREDL+SRPPSGLINIEGR
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSS---GFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGR

Query:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDW
        LLSVEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEA+EQRAKHHLRWP+S RLEEIKSQFEASFGLPNCCGAID+THIIMTLPAVQTSDDW
Subjt:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDW

Query:  CDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFND
        CDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLC+VGERLNGN RKL GGSEIREYLVGGVGYPLLPWLITPYES+DL P + NFN 
Subjt:  CDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFND

Query:  VHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERI
        VHGAAK LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQE+CCKQ D LGNTSRENLANH HQNKE+I
Subjt:  VHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERI

Query:  RSS
         SS
Subjt:  RSS

SwissProt top hitse value%identityAlignment
Q6AZB8 Putative nuclease HARBI16.2e-2228.22Show/hide
Query:  YICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASF---
        Y+  L+++ L+ R          R +S + Q+  A+    SG  Q  +G A G+ Q+++S+      +A+ ++A   + +    R E  K QF+  F   
Subjt:  YICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASF---

Query:  -GLPNCCGAIDSTHIIMTLPAVQTSDDWCDTNKN--YSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIRE--
         G+PN  G +D  HI +  P    +DD    NK   +S+  Q + D +   L   T WPG +T   + K S              NV KLF   E  +  
Subjt:  -GLPNCCGAIDSTHIIMTLPAVQTSDDWCDTNKN--YSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIRE--

Query:  YLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHGAAKLLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNIIIDNG
        +L+G   YPL  WL+TP +S + SP+++ +N  H     +  R F  ++  +R L+  K   +    K   II  CC+L NI + +G
Subjt:  YLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHGAAKLLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNIIIDNG

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 17.6e-15367.76Show/hide
Query:  MAPTKKSKKRKR----DSKKL---KKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLIN
        MAP K+ KK K+     +KKL   K+ K +  VP +P   D DWW+ FW +N S S  S   +E+  FK FFR SK TF YICSLVREDLISRPPSGLIN
Subjt:  MAPTKKSKKRKR----DSKKL---KKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLIN

Query:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQT
        IEGRLLSVEKQVAIA+RRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EA+E+RAKHHLRWP+S R+EEIKS+FE  +GLPNCCGAID+THIIMTLPAVQ 
Subjt:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQT

Query:  SDDWCDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEF
        SDDWCD  KNYSMFLQG+ DH+MRFL++VTGWPGGMT S+LLK S FFKLC+  + L+GN + L  G++IREY+VGG+ YPLLPWLITP++S+  S S  
Subjt:  SDDWCDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEF

Query:  NFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHL
         FN+ H   + +A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NIIID GD LQ DV LSGHHD GY +  CKQ++ LG+  R  L  HL
Subjt:  NFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHL

Q9M2U3 Protein ALP1-like3.5e-9746.57Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPS------------DS-----DWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDL
        M P K  KK+KR  KK+ +   L    +    S            DS     DWW+ F      S    G + + + F+  F+ S+KTFDYICSLV+ D 
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPS------------DS-----DWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDL

Query:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTH
         ++ P+   +  G  LS+  +VA+A+RRL SGES   +G  FG+ QSTVSQ+TWRFVE+ME+RA HHL WP+  +L+EIKS+FE   GLPNCCGAID TH
Subjt:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTH

Query:  IIMTLPAVQTSDD-WCDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITP
        I+M LPAV+ S+  W D  KN+SM LQ +VD  MRFLD++ GWPG +    +LK S F+KL + G+RLNG    L   +E+REY+VG  G+PLLPWL+TP
Subjt:  IIMTLPAVQTSDD-WCDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITP

Query:  YESNDLSPSEFNFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTS
        Y+    S  +  FN  H  A   A  A S+LK  WRI+N VMW PD+ +LP II VCCLL NIIID  D+   D  LS  HD+ Y++  CK +D   +  
Subjt:  YESNDLSPSEFNFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTS

Query:  RENLANHL
        R+ L++ L
Subjt:  RENLANHL

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)2.9e-2227.25Show/hide
Query:  SKKLKKPKTLTVVPSEPRPSDSD--WWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRR
        S+ L+    ++ +P  P PS S        W     +S  +  + ++  +  +FR SK TF  + S++     S  PS                A  + R
Subjt:  SKKLKKPKTLTVVPSEPRPSDSD--WWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRR

Query:  LASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDTNKNYSMFLQGI
        LA G S   +   FG    + SQ +  F    +      +    S +L++ K  F  +  LPNC G +      +    +             S+ +Q +
Subjt:  LASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDTNKNYSMFLQGI

Query:  VDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPS---EFNFNDVHGAAKLLAVR
        VD   RF+DI  GWP  M    + + +K F + +  E L+G   KL  G  +  Y++G    PLLPWL+TPY+      S   EFN N VH     + + 
Subjt:  VDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPS---EFNFNDVHGAAKLLAVR

Query:  AFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNIIIDNGDE
        AF++++  WRIL+K  W+P+  + +P +I   CLL N ++++GD+
Subjt:  AFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNIIIDNGDE

AT3G55350.1 PIF / Ping-Pong family of plant transposases2.5e-9846.57Show/hide
Query:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPS------------DS-----DWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDL
        M P K  KK+KR  KK+ +   L    +    S            DS     DWW+ F      S    G + + + F+  F+ S+KTFDYICSLV+ D 
Subjt:  MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPS------------DS-----DWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDL

Query:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTH
         ++ P+   +  G  LS+  +VA+A+RRL SGES   +G  FG+ QSTVSQ+TWRFVE+ME+RA HHL WP+  +L+EIKS+FE   GLPNCCGAID TH
Subjt:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTH

Query:  IIMTLPAVQTSDD-WCDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITP
        I+M LPAV+ S+  W D  KN+SM LQ +VD  MRFLD++ GWPG +    +LK S F+KL + G+RLNG    L   +E+REY+VG  G+PLLPWL+TP
Subjt:  IIMTLPAVQTSDD-WCDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITP

Query:  YESNDLSPSEFNFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTS
        Y+    S  +  FN  H  A   A  A S+LK  WRI+N VMW PD+ +LP II VCCLL NIIID  D+   D  LS  HD+ Y++  CK +D   +  
Subjt:  YESNDLSPSEFNFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTS

Query:  RENLANHL
        R+ L++ L
Subjt:  RENLANHL

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)5.4e-15467.76Show/hide
Query:  MAPTKKSKKRKR----DSKKL---KKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLIN
        MAP K+ KK K+     +KKL   K+ K +  VP +P   D DWW+ FW +N S S  S   +E+  FK FFR SK TF YICSLVREDLISRPPSGLIN
Subjt:  MAPTKKSKKRKR----DSKKL---KKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLIN

Query:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQT
        IEGRLLSVEKQVAIA+RRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EA+E+RAKHHLRWP+S R+EEIKS+FE  +GLPNCCGAID+THIIMTLPAVQ 
Subjt:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQT

Query:  SDDWCDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEF
        SDDWCD  KNYSMFLQG+ DH+MRFL++VTGWPGGMT S+LLK S FFKLC+  + L+GN + L  G++IREY+VGG+ YPLLPWLITP++S+  S S  
Subjt:  SDDWCDTNKNYSMFLQGIVDHQMRFLDIVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEF

Query:  NFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHL
         FN+ H   + +A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NIIID GD LQ DV LSGHHD GY +  CKQ++ LG+  R  L  HL
Subjt:  NFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHL

AT4G29780.1 unknown protein1.9e-2926.23Show/hide
Query:  SDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQ
        +DWW+            S  +  E+ F+  FR SK TF+ IC  + +  +++  + L +     +   K+V + + RLA+G     V   FG+G ST  +
Subjt:  SDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQ

Query:  VTWRFVEAM-EQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSD--DWCDTNKN----YSMFLQGIVDHQMRFLDIVTGWPG
        +      A+ +     +L WP+   +   K++FE+   +PN  G+I +THI +  P V  +   +   T +N    YS+ +QG+V+    F D+  G PG
Subjt:  VTWRFVEAM-EQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSD--DWCDTNKN----YSMFLQGIVDHQMRFLDIVTGWPG

Query:  GMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRP
         +T  ++L+ S   +            ++   G     ++VG  G+PL  +L+ PY   +L+ ++  FN+  G  + +A  AF +LKG W  L K     
Subjt:  GMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRP

Query:  DKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQN
          + LP ++  CC+L NI     +E+ P++      D+   EN      ++ + S  N  +H+  N
Subjt:  DKRKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQN

AT5G12010.1 unknown protein3.2e-3728.45Show/hide
Query:  WWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVT
        WWE      CS   +      EE FK  FR SK TF+ IC  +    +++  + L N     + V ++VA+ + RLA+GE    V   FG+G ST  ++ 
Subjt:  WWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVT

Query:  WRFVEAMEQ-RAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSD--DWCDTNKN----YSMFLQGIVDHQMRFLDIVTGWPGGM
            +A++      +L+WP+   L  I+ +FE+  G+PN  G++ +THI +  P +  +   +   T +N    YS+ +Q +V+ +  F D+  GWPG M
Subjt:  WRFVEAMEQ-RAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSD--DWCDTNKN----YSMFLQGIVDHQMRFLDIVTGWPGGM

Query:  TTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRPDK
           ++L+ S  ++  + G  L G             ++ GG G+PLL W++ PY   +L+ ++  FN+     + +A  AF +LKG W  L K       
Subjt:  TTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRPDK

Query:  RKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSL--GNTSRENLANH
        + LP+++  CC+L NI     ++++P++ +    D    EN  +  +++   +T   NL +H
Subjt:  RKLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSL--GNTSRENLANH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCCACAAAGAAATCGAAGAAGCGCAAAAGGGATTCCAAGAAACTCAAGAAACCCAAAACCCTAACTGTTGTTCCTTCCGAGCCCAGACCCTCCGACTCCGATTG
GTGGGAAATTTTCTGGCACAAGAACTGTTCATCTTCAGGTTTTTCTGGACATAATAATGAGGAAGAAGGATTCAAGTTCTTCTTTCGAACTTCGAAGAAAACTTTCGACT
ACATTTGTTCCCTTGTAAGAGAAGATCTCATATCGAGGCCGCCCTCCGGGCTTATCAATATTGAGGGCAGACTTCTTAGTGTGGAGAAGCAGGTTGCAATTGCTATGAGA
AGATTAGCATCTGGTGAGTCTCAAGTTTCTGTGGGAGCTGCCTTTGGAGTCGGCCAGTCCACAGTCTCTCAAGTTACTTGGAGATTCGTCGAAGCAATGGAGCAACGTGC
GAAGCATCATCTTCGATGGCCGAATTCCCCTAGATTGGAGGAAATAAAGTCACAGTTTGAAGCTTCCTTTGGGCTGCCTAATTGTTGTGGAGCCATAGATTCAACACACA
TCATTATGACTCTTCCAGCCGTACAAACATCAGATGATTGGTGTGATACCAACAAAAATTACAGCATGTTCTTGCAGGGAATCGTCGATCACCAGATGAGATTTCTTGAC
ATTGTAACTGGTTGGCCCGGGGGCATGACGACTAGTAGGTTATTGAAGTGCTCAAAGTTTTTCAAGCTATGCGATGTCGGCGAGCGGTTGAATGGAAATGTAAGGAAGTT
GTTTGGAGGGTCTGAAATCAGAGAATACTTGGTTGGTGGAGTTGGTTATCCTCTTCTTCCTTGGCTGATTACTCCTTATGAAAGCAATGATCTATCGCCATCCGAGTTCA
ACTTCAATGACGTGCACGGTGCTGCAAAATTGCTTGCTGTGAGAGCATTCTCTCAGTTGAAGGGCAGCTGGAGAATCCTCAACAAGGTTATGTGGAGACCCGATAAGCGA
AAATTGCCAAGCATTATATTGGTATGTTGTTTGCTTCAAAACATCATTATTGACAATGGCGATGAGTTACAACCAGATGTTGCTTTATCTGGTCATCATGATTTGGGATA
TCAGGAGAATTGTTGTAAGCAGTCTGATTCATTGGGGAACACTTCAAGGGAAAACTTGGCCAATCACTTGCATCAAAACAAAGAGAGAATTCGTTCTTCGTAA
mRNA sequenceShow/hide mRNA sequence
CTGGAAAGTTAGTGAAAAAAGGTCCAAGCTTCAATGATTCTTTGGTTTTCTCTTCTTCTTCTCTGAAACCAACATCGGGCAGTGGGCGATTGATTGTTGGCTGAATCGCA
ATGGCGCCCACAAAGAAATCGAAGAAGCGCAAAAGGGATTCCAAGAAACTCAAGAAACCCAAAACCCTAACTGTTGTTCCTTCCGAGCCCAGACCCTCCGACTCCGATTG
GTGGGAAATTTTCTGGCACAAGAACTGTTCATCTTCAGGTTTTTCTGGACATAATAATGAGGAAGAAGGATTCAAGTTCTTCTTTCGAACTTCGAAGAAAACTTTCGACT
ACATTTGTTCCCTTGTAAGAGAAGATCTCATATCGAGGCCGCCCTCCGGGCTTATCAATATTGAGGGCAGACTTCTTAGTGTGGAGAAGCAGGTTGCAATTGCTATGAGA
AGATTAGCATCTGGTGAGTCTCAAGTTTCTGTGGGAGCTGCCTTTGGAGTCGGCCAGTCCACAGTCTCTCAAGTTACTTGGAGATTCGTCGAAGCAATGGAGCAACGTGC
GAAGCATCATCTTCGATGGCCGAATTCCCCTAGATTGGAGGAAATAAAGTCACAGTTTGAAGCTTCCTTTGGGCTGCCTAATTGTTGTGGAGCCATAGATTCAACACACA
TCATTATGACTCTTCCAGCCGTACAAACATCAGATGATTGGTGTGATACCAACAAAAATTACAGCATGTTCTTGCAGGGAATCGTCGATCACCAGATGAGATTTCTTGAC
ATTGTAACTGGTTGGCCCGGGGGCATGACGACTAGTAGGTTATTGAAGTGCTCAAAGTTTTTCAAGCTATGCGATGTCGGCGAGCGGTTGAATGGAAATGTAAGGAAGTT
GTTTGGAGGGTCTGAAATCAGAGAATACTTGGTTGGTGGAGTTGGTTATCCTCTTCTTCCTTGGCTGATTACTCCTTATGAAAGCAATGATCTATCGCCATCCGAGTTCA
ACTTCAATGACGTGCACGGTGCTGCAAAATTGCTTGCTGTGAGAGCATTCTCTCAGTTGAAGGGCAGCTGGAGAATCCTCAACAAGGTTATGTGGAGACCCGATAAGCGA
AAATTGCCAAGCATTATATTGGTATGTTGTTTGCTTCAAAACATCATTATTGACAATGGCGATGAGTTACAACCAGATGTTGCTTTATCTGGTCATCATGATTTGGGATA
TCAGGAGAATTGTTGTAAGCAGTCTGATTCATTGGGGAACACTTCAAGGGAAAACTTGGCCAATCACTTGCATCAAAACAAAGAGAGAATTCGTTCTTCGTAAGGCTTCA
AAATTGCATCGGACTCTCGAAACTTGCCACCGATCTGGTAAGACTTAGTTTTGTATCTACACTGATGAATTAATGTATCTTTTGTTTAACATTTCCCCATTATGAAATGG
AGCATTTTAACTACATATCATTTTTAGTTTTGTAAGTAAATGTTCTTCATTAAAAAGTCCTATATGTTCCATGTTTGTAAGAAGCAAGATGAAGTTTCGATGTGGGCAAC
GTGGCATGGTAGGACGAACAGAATATCTTGTATTTGAGCACATTTCTTGAATACGATTTGTTCTGTAAGTTACTTGTGTTCTCGAGTTCTGAGCATCCATTAATGGGCAG
CACTTTCTGCTTGTGGATGTAAGTTAGAACTATCTCTCTCTTTCATTTGTTTCCGAGGGGTGATCGTGATGCAGTAGTCGAAGGCTTGATCTTTGAAGGTGTCTCTCCCC
TCCAGGTCTTAGATTCGAGACTCAGACTCAACTTTAACATTAATTTATAGACACCTTTCGTGCCTCCAATTCTCTGATGTATTGTGTCCTAGAGACGGACCAGTGGTACC
CTCGAGTTTAGAGGAGTGTAAGCTCCCACTCAAAGTTCTCGAGTGAACAAAAAATTCTCTCTCTCATTTTGATTACATAAATGGAGGAAATGAGTAGAGTGAGAAGTTTT
Protein sequenceShow/hide protein sequence
MAPTKKSKKRKRDSKKLKKPKTLTVVPSEPRPSDSDWWEIFWHKNCSSSGFSGHNNEEEGFKFFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMR
RLASGESQVSVGAAFGVGQSTVSQVTWRFVEAMEQRAKHHLRWPNSPRLEEIKSQFEASFGLPNCCGAIDSTHIIMTLPAVQTSDDWCDTNKNYSMFLQGIVDHQMRFLD
IVTGWPGGMTTSRLLKCSKFFKLCDVGERLNGNVRKLFGGSEIREYLVGGVGYPLLPWLITPYESNDLSPSEFNFNDVHGAAKLLAVRAFSQLKGSWRILNKVMWRPDKR
KLPSIILVCCLLQNIIIDNGDELQPDVALSGHHDLGYQENCCKQSDSLGNTSRENLANHLHQNKERIRSS