; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS018360 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS018360
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1
Genome locationscaffold342:217636..219220
RNA-Seq ExpressionMS018360
SyntenyMS018360
Gene Ontology termsGO:0060967 - negative regulation of gene silencing by RNA (biological process)
GO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158723.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 [Momordica charantia]2.6e-23599.75Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK
        NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK

Query:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS
        AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLA HMHQNKERIRSS
Subjt:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS

XP_022960389.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X1 [Cucurbita moschata]3.7e-21390.57Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTS---DSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGR
        MAPTKKSKKRKRDS KLKK K LSVVPM PRA+EPDWWEIFWHKNCSTS    S GPNDEAE FK+FFRTSK TFDYICSLVREDL+SRPPSGLINIEGR
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTS---DSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGR

Query:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDW
        LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSS+LEEIKSQFEASFGLPNCCGAIDATHIIMTLPA+QTSDDW
Subjt:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDW

Query:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNA
        CDT  NYSM LQGIVDHQMRFLDIVTGWPGGMTT+RLLKCSKFFKLC+VGERLNGN RKLSG SEIREYLVGG  YPLLPWLITPYE+DDLSP KL+FN 
Subjt:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNA

Query:  VHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERI
        VH AA+ LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNI+IDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNTSRENLA H HQNKE+I
Subjt:  VHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERI

Query:  RSS
         SS
Subjt:  RSS

XP_022960391.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita moschata]8.8e-21591.25Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDS KLKK K LSVVPM PRA+EPDWWEIFWHKNCSTS S GPNDEAE FK+FFRTSK TFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSS+LEEIKSQFEASFGLPNCCGAIDATHIIMTLPA+QTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK
          NYSM LQGIVDHQMRFLDIVTGWPGGMTT+RLLKCSKFFKLC+VGERLNGN RKLSG SEIREYLVGG  YPLLPWLITPYE+DDLSP KL+FN VH 
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK

Query:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS
        AA+ LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNI+IDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNTSRENLA H HQNKE+I SS
Subjt:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS

XP_023004332.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita maxima]2.6e-21491Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDS KLKK K LSVVPM PR SEPDWWEIFWHKNCSTS S GPNDEAE FK+FFRTSK TFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSS+LEEIKSQFEASFGLPNCCGAIDATHIIMTLPA+QTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK
        N NYSM LQGIVDHQMRFLDIVTGWPGGMTT+RLLKCSKFFKLC+VGERLNGN RKLSG SEIREYLVGG  YPLLPWLITPYE+DDL P KL+FN VH 
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK

Query:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS
        AA+ LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNI+IDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNTSRENLA H HQNKE+I SS
Subjt:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS

XP_023513674.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X2 [Cucurbita pepo subsp. pepo]4.4e-21491.25Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDS KLKK K LSVVPM PRASEPDWWEIFWHKNCSTS S GPNDEAE FK+FFRTSK TFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSS+LEEIKSQFEASFGLPNCCGAIDATHIIMTLPA+QTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK
          NYSM LQGIVDHQMRFLDIVTGWPGGMTT+RLLKCSKFFKLC+VGERLNGN RKLSG SEIREYLVGG  YPLLPWLITPYE+DDLSP KL+FN V  
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK

Query:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS
        AA+ LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNI+IDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNTSRENLA H HQNKE+I SS
Subjt:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS

TrEMBL top hitse value%identityAlignment
A0A6J1DWX1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.3e-23599.75Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK
        NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK

Query:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS
        AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLA HMHQNKERIRSS
Subjt:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS

A0A6J1H7G5 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X24.2e-21591.25Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDS KLKK K LSVVPM PRA+EPDWWEIFWHKNCSTS S GPNDEAE FK+FFRTSK TFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSS+LEEIKSQFEASFGLPNCCGAIDATHIIMTLPA+QTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK
          NYSM LQGIVDHQMRFLDIVTGWPGGMTT+RLLKCSKFFKLC+VGERLNGN RKLSG SEIREYLVGG  YPLLPWLITPYE+DDLSP KL+FN VH 
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK

Query:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS
        AA+ LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNI+IDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNTSRENLA H HQNKE+I SS
Subjt:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS

A0A6J1H8X9 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X11.8e-21390.57Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTS---DSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGR
        MAPTKKSKKRKRDS KLKK K LSVVPM PRA+EPDWWEIFWHKNCSTS    S GPNDEAE FK+FFRTSK TFDYICSLVREDL+SRPPSGLINIEGR
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTS---DSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGR

Query:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDW
        LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSS+LEEIKSQFEASFGLPNCCGAIDATHIIMTLPA+QTSDDW
Subjt:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDW

Query:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNA
        CDT  NYSM LQGIVDHQMRFLDIVTGWPGGMTT+RLLKCSKFFKLC+VGERLNGN RKLSG SEIREYLVGG  YPLLPWLITPYE+DDLSP KL+FN 
Subjt:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNA

Query:  VHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERI
        VH AA+ LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNI+IDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNTSRENLA H HQNKE+I
Subjt:  VHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERI

Query:  RSS
         SS
Subjt:  RSS

A0A6J1KQ50 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X21.2e-21491Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS
        MAPTKKSKKRKRDS KLKK K LSVVPM PR SEPDWWEIFWHKNCSTS S GPNDEAE FK+FFRTSK TFDYICSLVREDL+SRPPSGLINIEGRLLS
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLS

Query:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT
        VEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSS+LEEIKSQFEASFGLPNCCGAIDATHIIMTLPA+QTSDDWCDT
Subjt:  VEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDT

Query:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK
        N NYSM LQGIVDHQMRFLDIVTGWPGGMTT+RLLKCSKFFKLC+VGERLNGN RKLSG SEIREYLVGG  YPLLPWLITPYE+DDL P KL+FN VH 
Subjt:  NNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHK

Query:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS
        AA+ LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNI+IDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNTSRENLA H HQNKE+I SS
Subjt:  AARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS

A0A6J1KRU3 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 isoform X15.2e-21390.32Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTS---DSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGR
        MAPTKKSKKRKRDS KLKK K LSVVPM PR SEPDWWEIFWHKNCSTS    S GPNDEAE FK+FFRTSK TFDYICSLVREDL+SRPPSGLINIEGR
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTS---DSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGR

Query:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDW
        LLSVEKQVAIAMRRLASGESQVSVGA+FGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSS+LEEIKSQFEASFGLPNCCGAIDATHIIMTLPA+QTSDDW
Subjt:  LLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDW

Query:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNA
        CDTN NYSM LQGIVDHQMRFLDIVTGWPGGMTT+RLLKCSKFFKLC+VGERLNGN RKLSG SEIREYLVGG  YPLLPWLITPYE+DDL P KL+FN 
Subjt:  CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNA

Query:  VHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERI
        VH AA+ LAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNI+IDNGDELQPDVALSGHHDLGYQEHCCKQ+DPLGNTSRENLA H HQNKE+I
Subjt:  VHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERI

Query:  RSS
         SS
Subjt:  RSS

SwissProt top hitse value%identityAlignment
Q6AZB8 Putative nuclease HARBI11.4e-2125.36Show/hide
Query:  YICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWP-SSSKLEEIKSQFEASFGL
        Y+  L+++ L+ R          R +S + Q+  A+    SG  Q  +G A G+ Q+++S+      +AL ++A   + +    +  ++ K +F    G+
Subjt:  YICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWP-SSSKLEEIKSQFEASFGL

Query:  PNCCGAIDATHIIMTLPAIQTSDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGC
        PN  G +D  HI +  P    S  + +    +S+  Q + D +   L   T WPG +T   + K S   KL +              E++   +L+G   
Subjt:  PNCCGAIDATHIIMTLPAIQTSDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGC

Query:  YPLLPWLITPYENDDLSPSKLDFNAVHKAARLLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNILIDNG
        YPL  WL+TP ++ + SP+   +N  H     +  R F  ++  +R L+  K   +    K   II  CC+L NI + +G
Subjt:  YPLLPWLITPYENDDLSPSKLDFNAVHKAARLLAVRAFSQLKGSWRILN--KVMWRPDKRKLPSIILVCCLLQNILIDNG

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 15.3e-15466.75Show/hide
Query:  MAPTKKSKKRKRDS-------NKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLIN
        MAP K+ KK K+          K K+ K ++ VP+ P A + DWW+ FW +N S S    P+DE   FK FFR SKTTF YICSLVREDLISRPPSGLIN
Subjt:  MAPTKKSKKRKRDS-------NKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLIN

Query:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQT
        IEGRLLSVEKQVAIA+RRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EALE+RAKHHLRWP S ++EEIKS+FE  +GLPNCCGAID THIIMTLPA+Q 
Subjt:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQT

Query:  SDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKL
        SDDWCD   NYSM LQG+ DH+MRFL++VTGWPGGMT ++LLK S FFKLC+  + L+GN + LS  ++IREY+VGG  YPLLPWLITP+++D  S S +
Subjt:  SDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKL

Query:  DFNAVHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHM
         FN  H+  R +A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NI+ID GD LQ DV LSGHHD GY +  CKQ +PLG+  R  L +H+
Subjt:  DFNAVHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHM

Q9M2U3 Protein ALP1-like4.5e-9745.1Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEP-----------------DWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDL
        M P K  KK+KR   K+ ++  L+       AS                   DWW+ F  +    S  P      + F+  F+ S+ TFDYICSLV+ D 
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEP-----------------DWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDL

Query:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATH
         ++ P+   +  G  LS+  +VA+A+RRL SGES   +G  FG+ QSTVSQ+TWRFVE++E+RA HHL WP  SKL+EIKS+FE   GLPNCCGAID TH
Subjt:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATH

Query:  IIMTLPAIQTSDD-WCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITP
        I+M LPA++ S+  W D   N+SM LQ +VD  MRFLD++ GWPG +  + +LK S F+KL + G+RLNG    LS  +E+REY+VG   +PLLPWL+TP
Subjt:  IIMTLPAIQTSDD-WCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITP

Query:  YENDDLSPSKLDFNAVHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTS
        Y+    S  + +FN  H  A   A  A S+LK  WRI+N VMW PD+ +LP II VCCLL NI+ID  D+   D  LS  HD+ Y++  CK  D   +  
Subjt:  YENDDLSPSKLDFNAVHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTS

Query:  RENLAKHM
        R+ L+  +
Subjt:  RENLAKHM

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)3.6e-2527.22Show/hide
Query:  SNKLKKSKTLSVVPMGPRASEPD-------WWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVA
        S  L+    +S +P+ P  S          W+  F        D P        +  +FR SK+TF  + S++     S  PS                A
Subjt:  SNKLKKSKTLSVVPMGPRASEPD-------WWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVA

Query:  IAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDTNNNYSM
          + RLA G S   +   FG    + SQ +  F    +      +    S +L++ K  F  +  LPNC G +      +    +             S+
Subjt:  IAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDTNNNYSM

Query:  LLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYE--NDDLSPSKLDFNAVHKAARL
        L+Q +VD   RF+DI  GWP  M    + + +K F + +  E L+G   KL     +  Y++G  C PLLPWL+TPY+  +D+ S  +   N VH     
Subjt:  LLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYE--NDDLSPSKLDFNAVHKAARL

Query:  LAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNILIDNGDE
        + + AF++++  WRIL+K  W+P+  + +P +I   CLL N L+++GD+
Subjt:  LAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNILIDNGDE

AT1G72270.2 LOCATED IN: mitochondrion3.6e-2527.22Show/hide
Query:  SNKLKKSKTLSVVPMGPRASEPD-------WWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVA
        S  L+    +S +P+ P  S          W+  F        D P        +  +FR SK+TF  + S++     S  PS                A
Subjt:  SNKLKKSKTLSVVPMGPRASEPD-------WWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVA

Query:  IAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDTNNNYSM
          + RLA G S   +   FG    + SQ +  F    +      +    S +L++ K  F  +  LPNC G +      +    +             S+
Subjt:  IAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDTNNNYSM

Query:  LLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYE--NDDLSPSKLDFNAVHKAARL
        L+Q +VD   RF+DI  GWP  M    + + +K F + +  E L+G   KL     +  Y++G  C PLLPWL+TPY+  +D+ S  +   N VH     
Subjt:  LLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYE--NDDLSPSKLDFNAVHKAARL

Query:  LAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNILIDNGDE
        + + AF++++  WRIL+K  W+P+  + +P +I   CLL N L+++GD+
Subjt:  LAVRAFSQLKGSWRILNKVMWRPDKRK-LPSIILVCCLLQNILIDNGDE

AT3G55350.1 PIF / Ping-Pong family of plant transposases3.2e-9845.1Show/hide
Query:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEP-----------------DWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDL
        M P K  KK+KR   K+ ++  L+       AS                   DWW+ F  +    S  P      + F+  F+ S+ TFDYICSLV+ D 
Subjt:  MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEP-----------------DWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDL

Query:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATH
         ++ P+   +  G  LS+  +VA+A+RRL SGES   +G  FG+ QSTVSQ+TWRFVE++E+RA HHL WP  SKL+EIKS+FE   GLPNCCGAID TH
Subjt:  ISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATH

Query:  IIMTLPAIQTSDD-WCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITP
        I+M LPA++ S+  W D   N+SM LQ +VD  MRFLD++ GWPG +  + +LK S F+KL + G+RLNG    LS  +E+REY+VG   +PLLPWL+TP
Subjt:  IIMTLPAIQTSDD-WCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITP

Query:  YENDDLSPSKLDFNAVHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTS
        Y+    S  + +FN  H  A   A  A S+LK  WRI+N VMW PD+ +LP II VCCLL NI+ID  D+   D  LS  HD+ Y++  CK  D   +  
Subjt:  YENDDLSPSKLDFNAVHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTS

Query:  RENLAKHM
        R+ L+  +
Subjt:  RENLAKHM

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)3.7e-15566.75Show/hide
Query:  MAPTKKSKKRKRDS-------NKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLIN
        MAP K+ KK K+          K K+ K ++ VP+ P A + DWW+ FW +N S S    P+DE   FK FFR SKTTF YICSLVREDLISRPPSGLIN
Subjt:  MAPTKKSKKRKRDS-------NKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLIN

Query:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQT
        IEGRLLSVEKQVAIA+RRLASG+SQVSVGAAFGVGQSTVSQVTWRF+EALE+RAKHHLRWP S ++EEIKS+FE  +GLPNCCGAID THIIMTLPA+Q 
Subjt:  IEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQT

Query:  SDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKL
        SDDWCD   NYSM LQG+ DH+MRFL++VTGWPGGMT ++LLK S FFKLC+  + L+GN + LS  ++IREY+VGG  YPLLPWLITP+++D  S S +
Subjt:  SDDWCDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKL

Query:  DFNAVHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHM
         FN  H+  R +A  AF QLKGSWRIL+KVMWRPD+RKLPSIILVCCLL NI+ID GD LQ DV LSGHHD GY +  CKQ +PLG+  R  L +H+
Subjt:  DFNAVHKAARLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHM

AT5G12010.1 unknown protein7.1e-3728.18Show/hide
Query:  WWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVT
        WWE      CS  D P      E FK  FR SK+TF+ IC  +    +++  + L N     + V ++VA+ + RLA+GE    V   FG+G ST  ++ 
Subjt:  WWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQVSVGAAFGVGQSTVSQVT

Query:  WRFVEALEQ-RAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDW------CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGM
            +A++      +L+WP    L  I+ +FE+  G+PN  G++  THI +  P I  +  +       +   +YS+ +Q +V+ +  F D+  GWPG M
Subjt:  WRFVEALEQ-RAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDW------CDTNNNYSMLLQGIVDHQMRFLDIVTGWPGGM

Query:  TTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHKAARLLAVRAFSQLKGSWRILNKVMWRPDK
          +++L+ S  ++  + G  L G             ++ GG  +PLL W++ PY   +L+ ++  FN      + +A  AF +LKG W  L K       
Subjt:  TTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHKAARLLAVRAFSQLKGSWRILNKVMWRPDK

Query:  RKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPL--GNTSRENLAKH
        + LP+++  CC+L NI     ++++P++ +    D    E+  + V+ +   +T   NL  H
Subjt:  RKLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPL--GNTSRENLAKH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCCACAAAGAAATCGAAGAAGCGCAAAAGGGATTCCAACAAACTCAAGAAATCTAAAACCTTAAGCGTTGTTCCTATGGGACCCAGAGCCTCGGAGCCCGATTG
GTGGGAAATTTTCTGGCACAAGAATTGTTCAACCTCAGATTCTCCTGGACCCAATGATGAAGCAGAAGGATTCAAGTTCTTCTTTCGAACCTCGAAGACGACTTTCGACT
ACATTTGTTCCCTTGTAAGAGAAGATCTGATCTCAAGGCCACCGTCTGGGCTTATCAACATTGAGGGAAGACTTCTCAGTGTAGAGAAGCAAGTTGCAATTGCTATGAGA
AGATTGGCATCTGGTGAGTCCCAAGTTTCTGTGGGAGCTGCCTTTGGAGTTGGCCAGTCCACAGTCTCTCAGGTCACTTGGAGATTTGTTGAAGCTTTGGAGCAACGGGC
GAAGCACCATCTTCGGTGGCCGAGTTCATCGAAACTTGAGGAAATCAAGTCCCAATTTGAAGCCTCCTTTGGGCTGCCTAATTGTTGTGGAGCCATAGATGCAACACACA
TCATTATGACCCTTCCAGCCATACAAACATCTGATGATTGGTGTGATACTAACAATAACTACAGTATGTTATTGCAGGGAATCGTAGATCACCAGATGCGATTTCTTGAC
ATTGTAACAGGTTGGCCCGGGGGCATGACGACTAACCGATTGCTGAAGTGCTCGAAATTCTTCAAACTATGCGATGTCGGTGAGCGTTTGAATGGAAATGTAAGGAAGTT
GTCTGGAGAGTCAGAGATCAGAGAGTACTTGGTTGGTGGAGGTTGTTATCCTCTTCTTCCTTGGTTGATTACTCCTTATGAAAATGACGACCTATCGCCATCGAAGCTCG
ACTTCAACGCCGTGCACAAAGCTGCAAGGTTGCTCGCTGTGAGGGCATTCTCTCAGTTGAAGGGCAGCTGGAGAATCCTCAACAAGGTTATGTGGAGACCCGATAAGCGG
AAATTACCGAGCATCATACTAGTATGCTGTTTACTTCAGAACATTCTAATTGACAATGGAGATGAACTACAACCAGATGTTGCTTTATCTGGTCATCATGACTTGGGTTA
TCAGGAGCATTGTTGCAAGCAAGTTGATCCATTGGGGAACACTTCAAGGGAAAACTTAGCCAAGCATATGCATCAAAACAAAGAGAGAATTCGTTCTTCA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCCACAAAGAAATCGAAGAAGCGCAAAAGGGATTCCAACAAACTCAAGAAATCTAAAACCTTAAGCGTTGTTCCTATGGGACCCAGAGCCTCGGAGCCCGATTG
GTGGGAAATTTTCTGGCACAAGAATTGTTCAACCTCAGATTCTCCTGGACCCAATGATGAAGCAGAAGGATTCAAGTTCTTCTTTCGAACCTCGAAGACGACTTTCGACT
ACATTTGTTCCCTTGTAAGAGAAGATCTGATCTCAAGGCCACCGTCTGGGCTTATCAACATTGAGGGAAGACTTCTCAGTGTAGAGAAGCAAGTTGCAATTGCTATGAGA
AGATTGGCATCTGGTGAGTCCCAAGTTTCTGTGGGAGCTGCCTTTGGAGTTGGCCAGTCCACAGTCTCTCAGGTCACTTGGAGATTTGTTGAAGCTTTGGAGCAACGGGC
GAAGCACCATCTTCGGTGGCCGAGTTCATCGAAACTTGAGGAAATCAAGTCCCAATTTGAAGCCTCCTTTGGGCTGCCTAATTGTTGTGGAGCCATAGATGCAACACACA
TCATTATGACCCTTCCAGCCATACAAACATCTGATGATTGGTGTGATACTAACAATAACTACAGTATGTTATTGCAGGGAATCGTAGATCACCAGATGCGATTTCTTGAC
ATTGTAACAGGTTGGCCCGGGGGCATGACGACTAACCGATTGCTGAAGTGCTCGAAATTCTTCAAACTATGCGATGTCGGTGAGCGTTTGAATGGAAATGTAAGGAAGTT
GTCTGGAGAGTCAGAGATCAGAGAGTACTTGGTTGGTGGAGGTTGTTATCCTCTTCTTCCTTGGTTGATTACTCCTTATGAAAATGACGACCTATCGCCATCGAAGCTCG
ACTTCAACGCCGTGCACAAAGCTGCAAGGTTGCTCGCTGTGAGGGCATTCTCTCAGTTGAAGGGCAGCTGGAGAATCCTCAACAAGGTTATGTGGAGACCCGATAAGCGG
AAATTACCGAGCATCATACTAGTATGCTGTTTACTTCAGAACATTCTAATTGACAATGGAGATGAACTACAACCAGATGTTGCTTTATCTGGTCATCATGACTTGGGTTA
TCAGGAGCATTGTTGCAAGCAAGTTGATCCATTGGGGAACACTTCAAGGGAAAACTTAGCCAAGCATATGCATCAAAACAAAGAGAGAATTCGTTCTTCA
Protein sequenceShow/hide protein sequence
MAPTKKSKKRKRDSNKLKKSKTLSVVPMGPRASEPDWWEIFWHKNCSTSDSPGPNDEAEGFKFFFRTSKTTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMR
RLASGESQVSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLRWPSSSKLEEIKSQFEASFGLPNCCGAIDATHIIMTLPAIQTSDDWCDTNNNYSMLLQGIVDHQMRFLD
IVTGWPGGMTTNRLLKCSKFFKLCDVGERLNGNVRKLSGESEIREYLVGGGCYPLLPWLITPYENDDLSPSKLDFNAVHKAARLLAVRAFSQLKGSWRILNKVMWRPDKR
KLPSIILVCCLLQNILIDNGDELQPDVALSGHHDLGYQEHCCKQVDPLGNTSRENLAKHMHQNKERIRSS