; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G15333 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G15333
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionDDE Tnp4 domain-containing protein
Genome locationctg2009:1340142..1341872
RNA-Seq ExpressionCucsat.G15333
SyntenyCucsat.G15333
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.82e-27791.75Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDSSFYANLF HFLFSQDFAASLPFLSVSRKRKRTN  DHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
        FEDLAGLPNCCGVVSCT                            SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFG GEYPL
Subjt:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALA RARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

KAG6600319.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.89e-24785.5Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GRVH
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
         L RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTE
        CP+ELELTSS+FED+AGLPNCCGV+SCTSIVAGFRG+KDDSTVLMS+TLFKDIE+GRLL SPPVYLHG+AVN+YLFGHGEYPLLPWL+VPFAGAVSGSTE
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALAL
        ESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKA +IQ+ALAL
Subjt:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALAL

Query:  RARELHS
        RARELH+
Subjt:  RARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]4.17e-28393.4Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
        FEDLAGLPNCCGVVSCT                            SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
Subjt:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALALRARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]2.35e-24886Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GRVH
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
         L RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTE
        CP+ELELTSSAFED+AGLPNCCGV+SCTSIVAGFRG+KDDSTVLMS+TLFKDIE+GRLL SPPVYLHG+AVN+YLFGHGEYPLLPWL+VPFAGAVSGSTE
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALAL
        ESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKAS+IQ+ALAL
Subjt:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALAL

Query:  RARELHS
        RARELH+
Subjt:  RARELHS

XP_038880641.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Benincasa hispida]9.20e-27192.93Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDS+FYANLF HFLFSQDFAASLPFLSVSRKRKRTN SDHLELGSSHGR  HLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFS IS QFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMC
        FEDLAGLPNCCGVV+CTSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFGHGEYPLLPWL++PFAGAVSGSTEESFN+AHRLMC
Subjt:  FEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMC

Query:  IPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALALRARELHS
        IPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DH+SQYVE  LN DSTNEKAS+IQRALA+RARELHS
Subjt:  IPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALALRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein2.02e-28393.4Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
        FEDLAGLPNCCGVVSCT                            SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
Subjt:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALALRARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

A0A1S3C2M8 uncharacterized protein LOC1034961691.07e-24090.32Show/hide
Query:  FSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASLPFLSVSRKRKRTN  DHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCT----------------------------SIVAGFR
        FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCT                            SIVAGFR
Subjt:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCT----------------------------SIVAGFR

Query:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFG GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALALRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVEAGLNVDSTNEKASVIQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALALRARELHS

A0A5D3CRB2 Putative nuclease HARBI18.79e-27891.75Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDSSFYANLF HFLFSQDFAASLPFLSVSRKRKRTN  DHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
        FEDLAGLPNCCGVVSCT                            SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFG GEYPL
Subjt:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALA RARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

A0A6J1FNZ2 protein ALP1-like3.12e-24685.01Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GRVH
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
         L RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTE
        CP+ELELTSS+FED+AGLPNCCGV+SCTSIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHG+AVN+YLFGHGEYPLLPWL+VPFAGAVSGSTE
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALAL
        ESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKA+++Q+ALAL
Subjt:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALAL

Query:  RARELHS
        RARELH+
Subjt:  RARELHS

A0A6J1J0M5 protein ALP1-like6.29e-24685.5Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GRVH
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
         L RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTE
        CP+ELELTSSAFED+AGLPNCCGV+SCTSIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHGVAVN+YLFGHG+YPLLPWL+VPFAGAVSGSTE
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALAL
        ESFNEAHRLM IPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKAS+IQ+ALAL
Subjt:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALAL

Query:  RARELHS
        RARELH+
Subjt:  RARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 13.6e-2226.62Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S     + +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTSI-------------------------------------VAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H
        E   S FE++ GLPNCCG +  T I                                     V G+ G    S +L  S  FK  E  ++L+  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTSI-------------------------------------VAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H

Query:  GVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like1.6e-2225.91Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTSIV--------------------------------------AGFRGNKDDSTVLMSSTLFKDIEQGRLL
            P++L+   S FE ++GLPNCCG +  T IV                                      AG+ G+ +D  VL +S  +K +E+G+ L
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTSIV--------------------------------------AGFRGNKDDSTVLMSSTLFKDIEQGRLL

Query:  NSPPVYL-HGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE
        N   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++  E
Subjt:  NSPPVYL-HGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE

Query:  D
        D
Subjt:  D

Arabidopsis top hitse value%identityAlignment
AT3G19120.1 PIF / Ping-Pong family of plant transposases2.7e-1222.96Show/hide
Query:  SLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLF----AHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHH------LFRTRTPD---
        +++S LL L   L P+S   S  S S+  S+  ++L     A  L     A+ L FL+V+R    ++ S      S    +         FR  T D   
Subjt:  SLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLF----AHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHH------LFRTRTPD---

Query:  ---------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTN-FRFWVEF
                  +R+ + ++   F  +   L+P +       S L L  +  + + L RLA GC   T++ ++ +   +    +  + R+L T  +  +++ 
Subjt:  ---------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTN-FRFWVEF

Query:  PC-PNELELTSSAFEDLAGLPNCCGVVSCT---------------------------SIVAGFR-----------GNKDDSTVLMSSTLFKDIEQGRLLN
        P     L  T+  FE+L  LPN CG +  T                            +VA  +           G +DDS+    S L+K +  G ++ 
Subjt:  PC-PNELELTSSAFEDLAGLPNCCGVVSCT---------------------------SIVAGFR-----------GNKDDSTVLMSSTLFKDIEQGRLLN

Query:  SPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN
           + + G  V  Y+ G   YPLL +L+ PF+   SG+  E+  +   +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  SPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases1.2e-2325.91Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTSIV--------------------------------------AGFRGNKDDSTVLMSSTLFKDIEQGRLL
            P++L+   S FE ++GLPNCCG +  T IV                                      AG+ G+ +D  VL +S  +K +E+G+ L
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTSIV--------------------------------------AGFRGNKDDSTVLMSSTLFKDIEQGRLL

Query:  NSPPVYL-HGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE
        N   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++  E
Subjt:  NSPPVYL-HGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE

Query:  D
        D
Subjt:  D

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.6e-2326.62Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S     + +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTSI-------------------------------------VAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H
        E   S FE++ GLPNCCG +  T I                                     V G+ G    S +L  S  FK  E  ++L+  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTSI-------------------------------------VAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H

Query:  GVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein5.2e-1625.09Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S++FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCR----VLCTNFRFWVEF

Query:  PCPNELELTSSAFEDLAGLPNCCGVVSCTSI--------VAGF-------RGNKDDSTVLMSST-----LFKDI---EQGRLLN---------SPPVYLH
        P  +E+  T + FE +  +PN  G +  T I        VA +       R  K   ++ +        +F D+     G L +         S      
Subjt:  PCPNELELTSSAFEDLAGLPNCCGVVSCTSI--------VAGF-------RGNKDDSTVLMSST-----LFKDI---EQGRLLN---------SPPVYLH

Query:  GVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLMRED
        G+  + ++ G+  +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   MR++
Subjt:  GVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLMRED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCCTCGTTTAGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCCTTCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTCTCCAATTC
CGCCCCAGATTCCAGTTTCTATGCCAATCTCTTCGCCCACTTCCTCTTTTCCCAGGATTTTGCCGCTTCCCTTCCCTTTCTCTCTGTTTCCCGCAAGAGGAAGAGAACCA
ATCGCTCCGACCATCTCGAATTGGGGTCTTCCCATGGACGAGTTCATCATCTGTTTCGGACCCGGACTCCTGATTCTTTCAGAAATCACTTCAGAATGACTTCTTCAACG
TTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGTGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGACTAGGTGTTGGTTTGTATCGCCT
CGCCACTGGCTGCGATTTCTCCACAATCTCCGACCAATTTGGTGTTTCGGAGTCTGTAGCGAGGTTCTGTTCTAAACAATTGTGTCGAGTTCTCTGTACTAATTTTCGCT
TCTGGGTTGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGCGGTGTGGTTTCTTGTACAAGTATTGTT
GCAGGATTTCGTGGCAATAAGGACGACTCGACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGAATTCTCCTCCGGTTTACCTTCATGG
GGTGGCTGTGAATAAGTACTTGTTTGGACATGGTGAATATCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATG
AAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGTCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTAC
ATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTATCTTCACTTGATCATAAATCTCAGTATGT
TGAAGCTGGATTGAATGTGGACTCAACTAATGAGAAAGCTTCTGTTATACAGAGAGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTCCTCGTTTAGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCCTTCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTCTCCAATTC
CGCCCCAGATTCCAGTTTCTATGCCAATCTCTTCGCCCACTTCCTCTTTTCCCAGGATTTTGCCGCTTCCCTTCCCTTTCTCTCTGTTTCCCGCAAGAGGAAGAGAACCA
ATCGCTCCGACCATCTCGAATTGGGGTCTTCCCATGGACGAGTTCATCATCTGTTTCGGACCCGGACTCCTGATTCTTTCAGAAATCACTTCAGAATGACTTCTTCAACG
TTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGTGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGACTAGGTGTTGGTTTGTATCGCCT
CGCCACTGGCTGCGATTTCTCCACAATCTCCGACCAATTTGGTGTTTCGGAGTCTGTAGCGAGGTTCTGTTCTAAACAATTGTGTCGAGTTCTCTGTACTAATTTTCGCT
TCTGGGTTGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGCGGTGTGGTTTCTTGTACAAGTATTGTT
GCAGGATTTCGTGGCAATAAGGACGACTCGACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGAATTCTCCTCCGGTTTACCTTCATGG
GGTGGCTGTGAATAAGTACTTGTTTGGACATGGTGAATATCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATG
AAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGTCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTAC
ATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTATCTTCACTTGATCATAAATCTCAGTATGT
TGAAGCTGGATTGAATGTGGACTCAACTAATGAGAAAGCTTCTGTTATACAGAGAGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAG
Protein sequenceShow/hide protein sequence
MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSST
FEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTSIV
AGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAY
IGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALALRARELHS