; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0023240 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0023240
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationchr06:10394699..10396166
RNA-Seq ExpressionPI0023240
SyntenyPI0023240
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]2.2e-21088.68Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDSSFYANLFTHFLFSQDF ASLPFLSVSRKRKRTNP DHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCP+ELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSELELTSSA

Query:  FEDLAGLPNCC--------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPL
        FEDLAGLPNCC                                      GFRGNKDDSTVLMSSTLFKDIEQG+LL+SPPVYLHGVAVN+YLFG GEYPL
Subjt:  FEDLAGLPNCC--------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVE GLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALA RARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

KAG6600319.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]1.3e-18683.29Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYAN---LFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ   ASL FLSVSRKRKRT+ S+ LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYAN---LFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPSELELTSSAFEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTE
        CPSELELTSS+FED+AGLPNCC          GFRG+KDDSTVLMS+TLFKDIE+G+LL SPPVYLHG+AVNQYLFGHGEYPLLPWL+VPFAGAVSGSTE
Subjt:  CPSELELTSSAFEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALAL
        ESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKA +IQ+ALAL
Subjt:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALAL

Query:  RARELHS
        RARELH+
Subjt:  RARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]4.3e-21188.92Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLF HFLFSQDF ASLPFLSVSRKRKRTN SDHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCP+ELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSELELTSSA

Query:  FEDLAGLPNCC--------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPL
        FEDLAGLPNCC                                      GFRGNKDDSTVLMSSTLFKDIEQG+LL+SPPVYLHGVAVN+YLFGHGEYPL
Subjt:  FEDLAGLPNCC--------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVE GLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALALRARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]1.5e-18783.78Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYAN---LFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ   ASL FLSVSRKRKRT+ S+ LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYAN---LFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPSELELTSSAFEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTE
        CPSELELTSSAFED+AGLPNCC          GFRG+KDDSTVLMS+TLFKDIE+G+LL SPPVYLHG+AVNQYLFGHGEYPLLPWL+VPFAGAVSGSTE
Subjt:  CPSELELTSSAFEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALAL
        ESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKAS+IQ+ALAL
Subjt:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALAL

Query:  RARELHS
        RARELH+
Subjt:  RARELHS

XP_038880641.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Benincasa hispida]8.5e-20791.16Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSSHGRVHHLFRTRTPDSF
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDS+FYANLFTHFLFSQDF ASLPFLSVSRKRKRTNPSDHLELGSSHGR  HLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFS IS QFGVSESVARFC+KQLCRVLCTNFRFWVEFPCP+ELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSELELTSSA

Query:  FEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMC
        FEDLAGLPNCC          GFRG+KDDSTVLMSSTLFKDIEQG+LLD+PPVYLHGVAVNQYLFGHGEYPLLPWL++PFAGAVSGSTEESFN+AHRLMC
Subjt:  FEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMC

Query:  IPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALALRARELHS
        IPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DHRSQYVE  LN DSTNEKAS+IQRALA+RARELHS
Subjt:  IPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALALRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein2.1e-21188.92Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLF HFLFSQDF ASLPFLSVSRKRKRTN SDHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCP+ELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSELELTSSA

Query:  FEDLAGLPNCC--------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPL
        FEDLAGLPNCC                                      GFRGNKDDSTVLMSSTLFKDIEQG+LL+SPPVYLHGVAVN+YLFGHGEYPL
Subjt:  FEDLAGLPNCC--------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVE GLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALALRARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

A0A5D3CRB2 Putative nuclease HARBI11.0e-21088.68Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDSSFYANLFTHFLFSQDF ASLPFLSVSRKRKRTNP DHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCP+ELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSELELTSSA

Query:  FEDLAGLPNCC--------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPL
        FEDLAGLPNCC                                      GFRGNKDDSTVLMSSTLFKDIEQG+LL+SPPVYLHGVAVN+YLFG GEYPL
Subjt:  FEDLAGLPNCC--------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVE GLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALA RARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

A0A6J1D7F1 protein ALP1-like2.7e-18282.02Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLF---THFLFSQDFTASLPFLSVSRKRKRTNPSDHLEL-------GSSHGRVHH
        MDS  LAAL+SSLISQLLL LFLLFPSSNPHSL SN   DS FYAN F   THFLFSQ+  +SL FLSVSRKRKRT+  + LEL       G   GRVH 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLF---THFLFSQDFTASLPFLSVSRKRKRTNPSDHLEL-------GSSHGRVHH

Query:  LFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPC
        L+ TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LS EIRLGVGL RLATGCDFSTIS+QFGVSESVARFC+KQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSAFEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEE
        P+ELE TSSAFE LAGLPNCC          GFRG+KDDSTVLMSSTLFKDIE+G+LLDSPPVYLHG+AVNQY FGHGEYPLLPWL+VPF+GAVSGSTEE
Subjt:  PSELELTSSAFEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEE

Query:  SFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALALR
        SFN+AHRLMCIPALKAIVSLRNWGVLSQP+ EEFKTAVAYIGACSILHNALLMREDFSAMADEWE L+SLDH SQY+  GLN DST+EKASVIQRALALR
Subjt:  SFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALALR

Query:  ARELHS
        ARELHS
Subjt:  ARELHS

A0A6J1FNZ2 protein ALP1-like5.2e-18682.8Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYAN---LFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ   ASL FLSVSRKRKRT+ S+ LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYAN---LFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPSELELTSSAFEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTE
        CPSELELTSS+FED+AGLPNCC          GFRG+KDDSTVLMS+TLFKDIE+ +LL SPPVYLHG+AVNQYLFGHGEYPLLPWL+VPFAGAVSGSTE
Subjt:  CPSELELTSSAFEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALAL
        ESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKA+++Q+ALAL
Subjt:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALAL

Query:  RARELHS
        RARELH+
Subjt:  RARELHS

A0A6J1J0M5 protein ALP1-like8.9e-18683.29Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYAN---LFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ   ASL FLSVSRKRKRT+ S+ LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYAN---LFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPSELELTSSAFEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTE
        CPSELELTSSAFED+AGLPNCC          GFRG+KDDSTVLMS+TLFKDIE+ +LL SPPVYLHGVAVNQYLFGHG+YPLLPWL+VPFAGAVSGSTE
Subjt:  CPSELELTSSAFEDLAGLPNCC----------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALAL
        ESFNEAHRLM IPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKAS+IQ+ALAL
Subjt:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALAL

Query:  RARELHS
        RARELH+
Subjt:  RARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 19.6e-2025.6Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S     + +    L    +  + +P    +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSEL

Query:  ELTSSAFEDLAGLPNCC-----------------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYL-H
        E   S FE++ GLPNCC                                               G+ G    S +L  S  FK  E  ++LD  P  L  
Subjt:  ELTSSAFEDLAGLPNCC-----------------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like2.8e-1924.58Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV

Query:  EFPCPSELELTSSAFEDLAGLPNCC------------------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLL
            PS+L+   S FE ++GLPNCC                                                G+ G+ +D  VL +S  +K +E+GK L
Subjt:  EFPCPSELELTSSAFEDLAGLPNCC------------------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLL

Query:  DSPPVYL-HGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE
        +   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++  E
Subjt:  DSPPVYL-HGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE

Query:  D
        D
Subjt:  D

Arabidopsis top hitse value%identityAlignment
AT3G19120.1 PIF / Ping-Pong family of plant transposases2.9e-1122.62Show/hide
Query:  SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTN-FRFWVEFPC-PSELELTSSAFEDLAGLPNCCG---------------
        S L L  +  + + L RLA GC   T++ ++ +   +    +  + R+L T  +  +++ P     L  T+  FE+L  LPN CG               
Subjt:  SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTN-FRFWVEFPC-PSELELTSSAFEDLAGLPNCCG---------------

Query:  ---------------------------------FRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTE
                                           G +DDS+    S L+K +  G ++    + + G  V  Y+ G   YPLL +L+ PF+   SG+  
Subjt:  ---------------------------------FRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN
        E+  +   +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  ESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases2.0e-2024.58Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV

Query:  EFPCPSELELTSSAFEDLAGLPNCC------------------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLL
            PS+L+   S FE ++GLPNCC                                                G+ G+ +D  VL +S  +K +E+GK L
Subjt:  EFPCPSELELTSSAFEDLAGLPNCC------------------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLL

Query:  DSPPVYL-HGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE
        +   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++  E
Subjt:  DSPPVYL-HGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE

Query:  D
        D
Subjt:  D

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)6.8e-2125.6Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S     + +    L    +  + +P    +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSEL

Query:  ELTSSAFEDLAGLPNCC-----------------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYL-H
        E   S FE++ GLPNCC                                               G+ G    S +L  S  FK  E  ++LD  P  L  
Subjt:  ELTSSAFEDLAGLPNCC-----------------------------------------------GFRGNKDDSTVLMSSTLFKDIEQGKLLDSPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein2.5e-1524.4Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S++FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCR----VLCTNFRFWVEF

Query:  PCPSELELTSSAFEDLAGLPNCCGF-------------------------RGNKDDSTVLMSST-----LFKDI---EQGKLLD---------SPPVYLH
        P  SE+  T + FE +  +PN  G                          R  K   ++ +        +F D+     G L D         S      
Subjt:  PCPSELELTSSAFEDLAGLPNCCGF-------------------------RGNKDDSTVLMSST-----LFKDI---EQGKLLD---------SPPVYLH

Query:  GVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLMRED
        G+  + ++ G+  +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   MR++
Subjt:  GVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLMRED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCCTCGTTTAGCTGCTTTACTCTCTTCCTTGATCTCCCAACTCCTTCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCGCATTCCCTTTTCTCCAATTC
CGCTCCGGATTCCAGTTTCTATGCCAATCTCTTCACCCACTTCCTCTTTTCCCAGGATTTTACCGCTTCGCTTCCCTTTCTCTCTGTTTCCCGCAAGAGGAAGAGAACCA
ATCCCTCCGACCATCTCGAATTGGGGTCATCCCATGGACGAGTTCATCATCTGTTTCGGACTCGGACTCCTGATTCTTTCAGAAATCACTTCAGAATGACTTCTTCAACG
TTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGTGACCCGGTGGGTTCGCCTCTTGATCTCTCCGTTGAAATTCGACTAGGTGTTGGTTTGTATCGCCT
CGCCACTGGCTGCGATTTCTCCACAATCTCAGACCAATTTGGTGTTTCGGAGTCTGTAGCGAGGTTCTGTTCTAAACAATTGTGTCGAGTTCTCTGTACTAATTTTCGCT
TCTGGGTTGAATTCCCTTGCCCCAGTGAGCTGGAATTAACATCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGCGGATTTCGTGGCAATAAGGACGACTCG
ACGGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGAAAGCTTCTGGATTCTCCTCCGGTTTACCTTCATGGGGTGGCTGTGAATCAGTACTTGTTTGGACA
TGGTGAATATCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCACCGATTGATGTGCATTCCAGCTC
TGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGTCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAATGCT
TTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTATCTTCACTTGATCATAGATCTCAGTATGTTGAACCTGGATTGAATGTGGACTCAACTAA
TGAGAAGGCTTCTGTTATACAGAGAGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCCCTCGTTTAGCTGCTTTACTCTCTTCCTTGATCTCCCAACTCCTTCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCGCATTCCCTTTTCTCCAATTC
CGCTCCGGATTCCAGTTTCTATGCCAATCTCTTCACCCACTTCCTCTTTTCCCAGGATTTTACCGCTTCGCTTCCCTTTCTCTCTGTTTCCCGCAAGAGGAAGAGAACCA
ATCCCTCCGACCATCTCGAATTGGGGTCATCCCATGGACGAGTTCATCATCTGTTTCGGACTCGGACTCCTGATTCTTTCAGAAATCACTTCAGAATGACTTCTTCAACG
TTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGTGACCCGGTGGGTTCGCCTCTTGATCTCTCCGTTGAAATTCGACTAGGTGTTGGTTTGTATCGCCT
CGCCACTGGCTGCGATTTCTCCACAATCTCAGACCAATTTGGTGTTTCGGAGTCTGTAGCGAGGTTCTGTTCTAAACAATTGTGTCGAGTTCTCTGTACTAATTTTCGCT
TCTGGGTTGAATTCCCTTGCCCCAGTGAGCTGGAATTAACATCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGCGGATTTCGTGGCAATAAGGACGACTCG
ACGGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGAAAGCTTCTGGATTCTCCTCCGGTTTACCTTCATGGGGTGGCTGTGAATCAGTACTTGTTTGGACA
TGGTGAATATCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCACCGATTGATGTGCATTCCAGCTC
TGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGTCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAATGCT
TTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTATCTTCACTTGATCATAGATCTCAGTATGTTGAACCTGGATTGAATGTGGACTCAACTAA
TGAGAAGGCTTCTGTTATACAGAGAGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAGGATTTCAATCACAAGAATGCAGTTGTTTAGTGAAATTAATAAGTACAGC
CCATTTGAAAGAGATTTCCATCTCTTAGGATATTAATTGAAGGCAGCTCATCCAGCTCCATTCAACTATTACTATTGTAGAGGTAATTGATGATTCTTTTACAAATCTAT
ATAACAACATTTTCTCCCAAATTTTGGGTGTTCA
Protein sequenceShow/hide protein sequence
MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFTHFLFSQDFTASLPFLSVSRKRKRTNPSDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSST
FEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDLAGLPNCCGFRGNKDDS
TVLMSSTLFKDIEQGKLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNA
LLMREDFSAMADEWESLSSLDHRSQYVEPGLNVDSTNEKASVIQRALALRARELHS