; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027420 (gene) of Chayote v1 genome

Gene IDSed0027420
OrganismSechium edule (Chayote v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationLG11:1486258..1487589
RNA-Seq ExpressionSed0027420
SyntenySed0027420
Gene Ontology termsGO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586365.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]4.8e-20683.52Show/hide
Query:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS
        MAAGG  GDKR TRSSA+ A AVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLS+NDLHLLPSQTLALE+ I S SSSL ALSP L  P   L P 
Subjt:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS

Query:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK
        PPRQCWFQR LSAT+E+DCDPRWNL FRMSKSSFSLLLRLLSPI  SSSS+++  DCALAAA++RLAHGASY  +GR+FGID ADACRSFY VCKAINDK
Subjt:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK

Query:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL
        LG+LLELRSDIDRIVVGFGWISLPNCCGVLG+RRFGV     G +G LLVQALVDAEGRFLDVSAGWPS MKPETIL+QSKLY EIEKSGELL G  YNL
Subjt:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL

Query:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV
        DD  PISQYLIGDSCFPLLPWLLTPYMKLNEED SGF  RAFNSTHNRAMGL+NTAF ++RARWKLLSKPWKE CRDFFPF+VLTGCLLHNFLIKCSEK+
Subjt:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV

Query:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR
        +EEQ  ++GASCSSEEQKFPLYDGE  DDRGKDIRDALALHLSRLSFRR
Subjt:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR

XP_022938170.1 protein ALP1-like [Cucurbita moschata]4.1e-20583.3Show/hide
Query:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS
        MAAGG  GDKR TRSSA+ A AVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLS+NDLHLLPSQTLALE+ I S SSSL ALSP L  P   L P 
Subjt:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS

Query:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK
        PPRQCWFQR LSAT+E+DCDPRWNL FRMSKSSFSLLLRLLSPI  SSSS+++  DCALAAA++RLAHGASY  +GR+FGID ADACRSFY VCKAINDK
Subjt:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK

Query:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL
        LG+LLELRSDIDRIVVGFGWISLPNCCGVLG+RRFGV     G +G LLVQALVDAEGRFLDVSAGWPS MKPETIL+QSKLY EIEKSGELL G  YNL
Subjt:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL

Query:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV
        DD  PISQYLIGDSCFPLLPWLLTPYMKLNEED SGF  RAFNSTHNRAMGL+NTAF ++RARWKLLSKPWKE CRDFFPF+VLTGCLLHNFLIKCSEK+
Subjt:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV

Query:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR
        +EEQ  ++GASCSSEEQKF LYDGE  DDRGKDIRDALALHLSRLSFRR
Subjt:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR

XP_022965738.1 protein ALP1-like [Cucurbita maxima]2.4e-20583.74Show/hide
Query:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS
        MAAGG  GDKR TRSSA+ A AVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLS+NDLHLLPSQTLALE+ I S SSSL ALSP L  P   L P 
Subjt:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS

Query:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK
        PPRQCWFQR LSAT+E+DCDPRWNL FRMSKSSFSLLLRLLSPI  SSSS+++  DCALAAA++RLAHGASY  +GR+FGID ADACRSFY VCKAINDK
Subjt:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK

Query:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL
        LG+LLELRSDIDRIVVGFGWISLPNCCGVLG+RRFGV     G +G LLVQALVDAEGRFLDVSAGWPS MKPETIL+QSKLY EIEKSGELL G  YNL
Subjt:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL

Query:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV
        DD  PISQYLIGDSCFPLLPWLLTPYMKLNEED SGF  RAFNSTHNRAMGL+NTAF ++RARWKLLSKPWKE CRDFFPFVVLTGCLLHNFLIKCSEK+
Subjt:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV

Query:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR
        +EEQ  E+GAS SSEEQKFPLYDGE  DDRGKDIRDALALHLSRLSFRR
Subjt:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR

XP_023536803.1 protein ALP1-like [Cucurbita pepo subsp. pepo]7.0e-20583.3Show/hide
Query:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS
        MAAGG  GDKR TRSSA+ A AVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLS+NDLHLLPSQTLALE+ I S SSSL ALSP L  P   L P 
Subjt:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS

Query:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK
        PPRQCWFQR LSAT+E+DCDPRWNL FRMSKSSFSLLLRLLSPI  S SS+++  DCALAAA++RLAHGASY  +GR+FGID ADACRSFY VCKAINDK
Subjt:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK

Query:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL
        LG+LLELRSDIDRIVVGFGWISLPNCCGVLG+RRFGV     G +G LLVQALVDAEGRFLDVSAGWPS MKPETIL+QSKLY EIEKSGELL G  YNL
Subjt:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL

Query:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV
        DD  PISQYLIGDSCFPLLPWLLTPYMKLNEED SGF  RAFNSTHNRAMGL+NTAF ++RARWKLLSKPWKE CRDFFPF+VLTGCLLHNFLIKCSEK+
Subjt:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV

Query:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR
         EEQ  ++GASCSSEEQKFPLYDGE  DDRGKDIRDALALHLSRLSFRR
Subjt:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR

XP_038890100.1 protein ALP1-like [Benincasa hispida]5.3e-20580.74Show/hide
Query:  MAAGGHGGDKRITRSS-----AVAVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLP----P
        MA  G GGDKR TRSS     A A   TTRSKAKK DR++HL HQLVTLI+TTISSAHSFLS+NDLHLLPSQTLALE+L+ S SSSL+ALSP LP    P
Subjt:  MAAGGHGGDKRITRSS-----AVAVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLP----P

Query:  PPPPLSPSPPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYT
        PPPP  P PPRQCWFQR LSATS++DCDPRWNLSFRMSKSSFSLLLRLLSPI  SSSSS++  DCALAAA++RLAHGASY  +GR+FGID ADACRSFY 
Subjt:  PPPPLSPSPPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYT

Query:  VCKAINDKLGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGEL
        VCKAIN+KLG+LLELRSDIDRIVVGFGWISLPNCCGVLG+RRFGV     G  G LLVQALVDAEGRFLDVSAGWPS MKPETIL+QSKLY+EIEKS EL
Subjt:  VCKAINDKLGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGEL

Query:  LNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNF
        L G  YNLDD+ PI QYLIGDSCFPLLPWLLTPYMKLNEED SGF  RAFNSTHNRAM L+NTAF RLRARWKLLSKPWKEGCRDFFPF+VLTGCLLHNF
Subjt:  LNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNF

Query:  LIKCSEKVDEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR
        LIKCSEK+DEEQ  EE A CSSE+QKFPLYDG++ DDRGKDIRDALALHLS LS+RR
Subjt:  LIKCSEKVDEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR

TrEMBL top hitse value%identityAlignment
A0A0A0LFB5 DDE Tnp4 domain-containing protein2.5e-20080.09Show/hide
Query:  MAAGGHGGDKRITRSSAV--AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLP----PPPP
        MA  G  GDKR TRSSA+  A A  TRSKAKK D++NHL HQL+TLIETTISSAHSFLS+NDLHLLPSQTLALE+L+CS SSSLHALSP LP    PPP 
Subjt:  MAAGGHGGDKRITRSSAV--AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLP----PPPP

Query:  PLSPSPPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCK
        P    PPRQCWFQR LSATS++DCDPRWNLSFRMSKSSFSLLLRLLSPI  SS SS++  DCALAAA++RLAHGASY  +GR+FGID ADACRSFY VCK
Subjt:  PLSPSPPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCK

Query:  AINDKLGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGVGVE---GDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSA
        AIN+KLG+LLELRSDIDRIVVGFGWISLPNCCGVLG+RRFG   E   G LLVQALVDAEGRFLDVSAGWPS MKP TIL+QSKLY EIEKS ELL G  
Subjt:  AINDKLGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGVGVE---GDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSA

Query:  YNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCS
        YNLD+E PI QYLIGDSCFPLLPWLLTPYM+LNEED SGF  RAFNSTH RAM L+NTAF RLRARWKLLSKPWKEGCRDFFPF++LTGCLL NFLIKCS
Subjt:  YNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCS

Query:  EKVDEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR
        EK+DEEQ  EEGASCSSEEQKFPL+DGE+ D RGKDIRDALALHLS L++RR
Subjt:  EKVDEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR

A0A5D3BH79 Putative nuclease HARBI11.4e-19879.87Show/hide
Query:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS
        MA  G  GDKR TRSSA+ A A  TRSKAKK D++NHL HQL+TLIETTISSA SFLS+NDLHLLPSQTLALE+L+CS SSSLHALSP LP    P    
Subjt:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS

Query:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK
        PPRQCWFQR LSATS++DCDPRWNLSFRMSKSSFSLLLRLLSPI  S SSS++  DCALAAA++RLAHGASY  +GR+FGID ADACRSFY VCKAIN+K
Subjt:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK

Query:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGVGVE---GDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDD
        LG+LLELRSDIDRIVVGFGWISLPNCCGVLG+RRFG   E   G LLVQALVDAEGRFLDVSAGWPS MKP TIL+QSKLY+EIEKS ELL G  YNLDD
Subjt:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGVGVE---GDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDD

Query:  ENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVDE
        E PI QYLIGDSCFPL PWLLTPY++LNEED SGF  RAFNSTH RAM L+NTAF RLRARWKLLSKPWKEGCRDFFPF++LTGCLL NFLIKCSEK+DE
Subjt:  ENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVDE

Query:  EQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR
        EQ  EEGASCSSEEQKFP +DGE+ D RGKDIRDALALHLS LS+RR
Subjt:  EQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR

A0A6J1CNL4 protein ALP1-like4.7e-19978.82Show/hide
Query:  MAAGGHGGDKRITRSSAVAVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLP---------P
        MAAGG GGD+R+TR++AV  A  T+SKAKKSDR++HL  QLVTLIETTISSAHSFLS NDLHLLPSQTLALE+LI S SSSL AL+P LP         P
Subjt:  MAAGGHGGDKRITRSSAVAVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLP---------P

Query:  PPPPLSPSPPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYT
        PP    P PPRQCWFQR LSATSE+DCDPRWN SFRMSKSSFSLLLRLLSPI  SSSS +++ DCALAAA++RLAHGASY  +GR+FGID ADACRSFY 
Subjt:  PPPPLSPSPPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYT

Query:  VCKAINDKLGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGEL
        VCKAIN+KLG+LLELRSDIDRIVVGFGWISLPNCCGVLG+RRFG+     G  G LLVQALVDAEGRFLDVSAGWPS + PETIL+QSKLY EIEKSGEL
Subjt:  VCKAINDKLGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGEL

Query:  LNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNF
        L G  YNLDDE PI QYLIGDSCFPLLPWLLTPY+KL EE+ SGF  RAFNSTHNRAMGL+NTAF RL+ARWKLLSKPWKEGCRDFFPF+VLTGCLLHNF
Subjt:  LNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNF

Query:  LIKCSEKVD-EEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR
        LIKCSEK++ E++ GEE A+C SEEQKFPLYDGE+ DDRGKDIRDALA+HLSRLSFRR
Subjt:  LIKCSEKVD-EEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR

A0A6J1FIY2 protein ALP1-like2.0e-20583.3Show/hide
Query:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS
        MAAGG  GDKR TRSSA+ A AVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLS+NDLHLLPSQTLALE+ I S SSSL ALSP L  P   L P 
Subjt:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS

Query:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK
        PPRQCWFQR LSAT+E+DCDPRWNL FRMSKSSFSLLLRLLSPI  SSSS+++  DCALAAA++RLAHGASY  +GR+FGID ADACRSFY VCKAINDK
Subjt:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK

Query:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL
        LG+LLELRSDIDRIVVGFGWISLPNCCGVLG+RRFGV     G +G LLVQALVDAEGRFLDVSAGWPS MKPETIL+QSKLY EIEKSGELL G  YNL
Subjt:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL

Query:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV
        DD  PISQYLIGDSCFPLLPWLLTPYMKLNEED SGF  RAFNSTHNRAMGL+NTAF ++RARWKLLSKPWKE CRDFFPF+VLTGCLLHNFLIKCSEK+
Subjt:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV

Query:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR
        +EEQ  ++GASCSSEEQKF LYDGE  DDRGKDIRDALALHLSRLSFRR
Subjt:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR

A0A6J1HRT9 protein ALP1-like1.2e-20583.74Show/hide
Query:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS
        MAAGG  GDKR TRSSA+ A AVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLS+NDLHLLPSQTLALE+ I S SSSL ALSP L  P   L P 
Subjt:  MAAGGHGGDKRITRSSAV-AVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPS

Query:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK
        PPRQCWFQR LSAT+E+DCDPRWNL FRMSKSSFSLLLRLLSPI  SSSS+++  DCALAAA++RLAHGASY  +GR+FGID ADACRSFY VCKAINDK
Subjt:  PPRQCWFQRLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK

Query:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL
        LG+LLELRSDIDRIVVGFGWISLPNCCGVLG+RRFGV     G +G LLVQALVDAEGRFLDVSAGWPS MKPETIL+QSKLY EIEKSGELL G  YNL
Subjt:  LGYLLELRSDIDRIVVGFGWISLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNL

Query:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV
        DD  PISQYLIGDSCFPLLPWLLTPYMKLNEED SGF  RAFNSTHNRAMGL+NTAF ++RARWKLLSKPWKE CRDFFPFVVLTGCLLHNFLIKCSEK+
Subjt:  DDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKV

Query:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR
        +EEQ  E+GAS SSEEQKFPLYDGE  DDRGKDIRDALALHLSRLSFRR
Subjt:  DEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI11.5e-0824.25Show/hide
Query:  SSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDKLGYLLELRSD-----------------------IDRIVVGFGWISLPNC
        S  IS +  + AA+     G+    +G   GI  A   R    V +A+ ++    +   +D                       +D I V    I  PN 
Subjt:  SSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDKLGYLLELRSD-----------------------IDRIVVGFGWISLPNC

Query:  CGVLGIRRFGVGVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNE
          +  + R G+     L+V    D  G  + V   WP  ++   +LQQS L  + E                 P   +L+GDS F L  WLLTP + + E
Subjt:  CGVLGIRRFGVGVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNE

Query:  EDGSGFSNRAFNSTHNRAMGLLNTAFSRLR----ARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIK
               NRA ++TH+     L T   R R    ++  L   P K         ++L  C+LHN  ++
Subjt:  EDGSGFSNRAFNSTHNRAMGLLNTAFSRLR----ARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIK

Q6AZB8 Putative nuclease HARBI11.3e-0721.11Show/hide
Query:  SFRMSKSSFSLLLRLL--SPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK----LGYLLELRSDIDRIVVGFG
        +F   +     L+ LL  S +  +  S  IS D  + AA+     G+  + +G   GI  A   R    V KA+ +K    +G+  +  +        + 
Subjt:  SFRMSKSSFSLLLRLL--SPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDK----LGYLLELRSDIDRIVVGFG

Query:  WISLPNCCGVLGIRRFGVGVEG-------------DLLVQALVDAEGRFLDVSAGWPSFMKPETILQQS---KLYDEIEKSGELLNGSAYNLDDENPISQ
           +PN  GV+      +                  +  Q + DA G  L     WP  +    + +QS   KL++E E             DDE     
Subjt:  WISLPNCCGVLGIRRFGVGVEG-------------DLLVQALVDAEGRFLDVSAGWPSFMKPETILQQS---KLYDEIEKSGELLNGSAYNLDDENPISQ

Query:  YLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLL--SKPWKEGCRDFFPFVVLTGCLLHNFLIK
        +L+GD+ +PL  WL+TP      +     ++  +N  H     +++  F  ++ R++ L  +K + +   +    ++   C+LHN  ++
Subjt:  YLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLL--SKPWKEGCRDFFPFVVLTGCLLHNFLIK

Q8BR93 Putative nuclease HARBI11.3e-0722.69Show/hide
Query:  SSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDKLGYLLELRSDIDRIVVG------FGWISLPNCCGVLGIRRFGVGVEG--
        S  IS +  + AA+     G+    +G   GI  A   R    V +A+ ++    +     +D   V       +G   +P   GV       +      
Subjt:  SSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDKLGYLLELRSDIDRIVVG------FGWISLPNCCGVLGIRRFGVGVEG--

Query:  -----------DLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGS
                    L    + D  G  + V   WP  ++   +LQ+S L  + E                 P   +L+GDS F L  WLLTP + + E    
Subjt:  -----------DLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGS

Query:  GFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFP----FVVLTGCLLHN
           NRA ++TH+     L T   R R           +G   + P     ++L  C+LHN
Subjt:  GFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFP----FVVLTGCLLHN

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 13.2e-2727.37Show/hide
Query:  PLSPSPPRQCWFQ----RLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSS--------TISSDCALAAAIYRLAHGASYATIGRQFGIDP
        PL P      W+     R  S +   D D  +   FR SK++FS +  L+    IS   S         +S +  +A A+ RLA G S  ++G  FG+  
Subjt:  PLSPSPPRQCWFQ----RLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSS--------TISSDCALAAAIYRLAHGASYATIGRQFGIDP

Query:  ADACRSFYTVCKAINDKLGYLLELRSD--IDRIVVGF-GWISLPNCCGVLGIRRF-----GVGVEGD---------LLVQALVDAEGRFLDVSAGWPSFM
        +   +  +   +A+ ++  + L       I+ I   F     LPNCCG +           V    D         + +Q + D E RFL++  GWP  M
Subjt:  ADACRSFYTVCKAINDKLGYLLELRSD--IDRIVVGF-GWISLPNCCGVLGIRRF-----GVGVEGD---------LLVQALVDAEGRFLDVSAGWPSFM

Query:  KPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPW
            +L+ S  + ++ ++ ++L+G+   L     I +Y++G   +PLLPWL+TP+    + D    S  AFN  H +   +  TAF +L+  W++LSK  
Subjt:  KPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPW

Query:  KEGCRDFFPFVVLTGCLLHNFLIKCSEKVDEE-------QGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSR
            R   P ++L  CLLHN +I C + + E+         G     C   E   PL         G ++R  L  HL R
Subjt:  KEGCRDFFPFVVLTGCLLHNFLIKCSEKVDEE-------QGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSR

Q9M2U3 Protein ALP1-like1.8e-3330.41Show/hide
Query:  FRMSKSSFSLLLRLL------SPIHIS-SSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDKLGYLLELRSDIDRIVVGFG
        F++S+ +F  +  L+       P + S S+ + +S +  +A A+ RL  G S + IG  FG++ +   +  +   +++ ++  + L   S +D I   F 
Subjt:  FRMSKSSFSLLLRLL------SPIHIS-SSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDKLGYLLELRSDIDRIVVGFG

Query:  WIS-LPNCCGVLGIRRF-----------GVGVEGD----LLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQ
         IS LPNCCG + I               V ++G+    + +QA+VD + RFLDV AGWP  +  + +L+ S  Y  +EK G+ LNG    L +   + +
Subjt:  WIS-LPNCCGVLGIRRF-----------GVGVEGD----LLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQ

Query:  YLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVDEEQ
        Y++GDS FPLLPWLLTPY    +   +      FN  H+ A      A S+L+ RW++++       R+  P ++   CLLHN +I   ++  ++Q
Subjt:  YLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVDEEQ

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)4.0e-7341.2Show/hide
Query:  KKSDR-------KNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPSPPRQCWFQRLLSATSELDCDPR
        +KSDR       K  LK  L+  + +  +  +SFL  NDL L PSQTL LE+LI S+  S         P P   S +     WF R L++ +E + DPR
Subjt:  KKSDR-------KNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPSPPRQCWFQRLLSATSELDCDPR

Query:  WNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGID-PADACRSFYTVCKAINDKLGYLLELRSDIDRIVVGFGWI
        W L FRMSKS+F  L  +LS           SS  + AA I+RLAHGASY  +  +FG D  + A RSF+TVCK IN+KL         +D     F   
Subjt:  WNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGID-PADACRSFYTVCKAINDKLGYLLELRSDIDRIVVGFGWI

Query:  SLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPW
         LPNC GV+G  RF V     G +G +LVQALVD+ GRF+D+SAGWPS MKPE I +Q+KL+   E   E+L+G+   L +   + +Y++GDSC PLLPW
Subjt:  SLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPW

Query:  LLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVD--EE-----QGGEEG--ASC
        L+TPY   ++E+        FN+  +  +  +  AF+++RARW++L K WK    +F PFV+ TGCLLHNFL+   +  D  EE     + G+ G     
Subjt:  LLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVD--EE-----QGGEEG--ASC

Query:  SSEEQKFPLYDGEMEDDRGKDIRDALALHLSR
          +E++   ++GE   +  K IRDA+A +LSR
Subjt:  SSEEQKFPLYDGEMEDDRGKDIRDALALHLSR

AT1G72270.2 LOCATED IN: mitochondrion1.0e-7341.19Show/hide
Query:  KKSDR-------KNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPSPPRQCWFQRLLSATSELDCDPR
        +KSDR       K  LK  L+  + +  +  +SFL  NDL L PSQTL LE+LI S+  S         P P   S +     WF R L++ +E + DPR
Subjt:  KKSDR-------KNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPSPPRQCWFQRLLSATSELDCDPR

Query:  WNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGID-PADACRSFYTVCKAINDKLGYLLELRSDIDRIVVGFGWI
        W L FRMSKS+F  L  +LS           SS  + AA I+RLAHGASY  +  +FG D  + A RSF+TVCK IN+KL         +D     F   
Subjt:  WNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGID-PADACRSFYTVCKAINDKLGYLLELRSDIDRIVVGFGWI

Query:  SLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPW
         LPNC GV+G  RF V     G +G +LVQALVD+ GRF+D+SAGWPS MKPE I +Q+KL+   E   E+L+G+   L +   + +Y++GDSC PLLPW
Subjt:  SLPNCCGVLGIRRFGV-----GVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPW

Query:  LLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVD--EE-----QGGEEG--ASC
        L+TPY   ++E+        FN+  +  +  +  AF+++RARW++L K WK    +F PFV+ TGCLLHNFL+   +  D  EE     + G+ G     
Subjt:  LLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVD--EE-----QGGEEG--ASC

Query:  SSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR
          +E++   ++GE   +  K IRDA+A +LSR+S  R
Subjt:  SSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLSFRR

AT3G55350.1 PIF / Ping-Pong family of plant transposases1.2e-3430.41Show/hide
Query:  FRMSKSSFSLLLRLL------SPIHIS-SSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDKLGYLLELRSDIDRIVVGFG
        F++S+ +F  +  L+       P + S S+ + +S +  +A A+ RL  G S + IG  FG++ +   +  +   +++ ++  + L   S +D I   F 
Subjt:  FRMSKSSFSLLLRLL------SPIHIS-SSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDKLGYLLELRSDIDRIVVGFG

Query:  WIS-LPNCCGVLGIRRF-----------GVGVEGD----LLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQ
         IS LPNCCG + I               V ++G+    + +QA+VD + RFLDV AGWP  +  + +L+ S  Y  +EK G+ LNG    L +   + +
Subjt:  WIS-LPNCCGVLGIRRF-----------GVGVEGD----LLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQ

Query:  YLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVDEEQ
        Y++GDS FPLLPWLLTPY    +   +      FN  H+ A      A S+L+ RW++++       R+  P ++   CLLHN +I   ++  ++Q
Subjt:  YLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVDEEQ

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.3e-2827.37Show/hide
Query:  PLSPSPPRQCWFQ----RLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSS--------TISSDCALAAAIYRLAHGASYATIGRQFGIDP
        PL P      W+     R  S +   D D  +   FR SK++FS +  L+    IS   S         +S +  +A A+ RLA G S  ++G  FG+  
Subjt:  PLSPSPPRQCWFQ----RLLSATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSS--------TISSDCALAAAIYRLAHGASYATIGRQFGIDP

Query:  ADACRSFYTVCKAINDKLGYLLELRSD--IDRIVVGF-GWISLPNCCGVLGIRRF-----GVGVEGD---------LLVQALVDAEGRFLDVSAGWPSFM
        +   +  +   +A+ ++  + L       I+ I   F     LPNCCG +           V    D         + +Q + D E RFL++  GWP  M
Subjt:  ADACRSFYTVCKAINDKLGYLLELRSD--IDRIVVGF-GWISLPNCCGVLGIRRF-----GVGVEGD---------LLVQALVDAEGRFLDVSAGWPSFM

Query:  KPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPW
            +L+ S  + ++ ++ ++L+G+   L     I +Y++G   +PLLPWL+TP+    + D    S  AFN  H +   +  TAF +L+  W++LSK  
Subjt:  KPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPW

Query:  KEGCRDFFPFVVLTGCLLHNFLIKCSEKVDEE-------QGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSR
            R   P ++L  CLLHN +I C + + E+         G     C   E   PL         G ++R  L  HL R
Subjt:  KEGCRDFFPFVVLTGCLLHNFLIKCSEKVDEE-------QGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSR

AT5G12010.1 unknown protein1.1e-1927Show/hide
Query:  SFRMSKSSFSLLLRLLSPIHISSSS---STISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDKL--GYL--------------LE
        +FRMSKS+F L+   L+       +   + I     +A  I+RLA G     + ++FG+  +   +    VCKAI D L   YL               E
Subjt:  SFRMSKSSFSLLLRLLSPIHISSSS---STISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDKL--GYL--------------LE

Query:  LRSDIDRIVVGFGWISLPNCCGVLGIRRF--------GVGVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDE
          S I  +V       +P     + +  +               + +QA+V+ +G F D+  GWP  M  + +L++S LY      G LL G        
Subjt:  LRSDIDRIVVGFGWISLPNCCGVLGIRRF--------GVGVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDE

Query:  NPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVDEE
             ++ G    PLL W+L PY + N      ++  AFN   +   G+   AF RL+ RW  L K  +   +D  P V+   C+LHN      EK++ E
Subjt:  NPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSGFSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVDEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGGAGGACACGGCGGCGATAAGAGAATCACTCGTAGCTCCGCCGTCGCCGTCGCCGTCACCACCAGAAGCAAGGCGAAGAAATCCGACCGGAAAAACCATCT
CAAACACCAACTGGTAACCCTAATCGAAACCACCATTTCTTCAGCTCACTCATTTCTCTCCATCAACGATCTCCACCTTCTCCCCTCTCAAACCCTAGCTCTCGAAACCC
TAATTTGTTCCATTTCATCCTCTCTCCACGCTCTTTCTCCCAATCTGCCACCGCCGCCGCCGCCGCTGTCGCCGTCACCGCCGCGACAATGCTGGTTCCAACGCCTCCTC
TCCGCGACGTCGGAACTCGATTGCGATCCGAGATGGAATCTCTCCTTCCGCATGTCGAAATCGTCATTCTCCCTTCTCCTTCGCCTACTTTCCCCAATCCACATCTCATC
ATCATCCTCTACCATTTCATCCGATTGTGCTTTGGCCGCTGCGATTTACCGATTGGCACATGGAGCGAGCTATGCCACGATTGGGAGGCAATTCGGGATCGATCCCGCCG
ACGCTTGCCGCTCATTCTATACTGTTTGTAAGGCTATTAATGATAAATTGGGGTATTTGCTCGAGCTTAGATCTGATATTGATAGGATTGTTGTGGGGTTTGGGTGGATT
TCGCTTCCGAATTGCTGTGGGGTTTTGGGGATTAGAAGATTTGGGGTTGGGGTTGAAGGTGATCTTCTTGTTCAAGCATTGGTTGATGCTGAAGGGAGGTTTCTTGATGT
CTCTGCTGGATGGCCGAGCTTCATGAAACCTGAAACAATCTTGCAGCAGAGTAAGCTCTATGATGAAATTGAGAAATCTGGTGAATTACTGAATGGTTCTGCTTATAATC
TCGATGATGAAAACCCCATTTCTCAATACTTGATTGGCGATTCTTGTTTTCCACTATTGCCATGGCTTTTGACACCATATATGAAACTGAATGAGGAAGATGGCTCTGGT
TTTTCTAACAGAGCTTTCAATTCCACGCATAACCGTGCAATGGGGTTGCTTAATACCGCGTTTAGCAGACTCCGAGCTCGGTGGAAGCTTCTGTCAAAGCCATGGAAGGA
AGGGTGTAGAGATTTTTTCCCTTTTGTTGTATTGACTGGGTGTTTGCTGCATAATTTCCTTATTAAATGCAGTGAGAAAGTAGATGAAGAACAAGGTGGAGAAGAAGGAG
CGAGTTGTTCGAGTGAGGAGCAGAAGTTTCCTCTTTATGATGGTGAGATGGAAGATGATAGAGGAAAGGATATCAGAGATGCGCTTGCATTGCACTTGAGTAGGCTGAGC
TTTAGAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGGAGGACACGGCGGCGATAAGAGAATCACTCGTAGCTCCGCCGTCGCCGTCGCCGTCACCACCAGAAGCAAGGCGAAGAAATCCGACCGGAAAAACCATCT
CAAACACCAACTGGTAACCCTAATCGAAACCACCATTTCTTCAGCTCACTCATTTCTCTCCATCAACGATCTCCACCTTCTCCCCTCTCAAACCCTAGCTCTCGAAACCC
TAATTTGTTCCATTTCATCCTCTCTCCACGCTCTTTCTCCCAATCTGCCACCGCCGCCGCCGCCGCTGTCGCCGTCACCGCCGCGACAATGCTGGTTCCAACGCCTCCTC
TCCGCGACGTCGGAACTCGATTGCGATCCGAGATGGAATCTCTCCTTCCGCATGTCGAAATCGTCATTCTCCCTTCTCCTTCGCCTACTTTCCCCAATCCACATCTCATC
ATCATCCTCTACCATTTCATCCGATTGTGCTTTGGCCGCTGCGATTTACCGATTGGCACATGGAGCGAGCTATGCCACGATTGGGAGGCAATTCGGGATCGATCCCGCCG
ACGCTTGCCGCTCATTCTATACTGTTTGTAAGGCTATTAATGATAAATTGGGGTATTTGCTCGAGCTTAGATCTGATATTGATAGGATTGTTGTGGGGTTTGGGTGGATT
TCGCTTCCGAATTGCTGTGGGGTTTTGGGGATTAGAAGATTTGGGGTTGGGGTTGAAGGTGATCTTCTTGTTCAAGCATTGGTTGATGCTGAAGGGAGGTTTCTTGATGT
CTCTGCTGGATGGCCGAGCTTCATGAAACCTGAAACAATCTTGCAGCAGAGTAAGCTCTATGATGAAATTGAGAAATCTGGTGAATTACTGAATGGTTCTGCTTATAATC
TCGATGATGAAAACCCCATTTCTCAATACTTGATTGGCGATTCTTGTTTTCCACTATTGCCATGGCTTTTGACACCATATATGAAACTGAATGAGGAAGATGGCTCTGGT
TTTTCTAACAGAGCTTTCAATTCCACGCATAACCGTGCAATGGGGTTGCTTAATACCGCGTTTAGCAGACTCCGAGCTCGGTGGAAGCTTCTGTCAAAGCCATGGAAGGA
AGGGTGTAGAGATTTTTTCCCTTTTGTTGTATTGACTGGGTGTTTGCTGCATAATTTCCTTATTAAATGCAGTGAGAAAGTAGATGAAGAACAAGGTGGAGAAGAAGGAG
CGAGTTGTTCGAGTGAGGAGCAGAAGTTTCCTCTTTATGATGGTGAGATGGAAGATGATAGAGGAAAGGATATCAGAGATGCGCTTGCATTGCACTTGAGTAGGCTGAGC
TTTAGAAGATGA
Protein sequenceShow/hide protein sequence
MAAGGHGGDKRITRSSAVAVAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSINDLHLLPSQTLALETLICSISSSLHALSPNLPPPPPPLSPSPPRQCWFQRLL
SATSELDCDPRWNLSFRMSKSSFSLLLRLLSPIHISSSSSTISSDCALAAAIYRLAHGASYATIGRQFGIDPADACRSFYTVCKAINDKLGYLLELRSDIDRIVVGFGWI
SLPNCCGVLGIRRFGVGVEGDLLVQALVDAEGRFLDVSAGWPSFMKPETILQQSKLYDEIEKSGELLNGSAYNLDDENPISQYLIGDSCFPLLPWLLTPYMKLNEEDGSG
FSNRAFNSTHNRAMGLLNTAFSRLRARWKLLSKPWKEGCRDFFPFVVLTGCLLHNFLIKCSEKVDEEQGGEEGASCSSEEQKFPLYDGEMEDDRGKDIRDALALHLSRLS
FRR