; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006611 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006611
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationChr07:20303814..20305169
RNA-Seq ExpressionHG10006611
SyntenyHG10006611
Gene Ontology termsGO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586365.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]6.0e-22889.8Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP
        MA  G  G+KRTTRSSA+N A A TTRSKAKK D+++HL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKL+L    
Subjt:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP

Query:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
          PPPRQCWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPI+SSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
Subjt:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN

Query:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY
        +KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKS ELLKGPVY
Subjt:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY

Query:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE
        NLDD KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAM LVNTAFC++RARWKLLSKPWKE CRDFFPFIVLTGCLLHNFLIKCSE
Subjt:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE

Query:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR
        KL+EEQD+++GASCSSEEQKFPLYDGE GDDRGKDIRDALALHLS LSFRR
Subjt:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR

XP_004139403.1 protein ALP1-like [Cucumis sativus]1.9e-23793.79Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP
        MATRGL G+KRTTRSSAMNAAAAA TRSKAKKLDQE+HLNHQL+TLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSL ALSPRLPKL+L  PP
Subjt:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP

Query:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
        P PPPRQCWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPI+SS SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
Subjt:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN

Query:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY
        EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKSSELLKGPVY
Subjt:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY

Query:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE
        NLD+EKPIPQYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFI+LTGCLL NFLIKCSE
Subjt:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE

Query:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR
        KLDEEQDQEEGASCSSEEQKFPL+DGEIGD RGKDIRDALALHLSSL++RR
Subjt:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR

XP_008457314.1 PREDICTED: putative nuclease HARBI1 [Cucumis melo]2.1e-23392.9Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP
        MATRGL G+KRTTRSSAMN AAAA TRSKAKKLDQE+HLNHQL+TLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSL ALSPRLPKL+L  P 
Subjt:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP

Query:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
        P PPPRQCWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPI+S SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
Subjt:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN

Query:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY
        EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKSSELLKGPVY
Subjt:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY

Query:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE
        NLDDEKPIPQYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFI+LTGCLL NFLIKCSE
Subjt:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE

Query:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR
        KLDEEQDQEEGASCSSEEQKFP +DGEIGD RGKDIRDALALHLSSLS+RR
Subjt:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR

XP_022938170.1 protein ALP1-like [Cucurbita moschata]5.1e-22789.58Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP
        MA  G  G+KRTTRSSA+N A A TTRSKAKK D+++HL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKL+L    
Subjt:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP

Query:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
          PPPRQCWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPI+SSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
Subjt:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN

Query:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY
        +KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKS ELLKGPVY
Subjt:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY

Query:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE
        NLDD KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAM LVNTAFC++RARWKLLSKPWKE CRDFFPFIVLTGCLLHNFLIKCSE
Subjt:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE

Query:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR
        KL+EEQD+++GASCSSEEQKF LYDGE GDDRGKDIRDALALHLS LSFRR
Subjt:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR

XP_038890100.1 protein ALP1-like [Benincasa hispida]1.4e-24093.86Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAA---ATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLAL-
        MATRG+GG+KRTTRSS++NA AA   ATTRSKAKKLD+ESHL HQLVTLI+TTISSAHSFLSLNDLHLLPSQTLALESLL STSSSL ALSPRLPKL L 
Subjt:  MATRGLGGEKRTTRSSAMNAAAA---ATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLAL-

Query:  -PPPPPPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV
         PPPPPPPPPRQCWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPI+SSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV
Subjt:  -PPPPPPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV

Query:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELL
        CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVE ELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY EIEKS+ELL
Subjt:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELL

Query:  KGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFL
        KGPVYNLDD+KPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAF RLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFL
Subjt:  KGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFL

Query:  IKCSEKLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR
        IKCSEKLDEEQDQEE A CSSE+QKFPLYDG+IGDDRGKDIRDALALHLSSLS+RR
Subjt:  IKCSEKLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR

TrEMBL top hitse value%identityAlignment
A0A0A0LFB5 DDE Tnp4 domain-containing protein9.0e-23893.79Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP
        MATRGL G+KRTTRSSAMNAAAAA TRSKAKKLDQE+HLNHQL+TLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSL ALSPRLPKL+L  PP
Subjt:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP

Query:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
        P PPPRQCWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPI+SS SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
Subjt:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN

Query:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY
        EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKSSELLKGPVY
Subjt:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY

Query:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE
        NLD+EKPIPQYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFI+LTGCLL NFLIKCSE
Subjt:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE

Query:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR
        KLDEEQDQEEGASCSSEEQKFPL+DGEIGD RGKDIRDALALHLSSL++RR
Subjt:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR

A0A1S3C5W6 putative nuclease HARBI11.0e-23392.9Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP
        MATRGL G+KRTTRSSAMN AAAA TRSKAKKLDQE+HLNHQL+TLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSL ALSPRLPKL+L  P 
Subjt:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP

Query:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
        P PPPRQCWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPI+S SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
Subjt:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN

Query:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY
        EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKSSELLKGPVY
Subjt:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY

Query:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE
        NLDDEKPIPQYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFI+LTGCLL NFLIKCSE
Subjt:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE

Query:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR
        KLDEEQDQEEGASCSSEEQKFP +DGEIGD RGKDIRDALALHLSSLS+RR
Subjt:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR

A0A5D3BH79 Putative nuclease HARBI11.0e-23392.9Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP
        MATRGL G+KRTTRSSAMN AAAA TRSKAKKLDQE+HLNHQL+TLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSL ALSPRLPKL+L  P 
Subjt:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP

Query:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
        P PPPRQCWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPI+S SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
Subjt:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN

Query:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY
        EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKSSELLKGPVY
Subjt:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY

Query:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE
        NLDDEKPIPQYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFI+LTGCLL NFLIKCSE
Subjt:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE

Query:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR
        KLDEEQDQEEGASCSSEEQKFP +DGEIGD RGKDIRDALALHLSSLS+RR
Subjt:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR

A0A6J1FIY2 protein ALP1-like2.4e-22789.58Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP
        MA  G  G+KRTTRSSA+N A A TTRSKAKK D+++HL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKL+L    
Subjt:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP

Query:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
          PPPRQCWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPI+SSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
Subjt:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN

Query:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY
        +KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKS ELLKGPVY
Subjt:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY

Query:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE
        NLDD KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAM LVNTAFC++RARWKLLSKPWKE CRDFFPFIVLTGCLLHNFLIKCSE
Subjt:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE

Query:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR
        KL+EEQD+++GASCSSEEQKF LYDGE GDDRGKDIRDALALHLS LSFRR
Subjt:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR

A0A6J1HRT9 protein ALP1-like2.4e-22789.58Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP
        MA  G  G+KRTTRSSA+N A A TTRSKAKK D+++HL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKL+L    
Subjt:  MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPP

Query:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
          PPPRQCWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPI+SSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN
Subjt:  PPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN

Query:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY
        +KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKS ELLKGPVY
Subjt:  EKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVY

Query:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE
        NLDD KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAM LVNTAFC++RARWKLLSKPWKE CRDFFPF+VLTGCLLHNFLIKCSE
Subjt:  NLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSE

Query:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR
        KL+EEQD+E+GAS SSEEQKFPLYDGE GDDRGKDIRDALALHLS LSFRR
Subjt:  KLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSSLSFRR

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI13.3e-1124.05Show/hide
Query:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVLGLRRFGVEG------
        S ++ P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +   +D   I       +G   +P   G +      ++       
Subjt:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVLGLRRFGVEG------

Query:  ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGF
          + + G  SL    + D  G  + V   WP S++   +L+QS L ++ E                 P   +L+GDS F L  WLLTP + + E  +   
Subjt:  ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGF

Query:  PERAFNSTHNRAMALVNTAFCRLR----ARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIK
          RA ++TH+     + T  CR R    ++  L   P K         I+L  C+LHN  ++
Subjt:  PERAFNSTHNRAMALVNTAFCRLR----ARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIK

Q8BR93 Putative nuclease HARBI12.1e-1023.85Show/hide
Query:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVLGLRRFGVEG----
        S ++ P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +     +D   V       +G   +P   GV       ++     
Subjt:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVLGLRRFGVEG----

Query:  --ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSS
            + + G  SL    + D  G  + V   WP S++   +L++S L ++ E                 P   +L+GDS F L  WLLTP + + E  + 
Subjt:  --ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSS

Query:  GFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFP----FIVLTGCLLHN
            RA ++TH+     + T  CR R           +G   + P     I+L  C+LHN
Subjt:  GFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFP----FIVLTGCLLHN

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.7e-2829.22Show/hide
Query:  PPPPPPRQC-WFQRF----LSATSEVDCDPRWNLSFRMSKSSFSLLLRLL---------SPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS
        P  P    C W+  F     S +   D D  +   FR SK++FS +  L+         S + +     +  +  +A AL RLA G S  +VG  FG+  
Subjt:  PPPPPPRQC-WFQRF----LSATSEVDCDPRWNLSFRMSKSSFSLLLRLL---------SPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS

Query:  ADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGVEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSM
        +   +  +   +A+ E+  H L       I+ I   F     LPNCCG +      +    +          KN S+ +Q + D E RFL++  GWP  M
Subjt:  ADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGVEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSM

Query:  KPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPW
            +L+ S  + ++ +++++L G    L     I +Y++G   +PLLPWL+TP+   +  DS      AFN  H +  ++  TAF +L+  W++LSK  
Subjt:  KPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPW

Query:  KEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEE
            R   P I+L  CLLHN +I C + L E+
Subjt:  KEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEE

Q9M2U3 Protein ALP1-like1.7e-3628.93Show/hide
Query:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIESSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P      +A AL RL  G S   +G  FG++ +   +  +   +++
Subjt:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIESSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY
         E+  H L   S +D I   F  IS LPNCCG + +              +  ++GE   KN S+ +QA+VD + RFLDV AGWP S+  + +L+ S  Y
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY

Query:  AEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIV
          +EK    L G    L +   + +Y++GDS FPLLPWLLTPY    +   +  P+  FN  H+ A      A  +L+ RW++++       R+  P I+
Subjt:  AEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIV

Query:  LTGCLLHNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALA
           CLLHN +I       E+Q  ++       +  +     ++ D+    +RD L+
Subjt:  LTGCLLHNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALA

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)1.4e-7843.65Show/hide
Query:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPPPPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFS
        L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S           LP    P           WF RFL++ +E + DPRW L FRMSKS+F 
Subjt:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPPPPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFS

Query:  LLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRF
         L  +L      S SS+P   + AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D     F    LPNC GV+G  RF
Subjt:  LLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRF

Query:  GVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSS
         V+G+LLG  GS+LVQALVD+ GRF+D+SAGWPS+MKPE I RQ+KL++  E   E+L G    L +   +P+Y++GDSC PLLPWL+TPY   ++E+S 
Subjt:  GVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSS

Query:  GFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPLYDGEI
         F E  FN+  +  +  V  AF ++RARW++L K WK    +F PF++ TGCLLHNFL+   +  D         E  D  E      +E++   ++GE 
Subjt:  GFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPLYDGEI

Query:  GDDRGKDIRDALALHLS
          +  K IRDA+A +LS
Subjt:  GDDRGKDIRDALALHLS

AT1G72270.2 LOCATED IN: mitochondrion3.8e-7943.5Show/hide
Query:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPPPPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFS
        L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S           LP    P           WF RFL++ +E + DPRW L FRMSKS+F 
Subjt:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPPPPPPPRQCWFQRFLSATSEVDCDPRWNLSFRMSKSSFS

Query:  LLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRF
         L  +L      S SS+P   + AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D     F    LPNC GV+G  RF
Subjt:  LLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRF

Query:  GVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSS
         V+G+LLG  GS+LVQALVD+ GRF+D+SAGWPS+MKPE I RQ+KL++  E   E+L G    L +   +P+Y++GDSC PLLPWL+TPY   ++E+S 
Subjt:  GVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSS

Query:  GFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPLYDGEI
         F E  FN+  +  +  V  AF ++RARW++L K WK    +F PF++ TGCLLHNFL+   +  D         E  D  E      +E++   ++GE 
Subjt:  GFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPLYDGEI

Query:  GDDRGKDIRDALALHLSSLSFRR
          +  K IRDA+A +LS +S  R
Subjt:  GDDRGKDIRDALALHLSSLSFRR

AT3G55350.1 PIF / Ping-Pong family of plant transposases1.2e-3728.93Show/hide
Query:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIESSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P      +A AL RL  G S   +G  FG++ +   +  +   +++
Subjt:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIESSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY
         E+  H L   S +D I   F  IS LPNCCG + +              +  ++GE   KN S+ +QA+VD + RFLDV AGWP S+  + +L+ S  Y
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY

Query:  AEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIV
          +EK    L G    L +   + +Y++GDS FPLLPWLLTPY    +   +  P+  FN  H+ A      A  +L+ RW++++       R+  P I+
Subjt:  AEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIV

Query:  LTGCLLHNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALA
           CLLHN +I       E+Q  ++       +  +     ++ D+    +RD L+
Subjt:  LTGCLLHNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDALA

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.2e-2929.22Show/hide
Query:  PPPPPPRQC-WFQRF----LSATSEVDCDPRWNLSFRMSKSSFSLLLRLL---------SPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS
        P  P    C W+  F     S +   D D  +   FR SK++FS +  L+         S + +     +  +  +A AL RLA G S  +VG  FG+  
Subjt:  PPPPPPRQC-WFQRF----LSATSEVDCDPRWNLSFRMSKSSFSLLLRLL---------SPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS

Query:  ADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGVEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSM
        +   +  +   +A+ E+  H L       I+ I   F     LPNCCG +      +    +          KN S+ +Q + D E RFL++  GWP  M
Subjt:  ADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGVEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSM

Query:  KPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPW
            +L+ S  + ++ +++++L G    L     I +Y++G   +PLLPWL+TP+   +  DS      AFN  H +  ++  TAF +L+  W++LSK  
Subjt:  KPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPW

Query:  KEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEE
            R   P I+L  CLLHN +I C + L E+
Subjt:  KEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEE

AT5G12010.1 unknown protein8.0e-2127Show/hide
Query:  SFRMSKSSFSLLLRLLSPI----ESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL-GHLLELRSD--IDRIVVGFG
        +FRMSKS+F L+   L+      +++  +++P    +A  ++RLA G   + V ++FG+  +   +    VCKAI + L    L+   D  +  I   F 
Subjt:  SFRMSKSSFSLLLRLLSPI----ESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL-GHLLELRSD--IDRIVVGFG

Query:  WIS-LPNCCGVLGLRRFGVEGELLG---------------KNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVYNLDDE
         +S +PN  G +      +    +                 + S+ +QA+V+ +G F D+  GWP SM  + +L +S LY        LLKG        
Subjt:  WIS-LPNCCGVLGLRRFGVEGELLG---------------KNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVYNLDDE

Query:  KPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEE
             ++ G    PLL W+L PY + N      + + AFN   +    +   AF RL+ RW  L K  +   +D  P ++   C+LHN      EK++ E
Subjt:  KPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCAGAGGACTCGGCGGCGAAAAGAGGACCACCAGAAGCTCTGCCATGAACGCCGCCGCCGCCGCCACTACCAGAAGCAAGGCCAAGAAACTCGACCAGGAGAG
CCATCTCAACCACCAGCTGGTAACCCTCATCGAAACCACCATTTCTTCCGCTCACTCCTTTCTCTCTCTCAACGATCTCCACCTTCTTCCTTCACAAACCCTCGCCCTTG
AATCCCTCCTCTGTTCCACTTCATCCTCTCTTTGCGCTCTCTCTCCTCGTCTCCCAAAACTTGCTCTACCTCCGCCTCCGCCTCCGCCTCCGCCGCGGCAATGCTGGTTC
CAACGCTTCCTTTCTGCGACATCCGAGGTCGATTGCGATCCGAGATGGAATCTCTCCTTCCGTATGTCGAAATCGTCCTTCTCCCTCCTCCTTCGTCTCCTTTCCCCTAT
TGAGAGCTCCTCATCCTCTTCAGTTCCGCCCGATTGTGCGTTAGCCGCTGCGCTTTTCCGATTGGCACATGGCGCGAGCTACAAGGCGGTTGGGAGGCGATTTGGGATCG
ATTCCGCTGATGCTTGCCGCTCGTTTTATGCTGTTTGTAAAGCTATCAATGAGAAATTGGGGCATTTGCTTGAGCTACGGTCTGACATTGATCGGATTGTTGTGGGATTT
GGGTGGATTTCGCTTCCGAATTGTTGTGGGGTGTTAGGTCTTAGAAGATTTGGGGTTGAGGGTGAGCTGCTAGGCAAAAATGGATCGCTTTTGGTTCAAGCATTAGTCGA
TGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGAAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATTGAGAAATCCAGTGAAT
TACTCAAAGGCCCTGTTTATAATCTTGATGATGAAAAACCCATTCCCCAATACTTAATTGGCGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTGACACCATACATGAAA
CTGAACGAGGAAGATAGCTCTGGCTTTCCTGAAAGAGCATTCAATTCCACACATAACCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAA
GCTTCTGTCAAAACCATGGAAGGAAGGATGCAGAGATTTTTTCCCATTTATTGTATTGACTGGGTGTCTGCTGCACAATTTCCTCATTAAATGCAGTGAGAAACTAGATG
AAGAGCAAGATCAAGAAGAAGGAGCAAGTTGTTCAAGTGAGGAGCAGAAGTTTCCTCTTTATGATGGTGAGATAGGAGATGATAGAGGAAAGGATATCAGAGATGCGCTT
GCCTTGCACTTGAGTAGCCTGAGCTTCAGAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCACCAGAGGACTCGGCGGCGAAAAGAGGACCACCAGAAGCTCTGCCATGAACGCCGCCGCCGCCGCCACTACCAGAAGCAAGGCCAAGAAACTCGACCAGGAGAG
CCATCTCAACCACCAGCTGGTAACCCTCATCGAAACCACCATTTCTTCCGCTCACTCCTTTCTCTCTCTCAACGATCTCCACCTTCTTCCTTCACAAACCCTCGCCCTTG
AATCCCTCCTCTGTTCCACTTCATCCTCTCTTTGCGCTCTCTCTCCTCGTCTCCCAAAACTTGCTCTACCTCCGCCTCCGCCTCCGCCTCCGCCGCGGCAATGCTGGTTC
CAACGCTTCCTTTCTGCGACATCCGAGGTCGATTGCGATCCGAGATGGAATCTCTCCTTCCGTATGTCGAAATCGTCCTTCTCCCTCCTCCTTCGTCTCCTTTCCCCTAT
TGAGAGCTCCTCATCCTCTTCAGTTCCGCCCGATTGTGCGTTAGCCGCTGCGCTTTTCCGATTGGCACATGGCGCGAGCTACAAGGCGGTTGGGAGGCGATTTGGGATCG
ATTCCGCTGATGCTTGCCGCTCGTTTTATGCTGTTTGTAAAGCTATCAATGAGAAATTGGGGCATTTGCTTGAGCTACGGTCTGACATTGATCGGATTGTTGTGGGATTT
GGGTGGATTTCGCTTCCGAATTGTTGTGGGGTGTTAGGTCTTAGAAGATTTGGGGTTGAGGGTGAGCTGCTAGGCAAAAATGGATCGCTTTTGGTTCAAGCATTAGTCGA
TGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGAAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATTGAGAAATCCAGTGAAT
TACTCAAAGGCCCTGTTTATAATCTTGATGATGAAAAACCCATTCCCCAATACTTAATTGGCGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTGACACCATACATGAAA
CTGAACGAGGAAGATAGCTCTGGCTTTCCTGAAAGAGCATTCAATTCCACACATAACCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAA
GCTTCTGTCAAAACCATGGAAGGAAGGATGCAGAGATTTTTTCCCATTTATTGTATTGACTGGGTGTCTGCTGCACAATTTCCTCATTAAATGCAGTGAGAAACTAGATG
AAGAGCAAGATCAAGAAGAAGGAGCAAGTTGTTCAAGTGAGGAGCAGAAGTTTCCTCTTTATGATGGTGAGATAGGAGATGATAGAGGAAAGGATATCAGAGATGCGCTT
GCCTTGCACTTGAGTAGCCTGAGCTTCAGAAGATGA
Protein sequenceShow/hide protein sequence
MATRGLGGEKRTTRSSAMNAAAAATTRSKAKKLDQESHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLCALSPRLPKLALPPPPPPPPPRQCWF
QRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIESSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGF
GWISLPNCCGVLGLRRFGVEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMK
LNEEDSSGFPERAFNSTHNRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLYDGEIGDDRGKDIRDAL
ALHLSSLSFRR