; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G215230 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G215230
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationCiama_Chr11:28476198..28477541
RNA-Seq ExpressionCaUC11G215230
SyntenyCaUC11G215230
Gene Ontology termsGO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586365.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.7e-22288.39Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MA  G  G+KRTTRSSA+NA A TTRSK KK DR+NHL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPK+ L    P
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL
        PPR CWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAVCKAIN+KL
Subjt:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIE S ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        D KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGF ERAFNSTHNRAM LVNTAFC +RARWKLLSKPWKE CRD+FPFIVLTGCLLHNFLIKCSEKL+
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  +GASCSSEEQKFPLYDGE GD+RGKDIRD LA HLS LSFRR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

XP_004139403.1 protein ALP1-like [Cucumis sativus]1.2e-23191.54Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAT-TRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP
        MATRGL G+KRTTRSSAMNAAAA  TRSK KKLD+ENHLNHQL+TLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LPPP 
Subjt:  MATRGLGGEKRTTRSSAMNAAAAT-TRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP

Query:  PPPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEK
        PPPR CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQSS SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAVCKAINEK
Subjt:  PPPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNV
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIE S+ELLKGPVYN+
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNV

Query:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL
        D+EKPIPQYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKL
Subjt:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL

Query:  DEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        DEEQD  EGASCSSEEQKFPL+DGEIGD RGKDIRD LA HLSSL++RR
Subjt:  DEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

XP_008457314.1 PREDICTED: putative nuclease HARBI1 [Cucumis melo]1.7e-23091.07Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MATRGL G+KRTTRSSAMNAAAA TRSK KKLD+ENHLNHQL+TLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LP P P
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL
        PPR CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAVCKAINEKL
Subjt:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIE S+ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        DEKPIPQYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKLD
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  EGASCSSEEQKFP +DGEIGD RGKDIRD LA HLSSLS+RR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

XP_022938170.1 protein ALP1-like [Cucurbita moschata]3.1e-22188.17Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MA  G  G+KRTTRSSA+NA A TTRSK KK DR+NHL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPK+ L    P
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL
        PPR CWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAVCKAIN+KL
Subjt:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIE S ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        D KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGF ERAFNSTHNRAM LVNTAFC +RARWKLLSKPWKE CRD+FPFIVLTGCLLHNFLIKCSEKL+
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  +GASCSSEEQKF LYDGE GD+RGKDIRD LA HLS LSFRR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

XP_038890100.1 protein ALP1-like [Benincasa hispida]1.3e-23090.35Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAA----ATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLP
        MATRG+GG+KRTTRSS++NA A    ATTRSK KKLDRE+HL HQLVTLI+TTISSAHSFLSLNDLHLLPSQTLALESLL STSSSLYALSPRLPK+ LP
Subjt:  MATRGLGGEKRTTRSSAMNAAA----ATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLP

Query:  PPPPPPRP----CWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAV
        PPPPPP P    CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAV
Subjt:  PPPPPPRP----CWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAV

Query:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELL
        CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG E ELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY EIE S ELL
Subjt:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELL

Query:  KGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFL
        KGPVYN+DD+KPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGF ERAFNSTHNRAMALVNTAF  LRARWKLLSKPWKEGCRD+FPFIVLTGCLLHNFL
Subjt:  KGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFL

Query:  IKCSEKLDEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        IKCSEKLDEEQD  E A CSSE+QKFPLYDG+IGD+RGKDIRD LA HLSSLS+RR
Subjt:  IKCSEKLDEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

TrEMBL top hitse value%identityAlignment
A0A0A0LFB5 DDE Tnp4 domain-containing protein5.6e-23291.54Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAT-TRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP
        MATRGL G+KRTTRSSAMNAAAA  TRSK KKLD+ENHLNHQL+TLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LPPP 
Subjt:  MATRGLGGEKRTTRSSAMNAAAAT-TRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP

Query:  PPPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEK
        PPPR CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQSS SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAVCKAINEK
Subjt:  PPPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNV
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIE S+ELLKGPVYN+
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNV

Query:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL
        D+EKPIPQYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKL
Subjt:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL

Query:  DEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        DEEQD  EGASCSSEEQKFPL+DGEIGD RGKDIRD LA HLSSL++RR
Subjt:  DEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

A0A1S3C5W6 putative nuclease HARBI18.1e-23191.07Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MATRGL G+KRTTRSSAMNAAAA TRSK KKLD+ENHLNHQL+TLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LP P P
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL
        PPR CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAVCKAINEKL
Subjt:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIE S+ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        DEKPIPQYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKLD
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  EGASCSSEEQKFP +DGEIGD RGKDIRD LA HLSSLS+RR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

A0A5D3BH79 Putative nuclease HARBI18.1e-23191.07Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MATRGL G+KRTTRSSAMNAAAA TRSK KKLD+ENHLNHQL+TLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LP P P
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL
        PPR CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAVCKAINEKL
Subjt:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIE S+ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        DEKPIPQYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKLD
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  EGASCSSEEQKFP +DGEIGD RGKDIRD LA HLSSLS+RR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

A0A6J1FIY2 protein ALP1-like1.5e-22188.17Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MA  G  G+KRTTRSSA+NA A TTRSK KK DR+NHL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPK+ L    P
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL
        PPR CWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAVCKAIN+KL
Subjt:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIE S ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        D KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGF ERAFNSTHNRAM LVNTAFC +RARWKLLSKPWKE CRD+FPFIVLTGCLLHNFLIKCSEKL+
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  +GASCSSEEQKF LYDGE GD+RGKDIRD LA HLS LSFRR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

A0A6J1HRT9 protein ALP1-like1.5e-22188.17Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MA  G  G+KRTTRSSA+NA A TTRSK KK DR+NHL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPK+ L    P
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL
        PPR CWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAVCKAIN+KL
Subjt:  PPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIE S ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        D KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGF ERAFNSTHNRAM LVNTAFC +RARWKLLSKPWKE CRD+FPF+VLTGCLLHNFLIKCSEKL+
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDLE-GASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD E GAS SSEEQKFPLYDGE GD+RGKDIRD LA HLS LSFRR
Subjt:  EEQDLE-GASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI12.1e-1023.28Show/hide
Query:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVLGLRRFGFEG------
        S ++ P+  + AAL     G+    +G   GI  A        V +A+ E+    +   +D   I       +G   +P   G +       +       
Subjt:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVLGLRRFGFEG------

Query:  ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGF
          + + G  SL    + D  G  + V   WP S++   +L+QS L ++ E                 P   +L+GDS F L  WLLTP + + E  +   
Subjt:  ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGF

Query:  SERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFP----FIVLTGCLLHNFLIK
          RA ++TH+     + T  C  R           +G   Y P     I+L  C+LHN  ++
Subjt:  SERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFP----FIVLTGCLLHNFLIK

Q8BR93 Putative nuclease HARBI11.4e-0923.46Show/hide
Query:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVLGLRRFGFEG----
        S ++ P+  + AAL     G+    +G   GI  A        V +A+ E+    +     +D   V       +G   +P   GV        +     
Subjt:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVLGLRRFGFEG----

Query:  --ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSS
            + + G  SL    + D  G  + V   WP S++   +L++S L ++ E                 P   +L+GDS F L  WLLTP + + E  + 
Subjt:  --ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSS

Query:  GFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFP----FIVLTGCLLHN
            RA ++TH+     + T  C  R           +G   Y P     I+L  C+LHN
Subjt:  GFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFP----FIVLTGCLLHN

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 16.3e-3129.25Show/hide
Query:  FRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKLGHLLELRSD--IDRIVV
        FR SK++FS +  L+         S + +     +  +  +A AL RLA G S  +VG  FG+  +      +   +A+ E+  H L       I+ I  
Subjt:  FRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKLGHLLELRSD--IDRIVV

Query:  GF-GWISLPNCCGVLGLRRFGFEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPI
         F     LPNCCG +           +          KN S+ +Q + D E RFL++  GWP  M    +L+ S  +   EN+ ++L G    +     I
Subjt:  GF-GWISLPNCCGVLGLRRFGFEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPI

Query:  PQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDL
         +Y++G   +PLLPWL+TP+    + D    S  AFN  H +  ++  TAF  L+  W++LSK      R   P I+L  CLLHN +I C + L E+  L
Subjt:  PQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDL

Query:  EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL
         G   S    ++      +    G ++R  L  HL
Subjt:  EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL

Q9M2U3 Protein ALP1-like3.2e-3528.21Show/hide
Query:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P      +A AL RL  G S   +G  FG++ +      +   +++
Subjt:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY
         E+  H L   S +D I   F  IS LPNCCG + +              +   +GE   KN S+ +QA+VD + RFLDV AGWP S+  + +L+ S  Y
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY

Query:  AEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIV
          +E   + L G    + +   + +Y++GDS FPLLPWLLTPY    +   +   +  FN  H+ A      A   L+ RW++++       R+  P I+
Subjt:  AEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIV

Query:  LTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL
           CLLHN +I   ++  ++Q L        +  +     ++ D     +RD L+  L
Subjt:  LTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)1.7e-7641.94Show/hide
Query:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPP-------PRPCWFQRFLSATSEVDCDPRWNLSFRMS
        L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S                LP  P P           WF RFL++ +E + DPRW L FRMS
Subjt:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPP-------PRPCWFQRFLSATSEVDCDPRWNLSFRMS

Query:  KSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVL
        KS+F  L  +L      S SS+P   + AA +FRLAHGASY+ +  RFG DS + A  SF+ VCK INEKL         +D     F    LPNC GV+
Subjt:  KSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVL

Query:  GLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLN
        G  RF  +G+LLG  GS+LVQALVD+ GRF+D+SAGWPS+MKPE I RQ+KL++  E   E+L G    + +   +P+Y++GDSC PLLPWL+TPY   +
Subjt:  GLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLN

Query:  EEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDE-EQDLEGASC---------SSEEQKFPL
        +E+S  F E  FN+  +  +  V  AF  +RARW++L K WK    ++ PF++ TGCLLHNFL+   +  D  E+ + G              +E++   
Subjt:  EEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDE-EQDLEGASC---------SSEEQKFPL

Query:  YDGEIGDNRGKDIRDTLASHLS
        ++GE      K IRD +A +LS
Subjt:  YDGEIGDNRGKDIRDTLASHLS

AT1G72270.2 LOCATED IN: mitochondrion4.6e-7741.82Show/hide
Query:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPP-------PRPCWFQRFLSATSEVDCDPRWNLSFRMS
        L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S                LP  P P           WF RFL++ +E + DPRW L FRMS
Subjt:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPP-------PRPCWFQRFLSATSEVDCDPRWNLSFRMS

Query:  KSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVL
        KS+F  L  +L      S SS+P   + AA +FRLAHGASY+ +  RFG DS + A  SF+ VCK INEKL         +D     F    LPNC GV+
Subjt:  KSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVL

Query:  GLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLN
        G  RF  +G+LLG  GS+LVQALVD+ GRF+D+SAGWPS+MKPE I RQ+KL++  E   E+L G    + +   +P+Y++GDSC PLLPWL+TPY   +
Subjt:  GLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLN

Query:  EEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDE-EQDLEGASC---------SSEEQKFPL
        +E+S  F E  FN+  +  +  V  AF  +RARW++L K WK    ++ PF++ TGCLLHNFL+   +  D  E+ + G              +E++   
Subjt:  EEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDE-EQDLEGASC---------SSEEQKFPL

Query:  YDGEIGDNRGKDIRDTLASHLSSLSFRR
        ++GE      K IRD +A +LS +S  R
Subjt:  YDGEIGDNRGKDIRDTLASHLSSLSFRR

AT3G55350.1 PIF / Ping-Pong family of plant transposases2.3e-3628.21Show/hide
Query:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P      +A AL RL  G S   +G  FG++ +      +   +++
Subjt:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY
         E+  H L   S +D I   F  IS LPNCCG + +              +   +GE   KN S+ +QA+VD + RFLDV AGWP S+  + +L+ S  Y
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY

Query:  AEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIV
          +E   + L G    + +   + +Y++GDS FPLLPWLLTPY    +   +   +  FN  H+ A      A   L+ RW++++       R+  P I+
Subjt:  AEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIV

Query:  LTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL
           CLLHN +I   ++  ++Q L        +  +     ++ D     +RD L+  L
Subjt:  LTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)4.5e-3229.25Show/hide
Query:  FRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKLGHLLELRSD--IDRIVV
        FR SK++FS +  L+         S + +     +  +  +A AL RLA G S  +VG  FG+  +      +   +A+ E+  H L       I+ I  
Subjt:  FRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKLGHLLELRSD--IDRIVV

Query:  GF-GWISLPNCCGVLGLRRFGFEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPI
         F     LPNCCG +           +          KN S+ +Q + D E RFL++  GWP  M    +L+ S  +   EN+ ++L G    +     I
Subjt:  GF-GWISLPNCCGVLGLRRFGFEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPI

Query:  PQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDL
         +Y++G   +PLLPWL+TP+    + D    S  AFN  H +  ++  TAF  L+  W++LSK      R   P I+L  CLLHN +I C + L E+  L
Subjt:  PQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDL

Query:  EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL
         G   S    ++      +    G ++R  L  HL
Subjt:  EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL

AT5G12010.1 unknown protein1.4e-2026.8Show/hide
Query:  SFRMSKSSFSLLLRLLSPI----QSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSF-YAVCKAINE----------------KLGHLL
        +FRMSKS+F L+   L+       ++  +++P    +A  ++RLA G   + V ++FG+     CH     VCKAI +                 +    
Subjt:  SFRMSKSSFSLLLRLLSPI----QSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSF-YAVCKAINE----------------KLGHLL

Query:  ELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNG----SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        E  S I  +V       +P     + +  + F      +N     S+ +QA+V+ +G F D+  GWP SM  + +L +S LY    N   LLKG      
Subjt:  ELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNG----SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
               ++ G    PLL W+L PY + N      +++ AFN   +    +   AF  L+ RW  L K  +   +D  P ++   C+LHN      EK++
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDLE
         E  +E
Subjt:  EEQDLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCAGAGGACTCGGCGGCGAGAAGAGGACAACCAGAAGCTCCGCCATGAACGCCGCCGCCGCCACTACCAGAAGCAAGACCAAGAAACTTGACAGAGAGAACCA
TCTCAACCATCAACTGGTAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTCCACCTCCTTCCCTCACAAACCCTCGCCCTTGAAT
CCCTCCTCTGTTCCACTTCATCCTCTCTTTACGCTCTCTCTCCTCGTCTCCCAAAAATTTACCTACCACCGCCGCCGCCTCCGCCGCGACCATGCTGGTTCCAACGCTTC
CTCTCTGCGACATCCGAGGTCGATTGCGATCCGAGGTGGAATCTCTCCTTCCGTATGTCGAAATCGTCCTTCTCCCTCCTCCTTCGTCTCCTTTCCCCGATTCAGAGCTC
CTCATCCTCTTCAGTTCCTCCGGATTGTGCTTTAGCCGCTGCGCTTTTCCGATTGGCGCATGGTGCGAGCTACAAGGCGGTTGGGAGGCGGTTTGGGATCGATTCCGCTG
ATGCTTGCCACTCGTTTTATGCTGTTTGTAAAGCTATCAATGAGAAATTGGGGCATTTGCTTGAGCTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATT
TCGCTTCCGAATTGCTGTGGGGTTTTAGGTCTTAGAAGATTTGGGTTTGAGGGTGAGTTGCTAGGCAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGG
GAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGAAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATCGAGAACTCCACTGAATTACTCAAAG
GTCCTGTTTACAATGTCGATGATGAAAAGCCCATTCCTCAATACTTGATTGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTAACACCATACATGAAACTGAACGAG
GAAGATAGCTCTGGCTTTTCTGAACGAGCATTCAATTCCACACATAACCGTGCAATGGCGTTGGTTAACACAGCATTTTGCGGACTCCGAGCTCGGTGGAAGCTTCTGTC
AAAACCATGGAAGGAAGGATGTAGAGATTATTTCCCATTTATTGTATTGACCGGGTGTCTGCTGCACAATTTCCTCATTAAATGCAGTGAGAAACTAGATGAAGAGCAAG
ATCTTGAAGGAGCAAGTTGTTCGAGTGAGGAGCAGAAGTTTCCTCTTTATGACGGTGAGATAGGAGATAATAGAGGAAAGGATATCAGAGATACGCTTGCCTCGCACTTG
AGTAGCCTGAGCTTCAGAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCACCAGAGGACTCGGCGGCGAGAAGAGGACAACCAGAAGCTCCGCCATGAACGCCGCCGCCGCCACTACCAGAAGCAAGACCAAGAAACTTGACAGAGAGAACCA
TCTCAACCATCAACTGGTAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTCCACCTCCTTCCCTCACAAACCCTCGCCCTTGAAT
CCCTCCTCTGTTCCACTTCATCCTCTCTTTACGCTCTCTCTCCTCGTCTCCCAAAAATTTACCTACCACCGCCGCCGCCTCCGCCGCGACCATGCTGGTTCCAACGCTTC
CTCTCTGCGACATCCGAGGTCGATTGCGATCCGAGGTGGAATCTCTCCTTCCGTATGTCGAAATCGTCCTTCTCCCTCCTCCTTCGTCTCCTTTCCCCGATTCAGAGCTC
CTCATCCTCTTCAGTTCCTCCGGATTGTGCTTTAGCCGCTGCGCTTTTCCGATTGGCGCATGGTGCGAGCTACAAGGCGGTTGGGAGGCGGTTTGGGATCGATTCCGCTG
ATGCTTGCCACTCGTTTTATGCTGTTTGTAAAGCTATCAATGAGAAATTGGGGCATTTGCTTGAGCTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATT
TCGCTTCCGAATTGCTGTGGGGTTTTAGGTCTTAGAAGATTTGGGTTTGAGGGTGAGTTGCTAGGCAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGG
GAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGAAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATCGAGAACTCCACTGAATTACTCAAAG
GTCCTGTTTACAATGTCGATGATGAAAAGCCCATTCCTCAATACTTGATTGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTAACACCATACATGAAACTGAACGAG
GAAGATAGCTCTGGCTTTTCTGAACGAGCATTCAATTCCACACATAACCGTGCAATGGCGTTGGTTAACACAGCATTTTGCGGACTCCGAGCTCGGTGGAAGCTTCTGTC
AAAACCATGGAAGGAAGGATGTAGAGATTATTTCCCATTTATTGTATTGACCGGGTGTCTGCTGCACAATTTCCTCATTAAATGCAGTGAGAAACTAGATGAAGAGCAAG
ATCTTGAAGGAGCAAGTTGTTCGAGTGAGGAGCAGAAGTTTCCTCTTTATGACGGTGAGATAGGAGATAATAGAGGAAAGGATATCAGAGATACGCTTGCCTCGCACTTG
AGTAGCCTGAGCTTCAGAAGATGA
Protein sequenceShow/hide protein sequence
MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPPPRPCWFQRF
LSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWI
SLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNE
EDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL
SSLSFRR