; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G03700 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G03700
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDDE Tnp4 domain-containing protein
Genome locationClcChr11:3553557..3554900
RNA-Seq ExpressionClc11G03700
SyntenyClc11G03700
Gene Ontology termsGO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586365.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.1e-22188.17Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MA  G  G+KRTTRSSA+NA A TTRSK KK DR+NHL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPK+ L PPP 
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL
          R CWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSA AC SFYAVCKAIN+KL
Subjt:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIE S ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        D KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGF ERAFNSTHNRAM LVNTAFC +RARWKLLSKPWKE CRD+FPFIVLTGCLLHNFLIKCSEKL+
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  +GASCSSEEQKFPLYDGE GD+RGKDIRD LA HLS LSFRR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

XP_004139403.1 protein ALP1-like [Cucumis sativus]4.8e-23091.09Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAT-TRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP
        MATRGL G+KRTTRSSAMNAAAA  TRSK KKLD+ENHLNHQL+TLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LPPP 
Subjt:  MATRGLGGEKRTTRSSAMNAAAAT-TRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP

Query:  PPSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEK
        PP R CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQSS SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSA AC SFYAVCKAINEK
Subjt:  PPSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNV
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIE S+ELLKGPVYN+
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNV

Query:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL
        D+EKPIPQYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKL
Subjt:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL

Query:  DEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        DEEQD  EGASCSSEEQKFPL+DGEIGD RGKDIRD LA HLSSL++RR
Subjt:  DEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

XP_008457314.1 PREDICTED: putative nuclease HARBI1 [Cucumis melo]7.0e-22990.62Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MATRGL G+KRTTRSSAMNAAAA TRSK KKLD+ENHLNHQL+TLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LP P P
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL
        P R CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSA AC SFYAVCKAINEKL
Subjt:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIE S+ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        DEKPIPQYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKLD
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  EGASCSSEEQKFP +DGEIGD RGKDIRD LA HLSSLS+RR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

XP_022938170.1 protein ALP1-like [Cucurbita moschata]2.7e-22087.95Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MA  G  G+KRTTRSSA+NA A TTRSK KK DR+NHL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPK+ L PPP 
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL
          R CWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSA AC SFYAVCKAIN+KL
Subjt:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIE S ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        D KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGF ERAFNSTHNRAM LVNTAFC +RARWKLLSKPWKE CRD+FPFIVLTGCLLHNFLIKCSEKL+
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  +GASCSSEEQKF LYDGE GD+RGKDIRD LA HLS LSFRR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

XP_038890100.1 protein ALP1-like [Benincasa hispida]5.4e-22989.91Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAA----ATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLP
        MATRG+GG+KRTTRSS++NA A    ATTRSK KKLDRE+HL HQLVTLI+TTISSAHSFLSLNDLHLLPSQTLALESLL STSSSLYALSPRLPK+ LP
Subjt:  MATRGLGGEKRTTRSSAMNAAA----ATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLP

Query:  PPPPPSRP----CWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAV
        PPPPP  P    CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSA AC SFYAV
Subjt:  PPPPPSRP----CWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAV

Query:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELL
        CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG E ELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY EIE S ELL
Subjt:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELL

Query:  KGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFL
        KGPVYN+DD+KPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGF ERAFNSTHNRAMALVNTAF  LRARWKLLSKPWKEGCRD+FPFIVLTGCLLHNFL
Subjt:  KGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFL

Query:  IKCSEKLDEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        IKCSEKLDEEQD  E A CSSE+QKFPLYDG+IGD+RGKDIRD LA HLSSLS+RR
Subjt:  IKCSEKLDEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

TrEMBL top hitse value%identityAlignment
A0A0A0LFB5 DDE Tnp4 domain-containing protein2.3e-23091.09Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAAT-TRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP
        MATRGL G+KRTTRSSAMNAAAA  TRSK KKLD+ENHLNHQL+TLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LPPP 
Subjt:  MATRGLGGEKRTTRSSAMNAAAAT-TRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP

Query:  PPSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEK
        PP R CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQSS SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSA AC SFYAVCKAINEK
Subjt:  PPSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNV
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIE S+ELLKGPVYN+
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNV

Query:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL
        D+EKPIPQYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKL
Subjt:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL

Query:  DEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        DEEQD  EGASCSSEEQKFPL+DGEIGD RGKDIRD LA HLSSL++RR
Subjt:  DEEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

A0A1S3C5W6 putative nuclease HARBI13.4e-22990.62Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MATRGL G+KRTTRSSAMNAAAA TRSK KKLD+ENHLNHQL+TLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LP P P
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL
        P R CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSA AC SFYAVCKAINEKL
Subjt:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIE S+ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        DEKPIPQYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKLD
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  EGASCSSEEQKFP +DGEIGD RGKDIRD LA HLSSLS+RR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

A0A5D3BH79 Putative nuclease HARBI13.4e-22990.62Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MATRGL G+KRTTRSSAMNAAAA TRSK KKLD+ENHLNHQL+TLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LP P P
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL
        P R CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSA AC SFYAVCKAINEKL
Subjt:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIE S+ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        DEKPIPQYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKLD
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  EGASCSSEEQKFP +DGEIGD RGKDIRD LA HLSSLS+RR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

A0A6J1FIY2 protein ALP1-like1.3e-22087.95Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MA  G  G+KRTTRSSA+NA A TTRSK KK DR+NHL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPK+ L PPP 
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL
          R CWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSA AC SFYAVCKAIN+KL
Subjt:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIE S ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        D KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGF ERAFNSTHNRAM LVNTAFC +RARWKLLSKPWKE CRD+FPFIVLTGCLLHNFLIKCSEKL+
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD  +GASCSSEEQKF LYDGE GD+RGKDIRD LA HLS LSFRR
Subjt:  EEQDL-EGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

A0A6J1HRT9 protein ALP1-like1.3e-22087.95Show/hide
Query:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP
        MA  G  G+KRTTRSSA+NA A TTRSK KK DR+NHL HQLVTLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPK+ L PPP 
Subjt:  MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPP

Query:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL
          R CWFQRFLSAT+EVDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSA AC SFYAVCKAIN+KL
Subjt:  PSRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+LLGK+GSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIE S ELLKGPVYN+D
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
        D KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGF ERAFNSTHNRAM LVNTAFC +RARWKLLSKPWKE CRD+FPF+VLTGCLLHNFLIKCSEKL+
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDLE-GASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR
        EEQD E GAS SSEEQKFPLYDGE GD+RGKDIRD LA HLS LSFRR
Subjt:  EEQDLE-GASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLSFRR

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI14.7e-1023.28Show/hide
Query:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVLGLRRFGFEG------
        S ++ P+  + AAL     G+    +G   GI  A        V +A+ E+    +   +D   I       +G   +P   G +       +       
Subjt:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVLGLRRFGFEG------

Query:  ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGF
          + + G  SL    + D  G  + V   WP S++   +L+QS L ++ E                 P   +L+GDS F L  WLLTP + + E  +   
Subjt:  ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGF

Query:  SERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFP----FIVLTGCLLHNFLIK
          RA ++TH+     + T  C  R           +G   Y P     I+L  C+LHN  ++
Subjt:  SERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFP----FIVLTGCLLHNFLIK

Q8BR93 Putative nuclease HARBI13.0e-0923.46Show/hide
Query:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVLGLRRFGFEG----
        S ++ P+  + AAL     G+    +G   GI  A        V +A+ E+    +     +D   V       +G   +P   GV        +     
Subjt:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVLGLRRFGFEG----

Query:  --ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSS
            + + G  SL    + D  G  + V   WP S++   +L++S L ++ E                 P   +L+GDS F L  WLLTP + + E  + 
Subjt:  --ELLGKNG--SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSS

Query:  GFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFP----FIVLTGCLLHN
            RA ++TH+     + T  C  R           +G   Y P     I+L  C+LHN
Subjt:  GFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFP----FIVLTGCLLHN

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.8e-3028.65Show/hide
Query:  PPPPPSRPC-WFQRF----LSATSEVDCDPRWNLSFRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS
        P  P +  C W+  F     S +   D D  +   FR SK++FS +  L+         S + +     +  +  +A AL RLA G S  +VG  FG+  
Subjt:  PPPPPSRPC-WFQRF----LSATSEVDCDPRWNLSFRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS

Query:  AHACHSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGFEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSM
        +      +   +A+ E+  H L       I+ I   F     LPNCCG +           +          KN S+ +Q + D E RFL++  GWP  M
Subjt:  AHACHSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGFEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSM

Query:  KPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPW
            +L+ S  +   EN+ ++L G    +     I +Y++G   +PLLPWL+TP+    + D    S  AFN  H +  ++  TAF  L+  W++LSK  
Subjt:  KPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPW

Query:  KEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL
            R   P I+L  CLLHN +I C + L E+  L G   S    ++      +    G ++R  L  HL
Subjt:  KEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL

Q9M2U3 Protein ALP1-like7.2e-3528.21Show/hide
Query:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P      +A AL RL  G S   +G  FG++ +      +   +++
Subjt:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY
         E+  H L   S +D I   F  IS LPNCCG + +              +   +GE   KN S+ +QA+VD + RFLDV AGWP S+  + +L+ S  Y
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY

Query:  AEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIV
          +E   + L G    + +   + +Y++GDS FPLLPWLLTPY    +   +   +  FN  H+ A      A   L+ RW++++       R+  P I+
Subjt:  AEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIV

Query:  LTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL
           CLLHN +I   ++  ++Q L        +  +     ++ D     +RD L+  L
Subjt:  LTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)3.5e-7742.18Show/hide
Query:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPPS-------RPCWFQRFLSATSEVDCDPRWNLSFRMS
        L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S                LP  P PS          WF RFL++ +E + DPRW L FRMS
Subjt:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPPS-------RPCWFQRFLSATSEVDCDPRWNLSFRMS

Query:  KSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-AHACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVL
        KS+F  L  +L      S SS+P   + AA +FRLAHGASY+ +  RFG DS + A  SF+ VCK INEKL         +D     F    LPNC GV+
Subjt:  KSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-AHACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVL

Query:  GLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLN
        G  RF  +G+LLG  GS+LVQALVD+ GRF+D+SAGWPS+MKPE I RQ+KL++  E   E+L G    + +   +P+Y++GDSC PLLPWL+TPY   +
Subjt:  GLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLN

Query:  EEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDE-EQDLEGASC---------SSEEQKFPL
        +E+S  F E  FN+  +  +  V  AF  +RARW++L K WK    ++ PF++ TGCLLHNFL+   +  D  E+ + G              +E++   
Subjt:  EEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDE-EQDLEGASC---------SSEEQKFPL

Query:  YDGEIGDNRGKDIRDTLASHLS
        ++GE      K IRD +A +LS
Subjt:  YDGEIGDNRGKDIRDTLASHLS

AT1G72270.2 LOCATED IN: mitochondrion9.2e-7842.06Show/hide
Query:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPPS-------RPCWFQRFLSATSEVDCDPRWNLSFRMS
        L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S                LP  P PS          WF RFL++ +E + DPRW L FRMS
Subjt:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPPS-------RPCWFQRFLSATSEVDCDPRWNLSFRMS

Query:  KSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-AHACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVL
        KS+F  L  +L      S SS+P   + AA +FRLAHGASY+ +  RFG DS + A  SF+ VCK INEKL         +D     F    LPNC GV+
Subjt:  KSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-AHACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVL

Query:  GLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLN
        G  RF  +G+LLG  GS+LVQALVD+ GRF+D+SAGWPS+MKPE I RQ+KL++  E   E+L G    + +   +P+Y++GDSC PLLPWL+TPY   +
Subjt:  GLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLN

Query:  EEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDE-EQDLEGASC---------SSEEQKFPL
        +E+S  F E  FN+  +  +  V  AF  +RARW++L K WK    ++ PF++ TGCLLHNFL+   +  D  E+ + G              +E++   
Subjt:  EEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDE-EQDLEGASC---------SSEEQKFPL

Query:  YDGEIGDNRGKDIRDTLASHLSSLSFRR
        ++GE      K IRD +A +LS +S  R
Subjt:  YDGEIGDNRGKDIRDTLASHLSSLSFRR

AT3G55350.1 PIF / Ping-Pong family of plant transposases5.1e-3628.21Show/hide
Query:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P      +A AL RL  G S   +G  FG++ +      +   +++
Subjt:  WFQRFLSATSEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY
         E+  H L   S +D I   F  IS LPNCCG + +              +   +GE   KN S+ +QA+VD + RFLDV AGWP S+  + +L+ S  Y
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY

Query:  AEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIV
          +E   + L G    + +   + +Y++GDS FPLLPWLLTPY    +   +   +  FN  H+ A      A   L+ RW++++       R+  P I+
Subjt:  AEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIV

Query:  LTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL
           CLLHN +I   ++  ++Q L        +  +     ++ D     +RD L+  L
Subjt:  LTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.3e-3128.65Show/hide
Query:  PPPPPSRPC-WFQRF----LSATSEVDCDPRWNLSFRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS
        P  P +  C W+  F     S +   D D  +   FR SK++FS +  L+         S + +     +  +  +A AL RLA G S  +VG  FG+  
Subjt:  PPPPPSRPC-WFQRF----LSATSEVDCDPRWNLSFRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS

Query:  AHACHSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGFEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSM
        +      +   +A+ E+  H L       I+ I   F     LPNCCG +           +          KN S+ +Q + D E RFL++  GWP  M
Subjt:  AHACHSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRFGFEGELL---------GKNGSLLVQALVDAEGRFLDVSAGWPSSM

Query:  KPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPW
            +L+ S  +   EN+ ++L G    +     I +Y++G   +PLLPWL+TP+    + D    S  AFN  H +  ++  TAF  L+  W++LSK  
Subjt:  KPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPW

Query:  KEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL
            R   P I+L  CLLHN +I C + L E+  L G   S    ++      +    G ++R  L  HL
Subjt:  KEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL

AT5G12010.1 unknown protein3.0e-2026.8Show/hide
Query:  SFRMSKSSFSLLLRLLSPI----QSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSF-YAVCKAINE----------------KLGHLL
        +FRMSKS+F L+   L+       ++  +++P    +A  ++RLA G   + V ++FG+     CH     VCKAI +                 +    
Subjt:  SFRMSKSSFSLLLRLLSPI----QSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSF-YAVCKAINE----------------KLGHLL

Query:  ELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNG----SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD
        E  S I  +V       +P     + +  + F      +N     S+ +QA+V+ +G F D+  GWP SM  + +L +S LY    N   LLKG      
Subjt:  ELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNG----SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVD

Query:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD
               ++ G    PLL W+L PY + N      +++ AFN   +    +   AF  L+ RW  L K  +   +D  P ++   C+LHN      EK++
Subjt:  DEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLD

Query:  EEQDLE
         E  +E
Subjt:  EEQDLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCAGAGGACTCGGCGGCGAGAAGAGGACAACCAGAAGCTCCGCCATGAACGCCGCCGCCGCCACTACCAGAAGCAAGACCAAGAAACTTGACAGAGAGAACCA
TCTCAACCATCAACTGGTAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTCCACCTCCTTCCCTCACAAACCCTCGCCCTTGAAT
CCCTCCTCTGTTCCACTTCATCCTCTCTTTACGCTCTCTCTCCTCGTCTCCCAAAAATTTACCTACCACCGCCGCCGCCTCCGTCGCGACCATGCTGGTTCCAACGCTTC
CTCTCTGCGACATCCGAGGTCGATTGCGATCCGAGGTGGAATCTCTCCTTCCGTATGTCGAAATCGTCCTTCTCCCTCCTCCTTCGTCTCCTTTCCCCGATTCAGAGCTC
CTCATCCTCTTCAGTTCCTCCGGATTGTGCTTTAGCCGCTGCGCTTTTCCGATTGGCGCATGGTGCGAGCTACAAGGCGGTTGGGAGGCGGTTTGGGATCGATTCCGCTC
ATGCTTGCCACTCGTTTTATGCTGTTTGTAAAGCTATCAATGAGAAATTGGGGCATTTGCTTGAGCTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATT
TCGCTTCCGAATTGCTGTGGGGTTTTAGGTCTTAGAAGATTTGGGTTTGAGGGTGAGTTGCTAGGCAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGG
GAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGAAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATCGAGAACTCCACTGAATTACTCAAAG
GTCCTGTTTACAATGTCGATGATGAAAAGCCCATTCCTCAATACTTGATTGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTAACACCATACATGAAACTGAACGAG
GAAGATAGCTCTGGCTTTTCTGAACGAGCATTCAATTCCACACATAACCGTGCAATGGCGTTGGTTAACACAGCATTTTGCGGACTCCGAGCTCGGTGGAAGCTTCTGTC
AAAACCATGGAAGGAAGGATGTAGAGATTATTTCCCATTTATTGTATTGACCGGGTGTCTGCTGCACAATTTCCTCATTAAATGCAGTGAGAAACTAGATGAAGAGCAAG
ATCTTGAAGGAGCAAGTTGTTCGAGTGAGGAGCAGAAGTTTCCTCTTTATGACGGTGAGATAGGAGATAATAGAGGAAAGGATATCAGAGATACGCTTGCCTCGCACTTG
AGTAGCCTGAGCTTCAGAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCACCAGAGGACTCGGCGGCGAGAAGAGGACAACCAGAAGCTCCGCCATGAACGCCGCCGCCGCCACTACCAGAAGCAAGACCAAGAAACTTGACAGAGAGAACCA
TCTCAACCATCAACTGGTAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTCCACCTCCTTCCCTCACAAACCCTCGCCCTTGAAT
CCCTCCTCTGTTCCACTTCATCCTCTCTTTACGCTCTCTCTCCTCGTCTCCCAAAAATTTACCTACCACCGCCGCCGCCTCCGTCGCGACCATGCTGGTTCCAACGCTTC
CTCTCTGCGACATCCGAGGTCGATTGCGATCCGAGGTGGAATCTCTCCTTCCGTATGTCGAAATCGTCCTTCTCCCTCCTCCTTCGTCTCCTTTCCCCGATTCAGAGCTC
CTCATCCTCTTCAGTTCCTCCGGATTGTGCTTTAGCCGCTGCGCTTTTCCGATTGGCGCATGGTGCGAGCTACAAGGCGGTTGGGAGGCGGTTTGGGATCGATTCCGCTC
ATGCTTGCCACTCGTTTTATGCTGTTTGTAAAGCTATCAATGAGAAATTGGGGCATTTGCTTGAGCTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATT
TCGCTTCCGAATTGCTGTGGGGTTTTAGGTCTTAGAAGATTTGGGTTTGAGGGTGAGTTGCTAGGCAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGG
GAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGAAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATCGAGAACTCCACTGAATTACTCAAAG
GTCCTGTTTACAATGTCGATGATGAAAAGCCCATTCCTCAATACTTGATTGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTAACACCATACATGAAACTGAACGAG
GAAGATAGCTCTGGCTTTTCTGAACGAGCATTCAATTCCACACATAACCGTGCAATGGCGTTGGTTAACACAGCATTTTGCGGACTCCGAGCTCGGTGGAAGCTTCTGTC
AAAACCATGGAAGGAAGGATGTAGAGATTATTTCCCATTTATTGTATTGACCGGGTGTCTGCTGCACAATTTCCTCATTAAATGCAGTGAGAAACTAGATGAAGAGCAAG
ATCTTGAAGGAGCAAGTTGTTCGAGTGAGGAGCAGAAGTTTCCTCTTTATGACGGTGAGATAGGAGATAATAGAGGAAAGGATATCAGAGATACGCTTGCCTCGCACTTG
AGTAGCCTGAGCTTCAGAAGATGA
Protein sequenceShow/hide protein sequence
MATRGLGGEKRTTRSSAMNAAAATTRSKTKKLDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPPSRPCWFQRF
LSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSAHACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWI
SLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNVDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNE
EDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDLEGASCSSEEQKFPLYDGEIGDNRGKDIRDTLASHL
SSLSFRR