; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0001473 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0001473
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationchr05:25458361..25459988
RNA-Seq ExpressionIVF0001473
SyntenyIVF0001473
Gene Ontology termsGO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586365.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.65e-27986.61Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP
        MA  G +GDKRTTRSSA+NA A  TRSKAKK D++NHL HQL+TLIETTISSA SFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKLSL    P
Subjt:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP

Query:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL
        PPRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQS SS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+KL
Subjt:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKS ELLKGPVYNLD
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLD

Query:  DEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD
        D KPI QYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL+
Subjt:  DEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD

Query:  EEQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
        EEQD+++GASCSSEEQKFP +DGE GD RGKDIRDALALHLS LS+RR
Subjt:  EEQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR

XP_004139403.1 protein ALP1-like [Cucumis sativus]1.89e-31497.09Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAA-ITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPL
        MATRGLAGDKRTTRSSAMNAAAA ITRSKAKKLDQENHLNHQLITLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PL
Subjt:  MATRGLAGDKRTTRSSAMNAAAA-ITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQS  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDD
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLD+
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDD

Query:  EKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGF  RAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE

Query:  EQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
        EQDQEEGASCSSEEQKFP FDGEIGDGRGKDIRDALALHLSSL+YRR
Subjt:  EQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR

XP_008457314.1 PREDICTED: putative nuclease HARBI1 [Cucumis melo]0.0100Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP
        MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP
Subjt:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP

Query:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL
        PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL
Subjt:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDE
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDE
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDE

Query:  KPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
        KPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
Subjt:  KPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE

Query:  QDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
        QDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
Subjt:  QDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR

XP_023536803.1 protein ALP1-like [Cucurbita pepo subsp. pepo]3.09e-27886.61Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP
        MA  G +GDKRTTRSSA+NA A  TRSKAKK D++NHL HQL+TLIETTISSA SFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKLSL    P
Subjt:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP

Query:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL
        PPRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQS SS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+KL
Subjt:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKS ELLKGPVYNLD
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLD

Query:  DEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD
        D KPI QYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL 
Subjt:  DEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD

Query:  EEQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
        EEQD+++GASCSSEEQKFP +DGE GD RGKDIRDALALHLS LS+RR
Subjt:  EEQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR

XP_038890100.1 protein ALP1-like [Benincasa hispida]3.04e-29089.47Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAIT----RSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP
        MATRG+ GDKRTTRSS++NA AA T    RSKAKKLD+E+HL HQL+TLI+TTISSA SFLSLNDLHLLPSQTLALESLL STSSSL+ALSPRLPKL+LP
Subjt:  MATRGLAGDKRTTRSSAMNAAAAIT----RSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP

Query:  SPLPPP----RQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV
         P PPP    RQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV
Subjt:  SPLPPP----RQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV

Query:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELL
        CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG E EL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYEEIEKS+ELL
Subjt:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELL

Query:  KGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFL
        KGPVYNLDD+KPIPQYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAMALVNTAF RLRARWKLLSKPWKEGCRDFFPFI+LTGCLL NFL
Subjt:  KGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFL

Query:  IKCSEKLDEEQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
        IKCSEKLDEEQDQEE A CSSE+QKFP +DG+IGD RGKDIRDALALHLSSLSYRR
Subjt:  IKCSEKLDEEQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR

TrEMBL top hitse value%identityAlignment
A0A0A0LFB5 DDE Tnp4 domain-containing protein6.8e-24697.09Show/hide
Query:  MATRGLAGDKRTTRSSAMN-AAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPL
        MATRGLAGDKRTTRSSAMN AAAAITRSKAKKLDQENHLNHQLITLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PL
Subjt:  MATRGLAGDKRTTRSSAMN-AAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQS  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDD
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLD+
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDD

Query:  EKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGF  RAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE

Query:  EQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
        EQDQEEGASCSSEEQKFP FDGEIGDGRGKDIRDALALHLSSL+YRR
Subjt:  EQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR

A0A1S3C5W6 putative nuclease HARBI11.6e-255100Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP
        MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP
Subjt:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP

Query:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL
        PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL
Subjt:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDE
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDE
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDE

Query:  KPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
        KPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
Subjt:  KPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE

Query:  QDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
        QDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
Subjt:  QDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR

A0A5D3BH79 Putative nuclease HARBI11.6e-255100Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP
        MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP
Subjt:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP

Query:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL
        PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL
Subjt:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDE
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDE
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDE

Query:  KPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
        KPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
Subjt:  KPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE

Query:  QDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
        QDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
Subjt:  QDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR

A0A6J1FIY2 protein ALP1-like3.5e-21886.38Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP
        MA  G +GDKRTTRSSA+NA A  TRSKAKK D++NHL HQL+TLIETTISSA SFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKLSL    P
Subjt:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP

Query:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL
        PPRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQS SS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+KL
Subjt:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKS ELLKGPVYNLD
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLD

Query:  DEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD
        D KPI QYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL+
Subjt:  DEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD

Query:  EEQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
        EEQD+++GASCSSEEQKF  +DGE GD RGKDIRDALALHLS LS+RR
Subjt:  EEQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR

A0A6J1HRT9 protein ALP1-like3.5e-21886.38Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP
        MA  G +GDKRTTRSSA+NA A  TRSKAKK D++NHL HQL+TLIETTISSA SFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKLSL    P
Subjt:  MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLP

Query:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL
        PPRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQS SS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+KL
Subjt:  PPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKL

Query:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLD
        GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKS ELLKGPVYNLD
Subjt:  GHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLD

Query:  DEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD
        D KPI QYLIGDSCFPL PWLLTPY++LNEEDSSGF ERAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPF++LTGCLL NFLIKCSEKL+
Subjt:  DEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD

Query:  EEQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR
        EEQD+E+GAS SSEEQKFP +DGE GD RGKDIRDALALHLS LS+RR
Subjt:  EEQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLSSLSYRR

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI12.1e-1023.66Show/hide
Query:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVL----------GLRRF
        S ++ P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +   +D   I       +G   +P   G +               
Subjt:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVL----------GLRRF

Query:  GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGF
         +       SL    + D  G  + V   WP S++   +L+QS L  + E                 P   +L+GDS F L  WLLTP + + E  +   
Subjt:  GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGF

Query:  RERAFNSTHGRAMALVNTAFCRLR----ARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIK
          RA ++TH      + T  CR R    ++  L   P K         IIL  C+L N  ++
Subjt:  RERAFNSTHGRAMALVNTAFCRLR----ARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIK

Q8BR93 Putative nuclease HARBI13.0e-0923.46Show/hide
Query:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVL----------GLR
        S ++ P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +     +D   V       +G   +P   GV              
Subjt:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVL----------GLR

Query:  RFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSS
           +       SL    + D  G  + V   WP S++   +L++S L  + E                 P   +L+GDS F L  WLLTP + + E  + 
Subjt:  RFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSS

Query:  GFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFP----FIILTGCLLQN
            RA ++TH      + T  CR R           +G   + P     IIL  C+L N
Subjt:  GFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFP----FIILTGCLLQN

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 14.5e-2930.98Show/hide
Query:  FRMSKSSFSLLLRLLSP--IQSPSSSSVPPDCAL-------AAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV
        FR SK++FS +  L+    I  P S  +  +  L       A AL RLA G S  +VG  FG+  +   +  +   +A+ E+  H L       I+ I  
Subjt:  FRMSKSSFSLLLRLLSP--IQSPSSSSVPPDCAL-------AAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV

Query:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPI
         F     LPNCCG +                 +  + KN S+ +Q + D E RFL++  GWP  M  + +L+ S  ++  E ++++L G    L     I
Subjt:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPI

Query:  PQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
         +Y++G   +PL PWL+TP+   +  DS      AFN  H +  ++  TAF +L+  W++LSK      R   P IIL  CLL N +I C + L E+
Subjt:  PQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE

Q9M2U3 Protein ALP1-like1.8e-3329.78Show/hide
Query:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSPSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P     S+  P      +A AL RL  G S   +G  FG++ +   +  +   +++
Subjt:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSPSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEE
         E+  H L   S +D I   F  IS LPNCCG + +              +   +GE KN S+ +QA+VD + RFLDV AGWP S+    +L+ S  Y+ 
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEE

Query:  IEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT
        +EK    L G    L +   + +Y++GDS FPL PWLLTPY    +   +   +  FN  H  A      A  +L+ RW++++       R+  P II  
Subjt:  IEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT

Query:  GCLLQNFLIKCSEKLDEEQ
         CLL N +I   ++  ++Q
Subjt:  GCLLQNFLIKCSEKLDEEQ

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)1.6e-7442.89Show/hide
Query:  LNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL
        L   L+  + +  +   SFL  NDL L PSQTL LESL+ S   S    SP     ++ +        WF RFL++ ++ + DPRW L FRMSKS+F  L
Subjt:  LNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL

Query:  LRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF
          +L      S SS+P   + AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D     F    LPNC GV+G  RF  
Subjt:  LRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF

Query:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGF
        +G+L    GS+LVQALVD+ GRF+D+SAGWPS+MKP  I RQ+KL+   E   E+L G    L +   +P+Y++GDSC PL PWL+TPY   ++E+S  F
Subjt:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGF

Query:  RERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPPFDGEIGD
        RE  FN+     +  V  AF ++RARW++L K WK    +F PF+I TGCLL NFL+   +  D         E  D  E      +E++   F+GE   
Subjt:  RERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPPFDGEIGD

Query:  GRGKDIRDALALHLS
           K IRDA+A +LS
Subjt:  GRGKDIRDALALHLS

AT1G72270.2 LOCATED IN: mitochondrion4.3e-7542.76Show/hide
Query:  LNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL
        L   L+  + +  +   SFL  NDL L PSQTL LESL+ S   S    SP     ++ +        WF RFL++ ++ + DPRW L FRMSKS+F  L
Subjt:  LNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL

Query:  LRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF
          +L      S SS+P   + AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D     F    LPNC GV+G  RF  
Subjt:  LRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF

Query:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGF
        +G+L    GS+LVQALVD+ GRF+D+SAGWPS+MKP  I RQ+KL+   E   E+L G    L +   +P+Y++GDSC PL PWL+TPY   ++E+S  F
Subjt:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGF

Query:  RERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPPFDGEIGD
        RE  FN+     +  V  AF ++RARW++L K WK    +F PF+I TGCLL NFL+   +  D         E  D  E      +E++   F+GE   
Subjt:  RERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPPFDGEIGD

Query:  GRGKDIRDALALHLSSLSYRR
           K IRDA+A +LS +S  R
Subjt:  GRGKDIRDALALHLSSLSYRR

AT3G55350.1 PIF / Ping-Pong family of plant transposases1.3e-3429.78Show/hide
Query:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSPSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P     S+  P      +A AL RL  G S   +G  FG++ +   +  +   +++
Subjt:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSPSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEE
         E+  H L   S +D I   F  IS LPNCCG + +              +   +GE KN S+ +QA+VD + RFLDV AGWP S+    +L+ S  Y+ 
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEE

Query:  IEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT
        +EK    L G    L +   + +Y++GDS FPL PWLLTPY    +   +   +  FN  H  A      A  +L+ RW++++       R+  P II  
Subjt:  IEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT

Query:  GCLLQNFLIKCSEKLDEEQ
         CLL N +I   ++  ++Q
Subjt:  GCLLQNFLIKCSEKLDEEQ

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)3.2e-3030.98Show/hide
Query:  FRMSKSSFSLLLRLLSP--IQSPSSSSVPPDCAL-------AAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV
        FR SK++FS +  L+    I  P S  +  +  L       A AL RLA G S  +VG  FG+  +   +  +   +A+ E+  H L       I+ I  
Subjt:  FRMSKSSFSLLLRLLSP--IQSPSSSSVPPDCAL-------AAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV

Query:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPI
         F     LPNCCG +                 +  + KN S+ +Q + D E RFL++  GWP  M  + +L+ S  ++  E ++++L G    L     I
Subjt:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPI

Query:  PQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
         +Y++G   +PL PWL+TP+   +  DS      AFN  H +  ++  TAF +L+  W++LSK      R   P IIL  CLL N +I C + L E+
Subjt:  PQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE

AT5G12010.1 unknown protein6.7e-2025.33Show/hide
Query:  SFRMSKSSFSLLLRLLSPIQSPSS----SSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINE----------------KLGHLLE
        +FRMSKS+F L+   L+   +       +++P    +A  ++RLA G   + V ++FG+  +   +    VCKAI +                 +    E
Subjt:  SFRMSKSSFSLLLRLLSPIQSPSS----SSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINE----------------KLGHLLE

Query:  LRSDIDRIVVGFGWISLPNCCGVLGL-----RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDE
          S I  +V       +P     + +     +R     +  + S+ +QA+V+ +G F D+  GWP SM    +L +S LY+       LLKG        
Subjt:  LRSDIDRIVVGFGWISLPNCCGVLGL-----RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDE

Query:  KPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
             ++ G    PL  W+L PY + N      + + AFN        +   AF RL+ RW  L K  +   +D  P ++   C+L N      EK++ E
Subjt:  KPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCAGAGGACTCGCCGGCGACAAGAGAACCACCAGAAGCTCCGCCATGAACGCGGCCGCGGCTATTACTAGAAGCAAGGCCAAGAAACTCGACCAAGAGAACCA
TCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCGCTCCTTTCTCTCTCTCAACGATCTCCACCTTCTTCCCTCTCAAACCCTCGCCCTTGAAT
CCCTCCTCTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCCAAACTTTCCCTACCTTCGCCGCTACCTCCACCGCGCCAATGCTGGTTCCAACGCTTC
CTCTCCGCGACATCGGACGTCGATTGCGATCCGAGATGGAATCTTTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGTCTTCTTTCTCCGATTCAAAGTCC
CTCATCCTCTTCAGTTCCTCCCGATTGTGCTTTGGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTATAAGGCGGTCGGGAGACGGTTTGGGATCGATTCTGCTG
ATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGACATTGATCGAATTGTTGTGGGATTTGGGTGGATT
TCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGGGAGGTT
TCTGGATGTCTCTGCAGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTATATGAAGAAATCGAGAAATCTAGTGAATTACTCAAAGGTCCTG
TTTATAATCTTGACGATGAAAAACCCATTCCCCAATACTTGATTGGTGATTCTTGCTTCCCCCTTTTTCCATGGCTTTTGACACCTTATATAGAATTGAATGAAGAAGAT
AGCTCTGGCTTTCGTGAGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAAGCTCTTGTCAAAACC
ATGGAAGGAAGGATGTAGAGATTTTTTCCCATTTATTATATTGACTGGATGTCTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAACTAGATGAAGAGCAAGATCAAG
AAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAGAAGTTTCCTCCTTTTGATGGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGATGCCCTTGCCTTGCACTTGAGT
AGCCTGAGCTACAGAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCACCAGAGGACTCGCCGGCGACAAGAGAACCACCAGAAGCTCCGCCATGAACGCGGCCGCGGCTATTACTAGAAGCAAGGCCAAGAAACTCGACCAAGAGAACCA
TCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCGCTCCTTTCTCTCTCTCAACGATCTCCACCTTCTTCCCTCTCAAACCCTCGCCCTTGAAT
CCCTCCTCTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCCAAACTTTCCCTACCTTCGCCGCTACCTCCACCGCGCCAATGCTGGTTCCAACGCTTC
CTCTCCGCGACATCGGACGTCGATTGCGATCCGAGATGGAATCTTTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGTCTTCTTTCTCCGATTCAAAGTCC
CTCATCCTCTTCAGTTCCTCCCGATTGTGCTTTGGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTATAAGGCGGTCGGGAGACGGTTTGGGATCGATTCTGCTG
ATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGACATTGATCGAATTGTTGTGGGATTTGGGTGGATT
TCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGGGAGGTT
TCTGGATGTCTCTGCAGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTATATGAAGAAATCGAGAAATCTAGTGAATTACTCAAAGGTCCTG
TTTATAATCTTGACGATGAAAAACCCATTCCCCAATACTTGATTGGTGATTCTTGCTTCCCCCTTTTTCCATGGCTTTTGACACCTTATATAGAATTGAATGAAGAAGAT
AGCTCTGGCTTTCGTGAGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAAGCTCTTGTCAAAACC
ATGGAAGGAAGGATGTAGAGATTTTTTCCCATTTATTATATTGACTGGATGTCTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAACTAGATGAAGAGCAAGATCAAG
AAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAGAAGTTTCCTCCTTTTGATGGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGATGCCCTTGCCTTGCACTTGAGT
AGCCTGAGCTACAGAAGATGATTGCTTTGAACACCTTGGTAATTTTAATCTTTCATCAGCTGTATATATTCCTCCCTGATCCTTTAGAACATTTTGTTAAAACTGCAGAT
TGTAAACCCTGATTCTCTAGAAGTAGATCTATTTTTTTGTCAGCCATTAGTTTCCTAGTGATTTTATACTTCTTATTGTTAGAGAAGGGATGTTTACACTTATCGAACAT
GAAAGTGACTTGAAATCATTCACTCAGTATGTAAGTTTAAAAAAATCTAGTTCATGATTTGGGTCCCTCTGAATGCAATGATTTTGAC
Protein sequenceShow/hide protein sequence
MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRF
LSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWI
SLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLFPWLLTPYIELNEED
SSGFRERAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPPFDGEIGDGRGKDIRDALALHLS
SLSYRR