; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy2G002290 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy2G002290
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationGy14Chr2:1506280..1508749
RNA-Seq ExpressionCsGy2G002290
SyntenyCsGy2G002290
Gene Ontology termsGO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586365.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]5.79e-27986.86Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKLSL PP 
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
          PRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQSS S+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+K
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL

Query:  DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKL
        D+ KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL
Subjt:  DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKL

Query:  DEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
        +EEQD+++GASCSSEEQKFPL+DGE GD RGKDIRDALALHLS L++RR
Subjt:  DEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR

XP_004139403.1 protein ALP1-like [Cucumis sativus]0.0100Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN

Query:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE

Query:  EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
        EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
Subjt:  EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR

XP_008457314.1 PREDICTED: putative nuclease HARBI1 [Cucumis melo]5.51e-31497.09Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MATRGLAGDKRTTRSSAMNAA AAITRSKAKKLDQENHLNHQLITLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PL
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQS  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLD+
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN

Query:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGF  RAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE

Query:  EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
        EQDQEEGASCSSEEQKFP FDGEIGDGRGKDIRDALALHLSSL+YRR
Subjt:  EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR

XP_022938170.1 protein ALP1-like [Cucurbita moschata]9.58e-27886.64Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKLSL PP 
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
          PRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQSS S+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+K
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL

Query:  DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKL
        D+ KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL
Subjt:  DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKL

Query:  DEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
        +EEQD+++GASCSSEEQKF L+DGE GD RGKDIRDALALHLS L++RR
Subjt:  DEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR

XP_038890100.1 protein ALP1-like [Benincasa hispida]9.87e-29289.91Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAI---TRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP
        MATRG+ GDKRTTRSS++NA AAA    TRSKAKKLD+E+HL HQL+TLI+TTISSAHSFLSLNDLHLLPSQTLALESLL STSSSL+ALSPRLPKL+LP
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAI---TRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP

Query:  PPLPPP----RQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV
        PP PPP    RQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSS SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV
Subjt:  PPLPPP----RQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV

Query:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELL
        CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG E EL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKS+ELL
Subjt:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELL

Query:  KGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFL
        KGPVYNLD++KPIPQYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAMALVNTAF RLRARWKLLSKPWKEGCRDFFPFI+LTGCLL NFL
Subjt:  KGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFL

Query:  IKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
        IKCSEKLDEEQDQEE A CSSE+QKFPL+DG+IGD RGKDIRDALALHLSSL+YRR
Subjt:  IKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR

TrEMBL top hitse value%identityAlignment
A0A0A0LFB5 DDE Tnp4 domain-containing protein0.0100Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN

Query:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE

Query:  EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
        EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
Subjt:  EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR

A0A1S3C5W6 putative nuclease HARBI12.67e-31497.09Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MATRGLAGDKRTTRSSAMNAA AAITRSKAKKLDQENHLNHQLITLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PL
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQS  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLD+
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN

Query:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGF  RAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE

Query:  EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
        EQDQEEGASCSSEEQKFP FDGEIGDGRGKDIRDALALHLSSL+YRR
Subjt:  EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR

A0A5D3BH79 Putative nuclease HARBI12.67e-31497.09Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MATRGLAGDKRTTRSSAMNAA AAITRSKAKKLDQENHLNHQLITLIETTISSA SFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PL
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQS  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLD+
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN

Query:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGF  RAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDE

Query:  EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
        EQDQEEGASCSSEEQKFP FDGEIGDGRGKDIRDALALHLSSL+YRR
Subjt:  EQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR

A0A6J1FIY2 protein ALP1-like4.64e-27886.64Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKLSL PP 
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
          PRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQSS S+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+K
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL

Query:  DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKL
        D+ KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL
Subjt:  DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKL

Query:  DEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
        +EEQD+++GASCSSEEQKF L+DGE GD RGKDIRDALALHLS L++RR
Subjt:  DEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR

A0A6J1HRT9 protein ALP1-like4.64e-27886.64Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLSLNDLHLLPSQTLALES + STSSSL ALSP LPKLSL PP 
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
          PRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQSS S+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+K
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL

Query:  DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKL
        D+ KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPF++LTGCLL NFLIKCSEKL
Subjt:  DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKL

Query:  DEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
        +EEQD+E+GAS SSEEQKFPL+DGE GD RGKDIRDALALHLS L++RR
Subjt:  DEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI11.2e-1024.19Show/hide
Query:  LLRLLSPIQSSP---SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVL
        L+ LL    S P   S ++ P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +   +D   I       +G   +P   G +
Subjt:  LLRLLSPIQSSP---SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVL

Query:  ----------GLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWL
                        +       SL    + D  G  + V   WP S++   +L+QS L ++ E                 P   +L+GDS F L  WL
Subjt:  ----------GLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWL

Query:  LTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLR----ARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIK
        LTP + + E  +     RA ++TH      + T  CR R    ++  L   P K         IIL  C+L N  ++
Subjt:  LTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLR----ARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIK

Q8BR93 Putative nuclease HARBI11.0e-0924Show/hide
Query:  LLRLLSPIQSSP---SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCG
        L+ LL    S P   S ++ P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +     +D   V       +G   +P   G
Subjt:  LLRLLSPIQSSP---SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCG

Query:  VL----------GLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLP
        V                 +       SL    + D  G  + V   WP S++   +L++S L ++ E                 P   +L+GDS F L  
Subjt:  VL----------GLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLP

Query:  WLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFP----FIILTGCLLQN
        WLLTP + + E  +     RA ++TH      + T  CR R           +G   + P     IIL  C+L N
Subjt:  WLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFP----FIILTGCLLQN

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 12.7e-2930.98Show/hide
Query:  FRMSKSSFSLLLRLLSP--IQSSPSSSVPPDCAL-------AAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV
        FR SK++FS +  L+    I   PS  +  +  L       A AL RLA G S  +VG  FG+  +   +  +   +A+ E+  H L       I+ I  
Subjt:  FRMSKSSFSLLLRLLSP--IQSSPSSSVPPDCAL-------AAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV

Query:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPI
         F     LPNCCG +                 +  + KN S+ +Q + D E RFL++  GWP  M  + +L+ S  + ++ +++++L G    L     I
Subjt:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPI

Query:  PQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
         +Y++G   +PLLPWL+TP+   +  DS      AFN  H +  ++  TAF +L+  W++LSK      R   P IIL  CLL N +I C + L E+
Subjt:  PQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE

Q9M2U3 Protein ALP1-like1.0e-3328.81Show/hide
Query:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSPSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P      +A AL RL  G S   +G  FG++ +   +  +   +++
Subjt:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSPSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAE
         E+  H L   S +D I   F  IS LPNCCG + +              +   +GE KN S+ +QA+VD + RFLDV AGWP S+    +L+ S  Y  
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAE

Query:  IEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT
        +EK    L G    L     + +Y++GDS FPLLPWLLTPY    +   +      FN  H  A      A  +L+ RW++++       R+  P II  
Subjt:  IEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT

Query:  GCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLN
         CLL N +I       E+Q  ++       +  +     ++ D     +RD L+  L   N
Subjt:  GCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLN

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)7.3e-7542.65Show/hide
Query:  LNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL
        L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S   S    SP     ++          WF RFL++ ++ + DPRW L FRMSKS+F  L
Subjt:  LNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL

Query:  LRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF
          +LS       SS+P   + AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D     F    LPNC GV+G  RF  
Subjt:  LRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF

Query:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF
        +G+L    GS+LVQALVD+ GRF+D+SAGWPS+MKP  I RQ+KL++  E   E+L G    L N   +P+Y++GDSC PLLPWL+TPY   ++E+S   
Subjt:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF

Query:  CGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPLFDGEIGD
            FN+     +  V  AF ++RARW++L K WK    +F PF+I TGCLL NFL+   +  D         E  D  E      +E++   F+GE   
Subjt:  CGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPLFDGEIGD

Query:  GRGKDIRDALALHLS
           K IRDA+A +LS
Subjt:  GRGKDIRDALALHLS

AT1G72270.2 LOCATED IN: mitochondrion3.3e-7542.28Show/hide
Query:  LNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL
        L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S   S    SP     ++          WF RFL++ ++ + DPRW L FRMSKS+F  L
Subjt:  LNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL

Query:  LRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF
          +LS       SS+P   + AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D     F    LPNC GV+G  RF  
Subjt:  LRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF

Query:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF
        +G+L    GS+LVQALVD+ GRF+D+SAGWPS+MKP  I RQ+KL++  E   E+L G    L N   +P+Y++GDSC PLLPWL+TPY   ++E+S   
Subjt:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF

Query:  CGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPLFDGEIGD
            FN+     +  V  AF ++RARW++L K WK    +F PF+I TGCLL NFL+   +  D         E  D  E      +E++   F+GE   
Subjt:  CGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLD---------EEQDQEEGASCSSEEQKFPLFDGEIGD

Query:  GRGKDIRDALALHLSSLNYRR
           K IRDA+A +LS ++  R
Subjt:  GRGKDIRDALALHLSSLNYRR

AT3G55350.1 PIF / Ping-Pong family of plant transposases7.4e-3528.81Show/hide
Query:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSPSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P      +A AL RL  G S   +G  FG++ +   +  +   +++
Subjt:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSPSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAE
         E+  H L   S +D I   F  IS LPNCCG + +              +   +GE KN S+ +QA+VD + RFLDV AGWP S+    +L+ S  Y  
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAE

Query:  IEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT
        +EK    L G    L     + +Y++GDS FPLLPWLLTPY    +   +      FN  H  A      A  +L+ RW++++       R+  P II  
Subjt:  IEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT

Query:  GCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLN
         CLL N +I       E+Q  ++       +  +     ++ D     +RD L+  L   N
Subjt:  GCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLN

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.9e-3030.98Show/hide
Query:  FRMSKSSFSLLLRLLSP--IQSSPSSSVPPDCAL-------AAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV
        FR SK++FS +  L+    I   PS  +  +  L       A AL RLA G S  +VG  FG+  +   +  +   +A+ E+  H L       I+ I  
Subjt:  FRMSKSSFSLLLRLLSP--IQSSPSSSVPPDCAL-------AAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV

Query:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPI
         F     LPNCCG +                 +  + KN S+ +Q + D E RFL++  GWP  M  + +L+ S  + ++ +++++L G    L     I
Subjt:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPI

Query:  PQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
         +Y++G   +PLLPWL+TP+   +  DS      AFN  H +  ++  TAF +L+  W++LSK      R   P IIL  CLL N +I C + L E+
Subjt:  PQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE

AT5G12010.1 unknown protein2.0e-1925.67Show/hide
Query:  SFRMSKSSFSLLLRLLSPI----QSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINE----------------KLGHLLE
        +FRMSKS+F L+   L+       ++  +++P    +A  ++RLA G   + V ++FG+  +   +    VCKAI +                 +    E
Subjt:  SFRMSKSSFSLLLRLLSPI----QSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINE----------------KLGHLLE

Query:  LRSDIDRIVVGFGWISLPNCCGVLGL-----RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNE
          S I  +V       +P     + +     +R     +  + S+ +QA+V+ +G F D+  GWP SM    +L +S LY        LLKG        
Subjt:  LRSDIDRIVVGFGWISLPNCCGVLGL-----RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNE

Query:  KPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE
             ++ G    PLL W+L PY + N      +   AFN        +   AF RL+ RW  L K  +   +D  P ++   C+L N      EK++ E
Subjt:  KPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCAGAGGACTCGCCGGGGACAAGAGAACCACCAGAAGTTCCGCCATGAACGCTGCCGCCGCCGCCATTACCAGAAGCAAGGCCAAGAAACTCGATCAAGAGAA
CCATCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTTCACCTTCTTCCCTCTCAAACCCTCGCCCTTG
AATCCCTACTTTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCAAAACTTTCCCTACCTCCGCCACTACCTCCACCGCGCCAATGCTGGTTCCAGCGC
TTCCTATCCGCGACATCGGACGTCGATTGCGATCCGAGATGGAATCTCTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGTCTCCTTTCTCCGATTCAAAG
CTCCCCATCCTCTTCAGTTCCTCCCGATTGTGCTTTAGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTACAAGGCGGTTGGGAGACGGTTTGGGATCGATTCTG
CTGATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGG
ATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGGGAG
GTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATTGAGAAATCTAGTGAATTACTGAAGGGTC
CTGTTTATAATCTCGACAATGAAAAACCCATTCCCCAATACTTGATCGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTGACACCATATATGGAACTGAATGAAGAA
GATAGTTCTGGCTTTTGTGGGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAAGCTCTTGTCAAA
ACCATGGAAGGAAGGATGTAGAGATTTTTTCCCATTTATTATATTGACTGGATGTCTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAACTAGATGAAGAGCAAGATC
AAGAAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAAAAGTTTCCTCTTTTCGATGGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGATGCCCTTGCCTTGCACTTG
AGTAGCCTGAACTACAGAAGATGA
mRNA sequenceShow/hide mRNA sequence
TCGTCTAATATCTAATGTATGCTGACATGGTGAAAAATTCAAAATAATTGATAAATGGAAAATTTTAATGTTCTAAAAAAATCTCTTTAGTTTATTTTTGTTAAGAATGA
AGGAGAAAAGAAATCTATTTTATGAGTTTTAAAAGTCCTTAGTTGTGTAAATTTCCTATAACACTTCCTACAAAGGGACAACACCTCAACAAAGTAAGTTGAACAAGTTC
AATTTATTATGGTATCGTTTGAATAATCCTCCCGAGAGTTTCTTGGCTCCTTCTCAATTTGACCATTGCTATCCTACCAAAACCTTCTAAGTCCAATTTACTACCAAACT
CATCTTTATTATTTATATTTTCTACTAAAATTGATAGCTATATATATATATATTTTATTAATTTTCGTAAATATATCGAAATATTTAAGATCAAAGTCTACTAAACTATG
GTTTCTAAATATAAAGTAGGGTTATCAAATCTAAAACTATAGTTTTTCAAATAAAAACATAAATCCAAGAATTGGAGTTTTTTCCATTTTTGAAATGGTTGGGGTAAATA
TTTTGGTTGTATTTTTTAAATAAAGAGAATTGTTTTTTCTAATAATTTAGAAAAAAAAAAAAAAACTTATTTTTGTTCACAAAGACAATGATGGCATTCACAGTGTGTGC
TCCCCTACCCTGTTCGCCATGGCCACCAGAGGACTCGCCGGGGACAAGAGAACCACCAGAAGTTCCGCCATGAACGCTGCCGCCGCCGCCATTACCAGAAGCAAGGCCAA
GAAACTCGATCAAGAGAACCATCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTTCACCTTCTTCCCT
CTCAAACCCTCGCCCTTGAATCCCTACTTTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCAAAACTTTCCCTACCTCCGCCACTACCTCCACCGCGC
CAATGCTGGTTCCAGCGCTTCCTATCCGCGACATCGGACGTCGATTGCGATCCGAGATGGAATCTCTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGTCT
CCTTTCTCCGATTCAAAGCTCCCCATCCTCTTCAGTTCCTCCCGATTGTGCTTTAGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTACAAGGCGGTTGGGAGAC
GGTTTGGGATCGATTCTGCTGATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGACATTGATCGGATT
GTTGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCGCTTCTGGTTCAAGCATT
AGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATTGAGAAATCTA
GTGAATTACTGAAGGGTCCTGTTTATAATCTCGACAATGAAAAACCCATTCCCCAATACTTGATCGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTGACACCATAT
ATGGAACTGAATGAAGAAGATAGTTCTGGCTTTTGTGGGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCG
GTGGAAGCTCTTGTCAAAACCATGGAAGGAAGGATGTAGAGATTTTTTCCCATTTATTATATTGACTGGATGTCTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAAC
TAGATGAAGAGCAAGATCAAGAAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAAAAGTTTCCTCTTTTCGATGGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGAT
GCCCTTGCCTTGCACTTGAGTAGCCTGAACTACAGAAGATGATTGCTTTGAACACCTTGGTAATTTTAATCTTTCATCAGCTGTATATATTCCTCTCTGATCCTTTAGAA
CATTTTGTTAAAACTGCAGATTGTAAGCCCTAATTCTCTAGAAGTAGATCTTTTTTTTTTTTTTGTCAGCCATTAGTTTCCTAGTGATTTGATTCTTATTGTTAGAGAAG
GGATGTTTACACTTATCGAACATGAAAGTGACTTGAAATCATTCACTCACTATGTAAATTTAAAAGATCTAGTTCATGATTTGGGTCCCTCTGAATGCAATGATTTTGAC
ATATAAATATTCTATCAATTGAGTAGGAACTGCCTTTTGAATCAAGTCATCTTAATAAATTAATCTGAATCCCATTTTTGGAATTGTTCTAGTATAACTCGAACATGAAT
TTCAGTTGAAGTGAAATAACTATGGATGGTCATGCTTATAGAAAAAGCTC
Protein sequenceShow/hide protein sequence
MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQR
FLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGW
ISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEE
DSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHL
SSLNYRR