; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy5G103120 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy5G103120
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationchrH05:16995718..16997058
RNA-Seq ExpressionChy5G103120
SyntenyChy5G103120
Gene Ontology termsGO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586365.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]1.86e-27787.31Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLS NDLHLLPSQTLALES + STSSSL ALSP LPKLSL PP 
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
          PRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+K
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL

Query:  DDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKL
        DD KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL
Subjt:  DDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKL

Query:  DEEQDQE-GASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR
        +EEQD++ GASCSSEEQKFPL+DGE GD RGKDIRDALALHLS LS+RR
Subjt:  DEEQDQE-GASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR

XP_004139403.1 protein ALP1-like [Cucumis sativus]0.098.43Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLS NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSS SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDD
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLD+
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDD

Query:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF GRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFI+LTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDE

Query:  EQDQE-GASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR
        EQDQE GASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSL+YRR
Subjt:  EQDQE-GASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR

XP_008457314.1 PREDICTED: putative nuclease HARBI1 [Cucumis melo]4.29e-31397.32Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MATRGLAGDKRTTRSSAMNAA AAITRSKAKKLDQENHLNHQLITLIETTISSA SFLS NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PL
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDD
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLDD
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDD

Query:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGFR RAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFI+LTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDE

Query:  EQDQE-GASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR
        EQDQE GASCSSEEQKFP FDGEIGDGRGKDIRDALALHLSSLSYRR
Subjt:  EQDQE-GASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR

XP_022938170.1 protein ALP1-like [Cucurbita moschata]3.07e-27687.08Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLS NDLHLLPSQTLALES + STSSSL ALSP LPKLSL PP 
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
          PRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+K
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL

Query:  DDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKL
        DD KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL
Subjt:  DDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKL

Query:  DEEQDQE-GASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR
        +EEQD++ GASCSSEEQKF L+DGE GD RGKDIRDALALHLS LS+RR
Subjt:  DEEQDQE-GASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR

XP_038890100.1 protein ALP1-like [Benincasa hispida]9.06e-29090.13Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAI---TRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP
        MATRG+ GDKRTTRSS++NA AAA    TRSKAKKLD+E+HL HQL+TLI+TTISSAHSFLS NDLHLLPSQTLALESLL STSSSL+ALSPRLPKL+LP
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAI---TRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP

Query:  PPLPPP----RQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV
        PP PPP    RQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV
Subjt:  PPLPPP----RQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV

Query:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELL
        CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG E EL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKS+ELL
Subjt:  CKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELL

Query:  KGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFL
        KGPVYNLDD+KPIPQYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAMALVNTAF RLRARWKLLSKPWKEGCRDFFPFI+LTGCLL NFL
Subjt:  KGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFL

Query:  IKCSEKLDEEQDQEG-ASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR
        IKCSEKLDEEQDQE  A CSSE+QKFPL+DG+IGD RGKDIRDALALHLSSLSYRR
Subjt:  IKCSEKLDEEQDQEG-ASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR

TrEMBL top hitse value%identityAlignment
A0A0A0LFB5 DDE Tnp4 domain-containing protein5.4e-25198.43Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLS NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSS SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDD
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLD+
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDD

Query:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF GRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFI+LTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDE

Query:  EQDQ-EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR
        EQDQ EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSL+YRR
Subjt:  EQDQ-EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR

A0A1S3C5W6 putative nuclease HARBI16.8e-24697.32Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MATRGLAGDKRTTRSSAMN AAAAITRSKAKKLDQENHLNHQLITLIETTISSA SFLS NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PL
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDD
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLDD
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDD

Query:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGFR RAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFI+LTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDE

Query:  EQDQ-EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR
        EQDQ EGASCSSEEQKFP FDGEIGDGRGKDIRDALALHLSSLSYRR
Subjt:  EQDQ-EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR

A0A5D3BH79 Putative nuclease HARBI16.8e-24697.32Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MATRGLAGDKRTTRSSAMN AAAAITRSKAKKLDQENHLNHQLITLIETTISSA SFLS NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PL
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDD
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLDD
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDD

Query:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDE
        EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGFR RAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFI+LTGCLLQNFLIKCSEKLDE
Subjt:  EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDE

Query:  EQDQ-EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR
        EQDQ EGASCSSEEQKFP FDGEIGDGRGKDIRDALALHLSSLSYRR
Subjt:  EQDQ-EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR

A0A6J1FIY2 protein ALP1-like7.8e-21887.08Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLS NDLHLLPSQTLALES + STSSSL ALSP LPKLSL    
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+K
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL

Query:  DDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKL
        DD KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL
Subjt:  DDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKL

Query:  DEEQDQ-EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR
        +EEQD+ +GASCSSEEQKF L+DGE GD RGKDIRDALALHLS LS+RR
Subjt:  DEEQDQ-EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR

A0A6J1HRT9 protein ALP1-like7.8e-21887.08Show/hide
Query:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL
        MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLS NDLHLLPSQTLALES + STSSSL ALSP LPKLSL    
Subjt:  MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPL

Query:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK
        PPPRQCWFQRFLSAT++VDCDPRWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAIN+K
Subjt:  PPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL

Query:  DDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKL
        DD KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++RARWKLLSKPWKE CRDFFPF++LTGCLL NFLIKCSEKL
Subjt:  DDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKL

Query:  DEEQDQE-GASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR
        +EEQD+E GAS SSEEQKFPL+DGE GD RGKDIRDALALHLS LS+RR
Subjt:  DEEQDQE-GASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLSYRR

SwissProt top hitse value%identityAlignment
B0BN95 Putative nuclease HARBI16.1e-1023.28Show/hide
Query:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVL----------GLRRF
        S ++ P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +   +D   I       +G   +P   G +               
Subjt:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVL----------GLRRF

Query:  GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF
         +       SL    + D  G  + V   WP S++   +L+QS L ++ E                 P   +L+GDS F L  WLLTP + + E  +   
Subjt:  GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF

Query:  RGRAFNSTHGRAMALVNTAFCRLR----ARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIK
          RA ++TH      + T  CR R    ++  L   P K         I+L  C+L N  ++
Subjt:  RGRAFNSTHGRAMALVNTAFCRLR----ARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIK

Q8BR93 Putative nuclease HARBI14.0e-0923.08Show/hide
Query:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVL----------GLR
        S ++ P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +     +D   V       +G   +P   GV              
Subjt:  SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVL----------GLR

Query:  RFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSS
           +       SL    + D  G  + V   WP S++   +L++S L ++ E                 P   +L+GDS F L  WLLTP + + E  + 
Subjt:  RFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSS

Query:  GFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFP----FILLTGCLLQN
            RA ++TH      + T  CR R           +G   + P     I+L  C+L N
Subjt:  GFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFP----FILLTGCLLQN

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.3e-2828.36Show/hide
Query:  FRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV
        FR SK++FS +  L+         S + +     +  +  +A AL RLA G S  +VG  FG+  +   +  +   +A+ E+  H L       I+ I  
Subjt:  FRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV

Query:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPI
         F     LPNCCG +                 +  + KN S+ +Q + D E RFL++  GWP  M  + +L+ S  + ++ +++++L G    L     I
Subjt:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPI

Query:  PQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDEEQDQ
         +Y++G   +PLLPWL+TP+   +  DS      AFN  H +  ++  TAF +L+  W++LSK      R   P I+L  CLL N +I C + L E+   
Subjt:  PQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDEEQDQ

Query:  EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHL
         G   S    ++     +  +  G ++R  L  HL
Subjt:  EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHL

Q9M2U3 Protein ALP1-like1.8e-3330.09Show/hide
Query:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P      +A AL RL  G S   +G  FG++ +   +  +   +++
Subjt:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAE
         E+  H L   S +D I   F  IS LPNCCG + +              +   +GE KN S+ +QA+VD + RFLDV AGWP S+    +L+ S  Y  
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAE

Query:  IEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLT
        +EK    L G    L +   + +Y++GDS FPLLPWLLTPY    +   +      FN  H  A      A  +L+ RW++++       R+  P I+  
Subjt:  IEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLT

Query:  GCLLQNFLIKCSEKLDEEQ
         CLL N +I   ++  ++Q
Subjt:  GCLLQNFLIKCSEKLDEEQ

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)6.2e-7442.17Show/hide
Query:  LNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL
        L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S   S    SP     ++          WF RFL++ ++ + DPRW L FRMSKS+F  L
Subjt:  LNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL

Query:  LRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF
          +L      S SS+P   + AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D     F    LPNC GV+G  RF  
Subjt:  LRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF

Query:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF
        +G+L    GS+LVQALVD+ GRF+D+SAGWPS+MKP  I RQ+KL++  E   E+L G    L +   +P+Y++GDSC PLLPWL+TPY   ++E+S  F
Subjt:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF

Query:  RGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLI----------KCSEKLDEEQDQEGASCSSEEQKFPLFDGEIGD
        R   FN+     +  V  AF ++RARW++L K WK    +F PF++ TGCLL NFL+          +C    +   + E      +E++   F+GE   
Subjt:  RGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLI----------KCSEKLDEEQDQEGASCSSEEQKFPLFDGEIGD

Query:  GRGKDIRDALALHLS
           K IRDA+A +LS
Subjt:  GRGKDIRDALALHLS

AT1G72270.2 LOCATED IN: mitochondrion1.2e-7442.04Show/hide
Query:  LNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL
        L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S   S    SP     ++          WF RFL++ ++ + DPRW L FRMSKS+F  L
Subjt:  LNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLL

Query:  LRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF
          +L      S SS+P   + AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D     F    LPNC GV+G  RF  
Subjt:  LRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGF

Query:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF
        +G+L    GS+LVQALVD+ GRF+D+SAGWPS+MKP  I RQ+KL++  E   E+L G    L +   +P+Y++GDSC PLLPWL+TPY   ++E+S  F
Subjt:  EGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF

Query:  RGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLI----------KCSEKLDEEQDQEGASCSSEEQKFPLFDGEIGD
        R   FN+     +  V  AF ++RARW++L K WK    +F PF++ TGCLL NFL+          +C    +   + E      +E++   F+GE   
Subjt:  RGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLI----------KCSEKLDEEQDQEGASCSSEEQKFPLFDGEIGD

Query:  GRGKDIRDALALHLSSLSYRR
           K IRDA+A +LS +S  R
Subjt:  GRGKDIRDALALHLSSLSYRR

AT3G55350.1 PIF / Ping-Pong family of plant transposases1.3e-3430.09Show/hide
Query:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI
        W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P      +A AL RL  G S   +G  FG++ +   +  +   +++
Subjt:  WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC--ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAI

Query:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAE
         E+  H L   S +D I   F  IS LPNCCG + +              +   +GE KN S+ +QA+VD + RFLDV AGWP S+    +L+ S  Y  
Subjt:  NEKLGHLLELRSDIDRIVVGFGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAE

Query:  IEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLT
        +EK    L G    L +   + +Y++GDS FPLLPWLLTPY    +   +      FN  H  A      A  +L+ RW++++       R+  P I+  
Subjt:  IEKSSELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLT

Query:  GCLLQNFLIKCSEKLDEEQ
         CLL N +I   ++  ++Q
Subjt:  GCLLQNFLIKCSEKLDEEQ

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)9.3e-3028.36Show/hide
Query:  FRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV
        FR SK++FS +  L+         S + +     +  +  +A AL RLA G S  +VG  FG+  +   +  +   +A+ E+  H L       I+ I  
Subjt:  FRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVV

Query:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPI
         F     LPNCCG +                 +  + KN S+ +Q + D E RFL++  GWP  M  + +L+ S  + ++ +++++L G    L     I
Subjt:  GF-GWISLPNCCGVLGLRRF-----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPI

Query:  PQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDEEQDQ
         +Y++G   +PLLPWL+TP+   +  DS      AFN  H +  ++  TAF +L+  W++LSK      R   P I+L  CLL N +I C + L E+   
Subjt:  PQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDEEQDQ

Query:  EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHL
         G   S    ++     +  +  G ++R  L  HL
Subjt:  EGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHL

AT5G12010.1 unknown protein1.5e-1926Show/hide
Query:  SFRMSKSSFSLLLRLLSPI----QSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINE----------------KLGHLLE
        +FRMSKS+F L+   L+       ++  +++P    +A  ++RLA G   + V ++FG+  +   +    VCKAI +                 +    E
Subjt:  SFRMSKSSFSLLLRLLSPI----QSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINE----------------KLGHLLE

Query:  LRSDIDRIVVGFGWISLPNCCGVLGL-----RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDE
          S I  +V       +P     + +     +R     +  + S+ +QA+V+ +G F D+  GWP SM    +L +S LY        LLKG        
Subjt:  LRSDIDRIVVGFGWISLPNCCGVLGL-----RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDE

Query:  KPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDEE
             ++ G    PLL W+L PY + N      +   AFN        +   AF RL+ RW  L K  +   +D  P +L   C+L N      EK++ E
Subjt:  KPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCAGAGGACTCGCCGGGGACAAGAGAACCACCAGAAGCTCCGCTATGAATGCTGCCGCCGCCGCCATTACCAGAAGCAAGGCCAAGAAACTCGATCAG
GAGAACCATCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCGCAACGATCTTCACCTTCTTCCCTCTCAAACC
CTCGCCCTTGAATCCCTACTCTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCAAAACTTTCCCTACCTCCGCCGCTACCTCCACCGCGCCAA
TGCTGGTTCCAACGCTTCCTATCCGCGACATCGGATGTCGATTGCGATCCGAGATGGAATCTCTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGT
CTCCTTTCTCCGATTCAAAGCTCCTCATCCTCTTCAGTTCCTCCCGATTGTGCTTTAGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTACAAGGCGGTT
GGGAGACGGTTTGGGATCGATTCTGCTGATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGAT
ATTGATCGGATTGTTGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCG
CTTCTGGTTCAAGCATTAGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTA
TATGCAGAAATTGAGAAATCTAGTGAATTACTCAAAGGTCCTGTTTATAATCTCGACGATGAAAAACCCATTCCCCAATACTTGATTGGTGATTCTTGCTTCCCC
CTTTTGCCATGGCTTTTGACACCATATATGGAACTGAATGAAGAAGATAGTTCTGGCTTTCGTGGGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTG
GTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAAGCTCTTGTCAAAACCATGGAAGGAAGGATGTAGAGACTTTTTCCCATTTATTTTATTGACTGGATGT
CTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAACTAGATGAAGAGCAAGATCAAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAGAAGTTTCCTCTTTTTGAT
GGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGATGCCCTTGCCTTGCACTTGAGTAGCCTGAGCTACAGAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCACCAGAGGACTCGCCGGGGACAAGAGAACCACCAGAAGCTCCGCTATGAATGCTGCCGCCGCCGCCATTACCAGAAGCAAGGCCAAGAAACTCGATCAG
GAGAACCATCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCGCAACGATCTTCACCTTCTTCCCTCTCAAACC
CTCGCCCTTGAATCCCTACTCTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCAAAACTTTCCCTACCTCCGCCGCTACCTCCACCGCGCCAA
TGCTGGTTCCAACGCTTCCTATCCGCGACATCGGATGTCGATTGCGATCCGAGATGGAATCTCTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGT
CTCCTTTCTCCGATTCAAAGCTCCTCATCCTCTTCAGTTCCTCCCGATTGTGCTTTAGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTACAAGGCGGTT
GGGAGACGGTTTGGGATCGATTCTGCTGATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGAT
ATTGATCGGATTGTTGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCG
CTTCTGGTTCAAGCATTAGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTA
TATGCAGAAATTGAGAAATCTAGTGAATTACTCAAAGGTCCTGTTTATAATCTCGACGATGAAAAACCCATTCCCCAATACTTGATTGGTGATTCTTGCTTCCCC
CTTTTGCCATGGCTTTTGACACCATATATGGAACTGAATGAAGAAGATAGTTCTGGCTTTCGTGGGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTG
GTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAAGCTCTTGTCAAAACCATGGAAGGAAGGATGTAGAGACTTTTTCCCATTTATTTTATTGACTGGATGT
CTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAACTAGATGAAGAGCAAGATCAAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAGAAGTTTCCTCTTTTTGAT
GGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGATGCCCTTGCCTTGCACTTGAGTAGCCTGAGCTACAGAAGATGA
Protein sequenceShow/hide protein sequence
MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSRNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQ
CWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD
IDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDDEKPIPQYLIGDSCFP
LLPWLLTPYMELNEEDSSGFRGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFILLTGCLLQNFLIKCSEKLDEEQDQEGASCSSEEQKFPLFD
GEIGDGRGKDIRDALALHLSSLSYRR