; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017333 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017333
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDNA-dependent metalloprotease WSS1-like isoform X1
Genome locationtig00153042:243269..252358
RNA-Seq ExpressionSgr017333
SyntenySgr017333
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008237 - metallopeptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR013536 - WLM domain
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151616.1 DNA-dependent metalloprotease WSS1-like isoform X1 [Momordica charantia]5.2e-16286.85Show/hide
Query:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE
        MDLNDLNKVW +KPL+KIGEDDARKVLEKIAKQVQPIMRKR+WKVETLSEFYPANP LMGVNIGGGQEIKLRIRRPNNEWDFFPYEQ+LDTMLHELCHI 
Subjt:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE

Query:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ
        HGPHNADFY LLDELRKECEELM KGIT TG+GFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRL+
Subjt:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ

Query:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTL-NAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTL
        DDLWCGSKSLE NS   KN+ SSTG S+T  NA  P VISQIPSA S  P+QEA+D +E W CSACTLLN+PLALICEACGNRKN  NNT+TW CKFCT 
Subjt:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTL-NAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTL

Query:  NNSANTERCLACGEWRYSYGPPISTQG
        NNSA+T+RCLACGEWRYSYGPPIST+G
Subjt:  NNSANTERCLACGEWRYSYGPPISTQG

XP_022928615.1 DNA-dependent metalloprotease WSS1-like isoform X1 [Cucurbita moschata]1.7e-15784.97Show/hide
Query:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE
        MDLNDLNKVW +KPL+KIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYP NP LMGVNIGGGQEIKLRIRRPN+EWDFFPYEQ+LDTMLHELCHI 
Subjt:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE

Query:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ
        HGPHNADFY LLDELRKECEELM KGITGTG+GFD+ GRRLGGIS Q PLSSLRQTTL AAENRARG PSGPKRLGGDS +KAVLSP+QAAAMAAERRL+
Subjt:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ

Query:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN
        DDLWCGSKSLE NSD  KNMSSSTGPS+T N  TP V SQIPSAASFP  QEA   LE W CSACTLLNQPLALICEACGNRKN   NT+TW CKFCTL+
Subjt:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN

Query:  NSANTERCLACGEWRYSYGPPISTQG
        NS + +RCLACGEWRYSYGPP++TQG
Subjt:  NSANTERCLACGEWRYSYGPPISTQG

XP_022967821.1 DNA-dependent metalloprotease WSS1-like isoform X1 [Cucurbita maxima]7.0e-15984.29Show/hide
Query:  QLFSHMDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHE
        +L S MDLNDLNKVW +KPL+KIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYP NP LMGVNIGGGQEIKLRIRRPN+EWDFFPYEQ+LDTMLHE
Subjt:  QLFSHMDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHE

Query:  LCHIEHGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAA
        LCHI HGPHNADFY LLDELRKECEELM KGITGTG+GFD+ GRRLGGIS Q PLSSLRQTTL AAENRARG PSGPKRLGGDS +KAVLSP+QAAAMAA
Subjt:  LCHIEHGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAA

Query:  ERRLQDDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACK
        ERRL+DDLWCGSKSLE NSD  KNMSSSTGPS+T N  TP + SQIPSAASFP  QEA+  LE W CSACTLLNQPLALICEACGNRKN   NT+TWACK
Subjt:  ERRLQDDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACK

Query:  FCTLNNSANTERCLACGEWRYSYGPPISTQG
        FCTL+NS + +RCLACGEWRYSYGPP++TQG
Subjt:  FCTLNNSANTERCLACGEWRYSYGPPISTQG

XP_022967822.1 DNA-dependent metalloprotease WSS1-like isoform X2 [Cucurbita maxima]4.5e-15884.97Show/hide
Query:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE
        MDLNDLNKVW +KPL+KIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYP NP LMGVNIGGGQEIKLRIRRPN+EWDFFPYEQ+LDTMLHELCHI 
Subjt:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE

Query:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ
        HGPHNADFY LLDELRKECEELM KGITGTG+GFD+ GRRLGGIS Q PLSSLRQTTL AAENRARG PSGPKRLGGDS +KAVLSP+QAAAMAAERRL+
Subjt:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ

Query:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN
        DDLWCGSKSLE NSD  KNMSSSTGPS+T N  TP + SQIPSAASFP  QEA+  LE W CSACTLLNQPLALICEACGNRKN   NT+TWACKFCTL+
Subjt:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN

Query:  NSANTERCLACGEWRYSYGPPISTQG
        NS + +RCLACGEWRYSYGPP++TQG
Subjt:  NSANTERCLACGEWRYSYGPPISTQG

XP_023543567.1 DNA-dependent metalloprotease WSS1-like isoform X2 [Cucurbita pepo subsp. pepo]9.5e-15684.05Show/hide
Query:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE
        MDLNDLNKVW +KPL+KIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYP NP LMGVNI GGQEIKLRIRRPN+EWDFFPYEQ+LDTMLHELCHI 
Subjt:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE

Query:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ
        HGPHNADFY LLDELRKECEELM KGITGTG+GFD+ GRRLGGIS Q PLSSLRQTTL AAENRARG PSGPKRLGGDS +KAVLSP+QAAAMAAERRL+
Subjt:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ

Query:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN
        DDLWCGSKSLE NSD  KNMSSSTGPS+T N  TP V SQIPSAASF   QEA+  LE W CSACTLLNQPL L CEACGNRKN   NT+TWACKFCTL+
Subjt:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN

Query:  NSANTERCLACGEWRYSYGPPISTQG
        NS + +RCLACGEWRYSYGPP++TQG
Subjt:  NSANTERCLACGEWRYSYGPPISTQG

TrEMBL top hitse value%identityAlignment
A0A1S4DV14 LOW QUALITY PROTEIN: DNA-dependent metalloprotease WSS1-like1.9e-15484.57Show/hide
Query:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE
        MDLNDLNKVW V PL+KIGEDD+RKVLEKIAKQVQPIMRKRRWKVETLSEFYP NP LMGVNIGGGQEIKLRIRRPNNEWDFFPYEQ+LDTMLHELCHI 
Subjt:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE

Query:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ
        HGPHNADFY LLDELRKECEELM KGITGTG+GFDL GRRLGGIS QPPLSSLRQT LAAAENRARGGPSGPKRLGG SN+KAVLSPIQAAAMAAERRL+
Subjt:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ

Query:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN
        DDLWCGSKS E NSD  KNM SSTG S+  NA TP VISQIPSA S P +QEA D LE W CS CTLLN+ LALICEACGN KN   NTRTWAC FCTL+
Subjt:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN

Query:  NSANTERCLACGEWRYSYGPPIST
        NS++ E+CLACGEWRYSYGP IST
Subjt:  NSANTERCLACGEWRYSYGPPIST

A0A6J1DF71 DNA-dependent metalloprotease WSS1-like isoform X12.5e-16286.85Show/hide
Query:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE
        MDLNDLNKVW +KPL+KIGEDDARKVLEKIAKQVQPIMRKR+WKVETLSEFYPANP LMGVNIGGGQEIKLRIRRPNNEWDFFPYEQ+LDTMLHELCHI 
Subjt:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE

Query:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ
        HGPHNADFY LLDELRKECEELM KGIT TG+GFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRL+
Subjt:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ

Query:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTL-NAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTL
        DDLWCGSKSLE NS   KN+ SSTG S+T  NA  P VISQIPSA S  P+QEA+D +E W CSACTLLN+PLALICEACGNRKN  NNT+TW CKFCT 
Subjt:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTL-NAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTL

Query:  NNSANTERCLACGEWRYSYGPPISTQG
        NNSA+T+RCLACGEWRYSYGPPIST+G
Subjt:  NNSANTERCLACGEWRYSYGPPISTQG

A0A6J1EKF1 DNA-dependent metalloprotease WSS1-like isoform X18.3e-15884.97Show/hide
Query:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE
        MDLNDLNKVW +KPL+KIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYP NP LMGVNIGGGQEIKLRIRRPN+EWDFFPYEQ+LDTMLHELCHI 
Subjt:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE

Query:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ
        HGPHNADFY LLDELRKECEELM KGITGTG+GFD+ GRRLGGIS Q PLSSLRQTTL AAENRARG PSGPKRLGGDS +KAVLSP+QAAAMAAERRL+
Subjt:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ

Query:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN
        DDLWCGSKSLE NSD  KNMSSSTGPS+T N  TP V SQIPSAASFP  QEA   LE W CSACTLLNQPLALICEACGNRKN   NT+TW CKFCTL+
Subjt:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN

Query:  NSANTERCLACGEWRYSYGPPISTQG
        NS + +RCLACGEWRYSYGPP++TQG
Subjt:  NSANTERCLACGEWRYSYGPPISTQG

A0A6J1HRV8 DNA-dependent metalloprotease WSS1-like isoform X22.2e-15884.97Show/hide
Query:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE
        MDLNDLNKVW +KPL+KIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYP NP LMGVNIGGGQEIKLRIRRPN+EWDFFPYEQ+LDTMLHELCHI 
Subjt:  MDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIE

Query:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ
        HGPHNADFY LLDELRKECEELM KGITGTG+GFD+ GRRLGGIS Q PLSSLRQTTL AAENRARG PSGPKRLGGDS +KAVLSP+QAAAMAAERRL+
Subjt:  HGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQ

Query:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN
        DDLWCGSKSLE NSD  KNMSSSTGPS+T N  TP + SQIPSAASFP  QEA+  LE W CSACTLLNQPLALICEACGNRKN   NT+TWACKFCTL+
Subjt:  DDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACKFCTLN

Query:  NSANTERCLACGEWRYSYGPPISTQG
        NS + +RCLACGEWRYSYGPP++TQG
Subjt:  NSANTERCLACGEWRYSYGPPISTQG

A0A6J1HT53 DNA-dependent metalloprotease WSS1-like isoform X13.4e-15984.29Show/hide
Query:  QLFSHMDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHE
        +L S MDLNDLNKVW +KPL+KIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYP NP LMGVNIGGGQEIKLRIRRPN+EWDFFPYEQ+LDTMLHE
Subjt:  QLFSHMDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHE

Query:  LCHIEHGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAA
        LCHI HGPHNADFY LLDELRKECEELM KGITGTG+GFD+ GRRLGGIS Q PLSSLRQTTL AAENRARG PSGPKRLGGDS +KAVLSP+QAAAMAA
Subjt:  LCHIEHGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAA

Query:  ERRLQDDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACK
        ERRL+DDLWCGSKSLE NSD  KNMSSSTGPS+T N  TP + SQIPSAASFP  QEA+  LE W CSACTLLNQPLALICEACGNRKN   NT+TWACK
Subjt:  ERRLQDDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTWACK

Query:  FCTLNNSANTERCLACGEWRYSYGPPISTQG
        FCTL+NS + +RCLACGEWRYSYGPP++TQG
Subjt:  FCTLNNSANTERCLACGEWRYSYGPPISTQG

SwissProt top hitse value%identityAlignment
O94580 DNA-dependent metalloprotease WSS1 homolog 21.6e-0930.13Show/hide
Query:  EDDARKVLEKIAKQ--VQPIMRKRRWKVETLSEFYPA-----NPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIEHGPHNADFYRLL
        +D A + LE++     ++ IM   RW V  LSE  PA     +   +G+N   G  I+LR+R    +  F  Y+ V  T++HEL H  HG H++ F+ L 
Subjt:  EDDARKVLEKIAKQ--VQPIMRKRRWKVETLSEFYPA-----NPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIEHGPHNADFYRLL

Query:  DELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGG
         +L KE +     G  G+                +   +  R   LAAAE R + G
Subjt:  DELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGG

P38838 DNA-dependent metalloprotease WSS11.1e-2636.49Show/hide
Query:  KIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIEHGPHNADFYRLLDELR
        K  ++DA  ++++IA +V  +M++  +KV  L EFYP +  L+G+N+  G +I LR+R   +E+ F P E ++ TMLHEL H   GPH+  FY  LDEL 
Subjt:  KIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIEHGPHNADFYRLLDELR

Query:  KECEELMFKGITGTGKGFDLHGRRLGG----ISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQDDLWCGSKSLET
             +  +G+  T  G   +G+RLGG     S + P++ +  T       R +G   G     G S+I    SP + AA AAERR +DD WCG      
Subjt:  KECEELMFKGITGTGKGFDLHGRRLGG----ISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIKAVLSPIQAAAMAAERRLQDDLWCGSKSLET

Query:  NSDGVKNMSSS
        +     N+SSS
Subjt:  NSDGVKNMSSS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.9e-0723.43Show/hide
Query:  TQGEKRGLTVPQYLAQIKDIADKFSAIREPLSYRDHLGYILEGLGTEYNAFVTSIQNHTDVSSLADVRSLLLAYEARLEKQS----------AVDQLNMI
        T+G K   T+  Y+  +    D+ + + +P+ + + +  +LE L  EY   +  I       +L ++   LL +E+++   S          AV   N  
Subjt:  TQGEKRGLTVPQYLAQIKDIADKFSAIREPLSYRDHLGYILEGLGTEYNAFVTSIQNHTDVSSLADVRSLLLAYEARLEKQS----------AVDQLNMI

Query:  QANIANMNLSQSNKKFQKSSNGKNAYQKWPFPSSANRP----------QCLICGKLGHTTLVCYNRNNPIYQASSTQSSQAYFNNFQSSQPTVASTSNYL
          N  N N    N ++   +N  N+ + W   S+   P          +C ICG  GH+   C                Q + ++  S QP    T    
Subjt:  QANIANMNLSQSNKKFQKSSNGKNAYQKWPFPSSANRP----------QCLICGKLGHTTLVCYNRNNPIYQASSTQSSQAYFNNFQSSQPTVASTSNYL

Query:  EPISTQNQQSNILDDAWFVDSGTTHHVTPDIANLQQRPP
        +P +     S    + W +DSG THH+T D  NL    P
Subjt:  EPISTQNQQSNILDDAWFVDSGTTHHVTPDIANLQQRPP

Q9P7B5 DNA-dependent metalloprotease WSS1 homolog1.7e-1135.2Show/hide
Query:  KVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIEHGPHNAD
        K+  +  ++    D +   L++IA    PIM++  + V +L E    N    G N   G+ I+L +R  +N W   P+E V+D  LHELCHI  GPH+  
Subjt:  KVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIEHGPHNAD

Query:  FYRLLDELRKECEELMFKGITGTGK
        F+  L  LR     L  KG  G GK
Subjt:  FYRLLDELRKECEELMFKGITGTGK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.5e-0723.94Show/hide
Query:  DKFSAIREPLSYRDHLGYILEGLGTEYNAFVTSIQNHTDVSSLADVRSLLLAYEARLEKQSAVDQL----NMIQANIANMNLSQSNKKFQKS-SNGKNAY
        D+ + + +P+ + + +  +LE L  +Y   +  I       SL ++   L+  E++L   ++ + +    N++     N N +Q+N+   ++ +N  N  
Subjt:  DKFSAIREPLSYRDHLGYILEGLGTEYNAFVTSIQNHTDVSSLADVRSLLLAYEARLEKQSAVDQL----NMIQANIANMNLSQSNKKFQKS-SNGKNAY

Query:  QKWPFPSSANRP----------QCLICGKLGHTTLVCYNRNNPIYQASSTQSSQAYFNNFQSSQPTVASTSNYLEPISTQNQQSNILDDAWFVDSGTTHH
          W   SS +R           +C IC   GH+   C      ++Q  ST + Q   + F   QP      N   P +  N         W +DSG THH
Subjt:  QKWPFPSSANRP----------QCLICGKLGHTTLVCYNRNNPIYQASSTQSSQAYFNNFQSSQPTVASTSNYLEPISTQNQQSNILDDAWFVDSGTTHH

Query:  VTPDIANLQQRPP
        +T D  NL    P
Subjt:  VTPDIANLQQRPP

Arabidopsis top hitse value%identityAlignment
AT1G55915.1 zinc ion binding4.5e-9548.61Show/hide
Query:  SHMDLNDLNKVWAVKPL-QKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELC
        S  +L DLNKVW +K L +K  ED+ARK+LEK+A QVQPIM +R+W+V+ LSEF P NP L+GVN+  G ++KLR+RR N++ DF  Y ++LDTMLHELC
Subjt:  SHMDLNDLNKVWAVKPL-QKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELC

Query:  HIEHGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGG---PSGPKRLGGDSNIKAVLSPIQAAAMA
        H  HGPHNA FY+L DELRKECEELM KGITGTG+GFD+ G+RLGG+S+QP LS LR T   AAE R R G   PSGP+RLGGDS+I + LSPIQAAAMA
Subjt:  HIEHGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGG---PSGPKRLGGDSNIKAVLSPIQAAAMA

Query:  AERRLQDDLWCGSKSLE------------------------TNSDGVKNMSS---------------------------------------STGPSKTLN
        AERRL DD+WCGS+S +                         N   VK  +S                                         GPS   +
Subjt:  AERRLQDDLWCGSKSLE------------------------TNSDGVKNMSS---------------------------------------STGPSKTLN

Query:  APTPSVI-SQIP-SAASFPPNQEAVDGLETWPCSACTLLNQPLALICEAC--GNRKNINNNTRTWACKFCTLNNSANTERCLACGEWRYSYGPPIST
         P   V+ S IP  + S+  NQ   +    W C+ CTLLN  LA ICE C     K      + W+CKFCTL N    E+C ACG+WRYSYG P+ST
Subjt:  APTPSVI-SQIP-SAASFPPNQEAVDGLETWPCSACTLLNQPLALICEAC--GNRKNINNNTRTWACKFCTLNNSANTERCLACGEWRYSYGPPIST

AT5G35690.1 CONTAINS InterPro DOMAIN/s: WLM (InterPro:IPR013536), PUB domain (InterPro:IPR018997), PUG domain (InterPro:IPR006567)1.1e-1131.79Show/hide
Query:  IMRKRRWKVETLSEFYPAN------PALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIEHGPHNADFYRLLDELRKECEELMF---KGI
        +M K RW+V  ++E  P          L+G N   G+EI LR+R  + +  F  Y+ +  T+LHEL H+ +  H+  FY L  +L KE E L +   +G 
Subjt:  IMRKRRWKVETLSEFYPAN------PALMGVNIGGGQEIKLRIRRPNNEWDFFPYEQVLDTMLHELCHIEHGPHNADFYRLLDELRKECEELMF---KGI

Query:  TGTGKGF--------------DLHGRRLGGISQQPPLSSLRQTTLAAAENR
        T  G  F              +   +RLGG +Q   L + R++++AAA  R
Subjt:  TGTGKGF--------------DLHGRRLGGISQQPPLSSLRQTTLAAAENR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTGGACTCCTTCAGAACTGCAACGCCAGTAGTTTCAGCTCCTCCCTTGGCGGTTCTCTTAGCCCAGAGAGAGAGAGAGAGAGAGAGAGAGAGGAGTGCGAACGA
GGGTTTAAGGTGGGAGAGGCCTTCGTCAGTTTTCTTACCGAAACAGAGTAGCGGGAGGATGGTGGTGGGGCATACTGTAAAACGAAGCGCTTCAATGGACTTTGGCGTCG
GCTTTCTTCATCTCCAAGGTGCCTCTCTCTCTCTCTCTCTCTCTATCGCTCTTAACACTATTGGACATTGGAAATTGAAGTCTCATCGTTTCTGGAACAATAGTACAGTT
TATATTGAACTGCTTCTTCGCGCTGCAACTTTCTCCATTTTGGATTGCATTCTTCTAGGTAATGATTTGGGAGATGATGATTTTAAACACAGATTTTTGCAGCTCTTTTC
TCATATGGATCTCAACGATCTTAACAAGGTTTGGGCAGTTAAACCTTTGCAGAAGATTGGGGAAGATGATGCTCGGAAAGTTCTTGAAAAAATAGCTAAACAGGTGCAAC
CTATCATGCGCAAACGCAGATGGAAAGTGGAAACTCTTTCTGAGTTCTATCCTGCCAATCCAGCTCTCATGGGGGTGAACATAGGGGGAGGTCAGGAAATCAAACTTAGA
ATTCGAAGGCCAAATAACGAGTGGGATTTCTTCCCTTACGAGCAGGTTCTTGACACAATGCTCCATGAGCTTTGCCACATTGAACATGGTCCTCACAATGCCGATTTCTA
CCGCCTTCTAGATGAACTCAGAAAGGAATGTGAAGAACTTATGTTTAAAGGTATCACGGGCACCGGGAAAGGATTTGATCTTCATGGGAGACGTTTGGGTGGGATCTCTC
AGCAACCACCATTGTCATCCCTCCGGCAAACTACCTTAGCTGCTGCTGAAAATAGAGCTCGAGGTGGTCCATCAGGACCTAAGCGCCTAGGCGGTGACAGCAATATCAAG
GCAGTGCTTAGTCCTATACAAGCAGCTGCCATGGCAGCAGAAAGAAGACTGCAAGATGATTTGTGGTGTGGGTCCAAGTCTTTGGAGACTAATTCAGATGGAGTAAAAAA
CATGAGTTCCTCCACAGGACCCTCCAAAACTTTAAATGCTCCAACACCTTCTGTTATATCCCAAATTCCATCTGCTGCTTCCTTTCCTCCTAATCAAGAAGCTGTGGATG
GTCTGGAAACTTGGCCGTGCAGTGCATGTACCCTTCTAAATCAGCCATTGGCTCTGATTTGTGAAGCTTGTGGGAATCGTAAGAACATAAACAACAACACAAGGACTTGG
GCTTGTAAGTTTTGTACGCTCAACAACAGTGCTAATACAGAGAGATGCTTAGCTTGTGGGGAGTGGAGATATTCGTATGGCCCTCCCATCTCCACACAAGGTGAAAAGAG
AGGACTCACTGTACCTCAATACTTGGCTCAAATCAAAGACATAGCTGATAAGTTCTCTGCTATTAGGGAACCATTGTCATACCGTGATCATCTGGGATATATTCTTGAAG
GGCTTGGCACAGAATATAATGCCTTCGTCACTTCCATTCAGAATCACACTGATGTATCTTCCCTTGCTGATGTTCGCAGTTTATTGCTTGCGTATGAAGCCCGGTTGGAG
AAACAATCGGCTGTAGATCAACTCAACATGATTCAAGCCAATATTGCCAATATGAATCTGTCTCAATCCAACAAAAAATTTCAGAAATCATCAAATGGGAAGAATGCATA
TCAGAAATGGCCTTTCCCTTCCTCTGCCAATCGCCCTCAATGTCTGATTTGTGGTAAACTTGGACACACAACTCTCGTCTGCTACAACCGCAATAACCCAATTTACCAAG
CCTCTTCCACCCAGTCCTCTCAGGCATATTTCAACAATTTTCAGTCCTCTCAACCCACTGTTGCCTCAACTTCAAATTACCTTGAACCGATATCAACCCAAAACCAACAA
TCCAACATTCTCGATGATGCTTGGTTCGTGGATTCTGGCACCACTCACCACGTGACTCCTGACATAGCAAATTTACAGCAGCGTCCCCCTACTATGGTGGTGAGCAAGTT
GTCATCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTGGACTCCTTCAGAACTGCAACGCCAGTAGTTTCAGCTCCTCCCTTGGCGGTTCTCTTAGCCCAGAGAGAGAGAGAGAGAGAGAGAGAGAGGAGTGCGAACGA
GGGTTTAAGGTGGGAGAGGCCTTCGTCAGTTTTCTTACCGAAACAGAGTAGCGGGAGGATGGTGGTGGGGCATACTGTAAAACGAAGCGCTTCAATGGACTTTGGCGTCG
GCTTTCTTCATCTCCAAGGTGCCTCTCTCTCTCTCTCTCTCTCTATCGCTCTTAACACTATTGGACATTGGAAATTGAAGTCTCATCGTTTCTGGAACAATAGTACAGTT
TATATTGAACTGCTTCTTCGCGCTGCAACTTTCTCCATTTTGGATTGCATTCTTCTAGGTAATGATTTGGGAGATGATGATTTTAAACACAGATTTTTGCAGCTCTTTTC
TCATATGGATCTCAACGATCTTAACAAGGTTTGGGCAGTTAAACCTTTGCAGAAGATTGGGGAAGATGATGCTCGGAAAGTTCTTGAAAAAATAGCTAAACAGGTGCAAC
CTATCATGCGCAAACGCAGATGGAAAGTGGAAACTCTTTCTGAGTTCTATCCTGCCAATCCAGCTCTCATGGGGGTGAACATAGGGGGAGGTCAGGAAATCAAACTTAGA
ATTCGAAGGCCAAATAACGAGTGGGATTTCTTCCCTTACGAGCAGGTTCTTGACACAATGCTCCATGAGCTTTGCCACATTGAACATGGTCCTCACAATGCCGATTTCTA
CCGCCTTCTAGATGAACTCAGAAAGGAATGTGAAGAACTTATGTTTAAAGGTATCACGGGCACCGGGAAAGGATTTGATCTTCATGGGAGACGTTTGGGTGGGATCTCTC
AGCAACCACCATTGTCATCCCTCCGGCAAACTACCTTAGCTGCTGCTGAAAATAGAGCTCGAGGTGGTCCATCAGGACCTAAGCGCCTAGGCGGTGACAGCAATATCAAG
GCAGTGCTTAGTCCTATACAAGCAGCTGCCATGGCAGCAGAAAGAAGACTGCAAGATGATTTGTGGTGTGGGTCCAAGTCTTTGGAGACTAATTCAGATGGAGTAAAAAA
CATGAGTTCCTCCACAGGACCCTCCAAAACTTTAAATGCTCCAACACCTTCTGTTATATCCCAAATTCCATCTGCTGCTTCCTTTCCTCCTAATCAAGAAGCTGTGGATG
GTCTGGAAACTTGGCCGTGCAGTGCATGTACCCTTCTAAATCAGCCATTGGCTCTGATTTGTGAAGCTTGTGGGAATCGTAAGAACATAAACAACAACACAAGGACTTGG
GCTTGTAAGTTTTGTACGCTCAACAACAGTGCTAATACAGAGAGATGCTTAGCTTGTGGGGAGTGGAGATATTCGTATGGCCCTCCCATCTCCACACAAGGTGAAAAGAG
AGGACTCACTGTACCTCAATACTTGGCTCAAATCAAAGACATAGCTGATAAGTTCTCTGCTATTAGGGAACCATTGTCATACCGTGATCATCTGGGATATATTCTTGAAG
GGCTTGGCACAGAATATAATGCCTTCGTCACTTCCATTCAGAATCACACTGATGTATCTTCCCTTGCTGATGTTCGCAGTTTATTGCTTGCGTATGAAGCCCGGTTGGAG
AAACAATCGGCTGTAGATCAACTCAACATGATTCAAGCCAATATTGCCAATATGAATCTGTCTCAATCCAACAAAAAATTTCAGAAATCATCAAATGGGAAGAATGCATA
TCAGAAATGGCCTTTCCCTTCCTCTGCCAATCGCCCTCAATGTCTGATTTGTGGTAAACTTGGACACACAACTCTCGTCTGCTACAACCGCAATAACCCAATTTACCAAG
CCTCTTCCACCCAGTCCTCTCAGGCATATTTCAACAATTTTCAGTCCTCTCAACCCACTGTTGCCTCAACTTCAAATTACCTTGAACCGATATCAACCCAAAACCAACAA
TCCAACATTCTCGATGATGCTTGGTTCGTGGATTCTGGCACCACTCACCACGTGACTCCTGACATAGCAAATTTACAGCAGCGTCCCCCTACTATGGTGGTGAGCAAGTT
GTCATCATAG
Protein sequenceShow/hide protein sequence
MSLDSFRTATPVVSAPPLAVLLAQRERERERERSANEGLRWERPSSVFLPKQSSGRMVVGHTVKRSASMDFGVGFLHLQGASLSLSLSIALNTIGHWKLKSHRFWNNSTV
YIELLLRAATFSILDCILLGNDLGDDDFKHRFLQLFSHMDLNDLNKVWAVKPLQKIGEDDARKVLEKIAKQVQPIMRKRRWKVETLSEFYPANPALMGVNIGGGQEIKLR
IRRPNNEWDFFPYEQVLDTMLHELCHIEHGPHNADFYRLLDELRKECEELMFKGITGTGKGFDLHGRRLGGISQQPPLSSLRQTTLAAAENRARGGPSGPKRLGGDSNIK
AVLSPIQAAAMAAERRLQDDLWCGSKSLETNSDGVKNMSSSTGPSKTLNAPTPSVISQIPSAASFPPNQEAVDGLETWPCSACTLLNQPLALICEACGNRKNINNNTRTW
ACKFCTLNNSANTERCLACGEWRYSYGPPISTQGEKRGLTVPQYLAQIKDIADKFSAIREPLSYRDHLGYILEGLGTEYNAFVTSIQNHTDVSSLADVRSLLLAYEARLE
KQSAVDQLNMIQANIANMNLSQSNKKFQKSSNGKNAYQKWPFPSSANRPQCLICGKLGHTTLVCYNRNNPIYQASSTQSSQAYFNNFQSSQPTVASTSNYLEPISTQNQQ
SNILDDAWFVDSGTTHHVTPDIANLQQRPPTMVVSKLSS