; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002142 (gene) of Snake gourd v1 genome

Gene IDTan0002142
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-dependent metalloprotease WSS1 isoform X2
Genome locationLG01:85359776..85362259
RNA-Seq ExpressionTan0002142
SyntenyTan0002142
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008237 - metallopeptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR013536 - WLM domain
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607757.1 hypothetical protein SDJN03_01099, partial [Cucurbita argyrosperma subsp. sororia]3.1e-21589.37Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        M+VGDLNK+WEIKALKKAGEKEAKEILERIAKQVQPIMRK KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER
        ISPHNA+FYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+QPPLSSLRKSALAAAEGR+RLGSLLPSGP RLGGDSNIMVALSPVQAAAMAAER
Subjt:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER

Query:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSG-VDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM
        RLQDDIWCASSQEMPVDEECCSDFPS+ VHSSH  KSGPSSNLS  VD LHQKR+RESEKSSNK SRGHLKP FVDLSEDVLIPGSSA+YD E NKR+KM
Subjt:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSG-VDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM

Query:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS
         DR  FP+PCAETS+ID SCSSSNSM  HDGTL P + SMWECGNCTLLN PLAPICELC SQK KDADTKY+FWSCKFCTLEN+VK EKCSAC QWRYS
Subjt:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS

Query:  HGQPVSTQGPNLGT
        HGQP ST+GPNLGT
Subjt:  HGQPVSTQGPNLGT

KAG7037334.1 WSS1 [Cucurbita argyrosperma subsp. argyrosperma]9.1e-21589.61Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        M+VGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRK KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER
        ISPHNA+FYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+Q PLSSLRKSALAAAEGR+RLGSLLPSGP RLGGDSNIMVALSPVQAAAMAAER
Subjt:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER

Query:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSG-VDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM
        RLQDDIWCASSQEMPVDEECCSDFPSE  HSSH  KSGPSSNLS  VDALHQKR+RESEKSSNK SRGHLKP FVDLSEDVLIPGSSA+YD E NKR+KM
Subjt:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSG-VDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM

Query:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS
         DR  FP+PCAETS+ID SCSSSNSM  HDGTL P + SMWECGNCTLLN PLAPICELC SQK KDADTKY+FWSCKFCTLEN+VK EKCSAC QWRYS
Subjt:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS

Query:  HGQPVSTQGPNLGT
        HGQP ST+GPNLGT
Subjt:  HGQPVSTQGPNLGT

XP_022940664.1 uncharacterized protein LOC111446189 isoform X1 [Cucurbita moschata]1.2e-21489.37Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        M+VGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRK KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER
        ISPHNA+FYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+QPPLSSLRKSALAAAEGR+RLGSLLPSGP RLGGDSNIMVALSPVQAAAMAAER
Subjt:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER

Query:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSG-VDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM
        RLQDDIWCASSQEMPVDEECCSDFPS+ VHSSH  KSGPSSNLS  VD LHQKR+RESEKSSNK SRGHLKP FVDLSEDVLIPGSSA+YD E NKR+KM
Subjt:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSG-VDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM

Query:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS
         DR  FP+PCAETS+ID SCSSSNSM  HDGTL P + SMWECGNCTLLN PLAPICELC SQK KDADTKY+FWSCKFCTLEN+VK EKC AC QWRYS
Subjt:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS

Query:  HGQPVSTQGPNLGT
        HGQP ST+GPNLGT
Subjt:  HGQPVSTQGPNLGT

XP_022981482.1 uncharacterized protein LOC111480590 isoform X1 [Cucurbita maxima]1.3e-21389.13Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRK KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER
        ISPHNANFYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+QPPLSSLRKSALAAAEGR+RLGSLLPSGP RLGGDSNIM ALSPVQAAAMAAER
Subjt:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER

Query:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLS-GVDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM
        RLQDDIWCASSQEMPVDEECCSDFPSE  HSSH  KSGPSSNLS  VDALHQKR+RESEKSSN  SRGHL+PDFVDLSEDVLIPGSSA+YD E  KR+K+
Subjt:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLS-GVDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM

Query:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS
         DR  FP+PCAETS+ID  C SSNSM  HDGTL P + SMWECGNCTLLN PLAPICELC SQK KDADTKY+FWSCKFCTLEN+VK EKCSAC QWRYS
Subjt:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS

Query:  HGQPVSTQGPNLGT
        HGQPVST+GPNLGT
Subjt:  HGQPVSTQGPNLGT

XP_023525391.1 uncharacterized protein LOC111789010 [Cucurbita pepo subsp. pepo]3.0e-21891.06Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRK KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER
        ISPHNANFYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+QPPLSSLRKSALAAAEGR+RLGSLLPSGP RLGGDSNIMVALSPVQAAAMAAER
Subjt:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER

Query:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSG-VDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM
        RLQDDIWCASSQEMPVDEECCSDFPSE  HSSH  KSGPSSNLS  VDALHQKR+RESEKSS K SRGHLKPDFVDLSEDVLIPGSSA+YD ESNKRHKM
Subjt:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSG-VDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM

Query:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS
         DR  FP+PCAETSSID SCSSSNSM  HDGTL P + SMWECGNCTLLN PLAPICELC SQK KDADTKY+FWSCKFCTLEN+VK EKCSAC QWRYS
Subjt:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS

Query:  HGQPVSTQGPNLGT
        HGQP ST+GPNLGT
Subjt:  HGQPVSTQGPNLGT

TrEMBL top hitse value%identityAlignment
A0A0A0K3J6 Uncharacterized protein9.8e-20785.99Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MDVGDLNKVWEIKALKKAGEKEAK++LERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER
          PHNANFYKLWDELRKECEEL+AKG+SG+AQGFDLPGRRLGGN RQP LSSLRKS+LAAAEGRRRLGSLLPSGP RLGGDSNIMVALSPVQAAAMAAER
Subjt:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER

Query:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLS-GVDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM
        RLQDDIWCAS Q MPVDE+CC  FPSEA HSS   KSGP  NLS  VDALHQKR RESE+S NK S G L+PDFVDLS+D  IPGSSA Y  ESNKRHK+
Subjt:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLS-GVDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM

Query:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS
        PDR +FP+  AETSSIDLSCSSSN M  +DGT+ PG+LSMWECGNCTLLN PLAPICELCFSQKP D+DT+YKFWSCKFCTLEN+VK EKC+ACDQWRYS
Subjt:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS

Query:  HGQPVSTQGPNLGT
        HGQPVST+GPNLGT
Subjt:  HGQPVSTQGPNLGT

A0A1S4E403 uncharacterized protein LOC1035012341.6e-20183.81Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MDV DLNKVWEIKALKKAGEKEAK+ILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER
          PHNANFYKLWDELRKECEEL+AKGISG+AQGFDLPGRRLGGN RQPPLSSLRKS+LAAAEGRRRL SLLPSGP RLGGDSNIMVALSPVQAAAMAAER
Subjt:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER

Query:  RLQDDIWCASSQEM-------PVDEECCSDFPSEAVHSSHTDKSGPSSNLSGVDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVES
        RLQDDIWCAS Q M       PVDE+CC  FPSE  HSS       ++   GVDALHQKRSRESE+SSNK S GHL  DFVDLS+D  IPGSSA Y  ES
Subjt:  RLQDDIWCASSQEM-------PVDEECCSDFPSEAVHSSHTDKSGPSSNLSGVDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVES

Query:  NKRHKMPDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSAC
        NKRHK+PDR +FP+  AE SSIDLSCSSSN M  HDGT+ PG+LSMWECGNCTLLN PLAPICELCFSQKPKD+DT+YKFWSCKFCTLEN+VK EKC+AC
Subjt:  NKRHKMPDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSAC

Query:  DQWRYSHGQPVSTQGPNLGT
         QWRYSHGQPVST+GPNLGT
Subjt:  DQWRYSHGQPVSTQGPNLGT

A0A6J1CDP8 uncharacterized protein LOC111010596 isoform X13.6e-20185.61Show/hide
Query:  MDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHN
        MDVGDLNKVWEIKAL KKAGEKEA+EILERIAKQVQPIMR+HKWRVKVLSEFCPKN ALLGLNVGRGIHVKLRLRRPNRD DF PFNQVLDTMLHELCHN
Subjt:  MDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHN

Query:  LISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAE
        L  PHNANFYKLWDELRKECEELMAKGISG+AQGFDL GRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAE
Subjt:  LISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAE

Query:  RRLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNL-SGVDALHQKRSRE-SEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRH
        RRLQDDIWCAS QE+PVDEECC D PSEAVHS    K GPSSNL +GVDALH KR R+    S+NK S GHLKPDFVDLSEDV I GS+A YD ESNKR 
Subjt:  RRLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNL-SGVDALHQKRSRE-SEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRH

Query:  KMPDRATFPRPCAETSSIDLSCSSSNSMQCHDGT-LRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQW
        KM +R  FP+ CAETSS  LSCSSSN MQ HDGT   PG+LSMWECGNCTLLN PLAP+CELC+SQKPKDADTKY+ WSCKFCTLEN+VK EKCSACDQW
Subjt:  KMPDRATFPRPCAETSSIDLSCSSSNSMQCHDGT-LRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQW

Query:  RYSHGQPVSTQGPNLGT
        RYSHGQPVST+ PNLGT
Subjt:  RYSHGQPVSTQGPNLGT

A0A6J1FKX5 uncharacterized protein LOC111446189 isoform X15.7e-21589.37Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        M+VGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRK KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER
        ISPHNA+FYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+QPPLSSLRKSALAAAEGR+RLGSLLPSGP RLGGDSNIMVALSPVQAAAMAAER
Subjt:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER

Query:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSG-VDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM
        RLQDDIWCASSQEMPVDEECCSDFPS+ VHSSH  KSGPSSNLS  VD LHQKR+RESEKSSNK SRGHLKP FVDLSEDVLIPGSSA+YD E NKR+KM
Subjt:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSG-VDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM

Query:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS
         DR  FP+PCAETS+ID SCSSSNSM  HDGTL P + SMWECGNCTLLN PLAPICELC SQK KDADTKY+FWSCKFCTLEN+VK EKC AC QWRYS
Subjt:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS

Query:  HGQPVSTQGPNLGT
        HGQP ST+GPNLGT
Subjt:  HGQPVSTQGPNLGT

A0A6J1IU34 uncharacterized protein LOC111480590 isoform X16.3e-21489.13Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRK KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER
        ISPHNANFYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+QPPLSSLRKSALAAAEGR+RLGSLLPSGP RLGGDSNIM ALSPVQAAAMAAER
Subjt:  ISPHNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAER

Query:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLS-GVDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM
        RLQDDIWCASSQEMPVDEECCSDFPSE  HSSH  KSGPSSNLS  VDALHQKR+RESEKSSN  SRGHL+PDFVDLSEDVLIPGSSA+YD E  KR+K+
Subjt:  RLQDDIWCASSQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLS-GVDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKM

Query:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS
         DR  FP+PCAETS+ID  C SSNSM  HDGTL P + SMWECGNCTLLN PLAPICELC SQK KDADTKY+FWSCKFCTLEN+VK EKCSAC QWRYS
Subjt:  PDRATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYS

Query:  HGQPVSTQGPNLGT
        HGQPVST+GPNLGT
Subjt:  HGQPVSTQGPNLGT

SwissProt top hitse value%identityAlignment
O94580 DNA-dependent metalloprotease WSS1 homolog 21.6e-1232.53Show/hide
Query:  EIKALKKAGEKEAKEILERIAKQ--VQPIMRKHKWRVKVLSEFCP-----KNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLISP
        E+  L    +  A   LER+     ++ IM  H+W V +LSE  P      +   LGLN  +G H++LRLR    DG F  +  V  T++HEL HN+   
Subjt:  EIKALKKAGEKEAKEILERIAKQ--VQPIMRKHKWRVKVLSEFCP-----KNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLISP

Query:  HNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGS
        H+++F++L+ +L KE +     G  GS             N  +   +  R   LAAAE R++ GS
Subjt:  HNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGS

P38838 DNA-dependent metalloprotease WSS12.9e-2234.65Show/hide
Query:  KAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLISPHNANFYKLWDELR
        K  +++A  +++ IA +V  +M+++ ++V  L EF P++  LLG+NV  G  + LRLR    +  F P   ++ TMLHEL HNL  PH+  FY   DEL 
Subjt:  KAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLISPHNANFYKLWDELR

Query:  KECEELMAKGISGSAQGFDLPGRRLGG-----NSRQPPLSSLRKSALAAAEGRR-RLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCAS
             +  +G+  +  G    G+RLGG     ++R P       + +    G+  +LGSL P       G S+I    SP + AA AAERR +DD WC  
Subjt:  KECEELMAKGISGSAQGFDLPGRRLGG-----NSRQPPLSSLRKSALAAAEGRR-RLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCAS

Query:  SQ
        ++
Subjt:  SQ

Q9P7B5 DNA-dependent metalloprotease WSS1 homolog1.1e-0834.43Show/hide
Query:  KVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLISPHNAN
        K+  I A+K      + + L+RIA    PIM++H + V  L E    N    G N  +G  ++L LR  +    + PF  V+D  LHELCH    PH+  
Subjt:  KVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLISPHNAN

Query:  FYKLWDELRKECEELMAKGISG
        F+     LR     L AKG  G
Subjt:  FYKLWDELRKECEELMAKGISG

Arabidopsis top hitse value%identityAlignment
AT1G55915.1 zinc ion binding4.5e-11955.4Show/hide
Query:  DLNKVWEIKALK-KAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLISP
        DLNKVWEIKALK K  E EA++ILE++A QVQPIM + KWRVK+LSEFCP NP LLG+NV RG+ VKLRLRR N D DF  ++++LDTMLHELCHN   P
Subjt:  DLNKVWEIKALK-KAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLISP

Query:  HNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAERRLQ
        HNA+FYKLWDELRKECEELM+KGI+G+ QGFD+PG+RLGG SRQP LS LR +A  AAE R R G+LLPSGP+RLGGDS+IM  LSP+QAAAMAAERRL 
Subjt:  HNANFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAERRLQ

Query:  DDIWCAS-SQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSGVDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKMPD-
        DDIWC S S +   DEE  SD   E V    T  S    ++   ++     S     S  + S      D +DL+E+         +++   KR++ P  
Subjt:  DDIWCAS-SQEMPVDEECCSDFPSEAVHSSHTDKSGPSSNLSGVDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKMPD-

Query:  -----RATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQW
                 P      SSI L  +S N+ Q  +      + +MWEC  CTLLN  LAPICELC + KPK+ + K+K WSCKFCTLEN VK EKC AC QW
Subjt:  -----RATFPRPCAETSSIDLSCSSSNSMQCHDGTLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQW

Query:  RYSHGQPVSTQGPNLGT
        RYS+G P+ST  PN+GT
Subjt:  RYSHGQPVSTQGPNLGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTGGGCGACCTTAACAAAGTTTGGGAAATTAAAGCCCTGAAGAAGGCTGGGGAGAAAGAAGCAAAGGAGATTCTGGAGAGAATTGCCAAACAAGTCCAACCCAT
TATGCGGAAACACAAATGGCGAGTCAAGGTTCTCTCGGAATTCTGCCCAAAAAATCCAGCACTTTTAGGGTTAAATGTGGGACGTGGAATTCATGTGAAGTTGAGGCTTC
GAAGGCCAAACAGGGATGGAGATTTCTTCCCTTTCAATCAAGTTTTGGATACAATGCTACATGAGCTTTGCCACAATCTCATTAGTCCTCACAATGCCAATTTCTACAAG
CTTTGGGATGAACTTAGAAAGGAATGCGAGGAGTTGATGGCTAAGGGAATTAGTGGCTCCGCACAAGGATTTGATCTCCCGGGGAGGCGTTTGGGTGGTAATTCGCGACA
ACCCCCTCTCTCTTCCCTCCGCAAATCTGCCCTAGCTGCCGCAGAAGGAAGAAGACGTTTGGGATCTCTACTTCCATCTGGTCCTAAGCGGCTTGGTGGTGATAGCAACA
TCATGGTTGCTCTAAGTCCTGTACAAGCAGCTGCAATGGCTGCAGAAAGGAGGCTTCAGGATGATATTTGGTGTGCTTCATCTCAAGAAATGCCTGTGGATGAGGAATGT
TGCTCTGATTTTCCATCAGAAGCTGTGCATTCCTCCCACACAGATAAATCAGGGCCCTCTAGCAATTTAAGTGGTGTGGATGCATTACACCAGAAAAGAAGCCGTGAATC
AGAAAAGAGTTCCAACAAGTTTTCCCGTGGTCATTTGAAACCTGATTTTGTTGATTTATCTGAAGACGTTTTAATCCCTGGGTCTTCTGCCATCTATGATGTGGAATCAA
ATAAGCGACATAAAATGCCAGATAGAGCTACATTTCCTCGACCTTGTGCAGAAACTAGCTCGATAGATTTGTCTTGTTCATCCTCTAATTCAATGCAATGTCACGATGGA
ACTCTTCGTCCTGGAAAACTTTCCATGTGGGAGTGTGGAAATTGCACCTTACTGAATCAACCACTAGCTCCAATATGTGAGCTTTGTTTCTCACAAAAGCCAAAAGATGC
AGATACCAAGTACAAATTCTGGTCATGTAAATTCTGCACCTTAGAAAACAATGTGAAGTTTGAGAAATGCTCAGCATGTGATCAATGGAGATATTCTCATGGCCAGCCAG
TGTCGACTCAAGGACCGAATCTCGGTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTGGGCGACCTTAACAAAGTTTGGGAAATTAAAGCCCTGAAGAAGGCTGGGGAGAAAGAAGCAAAGGAGATTCTGGAGAGAATTGCCAAACAAGTCCAACCCAT
TATGCGGAAACACAAATGGCGAGTCAAGGTTCTCTCGGAATTCTGCCCAAAAAATCCAGCACTTTTAGGGTTAAATGTGGGACGTGGAATTCATGTGAAGTTGAGGCTTC
GAAGGCCAAACAGGGATGGAGATTTCTTCCCTTTCAATCAAGTTTTGGATACAATGCTACATGAGCTTTGCCACAATCTCATTAGTCCTCACAATGCCAATTTCTACAAG
CTTTGGGATGAACTTAGAAAGGAATGCGAGGAGTTGATGGCTAAGGGAATTAGTGGCTCCGCACAAGGATTTGATCTCCCGGGGAGGCGTTTGGGTGGTAATTCGCGACA
ACCCCCTCTCTCTTCCCTCCGCAAATCTGCCCTAGCTGCCGCAGAAGGAAGAAGACGTTTGGGATCTCTACTTCCATCTGGTCCTAAGCGGCTTGGTGGTGATAGCAACA
TCATGGTTGCTCTAAGTCCTGTACAAGCAGCTGCAATGGCTGCAGAAAGGAGGCTTCAGGATGATATTTGGTGTGCTTCATCTCAAGAAATGCCTGTGGATGAGGAATGT
TGCTCTGATTTTCCATCAGAAGCTGTGCATTCCTCCCACACAGATAAATCAGGGCCCTCTAGCAATTTAAGTGGTGTGGATGCATTACACCAGAAAAGAAGCCGTGAATC
AGAAAAGAGTTCCAACAAGTTTTCCCGTGGTCATTTGAAACCTGATTTTGTTGATTTATCTGAAGACGTTTTAATCCCTGGGTCTTCTGCCATCTATGATGTGGAATCAA
ATAAGCGACATAAAATGCCAGATAGAGCTACATTTCCTCGACCTTGTGCAGAAACTAGCTCGATAGATTTGTCTTGTTCATCCTCTAATTCAATGCAATGTCACGATGGA
ACTCTTCGTCCTGGAAAACTTTCCATGTGGGAGTGTGGAAATTGCACCTTACTGAATCAACCACTAGCTCCAATATGTGAGCTTTGTTTCTCACAAAAGCCAAAAGATGC
AGATACCAAGTACAAATTCTGGTCATGTAAATTCTGCACCTTAGAAAACAATGTGAAGTTTGAGAAATGCTCAGCATGTGATCAATGGAGATATTCTCATGGCCAGCCAG
TGTCGACTCAAGGACCGAATCTCGGTACTTGA
Protein sequenceShow/hide protein sequence
MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMRKHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLISPHNANFYK
LWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPKRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASSQEMPVDEEC
CSDFPSEAVHSSHTDKSGPSSNLSGVDALHQKRSRESEKSSNKFSRGHLKPDFVDLSEDVLIPGSSAIYDVESNKRHKMPDRATFPRPCAETSSIDLSCSSSNSMQCHDG
TLRPGKLSMWECGNCTLLNQPLAPICELCFSQKPKDADTKYKFWSCKFCTLENNVKFEKCSACDQWRYSHGQPVSTQGPNLGT