; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036331 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036331
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionzingipain-2-like
Genome locationscaffold5:48379379..48382163
RNA-Seq ExpressionSpg036331
SyntenySpg036331
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR025660 - Cysteine peptidase, histidine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
3QJ3_A Structure of digestive procathepsin L2 proteinase from Tenebrio molitor larval midgut [Tenebrio molitor]7.4e-1154.39Show/hide
Query:  GMIYHGPTNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNS
        G +Y+ PT +    NK  HA+LI+G+GNENGQ YW++KNSWG+ WG DGY +I++N+
Subjt:  GMIYHGPTNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNS

AAR05023.1 cathepsin L-like protein [Tenebrio molitor]7.4e-1154.39Show/hide
Query:  GMIYHGPTNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNS
        G +Y+ PT +    NK  HA+LI+G+GNENGQ YW++KNSWG+ WG DGY +I++N+
Subjt:  GMIYHGPTNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNS

XP_022949074.1 zingipain-2-like [Cucurbita moschata]5.3e-1750Show/hide
Query:  EIKNIVGMIYHGPTNQDDL--INKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSD
        E +++   IY+GP + + L  IN   HA+L++GFG  N QKYWIIKNSWGE W D GY +ISQN + T +G + L+E+LSYP I+  +
Subjt:  EIKNIVGMIYHGPTNQDDL--INKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSD

XP_023524541.1 zingipain-2-like [Cucurbita pepo subsp. pepo]3.6e-1353.52Show/hide
Query:  EIKNIVGMIYHGPTNQDDL--INKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKG
        E K++   IY+GP + + L  IN   HA+L++GFG  N QKYWIIKNSWGE W D GY +ISQN + T +G
Subjt:  EIKNIVGMIYHGPTNQDDL--INKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKG

XP_030939752.1 ervatamin-B-like [Quercus lobata]7.4e-1145.05Show/hide
Query:  EIKNIVGMIYHGPTNQDDL---INKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQN---SIMTSKGQKY---LLEKLSYPKI
        E   + G IY GP  QD L   I K +HAIL++G+  ENG+ YW+I+NSWGE WG +GY +I ++   SI+T K +     L+ ++SYP I
Subjt:  EIKNIVGMIYHGPTNQDDL---INKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQN---SIMTSKGQKY---LLEKLSYPKI

TrEMBL top hitse value%identityAlignment
A0A6A1WS89 Ervatamin-B6.1e-1142.05Show/hide
Query:  LLQSEIKNIVGMIYHGPTNQDDLINKSLHAILIIGFG--NENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSK--GQKYLLEKLSYP
        ++ +E  N+ G IY GP +    + + +HAIL++GFG   E G+ +WIIKNSWG+ WG +GYA+I+++S ++S     K L+ + SYP
Subjt:  LLQSEIKNIVGMIYHGPTNQDDLINKSLHAILIIGFG--NENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSK--GQKYLLEKLSYP

A0A6J1GBS9 zingipain-2-like2.6e-1750Show/hide
Query:  EIKNIVGMIYHGPTNQDDL--INKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSD
        E +++   IY+GP + + L  IN   HA+L++GFG  N QKYWIIKNSWGE W D GY +ISQN + T +G + L+E+LSYP I+  +
Subjt:  EIKNIVGMIYHGPTNQDDL--INKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSD

A0A7N2MRH3 Pept_C1 domain-containing protein3.6e-1145.05Show/hide
Query:  EIKNIVGMIYHGPTNQDDL---INKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQN---SIMTSKGQKY---LLEKLSYPKI
        E   + G IY GP  QD L   I K +HAIL++G+  ENG+ YW+I+NSWGE WG +GY +I ++   SI+T K +     L+ ++SYP I
Subjt:  EIKNIVGMIYHGPTNQDDL---INKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQN---SIMTSKGQKY---LLEKLSYPKI

A0A7N2RC65 Pept_C1 domain-containing protein4.7e-1143.48Show/hide
Query:  SEIKNIVGMIYHGPTNQDDLI---NKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQN---SIMTSKGQKY---LLEKLSYPKI
        +E   + G IY GP  QD LI    K +HAIL++G+  ENG+ YW+I+NSWG+ WG +GY +I ++   SI+T K +     L+ ++SYP I
Subjt:  SEIKNIVGMIYHGPTNQDDLI---NKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQN---SIMTSKGQKY---LLEKLSYPKI

Q69G21 C1 family cathepsin L53.6e-1154.39Show/hide
Query:  GMIYHGPTNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNS
        G +Y+ PT +    NK  HA+LI+G+GNENGQ YW++KNSWG+ WG DGY +I++N+
Subjt:  GMIYHGPTNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNS

SwissProt top hitse value%identityAlignment
F4JNL3 Probable cysteine protease RDL63.1e-1249.37Show/hide
Query:  IYHGP--TNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNS
        IY+GP  TN D       HA++I+G+G+ENGQ YWI++NSWG  WGD GY +I++N     KG   +    SYP IKNS
Subjt:  IYHGP--TNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNS

O35186 Cathepsin K3.8e-1046.3Show/hide
Query:  IYHGPTNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQN
        +Y+      D +N   HA+L++G+G + G KYWIIKNSWGE WG+ GY  +++N
Subjt:  IYHGPTNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQN

P25251 Cysteine proteinase COT44 (Fragment)1.3e-1044.44Show/hide
Query:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN
        HA++ +G+G+ENG  YWI++NSWG  WG+DGY R+ +N + +  G+  +  + SYP +K S N
Subjt:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN

P43297 Cysteine proteinase RD21A2.7e-1146.03Show/hide
Query:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN
        H ++ +G+G ENG+ YWI++NSWG+ WG+ GY R+++N I +S G+  +  + SYP IKN +N
Subjt:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN

Q94B08 Germination-specific cysteine protease 15.9e-1142.86Show/hide
Query:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN
        HA++ +G+G+ENG  YWI++NSWG  WG++GY R+ +N   +  G+  +  + SYP +K S N
Subjt:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN

Arabidopsis top hitse value%identityAlignment
AT1G47128.1 Granulin repeat cysteine protease family protein1.9e-1246.03Show/hide
Query:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN
        H ++ +G+G ENG+ YWI++NSWG+ WG+ GY R+++N I +S G+  +  + SYP IKN +N
Subjt:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN

AT3G19390.1 Granulin repeat cysteine protease family protein1.4e-1039.68Show/hide
Query:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN
        H ++ +G+G+E GQ YWI++NSWG  WG+ GY ++ +N I  S G+  +    SYP   +  N
Subjt:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN

AT4G23520.1 Cysteine proteinases superfamily protein2.2e-1349.37Show/hide
Query:  IYHGP--TNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNS
        IY+GP  TN D       HA++I+G+G+ENGQ YWI++NSWG  WGD GY +I++N     KG   +    SYP IKNS
Subjt:  IYHGP--TNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNS

AT4G36880.1 cysteine proteinase14.2e-1242.86Show/hide
Query:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN
        HA++ +G+G+ENG  YWI++NSWG  WG++GY R+ +N   +  G+  +  + SYP +K S N
Subjt:  HAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN

AT5G45890.1 senescence-associated gene 127.9e-1152.54Show/hide
Query:  HAILIIGFG-NENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKI
        HA+  IG+G + NG KYWIIKNSWG +WG+ GY RI Q  +   +G   L  K SYP I
Subjt:  HAILIIGFG-NENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATCGGCGTACTTACTCCAGTCCGAGATCAAGAACATTGTAGGAATGATATACCATGGTCCAACAAACCAGGATGATTTGATAAACAAATCTCTACATGCCATTTT
GATAATAGGATTTGGAAATGAAAATGGTCAAAAGTATTGGATAATCAAGAATTCGTGGGGTGAAGAATGGGGAGATGATGGTTATGCAAGAATATCCCAGAACAGTATTA
TGACCAGTAAAGGACAAAAATATTTGTTGGAGAAGTTATCGTATCCCAAGATCAAAAATAGTGATAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGATCGGCGTACTTACTCCAGTCCGAGATCAAGAACATTGTAGGAATGATATACCATGGTCCAACAAACCAGGATGATTTGATAAACAAATCTCTACATGCCATTTT
GATAATAGGATTTGGAAATGAAAATGGTCAAAAGTATTGGATAATCAAGAATTCGTGGGGTGAAGAATGGGGAGATGATGGTTATGCAAGAATATCCCAGAACAGTATTA
TGACCAGTAAAGGACAAAAATATTTGTTGGAGAAGTTATCGTATCCCAAGATCAAAAATAGTGATAATTAA
Protein sequenceShow/hide protein sequence
MRSAYLLQSEIKNIVGMIYHGPTNQDDLINKSLHAILIIGFGNENGQKYWIIKNSWGEEWGDDGYARISQNSIMTSKGQKYLLEKLSYPKIKNSDN