; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022583 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022583
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionThiol protease aleurain-like
Genome locationscaffold2:9655968..9656765
RNA-Seq ExpressionSpg022583
SyntenySpg022583
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG4942418.1 hypothetical protein JHK85_047064 [Glycine max]2.8e-2359.57Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

KHN17725.1 Thiol protease aleurain-like [Glycine soja]2.8e-2359.57Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

XP_003551114.1 pro-cathepsin H [Glycine max]2.8e-2359.57Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

XP_026441009.1 thiol protease aleurain-like [Papaver somniferum]8.3e-2358.51Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE   AV FVRPV + F++ +  RLY+EGV++S   GT P  D  H VLAVGYGVE +GT YW++KNSWGA WG+NGYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

XP_028211162.1 thiol protease aleurain-like [Glycine soja]2.8e-2359.57Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

TrEMBL top hitse value%identityAlignment
A0A0R4J5M9 Uncharacterized protein1.4e-2359.57Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

A0A445G211 Thiol protease aleurain-like1.4e-2359.57Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

A0A5N6QK05 Uncharacterized protein6.8e-2358.51Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE   AV FVRPV V F++  D RLY+EGVY+S   GT P  D  H VLAVGYGVE +G  YW++KNSWG  WG+NG+FKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

A0A7C9DHZ8 Uncharacterized protein (Fragment)6.8e-2357.45Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE   AV F+RPV V F++  D R Y+EGVY+S + G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG++GYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

A0A7C9DIJ4 Pept_C1 domain-containing protein (Fragment)8.9e-2357.45Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE   AV F+RPV V F++  D R Y+EGVY+S + G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG++GYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

SwissProt top hitse value%identityAlignment
A0A072UTP9 Pro-cathepsin H1.2e-2455.32Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE   AV F RPV V F++ +D RLY++GVY+S   G+ P  D  H VLAVGYG+ED G  YW++KNSWG  WG++GYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

P25778 Oryzain gamma chain2.7e-2457.45Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE   AVG VRPV V F++    R+Y+ GVY+S   GT P  D  H VLAVGYGVE +G  YW++KNSWGA WG+NGYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

Q10717 Cysteine proteinase 23.5e-2458.51Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE   AVG VRPV V F++    RLY+ GVY+S   GT P  D  H VLAVGYGVED G  YW++KNSWGA WG+ GYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

Q8H166 Thiol protease aleurain6.0e-2455.32Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE   AVG VRPV + F++    RLY+ GVY+    G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG+ GYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

Q8RWQ9 Thiol protease aleurain-like3.0e-2354.26Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE   AVG VRPV V F++  + R Y++GV++S   G  P  D  H VLAVGYGVED    YW++KNSWG  WG+NGYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

Arabidopsis top hitse value%identityAlignment
AT3G45310.1 Cysteine proteinases superfamily protein2.1e-2454.26Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE   AVG VRPV V F++  + R Y++GV++S   G  P  D  H VLAVGYGVED    YW++KNSWG  WG+NGYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

AT3G45310.2 Cysteine proteinases superfamily protein1.4e-2354.35Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AVG VRPV V F++  + R Y++GV++S   G  P  D  H VLAVGYGVED    YW++KNSWG  WG+NGYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

AT5G60360.1 aleurain-like protease4.2e-2555.32Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL
        +G EDE   AVG VRPV + F++    RLY+ GVY+    G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG+ GYFKM  G N+CG+
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL

AT5G60360.2 aleurain-like protease3.6e-2455.43Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AVG VRPV + F++    RLY+ GVY+    G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG+ GYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

AT5G60360.3 aleurain-like protease9.4e-2555.91Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCG
        +G EDE   AVG VRPV + F++    RLY+ GVY+    G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG+ GYFKM  G N+CG
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGATGAAGATGAATGGTTTCGTGCAGTTGGTTTTGTTCGACCAGTAATTGTAGTATTCAAGATGAATGAAGATTTAAGATTATACGAAGAAGGTGTTTACAGTAG
TTGCATCTCTGGCACCGTTCCTGAGAAGGACGAATACCATGTCGTGCTTGCAGTTGGTTATGGGGTTGAAGACAGTGGAACTCAATACTGGATTTTAAAGAATTCATGGG
GAGCATGCTGGGGCGAAAATGGCTACTTCAAGATGTTGAGGGGTGTGAACTTGTGTGGTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGATGAAGATGAATGGTTTCGTGCAGTTGGTTTTGTTCGACCAGTAATTGTAGTATTCAAGATGAATGAAGATTTAAGATTATACGAAGAAGGTGTTTACAGTAG
TTGCATCTCTGGCACCGTTCCTGAGAAGGACGAATACCATGTCGTGCTTGCAGTTGGTTATGGGGTTGAAGACAGTGGAACTCAATACTGGATTTTAAAGAATTCATGGG
GAGCATGCTGGGGCGAAAATGGCTACTTCAAGATGTTGAGGGGTGTGAACTTGTGTGGTTTGTAA
Protein sequenceShow/hide protein sequence
MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLCGL