; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022019 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022019
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionThiol protease aleurain-like
Genome locationscaffold2:9988666..9989382
RNA-Seq ExpressionSpg022019
SyntenySpg022019
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG4942418.1 hypothetical protein JHK85_047064 [Glycine max]1.4e-2259.78Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

KHN17725.1 Thiol protease aleurain-like [Glycine soja]1.4e-2259.78Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

XP_003551114.1 pro-cathepsin H [Glycine max]1.4e-2259.78Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

XP_026441009.1 thiol protease aleurain-like [Papaver somniferum]5.3e-2258.7Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AV FVRPV + F++ +  RLY+EGV++S   GT P  D  H VLAVGYGVE +GT YW++KNSWGA WG+NGYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

XP_028211162.1 thiol protease aleurain-like [Glycine soja]1.4e-2259.78Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

TrEMBL top hitse value%identityAlignment
A0A0R4J5M9 Uncharacterized protein6.7e-2359.78Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

A0A445G211 Thiol protease aleurain-like6.7e-2359.78Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE  +AV FVRPV V F++ +D R Y  GVY+S I G+ P  D  H VLAVGYGVED G  YWI+KNSWG+ WG+NGYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

A0A5N6QK05 Uncharacterized protein3.3e-2258.7Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AV FVRPV V F++  D RLY+EGVY+S   GT P  D  H VLAVGYGVE +G  YW++KNSWG  WG+NG+FKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

A0A7C9DHZ8 Uncharacterized protein (Fragment)3.3e-2257.61Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AV F+RPV V F++  D R Y+EGVY+S + G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG++GYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

A0A7C9DIJ4 Pept_C1 domain-containing protein (Fragment)4.3e-2257.61Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AV F+RPV V F++  D R Y+EGVY+S + G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG++GYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

SwissProt top hitse value%identityAlignment
A0A072UTP9 Pro-cathepsin H5.8e-2455.43Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AV F RPV V F++ +D RLY++GVY+S   G+ P  D  H VLAVGYG+ED G  YW++KNSWG  WG++GYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

P05167 Thiol protease aleurain6.4e-2358.43Show/hide
Query:  EDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        EDE   AVG VRPV V F++ +  R Y+ GVY+S   GT P+ D  H VLAVGYGVE +G  YW++KNSWGA WG+NGYFKM  G N+C
Subjt:  EDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

P25778 Oryzain gamma chain1.7e-2357.61Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AVG VRPV V F++    R+Y+ GVY+S   GT P  D  H VLAVGYGVE +G  YW++KNSWGA WG+NGYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

Q10717 Cysteine proteinase 21.7e-2358.7Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AVG VRPV V F++    RLY+ GVY+S   GT P  D  H VLAVGYGVED G  YW++KNSWGA WG+ GYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

Q8H166 Thiol protease aleurain3.8e-2355.43Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AVG VRPV + F++    RLY+ GVY+    G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG+ GYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

Arabidopsis top hitse value%identityAlignment
AT3G45310.1 Cysteine proteinases superfamily protein1.3e-2354.35Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AVG VRPV V F++  + R Y++GV++S   G  P  D  H VLAVGYGVED    YW++KNSWG  WG+NGYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

AT3G45310.2 Cysteine proteinases superfamily protein1.3e-2354.35Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AVG VRPV V F++  + R Y++GV++S   G  P  D  H VLAVGYGVED    YW++KNSWG  WG+NGYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

AT5G60360.1 aleurain-like protease2.7e-2455.43Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AVG VRPV + F++    RLY+ GVY+    G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG+ GYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

AT5G60360.2 aleurain-like protease2.7e-2455.43Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AVG VRPV + F++    RLY+ GVY+    G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG+ GYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC

AT5G60360.3 aleurain-like protease2.7e-2455.43Show/hide
Query:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC
        +G EDE   AVG VRPV + F++    RLY+ GVY+    G+ P  D  H VLAVGYGVED G  YW++KNSWGA WG+ GYFKM  G N+C
Subjt:  MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGATGAAGATGAATGGTTTCGTGCAGTTGGTTTTGTTCGACCAGTAATTGTAGTATTCAAGATGAATGAAGATTTAAGATTATACGAAGAAGGTGTTTACAGTAG
TTGCATCTCTGGCACCGTTCCTGAGAAGGACGAATACCATGTCGTGCTTGCAGTTGGTTATGGGGTTGAAGACAGTGGAACTCAATACTGGATTTTAAAGAATTCATGGG
GAGCATGCTGGGGCGAAAATGGCTACTTCAAGATGTTGAGGGGTGTGAACTTGTGT
mRNA sequenceShow/hide mRNA sequence
ATGGGGGATGAAGATGAATGGTTTCGTGCAGTTGGTTTTGTTCGACCAGTAATTGTAGTATTCAAGATGAATGAAGATTTAAGATTATACGAAGAAGGTGTTTACAGTAG
TTGCATCTCTGGCACCGTTCCTGAGAAGGACGAATACCATGTCGTGCTTGCAGTTGGTTATGGGGTTGAAGACAGTGGAACTCAATACTGGATTTTAAAGAATTCATGGG
GAGCATGCTGGGGCGAAAATGGCTACTTCAAGATGTTGAGGGGTGTGAACTTGTGT
Protein sequenceShow/hide protein sequence
MGDEDEWFRAVGFVRPVIVVFKMNEDLRLYEEGVYSSCISGTVPEKDEYHVVLAVGYGVEDSGTQYWILKNSWGACWGENGYFKMLRGVNLC