; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007945 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007945
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptioncysteine protease XCP2
Genome locationscaffold4:6425774..6427078
RNA-Seq ExpressionSpg007945
SyntenySpg007945
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
GO:0097655 - serpin family protein binding (molecular function)
InterPro domainsIPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8057439.1 hypothetical protein FH972_014132 [Carpinus fangiana]2.9e-0541.46Show/hide
Query:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDEVPCQRHLHLVHPRRISVEDEDED
        ELF+SWM ++ +SYES EE L+RF IF + +K +DE+NKK       GL  FAD++ +E   Q HL L+ P  +      ++
Subjt:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDEVPCQRHLHLVHPRRISVEDEDED

KAE8650303.1 hypothetical protein Csa_010836 [Cucumis sativus]3.7e-0543.48Show/hide
Query:  MEEAEGSENWELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE
        +E+AE S++W+ F SWM E+ + YES+EE L RF IF   +K + + NK+  G  TFGL  ++D+   E
Subjt:  MEEAEGSENWELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE

XP_008451198.1 PREDICTED: xylem cysteine proteinase 1-like [Cucumis melo]4.1e-0445.76Show/hide
Query:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE
        ELF+SWM ++ ++Y S EE L+RF+IF + +K +DE+NKK       GL  FAD++ +E
Subjt:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE

XP_038878031.1 uncharacterized protein LOC120070224 [Benincasa hispida]1.7e-0533.91Show/hide
Query:  LLRWVCSSSRRSVSYSYRYHLKSNIIVTPLPALIPN------LLRRAFSEPTRAPSKSSGGEVGKLSMEEAEGSENWELFKSWMLEND---RSYESEEEM
        +L W+CSSS                  +PL  LI N      LL  AFS  T AP       +  +++E  +  ENWE FKSW+L+ D   +SY+SE+E+
Subjt:  LLRWVCSSSRRSVSYSYRYHLKSNIIVTPLPALIPN------LLRRAFSEPTRAPSKSSGGEVGKLSMEEAEGSENWELFKSWMLEND---RSYESEEEM

Query:  LNRFQIFCNRVKMVD
        L +F++F  R++ ++
Subjt:  LNRFQIFCNRVKMVD

XP_038887495.1 pro-cathepsin H-like [Benincasa hispida]1.4e-0747.56Show/hide
Query:  KSSGGEVGKLSMEEAEGSENWELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDEVP
        KS G E    S+ +A  SE+WE FKSWM  +++ Y SEEEML RF +F   +K +++ NK Y G  TFG   F+D+  DEVP
Subjt:  KSSGGEVGKLSMEEAEGSENWELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDEVP

TrEMBL top hitse value%identityAlignment
A0A0A0L4P3 Inhibitor_I29 domain-containing protein1.8e-0543.48Show/hide
Query:  MEEAEGSENWELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE
        +E+AE S++W+ F SWM E+ + YES+EE L RF IF   +K + + NK+  G  TFGL  ++D+   E
Subjt:  MEEAEGSENWELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE

A0A2N9EW73 Uncharacterized protein9.0e-0547.46Show/hide
Query:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE
        ELF+SWM ++ ++YES EE L+RF+IF + +K +DE+NKK       GL  FAD++ +E
Subjt:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE

A0A2N9IUN2 Uncharacterized protein9.0e-0547.46Show/hide
Query:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE
        ELF+SWM ++ ++YES EE L+RF+IF + +K +DE+NKK       GL  FAD++ +E
Subjt:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE

A0A5A7UUL9 Xylem cysteine proteinase 1-like2.0e-0445.76Show/hide
Query:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE
        ELF+SWM ++ ++Y S EE L+RF+IF + +K +DE+NKK       GL  FAD++ +E
Subjt:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE

A0A5N6R8Z3 Inhibitor_I29 domain-containing protein1.4e-0541.46Show/hide
Query:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDEVPCQRHLHLVHPRRISVEDEDED
        ELF+SWM ++ +SYES EE L+RF IF + +K +DE+NKK       GL  FAD++ +E   Q HL L+ P  +      ++
Subjt:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDEVPCQRHLHLVHPRRISVEDEDED

SwissProt top hitse value%identityAlignment
O23791 Fruit bromelain1.3e-0537.7Show/hide
Query:  FKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDEVPCQ
        F+ WM E  R Y+ ++E + RFQIF N VK ++  N +     T G+  F DM + E   Q
Subjt:  FKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDEVPCQ

P00784 Papain7.1e-0744.07Show/hide
Query:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE
        +LF+SWML++++ Y++ +E + RF+IF + +K +DE+NKK       GL  FADM+ DE
Subjt:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE

P05994 Papaya proteinase 42.7e-0642.37Show/hide
Query:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE
        +LF SWML+++++Y++ +E L RF+IF + +K +DE NK   G    GL  F+D++ DE
Subjt:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE

P10056 Caricain3.5e-0644.07Show/hide
Query:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE
        +LF SWML +++ YE+ +E L RF+IF + +  +DE+NKK       GL  FAD++ DE
Subjt:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE

P14080 Chymopapain3.5e-0644.07Show/hide
Query:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE
        +LF SWML++++ YES +E + RF+IF + +  +DE+NKK       GL  FAD++ DE
Subjt:  ELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE

Arabidopsis top hitse value%identityAlignment
AT3G19390.1 Granulin repeat cysteine protease family protein5.7e-0433.33Show/hide
Query:  SENWELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE
        +E   +++ W++EN ++Y    E   RF+IF + +K V+E +         GLT FAD+  DE
Subjt:  SENWELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVDESNKKYRGRPTFGLTCFADMNRDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGTCGTTACTCCGTTGGGTGTGCTCGTCATCTCGCAGATCTGTATCGTATTCCTATCGTTATCATCTTAAATCTAATATTATCGTTACTCCGTTACCTGCCTT
GATCCCGAATCTTCTCCGTCGCGCCTTCTCAGAGCCAACCAGGGCGCCGTCCAAGTCGAGTGGTGGAGAAGTTGGTAAGTTATCAATGGAGGAAGCAGAAGGTTCGGAGA
ATTGGGAGTTGTTCAAGTCATGGATGTTGGAGAACGATAGGAGTTACGAGAGCGAGGAAGAGATGTTGAATAGGTTTCAGATATTCTGTAACAGGGTGAAGATGGTTGAT
GAGTCGAACAAGAAGTATCGTGGCCGTCCGACATTTGGGTTGACTTGCTTTGCAGACATGAACCGTGATGAGGTCCCGTGCCAGCGTCACCTCCATTTGGTACATCCCCG
CCGGATAAGTGTGGAGGACGAGGACGAGGACGAGATCGAGATCGAGGTCGAGGAGGAGGAGGAGGACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCGTCGTTACTCCGTTGGGTGTGCTCGTCATCTCGCAGATCTGTATCGTATTCCTATCGTTATCATCTTAAATCTAATATTATCGTTACTCCGTTACCTGCCTT
GATCCCGAATCTTCTCCGTCGCGCCTTCTCAGAGCCAACCAGGGCGCCGTCCAAGTCGAGTGGTGGAGAAGTTGGTAAGTTATCAATGGAGGAAGCAGAAGGTTCGGAGA
ATTGGGAGTTGTTCAAGTCATGGATGTTGGAGAACGATAGGAGTTACGAGAGCGAGGAAGAGATGTTGAATAGGTTTCAGATATTCTGTAACAGGGTGAAGATGGTTGAT
GAGTCGAACAAGAAGTATCGTGGCCGTCCGACATTTGGGTTGACTTGCTTTGCAGACATGAACCGTGATGAGGTCCCGTGCCAGCGTCACCTCCATTTGGTACATCCCCG
CCGGATAAGTGTGGAGGACGAGGACGAGGACGAGATCGAGATCGAGGTCGAGGAGGAGGAGGAGGACGAGGACTAG
Protein sequenceShow/hide protein sequence
MAASLLRWVCSSSRRSVSYSYRYHLKSNIIVTPLPALIPNLLRRAFSEPTRAPSKSSGGEVGKLSMEEAEGSENWELFKSWMLENDRSYESEEEMLNRFQIFCNRVKMVD
ESNKKYRGRPTFGLTCFADMNRDEVPCQRHLHLVHPRRISVEDEDEDEIEIEVEEEEEDED