; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014921 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014921
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionXylem cysteine proteinase 1
Genome locationchr12:5903816..5904148
RNA-Seq ExpressionLag0014921
SyntenyLag0014921
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0008233 - peptidase activity (molecular function)
GO:0097655 - serpin family protein binding (molecular function)
InterPro domainsIPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
3TNX_A Structure of the precursor of a thermostable variant of papain at 2.6 Angstroem resolution [Carica papaya]3.5e-0444.07Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE
        +LF+SWMLK+++ Y++  E +YR++IF   +K +DE+NKK       GL  FAD ++DE
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE

4QRG_A Crystal structure of I86L mutant of papain [Carica papaya]3.5e-0444.07Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE
        +LF+SWMLK+++ Y++  E +YR++IF   +K +DE+NKK       GL  FAD ++DE
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE

KAE8057439.1 hypothetical protein FH972_014132 [Carpinus fangiana]3.1e-0545.07Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRLRHLHLVHP
        ELF+SWM K+ +SYES +E L+R+ IF   +K +DE+NKK       GL  FAD + +E  ++ HL L+ P
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRLRHLHLVHP

KAE8650303.1 hypothetical protein Csa_010836 [Cucumis sativus]1.8e-0541.1Show/hide
Query:  MEEAEGSENWELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRL
        +E+AE S++W+ F SWM ++ + YES++E LYR+ IF   +K + + NK+  G  TFGL  ++D  + E  RL
Subjt:  MEEAEGSENWELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRL

XP_038887495.1 pro-cathepsin H-like [Benincasa hispida]2.0e-0748.48Show/hide
Query:  SENWELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPR
        SE+WE FKSWM  +++ Y SE+EMLYR+ +F K +K +++ NK   G  TFG   F+D   DEVP+
Subjt:  SENWELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPR

TrEMBL top hitse value%identityAlignment
A0A0A0L4P3 Inhibitor_I29 domain-containing protein8.9e-0641.1Show/hide
Query:  MEEAEGSENWELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRL
        +E+AE S++W+ F SWM ++ + YES++E LYR+ IF   +K + + NK+  G  TFGL  ++D  + E  RL
Subjt:  MEEAEGSENWELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRL

A0A5N6R8Z3 Inhibitor_I29 domain-containing protein1.5e-0545.07Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRLRHLHLVHP
        ELF+SWM K+ +SYES +E L+R+ IF   +K +DE+NKK       GL  FAD + +E  ++ HL L+ P
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRLRHLHLVHP

F6KSW9 Chymopapain (Fragment)4.9e-0445.76Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE
        +LF SWMLK+++ YES  E +YR++IF   +  +DE+NKK       GL  FAD ++DE
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE

Q9SMI0 Chymopapain isoform III4.9e-0445.76Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE
        +LF SWMLK+++ YES  E +YR++IF   +  +DE+NKK       GL  FAD ++DE
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE

Q9SMI1 Chymopapain isoform II4.9e-0445.76Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE
        +LF SWMLK+++ YES  E +YR++IF   +  +DE+NKK       GL  FAD ++DE
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE

SwissProt top hitse value%identityAlignment
P00784 Papain4.5e-0744.07Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE
        +LF+SWMLK+++ Y++  E +YR++IF   +K +DE+NKK       GL  FAD ++DE
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE

P05994 Papaya proteinase 41.3e-0644.07Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE
        +LF SWMLK++++Y++  E LYR++IF   +K +DE NK   G    GL  F+D ++DE
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE

P10056 Caricain3.8e-0644.07Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE
        +LF SWML +++ YE+  E LYR++IF   +  +DE+NKK       GL  FAD ++DE
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE

P14080 Chymopapain1.3e-0645.76Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE
        +LF SWMLK+++ YES  E +YR++IF   +  +DE+NKK       GL  FAD ++DE
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDE

Q9LM66 Cysteine protease XCP24.7e-0433.33Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRL
        ELF++W+   +++YE+ +E   R+++F   +K +DE+NKK +     GL  FAD + +E  ++
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRL

Arabidopsis top hitse value%identityAlignment
AT1G20850.1 xylem cysteine peptidase 23.3e-0533.33Show/hide
Query:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRL
        ELF++W+   +++YE+ +E   R+++F   +K +DE+NKK +     GL  FAD + +E  ++
Subjt:  ELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAGCAGAAGGTTCGGAGAATTGGGAATTGTTCAAGTCATGGATGTTGAAGAACGATAGGAGTTACGAGAGCGAGAAAGAGATGTTGTATAGGTATCAGATATT
CTGTAAGAGGGTGAAGATGGTTGATGAGTCGAACAAGAAGTGTCGTGGCCGTCCGACATTTGGGTTGACTTGCTTTGCAGACAAGAACTCGGATGAGGTCCCGCGCCTGC
GTCACCTCCATTTGGTACATCCCCGCGGAGGACGAGGACGAGGACGAGGCCTAGCTAATTGCAGCATTCATTCCAATTCTAATCCTTTTAATTTTATTATATTTGAACTA
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAGCAGAAGGTTCGGAGAATTGGGAATTGTTCAAGTCATGGATGTTGAAGAACGATAGGAGTTACGAGAGCGAGAAAGAGATGTTGTATAGGTATCAGATATT
CTGTAAGAGGGTGAAGATGGTTGATGAGTCGAACAAGAAGTGTCGTGGCCGTCCGACATTTGGGTTGACTTGCTTTGCAGACAAGAACTCGGATGAGGTCCCGCGCCTGC
GTCACCTCCATTTGGTACATCCCCGCGGAGGACGAGGACGAGGACGAGGCCTAGCTAATTGCAGCATTCATTCCAATTCTAATCCTTTTAATTTTATTATATTTGAACTA
TAG
Protein sequenceShow/hide protein sequence
MEEAEGSENWELFKSWMLKNDRSYESEKEMLYRYQIFCKRVKMVDESNKKCRGRPTFGLTCFADKNSDEVPRLRHLHLVHPRGGRGRGRGLANCSIHSNSNPFNFIIFEL