; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014954 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014954
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCysteine proteinase
Genome locationchr12:6165624..6166270
RNA-Seq ExpressionLag0014954
SyntenyLag0014954
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3971741.1 hypothetical protein CMV_004687 [Castanea mollissima]1.3e-1142.31Show/hide
Query:  TYTRLAFIFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLK
        +Y++++F+ +F ++   L A   L   +    Y P       +  ELF+SW+ KH K YR  +E L+RFEIF D +K+I+E+NKE+ SY LGLN+F+DL 
Subjt:  TYTRLAFIFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLK

Query:  DEEF
         EEF
Subjt:  DEEF

XP_002273243.2 PREDICTED: cysteine protease XCP1 [Vitis vinifera]1.8e-1142.86Show/hide
Query:  FIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEFPR
        F+ F++ + + A S     +    Y P D     +  +LF+SWM KH K YR  +E L+RFE+F D +K+I+E NK++ SY LGLN+F+DL  EEF R
Subjt:  FIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEFPR

XP_023885642.1 cysteine protease XCP1-like [Quercus suber]1.8e-1138.46Show/hide
Query:  TTYTRLAFIFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDL
        +++++++F+ IF ++   L A   L   +    Y P       +  ELF+SW+ KH K YR  +E L+RFEIF D +K+I+E+NKE+ SY LGLN+F+DL
Subjt:  TTYTRLAFIFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDL

Query:  KDEEF--------PRGGELRSLLQDSRLRD
          EEF        P   E R   +D   RD
Subjt:  KDEEF--------PRGGELRSLLQDSRLRD

XP_030973913.1 cysteine protease XCP1-like [Quercus lobata]1.8e-1137.69Show/hide
Query:  TTYTRLAFIFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDL
        +++++++F+ +F ++   L A   L   +    Y P       +  ELF+SW+ KH K YR  +E L+RFEIF D +K+I+E+NKE+ SY LGLN+F+DL
Subjt:  TTYTRLAFIFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDL

Query:  KDEEF--------PRGGELRSLLQDSRLRD
          EEF        P   E R   +D   RD
Subjt:  KDEEF--------PRGGELRSLLQDSRLRD

XP_038888276.1 cysteine protease XCP1-like [Benincasa hispida]1.8e-1143.27Show/hide
Query:  LAFIFIFFLTCLLLDA----ESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLK
        +AF   F  T L+L A       +   +    Y P       +  ELF+SWMKKH K+Y+  +E L+RFEIF D +K+I+E NK++ SY LGLN+F+DL 
Subjt:  LAFIFIFFLTCLLLDA----ESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLK

Query:  DEEF
         EEF
Subjt:  DEEF

TrEMBL top hitse value%identityAlignment
A0A2N9EW73 Uncharacterized protein5.0e-1246.39Show/hide
Query:  IFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF
        + I F  CLL  A S L   +    Y P D     +  ELF+SWM KH K Y   +E L+RFEIF D +K+I+E NK++ +Y LGLN+F+DL  EEF
Subjt:  IFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF

A0A2N9IUN2 Uncharacterized protein5.0e-1246.39Show/hide
Query:  IFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF
        + I F  CLL  A S L   +    Y P D     +  ELF+SWM KH K Y   +E L+RFEIF D +K+I+E NK++ +Y LGLN+F+DL  EEF
Subjt:  IFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF

A0A5A7UUL9 Xylem cysteine proteinase 1-like1.1e-1152.78Show/hide
Query:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF
        Y P       +  ELF+SWM KH K YR  +E L+RFEIF D +K+I+E NK++ SY LGLN+F+DL  EEF
Subjt:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF

A0A7N2LWE4 Uncharacterized protein8.5e-1237.69Show/hide
Query:  TTYTRLAFIFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDL
        +++++++F+ +F ++   L A   L   +    Y P       +  ELF+SW+ KH K YR  +E L+RFEIF D +K+I+E+NKE+ SY LGLN+F+DL
Subjt:  TTYTRLAFIFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDL

Query:  KDEEF--------PRGGELRSLLQDSRLRD
          EEF        P   E R   +D   RD
Subjt:  KDEEF--------PRGGELRSLLQDSRLRD

F6I6V5 Uncharacterized protein8.5e-1251.35Show/hide
Query:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEFPR
        Y P D     +  +LF+SWM KH K YR  +E L+RFE+F D +K+I+E NK++ SY LGLN+F+DL  EEF R
Subjt:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEFPR

SwissProt top hitse value%identityAlignment
O65493 Cysteine protease XCP11.1e-1143.06Show/hide
Query:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF
        Y P     + +  ELF+SWM +H K Y+  +E ++RFE+F + + +I+++N E++SY LGLN+F+DL  EEF
Subjt:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF

P00784 Papain6.3e-1248.61Show/hide
Query:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF
        Y   D   +    +LF+SWM KH K Y+   E +YRFEIF D +KYI+E NK+ +SY LGLN F+D+ ++EF
Subjt:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF

P05994 Papaya proteinase 41.1e-1351.39Show/hide
Query:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF
        Y   D   +    +LF SWM KH K Y+   E LYRFEIF D +KYI+E+NK ++ Y LGLN+FSDL ++EF
Subjt:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF

P10056 Caricain1.8e-1148.61Show/hide
Query:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF
        Y   D   +    +LF SWM  H K Y    E LYRFEIF D + YI+E NK+ +SY LGLN+F+DL ++EF
Subjt:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF

P14080 Chymopapain2.2e-1242.27Show/hide
Query:  IFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEFPR
        IF  TCL++       + Y    Y   D        +LF SWM KH K Y    E +YRFEIF D + YI+E NK+ +SY LGLN F+DL ++EF +
Subjt:  IFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEFPR

Arabidopsis top hitse value%identityAlignment
AT1G20850.1 xylem cysteine peptidase 23.2e-1141.77Show/hide
Query:  YPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEFPR
        Y    Y P D     +  ELF++W+   +K Y   +E   RFE+F D +K+I+E NK+  SY LGLN+F+DL  EEF +
Subjt:  YPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEFPR

AT3G49340.1 Cysteine proteinases superfamily protein2.4e-0641.67Show/hide
Query:  ELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELD-SYQLGLNDFSDLKDEEF
        E  + WM +  + Y  + E   RFEIFT+ +K++E  N   + +Y L +N+FSDL DEEF
Subjt:  ELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELD-SYQLGLNDFSDLKDEEF

AT4G23520.1 Cysteine proteinases superfamily protein8.1e-0737.37Show/hide
Query:  IFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWE-LFQSWMKKHKKRYR---GEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF
        +F L   +L A S   ++    P        S+E  E +FQ WM KH K Y    GEKE   RF+ F D +++I++ N +  SYQLGL  F+DL  +E+
Subjt:  IFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWE-LFQSWMKKHKKRYR---GEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF

AT4G35350.1 xylem cysteine peptidase 17.6e-1343.06Show/hide
Query:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF
        Y P     + +  ELF+SWM +H K Y+  +E ++RFE+F + + +I+++N E++SY LGLN+F+DL  EEF
Subjt:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF

AT4G35350.2 xylem cysteine peptidase 17.6e-1343.06Show/hide
Query:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF
        Y P     + +  ELF+SWM +H K Y+  +E ++RFE+F + + +I+++N E++SY LGLN+F+DL  EEF
Subjt:  YDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKELDSYQLGLNDFSDLKDEEF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGGGTGAGGACTGGGGAGAGCAAAACGAAACCCTAAACCCTAGGCGCTGCGAAAATGGCGGGACGACTTATACTCGCTTGGCGTTCATCTTCATCTTCTTCCT
CACCTGCCTTTTGCTTGATGCAGAATCGCCTCTACCGGAGATTTACCCGTTTCATCCATATGATCCTGCTGATGCTGGAGAGTCTTCGGAGCACTGGGAGTTGTTCCAGT
CGTGGATGAAGAAGCACAAAAAGCGTTACAGGGGTGAGAAAGAGATGCTCTATAGGTTTGAGATATTCACTGACTGTGTGAAGTACATTGAGGAGAAGAACAAGGAGCTA
GATAGCTATCAACTGGGCTTGAATGATTTTTCAGACTTGAAAGATGAGGAGTTTCCTCGTGGAGGGGAGCTCCGGAGCCTCCTGCAGGATTCGAGATTGAGGGATACAGG
ATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGGGTGAGGACTGGGGAGAGCAAAACGAAACCCTAAACCCTAGGCGCTGCGAAAATGGCGGGACGACTTATACTCGCTTGGCGTTCATCTTCATCTTCTTCCT
CACCTGCCTTTTGCTTGATGCAGAATCGCCTCTACCGGAGATTTACCCGTTTCATCCATATGATCCTGCTGATGCTGGAGAGTCTTCGGAGCACTGGGAGTTGTTCCAGT
CGTGGATGAAGAAGCACAAAAAGCGTTACAGGGGTGAGAAAGAGATGCTCTATAGGTTTGAGATATTCACTGACTGTGTGAAGTACATTGAGGAGAAGAACAAGGAGCTA
GATAGCTATCAACTGGGCTTGAATGATTTTTCAGACTTGAAAGATGAGGAGTTTCCTCGTGGAGGGGAGCTCCGGAGCCTCCTGCAGGATTCGAGATTGAGGGATACAGG
ATGA
Protein sequenceShow/hide protein sequence
MEKGEDWGEQNETLNPRRCENGGTTYTRLAFIFIFFLTCLLLDAESPLPEIYPFHPYDPADAGESSEHWELFQSWMKKHKKRYRGEKEMLYRFEIFTDCVKYIEEKNKEL
DSYQLGLNDFSDLKDEEFPRGGELRSLLQDSRLRDTG