; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002505 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002505
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPapaya proteinase 4
Genome locationchr4:43409368..43409976
RNA-Seq ExpressionLag0002505
SyntenyLag0002505
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650303.1 hypothetical protein Csa_010836 [Cucumis sativus]5.8e-0945.88Show/hide
Query:  ITIYPFHPYDSADAGELAKAQCSEHWELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEF
        I IY F    S   G L  A+ S+ W+ F SWM + KK+Y  ++E L RF IF   +K+I++ NKE  G   GLN YSDLT+ EF
Subjt:  ITIYPFHPYDSADAGELAKAQCSEHWELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEF

KAF3963543.1 hypothetical protein CMV_012083 [Castanea mollissima]2.2e-0849.23Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG
        ELF+SW+ K  K YR  +E L RFEIF + +K+I+E+NKE+  Y LGLN+++DL+  EF +  +G
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG

P05994.3 RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl endopeptidase; AltName: Full=Papaya peptidase B; AltName: Full=Papaya proteinase IV; Short=PPIV; Flags: Precursor [Carica papaya]2.6e-0950.75Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV
        +LF SWM K  K Y+   E L RFEIF + +KYI+E+NK + GY LGLN++SDL++ EF    VG +
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV

XP_021888999.1 papaya proteinase 4 [Carica papaya]2.6e-0950.75Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV
        +LF SWM K  K Y+   E L RFEIF + +KYI+E+NK + GY LGLN++SDL++ EF    VG +
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV

XP_038716878.1 cysteine protease XCP2-like [Tripterygium wilfordii]1.7e-0837.29Show/hide
Query:  SSSSSSSPAFCLMQNRLYRRLPRHREPITIYPFHPYDSADAGELAKAQCSEHWELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLG
        SSSS++SP                    +I  F P D    G L         ELF+SW++K +K Y   +E L RFEIF + +K+IEE NK+   Y LG
Subjt:  SSSSSSSPAFCLMQNRLYRRLPRHREPITIYPFHPYDSADAGELAKAQCSEHWELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLG

Query:  LNDYSDLTDVEFPHGLVG
        LN+Y+DL+  EF +  +G
Subjt:  LNDYSDLTDVEFPHGLVG

TrEMBL top hitse value%identityAlignment
A0A0A0L4P3 Inhibitor_I29 domain-containing protein2.8e-0945.88Show/hide
Query:  ITIYPFHPYDSADAGELAKAQCSEHWELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEF
        I IY F    S   G L  A+ S+ W+ F SWM + KK+Y  ++E L RF IF   +K+I++ NKE  G   GLN YSDLT+ EF
Subjt:  ITIYPFHPYDSADAGELAKAQCSEHWELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEF

A0A2C9UG35 Uncharacterized protein1.8e-0844.59Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGVRFVGHRG
        ELF+SWM K  K YR  +E L RFE+F + +K+I+ +N++L  Y LGLN+++DLT  EF    +G  R+   +G
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGVRFVGHRG

A0A2N9EIW6 Uncharacterized protein1.8e-0847.69Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG
        ELF+SW+ K  K YR  +E L RFEIF + +K+I+E+NKE+  Y LGLN+++D++  EF +  +G
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG

A0A6P9EI45 cysteine protease XCP1-like1.1e-0849.23Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG
        ELF+SWM K  K YR  +E L RFE+F + +K+I+++NKE   Y LGLN+++DLT  EF +  +G
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG

A0A7J7CYU5 Xylem cysteine proteinase 28.2e-0937.29Show/hide
Query:  SSSSSSSPAFCLMQNRLYRRLPRHREPITIYPFHPYDSADAGELAKAQCSEHWELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLG
        SSSS++SP                    +I  F P D    G L         ELF+SW++K +K Y   +E L RFEIF + +K+IEE NK+   Y LG
Subjt:  SSSSSSSPAFCLMQNRLYRRLPRHREPITIYPFHPYDSADAGELAKAQCSEHWELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLG

Query:  LNDYSDLTDVEFPHGLVG
        LN+Y+DL+  EF +  +G
Subjt:  LNDYSDLTDVEFPHGLVG

SwissProt top hitse value%identityAlignment
O65493 Cysteine protease XCP13.5e-0941.54Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG
        ELF+SWM +  K Y+  +E + RFE+F   + +I+++N E+  Y LGLN+++DLT  EF    +G
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG

P00784 Papain4.6e-0943.28Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV
        +LF+SWM K  K Y+   E + RFEIF + +KYI+E NK+   Y LGLN ++D+++ EF     G +
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV

P05994 Papaya proteinase 43.4e-1250.75Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV
        +LF SWM K  K Y+   E L RFEIF + +KYI+E+NK + GY LGLN++SDL++ EF    VG +
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV

P10056 Caricain1.0e-0844.78Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV
        +LF SWM    K Y    E L RFEIF + + YI+E NK+   Y LGLN+++DL++ EF    VG +
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV

P14080 Chymopapain1.8e-0846.27Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV
        +LF SWM K  K Y    E + RFEIF + + YI+E NK+   Y LGLN ++DL++ EF    VG V
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVGGV

Arabidopsis top hitse value%identityAlignment
AT1G20850.1 xylem cysteine peptidase 24.7e-0940Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG
        ELF++W+   +K Y   +E   RFE+F + +K+I+E NK+ + Y LGLN+++DL+  EF    +G
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG

AT1G29090.1 Cysteine proteinases superfamily protein5.4e-0538.33Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELE-GYQLGLNDYSDLTDVEF
        E  Q WM +  + Y  E E   RF++F   +K+IE+ NK+ +  Y+LG+N+++D T  EF
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELE-GYQLGLNDYSDLTDVEF

AT3G49340.1 Cysteine proteinases superfamily protein7.6e-0741.67Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKEL-EGYQLGLNDYSDLTDVEF
        E  + WM +  + Y  + E   RFEIFTN +K++E  N    + Y L +N++SDLTD EF
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKEL-EGYQLGLNDYSDLTDVEF

AT4G35350.1 xylem cysteine peptidase 12.5e-1041.54Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG
        ELF+SWM +  K Y+  +E + RFE+F   + +I+++N E+  Y LGLN+++DLT  EF    +G
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG

AT4G35350.2 xylem cysteine peptidase 12.5e-1041.54Show/hide
Query:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG
        ELF+SWM +  K Y+  +E + RFE+F   + +I+++N E+  Y LGLN+++DLT  EF    +G
Subjt:  ELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLGLNDYSDLTDVEFPHGLVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGGCGGCTTATACTCGCTTGGCGTTCATCTTCATCTTCTTCCTCGCCTGCCTTTTGCTTGATGCAGAATCGCCTCTACCGGAGGTTGCCTCGACACCGGGAGCC
CATCACGATTTACCCGTTTCATCCATATGATTCCGCTGATGCTGGAGAGTTGGCTAAAGCACAGTGTTCGGAGCACTGGGAGCTGTTCCAGTCGTGGATGAAGAAGACCA
AAAAGCGTTACAGGGGTGAGAAAGAGATGCTCTGTAGGTTTGAGATATTCACTAACCGTGTGAAGTATATTGAGGAGAAGAACAAGGAGCTAGAAGGCTATCAACTGGGG
TTGAATGATTACTCAGACTTGACAGATGTGGAATTTCCTCATGGGCTCGTGGGAGGAGTACGTTTCGTCGGGCACAGAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGGGCGGCTTATACTCGCTTGGCGTTCATCTTCATCTTCTTCCTCGCCTGCCTTTTGCTTGATGCAGAATCGCCTCTACCGGAGGTTGCCTCGACACCGGGAGCC
CATCACGATTTACCCGTTTCATCCATATGATTCCGCTGATGCTGGAGAGTTGGCTAAAGCACAGTGTTCGGAGCACTGGGAGCTGTTCCAGTCGTGGATGAAGAAGACCA
AAAAGCGTTACAGGGGTGAGAAAGAGATGCTCTGTAGGTTTGAGATATTCACTAACCGTGTGAAGTATATTGAGGAGAAGAACAAGGAGCTAGAAGGCTATCAACTGGGG
TTGAATGATTACTCAGACTTGACAGATGTGGAATTTCCTCATGGGCTCGTGGGAGGAGTACGTTTCGTCGGGCACAGAGGATGA
Protein sequenceShow/hide protein sequence
MAGRLILAWRSSSSSSSPAFCLMQNRLYRRLPRHREPITIYPFHPYDSADAGELAKAQCSEHWELFQSWMKKTKKRYRGEKEMLCRFEIFTNRVKYIEEKNKELEGYQLG
LNDYSDLTDVEFPHGLVGGVRFVGHRG