; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022132 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022132
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPept_C1 domain-containing protein
Genome locationchr7:19156362..19158889
RNA-Seq ExpressionLag0022132
SyntenyLag0022132
Gene Ontology termsGO:0050790 - regulation of catalytic activity (biological process)
GO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR012599 - Peptidase C1A, propeptide
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141146.1 cathepsin B-like protease 3 isoform X2 [Cucumis sativus]1.3e-5692.62Show/hide
Query:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL
        MASSH Y SLSLLFLAA+CTFHH QVYAEEQVLKFK +ADILQESIVRHVNEHP+AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPQSFDAREAWPQCISIGTIL
        KLP+SFDAREAWPQCISIGTIL
Subjt:  KLPQSFDAREAWPQCISIGTIL

XP_008465335.1 PREDICTED: cathepsin B-like isoform X1 [Cucumis melo]3.4e-5791.8Show/hide
Query:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL
        MASS LY SLSLLFLAA+CTFHHQQV+AEEQVLKFK +ADILQESIVRHVNEHP+AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPQSFDAREAWPQCISIGTIL
        +LP+SFDAREAWPQCISIGTIL
Subjt:  KLPQSFDAREAWPQCISIGTIL

XP_011652326.1 cathepsin B-like protease 2 isoform X1 [Cucumis sativus]1.4e-5893.44Show/hide
Query:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL
        MASSH Y SLSLLFLAA+CTFHHQQVYAEEQVLKFK +ADILQESIVRHVNEHP+AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPQSFDAREAWPQCISIGTIL
        KLP+SFDAREAWPQCISIGTIL
Subjt:  KLPQSFDAREAWPQCISIGTIL

XP_038903448.1 cathepsin B-like protease 2 isoform X1 [Benincasa hispida]1.2e-5995.08Show/hide
Query:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL
        MASSHLY SLSLLFLAA+CTFHHQQVYAEEQVL+FKFNADILQESIVRHVNEHP+AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPQSFDAREAWPQCISIGTIL
        KLP+SFDAREAWPQCISIGTIL
Subjt:  KLPQSFDAREAWPQCISIGTIL

XP_038903449.1 cathepsin B-like protease 2 isoform X2 [Benincasa hispida]1.2e-5794.26Show/hide
Query:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL
        MASSHLY SLSLLFLAA+CTFHH QVYAEEQVL+FKFNADILQESIVRHVNEHP+AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPQSFDAREAWPQCISIGTIL
        KLP+SFDAREAWPQCISIGTIL
Subjt:  KLPQSFDAREAWPQCISIGTIL

TrEMBL top hitse value%identityAlignment
A0A0A0LFN4 Pept_C1 domain-containing protein6.7e-5993.44Show/hide
Query:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL
        MASSH Y SLSLLFLAA+CTFHHQQVYAEEQVLKFK +ADILQESIVRHVNEHP+AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPQSFDAREAWPQCISIGTIL
        KLP+SFDAREAWPQCISIGTIL
Subjt:  KLPQSFDAREAWPQCISIGTIL

A0A1S3CNJ5 cathepsin B-like isoform X11.6e-5791.8Show/hide
Query:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL
        MASS LY SLSLLFLAA+CTFHHQQV+AEEQVLKFK +ADILQESIVRHVNEHP+AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPQSFDAREAWPQCISIGTIL
        +LP+SFDAREAWPQCISIGTIL
Subjt:  KLPQSFDAREAWPQCISIGTIL

A0A1S3CNM3 cathepsin B-like isoform X21.5e-5590.98Show/hide
Query:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL
        MASS LY SLSLLFLAA+CTFHH QV+AEEQVLKFK +ADILQESIVRHVNEHP+AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPQSFDAREAWPQCISIGTIL
        +LP+SFDAREAWPQCISIGTIL
Subjt:  KLPQSFDAREAWPQCISIGTIL

A0A5A7U7U4 Cathepsin B-like isoform X21.5e-5590.98Show/hide
Query:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL
        MASS LY SLSLLFLAA+CTFHH QV+AEEQVLKFK +ADILQESIVRHVNEHP+AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPQSFDAREAWPQCISIGTIL
        +LP+SFDAREAWPQCISIGTIL
Subjt:  KLPQSFDAREAWPQCISIGTIL

A0A6J1I7T5 cathepsin B-like protease 26.5e-5486.89Show/hide
Query:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL
        MASSH + SLSLLFL A C  HH QVYAEEQVLKFK NADILQESIVRHVNEHP AGWKA MNP FSNYSVSQFKH+LGVKQ+PEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPQSFDAREAWPQCISIGTIL
        KLP+SFDAREAWPQCI+IGTIL
Subjt:  KLPQSFDAREAWPQCISIGTIL

SwissProt top hitse value%identityAlignment
F4HVZ1 Cathepsin B-like protease 11.1e-2148.67Show/hide
Query:  LSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKLPQSFDAR
        L+ +FL    +F+ Q + A E + K K  + ILQ  IV+ VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P++ H  SLKLP+ FDAR
Subjt:  LSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKLPQSFDAR

Query:  EAWPQCISIGTIL
         AW  C SI  IL
Subjt:  EAWPQCISIGTIL

P25792 Cathepsin B-like cysteine proteinase2.7e-0931.86Show/hide
Query:  SLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGV-KQTPEKDLKSTPVLSHPK-SLKLPQSFDA
        S+L +A++ TF    +  + +    KF  + L + I+ ++NEHP AGW+A  + RF  +S+   +  +G  ++ P+   K  P + H   ++++P +FD+
Subjt:  SLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGV-KQTPEKDLKSTPVLSHPK-SLKLPQSFDA

Query:  REAWPQCISIGTI
        R+ WP C SI TI
Subjt:  REAWPQCISIGTI

P43157 Cathepsin B-like cysteine proteinase8.8e-0835.37Show/hide
Query:  LQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGV-KQTPEKDLKSTPVLS-HPKSLKLPQSFDAREAWPQCISIGTI
        L + ++  +NEHP AGWKA  + RF  +S+   + L+G  K+  E      P +  H  ++++P  FD+R+ WP C SI  I
Subjt:  LQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGV-KQTPEKDLKSTPVLS-HPKSLKLPQSFDAREAWPQCISIGTI

Q93VC9 Cathepsin B-like protease 22.7e-2550.83Show/hide
Query:  SSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKL
        S+ ++F L LL    I +F+  Q  A E + K K  + ILQ  IV+ VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P++SH  SLKL
Subjt:  SSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKL

Query:  PQSFDAREAWPQCISIGTIL
        P+ FDAR AW QC SIG IL
Subjt:  PQSFDAREAWPQCISIGTIL

Q94K85 Cathepsin B-like protease 34.2e-2653.64Show/hide
Query:  LFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKLPQSFDAREAW
        L L  +  F  + + A E + K K ++ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SLKLP++FDAR AW
Subjt:  LFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKLPQSFDAREAW

Query:  PQCISIGTIL
        PQC SIG IL
Subjt:  PQCISIGTIL

Arabidopsis top hitse value%identityAlignment
AT1G02300.1 Cysteine proteinases superfamily protein7.6e-2348.67Show/hide
Query:  LSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKLPQSFDAR
        L+ +FL    +F+ Q + A E + K K  + ILQ  IV+ VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P++ H  SLKLP+ FDAR
Subjt:  LSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKLPQSFDAR

Query:  EAWPQCISIGTIL
         AW  C SI  IL
Subjt:  EAWPQCISIGTIL

AT1G02305.1 Cysteine proteinases superfamily protein1.9e-2650.83Show/hide
Query:  SSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKL
        S+ ++F L LL    I +F+  Q  A E + K K  + ILQ  IV+ VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P++SH  SLKL
Subjt:  SSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKL

Query:  PQSFDAREAWPQCISIGTIL
        P+ FDAR AW QC SIG IL
Subjt:  PQSFDAREAWPQCISIGTIL

AT4G01610.1 Cysteine proteinases superfamily protein3.0e-2753.64Show/hide
Query:  LFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKLPQSFDAREAW
        L L  +  F  + + A E + K K ++ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SLKLP++FDAR AW
Subjt:  LFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKLPQSFDAREAW

Query:  PQCISIGTIL
        PQC SIG IL
Subjt:  PQCISIGTIL

AT4G01610.2 Cysteine proteinases superfamily protein6.0e-2854.05Show/hide
Query:  LFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKLPQSFDAREAW
        L L  +  F  + + A E + K K ++ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SLKLP++FDAR AW
Subjt:  LFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNEHPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKLPQSFDAREAW

Query:  PQCISIGTILG
        PQC SIG ILG
Subjt:  PQCISIGTILG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAAAATACTGGGGAAACTTGGGATTGGTCCATTTTCTCAAATCTCAACTCTCTACCTTTTATTTGCAACCCATCTCTTCCCATTTGCCCCATCCACTCTCTCTTC
ATCCTTCTTCATCGCCTCTGTTCCTCCACCTGATTCTGATTCTGCCTTCCAAAATAGCAAGGAGATGGCATCATCTCACTTGTATTTTTCCCTTTCCTTGCTATTTTTGG
CCGCCATCTGCACCTTTCATCATCAGCAGGTCTATGCGGAGGAACAAGTTCTAAAGTTCAAATTTAACGCTGATATTCTTCAGGAGTCTATCGTTCGCCACGTAAATGAA
CACCCAAAGGCTGGCTGGAAAGCTACAATGAACCCACGTTTTTCGAACTATTCTGTTAGCCAATTCAAGCACCTGCTTGGTGTCAAACAAACTCCTGAAAAGGATTTAAA
AAGTACCCCTGTTTTATCCCATCCCAAGTCTTTAAAGTTGCCACAAAGTTTTGATGCAAGAGAAGCTTGGCCTCAGTGTATCTCGATTGGAACCATTCTAGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAAAATACTGGGGAAACTTGGGATTGGTCCATTTTCTCAAATCTCAACTCTCTACCTTTTATTTGCAACCCATCTCTTCCCATTTGCCCCATCCACTCTCTCTTC
ATCCTTCTTCATCGCCTCTGTTCCTCCACCTGATTCTGATTCTGCCTTCCAAAATAGCAAGGAGATGGCATCATCTCACTTGTATTTTTCCCTTTCCTTGCTATTTTTGG
CCGCCATCTGCACCTTTCATCATCAGCAGGTCTATGCGGAGGAACAAGTTCTAAAGTTCAAATTTAACGCTGATATTCTTCAGGAGTCTATCGTTCGCCACGTAAATGAA
CACCCAAAGGCTGGCTGGAAAGCTACAATGAACCCACGTTTTTCGAACTATTCTGTTAGCCAATTCAAGCACCTGCTTGGTGTCAAACAAACTCCTGAAAAGGATTTAAA
AAGTACCCCTGTTTTATCCCATCCCAAGTCTTTAAAGTTGCCACAAAGTTTTGATGCAAGAGAAGCTTGGCCTCAGTGTATCTCGATTGGAACCATTCTAGGTTAA
Protein sequenceShow/hide protein sequence
MSKILGKLGIGPFSQISTLYLLFATHLFPFAPSTLSSSFFIASVPPPDSDSAFQNSKEMASSHLYFSLSLLFLAAICTFHHQQVYAEEQVLKFKFNADILQESIVRHVNE
HPKAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLKLPQSFDAREAWPQCISIGTILG