; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018094 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018094
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptioncathepsin B-like protease 2
Genome locationtig00153092:1388858..1391782
RNA-Seq ExpressionSgr018094
SyntenySgr018094
Gene Ontology termsGO:0050790 - regulation of catalytic activity (biological process)
GO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR012599 - Peptidase C1A, propeptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141146.1 cathepsin B-like protease 3 isoform X2 [Cucumis sativus]9.8e-4486.67Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASSH Y S+SLLFLA +CTFHHQVYAEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSLK
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  VPLRF
        +P  F
Subjt:  VPLRF

XP_008465336.1 PREDICTED: cathepsin B-like isoform X2 [Cucumis melo]2.4e-4284.76Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASS LY S+SLLFLA +CTFHHQV+AEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL+
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  VPLRF
        +P  F
Subjt:  VPLRF

XP_011652326.1 cathepsin B-like protease 2 isoform X1 [Cucumis sativus]2.4e-4285.85Show/hide
Query:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL
        MASSH Y S+SLLFLA +CTFHH QVYAEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL
Subjt:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL

Query:  KVPLRF
        K+P  F
Subjt:  KVPLRF

XP_038903448.1 cathepsin B-like protease 2 isoform X1 [Benincasa hispida]3.2e-4285.85Show/hide
Query:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL
        MASSHLY S+SLLFLA +CTFHH QVYAEEQVL+FK NADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL
Subjt:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL

Query:  KVPLRF
        K+P  F
Subjt:  KVPLRF

XP_038903449.1 cathepsin B-like protease 2 isoform X2 [Benincasa hispida]1.3e-4386.67Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASSHLY S+SLLFLA +CTFHHQVYAEEQVL+FK NADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSLK
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  VPLRF
        +P  F
Subjt:  VPLRF

TrEMBL top hitse value%identityAlignment
A0A0A0LFN4 Pept_C1 domain-containing protein1.2e-4285.85Show/hide
Query:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL
        MASSH Y S+SLLFLA +CTFHH QVYAEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL
Subjt:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL

Query:  KVPLRF
        K+P  F
Subjt:  KVPLRF

A0A1S3CNJ5 cathepsin B-like isoform X12.9e-4183.96Show/hide
Query:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL
        MASS LY S+SLLFLA +CTFHH QV+AEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL
Subjt:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL

Query:  KVPLRF
        ++P  F
Subjt:  KVPLRF

A0A1S3CNM3 cathepsin B-like isoform X21.2e-4284.76Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASS LY S+SLLFLA +CTFHHQV+AEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL+
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  VPLRF
        +P  F
Subjt:  VPLRF

A0A5A7U7U4 Cathepsin B-like isoform X21.2e-4284.76Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASS LY S+SLLFLA +CTFHHQV+AEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL+
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  VPLRF
        +P  F
Subjt:  VPLRF

A0A6J1DZC8 cathepsin B-like protease 21.3e-4183.81Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASS  YFS+SLLF A + +FHHQVYAEEQVLKFKLNADILQESIV+QVNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPE+DL+ST VVSHPKSLK
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  VPLRF
        +P  F
Subjt:  VPLRF

SwissProt top hitse value%identityAlignment
F4HVZ1 Cathepsin B-like protease 13.1e-1645.54Show/hide
Query:  ISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKVPLRFRGTV
        ++ +FL    +F+ Q  A E + K KL + ILQ  IV++VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P+V H  SLK+P  F    
Subjt:  ISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKVPLRFRGTV

Query:  A
        A
Subjt:  A

Q93VC9 Cathepsin B-like protease 23.1e-1645.87Show/hide
Query:  SSHLYFSISLLFLATICTFH-HQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKV
        S+ ++F + LL    I +F+  Q  A E + K KL + ILQ  IV++VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P+VSH  SLK+
Subjt:  SSHLYFSISLLFLATICTFH-HQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKV

Query:  PLRFRGTVA
        P  F    A
Subjt:  PLRFRGTVA

Q94K85 Cathepsin B-like protease 39.6e-1847.96Show/hide
Query:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKVPLRFRGTVA
        L L  +  F  +    E + K KL++ ILQ+ IV++VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P+VSH  SLK+P  F    A
Subjt:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKVPLRFRGTVA

Arabidopsis top hitse value%identityAlignment
AT1G02300.1 Cysteine proteinases superfamily protein2.2e-1745.54Show/hide
Query:  ISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKVPLRFRGTV
        ++ +FL    +F+ Q  A E + K KL + ILQ  IV++VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P+V H  SLK+P  F    
Subjt:  ISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKVPLRFRGTV

Query:  A
        A
Subjt:  A

AT1G02305.1 Cysteine proteinases superfamily protein2.2e-1745.87Show/hide
Query:  SSHLYFSISLLFLATICTFH-HQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKV
        S+ ++F + LL    I +F+  Q  A E + K KL + ILQ  IV++VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P+VSH  SLK+
Subjt:  SSHLYFSISLLFLATICTFH-HQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKV

Query:  PLRFRGTVA
        P  F    A
Subjt:  PLRFRGTVA

AT4G01610.1 Cysteine proteinases superfamily protein6.8e-1947.96Show/hide
Query:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKVPLRFRGTVA
        L L  +  F  +    E + K KL++ ILQ+ IV++VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P+VSH  SLK+P  F    A
Subjt:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKVPLRFRGTVA

AT4G01610.2 Cysteine proteinases superfamily protein6.8e-1947.96Show/hide
Query:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKVPLRFRGTVA
        L L  +  F  +    E + K KL++ ILQ+ IV++VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P+VSH  SLK+P  F    A
Subjt:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKVPLRFRGTVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCATCTCACTTGTATTTTTCCATTTCCTTGCTATTTTTGGCAACCATCTGCACTTTCCATCACCAGGTCTATGCGGAGGAACAAGTTCTAAAGTTCAAACTCAA
CGCTGATATTCTTCAGGAGTCAATCGTTCAGCAGGTAAATGAACACCCACTGGCTGGATGGAAAGCAACCATGAATCCACGTTTTTCGAATTATTCTGTTAGCCAATTCA
AGCACCTGCTTGGTGTAAAACAAACTCCTGAAAAGGATTTAAAAAGTACTCCTGTTGTATCCCATCCCAAGTCGTTAAAAGTGCCTTTGCGTTTTAGGGGCACTGTGGCT
CTTGCTGGGCATTTGGTGCTGTCGAATCACTATCAGATCGCTTCTGCATTCATTTTGACATGGTTTGCCGACTACACTGATTTTCTTGTTCACCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCATCTCACTTGTATTTTTCCATTTCCTTGCTATTTTTGGCAACCATCTGCACTTTCCATCACCAGGTCTATGCGGAGGAACAAGTTCTAAAGTTCAAACTCAA
CGCTGATATTCTTCAGGAGTCAATCGTTCAGCAGGTAAATGAACACCCACTGGCTGGATGGAAAGCAACCATGAATCCACGTTTTTCGAATTATTCTGTTAGCCAATTCA
AGCACCTGCTTGGTGTAAAACAAACTCCTGAAAAGGATTTAAAAAGTACTCCTGTTGTATCCCATCCCAAGTCGTTAAAAGTGCCTTTGCGTTTTAGGGGCACTGTGGCT
CTTGCTGGGCATTTGGTGCTGTCGAATCACTATCAGATCGCTTCTGCATTCATTTTGACATGGTTTGCCGACTACACTGATTTTCTTGTTCACCATTAA
Protein sequenceShow/hide protein sequence
MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKVPLRFRGTVA
LAGHLVLSNHYQIASAFILTWFADYTDFLVHH