; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr010509 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr010509
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCathepsin B
Genome locationtig00009637:149170..155668
RNA-Seq ExpressionSgr010509
SyntenySgr010509
Gene Ontology termsGO:0050790 - regulation of catalytic activity (biological process)
GO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR012599 - Peptidase C1A, propeptide
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141146.1 cathepsin B-like protease 3 isoform X2 [Cucumis sativus]2.7e-15090.44Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASSH Y S+SLLFLA +CTFHHQVYAEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSLK
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC
        LPK+FDAREAWPQCI+IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TGC
Subjt:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC

Query:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        SHPGCEPAYPTPRCVR CV+KNQ+W ++KHYGV+AYR+K+DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGV
Subjt:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

XP_008465336.1 PREDICTED: cathepsin B-like isoform X2 [Cucumis melo]7.7e-15090.44Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASS LY S+SLLFLA +CTFHHQV+AEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL+
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC
        LPK+FDAREAWPQCI+IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TGC
Subjt:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC

Query:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        SHPGCEPAYPTPRCVR CV+KNQ+W ++KHYGVNAYR+KKDP DIMAEVYKNGPVEV+FTVYEDFAHYKSGV
Subjt:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

XP_022159267.1 cathepsin B-like protease 2 [Momordica charantia]1.3e-14991.18Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASS  YFS+SLLF A + +FHHQVYAEEQVLKFKLNADILQESIV+QVNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPE+DL+ST VVSHPKSLK
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC
        LP NFDAREAWPQC +IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD+TGC
Subjt:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC

Query:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        SHPGCEP+YPTPRCV+KCV+KNQLWSRSKHYGVNAYR+ KD YDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
Subjt:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

XP_038903448.1 cathepsin B-like protease 2 isoform X1 [Benincasa hispida]2.2e-14990.84Show/hide
Query:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL
        MASSHLY S+SLLFLA +CTFHH QVYAEEQVL+FK NADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL
Subjt:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL

Query:  KLPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTG
        KLPK+FDAREAWPQCI+IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TG
Subjt:  KLPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTG

Query:  CSHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        CSHPGCEPAYPTPRCVR CV+KNQ+W ++KHYGVNAYR+K DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGV
Subjt:  CSHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

XP_038903449.1 cathepsin B-like protease 2 isoform X2 [Benincasa hispida]9.1e-15191.18Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASSHLY S+SLLFLA +CTFHHQVYAEEQVL+FK NADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSLK
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC
        LPK+FDAREAWPQCI+IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TGC
Subjt:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC

Query:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        SHPGCEPAYPTPRCVR CV+KNQ+W ++KHYGVNAYR+K DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGV
Subjt:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

TrEMBL top hitse value%identityAlignment
A0A0A0LFN4 Pept_C1 domain-containing protein3.2e-14990.11Show/hide
Query:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL
        MASSH Y S+SLLFLA +CTFHH QVYAEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL
Subjt:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL

Query:  KLPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTG
        KLPK+FDAREAWPQCI+IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TG
Subjt:  KLPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTG

Query:  CSHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        CSHPGCEPAYPTPRCVR CV+KNQ+W ++KHYGV+AYR+K+DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGV
Subjt:  CSHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

A0A1S3CNJ5 cathepsin B-like isoform X19.2e-14990.11Show/hide
Query:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL
        MASS LY S+SLLFLA +CTFHH QV+AEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL
Subjt:  MASSHLYFSISLLFLATICTFHH-QVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSL

Query:  KLPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTG
        +LPK+FDAREAWPQCI+IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TG
Subjt:  KLPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTG

Query:  CSHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        CSHPGCEPAYPTPRCVR CV+KNQ+W ++KHYGVNAYR+KKDP DIMAEVYKNGPVEV+FTVYEDFAHYKSGV
Subjt:  CSHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

A0A1S3CNM3 cathepsin B-like isoform X23.7e-15090.44Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASS LY S+SLLFLA +CTFHHQV+AEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL+
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC
        LPK+FDAREAWPQCI+IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TGC
Subjt:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC

Query:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        SHPGCEPAYPTPRCVR CV+KNQ+W ++KHYGVNAYR+KKDP DIMAEVYKNGPVEV+FTVYEDFAHYKSGV
Subjt:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

A0A5A7U7U4 Cathepsin B-like isoform X23.7e-15090.44Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASS LY S+SLLFLA +CTFHHQV+AEEQVLKFKL+ADILQESIV+ VNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPV+SHPKSL+
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC
        LPK+FDAREAWPQCI+IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TGC
Subjt:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC

Query:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        SHPGCEPAYPTPRCVR CV+KNQ+W ++KHYGVNAYR+KKDP DIMAEVYKNGPVEV+FTVYEDFAHYKSGV
Subjt:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

A0A6J1DZC8 cathepsin B-like protease 26.4e-15091.18Show/hide
Query:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK
        MASS  YFS+SLLF A + +FHHQVYAEEQVLKFKLNADILQESIV+QVNEHP AGWKATMNPRFSNYSVSQFK+LLGVKQTPE+DL+ST VVSHPKSLK
Subjt:  MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLK

Query:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC
        LP NFDAREAWPQC +IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD+TGC
Subjt:  LPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGC

Query:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        SHPGCEP+YPTPRCV+KCV+KNQLWSRSKHYGVNAYR+ KD YDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
Subjt:  SHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

SwissProt top hitse value%identityAlignment
F4HVZ1 Cathepsin B-like protease 18.1e-10261.13Show/hide
Query:  ISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDARE
        ++ +FL    +F+ Q  A E + K KL + ILQ  IV++VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P+V H  SLKLPK FDAR 
Subjt:  ISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDARE

Query:  AWPQC-----ITIGTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE
        AW  C     I +G IL+                GHCGSCWAFGAVESLSDRFCI +++N+SLS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT+
Subjt:  AWPQC-----ITIGTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE

Query:  QCDPYFDDTGCSHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        +CDPYFD+TGCSHPGCEP YPTP+C RKCV++NQLW  SKHYGV AYR+  DP DIMAEVYKNGPVEVAFTVYEDFAHYKSGV
Subjt:  QCDPYFDDTGCSHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

Q4R5M2 Cathepsin B1.4e-5346.03Show/hide
Query:  LQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHL----LGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVE
        L + +V  VN+     W+A  N  F N  VS  K L    LG  + P++       V   + LKLP++FDARE WPQC TI  I DQG CGSCWAFGAVE
Subjt:  LQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHL----LGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVE

Query:  SLSDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDDTGCSH------PGCEPAYPTPRCVRKC-VN
        ++SDR CIH + ++S+ V+  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY     C H      P C     TP+C + C   
Subjt:  SLSDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDDTGCSH------PGCEPAYPTPRCVRKC-VN

Query:  KNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
         +  + + KHYG N+Y +     DIMAE+YKNGPVE AF+VY DF  YKSGV
Subjt:  KNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

Q5R6D1 Cathepsin B3.1e-5346.03Show/hide
Query:  LQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHL----LGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVE
        L + +V  VN+     W+A  N  F N  VS  K L    LG  + P++       V   + LKLP++FDARE WPQC TI  I DQG CGSCWAFGAVE
Subjt:  LQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHL----LGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVE

Query:  SLSDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDDTGCSH------PGCEPAYPTPRCVRKC-VN
        ++SDR CIH + ++S+ V+  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY     C H      P C     TP+C + C   
Subjt:  SLSDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDDTGCSH------PGCEPAYPTPRCVRKC-VN

Query:  KNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
         +  + + KHYG N+Y +     DIMAE+YKNGPVE AF+VY DF  YKSGV
Subjt:  KNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

Q93VC9 Cathepsin B-like protease 22.3e-11269.37Show/hide
Query:  SSHLYFSISLLFLATICTFH-HQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKL
        S+ ++F + LL    I +F+  Q  A E + K KL + ILQ  IV++VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P+VSH  SLKL
Subjt:  SSHLYFSISLLFLATICTFH-HQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKL

Query:  PKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGCS
        PK FDAR AW QC +IG ILDQGHCGSCWAFGAVESLSDRFCI ++MN+SLSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTE+CDPYFD+TGCS
Subjt:  PKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGCS

Query:  HPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        HPGCEPAYPTP+C RKCV+ NQLW  SKHYGV+AY+++  P DIMAEVYKNGPVEVAFTVYEDFAHYKSGV
Subjt:  HPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

Q94K85 Cathepsin B-like protease 39.3e-11465.44Show/hide
Query:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWP
        L L  +  F  +    E + K KL++ ILQ+ IV++VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P+VSH  SLKLPK FDAR AWP
Subjt:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWP

Query:  QCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGCSHPGCEPAYPTP
        QC +IG ILDQGHCGSCWAFGAVESLSDRFCI F MNISLSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFD+TGCSHPGCEPAYPTP
Subjt:  QCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGCSHPGCEPAYPTP

Query:  RCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV------TNI----LLVMKWEACCEA--YWMGNNGFWRG
        +C RKCV+ N+LWS SKHY V+ Y +K +P DIMAEVYKNGPVEV+FTVYEDFAHYKSGV      +NI    + ++ W    E   YW+  N + RG
Subjt:  RCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV------TNI----LLVMKWEACCEA--YWMGNNGFWRG

Arabidopsis top hitse value%identityAlignment
AT1G02300.1 Cysteine proteinases superfamily protein5.8e-10361.13Show/hide
Query:  ISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDARE
        ++ +FL    +F+ Q  A E + K KL + ILQ  IV++VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P+V H  SLKLPK FDAR 
Subjt:  ISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDARE

Query:  AWPQC-----ITIGTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE
        AW  C     I +G IL+                GHCGSCWAFGAVESLSDRFCI +++N+SLS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT+
Subjt:  AWPQC-----ITIGTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE

Query:  QCDPYFDDTGCSHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        +CDPYFD+TGCSHPGCEP YPTP+C RKCV++NQLW  SKHYGV AYR+  DP DIMAEVYKNGPVEVAFTVYEDFAHYKSGV
Subjt:  QCDPYFDDTGCSHPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

AT1G02305.1 Cysteine proteinases superfamily protein1.6e-11369.37Show/hide
Query:  SSHLYFSISLLFLATICTFH-HQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKL
        S+ ++F + LL    I +F+  Q  A E + K KL + ILQ  IV++VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P+VSH  SLKL
Subjt:  SSHLYFSISLLFLATICTFH-HQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKL

Query:  PKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGCS
        PK FDAR AW QC +IG ILDQGHCGSCWAFGAVESLSDRFCI ++MN+SLSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTE+CDPYFD+TGCS
Subjt:  PKNFDAREAWPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGCS

Query:  HPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
        HPGCEPAYPTP+C RKCV+ NQLW  SKHYGV+AY+++  P DIMAEVYKNGPVEVAFTVYEDFAHYKSGV
Subjt:  HPGCEPAYPTPRCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

AT3G45310.1 Cysteine proteinases superfamily protein7.8e-1527.71Show/hide
Query:  EQVLKFKLNADILQES--IVQQVNEHPLAGWKATMNPRFSNYSVSQF-KHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWPQCITIGTILDQGH
        + V + KL   + +E+  +++  N+  L+ +K ++N +F++ +  +F ++ LG  Q     LK +  ++      +P   D    W +   +  + +QGH
Subjt:  EQVLKFKLNADILQES--IVQQVNEHPLAGWKATMNPRFSNYSVSQF-KHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWPQCITIGTILDQGH

Query:  CGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYF-VRHGVVTEQCDPYFDDTGCSHPGCE-PAYPTPRCVRKCVNKNQ
        CGSCW F    +L   +   F   ISLS   L+ C G     GC GG P  A+ Y     G+ TE+  PY    G    GC+  A      VR  VN   
Subjt:  CGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYF-VRHGVVTEQCDPYFDDTGCSHPGCE-PAYPTPRCVRKCVNKNQ

Query:  LWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV
               + V   R                PV VAF V  +F  YK GV
Subjt:  LWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV

AT4G01610.1 Cysteine proteinases superfamily protein6.6e-11565.44Show/hide
Query:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWP
        L L  +  F  +    E + K KL++ ILQ+ IV++VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P+VSH  SLKLPK FDAR AWP
Subjt:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWP

Query:  QCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGCSHPGCEPAYPTP
        QC +IG ILDQGHCGSCWAFGAVESLSDRFCI F MNISLSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFD+TGCSHPGCEPAYPTP
Subjt:  QCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGCSHPGCEPAYPTP

Query:  RCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV------TNI----LLVMKWEACCEA--YWMGNNGFWRG
        +C RKCV+ N+LWS SKHY V+ Y +K +P DIMAEVYKNGPVEV+FTVYEDFAHYKSGV      +NI    + ++ W    E   YW+  N + RG
Subjt:  RCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV------TNI----LLVMKWEACCEA--YWMGNNGFWRG

AT4G01610.2 Cysteine proteinases superfamily protein2.8e-11364.77Show/hide
Query:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWP
        L L  +  F  +    E + K KL++ ILQ+ IV++VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P+VSH  SLKLPK FDAR AWP
Subjt:  LFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREAWP

Query:  QCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGCSHPGCEPAYPTP
        QC +IG IL  GHCGSCWAFGAVESLSDRFCI F MNISLSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFD+TGCSHPGCEPAYPTP
Subjt:  QCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGCSHPGCEPAYPTP

Query:  RCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV------TNI----LLVMKWEACCEA--YWMGNNGFWRG
        +C RKCV+ N+LWS SKHY V+ Y +K +P DIMAEVYKNGPVEV+FTVYEDFAHYKSGV      +NI    + ++ W    E   YW+  N + RG
Subjt:  RCVRKCVNKNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGV------TNI----LLVMKWEACCEA--YWMGNNGFWRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCATCTCACTTGTATTTTTCCATTTCCTTGCTATTTTTGGCAACCATCTGCACTTTCCATCACCAGGTCTATGCGGAGGAACAAGTTCTAAAGTTCAAACTCAA
CGCTGATATTCTTCAGGAGTCAATCGTTCAGCAGGTAAATGAACACCCACTGGCTGGATGGAAAGCAACCATGAATCCACGTTTTTCGAATTATTCTGTTAGCCAATTCA
AGCACCTGCTTGGTGTCAAACAAACTCCTGAAAAGGATTTAAAAAGTACTCCTGTTGTATCCCATCCCAAGTCGTTAAAGTTGCCAAAAAATTTTGATGCAAGAGAAGCT
TGGCCTCAGTGTATCACCATTGGAACCATTCTAGATCAGGGGCACTGTGGCTCTTGCTGGGCATTTGGTGCTGTCGAATCACTATCAGATCGCTTCTGCATTCATTTTGA
CATGAACATTTCTCTGTCTGTTAATGATCTCTTGGCATGCTGTGGCTTCATGTGTGGTGACGGCTGTGATGGGGGTTACCCAATTTCTGCATGGCGATACTTTGTTCGCC
ATGGAGTTGTTACTGAACAGTGTGACCCATATTTTGACGATACTGGTTGCTCCCACCCTGGTTGTGAACCTGCATATCCTACTCCTAGATGTGTCAGGAAGTGTGTAAAT
AAAAACCAGCTTTGGAGTAGATCAAAGCACTATGGTGTTAATGCTTATAGGATGAAGAAGGATCCTTATGATATCATGGCAGAAGTTTATAAGAATGGACCAGTTGAGGT
TGCCTTCACGGTGTATGAGGACTTTGCTCACTATAAATCTGGGGTTACAAACATATTACTGGTGATGAAATGGGAGGCATGCTGTGAAGCTTATTGGATGGGGAACAACG
GATTCTGGAGAGGATTATTGGCTTTTGGCAAATCAGTGGAACAGAAGCTGGGGCGAT
mRNA sequenceShow/hide mRNA sequence
ATGGCATCATCTCACTTGTATTTTTCCATTTCCTTGCTATTTTTGGCAACCATCTGCACTTTCCATCACCAGGTCTATGCGGAGGAACAAGTTCTAAAGTTCAAACTCAA
CGCTGATATTCTTCAGGAGTCAATCGTTCAGCAGGTAAATGAACACCCACTGGCTGGATGGAAAGCAACCATGAATCCACGTTTTTCGAATTATTCTGTTAGCCAATTCA
AGCACCTGCTTGGTGTCAAACAAACTCCTGAAAAGGATTTAAAAAGTACTCCTGTTGTATCCCATCCCAAGTCGTTAAAGTTGCCAAAAAATTTTGATGCAAGAGAAGCT
TGGCCTCAGTGTATCACCATTGGAACCATTCTAGATCAGGGGCACTGTGGCTCTTGCTGGGCATTTGGTGCTGTCGAATCACTATCAGATCGCTTCTGCATTCATTTTGA
CATGAACATTTCTCTGTCTGTTAATGATCTCTTGGCATGCTGTGGCTTCATGTGTGGTGACGGCTGTGATGGGGGTTACCCAATTTCTGCATGGCGATACTTTGTTCGCC
ATGGAGTTGTTACTGAACAGTGTGACCCATATTTTGACGATACTGGTTGCTCCCACCCTGGTTGTGAACCTGCATATCCTACTCCTAGATGTGTCAGGAAGTGTGTAAAT
AAAAACCAGCTTTGGAGTAGATCAAAGCACTATGGTGTTAATGCTTATAGGATGAAGAAGGATCCTTATGATATCATGGCAGAAGTTTATAAGAATGGACCAGTTGAGGT
TGCCTTCACGGTGTATGAGGACTTTGCTCACTATAAATCTGGGGTTACAAACATATTACTGGTGATGAAATGGGAGGCATGCTGTGAAGCTTATTGGATGGGGAACAACG
GATTCTGGAGAGGATTATTGGCTTTTGGCAAATCAGTGGAACAGAAGCTGGGGCGAT
Protein sequenceShow/hide protein sequence
MASSHLYFSISLLFLATICTFHHQVYAEEQVLKFKLNADILQESIVQQVNEHPLAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVVSHPKSLKLPKNFDAREA
WPQCITIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDDTGCSHPGCEPAYPTPRCVRKCVN
KNQLWSRSKHYGVNAYRMKKDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVTNILLVMKWEACCEAYWMGNNGFWRGLLAFGKSVEQKLGRX