; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0026063 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0026063
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionCathepsin B
Genome locationchr04:24682628..24688736
RNA-Seq ExpressionIVF0026063
SyntenyIVF0026063
Gene Ontology termsGO:0050790 - regulation of catalytic activity (biological process)
GO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR012599 - Peptidase C1A, propeptide
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141146.1 cathepsin B-like protease 3 isoform X2 [Cucumis sativus]4.47e-17495.47Show/hide
Query:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASS  YLSLSLLFLAAVCTFHHQ V+AEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        +LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV    +KR P
Subjt:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

XP_008465335.1 PREDICTED: cathepsin B-like isoform X1 [Cucumis melo]6.22e-17997.12Show/hide
Query:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV    +K+ P
Subjt:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

XP_008465336.1 PREDICTED: cathepsin B-like isoform X2 [Cucumis melo]2.33e-17696.71Show/hide
Query:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASSQLYLSLSLLFLAAVCTFHHQ VHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV    +K+ P
Subjt:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

XP_011652326.1 cathepsin B-like protease 2 isoform X1 [Cucumis sativus]1.20e-17695.88Show/hide
Query:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASS  YLSLSLLFLAAVCTFHHQQV+AEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        +LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV    +KR P
Subjt:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

XP_038903448.1 cathepsin B-like protease 2 isoform X1 [Benincasa hispida]7.66e-17393.83Show/hide
Query:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASS LY SLSLLFLAAVCTFHHQQV+AEEQVL+FK +ADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        +LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV    +K  P
Subjt:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

TrEMBL top hitse value%identityAlignment
A0A0A0LFN4 Pept_C1 domain-containing protein2.0e-13995.88Show/hide
Query:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASS  YLSLSLLFLAAVCTFHHQQV+AEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        +LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV    +KR P
Subjt:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

A0A1S3CNJ5 cathepsin B-like isoform X13.7e-14197.12Show/hide
Query:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV    +K+ P
Subjt:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

A0A1S3CNM3 cathepsin B-like isoform X23.5e-13996.71Show/hide
Query:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASSQLYLSLSLLFLAAVCTFHH QVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV    +K+ P
Subjt:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

A0A5A7U7U4 Cathepsin B-like isoform X23.5e-13996.71Show/hide
Query:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASSQLYLSLSLLFLAAVCTFHH QVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV    +K+ P
Subjt:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

A0A6J1F2K3 cathepsin B-like protease 24.4e-12686.01Show/hide
Query:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASS  + SLSLLFL A C  HH QV+AEEQVLKFKL+ADILQESIVRHVNEHP AGWKA MNP FSNYSVSQFK++LGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        +LPKSFDAREAWPQCI+IGTILDQGHCGSCWAF AVESLSDRFCIH++MNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDT G
Subjt:  RLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        CSHPGCEPAY TP+CVRHCVDKNQIWRK+KHYGV    +K+ P
Subjt:  CSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

SwissProt top hitse value%identityAlignment
F4HVZ1 Cathepsin B-like protease 19.6e-7855.1Show/hide
Query:  LSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRLPKSFDAR
        L+ +FL    +F+ Q + A E + K KL + ILQ  IV+ VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P++ H  SL+LPK FDAR
Subjt:  LSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRLPKSFDAR

Query:  EAWPQCISI-----GTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT
         AW  C SI     G IL+                GHCGSCWAFGAVESLSDRFCI +++N++LS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT
Subjt:  EAWPQCISI-----GTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT

Query:  EQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV
        ++CDPYFD TGCSHPGCEP YPTP+C R CV +NQ+W ++KHYGV
Subjt:  EQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV

Q4R5M2 Cathepsin B5.7e-3841.51Show/hide
Query:  LQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEKDLKSTPVLSHPKSLRLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVE
        L + +V +VN+     W+A  N  F N  VS  K L    LG  + P++       +   + L+LP+SFDARE WPQC +I  I DQG CGSCWAFGAVE
Subjt:  LQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEKDLKSTPVLSHPKSLRLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVE

Query:  SLSDRFCIHFDMNITLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSH------PGCEPAYPTPRCVRHC-VD
        ++SDR CIH + ++++ V+  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY     C H      P C     TP+C + C   
Subjt:  SLSDRFCIHFDMNITLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSH------PGCEPAYPTPRCVRHC-VD

Query:  KNQIWRKTKHYG
         +  +++ KHYG
Subjt:  KNQIWRKTKHYG

Q5R6D1 Cathepsin B9.7e-3841.51Show/hide
Query:  LQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEKDLKSTPVLSHPKSLRLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVE
        L + +V +VN+     W+A  N  F N  VS  K L    LG  + P++       +   + L+LP+SFDARE WPQC +I  I DQG CGSCWAFGAVE
Subjt:  LQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEKDLKSTPVLSHPKSLRLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVE

Query:  SLSDRFCIHFDMNITLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSH------PGCEPAYPTPRCVRHC-VD
        ++SDR CIH + ++++ V+  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY     C H      P C     TP+C + C   
Subjt:  SLSDRFCIHFDMNITLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSH------PGCEPAYPTPRCVRHC-VD

Query:  KNQIWRKTKHYG
         +  +++ KHYG
Subjt:  KNQIWRKTKHYG

Q93VC9 Cathepsin B-like protease 21.2e-9163.07Show/hide
Query:  SSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRL
        S+ ++  L LL    + +F+  Q  A E + K KL + ILQ  IV+ VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P++SH  SL+L
Subjt:  SSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRL

Query:  PKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS
        PK FDAR AW QC SIG ILDQGHCGSCWAFGAVESLSDRFCI ++MN++LSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTE+CDPYFD TGCS
Subjt:  PKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS

Query:  HPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        HPGCEPAYPTP+C R CV  NQ+WR++KHYGV    ++  P
Subjt:  HPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

Q94K85 Cathepsin B-like protease 31.4e-9264.32Show/hide
Query:  SSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRL
        +++L L+   L L  +  F  + + A E + K KLD+ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SL+L
Subjt:  SSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRL

Query:  PKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS
        PK+FDAR AWPQC SIG ILDQGHCGSCWAFGAVESLSDRFCI F MNI+LSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFD TGCS
Subjt:  PKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS

Query:  HPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        HPGCEPAYPTP+C R CV  N++W ++KHY V    +K  P
Subjt:  HPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

Arabidopsis top hitse value%identityAlignment
AT1G02300.1 Cysteine proteinases superfamily protein6.8e-7955.1Show/hide
Query:  LSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRLPKSFDAR
        L+ +FL    +F+ Q + A E + K KL + ILQ  IV+ VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P++ H  SL+LPK FDAR
Subjt:  LSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRLPKSFDAR

Query:  EAWPQCISI-----GTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT
         AW  C SI     G IL+                GHCGSCWAFGAVESLSDRFCI +++N++LS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT
Subjt:  EAWPQCISI-----GTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT

Query:  EQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV
        ++CDPYFD TGCSHPGCEP YPTP+C R CV +NQ+W ++KHYGV
Subjt:  EQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGV

AT1G02305.1 Cysteine proteinases superfamily protein8.3e-9363.07Show/hide
Query:  SSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRL
        S+ ++  L LL    + +F+  Q  A E + K KL + ILQ  IV+ VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P++SH  SL+L
Subjt:  SSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRL

Query:  PKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS
        PK FDAR AW QC SIG ILDQGHCGSCWAFGAVESLSDRFCI ++MN++LSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTE+CDPYFD TGCS
Subjt:  PKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS

Query:  HPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        HPGCEPAYPTP+C R CV  NQ+WR++KHYGV    ++  P
Subjt:  HPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

AT3G45310.1 Cysteine proteinases superfamily protein5.0e-1327.75Show/hide
Query:  FHHQQVHAEEQVLKFKLDADILQES--IVRHVNEHPQAGWKATMNPRFSNYSVSQF-KYLLGVKQTPEKDLKSTPVLSHPKSLRLPKSFDAREAWPQCIS
        F H+     + V + KL   + +E+  ++R  N+     +K ++N +F++ +  +F +Y LG  Q     LK +  ++      +P + D    W +   
Subjt:  FHHQQVHAEEQVLKFKLDADILQES--IVRHVNEHPQAGWKATMNPRFSNYSVSQF-KYLLGVKQTPEKDLKSTPVLSHPKSLRLPKSFDAREAWPQCIS

Query:  IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYF-VRHGVVTEQCDPYFDTTGCSHPGCE
        +  + +QGHCGSCW F    +L   +   F   I+LS   L+ C G     GC GG P  A+ Y     G+ TE+  PY    G    GC+
Subjt:  IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYF-VRHGVVTEQCDPYFDTTGCSHPGCE

AT4G01610.1 Cysteine proteinases superfamily protein9.8e-9464.32Show/hide
Query:  SSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRL
        +++L L+   L L  +  F  + + A E + K KLD+ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SL+L
Subjt:  SSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRL

Query:  PKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS
        PK+FDAR AWPQC SIG ILDQGHCGSCWAFGAVESLSDRFCI F MNI+LSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFD TGCS
Subjt:  PKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS

Query:  HPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        HPGCEPAYPTP+C R CV  N++W ++KHY V    +K  P
Subjt:  HPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP

AT4G01610.2 Cysteine proteinases superfamily protein4.1e-9263.49Show/hide
Query:  SSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRL
        +++L L+   L L  +  F  + + A E + K KLD+ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SL+L
Subjt:  SSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRL

Query:  PKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS
        PK+FDAR AWPQC SIG IL  GHCGSCWAFGAVESLSDRFCI F MNI+LSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFD TGCS
Subjt:  PKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS

Query:  HPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP
        HPGCEPAYPTP+C R CV  N++W ++KHY V    +K  P
Subjt:  HPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVMLIGLKRIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCATCTCAGTTATATTTGTCCCTTTCCTTGCTATTTTTGGCTGCCGTCTGTACCTTCCATCATCAGCAGGTCCATGCAGAGGAACAAGTTCTAAAGTTCAAACT
CGATGCTGATATTCTTCAGGAGTCTATCGTTCGGCATGTAAACGAACACCCACAGGCTGGCTGGAAAGCTACCATGAACCCTCGTTTTTCGAACTATTCTGTTAGCCAAT
TCAAGTACCTGCTTGGCGTCAAACAAACTCCTGAAAAGGATTTGAAAAGTACCCCTGTTTTATCCCATCCCAAGTCTTTACGGTTGCCAAAAAGCTTTGATGCAAGAGAA
GCTTGGCCTCAGTGTATCAGCATTGGAACCATTCTAGATCAGGGGCACTGTGGTTCTTGCTGGGCATTTGGTGCTGTTGAGTCACTTTCAGATCGATTCTGCATTCATTT
TGACATGAACATTACTTTGTCTGTTAATGACCTTTTGGCATGCTGTGGCTTCATGTGTGGTGATGGCTGTGATGGGGGTTACCCAATTTCTGCATGGCGATACTTTGTTC
GTCATGGAGTTGTTACTGAACAGTGTGATCCATATTTTGACACTACTGGTTGTTCTCACCCTGGCTGTGAACCTGCATATCCTACTCCTAGATGTGTCAGGCATTGTGTA
GATAAGAACCAGATTTGGAGAAAAACAAAGCACTATGGTGTTATGCTTATAGGATTAAAAAGGATCCCAATGATATCATGGCAGAAGTTTATAAGAATGGACCAGTCGAG
GTTTCCTTCACAGTGTATGAGGATTTTGCTCACTATAAATCTGGCGTTTACAAATATATTACCGGCGATGTAA
mRNA sequenceShow/hide mRNA sequence
GCTATTACGAAGATAATTGTGAAGCCCGAGAAAGTGAATTTCTGTTGTTCGCCGATGGCAGCAGCATGGCGTGCGAACATGGATGACGTTAATTCGGACGTTGAATCAAA
GGGAACGTCTGGATTGGTCCTTTCTCCCAAATCTTAACACTCCATTTTTTATACGGAACTCATCTCTTCCATTTTCCCCACCCACTCTCTCTTCTTCCTCCATTTTCATC
GCCTTTGTTTCTGTTTCTCTTTCTGATTTCGCCCTTCCAAAATAGCAAGGAAATGGCATCATCTCAGTTATATTTGTCCCTTTCCTTGCTATTTTTGGCTGCCGTCTGTA
CCTTCCATCATCAGCAGGTCCATGCAGAGGAACAAGTTCTAAAGTTCAAACTCGATGCTGATATTCTTCAGGAGTCTATCGTTCGGCATGTAAACGAACACCCACAGGCT
GGCTGGAAAGCTACCATGAACCCTCGTTTTTCGAACTATTCTGTTAGCCAATTCAAGTACCTGCTTGGCGTCAAACAAACTCCTGAAAAGGATTTGAAAAGTACCCCTGT
TTTATCCCATCCCAAGTCTTTACGGTTGCCAAAAAGCTTTGATGCAAGAGAAGCTTGGCCTCAGTGTATCAGCATTGGAACCATTCTAGATCAGGGGCACTGTGGTTCTT
GCTGGGCATTTGGTGCTGTTGAGTCACTTTCAGATCGATTCTGCATTCATTTTGACATGAACATTACTTTGTCTGTTAATGACCTTTTGGCATGCTGTGGCTTCATGTGT
GGTGATGGCTGTGATGGGGGTTACCCAATTTCTGCATGGCGATACTTTGTTCGTCATGGAGTTGTTACTGAACAGTGTGATCCATATTTTGACACTACTGGTTGTTCTCA
CCCTGGCTGTGAACCTGCATATCCTACTCCTAGATGTGTCAGGCATTGTGTAGATAAGAACCAGATTTGGAGAAAAACAAAGCACTATGGTGTTATGCTTATAGGATTAA
AAAGGATCCCAATGATATCATGGCAGAAGTTTATAAGAATGGACCAGTCGAGGTTTCCTTCACAGTGTATGAGGATTTTGCTCACTATAAATCTGGCGTTTACAAATATA
TTACCGGCGATGTAATGGGAGGGCATGCTGTGAAGCTTATTGGATGGGGAACAACGGATGATGGAGAAGATTATTGGCTTTTGGCGAATCAGTGGAACAGAGGCTGGGGC
GAGGATGGCTACTTCAAGATAAGAAGAGGAACGAATGAGTGTGGGATTGAGGAAGATGTTGTTGCTGGTTTGCCCTCAACAAGAAATATTGCCAGGGAGGCTGCCATATG
ATCCAGATGCTGCTGTTTCACAATCAAGCTTTGCTCAACCGAGGATATGTTTTATGTGTCCTGCATTTGTATTAGAACTATTTATGCATATGAAGTTGGTTAATGCTTTG
CTGAATGTTTCCAAGTATCTGTGTTTTAAAAAATTTGTAGGATCTCTGAAAGGAGATTCTATAATCAAATAATGTGAATATTCTTGGAGAACGATGGTCCTTATAATATT
TACCTTTCCAGTTTTTTTCTTTTG
Protein sequenceShow/hide protein sequence
MASSQLYLSLSLLFLAAVCTFHHQQVHAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLRLPKSFDARE
AWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCV
DKNQIWRKTKHYGVMLIGLKRIPMISWQKFIRMDQSRFPSQCMRILLTINLAFTNILPAM