; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022133 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022133
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptioncathepsin B-like
Genome locationchr7:19158974..19164448
RNA-Seq ExpressionLag0022133
SyntenyLag0022133
Gene Ontology termsGO:0050790 - regulation of catalytic activity (biological process)
GO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR025660 - Cysteine peptidase, histidine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597714.1 Cathepsin B-like protease 2, partial [Cucurbita argyrosperma subsp. sororia]1.3e-13391.81Show/hide
Query:  KFPHKDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVR
        K+PHKDQGHCGSCWAF AVESLSDRFCIH++MNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDT GCSHPGCEPAY TP+CVR
Subjt:  KFPHKDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVR

Query:  QCVNKNQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGY
         CV+KNQIWKKSKHYGVNAYRI+ DPYDIMAEVYKNGPVEV FTVYEDFAHYKSGVYK+ITGD +GGHAVKLIGWGT+DDGEDYWLLANQWNRGWGDDGY
Subjt:  QCVNKNQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGY

Query:  FKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        FKI+RGTNECGIEEDVVAGLPSPRNIAREA+I
Subjt:  FKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

XP_008465335.1 PREDICTED: cathepsin B-like isoform X1 [Cucumis melo]7.6e-13495.15Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CV+K
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        NQIW+K+KHYGVNAYRI+ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWG+DGYFKIRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAREAAI
        GTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  GTNECGIEEDVVAGLPSPRNIAREAAI

XP_008465336.1 PREDICTED: cathepsin B-like isoform X2 [Cucumis melo]7.6e-13495.15Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CV+K
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        NQIW+K+KHYGVNAYRI+ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWG+DGYFKIRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAREAAI
        GTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  GTNECGIEEDVVAGLPSPRNIAREAAI

XP_038903448.1 cathepsin B-like protease 2 isoform X1 [Benincasa hispida]5.8e-13495.15Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CV+K
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        NQIW+K+KHYGVNAYRI++DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAREAAI
        GTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  GTNECGIEEDVVAGLPSPRNIAREAAI

XP_038903449.1 cathepsin B-like protease 2 isoform X2 [Benincasa hispida]5.8e-13495.15Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CV+K
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        NQIW+K+KHYGVNAYRI++DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAREAAI
        GTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  GTNECGIEEDVVAGLPSPRNIAREAAI

TrEMBL top hitse value%identityAlignment
A0A0A0LFN4 Pept_C1 domain-containing protein7.0e-13393.83Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CV+K
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        NQIW+K+KHYGV+AYR++ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAREAAI
        GTNECGIEEDVVAGLPS +NIAREAAI
Subjt:  GTNECGIEEDVVAGLPSPRNIAREAAI

A0A1S3CNJ5 cathepsin B-like isoform X13.7e-13495.15Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CV+K
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        NQIW+K+KHYGVNAYRI+ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWG+DGYFKIRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAREAAI
        GTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  GTNECGIEEDVVAGLPSPRNIAREAAI

A0A1S3CNM3 cathepsin B-like isoform X23.7e-13495.15Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CV+K
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        NQIW+K+KHYGVNAYRI+ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWG+DGYFKIRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAREAAI
        GTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  GTNECGIEEDVVAGLPSPRNIAREAAI

A0A5A7U7U4 Cathepsin B-like isoform X23.7e-13495.15Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CV+K
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        NQIW+K+KHYGVNAYRI+ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWG+DGYFKIRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAREAAI
        GTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  GTNECGIEEDVVAGLPSPRNIAREAAI

A0A6J1DZC8 cathepsin B-like protease 21.1e-13092.07Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFD TGCSHPGCEP+YPTPRCV++CV+K
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        NQ+W +SKHYGVNAYR+  D YDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYK+ITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKI+R
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAREAAI
        GTNECGIEEDVVAGLPS RNIA EAAI
Subjt:  GTNECGIEEDVVAGLPSPRNIAREAAI

SwissProt top hitse value%identityAlignment
F4HVZ1 Cathepsin B-like protease 12.4e-11479.91Show/hide
Query:  GHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNKNQ
        GHCGSCWAFGAVESLSDRFCI +++N++LS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT+ECDPYFD TGCSHPGCEP YPTP+C R+CV++NQ
Subjt:  GHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNKNQ

Query:  IWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGT
        +W +SKHYGV AYRI  DP DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITG  +GGHAVKLIGWGT+DDGEDYWLLANQWNR WGDDGYFKIRRGT
Subjt:  IWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGT

Query:  NECGIEEDVVAGLPSPRNI
        NECGIE+ VVAGLPS +N+
Subjt:  NECGIEEDVVAGLPSPRNI

P07858 Cathepsin B3.1e-6951.5Show/hide
Query:  KDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EECDPYFDTTGCSH------PGC
        +DQG CGSCWAFGAVE++SDR CIH + ++++ V+  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY     C H      P C
Subjt:  KDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EECDPYFDTTGCSH------PGC

Query:  EPAYPTPRCVRQC-VNKNQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLA
             TP+C + C    +  +K+ KHYG N+Y + +   DIMAE+YKNGPVE AF+VY DF  YKSGVY+++TG++MGGHA++++GWG  ++G  YWL+A
Subjt:  EPAYPTPRCVRQC-VNKNQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLA

Query:  NQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLP
        N WN  WGD+G+FKI RG + CGIE +VVAG+P
Subjt:  NQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLP

Q4R5M2 Cathepsin B1.8e-6951.5Show/hide
Query:  KDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EECDPYFDTTGCSH------PGC
        +DQG CGSCWAFGAVE++SDR CIH + ++++ V+  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY     C H      P C
Subjt:  KDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EECDPYFDTTGCSH------PGC

Query:  EPAYPTPRCVRQC-VNKNQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLA
             TP+C + C    +  +K+ KHYG N+Y + +   DIMAE+YKNGPVE AF+VY DF  YKSGVY+++TG++MGGHA++++GWG  ++G  YWL+A
Subjt:  EPAYPTPRCVRQC-VNKNQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLA

Query:  NQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLP
        N WN  WGD+G+FKI RG + CGIE +VVAG+P
Subjt:  NQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLP

Q93VC9 Cathepsin B-like protease 21.7e-12083.41Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCI ++MN++LSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTEECDPYFD TGCSHPGCEPAYPTP+C R+CV+ 
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        NQ+W++SKHYGV+AY++RS P DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+DDGEDYWLLANQWNR WGDDGYFKIRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAR
        GTNECGIE  VVAGLPS RN+ +
Subjt:  GTNECGIEEDVVAGLPSPRNIAR

Q94K85 Cathepsin B-like protease 37.4e-11680.27Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCI F MNI+LSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTEECDPYFD TGCSHPGCEPAYPTP+C R+CV+ 
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        N++W +SKHY V+ Y ++S+P DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+ +GEDYWL+ANQWNRGWGDDGYF IRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAR
        GTNECGIE++ VAGLPS +N+ R
Subjt:  GTNECGIEEDVVAGLPSPRNIAR

Arabidopsis top hitse value%identityAlignment
AT1G02300.1 Cysteine proteinases superfamily protein1.7e-11579.91Show/hide
Query:  GHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNKNQ
        GHCGSCWAFGAVESLSDRFCI +++N++LS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT+ECDPYFD TGCSHPGCEP YPTP+C R+CV++NQ
Subjt:  GHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNKNQ

Query:  IWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGT
        +W +SKHYGV AYRI  DP DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITG  +GGHAVKLIGWGT+DDGEDYWLLANQWNR WGDDGYFKIRRGT
Subjt:  IWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGT

Query:  NECGIEEDVVAGLPSPRNI
        NECGIE+ VVAGLPS +N+
Subjt:  NECGIEEDVVAGLPSPRNI

AT1G02305.1 Cysteine proteinases superfamily protein1.2e-12183.41Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCI ++MN++LSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTEECDPYFD TGCSHPGCEPAYPTP+C R+CV+ 
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        NQ+W++SKHYGV+AY++RS P DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+DDGEDYWLLANQWNR WGDDGYFKIRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAR
        GTNECGIE  VVAGLPS RN+ +
Subjt:  GTNECGIEEDVVAGLPSPRNIAR

AT3G45310.1 Cysteine proteinases superfamily protein5.1e-2735.78Show/hide
Query:  PHKDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYF-VRHGVVTEECDPYFDTTGCSHPGCE-PAYPTPRCVR
        P K+QGHCGSCW F    +L   +   F   I+LS   L+ C G     GC GG P  A+ Y     G+ TEE  PY    G    GC+  A      VR
Subjt:  PHKDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYF-VRHGVVTEECDPYFDTTGCSHPGCE-PAYPTPRCVR

Query:  QCVNKNQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMG------GHAVKLIGWGTTDDGEDYWLLANQWNRG
          VN     +    + V   R                PV VAF V  +F  YK GV+   T +  G       HAV  +G+G  DD   YWL+ N W   
Subjt:  QCVNKNQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMG------GHAVKLIGWGTTDDGEDYWLLANQWNRG

Query:  WGDDGYFKIRRGTNECGI
        WGD+GYFK+  G N CG+
Subjt:  WGDDGYFKIRRGTNECGI

AT4G01610.1 Cysteine proteinases superfamily protein5.3e-11780.27Show/hide
Query:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK
        DQGHCGSCWAFGAVESLSDRFCI F MNI+LSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTEECDPYFD TGCSHPGCEPAYPTP+C R+CV+ 
Subjt:  DQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNK

Query:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR
        N++W +SKHY V+ Y ++S+P DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+ +GEDYWL+ANQWNRGWGDDGYF IRR
Subjt:  NQIWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRR

Query:  GTNECGIEEDVVAGLPSPRNIAR
        GTNECGIE++ VAGLPS +N+ R
Subjt:  GTNECGIEEDVVAGLPSPRNIAR

AT4G01610.2 Cysteine proteinases superfamily protein1.0e-11580.09Show/hide
Query:  GHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNKNQ
        GHCGSCWAFGAVESLSDRFCI F MNI+LSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTEECDPYFD TGCSHPGCEPAYPTP+C R+CV+ N+
Subjt:  GHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNKNQ

Query:  IWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGT
        +W +SKHY V+ Y ++S+P DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+ +GEDYWL+ANQWNRGWGDDGYF IRRGT
Subjt:  IWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGT

Query:  NECGIEEDVVAGLPSPRNIAR
        NECGIE++ VAGLPS +N+ R
Subjt:  NECGIEEDVVAGLPSPRNIAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGATAAATTTCCTCATAAAGATCAGGGGCACTGTGGTTCTTGCTGGGCATTTGGTGCTGTTGAATCACTTTCAGATCGCTTTTGCATTCATTTTGACATGAACAT
TACTCTGTCTGTTAATGATCTTTTGGCATGCTGTGGCTTCATGTGTGGTGACGGCTGTGATGGGGGTTACCCAATTTCTGCATGGCGATACTTTGTTCGTCATGGAGTTG
TTACTGAAGAGTGTGATCCATATTTTGACACTACTGGTTGTTCCCACCCTGGTTGTGAACCTGCATATCCGACTCCTAGATGTGTCAGGCAGTGTGTAAATAAGAACCAG
ATTTGGAAAAAATCAAAGCACTATGGTGTTAATGCTTATAGGATTCGAAGCGATCCCTATGATATCATGGCAGAAGTTTATAAGAATGGACCAGTTGAGGTTGCCTTCAC
GGTGTATGAGGATTTTGCTCACTATAAATCTGGGGTTTACAAATATATTACTGGTGATGTAATGGGAGGGCATGCTGTGAAGCTTATTGGATGGGGAACAACGGATGATG
GAGAGGATTATTGGCTTTTGGCAAATCAGTGGAACAGAGGCTGGGGTGATGATGGCTACTTCAAGATAAGAAGAGGAACGAATGAGTGTGGCATTGAGGAAGATGTTGTT
GCTGGTTTGCCCTCACCTAGAAATATTGCTAGGGAGGCTGCCATATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGATAAATTTCCTCATAAAGATCAGGGGCACTGTGGTTCTTGCTGGGCATTTGGTGCTGTTGAATCACTTTCAGATCGCTTTTGCATTCATTTTGACATGAACAT
TACTCTGTCTGTTAATGATCTTTTGGCATGCTGTGGCTTCATGTGTGGTGACGGCTGTGATGGGGGTTACCCAATTTCTGCATGGCGATACTTTGTTCGTCATGGAGTTG
TTACTGAAGAGTGTGATCCATATTTTGACACTACTGGTTGTTCCCACCCTGGTTGTGAACCTGCATATCCGACTCCTAGATGTGTCAGGCAGTGTGTAAATAAGAACCAG
ATTTGGAAAAAATCAAAGCACTATGGTGTTAATGCTTATAGGATTCGAAGCGATCCCTATGATATCATGGCAGAAGTTTATAAGAATGGACCAGTTGAGGTTGCCTTCAC
GGTGTATGAGGATTTTGCTCACTATAAATCTGGGGTTTACAAATATATTACTGGTGATGTAATGGGAGGGCATGCTGTGAAGCTTATTGGATGGGGAACAACGGATGATG
GAGAGGATTATTGGCTTTTGGCAAATCAGTGGAACAGAGGCTGGGGTGATGATGGCTACTTCAAGATAAGAAGAGGAACGAATGAGTGTGGCATTGAGGAAGATGTTGTT
GCTGGTTTGCCCTCACCTAGAAATATTGCTAGGGAGGCTGCCATATGA
Protein sequenceShow/hide protein sequence
MLDKFPHKDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVNKNQ
IWKKSKHYGVNAYRIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVV
AGLPSPRNIAREAAI