; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028483 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028483
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCysteine protease
Genome locationtig00153204:1464592..1472134
RNA-Seq ExpressionSgr028483
SyntenySgr028483
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585563.1 Cysteine protease RD19A, partial [Cucurbita argyrosperma subsp. sororia]9.2e-9962.94Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNE----NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------
        MERLTVIA  F +LLS TV YG SS++     DD++LIRQVVSGADDRLLTAEQ F+NFKLKFGK Y + EEHDYRFR+FKANLR A+R+QK+D      
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNE----NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------

Query:  ------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE---
                                P DA  A ILPTD+L SDFDWRDHGAV PV+NQGSC SCWSFSAVGALEGA+FL+ G+L+SLS+QQLVDCDHE   
Subjt:  ------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE---

Query:  -----------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVIC
                    GLM  AF YI K GG+EREEDYPY G D+GPCKFQN+K+AASV+N+SVIS +ADQIAANL+K+GPLAI IN+A+M TY  GVSCP IC
Subjt:  -----------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVIC

Query:  SE-SIDHGVLLVG
        S+ +++HGVLLVG
Subjt:  SE-SIDHGVLLVG

XP_022144382.1 cysteine protease RD19A-like [Momordica charantia]1.4e-9964.52Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNEND-DEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD---------
        ME  TVIA FF +LLS TVAYG+SS++ D D++LIRQVVSGADDRLLTAEQ FENFKLKFGK Y S EEHDYRFRVF+ANLR A R+QK+D         
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNEND-DEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD---------

Query:  ---------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE------
                             P DA  A ILPT++LPSDFDWRD GAV PV++QGSC SCWSFSAVGA+EGA+FL+ G L+SLS+QQLVDCDHE      
Subjt:  ---------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE------

Query:  --------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE-
                 GLM  AF YI K GG+EREEDYPY GTD+GPCKFQN KVAASVAN+SVIS +A+QIAANLVK+GPLAIAIN+ +M TY GGVSCP ICS+ 
Subjt:  --------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE-

Query:  SIDHGVLLVG
        +++HGVLLVG
Subjt:  SIDHGVLLVG

XP_022951324.1 cysteine protease RD19A-like [Cucurbita moschata]1.8e-9963.26Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNE----NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------
        MERLTVIA  F +LLS TVAYG SS++     DD++LIRQVVSGADDRLLTAEQ F+NFKLKFGK Y + EEHDYRFR+FKANLR A+R+QK+D      
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNE----NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------

Query:  ------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE---
                                P DA  A ILPTD+L SDFDWRDHGAV PV+NQGSC SCWSFSAVGALEGA+FL+ G+L+SLS+QQLVDCDHE   
Subjt:  ------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE---

Query:  -----------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVIC
                    GLM  AF YI K GG+EREEDYPY G D+GPCKFQN+K+AASV+N+SVIS +ADQIAANL+K+GPLAI IN+A+M TY  GVSCP IC
Subjt:  -----------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVIC

Query:  SE-SIDHGVLLVG
        S+ +++HGVLLVG
Subjt:  SE-SIDHGVLLVG

XP_023002391.1 cysteine protease RD19A-like [Cucurbita maxima]1.1e-9963.26Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNE----NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------
        MERLTVIA  F +LLS TVAYG+SS++     DD++LIRQVVSG DDRLLTAEQ F+NFKLKFGK Y + EEHDYRFR+FKANLR A+R+QK+D      
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNE----NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------

Query:  ------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE---
                                P DA  A ILPTD+L SDFDWRDHGAV PV+NQGSC SCWSFSAVGALEGA+FL+ G+L+SLS+QQLVDCDHE   
Subjt:  ------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE---

Query:  -----------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVIC
                    GLMT AF YI K GG+EREEDYPY G D+GPCKFQN+K+AASV+N+SVIS +ADQIAANL+K+GPLAI IN+A+M TY  GVSCP IC
Subjt:  -----------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVIC

Query:  SE-SIDHGVLLVG
        S+ +++HGVLLVG
Subjt:  SE-SIDHGVLLVG

XP_038884968.1 cysteine protease RD19A-like [Benincasa hispida]7.0e-9962.22Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNE------NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD----
        M+RL V A FFT+LLS TV YG+SS++      +D+++LIRQVVSGADDR LTAEQ F+NFKLKFGK Y + EEHDYRFR+FKANLR A+R+QK+D    
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNE------NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD----

Query:  --------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-
                                  P DA  A ILPTDDL  DFDWRD GAV PV++QGSC SCWSFSAVGA+EGA+FL+ G+L+SLS+QQLVDCDHE 
Subjt:  --------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-

Query:  -------------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPV
                      GLM  AF+YI K GG+E+EEDYPY GTD+GPCKFQNSKVAASVAN+SVISN+ADQIAANLVK+GPLAI IN+ +M TY  GVSCP 
Subjt:  -------------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPV

Query:  ICSE-SIDHGVLLVG
        ICS+ ++DHGVLLVG
Subjt:  ICSE-SIDHGVLLVG

TrEMBL top hitse value%identityAlignment
A0A0A0LPD9 Cysteine protease5.4e-9761.27Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNE------NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD----
        MER   I  FF +LLS TVAYG+SS++      +++++LIRQVVSGADDR LTAEQ F++FKLKFGK Y + EEHDYRFRVFKANLR A+R+QK+D    
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNE------NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD----

Query:  --------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-
                                  P DA  A ILPTD+L SDFDWRD GAV PV++QGSC SCWSFSAVGALEGA+FL+ G+LISLS+QQLVDCDHE 
Subjt:  --------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-

Query:  -------------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPV
                      GLMT AF YI K GG+EREEDYPY GTD+G CKFQN K+AAS AN+SVISN+ADQIAANLVK+GPLAI IN+ +M TY  G+SCP 
Subjt:  -------------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPV

Query:  ICSE-SIDHGVLLVG
        ICS+ ++DHGVLLVG
Subjt:  ICSE-SIDHGVLLVG

A0A5A7VA47 Cysteine proteinase RD19a-like1.9e-9762.7Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNE------NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD----
        M R   IA FF +LLS TVAYG+ S++      + +++LIRQVVSGADDRLLTAEQ F+NFKLKFGK Y + EEHDYRFRVFKANLR A+R+QK+D    
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNE------NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD----

Query:  --------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-
                                  P DA  A ILPTD+L SDFDWRD GAV PV++QGSC SCWSFSAVGALEGA+FL+ G+LISLS+QQLVDCDHE 
Subjt:  --------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-

Query:  ---------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE
                  GLMT AF YI K GG+EREEDYPY GTD+G CKFQN K+AAS AN+SVISN+ADQIAANLVK+GPLAI IN+ +M TY  GVSCP ICS+
Subjt:  ---------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE

Query:  -SIDHGVLLVG
         ++DHGVLLVG
Subjt:  -SIDHGVLLVG

A0A6J1CTJ1 cysteine protease RD19A-like6.9e-10064.52Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNEND-DEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD---------
        ME  TVIA FF +LLS TVAYG+SS++ D D++LIRQVVSGADDRLLTAEQ FENFKLKFGK Y S EEHDYRFRVF+ANLR A R+QK+D         
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNEND-DEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD---------

Query:  ---------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE------
                             P DA  A ILPT++LPSDFDWRD GAV PV++QGSC SCWSFSAVGA+EGA+FL+ G L+SLS+QQLVDCDHE      
Subjt:  ---------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE------

Query:  --------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE-
                 GLM  AF YI K GG+EREEDYPY GTD+GPCKFQN KVAASVAN+SVIS +A+QIAANLVK+GPLAIAIN+ +M TY GGVSCP ICS+ 
Subjt:  --------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE-

Query:  SIDHGVLLVG
        +++HGVLLVG
Subjt:  SIDHGVLLVG

A0A6J1GIG1 cysteine protease RD19A-like9.0e-10063.26Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNE----NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------
        MERLTVIA  F +LLS TVAYG SS++     DD++LIRQVVSGADDRLLTAEQ F+NFKLKFGK Y + EEHDYRFR+FKANLR A+R+QK+D      
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNE----NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------

Query:  ------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE---
                                P DA  A ILPTD+L SDFDWRDHGAV PV+NQGSC SCWSFSAVGALEGA+FL+ G+L+SLS+QQLVDCDHE   
Subjt:  ------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE---

Query:  -----------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVIC
                    GLM  AF YI K GG+EREEDYPY G D+GPCKFQN+K+AASV+N+SVIS +ADQIAANL+K+GPLAI IN+A+M TY  GVSCP IC
Subjt:  -----------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVIC

Query:  SE-SIDHGVLLVG
        S+ +++HGVLLVG
Subjt:  SE-SIDHGVLLVG

A0A6J1KL64 cysteine protease RD19A-like5.2e-10063.26Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNE----NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------
        MERLTVIA  F +LLS TVAYG+SS++     DD++LIRQVVSG DDRLLTAEQ F+NFKLKFGK Y + EEHDYRFR+FKANLR A+R+QK+D      
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNE----NDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------

Query:  ------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE---
                                P DA  A ILPTD+L SDFDWRDHGAV PV+NQGSC SCWSFSAVGALEGA+FL+ G+L+SLS+QQLVDCDHE   
Subjt:  ------------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE---

Query:  -----------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVIC
                    GLMT AF YI K GG+EREEDYPY G D+GPCKFQN+K+AASV+N+SVIS +ADQIAANL+K+GPLAI IN+A+M TY  GVSCP IC
Subjt:  -----------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVIC

Query:  SE-SIDHGVLLVG
        S+ +++HGVLLVG
Subjt:  SE-SIDHGVLLVG

SwissProt top hitse value%identityAlignment
P25804 Cysteine proteinase 15A1.5e-7550.33Show/hide
Query:  FFFTVLLSTTVAYGISSNENDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------------------
        F F + L   VA  ++ + N+D+ +IRQVV   +D LL AE  F +FK KF K Y +KEEHDYRF VFK+NL  A+ +Q  D                  
Subjt:  FFFTVLLSTTVAYGISSNENDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD------------------

Query:  -------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDH--------------E
                     P  A  A ILPT +LP DFDWR+ GAV PV++QGSC SCW+FS  GALEGAH+LA G+L+SLS+QQLVDCDH               
Subjt:  -------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDH--------------E

Query:  RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSES-IDHGVLL
         GLM  AF Y+ + GG+ +E+DY Y G D G CKF  SKV ASV+N+SV++ + DQIAANLVK+GPLA+AIN+A+M TY  GVSCP +C++S +DHGVLL
Subjt:  RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSES-IDHGVLL

Query:  VG
        VG
Subjt:  VG

P43295 Probable cysteine protease RD19B3.4e-8056.58Show/hide
Query:  DDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD-------------------------------PVDAPHA
        D++VLIRQVV   + ++L++E  F  FK KFGK Y S EEH YRF VFKANL  A R+QKMD                               P DA  A
Subjt:  DDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD-------------------------------PVDAPHA

Query:  SILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE--------------RGLMTRAFRYISKVGGIERE
         ILPT +LP +FDWRD GAV PV+NQGSC SCWSFS  GALEGAHFLA G+L+SLS+QQLVDCDHE               GLM  AF Y  K GG+ RE
Subjt:  SILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE--------------RGLMTRAFRYISKVGGIERE

Query:  EDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSESIDHGVLLVG
        +DYPY GTD G CK   SK+ ASV+N+SV+S N DQIAANL+K+GPLA+AIN+AYM TY GGVSCP ICS  ++HGVLLVG
Subjt:  EDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSESIDHGVLLVG

P43296 Cysteine protease RD19A7.3e-8354.52Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNEND-DEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD---------
        M+RL +  +F   +LS  +    SS+ ND D+++IRQVV GA+ ++LT+E  F  FK KFGK Y S EEHDYRF VFKANLR A R+QK+D         
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNEND-DEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD---------

Query:  ----------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-----
                              P DA  A ILPT++LP DFDWRDHGAV PV+NQGSC SCWSFSA GALEGA+FLA G+L+SLS+QQLVDCDHE     
Subjt:  ----------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-----

Query:  ---------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE
                  GLM  AF Y  K GG+ +EEDYPY G D   CK   SK+ ASV+N+SVIS + +QIAANLVK+GPLA+AIN+ YM TY GGVSCP IC+ 
Subjt:  ---------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE

Query:  SIDHGVLLVG
         ++HGVLLVG
Subjt:  SIDHGVLLVG

Q10716 Cysteine proteinase 16.2e-7450.99Show/hide
Query:  VLLSTTVAYGISSNENDDEVLIRQVVSGADDR--LLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMDP-------------------
        +LLS   A  +++  + ++ LIRQVV G DD    L AE  F +F  +FGK Y+  +EH YR  VFK NLR A R+Q +DP                   
Subjt:  VLLSTTVAYGISSNENDDEVLIRQVVSGADDR--LLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMDP-------------------

Query:  -----------------VDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-----------
                           A  A +LPTD LP DFDWRDHGAV PV+NQGSC SCWSFSA GALEGAH+LA G+L  LS+QQ VDCDHE           
Subjt:  -----------------VDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-----------

Query:  ---RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSESIDHGV
            GLMT AF Y+ K GG+E E+DYPY G+D G CKF  SK+ ASV N+SV+S +  QI+ANL+K GPLAI IN+AYM TY GGVSCP IC   +DHGV
Subjt:  ---RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSESIDHGV

Query:  LLVG
        LLVG
Subjt:  LLVG

Q9SUL1 Probable cysteine protease RD19C1.7e-7651.94Show/hide
Query:  IAFFFTV---LLSTTVAYGISSNENDDEVL--IRQVV-SGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD----------
        + FFF +   LL+ ++   + S E  D  +  IRQVV    D++LL AE  F  FK K+ K Y ++ EHD+RFRVFKANLR A RNQ +D          
Subjt:  IAFFFTV---LLSTTVAYGISSNENDDEVL--IRQVV-SGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD----------

Query:  ----------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-----
                              P D   A ILPT DLP++FDWR+ GAV PV+NQG C SCWSFSA+GALEGAHFLA  EL+SLS+QQLVDCDHE     
Subjt:  ----------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-----

Query:  ---------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE
                  GLM  AF Y  K GG+ +EEDYPY G D   CKF  SK+ ASV+N+SV+S++ DQIAANLV+ GPLAIAIN+ +M TY GGVSCP +CS+
Subjt:  ---------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE

Query:  SIDHGVLLVG
        S DHGVLLVG
Subjt:  SIDHGVLLVG

Arabidopsis top hitse value%identityAlignment
AT2G21430.1 Papain family cysteine protease2.4e-8156.58Show/hide
Query:  DDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD-------------------------------PVDAPHA
        D++VLIRQVV   + ++L++E  F  FK KFGK Y S EEH YRF VFKANL  A R+QKMD                               P DA  A
Subjt:  DDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD-------------------------------PVDAPHA

Query:  SILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE--------------RGLMTRAFRYISKVGGIERE
         ILPT +LP +FDWRD GAV PV+NQGSC SCWSFS  GALEGAHFLA G+L+SLS+QQLVDCDHE               GLM  AF Y  K GG+ RE
Subjt:  SILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE--------------RGLMTRAFRYISKVGGIERE

Query:  EDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSESIDHGVLLVG
        +DYPY GTD G CK   SK+ ASV+N+SV+S N DQIAANL+K+GPLA+AIN+AYM TY GGVSCP ICS  ++HGVLLVG
Subjt:  EDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSESIDHGVLLVG

AT3G19390.1 Granulin repeat cysteine protease family protein7.1e-3334.33Show/hide
Query:  AEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERN-----------------------------QKMD----PVDAPHASILPTDDLPSDFDWRDH
        A + +E + ++  K Y    E + RF +FK NL+  E +                              KM+    PV          D LP   DWR  
Subjt:  AEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERN-----------------------------QKMD----PVDAPHASILPTDDLPSDFDWRDH

Query:  GAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKF--QNSKVAAS
        GAV PV++QGSC SCW+FSA+GA+EG + +  GELISLS+Q+LVDCD         GLM  AF++I + GGI+ EEDYPYI TD   C    +N++V  +
Subjt:  GAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKF--QNSKVAAS

Query:  VANYSVISNNADQIAANLVKSGPLAIAINSA--YMMTYKGGVSCPVICSESIDHGVLLVGTLGEQNEE
        +  Y  +  N ++     + + P+++AI +       Y  GV     C  S+DHGV+ VG   E  ++
Subjt:  VANYSVISNNADQIAANLVKSGPLAIAINSA--YMMTYKGGVSCPVICSESIDHGVLLVGTLGEQNEE

AT3G54940.2 Papain family cysteine protease9.6e-5441.24Show/hide
Query:  DEVLIRQVVSGADDRLLT-------AEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMDPVDAPH--------------------------
        +++ IRQV   AD+R +         E +F  F   +GK Y ++EE+ +R  +F  N+  A  +Q MDP  A H                          
Subjt:  DEVLIRQVVSGADDRLLT-------AEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMDPVDAPH--------------------------

Query:  --------ASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE--------------RGLMTRAFRYI
                A ++  D LP DFDWR+ G V  V+NQG+C SCW+FS  GA EGAHF++ G+L+SLS+QQLVDCD                 GLMT A+ Y+
Subjt:  --------ASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE--------------RGLMTRAFRYI

Query:  SKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE-SIDHGVLLVG
         + GG+E E  YPY G  +G CKF   KVA  V N++ I  + +QIAANLV+ GPLA+ +N+ +M TY GGVSCP+ICS+ +++HGVLLVG
Subjt:  SKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE-SIDHGVLLVG

AT4G16190.1 Papain family cysteine protease1.2e-7751.94Show/hide
Query:  IAFFFTV---LLSTTVAYGISSNENDDEVL--IRQVV-SGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD----------
        + FFF +   LL+ ++   + S E  D  +  IRQVV    D++LL AE  F  FK K+ K Y ++ EHD+RFRVFKANLR A RNQ +D          
Subjt:  IAFFFTV---LLSTTVAYGISSNENDDEVL--IRQVV-SGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD----------

Query:  ----------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-----
                              P D   A ILPT DLP++FDWR+ GAV PV+NQG C SCWSFSA+GALEGAHFLA  EL+SLS+QQLVDCDHE     
Subjt:  ----------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-----

Query:  ---------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE
                  GLM  AF Y  K GG+ +EEDYPY G D   CKF  SK+ ASV+N+SV+S++ DQIAANLV+ GPLAIAIN+ +M TY GGVSCP +CS+
Subjt:  ---------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE

Query:  SIDHGVLLVG
        S DHGVLLVG
Subjt:  SIDHGVLLVG

AT4G39090.1 Papain family cysteine protease5.2e-8454.52Show/hide
Query:  MERLTVIAFFFTVLLSTTVAYGISSNEND-DEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD---------
        M+RL +  +F   +LS  +    SS+ ND D+++IRQVV GA+ ++LT+E  F  FK KFGK Y S EEHDYRF VFKANLR A R+QK+D         
Subjt:  MERLTVIAFFFTVLLSTTVAYGISSNEND-DEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMD---------

Query:  ----------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-----
                              P DA  A ILPT++LP DFDWRDHGAV PV+NQGSC SCWSFSA GALEGA+FLA G+L+SLS+QQLVDCDHE     
Subjt:  ----------------------PVDAPHASILPTDDLPSDFDWRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHE-----

Query:  ---------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE
                  GLM  AF Y  K GG+ +EEDYPY G D   CK   SK+ ASV+N+SVIS + +QIAANLVK+GPLA+AIN+ YM TY GGVSCP IC+ 
Subjt:  ---------RGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQIAANLVKSGPLAIAINSAYMMTYKGGVSCPVICSE

Query:  SIDHGVLLVG
         ++HGVLLVG
Subjt:  SIDHGVLLVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCGCCTCACTGTCATTGCCTTCTTTTTCACCGTTCTGCTATCGACGACGGTAGCGTACGGCATATCTTCCAATGAAAACGACGATGAGGTTCTGATTCGTCAAGT
GGTATCCGGTGCGGATGATCGTCTCCTAACCGCAGAGCAACAGTTCGAGAACTTCAAACTCAAGTTCGGAAAGAGGTACGAGAGCAAGGAGGAGCACGATTACAGGTTCC
GCGTGTTCAAGGCTAACCTGCGCACAGCCGAGCGCAACCAGAAGATGGATCCCGTCGATGCTCCCCACGCTTCCATCCTTCCTACCGACGACTTACCCTCCGACTTCGAT
TGGCGAGACCACGGAGCCGTCAAGCCGGTCGAGAATCAGGGTTCTTGTGAGTCGTGCTGGTCTTTTAGCGCTGTTGGAGCGCTGGAGGGAGCTCATTTCTTGGCAAATGG
AGAGCTTATTAGTTTGAGCAAGCAACAGCTTGTGGATTGTGATCACGAGCGGGGGTTAATGACTAGAGCCTTTCGATACATTTCAAAAGTTGGTGGAATAGAGCGAGAGG
AGGACTATCCTTATATTGGGACAGATCAAGGTCCTTGCAAATTTCAAAATAGCAAAGTCGCTGCTTCTGTAGCCAACTATAGTGTCATTTCTAACAATGCCGACCAAATT
GCAGCAAACTTGGTTAAGAGTGGCCCTCTAGCAATTGCAATCAATTCAGCTTACATGATGACCTACAAAGGGGGAGTTTCGTGCCCAGTCATATGCTCTGAAAGTATAGA
TCATGGAGTGTTGCTTGTGGGAACTCTTGGGGAGCAAAATGAGGAGAGAATGGCTACTACAAAATCTGCAAAGGAAGAGATATCTGTGGAGTGGAGTCTGAGGATCCCTT
TTTTACATCTCCATGTCCCTCCCGCATTCGAGTACATTTTAAAAGTTGGTGGAGTTGAGCAAGAGGAGGACTATCCTTGCGCGACAGATCATGGCCCTTGCAAATTTCAA
AATGGCAAAGTTGCTGCTTCTGTAGCCAACTTCAGTGTCATTTCAAAGATGCCGACCAGATTGCAGCAAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCGCCTCACTGTCATTGCCTTCTTTTTCACCGTTCTGCTATCGACGACGGTAGCGTACGGCATATCTTCCAATGAAAACGACGATGAGGTTCTGATTCGTCAAGT
GGTATCCGGTGCGGATGATCGTCTCCTAACCGCAGAGCAACAGTTCGAGAACTTCAAACTCAAGTTCGGAAAGAGGTACGAGAGCAAGGAGGAGCACGATTACAGGTTCC
GCGTGTTCAAGGCTAACCTGCGCACAGCCGAGCGCAACCAGAAGATGGATCCCGTCGATGCTCCCCACGCTTCCATCCTTCCTACCGACGACTTACCCTCCGACTTCGAT
TGGCGAGACCACGGAGCCGTCAAGCCGGTCGAGAATCAGGGTTCTTGTGAGTCGTGCTGGTCTTTTAGCGCTGTTGGAGCGCTGGAGGGAGCTCATTTCTTGGCAAATGG
AGAGCTTATTAGTTTGAGCAAGCAACAGCTTGTGGATTGTGATCACGAGCGGGGGTTAATGACTAGAGCCTTTCGATACATTTCAAAAGTTGGTGGAATAGAGCGAGAGG
AGGACTATCCTTATATTGGGACAGATCAAGGTCCTTGCAAATTTCAAAATAGCAAAGTCGCTGCTTCTGTAGCCAACTATAGTGTCATTTCTAACAATGCCGACCAAATT
GCAGCAAACTTGGTTAAGAGTGGCCCTCTAGCAATTGCAATCAATTCAGCTTACATGATGACCTACAAAGGGGGAGTTTCGTGCCCAGTCATATGCTCTGAAAGTATAGA
TCATGGAGTGTTGCTTGTGGGAACTCTTGGGGAGCAAAATGAGGAGAGAATGGCTACTACAAAATCTGCAAAGGAAGAGATATCTGTGGAGTGGAGTCTGAGGATCCCTT
TTTTACATCTCCATGTCCCTCCCGCATTCGAGTACATTTTAAAAGTTGGTGGAGTTGAGCAAGAGGAGGACTATCCTTGCGCGACAGATCATGGCCCTTGCAAATTTCAA
AATGGCAAAGTTGCTGCTTCTGTAGCCAACTTCAGTGTCATTTCAAAGATGCCGACCAGATTGCAGCAAACTTGA
Protein sequenceShow/hide protein sequence
MERLTVIAFFFTVLLSTTVAYGISSNENDDEVLIRQVVSGADDRLLTAEQQFENFKLKFGKRYESKEEHDYRFRVFKANLRTAERNQKMDPVDAPHASILPTDDLPSDFD
WRDHGAVKPVENQGSCESCWSFSAVGALEGAHFLANGELISLSKQQLVDCDHERGLMTRAFRYISKVGGIEREEDYPYIGTDQGPCKFQNSKVAASVANYSVISNNADQI
AANLVKSGPLAIAINSAYMMTYKGGVSCPVICSESIDHGVLLVGTLGEQNEERMATTKSAKEEISVEWSLRIPFLHLHVPPAFEYILKVGGVEQEEDYPCATDHGPCKFQ
NGKVAASVANFSVISKMPTRLQQT