; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G20470 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G20470
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionCysteine protease
Genome locationChr2:18133292..18134745
RNA-Seq ExpressionCSPI02G20470
SyntenyCSPI02G20470
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065198.1 zingipain-2 [Cucumis melo var. makuwa]5.0e-4370Show/hide
Query:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH
        +L+RNVVTIDGYT IP NDEGKLLQAVA QPVSV ICGS+RAFQLYSK       +F+   S  +D  + +      NGVDYWIVKNSWG+SWGMDGY+H
Subjt:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH

Query:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP
        MQRNSGN+EG+CGINKLASYPT T+PNPPP
Subjt:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP

XP_004152671.1 zingipain-2 [Cucumis sativus]1.3e-4370.77Show/hide
Query:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH
        +L+RNVVTIDGY  IPSNDEGKLLQAVA QPVSV ICGS+RAFQLYSK       +F+   S  +D  + +      NGVDYWIVKNSWG+SWGMDGY+H
Subjt:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH

Query:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP
        MQRNSGN+EG+CGINKLASYPT TNPNPPP
Subjt:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP

XP_008444761.1 PREDICTED: zingipain-2 [Cucumis melo]5.0e-4370Show/hide
Query:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH
        +L+RNVVTIDGYT IP NDEGKLLQAVA QPVSV ICGS+RAFQLYSK       +F+   S  +D  + +      NGVDYWIVKNSWG+SWGMDGY+H
Subjt:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH

Query:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP
        MQRNSGN+EG+CGINKLASYPT T+PNPPP
Subjt:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP

XP_023538152.1 zingipain-2 [Cucurbita pepo subsp. pepo]4.4e-3965.38Show/hide
Query:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH
        +L R VVTIDGY+ +P N+E KLLQAVA QPVSV ICGS+RAFQLYSK       +F+   S  +D  + +      NGVDYWIVKNSWG+ WGMDGYIH
Subjt:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH

Query:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP
        MQRNSGN+EG+CGIN LASYPT T+PNPPP
Subjt:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP

XP_038885598.1 zingipain-2 [Benincasa hispida]6.8e-4066.92Show/hide
Query:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH
        +LKRN VTIDGYT IP N+EGKLLQAVA QPVSV ICGS+RAFQLYSK       +F+   S  +D  + +      NGVDYWIVKNSWG+SWGMDGY+ 
Subjt:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH

Query:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP
        MQRNSG++EG+CGIN LASYPT T+PNPPP
Subjt:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP

TrEMBL top hitse value%identityAlignment
A0A0A0LNP7 Uncharacterized protein6.4e-4470.77Show/hide
Query:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH
        +L+RNVVTIDGY  IPSNDEGKLLQAVA QPVSV ICGS+RAFQLYSK       +F+   S  +D  + +      NGVDYWIVKNSWG+SWGMDGY+H
Subjt:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH

Query:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP
        MQRNSGN+EG+CGINKLASYPT TNPNPPP
Subjt:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP

A0A1S3BBY8 zingipain-22.4e-4370Show/hide
Query:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH
        +L+RNVVTIDGYT IP NDEGKLLQAVA QPVSV ICGS+RAFQLYSK       +F+   S  +D  + +      NGVDYWIVKNSWG+SWGMDGY+H
Subjt:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH

Query:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP
        MQRNSGN+EG+CGINKLASYPT T+PNPPP
Subjt:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP

A0A5A7VC45 Zingipain-22.4e-4370Show/hide
Query:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH
        +L+RNVVTIDGYT IP NDEGKLLQAVA QPVSV ICGS+RAFQLYSK       +F+   S  +D  + +      NGVDYWIVKNSWG+SWGMDGY+H
Subjt:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH

Query:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP
        MQRNSGN+EG+CGINKLASYPT T+PNPPP
Subjt:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP

A0A6J1GH92 zingipain-22.1e-3965.38Show/hide
Query:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH
        +L R VVTIDGY+ +P N+E KLLQAVA QPVSV ICGS+RAFQLYSK       +F+   S  +D  + +      NGVDYWIVKNSWG+ WGMDGYIH
Subjt:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH

Query:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP
        MQRNSGN+EG+CGIN LASYPT T+PNPPP
Subjt:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP

A0A6P6AW24 low-temperature-induced cysteine proteinase4.7e-3967.2Show/hide
Query:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQRNS
        +LKR VVTIDGYT IP+N+E +LLQAVATQPVSV ICGS+RAFQLYSK G F     N      +       NG+DYWIVKNSWG  WGM+GYIHM RN 
Subjt:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQRNS

Query:  GNAEGLCGINKLASYPTTTNPNPPP
        GN+EG+CGIN LASYPT T+PNPPP
Subjt:  GNAEGLCGINKLASYPTTTNPNPPP

SwissProt top hitse value%identityAlignment
P12412 Vignain3.0e-2241.18Show/hide
Query:  VTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGV------------DYWIVKNSWGRSWGMDGYI
        V+IDG+  +P NDE  LL+AVA QPVSV+I      FQ YS+            G +  DC   L +GV            +YWIV+NSWG  WG  GYI
Subjt:  VTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGV------------DYWIVKNSWGRSWGMDGYI

Query:  HMQRNSGNAEGLCGINKLASYPTTTNPNPPPGFFSVKSAYHLAISLNEPKEAL
         MQRN    EGLCGI  +ASYP   + + P G            SL+ PK+ L
Subjt:  HMQRNSGNAEGLCGINKLASYPTTTNPNPPPGFFSVKSAYHLAISLNEPKEAL

P25251 Cysteine proteinase COT44 (Fragment)1.9e-2449.24Show/hide
Query:  SLQLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQR
        SL     VVTIDGY  +PS DE  L +AV+ QPVSV+I    RAFQ Y   G F            V       NGVDYWIV+NSWG  WG DGYI M+R
Subjt:  SLQLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQR

Query:  NSGNAEGLCGINKLASYPTTTNPNPPPGFFSV
        N  +  G CGI   ASYP   +PNP  G  SV
Subjt:  NSGNAEGLCGINKLASYPTTTNPNPPPGFFSV

P25777 Oryzain beta chain1.7e-2247.9Show/hide
Query:  VVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQRNSGNAEG
        VV+IDG+  +P NDE  L +AVA QPVSV+I    R FQLY   G F            V       NG DYWIV+NSWG  WG  GY+ M+RN     G
Subjt:  VVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQRNSGNAEG

Query:  LCGINKLASYPTTTNPNPP
         CGI  +ASYPT +  NPP
Subjt:  LCGINKLASYPTTTNPNPP

P25803 Vignain2.3e-2244.44Show/hide
Query:  VTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGV------------DYWIVKNSWGRSWGMDGYI
        V+IDG+  +P+NDE  LL+AVA QPVSV+I      FQ YS+            G +  DC   L +GV            +YWIV+NSWG  WG  GYI
Subjt:  VTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGV------------DYWIVKNSWGRSWGMDGYI

Query:  HMQRNSGNAEGLCGINKLASYPTTTNPNPPPGFFS
         MQRN    EGLCGI  L SYP   + + P G FS
Subjt:  HMQRNSGNAEGLCGINKLASYPTTTNPNPPPGFFS

Q9LT78 Probable cysteine protease RD21C1.7e-2245Show/hide
Query:  VVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQRNSGNAEG
        VVTIDGY  +P NDE  L +A+A QP+SV+I    RAFQLY+  G F  +         V        G DYWIV+NSWG +WG  GY  ++RN   + G
Subjt:  VVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQRNSGNAEG

Query:  LCGINKLASYPTTTNPNPPP
         CG+  +ASYPT ++ + PP
Subjt:  LCGINKLASYPTTTNPNPPP

Arabidopsis top hitse value%identityAlignment
AT1G09850.1 xylem bark cysteine peptidase 31.8e-3858.46Show/hide
Query:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH
        +LK+ VVTID Y  + SNDE  L++AVA QPVSV ICGS+RAFQLYS        +F+   S  +D  + +     +NGVDYWIVKNSWG+SWGMDG++H
Subjt:  QLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWL-----RNGVDYWIVKNSWGRSWGMDGYIH

Query:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP
        MQRN+ N++G+CGIN LASYP  T+PNPPP
Subjt:  MQRNSGNAEGLCGINKLASYPTTTNPNPPP

AT1G20850.1 xylem cysteine peptidase 21.0e-2246.77Show/hide
Query:  VTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGV-----------DYWIVKNSWGRSWGMDGYIH
        VTI+G+  +P+NDE  LL+A+A QP+SV+I  S R FQ YS             G +   C + L +GV           DY IVKNSWG  WG  GYI 
Subjt:  VTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGV-----------DYWIVKNSWGRSWGMDGYIH

Query:  MQRNSGNAEGLCGINKLASYPTTT
        ++RN+G  EGLCGINK+AS+PT T
Subjt:  MQRNSGNAEGLCGINKLASYPTTT

AT1G47128.1 Granulin repeat cysteine protease family protein6.1e-2345.74Show/hide
Query:  QLKRN--VVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQR
        Q+++N  VVTID Y  +P+  E  L +AVA QP+S++I    RAFQLY   G F  S         V       NG DYWIV+NSWG+SWG  GY+ M R
Subjt:  QLKRN--VVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQR

Query:  NSGNAEGLCGINKLASYPTTTNPNPP-PG
        N  ++ G CGI    SYP     NPP PG
Subjt:  NSGNAEGLCGINKLASYPTTTNPNPP-PG

AT3G19390.1 Granulin repeat cysteine protease family protein1.2e-2345Show/hide
Query:  VVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQRNSGNAEG
        VVTIDGY  +P NDE  L +A+A QP+SV+I    RAFQLY+  G F  +         V        G DYWIV+NSWG +WG  GY  ++RN   + G
Subjt:  VVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQRNSGNAEG

Query:  LCGINKLASYPTTTNPNPPP
         CG+  +ASYPT ++ + PP
Subjt:  LCGINKLASYPTTTNPNPPP

AT4G35350.1 xylem cysteine peptidase 18.0e-2350.44Show/hide
Query:  VTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQRNSGNAEGL
        VTI GY  +P ND+  L++A+A QPVSV+I  S R FQ Y K G F                     G DY IVKNSWG  WG  G+I M+RN+G  EGL
Subjt:  VTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYIHMQRNSGNAEGL

Query:  CGINKLASYPTTT
        CGINK+ASYPT T
Subjt:  CGINKLASYPTTT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAAGATTGCTTGCTGACTTGCATTTTATGTTTATTGAGCTTGCAGCTGAAGAGGAATGTTGTTACCATTGATGGTTATACTTATATTCCTTCAAATGATGAGGG
AAAATTACTGCAAGCAGTAGCAACTCAACCTGTGAGTGTTAGCATATGTGGCAGTGATAGAGCCTTTCAATTATACTCAAAGGTTGGGAATTTTCTCCGGTCCGTGTTCA
ACTTCTTTGGATCATGGTGTGTTGATTGTAGGATATGGCTCAGAAATGGAGTTGATTATTGGATTGTGAAGAACTCATGGGGTAGAAGTTGGGGAATGGATGGTTACATC
CACATGCAGCGCAACAGCGGCAATGCGGAAGGCCTTTGTGGAATTAACAAGCTTGCTTCATATCCAACTACAACAAATCCTAACCCTCCTCCAGGCTTCTTTTCGGTCAA
GAGTGCTTACCATTTGGCTATTAGTCTCAACGAGCCTAAAGAGGCTTTGGCATCTCGTCTGATTGGTCGTGTCACTTTTGAGGTCACTTGTACAGGGCAGTTTGATATTA
CTCTTTTAGTGAAGAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAAGATTGCTTGCTGACTTGCATTTTATGTTTATTGAGCTTGCAGCTGAAGAGGAATGTTGTTACCATTGATGGTTATACTTATATTCCTTCAAATGATGAGGG
AAAATTACTGCAAGCAGTAGCAACTCAACCTGTGAGTGTTAGCATATGTGGCAGTGATAGAGCCTTTCAATTATACTCAAAGGTTGGGAATTTTCTCCGGTCCGTGTTCA
ACTTCTTTGGATCATGGTGTGTTGATTGTAGGATATGGCTCAGAAATGGAGTTGATTATTGGATTGTGAAGAACTCATGGGGTAGAAGTTGGGGAATGGATGGTTACATC
CACATGCAGCGCAACAGCGGCAATGCGGAAGGCCTTTGTGGAATTAACAAGCTTGCTTCATATCCAACTACAACAAATCCTAACCCTCCTCCAGGCTTCTTTTCGGTCAA
GAGTGCTTACCATTTGGCTATTAGTCTCAACGAGCCTAAAGAGGCTTTGGCATCTCGTCTGATTGGTCGTGTCACTTTTGAGGTCACTTGTACAGGGCAGTTTGATATTA
CTCTTTTAGTGAAGAGTTAG
Protein sequenceShow/hide protein sequence
MGKDCLLTCILCLLSLQLKRNVVTIDGYTYIPSNDEGKLLQAVATQPVSVSICGSDRAFQLYSKVGNFLRSVFNFFGSWCVDCRIWLRNGVDYWIVKNSWGRSWGMDGYI
HMQRNSGNAEGLCGINKLASYPTTTNPNPPPGFFSVKSAYHLAISLNEPKEALASRLIGRVTFEVTCTGQFDITLLVKS