; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg13511 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg13511
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionLOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase CEP2
Genome locationCarg_Chr02:920727..921269
RNA-Seq ExpressionCarg13511
SyntenyCarg13511
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AKO60151.1 cysteine proteinase 1, partial [Citrullus lanatus]2.5e-2854.78Show/hide
Query:  ARAAVEGISKIKTGTLVSQSEQELVDRDT-----------------------------------------------TMT----EKVPVIDEKSIKDAVAN
        A AAVEGI+KIKTG L+S SEQELVD D                                                T+T    EKVPV DEKS+K AVAN
Subjt:  ARAAVEGISKIKTGTLVSQSEQELVDRDT-----------------------------------------------TMT----EKVPVIDEKSIKDAVAN

Query:  QPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD
        QPVSVAI  GGYDFQFYSGGVFSGNCGK+ NHGVA+VGYGEASN+   LVKNSWGTD
Subjt:  QPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD

KAG6604877.1 Thiol protease 102, partial [Cucurbita argyrosperma subsp. sororia]1.6e-5973.02Show/hide
Query:  MDLNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRDTT--
        MDLNGSSSNTLLNGASEPKLQLVIGASRKA+ESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRD    
Subjt:  MDLNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRDTT--

Query:  -------MTEKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNRLVKNSWGTDGVNLVTREF
                  K     +K+        P   AI       +++     + NCGKE NHGVAVVGYGEASNRLVKNSWGTDGVNLVTREF
Subjt:  -------MTEKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNRLVKNSWGTDGVNLVTREF

KAG7034988.1 KDEL-tailed cysteine endopeptidase CEP2, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-92100Show/hide
Query:  MDLNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRDTTMT
        MDLNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRDTTMT
Subjt:  MDLNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRDTTMT

Query:  EKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNRLVKNSWGTDGVNLVTREF
        EKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNRLVKNSWGTDGVNLVTREF
Subjt:  EKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNRLVKNSWGTDGVNLVTREF

XP_022947302.1 LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase CEP2 [Cucurbita moschata]2.5e-7677.1Show/hide
Query:  LNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRD------
        LNGSSSNTLLNGASEPKLQL+IGASRKA+ESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRD      
Subjt:  LNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRD------

Query:  -------------------TT--------------------MTEKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYG
                           TT                     TEKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKE NHGVAVVGYG
Subjt:  -------------------TT--------------------MTEKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYG

Query:  EASNRLVKNSWGTD
        EASNRLVKNSWGTD
Subjt:  EASNRLVKNSWGTD

XP_023533790.1 LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase CEP2 [Cucurbita pepo subsp. pepo]2.2e-6469.77Show/hide
Query:  MDLNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRDT---
        MDLNGSSSNTLLNGASEPKLQL+ GASRKA+ESLTDQPCLANKKLHKSSPNKAW      RSSSEQNQA+AAVE ISKIKTGTLVSQSEQELVD D    
Subjt:  MDLNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRDT---

Query:  ------------------------------------------TMTEKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVG
                                                  TMTEKVPVIDEKSIKDAVANQP SVAIHTGGYDFQFYSGGVFS NCGKE NHGVAVVG
Subjt:  ------------------------------------------TMTEKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVG

Query:  YGEASNR---LVKNS
        YGEASN+   LVKNS
Subjt:  YGEASNR---LVKNS

TrEMBL top hitse value%identityAlignment
A0A0A0LJV6 Uncharacterized protein1.1e-2449.68Show/hide
Query:  ARAAVEGISKIKTGTLVSQSEQELVDRDTTM---------------------------------------------------TEKVPVIDEKSIKDAVAN
        A AAVEGI+KIK G L+S SEQELVD D T                                                     EKVPV DEKS+K AVAN
Subjt:  ARAAVEGISKIKTGTLVSQSEQELVDRDTTM---------------------------------------------------TEKVPVIDEKSIKDAVAN

Query:  QPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD
        QPVSVAI   G +FQFYSGG+FSGNCG + NHGVA+VGYGE SN+   LVKNSWGTD
Subjt:  QPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD

A0A1S3C828 ervatamin-B-like1.8e-2450Show/hide
Query:  ARAAVEGISKIKTGTLVSQSEQELVDRDTTM---------------------------------------------------TEKVPVIDEKSIKDAVAN
        A AAVEGI+KIK G L+S SEQELVD D T                                                     EKVPV DEKS++ AVA 
Subjt:  ARAAVEGISKIKTGTLVSQSEQELVDRDTTM---------------------------------------------------TEKVPVIDEKSIKDAVAN

Query:  QPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGT
        QPVSVAI  GG DFQFYSGG+FSGNCGK+ NHGVA+VGYGE SN+   LVKNSWGT
Subjt:  QPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGT

A0A384S0D9 Cysteine proteinase 1 (Fragment)1.2e-2854.78Show/hide
Query:  ARAAVEGISKIKTGTLVSQSEQELVDRDT-----------------------------------------------TMT----EKVPVIDEKSIKDAVAN
        A AAVEGI+KIKTG L+S SEQELVD D                                                T+T    EKVPV DEKS+K AVAN
Subjt:  ARAAVEGISKIKTGTLVSQSEQELVDRDT-----------------------------------------------TMT----EKVPVIDEKSIKDAVAN

Query:  QPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD
        QPVSVAI  GGYDFQFYSGGVFSGNCGK+ NHGVA+VGYGEASN+   LVKNSWGTD
Subjt:  QPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD

A0A5A7SQK0 Ervatamin-B-like1.8e-2450Show/hide
Query:  ARAAVEGISKIKTGTLVSQSEQELVDRDTTM---------------------------------------------------TEKVPVIDEKSIKDAVAN
        A AAVEGI+KIK G L+S SEQELVD D T                                                     EKVPV DEKS++ AVA 
Subjt:  ARAAVEGISKIKTGTLVSQSEQELVDRDTTM---------------------------------------------------TEKVPVIDEKSIKDAVAN

Query:  QPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGT
        QPVSVAI  GG DFQFYSGG+FSGNCGK+ NHGVA+VGYGE SN+   LVKNSWGT
Subjt:  QPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGT

A0A6J1G629 LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase CEP21.2e-7677.1Show/hide
Query:  LNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRD------
        LNGSSSNTLLNGASEPKLQL+IGASRKA+ESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRD      
Subjt:  LNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRD------

Query:  -------------------TT--------------------MTEKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYG
                           TT                     TEKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKE NHGVAVVGYG
Subjt:  -------------------TT--------------------MTEKVPVIDEKSIKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYG

Query:  EASNRLVKNSWGTD
        EASNRLVKNSWGTD
Subjt:  EASNRLVKNSWGTD

SwissProt top hitse value%identityAlignment
O65039 Vignain6.5e-1941.94Show/hide
Query:  AVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQPV
        AVEGI++IKT  LVS SEQELVD DT                                                      E VP  DE ++  AVANQPV
Subjt:  AVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQPV

Query:  SVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR----LVKNSWGTD
        SVAI  GG DFQFYS GVF+G+CG E +HGVA+VGYG   +      VKNSWG +
Subjt:  SVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR----LVKNSWGTD

P12412 Vignain1.9e-1842.58Show/hide
Query:  AVEGISKIKTGTLVSQSEQELVDRD-------------------------TTMT--------------------------EKVPVIDEKSIKDAVANQPV
        AVEGI++IKT  LVS SEQELVD D                         TT +                          E VPV DE ++  AVANQPV
Subjt:  AVEGISKIKTGTLVSQSEQELVDRD-------------------------TTMT--------------------------EKVPVIDEKSIKDAVANQPV

Query:  SVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYG---EASNR-LVKNSWGTD
        SVAI  GG DFQFYS GVF+G+C  + NHGVA+VGYG   + +N  +V+NSWG +
Subjt:  SVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYG---EASNR-LVKNSWGTD

P43156 Thiol protease SEN1022.9e-1940Show/hide
Query:  AAVEGISKIKTGTLVSQSEQELVDRDTTMTE--------------------------------------------------KVPVIDEKSIKDAVANQPV
        A+VEGI++IKTG LVS SEQELVD DT+  E                                                   VP  +E ++  AVANQP+
Subjt:  AAVEGISKIKTGTLVSQSEQELVDRDTTMTE--------------------------------------------------KVPVIDEKSIKDAVANQPV

Query:  SVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR----LVKNSWGTD
        SV+I   GY FQFYS GVF+G CG E +HGVA+VGYG   +     +VKNSWG +
Subjt:  SVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR----LVKNSWGTD

Q9LM66 Cysteine protease XCP22.5e-1839.87Show/hide
Query:  AAVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQP
        AAVEGI+KI TG L + SEQEL+D DTT                                                     + VP  DEKS+  A+A+QP
Subjt:  AAVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQP

Query:  VSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWG
        +SVAI   G +FQFYSGGVF G CG + +HGVA VGYG +      +VKNSWG
Subjt:  VSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWG

Q9STL4 KDEL-tailed cysteine endopeptidase CEP25.9e-2042.58Show/hide
Query:  AAVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQP
        AAVEGI+KIKT  LVS SEQELVD DT                                                      E VP  DE ++  AVANQP
Subjt:  AAVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQP

Query:  VSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD
        VSVAI  G  DFQFYS GVF+G+CG E NHGVA VGYG    +   +V+NSWG +
Subjt:  VSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD

Arabidopsis top hitse value%identityAlignment
AT1G20850.1 xylem cysteine peptidase 21.7e-1939.87Show/hide
Query:  AAVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQP
        AAVEGI+KI TG L + SEQEL+D DTT                                                     + VP  DEKS+  A+A+QP
Subjt:  AAVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQP

Query:  VSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWG
        +SVAI   G +FQFYSGGVF G CG + +HGVA VGYG +      +VKNSWG
Subjt:  VSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWG

AT3G19390.1 Granulin repeat cysteine protease family protein2.3e-1938.61Show/hide
Query:  ARAAVEGISKIKTGTLVSQSEQELVDRDTTMT----------------------------------------------------EKVPVIDEKSIKDAVA
        A  AVEGI++IKTG L+S SEQELVD DT+                                                      E VP  DEKS+K A+A
Subjt:  ARAAVEGISKIKTGTLVSQSEQELVDRDTTMT----------------------------------------------------EKVPVIDEKSIKDAVA

Query:  NQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD
        NQP+SVAI  GG  FQ Y+ GVF+G CG   +HGV  VGYG    +   +V+NSWG++
Subjt:  NQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD

AT3G48340.1 Cysteine proteinases superfamily protein4.2e-2142.58Show/hide
Query:  AAVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQP
        AAVEGI+KIKT  LVS SEQELVD DT                                                      E VP  DE ++  AVANQP
Subjt:  AAVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQP

Query:  VSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD
        VSVAI  G  DFQFYS GVF+G+CG E NHGVA VGYG    +   +V+NSWG +
Subjt:  VSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWGTD

AT3G48350.1 Cysteine proteinases superfamily protein2.3e-1941.4Show/hide
Query:  AAVEGISKIKTGTLVSQSEQELVDRDT------------------------------------------------TMT----EKVPVIDEKSIKDAVANQ
        AAVEGI+KI+T  LVS SEQELVD DT                                                T+T    E VP  DE+ +  AVA+Q
Subjt:  AAVEGISKIKTGTLVSQSEQELVDRDT------------------------------------------------TMT----EKVPVIDEKSIKDAVANQ

Query:  PVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR----LVKNSWGTD
        PVSVAI  G  DFQ YS GVF G CG + NHGV +VGYGE  N     +V+NSWG +
Subjt:  PVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR----LVKNSWGTD

AT4G35350.1 xylem cysteine peptidase 12.3e-1939.87Show/hide
Query:  AAVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQP
        AAVEGI++I TG L S SEQEL+D DTT                                                     E VP  D++S+  A+A+QP
Subjt:  AAVEGISKIKTGTLVSQSEQELVDRDTTMT---------------------------------------------------EKVPVIDEKSIKDAVANQP

Query:  VSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWG
        VSVAI   G DFQFY GGVF+G CG + +HGVA VGYG +      +VKNSWG
Subjt:  VSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNR---LVKNSWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCTAAATGGCTCCTCATCAAACACATTATTGAATGGTGCATCTGAGCCTAAGCTTCAACTCGTCATCGGTGCCTCAAGGAAGGCGATGGAGTCATTAACAGATCA
ACCCTGTCTAGCCAACAAGAAGCTGCACAAGTCATCACCCAATAAGGCGTGGCTCCACCCAAGGCGGAAGCGATCTTCTAGTGAGCAAAATCAGGCCAGAGCAGCTGTGG
AAGGCATTAGCAAAATAAAAACAGGCACATTGGTCTCTCAATCAGAACAAGAGCTTGTCGACCGTGATACCACTATGACAGAAAAAGTACCTGTAATTGATGAGAAAAGC
ATAAAAGATGCAGTTGCTAACCAGCCAGTCTCTGTAGCAATTCATACAGGGGGATATGATTTCCAGTTCTATTCTGGTGGAGTTTTCTCAGGGAATTGTGGAAAGGAATT
CAATCATGGAGTGGCAGTAGTTGGGTATGGGGAAGCTAGCAATAGGCTTGTCAAGAATTCATGGGGCACTGACGGGGTGAATCTGGTTACACGAGAATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACCTAAATGGCTCCTCATCAAACACATTATTGAATGGTGCATCTGAGCCTAAGCTTCAACTCGTCATCGGTGCCTCAAGGAAGGCGATGGAGTCATTAACAGATCA
ACCCTGTCTAGCCAACAAGAAGCTGCACAAGTCATCACCCAATAAGGCGTGGCTCCACCCAAGGCGGAAGCGATCTTCTAGTGAGCAAAATCAGGCCAGAGCAGCTGTGG
AAGGCATTAGCAAAATAAAAACAGGCACATTGGTCTCTCAATCAGAACAAGAGCTTGTCGACCGTGATACCACTATGACAGAAAAAGTACCTGTAATTGATGAGAAAAGC
ATAAAAGATGCAGTTGCTAACCAGCCAGTCTCTGTAGCAATTCATACAGGGGGATATGATTTCCAGTTCTATTCTGGTGGAGTTTTCTCAGGGAATTGTGGAAAGGAATT
CAATCATGGAGTGGCAGTAGTTGGGTATGGGGAAGCTAGCAATAGGCTTGTCAAGAATTCATGGGGCACTGACGGGGTGAATCTGGTTACACGAGAATTCTGA
Protein sequenceShow/hide protein sequence
MDLNGSSSNTLLNGASEPKLQLVIGASRKAMESLTDQPCLANKKLHKSSPNKAWLHPRRKRSSSEQNQARAAVEGISKIKTGTLVSQSEQELVDRDTTMTEKVPVIDEKS
IKDAVANQPVSVAIHTGGYDFQFYSGGVFSGNCGKEFNHGVAVVGYGEASNRLVKNSWGTDGVNLVTREF