; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0010079 (gene) of Chayote v1 genome

Gene IDSed0010079
OrganismSechium edule (Chayote v1)
DescriptionCysteine proteinase
Genome locationLG04:44304636..44305410
RNA-Seq ExpressionSed0010079
SyntenySed0010079
Gene Ontology termsGO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AFO83614.1 papain-like cysteine protease [Fagopyrum esculentum]6.6e-0831.85Show/hide
Query:  GRRVRSSA-AWLNRNRVVSRGFSSQQAPIIDQMGVNGDLESLMKEAEA-SENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFG
        G RVRSSA A L    + +   S+    II     + D  S      +  E   +F+SW++ H +NY + GE   R+++F +  RF+D  N E    + G
Subjt:  GRRVRSSA-AWLNRNRVVSRGFSSQQAPIIDQMGVNGDLESLMKEAEA-SENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFG

Query:  LNRFSDMTMEEFRRDFTIPPSIARRTLRRHRFEQH
        LN+F+D++ EE+R  +    + ARR L + R  ++
Subjt:  LNRFSDMTMEEFRRDFTIPPSIARRTLRRHRFEQH

KAE8650303.1 hypothetical protein Csa_010836 [Cucumis sativus]8.4e-1142.68Show/hide
Query:  ESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDFTIPPSI
        + L+++AE S++W  F SWM EH + Y+S  E LYR+ IF    + +   NKE  GC FGLN++SD+T  EF R   +P  +
Subjt:  ESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDFTIPPSI

PHT41774.1 Cysteine proteinase RD21a [Capsicum baccatum]5.1e-0838.89Show/hide
Query:  SSQQAPIIDQMGVNGDLESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR
        S+    II     +GDL +        E  GL++SW+++H +NY + GE   R+ IF +  RF+D  N E    + GLNRFSD+T EE+R
Subjt:  SSQQAPIIDQMGVNGDLESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR

PHU10453.1 Cysteine proteinase RD21a [Capsicum chinense]5.1e-0838.89Show/hide
Query:  SSQQAPIIDQMGVNGDLESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR
        S+    II     +GDL +        E  GL++SW+++H +NY + GE   R++IF +  RF+D  N E    + GLNRFSD+T EE+R
Subjt:  SSQQAPIIDQMGVNGDLESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR

XP_038887495.1 pro-cathepsin H-like [Benincasa hispida]2.2e-1137.07Show/hide
Query:  CGRRVRSSAAWLNRNRVVSRGFSSQQAPIIDQMGVNGDLESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGL
        C   V SS AWL   R +    +    P+ +      D    + +A  SE+W  FKSWM  HN+ Y S+ E LYR+ +F +T + ++  NK  TGC FG 
Subjt:  CGRRVRSSAAWLNRNRVVSRGFSSQQAPIIDQMGVNGDLESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGL

Query:  NRFSDMTMEEFRRDFT
        N FSD+T +E  + +T
Subjt:  NRFSDMTMEEFRRDFT

TrEMBL top hitse value%identityAlignment
A0A0A0L4P3 Inhibitor_I29 domain-containing protein1.2e-1045.95Show/hide
Query:  ESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRR
        + L+++AE S++W  F SWM EH + Y+S  E LYR+ IF    + +   NKE  GC FGLN++SD+T  EF R
Subjt:  ESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRR

A0A1S3YIC8 cysteine proteinase RD21a-like7.1e-0846.03Show/hide
Query:  GLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDF
        GL++SW+++H +NY + GE   R++IF +  RF+D  N E+   + GLNRFSD+T EE+R  F
Subjt:  GLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDF

A0A2G2W964 Cysteine proteinase RD21a2.5e-0838.89Show/hide
Query:  SSQQAPIIDQMGVNGDLESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR
        S+    II     +GDL +        E  GL++SW+++H +NY + GE   R+ IF +  RF+D  N E    + GLNRFSD+T EE+R
Subjt:  SSQQAPIIDQMGVNGDLESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR

A0A2G3BVF2 Cysteine proteinase RD21a2.5e-0838.89Show/hide
Query:  SSQQAPIIDQMGVNGDLESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR
        S+    II     +GDL +        E  GL++SW+++H +NY + GE   R++IF +  RF+D  N E    + GLNRFSD+T EE+R
Subjt:  SSQQAPIIDQMGVNGDLESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR

T1P6Y9 Papain-like cysteine protease3.2e-0831.85Show/hide
Query:  GRRVRSSA-AWLNRNRVVSRGFSSQQAPIIDQMGVNGDLESLMKEAEA-SENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFG
        G RVRSSA A L    + +   S+    II     + D  S      +  E   +F+SW++ H +NY + GE   R+++F +  RF+D  N E    + G
Subjt:  GRRVRSSA-AWLNRNRVVSRGFSSQQAPIIDQMGVNGDLESLMKEAEA-SENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFG

Query:  LNRFSDMTMEEFRRDFTIPPSIARRTLRRHRFEQH
        LN+F+D++ EE+R  +    + ARR L + R  ++
Subjt:  LNRFSDMTMEEFRRDFTIPPSIARRTLRRHRFEQH

SwissProt top hitse value%identityAlignment
A5HII1 Actinidain9.0e-0836.25Show/hide
Query:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKEST-GCRFGLNRFSDMTMEEFRRDFTIPPSIARRTLRRHRFE
        +++SW++++ ++Y S GE   R++IF ET RF+D  N ++    + GLN+F+D+T EEFR  +    S + +T   +R+E
Subjt:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKEST-GCRFGLNRFSDMTMEEFRRDFTIPPSIARRTLRRHRFE

P00784 Papain1.3e-0941.27Show/hide
Query:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDFT
        LF+SWML+HN+ Y++  E +YR++IF +  +++D +NK++     GLN F+DM+ +EF+  +T
Subjt:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDFT

P05994 Papaya proteinase 41.9e-1045.16Show/hide
Query:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDF
        LF SWML+HN+NY++  E LYR++IF +  +++D  NK   G   GLN FSD++ +EF+  +
Subjt:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDF

P10056 Caricain8.1e-0940.32Show/hide
Query:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDF
        LF SWML HN+ Y++  E LYR++IF +   ++D +NK++     GLN F+D++ +EF   +
Subjt:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDF

P14080 Chymopapain6.2e-0940.32Show/hide
Query:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDF
        LF SWML+HN+ Y+S  E +YR++IF +   ++D +NK++     GLN F+D++ +EF++ +
Subjt:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDF

Arabidopsis top hitse value%identityAlignment
AT3G19400.2 Cysteine proteinases superfamily protein1.3e-0640Show/hide
Query:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNK-ESTGCRFGLNRFSDMTMEEFR
        +++ W++E+ +NY   GE   R+KIF +  +FVD  N         GL RF+D+T EEFR
Subjt:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNK-ESTGCRFGLNRFSDMTMEEFR

AT4G11320.1 Papain family cysteine protease3.5e-0739.66Show/hide
Query:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEF
        +F+SWM++H + Y S  E   R  IF +  RF+   N E+   R GLNRF+D+++ E+
Subjt:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEF

AT4G23520.1 Cysteine proteinases superfamily protein4.6e-0733.33Show/hide
Query:  LFKSWMLEHNRNY-QSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDFTIPPSIARRTLRRHR
        +F+ WM +H + Y  + GE   R++ F +  RF+D  N ++   + GL RF+D+T++E+R  F   P   +R L+  R
Subjt:  LFKSWMLEHNRNY-QSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFRRDFTIPPSIARRTLRRHR

AT4G35350.1 xylem cysteine peptidase 11.9e-0842.37Show/hide
Query:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR
        LF+SWM EH++ Y+S  E ++R+++F E    +D  N E      GLN F+D+T EEF+
Subjt:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR

AT4G35350.2 xylem cysteine peptidase 11.9e-0842.37Show/hide
Query:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR
        LF+SWM EH++ Y+S  E ++R+++F E    +D  N E      GLN F+D+T EEF+
Subjt:  LFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRFSDMTMEEFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGTTTGTTAGTTTGGTGCGGTCGTCGGGTGCGGTCGTCGGCGGCTTGGTTGAACCGGAATCGTGTGGTATCACGCGGCTTCTCAAGCCAACAAGCCCCAATCAT
CGATCAGATGGGAGTGAATGGCGATTTGGAATCATTGATGAAGGAAGCAGAAGCTTCTGAGAATTGGGGTTTGTTCAAGTCGTGGATGTTAGAACACAATAGGAATTACC
AGAGCCAAGGAGAGACGCTGTATAGGTATAAGATATTCTGTGAAACGAAGAGGTTTGTTGATATCTCCAACAAGGAGAGTACTGGGTGTAGGTTCGGCTTGAATCGGTTT
TCGGACATGACCATGGAAGAGTTTCGCCGTGACTTTACCATTCCCCCATCAATCGCCAGAAGGACGCTTCGTCGTCATCGTTTTGAGCAACACTGA
mRNA sequenceShow/hide mRNA sequence
AAAGGAAAGAACCCTAAATTAAACGCCGGCTGTGCGTGGAGTTGAGAAAGGAATGGAGGGTTTGTTAGTTTGGTGCGGTCGTCGGGTGCGGTCGTCGGCGGCTTGGTTGA
ACCGGAATCGTGTGGTATCACGCGGCTTCTCAAGCCAACAAGCCCCAATCATCGATCAGATGGGAGTGAATGGCGATTTGGAATCATTGATGAAGGAAGCAGAAGCTTCT
GAGAATTGGGGTTTGTTCAAGTCGTGGATGTTAGAACACAATAGGAATTACCAGAGCCAAGGAGAGACGCTGTATAGGTATAAGATATTCTGTGAAACGAAGAGGTTTGT
TGATATCTCCAACAAGGAGAGTACTGGGTGTAGGTTCGGCTTGAATCGGTTTTCGGACATGACCATGGAAGAGTTTCGCCGTGACTTTACCATTCCCCCATCAATCGCCA
GAAGGACGCTTCGTCGTCATCGTTTTGAGCAACACTGAAGGGTGTTTACAACAACTAAATATTCATTATAGTCTTACCATGTTTTGGATTTGGTGTGCCCTTTTCCTATC
GGACTTCTTTGATATGCATTAACTATGTTAGTATCCAGAATTTGAATTGTTTCTTCAGATTGATTTTCAGTTCAAGGTCTCTCTTGATCTTTTATTATGAATCAATTTAA
CTGCTTCAATATATTCACTTTCACATA
Protein sequenceShow/hide protein sequence
MEGLLVWCGRRVRSSAAWLNRNRVVSRGFSSQQAPIIDQMGVNGDLESLMKEAEASENWGLFKSWMLEHNRNYQSQGETLYRYKIFCETKRFVDISNKESTGCRFGLNRF
SDMTMEEFRRDFTIPPSIARRTLRRHRFEQH