; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G020710 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G020710
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionXylem cysteine proteinase 1
Genome locationchr05:27620481..27622099
RNA-Seq ExpressionLsi05G020710
SyntenyLsi05G020710
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650303.1 hypothetical protein Csa_010836 [Cucumis sativus]5.4e-1154.17Show/hide
Query:  KRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE
        K L+++ +RS+DW+ F SWM E++K YES +E ++RF IFR  L+ I+K N+E   GCTFGLN YSDLT  E
Subjt:  KRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE

XP_037446770.1 cysteine protease XCP1-like [Triticum dicoccoides]9.0e-0636.36Show/hide
Query:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE
        LLC  +   R++       SE+   S D RLI         E F+ W+ ++EK+Y S +E +HRFE+F++ L+ I++ NRE T     GLN ++DLT+DE
Subjt:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE

Query:  VPTGYSDRTP
            Y   +P
Subjt:  VPTGYSDRTP

XP_037473969.1 cysteine protease XCP1-like [Triticum dicoccoides]9.0e-0636.36Show/hide
Query:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE
        LLC  +   R++       SE+   S D RLI         E F+ W+ ++EK+Y S +E +HRFE+F++ L+ I++ NRE T     GLN ++DLT+DE
Subjt:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE

Query:  VPTGYSDRTP
            Y   +P
Subjt:  VPTGYSDRTP

XP_038878031.1 uncharacterized protein LOC120070224 [Benincasa hispida]9.0e-0643.27Show/hide
Query:  VCSSSWSGGGMIRNRN----LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENE---KSYESKKEMMHRFEIFRERLRTIEK
        +CSSS     +IRNRN    LLC A S    APE      E+  R  +  L +  Q  E+WESFKSW+L+ +   KSY+S+KE++++FE+F +RLR+IE 
Subjt:  VCSSSWSGGGMIRNRN----LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENE---KSYESKKEMMHRFEIFRERLRTIEK

Query:  SNRE
           E
Subjt:  SNRE

XP_038887495.1 pro-cathepsin H-like [Benincasa hispida]4.7e-1550Show/hide
Query:  PRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYS
        P  A  P +N++  Q +SD    I +   SEDWESFKSWM  + K Y S++EM++RF +F++ L+ IEK N+  T GCTFG N++SDLT+DEVP GY+
Subjt:  PRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYS

TrEMBL top hitse value%identityAlignment
A0A0A0L4P3 Inhibitor_I29 domain-containing protein2.6e-1154.17Show/hide
Query:  KRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE
        K L+++ +RS+DW+ F SWM E++K YES +E ++RF IFR  L+ I+K N+E   GCTFGLN YSDLT  E
Subjt:  KRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE

A0A452XLR1 Uncharacterized protein9.7e-0635.45Show/hide
Query:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE
        LLC  +   R++       SE+   S D RLI         E F+ W+ +++K+Y S +E +HRFE+F++ L+ I++ NRE T     GLN ++DLT+DE
Subjt:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE

Query:  VPTGYSDRTP
            Y   +P
Subjt:  VPTGYSDRTP

B4ESE7 Papain-like cysteine proteinase9.7e-0635.45Show/hide
Query:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE
        LLC      R++       SE+   S D RL+         E F+ W+ +++K+Y S +E +HRFE+F++ L+ I++ NRE T     GLN ++DLT+DE
Subjt:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE

Query:  VPTGYSDRTP
          T Y   +P
Subjt:  VPTGYSDRTP

M7ZZ67 Xylem cysteine proteinase 29.7e-0635.45Show/hide
Query:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE
        LLC  +   R++       SE+   S D RLI         E F+ W+ +++K+Y S +E +HRFE+F++ L+ I++ NRE T     GLN ++DLT+DE
Subjt:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE

Query:  VPTGYSDRTP
            Y   +P
Subjt:  VPTGYSDRTP

N1QVR8 Xylem cysteine proteinase 19.7e-0635.45Show/hide
Query:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE
        LLC  +   R++       SE+   S D RLI         E F+ W+ +++K+Y S +E +HRFE+F++ L+ I++ NRE T     GLN ++DLT+DE
Subjt:  LLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE

Query:  VPTGYSDRTP
            Y   +P
Subjt:  VPTGYSDRTP

SwissProt top hitse value%identityAlignment
O65493 Cysteine protease XCP16.4e-0745.31Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        E F+SWM E+ K+Y+S +E +HRFE+FRE L  I++ N E       GLN ++DLT++E    Y
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

P00785 Actinidain4.6e-0539.73Show/hide
Query:  TQRSED--WESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        TQR+ D     ++SW+++  KSY S  E   RFEIF+E LR I++ N ++      GLN ++DLT +E  + Y
Subjt:  TQRSED--WESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

P05994 Papaya proteinase 45.5e-0640.3Show/hide
Query:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP
        F SWML++ K+Y++  E ++RFEIF++ L+ I++ N+    G   GLN +SDL+ DE    Y    P
Subjt:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP

P10056 Caricain3.5e-0540.32Show/hide
Query:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        F SWML + K YE+  E ++RFEIF++ L  I+++N+++      GLN ++DL+ DE    Y
Subjt:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

P14080 Chymopapain1.6e-0543.55Show/hide
Query:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        F SWML++ K YES  E ++RFEIFR+ L  I+++N+++      GLN ++DL+ DE    Y
Subjt:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

Arabidopsis top hitse value%identityAlignment
AT1G20850.1 xylem cysteine peptidase 28.1e-0534.38Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        E F++W+   EK+YE+ +E   RFE+F++ L+ I+++N++       GLN ++DL+++E    Y
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

AT3G43960.1 Cysteine proteinases superfamily protein4.3e-0637.84Show/hide
Query:  ETQRSED--WESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        E+QR+E      ++ W++EN K+Y    E   RF+IF++ L+ IE+ N +       GLN +SDLT DE    Y
Subjt:  ETQRSED--WESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

AT4G35350.1 xylem cysteine peptidase 14.6e-0845.31Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        E F+SWM E+ K+Y+S +E +HRFE+FRE L  I++ N E       GLN ++DLT++E    Y
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

AT4G35350.2 xylem cysteine peptidase 14.6e-0845.31Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        E F+SWM E+ K+Y+S +E +HRFE+FRE L  I++ N E       GLN ++DLT++E    Y
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAATTGTGCGTGGGCGTGGAAGGAAAAAGAAAGAGGGGATGAAAATGGTGGGTCCGGTACTCCGTTGGCTGTGTGTTCATCATCTTGGTCAGGTGGTGGGATGAT
TCGGAATCGGAATCTCCTTTGCCGTGCATTGTCAACACCCAGATCTGCACCTGAACCACGCAGCAATGTAAGCGAACAGCAGTGCAGGTCGGATGATAAGAGATTAATAG
ATGAAACACAACGTTCGGAGGATTGGGAGTCGTTCAAGTCATGGATGTTGGAGAACGAGAAGAGTTATGAGAGCAAGAAAGAGATGATGCATAGGTTTGAGATATTCAGG
GAGAGATTGAGGACAATTGAAAAAAGTAACAGGGAGAGTACTTGTGGGTGTACGTTCGGGTTGAATTACTATTCAGACTTGACCTATGATGAGGTTCCTACAGGCTATTC
CGATCGGACACCCCGCTACGTATGGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATAATTGTGCGTGGGCGTGGAAGGAAAAAGAAAGAGGGGATGAAAATGGTGGGTCCGGTACTCCGTTGGCTGTGTGTTCATCATCTTGGTCAGGTGGTGGGATGAT
TCGGAATCGGAATCTCCTTTGCCGTGCATTGTCAACACCCAGATCTGCACCTGAACCACGCAGCAATGTAAGCGAACAGCAGTGCAGGTCGGATGATAAGAGATTAATAG
ATGAAACACAACGTTCGGAGGATTGGGAGTCGTTCAAGTCATGGATGTTGGAGAACGAGAAGAGTTATGAGAGCAAGAAAGAGATGATGCATAGGTTTGAGATATTCAGG
GAGAGATTGAGGACAATTGAAAAAAGTAACAGGGAGAGTACTTGTGGGTGTACGTTCGGGTTGAATTACTATTCAGACTTGACCTATGATGAGGTTCCTACAGGCTATTC
CGATCGGACACCCCGCTACGTATGGGATTGATTCAATTATCTACATGAACCCTATGCCGGCCTACACTCTGATCTTCTACCTTCGAGACTTCTTTTTGTTCTTGTGAGAG
ATCAGAAAAGGCATGTAGTATTTTTTTAATCTCCTTATTATGCAACCTAGCTACCACCCCCTTCACTTGTTCACTATTTTTCTTCTCCATCTATATTTTTAAACGTTTGT
AATTTTGATTGGTGGACTACCAACCTCTCCAATGGTTTTGTCTTTGCATCTCTGCTAATGAGAGGGACTTGTTTCCATTGTGCTATCCTATTTGAAACTAAC
Protein sequenceShow/hide protein sequence
MYNCAWAWKEKERGDENGGSGTPLAVCSSSWSGGGMIRNRNLLCRALSTPRSAPEPRSNVSEQQCRSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFR
ERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTPRYVWD