; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013124 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013124
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionXylem cysteine proteinase 1
Genome locationChr01:27095257..27098973
RNA-Seq ExpressionHG10013124
SyntenyHG10013124
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAQ00104.1 papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]3.5e-0542.03Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP
        E F+ W+ +++K+Y S +E +HRFE+F++ L+ I++ NRE T     GLN ++DLT+DE  T Y   +P
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP

KAE8650303.1 hypothetical protein Csa_010836 [Cucumis sativus]9.4e-1154.17Show/hide
Query:  KRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE
        K L+++ +RS+DW+ F SWM E++K YES +E ++RF IFR  L+ I+K N+E   GCTFGLN YSDLT  E
Subjt:  KRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE

KAF3328568.1 cysteine protease [Carex littledalei]3.5e-0540.58Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP
        E F++W++E+ K+Y S  E  HR E+F + ++ I+K NRE   G T  +N++SDLT +E    +S  TP
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP

XP_038878031.1 uncharacterized protein LOC120070224 [Benincasa hispida]8.2e-0744.9Show/hide
Query:  VCSSSWSGGGMIRNRN----LLCRALSTPRSAPEPRSNSDDKRLIDETQRS-EDWESFKSWMLENE---KSYESKKEMMHRFEIFRERLRTIEKSNRE
        +CSSS     +IRNRN    LLC A S    APE   ++    +  ETQ++ E+WESFKSW+L+ +   KSY+S+KE++++FE+F +RLR+IE    E
Subjt:  VCSSSWSGGGMIRNRN----LLCRALSTPRSAPEPRSNSDDKRLIDETQRS-EDWESFKSWMLENE---KSYESKKEMMHRFEIFRERLRTIEKSNRE

XP_038887495.1 pro-cathepsin H-like [Benincasa hispida]4.1e-1449.49Show/hide
Query:  RALSTPRSAPEPRSN---SDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYS
        R + TP   P    N   SD    I +   SEDWESFKSWM  + K Y S++EM++RF +F++ L+ IEK N+  T GCTFG N++SDLT+DEVP GY+
Subjt:  RALSTPRSAPEPRSN---SDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYS

TrEMBL top hitse value%identityAlignment
A0A0A0L4P3 Inhibitor_I29 domain-containing protein4.5e-1154.17Show/hide
Query:  KRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE
        K L+++ +RS+DW+ F SWM E++K YES +E ++RF IFR  L+ I+K N+E   GCTFGLN YSDLT  E
Subjt:  KRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDE

A0A287EHW3 Uncharacterized protein1.7e-0542.03Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP
        E F+ W+ +++K+Y S +E +HRFE+F++ L+ I++ NRE T     GLN ++DLT+DE  T Y   +P
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP

A0A2H5P1C1 Uncharacterized protein2.2e-0546.97Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRE--STCGCTFGLNYYSDLTYDEVPTGY
        E F+SWML++ KSYES +E +HR EIF++ L+ I+  NRE   T     GLN +SD++Y+E    Y
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRE--STCGCTFGLNYYSDLTYDEVPTGY

B4ESE7 Papain-like cysteine proteinase1.7e-0542.03Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP
        E F+ W+ +++K+Y S +E +HRFE+F++ L+ I++ NRE T     GLN ++DLT+DE  T Y   +P
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP

M0Y4R9 Uncharacterized protein1.7e-0542.03Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP
        E F+ W+ +++K+Y S +E +HRFE+F++ L+ I++ NRE T     GLN ++DLT+DE  T Y   +P
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP

SwissProt top hitse value%identityAlignment
O65493 Cysteine protease XCP11.1e-0645.31Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        E F+SWM E+ K+Y+S +E +HRFE+FRE L  I++ N E       GLN ++DLT++E    Y
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

P00785 Actinidain8.0e-0539.73Show/hide
Query:  TQRSED--WESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        TQR+ D     ++SW+++  KSY S  E   RFEIF+E LR I++ N ++      GLN ++DLT +E  + Y
Subjt:  TQRSED--WESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

P05994 Papaya proteinase 49.4e-0640.3Show/hide
Query:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP
        F SWML++ K+Y++  E ++RFEIF++ L+ I++ N+    G   GLN +SDL+ DE    Y    P
Subjt:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTP

P10056 Caricain6.1e-0540.32Show/hide
Query:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        F SWML + K YE+  E ++RFEIF++ L  I+++N+++      GLN ++DL+ DE    Y
Subjt:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

P14080 Chymopapain2.7e-0543.55Show/hide
Query:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        F SWML++ K YES  E ++RFEIFR+ L  I+++N+++      GLN ++DL+ DE    Y
Subjt:  FKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

Arabidopsis top hitse value%identityAlignment
AT1G20850.1 xylem cysteine peptidase 21.4e-0434.38Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        E F++W+   EK+YE+ +E   RFE+F++ L+ I+++N++       GLN ++DL+++E    Y
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

AT3G43960.1 Cysteine proteinases superfamily protein7.4e-0637.84Show/hide
Query:  ETQRSED--WESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        E+QR+E      ++ W++EN K+Y    E   RF+IF++ L+ IE+ N +       GLN +SDLT DE    Y
Subjt:  ETQRSED--WESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

AT4G35350.1 xylem cysteine peptidase 17.9e-0845.31Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        E F+SWM E+ K+Y+S +E +HRFE+FRE L  I++ N E       GLN ++DLT++E    Y
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY

AT4G35350.2 xylem cysteine peptidase 17.9e-0845.31Show/hide
Query:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY
        E F+SWM E+ K+Y+S +E +HRFE+FRE L  I++ N E       GLN ++DLT++E    Y
Subjt:  ESFKSWMLENEKSYESKKEMMHRFEIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGAGTAGCTGCACCAAAGAGTGGCATCTATTTATATGCAGACACCTGTTGGGCCTATTCGGCTGTACCTAGGATAAATCAAATCGTGACAGAAGAACTTTTGAA
GTTGTCGGAAGAAGATGTGATTAATCATCATTACCCAGACCCTGGATATGTTGGCGGCGGGGAAGGGGTTGGCTATATGAGGATGGCAAGAGGAGCTGCACCAAAGAGTG
ATATCTATTTATATGCAGACATCTATGGGGCCTATTTGGTTGTCCCTACCATTGAAGAGATAAATCAAATCGTGACAGGAGAACTTCTGAAGTTGTCGAAAGAAGATGTG
ATTAATCATCATTACCTGTACCCAGGATATGTTGGCGGTAATTGTGCGTGGGCGTGGAAGGAAAAAGAAAGAGGGGATGAAAATGGTGGGTCCGGTACTCCGTTGGCTGT
GTGTTCATCATCTTGGTCAGGTGGTGGGATGATTCGGAATCGGAATCTCCTTTGCCGTGCATTGTCAACACCCAGATCTGCACCTGAACCACGCAGCAATTCGGATGATA
AGAGATTAATAGATGAAACACAACGTTCGGAGGATTGGGAGTCGTTCAAGTCATGGATGTTGGAGAACGAGAAGAGTTATGAGAGCAAGAAAGAGATGATGCATAGGTTT
GAGATATTCAGGGAGAGATTGAGGACAATTGAAAAAAGTAACAGGGAGAGTACTTGTGGGTGTACGTTCGGGTTGAATTACTATTCAGACTTGACCTATGATGAGGTTCC
TACAGGCTATTCCGATCGGACACCCCGCTACGTATGGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGAGTAGCTGCACCAAAGAGTGGCATCTATTTATATGCAGACACCTGTTGGGCCTATTCGGCTGTACCTAGGATAAATCAAATCGTGACAGAAGAACTTTTGAA
GTTGTCGGAAGAAGATGTGATTAATCATCATTACCCAGACCCTGGATATGTTGGCGGCGGGGAAGGGGTTGGCTATATGAGGATGGCAAGAGGAGCTGCACCAAAGAGTG
ATATCTATTTATATGCAGACATCTATGGGGCCTATTTGGTTGTCCCTACCATTGAAGAGATAAATCAAATCGTGACAGGAGAACTTCTGAAGTTGTCGAAAGAAGATGTG
ATTAATCATCATTACCTGTACCCAGGATATGTTGGCGGTAATTGTGCGTGGGCGTGGAAGGAAAAAGAAAGAGGGGATGAAAATGGTGGGTCCGGTACTCCGTTGGCTGT
GTGTTCATCATCTTGGTCAGGTGGTGGGATGATTCGGAATCGGAATCTCCTTTGCCGTGCATTGTCAACACCCAGATCTGCACCTGAACCACGCAGCAATTCGGATGATA
AGAGATTAATAGATGAAACACAACGTTCGGAGGATTGGGAGTCGTTCAAGTCATGGATGTTGGAGAACGAGAAGAGTTATGAGAGCAAGAAAGAGATGATGCATAGGTTT
GAGATATTCAGGGAGAGATTGAGGACAATTGAAAAAAGTAACAGGGAGAGTACTTGTGGGTGTACGTTCGGGTTGAATTACTATTCAGACTTGACCTATGATGAGGTTCC
TACAGGCTATTCCGATCGGACACCCCGCTACGTATGGGATTGA
Protein sequenceShow/hide protein sequence
MARVAAPKSGIYLYADTCWAYSAVPRINQIVTEELLKLSEEDVINHHYPDPGYVGGGEGVGYMRMARGAAPKSDIYLYADIYGAYLVVPTIEEINQIVTGELLKLSKEDV
INHHYLYPGYVGGNCAWAWKEKERGDENGGSGTPLAVCSSSWSGGGMIRNRNLLCRALSTPRSAPEPRSNSDDKRLIDETQRSEDWESFKSWMLENEKSYESKKEMMHRF
EIFRERLRTIEKSNRESTCGCTFGLNYYSDLTYDEVPTGYSDRTPRYVWD