; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G018420 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G018420
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionxylem cysteine proteinase 1-like
Genome locationchr05:25685497..25687576
RNA-Seq ExpressionLsi05G018420
SyntenyLsi05G018420
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650303.1 hypothetical protein Csa_010836 [Cucumis sativus]2.3e-0864Show/hide
Query:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDE
        +HKKKY S EE LYRFGIFR +LK I+  N++ +GCTFGLN +SDLT+ E
Subjt:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDE

KAG8369302.1 hypothetical protein BUALT_Bualt15G0137200 [Buddleja alternifolia]1.2e-0450Show/hide
Query:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK
        INL      KH KKY S EE L+RF IF+D LK I+ KN+  S    GLN F+DL+ +E  K
Subjt:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK

XP_012838990.1 PREDICTED: xylem cysteine proteinase 2 [Erythranthe guttata]6.9e-0548.39Show/hide
Query:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK
        INL     +KH KKY + EE L+RF IF+D LK I+ KN+  +    GLN F+DL+ DE  K
Subjt:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK

XP_021889000.1 papaya proteinase 4-like [Carica papaya]4.1e-0546.03Show/hide
Query:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPKGCVPPWRPDF
        KH K Y + +E LYRF IF+D LK+I+ +N+  +G   GLN FSDL++DE  K  V     D+
Subjt:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPKGCVPPWRPDF

XP_038887495.1 pro-cathepsin H-like [Benincasa hispida]3.9e-1648.76Show/hide
Query:  MIGAVFRGC---------WSLIWKHNILPTPAHTRRTNINL----------------ETKQFKK----HKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQR
        M  A+FR C         W LI K NI PTPAHT  TNIN                 + + FK     H KKYGS+EE+LYRFG+F+  LK IE  N+  
Subjt:  MIGAVFRGC---------WSLIWKHNILPTPAHTRRTNINL----------------ETKQFKK----HKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQR

Query:  SGCTFGLNYFSDLTSDEVPKG
        +GCTFG N+FSDLT DEVPKG
Subjt:  SGCTFGLNYFSDLTSDEVPKG

TrEMBL top hitse value%identityAlignment
A0A022R8K4 Uncharacterized protein3.3e-0548.39Show/hide
Query:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK
        INL     +KH KKY + EE L+RF IF+D LK I+ KN+  +    GLN F+DL+ DE  K
Subjt:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK

A0A0A0L4P3 Inhibitor_I29 domain-containing protein1.1e-0864Show/hide
Query:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDE
        +HKKKY S EE LYRFGIFR +LK I+  N++ +GCTFGLN +SDLT+ E
Subjt:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDE

A0A1S3YNP8 xylem cysteine proteinase 1-like9.7e-0550Show/hide
Query:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK
        INL     +KH K Y S EE L+RF IFRD LK I+ +N+  S    GLN F+DL+ DE  K
Subjt:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK

A0A2K2DCU7 Uncharacterized protein9.7e-0540.7Show/hide
Query:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK---GCVPP-----WRPDFKWNRIS
        I L  K   KH+K Y S EE L+RF +F+D LK I+  NR+ +    GLN F+DLT +E      G  PP      R  FK+  +S
Subjt:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK---GCVPP-----WRPDFKWNRIS

S8CL74 Uncharacterized protein9.7e-0546.77Show/hide
Query:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK
        +NL     +KH +KY + EE L RF +FRD LK IE +N+  S    GLN F+D+T DE  K
Subjt:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK

SwissProt top hitse value%identityAlignment
O65493 Cysteine protease XCP11.1e-0534.48Show/hide
Query:  TPAHTRRTNINLETKQ--FKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPKGCVPPWRPDFKWNR
        TP H   T+  LE  +    +H K Y S EE ++RF +FR+ L  I+ +N + +    GLN F+DLT +E     +   +P F   R
Subjt:  TPAHTRRTNINLETKQ--FKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPKGCVPPWRPDFKWNR

P00784 Papain1.1e-0544Show/hide
Query:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDE
        KH K Y + +E +YRF IF+D LK+I+  N++ +    GLN F+D+++DE
Subjt:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDE

P05994 Papaya proteinase 42.0e-0747.46Show/hide
Query:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDE
        I L      KH K Y + +E LYRF IF+D LK+I+ +N+  +G   GLN FSDL++DE
Subjt:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDE

P14080 Chymopapain2.9e-0644.44Show/hide
Query:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPKGCVPPWRPDF
        KH K Y S +E +YRF IFRD L +I+  N++ +    GLN F+DL++DE  K  V     DF
Subjt:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPKGCVPPWRPDF

Q40143 Cysteine proteinase 31.9e-0547.17Show/hide
Query:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK
        +H+K+Y S EE+  RF IF D LK I   NR+      G+N F+DLT DE  K
Subjt:  KHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK

Arabidopsis top hitse value%identityAlignment
AT1G20850.1 xylem cysteine peptidase 28.2e-0435.48Show/hide
Query:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK
        I L        +K Y + EE   RF +F+D LK I+  N++      GLN F+DL+ +E  K
Subjt:  INLETKQFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPK

AT4G35350.1 xylem cysteine peptidase 17.9e-0734.48Show/hide
Query:  TPAHTRRTNINLETKQ--FKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPKGCVPPWRPDFKWNR
        TP H   T+  LE  +    +H K Y S EE ++RF +FR+ L  I+ +N + +    GLN F+DLT +E     +   +P F   R
Subjt:  TPAHTRRTNINLETKQ--FKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPKGCVPPWRPDFKWNR

AT4G35350.2 xylem cysteine peptidase 17.9e-0734.48Show/hide
Query:  TPAHTRRTNINLETKQ--FKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPKGCVPPWRPDFKWNR
        TP H   T+  LE  +    +H K Y S EE ++RF +FR+ L  I+ +N + +    GLN F+DLT +E     +   +P F   R
Subjt:  TPAHTRRTNINLETKQ--FKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPKGCVPPWRPDFKWNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATCGCAATCGGATGAAAATGGCGATTACGGCATTCCGTTCGCGTTTGCTCCGTTCATCATCTTCTTCAGCTGCCTTAATCCGGAATCGAGATCTCCTCTGCCG
CCGCGCATTCTCAGCAGCTAAAGAACATACCAAGATCAATCTTGAAAGCAAGCCGTCTAAGTCGGATGGTGTGGGAGAAGCACGAAATTGGAAGGATTTTGAGTCCTTCA
AGTCAATGATTGGTGCAGTATTCCGTGGTTGTTGGTCCTTGATCTGGAAACATAATATTCTACCCACCCCTGCACATACCCGACGCACCAACATCAATCTAGAAACCAAG
CAGTTCAAGAAGCACAAGAAGAAGTACGGGAGCAAGGAAGAGGTTTTGTATAGGTTTGGGATATTCAGAGACAAATTGAAGTTTATTGAGATGAAGAACAGGCAGAGATC
TGGGTGTACCTTTGGGTTGAATTACTTTTCAGACTTGACCAGTGATGAAGTTCCCAAAGGCTGCGTTCCCCCTTGGAGACCCGACTTCAAGTGGAATAGAATTTCAAAAT
ATTCGCGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATCGCAATCGGATGAAAATGGCGATTACGGCATTCCGTTCGCGTTTGCTCCGTTCATCATCTTCTTCAGCTGCCTTAATCCGGAATCGAGATCTCCTCTGCCG
CCGCGCATTCTCAGCAGCTAAAGAACATACCAAGATCAATCTTGAAAGCAAGCCGTCTAAGTCGGATGGTGTGGGAGAAGCACGAAATTGGAAGGATTTTGAGTCCTTCA
AGTCAATGATTGGTGCAGTATTCCGTGGTTGTTGGTCCTTGATCTGGAAACATAATATTCTACCCACCCCTGCACATACCCGACGCACCAACATCAATCTAGAAACCAAG
CAGTTCAAGAAGCACAAGAAGAAGTACGGGAGCAAGGAAGAGGTTTTGTATAGGTTTGGGATATTCAGAGACAAATTGAAGTTTATTGAGATGAAGAACAGGCAGAGATC
TGGGTGTACCTTTGGGTTGAATTACTTTTCAGACTTGACCAGTGATGAAGTTCCCAAAGGCTGCGTTCCCCCTTGGAGACCCGACTTCAAGTGGAATAGAATTTCAAAAT
ATTCGCGATAG
Protein sequenceShow/hide protein sequence
MENRNRMKMAITAFRSRLLRSSSSSAALIRNRDLLCRRAFSAAKEHTKINLESKPSKSDGVGEARNWKDFESFKSMIGAVFRGCWSLIWKHNILPTPAHTRRTNINLETK
QFKKHKKKYGSKEEVLYRFGIFRDKLKFIEMKNRQRSGCTFGLNYFSDLTSDEVPKGCVPPWRPDFKWNRISKYSR