; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G017890 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G017890
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPapain family cysteine protease
Genome locationchr05:25240567..25245480
RNA-Seq ExpressionLsi05G017890
SyntenyLsi05G017890
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAF2071095.1 unnamed protein product [Brassica napus]5.6e-1129.9Show/hide
Query:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-
        P  ++P+ VDWR EGAVT+VK Q   + CW +++V A+EGI KI+TGEL  LS  +++  N   +G +G GY  I +K  I   I         YKA + 
Subjt:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-

Query:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI
                                     +K +  +P  Y GP G Q+DH ++       +G  +  V++S        G G++     + AGI
Subjt:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI

CAF2149391.1 unnamed protein product [Brassica napus]5.6e-1129.9Show/hide
Query:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-
        P  ++P+ VDWR EGAVT+VK Q   + CW +++V A+EGI KI+TGEL  LS  +++  N   +G +G GY  I +K  I   I         YKA + 
Subjt:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-

Query:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI
                                     +K +  +P  Y GP G Q+DH ++       +G  +  V++S        G G++     + AGI
Subjt:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI

KAF2542396.1 hypothetical protein F2Q68_00029910 [Brassica cretica]4.3e-1129.9Show/hide
Query:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-
        P  ++P+ VDWR EGAVT+VK Q   + CW +++V A+EGI KI+TGEL  LS  +++  N   +G +G GY  I +K  I   I         YKA + 
Subjt:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-

Query:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI
                                     +K +  +P  Y GP G Q+DH ++       +G  +  V++S        G G++     + AGI
Subjt:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI

KAF3528335.1 hypothetical protein DY000_02038025 [Brassica cretica]4.3e-1129.9Show/hide
Query:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-
        P  ++P+ VDWR EGAVT+VK Q   + CW +++V A+EGI KI+TGEL  LS  +++  N   +G +G GY  I +K  I   I         YKA + 
Subjt:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-

Query:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI
                                     +K +  +P  Y GP G Q+DH ++       +G  +  V++S        G G++     + AGI
Subjt:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI

XP_038887495.1 pro-cathepsin H-like [Benincasa hispida]1.1e-1143.12Show/hide
Query:  IPAPEESIRLSNMNLASQQQSKCDGVRSMEEAQSSENWNVFKRWMSRMKRKYRSEEEMLERFEIFNDTVNTIKEWKKKNLGCSSALNCFADTKDDEVPRG
        IP P  +  L+N+N     Q K DGV S+ +A  SE+W  FK WMS   +KY SEEEML RF +F  T+  I++  K   GC+   N F+D   DEVP+G
Subjt:  IPAPEESIRLSNMNLASQQQSKCDGVRSMEEAQSSENWNVFKRWMSRMKRKYRSEEEMLERFEIFNDTVNTIKEWKKKNLGCSSALNCFADTKDDEVPRG

Query:  HVSYRRLRF
        + S   + F
Subjt:  HVSYRRLRF

TrEMBL top hitse value%identityAlignment
A0A0D3A5L6 Uncharacterized protein1.6e-1129.9Show/hide
Query:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-
        P  ++P+ VDWR EGAVT+VK Q   + CW +++V A+EGI KI+TGEL  LS  +++  N   +G +G GY  I +K  I   I         YKA + 
Subjt:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-

Query:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI
                                     +K +  +P  Y GP G Q+DH ++       +G  +  V++S        G G++     + AGI
Subjt:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI

A0A1J3GGP8 Putative cysteine proteinase2.3e-1048.1Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG
        +PK VDWR EGA+T VK Q   R CW ++AVGA+EG+ KI+TGEL  LS  ++I  N  +   GGG  +  YK  +  G
Subjt:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG

A0A1S3E7S7 LOW QUALITY PROTEIN: zingipain-21.8e-1045.88Show/hide
Query:  HPHSKVPKYVDWRTEGAVTSVKLQKKNRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEG--GGYAKIVYKHAIRFG
        H H  VPK +DWR EGAVT VK Q ++ CW ++AV A+EGI KI TG+L  LS +++I  + + G+EG  GG   I + +  + G
Subjt:  HPHSKVPKYVDWRTEGAVTSVKLQKKNRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEG--GGYAKIVYKHAIRFG

A0A6D2HQ48 Uncharacterized protein3.9e-1046.84Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG
        +PK VDWR EGAVT VK Q   R CW ++ VGA+EG+ KI+TGEL  LS  ++I  N  +   GGG  +  Y++ +  G
Subjt:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG

M4DB45 Uncharacterized protein3.6e-1129.9Show/hide
Query:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-
        P  ++P+ VDWR EGAVT+VK Q   + CW +++V A+EGI KI+TGEL  LS  +++  N   +G +G GY  I +K  I   I         YKA + 
Subjt:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-

Query:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI
                                     +K +  +P  Y GP G Q+DH ++       +G  +  V++S        G G++     + AGI
Subjt:  ----------------------------FRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESS--------GLGQILPTADEFAGI

SwissProt top hitse value%identityAlignment
P00784 Papain5.0e-1042.17Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFGIQYK
        +P+YVDWR +GAVT VK Q     CW ++AV  IEGI KI TG L   S  E++  + R     GGY     +   ++GI Y+
Subjt:  VPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFGIQYK

P60994 Ervatamin-B1.1e-0936Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAI-RFGIQYKADRFRKRIKEEPKPYR
        +P +VDWR++GAV S+K QK+   CW ++AV A+E I KI TG+L  LS  E++  +       GG+    +++ I   GI  + +     ++   KPYR
Subjt:  VPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAI-RFGIQYKADRFRKRIKEEPKPYR

Q8HY82 Cathepsin S3.2e-0935.71Show/hide
Query:  AHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEG--GGYAKIVYKHAI
        ++P+  +P  VDWR +G VT VK Q     CW ++AVGA+E   K+ TG+L  LS   ++  + ++G++G  GG+    +++ I
Subjt:  AHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEG--GGYAKIVYKHAI

Q9SUS9 Probable cysteine protease RDL55.3e-1246.84Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG
        +PK VDWR EGAVT VK Q   R CW ++ VGA+EG+ KI+TGEL  LS  ++I  N  +   GGG  +  Y+  +  G
Subjt:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG

Q9SUT0 Probable cysteine protease RDL43.1e-1246.84Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG
        +PK VDWR EGAVT VK Q   R CW ++ VGA+EG+ KI+TGEL  LS  ++I  N  +   GGG  +  Y+  ++ G
Subjt:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG

Arabidopsis top hitse value%identityAlignment
AT1G06260.1 Cysteine proteinases superfamily protein6.6e-1047.06Show/hide
Query:  VHGRAFSTAHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEII
        +H +      P   VP  VDWRT+GAVT ++ Q K   CW ++AV AIEGI KI TG L  LS  ++I
Subjt:  VHGRAFSTAHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEII

AT3G19390.1 Granulin repeat cysteine protease family protein3.9e-1041.25Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDE-GGGYAKIVYKHAIRFG
        +P  +DWR +GAV  VK Q     CW ++A+GA+EGI +I TGEL  LS  E++  +  + D  GGG     +K  I  G
Subjt:  VPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDE-GGGYAKIVYKHAIRFG

AT4G11310.1 Papain family cysteine protease2.2e-1346.84Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG
        +PK VDWR EGAVT VK Q   R CW ++ VGA+EG+ KI+TGEL  LS  ++I  N  +   GGG  +  Y+  ++ G
Subjt:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG

AT4G11320.1 Papain family cysteine protease3.8e-1346.84Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG
        +PK VDWR EGAVT VK Q   R CW ++ VGA+EG+ KI+TGEL  LS  ++I  N  +   GGG  +  Y+  +  G
Subjt:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG

AT4G23520.1 Cysteine proteinases superfamily protein1.1e-0941.03Show/hide
Query:  KVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQN-PRHGDEGGGYAKIVYKHAI
        ++P+ VDWR EGAV+ +K Q   N CW ++ V A+EG+ KI+TGEL  LS  E++  N   +G  G G     ++  I
Subjt:  KVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQN-PRHGDEGGGYAKIVYKHAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATATAGAATCGGCAGCTTCTCTTCTCAGCCCATTTTTGTTTCCGGGAGAAAGATGTATTCATTCGCTGGCTATCGGAATCTCCTCGGCCGTGCATTTTCAACCGT
CCACGGCCGTGCATTTTCAACCGCCCATCCGCATTCCAAGGTGCCGAAATATGTGGATTGGAGAACCGAAGGTGCTGTCACTTCGGTGAAGCTCCAAAAGAAAAATAGAT
GCTGGGTTTATGCTGCTGTAGGAGCAATTGAAGGAATATACAAAATAATGACTGGAGAGCTACCTATACTATCAGTAGATGAAATCATCCAACAAAACCCCCGACATGGT
GATGAAGGTGGTGGTTATGCGAAGATCGTGTACAAACACGCAATACGCTTTGGAATACAATACAAAGCAGATAGATTCAGAAAACGGATAAAAGAAGAGCCGAAACCATA
TCGGGGACCATTTGGAGAGCAAATGGACCATGAAATGCTTGCGTTTTGTGAAGTTGAAAGTTCTGGTACGAAGTTTTGTGAAGTTGAAAGTTCTGGGTTAGGCCAAATCT
TGCCCACTGCCGACGAATTTGCTGGGATTCAATCTGATGACGAAGATCTAAGTAATATGAAAATGGTACTGTATTCATCATCTTCCTTAGCTACCTTGATCCGAAACCCG
AATCGGAGTTTACTCTCCCGCCTATTCTCAACAGCCATCCCTGCACCTGAAGAATCTATCCGACTCAGCAACATGAATCTTGCAAGTCAGCAGCAGTCCAAGTGTGATGG
TGTGAGATCAATGGAAGAAGCACAAAGTTCAGAGAATTGGAACGTGTTCAAGAGATGGATGTCGAGGATGAAAAGGAAGTACCGGAGCGAGGAAGAGATGTTGGAGAGGT
TTGAGATATTCAATGATACAGTGAATACAATTAAGGAGTGGAAGAAGAAGAATCTTGGGTGTTCCTCCGCATTGAATTGCTTTGCAGACACGAAAGATGACGAGGTTCCC
AGGGGCCACGTCTCTTATCGTCGCCTTCGTTTTGGGAGAAGCCGAATCCTAAAGCGTTGGACTACGCCTAATAAGGACTCCATTGTTAATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCATATAGAATCGGCAGCTTCTCTTCTCAGCCCATTTTTGTTTCCGGGAGAAAGATGTATTCATTCGCTGGCTATCGGAATCTCCTCGGCCGTGCATTTTCAACCGT
CCACGGCCGTGCATTTTCAACCGCCCATCCGCATTCCAAGGTGCCGAAATATGTGGATTGGAGAACCGAAGGTGCTGTCACTTCGGTGAAGCTCCAAAAGAAAAATAGAT
GCTGGGTTTATGCTGCTGTAGGAGCAATTGAAGGAATATACAAAATAATGACTGGAGAGCTACCTATACTATCAGTAGATGAAATCATCCAACAAAACCCCCGACATGGT
GATGAAGGTGGTGGTTATGCGAAGATCGTGTACAAACACGCAATACGCTTTGGAATACAATACAAAGCAGATAGATTCAGAAAACGGATAAAAGAAGAGCCGAAACCATA
TCGGGGACCATTTGGAGAGCAAATGGACCATGAAATGCTTGCGTTTTGTGAAGTTGAAAGTTCTGGTACGAAGTTTTGTGAAGTTGAAAGTTCTGGGTTAGGCCAAATCT
TGCCCACTGCCGACGAATTTGCTGGGATTCAATCTGATGACGAAGATCTAAGTAATATGAAAATGGTACTGTATTCATCATCTTCCTTAGCTACCTTGATCCGAAACCCG
AATCGGAGTTTACTCTCCCGCCTATTCTCAACAGCCATCCCTGCACCTGAAGAATCTATCCGACTCAGCAACATGAATCTTGCAAGTCAGCAGCAGTCCAAGTGTGATGG
TGTGAGATCAATGGAAGAAGCACAAAGTTCAGAGAATTGGAACGTGTTCAAGAGATGGATGTCGAGGATGAAAAGGAAGTACCGGAGCGAGGAAGAGATGTTGGAGAGGT
TTGAGATATTCAATGATACAGTGAATACAATTAAGGAGTGGAAGAAGAAGAATCTTGGGTGTTCCTCCGCATTGAATTGCTTTGCAGACACGAAAGATGACGAGGTTCCC
AGGGGCCACGTCTCTTATCGTCGCCTTCGTTTTGGGAGAAGCCGAATCCTAAAGCGTTGGACTACGCCTAATAAGGACTCCATTGTTAATCCTTGATATGCAACCACTTT
CTTTTTTATCATTCTTTTTCTTTAATTTCTTTCCAGGAAGCTATTATCTTTCTTCAACTACTCATTTTATTAGTACCAATTTCTTTGTTTCCGTTTGATTATTAGTACCA
ATTTCATCCAAAAAAAATAATAATAAATAAAATAATAACTTTTCTCATTTTCAAAATTACCATTAAAAGAAAAAATATTAACTTCGCTTCGTTTCAG
Protein sequenceShow/hide protein sequence
MPYRIGSFSSQPIFVSGRKMYSFAGYRNLLGRAFSTVHGRAFSTAHPHSKVPKYVDWRTEGAVTSVKLQKKNRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHG
DEGGGYAKIVYKHAIRFGIQYKADRFRKRIKEEPKPYRGPFGEQMDHEMLAFCEVESSGTKFCEVESSGLGQILPTADEFAGIQSDDEDLSNMKMVLYSSSSLATLIRNP
NRSLLSRLFSTAIPAPEESIRLSNMNLASQQQSKCDGVRSMEEAQSSENWNVFKRWMSRMKRKYRSEEEMLERFEIFNDTVNTIKEWKKKNLGCSSALNCFADTKDDEVP
RGHVSYRRLRFGRSRILKRWTTPNKDSIVNP