; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012911 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012911
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCysteine protease
Genome locationChr01:25243632..25244205
RNA-Seq ExpressionHG10012911
SyntenyHG10012911
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADQ27799.1 mitogenic proteinase, partial [Vasconcellea cundinamarcensis]1.7e-1642.99Show/hide
Query:  GVCLRKERRTV-LKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR
        G C  K+++ + ++I GY  +  N+E  LIK ++ QP+ VLI      F  YRGGIY GP G  +DHA+ A+G+ ++YI++KNS    WGE GY+RI R 
Subjt:  GVCLRKERRTV-LKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR

Query:  AAEASGI
        + ++ GI
Subjt:  AAEASGI

AFJ15104.1 mexicain-like cystein protease, partial [Jacaratia mexicana]3.7e-1642.45Show/hide
Query:  GVCLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR
        G C  K+++ + +KI GY  +  NNE  LI+A++ QP+ V++      F  Y+GGI+ GP G  VDHA+ AVG+ + YI++KNS    WGE GY+RI R 
Subjt:  GVCLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR

Query:  AAEASG
        + ++ G
Subjt:  AAEASG

KAF3446927.1 hypothetical protein FNV43_RR12107 [Rhamnella rubrinervis]1.3e-1641.13Show/hide
Query:  MTIGVC-LRKERRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTR----EYIILKNS----WGENG
        M  G C ++KE + V+ I G+H + RN+E   +KAL+ QP+ V I  +   F  Y+GGIY G  G +VDH + AVG+      +YII+KNS    WGENG
Subjt:  MTIGVC-LRKERRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTR----EYIILKNS----WGENG

Query:  YMRICRRAAEASGI--YTDMMMYP
        Y+R+ R+  +  G+     M  YP
Subjt:  YMRICRRAAEASGI--YTDMMMYP

KAG6572278.1 Cysteine protease XCP2, partial [Cucurbita argyrosperma subsp. sororia]9.1e-2341.61Show/hide
Query:  GGCPDVVFRDAMTIGVCLRKE-------RRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYII
        GG  D+VF+ AM   +  + +       R   ++I  +  ++   E +LI AL   P+C+LIA  H  F  YRGGIY+GPFG  VDH++LAVG+T +YII
Subjt:  GGCPDVVFRDAMTIGVCLRKE-------RRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYII

Query:  LKN----SWGENGYMRICRRAAEASGIYTDMMMYPII
        +KN    SWG+ GYM + R+A + SGI+     YP++
Subjt:  LKN----SWGENGYMRICRRAAEASGIYTDMMMYPII

pir|S46476| cysteine proteinase (EC 3.4.22.-) III - mountain papaya [Vasconcellea cundinamarcensis]4.8e-1642.86Show/hide
Query:  CLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRRAA
        C  K+++  ++KI GY  +  N+E  LIKA++KQP+ VL+    + F  Y+ GI+ GP G  VDHA+ AVG+ ++YI++KNS    WGE GY++I R + 
Subjt:  CLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRRAA

Query:  EASGI
           GI
Subjt:  EASGI

TrEMBL top hitse value%identityAlignment
A0A4S4DRM0 Pept_C1 domain-containing protein8.9e-1639.16Show/hide
Query:  MHTIWLCGGCPD--VVFRDAMTIGVCLRKERRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTR--
        M +IW C G  D  V F D M         +  V+ I GYH +  NNE  L+KAL+ QP+ V I  +   F  Y GG++ G  G  +DH + AVG+    
Subjt:  MHTIWLCGGCPD--VVFRDAMTIGVCLRKERRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTR--

Query:  --EYIILKNS----WGENGYMRICRRAAEASGI--YTDMMMYP
          +YII+KNS    WGE GY+R+ R   +A GI     M  YP
Subjt:  --EYIILKNS----WGENGYMRICRRAAEASGI--YTDMMMYP

A0A6J1JFT7 cysteine protease XCP1-like1.2e-1540.62Show/hide
Query:  MTIGVCLR-KERRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGF----TREYIILKNS----WGENG
        M  G C+R KE+  V+ I GY  +  N+E  L+KALS QP+ V I  +   F  Y+GGI+ G  G  +DH + AVG+      +YII+KNS    WGENG
Subjt:  MTIGVCLR-KERRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGF----TREYIILKNS----WGENG

Query:  YMRICRRAAEASGI--YTDMMMYPIIDN
        Y+R+ R   +  G+     M  YP  +N
Subjt:  YMRICRRAAEASGI--YTDMMMYPIIDN

E5LBE8 Mitogenic proteinase (Fragment)8.1e-1742.99Show/hide
Query:  GVCLRKERRTV-LKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR
        G C  K+++ + ++I GY  +  N+E  LIK ++ QP+ VLI      F  YRGGIY GP G  +DHA+ A+G+ ++YI++KNS    WGE GY+RI R 
Subjt:  GVCLRKERRTV-LKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR

Query:  AAEASGI
        + ++ GI
Subjt:  AAEASGI

I1Z743 Mexicain-like cystein protease (Fragment)1.2e-1540.57Show/hide
Query:  GVCLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR
        G C  KE++ T ++I GY  +  N+E  LI+A++ QP+ VL+      F  Y+GGI+ GP G  +DHA+ A+G+ + YI++KNS    WGE GY++I R 
Subjt:  GVCLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR

Query:  AAEASG
        + ++ G
Subjt:  AAEASG

I1Z744 Mexicain-like cystein protease (Fragment)1.8e-1642.45Show/hide
Query:  GVCLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR
        G C  K+++ + +KI GY  +  NNE  LI+A++ QP+ V++      F  Y+GGI+ GP G  VDHA+ AVG+ + YI++KNS    WGE GY+RI R 
Subjt:  GVCLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR

Query:  AAEASG
        + ++ G
Subjt:  AAEASG

SwissProt top hitse value%identityAlignment
P00784 Papain5.9e-1742.45Show/hide
Query:  KIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRRAAEA---SGIYTD
        K DG   +Q  NE  L+ +++ QP+ V++    + F  YRGGI+ GP G  VDHA+ AVG+   YI++KNS    WGENGY+RI R    +    G+YT 
Subjt:  KIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRRAAEA---SGIYTD

Query:  MMMYPI
           YP+
Subjt:  MMMYPI

P05994 Papaya proteinase 42.5e-1540.4Show/hide
Query:  LKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTRE----YIILKNS----WGENGYMRICRRAAEASGI
        +K +G   +Q NNE  L+ A++ QP+ V++      F +Y+GGI+ G  G  VDHA+ AVG+ +     YI++KNS    WGENGY+RI R +  + G+
Subjt:  LKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTRE----YIILKNS----WGENGYMRICRRAAEASGI

P10056 Caricain2.9e-1639.64Show/hide
Query:  GVCLRKE-RRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTRE----YIILKNS----WGENGYMR
        G C  K+    ++K  G   +Q NNE  L+ A++KQP+ V++      F  Y+GGI+ GP G  VDHA+ AVG+ +     YI++KNS    WGE GY+R
Subjt:  GVCLRKE-RRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTRE----YIILKNS----WGENGYMR

Query:  ICRRAAEASGI
        I R    + G+
Subjt:  ICRRAAEASGI

P84346 Mexicain5.9e-1740.34Show/hide
Query:  GVCLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICR-
        G C  K+++   + I GY  +  N+E  LI+A++ QP+ V+       F  Y+GGIY GP G   DHA+ AVG+ + Y++LKNS    WGE GY+RI R 
Subjt:  GVCLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICR-

Query:  --RAAEASGIYTDMMMYPI
          R+    G+YT    +PI
Subjt:  --RAAEASGIYTDMMMYPI

P84347 Chymomexicain1.6e-1437.74Show/hide
Query:  GVCLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR
        G C  KE++ T ++I GY  +  N+E  LI+ +  QP+ VL       F  Y+GGI+ GP G   DHA+ A+G+ +  ++ KNS    WGE GY++I R 
Subjt:  GVCLRKERR-TVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNS----WGENGYMRICRR

Query:  AAEASG
        + ++ G
Subjt:  AAEASG

Arabidopsis top hitse value%identityAlignment
AT1G29090.1 Cysteine proteinases superfamily protein4.1e-1338.18Show/hide
Query:  IDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPF-GKYVDHAMLAVGFTR-----EYIILKNS----WGENGYMRICRRAAEASGI-
        I G+  +  NNE  L++A+SKQP+ V I  +   F+ Y GG+Y  P+ G  V+HA+  VG+       +Y + KNS    WGENGY+RI R  A   G+ 
Subjt:  IDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPF-GKYVDHAMLAVGFTR-----EYIILKNS----WGENGYMRICRRAAEASGI-

Query:  -YTDMMMYPI
               YP+
Subjt:  -YTDMMMYPI

AT2G34080.1 Cysteine proteinases superfamily protein2.8e-1434.75Show/hide
Query:  RKERRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGF-----TREYIILKNS----WGENGYMRICRR
        R   R   +I G+  +  NNE  L++A+S+QP+ V +    + F+ Y GG+Y GP G   +HA+  VG+       +Y + KNS    WGE GY+RI R 
Subjt:  RKERRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGF-----TREYIILKNS----WGENGYMRICRR

Query:  AAEASGI--YTDMMMYPI
         A   G+        YP+
Subjt:  AAEASGI--YTDMMMYPI

AT3G49340.1 Cysteine proteinases superfamily protein7.5e-1538.53Show/hide
Query:  IDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGF-----TREYIILKN----SWGENGYMRICRRAAEASGI--
        I GY  + +N+E  L+KA+S+QP+ V I  +  +F+ Y GGI+ G  G  + HA+  VG+       +Y +LKN    SWGENGYMRI R      G+  
Subjt:  IDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGF-----TREYIILKN----SWGENGYMRICRRAAEASGI--

Query:  YTDMMMYPI
           +  YP+
Subjt:  YTDMMMYPI

AT4G23520.1 Cysteine proteinases superfamily protein3.3e-1536.8Show/hide
Query:  TIGVCLRKERRT--VLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTRE----YIILKNS----WGENG
        T G C RK+  +  V+ ID Y  +  N+E  L KA++ QP+ V +    ++F+ YR  IY GP G  +DHA++ VG+  E    Y I++NS    WG+ G
Subjt:  TIGVCLRKERRT--VLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTRE----YIILKNS----WGENG

Query:  YMRICRRAAEASGIYTDMMM--YPI
        Y++I R   +  G+    M+  YPI
Subjt:  YMRICRRAAEASGIYTDMMM--YPI

AT4G36880.1 cysteine proteinase14.1e-1342Show/hide
Query:  VLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTRE----YIILKNS----WGENGYMRICRR-AAEASG
        V+ IDGY  +   +E  L KA+S QP+ V I      F  Y+ GI+ G  G  +DHA++AVG+  E    Y I++NS    WGE GY+R+ R  AA  SG
Subjt:  VLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTRE----YIILKNS----WGENGYMRICRR-AAEASG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATACGATCTGGCTATGCGGCGGTTGTCCGGACGTGGTTTTCAGAGACGCCATGACAATAGGCGTATGTCTCAGAAAGGAACGACGCACGGTATTAAAAATAGACGG
GTATCATATAATTCAACGCAACAATGAATGGGAGCTCATAAAAGCTTTATCTAAACAACCTATTTGCGTATTAATCGCGTGTAATCATGAGCAATTTCTATCATACCGAG
GGGGAATCTATTATGGACCTTTCGGAAAATATGTTGATCATGCTATGCTGGCGGTTGGTTTCACTCGAGAGTATATAATTTTGAAAAATTCGTGGGGAGAGAATGGATAT
ATGAGAATTTGCAGACGCGCCGCAGAAGCCTCAGGGATTTATACTGACATGATGATGTATCCAATTATTGATAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATACGATCTGGCTATGCGGCGGTTGTCCGGACGTGGTTTTCAGAGACGCCATGACAATAGGCGTATGTCTCAGAAAGGAACGACGCACGGTATTAAAAATAGACGG
GTATCATATAATTCAACGCAACAATGAATGGGAGCTCATAAAAGCTTTATCTAAACAACCTATTTGCGTATTAATCGCGTGTAATCATGAGCAATTTCTATCATACCGAG
GGGGAATCTATTATGGACCTTTCGGAAAATATGTTGATCATGCTATGCTGGCGGTTGGTTTCACTCGAGAGTATATAATTTTGAAAAATTCGTGGGGAGAGAATGGATAT
ATGAGAATTTGCAGACGCGCCGCAGAAGCCTCAGGGATTTATACTGACATGATGATGTATCCAATTATTGATAATTAA
Protein sequenceShow/hide protein sequence
MHTIWLCGGCPDVVFRDAMTIGVCLRKERRTVLKIDGYHIIQRNNEWELIKALSKQPICVLIACNHEQFLSYRGGIYYGPFGKYVDHAMLAVGFTREYIILKNSWGENGY
MRICRRAAEASGIYTDMMMYPIIDN