; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005095 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005095
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionErvatamin-B-like
Genome locationChr08:22844946..22845768
RNA-Seq ExpressionHG10005095
SyntenyHG10005095
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055927.1 ervatamin-B-like [Cucumis melo var. makuwa]6.7e-1530.77Show/hide
Query:  MYNRFKVFKDNAKYVLEVNQMNKTYK--LNQFADMLNDEFVNLYARSNITYYKNLHAK------------------------------------------
        M+ RFKVFKDNAK+V + NQM ++ K  LNQFADM +DEF ++++ SNITYYKNLHAK                                          
Subjt:  MYNRFKVFKDNAKYVLEVNQMNKTYK--LNQFADMLNDEFVNLYARSNITYYKNLHAK------------------------------------------

Query:  --------KTSF----------------------------------------------------------------------GYGIEEDGTDYWIISNSW
                 T+F                                                                      GYGI+EDG DYWII N +
Subjt:  --------KTSF----------------------------------------------------------------------GYGIEEDGTDYWIISNSW

Query:  RVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK
           WE++GYMKMQRG  NP G+ G+A   +YPVK
Subjt:  RVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK

KAG6598248.1 hypothetical protein SDJN03_08026, partial [Cucurbita argyrosperma subsp. sororia]1.1e-1429.6Show/hide
Query:  YNRFKVFKDNAKYVLEVNQMNKTY--KLNQFADMLNDEFVNLYARSNITYYKNLHAKKT-SFGY------------------------------------
        + RF VFK+N  +V  VNQMNK Y  KLN+FADM N EFV+ YARSNI++Y+ LH K++  F Y                                    
Subjt:  YNRFKVFKDNAKYVLEVNQMNKTY--KLNQFADMLNDEFVNLYARSNITYYKNLHAKKT-SFGY------------------------------------

Query:  ------------------------------------------------GIEED------------------------GTDYWIISNSWRVGWELEGYMKM
                                                        GI  +                        GTDYWI+ NSW VGW  EGY++M
Subjt:  ------------------------------------------------GIEED------------------------GTDYWIISNSWRVGWELEGYMKM

Query:  QRGVENPGGVYGLAMNSSYPVKH
        +RGVE   G+ G+ M +SYP+K+
Subjt:  QRGVENPGGVYGLAMNSSYPVKH

TYK28132.1 ervatamin-B-like [Cucumis melo var. makuwa]6.7e-1530.77Show/hide
Query:  MYNRFKVFKDNAKYVLEVNQMNKTYK--LNQFADMLNDEFVNLYARSNITYYKNLHAK------------------------------------------
        M+ RFKVFKDNAK+V + NQM ++ K  LNQFADM +DEF ++++ SNITYYKNLHAK                                          
Subjt:  MYNRFKVFKDNAKYVLEVNQMNKTYK--LNQFADMLNDEFVNLYARSNITYYKNLHAK------------------------------------------

Query:  --------KTSF----------------------------------------------------------------------GYGIEEDGTDYWIISNSW
                 T+F                                                                      GYGI+EDG DYWII N +
Subjt:  --------KTSF----------------------------------------------------------------------GYGIEEDGTDYWIISNSW

Query:  RVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK
           WE++GYMKMQRG  NP G+ G+A   +YPVK
Subjt:  RVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK

XP_038885798.1 vignain-like [Benincasa hispida]6.0e-1627.27Show/hide
Query:  MYNRFKVFKDNAKYVLEVNQMNKT--YKLNQFADMLNDEFVNLYARSNITYYKNLHAKKTS---------------------------------------
        M+ RFKVFKDNAKYV +VNQMNK+   KLNQFADM +DEF+N +  SNITYYKNLHAKK                                         
Subjt:  MYNRFKVFKDNAKYVLEVNQMNKT--YKLNQFADMLNDEFVNLYARSNITYYKNLHAKKTS---------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------FGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKH
                                                    GYG EEDG DYWII NSW   W LEGYMKMQRG   P  V GLAMN SYP+K+
Subjt:  -------------------------------------------FGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKH

XP_038896226.1 vignain-like [Benincasa hispida]4.9e-1828.04Show/hide
Query:  MYNRFKVFKDNAKYVLEVNQMNKT--YKLNQFADMLNDEFVNLYARSNITYYKNLHAKKTS---------------------------------------
        M+NRFKVFK+NAKYV +VNQMNK+   KLNQFADM +DEF+N +  SNITYYKNLHAKK                                         
Subjt:  MYNRFKVFKDNAKYVLEVNQMNKT--YKLNQFADMLNDEFVNLYARSNITYYKNLHAKKTS---------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------FGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKH
                                                   GYG EEDGTDYWII NSW   W LEGYMKMQRG + P GV GLAMN SYP+K+
Subjt:  ------------------------------------------FGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKH

TrEMBL top hitse value%identityAlignment
A0A0B2PNS5 KDEL-tailed cysteine endopeptidase CEP13.6e-1435.82Show/hide
Query:  RFKVFKDNAKYVLEVNQ-MNKTYKL--NQFADMLNDEFV-----------------------NLYARSNITYYKNLHAKKTSFGYGIEEDGTDYWIISNS
        RF++FK+N  Y+   N   NK YKL  NQFAD+  +EF+                       N+ A  +I  ++      T+ GYG+ +DGT YW++ NS
Subjt:  RFKVFKDNAKYVLEVNQ-MNKTYKL--NQFADMLNDEFV-----------------------NLYARSNITYYKNLHAKKTSFGYGIEEDGTDYWIISNS

Query:  WRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPV
        W   W  EGY++MQRGV+   G+  +AM +SY +
Subjt:  WRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPV

A0A0R0HF84 Uncharacterized protein1.0e-1336.84Show/hide
Query:  RFKVFKDNAKYVLEVNQ-MNKTYKL--NQFADMLNDEFV-----------NLYARSNITYYKN-----------LHAKKTSFGYGIEEDGTDYWIISNSW
        RF++FK+N  Y+   N   NK YKL  NQFAD+  +EF+           +   R+    Y+N           L    T+ GYG+ +DGT YW++ NSW
Subjt:  RFKVFKDNAKYVLEVNQ-MNKTYKL--NQFADMLNDEFV-----------NLYARSNITYYKN-----------LHAKKTSFGYGIEEDGTDYWIISNSW

Query:  RVGWELEGYMKMQRGVENPGGVYGLAMNSSYPV
           W  EGY++MQRGV+   G+  +AM +SY +
Subjt:  RVGWELEGYMKMQRGVENPGGVYGLAMNSSYPV

A0A444DJC4 Uncharacterized protein1.8e-1333.53Show/hide
Query:  RFKVFKDNAKYVLEVNQMNKTYK--LNQFADMLNDEFVNLYARSNIT---------------YYKN-------------------------------LHA
        RF +FK+NAKYV   N+  K YK  LN+F DM  +EF   YA + I                 YKN                               L  
Subjt:  RFKVFKDNAKYVLEVNQMNKTYK--LNQFADMLNDEFVNLYARSNIT---------------YYKN-------------------------------LHA

Query:  KKTSFGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHSIHAASYSISIL
             GYG  EDG  YWI+ NSW   W  EGY++MQRG+    G+ G+AM +SYP+K S +     + +L
Subjt:  KKTSFGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHSIHAASYSISIL

A0A5A7UMV5 Ervatamin-B-like3.2e-1530.77Show/hide
Query:  MYNRFKVFKDNAKYVLEVNQMNKTYK--LNQFADMLNDEFVNLYARSNITYYKNLHAK------------------------------------------
        M+ RFKVFKDNAK+V + NQM ++ K  LNQFADM +DEF ++++ SNITYYKNLHAK                                          
Subjt:  MYNRFKVFKDNAKYVLEVNQMNKTYK--LNQFADMLNDEFVNLYARSNITYYKNLHAK------------------------------------------

Query:  --------KTSF----------------------------------------------------------------------GYGIEEDGTDYWIISNSW
                 T+F                                                                      GYGI+EDG DYWII N +
Subjt:  --------KTSF----------------------------------------------------------------------GYGIEEDGTDYWIISNSW

Query:  RVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK
           WE++GYMKMQRG  NP G+ G+A   +YPVK
Subjt:  RVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK

A0A5D3DX84 Ervatamin-B-like3.2e-1530.77Show/hide
Query:  MYNRFKVFKDNAKYVLEVNQMNKTYK--LNQFADMLNDEFVNLYARSNITYYKNLHAK------------------------------------------
        M+ RFKVFKDNAK+V + NQM ++ K  LNQFADM +DEF ++++ SNITYYKNLHAK                                          
Subjt:  MYNRFKVFKDNAKYVLEVNQMNKTYK--LNQFADMLNDEFVNLYARSNITYYKNLHAK------------------------------------------

Query:  --------KTSF----------------------------------------------------------------------GYGIEEDGTDYWIISNSW
                 T+F                                                                      GYGI+EDG DYWII N +
Subjt:  --------KTSF----------------------------------------------------------------------GYGIEEDGTDYWIISNSW

Query:  RVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK
           WE++GYMKMQRG  NP G+ G+A   +YPVK
Subjt:  RVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK

SwissProt top hitse value%identityAlignment
O65039 Vignain3.9e-1048.15Show/hide
Query:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHS
        GYG   DGT YW + NSW   W  +GY++M+RG+ +  G+ G+AM +SYP+K S
Subjt:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHS

P12412 Vignain1.4e-1046.03Show/hide
Query:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHSIHAASYSIS
        GYG   DGT+YWI+ NSW   W  +GY++MQR +    G+ G+AM +SYP+K+S    + S+S
Subjt:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHSIHAASYSIS

P25803 Vignain5.1e-1050Show/hide
Query:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHS
        GYG   DGT+YWI+ NSW   W   GY++MQR +    G+ G+AM  SYP+K+S
Subjt:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHS

P43156 Thiol protease SEN1027.9e-1147.54Show/hide
Query:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHSIHAASYS
        GYG   DGT YWI+ NSW   W   GY++MQRG+ +  G  G+AM +SYP+K S +  + S
Subjt:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHSIHAASYS

Q9FGR9 KDEL-tailed cysteine endopeptidase CEP14.7e-1151.85Show/hide
Query:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHS
        GYG   DGT YWI+ NSW   W  +GY++MQRG+ +  G+ G+AM +SYP+K+S
Subjt:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHS

Arabidopsis top hitse value%identityAlignment
AT1G06260.1 Cysteine proteinases superfamily protein1.3e-1145.95Show/hide
Query:  LYARSNITYY--KNLHAKKTSFGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK
        LY+    T Y   NL+   T  GYG+E D   YWI+ NSW  GW  EGY++M+RGV    G  G+AM +SYP++
Subjt:  LYARSNITYY--KNLHAKKTSFGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK

AT1G09850.1 xylem bark cysteine peptidase 34.8e-1151.92Show/hide
Query:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK
        GYG  ++G DYWI+ NSW   W ++G+M MQR  EN  GV G+ M +SYP+K
Subjt:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVK

AT2G27420.1 Cysteine proteinases superfamily protein3.7e-1145Show/hide
Query:  NLHAKKTSFGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPV
        +LH   T  GYG+ E+GT YW++ NSW   W   GYM+++R V+ P G+ GLA+ + YP+
Subjt:  NLHAKKTSFGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPV

AT4G11310.1 Papain family cysteine protease3.7e-1144.44Show/hide
Query:  NLHAKKTSFGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHSIHAASYSIS
        NL+      GYG  E+G DYW++ NS  + W   GYMKM R + NP G+ G+AM +SYP+K+S      SI+
Subjt:  NLHAKKTSFGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHSIHAASYSIS

AT5G50260.1 Cysteine proteinases superfamily protein3.3e-1251.85Show/hide
Query:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHS
        GYG   DGT YWI+ NSW   W  +GY++MQRG+ +  G+ G+AM +SYP+K+S
Subjt:  GYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAMNSSYPVKHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACAATCGTTTCAAAGTTTTTAAAGATAATGCTAAATATGTGTTGGAAGTGAACCAAATGAACAAAACTTACAAGCTGAACCAGTTCGCTGATATGTTGAAT
GATGAGTTTGTTAACTTATATGCTAGATCCAATATTACCTACTACAAAAACCTACATGCTAAGAAAACGTCGTTTGGATACGGAATCGAGGAAGACGGAACAGAT
TATTGGATCATAAGTAACTCATGGAGAGTTGGATGGGAATTGGAAGGTTATATGAAGATGCAACGAGGAGTGGAGAATCCAGGAGGTGTATATGGATTGGCAATG
AATTCTTCATATCCCGTAAAGCATAGCATCCACGCAGCTTCCTATTCTATTTCCATTCTTTTTAGCATTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTACAATCGTTTCAAAGTTTTTAAAGATAATGCTAAATATGTGTTGGAAGTGAACCAAATGAACAAAACTTACAAGCTGAACCAGTTCGCTGATATGTTGAAT
GATGAGTTTGTTAACTTATATGCTAGATCCAATATTACCTACTACAAAAACCTACATGCTAAGAAAACGTCGTTTGGATACGGAATCGAGGAAGACGGAACAGAT
TATTGGATCATAAGTAACTCATGGAGAGTTGGATGGGAATTGGAAGGTTATATGAAGATGCAACGAGGAGTGGAGAATCCAGGAGGTGTATATGGATTGGCAATG
AATTCTTCATATCCCGTAAAGCATAGCATCCACGCAGCTTCCTATTCTATTTCCATTCTTTTTAGCATTCTTTAG
Protein sequenceShow/hide protein sequence
MYNRFKVFKDNAKYVLEVNQMNKTYKLNQFADMLNDEFVNLYARSNITYYKNLHAKKTSFGYGIEEDGTDYWIISNSWRVGWELEGYMKMQRGVENPGGVYGLAM
NSSYPVKHSIHAASYSISILFSIL