; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030942 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030942
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionaspartic proteinase PCS1-like
Genome locationchr11:3036475..3036879
RNA-Seq ExpressionLag0030942
SyntenyLag0030942
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581233.1 Aspartic proteinase PCS1, partial [Cucurbita argyrosperma subsp. sororia]1.2e-5986.57Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        MIDSGSDL+YLVDEAY+KV+EE+V+LVGP+MKKGYEYAAVADMCF+ G  AEVGRRI DMSFEFENGVEI VGKGEGVLTEVEKGVKCVG GRSGRLGI 
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK
        SNIIG VHQ+N WVEYDLANRRIGFGGADCS LK
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK

XP_011651212.2 aspartic proteinase PCS1 [Cucumis sativus]6.4e-6187.31Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        MIDSGSDL+YLVDEAY+KV+EEVVRLVG MMKKGY YAAVADMCFD G   EVGRRIGDMSFEF+NGVEI VG+GEGVLTEVEKGVKCVGIGRSGRLGIG
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK
        SNIIG VHQQNMWVEYDLAN+R+GFGGA+CS LK
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK

XP_022934878.1 aspartic proteinase PCS1-like [Cucurbita moschata]1.6e-5985.82Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        MIDSGSDL+YLVDEAY+KV+EE+V+LVGP+MKKGYEYAAVADMCF+ G  AEVGRRI DMSFEFENGVEI VGKGEGVLTEVEKGVKCVG GRSGRLGI 
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK
        SNIIG VHQ+N W+EYDLANRRIGFGGADCS LK
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK

XP_022983616.1 aspartic proteinase PCS1-like [Cucurbita maxima]1.2e-5986.57Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        MIDSGSDL+YLVDEAY+KV+EE+V+LVGP+MKKGYEYAAVADMCF+ G  AEVGRRI DMSFEFENGVEI VGKGEGVLTEVEKGVKCVG GRSGRLGI 
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK
        SNIIG VHQ+N WVEYDLANRRIGFGGADCS LK
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK

XP_038878004.1 aspartic proteinase PCS1 [Benincasa hispida]1.9e-6087.31Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        MIDSGSDL+YL DEAY+KV+EE+VRLVGPMMKKGYE+AAVADMCF+ G AA VGRRIG+MSFEFENGVEILVGKGEGVL EVEKGVKCVGIGRSGRLGIG
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK
        SNIIG VHQQNMWVEYDLA +RIGFGGA+CS LK
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK

TrEMBL top hitse value%identityAlignment
A0A0A0LBQ0 Peptidase A1 domain-containing protein3.1e-6187.31Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        MIDSGSDL+YLVDEAY+KV+EEVVRLVG MMKKGY YAAVADMCFD G   EVGRRIGDMSFEF+NGVEI VG+GEGVLTEVEKGVKCVGIGRSGRLGIG
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK
        SNIIG VHQQNMWVEYDLAN+R+GFGGA+CS LK
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK

A0A6J1EWZ6 aspartic proteinase PCS1-like8.4e-5986.57Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        MIDSGSDL+YLVDEAY KVREE+VRLVGPMMKKGYEYAAVADMCFDG  AA VGRRIGDM F+FENGVEILVGKGEG+LTEVE+GVKCVGIGRS RL   
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK
        SNIIGNVHQQNMWVEYDL+N+RIGFG A CSGLK
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK

A0A6J1F8Z2 aspartic proteinase PCS1-like7.6e-6085.82Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        MIDSGSDL+YLVDEAY+KV+EE+V+LVGP+MKKGYEYAAVADMCF+ G  AEVGRRI DMSFEFENGVEI VGKGEGVLTEVEKGVKCVG GRSGRLGI 
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK
        SNIIG VHQ+N W+EYDLANRRIGFGGADCS LK
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK

A0A6J1J2U3 aspartic proteinase PCS1-like5.8e-6086.57Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        MIDSGSDL+YLVDEAY+KV+EE+V+LVGP+MKKGYEYAAVADMCF+ G  AEVGRRI DMSFEFENGVEI VGKGEGVLTEVEKGVKCVG GRSGRLGI 
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK
        SNIIG VHQ+N WVEYDLANRRIGFGGADCS LK
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK

A0A6J1K7E8 aspartic proteinase PCS1-like5.0e-5984.33Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        MIDSGS+L+YLVDEAY+KVR E+VRLVGPMMKKGYEYA+V+DMCFD   AA  GRRIGDM F+FENGVEILVGKGEG+LTEVEKGVKCVGIGRSGRLG  
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK
        SN+IGNVHQQNMWVEYDLAN+R+GFGGA CSGLK
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCSGLK

SwissProt top hitse value%identityAlignment
Q6F4N5 Aspartyl protease 255.2e-0525Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGR-LGI
        ++DSG+ ++      Y  +REE  R V      GY      D CF+    A  G     ++   + GV++ +     ++      + C+ +  + + +  
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGR-LGI

Query:  GSNIIGNVHQQNMWVEYDLANRRIGFGGADCS
          N+I N+ QQN+ V +D+AN R+GF    C+
Subjt:  GSNIIGNVHQQNMWVEYDLANRRIGFGGADCS

Q766C3 Aspartic proteinase nepenthesin-15.0e-0830.77Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        +IDSG+ L+Y V+ AY  VR+E +  +   +  G   ++  D+CF   +      +I      F+ G   L    E        G+ C+ +G S +   G
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADC
         +I GN+ QQNM V YD  N  + F  A C
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.0e-0525.38Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        ++D G+ ++ L  +AY+ +R+  ++L    +KKG    ++ D C+D  + + V  ++  ++F F  G  + +     ++   + G  C     +      
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADC
         +IIGNV QQ   + YDL+   IG  G  C
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADC

Q9LTW4 Aspartic proteinase NANA, chloroplast1.0e-0526.72Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        ++DSG+ L+ L D AY +V   + R +  + +   E   + + CF   +   V  ++  ++F  + G      + +  L +   GVKC+G   +G     
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCS
        +N+IGN+ QQN   E+DL    + F  + C+
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCS

Q9LZL3 Aspartic proteinase PCS11.7e-0827.97Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMM----KKGYEYAAVADMCFDGGAA---AEVGRRIGDMSFEFENGVEILVGKGEGVLTEV------EKGVK
        M+DSG+  ++L+   Y  +R   +     ++       + +    D+C+        + +  R+  +S  FE G EI V  G+ +L  V         V 
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMM----KKGYEYAAVADMCFDGGAA---AEVGRRIGDMSFEFENGVEILVGKGEGVLTEV------EKGVK

Query:  CVGIGRSGRLGIGSNIIGNVHQQNMWVEYDLANRRIGFGGADC
        C   G S  +G+ + +IG+ HQQNMW+E+DL   RIG    +C
Subjt:  CVGIGRSGRLGIGSNIIGNVHQQNMWVEYDLANRRIGFGGADC

Arabidopsis top hitse value%identityAlignment
AT1G66180.1 Eukaryotic aspartyl protease family protein5.1e-4061.83Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        M+DSGS+ ++LVD AYDKVR E++  VG  +KKGY Y   ADMCFDG  A  + R IGD+ F F  GVEILV K E VL  V  G+ CVGIGRS  LG  
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCS
        SNIIGNVHQQN+WVE+D+ NRR+GF  ADCS
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADCS

AT2G39710.1 Eukaryotic aspartyl protease family protein8.5e-1127.74Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMK----KGYEYAAVADMCFDGGAAAE---VGRRIGDMSF---EFENGVEILVGKGEGVLTEVEKGVKCVG
        M+DSG+  ++L+   Y  ++ E +     +++      + +    D+C+  G+       G  +  + F   E     + L+ +  G  +E ++ V C  
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMK----KGYEYAAVADMCFDGGAAAE---VGRRIGDMSF---EFENGVEILVGKGEGVLTEVEKGVKCVG

Query:  IGRSGRLGIGSNIIGNVHQQNMWVEYDLANRRIGFGG
         G S  LGI + +IG+ HQQN+W+E+DLA  R+GF G
Subjt:  IGRSGRLGIGSNIIGNVHQQNMWVEYDLANRRIGFGG

AT3G18490.1 Eukaryotic aspartyl protease family protein7.4e-0725.38Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        ++D G+ ++ L  +AY+ +R+  ++L    +KKG    ++ D C+D  + + V  ++  ++F F  G  + +     ++   + G  C     +      
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADC
         +IIGNV QQ   + YDL+   IG  G  C
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADC

AT5G02190.1 Eukaryotic aspartyl protease family protein1.2e-0927.97Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMM----KKGYEYAAVADMCFDGGAA---AEVGRRIGDMSFEFENGVEILVGKGEGVLTEV------EKGVK
        M+DSG+  ++L+   Y  +R   +     ++       + +    D+C+        + +  R+  +S  FE G EI V  G+ +L  V         V 
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMM----KKGYEYAAVADMCFDGGAA---AEVGRRIGDMSFEFENGVEILVGKGEGVLTEV------EKGVK

Query:  CVGIGRSGRLGIGSNIIGNVHQQNMWVEYDLANRRIGFGGADC
        C   G S  +G+ + +IG+ HQQNMW+E+DL   RIG    +C
Subjt:  CVGIGRSGRLGIGSNIIGNVHQQNMWVEYDLANRRIGFGGADC

AT5G37540.1 Eukaryotic aspartyl protease family protein9.9e-4463.08Show/hide
Query:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG
        M+DSGS+ ++LVD AYDKV+EE+VRLVG  +KKGY Y + ADMCFDG  + E+GR IGD+ FEF  GVEILV K + +L  V  G+ CVGIGRS  LG  
Subjt:  MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIG

Query:  SNIIGNVHQQNMWVEYDLANRRIGFGGADC
        SNIIGNVHQQN+WVE+D+ NRR+GF  A+C
Subjt:  SNIIGNVHQQNMWVEYDLANRRIGFGGADC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGACTCCGGTTCGGACCTGAGCTATCTAGTGGACGAAGCGTACGACAAGGTGAGAGAAGAGGTGGTGAGACTGGTGGGGCCGATGATGAAGAAGGGGTACGAATA
CGCCGCCGTGGCCGACATGTGTTTCGACGGCGGAGCGGCGGCGGAGGTGGGGCGGAGGATCGGCGACATGTCGTTCGAGTTCGAGAACGGGGTGGAGATTTTGGTGGGGA
AAGGGGAAGGGGTATTGACGGAAGTGGAAAAAGGAGTGAAGTGCGTGGGGATCGGACGGTCAGGAAGGCTCGGGATTGGGAGTAATATAATCGGGAACGTTCATCAGCAG
AATATGTGGGTCGAGTACGATTTGGCCAATAGGAGAATCGGGTTTGGTGGAGCTGACTGTAGTGGATTGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGATCGACTCCGGTTCGGACCTGAGCTATCTAGTGGACGAAGCGTACGACAAGGTGAGAGAAGAGGTGGTGAGACTGGTGGGGCCGATGATGAAGAAGGGGTACGAATA
CGCCGCCGTGGCCGACATGTGTTTCGACGGCGGAGCGGCGGCGGAGGTGGGGCGGAGGATCGGCGACATGTCGTTCGAGTTCGAGAACGGGGTGGAGATTTTGGTGGGGA
AAGGGGAAGGGGTATTGACGGAAGTGGAAAAAGGAGTGAAGTGCGTGGGGATCGGACGGTCAGGAAGGCTCGGGATTGGGAGTAATATAATCGGGAACGTTCATCAGCAG
AATATGTGGGTCGAGTACGATTTGGCCAATAGGAGAATCGGGTTTGGTGGAGCTGACTGTAGTGGATTGAAATGA
Protein sequenceShow/hide protein sequence
MIDSGSDLSYLVDEAYDKVREEVVRLVGPMMKKGYEYAAVADMCFDGGAAAEVGRRIGDMSFEFENGVEILVGKGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGNVHQQ
NMWVEYDLANRRIGFGGADCSGLK