; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029240 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029240
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionAspartic proteinase nepenthesin-1
Genome locationchr8:36825142..36825750
RNA-Seq ExpressionLag0029240
SyntenyLag0029240
Gene Ontology termsGO:0005576 - extracellular region (cellular component)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CBI19032.3 unnamed protein product, partial [Vitis vinifera]3.4e-3161.61Show/hide
Query:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL
        +EDAFDVLK  F +QT L V +S++ GL LCF LP  N +++ VP L+FHF+GLDL LPVENYMV+D E GL+CL + A G+LSIFG++Q QNMLVLHDL
Subjt:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL

Query:  KKEVVSFVLTVC
        KK  +S V T C
Subjt:  KKEVVSFVLTVC

RVX22508.1 Aspartic proteinase nepenthesin-1 [Vitis vinifera]3.4e-3161.61Show/hide
Query:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL
        +EDAFDVLK  F +QT L V +S++ GL LCF LP  N +++ VP L+FHF+GLDL LPVENYMV+D E GL+CL + A G+LSIFG++Q QNMLVLHDL
Subjt:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL

Query:  KKEVVSFVLTVC
        KK  +S V T C
Subjt:  KKEVVSFVLTVC

XP_010664223.1 PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]3.4e-3161.61Show/hide
Query:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL
        +EDAFDVLK  F +QT L V +S++ GL LCF LP  N +++ VP L+FHF+GLDL LPVENYMV+D E GL+CL + A G+LSIFG++Q QNMLVLHDL
Subjt:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL

Query:  KKEVVSFVLTVC
        KK  +S V T C
Subjt:  KKEVVSFVLTVC

XP_034676400.1 aspartic proteinase nepenthesin-1-like [Vitis riparia]4.8e-3059.82Show/hide
Query:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL
        +EDAFDVLK  F +QT L V +S++ GL LCF LP  N +++ VP L+FHF+GLDL LPVENYMV+D + GL+CL + A G+ SIFG++Q QNMLVLHDL
Subjt:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL

Query:  KKEVVSFVLTVC
        KK  +S V T C
Subjt:  KKEVVSFVLTVC

XP_038886455.1 aspartic proteinase nepenthesin-2-like [Benincasa hispida]2.7e-4178.57Show/hide
Query:  EDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDLK
        ED FDVLKP F +QTNL V  S SIGL LCF+LPSHN SKLDVPDL+FHFE LDLKLPVENYMV +EE G+VCL MGAAG LSIFGSMQHQNMLVLHDL+
Subjt:  EDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDLK

Query:  KEVVSFVLTVCS
        K+VVSF+ T CS
Subjt:  KEVVSFVLTVCS

TrEMBL top hitse value%identityAlignment
A0A1U7YIZ8 aspartic proteinase nepenthesin-17.5e-2957.66Show/hide
Query:  EDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDLK
        E AF +LK  F++Q NLPV  S+S GL LCF LPS N + ++VP LVFHFEG DL LP +NYM+ D   G+ CL MG +  +SIFG++Q QNMLV+HDL 
Subjt:  EDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDLK

Query:  KEVVSFVLTVC
        KE +SFV T C
Subjt:  KEVVSFVLTVC

A0A2N9HF56 Peptidase A1 domain-containing protein1.5e-3262.61Show/hide
Query:  SSHREDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVL
        +S  EDAFDVLK  F AQT L V +  + GL +CF+LPS +   + VP LVFHFEGLDL+LPVENYM+ D + G+VCL MGA G LSIFG++Q QNMLVL
Subjt:  SSHREDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVL

Query:  HDLKKEVVSFVLTVC
        HDL+KE VSF+ T C
Subjt:  HDLKKEVVSFVLTVC

A0A438KMS9 Aspartic proteinase nepenthesin-11.6e-3161.61Show/hide
Query:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL
        +EDAFDVLK  F +QT L V +S++ GL LCF LP  N +++ VP L+FHF+GLDL LPVENYMV+D E GL+CL + A G+LSIFG++Q QNMLVLHDL
Subjt:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL

Query:  KKEVVSFVLTVC
        KK  +S V T C
Subjt:  KKEVVSFVLTVC

A0A6N2L8M9 Peptidase A1 domain-containing protein7.5e-2957.66Show/hide
Query:  EDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDLK
        + AFD++   FT+Q NLPV  S S GL +CF LPS   S ++VP LVFHF G DL+LP ENYM+ DE  GL CL MG++  +SIFG++Q QNMLVLHDL+
Subjt:  EDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDLK

Query:  KEVVSFVLTVC
        KE +SF+ T C
Subjt:  KEVVSFVLTVC

F6H0S5 Peptidase A1 domain-containing protein1.6e-3161.61Show/hide
Query:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL
        +EDAFDVLK  F +QT L V +S++ GL LCF LP  N +++ VP L+FHF+GLDL LPVENYMV+D E GL+CL + A G+LSIFG++Q QNMLVLHDL
Subjt:  REDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDL

Query:  KKEVVSFVLTVC
        KK  +S V T C
Subjt:  KKEVVSFVLTVC

SwissProt top hitse value%identityAlignment
O04496 Aspartyl protease AED31.5e-0525.42Show/hide
Query:  EDAFDVLKPFFTAQTNLPVKHSASIG-LHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPM-----GAAGALSIFGSMQHQNML
        +  ++ ++  F  Q N  V   +++G    CF   + N +    P +  H   LDLKLP+EN ++      L CL M      A   L++  ++Q QN+ 
Subjt:  EDAFDVLKPFFTAQTNLPVKHSASIG-LHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPM-----GAAGALSIFGSMQHQNML

Query:  VLHDLKKEVVSFVLTVCS
        +L D+    +      C+
Subjt:  VLHDLKKEVVSFVLTVCS

Q766C2 Aspartic proteinase nepenthesin-21.2e-1540.87Show/hide
Query:  EDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGAL--SIFGSMQHQNMLVLHD
        +DA++ +   FT Q NLP    +S GL  CF+ PS + S + VP++   F+G  L L  +N +++  E G++CL MG++  L  SIFG++Q Q   VL+D
Subjt:  EDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGAL--SIFGSMQHQNMLVLHD

Query:  LKKEVVSFVLTVCSA
        L+   VSFV T C A
Subjt:  LKKEVVSFVLTVCSA

Q766C3 Aspartic proteinase nepenthesin-19.5e-2145.13Show/hide
Query:  DAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAG-ALSIFGSMQHQNMLVLHDLK
        +A+  ++  F +Q NLPV + +S G  LCF+ PS + S L +P  V HF+G DL+LP ENY ++    GL+CL MG++   +SIFG++Q QNMLV++D  
Subjt:  DAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAG-ALSIFGSMQHQNMLVLHDLK

Query:  KEVVSFVLTVCSA
          VVSF    C A
Subjt:  KEVVSFVLTVCSA

Q7XV21 Aspartyl protease 372.3e-1136.28Show/hide
Query:  FDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNK-SKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMG--AAGALSIFGSMQHQNMLVLHDLK
        +D L      +  LP    +S+GL LCF LP      ++ VP +   F+G  L+L        D E G++CL +G   AG++SI G+ Q QNM VL++L+
Subjt:  FDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNK-SKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMG--AAGALSIFGSMQHQNMLVLHDLK

Query:  KEVVSFVLTVCSA
        +  V+FV + C A
Subjt:  KEVVSFVLTVCSA

Q9LNJ3 Aspartyl protease family protein 23.7e-0938.1Show/hide
Query:  CFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPM-GAAGALSIFGSMQHQNMLVLHDLKKEVVSFVLTVCS
        CF+L + N+ K  VP +V HF G D+ LP  NY++  +  G  C    G  G LSI G++Q Q   V++DL    V F    C+
Subjt:  CFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPM-GAAGALSIFGSMQHQNMLVLHDLKKEVVSFVLTVCS

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein2.7e-1038.1Show/hide
Query:  CFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPM-GAAGALSIFGSMQHQNMLVLHDLKKEVVSFVLTVCS
        CF+L + N+ K  VP +V HF G D+ LP  NY++  +  G  C    G  G LSI G++Q Q   V++DL    V F    C+
Subjt:  CFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPM-GAAGALSIFGSMQHQNMLVLHDLKKEVVSFVLTVCS

AT1G09750.1 Eukaryotic aspartyl protease family protein1.0e-0625.42Show/hide
Query:  EDAFDVLKPFFTAQTNLPVKHSASIG-LHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPM-----GAAGALSIFGSMQHQNML
        +  ++ ++  F  Q N  V   +++G    CF   + N +    P +  H   LDLKLP+EN ++      L CL M      A   L++  ++Q QN+ 
Subjt:  EDAFDVLKPFFTAQTNLPVKHSASIG-LHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPM-----GAAGALSIFGSMQHQNML

Query:  VLHDLKKEVVSFVLTVCS
        +L D+    +      C+
Subjt:  VLHDLKKEVVSFVLTVCS

AT2G03200.1 Eukaryotic aspartyl protease family protein8.8e-3055.86Show/hide
Query:  EDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDLK
        E AF VLK  FT++ +LPV  S S GL LCF+LP   K+ + VP ++FHF+G DL+LP ENYMV D   G++CL MG++  +SIFG++Q QN  VLHDL+
Subjt:  EDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAAGALSIFGSMQHQNMLVLHDLK

Query:  KEVVSFVLTVC
        KE VSFV T C
Subjt:  KEVVSFVLTVC

AT3G54400.1 Eukaryotic aspartyl protease family protein2.2e-0431.08Show/hide
Query:  PDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAA-----GALSIFGSMQHQNMLVLHDLKKEVVSFVLTVCS
        P + F F G+++ LP +N ++      L CL M AA       L++  SMQ QN  VL D+    +      C+
Subjt:  PDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAA-----GALSIFGSMQHQNMLVLHDLKKEVVSFVLTVCS

AT3G61820.1 Eukaryotic aspartyl protease family protein8.6e-0937.35Show/hide
Query:  CFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPM-GAAGALSIFGSMQHQNMLVLHDLKKEVVSFVLTVC
        CF+L      K  VP +VFHF G ++ LP  NY++     G  C    G  G+LSI G++Q Q   V +DL    V F+   C
Subjt:  CFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPM-GAAGALSIFGSMQHQNMLVLHDLKKEVVSFVLTVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATTCACAAATTTTCTTACTGTATGGAACTGGGAGTCTGAATTTAGAATCTTCAACAAGTATCGACAACACTACGAATCGAGCCATCGAGAGGATGCTTTTGATGT
GCTGAAACCATTTTTCACTGCCCAGACAAACCTCCCAGTGAAACATTCAGCAAGCATTGGTCTTCATCTCTGCTTTGAGCTGCCTTCCCACAACAAAAGCAAACTTGATG
TGCCTGATTTGGTATTTCATTTCGAGGGGCTCGACTTGAAGCTTCCAGTCGAGAACTATATGGTCACCGATGAGGAGCGGGGGTTGGTGTGCTTGCCAATGGGAGCTGCA
GGAGCTTTGTCAATTTTCGGCAGCATGCAACACCAGAATATGTTGGTTCTTCATGATCTCAAGAAGGAAGTTGTATCATTTGTTCTCACAGTGTGCTCGGCTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACATTCACAAATTTTCTTACTGTATGGAACTGGGAGTCTGAATTTAGAATCTTCAACAAGTATCGACAACACTACGAATCGAGCCATCGAGAGGATGCTTTTGATGT
GCTGAAACCATTTTTCACTGCCCAGACAAACCTCCCAGTGAAACATTCAGCAAGCATTGGTCTTCATCTCTGCTTTGAGCTGCCTTCCCACAACAAAAGCAAACTTGATG
TGCCTGATTTGGTATTTCATTTCGAGGGGCTCGACTTGAAGCTTCCAGTCGAGAACTATATGGTCACCGATGAGGAGCGGGGGTTGGTGTGCTTGCCAATGGGAGCTGCA
GGAGCTTTGTCAATTTTCGGCAGCATGCAACACCAGAATATGTTGGTTCTTCATGATCTCAAGAAGGAAGTTGTATCATTTGTTCTCACAGTGTGCTCGGCTTGGTAA
Protein sequenceShow/hide protein sequence
MTFTNFLTVWNWESEFRIFNKYRQHYESSHREDAFDVLKPFFTAQTNLPVKHSASIGLHLCFELPSHNKSKLDVPDLVFHFEGLDLKLPVENYMVTDEERGLVCLPMGAA
GALSIFGSMQHQNMLVLHDLKKEVVSFVLTVCSAW