; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg16016 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg16016
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionaspartic proteinase-like protein 1 isoform X1
Genome locationCarg_Chr07:2339063..2340075
RNA-Seq ExpressionCarg16016
SyntenyCarg16016
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594861.1 Aspartic proteinase-like protein 1, partial [Cucurbita argyrosperma subsp. sororia]1.6e-7498.64Show/hide
Query:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLVLVLLMMISAHQAMS AFTSRILHRFSEE+KALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
        WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
Subjt:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL

KAG7026822.1 Aspartic proteinase-like protein 1 [Cucurbita argyrosperma subsp. argyrosperma]4.6e-85100Show/hide
Query:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLVGLLFGICSFYIAINYF
        WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLVGLLFGICSFYIAINYF
Subjt:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLVGLLFGICSFYIAINYF

XP_022963037.1 aspartic proteinase-like protein 1 isoform X1 [Cucurbita moschata]2.1e-7497.96Show/hide
Query:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLVLVLLMMISAHQAMS AFTSRILHRFSEE+KALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
        WLHYAWIDIGTPSVSFLVALDAGSDLLW+PCDCIQCAPLSASYYGSL
Subjt:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL

XP_022963039.1 aspartic proteinase-like protein 1 isoform X3 [Cucurbita moschata]2.1e-7497.96Show/hide
Query:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLVLVLLMMISAHQAMS AFTSRILHRFSEE+KALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
        WLHYAWIDIGTPSVSFLVALDAGSDLLW+PCDCIQCAPLSASYYGSL
Subjt:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL

XP_023003199.1 aspartic proteinase-like protein 1 isoform X1 [Cucurbita maxima]3.6e-7497.96Show/hide
Query:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLVLVLLMMISAHQA+S AFTSRILHRFSEE+KALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
        WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
Subjt:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL

TrEMBL top hitse value%identityAlignment
A0A5D3CLH5 Aspartic proteinase-like protein 1 isoform X13.8e-6990.48Show/hide
Query:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLV++LLM+I  HQA+S  FTSRILHRFSEE+KALRVS STNTSVRVSWPEKGSMEYYQELVSGDF+RQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
        WLHY WIDIGTPSVSFLVALDAGSDLLW+PC+CIQCAPLSASYYGSL
Subjt:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL

A0A6J1HE55 aspartic proteinase-like protein 1 isoform X11.0e-7497.96Show/hide
Query:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLVLVLLMMISAHQAMS AFTSRILHRFSEE+KALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
        WLHYAWIDIGTPSVSFLVALDAGSDLLW+PCDCIQCAPLSASYYGSL
Subjt:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL

A0A6J1HIV4 aspartic proteinase-like protein 1 isoform X31.0e-7497.96Show/hide
Query:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLVLVLLMMISAHQAMS AFTSRILHRFSEE+KALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
        WLHYAWIDIGTPSVSFLVALDAGSDLLW+PCDCIQCAPLSASYYGSL
Subjt:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL

A0A6J1KNG7 aspartic proteinase-like protein 1 isoform X31.8e-7497.96Show/hide
Query:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLVLVLLMMISAHQA+S AFTSRILHRFSEE+KALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
        WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
Subjt:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL

A0A6J1KSL9 aspartic proteinase-like protein 1 isoform X11.8e-7497.96Show/hide
Query:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLVLVLLMMISAHQA+S AFTSRILHRFSEE+KALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
        WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
Subjt:  WLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-29.2e-0435.14Show/hide
Query:  ELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAWIDIGTPSVSFLVALDAGSDLLWVPCD-CIQC
        EL+    +R + ++ S   +L  S G +T     D  +L    + IGTP  SF   +D GSDL+W  C+ C QC
Subjt:  ELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAWIDIGTPSVSFLVALDAGSDLLWVPCD-CIQC

Q8VYV9 Aspartyl protease family protein 11.0e-1032.62Show/hide
Query:  SLRNLVLVLLMMISAHQAMSTA-----FTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFP-SEGSKTIAL
        S R L L LL+++++   +        F     HRFS+++  +              P + S +YY+ +   D   +  +L +  Q L   S+G++T+ +
Subjt:  SLRNLVLVLLMMISAHQAMSTA-----FTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFP-SEGSKTIAL

Query:  GNDFGWLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQC
         +  G+LHYA + +GTPS  F+VALD GSDL W+PCDC  C
Subjt:  GNDFGWLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQC

Q9LX20 Aspartic proteinase-like protein 14.2e-4155.32Show/hide
Query:  VLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAW
        +L  ++ ++  + +++ F+SR++HRFS+E +A   + S++ S+    P K S+EYY+ L   DF+RQ+M LG++ Q L PSEGSKTI+ GNDFGWLHY W
Subjt:  VLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAW

Query:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
        IDIGTPSVSFLVALD GS+LLW+PC+C+QCAPL+++YY SL
Subjt:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL

Arabidopsis top hitse value%identityAlignment
AT3G51330.1 Eukaryotic aspartyl protease family protein8.5e-1339.66Show/hide
Query:  STAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQ---LLFPSEGSKTIALGNDFGWLHYAWIDIGTPSVSFL
        S  F+  + H FS+     RV +S      V  PEKGS+EY++ L   D   +   L S  +   + F   G++TI++ +  G+LHYA + +GTP+  FL
Subjt:  STAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQ---LLFPSEGSKTIALGNDFGWLHYAWIDIGTPSVSFL

Query:  VALDAGSDLLWVPCDC
        VALD GSDL W+PC+C
Subjt:  VALDAGSDLLWVPCDC

AT3G51340.1 Eukaryotic aspartyl protease family protein1.4e-1244.44Show/hide
Query:  PEKGSMEYYQELVSGD-FQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDC
        PE GS+EY++ L   D F R +    +  +    S GS      N  G+LHYA + +GTP+  FLVALD GSDL W+PC+C
Subjt:  PEKGSMEYYQELVSGD-FQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAWIDIGTPSVSFLVALDAGSDLLWVPCDC

AT3G51360.1 Eukaryotic aspartyl protease family protein3.4e-1436.07Show/hide
Query:  MMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAWIDIGT
        M +    ++S + +  I HRFSE++K +              PE GS++YY+ LV  D  RQ     +    +  ++G+ T     +  +LHYA + IGT
Subjt:  MMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAWIDIGT

Query:  PSVSFLVALDAGSDLLWVPCDC
        P+  FLVALD GSDL W+PC+C
Subjt:  PSVSFLVALDAGSDLLWVPCDC

AT4G35880.1 Eukaryotic aspartyl protease family protein2.1e-1938.62Show/hide
Query:  VLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLG-----SRFQLLFPSEGSKTIALGNDFGW
        ++ +LM++S        FT  + HRFS+E+K      S +T     +P KGS EY+  LV  D+  +  +L      S   L F S+G+ T  + +  G+
Subjt:  VLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLG-----SRFQLLFPSEGSKTIALGNDFGW

Query:  LHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGS
        LHY  + +GTP + F+VALD GSDL WVPCDC +CAP   + Y S
Subjt:  LHYAWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGS

AT5G10080.1 Eukaryotic aspartyl protease family protein3.0e-4255.32Show/hide
Query:  VLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAW
        +L  ++ ++  + +++ F+SR++HRFS+E +A   + S++ S+    P K S+EYY+ L   DF+RQ+M LG++ Q L PSEGSKTI+ GNDFGWLHY W
Subjt:  VLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAW

Query:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL
        IDIGTPSVSFLVALD GS+LLW+PC+C+QCAPL+++YY SL
Subjt:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCTTCGGAATCTGGTTTTGGTCCTGCTGATGATGATTTCCGCTCACCAGGCTATGTCGACTGCGTTTACATCGAGAATACTTCACCGGTTCTCTGAGGAGTTGAA
GGCGCTTAGGGTTTCAAGGAGTACGAATACGAGTGTACGAGTCTCGTGGCCTGAGAAAGGGAGCATGGAGTATTATCAGGAGCTTGTGAGCGGTGACTTCCAGAGGCAGA
AGATGAAGCTTGGCTCTCGGTTTCAGTTGCTTTTCCCGTCTGAAGGCAGCAAAACCATTGCACTGGGGAATGACTTTGGCTGGTTGCATTACGCCTGGATCGACATCGGG
ACGCCAAGTGTTTCATTTCTGGTTGCATTGGATGCCGGGAGTGATCTGCTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCTTTGTCTGCAAGTTACTATGGCAGTCT
GGTAGGCCTGCTTTTCGGAATCTGTTCTTTCTACATAGCCATTAATTACTTCTAA
mRNA sequenceShow/hide mRNA sequence
TCTGTCTGCTTCTTTACACTTCACAACTCTTCTTCTTCCTAACTTCCGGCTGAACTCGTCGGTTTCTGTGATCTCTCCTCCATAATTTCGGCGGTCCGTAAACTCGAAAG
CTTGCTGTTTCTGTTTCTCATTTTGGTCCCGGGCGGAAATTGAGGGTCTGAATTGTTGGCAGTTTCTTCTGAAGCTCCAGATCTCGACTTCGAGCTGCTTCGTTCGTCCT
TTGAGGATTGGTTGCAATGTCGCTTCGGAATCTGGTTTTGGTCCTGCTGATGATGATTTCCGCTCACCAGGCTATGTCGACTGCGTTTACATCGAGAATACTTCACCGGT
TCTCTGAGGAGTTGAAGGCGCTTAGGGTTTCAAGGAGTACGAATACGAGTGTACGAGTCTCGTGGCCTGAGAAAGGGAGCATGGAGTATTATCAGGAGCTTGTGAGCGGT
GACTTCCAGAGGCAGAAGATGAAGCTTGGCTCTCGGTTTCAGTTGCTTTTCCCGTCTGAAGGCAGCAAAACCATTGCACTGGGGAATGACTTTGGCTGGTTGCATTACGC
CTGGATCGACATCGGGACGCCAAGTGTTTCATTTCTGGTTGCATTGGATGCCGGGAGTGATCTGCTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCTTTGTCTGCAA
GTTACTATGGCAGTCTGGTAGGCCTGCTTTTCGGAATCTGTTCTTTCTACATAGCCATTAATTACTTCTAA
Protein sequenceShow/hide protein sequence
MSLRNLVLVLLMMISAHQAMSTAFTSRILHRFSEELKALRVSRSTNTSVRVSWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAWIDIG
TPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLVGLLFGICSFYIAINYF