; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039276 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039276
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr2:40331609..40332410
RNA-Seq ExpressionLag0039276
SyntenyLag0039276
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]2.3e-3240.44Show/hide
Query:  NFLRSEDGLYKELSSG--VHPRDLT-YEWT-KTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELK
        NFLRS DG+Y  + S   +  R  + Y+W  +  ++L Y    HS ++  W  +DAVYLP+N+ G+HW+++CID   GE++V DS   +     +EQELK
Subjt:  NFLRSEDGLYKELSSG--VHPRDLT-YEWT-KTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELK

Query:  ALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWAN
         +  ++P+++ + G    +  + +  W +R   S  Q+   GDCG+F   F EYDV   SFDTLTQ  M  FRRQ+ VQLWAN
Subjt:  ALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWAN

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]1.4e-2943.54Show/hide
Query:  LRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRS
        + Y  + HS + + W  ++AVYLPFN++G HWV++CID   GE+VV DSLRA+     +E++LK +  V+PS+L K   +  R  L +  W +R   S  
Subjt:  LRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRS

Query:  QRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWAN
        ++  SGDCG+F  K+ EYDV  +S +TL Q+ M  FRRQ+  QLW+N
Subjt:  QRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWAN

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]3.3e-3139.34Show/hide
Query:  NFLRSEDGLYKELSSGVHPRDLTYEWTKTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCH
        N LR  DG Y  +  GV P   TY+W +   + RY     S ++  W+  D VY   N+ G HWV++ IDL  G++ V DSL+A+   E +E+ LK +C 
Subjt:  NFLRSEDGLYKELSSGVHPRDLTYEWTKTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCH

Query:  VVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWANMPLF
        ++P++L   G +  R  L +  W +R   +  Q+    DC +F  +F EYDVIGS  DTL Q  +  FRRQY VQ+WA  P F
Subjt:  VVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWANMPLF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]4.3e-3945.41Show/hide
Query:  DENFLRSEDGLYKELSSGVHPRDLTYEWTKTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKAL
        DENFLR +                T +W+    VL+Y   +H+ +++PW+ +DAVY+PFNL G+HWVLVC D QV E+++ DSL AL+    +E E++ +
Subjt:  DENFLRSEDGLYKELSSGVHPRDLTYEWTKTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKAL

Query:  CHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWANMPLF
        C   P +L     M+S N L + RW LR +  RSQ+ +SGDCG+F  KF EYDV GS   TLTQD    FRRQY +Q+WAN  LF
Subjt:  CHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWANMPLF

XP_038885861.1 sentrin-specific protease [Benincasa hispida]3.3e-3944.86Show/hide
Query:  DENFLRSEDGLYKELSSGVHPRDLTYEWTKTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKAL
        DENFLR +                T +W+KT+NV++Y   +H+ +++PW+ +DA+Y+PFNL  +HWVLVC+D QV E++V DSL  L+    +E E+++L
Subjt:  DENFLRSEDGLYKELSSGVHPRDLTYEWTKTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKAL

Query:  CHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWANMPLF
        C     +L     M+S N L + RW LR +    Q+  SGDCG+F CKF EYDV GS  DTLTQD M  +RRQY +Q+ AN  LF
Subjt:  CHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWANMPLF

TrEMBL top hitse value%identityAlignment
A0A6J1CJT2 uncharacterized protein LOC1110120673.7e-2842.18Show/hide
Query:  LRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRS
        + Y    HS + + W  ++AVYLPFN++G +WV++CID   GE+VV DSLRA+  +  +E++L+ +  V+PS+L K   +     L +  W +R   S  
Subjt:  LRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRS

Query:  QRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWAN
        Q+  S DC +F  K+ EYDV G+S +TL Q+ MP FRRQ+  QLW+N
Subjt:  QRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWAN

A0A6J1DID7 uncharacterized protein LOC1110207821.1e-2740Show/hide
Query:  YEWTKTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRW
        Y+W +   + RY     S ++ PW+  D VY P N+ G HWV++ IDL  G++ V DSL+A+   E +E+ LK +C ++P +L   G +  R  L    W
Subjt:  YEWTKTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRW

Query:  PLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWANMPLF
         +R   +  Q+    DCG+F  +F EYDV GS  DTL Q  +  FRRQY VQ+WA  P F
Subjt:  PLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWANMPLF

A0A6J1DLV0 uncharacterized protein LOC1110216461.1e-3240.44Show/hide
Query:  NFLRSEDGLYKELSSG--VHPRDLT-YEWT-KTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELK
        NFLRS DG+Y  + S   +  R  + Y+W  +  ++L Y    HS ++  W  +DAVYLP+N+ G+HW+++CID   GE++V DS   +     +EQELK
Subjt:  NFLRSEDGLYKELSSG--VHPRDLT-YEWT-KTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELK

Query:  ALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWAN
         +  ++P+++ + G    +  + +  W +R   S  Q+   GDCG+F   F EYDV   SFDTLTQ  M  FRRQ+ VQLWAN
Subjt:  ALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWAN

A0A6J1DQZ3 uncharacterized protein LOC1110234426.7e-3043.54Show/hide
Query:  LRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRS
        + Y  + HS + + W  ++AVYLPFN++G HWV++CID   GE+VV DSLRA+     +E++LK +  V+PS+L K   +  R  L +  W +R   S  
Subjt:  LRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRS

Query:  QRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWAN
        ++  SGDCG+F  K+ EYDV  +S +TL Q+ M  FRRQ+  QLW+N
Subjt:  QRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWAN

A0A6J1DY60 uncharacterized protein LOC1110252731.6e-3139.34Show/hide
Query:  NFLRSEDGLYKELSSGVHPRDLTYEWTKTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCH
        N LR  DG Y  +  GV P   TY+W +   + RY     S ++  W+  D VY   N+ G HWV++ IDL  G++ V DSL+A+   E +E+ LK +C 
Subjt:  NFLRSEDGLYKELSSGVHPRDLTYEWTKTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCH

Query:  VVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWANMPLF
        ++P++L   G +  R  L +  W +R   +  Q+    DC +F  +F EYDVIGS  DTL Q  +  FRRQY VQ+WA  P F
Subjt:  VVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWANMPLF

SwissProt top hitse value%identityAlignment
O65278 Putative ubiquitin-like-specific protease 1B6.1e-0426.05Show/hide
Query:  DAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEY
        D +++P ++D +HW L  I+ +  + V  DSL       ++    K L   V           S+  + V  W +   + R Q+ +  DCG+F+ K++++
Subjt:  DAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEY

Query:  DVIGSSFDTLTQDMMPEFR
           G S    +Q  MP FR
Subjt:  DVIGSSFDTLTQDMMPEFR

Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases7.9e-0728.21Show/hide
Query:  DAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSR-----SQRCDSGDCGVF--
        D VY+PFN D  HWV +C+DL+  ++ + DS   L ++  +  EL+ L  ++P +        S +   +   P  L++       S   DSG   VF  
Subjt:  DAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVLWKFGAMDSRNELSVGRWPLRLEKSR-----SQRCDSGDCGVF--

Query:  -------VCKFLEYDVI
               V + +E+DV+
Subjt:  -------VCKFLEYDVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGATGAGAATTTTTTGAGAAGTGAAGACGGTCTGTACAAAGAGCTAAGCAGCGGCGTACACCCCCGAGACTTGACATACGAATGGACCAAGACATCAAACGTCTT
GAGGTACGCAAGGGATGAGCATTCATACCACAACATCCCATGGACCACTATTGATGCGGTGTACTTGCCTTTTAATCTCGATGGTCTCCATTGGGTTTTGGTGTGCATTG
ATTTGCAGGTCGGTGAGGTGGTCGTGTCAGATTCGCTTAGGGCATTGAACAAGGAAGAGGTGGTCGAACAGGAGTTGAAGGCCCTTTGCCACGTCGTGCCCAGTGTACTT
TGGAAGTTCGGGGCTATGGATTCAAGGAATGAACTCTCTGTTGGAAGATGGCCTCTCCGTCTGGAAAAATCAAGGTCACAACGGTGTGATAGTGGTGACTGTGGGGTGTT
TGTATGTAAATTTTTAGAGTACGATGTAATAGGGTCGTCATTCGACACCCTTACTCAAGATATGATGCCTGAATTTCGAAGGCAATATGTTGTACAATTGTGGGCCAATA
TGCCACTTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCGATGAGAATTTTTTGAGAAGTGAAGACGGTCTGTACAAAGAGCTAAGCAGCGGCGTACACCCCCGAGACTTGACATACGAATGGACCAAGACATCAAACGTCTT
GAGGTACGCAAGGGATGAGCATTCATACCACAACATCCCATGGACCACTATTGATGCGGTGTACTTGCCTTTTAATCTCGATGGTCTCCATTGGGTTTTGGTGTGCATTG
ATTTGCAGGTCGGTGAGGTGGTCGTGTCAGATTCGCTTAGGGCATTGAACAAGGAAGAGGTGGTCGAACAGGAGTTGAAGGCCCTTTGCCACGTCGTGCCCAGTGTACTT
TGGAAGTTCGGGGCTATGGATTCAAGGAATGAACTCTCTGTTGGAAGATGGCCTCTCCGTCTGGAAAAATCAAGGTCACAACGGTGTGATAGTGGTGACTGTGGGGTGTT
TGTATGTAAATTTTTAGAGTACGATGTAATAGGGTCGTCATTCGACACCCTTACTCAAGATATGATGCCTGAATTTCGAAGGCAATATGTTGTACAATTGTGGGCCAATA
TGCCACTTTTTTAG
Protein sequenceShow/hide protein sequence
MTDENFLRSEDGLYKELSSGVHPRDLTYEWTKTSNVLRYARDEHSYHNIPWTTIDAVYLPFNLDGLHWVLVCIDLQVGEVVVSDSLRALNKEEVVEQELKALCHVVPSVL
WKFGAMDSRNELSVGRWPLRLEKSRSQRCDSGDCGVFVCKFLEYDVIGSSFDTLTQDMMPEFRRQYVVQLWANMPLF