; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035535 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035535
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr3:23697219..23698723
RNA-Seq ExpressionLag0035535
SyntenyLag0035535
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148308.1 uncharacterized protein LOC111016993 [Momordica charantia]1.2e-2838.12Show/hide
Query:  MEASIFVGRHGLDGKRKKLRCTIPCASYHLNLIISSRRGWTTRDWMVIDALFMFIRKKLDARPDLCQRKFVTGDVVVADFLRRDDVLEAL----------
        +E  +   R  + G  +K   +I   ++  NL+      W T +  V+D LFM +RKKL+ RPDLC RKF TGD+V+A++ RR D + A           
Subjt:  MEASIFVGRHGLDGKRKKLRCTIPCASYHLNLIISSRRGWTTRDWMVIDALFMFIRKKLDARPDLCQRKFVTGDVVVADFLRRDDVLEAL----------

Query:  -------DGRFDRLDDY----------EWIEAD--YIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTL
               DGR + +  Y           W+E D  YIPFN+ G HWV++C DLE GE+VV+DS   + T+  +E  LK +HT++ ++L KC VMKV+P L
Subjt:  -------DGRFDRLDDY----------EWIEAD--YIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTL

Query:  PL
        P+
Subjt:  PL

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]1.1e-4040.76Show/hide
Query:  MFIRKKLDARPDLCQRKFVTGDVVVADFLRRDD----------------------------VLEALDGRFDRLDDYEWIEAD--YIPFNLGGNHWVLLCA
        MF+  KL  RP+LC+RKF TGDV++++FLR  D                            +L  +DG     +D  W++ D  Y+P+N+GG HW+++C 
Subjt:  MFIRKKLDARPDLCQRKFVTGDVVVADFLRRDD----------------------------VLEALDGRFDRLDDYEWIEAD--YIPFNLGGNHWVLLCA

Query:  DLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKLDTLSQDSMDFCR
        D + GEL+V+DSFM +     +E++LKP+ T++  L+ + GV   KP +PL  WR++R    PQQ   GDCG+F   F EYDVT    DTL+Q  M F R
Subjt:  DLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKLDTLSQDSMDFCR

Query:  LQFVVQLWANR
         QF VQLWAN+
Subjt:  LQFVVQLWANR

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]5.9e-3147.33Show/hide
Query:  IEADYIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLE
        +EA Y+PFN+ GNHWV++C D   GE+VV+DS   + +   +E+QLK +HT++  LL K  V+ V+P LP+  WR++R    P+Q  SGDCG+F  K+ E
Subjt:  IEADYIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLE

Query:  YDVTKSKLDTLSQDSMDFCRLQFVVQLWANR
        YDVT + L+TL Q++M + R QF  QLW+N+
Subjt:  YDVTKSKLDTLSQDSMDFCRLQFVVQLWANR

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]4.5e-3138.79Show/hide
Query:  IDALFMFIRKKLDARPDLCQRKFVTGDVVVADFLRRDD---------VLEA---LDGRFDRL---------DDYE--WIEAD--YIPFNLGGNHWVLLCA
        ID+L M   +K++    L + +F  GDV++++ LRR D         VL +    D R +R           DY+  W EAD  Y   N+GGNHWV++  
Subjt:  IDALFMFIRKKLDARPDLCQRKFVTGDVVVADFLRRDD---------VLEA---LDGRFDRL---------DDYE--WIEAD--YIPFNLGGNHWVLLCA

Query:  DLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKLDTLSQDSMDFCR
        DL  G+L V+DS   +    D+EK LKP+ T++  +L   G++ ++P LP+  WR++R   VPQQ    DC +F  +F EYDV  SK+DTL Q ++   R
Subjt:  DLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKLDTLSQDSMDFCR

Query:  LQFVVQLWANRPFF
         Q+ VQ+WA RPFF
Subjt:  LQFVVQLWANRPFF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]1.6e-2840.66Show/hide
Query:  LDARPDLCQRK---FVTGDVVVADFLRRDDVLEALDGRFDRLD-DYEWIEADYIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTL
        +D  PD+  RK   F+  D    D+     VL+ + G+    D  +  ++A Y+PFNL G HWVL+CAD +  EL++FDS + L+ N D+E +++ V   
Subjt:  LDARPDLCQRK---FVTGDVVVADFLRRDDVLEALDGRFDRLD-DYEWIEADYIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTL

Query:  MSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKLDTLSQDSMDFCRLQFVVQLWANRPFF
           LL    VM+    L ++ W L+R+    QQ +SGDCG+FT KF EYDVT SK+ TL+QD   + R Q+ +Q+WANR  F
Subjt:  MSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKLDTLSQDSMDFCRLQFVVQLWANRPFF

TrEMBL top hitse value%identityAlignment
A0A6J1D3R7 uncharacterized protein LOC1110169934.6e-2938.61Show/hide
Query:  MEASIFVGRHGLDGKRKKLRCTIPCASYHLNLIISSRRGWTTRDWMVIDALFMFIRKKLDARPDLCQRKFVTGDVVVADFLRRDDVLEAL----------
        +E  +   R  + G  +K   +I   ++  NL+      W T +  V+D LFM +RKKL+ RPDLC RKF TGD+V+A++ RR D L A           
Subjt:  MEASIFVGRHGLDGKRKKLRCTIPCASYHLNLIISSRRGWTTRDWMVIDALFMFIRKKLDARPDLCQRKFVTGDVVVADFLRRDDVLEAL----------

Query:  -------DGRFDRLDDY----------EWIEAD--YIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTL
               DGR + +  Y           W+E D  YIPFN+ G HWV++C DLE GE+VV+DS   + T+  +E  LK +HT++ ++L KC VMKV+P L
Subjt:  -------DGRFDRLDDY----------EWIEAD--YIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTL

Query:  PL
        P+
Subjt:  PL

A0A6J1DID7 uncharacterized protein LOC1110207821.3e-2846.21Show/hide
Query:  RLDDYE--WIEAD--YIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSG
        R  DY+  W EAD  Y P N+GGNHWV++  DL  G+L V+DS   +    D+EK LKP+ T++ ++L   G++ ++P L    WR++R   VPQQ    
Subjt:  RLDDYE--WIEAD--YIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSG

Query:  DCGVFTAKFLEYDVTKSKLDTLSQDSMDFCRLQFVVQLWANRPFF
        DCG+F  +F EYDVT SK+DTL Q ++   R Q+ VQ+WA RPFF
Subjt:  DCGVFTAKFLEYDVTKSKLDTLSQDSMDFCRLQFVVQLWANRPFF

A0A6J1DLV0 uncharacterized protein LOC1110216465.2e-4140.76Show/hide
Query:  MFIRKKLDARPDLCQRKFVTGDVVVADFLRRDD----------------------------VLEALDGRFDRLDDYEWIEAD--YIPFNLGGNHWVLLCA
        MF+  KL  RP+LC+RKF TGDV++++FLR  D                            +L  +DG     +D  W++ D  Y+P+N+GG HW+++C 
Subjt:  MFIRKKLDARPDLCQRKFVTGDVVVADFLRRDD----------------------------VLEALDGRFDRLDDYEWIEAD--YIPFNLGGNHWVLLCA

Query:  DLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKLDTLSQDSMDFCR
        D + GEL+V+DSFM +     +E++LKP+ T++  L+ + GV   KP +PL  WR++R    PQQ   GDCG+F   F EYDVT    DTL+Q  M F R
Subjt:  DLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKLDTLSQDSMDFCR

Query:  LQFVVQLWANR
         QF VQLWAN+
Subjt:  LQFVVQLWANR

A0A6J1DQZ3 uncharacterized protein LOC1110234422.9e-3147.33Show/hide
Query:  IEADYIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLE
        +EA Y+PFN+ GNHWV++C D   GE+VV+DS   + +   +E+QLK +HT++  LL K  V+ V+P LP+  WR++R    P+Q  SGDCG+F  K+ E
Subjt:  IEADYIPFNLGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLE

Query:  YDVTKSKLDTLSQDSMDFCRLQFVVQLWANR
        YDVT + L+TL Q++M + R QF  QLW+N+
Subjt:  YDVTKSKLDTLSQDSMDFCRLQFVVQLWANR

A0A6J1DY60 uncharacterized protein LOC1110252732.2e-3138.79Show/hide
Query:  IDALFMFIRKKLDARPDLCQRKFVTGDVVVADFLRRDD---------VLEA---LDGRFDRL---------DDYE--WIEAD--YIPFNLGGNHWVLLCA
        ID+L M   +K++    L + +F  GDV++++ LRR D         VL +    D R +R           DY+  W EAD  Y   N+GGNHWV++  
Subjt:  IDALFMFIRKKLDARPDLCQRKFVTGDVVVADFLRRDD---------VLEA---LDGRFDRL---------DDYE--WIEAD--YIPFNLGGNHWVLLCA

Query:  DLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKLDTLSQDSMDFCR
        DL  G+L V+DS   +    D+EK LKP+ T++  +L   G++ ++P LP+  WR++R   VPQQ    DC +F  +F EYDV  SK+DTL Q ++   R
Subjt:  DLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKLDTLSQDSMDFCR

Query:  LQFVVQLWANRPFF
         Q+ VQ+WA RPFF
Subjt:  LQFVVQLWANRPFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G08430.1 Ulp1 protease family protein3.7e-0727.5Show/hide
Query:  LGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKV-KPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKL
        + GNHWV L  DL    + V+DS   L T+T++  Q   V T++  +L+     K  + +    EW  KR   +P+  D+ DC +++ K++E        
Subjt:  LGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKV-KPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKL

Query:  DTLSQDSMDFCRLQFVVQLW
        D L  ++M     +  V+++
Subjt:  DTLSQDSMDFCRLQFVVQLW

AT5G45570.1 Ulp1 protease family protein8.9e-0929.17Show/hide
Query:  LGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKV-KPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKL
        + GNHWV L  DL    + V+DS   L T+T++  Q   V T++  +L+     K  + +    EW  KR   +P+  D GDC +++ K++E        
Subjt:  LGGNHWVLLCADLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKV-KPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKL

Query:  DTLSQDSMDFCRLQFVVQLW
        D L  ++M   R +  V+++
Subjt:  DTLSQDSMDFCRLQFVVQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATGGTCCTTCTAAAGGAGGTGATGAGGGTCCTTTGGTGCGTCCGAACACCCAGAAAAAGGAGATGCGCCATCAGGAAGATGAGCCAACTAAAGAGACTCCGAA
GGAAGTGATGGGTCAAACTTCTACTGTTGAGCCATCACAAAGTATTGAAGAGATAGCGGCATTTGATGGAGTCACGCACTCTTTGGTACCCAAGACTAAGCCTGTTGAAT
TTGAGTCACAAAGTGTTGAAGAGACAACAACCCATGACCTGTTGAACGTCGGGGAAACGATAGCGTCAGATCTCATGGAAGCTTCAATCTTTGTGGGCAGACACGGGCTA
GATGGAAAAAGAAAAAAGTTAAGGTGTACAATCCCATGTGCGTCATACCACCTAAACTTGATCATCAGTTCCAGACGTGGTTGGACGACCCGTGACTGGATGGTCATTGA
CGCACTATTCATGTTTATTCGAAAAAAACTGGATGCTCGTCCAGACTTATGCCAACGTAAATTTGTGACAGGTGATGTGGTTGTTGCGGACTTTTTACGACGAGACGATG
TACTAGAAGCACTAGATGGTAGATTTGATCGTCTCGATGACTATGAATGGATTGAAGCGGACTACATACCCTTCAACCTTGGTGGGAACCATTGGGTGTTGTTGTGTGCT
GACTTAGAGGCCGGTGAGTTGGTGGTGTTTGATTCCTTTATGCAGTTGAATACAAACACTGACATAGAGAAGCAGTTGAAGCCTGTTCACACACTGATGTCGATTTTGCT
GGCTAAGTGTGGTGTCATGAAGGTGAAGCCTACACTTCCACTGAATGAATGGAGGTTGAAGAGGAACAAACTAGTGCCACAACAAAAGGATAGTGGGGATTGTGGGGTAT
TCACTGCCAAATTTTTGGAATATGATGTAACGAAGTCCAAACTGGACACCCTTAGTCAGGATAGCATGGACTTTTGTAGGCTTCAATTCGTTGTTCAACTTTGGGCCAAT
AGGCCATTCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGATGGTCCTTCTAAAGGAGGTGATGAGGGTCCTTTGGTGCGTCCGAACACCCAGAAAAAGGAGATGCGCCATCAGGAAGATGAGCCAACTAAAGAGACTCCGAA
GGAAGTGATGGGTCAAACTTCTACTGTTGAGCCATCACAAAGTATTGAAGAGATAGCGGCATTTGATGGAGTCACGCACTCTTTGGTACCCAAGACTAAGCCTGTTGAAT
TTGAGTCACAAAGTGTTGAAGAGACAACAACCCATGACCTGTTGAACGTCGGGGAAACGATAGCGTCAGATCTCATGGAAGCTTCAATCTTTGTGGGCAGACACGGGCTA
GATGGAAAAAGAAAAAAGTTAAGGTGTACAATCCCATGTGCGTCATACCACCTAAACTTGATCATCAGTTCCAGACGTGGTTGGACGACCCGTGACTGGATGGTCATTGA
CGCACTATTCATGTTTATTCGAAAAAAACTGGATGCTCGTCCAGACTTATGCCAACGTAAATTTGTGACAGGTGATGTGGTTGTTGCGGACTTTTTACGACGAGACGATG
TACTAGAAGCACTAGATGGTAGATTTGATCGTCTCGATGACTATGAATGGATTGAAGCGGACTACATACCCTTCAACCTTGGTGGGAACCATTGGGTGTTGTTGTGTGCT
GACTTAGAGGCCGGTGAGTTGGTGGTGTTTGATTCCTTTATGCAGTTGAATACAAACACTGACATAGAGAAGCAGTTGAAGCCTGTTCACACACTGATGTCGATTTTGCT
GGCTAAGTGTGGTGTCATGAAGGTGAAGCCTACACTTCCACTGAATGAATGGAGGTTGAAGAGGAACAAACTAGTGCCACAACAAAAGGATAGTGGGGATTGTGGGGTAT
TCACTGCCAAATTTTTGGAATATGATGTAACGAAGTCCAAACTGGACACCCTTAGTCAGGATAGCATGGACTTTTGTAGGCTTCAATTCGTTGTTCAACTTTGGGCCAAT
AGGCCATTCTTTTAG
Protein sequenceShow/hide protein sequence
MDDGPSKGGDEGPLVRPNTQKKEMRHQEDEPTKETPKEVMGQTSTVEPSQSIEEIAAFDGVTHSLVPKTKPVEFESQSVEETTTHDLLNVGETIASDLMEASIFVGRHGL
DGKRKKLRCTIPCASYHLNLIISSRRGWTTRDWMVIDALFMFIRKKLDARPDLCQRKFVTGDVVVADFLRRDDVLEALDGRFDRLDDYEWIEADYIPFNLGGNHWVLLCA
DLEAGELVVFDSFMQLNTNTDIEKQLKPVHTLMSILLAKCGVMKVKPTLPLNEWRLKRNKLVPQQKDSGDCGVFTAKFLEYDVTKSKLDTLSQDSMDFCRLQFVVQLWAN
RPFF