; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041603 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041603
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr13:21579858..21582075
RNA-Seq ExpressionLag0041603
SyntenyLag0041603
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]1.5e-4443.6Show/hide
Query:  MFIRRKLDARPDLCQRKFVIEDLVVAVNFLRRDE----VLEALDGRFDRI-DDYDW-SRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCA
        MF+  KL  RP+LC+RKF   D++++ NFLR  +    ++++ +    R+  DYDW  R   ++ Y+ G HSD++  W  V+AVY+P+N+GG HW++ C 
Subjt:  MFIRRKLDARPDLCQRKFVIEDLVVAVNFLRRDE----VLEALDGRFDRI-DDYDW-SRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCA

Query:  DLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCR
        D +  EL+V DSF+ +     LE   KP+ T++P L+ + GV   K  +PL  WR++R    PQQ   GDCG+F   F EYDVT    DTL+Q  M+F R
Subjt:  DLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCR

Query:  RQFAVQLWANR
        RQFAVQLWAN+
Subjt:  RQFAVQLWANR

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]1.2e-3346.62Show/hide
Query:  MDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVP
        M Y+  +HSD+ + W  VEAVY+PFN+ GNHWV+ C D    E+VV DS   + +   LE   K + T++P LL K  V+ V+  LP+  WR++R    P
Subjt:  MDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVP

Query:  QQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANR
        +Q  SGDCG+F  K+ EYDVT + L+TL Q++M++ RRQFA QLW+N+
Subjt:  QQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANR

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]5.6e-3639.53Show/hide
Query:  IEALFMFIRRKLDARPDLCQRKFVIEDLVVAVNFLRRDEVLEAL--DGRFDRIDDYDWSRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPC
        I++L M   RK++    L + +F I D++++ N LRR +   A    G       YDW + + +  YV+G  SD++ LWS  + VY   N+GGNHWV+  
Subjt:  IEALFMFIRRKLDARPDLCQRKFVIEDLVVAVNFLRRDEVLEAL--DGRFDRIDDYDWSRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPC

Query:  ADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFC
         DL   +L V DS   +    DLE   KP+ T++P +L   G++ ++  LP+  WR++R   VPQQ    DC +F  +F EYDV  SK+DTL Q +++  
Subjt:  ADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFC

Query:  RRQFAVQLWANRPFF
        RRQ+AVQ+WA RPFF
Subjt:  RRQFAVQLWANRPFF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]2.1e-3844.55Show/hide
Query:  LDARPDLCQRKFVIEDLVVAVNFLRRDEVLEALDGRFDRIDDYDWSRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDS
        +D  PD+  RK          NFLRRD                DWS  K V+ YV G H+D+++ WS+V+AVY+PFNL G HWVL CAD +  EL++ DS
Subjt:  LDARPDLCQRKFVIEDLVVAVNFLRRDEVLEALDGRFDRIDDYDWSRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDS

Query:  FLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANRP
         + L+ + DLE   + V    P LL    VM+  H L ++ W L+R+    QQ +SGDCG+FT KF EYDVT SK+ TL+QD   + RRQ+A+Q+WANR 
Subjt:  FLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANRP

Query:  FF
         F
Subjt:  FF

XP_038885861.1 sentrin-specific protease [Benincasa hispida]2.3e-3745.95Show/hide
Query:  DEVLEA---LDGRFD----RIDDY--DWSRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPV
        D+V++    +DGR D    R D +  DWS+   V+ YV G H+D+++ WS+V+A+Y+PFNL   HWVL C D +  EL+V DS + L+ + DLE+  + +
Subjt:  DEVLEA---LDGRFD----RIDDY--DWSRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPV

Query:  RTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANRPFF
              LL    VM+  H L ++ W L+R+  VPQQ  SGDCG+FT KF EYDVT SK+DTL+QD M + RRQ+A+Q+ ANR  F
Subjt:  RTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANRPFF

TrEMBL top hitse value%identityAlignment
A0A6J1CJT2 uncharacterized protein LOC1110120673.4e-3145.27Show/hide
Query:  MDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVP
        M Y+  +HS++ + W  VEAVY+PFN+ GN+WV+ C D    E+VV DS   +  D  LE   + +RT++P LL K  V+ V   LP+  WR++R    P
Subjt:  MDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVP

Query:  QQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANR
        QQ  S DC +F  K+ EYDVT + L+TL Q++M + RRQFA QLW+N+
Subjt:  QQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANR

A0A6J1DID7 uncharacterized protein LOC1110207822.2e-3344.38Show/hide
Query:  YDWSRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEW
        YDW + + +  YV+G  SD++  WS  + VY P N+GGNHWV+   DL   +L V DS   +    DLE   KP+ T++P++L   G++ ++  L    W
Subjt:  YDWSRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEW

Query:  RLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANRPFF
        R++R   VPQQ    DCG+F  +F EYDVT SK+DTL Q +++  RRQ+AVQ+WA RPFF
Subjt:  RLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANRPFF

A0A6J1DLV0 uncharacterized protein LOC1110216467.1e-4543.6Show/hide
Query:  MFIRRKLDARPDLCQRKFVIEDLVVAVNFLRRDE----VLEALDGRFDRI-DDYDW-SRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCA
        MF+  KL  RP+LC+RKF   D++++ NFLR  +    ++++ +    R+  DYDW  R   ++ Y+ G HSD++  W  V+AVY+P+N+GG HW++ C 
Subjt:  MFIRRKLDARPDLCQRKFVIEDLVVAVNFLRRDE----VLEALDGRFDRI-DDYDW-SRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCA

Query:  DLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCR
        D +  EL+V DSF+ +     LE   KP+ T++P L+ + GV   K  +PL  WR++R    PQQ   GDCG+F   F EYDVT    DTL+Q  M+F R
Subjt:  DLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCR

Query:  RQFAVQLWANR
        RQFAVQLWAN+
Subjt:  RQFAVQLWANR

A0A6J1DQZ3 uncharacterized protein LOC1110234425.7e-3446.62Show/hide
Query:  MDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVP
        M Y+  +HSD+ + W  VEAVY+PFN+ GNHWV+ C D    E+VV DS   + +   LE   K + T++P LL K  V+ V+  LP+  WR++R    P
Subjt:  MDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVP

Query:  QQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANR
        +Q  SGDCG+F  K+ EYDVT + L+TL Q++M++ RRQFA QLW+N+
Subjt:  QQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANR

A0A6J1DY60 uncharacterized protein LOC1110252732.7e-3639.53Show/hide
Query:  IEALFMFIRRKLDARPDLCQRKFVIEDLVVAVNFLRRDEVLEAL--DGRFDRIDDYDWSRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPC
        I++L M   RK++    L + +F I D++++ N LRR +   A    G       YDW + + +  YV+G  SD++ LWS  + VY   N+GGNHWV+  
Subjt:  IEALFMFIRRKLDARPDLCQRKFVIEDLVVAVNFLRRDEVLEAL--DGRFDRIDDYDWSRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPC

Query:  ADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFC
         DL   +L V DS   +    DLE   KP+ T++P +L   G++ ++  LP+  WR++R   VPQQ    DC +F  +F EYDV  SK+DTL Q +++  
Subjt:  ADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFC

Query:  RRQFAVQLWANRPFF
        RRQ+AVQ+WA RPFF
Subjt:  RRQFAVQLWANRPFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G45570.1 Ulp1 protease family protein3.3e-1029.46Show/hide
Query:  VEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKV-KHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFL
        V+ +Y    + GNHWV    DL    + V DS   L TDT++      V T++P +L+     K  + +    EW  KR   +P+  D GDC +++ K++
Subjt:  VEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVMKV-KHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFL

Query:  EYDVTKSKLDTLSQDSMNFCRRQFAVQLW
        E        D L  ++M   R + AV+++
Subjt:  EYDVTKSKLDTLSQDSMNFCRRQFAVQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGATAATGGAACCCCCACGTCCACCAACACCTTCTCCACCCCCTCCAGCAGTTGACGTAGATGAAGGTCCTTCTAAAGAAGGTGATGAAGGTTCGTTTAAAGG
AGGTGACGTGTCTGAAAACCCAGAAAAAGATGATGGGGATGATGGTCCTTCTGAAGGAGGTGATGAGGGTCCTTTGGGCGGGTCTGAACATCCAGAAAAAGGAGATGAGC
CAGCTGAAGAGAACCCAAAAAGAGGTGATGAGGGTCCTTTGGGCGGGTCCGAACACCCAGAAAAAGGAGATGAGCCAACTGAAGAGACCCCAAAAGGAGATGATGGGTCC
GACTTCCAGTTGAAAGAAACTGACGAGCCTGTTGAGCCGTTACAGAGTATTGAAGAGATAGTGGCATTTGATGGAGTCACACACTCTTTGGTACCCAAAACTGAGAGTGT
TGAAGAGACAACAACCCATAGACCTGTTGAACATCGGGGGAAACGAAAGCGTCAGATCTCATGGAAGCTTCGATCTCCGTGGGCAGACACGAGGCCAGATGGAAAAAGAA
GAAAAGTTAAGGTCATTGAAGCGCTATTCATGTTTATTCGGAGAAAACTAGATGCTCGTCCGGACTTATGCCAACGAAAATTTGTGATAGAAGACTTGGTTGTCGCGGTC
AATTTTCTACGACGAGATGAAGTACTAGAAGCACTGGATGGTAGGTTTGATCGTATCGATGACTATGACTGGAGTAGATGGAAGATCGTCATGGATTATGTTATGGGCAA
CCATTCAGACCATAACATTCTTTGGAGTTCAGTTGAAGCGGTCTACATACCCTTCAACCTTGGTGGGAACCATTGGGTGTTGCCGTGTGCTGACTTAGAGGCCAGCGAGT
TGGTGGTGTCTGATTCCTTTTTGCAGTTGAATACAGACACCGACTTAGAGAATCATTTCAAACCTGTTCGCACACTGATGCCAATCTTGCTCGCTAAGTGTGGTGTAATG
AAAGTGAAGCATACCCTTCCACTAAATGAATGGAGGTTGAAGAGGAACAAGCTAGTGCCACAACAAAAGGATAGTGGGGATTGTGGGGTATTCACTACCAAATTTTTGGA
ATATGATGTAACGAAATCCAAACTGGACACCCTTAGCCAGGATAGCATGAACTTTTGTAGACGTCAATTCGCTGTTCAACTTTGGGCCAATAGGCCATTCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCGGATAATGGAACCCCCACGTCCACCAACACCTTCTCCACCCCCTCCAGCAGTTGACGTAGATGAAGGTCCTTCTAAAGAAGGTGATGAAGGTTCGTTTAAAGG
AGGTGACGTGTCTGAAAACCCAGAAAAAGATGATGGGGATGATGGTCCTTCTGAAGGAGGTGATGAGGGTCCTTTGGGCGGGTCTGAACATCCAGAAAAAGGAGATGAGC
CAGCTGAAGAGAACCCAAAAAGAGGTGATGAGGGTCCTTTGGGCGGGTCCGAACACCCAGAAAAAGGAGATGAGCCAACTGAAGAGACCCCAAAAGGAGATGATGGGTCC
GACTTCCAGTTGAAAGAAACTGACGAGCCTGTTGAGCCGTTACAGAGTATTGAAGAGATAGTGGCATTTGATGGAGTCACACACTCTTTGGTACCCAAAACTGAGAGTGT
TGAAGAGACAACAACCCATAGACCTGTTGAACATCGGGGGAAACGAAAGCGTCAGATCTCATGGAAGCTTCGATCTCCGTGGGCAGACACGAGGCCAGATGGAAAAAGAA
GAAAAGTTAAGGTCATTGAAGCGCTATTCATGTTTATTCGGAGAAAACTAGATGCTCGTCCGGACTTATGCCAACGAAAATTTGTGATAGAAGACTTGGTTGTCGCGGTC
AATTTTCTACGACGAGATGAAGTACTAGAAGCACTGGATGGTAGGTTTGATCGTATCGATGACTATGACTGGAGTAGATGGAAGATCGTCATGGATTATGTTATGGGCAA
CCATTCAGACCATAACATTCTTTGGAGTTCAGTTGAAGCGGTCTACATACCCTTCAACCTTGGTGGGAACCATTGGGTGTTGCCGTGTGCTGACTTAGAGGCCAGCGAGT
TGGTGGTGTCTGATTCCTTTTTGCAGTTGAATACAGACACCGACTTAGAGAATCATTTCAAACCTGTTCGCACACTGATGCCAATCTTGCTCGCTAAGTGTGGTGTAATG
AAAGTGAAGCATACCCTTCCACTAAATGAATGGAGGTTGAAGAGGAACAAGCTAGTGCCACAACAAAAGGATAGTGGGGATTGTGGGGTATTCACTACCAAATTTTTGGA
ATATGATGTAACGAAATCCAAACTGGACACCCTTAGCCAGGATAGCATGAACTTTTGTAGACGTCAATTCGCTGTTCAACTTTGGGCCAATAGGCCATTCTTTTAG
Protein sequenceShow/hide protein sequence
MNRIMEPPRPPTPSPPPPAVDVDEGPSKEGDEGSFKGGDVSENPEKDDGDDGPSEGGDEGPLGGSEHPEKGDEPAEENPKRGDEGPLGGSEHPEKGDEPTEETPKGDDGS
DFQLKETDEPVEPLQSIEEIVAFDGVTHSLVPKTESVEETTTHRPVEHRGKRKRQISWKLRSPWADTRPDGKRRKVKVIEALFMFIRRKLDARPDLCQRKFVIEDLVVAV
NFLRRDEVLEALDGRFDRIDDYDWSRWKIVMDYVMGNHSDHNILWSSVEAVYIPFNLGGNHWVLPCADLEASELVVSDSFLQLNTDTDLENHFKPVRTLMPILLAKCGVM
KVKHTLPLNEWRLKRNKLVPQQKDSGDCGVFTTKFLEYDVTKSKLDTLSQDSMNFCRRQFAVQLWANRPFF