; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015475 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015475
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr12:13803805..13805405
RNA-Seq ExpressionLag0015475
SyntenyLag0015475
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153247.1 uncharacterized protein LOC111020782 [Momordica charantia]2.8e-1833.12Show/hide
Query:  FDWSRFKSVLKYVRGEHTNYNVPWSTV-----------------------GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTW
        +DW + +++ +YV G  ++Y+ PWS                         G++ V DSL A+T   +LEK +  +CTI+P +L   G++ ++  L    W
Subjt:  FDWSRFKSVLKYVRGEHTNYNVPWSTV-----------------------GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTW

Query:  TFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCNTPFF
           RR  VPQQ    DC I   +F EYDV G  M +L +  ++  RR+Y VQ+W   PFF
Subjt:  TFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCNTPFF

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]8.7e-2031.58Show/hide
Query:  MFICKKMDKSSRLMSLEVRYCDIVVTEFLRRWD-VYEQLVGGN------PESFDW-SRFKSVLKYVRGEHTNYNVPWSTV--------------------
        MF+C K+     L   +    D++++ FLR  D VY  +   N         +DW  R  S+L Y+ G H++ +  W  V                    
Subjt:  MFICKKMDKSSRLMSLEVRYCDIVVTEFLRRWD-VYEQLVGGN------PESFDW-SRFKSVLKYVRGEHTNYNVPWSTV--------------------

Query:  ---GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTWTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRR
           GE+ V DS + +T   +LE+E+  + TI+P L+   GV   K ++P T W  +R    PQQ  +GDC I    F EYDV      +L++ RM+F RR
Subjt:  ---GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTWTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRR

Query:  KYVVQLWCN
        ++ VQLW N
Subjt:  KYVVQLWCN

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]9.9e-2431.31Show/hide
Query:  IDSLFMFICKKMDKSSRLMSLEVRYCDIVVTEFLRRWD-VYEQLVGG---NPESFDWSRFKSVLKYVRGEHTNYNVPWSTV-------------------
        IDSL M   +K++K   L+ ++    D++++  LRR D  Y  +  G   +  ++DW + +++ +YV G  ++Y+  WS                     
Subjt:  IDSLFMFICKKMDKSSRLMSLEVRYCDIVVTEFLRRWD-VYEQLVGG---NPESFDWSRFKSVLKYVRGEHTNYNVPWSTV-------------------

Query:  ----GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTWTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCR
            G++ V DSL A+T   +LEK +  +CTI+P +L   G++ ++ +LP   W   RR  VPQQ    DC I   +F EYDV G  + +L +  ++  R
Subjt:  ----GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTWTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCR

Query:  RKYVVQLWCNTPFF
        R+Y VQ+W   PFF
Subjt:  RKYVVQLWCNTPFF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]4.4e-2438.82Show/hide
Query:  EQLVGGNPESFDWSRFKSVLKYVRGEHTNYNVPWSTVG-----------------------EVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVME
        E  +  +  + DWS  K VLKYV G+HT+Y+VPWS V                        E+ + DSL+AL  + +LE EM +VC   P LL  G VME
Subjt:  EQLVGGNPESFDWSRFKSVLKYVRGEHTNYNVPWSTVG-----------------------EVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVME

Query:  VKSSLPRTTWTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCNTPFF
           +L    WT +R     QQ ++GDC +   KF EYDV G  MG+L++DR  + RR+Y +Q+W N   F
Subjt:  VKSSLPRTTWTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCNTPFF

XP_038885861.1 sentrin-specific protease [Benincasa hispida]6.0e-2138.51Show/hide
Query:  SFDWSRFKSVLKYVRGEHTNYNVPWSTVG-----------------------EVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTT
        + DWS+  +V+KYV G+HT+Y+VPWS V                        E+ V DSL+ L  + +LE EM  +C     LL A  VME   +L    
Subjt:  SFDWSRFKSVLKYVRGEHTNYNVPWSTVG-----------------------EVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTT

Query:  WTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCNTPFF
        WT +R   VPQQ  +GDC +   KF EYDV G  M +L++DRM + RR+Y +Q+  N   F
Subjt:  WTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCNTPFF

TrEMBL top hitse value%identityAlignment
A0A6J1CJT2 uncharacterized protein LOC1110120673.7e-1631.97Show/hide
Query:  LKYVRGEHTNYNVPWSTV-----------------------GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTWTFKRRIGVP
        + Y+   H+NY + W  V                       GE+ V DSL A+T D  LE+++ ++ T++P LL    V+ V  +LP   W  +R    P
Subjt:  LKYVRGEHTNYNVPWSTV-----------------------GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTWTFKRRIGVP

Query:  QQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCN
        QQ  + DC+I   K+ EYDV G  + +L ++ M + RR++  QLW N
Subjt:  QQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCN

A0A6J1D492 uncharacterized protein LOC1110168904.3e-1734.67Show/hide
Query:  SFDWSRFKSVLKYVRGEHTNYNVPWSTV-----------------------GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTT
        ++ W    ++ +YV G  +++NVPWS                         G++ V DSL   T   ELEKE+  +CTILP LL  GG+  V+  LP   
Subjt:  SFDWSRFKSVLKYVRGEHTNYNVPWSTV-----------------------GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTT

Query:  WTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKY
        W   RR+ VPQQ    DC I   ++ EYD  G +M +L++D + + RR+Y
Subjt:  WTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKY

A0A6J1DID7 uncharacterized protein LOC1110207821.3e-1833.12Show/hide
Query:  FDWSRFKSVLKYVRGEHTNYNVPWSTV-----------------------GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTW
        +DW + +++ +YV G  ++Y+ PWS                         G++ V DSL A+T   +LEK +  +CTI+P +L   G++ ++  L    W
Subjt:  FDWSRFKSVLKYVRGEHTNYNVPWSTV-----------------------GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTW

Query:  TFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCNTPFF
           RR  VPQQ    DC I   +F EYDV G  M +L +  ++  RR+Y VQ+W   PFF
Subjt:  TFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCNTPFF

A0A6J1DLV0 uncharacterized protein LOC1110216464.2e-2031.58Show/hide
Query:  MFICKKMDKSSRLMSLEVRYCDIVVTEFLRRWD-VYEQLVGGN------PESFDW-SRFKSVLKYVRGEHTNYNVPWSTV--------------------
        MF+C K+     L   +    D++++ FLR  D VY  +   N         +DW  R  S+L Y+ G H++ +  W  V                    
Subjt:  MFICKKMDKSSRLMSLEVRYCDIVVTEFLRRWD-VYEQLVGGN------PESFDW-SRFKSVLKYVRGEHTNYNVPWSTV--------------------

Query:  ---GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTWTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRR
           GE+ V DS + +T   +LE+E+  + TI+P L+   GV   K ++P T W  +R    PQQ  +GDC I    F EYDV      +L++ RM+F RR
Subjt:  ---GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTWTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRR

Query:  KYVVQLWCN
        ++ VQLW N
Subjt:  KYVVQLWCN

A0A6J1DQZ3 uncharacterized protein LOC1110234426.3e-1631.29Show/hide
Query:  LKYVRGEHTNYNVPWSTV-----------------------GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTWTFKRRIGVP
        + Y+   H++Y + W  V                       GE+ V DSL A+TS   LE+++ ++ T++P LL    V+ V+ +LP T W  +R    P
Subjt:  LKYVRGEHTNYNVPWSTV-----------------------GEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTWTFKRRIGVP

Query:  QQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCN
        +Q  +GDC I   K+ EYDV    + +L ++ M++ RR++  QLW N
Subjt:  QQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G45570.1 Ulp1 protease family protein2.6e-0629.13Show/hide
Query:  VAVTDSLVALTSDIELEKEMNIVCTILPWLLEAG-GVMEVKSSLPRTTWTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVV
        V V DS+ +LT+D E+  +   V T++P +L +     + + S  +  W  KR   +P+ +D GDC I + K++E    G     L ++ M   R K  V
Subjt:  VAVTDSLVALTSDIELEKEMNIVCTILPWLLEAG-GVMEVKSSLPRTTWTFKRRIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVV

Query:  QLW
        +++
Subjt:  QLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGGAGGTTCTATTAGGGATGGAGGCCGAGTTACACGACATGCACAAGATAAAGGTGTAGACCCGACTAAGTATATTGGACCTGACGATGATGTCGATTATGGGGG
TCCACCCACCAAAAAACATGACGACGCCACCCGGGAGCGTTACGCCGAGGACAACATACCTAGTAGTGGGAAGGGGCTTATTGAGTTTGAGGACAGGGAGGACAAAGTGG
ACGTAACGGGACTGGACGAACCCTTCGTGCGTCGTGGAAAGTGTGTAATTGACTCTCTTTTCATGTTCATCTGTAAGAAGATGGACAAATCGTCCCGACTTATGTCGTTG
GAGGTTCGTTATTGTGACATAGTTGTTACGGAGTTCTTAAGGCGATGGGACGTATATGAACAACTTGTAGGAGGCAACCCTGAGTCCTTCGACTGGAGTAGGTTCAAGTC
CGTCCTCAAATATGTTCGGGGCGAGCACACGAATTATAATGTTCCATGGAGTACGGTAGGTGAGGTGGCCGTCACAGATTCACTAGTGGCTCTTACGTCCGACATCGAGT
TGGAGAAGGAAATGAATATAGTATGCACCATCCTTCCATGGCTTCTAGAAGCAGGTGGTGTCATGGAGGTGAAATCGTCACTACCACGCACTACATGGACATTCAAGAGG
AGGATTGGAGTTCCCCAGCAGGTAGATAATGGAGATTGTGAGATCTCCGCTGCAAAATTCCTCGAGTATGATGTAAATGGGGTGGACATGGGTTCTCTTAGTGAGGATAG
GATGAATTTTTGTAGACGAAAGTATGTTGTGCAACTATGGTGTAACACGCCATTTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTACGGAGGTTCTATTAGGGATGGAGGCCGAGTTACACGACATGCACAAGATAAAGGTGTAGACCCGACTAAGTATATTGGACCTGACGATGATGTCGATTATGGGGG
TCCACCCACCAAAAAACATGACGACGCCACCCGGGAGCGTTACGCCGAGGACAACATACCTAGTAGTGGGAAGGGGCTTATTGAGTTTGAGGACAGGGAGGACAAAGTGG
ACGTAACGGGACTGGACGAACCCTTCGTGCGTCGTGGAAAGTGTGTAATTGACTCTCTTTTCATGTTCATCTGTAAGAAGATGGACAAATCGTCCCGACTTATGTCGTTG
GAGGTTCGTTATTGTGACATAGTTGTTACGGAGTTCTTAAGGCGATGGGACGTATATGAACAACTTGTAGGAGGCAACCCTGAGTCCTTCGACTGGAGTAGGTTCAAGTC
CGTCCTCAAATATGTTCGGGGCGAGCACACGAATTATAATGTTCCATGGAGTACGGTAGGTGAGGTGGCCGTCACAGATTCACTAGTGGCTCTTACGTCCGACATCGAGT
TGGAGAAGGAAATGAATATAGTATGCACCATCCTTCCATGGCTTCTAGAAGCAGGTGGTGTCATGGAGGTGAAATCGTCACTACCACGCACTACATGGACATTCAAGAGG
AGGATTGGAGTTCCCCAGCAGGTAGATAATGGAGATTGTGAGATCTCCGCTGCAAAATTCCTCGAGTATGATGTAAATGGGGTGGACATGGGTTCTCTTAGTGAGGATAG
GATGAATTTTTGTAGACGAAAGTATGTTGTGCAACTATGGTGTAACACGCCATTTTTTTAG
Protein sequenceShow/hide protein sequence
MYGGSIRDGGRVTRHAQDKGVDPTKYIGPDDDVDYGGPPTKKHDDATRERYAEDNIPSSGKGLIEFEDREDKVDVTGLDEPFVRRGKCVIDSLFMFICKKMDKSSRLMSL
EVRYCDIVVTEFLRRWDVYEQLVGGNPESFDWSRFKSVLKYVRGEHTNYNVPWSTVGEVAVTDSLVALTSDIELEKEMNIVCTILPWLLEAGGVMEVKSSLPRTTWTFKR
RIGVPQQVDNGDCEISAAKFLEYDVNGVDMGSLSEDRMNFCRRKYVVQLWCNTPFF