; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039865 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039865
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionOTU domain-containing protein
Genome locationchr13:503163..505434
RNA-Seq ExpressionLag0039865
SyntenyLag0039865
Gene Ontology termsGO:0016579 - protein deubiquitination (biological process)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
InterPro domainsIPR003323 - OTU domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK15712.1 OTU domain-containing protein [Cucumis melo var. makuwa]1.1e-6492.65Show/hide
Query:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA
        I GDGRCLFRSVVHGA LRSGK APSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVR MRQPHVWGGEPELLMSSHVLQMPISVYM DK S 
Subjt:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA

Query:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN
        NLK+IAEYGQEYGKENPIRVLFHSYGHY+SLKA CN
Subjt:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN

XP_004135307.1 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 4 isoform X2 [Cucumis sativus]3.1e-6491.91Show/hide
Query:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA
        I GDGRCLFRSVV+GA LRSGK APSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVR MRQPHVWGGEPELLMSSHVLQMPISVYM DK S 
Subjt:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA

Query:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN
        NLK+IAEYGQEYGKENPIRVLFHSYGHY+SLKA CN
Subjt:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN

XP_016900199.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis melo]1.6e-6592.03Show/hide
Query:  SGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKN
        +GI GDGRCLFRSVVHGA LRSGK APSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVR MRQPHVWGGEPELLMSSHVLQMPISVYM DK 
Subjt:  SGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKN

Query:  SANLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN
        S NLK+IAEYGQEYGKENPIRVLFHSYGHY+SLKA CN
Subjt:  SANLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN

XP_022151797.1 OTU domain-containing protein At3g57810-like [Momordica charantia]2.5e-6692.65Show/hide
Query:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA
        IAGDGRCLFRSVVHGA LRSGKPAPSEVLQK+LAD LRENVANELMKRRLDTERFIEGDF QYVR+MRQPHVWGGEPELLMSSHVLQMPISVYMWDKN+A
Subjt:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA

Query:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN
        NLKLIAEYGQEYGKEN IRVLFHSYGHY+SLKA C+
Subjt:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN

XP_022967003.1 OTU domain-containing protein At3g57810-like isoform X2 [Cucurbita maxima]6.2e-6587.94Show/hide
Query:  LLVSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMW
        ++  GIAGDGRCLFRSVVHGA LRSGKPAP+E L+KELADELRENVANELMKRRLDTERFIEGDFGQYVR MRQPHVWGGEPELLMSSHVL+MPISVY+W
Subjt:  LLVSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMW

Query:  DKNSANLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN
        D  SANLKLIAEYGQEY KENPIRVLFHSYGHY+ LKA CN
Subjt:  DKNSANLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN

TrEMBL top hitse value%identityAlignment
A0A0A0KS16 OTU domain-containing protein1.5e-6491.91Show/hide
Query:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA
        I GDGRCLFRSVV+GA LRSGK APSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVR MRQPHVWGGEPELLMSSHVLQMPISVYM DK S 
Subjt:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA

Query:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN
        NLK+IAEYGQEYGKENPIRVLFHSYGHY+SLKA CN
Subjt:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN

A0A1S4DW37 OTU domain-containing protein At3g57810-like7.9e-6692.03Show/hide
Query:  SGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKN
        +GI GDGRCLFRSVVHGA LRSGK APSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVR MRQPHVWGGEPELLMSSHVLQMPISVYM DK 
Subjt:  SGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKN

Query:  SANLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN
        S NLK+IAEYGQEYGKENPIRVLFHSYGHY+SLKA CN
Subjt:  SANLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN

A0A5D3CW13 OTU domain-containing protein5.1e-6592.65Show/hide
Query:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA
        I GDGRCLFRSVVHGA LRSGK APSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVR MRQPHVWGGEPELLMSSHVLQMPISVYM DK S 
Subjt:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA

Query:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN
        NLK+IAEYGQEYGKENPIRVLFHSYGHY+SLKA CN
Subjt:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN

A0A6J1DEG4 OTU domain-containing protein At3g57810-like1.2e-6692.65Show/hide
Query:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA
        IAGDGRCLFRSVVHGA LRSGKPAPSEVLQK+LAD LRENVANELMKRRLDTERFIEGDF QYVR+MRQPHVWGGEPELLMSSHVLQMPISVYMWDKN+A
Subjt:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKNSA

Query:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN
        NLKLIAEYGQEYGKEN IRVLFHSYGHY+SLKA C+
Subjt:  NLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN

A0A6J1HTW1 OTU domain-containing protein At3g57810-like isoform X23.0e-6587.94Show/hide
Query:  LLVSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMW
        ++  GIAGDGRCLFRSVVHGA LRSGKPAP+E L+KELADELRENVANELMKRRLDTERFIEGDFGQYVR MRQPHVWGGEPELLMSSHVL+MPISVY+W
Subjt:  LLVSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMW

Query:  DKNSANLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN
        D  SANLKLIAEYGQEY KENPIRVLFHSYGHY+ LKA CN
Subjt:  DKNSANLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN

SwissProt top hitse value%identityAlignment
P38747 OTU domain-containing protein 26.8e-0630.07Show/hide
Query:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRR-------LDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVY
        I  DG CLF S++   +LR     P ++ Q     +LR    N + + R        D E     D  +Y ++M     WGGE E+L  SHV   PIS+ 
Subjt:  IAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRR-------LDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVY

Query:  MWDKNSANLKLIAEYGQEYGKENPIRVLF--HSYG---HYESL
        M  +         +   E GK   +++++  HSY    HY SL
Subjt:  MWDKNSANLKLIAEYGQEYGKENPIRVLF--HSYG---HYESL

Q7ZV00 Deubiquitinase OTUD6B9.8e-0527.78Show/hide
Query:  LLVSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDT---ERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISV
        L +  I+ DG C++R+V H    R G     + L+ + A  +R + A++ M    +    + +   +F +Y   +     WGG+ EL   S VLQ+PI V
Subjt:  LLVSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDT---ERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISV

Query:  YMWDKNSANLKLIAEYGQEYGKEN-PIRVLFHSYG---HYESLK
           D     +      G+EY K    +  + H+YG   HY S++
Subjt:  YMWDKNSANLKLIAEYGQEYGKEN-PIRVLFHSYG---HYESLK

Q8GYW0 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 34.2e-0827.85Show/hide
Query:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERF--------IEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMP
        V  + GDGRCLFR++V G     G     +  +++ ADELR  V   +     + E++        ++    ++ +++ +   WGGE ELL+ S + + P
Subjt:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERF--------IEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMP

Query:  ISVYMWDKN-------SANLKLIAEYGQEY------GK--ENPIRVLFHSYGHYESLK
        I VY+ +               I EYG E+      GK  +N +R+L+    HY+ L+
Subjt:  ISVYMWDKN-------SANLKLIAEYGQEY------GK--ENPIRVLFHSYGHYESLK

Q8LBZ4 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 41.4e-4867.91Show/hide
Query:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDK
        + GI GDGRCLFRSV HG  LRSGK AP E +Q+ELADELR  VA+E ++RR +TE F+EGDF  YVRQ+R PHVWGGEPEL M+SHVLQMPI+VYM D 
Subjt:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDK

Query:  NSANLKLIAEYGQEYGKENPIRVLFHSYGHYESL
         +  L  IAEYGQEYGK++PIRVL+H +GHY++L
Subjt:  NSANLKLIAEYGQEYGKENPIRVLFHSYGHYESL

Q8N6M0 Deubiquitinase OTUD6B2.2e-0426.8Show/hide
Query:  LSAFLLLVSGIAGDGRCLFRSVVHGARLRSGKPAPSEV-LQKELADELRENVANEL--MKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQ
        L+A  L +  I  DG C+++++    +L+    A + V L+ + A+ ++ +V + L  +      + +   +F +Y   +     WGG+ EL   SH+LQ
Subjt:  LSAFLLLVSGIAGDGRCLFRSVVHGARLRSGKPAPSEV-LQKELADELRENVANEL--MKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQ

Query:  MPISVYMWDKNSANLKLIAEYGQEYGKENPIRV-LFHSYG---HYESLKAACN
         PI +   D     +      G+EY K+  I V + H+YG   HY S+    N
Subjt:  MPISVYMWDKNSANLKLIAEYGQEYGKENPIRV-LFHSYG---HYESLKAACN

Arabidopsis top hitse value%identityAlignment
AT2G38025.1 Cysteine proteinases superfamily protein3.0e-0927.85Show/hide
Query:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERF--------IEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMP
        V  + GDGRCLFR++V G     G     +  +++ ADELR  V   +     + E++        ++    ++ +++ +   WGGE ELL+ S + + P
Subjt:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERF--------IEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMP

Query:  ISVYMWDKN-------SANLKLIAEYGQEY------GK--ENPIRVLFHSYGHYESLK
        I VY+ +               I EYG E+      GK  +N +R+L+    HY+ L+
Subjt:  ISVYMWDKN-------SANLKLIAEYGQEY------GK--ENPIRVLFHSYGHYESLK

AT3G57810.1 Cysteine proteinases superfamily protein1.0e-4967.91Show/hide
Query:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDK
        + GI GDGRCLFRSV HG  LRSGK AP E +Q+ELADELR  VA+E ++RR +TE F+EGDF  YVRQ+R PHVWGGEPEL M+SHVLQMPI+VYM D 
Subjt:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDK

Query:  NSANLKLIAEYGQEYGKENPIRVLFHSYGHYESL
         +  L  IAEYGQEYGK++PIRVL+H +GHY++L
Subjt:  NSANLKLIAEYGQEYGKENPIRVLFHSYGHYESL

AT3G57810.2 Cysteine proteinases superfamily protein1.0e-4967.91Show/hide
Query:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDK
        + GI GDGRCLFRSV HG  LRSGK AP E +Q+ELADELR  VA+E ++RR +TE F+EGDF  YVRQ+R PHVWGGEPEL M+SHVLQMPI+VYM D 
Subjt:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDK

Query:  NSANLKLIAEYGQEYGKENPIRVLFHSYGHYESL
         +  L  IAEYGQEYGK++PIRVL+H +GHY++L
Subjt:  NSANLKLIAEYGQEYGKENPIRVLFHSYGHYESL

AT3G57810.3 Cysteine proteinases superfamily protein1.0e-4967.91Show/hide
Query:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDK
        + GI GDGRCLFRSV HG  LRSGK AP E +Q+ELADELR  VA+E ++RR +TE F+EGDF  YVRQ+R PHVWGGEPEL M+SHVLQMPI+VYM D 
Subjt:  VSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDK

Query:  NSANLKLIAEYGQEYGKENPIRVLFHSYGHYESL
         +  L  IAEYGQEYGK++PIRVL+H +GHY++L
Subjt:  NSANLKLIAEYGQEYGKENPIRVLFHSYGHYESL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCTATCTGCTTTTCTTCTTCTCGTTTCCGGCATAGCTGGTGATGGAAGATGTCTGTTTCGATCAGTGGTTCATGGTGCTCGTCTCAGATCGGGCAAGCCAGCTCC
AAGCGAGGTTCTCCAGAAAGAGCTTGCAGACGAGCTCAGAGAGAACGTTGCAAATGAGTTGATGAAAAGGCGTTTGGACACTGAACGGTTTATTGAAGGTGACTTCGGCC
AGTATGTTAGACAGATGCGCCAACCACATGTGTGGGGAGGAGAACCTGAGTTACTTATGTCTTCACATGTCCTACAGATGCCAATCTCAGTCTACATGTGGGACAAGAAC
TCTGCAAATCTTAAACTCATAGCCGAGTATGGGCAGGAGTACGGAAAAGAAAATCCTATCCGCGTCCTTTTTCACAGCTACGGACATTATGAATCATTGAAGGCTGCATG
CAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCTATCTGCTTTTCTTCTTCTCGTTTCCGGCATAGCTGGTGATGGAAGATGTCTGTTTCGATCAGTGGTTCATGGTGCTCGTCTCAGATCGGGCAAGCCAGCTCC
AAGCGAGGTTCTCCAGAAAGAGCTTGCAGACGAGCTCAGAGAGAACGTTGCAAATGAGTTGATGAAAAGGCGTTTGGACACTGAACGGTTTATTGAAGGTGACTTCGGCC
AGTATGTTAGACAGATGCGCCAACCACATGTGTGGGGAGGAGAACCTGAGTTACTTATGTCTTCACATGTCCTACAGATGCCAATCTCAGTCTACATGTGGGACAAGAAC
TCTGCAAATCTTAAACTCATAGCCGAGTATGGGCAGGAGTACGGAAAAGAAAATCCTATCCGCGTCCTTTTTCACAGCTACGGACATTATGAATCATTGAAGGCTGCATG
CAATTGA
Protein sequenceShow/hide protein sequence
MHLSAFLLLVSGIAGDGRCLFRSVVHGARLRSGKPAPSEVLQKELADELRENVANELMKRRLDTERFIEGDFGQYVRQMRQPHVWGGEPELLMSSHVLQMPISVYMWDKN
SANLKLIAEYGQEYGKENPIRVLFHSYGHYESLKAACN