; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004728 (gene) of Snake gourd v1 genome

Gene IDTan0004728
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionOTU domain-containing protein
Genome locationLG04:587554..591881
RNA-Seq ExpressionTan0004728
SyntenyTan0004728
Gene Ontology termsGO:0016579 - protein deubiquitination (biological process)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
InterPro domainsIPR003323 - OTU domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016900199.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis melo]1.2e-6987.33Show/hide
Query:  TINRALLLFPHSAGIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVL
        T+    L F HSAGI GDGRCLFRSVVHGACLRSGK APSE+L+KELADELRENVANELMKRRLDTERFIEGDFGQYVR MRQPH WGGEPELLMSSHVL
Subjt:  TINRALLLFPHSAGIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVL

Query:  QMPISVYMWDKNSANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN
        QMPISVYM DK S NLK+IAEYGQEYGKENPIRVLFHSYGHYD LK PCN
Subjt:  QMPISVYMWDKNSANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN

XP_022945713.1 OTU domain-containing protein At3g57810-like isoform X2 [Cucurbita moschata]2.4e-6792.59Show/hide
Query:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS
        GIAGDGRCLFRSVVHGACLRSGKPAP+E LEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPH WGGEPELLMSSHVL++PISVY+WD  S
Subjt:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS

Query:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVP
        ANLKLIAEYGQEY KENPIRVLFHSYGHYDLLK P
Subjt:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVP

XP_022967002.1 OTU domain-containing protein At3g57810-like isoform X1 [Cucurbita maxima]5.8e-6993.38Show/hide
Query:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNSA
        IAGDGRCLFRSVVHGACLRSGKPAP+E LEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPH WGGEPELLMSSHVL+MPISVY+WD  SA
Subjt:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNSA

Query:  NLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN
        NLKLIAEYGQEY KENPIRVLFHSYGHYDLLK PCN
Subjt:  NLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN

XP_022967003.1 OTU domain-containing protein At3g57810-like isoform X2 [Cucurbita maxima]1.2e-6993.43Show/hide
Query:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS
        GIAGDGRCLFRSVVHGACLRSGKPAP+E LEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPH WGGEPELLMSSHVL+MPISVY+WD  S
Subjt:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS

Query:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN
        ANLKLIAEYGQEY KENPIRVLFHSYGHYDLLK PCN
Subjt:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN

XP_023541543.1 OTU domain-containing protein At3g57810-like isoform X2 [Cucurbita pepo subsp. pepo]8.3e-6893.33Show/hide
Query:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS
        GIAGDGRCLFRSVVHGACLRSGKPAP+E LEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPH WGGEPELLMSSHVL+MPISVY+WD  S
Subjt:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS

Query:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVP
        ANLKLIAEYGQEY KENPIRVLFHSYGHYDLLK P
Subjt:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVP

TrEMBL top hitse value%identityAlignment
A0A1S4DW37 OTU domain-containing protein At3g57810-like5.6e-7087.33Show/hide
Query:  TINRALLLFPHSAGIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVL
        T+    L F HSAGI GDGRCLFRSVVHGACLRSGK APSE+L+KELADELRENVANELMKRRLDTERFIEGDFGQYVR MRQPH WGGEPELLMSSHVL
Subjt:  TINRALLLFPHSAGIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVL

Query:  QMPISVYMWDKNSANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN
        QMPISVYM DK S NLK+IAEYGQEYGKENPIRVLFHSYGHYD LK PCN
Subjt:  QMPISVYMWDKNSANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN

A0A6J1DEG4 OTU domain-containing protein At3g57810-like2.0e-6791.18Show/hide
Query:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNSA
        IAGDGRCLFRSVVHGACLRSGKPAPSE+L+K+LAD LRENVANELMKRRLDTERFIEGDF QYVR MRQPH WGGEPELLMSSHVLQMPISVYMWDKN+A
Subjt:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNSA

Query:  NLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN
        NLKLIAEYGQEYGKEN IRVLFHSYGHYD LK PC+
Subjt:  NLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN

A0A6J1G1P0 OTU domain-containing protein At3g57810-like isoform X21.2e-6792.59Show/hide
Query:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS
        GIAGDGRCLFRSVVHGACLRSGKPAP+E LEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPH WGGEPELLMSSHVL++PISVY+WD  S
Subjt:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS

Query:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVP
        ANLKLIAEYGQEY KENPIRVLFHSYGHYDLLK P
Subjt:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVP

A0A6J1HPI9 OTU domain-containing protein At3g57810-like isoform X12.8e-6993.38Show/hide
Query:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNSA
        IAGDGRCLFRSVVHGACLRSGKPAP+E LEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPH WGGEPELLMSSHVL+MPISVY+WD  SA
Subjt:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNSA

Query:  NLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN
        NLKLIAEYGQEY KENPIRVLFHSYGHYDLLK PCN
Subjt:  NLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN

A0A6J1HTW1 OTU domain-containing protein At3g57810-like isoform X25.6e-7093.43Show/hide
Query:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS
        GIAGDGRCLFRSVVHGACLRSGKPAP+E LEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPH WGGEPELLMSSHVL+MPISVY+WD  S
Subjt:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS

Query:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN
        ANLKLIAEYGQEY KENPIRVLFHSYGHYDLLK PCN
Subjt:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN

SwissProt top hitse value%identityAlignment
P38747 OTU domain-containing protein 21.8e-0428.67Show/hide
Query:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRR-------LDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVY
        I  DG CLF S++    LR     P ++ +     +LR    N + + R        D E     D  +Y + M     WGGE E+L  SHV   PIS+ 
Subjt:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRR-------LDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVY

Query:  MWDKNSANLKLIAEYGQEYGKENPIRVLF--HSYG---HYDLL
        M  +         +   E GK   +++++  HSY    HY+ L
Subjt:  MWDKNSANLKLIAEYGQEYGKENPIRVLF--HSYG---HYDLL

Q7ZV00 Deubiquitinase OTUD6B1.8e-0428.08Show/hide
Query:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELA---DELRENVANELMKRRLDTERFIEG----------DFGQYVRCMRQPHAWGGEPELLMSSHVLQ
        I+ DG C++R+V H            ++ E+ LA    ELR+  A  +     D   F+            +F +Y   +    AWGG+ EL   S VLQ
Subjt:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELA---DELRENVANELMKRRLDTERFIEG----------DFGQYVRCMRQPHAWGGEPELLMSSHVLQ

Query:  MPISVYMWDKNSANLKLIAEYGQEYGKEN-PIRVLFHSYG---HYD
        +PI V   D     +      G+EY K    +  + H+YG   HY+
Subjt:  MPISVYMWDKNSANLKLIAEYGQEYGKEN-PIRVLFHSYG---HYD

Q8GYW0 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 31.8e-0929.68Show/hide
Query:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERF--------IEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISV
        + GDGRCLFR++V G     G     +  E++ ADELR  V   +     + E++        ++    ++ + + +   WGGE ELL+ S + + PI V
Subjt:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERF--------IEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISV

Query:  YMWDKN-------SANLKLIAEYGQEY------GK--ENPIRVLFHSYGHYDLLK
        Y+ +               I EYG E+      GK  +N +R+L+    HYDLL+
Subjt:  YMWDKN-------SANLKLIAEYGQEY------GK--ENPIRVLFHSYGHYDLLK

Q8LBZ4 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 41.2e-4868.18Show/hide
Query:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS
        GI GDGRCLFRSV HG CLRSGK AP E +++ELADELR  VA+E ++RR +TE F+EGDF  YVR +R PH WGGEPEL M+SHVLQMPI+VYM D  +
Subjt:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS

Query:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLL
          L  IAEYGQEYGK++PIRVL+H +GHYD L
Subjt:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLL

Arabidopsis top hitse value%identityAlignment
AT2G38025.1 Cysteine proteinases superfamily protein1.3e-1029.68Show/hide
Query:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERF--------IEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISV
        + GDGRCLFR++V G     G     +  E++ ADELR  V   +     + E++        ++    ++ + + +   WGGE ELL+ S + + PI V
Subjt:  IAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERF--------IEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISV

Query:  YMWDKN-------SANLKLIAEYGQEY------GK--ENPIRVLFHSYGHYDLLK
        Y+ +               I EYG E+      GK  +N +R+L+    HYDLL+
Subjt:  YMWDKN-------SANLKLIAEYGQEY------GK--ENPIRVLFHSYGHYDLLK

AT3G57810.1 Cysteine proteinases superfamily protein8.3e-5068.18Show/hide
Query:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS
        GI GDGRCLFRSV HG CLRSGK AP E +++ELADELR  VA+E ++RR +TE F+EGDF  YVR +R PH WGGEPEL M+SHVLQMPI+VYM D  +
Subjt:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS

Query:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLL
          L  IAEYGQEYGK++PIRVL+H +GHYD L
Subjt:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLL

AT3G57810.2 Cysteine proteinases superfamily protein8.3e-5068.18Show/hide
Query:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS
        GI GDGRCLFRSV HG CLRSGK AP E +++ELADELR  VA+E ++RR +TE F+EGDF  YVR +R PH WGGEPEL M+SHVLQMPI+VYM D  +
Subjt:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS

Query:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLL
          L  IAEYGQEYGK++PIRVL+H +GHYD L
Subjt:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLL

AT3G57810.3 Cysteine proteinases superfamily protein8.3e-5068.18Show/hide
Query:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS
        GI GDGRCLFRSV HG CLRSGK AP E +++ELADELR  VA+E ++RR +TE F+EGDF  YVR +R PH WGGEPEL M+SHVLQMPI+VYM D  +
Subjt:  GIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQMPISVYMWDKNS

Query:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLL
          L  IAEYGQEYGK++PIRVL+H +GHYD L
Subjt:  ANLKLIAEYGQEYGKENPIRVLFHSYGHYDLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGAAATGTAAAGGCGATAACGATAAATCGAGCTCTTTTATTGTTCCCACATTCTGCAGGCATAGCTGGTGATGGAAGATGTTTGTTTCGATCAGTGGTTCATGG
TGCTTGTCTCAGATCGGGCAAGCCAGCTCCAAGCGAGATTCTCGAAAAAGAGCTCGCAGATGAGCTCAGAGAGAACGTGGCAAATGAGTTGATGAAGAGGCGTTTGGACA
CCGAACGGTTTATTGAAGGTGACTTTGGCCAGTATGTTAGATGCATGCGCCAACCACACGCGTGGGGAGGAGAACCCGAGTTACTTATGTCTTCACATGTCCTACAGATG
CCAATCTCGGTCTACATGTGGGACAAGAACTCGGCAAATCTTAAGCTCATAGCTGAGTATGGGCAGGAATACGGTAAAGAAAATCCTATTCGCGTCCTCTTTCACAGCTA
CGGACATTATGATTTATTGAAGGTTCCATGCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGAAATGTAAAGGCGATAACGATAAATCGAGCTCTTTTATTGTTCCCACATTCTGCAGGCATAGCTGGTGATGGAAGATGTTTGTTTCGATCAGTGGTTCATGG
TGCTTGTCTCAGATCGGGCAAGCCAGCTCCAAGCGAGATTCTCGAAAAAGAGCTCGCAGATGAGCTCAGAGAGAACGTGGCAAATGAGTTGATGAAGAGGCGTTTGGACA
CCGAACGGTTTATTGAAGGTGACTTTGGCCAGTATGTTAGATGCATGCGCCAACCACACGCGTGGGGAGGAGAACCCGAGTTACTTATGTCTTCACATGTCCTACAGATG
CCAATCTCGGTCTACATGTGGGACAAGAACTCGGCAAATCTTAAGCTCATAGCTGAGTATGGGCAGGAATACGGTAAAGAAAATCCTATTCGCGTCCTCTTTCACAGCTA
CGGACATTATGATTTATTGAAGGTTCCATGCAATTAA
Protein sequenceShow/hide protein sequence
MEGNVKAITINRALLLFPHSAGIAGDGRCLFRSVVHGACLRSGKPAPSEILEKELADELRENVANELMKRRLDTERFIEGDFGQYVRCMRQPHAWGGEPELLMSSHVLQM
PISVYMWDKNSANLKLIAEYGQEYGKENPIRVLFHSYGHYDLLKVPCN