; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008296 (gene) of Snake gourd v1 genome

Gene IDTan0008296
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationLG07:9278850..9280325
RNA-Seq ExpressionTan0008296
SyntenyTan0008296
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649224.1 hypothetical protein Csa_014966 [Cucumis sativus]1.2e-4939.88Show/hide
Query:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGK--------------------NYTTSFIYNYIDY
        M+ SD Q PT+HG+PLG +N+RV VD+I+ +D  LPIP++GE+++L+Q++ NFVAWPR LVI  + K                    + T   +  Y  +
Subjt:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGK--------------------NYTTSFIYNYIDY

Query:  AFEC--FIKITL------------------------------LLFRY---LWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV
          +    I+I L                               +  Y   LW  CD EI  +F+LVDQ  IS+ +KSQE R  NL +RLEM N  LDQ V
Subjt:  AFEC--FIKITL------------------------------LLFRY---LWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV

Query:  FIPYNTG-YHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------F
         IPYNTG  HW+LIVI  R+N  YV++ LR+KI   FQG IN SL+ WQ +HS  QYRS I WK +K                                F
Subjt:  FIPYNTG-YHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------F

Query:  NTKTAYKQEEIDEIRVQWADFVGRFV
        NT   Y+QEEID +RV+WA+FVGRFV
Subjt:  NTKTAYKQEEIDEIRVQWADFVGRFV

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]3.2e-5540.92Show/hide
Query:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGKNYTTSFI----------------------YNYI
        ++ ++ Q PTVHGVPLGV+NVRV+VD+++ + A +PIP+RGE+++L+Q++  FVAWPR LVI ++ KN ++S                        Y  +
Subjt:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGKNYTTSFI----------------------YNYI

Query:  DYAFECFIKIT---------------------------------LLLFRYLWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV
            E  ++I                                  L    YLW V +YEI  KFL+VD   IS +VKSQE R  NLANRLEMVN  L+Q V
Subjt:  DYAFECFIKIT---------------------------------LLLFRYLWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV

Query:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------FN
         IPY +G HWMLI+I+ R+N  YVL+SLR KI+E +Q  INTSL++WQAKHS+ +YR++  WK +K                                FN
Subjt:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------FN

Query:  TKTAYKQEEIDEIRVQWADFVGRFV
        TK AY+QEEIDE+R++WADFVG  V
Subjt:  TKTAYKQEEIDEIRVQWADFVGRFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]3.2e-5540.92Show/hide
Query:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGKNYTTSFI----------------------YNYI
        ++ ++ Q PTVHGVPLGV+NVRV+VD+++ + A +PIP+RGE+++L+Q++  FVAWPR LVI ++ KN ++S                        Y  +
Subjt:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGKNYTTSFI----------------------YNYI

Query:  DYAFECFIKIT---------------------------------LLLFRYLWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV
            E  ++I                                  L    YLW V +YEI  KFL+VD   IS +VKSQE R  NLANRLEMVN  L+Q V
Subjt:  DYAFECFIKIT---------------------------------LLLFRYLWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV

Query:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------FN
         IPY +G HWMLI+I+ R+N  YVL+SLR KI+E +Q  INTSL++WQAKHS+ +YR++  WK +K                                FN
Subjt:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------FN

Query:  TKTAYKQEEIDEIRVQWADFVGRFV
        TK AY+QEEIDE+R++WADFVG  V
Subjt:  TKTAYKQEEIDEIRVQWADFVGRFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]1.4e-5040.18Show/hide
Query:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGK--------------------NYTTSFIYNYIDY
        M+ SDAQ P+++ +PLG +NVR +VD+++G+D  LPIP + ++K+L Q++ NFVAWPR LVI  K K                    + T   +  Y  +
Subjt:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGK--------------------NYTTSFIYNYIDY

Query:  AFEC--FIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV
        + +    I+I L                 + +Y                 LW  CD EI  KF++VDQ  IS+ VK QE R  NL NRLEMV+  LDQ V
Subjt:  AFEC--FIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV

Query:  FIPYNTG-YHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------F
         IPYNTG  HW+LI+I+ ++N  YV++SLRSKI E FQG INTSL+ WQAKHSL QYR+ I WK +K                                F
Subjt:  FIPYNTG-YHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------F

Query:  NTKTAYKQEEIDEIRVQWADFVGRFV
        NT+ AY+Q+EID +R++WA+FV RFV
Subjt:  NTKTAYKQEEIDEIRVQWADFVGRFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]5.7e-5240.31Show/hide
Query:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGK--------------------NYTTSFIYNYIDY
        M+ SDAQ P+++ +PLG +NVR +VD+++G+D  LPIP + ++K+L Q++ NFVAWPR LVI  K K                    + T   +  Y  +
Subjt:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGK--------------------NYTTSFIYNYIDY

Query:  AFEC--FIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV
        + +    I+I L                 + +Y                 LW  CD EI  KF++VDQ  IS+ VK QE R  NL NRLEMV+  LDQ V
Subjt:  AFEC--FIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV

Query:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------FN
         IPYNTG HW+LI+I+ ++N  YV++SLRSKI E FQG INTSL+ WQAKHSL QYR+ I WK +K                                FN
Subjt:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------FN

Query:  TKTAYKQEEIDEIRVQWADFVGRFV
        T+ AY+Q+EID +R++WA+FV RFV
Subjt:  TKTAYKQEEIDEIRVQWADFVGRFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.7e-4940.8Show/hide
Query:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGK-------NYTTSFIYNYID----------YAF-
        M+ SD Q PT+HG+PLG EN+RV VD+ + +D  LPIP++G++++L+Q++ NFVAWPR LVI  K K       + +T+    Y D          YA  
Subjt:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGK-------NYTTSFIYNYID----------YAF-

Query:  ----ECFIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV
            E  I+I+L                 + +Y                 LW VC+ EI  +F+LVDQ  IS+ +KSQE R  NL NRLEM N  LDQ V
Subjt:  ----ECFIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV

Query:  FIPYNTG-YHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------F
         IPYNTG  HW+LI+I  ++N  YV++ LRSKI   FQG IN SL+ WQ +HS   YRS I WK +K                                F
Subjt:  FIPYNTG-YHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------F

Query:  NTKTAYKQEEIDEIRVQWADFVGRFV
        NT  AY QEEID +RV+WA+FV RFV
Subjt:  NTKTAYKQEEIDEIRVQWADFVGRFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein1.7e-4940.8Show/hide
Query:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGK-------NYTTSFIYNYID----------YAF-
        M+ SD Q PT+HG+PLG EN+RV VD+ + +D  LPIP++G++++L+Q++ NFVAWPR LVI  K K       + +T+    Y D          YA  
Subjt:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGK-------NYTTSFIYNYID----------YAF-

Query:  ----ECFIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV
            E  I+I+L                 + +Y                 LW VC+ EI  +F+LVDQ  IS+ +KSQE R  NL NRLEM N  LDQ V
Subjt:  ----ECFIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV

Query:  FIPYNTG-YHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------F
         IPYNTG  HW+LI+I  ++N  YV++ LRSKI   FQG IN SL+ WQ +HS   YRS I WK +K                                F
Subjt:  FIPYNTG-YHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------F

Query:  NTKTAYKQEEIDEIRVQWADFVGRFV
        NT  AY QEEID +RV+WA+FV RFV
Subjt:  NTKTAYKQEEIDEIRVQWADFVGRFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X11.6e-5540.92Show/hide
Query:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGKNYTTSFI----------------------YNYI
        ++ ++ Q PTVHGVPLGV+NVRV+VD+++ + A +PIP+RGE+++L+Q++  FVAWPR LVI ++ KN ++S                        Y  +
Subjt:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGKNYTTSFI----------------------YNYI

Query:  DYAFECFIKIT---------------------------------LLLFRYLWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV
            E  ++I                                  L    YLW V +YEI  KFL+VD   IS +VKSQE R  NLANRLEMVN  L+Q V
Subjt:  DYAFECFIKIT---------------------------------LLLFRYLWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV

Query:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------FN
         IPY +G HWMLI+I+ R+N  YVL+SLR KI+E +Q  INTSL++WQAKHS+ +YR++  WK +K                                FN
Subjt:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------FN

Query:  TKTAYKQEEIDEIRVQWADFVGRFV
        TK AY+QEEIDE+R++WADFVG  V
Subjt:  TKTAYKQEEIDEIRVQWADFVGRFV

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X47.7e-4742.48Show/hide
Query:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGKNYTTSFI----------------------YNYI
        ++ ++ Q PTVHGVPLGV+NVRV+VD+++ + A +PIP+RGE+++L+Q++  FVAWPR LVI ++ KN ++S                        Y  +
Subjt:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGKNYTTSFI----------------------YNYI

Query:  DYAFECFIKIT---------------------------------LLLFRYLWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV
            E  ++I                                  L    YLW V +YEI  KFL+VD   IS +VKSQE R  NLANRLEMVN  L+Q V
Subjt:  DYAFECFIKIT---------------------------------LLLFRYLWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV

Query:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK
         IPY +G HWMLI+I+ R+N  YVL+SLR KI+E +Q  INTSL++WQAKHS+ +YR++  WK +K
Subjt:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X21.6e-5540.92Show/hide
Query:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGKNYTTSFI----------------------YNYI
        ++ ++ Q PTVHGVPLGV+NVRV+VD+++ + A +PIP+RGE+++L+Q++  FVAWPR LVI ++ KN ++S                        Y  +
Subjt:  MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGKNYTTSFI----------------------YNYI

Query:  DYAFECFIKIT---------------------------------LLLFRYLWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV
            E  ++I                                  L    YLW V +YEI  KFL+VD   IS +VKSQE R  NLANRLEMVN  L+Q V
Subjt:  DYAFECFIKIT---------------------------------LLLFRYLWTVCDYEIIAKFLLVDQIIISNFVKSQETRCINLANRLEMVNLDLDQQV

Query:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------FN
         IPY +G HWMLI+I+ R+N  YVL+SLR KI+E +Q  INTSL++WQAKHS+ +YR++  WK +K                                FN
Subjt:  FIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVK--------------------------------FN

Query:  TKTAYKQEEIDEIRVQWADFVGRFV
        TK AY+QEEIDE+R++WADFVG  V
Subjt:  TKTAYKQEEIDEIRVQWADFVGRFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAGGAGTTGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGTAAAGATGCTCCATTACCAAT
TCCTATACGGGGAGAAGTAAAGTCCCTGAGTCAATCTATGAGAAATTTTGTGGCATGGCCTCGTGACCTTGTCATTTTTAATAAGGGGAAAAATTATACAACATCTTTTA
TTTACAACTATATTGACTACGCTTTTGAATGTTTTATTAAAATAACTTTACTTTTATTTAGGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTGCTA
GTTGATCAAATAATCATTTCTAATTTTGTTAAAAGTCAAGAAACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTTAT
CCCATATAATACTGGATATCATTGGATGTTGATCGTTATCCATCCGCGCAAAAACACCGCTTATGTCTTAAACTCGTTGAGGAGTAAGATCGAAGAAAGTTTTCAAGGAA
CAATAAATACATCCTTGAGAATGTGGCAAGCCAAACACTCACTTCCACAATATCGTTCTTCCATCACTTGGAAACTTGTAAAGTTTAACACAAAAACTGCATATAAACAA
GAAGAAATCGACGAGATTCGAGTACAATGGGCGGATTTTGTTGGCAGATTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAGGAGTTGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGTAAAGATGCTCCATTACCAAT
TCCTATACGGGGAGAAGTAAAGTCCCTGAGTCAATCTATGAGAAATTTTGTGGCATGGCCTCGTGACCTTGTCATTTTTAATAAGGGGAAAAATTATACAACATCTTTTA
TTTACAACTATATTGACTACGCTTTTGAATGTTTTATTAAAATAACTTTACTTTTATTTAGGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTGCTA
GTTGATCAAATAATCATTTCTAATTTTGTTAAAAGTCAAGAAACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTTAT
CCCATATAATACTGGATATCATTGGATGTTGATCGTTATCCATCCGCGCAAAAACACCGCTTATGTCTTAAACTCGTTGAGGAGTAAGATCGAAGAAAGTTTTCAAGGAA
CAATAAATACATCCTTGAGAATGTGGCAAGCCAAACACTCACTTCCACAATATCGTTCTTCCATCACTTGGAAACTTGTAAAGTTTAACACAAAAACTGCATATAAACAA
GAAGAAATCGACGAGATTCGAGTACAATGGGCGGATTTTGTTGGCAGATTTGTGTAA
Protein sequenceShow/hide protein sequence
MYTSDAQFPTVHGVPLGVENVRVVVDMIVGKDAPLPIPIRGEVKSLSQSMRNFVAWPRDLVIFNKGKNYTTSFIYNYIDYAFECFIKITLLLFRYLWTVCDYEIIAKFLL
VDQIIISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTGYHWMLIVIHPRKNTAYVLNSLRSKIEESFQGTINTSLRMWQAKHSLPQYRSSITWKLVKFNTKTAYKQ
EEIDEIRVQWADFVGRFV