; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011720 (gene) of Snake gourd v1 genome

Gene IDTan0011720
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationLG08:67135645..67137112
RNA-Seq ExpressionTan0011720
SyntenyTan0011720
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649224.1 hypothetical protein Csa_014966 [Cucumis sativus]7.4e-5242.02Show/hide
Query:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKK-------YTTSFIYNYID----------YAF-
        M+ SD Q PT+HG+PLG +N+RV VD+I+ ED  LPIP++GE+E L+Q++GNFV WPR LVI  + KK        +T+    Y D          YA  
Subjt:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKK-------YTTSFIYNYID----------YAF-

Query:  ----KCFIKITL------------------------------LLFRY---LWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDLDQKV
            K  I+I L                               +  Y   LW  CD EI  +F+LVDQ TIS+ +KSQ+ R  NL++RLE+ N  LDQ V
Subjt:  ----KCFIKITL------------------------------LLFRY---LWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDLDQKV

Query:  FIPYNTG-YHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK--------------------------------F
         IPYNTG  HW+LIVI  R N VYV++ LR+KI   FQG IN SL+ WQ +HS  QYRS I WK +K                                F
Subjt:  FIPYNTG-YHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK--------------------------------F

Query:  NTKTAYKQEEIDDIRVQWADFVGRFV
        NT   Y+QEEID +RV+WA+FVGRFV
Subjt:  NTKTAYKQEEIDDIRVQWADFVGRFV

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.3e-5339.82Show/hide
Query:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTS----------------------------
        ++ ++ Q PTVHGVPLG +NVRV+VD+++ E A +PIP+RGE+E L+Q++G FV WPR LVI ++ K  ++S                            
Subjt:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTS----------------------------

Query:  ---------------------FIY----------NYIDYAFKCFIKITLLLFRYLWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDL
                              IY            I+  + C     L    YLW V +YEI  KFL+VD  TIS +VKSQ+ R  NL NRLE+VN  L
Subjt:  ---------------------FIY----------NYIDYAFKCFIKITLLLFRYLWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDL

Query:  DQKVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK------------------------------
        +Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  INTSL++WQ KHS+ +YR++  WK +K                              
Subjt:  DQKVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK------------------------------

Query:  --FNTKTAYKQEEIDDIRVQWADFVGRFV
          FNTK AY+QEEID++R++WADFVG  V
Subjt:  --FNTKTAYKQEEIDDIRVQWADFVGRFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]1.3e-5339.82Show/hide
Query:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTS----------------------------
        ++ ++ Q PTVHGVPLG +NVRV+VD+++ E A +PIP+RGE+E L+Q++G FV WPR LVI ++ K  ++S                            
Subjt:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTS----------------------------

Query:  ---------------------FIY----------NYIDYAFKCFIKITLLLFRYLWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDL
                              IY            I+  + C     L    YLW V +YEI  KFL+VD  TIS +VKSQ+ R  NL NRLE+VN  L
Subjt:  ---------------------FIY----------NYIDYAFKCFIKITLLLFRYLWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDL

Query:  DQKVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK------------------------------
        +Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  INTSL++WQ KHS+ +YR++  WK +K                              
Subjt:  DQKVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK------------------------------

Query:  --FNTKTAYKQEEIDDIRVQWADFVGRFV
          FNTK AY+QEEID++R++WADFVG  V
Subjt:  --FNTKTAYKQEEIDDIRVQWADFVGRFV

XP_031740251.1 uncharacterized protein LOC101213947 [Cucumis sativus]7.4e-5242.02Show/hide
Query:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKK-------YTTSFIYNYID----------YAF-
        M+ SD Q PT+HG+PLG +N+RV VD+I+ ED  LPIP++GE+E L+Q++GNFV WPR LVI  + KK        +T+    Y D          YA  
Subjt:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKK-------YTTSFIYNYID----------YAF-

Query:  ----KCFIKITL------------------------------LLFRY---LWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDLDQKV
            K  I+I L                               +  Y   LW  CD EI  +F+LVDQ TIS+ +KSQ+ R  NL++RLE+ N  LDQ V
Subjt:  ----KCFIKITL------------------------------LLFRY---LWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDLDQKV

Query:  FIPYNTG-YHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK--------------------------------F
         IPYNTG  HW+LIVI  R N VYV++ LR+KI   FQG IN SL+ WQ +HS  QYRS I WK +K                                F
Subjt:  FIPYNTG-YHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK--------------------------------F

Query:  NTKTAYKQEEIDDIRVQWADFVGRFV
        NT   Y+QEEID +RV+WA+FVGRFV
Subjt:  NTKTAYKQEEIDDIRVQWADFVGRFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]1.1e-5240.62Show/hide
Query:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTSFIYNYIDYAFK-CFIKITL-LLFRY---
        M+ SDAQ P+++ +PLG +NVR +VD+++GED  LPIP + +++ L Q++GNFV WPR LVI  K KK  +      I  + K   + +T+ LL RY   
Subjt:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTSFIYNYIDYAFK-CFIKITL-LLFRY---

Query:  --------------------------------------------------LWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDLDQKV
                                                          LW  CD EI  KF++VDQ TIS+ VK Q+ R  NL+NRLE+V+  LDQ V
Subjt:  --------------------------------------------------LWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDLDQKV

Query:  FIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK--------------------------------FN
         IPYNTG HW+LI+I+ + N VYV++SLRSKI E FQG INTSL+ WQ KHSL QYR+ I WK +K                                FN
Subjt:  FIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK--------------------------------FN

Query:  TKTAYKQEEIDDIRVQWADFVGRFV
        T+ AY+Q+EID +R++WA+FV RFV
Subjt:  TKTAYKQEEIDDIRVQWADFVGRFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X14.0e-5141.41Show/hide
Query:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKK-------YTTSFIYNYID----------YAFK
        M+ SD Q PT+HG+PLG EN+RV VD+ + ED  LPIP++G++E L+Q++GNFV WPR LVI  K KK        +T+    Y D          YA +
Subjt:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKK-------YTTSFIYNYID----------YAFK

Query:  C-----FIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDLDQKV
               I+I+L                 + +Y                 LW VC+ EI  +F+LVDQ TIS+ +KSQ+ R  NL+NRLE+ N  LDQ V
Subjt:  C-----FIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDLDQKV

Query:  FIPYNTG-YHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK--------------------------------F
         IPYNTG  HW+LI+I  + N VYV++ LRSKI   FQG IN SL+ WQ +HS   YRS I WK +K                                F
Subjt:  FIPYNTG-YHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK--------------------------------F

Query:  NTKTAYKQEEIDDIRVQWADFVGRFV
        NT  AY QEEID +RV+WA+FV RFV
Subjt:  NTKTAYKQEEIDDIRVQWADFVGRFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein4.0e-5141.41Show/hide
Query:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKK-------YTTSFIYNYID----------YAFK
        M+ SD Q PT+HG+PLG EN+RV VD+ + ED  LPIP++G++E L+Q++GNFV WPR LVI  K KK        +T+    Y D          YA +
Subjt:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKK-------YTTSFIYNYID----------YAFK

Query:  C-----FIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDLDQKV
               I+I+L                 + +Y                 LW VC+ EI  +F+LVDQ TIS+ +KSQ+ R  NL+NRLE+ N  LDQ V
Subjt:  C-----FIKITLL----------------LFRY-----------------LWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDLDQKV

Query:  FIPYNTG-YHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK--------------------------------F
         IPYNTG  HW+LI+I  + N VYV++ LRSKI   FQG IN SL+ WQ +HS   YRS I WK +K                                F
Subjt:  FIPYNTG-YHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK--------------------------------F

Query:  NTKTAYKQEEIDDIRVQWADFVGRFV
        NT  AY QEEID +RV+WA+FV RFV
Subjt:  NTKTAYKQEEIDDIRVQWADFVGRFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X16.5e-5439.82Show/hide
Query:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTS----------------------------
        ++ ++ Q PTVHGVPLG +NVRV+VD+++ E A +PIP+RGE+E L+Q++G FV WPR LVI ++ K  ++S                            
Subjt:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTS----------------------------

Query:  ---------------------FIY----------NYIDYAFKCFIKITLLLFRYLWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDL
                              IY            I+  + C     L    YLW V +YEI  KFL+VD  TIS +VKSQ+ R  NL NRLE+VN  L
Subjt:  ---------------------FIY----------NYIDYAFKCFIKITLLLFRYLWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDL

Query:  DQKVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK------------------------------
        +Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  INTSL++WQ KHS+ +YR++  WK +K                              
Subjt:  DQKVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK------------------------------

Query:  --FNTKTAYKQEEIDDIRVQWADFVGRFV
          FNTK AY+QEEID++R++WADFVG  V
Subjt:  --FNTKTAYKQEEIDDIRVQWADFVGRFV

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.9e-4541.48Show/hide
Query:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTS----------------------------
        ++ ++ Q PTVHGVPLG +NVRV+VD+++ E A +PIP+RGE+E L+Q++G FV WPR LVI ++ K  ++S                            
Subjt:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTS----------------------------

Query:  ---------------------FIY----------NYIDYAFKCFIKITLLLFRYLWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDL
                              IY            I+  + C     L    YLW V +YEI  KFL+VD  TIS +VKSQ+ R  NL NRLE+VN  L
Subjt:  ---------------------FIY----------NYIDYAFKCFIKITLLLFRYLWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDL

Query:  DQKVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK
        +Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  INTSL++WQ KHS+ +YR++  WK +K
Subjt:  DQKVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X26.5e-5439.82Show/hide
Query:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTS----------------------------
        ++ ++ Q PTVHGVPLG +NVRV+VD+++ E A +PIP+RGE+E L+Q++G FV WPR LVI ++ K  ++S                            
Subjt:  MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTS----------------------------

Query:  ---------------------FIY----------NYIDYAFKCFIKITLLLFRYLWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDL
                              IY            I+  + C     L    YLW V +YEI  KFL+VD  TIS +VKSQ+ R  NL NRLE+VN  L
Subjt:  ---------------------FIY----------NYIDYAFKCFIKITLLLFRYLWTVCDYEIIAKFLLVDQITISNFVKSQKTRCINLVNRLEIVNLDL

Query:  DQKVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK------------------------------
        +Q V IPY +G HWMLI+I+ R N VYVL+SLR KI+E +Q  INTSL++WQ KHS+ +YR++  WK +K                              
Subjt:  DQKVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVK------------------------------

Query:  --FNTKTAYKQEEIDDIRVQWADFVGRFV
          FNTK AY+QEEID++R++WADFVG  V
Subjt:  --FNTKTAYKQEEIDDIRVQWADFVGRFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAGGAGATGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGTGAAGATGCTCCACTACCAAT
TCCTATACGGGGAGAAGTAGAGTTCCTGAGTCAATCTATGGGAAATTTTGTGCCATGGCCTCGTGACCTTGTCATTTTTAATAAGGGGAAAAAGTATACAACATCTTTCA
TTTACAACTATATTGACTACGCTTTTAAATGTTTTATTAAAATAACTTTACTTTTATTTAGGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTACTA
GTTGACCAAATAACCATTTCTAATTTTGTTAAAAGTCAAAAAACACGTTGTATAAATCTGGTTAACAGGTTAGAAATAGTTAATTTGGACTTGGATCAAAAAGTTTTCAT
CCCATATAATACTGGATATCATTGGATGTTGATCGTTATCCATCCGCGGGCAAACACCGTTTATGTCTTAAACTCGTTGAGGAGTAAGATCGAAGAAAGTTTTCAAGGAA
CAATAAATACATCCTTGAGAATGTGGCAAGTCAAGCACTCACTTCCACAATATCGTTCATCCATCACTTGGAAACTTGTAAAGTTTAACACAAAAACTGCATATAAACAA
GAAGAAATCGACGACATTCGAGTACAATGGGCGGATTTTGTTGGCAGATTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAGGAGATGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGTGAAGATGCTCCACTACCAAT
TCCTATACGGGGAGAAGTAGAGTTCCTGAGTCAATCTATGGGAAATTTTGTGCCATGGCCTCGTGACCTTGTCATTTTTAATAAGGGGAAAAAGTATACAACATCTTTCA
TTTACAACTATATTGACTACGCTTTTAAATGTTTTATTAAAATAACTTTACTTTTATTTAGGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTACTA
GTTGACCAAATAACCATTTCTAATTTTGTTAAAAGTCAAAAAACACGTTGTATAAATCTGGTTAACAGGTTAGAAATAGTTAATTTGGACTTGGATCAAAAAGTTTTCAT
CCCATATAATACTGGATATCATTGGATGTTGATCGTTATCCATCCGCGGGCAAACACCGTTTATGTCTTAAACTCGTTGAGGAGTAAGATCGAAGAAAGTTTTCAAGGAA
CAATAAATACATCCTTGAGAATGTGGCAAGTCAAGCACTCACTTCCACAATATCGTTCATCCATCACTTGGAAACTTGTAAAGTTTAACACAAAAACTGCATATAAACAA
GAAGAAATCGACGACATTCGAGTACAATGGGCGGATTTTGTTGGCAGATTTGTGTAA
Protein sequenceShow/hide protein sequence
MYTSDAQFPTVHGVPLGDENVRVVVDMIVGEDAPLPIPIRGEVEFLSQSMGNFVPWPRDLVIFNKGKKYTTSFIYNYIDYAFKCFIKITLLLFRYLWTVCDYEIIAKFLL
VDQITISNFVKSQKTRCINLVNRLEIVNLDLDQKVFIPYNTGYHWMLIVIHPRANTVYVLNSLRSKIEESFQGTINTSLRMWQVKHSLPQYRSSITWKLVKFNTKTAYKQ
EEIDDIRVQWADFVGRFV