; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005925 (gene) of Snake gourd v1 genome

Gene IDTan0005925
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptioncathepsin O
Genome locationLG05:72636490..72638280
RNA-Seq ExpressionTan0005925
SyntenyTan0005925
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3971565.1 hypothetical protein CMV_004850 [Castanea mollissima]8.9e-0837.11Show/hide
Query:  DWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTKGLVKEEDFPFE
        DW D+  L PVR    C CCWAI    A++ +  I HP  +  +QLS Q L++C V       +   KE C   S   A+ WI   G+  EE +PF+
Subjt:  DWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTKGLVKEEDFPFE

OAD54413.1 hypothetical protein WN48_07777 [Eufriesea mexicana]2.0e-0736.11Show/hide
Query:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMT-KGLVKEEDFPFEP
        FDW DY V+TPV+    C  CWA   TG I+  + I    H NLL LS Q L++C              E CN   +  AY  I T  GL  E D+P++ 
Subjt:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMT-KGLVKEEDFPFEP

Query:  ELRLCNHI
        +   C+ +
Subjt:  ELRLCNHI

XP_017761376.1 PREDICTED: uncharacterized protein LOC108551655 [Eufriesea mexicana]2.0e-0736.11Show/hide
Query:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMT-KGLVKEEDFPFEP
        FDW DY V+TPV+    C  CWA   TG I+  + I    H NLL LS Q L++C              E CN   +  AY  I T  GL  E D+P++ 
Subjt:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMT-KGLVKEEDFPFEP

Query:  ELRLCNHI
        +   C+ +
Subjt:  ELRLCNHI

XP_023521676.1 low-temperature-induced cysteine proteinase-like [Cucurbita pepo subsp. pepo]1.1e-1339.09Show/hide
Query:  EADAFDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNI-FHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMT-KGLVKEED
        + + FDW  +++LTPVR    C CCWAI   GAI++++NI +   +NN+LQ +PQ LI+C +N N  +I     E C   ++ +A+ WI++ KG+ KE+D
Subjt:  EADAFDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNI-FHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMT-KGLVKEED

Query:  FPFEPELRLC
        + F      C
Subjt:  FPFEPELRLC

XP_030939752.1 ervatamin-B-like [Quercus lobata]2.8e-0937.86Show/hide
Query:  DWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTKGLVKEEDFPFEPEL
        DWS  H+L PVR    C CCWAI  T AI+ ++ +     N +  L+PQ LI+C     V N     K+ C   + ++A+ WIM  G+  EED+PF+ + 
Subjt:  DWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTKGLVKEEDFPFEPEL

Query:  RLC
          C
Subjt:  RLC

TrEMBL top hitse value%identityAlignment
A0A310SH02 Uncharacterized protein9.6e-0836.11Show/hide
Query:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMT-KGLVKEEDFPFEP
        FDW DY V+TPV+    C  CWA   TG I+  + I    H NLL LS Q L++C              E CN   +  AY  I T  GL  E D+P++ 
Subjt:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMT-KGLVKEEDFPFEP

Query:  ELRLCNHI
        +   C+ +
Subjt:  ELRLCNHI

A0A5P1EW68 Pept_C1 domain-containing protein4.8e-0737.25Show/hide
Query:  AFDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMT----KETCNPLSISQAYTWIMTKGLVKEEDF
        + DW +  VL  VR    C CCW I      + M+ I H    +L  LSPQ +INC      F   G+T    K  C   SI++A+ +IMT+G++ E+D 
Subjt:  AFDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMT----KETCNPLSISQAYTWIMTKGLVKEEDF

Query:  PF
        PF
Subjt:  PF

A0A6J3L3K9 uncharacterized protein LOC1172387748.1e-0733.33Show/hide
Query:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWI-MTKGLVKEEDFPFEP
        FDW D+ V+TPV+    C  CWA   TG ++  + I    HN LL LS Q L++C              E CN   +  AY  I    GL  E D+P++ 
Subjt:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWI-MTKGLVKEEDFPFEP

Query:  ELRLCNHI
        +   C+ +
Subjt:  ELRLCNHI

A0A6P3TXK7 LOW QUALITY PROTEIN: putative cysteine proteinase CG121638.1e-0733.33Show/hide
Query:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWI-MTKGLVKEEDFPFEP
        FDW D+ V+TPV+    C  CWA   TG ++  + I    HN LL LS Q L++C              E CN   +  AY  I    GL  E D+P++ 
Subjt:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWI-MTKGLVKEEDFPFEP

Query:  ELRLCNHI
        +   C+ +
Subjt:  ELRLCNHI

A0A6P8LMZ8 putative cysteine proteinase CG121638.1e-0733.33Show/hide
Query:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWI-MTKGLVKEEDFPFEP
        FDW D+ V+TPV+    C  CWA   TG ++  + I    HN LL LS Q L++C              E CN   +  AY  I    GL  E D+P++ 
Subjt:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWI-MTKGLVKEEDFPFEP

Query:  ELRLCNHI
        +   C+ +
Subjt:  ELRLCNHI

SwissProt top hitse value%identityAlignment
O45734 Cathepsin L-like1.3e-0629.63Show/hide
Query:  DAFDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIM-TKGLVKEEDFPF
        D  DW D H++T V+   MC  CWA   TGA++  H         L+ LS Q L++C                CN   + QA+ +I    G+  EE +P+
Subjt:  DAFDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIM-TKGLVKEEDFPF

Query:  EPELRLCN
        +     C+
Subjt:  EPELRLCN

P43234 Cathepsin O7.1e-0833.33Show/hide
Query:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWI--MTKGLVKEEDFPFE
        FDW D  V+T VR   MC  CWA    GA+++ + I       L  LS Q +I+C  N             CN  S   A  W+  M   LVK+ ++PF+
Subjt:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWI--MTKGLVKEEDFPFE

Query:  PELRLCNH
         +  LC++
Subjt:  PELRLCNH

P83443 Macrodontain-11.1e-0527.1Show/hide
Query:  AFDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMT-KGLVKEEDFPFE
        + DW DY  +  V+    C  CWA     AI T+  I+     NL+ LS Q +++C V+             C    +++AY +I++  G+  +E++P+ 
Subjt:  AFDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMT-KGLVKEEDFPFE

Query:  PELRLCN
             CN
Subjt:  PELRLCN

Q80LP4 Viral cathepsin1.5e-0528.57Show/hide
Query:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTK-GLVKEEDFPFEP
        FDW   + +T V+    C  CWA    G ++T++ I    HN L+ LS Q LI+C                C+   +  A+  +M   GL++E D+P++ 
Subjt:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTK-GLVKEEDFPFEP

Query:  ELRLC
           +C
Subjt:  ELRLC

Q9YMP9 Viral cathepsin1.5e-0529.52Show/hide
Query:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTKGLVKEE-DFPFEP
        FDW + + +T ++    C  CWA     ++++    F   HN L+ LS Q LI+C                CN   +  A+  IM  G V+ E D+PF  
Subjt:  FDWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTKGLVKEE-DFPFEP

Query:  ELRLC
          R C
Subjt:  ELRLC

Arabidopsis top hitse value%identityAlignment
AT4G11310.1 Papain family cysteine protease8.9e-0627.62Show/hide
Query:  DWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTK-GLVKEEDFPFEPE
        DW +   +T V+    C+ CWA    GA++ ++ I       L+ LS Q LINC                C    +  AY +IM   GL  + D+P++  
Subjt:  DWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTK-GLVKEEDFPFEPE

Query:  LRLCN
          +C+
Subjt:  LRLCN

AT4G11320.1 Papain family cysteine protease3.1e-0627.88Show/hide
Query:  DWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTK-GLVKEEDFPFEPE
        DW +   +T V+   +C+ CWA    GA++ ++ I       L+ LS Q LINC                C    +  AY +IM   GL  + D+P++  
Subjt:  DWSDYHVLTPVRQLNMCKCCWAIVFTGAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTK-GLVKEEDFPFEPE

Query:  LRLC
          +C
Subjt:  LRLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAAAAGACTCTTTATCCTTTTTCAACTGAACTAGCATACACATGGATAATGATGATGCTCAAAGAAGGGGACGTTTCATTTAAAGCAGAACTAGTACCTTGCAA
GCGGATAAATCCACGAATCGGTGGATATATACTTTTACAAGATGCTCATGAATTAAATTCACAAGTTCAACGCCAGCAGATTGTTGCAATAATGAAATATACTCAAGAAT
TGACAAATTTGAAAAAGGAAGCTGATGCCTTTGACTGGAGCGACTACCACGTCCTCACTCCTGTTAGGCAACTAAACATGTGTAAATGTTGTTGGGCCATTGTCTTCACT
GGGGCCATAAAAACCATGCATAACATCTTCCATCCCTCCCATAACAACCTATTGCAATTGTCACCTCAATGCCTAATAAACTGTGTGGTCAATCTTAATGTTTTTAATAT
TATAGGTATGACAAAAGAAACTTGTAATCCCTTATCAATTTCTCAAGCATACACATGGATAATGACTAAAGGGTTGGTCAAAGAAGAAGATTTTCCATTTGAACCTGAAC
TAAGACTGTGCAACCACATATAA
mRNA sequenceShow/hide mRNA sequence
GAAGAGACAACATTTTGTCACATGTACACTTCATTCAACTAATAAAAAGTGTAACATTCTTTTAAATAATTTAAAACTTACTATATTTGTAAACTTACAATATTAATTAT
AAAAGACCTCATGTTGTTCACTTGTAGTTTTACTTTTTACATACCAATTAGCTAGTATTTAACTATAAATTTTCTTTCTTACCAGAATGTTATTGGACCATCACAGTGGC
CAAATCAATAATTGAAACATCATATAACATCATCCACTCTTTAAATATAATAACCCACAATGGTTATCACCTCAATCTATGTTGTACACCCAACATATTCAAAATAATTG
ATATGAGCAAAAAAAACTCAAATAATTGATATGAGCAAAAAGACTCTTTATCCTTTTTCAACTGAACTAGCATACACATGGATAATGATGATGCTCAAAGAAGGGGACGT
TTCATTTAAAGCAGAACTAGTACCTTGCAAGCGGATAAATCCACGAATCGGTGGATATATACTTTTACAAGATGCTCATGAATTAAATTCACAAGTTCAACGCCAGCAGA
TTGTTGCAATAATGAAATATACTCAAGAATTGACAAATTTGAAAAAGGAAGCTGATGCCTTTGACTGGAGCGACTACCACGTCCTCACTCCTGTTAGGCAACTAAACATG
TGTAAATGTTGTTGGGCCATTGTCTTCACTGGGGCCATAAAAACCATGCATAACATCTTCCATCCCTCCCATAACAACCTATTGCAATTGTCACCTCAATGCCTAATAAA
CTGTGTGGTCAATCTTAATGTTTTTAATATTATAGGTATGACAAAAGAAACTTGTAATCCCTTATCAATTTCTCAAGCATACACATGGATAATGACTAAAGGGTTGGTCA
AAGAAGAAGATTTTCCATTTGAACCTGAACTAAGACTGTGCAACCACATATAAAA
Protein sequenceShow/hide protein sequence
MSKKTLYPFSTELAYTWIMMMLKEGDVSFKAELVPCKRINPRIGGYILLQDAHELNSQVQRQQIVAIMKYTQELTNLKKEADAFDWSDYHVLTPVRQLNMCKCCWAIVFT
GAIKTMHNIFHPSHNNLLQLSPQCLINCVVNLNVFNIIGMTKETCNPLSISQAYTWIMTKGLVKEEDFPFEPELRLCNHI