; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G03375 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G03375
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUlp1-like peptidase
Genome locationClcChr04:10597558..10599234
RNA-Seq ExpressionClc04G03375
SyntenyClc04G03375
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038874902.1 uncharacterized protein LOC120067405 [Benincasa hispida]2.9e-1139.18Show/hide
Query:  KVAKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNLNNLKPDL
        + A  WE ED Y D+V G   +    W++VDFVY++ N   HW+++A+D+  GHI ++DSL SY     LV +   L  T+ SL  +C+++  K DL
Subjt:  KVAKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNLNNLKPDL

XP_038875042.1 uncharacterized protein LOC120067568 [Benincasa hispida]2.9e-1144.05Show/hide
Query:  DFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNLNNLKPDL
        D+V+G   +    WS+VDFVY++ N  QHW+L+A ++N   + ++DSL S  S + L  F  PL YTLPSL  +C+L   KPD+
Subjt:  DFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNLNNLKPDL

XP_038881126.1 uncharacterized protein LOC120072727 [Benincasa hispida]2.2e-1144.83Show/hide
Query:  AKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCN
        A VWE E+ Y+D+V+G   +    WS+VDFVY++ N  QHW+L+A D+N G + ++DSL S  S ++L     PL YTL SL  +C+
Subjt:  AKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCN

XP_038899753.1 uncharacterized protein LOC120086987 [Benincasa hispida]3.3e-1541.84Show/hide
Query:  KVAKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNLNNLKPDLS
        + A  WE ED Y ++V+G   +    W++VDF+Y++ N  +HWI++A+D+N GHI ++DSL SY     LV +  PL  T+PSL  +C+++  K DLS
Subjt:  KVAKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNLNNLKPDLS

XP_038902498.1 uncharacterized protein LOC120089158 [Benincasa hispida]2.6e-1227.3Show/hide
Query:  ISLPPLPIGPPPHRAYPSSIHNPTTLLPSQPAKKVILKVKGEHTVKGEPTVKPVKIEPKNKKGEERKWTSRKRKSSQPYTPPIEATKGATKLQRYIQLGE
        ++ PP P  PPP    P    +PT  LP      +       H          V +E + K        +RKRK+   YTPPIE  K  TK ++ +++ +
Subjt:  ISLPPLPIGPPPHRAYPSSIHNPTTLLPSQPAKKVILKVKGEHTVKGEPTVKPVKIEPKNKKGEERKWTSRKRKSSQPYTPPIEATKGATKLQRYIQLGE

Query:  RPTDRPLV----------------------------MTEFMIWLTSENMGPHKPGTGILFRRNHMTKELSIPYS---CSFDGSSFPDRKYVHTSTP-RCP
         P DRP +                            + E + W+  EN         I        +EL+ P S   C      F   K      P  C 
Subjt:  RPTDRPLV----------------------------MTEFMIWLTSENMGPHKPGTGILFRRNHMTKELSIPYS---CSFDGSSFPDRKYVHTSTP-RCP

Query:  MTYWYSP---KSTWGTQLVPCHTLQNKEKV-AKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLV
          +   P    S   ++      ++NK  + A V   ++   D+V+G   +    W +VDF+Y++ N  QHW+LVA D+N G + ++DSL S  S + L 
Subjt:  MTYWYSP---KSTWGTQLVPCHTLQNKEKV-AKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLV

Query:  QFFSPLRYTLPSLCEFCNLNNLKPDL
         F   L YTLPSL  +C+L   KPD+
Subjt:  QFFSPLRYTLPSLCEFCNLNNLKPDL

TrEMBL top hitse value%identityAlignment
A0A5A7T796 Ulp1-like peptidase2.3e-0631.87Show/hide
Query:  QNKEKVAKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFN-YKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCE
        Q KE     W++E   +D+V+G + + +  W+ VD++YS FN +  HW+L+ +D+    + ++DSLLS  S+ ++     P+R  +P+L +
Subjt:  QNKEKVAKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFN-YKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCE

A0A6J1CPP7 uncharacterized protein LOC1110134392.7e-0737.35Show/hide
Query:  VAKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSL
        +   W++    M+ VLG  ++  P W  VDFVY   + + HW+LVAI++N   IL+YDSL S+      ++   PL + +PSL
Subjt:  VAKVWEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSL

A0A6J1D492 uncharacterized protein LOC1110168903.3e-0530.93Show/hide
Query:  WEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNY-KQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNLNNLKPDLSPAP
        W +E+    +V G +S+H   WS+ D VY+  N    HW+++ ID+  G I ++DSL +     +L +   P+   LP+L     + +++PDL   P
Subjt:  WEKEDCYMDFVLGLESEHQPGWSEVDFVYSSFNY-KQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNLNNLKPDLSPAP

A0A6J1DN69 uncharacterized protein LOC1110221403.2e-0843.59Show/hide
Query:  MDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNL
        MD VLG   +  P W +VD VYS    + HW+LVAID+    I +YDSL  ++S+  L+    PL +T+PSL   C L
Subjt:  MDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNL

A0A6J1DRI2 uncharacterized protein LOC1110225152.7e-0741.03Show/hide
Query:  MDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNL
        MD VLG   +  P W +VD VYS    + HW+LV ID+    I +YD L  ++S+  L+    PL +T+PSL   C L
Subjt:  MDFVLGLESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNL

SwissProt top hitse value%identityAlignment
O46598 Hepatitis A virus cellular receptor 16.1e-0432.54Show/hide
Query:  ITIPRPTESTTPLPTEVTTLLPTKPTI--ALPFESTTPLLFETTTPLPTEAVFTESITPPTHEAIFPTETTLPTKPCLPTKTSLP--IIPTISLP-----
        +T   PT +T P+ T + T LPT  T+   LP  +T P    TTT LPT      + T PT   + PT TTLPT   LPT T+LP   +PT++LP     
Subjt:  ITIPRPTESTTPLPTEVTTLLPTKPTI--ALPFESTTPLLFETTTPLPTEAVFTESITPPTHEAIFPTETTLPTKPCLPTKTSLP--IIPTISLP-----

Query:  ------PLPIGPPPHRAYPSSIHNPTTLLPSQPAKKVILKVKGEHTVKGEP-TVKPVKIEPKNKKGEER
              P+    P     P++   PTT + S       L ++    V   P + +P +  P    G  R
Subjt:  ------PLPIGPPPHRAYPSSIHNPTTLLPSQPAKKVILKVKGEHTVKGEP-TVKPVKIEPKNKKGEER

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTCAGACCTGCGACATATAAGGTTTGACATGACTGAGATGATGGCTGGCATACAGACTATGATCAGCTTACTCAGGTCTTTCTGTCAGATCACCATCCCACGCCC
TACCGAGTCCACCACCCCACTCCCCACCGAGGTCACCACCCTACTCCCCACTAAGCCCACCATTGCACTCCCGTTCGAGTCCACTACCCCACTCCTGTTCGAGACCACCA
CCCCACTCCCCACTGAGGCAGTTTTTACCGAGTCCATAACCCCTCCGACCCACGAGGCCATCTTTCCCACTGAGACCACCCTTCCCACCAAGCCATGTCTCCCCACTAAG
ACATCCCTTCCCATCATTCCCACCATTTCACTTCCTCCACTACCCATCGGCCCACCCCCTCATCGAGCATATCCCTCTAGCATTCATAATCCCACCACTTTGCTCCCCTC
ACAACCGGCTAAGAAGGTCATACTAAAGGTAAAAGGGGAGCACACGGTGAAAGGAGAGCCTACGGTGAAGCCGGTTAAGATTGAGCCAAAGAACAAGAAGGGGGAGGAAC
GAAAGTGGACTAGTAGGAAGAGAAAGTCGTCCCAACCGTACACCCCTCCAATCGAGGCAACAAAGGGGGCGACTAAGCTGCAACGATACATTCAACTAGGGGAACGCCCC
ACTGATAGGCCGCTTGTCATGACGGAGTTCATGATATGGCTGACGAGCGAGAATATGGGTCCTCATAAGCCGGGCACAGGCATACTCTTTCGAAGAAACCACATGACGAA
GGAACTCTCGATTCCATATTCATGTTCCTTCGACGGAAGTTCATTTCCAGATAGGAAATATGTGCACACTAGTACACCACGTTGCCCCATGACATATTGGTATTCACCTA
AATCTACATGGGGAACACAGCTTGTTCCATGCCATACATTGCAAAACAAGGAAAAAGTTGCAAAAGTATGGGAAAAAGAGGATTGCTATATGGATTTCGTGCTGGGGTTG
GAATCGGAACACCAGCCGGGATGGTCAGAGGTTGATTTCGTCTATAGCTCCTTCAACTACAAGCAACATTGGATACTGGTGGCGATAGACGTCAACCTCGGCCACATCCT
CCTATACGACTCCCTTCTATCATACGTCTCGTCGAGGGACCTAGTACAGTTTTTTTCGCCACTGCGCTACACTCTTCCTTCCCTATGTGAATTCTGCAATCTGAATAATT
TGAAGCCTGATCTTTCGCCCGCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTCAGACCTGCGACATATAAGGTTTGACATGACTGAGATGATGGCTGGCATACAGACTATGATCAGCTTACTCAGGTCTTTCTGTCAGATCACCATCCCACGCCC
TACCGAGTCCACCACCCCACTCCCCACCGAGGTCACCACCCTACTCCCCACTAAGCCCACCATTGCACTCCCGTTCGAGTCCACTACCCCACTCCTGTTCGAGACCACCA
CCCCACTCCCCACTGAGGCAGTTTTTACCGAGTCCATAACCCCTCCGACCCACGAGGCCATCTTTCCCACTGAGACCACCCTTCCCACCAAGCCATGTCTCCCCACTAAG
ACATCCCTTCCCATCATTCCCACCATTTCACTTCCTCCACTACCCATCGGCCCACCCCCTCATCGAGCATATCCCTCTAGCATTCATAATCCCACCACTTTGCTCCCCTC
ACAACCGGCTAAGAAGGTCATACTAAAGGTAAAAGGGGAGCACACGGTGAAAGGAGAGCCTACGGTGAAGCCGGTTAAGATTGAGCCAAAGAACAAGAAGGGGGAGGAAC
GAAAGTGGACTAGTAGGAAGAGAAAGTCGTCCCAACCGTACACCCCTCCAATCGAGGCAACAAAGGGGGCGACTAAGCTGCAACGATACATTCAACTAGGGGAACGCCCC
ACTGATAGGCCGCTTGTCATGACGGAGTTCATGATATGGCTGACGAGCGAGAATATGGGTCCTCATAAGCCGGGCACAGGCATACTCTTTCGAAGAAACCACATGACGAA
GGAACTCTCGATTCCATATTCATGTTCCTTCGACGGAAGTTCATTTCCAGATAGGAAATATGTGCACACTAGTACACCACGTTGCCCCATGACATATTGGTATTCACCTA
AATCTACATGGGGAACACAGCTTGTTCCATGCCATACATTGCAAAACAAGGAAAAAGTTGCAAAAGTATGGGAAAAAGAGGATTGCTATATGGATTTCGTGCTGGGGTTG
GAATCGGAACACCAGCCGGGATGGTCAGAGGTTGATTTCGTCTATAGCTCCTTCAACTACAAGCAACATTGGATACTGGTGGCGATAGACGTCAACCTCGGCCACATCCT
CCTATACGACTCCCTTCTATCATACGTCTCGTCGAGGGACCTAGTACAGTTTTTTTCGCCACTGCGCTACACTCTTCCTTCCCTATGTGAATTCTGCAATCTGAATAATT
TGAAGCCTGATCTTTCGCCCGCTCCTTGA
Protein sequenceShow/hide protein sequence
MQSDLRHIRFDMTEMMAGIQTMISLLRSFCQITIPRPTESTTPLPTEVTTLLPTKPTIALPFESTTPLLFETTTPLPTEAVFTESITPPTHEAIFPTETTLPTKPCLPTK
TSLPIIPTISLPPLPIGPPPHRAYPSSIHNPTTLLPSQPAKKVILKVKGEHTVKGEPTVKPVKIEPKNKKGEERKWTSRKRKSSQPYTPPIEATKGATKLQRYIQLGERP
TDRPLVMTEFMIWLTSENMGPHKPGTGILFRRNHMTKELSIPYSCSFDGSSFPDRKYVHTSTPRCPMTYWYSPKSTWGTQLVPCHTLQNKEKVAKVWEKEDCYMDFVLGL
ESEHQPGWSEVDFVYSSFNYKQHWILVAIDVNLGHILLYDSLLSYVSSRDLVQFFSPLRYTLPSLCEFCNLNNLKPDLSPAP