; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G015060 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G015060
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionUlp1-like peptidase
Genome locationCG_Chr05:26423696..26425925
RNA-Seq ExpressionClCG05G015060
SyntenyClCG05G015060
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038875042.1 uncharacterized protein LOC120067568 [Benincasa hispida]2.9e-0832.17Show/hide
Query:  WANVDYVYSAINIREHWILVAIDVNKD----------------ISLILLPLTCPLPSLAHFYKIDMKKHDLNSSPWPIFRAECGNIQASSTLDCSTVCLN
        W++VD+VY+  NI +HW+L+A ++N+D                +   L PLT  LPSL H+  +   K D+ +S W I +    N Q    LDC    + 
Subjt:  WANVDYVYSAINIREHWILVAIDVNKD----------------ISLILLPLTCPLPSLAHFYKIDMKKHDLNSSPWPIFRAECGNIQASSTLDCSTVCLN

Query:  LFEHMMTGFPMKNVS
        L EH++TG  +  ++
Subjt:  LFEHMMTGFPMKNVS

XP_038899753.1 uncharacterized protein LOC120086987 [Benincasa hispida]1.9e-1228.74Show/hide
Query:  RDLFHELTMDNEWG-----FEELVAPRKKFLDQPEICVHKFTTFPPGIL---------------KTKKEVATIWAKEENILNIMLGLEPNHQSGWANVDY
        +  F EL   + W          +  ++K   QP++C+HKFT    G+                KT  E A  W  E+   N ++G        WA+VD+
Subjt:  RDLFHELTMDNEWG-----FEELVAPRKKFLDQPEICVHKFTTFPPGIL---------------KTKKEVATIWAKEENILNIMLGLEPNHQSGWANVDY

Query:  VYSAINIREHWILVAIDVNKDISLI----------------LLPLTCPLPSLAHFYKIDMKKHDLNSSPWPIFR
        +Y+A+NI EHWI++A+D+N+    +                L PLT  +PSL H+  +D  K DL++  W   R
Subjt:  VYSAINIREHWILVAIDVNKDISLI----------------LLPLTCPLPSLAHFYKIDMKKHDLNSSPWPIFR

XP_038902498.1 uncharacterized protein LOC120089158 [Benincasa hispida]1.4e-1025.86Show/hide
Query:  IITIPLLALPPQRSMIVKKEKKVEPLSSPPLKRLITIDHPELHHVPPPL--KRLKVKKDKKIDEVKHELSEWVKKSKSSTLGKTVGNEVPQSKCFPPGK-
        I+T P    PP RS      K       PP++  IT     LH     +   R K +K K +D+    +    K +K   +   +  + P  K   PG  
Subjt:  IITIPLLALPPQRSMIVKKEKKVEPLSSPPLKRLITIDHPELHHVPPPL--KRLKVKKDKKIDEVKHELSEWVKKSKSSTLGKTVGNEVPQSKCFPPGK-

Query:  ----EDRVRYDLTHPVNVVE-PYIPRWGALFRTNKTTRD---------LFHELTMDNEWG--------FEELVAPRKKFLDQPEICVHKFTTFPPGIL--
            +  ++Y L H V V     +  W A   T    ++          F ELT  + W         FE +   ++KF  QP++C+ +FT  P GI   
Subjt:  ----EDRVRYDLTHPVNVVE-PYIPRWGALFRTNKTTRD---------LFHELTMDNEWG--------FEELVAPRKKFLDQPEICVHKFTTFPPGIL--

Query:  -------------KTKKEVATIWAKEENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVN----------------KDISLILLPLTCPLPS
                     K+  E A  W  +E + + ++G        W +VD++Y+  NI +HW+LVA D+N                K +   L  LT  LPS
Subjt:  -------------KTKKEVATIWAKEENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVN----------------KDISLILLPLTCPLPS

Query:  LAHFYKIDMKKHDLNSSPWPI
        L H+  +   K D+ +S W I
Subjt:  LAHFYKIDMKKHDLNSSPWPI

TrEMBL top hitse value%identityAlignment
A0A5A7SSX7 Ulp1-like peptidase3.4e-0730.08Show/hide
Query:  TIWAKE--ENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVNKDISLIL-----------------LPLTCPLPSLAHFYKIDMKKHDLNSS
        TIW ++  E   N  +G + NH  GW +V+YV + INI+EHW+ +A D+ K    +                  +P  C +PSLA    +++        
Subjt:  TIWAKE--ENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVNKDISLIL-----------------LPLTCPLPSLAHFYKIDMKKHDLNSS

Query:  PWPIFRAECGNIQASSTLDCSTVCLNLFEHMMT
        PWPI R++   +Q   +LDC   C    E ++T
Subjt:  PWPIFRAECGNIQASSTLDCSTVCLNLFEHMMT

A0A5A7TFJ6 Ulp1-like peptidase3.4e-0730.08Show/hide
Query:  TIWAKE--ENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVNKDISLIL-----------------LPLTCPLPSLAHFYKIDMKKHDLNSS
        TIW ++  E   N  +G + NH  GW +V+YV + INI+EHW+ +A D+ K    +                  +P  C +PSLA    +++        
Subjt:  TIWAKE--ENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVNKDISLIL-----------------LPLTCPLPSLAHFYKIDMKKHDLNSS

Query:  PWPIFRAECGNIQASSTLDCSTVCLNLFEHMMT
        PWPI R++   +Q   +LDC   C    E ++T
Subjt:  PWPIFRAECGNIQASSTLDCSTVCLNLFEHMMT

A0A5A7TW86 MuDRA-like transposase6.9e-0830.6Show/hide
Query:  TIWAKE--ENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVNKDISLIL-----------------LPLTCPLPSLAHFYKIDMKKHDLNSS
        TIW ++  E   N  +G + +H  GW +V+YV S INI+EHW+++A D+ K    +                  +P  C +PSLA    +++        
Subjt:  TIWAKE--ENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVNKDISLIL-----------------LPLTCPLPSLAHFYKIDMKKHDLNSS

Query:  PWPIFRAECGNIQASSTLDCSTVCLNLFEHMMTG
        PWPI R++   +Q   +LDC   C    E ++TG
Subjt:  PWPIFRAECGNIQASSTLDCSTVCLNLFEHMMTG

A0A5A7UYV8 Ulp1-like peptidase1.5e-0731.06Show/hide
Query:  TIWAKE--ENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVNK-------------DISLILLPLTCP---LPSLAHFYKIDMKKHDLNSSP
        TIW ++  E   N ++G + NH  GW +V+YV   INI+EHW+ +A D+ K             +  L+   L  P   +PSLA    +++  +     P
Subjt:  TIWAKE--ENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVNK-------------DISLILLPLTCP---LPSLAHFYKIDMKKHDLNSSP

Query:  WPIFRAECGNIQASSTLDCSTVCLNLFEHMMT
        WPI R++   +Q   +LDC   C    E ++T
Subjt:  WPIFRAECGNIQASSTLDCSTVCLNLFEHMMT

A0A6J1CPP7 uncharacterized protein LOC1110134391.8e-0829.46Show/hide
Query:  KTKKEVATIWAKEENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVNKDISLI----------------LLPLTCPLPSLAHFYKI-DMKKH
        K  +++ + W + +  +  +LG   +    W +VD+VY  ++IR HW+LVAI++N+   L+                L PL+  +PSL + + + +   +
Subjt:  KTKKEVATIWAKEENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAIDVNKDISLI----------------LLPLTCPLPSLAHFYKI-DMKKH

Query:  DLNSSPWPIFRAECGNIQASSTLDCSTVC
         L  +PWPIFR    N Q S  +DC  +C
Subjt:  DLNSSPWPIFRAECGNIQASSTLDCSTVC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCAACTCTCCTGTGGTATACAAATATGTTATCCCAACGTCTCTTTGTGGTGATCATCCCCACTCTTACTATGTCTAATGCGGAGAAACAATATAAGGACACTCC
AGTCGACCAACAGGTCATACACGTGGAAACCCAAGAAGAAGAAATGATACTGTCACATGAGGTGCCCCTTAAGGAGGCATTTTATACACATGACGAAGAAGGGGGTCATG
TAGAGAATGATTCATGTCCTCATGTTTTTGAGCTTGTTGTCACCACACAAAGACGAAGGTCCAAGACTACTTTGGGGGAGGTGAAGTCTGATTTGGGTGATGTAAAGATG
CTACTATGGACGATCACTAAGTTATTGCAGACGTTGTGTTGGAATATCACAACCACTACAACCACGATCATGATGGCGATGATAATGATGACGATGACAATGATCATATT
CATCACAAAGAGGAACCCACCACCGAGCCTACTTACCACGCCCACCACCATGGAGCCTACTACTACCACCATCGAGCCCACCACCATCAAGCCTACTACCACCAGCAGCA
CCACCAGCAGCAAGTCTACCACCACTATCCCCAAGACCACCATCACTACCATTGAGCCTACCACCACTGAGATCACTACTATCCTTGAGACCACCATCATTACCATTCCT
CTACTAGCGCTACCACCTCAGAGGTCGATGATAGTTAAAAAAGAGAAAAAGGTTGAGCCACTCTCATCTCCACCACTAAAGAGGCTGATAACTATTGATCATCCCGAGCT
ACATCATGTCCCACCTCCACTTAAAAGATTAAAGGTAAAGAAGGATAAAAAGATCGATGAAGTAAAACATGAATTGAGTGAATGGGTGAAGAAGTCAAAGTCCTCAACTC
TGGGAAAGACAGTTGGCAATGAAGTACCCCAATCTAAGTGTTTTCCACCTGGAAAGGAAGATCGGGTGAGGTACGATCTCACCCATCCAGTCAATGTTGTGGAGCCTTAC
ATTCCTAGATGGGGGGCATTATTTAGGACGAACAAGACCACTCGAGATTTGTTCCATGAGTTGACCATGGACAATGAATGGGGTTTTGAAGAGCTTGTCGCTCCTAGGAA
AAAGTTTCTTGATCAGCCTGAGATATGTGTCCATAAGTTCACAACATTCCCACCTGGAATATTGAAGACTAAGAAAGAAGTTGCGACGATATGGGCTAAAGAAGAAAACA
TCTTAAACATTATGCTAGGTTTAGAGCCAAATCATCAATCGGGGTGGGCAAACGTAGACTACGTGTACAGTGCTATCAACATCCGTGAACATTGGATCCTAGTCGCAATT
GATGTGAACAAAGATATTTCCTTGATATTGTTGCCTTTGACATGCCCATTACCGTCATTGGCTCATTTCTACAAGATTGATATGAAGAAACACGATCTCAATTCTAGCCC
TTGGCCAATTTTTCGTGCAGAGTGTGGAAACATACAAGCGAGCAGTACATTAGATTGCAGTACCGTCTGTTTAAATCTTTTTGAACATATGATGACAGGATTCCCGATGA
AGAATGTATCAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCAACTCTCCTGTGGTATACAAATATGTTATCCCAACGTCTCTTTGTGGTGATCATCCCCACTCTTACTATGTCTAATGCGGAGAAACAATATAAGGACACTCC
AGTCGACCAACAGGTCATACACGTGGAAACCCAAGAAGAAGAAATGATACTGTCACATGAGGTGCCCCTTAAGGAGGCATTTTATACACATGACGAAGAAGGGGGTCATG
TAGAGAATGATTCATGTCCTCATGTTTTTGAGCTTGTTGTCACCACACAAAGACGAAGGTCCAAGACTACTTTGGGGGAGGTGAAGTCTGATTTGGGTGATGTAAAGATG
CTACTATGGACGATCACTAAGTTATTGCAGACGTTGTGTTGGAATATCACAACCACTACAACCACGATCATGATGGCGATGATAATGATGACGATGACAATGATCATATT
CATCACAAAGAGGAACCCACCACCGAGCCTACTTACCACGCCCACCACCATGGAGCCTACTACTACCACCATCGAGCCCACCACCATCAAGCCTACTACCACCAGCAGCA
CCACCAGCAGCAAGTCTACCACCACTATCCCCAAGACCACCATCACTACCATTGAGCCTACCACCACTGAGATCACTACTATCCTTGAGACCACCATCATTACCATTCCT
CTACTAGCGCTACCACCTCAGAGGTCGATGATAGTTAAAAAAGAGAAAAAGGTTGAGCCACTCTCATCTCCACCACTAAAGAGGCTGATAACTATTGATCATCCCGAGCT
ACATCATGTCCCACCTCCACTTAAAAGATTAAAGGTAAAGAAGGATAAAAAGATCGATGAAGTAAAACATGAATTGAGTGAATGGGTGAAGAAGTCAAAGTCCTCAACTC
TGGGAAAGACAGTTGGCAATGAAGTACCCCAATCTAAGTGTTTTCCACCTGGAAAGGAAGATCGGGTGAGGTACGATCTCACCCATCCAGTCAATGTTGTGGAGCCTTAC
ATTCCTAGATGGGGGGCATTATTTAGGACGAACAAGACCACTCGAGATTTGTTCCATGAGTTGACCATGGACAATGAATGGGGTTTTGAAGAGCTTGTCGCTCCTAGGAA
AAAGTTTCTTGATCAGCCTGAGATATGTGTCCATAAGTTCACAACATTCCCACCTGGAATATTGAAGACTAAGAAAGAAGTTGCGACGATATGGGCTAAAGAAGAAAACA
TCTTAAACATTATGCTAGGTTTAGAGCCAAATCATCAATCGGGGTGGGCAAACGTAGACTACGTGTACAGTGCTATCAACATCCGTGAACATTGGATCCTAGTCGCAATT
GATGTGAACAAAGATATTTCCTTGATATTGTTGCCTTTGACATGCCCATTACCGTCATTGGCTCATTTCTACAAGATTGATATGAAGAAACACGATCTCAATTCTAGCCC
TTGGCCAATTTTTCGTGCAGAGTGTGGAAACATACAAGCGAGCAGTACATTAGATTGCAGTACCGTCTGTTTAAATCTTTTTGAACATATGATGACAGGATTCCCGATGA
AGAATGTATCAATTTAG
Protein sequenceShow/hide protein sequence
MSSTLLWYTNMLSQRLFVVIIPTLTMSNAEKQYKDTPVDQQVIHVETQEEEMILSHEVPLKEAFYTHDEEGGHVENDSCPHVFELVVTTQRRRSKTTLGEVKSDLGDVKM
LLWTITKLLQTLCWNITTTTTTIMMAMIMMTMTMIIFITKRNPPPSLLTTPTTMEPTTTTIEPTTIKPTTTSSTTSSKSTTTIPKTTITTIEPTTTEITTILETTIITIP
LLALPPQRSMIVKKEKKVEPLSSPPLKRLITIDHPELHHVPPPLKRLKVKKDKKIDEVKHELSEWVKKSKSSTLGKTVGNEVPQSKCFPPGKEDRVRYDLTHPVNVVEPY
IPRWGALFRTNKTTRDLFHELTMDNEWGFEELVAPRKKFLDQPEICVHKFTTFPPGILKTKKEVATIWAKEENILNIMLGLEPNHQSGWANVDYVYSAINIREHWILVAI
DVNKDISLILLPLTCPLPSLAHFYKIDMKKHDLNSSPWPIFRAECGNIQASSTLDCSTVCLNLFEHMMTGFPMKNVSI