; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G014960 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G014960
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionInhibitor I9 domain-containing protein
Genome locationCG_Chr07:31373748..31375157
RNA-Seq ExpressionClCG07G014960
SyntenyClCG07G014960
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR010259 - Peptidase S8 propeptide/proteinase inhibitor I9


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044911.1 subtilisin-like protease SBT2.5 [Cucumis melo var. makuwa]8.3e-4682.3Show/hide
Query:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG
        MQENRYNG H   G G GG GGECYFVFM YDP+YERLRADR GEGA+E+D YLSRKHDE+L RWLEPGSYRKISSFLIVDGFS EI E+QA VLRSAEG
Subjt:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG

Query:  VRVVEKNEDQPLI
        VRVVEKN+DQPLI
Subjt:  VRVVEKNEDQPLI

XP_008451935.1 PREDICTED: uncharacterized protein LOC103493088 [Cucumis melo]1.9e-4581.42Show/hide
Query:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG
        MQENRYNG H   G G GG GGECYFVFM YDP+YERLRADR GEGA+E+D YLSRKHDE+L RWLEPGSYRKISSFLIVDGFS EI E+QA VLRSA+G
Subjt:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG

Query:  VRVVEKNEDQPLI
        VRVVEKN+DQPLI
Subjt:  VRVVEKNEDQPLI

XP_011653213.1 uncharacterized protein LOC105435187 [Cucumis sativus]2.5e-4279.65Show/hide
Query:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG
        MQENRYNG H  GG G      ECYFVFM YDP+YERLRADR GEGA+ELD YLSRKHDE+L RWLEPGSYRKISSFLIVDGFS EI E+QA VLR AEG
Subjt:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG

Query:  VRVVEKNEDQPLI
        VRVVEKN+DQPLI
Subjt:  VRVVEKNEDQPLI

XP_022136905.1 uncharacterized protein LOC111008483 [Momordica charantia]4.3e-4277.24Show/hide
Query:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG
        M ENR  G  +C       GGGECYFVFMNYDP+YERLRADRSG+GA+ELD+YLSRKHDE+LGRWLEP SYRKISSFLIVDGFSAEITE+QA VLRSAEG
Subjt:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG

Query:  VRVVEKNEDQPL-INEKKRGEEG
        VRVVEKN+DQP  IN  +R EEG
Subjt:  VRVVEKNEDQPL-INEKKRGEEG

XP_038877291.1 uncharacterized protein LOC120069570 [Benincasa hispida]6.6e-5980.95Show/hide
Query:  MVLVHALKSSIFSQPLPESQRKTRMQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISS
        MVLVHAL SSIFS+ LPE+++K  MQENRY+GGH  GG G GG GGECYFVFMNYDP+YERLRADRSGEG +ELD+YLSRKHDEILGRWLEPGSYRKISS
Subjt:  MVLVHALKSSIFSQPLPESQRKTRMQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISS

Query:  FLIVDGFSAEITENQAKVLRSAEGVRVVEKNEDQPLINEKKRGEEGN
        FLIVDGFS EITE+QA VLR AEGVRVVEKN+DQPLIN   R EEG+
Subjt:  FLIVDGFSAEITENQAKVLRSAEGVRVVEKNEDQPLINEKKRGEEGN

TrEMBL top hitse value%identityAlignment
A0A1S3BSM3 uncharacterized protein LOC1034930889.0e-4681.42Show/hide
Query:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG
        MQENRYNG H   G G GG GGECYFVFM YDP+YERLRADR GEGA+E+D YLSRKHDE+L RWLEPGSYRKISSFLIVDGFS EI E+QA VLRSA+G
Subjt:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG

Query:  VRVVEKNEDQPLI
        VRVVEKN+DQPLI
Subjt:  VRVVEKNEDQPLI

A0A5D3D232 Subtilisin-like protease SBT2.54.0e-4682.3Show/hide
Query:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG
        MQENRYNG H   G G GG GGECYFVFM YDP+YERLRADR GEGA+E+D YLSRKHDE+L RWLEPGSYRKISSFLIVDGFS EI E+QA VLRSAEG
Subjt:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG

Query:  VRVVEKNEDQPLI
        VRVVEKN+DQPLI
Subjt:  VRVVEKNEDQPLI

A0A6J1C587 uncharacterized protein LOC1110084832.1e-4277.24Show/hide
Query:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG
        M ENR  G  +C       GGGECYFVFMNYDP+YERLRADRSG+GA+ELD+YLSRKHDE+LGRWLEP SYRKISSFLIVDGFSAEITE+QA VLRSAEG
Subjt:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG

Query:  VRVVEKNEDQPL-INEKKRGEEG
        VRVVEKN+DQP  IN  +R EEG
Subjt:  VRVVEKNEDQPL-INEKKRGEEG

A0A6J1ETN3 subtilisin-like protease SBT2.55.1e-4178.45Show/hide
Query:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG
        MQENRY       G G GGGGG CYFVFMNYDP+YERLRADRSGEGA+ELD YLSRKHD +LGR LEPG+YRK+SSFLIVDGFS EITE QA VLRSAEG
Subjt:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG

Query:  VRVVEKNEDQ--PLIN
        VRVVEKN+DQ  P IN
Subjt:  VRVVEKNEDQ--PLIN

A0A6J1J855 subtilisin-like protease SBT2.53.3e-4077.59Show/hide
Query:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG
        MQENRY+G         GGGGG CYFVFMNYDP+YERLRADRSGEGA+ELD YLSRKHD +LGR LEPG+YRK+SSFLIVDGFS EITE QA VLRSAEG
Subjt:  MQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVDGFSAEITENQAKVLRSAEG

Query:  VRVVEKNEDQ--PLIN
        VRVVEKN+DQ  P IN
Subjt:  VRVVEKNEDQ--PLIN

SwissProt top hitse value%identityAlignment
F4HYR6 Subtilisin-like protease SBT2.44.8e-0442.11Show/hide
Query:  HDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKNEDQPLI
        HDEILG  LE GSY K+ SF  +++  +   T +QAK L   +GV+ VE+++   L+
Subjt:  HDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKNEDQPLI

O64481 Subtilisin-like protease SBT2.51.2e-0742.86Show/hide
Query:  RADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKN
        + D S E       +L RKHD ILG   E GSY+K+ S+  +++GF+A ++  QA+ LR A GVR V+K+
Subjt:  RADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKN

Q9SZV5 Subtilisin-like protease SBT2.65.1e-0635.71Show/hide
Query:  RADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKN
        + D + E       +L RKHD +LG     GSY+K+ S+  +++GF+A ++ +QA++LR A GV+ V+++
Subjt:  RADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKN

Arabidopsis top hitse value%identityAlignment
AT1G62340.1 PA-domain containing subtilase family protein3.4e-0542.11Show/hide
Query:  HDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKNEDQPLI
        HDEILG  LE GSY K+ SF  +++  +   T +QAK L   +GV+ VE+++   L+
Subjt:  HDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKNEDQPLI

AT2G19170.1 subtilisin-like serine protease 38.7e-0942.86Show/hide
Query:  RADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKN
        + D S E       +L RKHD ILG   E GSY+K+ S+  +++GF+A ++  QA+ LR A GVR V+K+
Subjt:  RADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKN

AT4G30020.1 PA-domain containing subtilase family protein3.7e-0735.71Show/hide
Query:  RADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKN
        + D + E       +L RKHD +LG     GSY+K+ S+  +++GF+A ++ +QA++LR A GV+ V+++
Subjt:  RADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFL-IVDGFSAEITENQAKVLRSAEGVRVVEKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTAGTTCATGCATTAAAAAGCTCAATATTTTCCCAACCTTTGCCGGAAAGCCAGAGGAAAACCAGAATGCAAGAGAACAGATACAACGGTGGACACAAT
TGTGGTGGCAGCGGCAGCGGTGGCGGTGGCGGCGAATGTTACTTCGTTTTCATGAACTATGATCCTCAATATGAACGTCTTCGGGCTGATCGATCTGGGGAAGGG
GCAAATGAGCTTGATGTGTATCTGAGTAGAAAGCACGATGAGATTCTTGGGAGGTGGTTGGAGCCCGGAAGTTACAGAAAAATCTCGTCTTTTCTAATCGTTGAC
GGATTTTCGGCCGAGATTACTGAAAATCAGGCAAAAGTGCTTAGATCTGCAGAGGGAGTGAGGGTGGTGGAGAAGAATGAAGACCAGCCTCTCATTAATGAGAAA
AAAAGGGGTGAAGAAGGCAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTAGTTCATGCATTAAAAAGCTCAATATTTTCCCAACCTTTGCCGGAAAGCCAGAGGAAAACCAGAATGCAAGAGAACAGATACAACGGTGGACACAAT
TGTGGTGGCAGCGGCAGCGGTGGCGGTGGCGGCGAATGTTACTTCGTTTTCATGAACTATGATCCTCAATATGAACGTCTTCGGGCTGATCGATCTGGGGAAGGG
GCAAATGAGCTTGATGTGTATCTGAGTAGAAAGCACGATGAGATTCTTGGGAGGTGGTTGGAGCCCGGAAGTTACAGAAAAATCTCGTCTTTTCTAATCGTTGAC
GGATTTTCGGCCGAGATTACTGAAAATCAGGCAAAAGTGCTTAGATCTGCAGAGGGAGTGAGGGTGGTGGAGAAGAATGAAGACCAGCCTCTCATTAATGAGAAA
AAAAGGGGTGAAGAAGGCAATTAG
Protein sequenceShow/hide protein sequence
MVLVHALKSSIFSQPLPESQRKTRMQENRYNGGHNCGGSGSGGGGGECYFVFMNYDPQYERLRADRSGEGANELDVYLSRKHDEILGRWLEPGSYRKISSFLIVD
GFSAEITENQAKVLRSAEGVRVVEKNEDQPLINEKKRGEEGN