; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G003400 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G003400
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionTrypsin inhibitor 1-like
Genome locationCmo_Chr16:1572271..1574451
RNA-Seq ExpressionCmoCh16G003400
SyntenyCmoCh16G003400
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0010951 - negative regulation of endopeptidase activity (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016020 - membrane (cellular component)
GO:0004650 - polygalacturonase activity (molecular function)
GO:0004867 - serine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000737 - Proteinase inhibitor I7, squash
IPR011052 - Proteinase/amylase inhibitor domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576919.1 hypothetical protein SDJN03_24493, partial [Cucurbita argyrosperma subsp. sororia]4.4e-7198.58Show/hide
Query:  MWQNEMTQMGIASQMQYTFLSFKFQRQIGSKISSFSSIKWHLNRQEVLTKRPEMGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINHRVPR
        MWQNEMTQMGIASQMQYTFLSFKFQRQIGSKISSFSSIKWH NRQEVLTKRPEMGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINH VPR
Subjt:  MWQNEMTQMGIASQMQYTFLSFKFQRQIGSKISSFSSIKWHLNRQEVLTKRPEMGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINHRVPR

Query:  KIMKWGVLNHEERVCPKILMECKKDSDCLAECICLEHGYCG
        KIMKWGVLNHEERVCPKILMECKKDSDCLAECICLEHGYCG
Subjt:  KIMKWGVLNHEERVCPKILMECKKDSDCLAECICLEHGYCG

KAG6576920.1 hypothetical protein SDJN03_24494, partial [Cucurbita argyrosperma subsp. sororia]1.5e-3998.85Show/hide
Query:  MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLLTNDESSHRRTRSDFQTPRFEYLDQRSGHELMHSRSVSKGAFGSSYAGQVSNN
        MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLLTNDESSHRRTRSDFQ PRFEYLDQRSGHELMHSRSVSKGAFGSSYAGQVSNN
Subjt:  MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLLTNDESSHRRTRSDFQTPRFEYLDQRSGHELMHSRSVSKGAFGSSYAGQVSNN

KAG7014944.1 hypothetical protein SDJN02_22575, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-7097.87Show/hide
Query:  MWQNEMTQMGIASQMQYTFLSFKFQRQIGSKISSFSSIKWHLNRQEVLTKRPEMGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINHRVPR
        MWQNEMTQMGIASQMQYTFLSFKFQRQIGSKISSFSSIKW+ NRQEVLTKRPEMGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINH VPR
Subjt:  MWQNEMTQMGIASQMQYTFLSFKFQRQIGSKISSFSSIKWHLNRQEVLTKRPEMGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINHRVPR

Query:  KIMKWGVLNHEERVCPKILMECKKDSDCLAECICLEHGYCG
        KIMKWGVLNHEERVCPKILMECKKDSDCLAECICLEHGYCG
Subjt:  KIMKWGVLNHEERVCPKILMECKKDSDCLAECICLEHGYCG

KAG7014945.1 NADH-ubiquinone oxidoreductase chain 4L, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-3998.85Show/hide
Query:  MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLLTNDESSHRRTRSDFQTPRFEYLDQRSGHELMHSRSVSKGAFGSSYAGQVSNN
        MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLLTNDESSHRRTRSDFQ PRFEYLDQRSGHELMHSRSVSKGAFGSSYAGQVSNN
Subjt:  MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLLTNDESSHRRTRSDFQTPRFEYLDQRSGHELMHSRSVSKGAFGSSYAGQVSNN

XP_022922566.1 probable polygalacturonase At3g15720 [Cucurbita moschata]3.9e-3598.75Show/hide
Query:  MGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINHRVPRKIMKWGVLNHEERVCPKILMECKKDSDCLAECI
        MGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINHRVPRKIMKWGVLNHEERVCPKILMECKKDSDCLAEC+
Subjt:  MGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINHRVPRKIMKWGVLNHEERVCPKILMECKKDSDCLAECI

TrEMBL top hitse value%identityAlignment
A0A0A0KT06 Uncharacterized protein3.8e-2868.81Show/hide
Query:  MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLL--TNDESSHRRTRSDFQTPRFEYLDQRSGHELMHSRSVSKGAFGS-------------SYAGQVS
        MEGLIPYLFHAMKKQKPRH YRSQSVGSSRSYHLL   NDESSHRRTRSD+Q P FE+LDQRS  EL HSRSV+K AFGS             SY GQVS
Subjt:  MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLL--TNDESSHRRTRSDFQTPRFEYLDQRSGHELMHSRSVSKGAFGS-------------SYAGQVS

Query:  NNVLFKSGS
        N   +  G+
Subjt:  NNVLFKSGS

A0A5A7TM95 Putative beta-D-xylosidase 5 isoform X21.1e-2767.86Show/hide
Query:  MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLL--TNDESSHRRTRSDFQTPRFEYLD-QRSGHELMHSRSVSKGAFGS-------------SYAGQV
        MEGLIPYLFHAMKKQKPRH YRSQSVGSSRSYHLL   NDESSHRRTRSD+Q P FE+LD QR   EL HSRSV+K AFGS             SYAGQV
Subjt:  MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLL--TNDESSHRRTRSDFQTPRFEYLD-QRSGHELMHSRSVSKGAFGS-------------SYAGQV

Query:  SNNVLFKSGSQP
        SN   +  G+ P
Subjt:  SNNVLFKSGSQP

A0A5D3E148 Trypsin inhibitor 1-like2.1e-1857.61Show/hide
Query:  MGSMKIVAVALIA--ILLVATLSSAAYAIDD-DSMDLVFDGRDINHRVPRKIM-KWGVLNHEERVCPKILMECKKDSDCLAECICLEHGYCG
        M S KIV VA++   I+L A  SSA++ +DD DS +LV DGRD+N+  PRKIM K GV   E+R+CPKILM+CK+DSDCL +C+CL+ G+CG
Subjt:  MGSMKIVAVALIA--ILLVATLSSAAYAIDD-DSMDLVFDGRDINHRVPRKIM-KWGVLNHEERVCPKILMECKKDSDCLAECICLEHGYCG

A0A5N6QPG1 Uncharacterized protein1.3e-1555.1Show/hide
Query:  MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLLTNDE----SSHRRTRSDFQTPRFEYLDQRSG-HELMHSRSVSK------GAFGSSYAGQVSNN
        MEGLIPYL HA+KKQKP+H YRS S GS+RSYHLL N +    SSHRRTRSDFQ P  E+++QRSG  +L+ S S +K       +  +SY+ ++ NN
Subjt:  MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLLTNDE----SSHRRTRSDFQTPRFEYLDQRSG-HELMHSRSVSK------GAFGSSYAGQVSNN

A0A6J1E4H0 probable polygalacturonase At3g157201.9e-3598.75Show/hide
Query:  MGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINHRVPRKIMKWGVLNHEERVCPKILMECKKDSDCLAECI
        MGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINHRVPRKIMKWGVLNHEERVCPKILMECKKDSDCLAEC+
Subjt:  MGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINHRVPRKIMKWGVLNHEERVCPKILMECKKDSDCLAECI

SwissProt top hitse value%identityAlignment
P01074 Trypsin inhibitor 19.0e-1193.1Show/hide
Query:  RVCPKILMECKKDSDCLAECICLEHGYCG
        RVCP+ILMECKKDSDCLAEC+CLEHGYCG
Subjt:  RVCPKILMECKKDSDCLAECICLEHGYCG

P07853 Trypsin inhibitor 42.1e-1290.62Show/hide
Query:  HEERVCPKILMECKKDSDCLAECICLEHGYCG
        HEERVCP+ILM+CKKDSDCLAEC+CLEHGYCG
Subjt:  HEERVCPKILMECKKDSDCLAECICLEHGYCG

P10293 Trypsin inhibitor 32.5e-13100Show/hide
Query:  HEERVCPKILMECKKDSDCLAECICLEHGYCG
        HEERVCPKILMECKKDSDCLAECICLEHGYCG
Subjt:  HEERVCPKILMECKKDSDCLAECICLEHGYCG

P83397 Trypsin inhibitor 64.9e-0982.76Show/hide
Query:  RVCPKILMECKKDSDCLAECICLEHGYCG
        R+CP+ILM+CKKDSDCLAECIC EHG+CG
Subjt:  RVCPKILMECKKDSDCLAECICLEHGYCG

P83398 Trypsin inhibitor 73.8e-0982.76Show/hide
Query:  RVCPKILMECKKDSDCLAECICLEHGYCG
        R+CP+ILM+CKKDSDCLAECIC EHG+CG
Subjt:  RVCPKILMECKKDSDCLAECICLEHGYCG

Arabidopsis top hitse value%identityAlignment
AT3G19615.1 unknown protein1.9e-0850.57Show/hide
Query:  MEGLIPYLFHAMKK-QKPRHD-YRSQSVGSSRSYHLLTNDE--------SSHRRTRSDFQT-PRFEYLDQRS---GHELMHSRSVSK
        MEGLIPYL HA+KK  KP H  YRS SVGSSRSY  L   +        SSHRRTRSD+      +  DQRS   G E ++  S S+
Subjt:  MEGLIPYLFHAMKK-QKPRHD-YRSQSVGSSRSYHLLTNDE--------SSHRRTRSDFQT-PRFEYLDQRS---GHELMHSRSVSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGCTTAATTCCTTATCTATTCCATGCAATGAAGAAGCAAAAGCCACGCCACGACTATAGAAGCCAATCCGTAGGTTCGAGCCGGAGCTACCATCTCTTGACTAA
CGACGAGTCATCTCACCGACGAACACGATCGGATTTTCAGACACCGAGGTTCGAGTACTTGGACCAACGGTCGGGGCACGAGTTAATGCATTCCCGTAGCGTTAGTAAGG
GTGCGTTTGGATCATCTTATGCTGGCCAGGTTTCAAATAATGTCTTGTTCAAGAGTGGAAGCCAACCAACCACTTGGAAAAATGCTTCCATGCACACAATACTCATGTGG
CAAAACGAGATGACACAAATGGGCATTGCCAGCCAGATGCAATACACATTCTTATCATTCAAGTTTCAACGTCAAATTGGGAGCAAAATCTCCTCTTTCTCTTCCATAAA
ATGGCACCTAAACCGCCAAGAGGTACTAACAAAAAGACCTGAAATGGGTAGCATGAAGATTGTCGCGGTGGCTCTAATCGCAATTTTGCTGGTAGCAACGTTGTCGTCGG
CTGCCTATGCCATCGACGACGACTCGATGGACCTTGTCTTCGACGGTCGAGACATCAATCATAGAGTCCCAAGAAAGATCATGAAATGGGGAGTGCTTAATCATGAAGAA
AGGGTATGTCCCAAAATCCTCATGGAATGCAAAAAGGACTCTGATTGCTTAGCTGAGTGTATCTGTCTCGAGCATGGATATTGTGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGCTTAATTCCTTATCTATTCCATGCAATGAAGAAGCAAAAGCCACGCCACGACTATAGAAGCCAATCCGTAGGTTCGAGCCGGAGCTACCATCTCTTGACTAA
CGACGAGTCATCTCACCGACGAACACGATCGGATTTTCAGACACCGAGGTTCGAGTACTTGGACCAACGGTCGGGGCACGAGTTAATGCATTCCCGTAGCGTTAGTAAGG
GTGCGTTTGGATCATCTTATGCTGGCCAGGTTTCAAATAATGTCTTGTTCAAGAGTGGAAGCCAACCAACCACTTGGAAAAATGCTTCCATGCACACAATACTCATGTGG
CAAAACGAGATGACACAAATGGGCATTGCCAGCCAGATGCAATACACATTCTTATCATTCAAGTTTCAACGTCAAATTGGGAGCAAAATCTCCTCTTTCTCTTCCATAAA
ATGGCACCTAAACCGCCAAGAGGTACTAACAAAAAGACCTGAAATGGGTAGCATGAAGATTGTCGCGGTGGCTCTAATCGCAATTTTGCTGGTAGCAACGTTGTCGTCGG
CTGCCTATGCCATCGACGACGACTCGATGGACCTTGTCTTCGACGGTCGAGACATCAATCATAGAGTCCCAAGAAAGATCATGAAATGGGGAGTGCTTAATCATGAAGAA
AGGGTATGTCCCAAAATCCTCATGGAATGCAAAAAGGACTCTGATTGCTTAGCTGAGTGTATCTGTCTCGAGCATGGATATTGTGGCTAA
Protein sequenceShow/hide protein sequence
MEGLIPYLFHAMKKQKPRHDYRSQSVGSSRSYHLLTNDESSHRRTRSDFQTPRFEYLDQRSGHELMHSRSVSKGAFGSSYAGQVSNNVLFKSGSQPTTWKNASMHTILMW
QNEMTQMGIASQMQYTFLSFKFQRQIGSKISSFSSIKWHLNRQEVLTKRPEMGSMKIVAVALIAILLVATLSSAAYAIDDDSMDLVFDGRDINHRVPRKIMKWGVLNHEE
RVCPKILMECKKDSDCLAECICLEHGYCG