; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptioncysteine proteinase inhibitor 1-like
Genome locationchr3:13488677..13491833
RNA-Seq ExpressionMoc03g20030
SyntenyMoc03g20030
Gene Ontology termsGO:0010951 - negative regulation of endopeptidase activity (biological process)
GO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR018073 - Proteinase inhibitor I25, cystatin, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MQM02766.1 hypothetical protein [Colocasia esculenta]7.7e-4047.67Show/hide
Query:  VGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTSF-----NPFVFLG
        VG Y PI+D++ PY+QEI ++   E+N++ G      V+ +VVSGE+QVVSGTNYKL++        + YE +VYDKPWE++R LTSF          +G
Subjt:  VGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTSF-----NPFVFLG

Query:  GYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTSTLVFRSEPE
        GY+PI N+ DP++ EI  +A  EYN++      LVF  VVSGE+QVVAGTNYKL++  +   +   YE +VYDK WE  RELTS   F   PE
Subjt:  GYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTSTLVFRSEPE

MQM15186.1 hypothetical protein [Colocasia esculenta]2.9e-3947.83Show/hide
Query:  VGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTSF-----NPFVFLG
        V  Y+PI D++ PY+QEI ++A  E+N++ G      V+ +VVSGE+QVV+GTNYKL++        + YE +VYDKPWE++R LTSF          +G
Subjt:  VGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTSF-----NPFVFLG

Query:  GYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTS
        GYQPI N+ DP++ EI ++A  EYN++      LVF  VVSGE+QVVAGTNYKL++  +   +   YE ++YDK WE  RELTS
Subjt:  GYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTS

MQM16017.1 hypothetical protein [Colocasia esculenta]1.8e-4148.45Show/hide
Query:  VGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTSFNPF---------
        VG Y+PI ++  P++QEIA +A  +YN+++G +    V+  VVSGE+QV++GTNYKL +       I  YE +VYDK WE  + LTSF+P          
Subjt:  VGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTSFNPF---------

Query:  --VFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTS-TLV
          + +GGYQPI+N+ DP+I EIG++A  E+N++      LVF +VVSGE+QVVAGTNYKL++  K   +   YE +VYDKPWE  RELTS TLV
Subjt:  --VFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTS-TLV

XP_020244461.1 cysteine proteinase inhibitor 1-like [Asparagus officinalis]1.0e-3646.88Show/hide
Query:  AAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNV-TINKQIHNYEVIVYDKPWESYRNLTSFNP
        A   P P   G Y PI +++ P++ EI ++A  E+N+    ++T  V+ KV+ GE QVV+G NYKL++     N ++  YEVIVY++ WE Y  LTSFNP
Subjt:  AAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNV-TINKQIHNYEVIVYDKPWESYRNLTSFNP

Query:  FVFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTK---RGSLNLNYEVIVYDKPWESHRELTS
            GGY PIKNLSDP++ EIG++A  E+N++    T LVF KV+ GE QVVAG NYKL++  K   R +    YEVIV++K WE   +LTS
Subjt:  FVFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTK---RGSLNLNYEVIVYDKPWESHRELTS

XP_022147176.1 cysteine proteinase inhibitor 1-like [Momordica charantia]2.5e-10796.97Show/hide
Query:  MAAAAAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTS
        MAAAAAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTS
Subjt:  MAAAAAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTS

Query:  FNPFVFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTSTLVFRS
        FNPFVFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTS  ++ +
Subjt:  FNPFVFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTSTLVFRS

TrEMBL top hitse value%identityAlignment
A0A444FRQ3 Uncharacterized protein2.4e-3141.12Show/hide
Query:  AAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIH-NYEVIVYDKPWESYRNLTSFNP
        A  PP     G +  I+D+  P+++EIA +A  E+N+ +    T    +KV  GE QVV+G NY+L+L V         YE +V++K WE +R LTSF  
Subjt:  AAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIH-NYEVIVYDKPWESYRNLTSFNP

Query:  FVFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNL--NYEVIVYDKPWESHRELTSTLVFRS
         V +GG+ PI+N+ DP ++EI  +A  E+N +    T L   KVV GE QVVAGTNYKL+L  K G   +   YE +V++K WE  R+LTS ++  +
Subjt:  FVFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNL--NYEVIVYDKPWESHRELTSTLVFRS

A0A4S8KBT4 Uncharacterized protein2.9e-3243.09Show/hide
Query:  GTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIH-NYEVIVYDKPWESYRNLTSFNPFVFLGGYQPI
        G +  I+D+  P+++EIA +A  E+N+ +    T    +KV  GE QVV+GTNY+L+L V         YE +V++KPWE+++ L SF+  V +GG+ PI
Subjt:  GTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIH-NYEVIVYDKPWESYRNLTSFNPFVFLGGYQPI

Query:  KNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNL--NYEVIVYDKPWESHRELTS
        KN+SDP + EI  +A  E+N +    + L   KVV GE QVVAG NYKL+L  K G + +   YE +V++K WE  R LTS
Subjt:  KNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNL--NYEVIVYDKPWESHRELTS

A0A6J1D1J8 cysteine proteinase inhibitor 1-like1.2e-10796.97Show/hide
Query:  MAAAAAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTS
        MAAAAAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTS
Subjt:  MAAAAAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTS

Query:  FNPFVFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTSTLVFRS
        FNPFVFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTS  ++ +
Subjt:  FNPFVFLGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTSTLVFRS

A0A6J1DV34 cysteine proteinase inhibitor 5-like2.4e-3174.73Show/hide
Query:  GGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTSTLVFRS
        G Y+PIKNLSDPYISEIGRYACIEYNRK P+PTPLVF+KVVSGE+QVV G NYKLI+Y K G L   YEVIVYD PW+SHRELTS  ++ +
Subjt:  GGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTSTLVFRS

V4T622 Uncharacterized protein6.2e-3542.46Show/hide
Query:  VGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTSFNPFVFLGGYQPI
        +G ++PI+D    ++ EI ++A  EYN+R   S +   ++ V  GE QVVSGTNY+LIL V        +E +V +KPWE +++LTSF P   LGG++PI
Subjt:  VGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTSFNPFVFLGGYQPI

Query:  KNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTS
        ++  + ++ EIG++A  EYN+++     L F+ V  GE QVV+GTNY+LIL  K G     +E +V++KPWE  + LTS
Subjt:  KNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTS

SwissProt top hitse value%identityAlignment
P86472 Cysteine proteinase inhibitor 19.9e-1438.61Show/hide
Query:  MAAAAAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTS
        ++AA        A G + PI+++N+  +Q++A++A  E+N++   ++    YQ VV G  QVV+GTNY+L++       + NYE +V+DKPW  +RNLTS
Subjt:  MAAAAAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTS

Query:  F
        F
Subjt:  F

Q10Q46 Cysteine proteinase inhibitor 63.2e-1240.7Show/hide
Query:  GYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNL--NYEVIVYDKPWESHRELTS
        G+ PIKN+ DP+I E+GR+A  E NR +P+   L F +V  GE+QVV+G NY+L +    G  ++  +Y  +V+++ W + R+L S
Subjt:  GYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNL--NYEVIVYDKPWESHRELTS

Q41916 Cysteine proteinase inhibitor 52.4e-1548.28Show/hide
Query:  LGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRG-SLNLNYEVIVYDKPWESHRELTS
        +GG+ PI N++DP + EIG +A  EYN++  + + L F+ VVSGE QVV+GTNY+L +    G  ++ NY  IV+DKPW   R LTS
Subjt:  LGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRG-SLNLNYEVIVYDKPWESHRELTS

Q6TPK4 Cysteine proteinase inhibitor 11.7e-1338.61Show/hide
Query:  MAAAAAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTS
        ++AA        A G + PI+ +N+  +Q++A++A  E+N++   ++    YQ VV G  QVV+GTNY+L++       + NYE +V+DKPW  +RNLTS
Subjt:  MAAAAAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTS

Query:  F
        F
Subjt:  F

Q84WT8 Cysteine proteinase inhibitor 41.3e-1042.53Show/hide
Query:  LGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNL-NYEVIVYDKPWESHRELTS
        LG  +PIKN+SDP +  + +YA  E+N+++     LVF KVV G  QVV+GT Y L +  K G   + NYE +V +K W   + L S
Subjt:  LGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNL-NYEVIVYDKPWESHRELTS

Arabidopsis top hitse value%identityAlignment
AT2G40880.1 cystatin A3.4e-0937.93Show/hide
Query:  VFLGGYQPIK-NLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHREL
        + LGG   ++ N +   I  + R+A  E+N++      L FKK+V   EQVVAGT Y L L  K G    N+E  V+ KPW + ++L
Subjt:  VFLGGYQPIK-NLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHREL

AT3G12490.1 cystatin B3.9e-0535.53Show/hide
Query:  NLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHREL
        N +   +  + R+A  E+N+K      L F +VV  +EQVVAGT + L L          YE  V+ KPW + +EL
Subjt:  NLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHREL

AT3G12490.2 cystatin B3.9e-0535.53Show/hide
Query:  NLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHREL
        N +   +  + R+A  E+N+K      L F +VV  +EQVVAGT + L L          YE  V+ KPW + +EL
Subjt:  NLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHREL

AT4G16500.1 Cystatin/monellin superfamily protein9.5e-1242.53Show/hide
Query:  LGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNL-NYEVIVYDKPWESHRELTS
        LG  +PIKN+SDP +  + +YA  E+N+++     LVF KVV G  QVV+GT Y L +  K G   + NYE +V +K W   + L S
Subjt:  LGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNL-NYEVIVYDKPWESHRELTS

AT5G47550.1 Cystatin/monellin superfamily protein1.7e-1648.28Show/hide
Query:  LGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRG-SLNLNYEVIVYDKPWESHRELTS
        +GG+ PI N++DP + EIG +A  EYN++  + + L F+ VVSGE QVV+GTNY+L +    G  ++ NY  IV+DKPW   R LTS
Subjt:  LGGYQPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRG-SLNLNYEVIVYDKPWESHRELTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCAGCTGCAGCTGCGCCACCGTCCCCTCATGCAGTCGGCACCTATGAGCCGATCGACGATGTGAATGCCCCATACATTCAAGAAATCGCAAGGTATGCATGTAT
TGAGTACAACAGGAGACAAGGGCATTCCTCTACGCCCTATGTATACCAGAAAGTGGTGAGTGGGGAGCGGCAGGTGGTGAGTGGAACTAACTACAAGCTCATATTAAATG
TCACCATCAACAAACAAATTCATAACTATGAGGTCATCGTGTATGACAAGCCATGGGAGAGCTACAGAAACCTTACGTCTTTCAATCCCTTCGTATTTTTAGGCGGCTAT
CAGCCGATCAAAAATTTGAGTGACCCATACATTTCTGAAATCGGAAGGTATGCATGCATTGAGTACAACAGGAAAAACCCGAATCCTACACCACTTGTATTCAAGAAAGT
GGTGAGTGGGGAGGAACAGGTGGTGGCTGGAACTAACTACAAGCTCATATTGTACACGAAACGCGGCAGTCTAAATCTAAACTATGAGGTCATCGTGTATGACAAGCCAT
GGGAGAGCCACAGAGAACTTACGTCCACTCTAGTGTTCAGGTCGGAACCGGAGACTGGGAGTGAGCTCGATTCGTGCAGAACCGTTGTGCAAATTCCTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCAGCTGCAGCTGCGCCACCGTCCCCTCATGCAGTCGGCACCTATGAGCCGATCGACGATGTGAATGCCCCATACATTCAAGAAATCGCAAGGTATGCATGTAT
TGAGTACAACAGGAGACAAGGGCATTCCTCTACGCCCTATGTATACCAGAAAGTGGTGAGTGGGGAGCGGCAGGTGGTGAGTGGAACTAACTACAAGCTCATATTAAATG
TCACCATCAACAAACAAATTCATAACTATGAGGTCATCGTGTATGACAAGCCATGGGAGAGCTACAGAAACCTTACGTCTTTCAATCCCTTCGTATTTTTAGGCGGCTAT
CAGCCGATCAAAAATTTGAGTGACCCATACATTTCTGAAATCGGAAGGTATGCATGCATTGAGTACAACAGGAAAAACCCGAATCCTACACCACTTGTATTCAAGAAAGT
GGTGAGTGGGGAGGAACAGGTGGTGGCTGGAACTAACTACAAGCTCATATTGTACACGAAACGCGGCAGTCTAAATCTAAACTATGAGGTCATCGTGTATGACAAGCCAT
GGGAGAGCCACAGAGAACTTACGTCCACTCTAGTGTTCAGGTCGGAACCGGAGACTGGGAGTGAGCTCGATTCGTGCAGAACCGTTGTGCAAATTCCTGCATAA
Protein sequenceShow/hide protein sequence
MAAAAAAPPSPHAVGTYEPIDDVNAPYIQEIARYACIEYNRRQGHSSTPYVYQKVVSGERQVVSGTNYKLILNVTINKQIHNYEVIVYDKPWESYRNLTSFNPFVFLGGY
QPIKNLSDPYISEIGRYACIEYNRKNPNPTPLVFKKVVSGEEQVVAGTNYKLILYTKRGSLNLNYEVIVYDKPWESHRELTSTLVFRSEPETGSELDSCRTVVQIPA