; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g33400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g33400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCystatin domain-containing protein
Genome locationchr3:23681570..23683123
RNA-Seq ExpressionMoc03g33400
SyntenyMoc03g33400
Gene Ontology termsGO:0010951 - negative regulation of endopeptidase activity (biological process)
GO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR006525 - Cystatin-related, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137488.1 uncharacterized protein LOC111008920 isoform X1 [Momordica charantia]1.9e-61100Show/hide
Query:  RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV
        RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV
Subjt:  RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV

Query:  LAGISPDDFDVQLCRPEPTN
        LAGISPDDFDVQLCRPEPTN
Subjt:  LAGISPDDFDVQLCRPEPTN

XP_022137489.1 uncharacterized protein LOC111008920 isoform X2 [Momordica charantia]7.2e-4886.67Show/hide
Query:  RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV
        RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEE                GASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV
Subjt:  RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV

Query:  LAGISPDDFDVQLCRPEPTN
        LAGISPDDFDVQLCRPEPTN
Subjt:  LAGISPDDFDVQLCRPEPTN

XP_022137526.1 uncharacterized protein LOC111008952 [Momordica charantia]5.7e-2954.17Show/hide
Query:  YYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECA--GQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQAR
        YY A++++EGFD+PSFP +YA   I  +    L SEE++ECA   +A+ ++N +NG SFEFVKM+KA ++A +G +++LTF+VKQ G+PP+SPT TLQAR
Subjt:  YYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECA--GQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQAR

Query:  VLAGISPDDFDVQLCRPEPT
        VL G+    FDV+LCRPEP+
Subjt:  VLAGISPDDFDVQLCRPEPT

XP_022942206.1 uncharacterized protein LOC111447330 [Cucurbita moschata]6.1e-2347.11Show/hide
Query:  YYKAIQESEGFDVPSFPGTYAFAIISPLPLINL--ISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQAR
        YY A++ES+GFDVP F   YAF++I+P+ L  +  + +EV+  A +AIKHYNNENG +FE V ++KAN     G ++++TF VK  G P + P+ T QA+
Subjt:  YYKAIQESEGFDVPSFPGTYAFAIISPLPLINL--ISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQAR

Query:  VLAGIS-PDDFDVQLCRPEPT
        V   I   D  DV+LCRP+P+
Subjt:  VLAGIS-PDDFDVQLCRPEPT

XP_023525925.1 uncharacterized protein LOC111789396 [Cucurbita pepo subsp. pepo]2.8e-2343.24Show/hide
Query:  LDDQGTGYLQNRATLNRIARSICLIRVARFCRYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQ
        LDD    Y      +NR   S          RYY+ +++++GFDVP+FP TYA  +I P+    L    +R+CA +AI HYN  NG +FEFVK++KAN Q
Subjt:  LDDQGTGYLQNRATLNRIARSICLIRVARFCRYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQ

Query:  AALGELFFLTFQVKQTGAPPDSPTTTLQARVLAGISPDDFDVQLCRPE
           G  +++TF VKQ G   + PTTT +A+VL GI  D  +V LCRP+
Subjt:  AALGELFFLTFQVKQTGAPPDSPTTTLQARVLAGISPDDFDVQLCRPE

TrEMBL top hitse value%identityAlignment
A0A6J1C7D4 uncharacterized protein LOC111008920 isoform X23.5e-4886.67Show/hide
Query:  RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV
        RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEE                GASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV
Subjt:  RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV

Query:  LAGISPDDFDVQLCRPEPTN
        LAGISPDDFDVQLCRPEPTN
Subjt:  LAGISPDDFDVQLCRPEPTN

A0A6J1C8H7 uncharacterized protein LOC1110089522.8e-2954.17Show/hide
Query:  YYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECA--GQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQAR
        YY A++++EGFD+PSFP +YA   I  +    L SEE++ECA   +A+ ++N +NG SFEFVKM+KA ++A +G +++LTF+VKQ G+PP+SPT TLQAR
Subjt:  YYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECA--GQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQAR

Query:  VLAGISPDDFDVQLCRPEPT
        VL G+    FDV+LCRPEP+
Subjt:  VLAGISPDDFDVQLCRPEPT

A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X19.4e-62100Show/hide
Query:  RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV
        RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV
Subjt:  RYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARV

Query:  LAGISPDDFDVQLCRPEPTN
        LAGISPDDFDVQLCRPEPTN
Subjt:  LAGISPDDFDVQLCRPEPTN

A0A6J1FN74 uncharacterized protein LOC1114473303.0e-2347.11Show/hide
Query:  YYKAIQESEGFDVPSFPGTYAFAIISPLPLINL--ISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQAR
        YY A++ES+GFDVP F   YAF++I+P+ L  +  + +EV+  A +AIKHYNNENG +FE V ++KAN     G ++++TF VK  G P + P+ T QA+
Subjt:  YYKAIQESEGFDVPSFPGTYAFAIISPLPLINL--ISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQAR

Query:  VLAGIS-PDDFDVQLCRPEPT
        V   I   D  DV+LCRP+P+
Subjt:  VLAGIS-PDDFDVQLCRPEPT

A0A6J1IJT3 uncharacterized protein LOC1114753202.1e-2144.26Show/hide
Query:  YYKAIQESEGFDVPSFPGTYAFAIISPLPLINL--ISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQAR
        Y+ A+ ES+GFDVP F   YAF +I+P+ L     + +EV+  A +AIKHYN+ENG +FE V ++KAN +   G ++++TF VK  G   + P+ T QA+
Subjt:  YYKAIQESEGFDVPSFPGTYAFAIISPLPLINL--ISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQAR

Query:  VLAGIS-PDDFDVQLCRPEPTN
        V   I   D  +V+LCRP+P+N
Subjt:  VLAGIS-PDDFDVQLCRPEPTN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G23960.1 Protein of unknown function (DUF626)6.8e-0427.93Show/hide
Query:  YYKAIQESEGFDVPSF---PGTYA---------FAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPP
        Y++ + ES+GFD+ S    P  Y            I +PLP   L     +  A   +  YN   G  FEF +++K N        F++T +    G P 
Subjt:  YYKAIQESEGFDVPSF---PGTYA---------FAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPP

Query:  DSPTTTLQARV
         S   T Q R+
Subjt:  DSPTTTLQARV

AT1G23960.2 Protein of unknown function (DUF626)6.8e-0427.93Show/hide
Query:  YYKAIQESEGFDVPSF---PGTYA---------FAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPP
        Y++ + ES+GFD+ S    P  Y            I +PLP   L     +  A   +  YN   G  FEF +++K N        F++T +    G P 
Subjt:  YYKAIQESEGFDVPSF---PGTYA---------FAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPP

Query:  DSPTTTLQARV
         S   T Q R+
Subjt:  DSPTTTLQARV

AT1G63200.1 Cystatin/monellin superfamily protein9.5e-0635Show/hide
Query:  VRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARVLAGISPDDFDVQLCRPEP
        ++  A +A+  +N  +G  +EFVK++KAN   A   +F +TFQVK    P D      Q RV  G      +   CRP+P
Subjt:  VRECAGQAIKHYNNENGASFEFVKMLKANSQAALGELFFLTFQVKQTGAPPDSPTTTLQARVLAGISPDDFDVQLCRPEP

AT1G63206.1 Cystatin/monellin superfamily protein3.3e-0628.12Show/hide
Query:  KAIQESEGFDVPSFPGTYAFAIISPLPLINL-----------ISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAAL--GELFFLTFQVKQTGAPPD
        + +++S+GFD+  F    +     P+ L +L            +E ++  +  ++K+YN+E    +EF+K++KAN+      G ++F+TF+V+    P D
Subjt:  KAIQESEGFDVPSFPGTYAFAIISPLPLINL-----------ISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAAL--GELFFLTFQVKQTGAPPD

Query:  SPTTTLQARVLAGISPDDFDVQLCRPEP
        +     Q RV       D D  LCRP+P
Subjt:  SPTTTLQARVLAGISPDDFDVQLCRPEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCACCAAACTCGATGATCAAGGGACCGGATACCTGCAAAACCGAGCAACCCTAAACCGGATAGCTAGAAGCATTTGCTTAATTAGGGTTGCTCGGTTTTGCAGATA
CTATAAGGCCATACAAGAAAGCGAGGGTTTTGATGTACCTTCCTTCCCTGGAACATATGCTTTTGCTATTATTTCGCCCTTGCCCTTGATTAACCTGATCTCCGAAGAGG
TTCGAGAATGTGCAGGACAAGCTATTAAACATTACAACAATGAAAATGGTGCGAGTTTTGAATTTGTGAAGATGTTGAAAGCAAATAGTCAAGCTGCATTGGGGGAATTG
TTTTTCCTCACCTTTCAAGTCAAGCAAACGGGAGCACCACCGGATTCTCCCACCACAACGCTCCAAGCACGAGTGCTGGCTGGTATTTCTCCCGATGATTTTGACGTACA
ACTGTGCAGGCCAGAACCCACTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCACCAAACTCGATGATCAAGGGACCGGATACCTGCAAAACCGAGCAACCCTAAACCGGATAGCTAGAAGCATTTGCTTAATTAGGGTTGCTCGGTTTTGCAGATA
CTATAAGGCCATACAAGAAAGCGAGGGTTTTGATGTACCTTCCTTCCCTGGAACATATGCTTTTGCTATTATTTCGCCCTTGCCCTTGATTAACCTGATCTCCGAAGAGG
TTCGAGAATGTGCAGGACAAGCTATTAAACATTACAACAATGAAAATGGTGCGAGTTTTGAATTTGTGAAGATGTTGAAAGCAAATAGTCAAGCTGCATTGGGGGAATTG
TTTTTCCTCACCTTTCAAGTCAAGCAAACGGGAGCACCACCGGATTCTCCCACCACAACGCTCCAAGCACGAGTGCTGGCTGGTATTTCTCCCGATGATTTTGACGTACA
ACTGTGCAGGCCAGAACCCACTAATTGA
Protein sequenceShow/hide protein sequence
MRTKLDDQGTGYLQNRATLNRIARSICLIRVARFCRYYKAIQESEGFDVPSFPGTYAFAIISPLPLINLISEEVRECAGQAIKHYNNENGASFEFVKMLKANSQAALGEL
FFLTFQVKQTGAPPDSPTTTLQARVLAGISPDDFDVQLCRPEPTN