; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCystatin domain-containing protein
Genome locationchr3:9805505..9806077
RNA-Seq ExpressionMoc03g14560
SyntenyMoc03g14560
Gene Ontology termsGO:0010951 - negative regulation of endopeptidase activity (biological process)
GO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137488.1 uncharacterized protein LOC111008920 isoform X1 [Momordica charantia]2.5e-1947.32Show/hide
Query:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGNDH
        GFD+PSF G YA   I  +    L SEE++E   +AI +YN +NG SF+FVKM+K  ++  +  ++++TF+VKQ G+PP+S T TL+ARVLA I   +  
Subjt:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGNDH

Query:  GFVVELCRSEPS
         F V+LCR EP+
Subjt:  GFVVELCRSEPS

XP_022137489.1 uncharacterized protein LOC111008920 isoform X2 [Momordica charantia]2.9e-1241.96Show/hide
Query:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGNDH
        GFD+PSF G YA   I  +    L SEE                G SF+FVKM+K  ++  +  ++++TF+VKQ G+PP+S T TL+ARVLA I   +  
Subjt:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGNDH

Query:  GFVVELCRSEPS
         F V+LCR EP+
Subjt:  GFVVELCRSEPS

XP_022137526.1 uncharacterized protein LOC111008952 [Momordica charantia]3.1e-3873.68Show/hide
Query:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFV--DKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGN
        GFDMPSF   YACG I+ +TGSRL+SEELQE    DKA+DY+NQQNGTSF+FVKMVK TNK VV IMYY+TFEVKQMGSPPNS TKTL+ARVL  +PIG+
Subjt:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFV--DKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGN

Query:  DHGFVVELCRSEPS
           F VELCR EPS
Subjt:  DHGFVVELCRSEPS

XP_022948409.1 uncharacterized protein LOC111452100 [Cucurbita moschata]3.2e-1140.52Show/hide
Query:  LTKPGFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPI
        L   GFD+P+       G I  V+   L    L+E   KAI  YN +NGT+++FVK+ K T++V   ++YYITF+VKQ+G      T T EA+VL  I  
Subjt:  LTKPGFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPI

Query:  GNDHGFVVELCRSEPS
          D    V LCR + S
Subjt:  GNDHGFVVELCRSEPS

XP_023525925.1 uncharacterized protein LOC111789396 [Cucurbita pepo subsp. pepo]1.7e-1243.75Show/hide
Query:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGNDH
        GFD+P+F   YA G I  +    L    L++  +KAI +YN +NGT+F+FVK+VK   +VV    YYITF+VKQ+G+     T T EA+VL     G D 
Subjt:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGNDH

Query:  GFVVELCRSEPS
           V LCR + S
Subjt:  GFVVELCRSEPS

TrEMBL top hitse value%identityAlignment
A0A6J1C7D4 uncharacterized protein LOC111008920 isoform X21.4e-1241.96Show/hide
Query:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGNDH
        GFD+PSF G YA   I  +    L SEE                G SF+FVKM+K  ++  +  ++++TF+VKQ G+PP+S T TL+ARVLA I   +  
Subjt:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGNDH

Query:  GFVVELCRSEPS
         F V+LCR EP+
Subjt:  GFVVELCRSEPS

A0A6J1C8H7 uncharacterized protein LOC1110089521.5e-3873.68Show/hide
Query:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFV--DKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGN
        GFDMPSF   YACG I+ +TGSRL+SEELQE    DKA+DY+NQQNGTSF+FVKMVK TNK VV IMYY+TFEVKQMGSPPNS TKTL+ARVL  +PIG+
Subjt:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFV--DKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGN

Query:  DHGFVVELCRSEPS
           F VELCR EPS
Subjt:  DHGFVVELCRSEPS

A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X11.2e-1947.32Show/hide
Query:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGNDH
        GFD+PSF G YA   I  +    L SEE++E   +AI +YN +NG SF+FVKM+K  ++  +  ++++TF+VKQ G+PP+S T TL+ARVLA I   +  
Subjt:  GFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGNDH

Query:  GFVVELCRSEPS
         F V+LCR EP+
Subjt:  GFVVELCRSEPS

A0A6J1FN74 uncharacterized protein LOC1114473302.2e-1038.46Show/hide
Query:  GFDMPSFLGHYACGHIDVVTGSRLYS-----EELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIP
        GFD+P F   YA     ++T  +L+      +E+Q   ++AI +YN +NGT+F+ V +VK  +      MYYITF VK +G+P    + T +A+V   IP
Subjt:  GFDMPSFLGHYACGHIDVVTGSRLYS-----EELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIP

Query:  IGNDHGFVVELCRSEPS
        I +     VELCR +PS
Subjt:  IGNDHGFVVELCRSEPS

A0A6J1G9T9 uncharacterized protein LOC1114521001.6e-1140.52Show/hide
Query:  LTKPGFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPI
        L   GFD+P+       G I  V+   L    L+E   KAI  YN +NGT+++FVK+ K T++V   ++YYITF+VKQ+G      T T EA+VL  I  
Subjt:  LTKPGFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPI

Query:  GNDHGFVVELCRSEPS
          D    V LCR + S
Subjt:  GNDHGFVVELCRSEPS

SwissProt top hitse value%identityAlignment
P37842 Multicystatin2.1e-0535.9Show/hide
Query:  GHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVL
        G +DV   +++  ++L  F   A+  YNQ+N +S +F K++ V  ++V  IMYYITFE  + G+      K  EA++L
Subjt:  GHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVL

Arabidopsis top hitse value%identityAlignment
AT1G63190.1 Cystatin/monellin superfamily protein6.7e-0730.95Show/hide
Query:  ILTKPGFDMPSFLGHYAC---GHIDVVTGSRLYSEE------LQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTL
        IL   GFD+     H+ C    H+  +       E       L+    KA+D YNQ++ T F+FVK+VK       +IM+ ITFEV     P ++  K  
Subjt:  ILTKPGFDMPSFLGHYAC---GHIDVVTGSRLYSEE------LQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTL

Query:  EARVLADIPIGNDHGFVVELCRSEPS
        + RV     I  ++ F    CR +P+
Subjt:  EARVLADIPIGNDHGFVVELCRSEPS

AT1G63200.1 Cystatin/monellin superfamily protein1.3e-0530.17Show/hide
Query:  DMPSFL----GHYACGHIDVVTGSRLYSEE--LQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPI
        D  SFL     H A  H D        + E  L+    +A+D +N ++GT ++FVK+VK       ++M+ ITF+VK    P +   K  + RV     I
Subjt:  DMPSFL----GHYACGHIDVVTGSRLYSEE--LQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPI

Query:  GNDHGFVVELCRSEPS
          ++ F    CR +P+
Subjt:  GNDHGFVVELCRSEPS

AT2G37435.1 Cystatin/monellin superfamily protein1.8e-0432.26Show/hide
Query:  LYSEELQEFVD----KAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNS
        L  E  +EF+D    K+++++N+ + T ++FV+ +K  + V   +MY+ITFE K + +  +S
Subjt:  LYSEELQEFVD----KAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCTGACAAAGCCCGGGTTTGACATGCCCTCTTTTCTCGGTCATTACGCTTGTGGTCACATTGATGTTGTAACTGGGAGTCGATTATATTCGGAAGAGCTT
CAAGAATTCGTAGACAAAGCTATTGACTATTACAATCAGCAAAATGGTACAAGTTTTGATTTTGTGAAGATGGTGAAGGTAACTAATAAAGTTGTGGTTAGTATA
ATGTATTACATCACTTTTGAAGTCAAGCAAATGGGATCGCCTCCAAACTCTGCCACCAAAACACTCGAAGCTCGAGTGTTGGCTGATATTCCTATTGGCAATGAT
CATGGTTTTGTGGTAGAACTATGCAGGTCAGAACCCTCTATTTATAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCCTGACAAAGCCCGGGTTTGACATGCCCTCTTTTCTCGGTCATTACGCTTGTGGTCACATTGATGTTGTAACTGGGAGTCGATTATATTCGGAAGAGCTT
CAAGAATTCGTAGACAAAGCTATTGACTATTACAATCAGCAAAATGGTACAAGTTTTGATTTTGTGAAGATGGTGAAGGTAACTAATAAAGTTGTGGTTAGTATA
ATGTATTACATCACTTTTGAAGTCAAGCAAATGGGATCGCCTCCAAACTCTGCCACCAAAACACTCGAAGCTCGAGTGTTGGCTGATATTCCTATTGGCAATGAT
CATGGTTTTGTGGTAGAACTATGCAGGTCAGAACCCTCTATTTATAATTGA
Protein sequenceShow/hide protein sequence
MILTKPGFDMPSFLGHYACGHIDVVTGSRLYSEELQEFVDKAIDYYNQQNGTSFDFVKMVKVTNKVVVSIMYYITFEVKQMGSPPNSATKTLEARVLADIPIGND
HGFVVELCRSEPSIYN