; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015832 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015832
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCystatin domain-containing protein
Genome locationtig00006144:199348..200030
RNA-Seq ExpressionSgr015832
SyntenySgr015832
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR006525 - Cystatin-related, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137488.1 uncharacterized protein LOC111008920 isoform X1 [Momordica charantia]1.5e-1639.47Show/hide
Query:  KVDPNFKSPSPPYYGYDD---IDDGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEI
        +V  +   P+   Y YD     DDG  R MNEEE+  Y+  ++ SEGFDV + P +     I P+  I    +EE++  A +AIK YN +N  +FE V++
Subjt:  KVDPNFKSPSPPYYGYDD---IDDGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEI

Query:  EKAMNGGSYGFMFYITFKVKPSDTPFDHPTTTFQAKVLYA---KKFSVELCR
         KA +  + G +F++TF+VK +  P D PTTT QA+VL       F V+LCR
Subjt:  EKAMNGGSYGFMFYITFKVKPSDTPFDHPTTTFQAKVLYA---KKFSVELCR

XP_022149809.1 uncharacterized protein LOC111018153 [Momordica charantia]3.8e-2860.66Show/hide
Query:  MNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPSDTPFDHP
        M +EEL AYF E+              +L  AI PI DI R FTEEIQHGA+EAIKDYNQKN+TNFEVVEIEKA  GGS G   YITFKVKPS TP ++P
Subjt:  MNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPSDTPFDHP

Query:  TTTFQAKVL-YAKKFSVELCRI
        TTTFQA+VL     + +E+CRI
Subjt:  TTTFQAKVL-YAKKFSVELCRI

XP_022942206.1 uncharacterized protein LOC111447330 [Cucurbita moschata]2.0e-1644.7Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRP--ISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP
        DG + I  +EE+  Y+  V+ S+GFDV           I P  + ++     +E+Q  A EAIK YN +N TNFEVV+I KA + G  G M+YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRP--ISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP

Query:  SDTPFDHPTTTFQAKVLYA----KKFSVELCR
          TP + P+ TFQAKV YA        VELCR
Subjt:  SDTPFDHPTTTFQAKVLYA----KKFSVELCR

XP_022975633.1 uncharacterized protein LOC111475320 [Cucurbita maxima]2.0e-1646.21Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRP--ISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP
        DG + I  +EE+  YF  V  S+GFDV         G I P  +      F +E+Q  A EAIK YN +N TNFEVV+I KA + G  G M+YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRP--ISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP

Query:  SDTPFDHPTTTFQAKVLYA----KKFSVELCR
          T  + P+ TFQAKV YA        VELCR
Subjt:  SDTPFDHPTTTFQAKVLYA----KKFSVELCR

XP_023521681.1 UPF0725 protein At4g29550-like [Cucurbita pepo subsp. pepo]2.0e-1646.56Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPIS-DIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPS
        DG + I  +EE   YF  V+ SEGFDV           I PI      +  EE++  A +AIK YN++N TNFEVVEI KA + G  G M+YITF VKP 
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPIS-DIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPS

Query:  DTPFDHPTTTFQAKVLYA----KKFSVELCR
         T  + P  TFQAKV YA        VELCR
Subjt:  DTPFDHPTTTFQAKVLYA----KKFSVELCR

TrEMBL top hitse value%identityAlignment
A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X17.3e-1739.47Show/hide
Query:  KVDPNFKSPSPPYYGYDD---IDDGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEI
        +V  +   P+   Y YD     DDG  R MNEEE+  Y+  ++ SEGFDV + P +     I P+  I    +EE++  A +AIK YN +N  +FE V++
Subjt:  KVDPNFKSPSPPYYGYDD---IDDGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEI

Query:  EKAMNGGSYGFMFYITFKVKPSDTPFDHPTTTFQAKVLYA---KKFSVELCR
         KA +  + G +F++TF+VK +  P D PTTT QA+VL       F V+LCR
Subjt:  EKAMNGGSYGFMFYITFKVKPSDTPFDHPTTTFQAKVLYA---KKFSVELCR

A0A6J1D850 uncharacterized protein LOC1110181531.8e-2860.66Show/hide
Query:  MNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPSDTPFDHP
        M +EEL AYF E+              +L  AI PI DI R FTEEIQHGA+EAIKDYNQKN+TNFEVVEIEKA  GGS G   YITFKVKPS TP ++P
Subjt:  MNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPSDTPFDHP

Query:  TTTFQAKVL-YAKKFSVELCRI
        TTTFQA+VL     + +E+CRI
Subjt:  TTTFQAKVL-YAKKFSVELCRI

A0A6J1FN74 uncharacterized protein LOC1114473309.5e-1744.7Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRP--ISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP
        DG + I  +EE+  Y+  V+ S+GFDV           I P  + ++     +E+Q  A EAIK YN +N TNFEVV+I KA + G  G M+YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRP--ISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP

Query:  SDTPFDHPTTTFQAKVLYA----KKFSVELCR
          TP + P+ TFQAKV YA        VELCR
Subjt:  SDTPFDHPTTTFQAKVLYA----KKFSVELCR

A0A6J1IJT3 uncharacterized protein LOC1114753209.5e-1746.21Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRP--ISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP
        DG + I  +EE+  YF  V  S+GFDV         G I P  +      F +E+Q  A EAIK YN +N TNFEVV+I KA + G  G M+YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRP--ISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP

Query:  SDTPFDHPTTTFQAKVLYA----KKFSVELCR
          T  + P+ TFQAKV YA        VELCR
Subjt:  SDTPFDHPTTTFQAKVLYA----KKFSVELCR

A0A6J1IL21 uncharacterized protein LOC1114751782.1e-1646.21Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRF--FTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP
        DG + I  +EE+  YF  V  S+GFDV         G I P+  +  F    +E+Q  A EAIK YN KN TNFEVV+I KA + G  G M+YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRF--FTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP

Query:  SDTPFDHPTTTFQAKVLYA----KKFSVELCR
          T  +  + TFQAKV YA        VELCR
Subjt:  SDTPFDHPTTTFQAKVLYA----KKFSVELCR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G63190.1 Cystatin/monellin superfamily protein6.7e-0728.29Show/hide
Query:  DPNFKSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEE------IQHGAEEAIKDYNQKNDTNFEVVE
        DP ++ P+  Y  +   +D + +   E+ELA    ++ AS+GFD+       +        D   F  E       ++  + +A+ DYNQ++ T FE V+
Subjt:  DPNFKSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEE------IQHGAEEAIKDYNQKNDTNFEVVE

Query:  IEKAMNGGSYGFMFYITFKVKPSDTPFDHPTTTFQAKVLYAKKFSVE--LCR
        + KA        MF ITF+V     P+D+    FQ +V +A+    E   CR
Subjt:  IEKAMNGGSYGFMFYITFKVKPSDTPFDHPTTTFQAKVLYAKKFSVE--LCR

AT1G63200.1 Cystatin/monellin superfamily protein1.8e-0730.71Show/hide
Query:  EEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEE-------IQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPSDT
        EEELA    +V AS+GFD+       +      I    +F  +E       ++  A+EA+ D+N ++ T +E V++ KA    +   MF ITF+VK    
Subjt:  EEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEE-------IQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPSDT

Query:  PFDHPTTTFQAKVLYAKKFSVE--LCR
        P+D     FQ +V + K  +     CR
Subjt:  PFDHPTTTFQAKVLYAKKFSVE--LCR

AT1G63205.1 Cystatin/monellin superfamily protein6.7e-0731.78Show/hide
Query:  EEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEE--------IQHGAEEAIKDYNQKNDTN-FEVVEIEKAMNGGSYGFMFYITFKVKPS
        +EE+A    ++  SEGFD+       L        D   F  +E        ++  ++E++K YN +  TN +E  E+ KA   GS G+MF ITF+V   
Subjt:  EEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEE--------IQHGAEEAIKDYNQKNDTN-FEVVEIEKAMNGGSYGFMFYITFKVKPS

Query:  DTPFDHPTTTFQAKVLYAKKFSVE--LCR
          P D    TFQA++ Y   +  E   CR
Subjt:  DTPFDHPTTTFQAKVLYAKKFSVE--LCR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCGCCCATCATCTTACAAAGTCGATCCCAATTTTAAAAGTCCATCGCCTCCATATTATGGATATGATGACATTGATGATGGTGATAATCGTATCATGAACGAAGA
AGAGTTGGCTGCATACTTTGCTGAAGTTAAAGCCAGCGAGGGCTTTGATGTTCAAACTATTCCTTGCTCTCTTTTATGCGGTGCCATTAGGCCCATTTCTGATATTAAAA
GATTCTTTACTGAAGAGATTCAACATGGTGCTGAGGAAGCCATCAAAGACTACAACCAGAAGAATGATACTAATTTTGAGGTTGTGGAGATTGAGAAGGCCATGAATGGA
GGAAGTTATGGTTTCATGTTTTACATCACCTTTAAAGTGAAGCCATCTGACACGCCTTTTGACCATCCAACTACTACATTTCAGGCTAAAGTACTATATGCTAAAAAATT
TTCCGTAGAACTTTGCAGGATCAACCTTCAAATTAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACCGCCCATCATCTTACAAAGTCGATCCCAATTTTAAAAGTCCATCGCCTCCATATTATGGATATGATGACATTGATGATGGTGATAATCGTATCATGAACGAAGA
AGAGTTGGCTGCATACTTTGCTGAAGTTAAAGCCAGCGAGGGCTTTGATGTTCAAACTATTCCTTGCTCTCTTTTATGCGGTGCCATTAGGCCCATTTCTGATATTAAAA
GATTCTTTACTGAAGAGATTCAACATGGTGCTGAGGAAGCCATCAAAGACTACAACCAGAAGAATGATACTAATTTTGAGGTTGTGGAGATTGAGAAGGCCATGAATGGA
GGAAGTTATGGTTTCATGTTTTACATCACCTTTAAAGTGAAGCCATCTGACACGCCTTTTGACCATCCAACTACTACATTTCAGGCTAAAGTACTATATGCTAAAAAATT
TTCCGTAGAACTTTGCAGGATCAACCTTCAAATTAACTGA
Protein sequenceShow/hide protein sequence
MDRPSSYKVDPNFKSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKASEGFDVQTIPCSLLCGAIRPISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNG
GSYGFMFYITFKVKPSDTPFDHPTTTFQAKVLYAKKFSVELCRINLQIN