; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015834 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015834
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCystatin domain-containing protein
Genome locationtig00006144:204945..205616
RNA-Seq ExpressionSgr015834
SyntenySgr015834
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR006525 - Cystatin-related, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137488.1 uncharacterized protein LOC111008920 isoform X1 [Momordica charantia]1.1e-1638.22Show/hide
Query:  KVDPNFKSPSPPYYGYDD---IDDGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEI
        +V  +   P+   Y YD     DDG  R MNEEE+  Y+  ++ +EGFDV + P +     I  +  I    +EE++  A +AIK YN +N  +FE V++
Subjt:  KVDPNFKSPSPPYYGYDD---IDDGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEI

Query:  EKAMNGGSYGFMFYITFKVKPSNTPFDHPTTTFQAKVLYA---KKFSVELCRIKPSN
         KA +  + G +F++TF+VK +  P D PTTT QA+VL       F V+LCR +P+N
Subjt:  EKAMNGGSYGFMFYITFKVKPSNTPFDHPTTTFQAKVLYA---KKFSVELCRIKPSN

XP_022149809.1 uncharacterized protein LOC111018153 [Momordica charantia]6.9e-3061.11Show/hide
Query:  MNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPSNTPFDHP
        M +EEL AYF E+              +L  AI  I DI R FTEEIQHGA+EAIKDYNQKN+TNFEVVEIEKA  GGS G   YITFKVKPS TP ++P
Subjt:  MNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPSNTPFDHP

Query:  TTTFQAKVL-YAKKFSVELCRIKPSN
        TTTFQA+VL     + +E+CRIKPSN
Subjt:  TTTFQAKVL-YAKKFSVELCRIKPSN

XP_022942206.1 uncharacterized protein LOC111447330 [Cucurbita moschata]6.7e-1745.65Show/hide
Query:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRF----FTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKV
        DG + I  +EE+  Y+  V+ ++GFDV   P      A   I+ +K        +E+Q  A EAIK YN +N TNFEVV+I KA + G  G M+YITF V
Subjt:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRF----FTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKV

Query:  KPSNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPS
        KP  TP + P+ TFQAKV YA        VELCR KPS
Subjt:  KPSNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPS

XP_022975599.1 uncharacterized protein LOC111475178 [Cucurbita maxima]5.1e-1745.99Show/hide
Query:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRF--FTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP
        DG + I  +EE+  YF  V  ++GFDV         G I  +  +  F    +E+Q  A EAIK YN KN TNFEVV+I KA + G  G M+YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRF--FTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP

Query:  SNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPSN
          T  +  + TFQAKV YA        VELCR KPSN
Subjt:  SNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPSN

XP_022975633.1 uncharacterized protein LOC111475320 [Cucurbita maxima]1.8e-1745.99Show/hide
Query:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAI--RHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP
        DG + I  +EE+  YF  V  ++GFDV         G I    +      F +E+Q  A EAIK YN +N TNFEVV+I KA + G  G M+YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAI--RHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP

Query:  SNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPSN
          T  + P+ TFQAKV YA        VELCR KPSN
Subjt:  SNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPSN

TrEMBL top hitse value%identityAlignment
A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X15.5e-1738.22Show/hide
Query:  KVDPNFKSPSPPYYGYDD---IDDGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEI
        +V  +   P+   Y YD     DDG  R MNEEE+  Y+  ++ +EGFDV + P +     I  +  I    +EE++  A +AIK YN +N  +FE V++
Subjt:  KVDPNFKSPSPPYYGYDD---IDDGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEI

Query:  EKAMNGGSYGFMFYITFKVKPSNTPFDHPTTTFQAKVLYA---KKFSVELCRIKPSN
         KA +  + G +F++TF+VK +  P D PTTT QA+VL       F V+LCR +P+N
Subjt:  EKAMNGGSYGFMFYITFKVKPSNTPFDHPTTTFQAKVLYA---KKFSVELCRIKPSN

A0A6J1D850 uncharacterized protein LOC1110181533.3e-3061.11Show/hide
Query:  MNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPSNTPFDHP
        M +EEL AYF E+              +L  AI  I DI R FTEEIQHGA+EAIKDYNQKN+TNFEVVEIEKA  GGS G   YITFKVKPS TP ++P
Subjt:  MNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPSNTPFDHP

Query:  TTTFQAKVL-YAKKFSVELCRIKPSN
        TTTFQA+VL     + +E+CRIKPSN
Subjt:  TTTFQAKVL-YAKKFSVELCRIKPSN

A0A6J1FN74 uncharacterized protein LOC1114473303.2e-1745.65Show/hide
Query:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRF----FTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKV
        DG + I  +EE+  Y+  V+ ++GFDV   P      A   I+ +K        +E+Q  A EAIK YN +N TNFEVV+I KA + G  G M+YITF V
Subjt:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRF----FTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKV

Query:  KPSNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPS
        KP  TP + P+ TFQAKV YA        VELCR KPS
Subjt:  KPSNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPS

A0A6J1IJT3 uncharacterized protein LOC1114753208.5e-1845.99Show/hide
Query:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAI--RHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP
        DG + I  +EE+  YF  V  ++GFDV         G I    +      F +E+Q  A EAIK YN +N TNFEVV+I KA + G  G M+YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAI--RHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP

Query:  SNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPSN
          T  + P+ TFQAKV YA        VELCR KPSN
Subjt:  SNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPSN

A0A6J1IL21 uncharacterized protein LOC1114751782.5e-1745.99Show/hide
Query:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRF--FTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP
        DG + I  +EE+  YF  V  ++GFDV         G I  +  +  F    +E+Q  A EAIK YN KN TNFEVV+I KA + G  G M+YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRF--FTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKP

Query:  SNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPSN
          T  +  + TFQAKV YA        VELCR KPSN
Subjt:  SNTPFDHPTTTFQAKVLYA----KKFSVELCRIKPSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G63190.1 Cystatin/monellin superfamily protein1.3e-0728.21Show/hide
Query:  DPNFKSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRFFTEE------IQHGAEEAIKDYNQKNDTNFEVVE
        DP ++ P+  Y  +   +D + +   E+ELA    ++ A++GFD+       +        D   F  E       ++  + +A+ DYNQ++ T FE V+
Subjt:  DPNFKSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRFFTEE------IQHGAEEAIKDYNQKNDTNFEVVE

Query:  IEKAMNGGSYGFMFYITFKVKPSNTPFDHPTTTFQAKVLYAKKFSVE--LCRIKPS
        + KA        MF ITF+V     P+D+    FQ +V +A+    E   CR KP+
Subjt:  IEKAMNGGSYGFMFYITFKVKPSNTPFDHPTTTFQAKVLYAKKFSVE--LCRIKPS

AT1G63200.1 Cystatin/monellin superfamily protein2.5e-0934.59Show/hide
Query:  EEELAAYFAEVKANEGFDVQTIPCSLLC-----GAIRH---ISDIKRFFTEEIQHG-AEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPS
        EEELA    +V A++GFD+     S LC      AI H    +D +   TE++    A+EA+ D+N ++ T +E V++ KA    +   MF ITF+VK  
Subjt:  EEELAAYFAEVKANEGFDVQTIPCSLLC-----GAIRH---ISDIKRFFTEEIQHG-AEEAIKDYNQKNDTNFEVVEIEKAMNGGSYGFMFYITFKVKPS

Query:  NTPFDHPTTTFQAKVLYAKKFSVE--LCRIKPS
          P+D     FQ +V + K  +     CR KP+
Subjt:  NTPFDHPTTTFQAKVLYAKKFSVE--LCRIKPS

AT1G63205.1 Cystatin/monellin superfamily protein1.0e-0731.85Show/hide
Query:  EEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRF-FTEE---------IQHGAEEAIKDYNQKNDTN-FEVVEIEKAMNGGSYGFMFYITFKVK
        +EE+A    ++  +EGFD+        C    H+ D   F F ++         ++  ++E++K YN +  TN +E  E+ KA   GS G+MF ITF+V 
Subjt:  EEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRF-FTEE---------IQHGAEEAIKDYNQKNDTN-FEVVEIEKAMNGGSYGFMFYITFKVK

Query:  PSNTPFDHPTTTFQAKVLYAKKFSVE--LCRIKPS
            P D    TFQA++ Y   +  E   CR KP+
Subjt:  PSNTPFDHPTTTFQAKVLYAKKFSVE--LCRIKPS

AT5G05040.1 Cystatin/monellin superfamily protein1.4e-0427.15Show/hide
Query:  KSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKANEGFDV--QTIPCSLLCGAI-----RHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEK
        + PS     + + D  D   ++ ++L     E   + GFDV    +     CGA+       +S+      + +   +  AI  YN K D++ E+V++ +
Subjt:  KSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKANEGFDV--QTIPCSLLCGAI-----RHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEK

Query:  AMNGGSYGFMFYITFKVKPSNTPFD-HPTTTFQAKVLYAKKFSVELCRIKP
        A    S     YITF+   +N P D + T  +QA VLY   F +++C  KP
Subjt:  AMNGGSYGFMFYITFKVKPSNTPFD-HPTTTFQAKVLYAKKFSVELCRIKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCGCCCATCATCTTACAAAGTCGATCCCAATTTTAAAAGTCCATCACCTCCATATTACGGGTATGATGACATTGATGATGGTGATAATCGTATCATGAACGAGGA
AGAGTTGGCTGCATACTTCGCTGAAGTTAAAGCCAACGAGGGATTTGATGTTCAAACTATTCCTTGCTCTCTTTTATGCGGTGCCATTAGGCACATTTCTGATATTAAAA
GATTCTTTACTGAAGAGATTCAACATGGTGCTGAGGAAGCCATCAAAGACTACAACCAGAAGAATGATACTAATTTTGAGGTTGTGGAGATTGAGAAGGCCATGAACGGA
GGAAGTTATGGTTTCATGTTTTACATCACATTTAAAGTGAAGCCATCTAACACGCCTTTTGACCACCCAACTACTACATTTCAGGCTAAAGTCTTATATGCTAAAAAATT
TTCCGTAGAACTTTGCAGGATCAAGCCCTCAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCGCCCATCATCTTACAAAGTCGATCCCAATTTTAAAAGTCCATCACCTCCATATTACGGGTATGATGACATTGATGATGGTGATAATCGTATCATGAACGAGGA
AGAGTTGGCTGCATACTTCGCTGAAGTTAAAGCCAACGAGGGATTTGATGTTCAAACTATTCCTTGCTCTCTTTTATGCGGTGCCATTAGGCACATTTCTGATATTAAAA
GATTCTTTACTGAAGAGATTCAACATGGTGCTGAGGAAGCCATCAAAGACTACAACCAGAAGAATGATACTAATTTTGAGGTTGTGGAGATTGAGAAGGCCATGAACGGA
GGAAGTTATGGTTTCATGTTTTACATCACATTTAAAGTGAAGCCATCTAACACGCCTTTTGACCACCCAACTACTACATTTCAGGCTAAAGTCTTATATGCTAAAAAATT
TTCCGTAGAACTTTGCAGGATCAAGCCCTCAAATTAA
Protein sequenceShow/hide protein sequence
MDRPSSYKVDPNFKSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKANEGFDVQTIPCSLLCGAIRHISDIKRFFTEEIQHGAEEAIKDYNQKNDTNFEVVEIEKAMNG
GSYGFMFYITFKVKPSNTPFDHPTTTFQAKVLYAKKFSVELCRIKPSN