; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015835 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015835
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCystatin domain-containing protein
Genome locationtig00006144:209662..210337
RNA-Seq ExpressionSgr015835
SyntenySgr015835
Gene Ontology termsGO:0010466 - negative regulation of peptidase activity (biological process)
GO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR006525 - Cystatin-related, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149809.1 uncharacterized protein LOC111018153 [Momordica charantia]5.3e-3059.52Show/hide
Query:  MNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRFYTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKPSDTPFDHP
        M +EEL AYF E+              +L  AI PI DI R +T++IQHGA+EAIK+YNQKN+TNFE+VEIEKA  GGS GI LYITFKVKPS TP ++P
Subjt:  MNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRFYTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKPSDTPFDHP

Query:  TTTFQAKVL-YARKFSVELCRIKPSN
        TTTFQA+VL     + +E+CRIKPSN
Subjt:  TTTFQAKVL-YARKFSVELCRIKPSN

XP_022942206.1 uncharacterized protein LOC111447330 [Cucurbita moschata]4.6e-1845.93Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPI-LDIKRFYTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKPS
        DG + I  +EE+  Y+  V+ S+GFDV           I P+ L       K++Q  A EAIK YN +N TNFE+V+I KA + G  G M YITF VKP 
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPI-LDIKRFYTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKPS

Query:  DTPFDHPTTTFQAKVLYARK----FSVELCRIKPS
         TP + P+ TFQAKV YA        VELCR KPS
Subjt:  DTPFDHPTTTFQAKVLYARK----FSVELCRIKPS

XP_022975599.1 uncharacterized protein LOC111475178 [Cucurbita maxima]3.5e-1846.72Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRF--YTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKP
        DG + I  +EE+  YF  V  S+GFDV         G I P + +  F    K++Q  A EAIK YN KN TNFE+V+I KA + G  G M YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRF--YTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKP

Query:  SDTPFDHPTTTFQAKVLYARK----FSVELCRIKPSN
          T  +  + TFQAKV YA        VELCR KPSN
Subjt:  SDTPFDHPTTTFQAKVLYARK----FSVELCRIKPSN

XP_022975633.1 uncharacterized protein LOC111475320 [Cucurbita maxima]2.1e-1846.72Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRF--YTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKP
        DG + I  +EE+  YF  V  S+GFDV         G I P + +  F    K++Q  A EAIK YN +N TNFE+V+I KA + G  G M YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRF--YTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKP

Query:  SDTPFDHPTTTFQAKVLYARK----FSVELCRIKPSN
          T  + P+ TFQAKV YA        VELCR KPSN
Subjt:  SDTPFDHPTTTFQAKVLYARK----FSVELCRIKPSN

XP_023548353.1 UPF0725 protein At1g02770-like [Cucurbita pepo subsp. pepo]3.0e-1744.53Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRF--YTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKP
        DG + I  +EE+  Y+  V+ S+GFDV           I P + +  F    K++Q    EAIK YN +N TNFE+V+I KA + G  G M YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRF--YTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKP

Query:  SDTPFDHPTTTFQAKVLYARK----FSVELCRIKPSN
          T  + P  TFQAKV YA        VELCR KPSN
Subjt:  SDTPFDHPTTTFQAKVLYARK----FSVELCRIKPSN

TrEMBL top hitse value%identityAlignment
A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X15.5e-1737.58Show/hide
Query:  KVDPNFKSPSPPYYGYDD---IDDGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRFYTKQIQHGAEEAIKEYNQKNDTNFEIVEI
        +V  +   P+   Y YD     DDG  R MNEEE+  Y+  ++ SEGFDV + P +     I P L +    +++++  A +AIK YN +N  +FE V++
Subjt:  KVDPNFKSPSPPYYGYDD---IDDGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRFYTKQIQHGAEEAIKEYNQKNDTNFEIVEI

Query:  EKAMNGGSYGIMLYITFKVKPSDTPFDHPTTTFQAKVLYA---RKFSVELCRIKPSN
         KA +  + G + ++TF+VK +  P D PTTT QA+VL       F V+LCR +P+N
Subjt:  EKAMNGGSYGIMLYITFKVKPSDTPFDHPTTTFQAKVLYA---RKFSVELCRIKPSN

A0A6J1D850 uncharacterized protein LOC1110181532.6e-3059.52Show/hide
Query:  MNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRFYTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKPSDTPFDHP
        M +EEL AYF E+              +L  AI PI DI R +T++IQHGA+EAIK+YNQKN+TNFE+VEIEKA  GGS GI LYITFKVKPS TP ++P
Subjt:  MNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRFYTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKPSDTPFDHP

Query:  TTTFQAKVL-YARKFSVELCRIKPSN
        TTTFQA+VL     + +E+CRIKPSN
Subjt:  TTTFQAKVL-YARKFSVELCRIKPSN

A0A6J1FN74 uncharacterized protein LOC1114473302.2e-1845.93Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPI-LDIKRFYTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKPS
        DG + I  +EE+  Y+  V+ S+GFDV           I P+ L       K++Q  A EAIK YN +N TNFE+V+I KA + G  G M YITF VKP 
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPI-LDIKRFYTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKPS

Query:  DTPFDHPTTTFQAKVLYARK----FSVELCRIKPS
         TP + P+ TFQAKV YA        VELCR KPS
Subjt:  DTPFDHPTTTFQAKVLYARK----FSVELCRIKPS

A0A6J1IJT3 uncharacterized protein LOC1114753201.0e-1846.72Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRF--YTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKP
        DG + I  +EE+  YF  V  S+GFDV         G I P + +  F    K++Q  A EAIK YN +N TNFE+V+I KA + G  G M YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRF--YTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKP

Query:  SDTPFDHPTTTFQAKVLYARK----FSVELCRIKPSN
          T  + P+ TFQAKV YA        VELCR KPSN
Subjt:  SDTPFDHPTTTFQAKVLYARK----FSVELCRIKPSN

A0A6J1IL21 uncharacterized protein LOC1114751781.7e-1846.72Show/hide
Query:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRF--YTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKP
        DG + I  +EE+  YF  V  S+GFDV         G I P + +  F    K++Q  A EAIK YN KN TNFE+V+I KA + G  G M YITF VKP
Subjt:  DGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRF--YTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNGGSYGIMLYITFKVKP

Query:  SDTPFDHPTTTFQAKVLYARK----FSVELCRIKPSN
          T  +  + TFQAKV YA        VELCR KPSN
Subjt:  SDTPFDHPTTTFQAKVLYARK----FSVELCRIKPSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G63190.1 Cystatin/monellin superfamily protein6.1e-0828.21Show/hide
Query:  DPNFKSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRFYTKQ------IQHGAEEAIKEYNQKNDTNFEIVE
        DP ++ P+  Y  +   +D + +   E+ELA    ++ AS+GFD++      +       LD   F  +       ++  + +A+ +YNQ++ T FE V+
Subjt:  DPNFKSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRFYTKQ------IQHGAEEAIKEYNQKNDTNFEIVE

Query:  IEKAMNGGSYGIMLYITFKVKPSDTPFDHPTTTFQAKVLYARKFSVE--LCRIKPS
        + KA       IM  ITF+V     P+D+    FQ +V +A     E   CR KP+
Subjt:  IEKAMNGGSYGIMLYITFKVKPSDTPFDHPTTTFQAKVLYARKFSVE--LCRIKPS

AT5G05040.1 Cystatin/monellin superfamily protein7.4e-0629.01Show/hide
Query:  PSPYKVDPNFKSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKASEGFDVE--TIPCSLLCGAIRPILDIKRFYTKQ-------IQHGAEEAIKEYNQK
        PSP K+    + PS     + + D  D   ++ ++L     E   S GFDV+   +     CGA+   LD     ++        +   +  AI  YN K
Subjt:  PSPYKVDPNFKSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKASEGFDVE--TIPCSLLCGAIRPILDIKRFYTKQ-------IQHGAEEAIKEYNQK

Query:  NDTNFEIVEIEKAMNGGSYGIMLYITFKVKPSDTPFDHPTTTFQAKVLYARKFSVELCRIKP
         D++ E+V++ +A    S  I LYITF+   +D    + T  +QA VLY   F +++C  KP
Subjt:  NDTNFEIVEIEKAMNGGSYGIMLYITFKVKPSDTPFDHPTTTFQAKVLYARKFSVELCRIKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCACCCATCACCTTACAAAGTCGATCCCAATTTTAAAAGTCCATCGCCTCCATATTATGGATATGATGACATTGATGATGGTGATAATCGTATCATGAACGAAGA
AGAGTTGGCTGCATACTTTGCTGAAGTTAAAGCCAGCGAGGGCTTCGATGTTGAGACTATTCCTTGCTCTCTTTTATGCGGCGCAATTAGGCCCATCTTGGATATTAAAA
GGTTCTATACCAAACAGATTCAACATGGTGCTGAGGAAGCCATCAAAGAATACAACCAAAAGAATGATACTAATTTTGAAATTGTGGAGATTGAGAAGGCCATGAACGGA
GGAAGTTATGGTATCATGCTATACATCACTTTTAAAGTGAAGCCATCTGACACGCCTTTTGACCACCCAACTACTACATTTCAGGCTAAAGTACTCTATGCTAGAAAATT
TTCCGTAGAACTTTGCAGGATCAAGCCTTCAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCACCCATCACCTTACAAAGTCGATCCCAATTTTAAAAGTCCATCGCCTCCATATTATGGATATGATGACATTGATGATGGTGATAATCGTATCATGAACGAAGA
AGAGTTGGCTGCATACTTTGCTGAAGTTAAAGCCAGCGAGGGCTTCGATGTTGAGACTATTCCTTGCTCTCTTTTATGCGGCGCAATTAGGCCCATCTTGGATATTAAAA
GGTTCTATACCAAACAGATTCAACATGGTGCTGAGGAAGCCATCAAAGAATACAACCAAAAGAATGATACTAATTTTGAAATTGTGGAGATTGAGAAGGCCATGAACGGA
GGAAGTTATGGTATCATGCTATACATCACTTTTAAAGTGAAGCCATCTGACACGCCTTTTGACCACCCAACTACTACATTTCAGGCTAAAGTACTCTATGCTAGAAAATT
TTCCGTAGAACTTTGCAGGATCAAGCCTTCAAATTAA
Protein sequenceShow/hide protein sequence
MDHPSPYKVDPNFKSPSPPYYGYDDIDDGDNRIMNEEELAAYFAEVKASEGFDVETIPCSLLCGAIRPILDIKRFYTKQIQHGAEEAIKEYNQKNDTNFEIVEIEKAMNG
GSYGIMLYITFKVKPSDTPFDHPTTTFQAKVLYARKFSVELCRIKPSN