; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027544 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027544
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCystatin domain-containing protein
Genome locationtig00153054:2404499..2405153
RNA-Seq ExpressionSgr027544
SyntenySgr027544
Gene Ontology termsGO:0010466 - negative regulation of peptidase activity (biological process)
GO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR006525 - Cystatin-related, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600694.1 hypothetical protein SDJN03_05927, partial [Cucurbita argyrosperma subsp. sororia]3.0e-3553.16Show/hide
Query:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI
        MASS ++         SD +F+QDGY  IT++EE EYY A+ ESQGFDVP F  V+AF +I P+    +   + E+++   EAIKHYN ENGTNFEVV I
Subjt:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI

Query:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPSN
        V ANH    G +Y ITF  K +GT  EFP   FQA+V   IP  D I+VELCRPKPSN
Subjt:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPSN

XP_022942206.1 uncharacterized protein LOC111447330 [Cucurbita moschata]1.2e-3654.14Show/hide
Query:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI
        MASS ++         SD +F+QDGY  IT++EE EYY A+ ESQGFDVP F  V+AF +I P+    +   + E+++   EAIKHYN ENGTNFEVV I
Subjt:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI

Query:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPS
        V ANH    G +YYITF  K +GTP EFP+  FQA+V   IP  D I+VELCRPKPS
Subjt:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPS

XP_022975633.1 uncharacterized protein LOC111475320 [Cucurbita maxima]4.1e-3754.43Show/hide
Query:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI
        MASS ++         SD +F+QDGY  IT++EE EY+ A+ ESQGFDVP F  V+AFG+I P+          E+++   EAIKHYN ENGTNFEVV I
Subjt:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI

Query:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPSN
        V ANH+   G +YYITF  K +GT  EFP+  FQA+V   IP  D IEVELCRPKPSN
Subjt:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPSN

XP_023521681.1 UPF0725 protein At4g29550-like [Cucurbita pepo subsp. pepo]6.0e-3656.83Show/hide
Query:  SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKIVTANHQAARGLLYYIT
        SD +FYQDGY  IT++EE EY+RA+ ES+GFDVP F KVF++ +I P+     S  + E+R+   +AIKHYN+ENGTNFEVV+IV ANH    G +YYIT
Subjt:  SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKIVTANHQAARGLLYYIT

Query:  FEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPK
        F  K +GT  EFP   FQA+V   IP  D I+VELCRPK
Subjt:  FEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPK

XP_023548353.1 UPF0725 protein At1g02770-like [Cucurbita pepo subsp. pepo]6.0e-3653.16Show/hide
Query:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI
        MASS ++         SD +F+QDGY  IT++EE EYY A++ESQGFDVP F  V+AF +I P+        + E+++   EAIKHYN ENGTNFE+V I
Subjt:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI

Query:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPSN
        V ANH    G +YYITF  K +GT  EFP   FQA+V   IP  D I+VELCRPKPSN
Subjt:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPSN

TrEMBL top hitse value%identityAlignment
A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X15.5e-3554.05Show/hide
Query:  ASSPYVSDSD-FYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKIVTANHQAAR
        A++ Y+ DSD +Y DG R++ EEE   YY+A++ES+GFDVP+FP  +AF II PL   L++    E+R    +AIKHYN ENG +FE VK++ AN QAA 
Subjt:  ASSPYVSDSD-FYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKIVTANHQAAR

Query:  GLLYYITFEGKQVGTPPEFPTTIFQARVLAGI-PDTIEVELCRPKPSN
        G L+++TF+ KQ G PP+ PTT  QARVLAGI PD  +V+LCRP+P+N
Subjt:  GLLYYITFEGKQVGTPPEFPTTIFQARVLAGI-PDTIEVELCRPKPSN

A0A6J1FN74 uncharacterized protein LOC1114473305.8e-3754.14Show/hide
Query:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI
        MASS ++         SD +F+QDGY  IT++EE EYY A+ ESQGFDVP F  V+AF +I P+    +   + E+++   EAIKHYN ENGTNFEVV I
Subjt:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI

Query:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPS
        V ANH    G +YYITF  K +GTP EFP+  FQA+V   IP  D I+VELCRPKPS
Subjt:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPS

A0A6J1FVU0 uncharacterized protein LOC1114473293.2e-3556.83Show/hide
Query:  SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKIVTANHQAARGLLYYIT
        SD +FYQDGY  IT++EE EY+RA+ ES+GFDVP F KVF+   IIP+     S  + E+R+   +AIKHYN+ENGTNFEVV+IV ANH    G +YYIT
Subjt:  SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKIVTANHQAARGLLYYIT

Query:  FEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPK
        F  K +GT  EFP   FQA+V   IP  D I+V+LCRPK
Subjt:  FEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPK

A0A6J1IJT3 uncharacterized protein LOC1114753202.0e-3754.43Show/hide
Query:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI
        MASS ++         SD +F+QDGY  IT++EE EY+ A+ ESQGFDVP F  V+AFG+I P+          E+++   EAIKHYN ENGTNFEVV I
Subjt:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI

Query:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPSN
        V ANH+   G +YYITF  K +GT  EFP+  FQA+V   IP  D IEVELCRPKPSN
Subjt:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPSN

A0A6J1IL21 uncharacterized protein LOC1114751781.9e-3553.16Show/hide
Query:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI
        MASS ++         SD +F+QDGY  IT++EE EY+ A+ ESQGFDVP F  V+AFG+I P+          E+++   EAIKHYN +NGTNFEVV I
Subjt:  MASSPYV---------SDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKI

Query:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPSN
        V ANH    G +YYITF  K +GT  EF +  FQA+V   IP  D IEVELCRPKPSN
Subjt:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIP--DTIEVELCRPKPSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G63190.1 Cystatin/monellin superfamily protein1.0e-0932.9Show/hide
Query:  DSDFYQDGYRDITEEEEEEYYRALRE----------SQGFDV--PAFPKVFAFGIIIPLSEELLSHKIPE-----LRVYTEEAIKHYNQENGTNFEVVKI
        D  + +  YRD T+EE+E  Y   +E          S GFD+    F  VF + +    S E +    PE     L   + +A+  YNQE+ T FE VK+
Subjt:  DSDFYQDGYRDITEEEEEEYYRALRE----------SQGFDV--PAFPKVFAFGIIIPLSEELLSHKIPE-----LRVYTEEAIKHYNQENGTNFEVVKI

Query:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIPDTIEVELCRPKPS
        V AN      +++ ITFE   V  P +    +FQ RV        E   CRPKP+
Subjt:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIPDTIEVELCRPKPS

AT1G63200.1 Cystatin/monellin superfamily protein1.5e-0832.06Show/hide
Query:  EEEEEEYYRALRESQGFDV--PAFPKVFAFGIIIPLSEELLSHKIPE----LRVYTEEAIKHYNQENGTNFEVVKIVTANHQAARGLLYYITFEGKQVGT
        EEE     + +  S GFD+   +F  VF +   I  S++    +       L+   +EA+  +N  +GT +E VK+V AN   A  +++ ITF   QV  
Subjt:  EEEEEEYYRALRESQGFDV--PAFPKVFAFGIIIPLSEELLSHKIPE----LRVYTEEAIKHYNQENGTNFEVVKIVTANHQAARGLLYYITFEGKQVGT

Query:  PPEFPTTIFQARVLAGIPDTIEVELCRPKPS
        P +    +FQ RV  G   T     CRPKP+
Subjt:  PPEFPTTIFQARVLAGIPDTIEVELCRPKPS

AT1G63205.1 Cystatin/monellin superfamily protein2.5e-0828.39Show/hide
Query:  DSDFYQDGYRDITEEEEEEY---------YRALRESQGFDV--PAFPKVFAFGIIIPLSEELLSHKIPE-----LRVYTEEAIKHYNQENGTN-FEVVKI
        D +  +  Y   TE++E +Y            + +S+GFD+    F  +F + ++ P   + L  +  E     ++ +++E++K YN E GTN +E  ++
Subjt:  DSDFYQDGYRDITEEEEEEY---------YRALRESQGFDV--PAFPKVFAFGIIIPLSEELLSHKIPE-----LRVYTEEAIKHYNQENGTN-FEVVKI

Query:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIPDTIEVELCRPKPS
        V AN   + G ++ ITF   QV  P +     FQAR+        E   CRPKP+
Subjt:  VTANHQAARGLLYYITFEGKQVGTPPEFPTTIFQARVLAGIPDTIEVELCRPKPS

AT1G63206.1 Cystatin/monellin superfamily protein2.8e-0732.56Show/hide
Query:  RALRESQGFDV--PAFPKVFAFGIIIPLSEELLSHKIPELRVYT---------EEAIKHYNQENGTNFEVVKIVTANHQ--AARGLLYYITFEGKQVGTP
        R +++S+GFD+    F  VF +    P+  + L +K   L   T         + ++K+YN E  T +E +K+V AN       G +Y+ITFE   V  P
Subjt:  RALRESQGFDV--PAFPKVFAFGIIIPLSEELLSHKIPELRVYT---------EEAIKHYNQENGTNFEVVKIVTANHQ--AARGLLYYITFEGKQVGTP

Query:  PEFPTTIFQARVLAGIPDTIEVELCRPKP
         +    +FQ RV      T +  LCRPKP
Subjt:  PEFPTTIFQARVLAGIPDTIEVELCRPKP

AT2G37435.1 Cystatin/monellin superfamily protein1.8e-0931.5Show/hide
Query:  EEEYYRALRE---SQGFDV--PAFPKVFAFGIIIPLSEELLSHKIPELRVYTE----EAIKHYNQENGTNFEVVKIVTANHQAARGLLYYITFEGKQVGT
        EEEYY  ++E   S+GFD+    F  VF +   + L +  L+ +    R + +    ++++H+N+ + T +E V+ + ANH  + G++Y+ITFEGK +  
Subjt:  EEEYYRALRE---SQGFDV--PAFPKVFAFGIIIPLSEELLSHKIPELRVYTE----EAIKHYNQENGTNFEVVKIVTANHQAARGLLYYITFEGKQVGT

Query:  PPEFPTTIFQARV--LAGIPDTIEVEL
          +  +  FQA++    G P+ I  EL
Subjt:  PPEFPTTIFQARV--LAGIPDTIEVEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTTCACCCTACGTTTCAGACTCGGATTTTTATCAAGACGGTTACCGTGACATTACTGAAGAAGAGGAGGAGGAATATTACCGTGCCCTAAGAGAAAGCCAGGG
TTTTGATGTACCGGCTTTCCCTAAAGTCTTTGCATTTGGTATTATTATTCCATTATCTGAGGAGCTACTTTCACATAAAATTCCAGAACTTCGAGTCTACACCGAAGAAG
CTATTAAGCATTACAACCAGGAAAATGGTACAAATTTTGAAGTTGTGAAGATTGTGACGGCAAATCATCAAGCTGCCCGTGGTTTATTGTATTACATCACCTTCGAGGGG
AAGCAAGTTGGAACACCTCCAGAATTTCCAACCACAATCTTCCAAGCTCGAGTTCTGGCTGGTATTCCTGATACTATAGAGGTAGAACTTTGCAGGCCAAAGCCTTCTAA
TTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTTCACCCTACGTTTCAGACTCGGATTTTTATCAAGACGGTTACCGTGACATTACTGAAGAAGAGGAGGAGGAATATTACCGTGCCCTAAGAGAAAGCCAGGG
TTTTGATGTACCGGCTTTCCCTAAAGTCTTTGCATTTGGTATTATTATTCCATTATCTGAGGAGCTACTTTCACATAAAATTCCAGAACTTCGAGTCTACACCGAAGAAG
CTATTAAGCATTACAACCAGGAAAATGGTACAAATTTTGAAGTTGTGAAGATTGTGACGGCAAATCATCAAGCTGCCCGTGGTTTATTGTATTACATCACCTTCGAGGGG
AAGCAAGTTGGAACACCTCCAGAATTTCCAACCACAATCTTCCAAGCTCGAGTTCTGGCTGGTATTCCTGATACTATAGAGGTAGAACTTTGCAGGCCAAAGCCTTCTAA
TTGA
Protein sequenceShow/hide protein sequence
MASSPYVSDSDFYQDGYRDITEEEEEEYYRALRESQGFDVPAFPKVFAFGIIIPLSEELLSHKIPELRVYTEEAIKHYNQENGTNFEVVKIVTANHQAARGLLYYITFEG
KQVGTPPEFPTTIFQARVLAGIPDTIEVELCRPKPSN