; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr004541 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr004541
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionExopolysaccharide production negative regulator
Genome locationtig00003038:1279..14206
RNA-Seq ExpressionSgr004541
SyntenySgr004541
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596694.1 hypothetical protein SDJN03_09874, partial [Cucurbita argyrosperma subsp. sororia]8.4e-7890.36Show/hide
Query:  SFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAP
        SF L   S+  LVAV VANSSIT+PISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKD FSGKAR+ALAEAP
Subjt:  SFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAP

Query:  NEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL
        NEALPHKCRPNFGAAWLAKDKFKVNETY CWYSS IS V+LDYDGFSSCQAQEP+KVEMIKRYYFL
Subjt:  NEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL

XP_004146124.1 uncharacterized protein LOC101211843 [Cucumis sativus]7.6e-7992.99Show/hide
Query:  VELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAPNEALPHKCR
        + LVAV VANSSIT+PISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKAR+ALAEAPNEALPHKCR
Subjt:  VELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAPNEALPHKCR

Query:  PNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL
        PNFGAAWLAK KFKVNETYDCWYSSGISKV+LDYDGFS CQAQEP+ +EMIKRYYFL
Subjt:  PNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL

XP_022145339.1 uncharacterized protein LOC111014820 [Momordica charantia]2.6e-7994.27Show/hide
Query:  VELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAPNEALPHKCR
        + LVAVLVANSSI N ISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARI LAEAP+EALPHKCR
Subjt:  VELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAPNEALPHKCR

Query:  PNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL
        PNFGAAWLAKDKFKVNETYDCWYSSGISKV+LDYDGFSSCQ+QEP+KVEMIKRYYFL
Subjt:  PNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL

XP_022946874.1 uncharacterized protein LOC111450810 [Cucurbita moschata]8.4e-7890.36Show/hide
Query:  SFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAP
        SF L   S+  LVAV VANSSIT+PISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKD FSGKAR+ALAEAP
Subjt:  SFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAP

Query:  NEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL
        NEALPHKCRPNFGAAWLAKDKFKVNETY CWYSS IS V+LDYDGFSSCQAQEP+KVEMIKRYYFL
Subjt:  NEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL

XP_038876581.1 uncharacterized protein LOC120069002 [Benincasa hispida]3.4e-7984.75Show/hide
Query:  CAMVCRAYASSSFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFS
        CA +     S  F+      + LVA+ VANSSIT+PISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFS
Subjt:  CAMVCRAYASSSFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFS

Query:  GKARIALAEAPNEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL
        GK+R+ALAEAPNEALPHKCRPNFGAAWLAK KFKVNETYDCWYSSGISKV+LDYDGFSSCQAQEP+K+EMIKRYYFL
Subjt:  GKARIALAEAPNEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL

TrEMBL top hitse value%identityAlignment
A0A0A0L6A9 Uncharacterized protein3.7e-7992.99Show/hide
Query:  VELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAPNEALPHKCR
        + LVAV VANSSIT+PISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKAR+ALAEAPNEALPHKCR
Subjt:  VELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAPNEALPHKCR

Query:  PNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL
        PNFGAAWLAK KFKVNETYDCWYSSGISKV+LDYDGFS CQAQEP+ +EMIKRYYFL
Subjt:  PNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL

A0A1S3BKP1 uncharacterized protein LOC1034907225.3e-7884.18Show/hide
Query:  CAMVCRAYASSSFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFS
        CA +     S  F+      + LVAV VANS IT+PISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYP+ERNKFRCRYDYYWASVFKVEMKDHFS
Subjt:  CAMVCRAYASSSFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFS

Query:  GKARIALAEAPNEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL
        GKAR+ALAEAPNEALPHKCRPNFGAAWLAK KFKVNETYDCWYSSGISKV+LDYDGFSSCQAQEP+ +EMIKRYYFL
Subjt:  GKARIALAEAPNEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL

A0A6J1CUY3 uncharacterized protein LOC1110148201.3e-7994.27Show/hide
Query:  VELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAPNEALPHKCR
        + LVAVLVANSSI N ISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARI LAEAP+EALPHKCR
Subjt:  VELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAPNEALPHKCR

Query:  PNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL
        PNFGAAWLAKDKFKVNETYDCWYSSGISKV+LDYDGFSSCQ+QEP+KVEMIKRYYFL
Subjt:  PNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL

A0A6J1G561 uncharacterized protein LOC1114508104.1e-7890.36Show/hide
Query:  SFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAP
        SF L   S+  LVAV VANSSIT+PISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKD FSGKAR+ALAEAP
Subjt:  SFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAP

Query:  NEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL
        NEALPHKCRPNFGAAWLAKDKFKVNETY CWYSS IS V+LDYDGFSSCQAQEP+KVEMIKRYYFL
Subjt:  NEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL

A0A6J1KU51 uncharacterized protein LOC1114982744.1e-7890.36Show/hide
Query:  SFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAP
        SF L   S+  LVAV VANSSIT+PISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKD FSGKAR+ALAEAP
Subjt:  SFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAP

Query:  NEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL
        NEALPHKCRPNFGAAWLAKDKFKVNETY CWYSS IS V+LDYDGFSSCQAQEP+KVEMIKRYYFL
Subjt:  NEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19140.1 unknown protein2.9e-5263.87Show/hide
Query:  IVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAPNEALPHKC
        +V  +  L   SS +   SL S+CKIVSSSVDLRSSKVC +GLLN KA++VFYP+ER+KFRCRYDYYWASVFKVE KD+  G+ R+A +EAPNEALP +C
Subjt:  IVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAPNEALPHKC

Query:  RPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRY
        RPNFGAA L KD FKVNETYDCWY+ GI K+ L  D F  CQA + S  ++ K+Y
Subjt:  RPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGGCACCACCAGTAGAACAGAAGCTCTATTTGATACATCCCTCCAACGGGATGAGCAGGACCACTCATGATACTTGGACAGTTAATCGAGCTGTGACCAGAGCTCCCACA
GCTGTTGCAGTACGTGTGGTTATGAAGTCCAGGATTATGTATAATGCTTTCCGCCTTGGGGGTAATAGAACTAGTGTTCTCTGCCTAAGCTTTTCCATCAAATCTAGAAG
AAACCTGGATTCCGTTGAAAAATTCACAGTTTCTGCTTTCACCACGGTTGACCTTACTCGCTGTTCATCACTAAAAGTTTCTTCCTTTATTTTCAACTTCATAATAAATT
TTGTGAAAAGAACCTTACGGATGATTTCTGCAATTTCTCATCATCTTGCTCTTCGTATTTCAAATAATACAATCTTTTTGCAGTTACCCAAGTTAAGCCAGTATGGTCCT
GTATTTGCAGCTGGAAAACTTCCCTGGATATGGAGAAAGAAGGAGTACTCCTCCCTTCCTTCTCAAACCAGTCTCTCAGCCTACGAGCCTCAGGACCATCTGGCTCTACA
AAAAGCTGACTTGTTGATATGGTTTCAGGCTTCCTTTAGAAATGAAGTAGACTTTACCAGATTCAATCTGGTTGTAAAACTGATCAGCTACCATATTGAAGCACGCCCCA
CAGAGGATTGGGGATTTCCACTGAATGCACCTGTTACTGATCTTGTTGCCGGACGAGGAGGTCTCAACCTCGGAGAGGACGACGAAGGGGGGTTATCTAGCTTTAATTGT
GCGATGGTGTGCCGTGCTTATGCTTCCAGTAGTTTCATTCTTTTTTGTGACTCTATCGTTGAGCTTGTTGCTGTGCTTGTCGCCAATTCGTCAATTACTAATCCCATTTC
TTTGCGCTCTCAGTGCAAGATTGTTTCCAGCAGTGTGGACCTTAGGTCATCTAAGGTTTGTGAACTTGGACTACTGAATTATAAAGCCAAGAATGTTTTCTACCCCTATG
AAAGAAATAAGTTTAGATGCCGTTATGATTACTACTGGGCGTCAGTATTCAAGGTAGAAATGAAGGATCATTTTTCTGGAAAGGCTCGGATTGCTTTGGCAGAGGCTCCA
AATGAGGCCCTTCCTCATAAATGCAGGCCTAATTTTGGTGCTGCGTGGTTGGCTAAAGATAAATTCAAGGTAAATGAAACATATGACTGCTGGTACTCATCTGGCATTTC
CAAAGTGAACTTAGACTATGATGGGTTTTCCAGTTGTCAAGCTCAAGAACCTTCAAAAGTTGAGATGATTAAAAGATACTACTTCCTCCGGAAGATTTAG
mRNA sequenceShow/hide mRNA sequence
AGGCACCACCAGTAGAACAGAAGCTCTATTTGATACATCCCTCCAACGGGATGAGCAGGACCACTCATGATACTTGGACAGTTAATCGAGCTGTGACCAGAGCTCCCACA
GCTGTTGCAGTACGTGTGGTTATGAAGTCCAGGATTATGTATAATGCTTTCCGCCTTGGGGGTAATAGAACTAGTGTTCTCTGCCTAAGCTTTTCCATCAAATCTAGAAG
AAACCTGGATTCCGTTGAAAAATTCACAGTTTCTGCTTTCACCACGGTTGACCTTACTCGCTGTTCATCACTAAAAGTTTCTTCCTTTATTTTCAACTTCATAATAAATT
TTGTGAAAAGAACCTTACGGATGATTTCTGCAATTTCTCATCATCTTGCTCTTCGTATTTCAAATAATACAATCTTTTTGCAGTTACCCAAGTTAAGCCAGTATGGTCCT
GTATTTGCAGCTGGAAAACTTCCCTGGATATGGAGAAAGAAGGAGTACTCCTCCCTTCCTTCTCAAACCAGTCTCTCAGCCTACGAGCCTCAGGACCATCTGGCTCTACA
AAAAGCTGACTTGTTGATATGGTTTCAGGCTTCCTTTAGAAATGAAGTAGACTTTACCAGATTCAATCTGGTTGTAAAACTGATCAGCTACCATATTGAAGCACGCCCCA
CAGAGGATTGGGGATTTCCACTGAATGCACCTGTTACTGATCTTGTTGCCGGACGAGGAGGTCTCAACCTCGGAGAGGACGACGAAGGGGGGTTATCTAGCTTTAATTGT
GCGATGGTGTGCCGTGCTTATGCTTCCAGTAGTTTCATTCTTTTTTGTGACTCTATCGTTGAGCTTGTTGCTGTGCTTGTCGCCAATTCGTCAATTACTAATCCCATTTC
TTTGCGCTCTCAGTGCAAGATTGTTTCCAGCAGTGTGGACCTTAGGTCATCTAAGGTTTGTGAACTTGGACTACTGAATTATAAAGCCAAGAATGTTTTCTACCCCTATG
AAAGAAATAAGTTTAGATGCCGTTATGATTACTACTGGGCGTCAGTATTCAAGGTAGAAATGAAGGATCATTTTTCTGGAAAGGCTCGGATTGCTTTGGCAGAGGCTCCA
AATGAGGCCCTTCCTCATAAATGCAGGCCTAATTTTGGTGCTGCGTGGTTGGCTAAAGATAAATTCAAGGTAAATGAAACATATGACTGCTGGTACTCATCTGGCATTTC
CAAAGTGAACTTAGACTATGATGGGTTTTCCAGTTGTCAAGCTCAAGAACCTTCAAAAGTTGAGATGATTAAAAGATACTACTTCCTCCGGAAGATTTAG
Protein sequenceShow/hide protein sequence
APPVEQKLYLIHPSNGMSRTTHDTWTVNRAVTRAPTAVAVRVVMKSRIMYNAFRLGGNRTSVLCLSFSIKSRRNLDSVEKFTVSAFTTVDLTRCSSLKVSSFIFNFIINF
VKRTLRMISAISHHLALRISNNTIFLQLPKLSQYGPVFAAGKLPWIWRKKEYSSLPSQTSLSAYEPQDHLALQKADLLIWFQASFRNEVDFTRFNLVVKLISYHIEARPT
EDWGFPLNAPVTDLVAGRGGLNLGEDDEGGLSSFNCAMVCRAYASSSFILFCDSIVELVAVLVANSSITNPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYE
RNKFRCRYDYYWASVFKVEMKDHFSGKARIALAEAPNEALPHKCRPNFGAAWLAKDKFKVNETYDCWYSSGISKVNLDYDGFSSCQAQEPSKVEMIKRYYFLRKI