; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017167 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017167
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDUF1990 domain-containing protein
Genome locationtig00153031:735522..736702
RNA-Seq ExpressionSgr017167
SyntenySgr017167
Gene Ontology termsNA
InterPro domainsIPR018960 - Domain of unknown function DUF1990


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603877.1 UPF0548 protein, partial [Cucurbita argyrosperma subsp. sororia]6.2e-7392.36Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        I+RS SFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEF+PW+VMPLQ+
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYVNE+R T MA TCFSFGSGTLQGHLLAGEERFSIEMD N+QV
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

KAG7034054.1 UPF0548 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.3e-7492.52Show/hide
Query:  MMFINRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMP
        MMFI RS SFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEF+PW+VMP
Subjt:  MMFINRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMP

Query:  LQVVYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        LQ+VYVNE+R T MA TCFSFGSGTLQGHLLAGEERFSIEMD N+QV
Subjt:  LQVVYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

XP_022950650.1 UPF0548 protein At2g17695 [Cucurbita moschata]6.2e-7392.36Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        I+RS SFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEF+PW+VMPLQ+
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYVNE+R T MA TCFSFGSGTLQGHLLAGEERFSIEMD N+QV
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

XP_022979037.1 UPF0548 protein At2g17695 [Cucurbita maxima]6.2e-7392.36Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        I+RS SFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEF+PW+VMPLQ+
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYVNE+R T MA TCFSFGSGTLQGHLLAGEERFSIEMD N+QV
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

XP_038883222.1 UPF0548 protein At2g17695 isoform X2 [Benincasa hispida]5.2e-7289.58Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        I R+ SFNYSSKFRGATANPS+CLQED G+SQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEF+PW+V+PLQ+
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYVNE+RDTK AGTCFSFGSGTL GHLLAGEERFSIEMD N+QV
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

TrEMBL top hitse value%identityAlignment
A0A0A0KKV8 DUF1990 domain-containing protein2.6e-6987.59Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDK-GLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQ
        I R+ SFNY+SKFRGATANPSSCLQEDK G+SQEGFLLNHARILVGSGV TYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEF+PW+V+PLQ
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDK-GLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQ

Query:  VVYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        +VYVNE+RDT    TCFSFGSGTLQGHLLAGEERFSIEMD N+QV
Subjt:  VVYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

A0A1S3B2T0 UPF0548 protein At2g176952.8e-7188.89Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        I R+ SFNYSSKFRGATANPSSCLQEDKG+SQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEF+PW+V+PLQ+
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYVNE+R+T    TCFSFGSGTLQGHLLAGEERFSIEMD N+QV
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

A0A6J1CKD3 UPF0548 protein At2g176955.6e-7290.97Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        I+RS SFNYSSKFRGATANPSS L+EDKGL QEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPV+PGVKFCVC+KEF+PW+VMPLQ+
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYVNE+RD KMA TCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

A0A6J1GGC4 UPF0548 protein At2g176953.0e-7392.36Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        I+RS SFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEF+PW+VMPLQ+
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYVNE+R T MA TCFSFGSGTLQGHLLAGEERFSIEMD N+QV
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

A0A6J1IMP9 UPF0548 protein At2g176953.0e-7392.36Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        I+RS SFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEF+PW+VMPLQ+
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYVNE+R T MA TCFSFGSGTLQGHLLAGEERFSIEMD N+QV
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

SwissProt top hitse value%identityAlignment
Q86JL6 UPF0548 protein7.0e-1128.68Show/hide
Query:  RSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWA-FVDSSTPVHPGVKFCVCAKEFVPWLVMPLQVV
        R   F YS+ +   T + ++  + +       F ++  +I +G+GVE ++K   AL+ W+HF L+W  F   +TP+  G    + +K+   W++   ++ 
Subjt:  RSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWA-FVDSSTPVHPGVKFCVCAKEFVPWLVMPLQVV

Query:  YVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIE
        Y+ +  D +     F +  GTL+ H+  GEERF IE
Subjt:  YVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIE

Q8GXB1 UPF0548 protein At2g176952.4e-5160.42Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        IN++ +FNY +K+RG ++   + L+ED  + ++GFL+NHAR+LVGSG E+YEKGKKALQNW+HFG++WAFVD +TPV  G KFC+C KE +PW+++PLQV
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYV+ESR ++     F +GSGTLQGHLLAGEE+FSIE+D N +V
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

Q9RST8 UPF0548 protein DR_20358.0e-0731.9Show/hide
Query:  RILVGSGVETYEKGKKALQNWRHFGLNWAF---VDSSTPV-HPGVKFCVCAKEFVPW--------LVMPLQVVYVNESRDTKMAGTCFSFGSGTLQGHLL
        R+ VG G   +E+ K AL+  + F  +W      ++STP+   G    +  + F PW         +M  +V+Y+ +  D       + FG GTL GHL+
Subjt:  RILVGSGVETYEKGKKALQNWRHFGLNWAF---VDSSTPV-HPGVKFCVCAKEFVPW--------LVMPLQVVYVNESRDTKMAGTCFSFGSGTLQGHLL

Query:  AGEERFSIEMDKNNQV
         GEERF +E D    V
Subjt:  AGEERFSIEMDKNNQV

Arabidopsis top hitse value%identityAlignment
AT2G17695.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; CONTAINS InterPro DOMAIN/s: Domain of unknown function DUF1990 (InterPro:IPR018960); Has 259 Blast hits to 259 proteins in 120 species: Archae - 0; Bacteria - 197; Metazoa - 0; Fungi - 0; Plants - 57; Viruses - 0; Other Eukaryotes - 5 (source: NCBI BLink).1.7e-5260.42Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        IN++ +FNY +K+RG ++   + L+ED  + ++GFL+NHAR+LVGSG E+YEKGKKALQNW+HFG++WAFVD +TPV  G KFC+C KE +PW+++PLQV
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYV+ESR ++     F +GSGTLQGHLLAGEE+FSIE+D N +V
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

AT2G17695.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; CONTAINS InterPro DOMAIN/s: Domain of unknown function DUF1990 (InterPro:IPR018960); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.7e-5260.42Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        IN++ +FNY +K+RG ++   + L+ED  + ++GFL+NHAR+LVGSG E+YEKGKKALQNW+HFG++WAFVD +TPV  G KFC+C KE +PW+++PLQV
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYV+ESR ++     F +GSGTLQGHLLAGEE+FSIE+D N +V
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV

AT2G17695.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro DOMAIN/s: Domain of unknown function DUF1990 (InterPro:IPR018960).1.7e-5260.42Show/hide
Query:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV
        IN++ +FNY +K+RG ++   + L+ED  + ++GFL+NHAR+LVGSG E+YEKGKKALQNW+HFG++WAFVD +TPV  G KFC+C KE +PW+++PLQV
Subjt:  INRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQV

Query:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV
        VYV+ESR ++     F +GSGTLQGHLLAGEE+FSIE+D N +V
Subjt:  VYVNESRDTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTTCATCAATAGGTCTGTTTCCTTCAACTACAGCAGCAAGTTTAGAGGAGCCACTGCTAATCCCAGCTCTTGCCTTCAAGAAGATAAGGGGCTTTCACAAGAAGG
TTTTCTTCTCAACCATGCTCGTATTTTGGTGGGTTCTGGTGTTGAGACTTATGAAAAGGGGAAGAAAGCTCTTCAGAACTGGAGGCATTTTGGATTGAATTGGGCATTTG
TTGATTCCTCAACACCAGTTCATCCGGGAGTGAAGTTTTGTGTCTGTGCCAAGGAGTTCGTTCCATGGCTGGTGATGCCTCTTCAGGTTGTATATGTAAATGAGAGCAGG
GACACCAAGATGGCTGGGACGTGTTTCAGTTTTGGCAGCGGTACCCTTCAAGGCCATCTTCTGGCCGGTGAAGAACGCTTTTCAATTGAGATGGACAAGAACAACCAAGT
GTG
mRNA sequenceShow/hide mRNA sequence
ATGATGTTCATCAATAGGTCTGTTTCCTTCAACTACAGCAGCAAGTTTAGAGGAGCCACTGCTAATCCCAGCTCTTGCCTTCAAGAAGATAAGGGGCTTTCACAAGAAGG
TTTTCTTCTCAACCATGCTCGTATTTTGGTGGGTTCTGGTGTTGAGACTTATGAAAAGGGGAAGAAAGCTCTTCAGAACTGGAGGCATTTTGGATTGAATTGGGCATTTG
TTGATTCCTCAACACCAGTTCATCCGGGAGTGAAGTTTTGTGTCTGTGCCAAGGAGTTCGTTCCATGGCTGGTGATGCCTCTTCAGGTTGTATATGTAAATGAGAGCAGG
GACACCAAGATGGCTGGGACGTGTTTCAGTTTTGGCAGCGGTACCCTTCAAGGCCATCTTCTGGCCGGTGAAGAACGCTTTTCAATTGAGATGGACAAGAACAACCAAGT
GTG
Protein sequenceShow/hide protein sequence
MMFINRSVSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFVPWLVMPLQVVYVNESR
DTKMAGTCFSFGSGTLQGHLLAGEERFSIEMDKNNQVX