; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023457 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023457
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionThylakoid soluble phosphoprotein TSP
Genome locationtig00000892:3452945..3453470
RNA-Seq ExpressionSgr023457
SyntenySgr023457
Gene Ontology termsNA
InterPro domainsIPR021584 - Thylakoid soluble phosphoprotein TSP9
IPR037244 - Thylakoid soluble phosphoprotein TSP9 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143212.1 uncharacterized protein LOC101205268 [Cucumis sativus]1.0e-3281.44Show/hide
Query:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKN-GGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD
        MASLPI FSPAAG VFAATAAKGAGE+KKEKGLLDWI+G L KD LLETDP+LQKVEGK+   G+  G+ R GRKNSVQVPPKKNGGGFGGLFAKKD
Subjt:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKN-GGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD

XP_008464052.1 PREDICTED: uncharacterized protein LOC103502033 [Cucumis melo]4.1e-3482.29Show/hide
Query:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNG-GGSNDGTTRGRKNSVQVPPKKNGGGFGGLFAKKD
        MASLPI FSPAAG VFAATAAKGAGE+KKEKGLLDWI+G L KD LLETDP+LQKVEGK+G  G+  GT RGRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNG-GGSNDGTTRGRKNSVQVPPKKNGGGFGGLFAKKD

XP_022947653.1 uncharacterized protein LOC111451452 [Cucurbita moschata]1.7e-3282.29Show/hide
Query:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNGGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD
        MASL   FSPAAG VFAATAAKGAGE KKEKG+LDWILGG+EKD LLETDP+LQKVEGKNGG +N GT R G+KNSVQVPPKKN G FGGLFAKKD
Subjt:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNGGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD

XP_023532336.1 uncharacterized protein LOC111794525 [Cucurbita pepo subsp. pepo]7.8e-3382.29Show/hide
Query:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNGGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD
        MASL   FSPAAG VFAATAAKGAGE+KKEKG+LDWILGG+EKD LLETDP+LQKVEGKNGG +N GT R G+KNSVQVPPKKN G FGGLFAKKD
Subjt:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNGGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD

XP_038901860.1 uncharacterized protein LOC120088545 [Benincasa hispida]7.8e-3381.25Show/hide
Query:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNGGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD
        MASLPI FSPAA  VFAATAAKGAGE+KKE+GLLDWI+G L KD LLETDP+LQKVEGK   G+  GT R GRKNSVQVPPKKNGGGFGGLFAKKD
Subjt:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNGGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD

TrEMBL top hitse value%identityAlignment
A0A0A0KC59 Uncharacterized protein4.9e-3381.44Show/hide
Query:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKN-GGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD
        MASLPI FSPAAG VFAATAAKGAGE+KKEKGLLDWI+G L KD LLETDP+LQKVEGK+   G+  G+ R GRKNSVQVPPKKNGGGFGGLFAKKD
Subjt:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKN-GGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD

A0A1S3CKK5 uncharacterized protein LOC1035020332.0e-3482.29Show/hide
Query:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNG-GGSNDGTTRGRKNSVQVPPKKNGGGFGGLFAKKD
        MASLPI FSPAAG VFAATAAKGAGE+KKEKGLLDWI+G L KD LLETDP+LQKVEGK+G  G+  GT RGRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNG-GGSNDGTTRGRKNSVQVPPKKNGGGFGGLFAKKD

A0A5A7V5S9 Uncharacterized protein2.0e-3482.29Show/hide
Query:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNG-GGSNDGTTRGRKNSVQVPPKKNGGGFGGLFAKKD
        MASLPI FSPAAG VFAATAAKGAGE+KKEKGLLDWI+G L KD LLETDP+LQKVEGK+G  G+  GT RGRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNG-GGSNDGTTRGRKNSVQVPPKKNGGGFGGLFAKKD

A0A6J1G777 uncharacterized protein LOC1114514528.4e-3382.29Show/hide
Query:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNGGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD
        MASL   FSPAAG VFAATAAKGAGE KKEKG+LDWILGG+EKD LLETDP+LQKVEGKNGG +N GT R G+KNSVQVPPKKN G FGGLFAKKD
Subjt:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNGGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD

A0A6J1L2X7 uncharacterized protein LOC1114999278.4e-3382.29Show/hide
Query:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNGGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD
        MASL   FSPAAG VFAATAAKGAGE KKEKG+LDWILGG+EKD LLETDP+LQKVEGKNGG +N GT R G+KNSVQVPPKKN G FGGLFAKKD
Subjt:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNGGGSNDGTTR-GRKNSVQVPPKKNGGGFGGLFAKKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G47070.1 LOCATED IN: thylakoid, chloroplast thylakoid membrane, chloroplast, chloroplast envelope; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Thylakoid soluble phosphoprotein TSP9 (InterPro:IPR021584); Has 37 Blast hits to 37 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 37; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.6e-1245.16Show/hide
Query:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEK-DHLLETDPILQKVEGKNGGGSND-GTTRGRKNSVQVP-PKKNGGGFGGL
        ++SL ++F+PA   V+A +   G+G  K+EK  +D++LG + K D   ET+P+L+KV+ K G  +   GT RG KNS   P PKK+ GGFGGL
Subjt:  MASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEK-DHLLETDPILQKVEGKNGGGSND-GTTRGRKNSVQVP-PKKNGGGFGGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACGGCCCATCGGCCCATCTACGGTTAGATCGACGTGTCGAATCCGCAGCCATGTCTGCTATCGCCTCTTATTTACTCAGCCCCACTTCTATTCCCTTCCATTTCT
ATACACCTTTCATCTTCCTCCTCTGGCCATGGCTTCTCTACCCATCACATTCTCTCCGGCCGCCGGACTGGTGTTCGCAGCCACGGCGGCGAAAGGCGCTGGCGAAACCA
AGAAAGAGAAGGGTCTTCTCGACTGGATTCTTGGAGGTCTGGAGAAGGACCACCTCCTGGAAACCGACCCGATTCTTCAAAAGGTGGAGGGGAAGAACGGCGGCGGCAGC
AATGACGGCACCACCCGCGGCCGGAAGAACTCCGTCCAAGTCCCGCCGAAGAAGAACGGCGGCGGATTCGGCGGTCTCTTTGCCAAGAAAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCACGGCCCATCGGCCCATCTACGGTTAGATCGACGTGTCGAATCCGCAGCCATGTCTGCTATCGCCTCTTATTTACTCAGCCCCACTTCTATTCCCTTCCATTTCT
ATACACCTTTCATCTTCCTCCTCTGGCCATGGCTTCTCTACCCATCACATTCTCTCCGGCCGCCGGACTGGTGTTCGCAGCCACGGCGGCGAAAGGCGCTGGCGAAACCA
AGAAAGAGAAGGGTCTTCTCGACTGGATTCTTGGAGGTCTGGAGAAGGACCACCTCCTGGAAACCGACCCGATTCTTCAAAAGGTGGAGGGGAAGAACGGCGGCGGCAGC
AATGACGGCACCACCCGCGGCCGGAAGAACTCCGTCCAAGTCCCGCCGAAGAAGAACGGCGGCGGATTCGGCGGTCTCTTTGCCAAGAAAGACTGA
Protein sequenceShow/hide protein sequence
MSRPIGPSTVRSTCRIRSHVCYRLLFTQPHFYSLPFLYTFHLPPLAMASLPITFSPAAGLVFAATAAKGAGETKKEKGLLDWILGGLEKDHLLETDPILQKVEGKNGGGS
NDGTTRGRKNSVQVPPKKNGGGFGGLFAKKD