; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023669 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023669
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00000892:5437051..5437521
RNA-Seq ExpressionSgr023669
SyntenySgr023669
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570883.1 hypothetical protein SDJN03_29798, partial [Cucurbita argyrosperma subsp. sororia]6.6e-3363.58Show/hide
Query:  MGN-CLKSNKVMAQDEP--SPSPLPPTETDKV-DKPAGGSALARQKTEEARS-AARGKKVVRFKLQ-ENENSGDGKVIVGRSGDGSGARGGVLRIKVVVS
        MGN C KSNKVMAQDE   + S  PP E  KV +KP  GSA+A+ KT E RS AA GKKVVRFKLQ E+ENSG      G  GDG   R GVLRIKVV+S
Subjt:  MGN-CLKSNKVMAQDEP--SPSPLPPTETDKV-DKPAGGSALARQKTEEARS-AARGKKVVRFKLQ-ENENSGDGKVIVGRSGDGSGARGGVLRIKVVVS

Query:  QKELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPEDLH
        Q+ELKQILK+ +N+S +LEE++AE K+KGR T+SDA T  DE EDENGS RP+LE IPE LH
Subjt:  QKELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPEDLH

KGN63254.1 hypothetical protein Csa_022493 [Cucumis sativus]1.1e-2455.06Show/hide
Query:  MGN-CLKSNKVMAQDEPSPSPLPP---TETDKV-DKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQ
        MGN C KSNKVMAQD+ S    PP    E  KV  +P  GSA+A+ K       A GKKVVRF LQE E   +G+     SGD      GVLRIKVV+SQ
Subjt:  MGN-CLKSNKVMAQDEPSPSPLPP---TETDKV-DKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQ

Query:  KELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE
        KELKQILK R+NNSC+LEE++ ELK+KGRAT   A        DE GSW+P+LE IPE
Subjt:  KELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE

TYK24218.1 hypothetical protein E5676_scaffold27G00200 [Cucumis melo var. makuwa]3.3e-2452.15Show/hide
Query:  MGN-CLKSNKVMAQDEPSPSPLPP---TETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQK
        MGN C ++NKVMAQD+ S   LPP    E +KV++       A  K +     A GKKVVRF LQE E   + +     SGD SGA  GVLRIKVV+SQK
Subjt:  MGN-CLKSNKVMAQDEPSPSPLPP---TETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQK

Query:  ELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE---DLH
        ELK+ILK+R+NNSC+LEE++ ELK+KGRAT            DE GSW+P+LE IPE   DLH
Subjt:  ELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE---DLH

XP_022140639.1 uncharacterized protein LOC111011249 [Momordica charantia]2.8e-3968.12Show/hide
Query:  MGNCLKSNKVMAQDE---PSP-SPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQK
        MGNCL++N+VMAQDE   PSP S L  T     DKPA GSALAR KTEEAR AAR KKVVRF+ +E+E SG G              GGVLRIKVVVSQK
Subjt:  MGNCLKSNKVMAQDE---PSP-SPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQK

Query:  ELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPEDLH
        ELKQILKDR++NS TLEE+LAELKMKGR TISDA+   D +EDENGSWRP+LESIPEDLH
Subjt:  ELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPEDLH

XP_038902397.1 uncharacterized protein LOC120089037 [Benincasa hispida]1.6e-3462.34Show/hide
Query:  NCLKSNKVMAQDEPSPSPLPPTETDKV-DKPAGGSALARQKTEEARSAARGKKVVRFKLQENE--NSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELK
        NC KSNKVMAQDEP    LPP E  KV +KP  GSA+A+ KT EAR+    KKVVRFKLQE E  NSGD               GGVLRIKVV+SQKELK
Subjt:  NCLKSNKVMAQDEPSPSPLPPTETDKV-DKPAGGSALARQKTEEARSAARGKKVVRFKLQENE--NSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELK

Query:  QILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE
        Q+LKDR+NNSCTLEE++ ELK+KGR TISD +   D  EDENG W+P LE IPE
Subjt:  QILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE

TrEMBL top hitse value%identityAlignment
A0A0A0LQE9 Uncharacterized protein5.4e-2555.06Show/hide
Query:  MGN-CLKSNKVMAQDEPSPSPLPP---TETDKV-DKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQ
        MGN C KSNKVMAQD+ S    PP    E  KV  +P  GSA+A+ K       A GKKVVRF LQE E   +G+     SGD      GVLRIKVV+SQ
Subjt:  MGN-CLKSNKVMAQDEPSPSPLPP---TETDKV-DKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQ

Query:  KELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE
        KELKQILK R+NNSC+LEE++ ELK+KGRAT   A        DE GSW+P+LE IPE
Subjt:  KELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE

A0A5D3DKZ8 Uncharacterized protein1.6e-2452.15Show/hide
Query:  MGN-CLKSNKVMAQDEPSPSPLPP---TETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQK
        MGN C ++NKVMAQD+ S   LPP    E +KV++       A  K +     A GKKVVRF LQE E   + +     SGD SGA  GVLRIKVV+SQK
Subjt:  MGN-CLKSNKVMAQDEPSPSPLPP---TETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQK

Query:  ELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE---DLH
        ELK+ILK+R+NNSC+LEE++ ELK+KGRAT            DE GSW+P+LE IPE   DLH
Subjt:  ELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE---DLH

A0A6J1B1M6 uncharacterized protein LOC1104232931.4e-1239.1Show/hide
Query:  MGNCLKSNKVMAQ-DEPSPSPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELK
        MGNCL SNK++AQ D+P P        ++  K         +   +     + KK+VRFKL E EN  DG    GR G+   ++ GV+RI++VV+QKELK
Subjt:  MGNCLKSNKVMAQ-DEPSPSPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELK

Query:  QILKDR-DNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPED
        QIL  R D    +LE ++  +K++G       +T  ++D+  +G WRP+LESIPE+
Subjt:  QILKDR-DNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPED

A0A6J1CG85 uncharacterized protein LOC1110112491.3e-3968.12Show/hide
Query:  MGNCLKSNKVMAQDE---PSP-SPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQK
        MGNCL++N+VMAQDE   PSP S L  T     DKPA GSALAR KTEEAR AAR KKVVRF+ +E+E SG G              GGVLRIKVVVSQK
Subjt:  MGNCLKSNKVMAQDE---PSP-SPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQK

Query:  ELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPEDLH
        ELKQILKDR++NS TLEE+LAELKMKGR TISDA+   D +EDENGSWRP+LESIPEDLH
Subjt:  ELKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPEDLH

A0A7N2MA05 Uncharacterized protein4.3e-1444.52Show/hide
Query:  MGNCLKSNKVMAQDEPSPSPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQ
        MGNCL SNK +AQ+E  P      E  +  KP+  S L   K  +     + KKVVRFKL+E++ +      VG S +G  +R GV+RI+VVV+QKELKQ
Subjt:  MGNCLKSNKVMAQDEPSPSPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQ

Query:  ILKDRDN-NSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPED
        IL  ++     ++E+++  L ++GR  IS+ +T  DEDE  N +WRP+LESIPED
Subjt:  ILKDRDN-NSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G21680.1 unknown protein5.1e-0732.69Show/hide
Query:  MGNCLKSNKVMA---QDEPSPSPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKE
        MGNCL+ +  +A   +D+  P PL                   +  EE +++ RG+       +E+E S +                 V+RIKVVV++KE
Subjt:  MGNCLKSNKVMA---QDEPSPSPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKE

Query:  LKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE
        L+QIL  + N   ++++++  LK  GR  IS A  +EDE E+ + +WRP+LESIPE
Subjt:  LKQILKDRDNNSCTLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAACTGCTTGAAGAGCAACAAAGTGATGGCCCAAGATGAGCCTTCGCCTTCGCCTTTGCCTCCGACAGAAACTGATAAAGTAGATAAACCAGCGGGCGGATCGGC
GCTGGCGCGGCAGAAGACGGAGGAGGCGAGAAGCGCTGCGCGAGGTAAGAAGGTGGTGAGGTTTAAGCTACAAGAAAATGAAAATTCCGGCGACGGAAAAGTGATCGTCG
GCAGAAGTGGCGACGGCTCCGGAGCCAGAGGCGGAGTATTGAGGATTAAAGTGGTGGTGTCTCAGAAAGAGTTGAAGCAGATATTGAAGGATAGAGACAACAATTCCTGC
ACCTTGGAGGAAATGTTAGCTGAATTGAAGATGAAAGGCAGGGCGACAATTTCAGATGCTAAAACCGATGAAGATGAAGACGAGGATGAAAATGGAAGCTGGAGGCCGTC
TTTGGAAAGTATTCCTGAGGATCTCCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAACTGCTTGAAGAGCAACAAAGTGATGGCCCAAGATGAGCCTTCGCCTTCGCCTTTGCCTCCGACAGAAACTGATAAAGTAGATAAACCAGCGGGCGGATCGGC
GCTGGCGCGGCAGAAGACGGAGGAGGCGAGAAGCGCTGCGCGAGGTAAGAAGGTGGTGAGGTTTAAGCTACAAGAAAATGAAAATTCCGGCGACGGAAAAGTGATCGTCG
GCAGAAGTGGCGACGGCTCCGGAGCCAGAGGCGGAGTATTGAGGATTAAAGTGGTGGTGTCTCAGAAAGAGTTGAAGCAGATATTGAAGGATAGAGACAACAATTCCTGC
ACCTTGGAGGAAATGTTAGCTGAATTGAAGATGAAAGGCAGGGCGACAATTTCAGATGCTAAAACCGATGAAGATGAAGACGAGGATGAAAATGGAAGCTGGAGGCCGTC
TTTGGAAAGTATTCCTGAGGATCTCCATTAG
Protein sequenceShow/hide protein sequence
MGNCLKSNKVMAQDEPSPSPLPPTETDKVDKPAGGSALARQKTEEARSAARGKKVVRFKLQENENSGDGKVIVGRSGDGSGARGGVLRIKVVVSQKELKQILKDRDNNSC
TLEEMLAELKMKGRATISDAKTDEDEDEDENGSWRPSLESIPEDLH