; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021408 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021408
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSynechocystis YCF37
Genome locationtig00153666:827189..830727
RNA-Seq ExpressionSgr021408
SyntenySgr021408
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571416.1 hypothetical protein SDJN03_30331, partial [Cucurbita argyrosperma subsp. sororia]2.9e-6084.57Show/hide
Query:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTATTAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETA
        MAS  S ILQILQPRCRL+ TVRAA+ K PPE+LS  RTRRQ LL LTA T A+VGRE PSMAEDIPLFGLRKKLKKVEEEAEE+VREGFEAAEKGLETA
Subjt:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTATTAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETA

Query:  ERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPE
        ERGIVTAE+GI  AERGIETAEK+IE+A+NFG LSQAGAVAGAEVVGVLVATSIVNGILGPE
Subjt:  ERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPE

XP_004148045.1 uncharacterized protein LOC101208092 [Cucumis sativus]1.0e-6084.66Show/hide
Query:  ASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETA
        ++A S  LQIL+PRCRLS TVRAAA K P E+LS  RTRRQ LLFLTAT TAA+VGRE PSMAEDIPLFGLRKKLKKVEEEAEE+VREGFEAAEKGLETA
Subjt:  ASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETA

Query:  ERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG
        ERGIVTAE+GI AAER IETAEK+IETA+NFG LSQAGAVAGAEVVGVL+ATSIVNGILGPEG
Subjt:  ERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG

XP_022158696.1 uncharacterized protein LOC111025160 [Momordica charantia]2.4e-6286.5Show/hide
Query:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTATTAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETA
        MAS SS ILQIL+PRCR++TTVRAA  KL PE+LS G TRRQALLF TA TAAI  RE PSMAEDIPLFGLRKKLKKVEEEAEE+VREGFEAAEKG+ETA
Subjt:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTATTAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETA

Query:  ERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG
        ERGIVTAERGI+AAER IETAEK+IETALNFG LSQAGAVAGAEVVGVLVATSIVNGILGPEG
Subjt:  ERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG

XP_022928047.1 uncharacterized protein LOC111434949 [Cucurbita moschata]2.6e-6185.28Show/hide
Query:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLET
        MAS  S ILQILQPRCRL+ TVRAA+ K PPE+LS  RTRRQ LL LTAT TAA+VGRE PSMAEDIPLFGLRKKLKKVEEEAEE+VREGFEAAEKGLET
Subjt:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLET

Query:  AERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPE
        AERGIVTAE+GI  AERGIETAEK+IE+A+NFG LSQAGAVAGAEVVGVLVATSIVNGILGPE
Subjt:  AERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPE

XP_022971729.1 uncharacterized protein LOC111470397 [Cucurbita maxima]2.0e-6184.76Show/hide
Query:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLET
        MAS  S ILQILQPRCRL+ TVRAA+ K PPE+LS  RTRRQ LL LTAT TAA+VGRE PSMAE+IPLFGLRKKLKKVEEEAEE+VREGFEAAEKGLET
Subjt:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLET

Query:  AERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG
        AERGIVTAE+GI  AERGIETAEK+IE+A+NFG LSQAGAVAGAEVVGVLVATSIVNGILGPEG
Subjt:  AERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG

TrEMBL top hitse value%identityAlignment
A0A0A0LJU7 Uncharacterized protein4.8e-6184.66Show/hide
Query:  ASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETA
        ++A S  LQIL+PRCRLS TVRAAA K P E+LS  RTRRQ LLFLTAT TAA+VGRE PSMAEDIPLFGLRKKLKKVEEEAEE+VREGFEAAEKGLETA
Subjt:  ASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETA

Query:  ERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG
        ERGIVTAE+GI AAER IETAEK+IETA+NFG LSQAGAVAGAEVVGVL+ATSIVNGILGPEG
Subjt:  ERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG

A0A5D3CRC5 Synechocystis YCF377.0e-6083.95Show/hide
Query:  ASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTATTAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETAE
        ++A S  LQIL+PRCRLS TVRAAA K P E+LS  RTRRQ LLFLTA TAA+VGRE PSMAEDIPLFGLRKKLKKVEEEAEE+VREGFEAAEKGLETAE
Subjt:  ASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTATTAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETAE

Query:  RGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG
        RGIVTAE+GI AAER IETAEK+IETA++FG LSQAGAVAGAEVVGVL+ATSIVNGILGPEG
Subjt:  RGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG

A0A6J1E1P9 uncharacterized protein LOC1110251601.2e-6286.5Show/hide
Query:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTATTAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETA
        MAS SS ILQIL+PRCR++TTVRAA  KL PE+LS G TRRQALLF TA TAAI  RE PSMAEDIPLFGLRKKLKKVEEEAEE+VREGFEAAEKG+ETA
Subjt:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTATTAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETA

Query:  ERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG
        ERGIVTAERGI+AAER IETAEK+IETALNFG LSQAGAVAGAEVVGVLVATSIVNGILGPEG
Subjt:  ERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG

A0A6J1EIT8 uncharacterized protein LOC1114349491.3e-6185.28Show/hide
Query:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLET
        MAS  S ILQILQPRCRL+ TVRAA+ K PPE+LS  RTRRQ LL LTAT TAA+VGRE PSMAEDIPLFGLRKKLKKVEEEAEE+VREGFEAAEKGLET
Subjt:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLET

Query:  AERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPE
        AERGIVTAE+GI  AERGIETAEK+IE+A+NFG LSQAGAVAGAEVVGVLVATSIVNGILGPE
Subjt:  AERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPE

A0A6J1I9D9 uncharacterized protein LOC1114703979.7e-6284.76Show/hide
Query:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLET
        MAS  S ILQILQPRCRL+ TVRAA+ K PPE+LS  RTRRQ LL LTAT TAA+VGRE PSMAE+IPLFGLRKKLKKVEEEAEE+VREGFEAAEKGLET
Subjt:  MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTAT-TAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLET

Query:  AERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG
        AERGIVTAE+GI  AERGIETAEK+IE+A+NFG LSQAGAVAGAEVVGVLVATSIVNGILGPEG
Subjt:  AERGIVTAERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23670.1 homolog of Synechocystis YCF371.0e-3458.06Show/hide
Query:  LQPRCRLSTTVRAAAAKLPPEALS----TGRTRRQALLFLTATTAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETAERGIVTA
        LQ RC  +  V   AA     ALS    TG +RR  L  LTA T  + G +  SMAE+IPLFG+RKKLKK EEEA E+V+EGFE AEKG++        A
Subjt:  LQPRCRLSTTVRAAAAKLPPEALS----TGRTRRQALLFLTATTAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETAERGIVTA

Query:  ERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPE
        E+G++AAERG+ETAEK+I TA++F  L+QAGAV  AE VGVLVATS+VNGILGPE
Subjt:  ERGIKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCGCAAGCTCATCAATCCTCCAAATTCTCCAACCCCGATGCCGCTTGAGTACCACCGTCCGTGCCGCCGCTGCAAAACTGCCGCCGGAAGCTCTCTCCACCGG
CAGAACGCGACGACAAGCGCTGTTGTTTCTCACAGCGACGACGGCAGCGATCGTCGGGAGAGAGAAACCATCAATGGCGGAGGACATCCCTCTGTTCGGGCTGAGGAAGA
AACTGAAGAAGGTGGAGGAGGAAGCGGAGGAGCTTGTGAGGGAAGGATTCGAGGCGGCGGAGAAAGGATTAGAGACGGCGGAGCGAGGGATCGTTACGGCGGAGCGGGGA
ATTAAGGCGGCAGAGAGAGGGATCGAAACGGCGGAGAAAGATATCGAAACGGCACTGAATTTTGGGCCGTTGTCGCAAGCAGGGGCGGTAGCCGGAGCGGAGGTCGTCGG
AGTTCTCGTTGCTACGTCGATTGTTAACGGTATTTTGGGTCCCGAAGGTCTCGCCCCATTAAGACTAACCTATCTAAAGGGAGTTGAGGTAGAAGAAGCCGAGGTTAGAG
GAGTTGAGAGGCTTGAAGTTGGAGGGGTTCCTACTAAGTTAACCCTACATGGGAGCTATCTTAGTGAAGGTCATTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCGCAAGCTCATCAATCCTCCAAATTCTCCAACCCCGATGCCGCTTGAGTACCACCGTCCGTGCCGCCGCTGCAAAACTGCCGCCGGAAGCTCTCTCCACCGG
CAGAACGCGACGACAAGCGCTGTTGTTTCTCACAGCGACGACGGCAGCGATCGTCGGGAGAGAGAAACCATCAATGGCGGAGGACATCCCTCTGTTCGGGCTGAGGAAGA
AACTGAAGAAGGTGGAGGAGGAAGCGGAGGAGCTTGTGAGGGAAGGATTCGAGGCGGCGGAGAAAGGATTAGAGACGGCGGAGCGAGGGATCGTTACGGCGGAGCGGGGA
ATTAAGGCGGCAGAGAGAGGGATCGAAACGGCGGAGAAAGATATCGAAACGGCACTGAATTTTGGGCCGTTGTCGCAAGCAGGGGCGGTAGCCGGAGCGGAGGTCGTCGG
AGTTCTCGTTGCTACGTCGATTGTTAACGGTATTTTGGGTCCCGAAGGTCTCGCCCCATTAAGACTAACCTATCTAAAGGGAGTTGAGGTAGAAGAAGCCGAGGTTAGAG
GAGTTGAGAGGCTTGAAGTTGGAGGGGTTCCTACTAAGTTAACCCTACATGGGAGCTATCTTAGTGAAGGTCATTCTTAG
Protein sequenceShow/hide protein sequence
MASASSSILQILQPRCRLSTTVRAAAAKLPPEALSTGRTRRQALLFLTATTAAIVGREKPSMAEDIPLFGLRKKLKKVEEEAEELVREGFEAAEKGLETAERGIVTAERG
IKAAERGIETAEKDIETALNFGPLSQAGAVAGAEVVGVLVATSIVNGILGPEGLAPLRLTYLKGVEVEEAEVRGVERLEVGGVPTKLTLHGSYLSEGHS