; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029837 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029837
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDUF4228 domain-containing protein
Genome locationtig00153533:1784524..1794288
RNA-Seq ExpressionSgr029837
SyntenySgr029837
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029841.1 hypothetical protein SDJN02_08184, partial [Cucurbita argyrosperma subsp. argyrosperma]8.4e-3454.14Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP--EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLAS
        MGGC+S RSSS+ +++  D +++VH+NG+VQHFH P++ R VT   PP  E+F+ T AQLVS   SPALNPDA+L PG +YF+LP STLHPDVSP DL+S
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP--EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLAS

Query:  IARKLTAAAKSAGKPASPPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEKSI
        IARKLTAAAKSA +P     G     + P+        +  SR W+P LDTI+EK++
Subjt:  IARKLTAAAKSAGKPASPPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEKSI

XP_008460258.1 PREDICTED: uncharacterized protein LOC103499134 [Cucumis melo]3.8e-3455.88Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP--EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLAS
        MGGC+S RSSS +++   D V+VVH+NG+VQHFH P++ R V    PP  E+F+CT AQLVS  +SPAL+PDA+L PG +YF+LP STLHPDVS  DLAS
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP--EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLAS

Query:  IARKLTAAAKSAGKPAS-PPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEK------SIEPDSER
        IAR+LTAAAKSA K  S PP               G      SR WRPLLDTIKEK       IE D ER
Subjt:  IARKLTAAAKSAGKPAS-PPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEK------SIEPDSER

XP_011650123.1 uncharacterized protein LOC105434722 [Cucumis sativus]3.8e-3455.29Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP---EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLA
        MGGC+S RSSS++++++ D V+VVH+NG+VQHFH P++ R V AG+PP   E+F+CT AQLVS  +SPALNPD +L PG +YF+LPLSTLHPDVS  DLA
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP---EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLA

Query:  SIARKLTAAAKSAGKPASPPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEK------SIEPDSER
        SIAR+LTAAAKSA K  S P              +    +  SR WRPLLDTI+EK       I+ D ER
Subjt:  SIARKLTAAAKSAGKPASPPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEK------SIEPDSER

XP_023547215.1 uncharacterized protein LOC111806092 [Cucurbita pepo subsp. pepo]3.2e-3353.5Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAG--KPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLAS
        MGGC+S RSSS+ +++  D +++VH+NG+VQHFH P++ R VT    +P E+F+ T AQLVS   SPALNPDA+L PG +YF+LP STLHPDVSP DL+S
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAG--KPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLAS

Query:  IARKLTAAAKSAGKPASPPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEKSI
        IARKLTAAAKSA +P     G     + P+           SR W+P LDTI+EK++
Subjt:  IARKLTAAAKSAGKPASPPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEKSI

XP_024925481.1 uncharacterized protein LOC112490182 [Ziziphus jujuba]9.3e-3351.55Show/hide
Query:  MGGCLSCRSSSSSSSS------SFDNVRVVHINGYVQHFHHPVSVRHVTAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPM
        MG C SCRSSSSSSSS      S   VRVVH+NGYV+ F  PVSV  +T GKP +HF+CTPAQL+S+ S P L  DALL PG IY +LP S L  DVSP+
Subjt:  MGGCLSCRSSSSSSSS------SFDNVRVVHINGYVQHFHHPVSVRHVTAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPM

Query:  DLASIARKLTAAAKSA------------GKPASPPHGPQ------TSPRRPS----------LMMYGSQRSPWSRSWRPLLDTIKEKSIEPDSE
        DLASI RKLT+AAKS+              P   PHG +       SP R S          LM  G QRS  +RSW+P+LDTI+E+S    SE
Subjt:  DLASIARKLTAAAKSA------------GKPASPPHGPQ------TSPRRPS----------LMMYGSQRSPWSRSWRPLLDTIKEKSIEPDSE

TrEMBL top hitse value%identityAlignment
A0A0A0LHC0 Uncharacterized protein1.8e-3455.29Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP---EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLA
        MGGC+S RSSS++++++ D V+VVH+NG+VQHFH P++ R V AG+PP   E+F+CT AQLVS  +SPALNPD +L PG +YF+LPLSTLHPDVS  DLA
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP---EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLA

Query:  SIARKLTAAAKSAGKPASPPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEK------SIEPDSER
        SIAR+LTAAAKSA K  S P              +    +  SR WRPLLDTI+EK       I+ D ER
Subjt:  SIARKLTAAAKSAGKPASPPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEK------SIEPDSER

A0A1S3CC70 uncharacterized protein LOC1034991341.8e-3455.88Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP--EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLAS
        MGGC+S RSSS +++   D V+VVH+NG+VQHFH P++ R V    PP  E+F+CT AQLVS  +SPAL+PDA+L PG +YF+LP STLHPDVS  DLAS
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP--EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLAS

Query:  IARKLTAAAKSAGKPAS-PPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEK------SIEPDSER
        IAR+LTAAAKSA K  S PP               G      SR WRPLLDTIKEK       IE D ER
Subjt:  IARKLTAAAKSAGKPAS-PPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEK------SIEPDSER

A0A5D3CAJ8 DUF4228 domain-containing protein1.8e-3455.88Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP--EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLAS
        MGGC+S RSSS +++   D V+VVH+NG+VQHFH P++ R V    PP  E+F+CT AQLVS  +SPAL+PDA+L PG +YF+LP STLHPDVS  DLAS
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPP--EHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLAS

Query:  IARKLTAAAKSAGKPAS-PPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEK------SIEPDSER
        IAR+LTAAAKSA K  S PP               G      SR WRPLLDTIKEK       IE D ER
Subjt:  IARKLTAAAKSAGKPAS-PPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEK------SIEPDSER

A0A5N6R7R3 Uncharacterized protein1.0e-3253.22Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLASIA
        MG C SCR  SS        +R+VH+NGYV+ F HPVSV  VT GKP +HF+CTPAQL+ A S P L PD  L  G IYF+LP S L  DVSP+DL SI 
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLASIA

Query:  RKLTAAAKSAGK------PASPPHGPQTSPRR-----PSLMMYGSQRSPWSRSWRPLLDTIKEKSIEPDSE
        +KLTA AK++G+       +S    P TSP R       +M YG+QRS  +RSWRP+LDTIKEKS    SE
Subjt:  RKLTAAAKSAGK------PASPPHGPQTSPRR-----PSLMMYGSQRSPWSRSWRPLLDTIKEKSIEPDSE

A0A6P6FV23 uncharacterized protein LOC1124901824.5e-3351.55Show/hide
Query:  MGGCLSCRSSSSSSSS------SFDNVRVVHINGYVQHFHHPVSVRHVTAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPM
        MG C SCRSSSSSSSS      S   VRVVH+NGYV+ F  PVSV  +T GKP +HF+CTPAQL+S+ S P L  DALL PG IY +LP S L  DVSP+
Subjt:  MGGCLSCRSSSSSSSS------SFDNVRVVHINGYVQHFHHPVSVRHVTAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPM

Query:  DLASIARKLTAAAKSA------------GKPASPPHGPQ------TSPRRPS----------LMMYGSQRSPWSRSWRPLLDTIKEKSIEPDSE
        DLASI RKLT+AAKS+              P   PHG +       SP R S          LM  G QRS  +RSW+P+LDTI+E+S    SE
Subjt:  DLASIARKLTAAAKSA------------GKPASPPHGPQ------TSPRRPS----------LMMYGSQRSPWSRSWRPLLDTIKEKSIEPDSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06980.1 unknown protein5.7e-0427.33Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGK----PPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDL
        MG  L C  +      + D +R+VH+NGYV+        R +TAG+     P H L  P           L+P++ L  G+IYF++P S+L P+      
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGK----PPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDL

Query:  ASIARKLTAAAKSAGKPASPPHGPQ----------------TSPRRPSLMMYGSQRSPWSRSWRPLLDTIKE
         +  RK      SA        G +                 S  +         RS    +WRPLLD+I E
Subjt:  ASIARKLTAAAKSAGKPASPPHGPQ----------------TSPRRPSLMMYGSQRSPWSRSWRPLLDTIKE

AT1G76600.1 unknown protein1.1e-0429.85Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHV---------TAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDV
        MG C+S   +   SSS+    ++V ING ++ +  PV    V         ++     +FLC    L      PA+  D +L    IYF+LP+S     +
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHV---------TAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDV

Query:  SPMDLASIARKLTAA-AKSAGKPASPPHGPQTSP
        S  D+A++A K + A  K+AGK        + SP
Subjt:  SPMDLASIARKLTAA-AKSAGKPASPPHGPQTSP

AT2G30230.1 unknown protein9.7e-0428.92Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLP
        MG  L C  +      + D +R+VH+NG+V     P++   +     P H L  P           L+P++ L  G+IYF++P
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLP

AT5G66580.1 unknown protein2.6e-0427.93Show/hide
Query:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLASIA
        MG C S  S  S      D+ +++ ++G +Q F  PV V  +   K P  F+C   ++    +  A+  +  L  G +YF+LPL+ L+  +   ++A++A
Subjt:  MGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLASIA

Query:  RKLTAAAKSAG
         K ++A   +G
Subjt:  RKLTAAAKSAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAATCAATGGGCAAAGGGAGTACTCTCTAATCTGGTCGGAGGCTGATTGCATAACAGAAACGGTCTCGATTACTTCGTTTCAAGGAGTTTCCTCTCTGTTAATGTG
CGAAAAGAGTTGCATTACTCCGTGCAGGTTGCCTGTTCTTCTTCTTCGGAGTAGATCGAGCTCTCGAGGCTCCACCTACCAGTGTGCTCGGAAGGTCCTGTTTGAGTTTT
CAATGAAGCGGAAAAGGCCCAACCAAGCCCTTTTGAAAGTGAAAACAACTGCTGCACTGCACCTGGATGGAACTCAACTTCCCAGAACCAATTCTCCAGTTCCTGATTTC
ACAGACACAGGAGAGAGAGAGAATTCCAGAGAGAGAGAGATGGGTGGGTGTCTCTCCTGCCGATCATCTTCTTCTTCTTCTTCCAGCTCCTTCGACAACGTCCGAGTGGT
TCACATCAACGGCTACGTCCAACACTTCCACCACCCAGTTAGCGTCCGCCACGTCACTGCCGGAAAGCCGCCGGAGCACTTCCTCTGCACTCCTGCTCAGCTGGTCTCCG
CCCCTTCCAGTCCGGCCTTGAACCCCGACGCCCTTCTCCATCCCGGCAACATCTACTTCATGCTTCCTCTCTCCACTCTCCACCCCGACGTATCTCCCATGGACTTGGCT
TCCATAGCCAGAAAGCTCACCGCTGCCGCCAAATCGGCCGGAAAGCCCGCCTCACCGCCTCATGGGCCCCAGACGAGTCCCCGCAGGCCGAGTTTGATGATGTACGGATC
GCAGAGGTCACCGTGGTCGCGATCATGGAGGCCATTGTTAGACACCATTAAGGAGAAAAGCATTGAACCAGATTCGGAGCGAGTCGAATTTGCAAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGCTAATCAATGGGCAAAGGGAGTACTCTCTAATCTGGTCGGAGGCTGATTGCATAACAGAAACGGTCTCGATTACTTCGTTTCAAGGAGTTTCCTCTCTGTTAATGTG
CGAAAAGAGTTGCATTACTCCGTGCAGGTTGCCTGTTCTTCTTCTTCGGAGTAGATCGAGCTCTCGAGGCTCCACCTACCAGTGTGCTCGGAAGGTCCTGTTTGAGTTTT
CAATGAAGCGGAAAAGGCCCAACCAAGCCCTTTTGAAAGTGAAAACAACTGCTGCACTGCACCTGGATGGAACTCAACTTCCCAGAACCAATTCTCCAGTTCCTGATTTC
ACAGACACAGGAGAGAGAGAGAATTCCAGAGAGAGAGAGATGGGTGGGTGTCTCTCCTGCCGATCATCTTCTTCTTCTTCTTCCAGCTCCTTCGACAACGTCCGAGTGGT
TCACATCAACGGCTACGTCCAACACTTCCACCACCCAGTTAGCGTCCGCCACGTCACTGCCGGAAAGCCGCCGGAGCACTTCCTCTGCACTCCTGCTCAGCTGGTCTCCG
CCCCTTCCAGTCCGGCCTTGAACCCCGACGCCCTTCTCCATCCCGGCAACATCTACTTCATGCTTCCTCTCTCCACTCTCCACCCCGACGTATCTCCCATGGACTTGGCT
TCCATAGCCAGAAAGCTCACCGCTGCCGCCAAATCGGCCGGAAAGCCCGCCTCACCGCCTCATGGGCCCCAGACGAGTCCCCGCAGGCCGAGTTTGATGATGTACGGATC
GCAGAGGTCACCGTGGTCGCGATCATGGAGGCCATTGTTAGACACCATTAAGGAGAAAAGCATTGAACCAGATTCGGAGCGAGTCGAATTTGCAAGATAG
Protein sequenceShow/hide protein sequence
MLINGQREYSLIWSEADCITETVSITSFQGVSSLLMCEKSCITPCRLPVLLLRSRSSSRGSTYQCARKVLFEFSMKRKRPNQALLKVKTTAALHLDGTQLPRTNSPVPDF
TDTGERENSREREMGGCLSCRSSSSSSSSSFDNVRVVHINGYVQHFHHPVSVRHVTAGKPPEHFLCTPAQLVSAPSSPALNPDALLHPGNIYFMLPLSTLHPDVSPMDLA
SIARKLTAAAKSAGKPASPPHGPQTSPRRPSLMMYGSQRSPWSRSWRPLLDTIKEKSIEPDSERVEFAR