; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr013039 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr013039
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionglycine-rich cell wall structural protein 1.8-like
Genome locationtig00153652:6451..7248
RNA-Seq ExpressionSgr013039
SyntenySgr013039
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596726.1 hypothetical protein SDJN03_09906, partial [Cucurbita argyrosperma subsp. sororia]5.9e-3475.66Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS
        MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDGPYG YGGG+GGGYG GAGS+LGGSGYGSGGG GGGSGY  
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS

Query:  VGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG--GGRSMG
        VGD  G GYGSGGG GS  GYGD+GGHGKGYGSGGGGG G GYG  GG   G
Subjt:  VGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG--GGRSMG

XP_022145473.1 glycine-rich cell wall structural protein 2-like [Momordica charantia]1.3e-3367.26Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS
        MA  R  SLGFLLLVG GLASAAR LL+YDPP  H  VGYDYDRPV NP+VGYDP+HHD PYG YGGGAGGGYG G GSALGGSGYGSGGG GGGSGYG 
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS

Query:  VGDHGVGYGS-----------------------GGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGGRS
          DHG GYGS                       GGG+G GYGDV G GKGYGSG GGG GSG G G S
Subjt:  VGDHGVGYGS-----------------------GGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGGRS

XP_022924241.1 glycine-rich cell wall structural protein 1.8-like [Cucurbita moschata]2.5e-3275Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS
        MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDGPYG YGGG+GGGYG GAGS+LGGSGYGSGG  GGGSGY  
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS

Query:  VGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG--GGRSMG
        VGD  G GYGSGGG GS  GYGD+GGHGKGYGSGGGGG G GYG  GG   G
Subjt:  VGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG--GGRSMG

XP_023005512.1 glycine-rich cell wall structural protein 1.8-like [Cucurbita maxima]6.6e-3377.4Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGY--GGGAGSALGGSGYGSGGGEGGGSGY
        MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDGPYG YGGG+GGGY  G GAGSALGGSGYGSGGG GGGSGY
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGY--GGGAGSALGGSGYGSGGGEGGGSGY

Query:  GSVGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG
          VGD  G GYGSGGG GS  GYGD+GGHGKGYGSGGGGG G GYG
Subjt:  GSVGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG

XP_023539508.1 glycine-rich cell wall structural protein 1.8-like [Cucurbita pepo subsp. pepo]2.7e-3476.32Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS
        MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDGPYG YGGG+GGGYG GAGSALGGSGYGSGGG GGGSGY  
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS

Query:  VGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG--GGRSMG
        VGD  G GYGSGGG GS  GYGD+GGHGKGYGSGGGGG G GYG  GG   G
Subjt:  VGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG--GGRSMG

TrEMBL top hitse value%identityAlignment
A0A1S3BJM7 glycine-rich cell wall structural protein 2-like6.0e-2457.45Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYD-RPVHNPRVGYDPDHHDGPY-------GGYGGGAGGGY--GGGAGSALGGSGYGSGG
        MAIS+  S GFLLLV  GLASAAR+LL YD P        +YD  PV NP+VGY+ DHHDG Y         YGGGAGGGY  GGGAGS+LGGSGYGSGG
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYD-RPVHNPRVGYDPDHHDGPY-------GGYGGGAGGGY--GGGAGSALGGSGYGSGG

Query:  GEGGGSGYGSVGDHGVGYGS-----------------------GGGAGSGYGDVGGHGKGYGSGGGGG--------HGSGYGGGRSMG
          GGGSGYG VG+HGVGYGS                       GGG+G GYGD+GGHGKGYGSGGGGG        HG GYG G   G
Subjt:  GEGGGSGYGSVGDHGVGYGS-----------------------GGGAGSGYGDVGGHGKGYGSGGGGG--------HGSGYGGGRSMG

A0A6J1CW11 glycine-rich cell wall structural protein 2-like6.4e-3467.26Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS
        MA  R  SLGFLLLVG GLASAAR LL+YDPP  H  VGYDYDRPV NP+VGYDP+HHD PYG YGGGAGGGYG G GSALGGSGYGSGGG GGGSGYG 
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS

Query:  VGDHGVGYGS-----------------------GGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGGRS
          DHG GYGS                       GGG+G GYGDV G GKGYGSG GGG GSG G G S
Subjt:  VGDHGVGYGS-----------------------GGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGGRS

A0A6J1CWP3 glycine-rich cell wall structural protein 1.8-like isoform X14.4e-2763.53Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS
        MAIS+AFSL FLLL+GFGLASAARTLL ++P A +P     YDR     RVGYD DHHD   G YGGG+GGGYG GAGS+LGGSGYGSGGG GGGSGYG 
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS

Query:  VGDHGVGYGS-----------------------GGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGGRSMG
         GDHGVGYGS                       GGG+G GYGD GG GKGYGSGGGG  GSGYGGG   G
Subjt:  VGDHGVGYGS-----------------------GGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGGRSMG

A0A6J1E911 glycine-rich cell wall structural protein 1.8-like1.2e-3275Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS
        MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDGPYG YGGG+GGGYG GAGS+LGGSGYGSGG  GGGSGY  
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGS

Query:  VGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG--GGRSMG
        VGD  G GYGSGGG GS  GYGD+GGHGKGYGSGGGGG G GYG  GG   G
Subjt:  VGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG--GGRSMG

A0A6J1KTC0 glycine-rich cell wall structural protein 1.8-like3.2e-3377.4Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGY--GGGAGSALGGSGYGSGGGEGGGSGY
        MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDGPYG YGGG+GGGY  G GAGSALGGSGYGSGGG GGGSGY
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGY--GGGAGSALGGSGYGSGGGEGGGSGY

Query:  GSVGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG
          VGD  G GYGSGGG GS  GYGD+GGHGKGYGSGGGGG G GYG
Subjt:  GSVGD-HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYG

SwissProt top hitse value%identityAlignment
A3C5A7 Glycine-rich cell wall structural protein 23.9e-0443.95Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALG-------GSGYGSGGGEG
        MA ++  +L  L+L+  G+ ++ARTLL Y P       G            GY      G  GG GG AGGGYG G G   G       GSGYGSG G G
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALG-------GSGYGSGGGEG

Query:  GGSGYGSVGDHGVGYGSGGGAG---SGYGDVGGHGKGYGSGGGGGHGSGYGGGRSMG
         G+G G  G +G G G GGG G    GYG   G+G GYGSG GG HG GYG G   G
Subjt:  GGSGYGSVGDHGVGYGSGGGAG---SGYGDVGGHGKGYGSGGGGGHGSGYGGGRSMG

P0C5C7 Glycine-rich cell wall structural protein 23.9e-0443.95Show/hide
Query:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALG-------GSGYGSGGGEG
        MA ++  +L  L+L+  G+ ++ARTLL Y P       G            GY      G  GG GG AGGGYG G G   G       GSGYGSG G G
Subjt:  MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALG-------GSGYGSGGGEG

Query:  GGSGYGSVGDHGVGYGSGGGAG---SGYGDVGGHGKGYGSGGGGGHGSGYGGGRSMG
         G+G G  G +G G G GGG G    GYG   G+G GYGSG GG HG GYG G   G
Subjt:  GGSGYGSVGDHGVGYGSGGGAG---SGYGDVGGHGKGYGSGGGGGHGSGYGGGRSMG

Arabidopsis top hitse value%identityAlignment
AT5G46730.2 glycine-rich protein8.9e-0463.75Show/hide
Query:  GPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGSVGDHGVGYGSGGGAGSGYGDVGGHGKGYGSGGGGGHGSG
        G YGG GG  GGG GG AG A GGSGYGSGGGEGGG G G+ G +G G G G G G  YG   G G G G GGGGGHG G
Subjt:  GPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGSVGDHGVGYGSGGGAGSGYGDVGGHGKGYGSGGGGGHGSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTGTAAGGCATTCCTTCCCATGGCTATCTCTAGAGCTTTCTCCCTTGGTTTTCTTCTGCTGGTGGGTTTTGGCTTAGCTTCTGCTGCCAGAACCCTTCTCAGTTA
TGATCCTCCTGCGCGTCACCCAGTGGTAGGATATGATTACGATCGTCCTGTGCATAACCCGAGAGTAGGGTATGATCCTGACCATCATGATGGACCTTATGGTGGATATG
GTGGTGGTGCTGGTGGAGGATATGGGGGCGGAGCTGGCTCTGCTCTTGGAGGATCGGGATATGGAAGTGGTGGCGGGGAAGGAGGTGGTTCTGGATATGGAAGTGTAGGA
GATCACGGGGTTGGTTATGGTAGTGGTGGAGGGGCTGGTTCTGGATATGGAGATGTAGGAGGGCATGGAAAAGGCTATGGTAGCGGTGGTGGTGGAGGACACGGGAGTGG
ATACGGAGGCGGGCGGAGCATGGGGTTGGTTATGGTAGTGGTGGAGGTGGAGGATATGGAAGTGGAGGCGGCACTGGATATGGCCCAGGAGGAGAGCATGGGGTTGGCTA
TGGAAGTGGAGGAGGAGCTGGGTCTGGTAGTGGTTACGGTGGTGGAGCTAAAGGATATGGAGGAGGAAGCGGTGGTGGAAAAGGTGGTGGTGGTGGAGCAGGTTACGGTC
CTGGAGGAGGACATGGAAGCGGATATGGTGGTGGTGAAGGAGCAGGAAGTGGCTATGGCGGCGGTGATGGAGGATATGATGGTGGATATGCACCTTAAAAAATATCATCT
CAAAGCTTATCTCTTGTACGAAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCATTGTAAGGCATTCCTTCCCATGGCTATCTCTAGAGCTTTCTCCCTTGGTTTTCTTCTGCTGGTGGGTTTTGGCTTAGCTTCTGCTGCCAGAACCCTTCTCAGTTA
TGATCCTCCTGCGCGTCACCCAGTGGTAGGATATGATTACGATCGTCCTGTGCATAACCCGAGAGTAGGGTATGATCCTGACCATCATGATGGACCTTATGGTGGATATG
GTGGTGGTGCTGGTGGAGGATATGGGGGCGGAGCTGGCTCTGCTCTTGGAGGATCGGGATATGGAAGTGGTGGCGGGGAAGGAGGTGGTTCTGGATATGGAAGTGTAGGA
GATCACGGGGTTGGTTATGGTAGTGGTGGAGGGGCTGGTTCTGGATATGGAGATGTAGGAGGGCATGGAAAAGGCTATGGTAGCGGTGGTGGTGGAGGACACGGGAGTGG
ATACGGAGGCGGGCGGAGCATGGGGTTGGTTATGGTAGTGGTGGAGGTGGAGGATATGGAAGTGGAGGCGGCACTGGATATGGCCCAGGAGGAGAGCATGGGGTTGGCTA
TGGAAGTGGAGGAGGAGCTGGGTCTGGTAGTGGTTACGGTGGTGGAGCTAAAGGATATGGAGGAGGAAGCGGTGGTGGAAAAGGTGGTGGTGGTGGAGCAGGTTACGGTC
CTGGAGGAGGACATGGAAGCGGATATGGTGGTGGTGAAGGAGCAGGAAGTGGCTATGGCGGCGGTGATGGAGGATATGATGGTGGATATGCACCTTAAAAAATATCATCT
CAAAGCTTATCTCTTGTACGAAAAATGA
Protein sequenceShow/hide protein sequence
MHCKAFLPMAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGSVG
DHGVGYGSGGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGGRSMGLVMVVVEVEDMEVEAALDMAQEESMGLAMEVEEELGLVVVTVVELKDMEEEAVVEKVVVVEQVTV
LEEDMEADMVVVKEQEVAMAAVMEDMMVDMHLKKYHLKAYLLYEK