; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015761 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015761
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function, DUF538
Genome locationtig00005754:4318..4782
RNA-Seq ExpressionSgr015761
SyntenySgr015761
Gene Ontology termsNA
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155180.1 uncharacterized protein At5g01610-like [Momordica charantia]2.4e-6785.81Show/hide
Query:  MASL-LFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV
        MASL L LL AF+SFSNPLVSAQKALTAYD+LQQYGFPVGILPVG TGY+LDR TGEFSLYLNQKCRFSI+SY LEYKPT+KGVISQG+IR LKGVTVKV
Subjt:  MASL-LFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV

Query:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        LLLWLNIVEV NDG DL FSVGIASANFPI+ FYESPQCGCGFDC K G LV+AS
Subjt:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

XP_022953728.1 uncharacterized protein LOC111456173 [Cucurbita moschata]6.4e-6581.17Show/hide
Query:  MASLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL
        MASLLFL  AF+SFS+PLVSAQKALTAYD++QQYGFPVGILP+G TGYKLDR TG FSL+L+QKC+F+IDSYELEYKPTV GVISQGK+ KLKGV+VK+L
Subjt:  MASLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL

Query:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LWL+IVEV +DGD+L FSVGIASANFPI+SFYESPQCGCGFDC+K+G+LVSAS
Subjt:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

XP_022991260.1 uncharacterized protein At5g01610-like [Cucurbita maxima]5.4e-6479.22Show/hide
Query:  MASLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL
        MASLLFL  AF+SFS+PLVSAQKALTAYD++QQYGFPVGILP+G TGY+LDR TG FSL+L+QKC+F+IDSYELEYKPT+ GVISQG++ KLKGV+VK+L
Subjt:  MASLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL

Query:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LWL+IVEV +DGD+L FSVGIASANFPI+SFYESPQCGCGFDC+K+G+LVSAS
Subjt:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

XP_023549402.1 uncharacterized protein LOC111807766 [Cucurbita pepo subsp. pepo]1.9e-6480.52Show/hide
Query:  MASLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL
        MASLLFL  AF+SFS+PLVSAQKALTAYD++QQYGFPVGILP+G TGYKLDR TG FSL+L+QKC+F+IDSYELEYKPTV GVISQGK+ KLKGV+VK+L
Subjt:  MASLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL

Query:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LW +IVEV +DGD+L FSVGIASANFPI+SFYESPQCGCGFDC+K+G+LVSAS
Subjt:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

XP_038898643.1 uncharacterized protein LOC120086187 [Benincasa hispida]3.8e-6581.29Show/hide
Query:  MASL-LFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV
        MASL L LL+AF+SFS+PLVSAQK LTAYD+LQQYGFPVGILPVG  GY+L+RATGEFSL+LNQKC+F I+SYELEYK TV+GVISQG+IRKLKGV+VK+
Subjt:  MASL-LFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV

Query:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LLWL+IVEV NDGDDL FSVGIASANFP++SFYESP+CGCGFDCDK G+LVSAS
Subjt:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

TrEMBL top hitse value%identityAlignment
A0A6J1DLP9 uncharacterized protein At5g01610-like1.1e-6785.81Show/hide
Query:  MASL-LFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV
        MASL L LL AF+SFSNPLVSAQKALTAYD+LQQYGFPVGILPVG TGY+LDR TGEFSLYLNQKCRFSI+SY LEYKPT+KGVISQG+IR LKGVTVKV
Subjt:  MASL-LFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV

Query:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        LLLWLNIVEV NDG DL FSVGIASANFPI+ FYESPQCGCGFDC K G LV+AS
Subjt:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

A0A6J1GP52 uncharacterized protein LOC1114561733.1e-6581.17Show/hide
Query:  MASLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL
        MASLLFL  AF+SFS+PLVSAQKALTAYD++QQYGFPVGILP+G TGYKLDR TG FSL+L+QKC+F+IDSYELEYKPTV GVISQGK+ KLKGV+VK+L
Subjt:  MASLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL

Query:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LWL+IVEV +DGD+L FSVGIASANFPI+SFYESPQCGCGFDC+K+G+LVSAS
Subjt:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

A0A6J1H965 uncharacterized protein At5g01610-like1.6e-6178Show/hide
Query:  FLLIAFISFSN-PLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWL
        FLL+AFIS S+ PL  AQK+L+AYD+LQQYGFPVGILPVG TGY+ ++ATGEFSL+LN+KCRF IDSYELEYKPTVKGVISQG I+ LKGV+VK+LL+W 
Subjt:  FLLIAFISFSN-PLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWL

Query:  NIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        NIVEV +D D+L FS+G+ASANFPINSFYESPQCGCGFDCDK+G+LVSAS
Subjt:  NIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

A0A6J1JSE0 uncharacterized protein At5g01610-like2.6e-6479.22Show/hide
Query:  MASLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL
        MASLLFL  AF+SFS+PLVSAQKALTAYD++QQYGFPVGILP+G TGY+LDR TG FSL+L+QKC+F+IDSYELEYKPT+ GVISQG++ KLKGV+VK+L
Subjt:  MASLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL

Query:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LWL+IVEV +DGD+L FSVGIASANFPI+SFYESPQCGCGFDC+K+G+LVSAS
Subjt:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

A0A6J1KXK7 uncharacterized protein At5g01610-like7.9e-6177.33Show/hide
Query:  FLLIAFISF-SNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWL
        FLL+AFIS  S+PL  AQK+L+AYD+LQQYGFPVGILPVG TGY+ ++ATGEFSL+LN+KCRF IDSYELEYKPTVKGVISQG I+ LKGV+VK+LL+W 
Subjt:  FLLIAFISF-SNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWL

Query:  NIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        NIVEV +D D+L FS+G+ASANFPINSFYESP CGCGFDCDK+G+LVSAS
Subjt:  NIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02813.1 Protein of unknown function, DUF5381.6e-3449.65Show/hide
Query:  LLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLW
        ++FLL+   S S   VS QK  + Y VL+ Y  P GILP G   Y L+R TG F +  N  C+FSIDSY+++YKP + G+I++G++ +L GV+VKVL  W
Subjt:  LLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLW

Query:  LNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDC
        +NI EV  DGDD+ F VG AS  F    F +SP+CGCGF+C
Subjt:  LNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDC

AT1G02816.1 Protein of unknown function, DUF5384.0e-4160.15Show/hide
Query:  NPLVSA---QKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSID-SYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWLNIVEVDN
        +PL++A       TAY +LQ Y FPVGILP G   Y LD++TG+F  Y N+ C F++  SY+L+YK T+ G IS+ KI KL GV VKVL LWLNIVEV  
Subjt:  NPLVSA---QKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSID-SYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWLNIVEVDN

Query:  DGDDLVFSVGIASANFPINSFYESPQCGCGFDC
        +GD+L FSVGI SANF I+ FYESPQCGCGFDC
Subjt:  DGDDLVFSVGIASANFPINSFYESPQCGCGFDC

AT4G02360.1 Protein of unknown function, DUF5381.7e-3955.94Show/hide
Query:  SLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLL
        S++F L   +SF+   VS QK  TAYD ++ Y  P GILP G   Y+L+  TG F +Y N  C F+I SY+L+YK T+ GVIS G ++ LKGV+VKVL  
Subjt:  SLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLL

Query:  WLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCD
        W+NI EV  DG DL FSVGIASA+FP  +F ESPQCGCGFDC+
Subjt:  WLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCD

AT4G02370.1 Protein of unknown function, DUF5385.5e-3854.36Show/hide
Query:  SLLFLLIAFISFSNPLVSA------QKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFS-IDSYELEYKPTVKGVISQGKIRKLKGV
        S L +LIA   F + L +A          TAY +LQ Y FPVGILP G   Y LD  TG+F  Y N  C F+ + SY+L YK T+ G IS+ K++KL GV
Subjt:  SLLFLLIAFISFSNPLVSA------QKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFS-IDSYELEYKPTVKGVISQGKIRKLKGV

Query:  TVKVLLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDC
         VKVL LWLNIVEV  +GD++ FSVGI SANF I  F ESPQCGCGF+C
Subjt:  TVKVLLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDC

AT5G19590.1 Protein of unknown function, DUF5381.5e-1936.5Show/hide
Query:  LLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSI--DSYELEYKPTVKGVISQGKIRKLKGVTVKVLL
        L  L++  IS  +P    +K   A+  L  +GFP+G+LP+    Y L++ +G+FSL+LN  C+ ++  D+Y   Y   V G ISQGKI +L+G+ V+   
Subjt:  LLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSI--DSYELEYKPTVKGVISQGKIRKLKGVTVKVLL

Query:  LWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQC
           +I  + + GD+LVF V   +A +P  +F ES  C
Subjt:  LWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTCCTTTTTCTTCTTATCGCTTTCATCTCCTTCTCAAATCCCCTCGTCTCTGCTCAAAAAGCTCTCACCGCCTACGATGTTCTTCAACAGTACGGCTTCCC
CGTCGGTATTCTTCCCGTCGGAGCAACTGGCTACAAATTGGACAGAGCTACCGGTGAATTCAGTCTCTATTTGAATCAGAAGTGCAGATTCTCGATCGATTCGTACGAAT
TGGAGTACAAGCCTACCGTGAAGGGCGTGATTTCCCAGGGAAAAATCAGGAAATTGAAAGGCGTTACTGTTAAGGTGTTGTTGCTGTGGCTCAACATTGTGGAAGTCGAT
AACGACGGCGATGACCTCGTATTCTCTGTGGGAATCGCCTCCGCGAACTTTCCGATTAACAGTTTTTATGAGTCGCCGCAGTGTGGGTGTGGATTTGACTGCGACAAGCT
TGGGAACTTGGTGTCGGCTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCTCCTTTTTCTTCTTATCGCTTTCATCTCCTTCTCAAATCCCCTCGTCTCTGCTCAAAAAGCTCTCACCGCCTACGATGTTCTTCAACAGTACGGCTTCCC
CGTCGGTATTCTTCCCGTCGGAGCAACTGGCTACAAATTGGACAGAGCTACCGGTGAATTCAGTCTCTATTTGAATCAGAAGTGCAGATTCTCGATCGATTCGTACGAAT
TGGAGTACAAGCCTACCGTGAAGGGCGTGATTTCCCAGGGAAAAATCAGGAAATTGAAAGGCGTTACTGTTAAGGTGTTGTTGCTGTGGCTCAACATTGTGGAAGTCGAT
AACGACGGCGATGACCTCGTATTCTCTGTGGGAATCGCCTCCGCGAACTTTCCGATTAACAGTTTTTATGAGTCGCCGCAGTGTGGGTGTGGATTTGACTGCGACAAGCT
TGGGAACTTGGTGTCGGCTTCTTAG
Protein sequenceShow/hide protein sequence
MASLLFLLIAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWLNIVEVD
NDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS