; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029499 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029499
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function, DUF538
Genome locationtig00153403:1403110..1403574
RNA-Seq ExpressionSgr029499
SyntenySgr029499
Gene Ontology termsNA
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155180.1 uncharacterized protein At5g01610-like [Momordica charantia]4.8e-6886.45Show/hide
Query:  MASL-LFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV
        MASL L LLFAF+SFSNPLVSAQKALTAYD+LQQYGFPVGILPVG TGY+LDR TGEFSLYLNQKCRFSI+SY LEYKPT+KGVISQG+IR LKGVTVKV
Subjt:  MASL-LFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV

Query:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        LLLWLNIVEV NDG DL FSVGIASANFPI+ FYESPQCGCGFDC K G LV+AS
Subjt:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

XP_022953728.1 uncharacterized protein LOC111456173 [Cucurbita moschata]6.4e-6581.17Show/hide
Query:  MASLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL
        MASLLFL  AF+SFS+PLVSAQKALTAYD++QQYGFPVGILP+G TGYKLDR TG FSL+L+QKC+F+IDSYELEYKPTV GVISQGK+ KLKGV+VK+L
Subjt:  MASLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL

Query:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LWL+IVEV +DGD+L FSVGIASANFPI+SFYESPQCGCGFDC+K+G+LVSAS
Subjt:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

XP_022991260.1 uncharacterized protein At5g01610-like [Cucurbita maxima]5.4e-6479.22Show/hide
Query:  MASLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL
        MASLLFL  AF+SFS+PLVSAQKALTAYD++QQYGFPVGILP+G TGY+LDR TG FSL+L+QKC+F+IDSYELEYKPT+ GVISQG++ KLKGV+VK+L
Subjt:  MASLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL

Query:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LWL+IVEV +DGD+L FSVGIASANFPI+SFYESPQCGCGFDC+K+G+LVSAS
Subjt:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

XP_023549402.1 uncharacterized protein LOC111807766 [Cucurbita pepo subsp. pepo]1.9e-6480.52Show/hide
Query:  MASLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL
        MASLLFL  AF+SFS+PLVSAQKALTAYD++QQYGFPVGILP+G TGYKLDR TG FSL+L+QKC+F+IDSYELEYKPTV GVISQGK+ KLKGV+VK+L
Subjt:  MASLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL

Query:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LW +IVEV +DGD+L FSVGIASANFPI+SFYESPQCGCGFDC+K+G+LVSAS
Subjt:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

XP_038898643.1 uncharacterized protein LOC120086187 [Benincasa hispida]6.4e-6581.29Show/hide
Query:  MASL-LFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV
        MASL L LL AF+SFS+PLVSAQK LTAYD+LQQYGFPVGILPVG  GY+L+RATGEFSL+LNQKC+F I+SYELEYK TV+GVISQG+IRKLKGV+VK+
Subjt:  MASL-LFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV

Query:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LLWL+IVEV NDGDDL FSVGIASANFP++SFYESP+CGCGFDCDK G+LVSAS
Subjt:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

TrEMBL top hitse value%identityAlignment
A0A6J1DLP9 uncharacterized protein At5g01610-like2.3e-6886.45Show/hide
Query:  MASL-LFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV
        MASL L LLFAF+SFSNPLVSAQKALTAYD+LQQYGFPVGILPVG TGY+LDR TGEFSLYLNQKCRFSI+SY LEYKPT+KGVISQG+IR LKGVTVKV
Subjt:  MASL-LFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKV

Query:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        LLLWLNIVEV NDG DL FSVGIASANFPI+ FYESPQCGCGFDC K G LV+AS
Subjt:  LLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

A0A6J1GP52 uncharacterized protein LOC1114561733.1e-6581.17Show/hide
Query:  MASLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL
        MASLLFL  AF+SFS+PLVSAQKALTAYD++QQYGFPVGILP+G TGYKLDR TG FSL+L+QKC+F+IDSYELEYKPTV GVISQGK+ KLKGV+VK+L
Subjt:  MASLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL

Query:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LWL+IVEV +DGD+L FSVGIASANFPI+SFYESPQCGCGFDC+K+G+LVSAS
Subjt:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

A0A6J1H965 uncharacterized protein At5g01610-like2.7e-6178Show/hide
Query:  FLLFAFISFSN-PLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWL
        FLL AFIS S+ PL  AQK+L+AYD+LQQYGFPVGILPVG TGY+ ++ATGEFSL+LN+KCRF IDSYELEYKPTVKGVISQG I+ LKGV+VK+LL+W 
Subjt:  FLLFAFISFSN-PLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWL

Query:  NIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        NIVEV +D D+L FS+G+ASANFPINSFYESPQCGCGFDCDK+G+LVSAS
Subjt:  NIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

A0A6J1JSE0 uncharacterized protein At5g01610-like2.6e-6479.22Show/hide
Query:  MASLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL
        MASLLFL  AF+SFS+PLVSAQKALTAYD++QQYGFPVGILP+G TGY+LDR TG FSL+L+QKC+F+IDSYELEYKPT+ GVISQG++ KLKGV+VK+L
Subjt:  MASLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVL

Query:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        +LWL+IVEV +DGD+L FSVGIASANFPI+SFYESPQCGCGFDC+K+G+LVSAS
Subjt:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

A0A6J1KXK7 uncharacterized protein At5g01610-like1.4e-6077.33Show/hide
Query:  FLLFAFISF-SNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWL
        FLL AFIS  S+PL  AQK+L+AYD+LQQYGFPVGILPVG TGY+ ++ATGEFSL+LN+KCRF IDSYELEYKPTVKGVISQG I+ LKGV+VK+LL+W 
Subjt:  FLLFAFISF-SNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWL

Query:  NIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS
        NIVEV +D D+L FS+G+ASANFPINSFYESP CGCGFDCDK+G+LVSAS
Subjt:  NIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02813.1 Protein of unknown function, DUF5384.3e-3547.86Show/hide
Query:  LFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWL
        +F++F  +  ++  VS QK  + Y VL+ Y  P GILP G   Y L+R TG F +  N  C+FSIDSY+++YKP + G+I++G++ +L GV+VKVL  W+
Subjt:  LFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWL

Query:  NIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDC
        NI EV  DGDD+ F VG AS  F    F +SP+CGCGF+C
Subjt:  NIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDC

AT1G02816.1 Protein of unknown function, DUF5385.3e-4157.64Show/hide
Query:  LFLLFAFISFSNPLVSA---QKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSID-SYELEYKPTVKGVISQGKIRKLKGVTVKVL
        +F  F F    +PL++A       TAY +LQ Y FPVGILP G   Y LD++TG+F  Y N+ C F++  SY+L+YK T+ G IS+ KI KL GV VKVL
Subjt:  LFLLFAFISFSNPLVSA---QKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSID-SYELEYKPTVKGVISQGKIRKLKGVTVKVL

Query:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDC
         LWLNIVEV  +GD+L FSVGI SANF I+ FYESPQCGCGFDC
Subjt:  LLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDC

AT4G02360.1 Protein of unknown function, DUF5384.5e-4056.64Show/hide
Query:  SLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLL
        S++F LF  +SF+   VS QK  TAYD ++ Y  P GILP G   Y+L+  TG F +Y N  C F+I SY+L+YK T+ GVIS G ++ LKGV+VKVL  
Subjt:  SLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLL

Query:  WLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCD
        W+NI EV  DG DL FSVGIASA+FP  +F ESPQCGCGFDC+
Subjt:  WLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDCD

AT4G02370.1 Protein of unknown function, DUF5383.2e-3855.1Show/hide
Query:  MASLLFLLFAFISFSNPLVSAQKA--LTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFS-IDSYELEYKPTVKGVISQGKIRKLKGVTV
        +AS LFL     S +  +V+A ++   TAY +LQ Y FPVGILP G   Y LD  TG+F  Y N  C F+ + SY+L YK T+ G IS+ K++KL GV V
Subjt:  MASLLFLLFAFISFSNPLVSAQKA--LTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFS-IDSYELEYKPTVKGVISQGKIRKLKGVTV

Query:  KVLLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDC
        KVL LWLNIVEV  +GD++ FSVGI SANF I  F ESPQCGCGF+C
Subjt:  KVLLLWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQCGCGFDC

AT5G19590.1 Protein of unknown function, DUF5382.6e-1936.5Show/hide
Query:  LLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSI--DSYELEYKPTVKGVISQGKIRKLKGVTVKVLL
        L  L+   IS  +P    +K   A+  L  +GFP+G+LP+    Y L++ +G+FSL+LN  C+ ++  D+Y   Y   V G ISQGKI +L+G+ V+   
Subjt:  LLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSI--DSYELEYKPTVKGVISQGKIRKLKGVTVKVLL

Query:  LWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQC
           +I  + + GD+LVF V   +A +P  +F ES  C
Subjt:  LWLNIVEVDNDGDDLVFSVGIASANFPINSFYESPQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTCCTTTTTCTTCTTTTCGCTTTCATCTCCTTCTCAAATCCCCTCGTCTCTGCTCAAAAAGCTCTCACCGCCTACGATGTTCTTCAACAGTACGGCTTCCC
CGTCGGTATTCTTCCCGTCGGAGCAACTGGCTACAAATTGGACAGAGCTACCGGTGAATTCAGTCTCTATTTGAATCAGAAGTGCAGATTCTCGATCGATTCGTACGAAT
TGGAGTACAAGCCTACCGTGAAGGGCGTGATTTCCCAGGGAAAAATCAGGAAATTGAAAGGCGTTACTGTTAAGGTGTTGTTGCTGTGGCTCAACATTGTGGAAGTCGAT
AACGACGGCGATGACCTCGTATTCTCTGTGGGAATCGCCTCCGCGAACTTTCCGATTAACAGTTTTTATGAGTCGCCGCAGTGTGGGTGTGGATTTGACTGCGACAAGCT
TGGGAACTTGGTGTCGGCTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCTCCTTTTTCTTCTTTTCGCTTTCATCTCCTTCTCAAATCCCCTCGTCTCTGCTCAAAAAGCTCTCACCGCCTACGATGTTCTTCAACAGTACGGCTTCCC
CGTCGGTATTCTTCCCGTCGGAGCAACTGGCTACAAATTGGACAGAGCTACCGGTGAATTCAGTCTCTATTTGAATCAGAAGTGCAGATTCTCGATCGATTCGTACGAAT
TGGAGTACAAGCCTACCGTGAAGGGCGTGATTTCCCAGGGAAAAATCAGGAAATTGAAAGGCGTTACTGTTAAGGTGTTGTTGCTGTGGCTCAACATTGTGGAAGTCGAT
AACGACGGCGATGACCTCGTATTCTCTGTGGGAATCGCCTCCGCGAACTTTCCGATTAACAGTTTTTATGAGTCGCCGCAGTGTGGGTGTGGATTTGACTGCGACAAGCT
TGGGAACTTGGTGTCGGCTTCTTAG
Protein sequenceShow/hide protein sequence
MASLLFLLFAFISFSNPLVSAQKALTAYDVLQQYGFPVGILPVGATGYKLDRATGEFSLYLNQKCRFSIDSYELEYKPTVKGVISQGKIRKLKGVTVKVLLLWLNIVEVD
NDGDDLVFSVGIASANFPINSFYESPQCGCGFDCDKLGNLVSAS