; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016653 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016653
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function, DUF538
Genome locationtig00152985:316217..316738
RNA-Seq ExpressionSgr016653
SyntenySgr016653
Gene Ontology termsNA
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588240.1 hypothetical protein SDJN03_16805, partial [Cucurbita argyrosperma subsp. sororia]3.7e-6174.85Show/hide
Query:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY
        ++SPFSL L+IL I+ Q H++ S+RD+          L STDIHELLPLYG PKGLLPNNVKSYTLS DGSFEIELESECYVKFDL+V+YDK +KGKL Y
Subjt:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY

Query:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM
        GSV DVSGIQAKKLFLWVSVTGIKAN  S TIDFYVG LSETLPAQQFQKI  C R  C+GERTEAM
Subjt:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM

KAG7022156.1 hypothetical protein SDJN02_15885, partial [Cucurbita argyrosperma subsp. argyrosperma]5.7e-6276.05Show/hide
Query:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY
        ++SPFSL L+IL I+ Q HL+ S+RD+          L STDIHELLPLYG PKGLLPNNVKSYTLS DGSFEIELESECYVKFDL+V+YDK +KGKL Y
Subjt:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY

Query:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM
        GSV DVSGIQAKKLFLWVSVTGIKAN  S TIDFYVG LSETLPAQQFQKI AC R  C+GERTEAM
Subjt:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM

XP_022153277.1 uncharacterized protein LOC111020803 [Momordica charantia]5.3e-6881.03Show/hide
Query:  MASFSTTISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKST-DIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKK
        MASFS TI  FS FL+IL I TQ HLS S  D KIE N PSFPLKST D+HELLP YG PKGLLP+NVKSYTLSDDGSFEIELESECYVKF L+V+YDKK
Subjt:  MASFSTTISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKST-DIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKK

Query:  IKGKLSYGSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM
        IKGKLSYGSV D SGIQAKKLFLWVSVTGIKANP   TIDF+VGFLSETL AQQFQKI  CKRN CLGERTEA+
Subjt:  IKGKLSYGSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM

XP_022933750.1 uncharacterized protein LOC111441073 [Cucurbita moschata]2.8e-6176.05Show/hide
Query:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY
        ++SPFSL L+IL I  Q HL+ S RD+          L STDIHELLPLYG PKGLLPNNVKSYTLS DGSFEIELESECYVKFDL+V+YDK +KGKL Y
Subjt:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY

Query:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM
        GSV DVSGIQAKKLFLWVSVTGIKAN  S TIDFYVG LSETLPAQQFQKI AC R  C+GERTEAM
Subjt:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM

XP_023006181.1 uncharacterized protein LOC111498989 [Cucurbita maxima]2.2e-6175.45Show/hide
Query:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY
        ++SPFSL L+IL I+ Q H++ S RD+          L STDIHELLPLYG PKGLLPNNVKSYTLSDDGSFEIELESECYVKFDL+V+Y+K +KGKLSY
Subjt:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY

Query:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM
        GSV DVSGIQAKKLFLWVSVTGI+AN  S TIDFYVG LSETLPAQQFQKI AC R AC GE TEAM
Subjt:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM

TrEMBL top hitse value%identityAlignment
A0A0A0M0C4 Uncharacterized protein6.2e-5463.58Show/hide
Query:  MASFSTTISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKI
        MASFST +SPFSLFL+IL  STQ HLSFS+RD         FPL+S+DIH+LLPLYG P GLLP+NV SYTLSDDG+FEI+L+S CYV F  +V+Y K I
Subjt:  MASFSTTISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKI

Query:  KGKLSYGSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM
        KGKLS  S++DVSGI+ KKLF W+ +TGIK  PDS++I+F VGFLSE LP   F+ I  C+R ACL  +TEAM
Subjt:  KGKLSYGSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM

A0A6J1DIG2 uncharacterized protein LOC1110208032.6e-6881.03Show/hide
Query:  MASFSTTISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKST-DIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKK
        MASFS TI  FS FL+IL I TQ HLS S  D KIE N PSFPLKST D+HELLP YG PKGLLP+NVKSYTLSDDGSFEIELESECYVKF L+V+YDKK
Subjt:  MASFSTTISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKST-DIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKK

Query:  IKGKLSYGSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM
        IKGKLSYGSV D SGIQAKKLFLWVSVTGIKANP   TIDF+VGFLSETL AQQFQKI  CKRN CLGERTEA+
Subjt:  IKGKLSYGSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM

A0A6J1F5P9 uncharacterized protein LOC1114410731.4e-6176.05Show/hide
Query:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY
        ++SPFSL L+IL I  Q HL+ S RD+          L STDIHELLPLYG PKGLLPNNVKSYTLS DGSFEIELESECYVKFDL+V+YDK +KGKL Y
Subjt:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY

Query:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM
        GSV DVSGIQAKKLFLWVSVTGIKAN  S TIDFYVG LSETLPAQQFQKI AC R  C+GERTEAM
Subjt:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM

A0A6J1H9I3 uncharacterized protein LOC1114613581.9e-5567.63Show/hide
Query:  MASFSTTISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKI
        MASFS  +SPFSLFL+IL ISTQ HLSFSS D          PL  TDIHELL  YG P GLLPNNVKSYTLSDDG+FEIEL+++CYV F  +V+Y+KKI
Subjt:  MASFSTTISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKI

Query:  KGKLSYGSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM
         GKLSYGSVTDVSGIQ KKLFLW+SV+G K+N  S TI F+VG LSET PA+ F+ I  C+R  CLG RTEAM
Subjt:  KGKLSYGSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM

A0A6J1KX25 uncharacterized protein LOC1114989891.1e-6175.45Show/hide
Query:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY
        ++SPFSL L+IL I+ Q H++ S RD+          L STDIHELLPLYG PKGLLPNNVKSYTLSDDGSFEIELESECYVKFDL+V+Y+K +KGKLSY
Subjt:  TISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSY

Query:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM
        GSV DVSGIQAKKLFLWVSVTGI+AN  S TIDFYVG LSETLPAQQFQKI AC R AC GE TEAM
Subjt:  GSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55265.1 Protein of unknown function, DUF5382.9e-4052.1Show/hide
Query:  STTISPFS-LFLIILPISTQIHL------SFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKF-DLMVFY
        S+ ++ FS LFL +L +S+ ++L        +  DL    N     L + DIH+LLP YG PKGLLPNNVKSYT+SDDG F ++L S CYVKF D +VFY
Subjt:  STTISPFS-LFLIILPISTQIHL------SFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKF-DLMVFY

Query:  DKKIKGKLSYGSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRN
         K I GKLSYGSV DV GIQAK+ FLW+ +T ++++P S T+ F VGF+S+TLPA  F+ + +C RN
Subjt:  DKKIKGKLSYGSVTDVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRN

AT1G61667.1 Protein of unknown function, DUF5386.4e-1935.11Show/hide
Query:  PLKSTDIHELLPLYGLPKGLLPNNVKSYTLSD-DGSFEIELESECYVKFDLMVFYDKKIKGKLSYGSVTDVSGIQAKKLFLWVSVTGIKAN-PDSQTIDF
        P   + I  LL   GLP GL P+NV+SY+L D  G  E++L++ C+ +F+  V++D+ IK  LSYG +  + G+  ++LFLW+ V GI  N P S  + F
Subjt:  PLKSTDIHELLPLYGLPKGLLPNNVKSYTLSD-DGSFEIELESECYVKFDLMVFYDKKIKGKLSYGSVTDVSGIQAKKLFLWVSVTGIKAN-PDSQTIDF

Query:  YVGFLSETLPAQQFQKIHACKRNACLGERTE
         +G   + +    F+    C     + E+ E
Subjt:  YVGFLSETLPAQQFQKIHACKRNACLGERTE

AT3G07460.1 Protein of unknown function, DUF5385.1e-1635.59Show/hide
Query:  KSTDIHELLPLYGLPKGLLPNNVKSYTLS-DDGSFEIELESECYVKFDLMVFYDKKIKGKLSYGSVTDVSGIQAKKLFLWVSVTGIKAN-PDSQTIDFYV
        ++  I E+L   GLP GL P  VK +T++ + G F + L   C  K++  + YD+ + G + Y  + D+SGI A++LFLW+ V GI+ + P S  I F V
Subjt:  KSTDIHELLPLYGLPKGLLPNNVKSYTLS-DDGSFEIELESECYVKFDLMVFYDKKIKGKLSYGSVTDVSGIQAKKLFLWVSVTGIKAN-PDSQTIDFYV

Query:  GFLSETLPAQQFQKIHAC
        G L +      F+    C
Subjt:  GFLSETLPAQQFQKIHAC

AT5G19860.1 Protein of unknown function, DUF5385.6e-3143.31Show/hide
Query:  SLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSYGSVTD
        S+F+  L + T    S +      EP+P S     + ++ELLP YGLP GLLP+ V  +TLSDDG F + L + C ++FD +V YDK I G++ YGS+T+
Subjt:  SLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSYGSVTD

Query:  VSGIQAKKLFLWVSVTGIKAN-PDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLG
        + GIQ KK F+W+ V  IK + P S +I F VGF+++ L   QF+ IH+C  N   G
Subjt:  VSGIQAKKLFLWVSVTGIKAN-PDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLG

AT5G54530.1 Protein of unknown function, DUF5381.2e-2036.54Show/hide
Query:  LIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSYGSVTDVSG
        +I+L  + ++ LS SS         PS+P     +H++L   GLP GLLP  V SY L +DG  E+ L + CY KF+  V ++  ++G LSYGS+  V G
Subjt:  LIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSYGSVTDVSG

Query:  IQAKKLFLWVSVTGIKA-NPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGER
        +  K+LFLW+ V  I   NP+S  I F +G   + L    F+    CK +  L ++
Subjt:  IQAKKLFLWVSVTGIKA-NPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTTCTCCACAACCATCTCCCCGTTCTCACTATTCCTTATAATTCTCCCCATTTCTACTCAAATTCATCTCTCCTTTTCATCAAGGGATCTTAAAATCGAACC
GAACCCCCCATCTTTCCCCCTCAAGTCAACGGATATCCATGAGCTTCTGCCCCTATACGGTCTCCCAAAGGGTCTCTTACCCAACAATGTCAAGTCCTACACTCTCTCAG
ATGATGGGAGCTTCGAAATCGAACTGGAAAGCGAGTGTTATGTGAAGTTCGACTTGATGGTCTTTTACGATAAGAAAATCAAGGGGAAGTTGAGCTACGGGTCTGTGACG
GATGTTTCTGGGATTCAAGCCAAGAAGCTCTTCTTGTGGGTCTCTGTTACTGGAATCAAGGCTAACCCAGATTCGCAAACCATCGATTTTTATGTTGGATTTTTGTCTGA
GACGTTGCCGGCGCAACAGTTCCAGAAAATTCATGCGTGTAAAAGGAACGCTTGCCTAGGAGAAAGGACAGAGGCCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTTCTCCACAACCATCTCCCCGTTCTCACTATTCCTTATAATTCTCCCCATTTCTACTCAAATTCATCTCTCCTTTTCATCAAGGGATCTTAAAATCGAACC
GAACCCCCCATCTTTCCCCCTCAAGTCAACGGATATCCATGAGCTTCTGCCCCTATACGGTCTCCCAAAGGGTCTCTTACCCAACAATGTCAAGTCCTACACTCTCTCAG
ATGATGGGAGCTTCGAAATCGAACTGGAAAGCGAGTGTTATGTGAAGTTCGACTTGATGGTCTTTTACGATAAGAAAATCAAGGGGAAGTTGAGCTACGGGTCTGTGACG
GATGTTTCTGGGATTCAAGCCAAGAAGCTCTTCTTGTGGGTCTCTGTTACTGGAATCAAGGCTAACCCAGATTCGCAAACCATCGATTTTTATGTTGGATTTTTGTCTGA
GACGTTGCCGGCGCAACAGTTCCAGAAAATTCATGCGTGTAAAAGGAACGCTTGCCTAGGAGAAAGGACAGAGGCCATGTGA
Protein sequenceShow/hide protein sequence
MASFSTTISPFSLFLIILPISTQIHLSFSSRDLKIEPNPPSFPLKSTDIHELLPLYGLPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVFYDKKIKGKLSYGSVT
DVSGIQAKKLFLWVSVTGIKANPDSQTIDFYVGFLSETLPAQQFQKIHACKRNACLGERTEAM