; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029498 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029498
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function, DUF538
Genome locationtig00153403:1400263..1401628
RNA-Seq ExpressionSgr029498
SyntenySgr029498
Gene Ontology termsNA
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646343.1 hypothetical protein Csa_016699 [Cucumis sativus]3.4e-5380.3Show/hide
Query:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV
        MSP+  ASPS ++G+A SVYDILREFNFPIGL+PEG +GC LDR TGK EAYL  SCHFSPDE YELKYKSTISG ISRNRLT+LKGVSVK MFFWVNIV
Subjt:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV

Query:  EVVRDGDDLEFSVGIATASFPVENFSECPQCG
        EVVR+GDDLEFS+G+ATASFPV+NFSECP  G
Subjt:  EVVRDGDDLEFSVGIATASFPVENFSECPQCG

XP_004136167.1 uncharacterized protein LOC101222381 [Cucumis sativus]3.4e-5380.3Show/hide
Query:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV
        MSP+  ASPS ++G+A SVYDILREFNFPIGL+PEG +GC LDR TGK EAYL  SCHFSPDE YELKYKSTISG ISRNRLT+LKGVSVK MFFWVNIV
Subjt:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV

Query:  EVVRDGDDLEFSVGIATASFPVENFSECPQCG
        EVVR+GDDLEFS+G+ATASFPV+NFSECP  G
Subjt:  EVVRDGDDLEFSVGIATASFPVENFSECPQCG

XP_008451557.1 PREDICTED: uncharacterized protein LOC103492802 [Cucumis melo]1.5e-5380Show/hide
Query:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV
        MSP+  A PS S+GEA SVYDILREFNFPIGL+PEG +GC LDR TGK EAYL  SCHFSPDE YELKYKSTISG ISRNRLT+LKGVSVK MFFWVNIV
Subjt:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV

Query:  EVVRDGDDLEFSVGIATASFPVENFSECP--QCGL
        EVVR+GDDLEFS+G+ATASFPV+NFSECP   CG+
Subjt:  EVVRDGDDLEFSVGIATASFPVENFSECP--QCGL

XP_022151830.1 uncharacterized protein LOC111019713 [Momordica charantia]2.1e-5886.26Show/hide
Query:  MSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIVE
        MSPSAA PSA +GE+ SVYDILREFNFPIGL+PEGA+GC LDRATGK EAYL G+C F PDE+Y+LKYKSTISGQISRNRLTDLKGVSVK MFFWVNIVE
Subjt:  MSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIVE

Query:  VVRDGDDLEFSVGIATASFPVENFSECPQCG
        VVR GDDLEFSVG+ATASFPVENFSECPQCG
Subjt:  VVRDGDDLEFSVGIATASFPVENFSECPQCG

XP_038896189.1 uncharacterized protein LOC120084473 [Benincasa hispida]3.1e-5482.58Show/hide
Query:  MSPSA-ASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV
        MSP+  ASPS SD EALSVYDILREFNFPIGL+PEG +G  LDR TGK EAYL GSCHFSPDE YELKYKSTISG ISRNRLT+LKGVSVK MFFWVNIV
Subjt:  MSPSA-ASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV

Query:  EVVRDGDDLEFSVGIATASFPVENFSECPQCG
        EVVR+GDDL+FSVG+ATASFPV+NFSECP  G
Subjt:  EVVRDGDDLEFSVGIATASFPVENFSECPQCG

TrEMBL top hitse value%identityAlignment
A0A0A0K643 Uncharacterized protein1.7e-5380.3Show/hide
Query:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV
        MSP+  ASPS ++G+A SVYDILREFNFPIGL+PEG +GC LDR TGK EAYL  SCHFSPDE YELKYKSTISG ISRNRLT+LKGVSVK MFFWVNIV
Subjt:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV

Query:  EVVRDGDDLEFSVGIATASFPVENFSECPQCG
        EVVR+GDDLEFS+G+ATASFPV+NFSECP  G
Subjt:  EVVRDGDDLEFSVGIATASFPVENFSECPQCG

A0A1S3BR55 uncharacterized protein LOC1034928027.5e-5480Show/hide
Query:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV
        MSP+  A PS S+GEA SVYDILREFNFPIGL+PEG +GC LDR TGK EAYL  SCHFSPDE YELKYKSTISG ISRNRLT+LKGVSVK MFFWVNIV
Subjt:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV

Query:  EVVRDGDDLEFSVGIATASFPVENFSECP--QCGL
        EVVR+GDDLEFS+G+ATASFPV+NFSECP   CG+
Subjt:  EVVRDGDDLEFSVGIATASFPVENFSECP--QCGL

A0A5A7VQJ7 Uncharacterized protein7.5e-5480Show/hide
Query:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV
        MSP+  A PS S+GEA SVYDILREFNFPIGL+PEG +GC LDR TGK EAYL  SCHFSPDE YELKYKSTISG ISRNRLT+LKGVSVK MFFWVNIV
Subjt:  MSPS-AASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIV

Query:  EVVRDGDDLEFSVGIATASFPVENFSECP--QCGL
        EVVR+GDDLEFS+G+ATASFPV+NFSECP   CG+
Subjt:  EVVRDGDDLEFSVGIATASFPVENFSECP--QCGL

A0A6J1DCA2 uncharacterized protein LOC1110197131.0e-5886.26Show/hide
Query:  MSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIVE
        MSPSAA PSA +GE+ SVYDILREFNFPIGL+PEGA+GC LDRATGK EAYL G+C F PDE+Y+LKYKSTISGQISRNRLTDLKGVSVK MFFWVNIVE
Subjt:  MSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIVE

Query:  VVRDGDDLEFSVGIATASFPVENFSECPQCG
        VVR GDDLEFSVG+ATASFPVENFSECPQCG
Subjt:  VVRDGDDLEFSVGIATASFPVENFSECPQCG

A0A6J1KX02 uncharacterized protein LOC1114979011.2e-5177.95Show/hide
Query:  MSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIVE
        MS +AASP+ ++GEA SVYDILREFNFPIGL+PEG +GC LDR TGK EAYLN +C FSP++ YEL+YK+TISGQIS+NRLTDLKGV+VK MFFWVNIVE
Subjt:  MSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIVE

Query:  VVRDGDDLEFSVGIATASFPVENFSEC
        VVR+GDDL FSVG+ATASFPV+NFSEC
Subjt:  VVRDGDDLEFSVGIATASFPVENFSEC

SwissProt top hitse value%identityAlignment
Q9M015 Uncharacterized protein At5g016102.9e-1028.42Show/hide
Query:  DILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIVEVVRDGDDLEFSVGI
        ++L+E++ PIG+ P  A   + D  T K    +   C     +S  LK+ +T++G + + +LTD++G+  K+M  WV +  +  D   + F+ G+
Subjt:  DILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFWVNIVEVVRDGDDLEFSVGI

Arabidopsis top hitse value%identityAlignment
AT1G02813.1 Protein of unknown function, DUF5384.4e-3047.45Show/hide
Query:  AFFLIFMSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFF
        + F+IF+   + S S S  +  SVY +L  +  P G++PEG    DL+R TG F+   N +C FS D SY++KYK  ISG I+R R+  L GVSVK++FF
Subjt:  AFFLIFMSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFF

Query:  WVNIVEVVRDGDDLEFSVGIATASFPVENFSECPQCG
        W+NI EV RDGDD+EF VG A+  F  + F + P+CG
Subjt:  WVNIVEVVRDGDDLEFSVGIATASFPVENFSECPQCG

AT1G02816.1 Protein of unknown function, DUF5382.7e-4052.21Show/hide
Query:  FFLIFMSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFW
        FF +F  PS    +A+D +  + Y +L+ +NFP+G++P+G +  DLD++TG+F AY N SC F+   SY+L YKSTISG IS N++T L GV VK++F W
Subjt:  FFLIFMSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFFW

Query:  VNIVEVVRDGDDLEFSVGIATASFPVENFSECPQCG
        +NIVEV+R+GD+LEFSVGI +A+F ++ F E PQCG
Subjt:  VNIVEVVRDGDDLEFSVGIATASFPVENFSECPQCG

AT4G02360.1 Protein of unknown function, DUF5381.4e-3350.34Show/hide
Query:  MRLIAIATLAAFFLIFMSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDL
        M L A      FFL F    A S     G+  + YD ++ +N P G++P+G +  +L+  TG F+ Y N +C F+  +SY+LKYKSTISG IS   + +L
Subjt:  MRLIAIATLAAFFLIFMSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDL

Query:  KGVSVKIMFFWVNIVEVVRDGDDLEFSVGIATASFPVENFSECPQCG
        KGVSVK++FFWVNI EV  DG DL+FSVGIA+ASFP  NF E PQCG
Subjt:  KGVSVKIMFFWVNIVEVVRDGDDLEFSVGIATASFPVENFSECPQCG

AT4G02370.1 Protein of unknown function, DUF5387.4e-3851.08Show/hide
Query:  LAAFFLIFMSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIM
        L A  L   S +AA  +A++ +  + Y +L+ +NFP+G++P+G +  DLD  TGKF AY N SC F+   SY+L YKSTISG IS N+L  L GV VK++
Subjt:  LAAFFLIFMSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIM

Query:  FFWVNIVEVVRDGDDLEFSVGIATASFPVENFSECPQCG
        F W+NIVEV+R+GD++EFSVGI +A+F ++ F E PQCG
Subjt:  FFWVNIVEVVRDGDDLEFSVGIATASFPVENFSECPQCG

AT5G19590.1 Protein of unknown function, DUF5383.4e-1430.34Show/hide
Query:  LIAIATLAAFFLIFMSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFS-PDESYELKYKSTISGQISRNRLTDLK
        L++I  L    L   +P    P+ +  E       L    FPIGL+P       L++ +G F  +LNG+C  + P ++Y   Y + ++G+IS+ ++ +L+
Subjt:  LIAIATLAAFFLIFMSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFS-PDESYELKYKSTISGQISRNRLTDLK

Query:  GVSVKIMFFWVNIVEVVRDGDDLEFSVGIATASFPVENFSECPQC
        G+ V+  F   +I  +   GD+L F V   TA +P +NF E   C
Subjt:  GVSVKIMFFWVNIVEVVRDGDDLEFSVGIATASFPVENFSECPQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTCTCATAGCCATAGCCACCCTCGCCGCCTTCTTCCTCATCTTCATGTCGCCCTCCGCCGCATCGCCCTCCGCCAGCGACGGCGAAGCGCTGTCGGTGTACGACAT
CCTCCGGGAATTCAACTTCCCGATCGGCCTCATCCCAGAGGGTGCGTTGGGTTGCGATCTGGATCGAGCCACCGGAAAGTTCGAGGCTTATTTGAACGGATCTTGCCATT
TCTCGCCGGATGAATCTTACGAACTGAAATATAAATCCACCATTAGCGGGCAGATCTCGAGGAATCGGCTGACGGATCTGAAGGGGGTGAGCGTGAAGATCATGTTCTTC
TGGGTGAACATCGTGGAGGTGGTGAGAGACGGCGACGATCTGGAGTTCTCGGTGGGGATAGCTACGGCGTCGTTTCCGGTGGAAAATTTCTCCGAGTGCCCGCAATGTGG
ATTGAGTAGTGTACGCTTGCTGCCCTCAACTTGTCGTTGCTTGCAACCTGTTATCGTCATGTCGCCTAATCCCGCTTGTTGTTGTAAAGTCGCCTGCTACCGCTTGCTGT
CGTTGAACGTCGCTGTCTTTTGTCACCAAATGCCATTGCTGCTGTCTATCGTCGACGTCGAGCTTCACTGCATCTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTCTCATAGCCATAGCCACCCTCGCCGCCTTCTTCCTCATCTTCATGTCGCCCTCCGCCGCATCGCCCTCCGCCAGCGACGGCGAAGCGCTGTCGGTGTACGACAT
CCTCCGGGAATTCAACTTCCCGATCGGCCTCATCCCAGAGGGTGCGTTGGGTTGCGATCTGGATCGAGCCACCGGAAAGTTCGAGGCTTATTTGAACGGATCTTGCCATT
TCTCGCCGGATGAATCTTACGAACTGAAATATAAATCCACCATTAGCGGGCAGATCTCGAGGAATCGGCTGACGGATCTGAAGGGGGTGAGCGTGAAGATCATGTTCTTC
TGGGTGAACATCGTGGAGGTGGTGAGAGACGGCGACGATCTGGAGTTCTCGGTGGGGATAGCTACGGCGTCGTTTCCGGTGGAAAATTTCTCCGAGTGCCCGCAATGTGG
ATTGAGTAGTGTACGCTTGCTGCCCTCAACTTGTCGTTGCTTGCAACCTGTTATCGTCATGTCGCCTAATCCCGCTTGTTGTTGTAAAGTCGCCTGCTACCGCTTGCTGT
CGTTGAACGTCGCTGTCTTTTGTCACCAAATGCCATTGCTGCTGTCTATCGTCGACGTCGAGCTTCACTGCATCTTTTGA
Protein sequenceShow/hide protein sequence
MRLIAIATLAAFFLIFMSPSAASPSASDGEALSVYDILREFNFPIGLIPEGALGCDLDRATGKFEAYLNGSCHFSPDESYELKYKSTISGQISRNRLTDLKGVSVKIMFF
WVNIVEVVRDGDDLEFSVGIATASFPVENFSECPQCGLSSVRLLPSTCRCLQPVIVMSPNPACCCKVACYRLLSLNVAVFCHQMPLLLSIVDVELHCIF