; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030408 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030408
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPlant protein of unknown function (DUF946)
Genome locationtig00153640:3344080..3344481
RNA-Seq ExpressionSgr030408
SyntenySgr030408
Gene Ontology termsNA
InterPro domainsIPR009291 - Vacuolar protein sorting-associated protein 62


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036561.1 hypothetical protein SDJN02_00180, partial [Cucurbita argyrosperma subsp. argyrosperma]7.4e-1754.55Show/hide
Query:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        S +   PD   +  LF  Y       P+ K L   V WFFSGGALL+DKS+ES  VPI+ DGSNLPQGG ND  FWL+LPAD+EA+ K
Subjt:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK

XP_008461748.1 PREDICTED: uncharacterized protein LOC103500280 [Cucumis melo]1.1e-1755.68Show/hide
Query:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        S +   PD   + SL+  Y       P+ K L   V+WFFSGGALLYDKS+ES  VPI+ DGSNLPQGGSND  FWLNLP D+E + K
Subjt:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK

XP_022948317.1 uncharacterized protein LOC111452030 [Cucurbita moschata]4.3e-1755.68Show/hide
Query:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        S +   PD   +  LF  Y       P+ K L   V WFFSGGALL+DKSDES  VPI+ DGSNLPQGG ND  FWL+LPAD+EA  K
Subjt:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK

XP_023523507.1 uncharacterized protein LOC111787708 [Cucurbita pepo subsp. pepo]1.1e-1756.82Show/hide
Query:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        S +   PD   +  LF  Y       P+ K L   V WFFSGGALL+DKSDESK VPI+ DGSNLPQGG ND  FWL+LPAD+EA+ K
Subjt:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK

XP_038903158.1 uncharacterized protein LOC120089826 [Benincasa hispida]1.8e-1857.95Show/hide
Query:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        S +   PD   + SLF  Y       P+ K L   VNWFFSGGALLYDKS+ES  VPI+ DGSNLPQGGSND  FWLNLP D+E + K
Subjt:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK

TrEMBL top hitse value%identityAlignment
A0A0A0LEZ5 Uncharacterized protein1.8e-1653.41Show/hide
Query:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        S +   PD   + SL+  Y       P+ K L   V+WFFS GALLYDKS+ES  VPI+ DG NLPQGGSND  FWLNLP D+E + K
Subjt:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK

A0A1S3CFF1 uncharacterized protein LOC1035002805.5e-1855.68Show/hide
Query:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        S +   PD   + SL+  Y       P+ K L   V+WFFSGGALLYDKS+ES  VPI+ DGSNLPQGGSND  FWLNLP D+E + K
Subjt:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK

A0A5A7UCT0 DUF946 domain-containing protein5.5e-1855.68Show/hide
Query:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        S +   PD   + SL+  Y       P+ K L   V+WFFSGGALLYDKS+ES  VPI+ DGSNLPQGGSND  FWLNLP D+E + K
Subjt:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK

A0A6J1DIT8 uncharacterized protein LOC1110209247.9e-1753.41Show/hide
Query:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        S++   PD   V+SLF  Y       P+ K L   VNWFFSGGALLYDKSDE   + I+ DG+NLPQGG ND  FWLNLPA +E + +
Subjt:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK

A0A6J1G8U9 uncharacterized protein LOC1114520302.1e-1755.68Show/hide
Query:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        S +   PD   +  LF  Y       P+ K L   V WFFSGGALL+DKSDES  VPI+ DGSNLPQGG ND  FWL+LPAD+EA  K
Subjt:  SHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G44230.1 Plant protein of unknown function (DUF946)3.0e-1645.65Show/hide
Query:  FDFCSHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        FDF     C P    + +LF  Y          K L   VNWFFS GALLY K DES  VP++ +G NLPQG  ND L+WL+LP   +AR++
Subjt:  FDFCSHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK

AT2G44260.1 Plant protein of unknown function (DUF946)5.1e-1633.86Show/hide
Query:  PTTEGSRRR-ESPSEHSWHCHRPLLRRVALFEELEFDFCSHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSD
        PTT G++         +W         ++  +  + DF +     P+   +  LF  +       P  + L   V W+F+ GALLY K +ESK +PI+S+
Subjt:  PTTEGSRRR-ESPSEHSWHCHRPLLRRVALFEELEFDFCSHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSD

Query:  GSNLPQGGSNDRLFWLNLPADDEARRK
        GSNLPQGGSND  +WL+LP D   + +
Subjt:  GSNLPQGGSNDRLFWLNLPADDEARRK

AT2G44260.2 Plant protein of unknown function (DUF946)5.1e-1633.86Show/hide
Query:  PTTEGSRRR-ESPSEHSWHCHRPLLRRVALFEELEFDFCSHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSD
        PTT G++         +W         ++  +  + DF +     P+   +  LF  +       P  + L   V W+F+ GALLY K +ESK +PI+S+
Subjt:  PTTEGSRRR-ESPSEHSWHCHRPLLRRVALFEELEFDFCSHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSD

Query:  GSNLPQGGSNDRLFWLNLPADDEARRK
        GSNLPQGGSND  +WL+LP D   + +
Subjt:  GSNLPQGGSNDRLFWLNLPADDEARRK

AT3G01870.1 Plant protein of unknown function (DUF946)8.7e-1646.15Show/hide
Query:  EFDFCSHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEAR
        +FD  S     P     + LF  Y     L P    ++  V+WFFS GALL+ K +ES  VP+  DGSNLPQGGS+D LFWL+ PAD  A+
Subjt:  EFDFCSHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEAR

AT3G01880.1 Plant protein of unknown function (DUF946)2.4e-1345.16Show/hide
Query:  EFDFCSHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK
        +FD  S     P       LF  Y     L P+   L   VNW F+ GALL+ K +ES  VPI  +GSNLPQGG ND LFWL+   D +AR K
Subjt:  EFDFCSHAGCFPDSFSVSSLFSDY----ILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNLPQGGSNDRLFWLNLPADDEARRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAACGGCTTTAACGTGTACAATTCCAGACCCAACAACAGAGGGATCACGGCGACGGGAGTCGCCGTCGGAGCATTCGTGGCACTGCCATCGGCCACTTCTCCGCCG
CGTTGCTCTGTTTGAAGAACTCGAGTTCGATTTCTGCAGCCATGCCGGATGTTTCCCAGATAGCTTCTCTGTTTCAAGCTTATTCTCCGATTATATACTTTCACCCCAAA
GAAAAGTACTTACCGTCGTCGTGAACTGGTTTTTCTCCGGCGGGGCTCTGCTGTACGACAAATCCGACGAATCGAAGCTGGTTCCGATTGACTCCGACGGCTCGAACCTT
CCTCAAGGAGGCTCAAACGACCGTCTCTTCTGGCTAAATCTTCCTGCCGATGACGAAGCCAGGAGAAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATAACGGCTTTAACGTGTACAATTCCAGACCCAACAACAGAGGGATCACGGCGACGGGAGTCGCCGTCGGAGCATTCGTGGCACTGCCATCGGCCACTTCTCCGCCG
CGTTGCTCTGTTTGAAGAACTCGAGTTCGATTTCTGCAGCCATGCCGGATGTTTCCCAGATAGCTTCTCTGTTTCAAGCTTATTCTCCGATTATATACTTTCACCCCAAA
GAAAAGTACTTACCGTCGTCGTGAACTGGTTTTTCTCCGGCGGGGCTCTGCTGTACGACAAATCCGACGAATCGAAGCTGGTTCCGATTGACTCCGACGGCTCGAACCTT
CCTCAAGGAGGCTCAAACGACCGTCTCTTCTGGCTAAATCTTCCTGCCGATGACGAAGCCAGGAGAAAGTGA
Protein sequenceShow/hide protein sequence
MITALTCTIPDPTTEGSRRRESPSEHSWHCHRPLLRRVALFEELEFDFCSHAGCFPDSFSVSSLFSDYILSPQRKVLTVVVNWFFSGGALLYDKSDESKLVPIDSDGSNL
PQGGSNDRLFWLNLPADDEARRK