; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr006789 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr006789
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00004961:51360..51931
RNA-Seq ExpressionSgr006789
SyntenySgr006789
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600851.1 hypothetical protein SDJN03_06084, partial [Cucurbita argyrosperma subsp. sororia]4.3e-4866.28Show/hide
Query:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAIKFP---NLRTRLLRGGSN-CAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALI
        M+NHL+ S ILHS SLP  L PFSG+ NT+ TKI  + +       N R R  R   N CAARRRVR D     DE++GHN+QIALLESYTQAA GEALI
Subjt:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAIKFP---NLRTRLLRGGSN-CAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALI

Query:  VHAAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNFDP
        VHAA+DG+ VE     GFSSCLSYATS DPSRSVLPARA I+SIDRIKGPFDPSNIEY+Q+ I WESFNF P
Subjt:  VHAAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNFDP

XP_008452414.1 PREDICTED: uncharacterized protein LOC103493452 [Cucumis melo]1.9e-4867.86Show/hide
Query:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAI-KFPNLRTRLL-RGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIVH
        M+ +L ++ ILHS SL N L PF  NPNTSLT IP +  I K PN +T +   G   CAARRRVR D     DED+GHNDQIALLESYTQAATGEALIVH
Subjt:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAI-KFPNLRTRLL-RGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIVH

Query:  AAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF
        A VDGE VE     GFSSCLSY TSPDPSRSV+P RA IKSIDRIKGPFDPSNI+Y++K I W SFNF
Subjt:  AAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF

XP_022146835.1 uncharacterized protein LOC111015943 [Momordica charantia]2.5e-4870.24Show/hide
Query:  MSNHLRRSLILHSGSLPNFLSPFSGNP--NTSLTKIPLASAI-KFPNLRTRLLRGGSNCAARRRVR----SDDEDHGHNDQIALLESYTQAATGEALIVH
        M+N LR+SLI  S SLP    PFSGNP  NTSL  I L+SA+ +  NLR + LR   +C ARRRVR     +DED+GHN+Q+A LESYTQAA GEALIVH
Subjt:  MSNHLRRSLILHSGSLPNFLSPFSGNP--NTSLTKIPLASAI-KFPNLRTRLLRGGSNCAARRRVR----SDDEDHGHNDQIALLESYTQAATGEALIVH

Query:  AAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF
        AAV GE VE     GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYL+K I WESFNF
Subjt:  AAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF

XP_022985163.1 uncharacterized protein LOC111483247 [Cucurbita maxima]7.4e-4865.09Show/hide
Query:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAIKFP---NLRTRLLRGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIV
        M+NHL+ S ILHS SLP  L PFSG+ NT+ TKI  + +       N R R  +  ++CAARRRVR D     DE++GHN+QI+LLESYTQAA GEALIV
Subjt:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAIKFP---NLRTRLLRGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIV

Query:  HAAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF
        HAA+DG+ VE     GFSSCLSYATS DPSRSVLPARA I+SIDRIKGPFDPSNIEY+QK + WESFNF
Subjt:  HAAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF

XP_038892698.1 uncharacterized protein LOC120081685 [Benincasa hispida]4.6e-5070.06Show/hide
Query:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAI-KFPNLRTRLLRGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIVHA
        M+ HL++  ILH       L PF  NPNTSLTKI L+  I K  N RT L RG + CAARRRVR D     DED+GHNDQIALLESYTQA  GEALIVHA
Subjt:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAI-KFPNLRTRLLRGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIVHA

Query:  AVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF
         VDG+ VE     GFSSCLSY TSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYL+K I WESFNF
Subjt:  AVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF

TrEMBL top hitse value%identityAlignment
A0A1S3BTQ8 uncharacterized protein LOC1034934529.4e-4967.86Show/hide
Query:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAI-KFPNLRTRLL-RGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIVH
        M+ +L ++ ILHS SL N L PF  NPNTSLT IP +  I K PN +T +   G   CAARRRVR D     DED+GHNDQIALLESYTQAATGEALIVH
Subjt:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAI-KFPNLRTRLL-RGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIVH

Query:  AAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF
        A VDGE VE     GFSSCLSY TSPDPSRSV+P RA IKSIDRIKGPFDPSNI+Y++K I W SFNF
Subjt:  AAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF

A0A5A7UTE4 Uncharacterized protein9.4e-4967.86Show/hide
Query:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAI-KFPNLRTRLL-RGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIVH
        M+ +L ++ ILHS SL N L PF  NPNTSLT IP +  I K PN +T +   G   CAARRRVR D     DED+GHNDQIALLESYTQAATGEALIVH
Subjt:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAI-KFPNLRTRLL-RGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIVH

Query:  AAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF
        A VDGE VE     GFSSCLSY TSPDPSRSV+P RA IKSIDRIKGPFDPSNI+Y++K I W SFNF
Subjt:  AAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF

A0A6J1CYG3 uncharacterized protein LOC1110159431.2e-4870.24Show/hide
Query:  MSNHLRRSLILHSGSLPNFLSPFSGNP--NTSLTKIPLASAI-KFPNLRTRLLRGGSNCAARRRVR----SDDEDHGHNDQIALLESYTQAATGEALIVH
        M+N LR+SLI  S SLP    PFSGNP  NTSL  I L+SA+ +  NLR + LR   +C ARRRVR     +DED+GHN+Q+A LESYTQAA GEALIVH
Subjt:  MSNHLRRSLILHSGSLPNFLSPFSGNP--NTSLTKIPLASAI-KFPNLRTRLLRGGSNCAARRRVR----SDDEDHGHNDQIALLESYTQAATGEALIVH

Query:  AAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF
        AAV GE VE     GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYL+K I WESFNF
Subjt:  AAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF

A0A6J1FWQ3 uncharacterized protein LOC1114476342.0e-4665.29Show/hide
Query:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAIKFP---NLRTRLLRGGSN-CAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALI
        M+NHL+ S ILHS SLP  L PFSG+ NT+ TKI  + +       N R R      N CAARRRVR D     DE++GHN+QIALLESYTQAA GEALI
Subjt:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAIKFP---NLRTRLLRGGSN-CAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALI

Query:  VHAAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF
        VHAA+DG+ VE     GFSSCLS+ATS DPSRSVLPARA I+SIDRIKGPFDPSNIEY+Q+ I WESFNF
Subjt:  VHAAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF

A0A6J1JCS8 uncharacterized protein LOC1114832473.6e-4865.09Show/hide
Query:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAIKFP---NLRTRLLRGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIV
        M+NHL+ S ILHS SLP  L PFSG+ NT+ TKI  + +       N R R  +  ++CAARRRVR D     DE++GHN+QI+LLESYTQAA GEALIV
Subjt:  MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAIKFP---NLRTRLLRGGSNCAARRRVRSD-----DEDHGHNDQIALLESYTQAATGEALIV

Query:  HAAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF
        HAA+DG+ VE     GFSSCLSYATS DPSRSVLPARA I+SIDRIKGPFDPSNIEY+QK + WESFNF
Subjt:  HAAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01755.1 unknown protein1.1e-2554.55Show/hide
Query:  CAARRRVR------SDDEDHGHNDQIALLESYTQAATGEALIVHAAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEY
        C ARRRVR       +DE +G+N+++A+LE Y+Q+   EALIV A VD E+VE     G SSCLS  T+ DP+RSVLP RAVI  IDR++GPFDPS I Y
Subjt:  CAARRRVR------SDDEDHGHNDQIALLESYTQAATGEALIVHAAVDGEQVE-----GFSSCLSYATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEY

Query:  LQKDIAWESF
        +Q+DI++++F
Subjt:  LQKDIAWESF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAACCACCTCCGAAGATCCCTAATCCTTCACTCCGGTTCCCTCCCTAATTTTCTCTCGCCATTTTCCGGCAATCCAAACACATCTCTCACCAAAATCCCTCTCGC
CTCCGCAATCAAATTTCCGAATCTCAGAACTCGACTCCTCCGAGGCGGTAGCAATTGCGCCGCGAGGCGGAGAGTACGATCCGACGACGAAGATCACGGCCACAACGACC
AGATCGCGCTGCTGGAATCGTACACTCAGGCTGCCACCGGCGAGGCGCTCATCGTTCACGCGGCGGTCGACGGCGAGCAAGTTGAAGGATTCTCGTCTTGCTTGAGCTAT
GCGACTTCGCCTGATCCGTCGAGAAGCGTTCTTCCGGCTAGAGCGGTGATAAAATCCATTGATAGAATCAAAGGGCCTTTCGATCCATCGAATATTGAATATCTTCAGAA
GGACATCGCTTGGGAATCGTTTAACTTCGACCCCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAACCACCTCCGAAGATCCCTAATCCTTCACTCCGGTTCCCTCCCTAATTTTCTCTCGCCATTTTCCGGCAATCCAAACACATCTCTCACCAAAATCCCTCTCGC
CTCCGCAATCAAATTTCCGAATCTCAGAACTCGACTCCTCCGAGGCGGTAGCAATTGCGCCGCGAGGCGGAGAGTACGATCCGACGACGAAGATCACGGCCACAACGACC
AGATCGCGCTGCTGGAATCGTACACTCAGGCTGCCACCGGCGAGGCGCTCATCGTTCACGCGGCGGTCGACGGCGAGCAAGTTGAAGGATTCTCGTCTTGCTTGAGCTAT
GCGACTTCGCCTGATCCGTCGAGAAGCGTTCTTCCGGCTAGAGCGGTGATAAAATCCATTGATAGAATCAAAGGGCCTTTCGATCCATCGAATATTGAATATCTTCAGAA
GGACATCGCTTGGGAATCGTTTAACTTCGACCCCAAATAA
Protein sequenceShow/hide protein sequence
MSNHLRRSLILHSGSLPNFLSPFSGNPNTSLTKIPLASAIKFPNLRTRLLRGGSNCAARRRVRSDDEDHGHNDQIALLESYTQAATGEALIVHAAVDGEQVEGFSSCLSY
ATSPDPSRSVLPARAVIKSIDRIKGPFDPSNIEYLQKDIAWESFNFDPK