; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018643 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018643
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153206:1360451..1361139
RNA-Seq ExpressionSgr018643
SyntenySgr018643
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599936.1 hypothetical protein SDJN03_05169, partial [Cucurbita argyrosperma subsp. sororia]1.6e-3373.33Show/hide
Query:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWG-SVAELERELAAEVSLEEGEEWLKLGRLKQKCG-GGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK
        P S   PPW       RR A A RCVS GGWG SVAELEREL+      EGEEWLKLGRL++KCG GGKG+VELLE LEREAIM EDEGRDPTDY+RRAK
Subjt:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWG-SVAELERELAAEVSLEEGEEWLKLGRLKQKCG-GGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK

Query:  IFSTSSRVFQALKQHSDAAS
        IFSTSSRVFQALKQHSD  S
Subjt:  IFSTSSRVFQALKQHSDAAS

KAG7030615.1 hypothetical protein SDJN02_04652, partial [Cucurbita argyrosperma subsp. argyrosperma]4.3e-3473.33Show/hide
Query:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWG-SVAELERELAAEVSLEEGEEWLKLGRLKQKCG-GGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK
        P S   PPW       RR A A RCVS GGWG SV ELEREL+      EGEEWLKLGRL++KCG GGKG+VELLE LEREAIMGEDEGRDPTDY+RRAK
Subjt:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWG-SVAELERELAAEVSLEEGEEWLKLGRLKQKCG-GGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK

Query:  IFSTSSRVFQALKQHSDAAS
        IFSTSSRVFQALKQHSD  S
Subjt:  IFSTSSRVFQALKQHSDAAS

XP_022146811.1 uncharacterized protein LOC111015926 [Momordica charantia]5.5e-3763.7Show/hide
Query:  LSLHKIFYIRTPCIPPPRNSLQT-----PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWGSVAELERELAAEVSLEEGEEWLKLGRLKQKCGGGKGVVEL
        + +  +F +R P  PPP +   +     PTSA  P W       RR   A+RCVSQGGWGS AELERE+AA     EGEEWLKLGRLK+KCGGGKGVVEL
Subjt:  LSLHKIFYIRTPCIPPPRNSLQT-----PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWGSVAELERELAAEVSLEEGEEWLKLGRLKQKCGGGKGVVEL

Query:  LECLEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSDAAS
        LECLE EAIMGEDEGRDP DY+RRAKIFSTSS+VFQALKQ +   S
Subjt:  LECLEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSDAAS

XP_022941969.1 uncharacterized protein LOC111447176 [Cucurbita moschata]3.0e-3575Show/hide
Query:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWG-SVAELERELAAEVSLEEGEEWLKLGRLKQKCG-GGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK
        P S   PPW       RR A A RCVSQGGWG SVAELEREL+      EGEEWLKLGRL++KCG GGKG+VELLE LEREAIMGEDEGRDPTDY+RRAK
Subjt:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWG-SVAELERELAAEVSLEEGEEWLKLGRLKQKCG-GGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK

Query:  IFSTSSRVFQALKQHSDAAS
        IFSTSSRVFQALKQHSD  S
Subjt:  IFSTSSRVFQALKQHSDAAS

XP_023541783.1 uncharacterized protein LOC111801831 [Cucurbita pepo subsp. pepo]2.2e-3371.67Show/hide
Query:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWG-SVAELERELAAEVSLEEGEEWLKLGRLKQKCG-GGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK
        P S   PPW       RR   A RCVSQGGWG SVAELERE +      EGEEWLKLGRL++KCG GGKG+VELLE LEREAIMGEDEGRDPT+Y+RRAK
Subjt:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWG-SVAELERELAAEVSLEEGEEWLKLGRLKQKCG-GGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK

Query:  IFSTSSRVFQALKQHSDAAS
        IFSTSSRVFQALK+HSD  S
Subjt:  IFSTSSRVFQALKQHSDAAS

TrEMBL top hitse value%identityAlignment
A0A0A0KQQ4 Uncharacterized protein5.4e-3054.88Show/hide
Query:  LQLTRQAVAPPLSLSLHKIFYIRTPCIPPPRNSLQTPTSAPCPPWSTNGAFRRRAAVATRCVSQGGWGSVAELERELAAEVSLEEGEEWLKLGRLKQKC-
        +QL+   ++PP    LH+     T   P  R     PT +   PW  +      A    RCVSQGGWGS   +      EV +   EEWLKLGRL++KC 
Subjt:  LQLTRQAVAPPLSLSLHKIFYIRTPCIPPPRNSLQTPTSAPCPPWSTNGAFRRRAAVATRCVSQGGWGSVAELERELAAEVSLEEGEEWLKLGRLKQKC-

Query:  GGGKGVVELLECLEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSDAAS------HHS
        GGGKG+VELLECLE+EAIMGEDEGRDPTDYNRRAKIFSTSS VFQALKQHSDA +      HHS
Subjt:  GGGKGVVELLECLEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSDAAS------HHS

A0A6J1D0H7 uncharacterized protein LOC1110159262.7e-3763.7Show/hide
Query:  LSLHKIFYIRTPCIPPPRNSLQT-----PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWGSVAELERELAAEVSLEEGEEWLKLGRLKQKCGGGKGVVEL
        + +  +F +R P  PPP +   +     PTSA  P W       RR   A+RCVSQGGWGS AELERE+AA     EGEEWLKLGRLK+KCGGGKGVVEL
Subjt:  LSLHKIFYIRTPCIPPPRNSLQT-----PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWGSVAELERELAAEVSLEEGEEWLKLGRLKQKCGGGKGVVEL

Query:  LECLEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSDAAS
        LECLE EAIMGEDEGRDP DY+RRAKIFSTSS+VFQALKQ +   S
Subjt:  LECLEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSDAAS

A0A6J1FPZ7 uncharacterized protein LOC1114471761.5e-3575Show/hide
Query:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWG-SVAELERELAAEVSLEEGEEWLKLGRLKQKCG-GGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK
        P S   PPW       RR A A RCVSQGGWG SVAELEREL+      EGEEWLKLGRL++KCG GGKG+VELLE LEREAIMGEDEGRDPTDY+RRAK
Subjt:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGWG-SVAELERELAAEVSLEEGEEWLKLGRLKQKCG-GGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK

Query:  IFSTSSRVFQALKQHSDAAS
        IFSTSSRVFQALKQHSD  S
Subjt:  IFSTSSRVFQALKQHSDAAS

A0A6J1FW01 uncharacterized protein LOC1114478172.0e-3273.11Show/hide
Query:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGW-GSVAELERELAAEVSLEEGEEWLKLGRLKQKC-GGGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK
        P SA  P W      RR AA A RCVSQGGW GSVAE            E EEWLKLGRL++KC GGGKGVVELLECLEREAIMGEDEGR+PTDYNRRAK
Subjt:  PTSAPCPPWSTNGAFRRRAAVATRCVSQGGW-GSVAELERELAAEVSLEEGEEWLKLGRLKQKC-GGGKGVVELLECLEREAIMGEDEGRDPTDYNRRAK

Query:  IFSTSSRVFQALKQHSDAA
        IFSTSS VFQALKQHSDAA
Subjt:  IFSTSSRVFQALKQHSDAA

A0A6J1JM89 uncharacterized protein LOC1114859336.8e-3364.19Show/hide
Query:  VAPPLSLSLHKIFYIRTPCIPPPRNSLQTPTSAPCPPWSTNGAFRRRAAVATRCVSQGGW-GSVAELERELAAEVSLEEGEEWLKLGRLKQKC-GGGKGV
        + PP S S    F+ R+  +  P+     P SA  PPW      RR AA A RCVSQGGW GSVAE           +E EEWLKLGRL +KC GGGKGV
Subjt:  VAPPLSLSLHKIFYIRTPCIPPPRNSLQTPTSAPCPPWSTNGAFRRRAAVATRCVSQGGW-GSVAELERELAAEVSLEEGEEWLKLGRLKQKC-GGGKGV

Query:  VELLECLEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSDAA
        VELLECLEREAIMGEDEGR+PTDYNRRAKIFSTSS VFQALKQHSDAA
Subjt:  VELLECLEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSDAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G05220.1 unknown protein9.7e-1646.15Show/hide
Query:  RRRAAVATRCVSQGGWGSVAELERELAAEVSLEEGEEWLKLGRLKQKCGG--GKGVVELLECLEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQH
        + R   A RCV+ G          + AA +     EE  +L + +  CGG   +GV ELLECLE+EAIMG D+GRDP DYNRRAKIF  SS++F+ L + 
Subjt:  RRRAAVATRCVSQGGWGSVAELERELAAEVSLEEGEEWLKLGRLKQKCGG--GKGVVELLECLEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQH

Query:  SDAA
         D A
Subjt:  SDAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATTCGGAAATTTGGATACTTGACTCGAGGACTTGTAATCCTTTCCCTACGCCTCGATTGCGACAACGTGCCATTACTTGACTGCCAAGCTGCTTCGCGAACTCT
CCAGCTAACACGTCAGGCGGTGGCTCCGCCTCTCTCTCTCTCTCTCCACAAGATTTTCTATATACGAACTCCCTGCATCCCACCCCCCAGAAACTCTCTTCAAACACCAA
CTTCAGCCCCCTGCCCGCCGTGGAGCACCAATGGGGCGTTCCGCCGCCGAGCTGCGGTGGCGACGAGATGCGTCAGTCAGGGCGGTTGGGGTTCTGTTGCGGAGCTAGAG
AGAGAGTTGGCGGCGGAGGTGAGCCTGGAGGAGGGGGAAGAGTGGCTGAAGCTGGGGAGGCTGAAGCAGAAGTGCGGCGGAGGAAAGGGAGTGGTGGAGCTACTGGAATG
TTTGGAAAGAGAAGCCATTATGGGGGAAGATGAAGGCAGAGACCCGACGGATTATAATCGGAGGGCCAAGATTTTCAGCACCAGTTCCAGGGTTTTCCAAGCTCTCAAGC
AACATTCTGATGCAGCTTCCCACCATTCTCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATTCGGAAATTTGGATACTTGACTCGAGGACTTGTAATCCTTTCCCTACGCCTCGATTGCGACAACGTGCCATTACTTGACTGCCAAGCTGCTTCGCGAACTCT
CCAGCTAACACGTCAGGCGGTGGCTCCGCCTCTCTCTCTCTCTCTCCACAAGATTTTCTATATACGAACTCCCTGCATCCCACCCCCCAGAAACTCTCTTCAAACACCAA
CTTCAGCCCCCTGCCCGCCGTGGAGCACCAATGGGGCGTTCCGCCGCCGAGCTGCGGTGGCGACGAGATGCGTCAGTCAGGGCGGTTGGGGTTCTGTTGCGGAGCTAGAG
AGAGAGTTGGCGGCGGAGGTGAGCCTGGAGGAGGGGGAAGAGTGGCTGAAGCTGGGGAGGCTGAAGCAGAAGTGCGGCGGAGGAAAGGGAGTGGTGGAGCTACTGGAATG
TTTGGAAAGAGAAGCCATTATGGGGGAAGATGAAGGCAGAGACCCGACGGATTATAATCGGAGGGCCAAGATTTTCAGCACCAGTTCCAGGGTTTTCCAAGCTCTCAAGC
AACATTCTGATGCAGCTTCCCACCATTCTCGATGA
Protein sequenceShow/hide protein sequence
MKIRKFGYLTRGLVILSLRLDCDNVPLLDCQAASRTLQLTRQAVAPPLSLSLHKIFYIRTPCIPPPRNSLQTPTSAPCPPWSTNGAFRRRAAVATRCVSQGGWGSVAELE
RELAAEVSLEEGEEWLKLGRLKQKCGGGKGVVELLECLEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSDAASHHSR