; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016905 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016905
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF3049)
Genome locationtig00153016:392080..392700
RNA-Seq ExpressionSgr016905
SyntenySgr016905
Gene Ontology termsNA
InterPro domainsIPR021410 - The fantastic four family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016540.1 hypothetical protein SDJN02_21649 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-4356Show/hide
Query:  SPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRD-KIMWGMGRKEREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTEEK
        SP + + DDYIG+ESCVDLKD+D TA D+   A+F       G + KRD K  W   + ++ YPPPIPLLVRTENL SH+PWV+KR YT DGRLILTEEK
Subjt:  SPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRD-KIMWGMGRKEREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTEEK

Query:  VRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLRTV
        V+HHE+FRAHRSDGRLMLQLV +DD                  +  EDGD+   +E + +  E GG  E GK  KY NVR RD AFMFGV V AGSLR+V
Subjt:  VRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLRTV

XP_022141391.1 uncharacterized protein LOC111011804, partial [Momordica charantia]2.1e-5061.68Show/hide
Query:  QSEKLSP---REFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMW-GMGRKEREYPPPIPLLVRTENLPSHMPWVLKRHYTADG
        QS+++ P   RE DIDDYIGMESCVDLK ++ TA D + P         +   DKR+  MW G GRKEREYPPPIPLLVRTENL SHMPWVLKRHYT DG
Subjt:  QSEKLSP---REFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMW-GMGRKEREYPPPIPLLVRTENLPSHMPWVLKRHYTADG

Query:  RLILTEEKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNV-RGRDPAFMFGVA
        RLILTEEKVRHHEYFRAHRSDGRLMLQLVALDD            G  ES QDQED D+        D  +GG    + KC K G V R RDPA MFGVA
Subjt:  RLILTEEKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNV-RGRDPAFMFGVA

Query:  V----AGSLRTVES
        V    AG+LRTV S
Subjt:  V----AGSLRTVES

XP_022939970.1 uncharacterized protein LOC111445672 [Cucurbita moschata]8.4e-4457.43Show/hide
Query:  LSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRD-KIMWGMGRK-EREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTE
        L P + + DDYIG+ESCVDLKD+D TA D+   A+F       G + KRD K  W MGRK ++ YPPPIPLLVRTENL SH+PWV+KR YT DGRLILTE
Subjt:  LSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRD-KIMWGMGRK-EREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTE

Query:  EKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLR
        EKV+HHE+FRAHRSDGRLMLQLV +DD                  +  EDGD+   +E + +  E GG  E GK  KY NVR RD AFMFGV V AGSLR
Subjt:  EKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLR

Query:  TV
        +V
Subjt:  TV

XP_022992861.1 uncharacterized protein LOC111489066 [Cucurbita maxima]1.7e-4456.22Show/hide
Query:  LSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRK-EREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTEE
        L P + + DDYIG+ESCVDLK   HTA D+   A+F       G + KRD     MGRK ++ YPPPIPLLVRTENL SH+PWV+KR YT DGRLILTEE
Subjt:  LSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRK-EREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTEE

Query:  KVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLRT
        KV+HHE+FRAHRSDGRLMLQLV +DD        E+   G E  +++E+           +E    GGSE GK  KY NVR RD AFMFGV V AGSLR+
Subjt:  KVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLRT

Query:  V
        V
Subjt:  V

XP_023551237.1 uncharacterized protein LOC111809117 [Cucurbita pepo subsp. pepo]3.2e-4357.92Show/hide
Query:  LSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRD-KIMWGMGRK-EREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTE
        L P + + DDYIG+ESCVDLKD D TA D    A+F       G + KRD K  W MGRK ++ YPPPIPLLVRTENL SH+PWV+KR YT DGRLILTE
Subjt:  LSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRD-KIMWGMGRK-EREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTE

Query:  EKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLR
        EKV+HHE+FRAHRSDGRLMLQLV +DD                  QD ++ DD    E   +E+   GGSE GK  KY NVR RD AFMFGV V AGSLR
Subjt:  EKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLR

Query:  TV
        +V
Subjt:  TV

TrEMBL top hitse value%identityAlignment
A0A6J1CJ12 uncharacterized protein LOC1110118041.0e-5061.68Show/hide
Query:  QSEKLSP---REFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMW-GMGRKEREYPPPIPLLVRTENLPSHMPWVLKRHYTADG
        QS+++ P   RE DIDDYIGMESCVDLK ++ TA D + P         +   DKR+  MW G GRKEREYPPPIPLLVRTENL SHMPWVLKRHYT DG
Subjt:  QSEKLSP---REFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMW-GMGRKEREYPPPIPLLVRTENLPSHMPWVLKRHYTADG

Query:  RLILTEEKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNV-RGRDPAFMFGVA
        RLILTEEKVRHHEYFRAHRSDGRLMLQLVALDD            G  ES QDQED D+        D  +GG    + KC K G V R RDPA MFGVA
Subjt:  RLILTEEKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNV-RGRDPAFMFGVA

Query:  V----AGSLRTVES
        V    AG+LRTV S
Subjt:  V----AGSLRTVES

A0A6J1FP95 uncharacterized protein LOC1114456724.1e-4457.43Show/hide
Query:  LSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRD-KIMWGMGRK-EREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTE
        L P + + DDYIG+ESCVDLKD+D TA D+   A+F       G + KRD K  W MGRK ++ YPPPIPLLVRTENL SH+PWV+KR YT DGRLILTE
Subjt:  LSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRD-KIMWGMGRK-EREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTE

Query:  EKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLR
        EKV+HHE+FRAHRSDGRLMLQLV +DD                  +  EDGD+   +E + +  E GG  E GK  KY NVR RD AFMFGV V AGSLR
Subjt:  EKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLR

Query:  TV
        +V
Subjt:  TV

A0A6J1GVY4 uncharacterized protein LOC1114580312.3e-3958.24Show/hide
Query:  MQSEKLSP--REFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRKE-REYPPPIPLLVRTENLPSHMPWVLKRHYTADG
        MQS+ LSP  R+FD DDYIG+ESCVDL  ++HT+ D+             G + +RD  MWGM RK+  EYPPPI LLVRTENL S MPWVLKRHYT DG
Subjt:  MQSEKLSP--REFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRKE-REYPPPIPLLVRTENLPSHMPWVLKRHYTADG

Query:  RLILTEEKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDC-MGKEGRGDE
        RLILTEE++R++E+FRAHRS+GRLMLQLVA DD   +ESNVE+G       +D  DG++C    EGR  E
Subjt:  RLILTEEKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDC-MGKEGRGDE

A0A6J1JTI2 uncharacterized protein LOC1114873722.3e-3960.62Show/hide
Query:  MQSEKLSP--REFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRKE-REYPPPIPLLVRTENLPSHMPWVLKRHYTADG
        MQS+ LSP  R+FDIDDYIGMESCVDL+ ++HTA D+             G + KRD  M GM RK+  EYPPPI LLVRTENL S MPWVLKR YT DG
Subjt:  MQSEKLSP--REFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRKE-REYPPPIPLLVRTENLPSHMPWVLKRHYTADG

Query:  RLILTEEKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDC
        RLILT+E++RH+E+FRAHRSDGRLMLQLVA DD   +ESNVE+G+      +D   G++C
Subjt:  RLILTEEKVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDC

A0A6J1JUP6 uncharacterized protein LOC1114890668.2e-4556.22Show/hide
Query:  LSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRK-EREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTEE
        L P + + DDYIG+ESCVDLK   HTA D+   A+F       G + KRD     MGRK ++ YPPPIPLLVRTENL SH+PWV+KR YT DGRLILTEE
Subjt:  LSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRK-EREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTEE

Query:  KVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLRT
        KV+HHE+FRAHRSDGRLMLQLV +DD        E+   G E  +++E+           +E    GGSE GK  KY NVR RD AFMFGV V AGSLR+
Subjt:  KVRHHEYFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAV-AGSLRT

Query:  V
        V
Subjt:  V

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22110.1 structural constituent of ribosome7.6e-2741.44Show/hide
Query:  MQSEKLSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRKE---------REYPPPIPLLVRTENLPSHMPWVLKR
        M S   S     + DYIG ESC D           +L A+  ++  ++ ++  + +  +G  R+E         RE+PPPIPLL +T NL  HMPWVLKR
Subjt:  MQSEKLSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRKE---------REYPPPIPLLVRTENLPSHMPWVLKR

Query:  HYTADGRLILTEEKVRHHEYFRAHRSDGRLMLQLVALDDHF----SEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENE
          T+DGRLIL EEKVRHHEYFRA+RS+GRL L LV LDD       E S+ +        S D+ED D+C  ++   D+++
Subjt:  HYTADGRLILTEEKVRHHEYFRAHRSDGRLMLQLVALDDHF----SEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENE

AT1G77932.1 Protein of unknown function (DUF3049)6.4e-1844.35Show/hide
Query:  YIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRKEREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTEEKVRHHEYFRAH
        YIG ESC +   D     D+ LP     E+K +  +     IM           PP+        LPSH+P VLKR YT+DGRL+L EEKV  +EYFRAH
Subjt:  YIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRKEREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTEEKVRHHEYFRAH

Query:  RSDGRLMLQLVALDDHFSEESNVE
        RS+GRLM+QLV+LD+    +S+V+
Subjt:  RSDGRLMLQLVALDDHFSEESNVE

AT5G22390.1 Protein of unknown function (DUF3049)8.7e-0733.33Show/hide
Query:  IGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRKER-EYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTEEKVRHHEYFRAH
        +G ES   L+D+               + K+ G++D+ +   W   RKER EYPP              M  +  + Y  +GRL+L E ++   E+ RA 
Subjt:  IGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRKER-EYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTEEKVRHHEYFRAH

Query:  RSDGRLMLQLV-ALDDHFSEESN
        R DGRL L+LV   DDH  EE N
Subjt:  RSDGRLMLQLV-ALDDHFSEESN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGCGAGAAGCTTTCCCCTCGAGAGTTTGATATTGATGATTACATTGGCATGGAGAGTTGTGTTGATTTGAAAGACGACGACCACACGGCCCCTGACATGCTGCT
GCCGGCGAACTTTTCCTCCGAGAGAAAAAATATTGGTGCAGATGATAAGAGGGACAAAATAATGTGGGGAATGGGGAGGAAAGAGAGGGAGTATCCTCCGCCAATACCAT
TGCTGGTTCGCACCGAGAATCTGCCTTCCCACATGCCTTGGGTGTTGAAACGACACTACACCGCAGACGGACGGCTGATACTGACGGAGGAAAAGGTGAGGCACCACGAG
TACTTCCGGGCTCACAGATCCGACGGCCGTCTGATGCTGCAGCTGGTGGCCCTCGACGACCACTTCTCCGAGGAGTCGAATGTGGAAGTGGGCAACGGTGGCACCGAGTC
CAGCCAAGATCAGGAAGATGGGGATGATTGTATGGGGAAGGAGGGCCGTGGCGATGAGAATGAGGGCGGCGGCGGGAGTGAAGAGGGGAAATGCTTCAAGTATGGAAATG
TGAGGGGCAGAGATCCAGCGTTTATGTTCGGAGTGGCAGTGGCAGGGAGCCTGAGGACCGTTGAAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGCGAGAAGCTTTCCCCTCGAGAGTTTGATATTGATGATTACATTGGCATGGAGAGTTGTGTTGATTTGAAAGACGACGACCACACGGCCCCTGACATGCTGCT
GCCGGCGAACTTTTCCTCCGAGAGAAAAAATATTGGTGCAGATGATAAGAGGGACAAAATAATGTGGGGAATGGGGAGGAAAGAGAGGGAGTATCCTCCGCCAATACCAT
TGCTGGTTCGCACCGAGAATCTGCCTTCCCACATGCCTTGGGTGTTGAAACGACACTACACCGCAGACGGACGGCTGATACTGACGGAGGAAAAGGTGAGGCACCACGAG
TACTTCCGGGCTCACAGATCCGACGGCCGTCTGATGCTGCAGCTGGTGGCCCTCGACGACCACTTCTCCGAGGAGTCGAATGTGGAAGTGGGCAACGGTGGCACCGAGTC
CAGCCAAGATCAGGAAGATGGGGATGATTGTATGGGGAAGGAGGGCCGTGGCGATGAGAATGAGGGCGGCGGCGGGAGTGAAGAGGGGAAATGCTTCAAGTATGGAAATG
TGAGGGGCAGAGATCCAGCGTTTATGTTCGGAGTGGCAGTGGCAGGGAGCCTGAGGACCGTTGAAAGCTAA
Protein sequenceShow/hide protein sequence
MQSEKLSPREFDIDDYIGMESCVDLKDDDHTAPDMLLPANFSSERKNIGADDKRDKIMWGMGRKEREYPPPIPLLVRTENLPSHMPWVLKRHYTADGRLILTEEKVRHHE
YFRAHRSDGRLMLQLVALDDHFSEESNVEVGNGGTESSQDQEDGDDCMGKEGRGDENEGGGGSEEGKCFKYGNVRGRDPAFMFGVAVAGSLRTVES