; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027383 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027383
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationtig00153054:962739..969580
RNA-Seq ExpressionSgr027383
SyntenySgr027383
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99070.1 uncharacterized protein E5676_scaffold248G002740 [Cucumis melo var. makuwa]4.8e-7183.65Show/hide
Query:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF
        D+IHQLV GDG L+GK+YGDPT LNSI L+DLHAGSWYSVAWYPIYRIPDGN RAAFLTYHSLGHFV RTSQ    D +SCLVCPVVGLQSYNAQ+ECWF
Subjt:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF

Query:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR
        EPR S S F+S L PP +LQERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Subjt:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR

XP_008436988.1 PREDICTED: uncharacterized protein LOC103482551 isoform X2 [Cucumis melo]4.8e-7183.65Show/hide
Query:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF
        D+IHQLV GDG L+GK+YGDPT LNSI L+DLHAGSWYSVAWYPIYRIPDGN RAAFLTYHSLGHFV RTSQ    D +SCLVCPVVGLQSYNAQ+ECWF
Subjt:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF

Query:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR
        EPR S S F+S L PP +LQERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Subjt:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR

XP_022137189.1 uncharacterized protein LOC111008718 [Momordica charantia]3.2e-7580.12Show/hide
Query:  EVSSLAWRNFIDDRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVG
        EV     R  + D+I QLV GDGRL+GK+YGDPT LNSI LNDLHA SWYSVAWYPIYRIPDGN RAAFLTYHSLGHFVCRTSQ +S D DSCLVCPVVG
Subjt:  EVSSLAWRNFIDDRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVG

Query:  LQSYNAQSECWFEPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR
        LQSYNAQ+ECWFEPRN  S F+  + PPGIL+ERLRTLEETASLMARA+VKKGNLNS+NTHPDYEFFLSRR
Subjt:  LQSYNAQSECWFEPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR

XP_022137189.1 uncharacterized protein LOC111008718 [Momordica charantia]4.5e-1684.91Show/hide
Query:  MQCVLERRSSNFQKVPDNGKELLEVRFQEDNCSRRIKDSEVSSLAWRNFIDDR
        MQC LERR S+ QK+PD GKELLEVRFQEDNCSRRIKDSEVSSLAWRNF D R
Subjt:  MQCVLERRSSNFQKVPDNGKELLEVRFQEDNCSRRIKDSEVSSLAWRNFIDDR

XP_022137189.1 uncharacterized protein LOC111008718 [Momordica charantia]6.1e-7484.28Show/hide
Query:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF
        D+IHQLV GDG  +GK+YGDPT LNSI LNDLHAGSWYSVAWYPIYRIPDGN RAAFLTYHSLGHFV RT QS+SPD +SCLVCPVVGLQSYNAQ+ECWF
Subjt:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF

Query:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR
        EPRN    F+ GL PP IL+ERLRTLEETASLMARAVVKKGNLNS+NTHPDYEFFLSRR
Subjt:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR

XP_038894653.1 uncharacterized protein LOC120083142 isoform X1 [Benincasa hispida]2.7e-0563.27Show/hide
Query:  MQCVLERRSSNFQKVPDNGKELLEVRFQEDNCSRRIKDSEVSSLAWRNF
        MQC     SS+FQKV D  KE LE+R +E+ CSR IKDS+VSS AWRNF
Subjt:  MQCVLERRSSNFQKVPDNGKELLEVRFQEDNCSRRIKDSEVSSLAWRNF

XP_038894653.1 uncharacterized protein LOC120083142 isoform X1 [Benincasa hispida]4.8e-7183.65Show/hide
Query:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF
        D+IHQLV GDG L+GK+YGDPT LNSI L+DLHAGSWYSVAWYPIYRIPDGN RAAFLTYHSLGHFV RTSQ    D +SCLVCPVVGLQSYNAQ+ECWF
Subjt:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF

Query:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR
        EPR S S F+S L PP +LQERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Subjt:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR

TrEMBL top hitse value%identityAlignment
A0A1S3AT34 uncharacterized protein LOC103482551 isoform X22.3e-7183.65Show/hide
Query:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF
        D+IHQLV GDG L+GK+YGDPT LNSI L+DLHAGSWYSVAWYPIYRIPDGN RAAFLTYHSLGHFV RTSQ    D +SCLVCPVVGLQSYNAQ+ECWF
Subjt:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF

Query:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR
        EPR S S F+S L PP +LQERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Subjt:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR

A0A1S4DW44 uncharacterized protein LOC103482551 isoform X12.3e-7183.65Show/hide
Query:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF
        D+IHQLV GDG L+GK+YGDPT LNSI L+DLHAGSWYSVAWYPIYRIPDGN RAAFLTYHSLGHFV RTSQ    D +SCLVCPVVGLQSYNAQ+ECWF
Subjt:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF

Query:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR
        EPR S S F+S L PP +LQERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Subjt:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR

A0A6J1C5T5 uncharacterized protein LOC1110087181.6e-7580.12Show/hide
Query:  EVSSLAWRNFIDDRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVG
        EV     R  + D+I QLV GDGRL+GK+YGDPT LNSI LNDLHA SWYSVAWYPIYRIPDGN RAAFLTYHSLGHFVCRTSQ +S D DSCLVCPVVG
Subjt:  EVSSLAWRNFIDDRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVG

Query:  LQSYNAQSECWFEPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR
        LQSYNAQ+ECWFEPRN  S F+  + PPGIL+ERLRTLEETASLMARA+VKKGNLNS+NTHPDYEFFLSRR
Subjt:  LQSYNAQSECWFEPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR

A0A6J1C5T5 uncharacterized protein LOC1110087182.2e-1684.91Show/hide
Query:  MQCVLERRSSNFQKVPDNGKELLEVRFQEDNCSRRIKDSEVSSLAWRNFIDDR
        MQC LERR S+ QK+PD GKELLEVRFQEDNCSRRIKDSEVSSLAWRNF D R
Subjt:  MQCVLERRSSNFQKVPDNGKELLEVRFQEDNCSRRIKDSEVSSLAWRNFIDDR

A0A6J1C5T5 uncharacterized protein LOC1110087182.3e-7183.65Show/hide
Query:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF
        D+IHQLV GDG L+GK+YGDPT LNSI L+DLHAGSWYSVAWYPIYRIPDGN RAAFLTYHSLGHFV RTSQ    D +SCLVCPVVGLQSYNAQ+ECWF
Subjt:  DRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWF

Query:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR
        EPR S S F+S L PP +LQERLRTLEETASLMARAVVKKGNLNS NTHPDYEFFLSRR
Subjt:  EPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR

A0A6J1GS60 uncharacterized protein LOC1114570061.5e-7066.35Show/hide
Query:  ERRSSNFQKVPDN---GKELLEVRFQEDNCSRRIKDSEVSSLAWRNFIDDRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDG
        E+R+ + Q V  N     EL+   F+E+   +            R  + D+I QLV GDG L GK+YGDPT L SI LNDLHAGSWYSVAWYPIYRIPDG
Subjt:  ERRSSNFQKVPDN---GKELLEVRFQEDNCSRRIKDSEVSSLAWRNFIDDRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDG

Query:  NFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWFEPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPD
        N RAAFLTYHSLGHFVCRTSQS S + DSC+VCPVVGLQS+NAQ+ECWF+PRNS S F+    PPG++ ERLRTLEETASLMARAVVKKGNLN++N HPD
Subjt:  NFRAAFLTYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWFEPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPD

Query:  YEFFLSRR
        YEFFLSRR
Subjt:  YEFFLSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)9.6e-0932.37Show/hide
Query:  DLHAGSWYSVAWYPIYRIPDG----NFRAAFLTYHSL-----GHFVCRTSQSDSPDADSC--LVCPVVGLQSYNAQSECWFEPRNSMSAFSSGLYPPGIL
        DL   SW+SVAWYPIY+IP G    +  A FLTYHSL     G  V   S       +S   +  PV GL SY  +   W     S    ++ L+     
Subjt:  DLHAGSWYSVAWYPIYRIPDG----NFRAAFLTYHSL-----GHFVCRTSQSDSPDADSC--LVCPVVGLQSYNAQSECWFEPRNSMSAFSSGLYPPGIL

Query:  QERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSR
          RLR +                      HPD+ FF  R
Subjt:  QERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSR

AT2G01260.1 Protein of unknown function (DUF789)5.6e-0942.31Show/hide
Query:  DLHAGSWYSVAWYPIYRIPDG----NFRAAFLTYHSL-----GHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECW
        DL   SW+SVAWYPIYRIP G    +  A FLTYHSL     G    ++     P     +  PV GL SY  +   W
Subjt:  DLHAGSWYSVAWYPIYRIPDG----NFRAAFLTYHSL-----GHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECW

AT4G16100.1 Protein of unknown function (DUF789)1.3e-0836.03Show/hide
Query:  DLHAGSWYSVAWYPIYRIPDG----NFRAAFLTYHSLGHFVCRTS----QSDSPD-ADSCLVCPVVGLQSYNAQSECWFEPRNSMSAFSSGLYPPGILQE
        DL   SW SVAWYPIYRIP G    N  A FLT+HSL      TS    QS S   A + L  P  GL SY  +   W  P + +              +
Subjt:  DLHAGSWYSVAWYPIYRIPDG----NFRAAFLTYHSLGHFVCRTS----QSDSPD-ADSCLVCPVVGLQSYNAQSECWFEPRNSMSAFSSGLYPPGILQE

Query:  RLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLS
        R+ TL  TA    R +        K   PD+  F+S
Subjt:  RLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLS

AT5G23380.1 Protein of unknown function (DUF789)1.6e-0831.37Show/hide
Query:  TTLNSIPLNDLHAGSWYSVAWYPIYRIP-----DGNFRAAFLTYHSLGHFVCRT----------SQSDSPDADSCLVCPVVGLQSYNAQSECWFEPRNSM
        T L+S+  +DL   SW S+AWYPIY IP     DG   AAFLTYH L      T           +S +P+    ++ P  G  +Y A    W  P  S 
Subjt:  TTLNSIPLNDLHAGSWYSVAWYPIYRIP-----DGNFRAAFLTYHSLGHFVCRT----------SQSDSPDADSCLVCPVVGLQSYNAQSECWFEPRNSM

Query:  SAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR
                      +     EE+A    R   K+G      +H D+ FF+SR+
Subjt:  SAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRR

AT5G49220.1 Protein of unknown function (DUF789)1.5e-0934.09Show/hide
Query:  DLHAGSWYSVAWYPIYRIPDG----NFRAAFLTYHSLGHFVCRTSQSDSPDADSC-LVCPVVGLQSYNAQSECWFEPRNSMSAFSSGLYPPGILQERLRT
        DL   SW SV+WYPIYRIP G    N  A FLT+HSL     +++   S    S  L  P  GL SY  +   W                    Q R++ 
Subjt:  DLHAGSWYSVAWYPIYRIPDG----NFRAAFLTYHSLGHFVCRTSQSDSPDADSC-LVCPVVGLQSYNAQSECWFEPRNSMSAFSSGLYPPGILQERLRT

Query:  LEETASLMARAVVKKGNLNSKNTHPDYEFFLS
         ++  SL+  A   K     +  HPDY FF S
Subjt:  LEETASLMARAVVKKGNLNSKNTHPDYEFFLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCAGTGTGTTCTTGAAAGAAGAAGTAGCAATTTTCAGAAAGTACCAGATAACGGAAAGGAGTTATTAGAAGTGAGATTTCAGGAAGACAATTGTTCCAGAAGAAT
TAAGGATTCTGAAGTTTCTTCTCTTGCATGGAGGAACTTTATCGATGACAGGATACATCAGCTGGTCTGGGGAGATGGACGTCTGGAAGGAAAATTATATGGGGATCCGA
CCACACTTAATTCAATACCTTTGAATGATCTGCATGCTGGATCATGGTACTCAGTGGCATGGTATCCCATATATAGGATACCAGATGGCAATTTTCGAGCTGCGTTTTTG
ACTTACCATTCACTAGGACATTTTGTTTGTAGAACTTCCCAATCCGACTCTCCAGATGCAGATTCTTGTTTAGTATGTCCAGTTGTGGGCCTTCAAAGTTATAATGCACA
GAGTGAATGCTGGTTTGAGCCAAGAAACAGTATGTCAGCGTTTTCCTCTGGCTTATATCCTCCCGGAATCCTTCAGGAGCGCTTGAGGACGCTGGAAGAGACTGCATCTC
TCATGGCCAGAGCTGTTGTTAAGAAAGGAAATCTCAACTCCAAAAACACGCATCCAGATTACGAGTTCTTCCTCTCACGGCGACACTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGCAGTGTGTTCTTGAAAGAAGAAGTAGCAATTTTCAGAAAGTACCAGATAACGGAAAGGAGTTATTAGAAGTGAGATTTCAGGAAGACAATTGTTCCAGAAGAAT
TAAGGATTCTGAAGTTTCTTCTCTTGCATGGAGGAACTTTATCGATGACAGGATACATCAGCTGGTCTGGGGAGATGGACGTCTGGAAGGAAAATTATATGGGGATCCGA
CCACACTTAATTCAATACCTTTGAATGATCTGCATGCTGGATCATGGTACTCAGTGGCATGGTATCCCATATATAGGATACCAGATGGCAATTTTCGAGCTGCGTTTTTG
ACTTACCATTCACTAGGACATTTTGTTTGTAGAACTTCCCAATCCGACTCTCCAGATGCAGATTCTTGTTTAGTATGTCCAGTTGTGGGCCTTCAAAGTTATAATGCACA
GAGTGAATGCTGGTTTGAGCCAAGAAACAGTATGTCAGCGTTTTCCTCTGGCTTATATCCTCCCGGAATCCTTCAGGAGCGCTTGAGGACGCTGGAAGAGACTGCATCTC
TCATGGCCAGAGCTGTTGTTAAGAAAGGAAATCTCAACTCCAAAAACACGCATCCAGATTACGAGTTCTTCCTCTCACGGCGACACTAG
Protein sequenceShow/hide protein sequence
MMQCVLERRSSNFQKVPDNGKELLEVRFQEDNCSRRIKDSEVSSLAWRNFIDDRIHQLVWGDGRLEGKLYGDPTTLNSIPLNDLHAGSWYSVAWYPIYRIPDGNFRAAFL
TYHSLGHFVCRTSQSDSPDADSCLVCPVVGLQSYNAQSECWFEPRNSMSAFSSGLYPPGILQERLRTLEETASLMARAVVKKGNLNSKNTHPDYEFFLSRRH