; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021849 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021849
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1685)
Genome locationtig00153840:737599..738860
RNA-Seq ExpressionSgr021849
SyntenySgr021849
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575353.1 hypothetical protein SDJN03_25992, partial [Cucurbita argyrosperma subsp. sororia]3.3e-7462.93Show/hide
Query:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW------KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFA
        M   S + ESQ+SPE  IGGD EA+I   QS F+S DDR+E    TDW      KKTKKKNQILLEGFVE +DEENLTRTKSLTDDDLEELKGCVDLGFA
Subjt:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW------KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFA

Query:  FSYDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACTKITWIFSWSLCFEIPFKKP
        F YDEIPELCNTLPALELCYSMSQKF+DEHQK+P++S P+  DS    SSPIPNWKISSPGDHPEDVKARLK+WAQAVA                     
Subjt:  FSYDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACTKITWIFSWSLCFEIPFKKP

Query:  CPYNFHFLARDGFSGTVYIWSDF-DDGRRKKKEKN-KKKNVSVVRAPSKLHQQAFATPL
                           WSDF D+GR+KKK+K   KKNVSVVRAPS+LHQ  FA+ L
Subjt:  CPYNFHFLARDGFSGTVYIWSDF-DDGRRKKKEKN-KKKNVSVVRAPSKLHQQAFATPL

KAG6592872.1 hypothetical protein SDJN03_12348, partial [Cucurbita argyrosperma subsp. sororia]1.3e-7377.01Show/hide
Query:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW----KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFS
        M   S  +ESQ+SPE  IGG+PE +IF  +  FSSEDD DE    TDW    KK KKKNQILLEGFVEA+DEENLTRTKSLTDDDLEELKGCVDLGFAF 
Subjt:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW----KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFS

Query:  YDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACTKITWIFSW
        YDEIPELCNTLPALELCYSMSQKFMD+HQK+P++SPPESVDS    SSPIPNWKISSPGDHPEDVKARLKYWAQAVACT +   F W
Subjt:  YDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACTKITWIFSW

XP_022150436.1 uncharacterized protein LOC111018593 [Momordica charantia]2.7e-7685.06Show/hide
Query:  SCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDWKK---TKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIP
        S ES+SQ+SP  PIGGDPEA+IF+ QS FSSEDDRDE EK T+WK    TKKKNQILLEGFVEAADEENL RTKSLTDDDLEELKGCVDLGFAF YDEIP
Subjt:  SCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDWKK---TKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIP

Query:  ELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        ELCNTLPALELCYSMSQKFMD+HQK+P+NSPPES DS    SSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
Subjt:  ELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

XP_023514829.1 uncharacterized protein LOC111779030 [Cucurbita pepo subsp. pepo]1.4e-7279.89Show/hide
Query:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW----KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFS
        M   S ++ESQ+SPE  IGGDPE +IF  +  FSSEDD DE    TDW    KK KKKNQILLEGFVEA+DEENLTRTKSLTDDDLEELKGCVDLGFAF 
Subjt:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW----KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFS

Query:  YDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        YDEIPELCNTLPALELCYSMSQKFMD+HQK+P++SPPESVDS    SSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
Subjt:  YDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

XP_038885613.1 uncharacterized protein LOC120075933 [Benincasa hispida]1.3e-7380.66Show/hide
Query:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDWKKTKK------KNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFA
        M   S ESESQ+SPE  + GDPEA+I   QS FSSEDD DE    TDWK TKK      KNQILLEGFVEA+DE+NLTRTKSLTDDDLEELKGCVDLGFA
Subjt:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDWKKTKK------KNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFA

Query:  FSYDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        F YDEIPELCNTLPALELCYSMSQKFMDEHQK+P+NSPPESVDS    SSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
Subjt:  FSYDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

TrEMBL top hitse value%identityAlignment
A0A6J1DBH7 uncharacterized protein LOC1110185931.3e-7685.06Show/hide
Query:  SCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDWKK---TKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIP
        S ES+SQ+SP  PIGGDPEA+IF+ QS FSSEDDRDE EK T+WK    TKKKNQILLEGFVEAADEENL RTKSLTDDDLEELKGCVDLGFAF YDEIP
Subjt:  SCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDWKK---TKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIP

Query:  ELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        ELCNTLPALELCYSMSQKFMD+HQK+P+NSPPES DS    SSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
Subjt:  ELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

A0A6J1EVL6 uncharacterized protein LOC1114363738.5e-6875.14Show/hide
Query:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW------KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFA
        M   S + ESQ+SPE  IGGD EA+I   QS F+S DDR+E    TDW      KKTKKKNQILLEGFVE +DEENLTRTKSLTD+DLEELKGCVDLGFA
Subjt:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW------KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFA

Query:  FSYDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        F YDEIPELCNTLPALELCYSMSQKF+DEHQK+P++S P+  DS    SSPIPNWKISSPGDHPEDVKARLK+WAQAVACT
Subjt:  FSYDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

A0A6J1H4T7 uncharacterized protein LOC1114605871.3e-7178.77Show/hide
Query:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW----KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFS
        M   S  +ESQ+SPE  IGGDPE +IF  +  FSSEDD DE    TDW    KK KKKNQILLEGFVE +DEENLTRTKSLTDDDLEELKGCVDLGFAF 
Subjt:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW----KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFS

Query:  YDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        YDEIPELCNTLPALELCYSMSQKFMD+HQ +P++SPPESVDS    SSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
Subjt:  YDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

A0A6J1JTH9 uncharacterized protein LOC1114887292.9e-6875.69Show/hide
Query:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW------KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFA
        M   S + ESQ+SPE  IGGD EA+I   QS F+S D RDE    TDW      KKTKKKNQILLEGFVE +DEENLTRTKSLTDDDLEELKGCVDLGFA
Subjt:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW------KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFA

Query:  FSYDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        F YDEIPELCNTLPALELCYSMSQKF+DEHQK+P++SPP+  DS    SSPIPNWKISSPGDHPE+VKARLK+WAQAVACT
Subjt:  FSYDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

A0A6J1L190 uncharacterized protein LOC1114981757.4e-7278.77Show/hide
Query:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW----KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFS
        M   S ++ESQ+SPE  IGGDPE +IF  +  FSSEDD D     TDW    KK KKKNQILLEGFVEA+D+ENLTRTKSLTDDDLEELKGCVDLGFAF 
Subjt:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDW----KKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFS

Query:  YDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        YDEIPELCNTLPALELCYSMSQKFMD+HQK+P++SPPESVDS    SSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
Subjt:  YDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)9.0e-4661.25Show/hide
Query:  EIFDIQSGFSSEDDRDELEKTTDWKKTKKKNQILLEGFVEAA-------DEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIPELCNTLPALELCYSMS
        EI   +S  + E  R +LE        +KK+Q+LLEG+VE A        +++LTR+KSLTDDDLE+L+GC+DLGF FSYDEIPELCNTLPALELCYSMS
Subjt:  EIFDIQSGFSSEDDRDELEKTTDWKKTKKKNQILLEGFVEAA-------DEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIPELCNTLPALELCYSMS

Query:  QKFMDEHQ-KIPDNSPPESVDS-SLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        QKF+D+ Q K P+ S  E   S  L  ++PI NWKISSPGD+P+DVKARLKYWAQAVACT
Subjt:  QKFMDEHQ-KIPDNSPPESVDS-SLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

AT1G05870.2 Protein of unknown function (DUF1685)9.0e-4661.25Show/hide
Query:  EIFDIQSGFSSEDDRDELEKTTDWKKTKKKNQILLEGFVEAA-------DEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIPELCNTLPALELCYSMS
        EI   +S  + E  R +LE        +KK+Q+LLEG+VE A        +++LTR+KSLTDDDLE+L+GC+DLGF FSYDEIPELCNTLPALELCYSMS
Subjt:  EIFDIQSGFSSEDDRDELEKTTDWKKTKKKNQILLEGFVEAA-------DEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIPELCNTLPALELCYSMS

Query:  QKFMDEHQ-KIPDNSPPESVDS-SLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        QKF+D+ Q K P+ S  E   S  L  ++PI NWKISSPGD+P+DVKARLKYWAQAVACT
Subjt:  QKFMDEHQ-KIPDNSPPESVDS-SLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

AT1G05870.3 Protein of unknown function (DUF1685)9.0e-4661.25Show/hide
Query:  EIFDIQSGFSSEDDRDELEKTTDWKKTKKKNQILLEGFVEAA-------DEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIPELCNTLPALELCYSMS
        EI   +S  + E  R +LE        +KK+Q+LLEG+VE A        +++LTR+KSLTDDDLE+L+GC+DLGF FSYDEIPELCNTLPALELCYSMS
Subjt:  EIFDIQSGFSSEDDRDELEKTTDWKKTKKKNQILLEGFVEAA-------DEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIPELCNTLPALELCYSMS

Query:  QKFMDEHQ-KIPDNSPPESVDS-SLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        QKF+D+ Q K P+ S  E   S  L  ++PI NWKISSPGD+P+DVKARLKYWAQAVACT
Subjt:  QKFMDEHQ-KIPDNSPPESVDS-SLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

AT1G05870.4 Protein of unknown function (DUF1685)9.0e-4661.25Show/hide
Query:  EIFDIQSGFSSEDDRDELEKTTDWKKTKKKNQILLEGFVEAA-------DEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIPELCNTLPALELCYSMS
        EI   +S  + E  R +LE        +KK+Q+LLEG+VE A        +++LTR+KSLTDDDLE+L+GC+DLGF FSYDEIPELCNTLPALELCYSMS
Subjt:  EIFDIQSGFSSEDDRDELEKTTDWKKTKKKNQILLEGFVEAA-------DEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIPELCNTLPALELCYSMS

Query:  QKFMDEHQ-KIPDNSPPESVDS-SLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        QKF+D+ Q K P+ S  E   S  L  ++PI NWKISSPGD+P+DVKARLKYWAQAVACT
Subjt:  QKFMDEHQ-KIPDNSPPESVDS-SLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

AT2G43340.1 Protein of unknown function (DUF1685)1.0e-4454.59Show/hide
Query:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDEL--------EKTTDWKKTKKKNQILLEGF-VEAADEENLTRTKSLTDDDLEELKGCVDL
        + RCS       S E  +  D E       S   SE + +E+         K    K  KKK+ +LLEG+ V++A  ++L RTKSLTDDDLEELKGCVDL
Subjt:  MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDEL--------EKTTDWKKTKKKNQILLEGF-VEAADEENLTRTKSLTDDDLEELKGCVDL

Query:  GFAFSYDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSL-AASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        GF F+Y+EIPELCNTLPALELCYSMSQKF+D+      +S PE   S L +  SPI +WKISSPGD+P+DVKARLK+WAQAVACT
Subjt:  GFAFSYDEIPELCNTLPALELCYSMSQKFMDEHQKIPDNSPPESVDSSL-AASSPIPNWKISSPGDHPEDVKARLKYWAQAVACT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCGCTGTTCATGTGAATCCGAATCTCAGGTTTCTCCTGAAATCCCCATTGGAGGGGACCCAGAAGCTGAAATCTTCGATATCCAATCTGGGTTTAGCTCGGAGGA
CGATCGTGACGAGTTAGAGAAGACGACGGACTGGAAGAAGACGAAGAAGAAGAACCAGATCTTGCTGGAGGGGTTCGTGGAGGCTGCAGATGAGGAGAATTTGACGAGGA
CGAAGAGCTTGACGGATGACGATCTTGAGGAGCTTAAAGGCTGTGTGGATCTGGGTTTCGCGTTTAGCTACGACGAGATTCCCGAGCTCTGCAACACCTTGCCGGCGCTC
GAGCTCTGCTATTCGATGAGCCAGAAGTTTATGGATGAGCACCAGAAGATTCCTGATAATTCTCCGCCGGAGTCGGTGGATTCGAGTCTGGCGGCTTCGAGTCCAATTCC
GAACTGGAAGATCTCTAGCCCTGGTGATCATCCAGAAGATGTTAAAGCAAGGCTCAAATATTGGGCACAAGCGGTGGCTTGTACTAAGATAACATGGATCTTCTCCTGGA
GCTTGTGCTTTGAAATCCCTTTCAAAAAACCATGTCCATACAATTTCCATTTTCTAGCCAGAGATGGGTTTTCTGGGACAGTCTATATTTGGTCTGATTTTGATGATGGA
AGAAGAAAAAAGAAGGAGAAGAACAAGAAGAAGAATGTCTCAGTGGTGAGAGCCCCATCAAAGCTTCACCAACAGGCTTTTGCTACACCTCTGCACATGCTGAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTCGCTGTTCATGTGAATCCGAATCTCAGGTTTCTCCTGAAATCCCCATTGGAGGGGACCCAGAAGCTGAAATCTTCGATATCCAATCTGGGTTTAGCTCGGAGGA
CGATCGTGACGAGTTAGAGAAGACGACGGACTGGAAGAAGACGAAGAAGAAGAACCAGATCTTGCTGGAGGGGTTCGTGGAGGCTGCAGATGAGGAGAATTTGACGAGGA
CGAAGAGCTTGACGGATGACGATCTTGAGGAGCTTAAAGGCTGTGTGGATCTGGGTTTCGCGTTTAGCTACGACGAGATTCCCGAGCTCTGCAACACCTTGCCGGCGCTC
GAGCTCTGCTATTCGATGAGCCAGAAGTTTATGGATGAGCACCAGAAGATTCCTGATAATTCTCCGCCGGAGTCGGTGGATTCGAGTCTGGCGGCTTCGAGTCCAATTCC
GAACTGGAAGATCTCTAGCCCTGGTGATCATCCAGAAGATGTTAAAGCAAGGCTCAAATATTGGGCACAAGCGGTGGCTTGTACTAAGATAACATGGATCTTCTCCTGGA
GCTTGTGCTTTGAAATCCCTTTCAAAAAACCATGTCCATACAATTTCCATTTTCTAGCCAGAGATGGGTTTTCTGGGACAGTCTATATTTGGTCTGATTTTGATGATGGA
AGAAGAAAAAAGAAGGAGAAGAACAAGAAGAAGAATGTCTCAGTGGTGAGAGCCCCATCAAAGCTTCACCAACAGGCTTTTGCTACACCTCTGCACATGCTGAG
Protein sequenceShow/hide protein sequence
MGRCSCESESQVSPEIPIGGDPEAEIFDIQSGFSSEDDRDELEKTTDWKKTKKKNQILLEGFVEAADEENLTRTKSLTDDDLEELKGCVDLGFAFSYDEIPELCNTLPAL
ELCYSMSQKFMDEHQKIPDNSPPESVDSSLAASSPIPNWKISSPGDHPEDVKARLKYWAQAVACTKITWIFSWSLCFEIPFKKPCPYNFHFLARDGFSGTVYIWSDFDDG
RRKKKEKNKKKNVSVVRAPSKLHQQAFATPLHMLX