; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026084 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026084
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1195)
Genome locationtig00153031:1599608..1600299
RNA-Seq ExpressionSgr026084
SyntenySgr026084
Gene Ontology termsGO:0008643 - carbohydrate transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010608 - Protein of unknown function DUF1195


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036238.1 Aldehyde dehydrogenase [Cucumis melo var. makuwa]4.1e-8291.01Show/hide
Query:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTT ASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD  EMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQSKVSYLLSDAYFLASPASADDLMFLAHEKG
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHS++LDTPS+QS V YLLSDAY L SP   DDLMF+A+E G
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQSKVSYLLSDAYFLASPASADDLMFLAHEKG

KAG7034156.1 hypothetical protein SDJN02_03883 [Cucurbita argyrosperma subsp. argyrosperma]7.3e-7988.14Show/hide
Query:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTT ASAGKKESSVSS+FGKGRYKFWAL AILLLAFWSMFTGTVSLRWS GNLNGLSDDIDFN+ +DLD  EMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQSKVSYLLSDAYFLASPASADDLMFLAHEK
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMS+HSI+LDTPSEQS V YLLSD Y L +P   DDLMF+AHEK
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQSKVSYLLSDAYFLASPASADDLMFLAHEK

XP_008440698.1 PREDICTED: uncharacterized protein LOC103485038 [Cucumis melo]1.0e-7295.97Show/hide
Query:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTT ASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD  EMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHS++LDTPS+QS
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS

XP_011658007.1 uncharacterized protein LOC101216891 [Cucumis sativus]3.9e-7295.3Show/hide
Query:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTT ASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDF+IHDDLD  EMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHS++LDTPS+QS
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS

XP_038883913.1 uncharacterized protein LOC120074752 [Benincasa hispida]3.5e-7397.32Show/hide
Query:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTT ASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD  EMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSI+LDTPSEQS
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS

TrEMBL top hitse value%identityAlignment
A0A0A0KGF8 Uncharacterized protein1.9e-7295.3Show/hide
Query:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTT ASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDF+IHDDLD  EMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHS++LDTPS+QS
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS

A0A1S3B2C0 uncharacterized protein LOC1034850384.9e-7395.97Show/hide
Query:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTT ASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD  EMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHS++LDTPS+QS
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS

A0A5A7T076 Aldehyde dehydrogenase2.0e-8291.01Show/hide
Query:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTT ASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD  EMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQSKVSYLLSDAYFLASPASADDLMFLAHEKG
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHS++LDTPS+QS V YLLSDAY L SP   DDLMF+A+E G
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQSKVSYLLSDAYFLASPASADDLMFLAHEKG

A0A6J1BS47 uncharacterized protein LOC1110052576.7e-7092.62Show/hide
Query:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTT ASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSD +DFNI+DDLD  EMEEREK+VKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSI++D  SEQS
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS

A0A6J1GEZ2 uncharacterized protein LOC1114533256.7e-7092.62Show/hide
Query:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTT ASAGKKESSVSS+FGKGRYKFWAL AILLLAFWSMFTGTVSLRWS GNLNGLSDDIDFN+ +DLD  EMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMS+HSI+LDTPSEQS
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19380.1 Protein of unknown function (DUF1195)1.8e-2251.85Show/hide
Query:  YKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAI
        YK W L A+LLLAF SM TG+VSL+   G  +       F+  DDLD  E+EEREK+V+ MWDVY  +  +++PRFW+EAFEAAYE L S+    R AA+
Subjt:  YKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVYTNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAI

Query:  SEIARMSV
        S+IA++S+
Subjt:  SEIARMSV

AT4G36660.1 Protein of unknown function (DUF1195)1.9e-5368.99Show/hide
Query:  MKDDDVLPTTTASA---------GKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMW
        MK+DD LPTTT +           KKESS S LFG+GRYKFWA AAILLLAFWSMFTGTV+LR S GNLN LS+D+    +D+LD  EMEEREK+VKHMW
Subjt:  MKDDDVLPTTTASA---------GKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMW

Query:  DVYTNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS
        DVYTN+RRI+LPRFWQEAF AAYE+LTS+VPG REAAI EIA+MS  SI LD P  +S
Subjt:  DVYTNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQS

AT5G65650.1 Protein of unknown function (DUF1195)2.7e-5572.19Show/hide
Query:  MKDDDVLPTTTAS------AGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVY
        MKD D LP +T+S       GKKE+  S+LF KGRYKFWALAAILLLAFWSM TGTV+LRWSAGN+N  +DD+ F IH+DLD  EMEEREK+VKHMWDVY
Subjt:  MKDDDVLPTTTAS------AGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLD--EMEEREKIVKHMWDVY

Query:  TNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTP
         N RRIRLPRFWQEAFEAAYE+LTS+VP   EAAISEIARMS+ SI++D P
Subjt:  TNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVHSILLDTP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGACGACGATGTCCTTCCAACAACGACAGCGAGTGCCGGGAAGAAAGAGAGCTCGGTTTCGAGTTTATTCGGCAAAGGCCGCTACAAGTTCTGGGCATTGGCTGC
CATTTTGTTGCTTGCATTTTGGTCCATGTTCACCGGTACCGTCTCTCTTCGATGGTCTGCCGGTAATCTCAACGGGCTATCTGATGATATCGATTTTAATATCCACGACG
ATCTCGATGAGATGGAGGAAAGGGAGAAGATAGTGAAGCACATGTGGGACGTTTACACAAATAACCGCCGGATCAGGTTACCGCGTTTCTGGCAAGAGGCATTTGAGGCT
GCGTACGAGGACCTGACAAGTGAAGTGCCGGGTGATAGAGAGGCTGCTATCTCCGAGATCGCCCGGATGTCCGTGCACTCCATTCTTCTTGATACGCCTTCGGAGCAATC
AAAGGTTAGCTATTTACTTTCTGATGCTTACTTTTTAGCATCTCCTGCCTCTGCTGATGATCTTATGTTCTTAGCACACGAAAAGGGTACCAAGGGCATTCTTTGGGCTA
TAATCCCAAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGACGACGATGTCCTTCCAACAACGACAGCGAGTGCCGGGAAGAAAGAGAGCTCGGTTTCGAGTTTATTCGGCAAAGGCCGCTACAAGTTCTGGGCATTGGCTGC
CATTTTGTTGCTTGCATTTTGGTCCATGTTCACCGGTACCGTCTCTCTTCGATGGTCTGCCGGTAATCTCAACGGGCTATCTGATGATATCGATTTTAATATCCACGACG
ATCTCGATGAGATGGAGGAAAGGGAGAAGATAGTGAAGCACATGTGGGACGTTTACACAAATAACCGCCGGATCAGGTTACCGCGTTTCTGGCAAGAGGCATTTGAGGCT
GCGTACGAGGACCTGACAAGTGAAGTGCCGGGTGATAGAGAGGCTGCTATCTCCGAGATCGCCCGGATGTCCGTGCACTCCATTCTTCTTGATACGCCTTCGGAGCAATC
AAAGGTTAGCTATTTACTTTCTGATGCTTACTTTTTAGCATCTCCTGCCTCTGCTGATGATCTTATGTTCTTAGCACACGAAAAGGGTACCAAGGGCATTCTTTGGGCTA
TAATCCCAAACTAA
Protein sequenceShow/hide protein sequence
MKDDDVLPTTTASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDIDFNIHDDLDEMEEREKIVKHMWDVYTNNRRIRLPRFWQEAFEA
AYEDLTSEVPGDREAAISEIARMSVHSILLDTPSEQSKVSYLLSDAYFLASPASADDLMFLAHEKGTKGILWAIIPN