; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016854 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016854
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUPF0481 protein At3g47200-like
Genome locationtig00153010:2033747..2034178
RNA-Seq ExpressionSgr016854
SyntenySgr016854
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649012.1 hypothetical protein Csa_008980 [Cucumis sativus]1.2e-5278.47Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQS-YNSGRIIKMAEIQIVKDINCYYESCWKVKAKR
        MAFEAAT+LNPPCF+YY  LMS LITT  DVKILKQ  IIE HS SEEEVV +FGGLRN+LELHQ+S Y S  I+ MAEIQI KDINCYYESCWKVKA+R
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQS-YNSGRIIKMAEIQIVKDINCYYESCWKVKAKR

Query:  FIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS
         +KRYVNPVLKLIF++VVILL++VVVVR FCGWFG SRILHVVS
Subjt:  FIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS

KAG7025194.1 putative UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-4875.18Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF
        +AFE AT+LNPPCF+ Y  LMS LITTTKDVKILK+A II+ HSGSEE+VV LF GLRN+LELHQ S NS RI  MAEIQIVKDIN YYESCW VKAKRF
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF

Query:  IKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHV
        IK+Y+NP++K+I L+VVILLI VVV+R+FCGWFG SRIL+V
Subjt:  IKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHV

XP_008459652.1 PREDICTED: putative UPF0481 protein At3g02645 [Cucumis melo]2.9e-5177.78Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQS-YNSGRIIKMAEIQIVKDINCYYESCWKVKAKR
        MAFEAAT+LNPPCF+YY  LMS LITT  DVKILKQ  IIE  S SEEE+V +F GLRN+LELHQQS Y S  I+ MAEIQI KDINCYYESCWKVKA+R
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQS-YNSGRIIKMAEIQIVKDINCYYESCWKVKAKR

Query:  FIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS
         +KRYVNPVLKLIF++VVILLI+VVVVR FCGWFG SRILHVVS
Subjt:  FIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS

XP_011656112.2 putative UPF0481 protein At3g02645 [Cucumis sativus]1.2e-5278.47Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQS-YNSGRIIKMAEIQIVKDINCYYESCWKVKAKR
        MAFEAAT+LNPPCF+YY  LMS LITT  DVKILKQ  IIE HS SEEEVV +FGGLRN+LELHQ+S Y S  I+ MAEIQI KDINCYYESCWKVKA+R
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQS-YNSGRIIKMAEIQIVKDINCYYESCWKVKAKR

Query:  FIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS
         +KRYVNPVLKLIF++VVILL++VVVVR FCGWFG SRILHVVS
Subjt:  FIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS

XP_038889519.1 putative UPF0481 protein At3g02645 [Benincasa hispida]3.8e-5177.62Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF
        +AFEAAT LNPP F YY  LMS LITT  DVKILKQ  IIE HSGSEEEVV LF GLRN+LELHQQ Y S RI+ MAE+QIVKDIN YYESCWKVKAKR 
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF

Query:  IKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS
        +KRYVNPV+K++F+IVVILLI+VVVV  FCGWFG SRILHVVS
Subjt:  IKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS

TrEMBL top hitse value%identityAlignment
A0A0A0KSY3 Uncharacterized protein5.7e-5378.47Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQS-YNSGRIIKMAEIQIVKDINCYYESCWKVKAKR
        MAFEAAT+LNPPCF+YY  LMS LITT  DVKILKQ  IIE HS SEEEVV +FGGLRN+LELHQ+S Y S  I+ MAEIQI KDINCYYESCWKVKA+R
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQS-YNSGRIIKMAEIQIVKDINCYYESCWKVKAKR

Query:  FIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS
         +KRYVNPVLKLIF++VVILL++VVVVR FCGWFG SRILHVVS
Subjt:  FIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS

A0A1S3CB64 putative UPF0481 protein At3g026451.4e-5177.78Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQS-YNSGRIIKMAEIQIVKDINCYYESCWKVKAKR
        MAFEAAT+LNPPCF+YY  LMS LITT  DVKILKQ  IIE  S SEEE+V +F GLRN+LELHQQS Y S  I+ MAEIQI KDINCYYESCWKVKA+R
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQS-YNSGRIIKMAEIQIVKDINCYYESCWKVKAKR

Query:  FIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS
         +KRYVNPVLKLIF++VVILLI+VVVVR FCGWFG SRILHVVS
Subjt:  FIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS

A0A6J1DUS9 putative UPF0481 protein At3g02645 isoform X21.9e-4875.17Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELH--QQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAK
        +AFEAAT+ N   F +Y  LMS LITT KDVKILKQA IIE H GSEEEVV LF GLRNLLELH  Q SY+SGRI++MAEIQIVKDIN YYESCWKVKAK
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELH--QQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAK

Query:  RFIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS
        RFIKR+VNP+LK+IFLI+VILLI+VVV+R  CGWF  SR+LHV+S
Subjt:  RFIKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS

A0A6J1EBZ5 putative UPF0481 protein At3g026459.1e-4369.93Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF
        +AFE AT+LNPPCF+ Y  LMS LITT KDVKILK+A IIE H G+EEEVV LF GLR++LEL            MAEIQ+VKDIN YYESCWKVKAKRF
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF

Query:  IKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS
        I++Y+NP+LK+I L+VVILLI VVVVR FCGWFG  RILHVVS
Subjt:  IKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS

A0A6J1IEU4 putative UPF0481 protein At3g026454.8e-4470.63Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF
        +AFE AT+LNPPCF+ Y  L+S LITTTKDVKILK+A IIE H+GSEEEVV LF GLR++LEL            MAE+Q+VKDIN YYESCWKVKAKRF
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF

Query:  IKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS
        I++Y+NP+LK+I L+VVILLI VVVVR FCGWFG SRILHVVS
Subjt:  IKRYVNPVLKLIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026454.5e-0725.58Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF
        +A+EA     P  FT Y  L++ +I + +DV++L++ G++     S++E   ++ G+   + L +  +           + ++D+N YY   WKVK  R 
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF

Query:  IKRYVNPVLKLIFLIVVILLIIVVVVRLF
        ++ YV    +++  +  +LL+++V ++LF
Subjt:  IKRYVNPVLKLIFLIVVILLIIVVVVRLF

Arabidopsis top hitse value%identityAlignment
AT3G02645.1 Plant protein of unknown function (DUF247)3.2e-0825.58Show/hide
Query:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF
        +A+EA     P  FT Y  L++ +I + +DV++L++ G++     S++E   ++ G+   + L +  +           + ++D+N YY   WKVK  R 
Subjt:  MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRF

Query:  IKRYVNPVLKLIFLIVVILLIIVVVVRLF
        ++ YV    +++  +  +LL+++V ++LF
Subjt:  IKRYVNPVLKLIFLIVVILLIIVVVVRLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTTGAAGCAGCAACTCAACTAAATCCGCCTTGCTTCACCTATTATGCGACATTGATGAGCAGGCTGATAACCACAACCAAAGATGTCAAGATACTGAAGCAAGC
AGGGATCATCGAGGGCCATTCAGGGAGTGAAGAAGAGGTGGTGATACTGTTTGGTGGGCTAAGAAATCTCTTGGAACTTCACCAGCAGAGTTATAACTCTGGTAGGATCA
TAAAGATGGCTGAGATTCAAATCGTTAAGGACATCAATTGCTATTATGAAAGTTGCTGGAAGGTGAAAGCCAAAAGATTCATTAAGAGATATGTGAATCCAGTATTGAAG
TTGATCTTTCTTATTGTTGTGATTTTGCTTATTATTGTGGTAGTTGTACGCTTGTTTTGTGGTTGGTTTGGTGGCTCTAGAATTTTGCATGTGGTTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTTTGAAGCAGCAACTCAACTAAATCCGCCTTGCTTCACCTATTATGCGACATTGATGAGCAGGCTGATAACCACAACCAAAGATGTCAAGATACTGAAGCAAGC
AGGGATCATCGAGGGCCATTCAGGGAGTGAAGAAGAGGTGGTGATACTGTTTGGTGGGCTAAGAAATCTCTTGGAACTTCACCAGCAGAGTTATAACTCTGGTAGGATCA
TAAAGATGGCTGAGATTCAAATCGTTAAGGACATCAATTGCTATTATGAAAGTTGCTGGAAGGTGAAAGCCAAAAGATTCATTAAGAGATATGTGAATCCAGTATTGAAG
TTGATCTTTCTTATTGTTGTGATTTTGCTTATTATTGTGGTAGTTGTACGCTTGTTTTGTGGTTGGTTTGGTGGCTCTAGAATTTTGCATGTGGTTTCTTAG
Protein sequenceShow/hide protein sequence
MAFEAATQLNPPCFTYYATLMSRLITTTKDVKILKQAGIIEGHSGSEEEVVILFGGLRNLLELHQQSYNSGRIIKMAEIQIVKDINCYYESCWKVKAKRFIKRYVNPVLK
LIFLIVVILLIIVVVVRLFCGWFGGSRILHVVS