; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012232 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012232
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153284:95077..101524
RNA-Seq ExpressionSgr012232
SyntenySgr012232
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593003.1 hypothetical protein SDJN03_12479, partial [Cucurbita argyrosperma subsp. sororia]1.0e-6258.33Show/hide
Query:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC
        +E   +  G + R IKLFCPSLST+APF+ S DQ +D+GSIAT FGL+PSTVKLNG FLSRG DLVSSVTW SLLSFF +KRLPTG SD DA+VVDGKL 
Subjt:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC

Query:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLIKK-----------------LKC------------------------RDLGFGELSDAIGG
        KIGVKRAH  QEI NGDCCEADEED NL+G R KPES+L+K                  LKC                           GF ELSDA   
Subjt:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLIKK-----------------LKC------------------------RDLGFGELSDAIGG

Query:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR
        +   AN  P T YSCSYNSKNMKRMR +E LV AFCKRT+
Subjt:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR

XP_022145086.1 uncharacterized protein LOC111014592 [Momordica charantia]1.4e-6760.5Show/hide
Query:  LHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLCKI
        + ME T HKPRIIKL CPSLS +APFL SD   +D+G+IAT FGLQPSTVKLNG FLSRGPDL+SSVTWKSLLSFF +KRLP GNSD+D +VVDGKL KI
Subjt:  LHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLCKI

Query:  GVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI--KKLKCRDL---------------------------------------GFGELSDAIGGLI
        G+KRA G QEIV+G CCEADEED NL+   Q    +L+  KKLK RD                                        G  ELSD + GL 
Subjt:  GVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI--KKLKCRDL---------------------------------------GFGELSDAIGGLI

Query:  DAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR
        DAANVVP   YSCSYNSKNMKRMR +ETLVSAFCKRTR
Subjt:  DAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR

XP_022959564.1 uncharacterized protein LOC111460597 isoform X1 [Cucurbita moschata]1.0e-6258.33Show/hide
Query:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC
        +E   +  G + R IKLFCPSLST+APF+ S DQ +D+GSIAT FGL+PSTVKLNG FLSRG DLVSSVTW SLLSFF +KRLPTG SD DA+VVDGKL 
Subjt:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC

Query:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLIKK-----------------LKC------------------------RDLGFGELSDAIGG
        KIGVKRAH  QEI NGDCCEADEED NL+G R KPES+L+K                  LKC                           GF ELSDA   
Subjt:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLIKK-----------------LKC------------------------RDLGFGELSDAIGG

Query:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR
        +   AN  P T YSCSYNSKNMKRMR +E LV AFCKRT+
Subjt:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR

XP_023513937.1 uncharacterized protein LOC111778382 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-6358.75Show/hide
Query:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC
        +E   +  G + R IKLFCPSLST+APF+ S DQ +D+GSIAT FGL+PSTVKLNG FLSRG DLVSSVTW SLLSFF +KRLPTG SD DA+VVDGKL 
Subjt:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC

Query:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLIKK-----------------LKC------------------------RDLGFGELSDAIGG
        KIGVKRAH  QEI NGDCCEADEEDGNL+G R KPES+L+K                  LKC                           GF ELSDA   
Subjt:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLIKK-----------------LKC------------------------RDLGFGELSDAIGG

Query:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR
        + + AN  P T YSCSYNSKNMKRMR +E LV AFCKRT+
Subjt:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR

XP_038897803.1 uncharacterized protein LOC120085717 isoform X2 [Benincasa hispida]9.9e-6659.84Show/hide
Query:  KGETLPETVGSKIRSSVWSPNQNDQSHSHFTFLLKEEHYSHSWVELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGR
        + ETLPET+  +      SP+   Q++         E+     +E+ ME T  K R I LFCPSLST+APFL SDD  +D+GSIA  FGL PS++KLNG 
Subjt:  KGETLPETVGSKIRSSVWSPNQNDQSHSHFTFLLKEEHYSHSWVELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGR

Query:  FLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLCKIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI--KKLKCRDLGFGELSD
        FLSRG DLVS VTW SLLSFF +KRLP G+SD DA++VDGKL K+GVKRAHG QEIV+GDCC+ADEED N++  R KPES+L+  KK+K  DLGF ELSD
Subjt:  FLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLCKIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI--KKLKCRDLGFGELSD

Query:  AIGGLIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR
          GG+ DAANV     YSCS+NS NMKRMR  ETLVSA CKR+R
Subjt:  AIGGLIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR

TrEMBL top hitse value%identityAlignment
A0A1S3C6U5 uncharacterized protein LOC1034975641.8e-5252.92Show/hide
Query:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC
        +++ ME T  K   I LFCPSLST APFL S D  +D+GSIA  FGL PS++KLNGRFLSRG DL+SSVTW SLLSFF +KRLP G+S  DA++VDGKL 
Subjt:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC

Query:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI--KKLKCRDL---------------------------------------GFGELSDAIGG
        KIG KR HG QE V+GD  EADEE  +++  R KPES+L+  KK+K  D                                        GF ELSD  GG
Subjt:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI--KKLKCRDL---------------------------------------GFGELSDAIGG

Query:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR
        + D ANV   T YSCS NS NMKRMR  ETLVSA CKR+R
Subjt:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR

A0A5D3E3Q7 Uncharacterized protein3.9e-5253.81Show/hide
Query:  METTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLCKIGV
        ME T  K   I LFCPSLST APFL S D  +D+GSIA  FGL PS++KLNGRFLSRG DL+SSVTW SLLSFF +KRLP G+S  DA++VDGKL KIG 
Subjt:  METTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLCKIGV

Query:  KRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI--KKLKCRDL---------------------------------------GFGELSDAIGGLIDA
        KR HG QE V+GD  EADEE  +++  R KPES+L+  KK+K  D                                        GF ELSD  GG+ D 
Subjt:  KRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI--KKLKCRDL---------------------------------------GFGELSDAIGGLIDA

Query:  ANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR
        ANV   T YSCS NS NMKRMR  ETLVSA CKR+R
Subjt:  ANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR

A0A6J1CVB3 uncharacterized protein LOC1110145926.7e-6860.5Show/hide
Query:  LHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLCKI
        + ME T HKPRIIKL CPSLS +APFL SD   +D+G+IAT FGLQPSTVKLNG FLSRGPDL+SSVTWKSLLSFF +KRLP GNSD+D +VVDGKL KI
Subjt:  LHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLCKI

Query:  GVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI--KKLKCRDL---------------------------------------GFGELSDAIGGLI
        G+KRA G QEIV+G CCEADEED NL+   Q    +L+  KKLK RD                                        G  ELSD + GL 
Subjt:  GVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI--KKLKCRDL---------------------------------------GFGELSDAIGGLI

Query:  DAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR
        DAANVVP   YSCSYNSKNMKRMR +ETLVSAFCKRTR
Subjt:  DAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR

A0A6J1H8F0 uncharacterized protein LOC111460597 isoform X15.0e-6358.33Show/hide
Query:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC
        +E   +  G + R IKLFCPSLST+APF+ S DQ +D+GSIAT FGL+PSTVKLNG FLSRG DLVSSVTW SLLSFF +KRLPTG SD DA+VVDGKL 
Subjt:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC

Query:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLIKK-----------------LKC------------------------RDLGFGELSDAIGG
        KIGVKRAH  QEI NGDCCEADEED NL+G R KPES+L+K                  LKC                           GF ELSDA   
Subjt:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLIKK-----------------LKC------------------------RDLGFGELSDAIGG

Query:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR
        +   AN  P T YSCSYNSKNMKRMR +E LV AFCKRT+
Subjt:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR

A0A6J1KVM1 uncharacterized protein LOC111498000 isoform X11.4e-6257.92Show/hide
Query:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC
        +E   +  G + R IKLFC SLST+APF+ S+DQ +D+GSIAT FGL+PSTVKLNG FLSRG DLVSSVTW SLLSFF +KRLPTG SD DA+VVDGKL 
Subjt:  VELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLC

Query:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI---------------------------------------KKLKCRDL--GFGELSDAIGG
        KIGVKRAH  QEI NGDCCEADEEDGNL+G R KPES+L+                                       KKLK  +   GF ELSDA   
Subjt:  KIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLI---------------------------------------KKLKCRDL--GFGELSDAIGG

Query:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR
        + + AN  P T  SCSYNSKNMKRMR +E LV AFCKRT+
Subjt:  LIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G07150.1 unknown protein1.5e-2739.22Show/hide
Query:  RIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSS-VTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLCKIGVKRAHG--
        R IKLFCPS+S +  ++  +D+ LD  +IA  FGL+PSTVKLNG F+SRG DLV++ VTW+SLL+FF ++ L TG  + DA++V GKL K+G KRA    
Subjt:  RIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGLQPSTVKLNGRFLSRGPDLVSS-VTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLCKIGVKRAHG--

Query:  LQEIVNGDC-----------CEADEE--DGNLSGRRQKPESSLIKKLKCRDLGFGELSDAIGGLIDAANVVPCTTYSCSYNSKN-MKRMRVNETLVSAFC
        L++    D            C   E    G    +    +S  +KKLK        + D+ GG          T   CS+ S N +KR R ++ + SA C
Subjt:  LQEIVNGDC-----------CEADEE--DGNLSGRRQKPESSLIKKLKCRDLGFGELSDAIGGLIDAANVVPCTTYSCSYNSKN-MKRMRVNETLVSAFC

Query:  KRTR
        K+ R
Subjt:  KRTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTTGGGCTTGGAGCAACCTCTGCGTTCCGCTCACCTCAATTGCCTTCCTTCAGCAAGGGCGAGACGCTTCCAGAGACCGTTGGAAGCAAAATCCGCAGCTCTGT
CTGGTCTCCAAACCAAAATGACCAGAGCCACAGCCATTTCACTTTCCTTCTCAAAGAAGAGCATTATTCTCACAGTTGGGTCGAGCTTCATATGGAGACGACAGGTCACA
AACCCAGAATCATCAAGCTATTTTGCCCCTCACTCTCCACCGTTGCCCCATTCCTCACATCCGACGACCAACCCCTCGATATGGGCTCCATAGCCACCACCTTCGGCCTC
CAACCCTCCACGGTGAAGCTCAATGGCCGCTTCCTCAGCCGAGGGCCCGATCTCGTCTCCTCCGTCACTTGGAAGTCCCTTCTCTCTTTCTTCTTTTCTAAACGACTGCC
TACTGGGAACTCCGACAAGGATGCGATAGTTGTTGATGGAAAGCTCTGTAAAATTGGCGTCAAGAGAGCTCATGGCCTTCAGGAAATTGTAAATGGTGATTGTTGCGAAG
CTGATGAAGAAGATGGAAATCTGAGTGGTAGAAGGCAAAAACCAGAAAGCAGCCTGATCAAGAAGTTGAAATGCAGGGACTTAGGTTTCGGTGAATTATCAGATGCAATT
GGAGGATTAATCGACGCAGCCAATGTCGTTCCATGCACGACATATTCGTGTAGCTACAATAGTAAGAATATGAAAAGGATGAGAGTAAATGAGACTCTTGTTTCAGCTTT
CTGCAAGAGAACTAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCTTGGGCTTGGAGCAACCTCTGCGTTCCGCTCACCTCAATTGCCTTCCTTCAGCAAGGGCGAGACGCTTCCAGAGACCGTTGGAAGCAAAATCCGCAGCTCTGT
CTGGTCTCCAAACCAAAATGACCAGAGCCACAGCCATTTCACTTTCCTTCTCAAAGAAGAGCATTATTCTCACAGTTGGGTCGAGCTTCATATGGAGACGACAGGTCACA
AACCCAGAATCATCAAGCTATTTTGCCCCTCACTCTCCACCGTTGCCCCATTCCTCACATCCGACGACCAACCCCTCGATATGGGCTCCATAGCCACCACCTTCGGCCTC
CAACCCTCCACGGTGAAGCTCAATGGCCGCTTCCTCAGCCGAGGGCCCGATCTCGTCTCCTCCGTCACTTGGAAGTCCCTTCTCTCTTTCTTCTTTTCTAAACGACTGCC
TACTGGGAACTCCGACAAGGATGCGATAGTTGTTGATGGAAAGCTCTGTAAAATTGGCGTCAAGAGAGCTCATGGCCTTCAGGAAATTGTAAATGGTGATTGTTGCGAAG
CTGATGAAGAAGATGGAAATCTGAGTGGTAGAAGGCAAAAACCAGAAAGCAGCCTGATCAAGAAGTTGAAATGCAGGGACTTAGGTTTCGGTGAATTATCAGATGCAATT
GGAGGATTAATCGACGCAGCCAATGTCGTTCCATGCACGACATATTCGTGTAGCTACAATAGTAAGAATATGAAAAGGATGAGAGTAAATGAGACTCTTGTTTCAGCTTT
CTGCAAGAGAACTAGATAA
Protein sequenceShow/hide protein sequence
MSLGLGATSAFRSPQLPSFSKGETLPETVGSKIRSSVWSPNQNDQSHSHFTFLLKEEHYSHSWVELHMETTGHKPRIIKLFCPSLSTVAPFLTSDDQPLDMGSIATTFGL
QPSTVKLNGRFLSRGPDLVSSVTWKSLLSFFFSKRLPTGNSDKDAIVVDGKLCKIGVKRAHGLQEIVNGDCCEADEEDGNLSGRRQKPESSLIKKLKCRDLGFGELSDAI
GGLIDAANVVPCTTYSCSYNSKNMKRMRVNETLVSAFCKRTR