; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020162 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020162
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGUB_WAK_bind domain-containing protein
Genome locationtig00153449:482531..485129
RNA-Seq ExpressionSgr020162
SyntenySgr020162
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0030247 - polysaccharide binding (molecular function)
InterPro domainsIPR025287 - Wall-associated receptor kinase, galacturonan-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147259.2 uncharacterized protein LOC101212248 [Cucumis sativus]1.1e-7560.22Show/hide
Query:  ISVLSIVFLLLIAPTSIKVQ---ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVHGV
        IS+LS  F  LI+P S   Q   + +  C+HG P+IQFPF                F+LSC  N TRIHF++Y+SL++KSISYDQKRLDL DLN CVH  
Subjt:  ISVLSIVFLLLIAPTSIKVQ---ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVHGV

Query:  FLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQPGA-YVYVVRPPLMAA--PRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL-DQTKTG
        FL L+LSLTPFRYFYVVKDYLYLNCT RL SSSST +PCLS+ G  YVYVV+PPLM +  PRFC+ VK V IPFEYS YLDDGSFGLALTWG  DQTKT 
Subjt:  FLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQPGA-YVYVVRPPLMAA--PRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL-DQTKTG

Query:  CQTGCFLKATNHQVLSISLLAAMVA-------ISMVIM--KIYHSKKQDYPKEEADKKLFEH-SYEALKNEPHD
         Q  CF KAT+ QV+ ISLL AMVA       ++MV+M  K Y SK ++Y KEE +KK+FEH SYE LK   +D
Subjt:  CQTGCFLKATNHQVLSISLLAAMVA-------ISMVIM--KIYHSKKQDYPKEEADKKLFEH-SYEALKNEPHD

XP_022158470.1 uncharacterized protein LOC111024954 [Momordica charantia]3.7e-7662.68Show/hide
Query:  MGISVLSIVFLLLIAPTSIKVQ-ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYES---LAVKSISYDQKRLDLADLNGCV
        M ISV S  FLLL + TSIK + + S   SHG  ++ FPF+L               +LSC  N T IHF+S+ES   LAVKSISYDQKRLDLAD +GCV
Subjt:  MGISVLSIVFLLLIAPTSIKVQ-ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYES---LAVKSISYDQKRLDLADLNGCV

Query:  HGVFLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP---GAYVYVVR-PPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGLD--
        HGVFL L+L+ TPFRYFY +KDY+Y+NCT +LP   ST VPCLS+    G YVYVVR  PL+AAPR CR VK VGIPFEYS YLDDGSFGL+L+WG D  
Subjt:  HGVFLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP---GAYVYVVR-PPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGLD--

Query:  QTKTGCQTGCFLKATNHQVLSISLLAAMVAISMVIMKIYHSKKQDYPKEEADKKLFEHSYEALKNEPHDSANHQLV
        Q KT C+TGCFLKATN QV+SI L+AAMVAI+MV+ KI HSKKQ+YPKEE DK    +SYEALKN   D  NHQLV
Subjt:  QTKTGCQTGCFLKATNHQVLSISLLAAMVAISMVIMKIYHSKKQDYPKEEADKKLFEHSYEALKNEPHDSANHQLV

XP_022938488.1 uncharacterized protein LOC111444707 isoform X1 [Cucurbita moschata]1.4e-7259.69Show/hide
Query:  MGISVLSIVFLLLIAPTSIKVQARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVHGVF
        M IS L  +   L++P SIK Q  +  CSHG P I  PF                F+LSC  N TRIHF+SY+SL++KSISYDQKRLDL DLNGCVHG F
Subjt:  MGISVLSIVFLLLIAPTSIKVQARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVHGVF

Query:  LKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP-GAYVYVVRPPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL--DQTKTGCQ
        LKLNL+LTPFRYFYVV+DY YLNCT++L SS S  +PCLS+P   YVYVVR  +   PRFC+ VK V IPFEYS YLDDGSFGL+L+WG   D+ +T  +
Subjt:  LKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP-GAYVYVVRPPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL--DQTKTGCQ

Query:  TGCFLKATNHQVLSISLLAAMVAI-SMVIMKIYHSKKQDYPKEEADKKLFEHSYEALK
         GC  KA N++VL + LLAAMV I SMVI+KI HSKK    KEE  KK+FEH YEALK
Subjt:  TGCFLKATNHQVLSISLLAAMVAI-SMVIMKIYHSKKQDYPKEEADKKLFEHSYEALK

XP_023005514.1 uncharacterized protein LOC111498479 isoform X1 [Cucurbita maxima]7.2e-7259.77Show/hide
Query:  MGISVLSIVFLLLIAPTSIKVQ---ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVH
        M IS L      L++P SIK Q     +  CSHG P I  PF                F+LSC  N TRIHF+SY+SL++KSISYD+KRLDL DLNGCVH
Subjt:  MGISVLSIVFLLLIAPTSIKVQ---ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVH

Query:  GVFLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP-GAYVYVVRPPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL--DQTKT
        G FLKLNLSLTPFRYFYVV+DY YLNCT++L  + S  +PCLS+P   YVYVVR  +   PRFC+ VK V IPFEYS YLDDGSFGL+LTWG   D+ +T
Subjt:  GVFLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP-GAYVYVVRPPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL--DQTKT

Query:  GCQTGCFLKATNHQVLSISLLAAMVAIS-MVIMKIYHSKKQDYPKEEADKKLFEHSYEALK
          Q GC  KA N++VL +SLL AMV IS MVI+KI HSKKQ + KEEA KK+FEHSYEA+K
Subjt:  GCQTGCFLKATNHQVLSISLLAAMVAIS-MVIMKIYHSKKQDYPKEEADKKLFEHSYEALK

XP_038905185.1 putative RING-H2 finger protein ATL21A [Benincasa hispida]7.2e-8064.12Show/hide
Query:  MGISVLSI--VFLLLIAPTSIKVQARSAQ--CSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCV
        M IS+ SI   F  LI+P SIK Q  S++  C+HG P+IQFPF                F++SC  N TRIHF++Y+SL++KSISYDQKRLDL DLN CV
Subjt:  MGISVLSI--VFLLLIAPTSIKVQARSAQ--CSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCV

Query:  HGVFLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQPGA-YVYVVRPPLM-AAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL-DQTK
        H  FLKLNL LTPFRYFYVVKDY YLNCT RL S+SST +PCLS+ G  YVY VRPPLM + PR C+ +K V IPFEYS YLDDGSFGL+LTWG  D TK
Subjt:  HGVFLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQPGA-YVYVVRPPLM-AAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL-DQTK

Query:  TGCQTGCFLKATNHQVLSISLLAAMVAI-SMVIMKIYHSKKQDYPKEEADKKLFEHSYEALK
        T  Q  CF KATN QV+ ISLL AMVAI SMV+MKIYHSK + Y KEEA+KK+FEHSYE LK
Subjt:  TGCQTGCFLKATNHQVLSISLLAAMVAI-SMVIMKIYHSKKQDYPKEEADKKLFEHSYEALK

TrEMBL top hitse value%identityAlignment
A0A0A0L6T1 Uncharacterized protein2.3e-7660.89Show/hide
Query:  ISVLSIVFLLLIAPTSIKVQ---ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVHGV
        IS+LS  F  LI+P S   Q   + +  C+HG P+IQFPF                F+LSC  N TRIHF++Y+SL++KSISYDQKRLDL DLN CVH  
Subjt:  ISVLSIVFLLLIAPTSIKVQ---ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVHGV

Query:  FLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQPGA-YVYVVRPPLMAA--PRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL-DQTKTG
        FL L+LSLTPFRYFYVVKDYLYLNCT RL SSSST +PCLS+ G  YVYVV+PPLM +  PRFC+ VK V IPFEYS YLDDGSFGLALTWG  DQTKT 
Subjt:  FLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQPGA-YVYVVRPPLMAA--PRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL-DQTKTG

Query:  CQTGCFLKATNHQVLSISLLAAMVA----ISMVIM--KIYHSKKQDYPKEEADKKLFEH-SYEALKNEPHD
         Q  CF KAT+ QV+ ISLL AMVA    ++MV+M  K Y SK ++Y KEE +KK+FEH SYE LK   +D
Subjt:  CQTGCFLKATNHQVLSISLLAAMVA----ISMVIM--KIYHSKKQDYPKEEADKKLFEH-SYEALKNEPHD

A0A6J1DVX6 uncharacterized protein LOC1110249541.8e-7662.68Show/hide
Query:  MGISVLSIVFLLLIAPTSIKVQ-ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYES---LAVKSISYDQKRLDLADLNGCV
        M ISV S  FLLL + TSIK + + S   SHG  ++ FPF+L               +LSC  N T IHF+S+ES   LAVKSISYDQKRLDLAD +GCV
Subjt:  MGISVLSIVFLLLIAPTSIKVQ-ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYES---LAVKSISYDQKRLDLADLNGCV

Query:  HGVFLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP---GAYVYVVR-PPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGLD--
        HGVFL L+L+ TPFRYFY +KDY+Y+NCT +LP   ST VPCLS+    G YVYVVR  PL+AAPR CR VK VGIPFEYS YLDDGSFGL+L+WG D  
Subjt:  HGVFLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP---GAYVYVVR-PPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGLD--

Query:  QTKTGCQTGCFLKATNHQVLSISLLAAMVAISMVIMKIYHSKKQDYPKEEADKKLFEHSYEALKNEPHDSANHQLV
        Q KT C+TGCFLKATN QV+SI L+AAMVAI+MV+ KI HSKKQ+YPKEE DK    +SYEALKN   D  NHQLV
Subjt:  QTKTGCQTGCFLKATNHQVLSISLLAAMVAISMVIMKIYHSKKQDYPKEEADKKLFEHSYEALKNEPHDSANHQLV

A0A6J1FDB4 uncharacterized protein LOC111444707 isoform X21.5e-7058.91Show/hide
Query:  MGISVLSIVFLLLIAPTSIKVQARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVHGVF
        M IS L  +   L++P SIK Q  +  CSHG P I  PF                F+LSC  N TRIHF+SY+SL++KSISYDQKRLDL DLNGCVHG F
Subjt:  MGISVLSIVFLLLIAPTSIKVQARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVHGVF

Query:  LKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP-GAYVYVVRPPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL--DQTKTGCQ
        LKLNL+LTPFRYFYVV+DY YLNCT++L SS S  +PCLS+P   YVYVVR  +   PRFC+ VK V IPFEYS YLDDGSFGL+L+WG   D+ +T  +
Subjt:  LKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP-GAYVYVVRPPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL--DQTKTGCQ

Query:  TGCFLKATNHQVLSISLLAAMVAI-SMVIMKIYHSKKQDYPKEEADKKLFEHSYEALK
         GC  KA N++   + LLAAMV I SMVI+KI HSKK    KEE  KK+FEH YEALK
Subjt:  TGCFLKATNHQVLSISLLAAMVAI-SMVIMKIYHSKKQDYPKEEADKKLFEHSYEALK

A0A6J1FJ12 uncharacterized protein LOC111444707 isoform X17.0e-7359.69Show/hide
Query:  MGISVLSIVFLLLIAPTSIKVQARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVHGVF
        M IS L  +   L++P SIK Q  +  CSHG P I  PF                F+LSC  N TRIHF+SY+SL++KSISYDQKRLDL DLNGCVHG F
Subjt:  MGISVLSIVFLLLIAPTSIKVQARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVHGVF

Query:  LKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP-GAYVYVVRPPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL--DQTKTGCQ
        LKLNL+LTPFRYFYVV+DY YLNCT++L SS S  +PCLS+P   YVYVVR  +   PRFC+ VK V IPFEYS YLDDGSFGL+L+WG   D+ +T  +
Subjt:  LKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP-GAYVYVVRPPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL--DQTKTGCQ

Query:  TGCFLKATNHQVLSISLLAAMVAI-SMVIMKIYHSKKQDYPKEEADKKLFEHSYEALK
         GC  KA N++VL + LLAAMV I SMVI+KI HSKK    KEE  KK+FEH YEALK
Subjt:  TGCFLKATNHQVLSISLLAAMVAI-SMVIMKIYHSKKQDYPKEEADKKLFEHSYEALK

A0A6J1KZE3 uncharacterized protein LOC111498479 isoform X13.5e-7259.77Show/hide
Query:  MGISVLSIVFLLLIAPTSIKVQ---ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVH
        M IS L      L++P SIK Q     +  CSHG P I  PF                F+LSC  N TRIHF+SY+SL++KSISYD+KRLDL DLNGCVH
Subjt:  MGISVLSIVFLLLIAPTSIKVQ---ARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVH

Query:  GVFLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP-GAYVYVVRPPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL--DQTKT
        G FLKLNLSLTPFRYFYVV+DY YLNCT++L  + S  +PCLS+P   YVYVVR  +   PRFC+ VK V IPFEYS YLDDGSFGL+LTWG   D+ +T
Subjt:  GVFLKLNLSLTPFRYFYVVKDYLYLNCTARLPSSSSTEVPCLSQP-GAYVYVVRPPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGL--DQTKT

Query:  GCQTGCFLKATNHQVLSISLLAAMVAIS-MVIMKIYHSKKQDYPKEEADKKLFEHSYEALK
          Q GC  KA N++VL +SLL AMV IS MVI+KI HSKKQ + KEEA KK+FEHSYEA+K
Subjt:  GCQTGCFLKATNHQVLSISLLAAMVAIS-MVIMKIYHSKKQDYPKEEADKKLFEHSYEALK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATCTCAGTCCTCTCCATCGTCTTCCTCCTCCTCATCGCTCCCACCTCCATTAAAGTCCAAGCTCGGTCTGCACAGTGCAGCCATGGCGGTCCAAGAATCCAGTT
CCCTTTCCAACTTCAACAGGACCATGGGCAGGTTTCTCAAAGTGGGCTTCCTGGTTTCCACCTTTCTTGCGAAGGAAACGCCACCAGAATTCACTTCCAAAGCTATGAAT
CTTTGGCAGTGAAGTCCATTTCCTACGACCAAAAAAGACTCGATCTTGCAGACCTCAATGGCTGCGTCCATGGCGTCTTTCTCAAGCTCAACCTCTCCCTCACCCCCTTC
CGCTACTTCTACGTCGTCAAAGATTATCTGTACCTAAACTGCACGGCGAGGCTGCCGTCGTCGTCTTCGACGGAGGTGCCGTGCCTGAGCCAACCTGGGGCTTATGTCTA
TGTTGTGAGGCCGCCATTAATGGCGGCGCCAAGGTTTTGCAGGGCAGTGAAGACAGTGGGGATCCCATTTGAGTACAGTGCTTATCTTGATGATGGTTCTTTTGGACTTG
CCTTAACTTGGGGTTTAGATCAGACTAAAACGGGTTGCCAAACAGGGTGTTTCTTAAAAGCAACAAATCATCAAGTGCTTAGCATTAGCTTGCTTGCAGCCATGGTGGCA
ATATCAATGGTGATCATGAAGATATATCACTCAAAAAAACAAGATTATCCAAAGGAAGAAGCTGACAAGAAGCTGTTTGAACATTCATATGAAGCCCTAAAAAATGAACC
ACATGACTCTGCTAACCACCAATTAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAATCTCAGTCCTCTCCATCGTCTTCCTCCTCCTCATCGCTCCCACCTCCATTAAAGTCCAAGCTCGGTCTGCACAGTGCAGCCATGGCGGTCCAAGAATCCAGTT
CCCTTTCCAACTTCAACAGGACCATGGGCAGGTTTCTCAAAGTGGGCTTCCTGGTTTCCACCTTTCTTGCGAAGGAAACGCCACCAGAATTCACTTCCAAAGCTATGAAT
CTTTGGCAGTGAAGTCCATTTCCTACGACCAAAAAAGACTCGATCTTGCAGACCTCAATGGCTGCGTCCATGGCGTCTTTCTCAAGCTCAACCTCTCCCTCACCCCCTTC
CGCTACTTCTACGTCGTCAAAGATTATCTGTACCTAAACTGCACGGCGAGGCTGCCGTCGTCGTCTTCGACGGAGGTGCCGTGCCTGAGCCAACCTGGGGCTTATGTCTA
TGTTGTGAGGCCGCCATTAATGGCGGCGCCAAGGTTTTGCAGGGCAGTGAAGACAGTGGGGATCCCATTTGAGTACAGTGCTTATCTTGATGATGGTTCTTTTGGACTTG
CCTTAACTTGGGGTTTAGATCAGACTAAAACGGGTTGCCAAACAGGGTGTTTCTTAAAAGCAACAAATCATCAAGTGCTTAGCATTAGCTTGCTTGCAGCCATGGTGGCA
ATATCAATGGTGATCATGAAGATATATCACTCAAAAAAACAAGATTATCCAAAGGAAGAAGCTGACAAGAAGCTGTTTGAACATTCATATGAAGCCCTAAAAAATGAACC
ACATGACTCTGCTAACCACCAATTAGTTTGA
Protein sequenceShow/hide protein sequence
MGISVLSIVFLLLIAPTSIKVQARSAQCSHGGPRIQFPFQLQQDHGQVSQSGLPGFHLSCEGNATRIHFQSYESLAVKSISYDQKRLDLADLNGCVHGVFLKLNLSLTPF
RYFYVVKDYLYLNCTARLPSSSSTEVPCLSQPGAYVYVVRPPLMAAPRFCRAVKTVGIPFEYSAYLDDGSFGLALTWGLDQTKTGCQTGCFLKATNHQVLSISLLAAMVA
ISMVIMKIYHSKKQDYPKEEADKKLFEHSYEALKNEPHDSANHQLV