; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014373 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014373
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSASA domain-containing protein
Genome locationtig00000289:887203..888091
RNA-Seq ExpressionSgr014373
SyntenySgr014373
Gene Ontology termsNA
InterPro domainsIPR005181 - Sialate O-acetylesterase domain
IPR036514 - SGNH hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016422.1 putative carbohydrate esterase, partial [Cucurbita argyrosperma subsp. argyrosperma]3.8e-5247.73Show/hide
Query:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV
        A +SP NIFIL GQSNMAGRGGV  D  T    WDGYIP E               WE A EPLHWDID  K NGVGPGM FANELLAKA  SIG IGLV
Subjt:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV

Query:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE
        PCAIGG+H+REW KGT  YT LV+R+  SE++GGK                                              VKI ++D  I SP   + E
Subjt:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE

Query:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA
        ++  A+EAV QKLP + +VD     R+AV N   GLNED GH +V SEV +GKM AH+Y    A
Subjt:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA

XP_022134349.1 probable carbohydrate esterase At4g34215 [Momordica charantia]9.8e-7761.57Show/hide
Query:  FWGATSSPDNIFILGGQSNMAGRGGVFWDTITKHY-KWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGV
        F  ATS PDNIFILGGQSNMAGRGGV  D  T HY KWDGYIP+E               WELAHEPLHWDID  K NGVGPGMAFANELLAKANKSIGV
Subjt:  FWGATSSPDNIFILGGQSNMAGRGGVFWDTITKHY-KWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGV

Query:  IGLVPCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPIL
        IGLVPCAIGGT++REW KGTINYT LVDRINASE YGGK                                              VKI +YD LI SPI+
Subjt:  IGLVPCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPIL

Query:  NYTEDIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA
        NYT+ IRRA+EAVK KLPKIS VD+ +AI+  V++ KPGLNED GH SVYSEVEVGKMLAHAY Q  A
Subjt:  NYTEDIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA

XP_022141681.1 probable carbohydrate esterase At4g34215 [Momordica charantia]5.6e-5649.81Show/hide
Query:  GATS-SPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIG
        G+TS SP NIFIL GQSNMAGRGGV  D IT+   WDGYIP               L WE A EPLHWDID+ K NG+GPGMAFANEL  +  KSIGVIG
Subjt:  GATS-SPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIG

Query:  LVPCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNY
        LVPCAIGGTH+REW KGT  YT L+DRI ASEK+GGK                                              VKI ++D +  SPI+N+
Subjt:  LVPCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNY

Query:  TEDIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAY
         ED+ +A+E V +KL  + +VD  E    AV N + GLNED GH +V SEV++GKMLAHA+
Subjt:  TEDIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAY

XP_022939276.1 probable carbohydrate esterase At4g34215 [Cucurbita moschata]2.9e-5248.11Show/hide
Query:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV
        A +SP NIFIL GQSNMAGRGGV  D  T    WDGYIP E               WE A EPLHWDID  K NGVGPGM FANELLAKA  SIG IGLV
Subjt:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV

Query:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE
        PCAIGG+H+REW KGT  YT LV+R+  SE++GGK                                              VKI ++D  I SP   + E
Subjt:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE

Query:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA
        ++  A+EAV QKLP I +VD     R+AV N   GLNED GH +V SEV +GKM AH+Y    A
Subjt:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA

XP_023550941.1 probable carbohydrate esterase At4g34215 [Cucurbita pepo subsp. pepo]4.4e-5348.11Show/hide
Query:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV
        A +SP NIFIL GQSNMAGRGGV  D  T    WDGYIP E               WE AHEPLHWDID  K NGVGPGM FANELLAKA  SIG IGLV
Subjt:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV

Query:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE
        PCAIGG+H+REW KGT  YT LV+R+  SE++GGK                                              VKI ++D  I SP   + E
Subjt:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE

Query:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA
        ++  A+EAV QKLP + +VD     R+AV N   GLNED GH +V SEV +GKM AH+Y    A
Subjt:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA

TrEMBL top hitse value%identityAlignment
A0A5D3CGI9 Putative carbohydrate esterase5.3e-5246.21Show/hide
Query:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV
        AT+SP+NIFIL GQSNMAGRGGV  D  T+   WDGYIPLE               WE AHEPLHWDID  K NG+GPGM FANELLA   K  G IGLV
Subjt:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV

Query:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE
        PCAIGG+H++EW KGT  Y NLV+RI ASEK GGK                                              VKI ++D  + SP +++ E
Subjt:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE

Query:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA
        ++ +A EAV   LP +++VD      +AV N   GLNED GH +V SEV++GKM AH++    A
Subjt:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA

A0A6J1BYJ2 probable carbohydrate esterase At4g342154.7e-7761.57Show/hide
Query:  FWGATSSPDNIFILGGQSNMAGRGGVFWDTITKHY-KWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGV
        F  ATS PDNIFILGGQSNMAGRGGV  D  T HY KWDGYIP+E               WELAHEPLHWDID  K NGVGPGMAFANELLAKANKSIGV
Subjt:  FWGATSSPDNIFILGGQSNMAGRGGVFWDTITKHY-KWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGV

Query:  IGLVPCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPIL
        IGLVPCAIGGT++REW KGTINYT LVDRINASE YGGK                                              VKI +YD LI SPI+
Subjt:  IGLVPCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPIL

Query:  NYTEDIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA
        NYT+ IRRA+EAVK KLPKIS VD+ +AI+  V++ KPGLNED GH SVYSEVEVGKMLAHAY Q  A
Subjt:  NYTEDIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA

A0A6J1CJZ1 probable carbohydrate esterase At4g342152.7e-5649.81Show/hide
Query:  GATS-SPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIG
        G+TS SP NIFIL GQSNMAGRGGV  D IT+   WDGYIP               L WE A EPLHWDID+ K NG+GPGMAFANEL  +  KSIGVIG
Subjt:  GATS-SPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIG

Query:  LVPCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNY
        LVPCAIGGTH+REW KGT  YT L+DRI ASEK+GGK                                              VKI ++D +  SPI+N+
Subjt:  LVPCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNY

Query:  TEDIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAY
         ED+ +A+E V +KL  + +VD  E    AV N + GLNED GH +V SEV++GKMLAHA+
Subjt:  TEDIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAY

A0A6J1FFF9 probable carbohydrate esterase At4g342151.4e-5248.11Show/hide
Query:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV
        A +SP NIFIL GQSNMAGRGGV  D  T    WDGYIP E               WE A EPLHWDID  K NGVGPGM FANELLAKA  SIG IGLV
Subjt:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV

Query:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE
        PCAIGG+H+REW KGT  YT LV+R+  SE++GGK                                              VKI ++D  I SP   + E
Subjt:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE

Query:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA
        ++  A+EAV QKLP I +VD     R+AV N   GLNED GH +V SEV +GKM AH+Y    A
Subjt:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA

A0A6J1K1G7 probable carbohydrate esterase At4g342154.0e-5247.35Show/hide
Query:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV
        A +SP NIFIL GQSNMAGRGGV  D  T    WDGYIP E               WE A EPLHWDID  K NGVGPGM FANELLAKA  SIG IGLV
Subjt:  ATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLE---------------WELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLV

Query:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE
        PCAIGG+H+REW KGT  YT LV+R+  SE++GGK                                              VKI ++D  I SP   + +
Subjt:  PCAIGGTHIREWAKGTINYTNLVDRINASEKYGGK----------------------------------------------VKIASYDKLIRSPILNYTE

Query:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA
        ++  A+EAV QKLP + +VD     R+AV N   GLNED GH +V SEV +GKM AH+Y    A
Subjt:  DIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNED-GHFSVYSEVEVGKMLAHAYSQTLA

SwissProt top hitse value%identityAlignment
Q8L9J9 Probable carbohydrate esterase At4g342156.5e-3151.88Show/hide
Query:  PDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLVPCAI
        P+ IFIL GQSNMAGRGGVF D     + WD  +P               L WE AHEPLH DID GKV GVGPGMAFAN +  +      VIGLVPCA 
Subjt:  PDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLVPCAI

Query:  GGTHIREWAKGTINYTNLVDRINASEKYGGKVK
        GGT I+EW +G+  Y  +V R   S K GG++K
Subjt:  GGTHIREWAKGTINYTNLVDRINASEKYGGKVK

Arabidopsis top hitse value%identityAlignment
AT3G53010.1 Domain of unknown function (DUF303)6.9e-2850Show/hide
Query:  NIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLVPCAIGG
        +IFIL GQSNMAGRGGV+ DT T    WDG IP               LEW+ A EPLH DID  K NGVGPGM FAN ++ +     G +GLVPC+IGG
Subjt:  NIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLVPCAIGG

Query:  THIREWAKGTINYTNLVDRINASEKYGG
        T + +W KG   Y   V R  A+   GG
Subjt:  THIREWAKGTINYTNLVDRINASEKYGG

AT4G34215.1 Domain of unknown function (DUF303)4.6e-3251.88Show/hide
Query:  PDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLVPCAI
        P+ IFIL GQSNMAGRGGVF D     + WD  +P               L WE AHEPLH DID GKV GVGPGMAFAN +  +      VIGLVPCA 
Subjt:  PDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLVPCAI

Query:  GGTHIREWAKGTINYTNLVDRINASEKYGGKVK
        GGT I+EW +G+  Y  +V R   S K GG++K
Subjt:  GGTHIREWAKGTINYTNLVDRINASEKYGGKVK

AT4G34215.2 Domain of unknown function (DUF303)4.6e-3251.88Show/hide
Query:  PDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLVPCAI
        P+ IFIL GQSNMAGRGGVF D     + WD  +P               L WE AHEPLH DID GKV GVGPGMAFAN +  +      VIGLVPCA 
Subjt:  PDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIP---------------LEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLVPCAI

Query:  GGTHIREWAKGTINYTNLVDRINASEKYGGKVK
        GGT I+EW +G+  Y  +V R   S K GG++K
Subjt:  GGTHIREWAKGTINYTNLVDRINASEKYGGKVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTGGGGGGCTACTTCTTCTCCTGACAATATTTTTATTCTTGGTGGCCAAAGCAACATGGCTGGTCGAGGTGGCGTCTTTTGGGACACAATCACGAAACATTATAA
ATGGGATGGATATATCCCACTCGAATGGGAATTAGCGCATGAACCACTTCATTGGGACATTGATCATGGCAAGGTCAATGGGGTTGGCCCTGGTATGGCTTTTGCAAATG
AGCTTTTGGCCAAGGCTAACAAGAGCATTGGAGTTATTGGTCTTGTCCCGTGTGCTATTGGAGGAACCCACATTAGAGAGTGGGCTAAAGGGACAATTAATTACACCAAT
CTAGTTGATCGGATTAATGCTTCAGAAAAATACGGGGGAAAAGTGAAGATAGCATCCTACGATAAACTCATAAGAAGCCCAATCTTGAACTATACAGAAGATATTAGGAG
GGCTGAGGAGGCAGTTAAGCAGAAGCTGCCCAAAATAAGTATCGTAGATTCCATGGAAGCTATTAGACTCGCTGTTAATAACCACAAACCAGGCCTTAATGAAGATGGTC
ATTTTAGTGTCTATTCAGAGGTAGAAGTAGGCAAGATGTTAGCTCATGCTTATTCCCAGACATTAGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTGGGGGGCTACTTCTTCTCCTGACAATATTTTTATTCTTGGTGGCCAAAGCAACATGGCTGGTCGAGGTGGCGTCTTTTGGGACACAATCACGAAACATTATAA
ATGGGATGGATATATCCCACTCGAATGGGAATTAGCGCATGAACCACTTCATTGGGACATTGATCATGGCAAGGTCAATGGGGTTGGCCCTGGTATGGCTTTTGCAAATG
AGCTTTTGGCCAAGGCTAACAAGAGCATTGGAGTTATTGGTCTTGTCCCGTGTGCTATTGGAGGAACCCACATTAGAGAGTGGGCTAAAGGGACAATTAATTACACCAAT
CTAGTTGATCGGATTAATGCTTCAGAAAAATACGGGGGAAAAGTGAAGATAGCATCCTACGATAAACTCATAAGAAGCCCAATCTTGAACTATACAGAAGATATTAGGAG
GGCTGAGGAGGCAGTTAAGCAGAAGCTGCCCAAAATAAGTATCGTAGATTCCATGGAAGCTATTAGACTCGCTGTTAATAACCACAAACCAGGCCTTAATGAAGATGGTC
ATTTTAGTGTCTATTCAGAGGTAGAAGTAGGCAAGATGTTAGCTCATGCTTATTCCCAGACATTAGCCTAA
Protein sequenceShow/hide protein sequence
MFWGATSSPDNIFILGGQSNMAGRGGVFWDTITKHYKWDGYIPLEWELAHEPLHWDIDHGKVNGVGPGMAFANELLAKANKSIGVIGLVPCAIGGTHIREWAKGTINYTN
LVDRINASEKYGGKVKIASYDKLIRSPILNYTEDIRRAEEAVKQKLPKISIVDSMEAIRLAVNNHKPGLNEDGHFSVYSEVEVGKMLAHAYSQTLA