; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022152 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022152
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSASA domain-containing protein
Genome locationtig00153894:709234..714537
RNA-Seq ExpressionSgr022152
SyntenySgr022152
Gene Ontology termsNA
InterPro domainsIPR005181 - Sialate O-acetylesterase domain
IPR036514 - SGNH hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578894.1 putative carbohydrate esterase, partial [Cucurbita argyrosperma subsp. sororia]8.3e-4062.5Show/hide
Query:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV
        LLAK G SI  IGLV CAIGG+ LREW+K T  YT LV ++K S++HG    GFFWYQGE DA+VE E+K Y+  L+KFFTDLR D+NH DLPIIL  IV
Subjt:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV

Query:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ
         HD F SP   +KE+VW AQEAVT KLP VRMVD   AV N ++
Subjt:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ

KAG7016422.1 putative carbohydrate esterase, partial [Cucurbita argyrosperma subsp. argyrosperma]8.3e-4062.5Show/hide
Query:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV
        LLAK G SI  IGLV CAIGG+ LREW+K T  YT LV ++K S++HG    GFFWYQGE DA+VE E+K Y+  L+KFFTDLR D+NH DLPIIL  IV
Subjt:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV

Query:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ
         HD F SP   +KE+VW AQEAVT KLP VRMVD   AV N ++
Subjt:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ

XP_022141681.1 probable carbohydrate esterase At4g34215 [Momordica charantia]5.5e-4468.75Show/hide
Query:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV
        L  ++G+SI VIGLV CAIGG  LREWIK T  YT L+++IKAS+KHG    GF WYQGE DASVE ESK Y+  LTKFFTDLR D N+L+LPIIL  IV
Subjt:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV

Query:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ
         HDIFTSPIIN+KEDVWKAQE VT KL  VRMVD  EAV N E+
Subjt:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ

XP_022939276.1 probable carbohydrate esterase At4g34215 [Cucurbita moschata]1.1e-3961.81Show/hide
Query:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV
        LLAK G SI  IGLV CAIGG+ LREW+K T  YT LV ++K S++HG    GFFWYQGE DA+VE E+K Y+  L+KFFTDLR D+NH DLPIIL  IV
Subjt:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV

Query:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ
         HD F SP   +KE+VW AQEAVT KLP +RMVD   AV N ++
Subjt:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ

XP_023550941.1 probable carbohydrate esterase At4g34215 [Cucurbita pepo subsp. pepo]8.3e-4062.5Show/hide
Query:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV
        LLAK G SI  IGLV CAIGG+ LREW+K T  YT LV ++K S++HG    GFFWYQGE DA+VE E+K Y+  L+KFFTDLR D+NH DLPIIL  IV
Subjt:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV

Query:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ
         HD F SP   +KE+VW AQEAVT KLP VRMVD   AV N ++
Subjt:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ

TrEMBL top hitse value%identityAlignment
A0A6J1BYJ2 probable carbohydrate esterase At4g342154.6e-3658.11Show/hide
Query:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHGG----FFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV
        LLAK  +SI VIGLV CAIGG  LREW+K T+NYT LV++I AS+ +GG    FFW+QGE DASV V+++FYK+NL KF TDLRKDLN   LPIIL  I 
Subjt:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHGG----FFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV

Query:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQKVIS
         +D   SPI+NY + + +A EAV HKLPK+  VDA +A+    Q+V+S
Subjt:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQKVIS

A0A6J1CJZ1 probable carbohydrate esterase At4g342152.7e-4468.75Show/hide
Query:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV
        L  ++G+SI VIGLV CAIGG  LREWIK T  YT L+++IKAS+KHG    GF WYQGE DASVE ESK Y+  LTKFFTDLR D N+L+LPIIL  IV
Subjt:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV

Query:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ
         HDIFTSPIIN+KEDVWKAQE VT KL  VRMVD  EAV N E+
Subjt:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ

A0A6J1CKF9 probable carbohydrate esterase At4g342151.5e-3962.24Show/hide
Query:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV
        +LAK G    VIGLV CAIGG  LREW+K T NYT LVN+IKAS+  G    G  WYQGE DA+VE ESKFY+ NLTKF+TDLR D NH DLPIIL  IV
Subjt:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV

Query:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLE
         HD F SP+IN+ +DVWKAQE +T  L  VR+VD  +AV N +
Subjt:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLE

A0A6J1FFF9 probable carbohydrate esterase At4g342155.2e-4061.81Show/hide
Query:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV
        LLAK G SI  IGLV CAIGG+ LREW+K T  YT LV ++K S++HG    GFFWYQGE DA+VE E+K Y+  L+KFFTDLR D+NH DLPIIL  IV
Subjt:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV

Query:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ
         HD F SP   +KE+VW AQEAVT KLP +RMVD   AV N ++
Subjt:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ

A0A6J1K1G7 probable carbohydrate esterase At4g342151.2e-3961.81Show/hide
Query:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV
        LLAK G SI  IGLV CAIGG+ LREW+K T  YT LV ++K S++HG    GFFWYQGE DA+VE E+K Y+  L+KFFTDLR D+NH DLPIIL  IV
Subjt:  LLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHG----GFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIIL--IV

Query:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ
         HD F SP   +K++VW AQEAVT KLP VRMVD   AV N ++
Subjt:  PHDIFTSPIINYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQ

SwissProt top hitse value%identityAlignment
Q8L9J9 Probable carbohydrate esterase At4g342151.3e-0839.08Show/hide
Query:  VIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHGG----FFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIILI
        VIGLV CA GG  ++EW + +  Y  +V + + S+K GG      WYQGE D     +++ Y  N+ +   +LR DLN   LPII +
Subjt:  VIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHGG----FFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIILI

Arabidopsis top hitse value%identityAlignment
AT3G53010.1 Domain of unknown function (DUF303)8.3e-1444.32Show/hide
Query:  IGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHGG------FFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIILI
        +GLV C+IGG +L +W K    Y   V + KA+   GG        WYQGE D    V++  YK+ L KFF+DLR DL H +LPII +
Subjt:  IGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHGG------FFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIILI

AT4G34215.1 Domain of unknown function (DUF303)9.5e-1039.08Show/hide
Query:  VIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHGG----FFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIILI
        VIGLV CA GG  ++EW + +  Y  +V + + S+K GG      WYQGE D     +++ Y  N+ +   +LR DLN   LPII +
Subjt:  VIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHGG----FFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIILI

AT4G34215.2 Domain of unknown function (DUF303)9.5e-1039.08Show/hide
Query:  VIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHGG----FFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIILI
        VIGLV CA GG  ++EW + +  Y  +V + + S+K GG      WYQGE D     +++ Y  N+ +   +LR DLN   LPII +
Subjt:  VIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHGG----FFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIILI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTTGGACCCAGAATGGCTTTTGGCCAAACTTGGCGAGAGCATCGATGTCATTGGTCTCGTTTCGTGTGCCATTGGAGGAAATCAATTGAGGGAATGGATTAAATG
TACTGTTAATTACACCATATTGGTCAACCAAATTAAAGCTTCCAAAAAACATGGAGGATTTTTCTGGTATCAAGGAGAGTTTGATGCTTCAGTGGAAGTAGAATCTAAGT
TCTACAAAGAAAACCTTACCAAATTCTTCACTGACCTGCGCAAAGACCTGAACCACCTAGATCTACCCATCATCCTGATAGTACCTCATGATATTTTCACAAGTCCAATT
ATAAACTACAAGGAAGATGTATGGAAGGCTCAGGAGGCAGTCACACACAAGCTACCGAAAGTAAGAATGGTGGACGCCATGGAAGCGGTCGACAACCTTGAGCAAAAGGT
CATCTCAATGTCAAATCTAAGGTTTATCATAGGCCTGCCGACTCCCCGACTGAAAAGAAATACTGAGATTGCCGGAACATACACTACACCACATCAGTGTGGAGAGAACT
TGGGGATCCACGAGATTCAAAGAGTTGCGGATCCAGGATTTGAAGGTTTGGCGAGTGATATCGAGGTCCGATGGCTTCCGATGGCTCTAAAACTGCAAATGTGCTTTAGA
TTATACGGCAGTTTATCAACTTACTTACTCAAGAAGGCGGAGTTGGTTGGTTTACCTTCCCATCCAGAAGCAACGACCCTGCCGAGCGAAGGTCTTGATCGCAGTTCGAA
CGACGGAAGAGGCACTGTGATTCTGACTCGCAAAGAACCGAATGGGGTCGCCGCTCCCCGTCAGATTGACGCTGATCAGCCTCCATTGTTCTTCCGCTGCCTTCTTCTTC
ATCCTGATCTCCTCCTTCTGCTTTCGCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTTGGACCCAGAATGGCTTTTGGCCAAACTTGGCGAGAGCATCGATGTCATTGGTCTCGTTTCGTGTGCCATTGGAGGAAATCAATTGAGGGAATGGATTAAATG
TACTGTTAATTACACCATATTGGTCAACCAAATTAAAGCTTCCAAAAAACATGGAGGATTTTTCTGGTATCAAGGAGAGTTTGATGCTTCAGTGGAAGTAGAATCTAAGT
TCTACAAAGAAAACCTTACCAAATTCTTCACTGACCTGCGCAAAGACCTGAACCACCTAGATCTACCCATCATCCTGATAGTACCTCATGATATTTTCACAAGTCCAATT
ATAAACTACAAGGAAGATGTATGGAAGGCTCAGGAGGCAGTCACACACAAGCTACCGAAAGTAAGAATGGTGGACGCCATGGAAGCGGTCGACAACCTTGAGCAAAAGGT
CATCTCAATGTCAAATCTAAGGTTTATCATAGGCCTGCCGACTCCCCGACTGAAAAGAAATACTGAGATTGCCGGAACATACACTACACCACATCAGTGTGGAGAGAACT
TGGGGATCCACGAGATTCAAAGAGTTGCGGATCCAGGATTTGAAGGTTTGGCGAGTGATATCGAGGTCCGATGGCTTCCGATGGCTCTAAAACTGCAAATGTGCTTTAGA
TTATACGGCAGTTTATCAACTTACTTACTCAAGAAGGCGGAGTTGGTTGGTTTACCTTCCCATCCAGAAGCAACGACCCTGCCGAGCGAAGGTCTTGATCGCAGTTCGAA
CGACGGAAGAGGCACTGTGATTCTGACTCGCAAAGAACCGAATGGGGTCGCCGCTCCCCGTCAGATTGACGCTGATCAGCCTCCATTGTTCTTCCGCTGCCTTCTTCTTC
ATCCTGATCTCCTCCTTCTGCTTTCGCTTTGA
Protein sequenceShow/hide protein sequence
MGLDPEWLLAKLGESIDVIGLVSCAIGGNQLREWIKCTVNYTILVNQIKASKKHGGFFWYQGEFDASVEVESKFYKENLTKFFTDLRKDLNHLDLPIILIVPHDIFTSPI
INYKEDVWKAQEAVTHKLPKVRMVDAMEAVDNLEQKVISMSNLRFIIGLPTPRLKRNTEIAGTYTTPHQCGENLGIHEIQRVADPGFEGLASDIEVRWLPMALKLQMCFR
LYGSLSTYLLKKAELVGLPSHPEATTLPSEGLDRSSNDGRGTVILTRKEPNGVAAPRQIDADQPPLFFRCLLLHPDLLLLLSL