; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012341 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012341
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSASA domain-containing protein
Genome locationtig00153343:117955..119008
RNA-Seq ExpressionSgr012341
SyntenySgr012341
Gene Ontology termsNA
InterPro domainsIPR005181 - Sialate O-acetylesterase domain
IPR036514 - SGNH hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131651.1 probable carbohydrate esterase At4g34215 [Momordica charantia]5.8e-6352.96Show/hide
Query:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA
        MALLRLSI+LCMMLFG S   ATSPKNIFIL  QS+MAGR                                      AREPVH GID  KTVGVG AIA
Subjt:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA

Query:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF
        FAHQLQ+KG                                                          G+S+AASS+TA RYK+N KKFFTDI NDIKPR 
Subjt:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF

Query:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL
        LPII+VKI VYD +MKHDTHDLPA+RA E+A QREL +VVTID+L+LVNTTT EGFN D GHFN+KTEIALGKWLADTYLS+Y HLL
Subjt:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL

XP_022157447.1 probable carbohydrate esterase At4g34215 [Momordica charantia]2.9e-6252.96Show/hide
Query:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA
        M +LRLSILLCMML G S   ATSPKNIFIL  QS+MAGR                                      AREPVHEGID NKTVGVG AIA
Subjt:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA

Query:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF
        FA QLQ+KG                                                          G+S+AAS +TA RYK+N KKFFTDI NDIKPRF
Subjt:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF

Query:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL
        LPII+VKI VYD +MKHDTHDLPA+RA E+A QREL +VVTIDSL+LVNTTT EGFN D GHFN+KTEIALGKWLADTYLS Y HLL
Subjt:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL

XP_022158365.1 probable carbohydrate esterase At4g34215 [Momordica charantia]2.4e-6152.26Show/hide
Query:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA
        MALLRL ILLCMML G+S   ATSPKNIFIL  QS+MAGR                                      AREPVHEGID NKTVGVG AI+
Subjt:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA

Query:  FAHQLQSKGGQ----------------------------------------------------------SNAASSNTARRYKDNFKKFFTDIHNDIKPRF
        FAHQLQ+KGG                                                           S+AAS +TA RYK+N KKFFTDI ND+KPRF
Subjt:  FAHQLQSKGGQ----------------------------------------------------------SNAASSNTARRYKDNFKKFFTDIHNDIKPRF

Query:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL
        LPIIVV+  VYD +MKHDTHDLPA+RA ++A QREL +VVTIDSL+LVNTTT EGFN D GHFN KTEIALGKWLADTYLSHY +LL
Subjt:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL

XP_022158585.1 probable carbohydrate esterase At4g34215 [Momordica charantia]6.4e-6251.92Show/hide
Query:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA
        MALLRLSILLCM+L G S   ATSPKNIFIL  QS+MAGR                                      AREPVHEGID NKTVGVGSAIA
Subjt:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA

Query:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF
        FAHQLQ+KG                                                          G+S+AAS +TA RYK+N KKFFTDI NDIKPRF
Subjt:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF

Query:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL
        LPIIV+KI  YD +++HDTHDLP +RA E+A QRELL++VTID+L+LVNT TGEGFN D GH+N+KTEIALGKWLADTYLSHY  LL
Subjt:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL

XP_023002178.1 probable carbohydrate esterase At4g34215 [Cucurbita maxima]9.3e-6151.57Show/hide
Query:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA
        MA LRL+I +CMMLF  S L ATSPKNIFIL  QS+MAGR                                      A EPVH+GID NKTVGVGSAI 
Subjt:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA

Query:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF
        FA QLQ+K                                                           G+S+AA+S+TA RYKDN KKFF DI NDIKPRF
Subjt:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF

Query:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL
        LPII+VKI VYD YMKHDTHDLPA+RA E+A Q+EL  +VTIDSL LVNT T EGFNQDHGHFN KT+IALGKWLADTYLSHY HLL
Subjt:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL

TrEMBL top hitse value%identityAlignment
A0A6J1BQ38 probable carbohydrate esterase At4g342152.8e-6352.96Show/hide
Query:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA
        MALLRLSI+LCMMLFG S   ATSPKNIFIL  QS+MAGR                                      AREPVH GID  KTVGVG AIA
Subjt:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA

Query:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF
        FAHQLQ+KG                                                          G+S+AASS+TA RYK+N KKFFTDI NDIKPR 
Subjt:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF

Query:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL
        LPII+VKI VYD +MKHDTHDLPA+RA E+A QREL +VVTID+L+LVNTTT EGFN D GHFN+KTEIALGKWLADTYLS+Y HLL
Subjt:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL

A0A6J1DUH7 probable carbohydrate esterase At4g342151.4e-6252.96Show/hide
Query:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA
        M +LRLSILLCMML G S   ATSPKNIFIL  QS+MAGR                                      AREPVHEGID NKTVGVG AIA
Subjt:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA

Query:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF
        FA QLQ+KG                                                          G+S+AAS +TA RYK+N KKFFTDI NDIKPRF
Subjt:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF

Query:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL
        LPII+VKI VYD +MKHDTHDLPA+RA E+A QREL +VVTIDSL+LVNTTT EGFN D GHFN+KTEIALGKWLADTYLS Y HLL
Subjt:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL

A0A6J1DVM5 probable carbohydrate esterase At4g342151.2e-6152.26Show/hide
Query:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA
        MALLRL ILLCMML G+S   ATSPKNIFIL  QS+MAGR                                      AREPVHEGID NKTVGVG AI+
Subjt:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA

Query:  FAHQLQSKGGQ----------------------------------------------------------SNAASSNTARRYKDNFKKFFTDIHNDIKPRF
        FAHQLQ+KGG                                                           S+AAS +TA RYK+N KKFFTDI ND+KPRF
Subjt:  FAHQLQSKGGQ----------------------------------------------------------SNAASSNTARRYKDNFKKFFTDIHNDIKPRF

Query:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL
        LPIIVV+  VYD +MKHDTHDLPA+RA ++A QREL +VVTIDSL+LVNTTT EGFN D GHFN KTEIALGKWLADTYLSHY +LL
Subjt:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL

A0A6J1DW87 probable carbohydrate esterase At4g342153.1e-6251.92Show/hide
Query:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA
        MALLRLSILLCM+L G S   ATSPKNIFIL  QS+MAGR                                      AREPVHEGID NKTVGVGSAIA
Subjt:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA

Query:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF
        FAHQLQ+KG                                                          G+S+AAS +TA RYK+N KKFFTDI NDIKPRF
Subjt:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF

Query:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL
        LPIIV+KI  YD +++HDTHDLP +RA E+A QRELL++VTID+L+LVNT TGEGFN D GH+N+KTEIALGKWLADTYLSHY  LL
Subjt:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL

A0A6J1KST3 probable carbohydrate esterase At4g342154.5e-6151.57Show/hide
Query:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA
        MA LRL+I +CMMLF  S L ATSPKNIFIL  QS+MAGR                                      A EPVH+GID NKTVGVGSAI 
Subjt:  MALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGR--------------------------------------AREPVHEGIDNNKTVGVGSAIA

Query:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF
        FA QLQ+K                                                           G+S+AA+S+TA RYKDN KKFF DI NDIKPRF
Subjt:  FAHQLQSKG----------------------------------------------------------GQSNAASSNTARRYKDNFKKFFTDIHNDIKPRF

Query:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL
        LPII+VKI VYD YMKHDTHDLPA+RA E+A Q+EL  +VTIDSL LVNT T EGFNQDHGHFN KT+IALGKWLADTYLSHY HLL
Subjt:  LPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G53010.1 Domain of unknown function (DUF303)5.0e-0423.5Show/hide
Query:  EATSPKNIFILTSQSSMAGRAREPVHEGIDNNKTVGVGSAIAFAHQLQSKGGQ-----------------------------------------------
        E  S  +I  LTS+      A+EP+H  ID NKT GVG  + FA+++ ++ GQ                                               
Subjt:  EATSPKNIFILTSQSSMAGRAREPVHEGIDNNKTVGVGSAIAFAHQLQSKGGQ-----------------------------------------------

Query:  ----SNAASSNTARRYKDNFKKFFTDIHNDIKPRFLPIIVVKITVYDFYMKHDTHDLPAMRATENA-FQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFN
            S+      A  YK    KFF+D+ ND++   LPII V +          T   P + A   A  + +L +V  +D+        G     D  H  
Subjt:  ----SNAASSNTARRYKDNFKKFFTDIHNDIKPRFLPIIVVKITVYDFYMKHDTHDLPAMRATENA-FQRELLHVVTIDSLQLVNTTTGEGFNQDHGHFN

Query:  LKTEIALGKWLADTYLS
          +++ LG  +A+++L+
Subjt:  LKTEIALGKWLADTYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAAGGCAGAGGCAAGGATCAGCCATCAACAAAAAGAAGCAGTGATAAGGGTTTTGGAGGGAGAGTAGCTGTGTCTGTGCAAATAGACATAGAGGAAAAGTTGCT
TCTTTTAGACACAAAAGAGTCATTTCTATCCATTTACACTTTTCTTCTTTGTCTATTTGTACAGACACAAAGGGTCAAGATGGCTTTGTTGAGATTATCAATCTTGCTCT
GTATGATGCTATTTGGCTCTTCCCGTTTAGAGGCTACTTCTCCTAAGAACATATTCATTCTCACCAGTCAGAGCAGCATGGCTGGTCGAGCACGAGAGCCCGTCCATGAG
GGCATTGACAACAACAAGACGGTTGGAGTTGGTTCAGCAATTGCATTTGCTCACCAGCTGCAGTCCAAAGGCGGTCAAAGCAATGCAGCTAGTAGCAACACTGCTCGTAG
ATACAAAGACAACTTCAAGAAGTTCTTCACTGACATTCACAATGATATCAAGCCTAGATTTTTACCCATCATTGTTGTGAAAATAACTGTTTATGACTTCTATATGAAGC
ATGATACTCATGATTTGCCAGCAATGAGGGCAACAGAAAATGCATTCCAGCGGGAGCTGTTGCATGTGGTGACCATCGACTCTTTGCAATTGGTGAACACTACCACCGGG
GAAGGCTTTAACCAAGATCATGGTCATTTTAATCTCAAAACTGAGATTGCATTGGGCAAATGGTTGGCTGACACCTACCTCTCCCATTATGTCCACTTACTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAAGGCAGAGGCAAGGATCAGCCATCAACAAAAAGAAGCAGTGATAAGGGTTTTGGAGGGAGAGTAGCTGTGTCTGTGCAAATAGACATAGAGGAAAAGTTGCT
TCTTTTAGACACAAAAGAGTCATTTCTATCCATTTACACTTTTCTTCTTTGTCTATTTGTACAGACACAAAGGGTCAAGATGGCTTTGTTGAGATTATCAATCTTGCTCT
GTATGATGCTATTTGGCTCTTCCCGTTTAGAGGCTACTTCTCCTAAGAACATATTCATTCTCACCAGTCAGAGCAGCATGGCTGGTCGAGCACGAGAGCCCGTCCATGAG
GGCATTGACAACAACAAGACGGTTGGAGTTGGTTCAGCAATTGCATTTGCTCACCAGCTGCAGTCCAAAGGCGGTCAAAGCAATGCAGCTAGTAGCAACACTGCTCGTAG
ATACAAAGACAACTTCAAGAAGTTCTTCACTGACATTCACAATGATATCAAGCCTAGATTTTTACCCATCATTGTTGTGAAAATAACTGTTTATGACTTCTATATGAAGC
ATGATACTCATGATTTGCCAGCAATGAGGGCAACAGAAAATGCATTCCAGCGGGAGCTGTTGCATGTGGTGACCATCGACTCTTTGCAATTGGTGAACACTACCACCGGG
GAAGGCTTTAACCAAGATCATGGTCATTTTAATCTCAAAACTGAGATTGCATTGGGCAAATGGTTGGCTGACACCTACCTCTCCCATTATGTCCACTTACTTTGA
Protein sequenceShow/hide protein sequence
MEKGRGKDQPSTKRSSDKGFGGRVAVSVQIDIEEKLLLLDTKESFLSIYTFLLCLFVQTQRVKMALLRLSILLCMMLFGSSRLEATSPKNIFILTSQSSMAGRAREPVHE
GIDNNKTVGVGSAIAFAHQLQSKGGQSNAASSNTARRYKDNFKKFFTDIHNDIKPRFLPIIVVKITVYDFYMKHDTHDLPAMRATENAFQRELLHVVTIDSLQLVNTTTG
EGFNQDHGHFNLKTEIALGKWLADTYLSHYVHLL