; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023609 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023609
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationtig00000892:4985135..4986004
RNA-Seq ExpressionSgr023609
SyntenySgr023609
Gene Ontology termsGO:0006474 - N-terminal protein amino acid acetylation (biological process)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034917.1 hypothetical protein SDJN02_01710 [Cucurbita argyrosperma subsp. argyrosperma]3.0e-7088.31Show/hide
Query:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI
        M++GVIVELQRNSTN  KVVEDIVKLEKKIFPKHESLARFFDE+L+KKNSGLLF++LDGEVVGYVMYSWPSSL ATIAKLAVKEN RRQG+GEALLKAAI
Subjt:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI

Query:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF
        +KCRTRN+QRI LHVDPLRTAA+ LYKKLGFQVDSLI+GYYSADR+AYRMYLDF
Subjt:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF

XP_004139939.1 uncharacterized protein LOC101203471 [Cucumis sativus]6.8e-7085.71Show/hide
Query:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI
        MNSG IVEL+RNSTNW KVVED+VKLEKK+FPKHESLARFFD++LRK+NSGLLF++L GEVVGYVMYSWPSSL ATIAKLAVKE CRRQG+GE LLKAAI
Subjt:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI

Query:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF
        +KCRTRN+QRI LHVDP RTAAMNLYKKLGFQVDSLI+GYYSADR+AYRMYL+F
Subjt:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF

XP_022140583.1 uncharacterized protein LOC111011202 [Momordica charantia]2.2e-7693.51Show/hide
Query:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI
        MN G IVELQRNSTN+ KVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLF+ELDGEV+GYVMYSWPSSLCATIAKLAVKENCRRQG+GEALLKAAI
Subjt:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI

Query:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF
        DKCRTRNVQRICLHVDPLRTAAMNLY KLGFQVDSLIEGYYSADRNAYRM+L+F
Subjt:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF

XP_022947642.1 uncharacterized protein LOC111451442 [Cucurbita moschata]1.5e-6987.66Show/hide
Query:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI
        M++GVIVELQRNSTN  KVVEDIVKLEKKIFPKHESLARFFDE+L+KKNSGLLF++LDGEVVGYVMYSWPSSL ATIAKLAVKEN RRQG+GEALLKAAI
Subjt:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI

Query:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF
        +KCRTRN+QRI LHVDPLRTAA+ LY KLGFQVDSLI+GYYSADR+AYRMYLDF
Subjt:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF

XP_038902888.1 putative [ribosomal protein S18]-alanine N-acetyltransferase [Benincasa hispida]2.3e-7087.66Show/hide
Query:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI
        MNSG IVEL+RNSTNW KVVEDIVKLEKKIFPKHESLARFFDE+LRK+NSGLLF++L G+VV YVMY WPSSL A IAKLAVKENCRRQG+GEALLKAAI
Subjt:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI

Query:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF
        +KCRTRN+QRI LHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADR+AYRMYL+F
Subjt:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF

TrEMBL top hitse value%identityAlignment
A0A0A0KG64 N-acetyltransferase domain-containing protein3.3e-7085.71Show/hide
Query:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI
        MNSG IVEL+RNSTNW KVVED+VKLEKK+FPKHESLARFFD++LRK+NSGLLF++L GEVVGYVMYSWPSSL ATIAKLAVKE CRRQG+GE LLKAAI
Subjt:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI

Query:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF
        +KCRTRN+QRI LHVDP RTAAMNLYKKLGFQVDSLI+GYYSADR+AYRMYL+F
Subjt:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF

A0A5N6RAA0 N-acetyltransferase domain-containing protein2.1e-6982.69Show/hide
Query:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI
        M+SGVIVEL RN+TNW KVV+DIVKLE+KIFPKHESLAR FDE+LRK NSGLL+V++DGEVVGY+MYSWPSSLCA++ KLAVKENCRRQG+GE LLKAAI
Subjt:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI

Query:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDFDA
         KCRTRNV RI LHVDPLRT A+ LYKK GFQVD+LIEGYYS+DRNAYRMYLDFDA
Subjt:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDFDA

A0A6J1CIB3 uncharacterized protein LOC1110112021.1e-7693.51Show/hide
Query:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI
        MN G IVELQRNSTN+ KVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLF+ELDGEV+GYVMYSWPSSLCATIAKLAVKENCRRQG+GEALLKAAI
Subjt:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI

Query:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF
        DKCRTRNVQRICLHVDPLRTAAMNLY KLGFQVDSLIEGYYSADRNAYRM+L+F
Subjt:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF

A0A6J1G710 uncharacterized protein LOC1114514427.3e-7087.66Show/hide
Query:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI
        M++GVIVELQRNSTN  KVVEDIVKLEKKIFPKHESLARFFDE+L+KKNSGLLF++LDGEVVGYVMYSWPSSL ATIAKLAVKEN RRQG+GEALLKAAI
Subjt:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI

Query:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF
        +KCRTRN+QRI LHVDPLRTAA+ LY KLGFQVDSLI+GYYSADR+AYRMYLDF
Subjt:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF

A0A6J1I3P5 uncharacterized protein LOC1114696251.6e-6987.66Show/hide
Query:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI
        M++GVIVELQRNSTN  KVVEDIVKLEKKIFPKHESLARFFDE+L+KKNSGLLF++ D EVVGYVMYSWPSSL ATIAKLAVKEN RRQG+GEALLKAAI
Subjt:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI

Query:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF
        +KCRTRN+QRI LHVDPLRTAA+ LYKKLGFQVDSLIEGYYSADR+AYRMYLDF
Subjt:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDF

SwissProt top hitse value%identityAlignment
B5X4Z4 Probable amino-acid acetyltransferase NAGS2, chloroplastic3.6e-0528.33Show/hide
Query:  VEDIVKLEKKIFPKHES--LARFFDEQLRKKNSGLLFVELDGEVVG-YVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVD
        VED+  +   I P  ES  L R  DE+L +     + VE +G+++    ++ +    C  +A +AV  +CR QG G+ LL     K  +  ++++ L   
Subjt:  VEDIVKLEKKIFPKHES--LARFFDEQLRKKNSGLLFVELDGEVVG-YVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVD

Query:  PLRTAAMNLYKKLGFQVDSL
         L T   + + + GFQ  S+
Subjt:  PLRTAAMNLYKKLGFQVDSL

O05517 Putative [ribosomal protein S18]-alanine N-acetyltransferase1.8e-0928.46Show/hide
Query:  VEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVDPLR
        ++ + ++E   F    +   F+ E L    +  L +E DG + GY    W     A I  +A+K   R Q  GE L ++A++ C+ ++ +R+ L V    
Subjt:  VEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVDPLR

Query:  TAAMNLYKKLGFQVDSLIEGYYS
          A  LYKK G Q   + + YY+
Subjt:  TAAMNLYKKLGFQVDSLIEGYYS

Q4JBG0 N-alpha-acetyltransferase9.0e-0930.07Show/hide
Query:  VEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVM--YSW--------PSSL-CATIAKLAVKENCRRQGYGEALLKAAIDKCR-TRN
        ++ I+++ +   P++     FF E L++        +L+GEVVGYVM    W        PS +    I  +AV E  R+ G G +LL+ ++   + T N
Subjt:  VEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVM--YSW--------PSSL-CATIAKLAVKENCRRQGYGEALLKAAIDKCR-TRN

Query:  VQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRM
         + + L V      A++LYKK  F+   L++ YY+   +AY M
Subjt:  VQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRM

Q84JF4 Probable amino-acid acetyltransferase NAGS1, chloroplastic1.8e-0428.45Show/hide
Query:  VEDIVKLEKKIFPKHES--LARFFDEQLRKKNSGLLFVELDGEVVG-YVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVD
        VED+  + + I P  ES  L R  DE+L +     + VE +G ++    ++ +    C  +A +AV  +CR QG G+ LL     K     ++ + L   
Subjt:  VEDIVKLEKKIFPKHES--LARFFDEQLRKKNSGLLFVELDGEVVG-YVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVD

Query:  PLRTAAMNLYKKLGFQ
         L T   + + + GFQ
Subjt:  PLRTAAMNLYKKLGFQ

Q976C3 N-alpha-acetyltransferase5.1e-1232.87Show/hide
Query:  VEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVM--YSWPSSLC---------ATIAKLAVKENCRRQGYGEALLKAAIDKCR-TRN
        V+ I+K+ +   P++     FF E L++  +     E+DGEVVGY+M    W  S             +  +AV E  RR G G ALL+A++   +   N
Subjt:  VEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVM--YSWPSSLC---------ATIAKLAVKENCRRQGYGEALLKAAIDKCR-TRN

Query:  VQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRM
         + + L V    + A+NLYKKLGF+   ++  YY+   +AY M
Subjt:  VQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRM

Arabidopsis top hitse value%identityAlignment
AT1G03650.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein5.7e-6776.13Show/hide
Query:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI
        M+ GV+VEL R ST+W KVVEDIVKLEKK FPKHESLA+ FD +LRKKN+GLL+V+ +G+ VGY MYSWPSSL A+I KLAVKENCRRQG+GEALL+AAI
Subjt:  MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAI

Query:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDFD
        DKCR+R VQR+ LHVDP RT+A+NLYKKLGFQVD L++ YYSADR+AYRMYLDFD
Subjt:  DKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDFD

AT2G22910.1 N-acetyl-l-glutamate synthase 11.3e-0528.45Show/hide
Query:  VEDIVKLEKKIFPKHES--LARFFDEQLRKKNSGLLFVELDGEVVG-YVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVD
        VED+  + + I P  ES  L R  DE+L +     + VE +G ++    ++ +    C  +A +AV  +CR QG G+ LL     K     ++ + L   
Subjt:  VEDIVKLEKKIFPKHES--LARFFDEQLRKKNSGLLFVELDGEVVG-YVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVD

Query:  PLRTAAMNLYKKLGFQ
         L T   + + + GFQ
Subjt:  PLRTAAMNLYKKLGFQ

AT4G37670.1 N-acetyl-l-glutamate synthase 21.1e-0432.5Show/hide
Query:  VEDIVKLEKKIFPKHES--LARFFDEQLRKKNSGLLFVELDGEVVG-YVMYSWPSSLCATIAKLAVKENCRRQGYGEALL
        VED+  +   I P  ES  L R  DE+L +     + VE +G+++    ++ +    C  +A +AV  +CR QG G+ LL
Subjt:  VEDIVKLEKKIFPKHES--LARFFDEQLRKKNSGLLFVELDGEVVG-YVMYSWPSSLCATIAKLAVKENCRRQGYGEALL

AT4G37670.2 N-acetyl-l-glutamate synthase 22.5e-0628.33Show/hide
Query:  VEDIVKLEKKIFPKHES--LARFFDEQLRKKNSGLLFVELDGEVVG-YVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVD
        VED+  +   I P  ES  L R  DE+L +     + VE +G+++    ++ +    C  +A +AV  +CR QG G+ LL     K  +  ++++ L   
Subjt:  VEDIVKLEKKIFPKHES--LARFFDEQLRKKNSGLLFVELDGEVVG-YVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVD

Query:  PLRTAAMNLYKKLGFQVDSL
         L T   + + + GFQ  S+
Subjt:  PLRTAAMNLYKKLGFQVDSL

AT5G11340.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.6e-0536.92Show/hide
Query:  IAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYY
        I  L V    R  G G  LL   +D C  +N+  I LHV      A+  YKK GF++   I+ YY
Subjt:  IAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQRICLHVDPLRTAAMNLYKKLGFQVDSLIEGYY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAGTGGGGTCATCGTGGAACTGCAAAGAAACTCCACCAACTGGCCTAAAGTTGTGGAAGACATCGTGAAGCTCGAGAAGAAGATTTTCCCAAAACACGAGTCTCT
TGCTAGGTTCTTCGACGAACAACTCAGAAAAAAGAATTCTGGATTGCTTTTCGTGGAATTGGATGGCGAAGTCGTAGGCTATGTCATGTATTCTTGGCCCTCTTCCCTCT
GCGCTACGATCGCCAAGCTCGCAGTGAAGGAGAACTGTAGAAGACAAGGATATGGAGAGGCACTCCTGAAGGCGGCCATTGACAAGTGCAGAACCAGAAACGTTCAACGC
ATATGTCTTCATGTTGATCCGCTGAGGACTGCGGCTATGAATCTCTACAAGAAACTTGGGTTCCAAGTTGATAGCTTGATAGAAGGATACTACTCTGCTGATCGAAATGC
CTACAGAATGTACTTGGATTTTGATGCAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAGTGGGGTCATCGTGGAACTGCAAAGAAACTCCACCAACTGGCCTAAAGTTGTGGAAGACATCGTGAAGCTCGAGAAGAAGATTTTCCCAAAACACGAGTCTCT
TGCTAGGTTCTTCGACGAACAACTCAGAAAAAAGAATTCTGGATTGCTTTTCGTGGAATTGGATGGCGAAGTCGTAGGCTATGTCATGTATTCTTGGCCCTCTTCCCTCT
GCGCTACGATCGCCAAGCTCGCAGTGAAGGAGAACTGTAGAAGACAAGGATATGGAGAGGCACTCCTGAAGGCGGCCATTGACAAGTGCAGAACCAGAAACGTTCAACGC
ATATGTCTTCATGTTGATCCGCTGAGGACTGCGGCTATGAATCTCTACAAGAAACTTGGGTTCCAAGTTGATAGCTTGATAGAAGGATACTACTCTGCTGATCGAAATGC
CTACAGAATGTACTTGGATTTTGATGCAGCTTAG
Protein sequenceShow/hide protein sequence
MNSGVIVELQRNSTNWPKVVEDIVKLEKKIFPKHESLARFFDEQLRKKNSGLLFVELDGEVVGYVMYSWPSSLCATIAKLAVKENCRRQGYGEALLKAAIDKCRTRNVQR
ICLHVDPLRTAAMNLYKKLGFQVDSLIEGYYSADRNAYRMYLDFDAA