; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024731 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024731
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationtig00002486:2302621..2303151
RNA-Seq ExpressionSgr024731
SyntenySgr024731
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138097.1 uncharacterized protein LOC111009349 [Momordica charantia]1.7e-7482.14Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        MAVTLRPLDLTDIDDFMVWASDEKAAR CSWEPYT+KSDA+ FIK++VLPHPW+RAICVDGRPVGAISV+ANS+ RDRC GELGYVLGS FWGKGI T A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE
        VKLVAE IF E+P LER+EALV V+NLASQRV+EKAGF REGVLRK+GV KG+T+DFV+FSLLS+DL+
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE

XP_022138172.1 uncharacterized protein LOC111009408 [Momordica charantia]3.2e-7682.46Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        MAVTLRPLDLTDIDDFMVWASDEK ARFCSWEPYT+KSDA+ FIK++VLPHPW+RAICVDGRPVGAI VS NS+ RDRCR ELGYVLGSKFWGKGI T A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLEVET
        VKLVAE IF E+PGLERLEALVDV+NLASQRV+EKAGF REGVLRK+GV KG+T+DFV+FSLLS+ L  +T
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLEVET

XP_022955368.1 uncharacterized protein LOC111457417 [Cucurbita moschata]1.0e-7482.74Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        M +TLRPLDLTDIDDFMVWASDEKAAR CSWEPY +KSDAI +I ++VL HPW RAICVDGRPVGAISV+AN + RDRCRGELGYVLGSKFWGKGIGT A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE
        VKLVAE IFVE+P LERLEALV V+NLASQRVLEKAGFQREGVLRK+GV KG+T+D+V+FSLLS+DLE
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE

XP_022980780.1 uncharacterized protein LOC111480066 [Cucurbita maxima]1.3e-7481.55Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        M +TLRPLDLTDIDDFM WASDEKAAR CSWEPY +K DAI +I ++VL HPW+RAICVDGRPVGAISV+AN + RDRCRGELGYVLGSKFWGKGIGT A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE
        VKLVAE IFVE+P LERLEALVDV+NLASQRV+EKAGFQREGVLRK+GV KG+T+D+V+FSLLS+DLE
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE

XP_038898770.1 uncharacterized N-acetyltransferase p20 [Benincasa hispida]4.6e-7582.14Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        M +TLRPLDLTD+DDFM WA+DEKAARFCSWEPY +KS+AI FI ++VL HPW+RAICVDGRPVGAISV AN++VRDRCRGELGYVLGSKFWGKGIGT A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE
        VKLVAE IFVE P LERLEALVDV+NLASQRVLEKAGFQREGVLRK+GV+KG+T+DFV+FS LS+DL+
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE

TrEMBL top hitse value%identityAlignment
A0A5D3D131 Putative N-acetyltransferase p20-like6.3e-7077.71Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        M +TLRPLDLTDIDDFM WA+DEKAAR+CSWEPY +KS+AI FI ++VL HP++RAICVDGRPVGAISV +N++ RD+CRGELGYVLGSKFWGKGIGT A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSD
        VKLVAE IFVE P LERLEALVDV+N ASQRVLEKAGFQREGVLRK+GV KG  +D+V+FS L +D
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSD

A0A6J1CA30 uncharacterized protein LOC1110093498.5e-7582.14Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        MAVTLRPLDLTDIDDFMVWASDEKAAR CSWEPYT+KSDA+ FIK++VLPHPW+RAICVDGRPVGAISV+ANS+ RDRC GELGYVLGS FWGKGI T A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE
        VKLVAE IF E+P LER+EALV V+NLASQRV+EKAGF REGVLRK+GV KG+T+DFV+FSLLS+DL+
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE

A0A6J1CAC0 uncharacterized protein LOC1110094081.5e-7682.46Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        MAVTLRPLDLTDIDDFMVWASDEK ARFCSWEPYT+KSDA+ FIK++VLPHPW+RAICVDGRPVGAI VS NS+ RDRCR ELGYVLGSKFWGKGI T A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLEVET
        VKLVAE IF E+PGLERLEALVDV+NLASQRV+EKAGF REGVLRK+GV KG+T+DFV+FSLLS+ L  +T
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLEVET

A0A6J1GTR6 uncharacterized protein LOC1114574175.0e-7582.74Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        M +TLRPLDLTDIDDFMVWASDEKAAR CSWEPY +KSDAI +I ++VL HPW RAICVDGRPVGAISV+AN + RDRCRGELGYVLGSKFWGKGIGT A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE
        VKLVAE IFVE+P LERLEALV V+NLASQRVLEKAGFQREGVLRK+GV KG+T+D+V+FSLLS+DLE
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE

A0A6J1IXH7 uncharacterized protein LOC1114800666.5e-7581.55Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        M +TLRPLDLTDIDDFM WASDEKAAR CSWEPY +K DAI +I ++VL HPW+RAICVDGRPVGAISV+AN + RDRCRGELGYVLGSKFWGKGIGT A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE
        VKLVAE IFVE+P LERLEALVDV+NLASQRV+EKAGFQREGVLRK+GV KG+T+D+V+FSLLS+DLE
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE

SwissProt top hitse value%identityAlignment
O31633 Putative [ribosomal protein S5]-alanine N-acetyltransferase5.9e-0945.35Show/hide
Query:  LGYVLGSKFWGKGIGTTAVKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE
        +GY L     GKGI T AV+LV +  F E   L R+EA V  +NL S RVLEKAGF +EG+ RK     G  +D  + ++L+ D E
Subjt:  LGYVLGSKFWGKGIGTTAVKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLE

O34569 Uncharacterized N-acetyltransferase YoaA1.9e-0725.77Show/hide
Query:  LRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAI--CVDGRPVGAI--SVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        LR +   D +      S+++  R+   E   +   AI+ I+     +   R I   ++ R    +  ++  ++  +   R E+GY +  + W  G  +  
Subjt:  LRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAI--CVDGRPVGAI--SVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLL
        +  V    F    GL R+ A+V   N AS R+L K GFQ+EGVLR++    G   D  ++S++
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLL

P05332 Uncharacterized N-acetyltransferase p201.8e-1331.55Show/hide
Query:  VTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKS---DAINFIKERVLPHPWFR-AICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGT
        +TLR ++L D D    + SD +  ++ +  P+T+ S   D I  I +  L     R +I V        +   N   ++  R E+GY LG   WGKG  +
Subjt:  VTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKS---DAINFIKERVLPHPWFR-AICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGT

Query:  TAVKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSD
         AV+ + +  F     L R+EA V+ +N  S ++L    FQ+EG+LR +  +KG   D  +FSLL  +
Subjt:  TAVKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSD

Arabidopsis top hitse value%identityAlignment
AT2G32020.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.9e-5157.58Show/hide
Query:  VTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICV-DGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTAV
        ++LRP+ L+D+DD+MVWA+D K ARFC+WEP T++ +AI +I +RVL HPW RAIC+ D RP+G I + A     D  R E+GYVL  K+WGKG  T AV
Subjt:  VTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICV-DGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTAV

Query:  KLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSD
        +LV   +F E P +ERLEALVDV N+ SQRVLEK GF REGV+RKF   KG  +D V+FS LS+D
Subjt:  KLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSD

AT2G32030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.1e-5358.79Show/hide
Query:  VTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDG-RPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTAV
        + LRP+ L+D+DDFMVWA+D    RFC+WEPYT++  AI ++ + +LPHPW RAIC+D  RP+G+ISV+      D  RGE+GYVLGSK+WGKGI T AV
Subjt:  VTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDG-RPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTAV

Query:  KLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSD
        +LVA  IF E+P ++RLEALVDV N+ SQ+VLEK GF +EGV+RKF   KG  +D V+FS L SD
Subjt:  KLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSD

AT3G22560.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein7.6e-3640.96Show/hide
Query:  VTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICV--DGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA
        + LRP +L+D +D   WA D+   R+  W+   +  +A   I  + +PHPW R+I +  DG  +G +SV  +S    RCR +L Y +  +FWG+GI T A
Subjt:  VTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICV--DGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTA

Query:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSD
        V++  E    + P + RL+A+V+V+N ASQRVLEKAGF++EG+L K+G SKG  +D  ++S +  D
Subjt:  VKLVAETIFVEQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGACTCTCCGGCCGCTGGATCTCACCGACATCGACGATTTCATGGTGTGGGCTTCCGATGAGAAGGCGGCTCGATTCTGCTCGTGGGAGCCCTACACGAACAA
ATCAGACGCCATAAACTTCATTAAGGAGAGAGTCCTACCGCACCCATGGTTCCGGGCGATATGCGTCGACGGCCGGCCGGTTGGGGCTATTTCAGTGTCTGCAAATTCGT
CGGTCAGGGATAGGTGCAGAGGCGAGCTCGGGTACGTATTGGGATCCAAATTCTGGGGGAAAGGGATCGGAACGACGGCGGTGAAATTGGTGGCGGAGACGATATTCGTC
GAGCAGCCAGGGCTGGAGAGGCTCGAAGCTCTGGTGGATGTGAAGAATCTGGCGTCTCAGAGAGTGCTGGAGAAGGCTGGTTTCCAGAGGGAAGGTGTTCTGAGAAAGTT
TGGAGTCTCGAAAGGGGAAACCAAAGATTTCGTCATCTTCAGTCTTCTTTCTTCTGATCTTGAAGTTGAAACCAAAATTGATGGACTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTGACTCTCCGGCCGCTGGATCTCACCGACATCGACGATTTCATGGTGTGGGCTTCCGATGAGAAGGCGGCTCGATTCTGCTCGTGGGAGCCCTACACGAACAA
ATCAGACGCCATAAACTTCATTAAGGAGAGAGTCCTACCGCACCCATGGTTCCGGGCGATATGCGTCGACGGCCGGCCGGTTGGGGCTATTTCAGTGTCTGCAAATTCGT
CGGTCAGGGATAGGTGCAGAGGCGAGCTCGGGTACGTATTGGGATCCAAATTCTGGGGGAAAGGGATCGGAACGACGGCGGTGAAATTGGTGGCGGAGACGATATTCGTC
GAGCAGCCAGGGCTGGAGAGGCTCGAAGCTCTGGTGGATGTGAAGAATCTGGCGTCTCAGAGAGTGCTGGAGAAGGCTGGTTTCCAGAGGGAAGGTGTTCTGAGAAAGTT
TGGAGTCTCGAAAGGGGAAACCAAAGATTTCGTCATCTTCAGTCTTCTTTCTTCTGATCTTGAAGTTGAAACCAAAATTGATGGACTGTAA
Protein sequenceShow/hide protein sequence
MAVTLRPLDLTDIDDFMVWASDEKAARFCSWEPYTNKSDAINFIKERVLPHPWFRAICVDGRPVGAISVSANSSVRDRCRGELGYVLGSKFWGKGIGTTAVKLVAETIFV
EQPGLERLEALVDVKNLASQRVLEKAGFQREGVLRKFGVSKGETKDFVIFSLLSSDLEVETKIDGL