; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011789 (gene) of Snake gourd v1 genome

Gene IDTan0011789
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationLG10:6885475..6886317
RNA-Seq ExpressionTan0011789
SyntenyTan0011789
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049351.1 putative N-acetyltransferase YoaA [Cucumis melo var. makuwa]2.3e-7986.39Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA
        M SSRISIRPFNLSDADDFLRWA D+RVTRYLRWNTI SKEEALTY+E +AIPH WR SICLDGRSVGYVS KPESEEKCRAHISYAV AE+WG+GIAT 
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA

Query:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL
        AL AAI AAL+EFPEVVRVQA+VEVENEGSQ+VLEK+GFCREG+LRKYGFCKGEIRDL+VFSFLRT+QL
Subjt:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL

KAE8650353.1 hypothetical protein Csa_010832 [Cucumis sativus]8.6e-7986.39Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA
        M SSRISIRPFNLSDADDFLRWA D+RVTRYLRWNTI SKEEALTYLE VAIPH WR SICLDGRSVGYVS KPESEEKCRAHISYAV AE+WG+GIAT 
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA

Query:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL
        AL AAI AAL++FPE+VRVQA+VEVENEGSQ+VLEK+GFCREG+LRKYGFCKGEIRDL+VFS LRTDQL
Subjt:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL

XP_022138208.1 uncharacterized protein LOC111009437 [Momordica charantia]5.1e-7984.71Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA
        MESSRIS+R F LSDA+DFLRWAGDDRVTRYLRW+ I SKEEA+TYLE VAIPHPWR SICLDGRSVGYVS++PE+EE+CRAHISYAV AE+WGRGIATA
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA

Query:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQLK
        AL AA+  ALKEFPEVVRVQALVEVEN GSQRVLEKVGFCREGLLRKYGFCKGEIRD  +FSFLRTDQ++
Subjt:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQLK

XP_022955367.1 uncharacterized protein LOC111457416 [Cucurbita moschata]2.3e-7987.57Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA
        M SS IS+RPFNLSDADDFLRWAGDDRVTRYLRWNTI SKEEALTYLE VAIPHPWR SICL+G SVGYVS+KPESEEKCR +ISYAV AE+WGRGIATA
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA

Query:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL
        AL AAI AAL +FPEVVR+QALVEVENEGSQRVLEK+GFCREGLLRKYGFCKGEIR+ IVFSFLRTDQL
Subjt:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL

XP_023527193.1 uncharacterized protein LOC111790505 [Cucurbita pepo subsp. pepo]6.0e-8088.17Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA
        M SS IS+RPFNLSDADDFLRWAGDDRVTRYLRWNTI SKEEALTYLE VAIPHPWR SICL+GRSVGYVS+KPESEEKCRA+ISYAV AE+WGRGIATA
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA

Query:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL
        AL  AI AAL +FPEVVR+QALVEVENEGSQRVLEK+GFCREGLLRKYGFCKGEIR+ IVFSFLRTDQL
Subjt:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL

TrEMBL top hitse value%identityAlignment
A0A0A0LA76 N-acetyltransferase domain-containing protein4.2e-7986.39Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA
        M SSRISIRPFNLSDADDFLRWA D+RVTRYLRWNTI SKEEALTYLE VAIPH WR SICLDGRSVGYVS KPESEEKCRAHISYAV AE+WG+GIAT 
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA

Query:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL
        AL AAI AAL++FPE+VRVQA+VEVENEGSQ+VLEK+GFCREG+LRKYGFCKGEIRDL+VFS LRTDQL
Subjt:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL

A0A5A7U710 Putative N-acetyltransferase YoaA1.1e-7986.39Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA
        M SSRISIRPFNLSDADDFLRWA D+RVTRYLRWNTI SKEEALTY+E +AIPH WR SICLDGRSVGYVS KPESEEKCRAHISYAV AE+WG+GIAT 
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA

Query:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL
        AL AAI AAL+EFPEVVRVQA+VEVENEGSQ+VLEK+GFCREG+LRKYGFCKGEIRDL+VFSFLRT+QL
Subjt:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL

A0A6J1CCF0 uncharacterized protein LOC1110094372.5e-7984.71Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA
        MESSRIS+R F LSDA+DFLRWAGDDRVTRYLRW+ I SKEEA+TYLE VAIPHPWR SICLDGRSVGYVS++PE+EE+CRAHISYAV AE+WGRGIATA
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA

Query:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQLK
        AL AA+  ALKEFPEVVRVQALVEVEN GSQRVLEKVGFCREGLLRKYGFCKGEIRD  +FSFLRTDQ++
Subjt:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQLK

A0A6J1GTD3 uncharacterized protein LOC1114574161.1e-7987.57Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA
        M SS IS+RPFNLSDADDFLRWAGDDRVTRYLRWNTI SKEEALTYLE VAIPHPWR SICL+G SVGYVS+KPESEEKCR +ISYAV AE+WGRGIATA
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA

Query:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL
        AL AAI AAL +FPEVVR+QALVEVENEGSQRVLEK+GFCREGLLRKYGFCKGEIR+ IVFSFLRTDQL
Subjt:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL

A0A6J1J069 uncharacterized protein LOC111480065 isoform X27.9e-7885.8Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA
        M  S IS+RPFNLSDADDFLRWAGDDRVTRYLRWNTI SKEEAL YLE VAIPHPWR SICL+GRSVGYVS+KPESEEKCRA+ISYAV AE+WGRGIAT 
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATA

Query:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL
        AL  AI AAL +FPEVVR+QALVEVENEGSQRVLEK+GFCREGLLRKYGFCKGEIR+ IVFS LRTDQL
Subjt:  ALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL

SwissProt top hitse value%identityAlignment
O31633 Putative [ribosomal protein S5]-alanine N-acetyltransferase2.4e-0728.09Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDR-------VTRYLRWNTINSKEEALT-YLENVAIPHPWRWSI--CLDGRSVGYVSIKPESEEKCR-AHISYAVG
        ++   I +RP  ++DA++ L    ++R       + R   + T+  + + +T Y E +     + + I    D R +G VS+        + A I Y + 
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDR-------VTRYLRWNTINSKEEALT-YLENVAIPHPWRWSI--CLDGRSVGYVSIKPESEEKCR-AHISYAVG

Query:  AENWGRGIATAALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTD
          + G+GI T A+   +  A  E  ++ R++A V   N GS RVLEK GF +EG+ RK     G   D  V + L  D
Subjt:  AENWGRGIATAALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTD

O34569 Uncharacterized N-acetyltransferase YoaA5.5e-1227.49Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHP----WRWSI-CLDGRS-VGYVSIKPESEEKCRAHISYAVGAENWG
        +E+ R+ +R     DA+       +D VTRY     + S E+A++ ++  A  +      RW I   D +  +G +     +++  RA I Y +  E+W 
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHP----WRWSI-CLDGRS-VGYVSIKPESEEKCRAHISYAVGAENWG

Query:  RGIATAALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLR
         G A+  ++  ++        + R+ A+V  +NE S R+L K+GF +EG+LR+Y +  G   D  V+S ++
Subjt:  RGIATAALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLR

P05332 Uncharacterized N-acetyltransferase p208.6e-1327.49Show/hide
Query:  SSRISIRPFNLSDADDFLRWAGDDRVTRYLR---WNTINSKEEALTYLENVAIP-HPWRWSICL--DGRSVGYVSIKPESEEKCRAHISYAVGAENWGRG
        + R+++R   L DAD   ++  D  VT+Y+    +  ++   + +  + ++++     R+SI +      +G        +E  RA I Y +G  +WG+G
Subjt:  SSRISIRPFNLSDADDFLRWAGDDRVTRYLR---WNTINSKEEALTYLENVAIP-HPWRWSICL--DGRSVGYVSIKPESEEKCRAHISYAVGAENWGRG

Query:  IATAALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTD
         A+ A+   I         + R++A VE EN  S ++L  + F +EGLLR Y   KG + D+ +FS L+ +
Subjt:  IATAALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTD

Arabidopsis top hitse value%identityAlignment
AT2G32020.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.5e-4147.31Show/hide
Query:  RISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICL-DGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATAALT
        RIS+RP  LSD DD++ WA D +V R+  W    S++EA+ Y+ +  + HPW  +ICL D R +GY+ I   + +  R  I Y +  + WG+G AT A+ 
Subjt:  RISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICL-DGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATAALT

Query:  AAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQLK
           A   +EFPE+ R++ALV+V+N GSQRVLEKVGF REG++RK+   KG +RD ++FSFL TD LK
Subjt:  AAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQLK

AT2G32030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein8.1e-4347.59Show/hide
Query:  RISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDG-RSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATAALT
        +I +RP  LSD DDF+ WA D  VTR+  W    S+E A+ YL +  +PHPW  +ICLD  R +G +S+ P  E   R  I Y +G++ WG+GIAT A+ 
Subjt:  RISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDG-RSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATAALT

Query:  AAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL
               KE PE+ R++ALV+V+N GSQ+VLEKVGF +EG++RK+ + KG +RD+++FSFL +D L
Subjt:  AAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQL

AT3G22560.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.1e-5863.53Show/hide
Query:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICL--DGRSVGYVSIKPES-EEKCRAHISYAVGAENWGRGI
        MES RI +RPFNLSDA+D  +WAGDD VTRYLRW+++NS EEA  ++ N AIPHPWR SI L  DG S+GYVS+KP+S + +CRA ++YAV  E WGRGI
Subjt:  MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICL--DGRSVGYVSIKPES-EEKCRAHISYAVGAENWGRGI

Query:  ATAALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTD
        ATAA+  A+  AL++FPEVVR+QA+VEVEN+ SQRVLEK GF +EGLL KYGF KG IRD+ ++S+++ D
Subjt:  ATAALTAAIAAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCATCAAGAATTTCCATCCGGCCGTTCAATCTCTCCGACGCCGACGACTTTCTCCGATGGGCCGGCGACGACAGAGTAACCCGATATCTCCGATGGAACACAAT
CAACTCCAAAGAAGAAGCATTAACTTACCTAGAGAATGTCGCAATTCCCCATCCATGGCGGTGGTCCATCTGCTTGGACGGCCGTTCCGTCGGATACGTTTCGATCAAGC
CGGAATCGGAGGAGAAATGCAGAGCGCATATCAGTTATGCTGTGGGTGCAGAGAATTGGGGCCGAGGAATAGCCACGGCGGCGCTGACGGCGGCGATTGCAGCGGCGTTG
AAAGAGTTCCCGGAGGTGGTTAGGGTTCAGGCGCTGGTGGAGGTCGAGAATGAGGGATCCCAGAGGGTCCTTGAAAAAGTAGGGTTTTGCAGAGAGGGGCTGTTGAGGAA
ATATGGATTTTGTAAGGGCGAAATTCGGGATTTGATTGTTTTCAGTTTTTTAAGAACCGATCAACTCAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATCCAAACCATTTTTTAACCCCAAAGCCCTCAATAACAACCGGCCCTCTTTCCGATCGATGGAATCATCAAGAATTTCCATCCGGCCGTTCAATCTCTCCGACGCCGACG
ACTTTCTCCGATGGGCCGGCGACGACAGAGTAACCCGATATCTCCGATGGAACACAATCAACTCCAAAGAAGAAGCATTAACTTACCTAGAGAATGTCGCAATTCCCCAT
CCATGGCGGTGGTCCATCTGCTTGGACGGCCGTTCCGTCGGATACGTTTCGATCAAGCCGGAATCGGAGGAGAAATGCAGAGCGCATATCAGTTATGCTGTGGGTGCAGA
GAATTGGGGCCGAGGAATAGCCACGGCGGCGCTGACGGCGGCGATTGCAGCGGCGTTGAAAGAGTTCCCGGAGGTGGTTAGGGTTCAGGCGCTGGTGGAGGTCGAGAATG
AGGGATCCCAGAGGGTCCTTGAAAAAGTAGGGTTTTGCAGAGAGGGGCTGTTGAGGAAATATGGATTTTGTAAGGGCGAAATTCGGGATTTGATTGTTTTCAGTTTTTTA
AGAACCGATCAACTCAAGTGAATCAATTTGGGATGCAAAATGGTACTAAAAAAAATGGTATTCCGATCTCTTTCTCCAATCAACAATGCTTGCAAAACCCAAGTATTATC
CTTTTCGAATATCATTTATTTAGAATTGCTGCAGCTTTTAATTATTAGGAAAATCAGCTGTTCAGATTATGGCATGGTAAATTTTAATTTTAAACCTAAAAAAAGGGTTA
CAATATTTGATAAATTCAAACAAAATTATTATAGTTTTTTTAAGACAACAATATTAAAGTAAAGAATAGAGTG
Protein sequenceShow/hide protein sequence
MESSRISIRPFNLSDADDFLRWAGDDRVTRYLRWNTINSKEEALTYLENVAIPHPWRWSICLDGRSVGYVSIKPESEEKCRAHISYAVGAENWGRGIATAALTAAIAAAL
KEFPEVVRVQALVEVENEGSQRVLEKVGFCREGLLRKYGFCKGEIRDLIVFSFLRTDQLK