; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004772 (gene) of Snake gourd v1 genome

Gene IDTan0004772
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionH4/H2A histone acetyltransferase complex
Genome locationLG04:9029266..9031180
RNA-Seq ExpressionTan0004772
SyntenyTan0004772
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
InterPro domainsIPR012423 - Chromatin modification-related protein Eaf7/MRGBP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135149.1 uncharacterized protein LOC101212766 isoform X2 [Cucumis sativus]1.0e-6097.62Show/hide
Query:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
        MESGGKGREEEQDG+SVHSPCKAPPSSASSLPKE PQIELELKLLQALEIYPLVKLQGIHRHFVL+GLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
Subjt:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP

Query:  DDEEMEILNHEEDFCLPQTFFVKEES
        DDEEMEILNHEEDFCLPQTFFVKEES
Subjt:  DDEEMEILNHEEDFCLPQTFFVKEES

XP_008446430.1 PREDICTED: uncharacterized protein LOC103489178 [Cucumis melo]2.7e-6198.41Show/hide
Query:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
        MESGGKGREEEQDG+SVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVL+GLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
Subjt:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP

Query:  DDEEMEILNHEEDFCLPQTFFVKEES
        DDEEMEILNHEEDFCLPQTFFVKEES
Subjt:  DDEEMEILNHEEDFCLPQTFFVKEES

XP_022945110.1 uncharacterized protein LOC111449449 [Cucurbita moschata]1.8e-6097.62Show/hide
Query:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
        MESGGKGR+EEQDGMSVHSPCKAPPSSASSLPKEQ QIELELKLLQALEIYPLVKLQGIHRHFVL+GLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
Subjt:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP

Query:  DDEEMEILNHEEDFCLPQTFFVKEES
        DDEEMEILNHEEDFCLPQTFFVKEES
Subjt:  DDEEMEILNHEEDFCLPQTFFVKEES

XP_022968533.1 uncharacterized protein LOC111467738 [Cucurbita maxima]3.0e-6096.83Show/hide
Query:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
        MESGGKGR+EEQDGMSVHSPCKAPPSSASSLPKEQ QIELELK+LQALEIYPLVKLQGIHRHFVL+GLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
Subjt:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP

Query:  DDEEMEILNHEEDFCLPQTFFVKEES
        DDEEMEILNHEEDFCLPQTFFVKEES
Subjt:  DDEEMEILNHEEDFCLPQTFFVKEES

XP_038893367.1 uncharacterized protein LOC120082183 [Benincasa hispida]9.3e-6299.21Show/hide
Query:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
        MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVL+GLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
Subjt:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP

Query:  DDEEMEILNHEEDFCLPQTFFVKEES
        DDEEMEILNHEEDFCLPQTFFVKEES
Subjt:  DDEEMEILNHEEDFCLPQTFFVKEES

TrEMBL top hitse value%identityAlignment
A0A0A0KQU2 Uncharacterized protein5.0e-6197.62Show/hide
Query:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
        MESGGKGREEEQDG+SVHSPCKAPPSSASSLPKE PQIELELKLLQALEIYPLVKLQGIHRHFVL+GLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
Subjt:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP

Query:  DDEEMEILNHEEDFCLPQTFFVKEES
        DDEEMEILNHEEDFCLPQTFFVKEES
Subjt:  DDEEMEILNHEEDFCLPQTFFVKEES

A0A1S3BFV8 uncharacterized protein LOC1034891781.3e-6198.41Show/hide
Query:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
        MESGGKGREEEQDG+SVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVL+GLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
Subjt:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP

Query:  DDEEMEILNHEEDFCLPQTFFVKEES
        DDEEMEILNHEEDFCLPQTFFVKEES
Subjt:  DDEEMEILNHEEDFCLPQTFFVKEES

A0A6J1FZZ7 uncharacterized protein LOC1114494498.5e-6197.62Show/hide
Query:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
        MESGGKGR+EEQDGMSVHSPCKAPPSSASSLPKEQ QIELELKLLQALEIYPLVKLQGIHRHFVL+GLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
Subjt:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP

Query:  DDEEMEILNHEEDFCLPQTFFVKEES
        DDEEMEILNHEEDFCLPQTFFVKEES
Subjt:  DDEEMEILNHEEDFCLPQTFFVKEES

A0A6J1HYA5 uncharacterized protein LOC1114677381.4e-6096.83Show/hide
Query:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
        MESGGKGR+EEQDGMSVHSPCKAPPSSASSLPKEQ QIELELK+LQALEIYPLVKLQGIHRHFVL+GLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
Subjt:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP

Query:  DDEEMEILNHEEDFCLPQTFFVKEES
        DDEEMEILNHEEDFCLPQTFFVKEES
Subjt:  DDEEMEILNHEEDFCLPQTFFVKEES

A0A6J1J1E0 uncharacterized protein LOC1114825265.5e-6096.83Show/hide
Query:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP
        MESGGKGREEEQDG+SVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVL+GLMEFLRRSFDR FSSDEVLQLLDRFYNLEMLKP
Subjt:  MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKP

Query:  DDEEMEILNHEEDFCLPQTFFVKEES
        DDEEMEILN EEDFCLPQTFFVKEES
Subjt:  DDEEMEILNHEEDFCLPQTFFVKEES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G26470.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: regulation of transcription; LOCATED IN: nucleus, H4/H2A histone acetyltransferase complex; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: CT20 (InterPro:IPR012423); Has 60 Blast hits to 60 proteins in 27 species: Archae - 0; Bacteria - 0; Metazoa - 26; Fungi - 2; Plants - 30; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink).3.9e-5080.33Show/hide
Query:  GGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKPDDE
        GGK +EEEQDG+SVHSPCKA PSSASSL KEQ Q+ELEL+LL+ALEIYP VKL+GIHRHFVLYGLME+L RSFDR F++DEVLQLLDRFYN+EMLK DDE
Subjt:  GGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKPDDE

Query:  EMEILNHEEDFCLPQTFFVKEE
        +++ILNHEEDF LPQ+FF KEE
Subjt:  EMEILNHEEDFCLPQTFFVKEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGCGGCGGCAAAGGGAGAGAGGAAGAACAAGACGGCATGTCTGTACACTCTCCATGCAAAGCTCCACCGTCCTCCGCTTCCTCTCTTCCCAAGGAGCAACCACA
GATTGAATTGGAACTCAAACTCTTGCAAGCTCTTGAAATTTATCCACTAGTTAAACTACAAGGCATTCATCGCCATTTTGTCCTCTATGGGTTGATGGAATTTCTGCGAA
GAAGCTTTGATCGTCAGTTCTCTTCTGATGAGGTTCTACAGTTGTTGGATCGGTTCTATAACCTGGAAATGCTGAAGCCAGATGATGAGGAAATGGAAATTCTGAATCAC
GAGGAAGATTTTTGCTTACCACAAACCTTCTTTGTCAAAGAAGAGTCTTAA
mRNA sequenceShow/hide mRNA sequence
CGCGGATCAAACAGCCAAAACCCTAACCAGAAATCACGGGGAAATTTTAATTACAAAAATCTGTTCAATCTGGAGGGAAAAAAAGGAAAAGAGGAAGAAATAGAAATTAA
AAACTCTGGCAGAGCATTGCTTTTCGACTCAGAGCATCTTCTTTCGCGATTGGCAGTTTATTCATGGAAAGCGGCGGCAAAGGGAGAGAGGAAGAACAAGACGGCATGTC
TGTACACTCTCCATGCAAAGCTCCACCGTCCTCCGCTTCCTCTCTTCCCAAGGAGCAACCACAGATTGAATTGGAACTCAAACTCTTGCAAGCTCTTGAAATTTATCCAC
TAGTTAAACTACAAGGCATTCATCGCCATTTTGTCCTCTATGGGTTGATGGAATTTCTGCGAAGAAGCTTTGATCGTCAGTTCTCTTCTGATGAGGTTCTACAGTTGTTG
GATCGGTTCTATAACCTGGAAATGCTGAAGCCAGATGATGAGGAAATGGAAATTCTGAATCACGAGGAAGATTTTTGCTTACCACAAACCTTCTTTGTCAAAGAAGAGTC
TTAAGGGAAATTGTTCATATGGATTAGTAACTTAGCACTTCCTGGTATATGATAGTTTTATTAATTTTATCTACCATCGTTCCCTTTTTGTTTTGTAAATGTCAATATAG
AATCTTTTGCTGGTTAGTATAATTCAAAATTTTCAAGGATGAATTCTTTAGTGCCTATACACGAGGGGATCTACTTAATTGAGCTGAG
Protein sequenceShow/hide protein sequence
MESGGKGREEEQDGMSVHSPCKAPPSSASSLPKEQPQIELELKLLQALEIYPLVKLQGIHRHFVLYGLMEFLRRSFDRQFSSDEVLQLLDRFYNLEMLKPDDEEMEILNH
EEDFCLPQTFFVKEES