; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010352 (gene) of Snake gourd v1 genome

Gene IDTan0010352
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionACT domain-containing protein
Genome locationLG03:63059217..63062174
RNA-Seq ExpressionTan0010352
SyntenyTan0010352
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595565.1 hypothetical protein SDJN03_12118, partial [Cucurbita argyrosperma subsp. sororia]3.4e-4189.42Show/hide
Query:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE
        MN  K  EF ASTSSSAVVNVESL RGFLINVY EKNSPGLLVRILEAFEKLGLEVLDA+VSCSDCFQLQAVGEENE TKI+KPQVVKKAV QAIKEWSE
Subjt:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE

Query:  SNGQ
        SNGQ
Subjt:  SNGQ

XP_022950375.1 uncharacterized protein LOC111453491 [Cucurbita moschata]3.3e-4491.43Show/hide
Query:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE
        +N HKSKEFRASTSSSAVVNVE+LERGFLINV+SE+NSPGLLV+ILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIK QVVK AV QAIKEWSE
Subjt:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE

Query:  SNGQA
        SNGQA
Subjt:  SNGQA

XP_022966518.1 uncharacterized protein LOC111466169 [Cucurbita maxima]7.5e-4188.46Show/hide
Query:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE
        MN  K  EF ASTSSSAVVNVESL RGFLINVY EKNSPGLLVRILEAFEKLGLEVLDA+VSCSDCFQLQAVGEENE TK++KPQVVKKAV QAIKEWSE
Subjt:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE

Query:  SNGQ
        SNGQ
Subjt:  SNGQ

XP_022977370.1 uncharacterized protein LOC111477723 [Cucurbita maxima]1.5e-4492.38Show/hide
Query:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE
        +N HKSKEFRASTSSSAVVNVE LERGFLINV+SE+NSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIK QVVK AV QAIKEWSE
Subjt:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE

Query:  SNGQA
        SNGQA
Subjt:  SNGQA

XP_038882922.1 uncharacterized protein LOC120074022 [Benincasa hispida]1.8e-4290.38Show/hide
Query:  NRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSES
        + HKSKEFRASTSSSA+VNVESLERGFLINV+ E+NSPGLLVRILEAFEKLGLEVLDA+VSCSDCFQLQAVGEENEGTKIIK QVVK AV QAIKEWSES
Subjt:  NRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSES

Query:  NGQA
        NGQA
Subjt:  NGQA

TrEMBL top hitse value%identityAlignment
A0A1S3CLI8 uncharacterized protein LOC1035021841.4e-4086.54Show/hide
Query:  NRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSES
        + H SKEFRASTSSSAVVNVES+ERGFLINV+ E+NSPGLLVRILEAFEKLGL VLDAD+SCSDCFQLQAVGEENEG KIIK QVVK AV QAIKEWSES
Subjt:  NRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSES

Query:  NGQA
        +GQA
Subjt:  NGQA

A0A6J1CQF9 uncharacterized protein LOC1110137972.4e-4089.42Show/hide
Query:  HKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEW--SES
        HKSKEFRASTSSSA VNVES+ERGF+INVY E+NSPGLLVRILE FEKLGLEVLDA VSCSD FQLQAVGEENEGTKIIKPQVVKKAV QAIKEW  SES
Subjt:  HKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEW--SES

Query:  NGQA
        NGQA
Subjt:  NGQA

A0A6J1GFJ5 uncharacterized protein LOC1114534911.6e-4491.43Show/hide
Query:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE
        +N HKSKEFRASTSSSAVVNVE+LERGFLINV+SE+NSPGLLV+ILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIK QVVK AV QAIKEWSE
Subjt:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE

Query:  SNGQA
        SNGQA
Subjt:  SNGQA

A0A6J1HSD7 uncharacterized protein LOC1114661693.6e-4188.46Show/hide
Query:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE
        MN  K  EF ASTSSSAVVNVESL RGFLINVY EKNSPGLLVRILEAFEKLGLEVLDA+VSCSDCFQLQAVGEENE TK++KPQVVKKAV QAIKEWSE
Subjt:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE

Query:  SNGQ
        SNGQ
Subjt:  SNGQ

A0A6J1IM47 uncharacterized protein LOC1114777237.1e-4592.38Show/hide
Query:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE
        +N HKSKEFRASTSSSAVVNVE LERGFLINV+SE+NSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIK QVVK AV QAIKEWSE
Subjt:  MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSE

Query:  SNGQA
        SNGQA
Subjt:  SNGQA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein4.6e-0440.74Show/hide
Query:  VNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAV
        V VE +   F + + S +     LV ILEAFE++GL V  A  SC D F ++A+
Subjt:  VNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAV

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)2.7e-2052.94Show/hide
Query:  VVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSESN
        +V VE+L++GF+INV+S KN PG+LV +LEAFE +GL VL+A  SC+D F L A+G ENE  + +  + VK+AV  AI+ W E N
Subjt:  VVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSESN

AT3G56220.1 transcription regulators8.6e-1947.42Show/hide
Query:  KEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVG-EENEGTKIIKPQVVKKAVNQAIKEWSESN
        + FR S+  + +V VE+LE+GF+I V S KN  G+LV +LE FE LGL+V++A VSC+D F L A+G   N+    I  + VK+AV +AI+ WS+S+
Subjt:  KEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVG-EENEGTKIIKPQVVKKAVNQAIKEWSESN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGTCACAAGTCCAAAGAATTTCGTGCATCCACAAGCTCTAGTGCTGTGGTGAATGTGGAGAGTTTAGAGAGGGGATTTCTAATTAACGTATATTCAGAAAAGAA
TTCACCAGGATTGTTGGTTCGGATTCTAGAAGCCTTTGAGAAATTGGGACTTGAAGTTCTCGATGCCGACGTGTCTTGTTCTGATTGTTTTCAACTCCAAGCTGTTGGAG
AAGAGAATGAAGGCACCAAAATCATAAAGCCTCAAGTGGTGAAAAAAGCAGTAAACCAAGCAATCAAGGAATGGAGTGAAAGTAATGGGCAGGCATGA
mRNA sequenceShow/hide mRNA sequence
GGAAAAAAGAAAATAAAAAATAAAAAAGGAAAATAGAAAATAGAAAATGCAAATATAAAAGAAGAGGAACTCGTGGTCTGGTTTGTTTTAAGAGCTCGGGAAGAGAAAGA
AAGCAAAAGCCTCTGGTGGCGCGTGGGCGGCGAGCGGTGGAGACACGGCGGACGGCGGAGTTTCGATTGATTTTTTATTTCCGGCGGCTGTTGGGTTCGAGAGAGCGAAA
GAAAGCAGCAGCTTCTTCAATGAATCGTCACAAGTCCAAAGAATTTCGTGCATCCACAAGCTCTAGTGCTGTGGTGAATGTGGAGAGTTTAGAGAGGGGATTTCTAATTA
ACGTATATTCAGAAAAGAATTCACCAGGATTGTTGGTTCGGATTCTAGAAGCCTTTGAGAAATTGGGACTTGAAGTTCTCGATGCCGACGTGTCTTGTTCTGATTGTTTT
CAACTCCAAGCTGTTGGAGAAGAGAATGAAGGCACCAAAATCATAAAGCCTCAAGTGGTGAAAAAAGCAGTAAACCAAGCAATCAAGGAATGGAGTGAAAGTAATGGGCA
GGCATGAAATATTATTACATTATTTATTTCCCTTCTTTTTCTCTTCTTTTTCTTTTCCCCAAACAGAATATCTCTTCTATGTATAGGGTAACAAATTTTCAACTGCCAAC
GAAAGCATAGCTCAATTGGCATCAAGTATAACTCATGACTAAAGAGGTCATGGGTTCGAATCTCTTTACTCCCAATTATTGATCTACTAAAAAAAAGAAACAAACTTTCA
ACTCTCAAGAGTGTAGAACTAAAAAAAAATCCTCCCAAAACCCTTAGAAGAAATTCAGAAATAAAAATTGTTTTTCCCTCTTCTTCTTTCTTTCAAAAAATCACTATAAT
TTTGGCAAGGGATGAGTTTATATAAAATATGCCTCGTTAAAACGAAAAAGAGTACATATTTTATATAATACATTTACTCCCCCTCATGACGACATCACATTTTATATAAT
ACATTTACTCCCCCTCATGACGACATCACTTGAGACCTCTGAGTCTTCGCATCCCAATACTGTGAACCAATTTCTCAAATGTTGTAGTTGGTAATGCCTTTGTAAATATG
TCTGGCAAGTTATCCTTCGAACAAATCTGTTGTACTGAGATGTCATCATTTTCTTCAAGGTCATGAGTGTAGAAAAGTTTTAGAGAGATATGCTTCGTTCTATCTCCTTT
AATAATATATCCTCCTTTGACTTGAGCTATACATGTTTTGTTGTCTTCGTATAATATTGTTGGAATGAGTTTGCTCGAAAAC
Protein sequenceShow/hide protein sequence
MNRHKSKEFRASTSSSAVVNVESLERGFLINVYSEKNSPGLLVRILEAFEKLGLEVLDADVSCSDCFQLQAVGEENEGTKIIKPQVVKKAVNQAIKEWSESNGQA