; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016272 (gene) of Snake gourd v1 genome

Gene IDTan0016272
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGATA transcription factor 16-like
Genome locationLG09:5202867..5205777
RNA-Seq ExpressionTan0016272
SyntenyTan0016272
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135612.1 GATA transcription factor 17-like isoform X1 [Momordica charantia]6.5e-2853.22Show/hide
Query:  MGMMD-MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA---
        MGMMD +R+K + +  EDD  TKK CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR+ST    R CD KR++ H+H              S+TA   
Subjt:  MGMMD-MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA---

Query:  ----------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA
                        G CG          GEEV++   + KQR  RKL   EEEQAAVSLMALSCGSVFA
Subjt:  ----------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA

XP_022135613.1 GATA transcription factor 17-like isoform X2 [Momordica charantia]1.4e-2752.35Show/hide
Query:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA----
        MGMMD+ ++   +  EDD  TKK CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR+ST    R CD KR++ H+H              S+TA    
Subjt:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA----

Query:  ---------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA
                       G CG          GEEV++   + KQR  RKL   EEEQAAVSLMALSCGSVFA
Subjt:  ---------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA

XP_022135614.1 GATA transcription factor 16-like isoform X3 [Momordica charantia]6.5e-2853.22Show/hide
Query:  MGMMD-MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA---
        MGMMD +R+K + +  EDD  TKK CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR+ST    R CD KR++ H+H              S+TA   
Subjt:  MGMMD-MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA---

Query:  ----------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA
                        G CG          GEEV++   + KQR  RKL   EEEQAAVSLMALSCGSVFA
Subjt:  ----------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA

XP_022135615.1 GATA transcription factor 16-like isoform X4 [Momordica charantia]1.4e-2752.35Show/hide
Query:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA----
        MGMMD+ ++   +  EDD  TKK CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR+ST    R CD KR++ H+H              S+TA    
Subjt:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA----

Query:  ---------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA
                       G CG          GEEV++   + KQR  RKL   EEEQAAVSLMALSCGSVFA
Subjt:  ---------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA

XP_022989484.1 GATA transcription factor 16-like [Cucurbita maxima]5.8e-2953.46Show/hide
Query:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTYRRCDSKRQRLH---------------NHSSTAGGCGDG
        MGMMD+ QKG +  T+    TKKCCVDC TTKTPLWRGGPAGPKSLCNACGIRFRKRRIST R    KR+R H               N +++ GG GDG
Subjt:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTYRRCDSKRQRLH---------------NHSSTAGGCGDG

Query:  -------------------EEVLLVKK----QRWRKLPEEEEEQAAVSLMALSCGSVFA
                           EEV++V+     Q+  KL   EEEQAAV LMALSCGSVFA
Subjt:  -------------------EEVLLVKK----QRWRKLPEEEEEQAAVSLMALSCGSVFA

TrEMBL top hitse value%identityAlignment
A0A6J1C1I3 GATA transcription factor 16-like isoform X47.0e-2852.35Show/hide
Query:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA----
        MGMMD+ ++   +  EDD  TKK CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR+ST    R CD KR++ H+H              S+TA    
Subjt:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA----

Query:  ---------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA
                       G CG          GEEV++   + KQR  RKL   EEEQAAVSLMALSCGSVFA
Subjt:  ---------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA

A0A6J1C1Y1 GATA transcription factor 16-like isoform X33.1e-2853.22Show/hide
Query:  MGMMD-MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA---
        MGMMD +R+K + +  EDD  TKK CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR+ST    R CD KR++ H+H              S+TA   
Subjt:  MGMMD-MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA---

Query:  ----------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA
                        G CG          GEEV++   + KQR  RKL   EEEQAAVSLMALSCGSVFA
Subjt:  ----------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA

A0A6J1C373 GATA transcription factor 17-like isoform X13.1e-2853.22Show/hide
Query:  MGMMD-MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA---
        MGMMD +R+K + +  EDD  TKK CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR+ST    R CD KR++ H+H              S+TA   
Subjt:  MGMMD-MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA---

Query:  ----------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA
                        G CG          GEEV++   + KQR  RKL   EEEQAAVSLMALSCGSVFA
Subjt:  ----------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA

A0A6J1C5A4 GATA transcription factor 17-like isoform X27.0e-2852.35Show/hide
Query:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA----
        MGMMD+ ++   +  EDD  TKK CVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR+ST    R CD KR++ H+H              S+TA    
Subjt:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTY---RRCDSKRQRLHNH--------------SSTA----

Query:  ---------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA
                       G CG          GEEV++   + KQR  RKL   EEEQAAVSLMALSCGSVFA
Subjt:  ---------------GGCGD---------GEEVLL---VKKQR-WRKLPEEEEEQAAVSLMALSCGSVFA

A0A6J1JPG4 GATA transcription factor 16-like2.8e-2953.46Show/hide
Query:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTYRRCDSKRQRLH---------------NHSSTAGGCGDG
        MGMMD+ QKG +  T+    TKKCCVDC TTKTPLWRGGPAGPKSLCNACGIRFRKRRIST R    KR+R H               N +++ GG GDG
Subjt:  MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTYRRCDSKRQRLH---------------NHSSTAGGCGDG

Query:  -------------------EEVLLVKK----QRWRKLPEEEEEQAAVSLMALSCGSVFA
                           EEV++V+     Q+  KL   EEEQAAV LMALSCGSVFA
Subjt:  -------------------EEVLLVKK----QRWRKLPEEEEEQAAVSLMALSCGSVFA

SwissProt top hitse value%identityAlignment
Q8LC59 GATA transcription factor 233.7e-1848.7Show/hide
Query:  EDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR---------ISTYRRCDSKRQRLHNHSSTAGGCGDGEEVLLVKKQRWRKLPEEEEEQ
        +++  T +CC +CKTTKTP+WRGGP GPKSLCNACGIR RK+R         I +++   SK+  L   SS+ GG       + VKK+R  K    EEEQ
Subjt:  EDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR---------ISTYRRCDSKRQRLHNHSSTAGGCGDGEEVLLVKKQRWRKLPEEEEEQ

Query:  AAVSLMALSCGSVFA
        AA+ L+ LSC SV A
Subjt:  AAVSLMALSCGSVFA

Q8LC79 GATA transcription factor 181.3e-1063.64Show/hide
Query:  DSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRIST
        DS+  + C +C TT TPLWR GP GPKSLCNACGIRF+K    T
Subjt:  DSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRIST

Q8LG10 GATA transcription factor 155.3e-1746.88Show/hide
Query:  MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFR-KRRISTYRRCDSKRQRLHNHSSTAGGCGD------GEEVLL----VKKQ
        + +   +   E  S  KK C  C T+KTPLWRGGPAGPKSLCNACGIR R KRR     R + K+++ HN +   G          G EV++     + Q
Subjt:  MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFR-KRRISTYRRCDSKRQRLHNHSSTAGGCGD------GEEVLL----VKKQ

Query:  RWRKLPEEEEEQAAVSLMALS-CGSVFA
        R  KL   EEEQAAV LMALS   SV+A
Subjt:  RWRKLPEEEEEQAAVSLMALS-CGSVFA

Q9FJ10 GATA transcription factor 164.1e-1752.68Show/hide
Query:  KKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTYRRCDSKRQRLHNHSSTAGGCGDGEEV------------LLVKKQRWRKLPEEEEEQAAV
        KK C DC T+KTPLWRGGP GPKSLCNACGIR RK+R    R      ++L   SS  G    GE +              V+KQR +KL   EEEQAAV
Subjt:  KKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTYRRCDSKRQRLHNHSSTAGGCGDGEEV------------LLVKKQRWRKLPEEEEEQAAV

Query:  SLMALSCGSVFA
         LMALS GSV+A
Subjt:  SLMALSCGSVFA

Q9LIB5 GATA transcription factor 175.0e-1537.91Show/hide
Query:  TKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRIST--YRRCDSKRQR------------------------------------LHNHSSTAGGC
        TK+ CVDC T +TPLWRGGPAGPKSLCNACGI+ RK+R +    R  + K+ R                                     +N  S++   
Subjt:  TKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRIST--YRRCDSKRQR------------------------------------LHNHSSTAGGC

Query:  GDGEEVLL--------------VKKQRWRKLPEEEEEQAAVSLMALSCGSVFA
          G    L               KK+ WRKL   EEE+AAV LMALSC SV+A
Subjt:  GDGEEVLL--------------VKKQRWRKLPEEEEEQAAVSLMALSCGSVFA

Arabidopsis top hitse value%identityAlignment
AT3G06740.1 GATA transcription factor 153.8e-1846.88Show/hide
Query:  MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFR-KRRISTYRRCDSKRQRLHNHSSTAGGCGD------GEEVLL----VKKQ
        + +   +   E  S  KK C  C T+KTPLWRGGPAGPKSLCNACGIR R KRR     R + K+++ HN +   G          G EV++     + Q
Subjt:  MRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFR-KRRISTYRRCDSKRQRLHNHSSTAGGCGD------GEEVLL----VKKQ

Query:  RWRKLPEEEEEQAAVSLMALS-CGSVFA
        R  KL   EEEQAAV LMALS   SV+A
Subjt:  RWRKLPEEEEEQAAVSLMALS-CGSVFA

AT3G16870.1 GATA transcription factor 173.5e-1637.91Show/hide
Query:  TKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRIST--YRRCDSKRQR------------------------------------LHNHSSTAGGC
        TK+ CVDC T +TPLWRGGPAGPKSLCNACGI+ RK+R +    R  + K+ R                                     +N  S++   
Subjt:  TKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRIST--YRRCDSKRQR------------------------------------LHNHSSTAGGC

Query:  GDGEEVLL--------------VKKQRWRKLPEEEEEQAAVSLMALSCGSVFA
          G    L               KK+ WRKL   EEE+AAV LMALSC SV+A
Subjt:  GDGEEVLL--------------VKKQRWRKLPEEEEEQAAVSLMALSCGSVFA

AT4G16141.1 GATA type zinc finger transcription factor family protein5.7e-1465.31Show/hide
Query:  GQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR
        G    +     TKK CVDC T++TPLWRGGPAGPKSLCNACGI+ RK+R
Subjt:  GQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR

AT5G26930.1 GATA transcription factor 232.6e-1948.7Show/hide
Query:  EDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR---------ISTYRRCDSKRQRLHNHSSTAGGCGDGEEVLLVKKQRWRKLPEEEEEQ
        +++  T +CC +CKTTKTP+WRGGP GPKSLCNACGIR RK+R         I +++   SK+  L   SS+ GG       + VKK+R  K    EEEQ
Subjt:  EDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRR---------ISTYRRCDSKRQRLHNHSSTAGGCGDGEEVLLVKKQRWRKLPEEEEEQ

Query:  AAVSLMALSCGSVFA
        AA+ L+ LSC SV A
Subjt:  AAVSLMALSCGSVFA

AT5G49300.1 GATA transcription factor 162.9e-1852.68Show/hide
Query:  KKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTYRRCDSKRQRLHNHSSTAGGCGDGEEV------------LLVKKQRWRKLPEEEEEQAAV
        KK C DC T+KTPLWRGGP GPKSLCNACGIR RK+R    R      ++L   SS  G    GE +              V+KQR +KL   EEEQAAV
Subjt:  KKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTYRRCDSKRQRLHNHSSTAGGCGDGEEV------------LLVKKQRWRKLPEEEEEQAAV

Query:  SLMALSCGSVFA
         LMALS GSV+A
Subjt:  SLMALSCGSVFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATGATGGATATGAGACAAAAGGGTCAAGCAGATGAAACAGAGGATGATTCCATTACCAAAAAATGTTGTGTTGATTGTAAGACTACAAAGACTCCTTTGTGGCG
TGGAGGCCCTGCTGGACCTAAGTCACTGTGTAATGCATGTGGGATCAGGTTTAGGAAGAGAAGAATATCAACCTACAGAAGATGTGACAGCAAGAGACAGAGACTTCACA
ATCATAGTTCCACCGCCGGCGGTTGTGGAGATGGGGAGGAGGTGCTGCTGGTTAAGAAACAGCGGTGGAGGAAGCTCCCGGAGGAGGAGGAGGAGCAGGCGGCGGTGTCG
TTAATGGCGCTGTCGTGCGGCTCTGTGTTTGCTTGA
mRNA sequenceShow/hide mRNA sequence
TCATTTTATCCCTATCCCTAATAACATTTTTCAGAGTCTTTAATGATTTCAATTTTCTTCCCTAAAACCTGAAAAATGGGTATGATGGATATGAGACAAAAGGGTCAAGC
AGATGAAACAGAGGATGATTCCATTACCAAAAAATGTTGTGTTGATTGTAAGACTACAAAGACTCCTTTGTGGCGTGGAGGCCCTGCTGGACCTAAGTCACTGTGTAATG
CATGTGGGATCAGGTTTAGGAAGAGAAGAATATCAACCTACAGAAGATGTGACAGCAAGAGACAGAGACTTCACAATCATAGTTCCACCGCCGGCGGTTGTGGAGATGGG
GAGGAGGTGCTGCTGGTTAAGAAACAGCGGTGGAGGAAGCTCCCGGAGGAGGAGGAGGAGCAGGCGGCGGTGTCGTTAATGGCGCTGTCGTGCGGCTCTGTGTTTGCTTG
AAGAAGAAAGGTAGATCGACGAAGATGACTTTGGCAGTGCTATTCGACACAAAATCCAACCCACTTTTTATGCAAAATCCAACAAAATTCATCTTTGTTTTTTAAATGCT
GTGTCAATCAATGTGTCTATATATATTAATTTTAATGACAAATTAATTTTATGCAAGCTAGGTTTAACCAATTTTCTTTCAAATGAAATGATACAATGGAAAATTCTTAA
TATATAGACGGTAAGTTTGAGTATTTTTAAAAATAAAGGACCAAATTTTAAAAGAACCTTAAAGATTACGGATATCATAAATATACCAAAAATTTTAGGGGAGATTTAAG
GTTTTTTTTTTTTTTTTTTGAGTACAACATCACCATGTGGGAGCATCAACTCCGTTCCTGTGGGCTTTTGAGGAGTGAGAAAAATTATTGGAATTCTATGAAGAGTCTCG
GAAGCCAGGATGCATGCCAGTTTCATACAACCAGGTGGAGCTTTCACTATTCGATAGCATTATCATTGATTGCGCTAGCACTATTCCGATAGCATTATCATTAAAAGTAA
GATCTGCAGTGATGTAATCAGGATTTCATCTCCACAATTCTTTTTTTCTATGAAAGCAGGAAAGTTTAGTTCAAAATTCTACCTTATTCCATGAGATCTATAACTACTTA
TGCAACCTTCTCTCTCCTCTCTAGGGAAATTCGAACCCATGACTTTTGGTTTTAAGTCTTATTTGATGTCAATTGAGCTATGCTCTTGTTGGTTCAAGGTTGAACCATTA
AATTATGGGAGATTTAAGTTTATTAAGTTTAGGATCAAAATGAATCATATTATTATTATTGTTGTTGTTTTGATACATGAATAACATTTTCTAAAATAAACGCTATTTTG
ATAATTTAGAAGTCATTGTAGTCGTAAATAATTTCCACTCTTTTGTAAGTTTTTCCTAATGGTCATTTATAAACAAACATATCGCCTAATCATGTATATGTTTTCCTGCC
ACTTCTCTTGAGGGGTACAAGTTATAAAAAAAATTAATAAATTGAAATTTGATTGAAAATGTAGAAGGTTTAAATAATCTTAAAATATTTATAAATG
Protein sequenceShow/hide protein sequence
MGMMDMRQKGQADETEDDSITKKCCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRISTYRRCDSKRQRLHNHSSTAGGCGDGEEVLLVKKQRWRKLPEEEEEQAAVS
LMALSCGSVFA