; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013033 (gene) of Snake gourd v1 genome

Gene IDTan0013033
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGATA transcription factor 29-like
Genome locationLG07:10871690..10873005
RNA-Seq ExpressionTan0013033
SyntenyTan0013033
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010246.1 GATA transcription factor 29, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-5467.8Show/hide
Query:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD
        NNHG+AS G RNGE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           IGSTAN+       ++PLQ   N NND YTLLNPA R TDD
Subjt:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD

Query:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
         +   +S   RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EAMAAEN+H
Subjt:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

XP_022985849.1 GATA zinc finger domain-containing protein 8-like [Cucurbita maxima]6.1e-5670.62Show/hide
Query:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD
        NNHG+AS GGRNGE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           +GSTAN+    ANL  PLQ   N NND YTLLNPA R  DD
Subjt:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD

Query:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
          EA+ASS +RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EA+AAEN+H
Subjt:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

XP_023513295.1 putative GATA transcription factor 22 [Cucurbita pepo subsp. pepo]8.9e-5568.93Show/hide
Query:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD
        NNHG+AS GGR+GE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           IGSTAN+    ANL  PLQ   N NND YTLLNPA R TDD
Subjt:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD

Query:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
         +   +S   RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EA+AAEN+H
Subjt:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

XP_023522473.1 GATA transcription factor 21-like [Cucurbita pepo subsp. pepo]8.9e-5568.93Show/hide
Query:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD
        NNHG+AS GGR+GE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           IGSTAN+    ANL  PLQ   N NND YTLLNPA R TDD
Subjt:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD

Query:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
         +   +S   RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EA+AAEN+H
Subjt:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

XP_038902274.1 putative GATA transcription factor 22 [Benincasa hispida]5.5e-5771.91Show/hide
Query:  MVNNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQL-----------MPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRIT
        MV+NHG AS GGRNGE MMSE NYNPHQIVWSKELKTYVL SN++ QL            PI +T NN   + NL +P       NNDLYTLLNPASRIT
Subjt:  MVNNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQL-----------MPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRIT

Query:  DDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENS
        DD  EA+ASS  RR+G+RRRRVS+ N+AE+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENS
Subjt:  DDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENS

TrEMBL top hitse value%identityAlignment
A0A0A0L0J0 GATA-type domain-containing protein5.1e-4863.74Show/hide
Query:  MVNNHGMASCGGRNGEMMMSEGNY-NPHQIVWSKELKTYVLNSN-------------NYEQLMPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPAS
        M NN G+     RN E MMSE NY NPHQIVWSK+LKTYVL SN             N  Q  PI ++  NN    NL +P      +NNDLYTLLNPAS
Subjt:  MVNNHGMASCGGRNGEMMMSEGNY-NPHQIVWSKELKTYVLNSN-------------NYEQLMPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPAS

Query:  RITDDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
        R+ DD  EA+ASS+ RRKG+RRRRVS+ N+ E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKET+N+EAMAAENS+
Subjt:  RITDDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

A0A5A7TDI3 GATA transcription factor 29-like6.4e-5166.1Show/hide
Query:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMPIG-----------STANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD
        NN G+A    RN E MMSE NYNPHQIVWSK+LKTYVL SNNY QL P             +T+  N    NL +P      +NNDLYTLLNPASRI+DD
Subjt:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMPIG-----------STANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD

Query:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
          EA+ASS  RRKG+RRRR S+ N+ E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKET+N+EAMAAENS+
Subjt:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

A0A6J1DIM0 GATA transcription factor 291.6e-4663.07Show/hide
Query:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQL---------MPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQ
        NN G+ SCGG      MSE NYNP+QI+WSKELKTYVL SNN   L         + IG+ +  N +A   ++PLQ    N+NDLYTLLNPASR TDD +
Subjt:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQL---------MPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQ

Query:  EADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS
        + + SS +RRK +R RRVS  +E EKRCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +NREAM AENSH+
Subjt:  EADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS

A0A6J1FTP0 GATA zinc finger domain-containing protein 8-like5.6e-5567.8Show/hide
Query:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD
        NNHG+AS GGR+GE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           IGSTAN        ++PLQ   N NND YTLLNPA R TDD
Subjt:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD

Query:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
         +   +S   RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EAMAAEN+H
Subjt:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

A0A6J1JEF0 GATA zinc finger domain-containing protein 8-like3.0e-5670.62Show/hide
Query:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD
        NNHG+AS GGRNGE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           +GSTAN+    ANL  PLQ   N NND YTLLNPA R  DD
Subjt:  NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDD

Query:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
          EA+ASS +RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EA+AAEN+H
Subjt:  QQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

SwissProt top hitse value%identityAlignment
B8AX51 GATA transcription factor 154.5e-0968.42Show/hide
Query:  EKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        ++RC   NC T  TP+WR GP GPKSLCNACGIRY+KE
Subjt:  EKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

Q6L5E5 GATA transcription factor 154.5e-0968.42Show/hide
Query:  EKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        ++RC   NC T  TP+WR GP GPKSLCNACGIRY+KE
Subjt:  EKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

Q6QPM2 GATA transcription factor 195.3e-1060Show/hide
Query:  NEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS
        N   +RC   NC+TT TP+WR GP GPKSLCNACGIR++KE   R A  A NS S
Subjt:  NEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS

Q8LC79 GATA transcription factor 184.5e-0970.27Show/hide
Query:  KRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        +RC   NC+TT TP+WR GP GPKSLCNACGIR++KE
Subjt:  KRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

Q9LT45 GATA transcription factor 291.3e-1143.18Show/hide
Query:  NDLYTLLN-PASRITDDQQEADASSSARRKGTRRRRVSSCN-------EAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        +D Y L++ PA R    +  +   +++ ++    +R+  C        E  K+CTN NCN   TPMWR+GPLGPKSLCNACGI++RKE
Subjt:  NDLYTLLN-PASRITDDQQEADASSSARRKGTRRRRVSSCN-------EAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

Arabidopsis top hitse value%identityAlignment
AT3G20750.1 GATA transcription factor 298.9e-1343.18Show/hide
Query:  NDLYTLLN-PASRITDDQQEADASSSARRKGTRRRRVSSCN-------EAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        +D Y L++ PA R    +  +   +++ ++    +R+  C        E  K+CTN NCN   TPMWR+GPLGPKSLCNACGI++RKE
Subjt:  NDLYTLLN-PASRITDDQQEADASSSARRKGTRRRRVSSCN-------EAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

AT3G50870.1 GATA type zinc finger transcription factor family protein3.2e-1070.27Show/hide
Query:  KRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        +RC   NC+TT TP+WR GP GPKSLCNACGIR++KE
Subjt:  KRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

AT4G36620.1 GATA transcription factor 193.8e-1160Show/hide
Query:  NEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS
        N   +RC   NC+TT TP+WR GP GPKSLCNACGIR++KE   R A  A NS S
Subjt:  NEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS

AT5G49300.1 GATA transcription factor 163.5e-0959.09Show/hide
Query:  SSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        +S N+ +K C   +C T+ TP+WR GP+GPKSLCNACGIR RK+
Subjt:  SSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

AT5G56860.1 GATA type zinc finger transcription factor family protein5.4e-1042.15Show/hide
Query:  NYEQLMPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKS
        N +QL  I  T NNN   ++ H PL    N + D +  LN  + +T   ++  A+++  R  T      S N    R  + +CNTT TP+WR GP GPKS
Subjt:  NYEQLMPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKS

Query:  LCNACGIRYRKETLNREAMAA
        LCNACGIR RK    R AMAA
Subjt:  LCNACGIRYRKETLNREAMAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAACAATCATGGGATGGCGTCGTGCGGAGGCAGAAATGGTGAGATGATGATGAGTGAAGGAAATTACAACCCTCACCAAATTGTATGGTCAAAAGAGCTGAAAAC
CTATGTTCTTAATTCAAATAATTACGAGCAGCTGATGCCCATTGGCAGCACTGCAAATAACAACATGATAGCAGCCAATCTTCATATTCCTCTTCAAAAGAACAATAATA
ATAATAATGACCTCTACACACTCCTCAACCCTGCTTCTAGAATCACCGATGATCAACAAGAAGCCGATGCTTCCTCCAGTGCCCGCCGGAAAGGTACACGGCGGCGCCGA
GTTTCCTCCTGCAACGAGGCCGAGAAGAGGTGCACCAATTACAATTGCAACACCACTTTTACACCCATGTGGCGTAAAGGCCCTCTTGGTCCCAAGAGCCTTTGCAATGC
ATGCGGCATAAGGTACAGAAAGGAAACGTTGAATCGGGAAGCAATGGCGGCAGAAAACAGCCACAGCCACAGCCACAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGAACAATCATGGGATGGCGTCGTGCGGAGGCAGAAATGGTGAGATGATGATGAGTGAAGGAAATTACAACCCTCACCAAATTGTATGGTCAAAAGAGCTGAAAAC
CTATGTTCTTAATTCAAATAATTACGAGCAGCTGATGCCCATTGGCAGCACTGCAAATAACAACATGATAGCAGCCAATCTTCATATTCCTCTTCAAAAGAACAATAATA
ATAATAATGACCTCTACACACTCCTCAACCCTGCTTCTAGAATCACCGATGATCAACAAGAAGCCGATGCTTCCTCCAGTGCCCGCCGGAAAGGTACACGGCGGCGCCGA
GTTTCCTCCTGCAACGAGGCCGAGAAGAGGTGCACCAATTACAATTGCAACACCACTTTTACACCCATGTGGCGTAAAGGCCCTCTTGGTCCCAAGAGCCTTTGCAATGC
ATGCGGCATAAGGTACAGAAAGGAAACGTTGAATCGGGAAGCAATGGCGGCAGAAAACAGCCACAGCCACAGCCACAGCTAG
Protein sequenceShow/hide protein sequence
MVNNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRR
VSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHSHSHS