; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009993 (gene) of Snake gourd v1 genome

Gene IDTan0009993
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGATA transcription factor 15-like
Genome locationLG06:5864376..5866635
RNA-Seq ExpressionTan0009993
SyntenyTan0009993
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046674.1 GATA transcription factor 15-like [Cucumis melo var. makuwa]6.5e-3257.14Show/hide
Query:  LQFFMDRSDLSAGSTFN-------NQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLK---------NNINGGVIKSGKGKNR-G
        + F M+R+   + STFN       ++     +     R CVHCR T TPLWR GPAGPRSLCNACGIRYRK+K         NN N    K GKGK   G
Subjt:  LQFFMDRSDLSAGSTFN-------NQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLK---------NNINGGVIKSGKGKNR-G

Query:  GGSLKMRVVGFGREIVLHR--ATAMAENGVAEVIGEEEQAAAMLLMALSSGYLS
        GGSLK+RVV  GREI++HR   TAM +NGVAE IGEEEQ AAMLLMALSSGY+S
Subjt:  GGSLKMRVVGFGREIVLHR--ATAMAENGVAEVIGEEEQAAAMLLMALSSGYLS

XP_008451515.1 PREDICTED: GATA transcription factor 15-like [Cucumis melo]1.9e-3155.7Show/hide
Query:  LQFFMDRSDLSAGSTFN-------NQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLK-------------NNINGGVIKSGKGK
        + F M+R+   + STFN       ++     +     R CVHCR T TPLWR GPAGPRSLCNACGIRYRK+K             NN N    K GKGK
Subjt:  LQFFMDRSDLSAGSTFN-------NQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLK-------------NNINGGVIKSGKGK

Query:  NR-GGGSLKMRVVGFGREIVLHR--ATAMAENGVAEVIGEEEQAAAMLLMALSSGYLS
           GGGSLK+RVV  GREI++HR   TAM +NGVAE IGEEEQ AAMLLMALSSGY+S
Subjt:  NR-GGGSLKMRVVGFGREIVLHR--ATAMAENGVAEVIGEEEQAAAMLLMALSSGYLS

XP_022959984.1 GATA transcription factor 15-like [Cucurbita moschata]8.2e-3565.75Show/hide
Query:  LQFFMDRSDLSAGSTFNNQSSSDLMPISN----------HRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGSLKMR
        + FF++     +GSTFN    S LM ISN           RTC HCRTT TPLWR GPAGPRSLCNACGIRYRKLKN+ NGGV KS  GK        +R
Subjt:  LQFFMDRSDLSAGSTFNNQSSSDLMPISN----------HRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGSLKMR

Query:  VVGFGREIVLHRAT-AMAENGVAEVIGEEEQAAAMLLMALSSGYLS
        VVG GREI+LHRAT AMAENG AE IGEEEQAAAMLLMALSSGY+S
Subjt:  VVGFGREIVLHRAT-AMAENGVAEVIGEEEQAAAMLLMALSSGYLS

XP_023004687.1 GATA transcription factor 15-like [Cucurbita maxima]5.7e-3666.44Show/hide
Query:  LQFFMDRSDLSAGSTFNNQSSSDLMPISN----------HRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGSLKMR
        + FF++     +GSTFN    S LM ISN           RTC HCRTT TPLWR GPAGPRSLCNACGIRYRKLKNN NGGV KSG GK        +R
Subjt:  LQFFMDRSDLSAGSTFNNQSSSDLMPISN----------HRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGSLKMR

Query:  VVGFGREIVLHRAT-AMAENGVAEVIGEEEQAAAMLLMALSSGYLS
        VVG GREI+LHR T AMAENG AE IGEEEQAAAMLLMALSSGY+S
Subjt:  VVGFGREIVLHRAT-AMAENGVAEVIGEEEQAAAMLLMALSSGYLS

XP_023515141.1 GATA transcription factor 15-like [Cucurbita pepo subsp. pepo]5.7e-3666.44Show/hide
Query:  LQFFMDRSDLSAGSTFNNQSSSDLMPISNH----------RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGSLKMR
        + FF++     +GSTFN    S LM ISNH          RTC HCRTT TPLWR GPAGPRSLCNACGIRYRKLKN+ NGGV KSG GK        +R
Subjt:  LQFFMDRSDLSAGSTFNNQSSSDLMPISNH----------RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGSLKMR

Query:  VVGFGREIVLHRAT-AMAENGVAEVIGEEEQAAAMLLMALSSGYLS
        VV  GREI+LHRAT AMAENG AE IGEEEQAAAMLLMALSSGY+S
Subjt:  VVGFGREIVLHRAT-AMAENGVAEVIGEEEQAAAMLLMALSSGYLS

TrEMBL top hitse value%identityAlignment
A0A0A0K6B1 GATA-type domain-containing protein3.9e-3059.09Show/hide
Query:  LQFFMDR-SDLSAGSTFNNQSSSDL--------MPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLK--NNINGGV-----IKSGKGKNRG-
        + F M+R +DLSA STFN  S            +     R CVHCR T TPLWR GPAGPRSLCNACGIRYRK+K  +N NGGV      K GKGK  G 
Subjt:  LQFFMDR-SDLSAGSTFNNQSSSDL--------MPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLK--NNINGGV-----IKSGKGKNRG-

Query:  -GGSLKMRVVGFGREIVLHRATAMAE--NGVAEVIGEEEQAAAMLLMALSSGYL
         GGSLK+RVV  GREI++HR T   E  N VAE IGEEEQ AA+LLMALSSGY+
Subjt:  -GGSLKMRVVGFGREIVLHRATAMAE--NGVAEVIGEEEQAAAMLLMALSSGYL

A0A1S3BSR7 GATA transcription factor 15-like9.2e-3255.7Show/hide
Query:  LQFFMDRSDLSAGSTFN-------NQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLK-------------NNINGGVIKSGKGK
        + F M+R+   + STFN       ++     +     R CVHCR T TPLWR GPAGPRSLCNACGIRYRK+K             NN N    K GKGK
Subjt:  LQFFMDRSDLSAGSTFN-------NQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLK-------------NNINGGVIKSGKGK

Query:  NR-GGGSLKMRVVGFGREIVLHR--ATAMAENGVAEVIGEEEQAAAMLLMALSSGYLS
           GGGSLK+RVV  GREI++HR   TAM +NGVAE IGEEEQ AAMLLMALSSGY+S
Subjt:  NR-GGGSLKMRVVGFGREIVLHR--ATAMAENGVAEVIGEEEQAAAMLLMALSSGYLS

A0A5A7TUD0 GATA transcription factor 15-like3.2e-3257.14Show/hide
Query:  LQFFMDRSDLSAGSTFN-------NQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLK---------NNINGGVIKSGKGKNR-G
        + F M+R+   + STFN       ++     +     R CVHCR T TPLWR GPAGPRSLCNACGIRYRK+K         NN N    K GKGK   G
Subjt:  LQFFMDRSDLSAGSTFN-------NQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLK---------NNINGGVIKSGKGKNR-G

Query:  GGSLKMRVVGFGREIVLHR--ATAMAENGVAEVIGEEEQAAAMLLMALSSGYLS
        GGSLK+RVV  GREI++HR   TAM +NGVAE IGEEEQ AAMLLMALSSGY+S
Subjt:  GGSLKMRVVGFGREIVLHR--ATAMAENGVAEVIGEEEQAAAMLLMALSSGYLS

A0A6J1H9M3 GATA transcription factor 15-like4.0e-3565.75Show/hide
Query:  LQFFMDRSDLSAGSTFNNQSSSDLMPISN----------HRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGSLKMR
        + FF++     +GSTFN    S LM ISN           RTC HCRTT TPLWR GPAGPRSLCNACGIRYRKLKN+ NGGV KS  GK        +R
Subjt:  LQFFMDRSDLSAGSTFNNQSSSDLMPISN----------HRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGSLKMR

Query:  VVGFGREIVLHRAT-AMAENGVAEVIGEEEQAAAMLLMALSSGYLS
        VVG GREI+LHRAT AMAENG AE IGEEEQAAAMLLMALSSGY+S
Subjt:  VVGFGREIVLHRAT-AMAENGVAEVIGEEEQAAAMLLMALSSGYLS

A0A6J1KR45 GATA transcription factor 15-like2.8e-3666.44Show/hide
Query:  LQFFMDRSDLSAGSTFNNQSSSDLMPISN----------HRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGSLKMR
        + FF++     +GSTFN    S LM ISN           RTC HCRTT TPLWR GPAGPRSLCNACGIRYRKLKNN NGGV KSG GK        +R
Subjt:  LQFFMDRSDLSAGSTFNNQSSSDLMPISN----------HRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGSLKMR

Query:  VVGFGREIVLHRAT-AMAENGVAEVIGEEEQAAAMLLMALSSGYLS
        VVG GREI+LHR T AMAENG AE IGEEEQAAAMLLMALSSGY+S
Subjt:  VVGFGREIVLHRAT-AMAENGVAEVIGEEEQAAAMLLMALSSGYLS

SwissProt top hitse value%identityAlignment
B8AX51 GATA transcription factor 154.7e-0970.59Show/hide
Query:  RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRK
        R C +C T +TPLWR GP GP+SLCNACGIRY+K
Subjt:  RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRK

Q8LC79 GATA transcription factor 183.6e-0970.59Show/hide
Query:  RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRK
        R C +C TT+TPLWR GP GP+SLCNACGIR++K
Subjt:  RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRK

Q8LG10 GATA transcription factor 155.6e-1847.73Show/hide
Query:  MDRSDLSAGSTFNNQSSSDLMPISNH-RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNI--NGGVIKSGKGKNRG---GGSLKMRVVGFGREI
        M+    S  +   + SSS    ISN  ++C  C T+ TPLWR GPAGP+SLCNACGIR RK +  +  N    K  K  NR    G SLK R++  GRE+
Subjt:  MDRSDLSAGSTFNNQSSSDLMPISNH-RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNI--NGGVIKSGKGKNRG---GGSLKMRVVGFGREI

Query:  VLHRATAMAENGVAEVIGEEEQAAAMLLMALS
        ++ R+T  AEN     +GEEEQ AA+LLMALS
Subjt:  VLHRATAMAENGVAEVIGEEEQAAAMLLMALS

Q9FJ10 GATA transcription factor 161.2e-1243.8Show/hide
Query:  NNQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRK-----LKNNINGGVIKSGKGKNRGGGSLKMRVVGFGREIVLHRATAMAENGV
        NN S +D       +TC  C T+ TPLWR GP GP+SLCNACGIR RK      ++N       SG G  + G SLK  ++  G   +  R+T   +   
Subjt:  NNQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRK-----LKNNINGGVIKSGKGKNRGGGSLKMRVVGFGREIVLHRATAMAENGV

Query:  AEVIGEEEQAAAMLLMALSSG
         + +GEEEQ AA+LLMALS G
Subjt:  AEVIGEEEQAAAMLLMALSSG

Q9LIB5 GATA transcription factor 172.1e-0947.44Show/hide
Query:  MDRSDLSAGSTFNNQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNR
        +D  + S+  +    SS D       RTCV C T  TPLWR GPAGP+SLCNACGI+ RK K     G+    K KNR
Subjt:  MDRSDLSAGSTFNNQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNR

Arabidopsis top hitse value%identityAlignment
AT3G06740.1 GATA transcription factor 154.0e-1947.73Show/hide
Query:  MDRSDLSAGSTFNNQSSSDLMPISNH-RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNI--NGGVIKSGKGKNRG---GGSLKMRVVGFGREI
        M+    S  +   + SSS    ISN  ++C  C T+ TPLWR GPAGP+SLCNACGIR RK +  +  N    K  K  NR    G SLK R++  GRE+
Subjt:  MDRSDLSAGSTFNNQSSSDLMPISNH-RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNI--NGGVIKSGKGKNRG---GGSLKMRVVGFGREI

Query:  VLHRATAMAENGVAEVIGEEEQAAAMLLMALS
        ++ R+T  AEN     +GEEEQ AA+LLMALS
Subjt:  VLHRATAMAENGVAEVIGEEEQAAAMLLMALS

AT3G16870.1 GATA transcription factor 171.5e-1047.44Show/hide
Query:  MDRSDLSAGSTFNNQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNR
        +D  + S+  +    SS D       RTCV C T  TPLWR GPAGP+SLCNACGI+ RK K     G+    K KNR
Subjt:  MDRSDLSAGSTFNNQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNR

AT4G16141.1 GATA type zinc finger transcription factor family protein6.8e-1138.39Show/hide
Query:  DRSDLSAGSTFNNQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKN---NINGGVIKSGKGKNRGGGSLKMRVVGFGREIVLHR
        D SD+  G+  ++ S  D       +TCV C T+ TPLWR GPAGP+SLCNACGI+ RK +     I    IK  K K+     L+ R V  G+   ++ 
Subjt:  DRSDLSAGSTFNNQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKN---NINGGVIKSGKGKNRGGGSLKMRVVGFGREIVLHR

Query:  ATAMAENGVAEV
          A  E G+ ++
Subjt:  ATAMAENGVAEV

AT4G36620.1 GATA transcription factor 193.4e-1050Show/hide
Query:  RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGS
        R C +C TT+TPLWR GP GP+SLCNACGIR++K +   +     + +    GGGS
Subjt:  RTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGS

AT5G49300.1 GATA transcription factor 168.5e-1443.8Show/hide
Query:  NNQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRK-----LKNNINGGVIKSGKGKNRGGGSLKMRVVGFGREIVLHRATAMAENGV
        NN S +D       +TC  C T+ TPLWR GP GP+SLCNACGIR RK      ++N       SG G  + G SLK  ++  G   +  R+T   +   
Subjt:  NNQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRK-----LKNNINGGVIKSGKGKNRGGGSLKMRVVGFGREIVLHRATAMAENGV

Query:  AEVIGEEEQAAAMLLMALSSG
         + +GEEEQ AA+LLMALS G
Subjt:  AEVIGEEEQAAAMLLMALSSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAATTTCCAGTTGCAGTTCTTCATGGACCGCTCAGATCTCTCAGCTGGCTCGACCTTCAATAATCAATCATCTTCTGATCTAATGCCCATCTCCAACCACAGAAC
CTGCGTCCACTGTCGCACCACCACCACCCCCCTCTGGAGAACCGGTCCCGCCGGCCCAAGGTCGCTGTGCAATGCATGTGGGATTAGATACAGGAAACTGAAGAACAATA
TTAATGGCGGAGTGATTAAGAGCGGAAAGGGGAAGAATAGGGGAGGAGGATCGTTGAAGATGAGAGTGGTGGGTTTTGGGAGAGAGATAGTGCTGCACAGAGCTACGGCG
ATGGCGGAAAATGGAGTTGCAGAAGTGATCGGAGAAGAAGAACAGGCAGCGGCGATGCTTCTTATGGCTCTATCTTCCGGCTATTTATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAATTTCCAGTTGCAGTTCTTCATGGACCGCTCAGATCTCTCAGCTGGCTCGACCTTCAATAATCAATCATCTTCTGATCTAATGCCCATCTCCAACCACAGAAC
CTGCGTCCACTGTCGCACCACCACCACCCCCCTCTGGAGAACCGGTCCCGCCGGCCCAAGGTCGCTGTGCAATGCATGTGGGATTAGATACAGGAAACTGAAGAACAATA
TTAATGGCGGAGTGATTAAGAGCGGAAAGGGGAAGAATAGGGGAGGAGGATCGTTGAAGATGAGAGTGGTGGGTTTTGGGAGAGAGATAGTGCTGCACAGAGCTACGGCG
ATGGCGGAAAATGGAGTTGCAGAAGTGATCGGAGAAGAAGAACAGGCAGCGGCGATGCTTCTTATGGCTCTATCTTCCGGCTATTTATCTTAA
Protein sequenceShow/hide protein sequence
MANFQLQFFMDRSDLSAGSTFNNQSSSDLMPISNHRTCVHCRTTTTPLWRTGPAGPRSLCNACGIRYRKLKNNINGGVIKSGKGKNRGGGSLKMRVVGFGREIVLHRATA
MAENGVAEVIGEEEQAAAMLLMALSSGYLS