; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004146 (gene) of Snake gourd v1 genome

Gene IDTan0004146
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGATA transcription factor 21-like
Genome locationLG01:112239132..112241902
RNA-Seq ExpressionTan0004146
SyntenyTan0004146
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008452719.1 PREDICTED: GATA transcription factor 21-like [Cucumis melo]3.0e-8455.69Show/hide
Query:  NSLYP----FMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTN-----ATQHDHQR---VNLHHHLLQESHHQLQHDYHQAEKSI
        N LYP    F+EKQ ++E E+ NLKLYIFSS        SSS+++SSQLAF+TCFSTN      TQH H +   ++LHHHLLQ+SHHQL   +HQ EK  
Subjt:  NSLYP----FMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTN-----ATQHDHQR---VNLHHHLLQESHHQLQHDYHQAEKSI

Query:  DGSRENSEAEVILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDK---KVI------GGA-NSDDHKAAT
         GSRE  E+EVILSS+S      +R   ++     D D HQ       + S + KYWMSSKMRLMQKMMIN       KVI      GGA NSD H+ AT
Subjt:  DGSRENSEAEVILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDK---KVI------GGA-NSDDHKAAT

Query:  RNIINNNNQGSDGGKIWEIRGRTSCNSNGHNNNNI----INGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAE-AANGGGGGGGDTV-V
        RN  N+NN+G +GGK   + G++S +S   N++NI     NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMA+ AAN   G   +T   
Subjt:  RNIINNNNQGSDGGKIWEIRGRTSCNSNGHNNNNI----INGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAE-AANGGGGGGGDTV-V

Query:  VGSG-KEVHKEKKSRLSCDGD-----GSIVGDHVKKN-KYWNKNMTE-----NSSDSQSE--GPNDGVSFD-RHNFTLRLSKSGST---------FGRVF
          SG KE HKEKKSRLSC+GD      + VGDH K N KY NKN  +     N+  S+SE    N+ VSFD  HNFTLRLSK+ ST         FG+VF
Subjt:  VGSG-KEVHKEKKSRLSCDGD-----GSIVGDHVKKN-KYWNKNMTE-----NSSDSQSE--GPNDGVSFD-RHNFTLRLSKSGST---------FGRVF

Query:  PKDEEEAAILLMELSCGLVHTC
        P+DEEEAAILLMELSCGL+HTC
Subjt:  PKDEEEAAILLMELSCGLVHTC

XP_011654173.1 GATA transcription factor 21 [Cucumis sativus]6.7e-8455.88Show/hide
Query:  FMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNAT------QHDHQRVN--LHHHLLQESHHQLQHDYHQAEKSIDGSRENSEA
        F+EKQ ++E E+ NLKLYIFSSS       SSS++SSSQLAF+ CFSTN         H HQ VN  LHHHLLQ+SHHQL   +HQ EK I GSRE  E+
Subjt:  FMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNAT------QHDHQRVN--LHHHLLQESHHQLQHDYHQAEKSIDGSRENSEA

Query:  EVILSSNSNLVKSSTREALERSRSRSDIDDHQE---QQLDNPNGSSAAKYWMSSKMRLMQKMMINKTD--KKVI------GGA-NSDDHKAATRNIINNN
        +VILSS+S          LE+     D +DH +   ++ D+ NGS+  KYWMSSKMRLMQKMMIN     KKV+      GGA NSD H+ ATRN  + N
Subjt:  EVILSSNSNLVKSSTREALERSRSRSDIDDHQE---QQLDNPNGSSAAKYWMSSKMRLMQKMMINKTD--KKVI------GGA-NSDDHKAATRNIINNN

Query:  NQGSDGGKIWEIRGRTSCNSNGHNNNNI----INGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAE-AANGGGGGGGDT---VVVGSGK
        N+G +GGK   + G++S +S   N++NI     NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMA+ AAN GGG   +T         K
Subjt:  NQGSDGGKIWEIRGRTSCNSNGHNNNNI----INGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAE-AANGGGGGGGDT---VVVGSGK

Query:  EVHKEKKSRLSCDGD----GSIVGDHVKKN-KYWNKN-------MTENSSDSQSE--GPNDGVSF-DRHNFTLRLSKSGST---------FGRVFPKDEE
        E HKEKKSRLSC+GD     + VGD VK N KY N N          N+  S+SE    N+ VSF   HNFTLRLSK+ ST         FG+VFP+DEE
Subjt:  EVHKEKKSRLSCDGD----GSIVGDHVKKN-KYWNKN-------MTENSSDSQSE--GPNDGVSF-DRHNFTLRLSKSGST---------FGRVFPKDEE

Query:  EAAILLMELSCGLVHTC
        EAAILLMELSCGL+HTC
Subjt:  EAAILLMELSCGLVHTC

XP_022941123.1 putative GATA transcription factor 22 [Cucurbita moschata]2.8e-8255.53Show/hide
Query:  MNSLYPFM---EKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSE
        MNSLYP M   E+ +E E +  +LKLYIF S+SQV AA       SS LAFATCF+++ATQHDHQR++LHHH             +Q EKSIDG      
Subjt:  MNSLYPFM---EKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSE

Query:  AEVILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLD-----NPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDG
                                SR+DI DHQEQQLD       +  SAAKYWMSSKMRLMQKMM        +GG NSD   AA     NNN      
Subjt:  AEVILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLD-----NPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDG

Query:  GKIWEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTV-VVGSGKEVHKEKKSRLSCD
               GR+ C+S         NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGG    V    SGKE+HKEKKSRLS  
Subjt:  GKIWEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTV-VVGSGKEVHKEKKSRLSCD

Query:  GDGSIVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVHTC
        GD     + VKK K  N++M E   +SQSEG NDGV        LRLSK+GS FGRVFP+DEEEAAILLMELSCGLVHTC
Subjt:  GDGSIVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVHTC

XP_022982396.1 putative GATA transcription factor 22 [Cucurbita maxima]1.6e-8255.85Show/hide
Query:  MNSLYPFMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSEAEV
        MNSLYP M K +EQ   + +LKLYIF S+SQV AA       SS LAFATCF+++ATQHDHQR++LHHH             +Q EK IDG         
Subjt:  MNSLYPFMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSEAEV

Query:  ILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLD-----NPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDGGKI
                             SR+DI DHQEQQLD       +  SAAKYWMSSKMRLMQKMM        +GG NSD   AA     +NNN+       
Subjt:  ILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLD-----NPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDGGKI

Query:  WEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLSCDGDGS
            GR+ C+S         NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAAN GGG G   V   SGKE++KEKKSRLS  GD  
Subjt:  WEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLSCDGDGS

Query:  IVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVHTC
          G+ VKK K  N+NM E   +SQSEG NDGV        LRLSK+GS FGRVFP+DEEEAAILLMELSCGLVHTC
Subjt:  IVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVHTC

XP_038898726.1 GATA transcription factor 21-like [Benincasa hispida]1.2e-10165.22Show/hide
Query:  MNSLYP---FMEKQ-DEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFST-NATQHDHQRVN--LHHHLLQESHHQLQHDYHQAEKS-IDGS
        MN LYP   F+EKQ +EQE E+ NLKLYIFSS+SQVVAAA+SS SSSSQLAF +CFST NATQHD+Q +N  LHHHLLQ+SHHQL H +H+ EKS I GS
Subjt:  MNSLYP---FMEKQ-DEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFST-NATQHDHQRVN--LHHHLLQESHHQLQHDYHQAEKS-IDGS

Query:  RENSEAEVILSSNSNLVKSSTREALERSR---SRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQG
        RE  E EVI SSN        REA ERSR    + + DDHQ  + ++ NGS  AKYWMSSKMRLMQKMMIN  +KKVI G  + DH+ ATRN  NNNN+G
Subjt:  RENSEAEVILSSNSNLVKSSTREALERSR---SRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQG

Query:  SDGGKIWEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLS
         DGGK   + G++S +S   N     NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGG   +T   G  KE+HKEKKSRL+
Subjt:  SDGGKIWEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLS

Query:  CDGD--GSIVGDHVKKNKYWNK--NMTENSSDSQSEG--PNDGVSFD-RHNFTLRLSKSG--STFGRVFPKDEEEAAILLMELSCGLVHTC
        CDGD  G  VGD VK NK  N   + + N+  SQSE    N+ VSFD  HNFTLRLSK+   S FG+VFP+DEEEAAILLMELSCGL+H+C
Subjt:  CDGD--GSIVGDHVKKNKYWNK--NMTENSSDSQSEG--PNDGVSFD-RHNFTLRLSKSG--STFGRVFPKDEEEAAILLMELSCGLVHTC

TrEMBL top hitse value%identityAlignment
A0A0A0L3Z9 GATA-type domain-containing protein3.2e-8455.88Show/hide
Query:  FMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNAT------QHDHQRVN--LHHHLLQESHHQLQHDYHQAEKSIDGSRENSEA
        F+EKQ ++E E+ NLKLYIFSSS       SSS++SSSQLAF+ CFSTN         H HQ VN  LHHHLLQ+SHHQL   +HQ EK I GSRE  E+
Subjt:  FMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNAT------QHDHQRVN--LHHHLLQESHHQLQHDYHQAEKSIDGSRENSEA

Query:  EVILSSNSNLVKSSTREALERSRSRSDIDDHQE---QQLDNPNGSSAAKYWMSSKMRLMQKMMINKTD--KKVI------GGA-NSDDHKAATRNIINNN
        +VILSS+S          LE+     D +DH +   ++ D+ NGS+  KYWMSSKMRLMQKMMIN     KKV+      GGA NSD H+ ATRN  + N
Subjt:  EVILSSNSNLVKSSTREALERSRSRSDIDDHQE---QQLDNPNGSSAAKYWMSSKMRLMQKMMINKTD--KKVI------GGA-NSDDHKAATRNIINNN

Query:  NQGSDGGKIWEIRGRTSCNSNGHNNNNI----INGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAE-AANGGGGGGGDT---VVVGSGK
        N+G +GGK   + G++S +S   N++NI     NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMA+ AAN GGG   +T         K
Subjt:  NQGSDGGKIWEIRGRTSCNSNGHNNNNI----INGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAE-AANGGGGGGGDT---VVVGSGK

Query:  EVHKEKKSRLSCDGD----GSIVGDHVKKN-KYWNKN-------MTENSSDSQSE--GPNDGVSF-DRHNFTLRLSKSGST---------FGRVFPKDEE
        E HKEKKSRLSC+GD     + VGD VK N KY N N          N+  S+SE    N+ VSF   HNFTLRLSK+ ST         FG+VFP+DEE
Subjt:  EVHKEKKSRLSCDGD----GSIVGDHVKKN-KYWNKN-------MTENSSDSQSE--GPNDGVSF-DRHNFTLRLSKSGST---------FGRVFPKDEE

Query:  EAAILLMELSCGLVHTC
        EAAILLMELSCGL+HTC
Subjt:  EAAILLMELSCGLVHTC

A0A1S3BVA6 GATA transcription factor 21-like1.5e-8455.69Show/hide
Query:  NSLYP----FMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTN-----ATQHDHQR---VNLHHHLLQESHHQLQHDYHQAEKSI
        N LYP    F+EKQ ++E E+ NLKLYIFSS        SSS+++SSQLAF+TCFSTN      TQH H +   ++LHHHLLQ+SHHQL   +HQ EK  
Subjt:  NSLYP----FMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTN-----ATQHDHQR---VNLHHHLLQESHHQLQHDYHQAEKSI

Query:  DGSRENSEAEVILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDK---KVI------GGA-NSDDHKAAT
         GSRE  E+EVILSS+S      +R   ++     D D HQ       + S + KYWMSSKMRLMQKMMIN       KVI      GGA NSD H+ AT
Subjt:  DGSRENSEAEVILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDK---KVI------GGA-NSDDHKAAT

Query:  RNIINNNNQGSDGGKIWEIRGRTSCNSNGHNNNNI----INGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAE-AANGGGGGGGDTV-V
        RN  N+NN+G +GGK   + G++S +S   N++NI     NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMA+ AAN   G   +T   
Subjt:  RNIINNNNQGSDGGKIWEIRGRTSCNSNGHNNNNI----INGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAE-AANGGGGGGGDTV-V

Query:  VGSG-KEVHKEKKSRLSCDGD-----GSIVGDHVKKN-KYWNKNMTE-----NSSDSQSE--GPNDGVSFD-RHNFTLRLSKSGST---------FGRVF
          SG KE HKEKKSRLSC+GD      + VGDH K N KY NKN  +     N+  S+SE    N+ VSFD  HNFTLRLSK+ ST         FG+VF
Subjt:  VGSG-KEVHKEKKSRLSCDGD-----GSIVGDHVKKN-KYWNKNMTE-----NSSDSQSE--GPNDGVSFD-RHNFTLRLSKSGST---------FGRVF

Query:  PKDEEEAAILLMELSCGLVHTC
        P+DEEEAAILLMELSCGL+HTC
Subjt:  PKDEEEAAILLMELSCGLVHTC

A0A6J1DKQ2 GATA transcription factor 21-like3.4e-6550.39Show/hide
Query:  MNSLYP----FMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLA-FATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSREN
        MN LYP    FMEK +E+  E+ NLKLYIFSSS     AAS++ASSSSQL  FATCF  N T+      NLHH         L H + Q EKS  G    
Subjt:  MNSLYP----FMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLA-FATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSREN

Query:  SEAEVILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHK-----AATRNIINNNNQGS
         E EV                              E Q+        AKYWMSSKMR+MQKMM N  DKK  G   + DHK     + TRN   N+ +  
Subjt:  SEAEVILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHK-----AATRNIINNNNQGS

Query:  DGGKIWEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLSC
        + G  W I GR+S  SNGH      NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAE A+GGGG         SGK++HKEKK R SC
Subjt:  DGGKIWEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLSC

Query:  DGDGSIVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGST--FGRVFPKDEEEAAILLMELSCGLVHTC
         GD +     + KNK              S+  N  VSFD H+FTLRL KS  +  FGRVF +DEEEAAILLMELSCG +HTC
Subjt:  DGDGSIVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGST--FGRVFPKDEEEAAILLMELSCGLVHTC

A0A6J1FR75 putative GATA transcription factor 221.4e-8255.53Show/hide
Query:  MNSLYPFM---EKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSE
        MNSLYP M   E+ +E E +  +LKLYIF S+SQV AA       SS LAFATCF+++ATQHDHQR++LHHH             +Q EKSIDG      
Subjt:  MNSLYPFM---EKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSE

Query:  AEVILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLD-----NPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDG
                                SR+DI DHQEQQLD       +  SAAKYWMSSKMRLMQKMM        +GG NSD   AA     NNN      
Subjt:  AEVILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLD-----NPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDG

Query:  GKIWEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTV-VVGSGKEVHKEKKSRLSCD
               GR+ C+S         NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGG    V    SGKE+HKEKKSRLS  
Subjt:  GKIWEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTV-VVGSGKEVHKEKKSRLSCD

Query:  GDGSIVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVHTC
        GD     + VKK K  N++M E   +SQSEG NDGV        LRLSK+GS FGRVFP+DEEEAAILLMELSCGLVHTC
Subjt:  GDGSIVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVHTC

A0A6J1IZ80 putative GATA transcription factor 228.0e-8355.85Show/hide
Query:  MNSLYPFMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSEAEV
        MNSLYP M K +EQ   + +LKLYIF S+SQV AA       SS LAFATCF+++ATQHDHQR++LHHH             +Q EK IDG         
Subjt:  MNSLYPFMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSEAEV

Query:  ILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLD-----NPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDGGKI
                             SR+DI DHQEQQLD       +  SAAKYWMSSKMRLMQKMM        +GG NSD   AA     +NNN+       
Subjt:  ILSSNSNLVKSSTREALERSRSRSDIDDHQEQQLD-----NPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDGGKI

Query:  WEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLSCDGDGS
            GR+ C+S         NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAAN GGG G   V   SGKE++KEKKSRLS  GD  
Subjt:  WEIRGRTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLSCDGDGS

Query:  IVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVHTC
          G+ VKK K  N+NM E   +SQSEG NDGV        LRLSK+GS FGRVFP+DEEEAAILLMELSCGLVHTC
Subjt:  IVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVHTC

SwissProt top hitse value%identityAlignment
Q5HZ36 GATA transcription factor 211.1e-2331.04Show/hide
Query:  GEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSEAEVILSSNSNLVKSSTRE
        G  ++L  +      QV + +SSS+SS S L+    F  N+ +  H   N  +H     H  L           +G           + +  + K  TR 
Subjt:  GEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSEAEVILSSNSNLVKSSTRE

Query:  ALERSRSRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINN-NNQGSDGGKIWEIRG--------------
         L   +   +   H   Q      S + K+ MS KMRL++K + N  +K++I   N+++HK +    +N+  N   D  +    +               
Subjt:  ALERSRSRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINN-NNQGSDGGKIWEIRG--------------

Query:  RTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGG------------------------GGGDTVVVGS
          + N NG++NNN +  +RVCSDCNTT TPLWRSGP+GPKSLCNACGIRQRKARRA   AA   G                           G      S
Subjt:  RTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGG------------------------GGGDTVVVGS

Query:  GKEVHKEKKSRLSCDGDGSIVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVH
           V K KK ++  + +  +  + V  +   +K+ T ++S   S        F   + T+ LSKS S + +VFP+DE+EAA+LLM LS G+VH
Subjt:  GKEVHKEKKSRLSCDGDGSIVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVH

Q6YW48 Protein CYTOKININ-RESPONSIVE GATA TRANSCRIPTION FACTOR 14.6e-1933.45Show/hide
Query:  RSRSDIDDHQEQQLDNPNGSSAAKYWMSS---KMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDGGKIWEIRGRTSCNSNGHNN--NNII
        RS  D  D +     + NGS++   WMS+   KMR+++K            GA +D                 +GG + + R R   + +         +
Subjt:  RSRSDIDDHQEQQLDNPNGSSAAKYWMSS---KMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDGGKIWEIRGRTSCNSNGHNN--NNII

Query:  NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGG-GGGDTVVVGSGKEVHKEKKSRLSCDGDGSI-------VGDHVKKNKYW
          VRVCSDCNTT TPLWRSGP GPKSLCNACGIRQRKARRAMA AANGG       +V           KK + + D D S+       + DHV      
Subjt:  NGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGG-GGGDTVVVGSGKEVHKEKKSRLSCDGDGSI-------VGDHVKKNKYW

Query:  NKNMTENSSDSQSEGPNDGV-----------------SFDRHNFTLRLSKSGSTFGRVFPKDE-EEAAILLMELSCGLVHT
         K        + +    D V                    +   T   + +   F    P+DE  +AA+LLM LSCGLVH+
Subjt:  NKNMTENSSDSQSEGPNDGV-----------------SFDRHNFTLRLSKSGSTFGRVFPKDE-EEAAILLMELSCGLVHT

Q9FJ10 GATA transcription factor 161.7e-1050.79Show/hide
Query:  NNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRA-------MAEAANGGG
        NN ++ +  + C+DC T+ TPLWR GP GPKSLCNACGIR RK RR        + ++++GGG
Subjt:  NNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRA-------MAEAANGGG

Q9LIB5 GATA transcription factor 172.4e-1234.78Show/hide
Query:  SCNSNGHNNNNIINGV-RVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLSCDGDGSIVGDHV
        +C+S+G    +      R C DC T  TPLWR GP GPKSLCNACGI+ RK R+A                 +G   E  K+K  + +C+ D ++  DH 
Subjt:  SCNSNGHNNNNIINGV-RVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLSCDGDGSIVGDHV

Query:  KKNKYWNKNMTE------------NSSDSQSEGPNDGVS-FDRHNFTLRLSK-SGSTFGRVFPK--DEEEAAILLMELSCGLVH
           KY    + +            N+  S S   N GVS F    F + + K S     R++ K  +EE AA+LLM LSC  V+
Subjt:  KKNKYWNKNMTE------------NSSDSQSEGPNDGVS-FDRHNFTLRLSK-SGSTFGRVFPK--DEEEAAILLMELSCGLVH

Q9SZI6 Putative GATA transcription factor 221.1e-2534.53Show/hide
Query:  NLHHHLLQESHHQLQHDYHQAEKSIDGSRENS-----------EAEVILSSNSNLV----------------------KSSTREALERSRSRSDI-----
        +LHHH LQ+   Q QH +HQA  +       S           + +V +  N+N                         SS+ + + +  +R  +     
Subjt:  NLHHHLLQESHHQLQHDYHQAEKSIDGSRENS-----------EAEVILSSNSNLV----------------------KSSTREALERSRSRSDI-----

Query:  DDHQEQ------QLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDGGKIWEIRGRTSCNSNGHNNNNIINGVRV
        D+HQ+Q       + +  G+++ K W+SSK+RLM+K       KK I    SD  K  T N  ++N   S+               NG+NN+ +I   R+
Subjt:  DDHQEQ------QLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDGGKIWEIRGRTSCNSNGHNNNNIINGVRV

Query:  CSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEK---KSRLSCDGDGSIVGDHVKKNKYWNKNMT-----
        CSDCNTT TPLWRSGP+GPKSLCNACGIRQRKARRA    A         T V G    V K+K   K+++S +G   I+     K     + +T     
Subjt:  CSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEK---KSRLSCDGDGSIVGDHVKKNKYWNKNMT-----

Query:  -----ENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVH
             E  S+S     +D + FD  +  L LSKS S + +VFP+DE+EAAILLM LS G+VH
Subjt:  -----ENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVH

Arabidopsis top hitse value%identityAlignment
AT3G16870.1 GATA transcription factor 171.7e-1334.78Show/hide
Query:  SCNSNGHNNNNIINGV-RVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLSCDGDGSIVGDHV
        +C+S+G    +      R C DC T  TPLWR GP GPKSLCNACGI+ RK R+A                 +G   E  K+K  + +C+ D ++  DH 
Subjt:  SCNSNGHNNNNIINGV-RVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLSCDGDGSIVGDHV

Query:  KKNKYWNKNMTE------------NSSDSQSEGPNDGVS-FDRHNFTLRLSK-SGSTFGRVFPK--DEEEAAILLMELSCGLVH
           KY    + +            N+  S S   N GVS F    F + + K S     R++ K  +EE AA+LLM LSC  V+
Subjt:  KKNKYWNKNMTE------------NSSDSQSEGPNDGVS-FDRHNFTLRLSK-SGSTFGRVFPK--DEEEAAILLMELSCGLVH

AT4G26150.1 cytokinin-responsive gata factor 18.0e-2734.53Show/hide
Query:  NLHHHLLQESHHQLQHDYHQAEKSIDGSRENS-----------EAEVILSSNSNLV----------------------KSSTREALERSRSRSDI-----
        +LHHH LQ+   Q QH +HQA  +       S           + +V +  N+N                         SS+ + + +  +R  +     
Subjt:  NLHHHLLQESHHQLQHDYHQAEKSIDGSRENS-----------EAEVILSSNSNLV----------------------KSSTREALERSRSRSDI-----

Query:  DDHQEQ------QLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDGGKIWEIRGRTSCNSNGHNNNNIINGVRV
        D+HQ+Q       + +  G+++ K W+SSK+RLM+K       KK I    SD  K  T N  ++N   S+               NG+NN+ +I   R+
Subjt:  DDHQEQ------QLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDGGKIWEIRGRTSCNSNGHNNNNIINGVRV

Query:  CSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEK---KSRLSCDGDGSIVGDHVKKNKYWNKNMT-----
        CSDCNTT TPLWRSGP+GPKSLCNACGIRQRKARRA    A         T V G    V K+K   K+++S +G   I+     K     + +T     
Subjt:  CSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEK---KSRLSCDGDGSIVGDHVKKNKYWNKNMT-----

Query:  -----ENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVH
             E  S+S     +D + FD  +  L LSKS S + +VFP+DE+EAAILLM LS G+VH
Subjt:  -----ENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVH

AT4G36620.1 GATA transcription factor 191.6e-1151.52Show/hide
Query:  SCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGG
        S    G   +N++   R C++C+TT+TPLWR+GP+GPKSLCNACGIR +K  R  + A N   GGG
Subjt:  SCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGG

AT5G49300.1 GATA transcription factor 161.2e-1150.79Show/hide
Query:  NNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRA-------MAEAANGGG
        NN ++ +  + C+DC T+ TPLWR GP GPKSLCNACGIR RK RR        + ++++GGG
Subjt:  NNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRA-------MAEAANGGG

AT5G56860.1 GATA type zinc finger transcription factor family protein7.5e-2531.04Show/hide
Query:  GEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSEAEVILSSNSNLVKSSTRE
        G  ++L  +      QV + +SSS+SS S L+    F  N+ +  H   N  +H     H  L           +G           + +  + K  TR 
Subjt:  GEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSEAEVILSSNSNLVKSSTRE

Query:  ALERSRSRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINN-NNQGSDGGKIWEIRG--------------
         L   +   +   H   Q      S + K+ MS KMRL++K + N  +K++I   N+++HK +    +N+  N   D  +    +               
Subjt:  ALERSRSRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINN-NNQGSDGGKIWEIRG--------------

Query:  RTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGG------------------------GGGDTVVVGS
          + N NG++NNN +  +RVCSDCNTT TPLWRSGP+GPKSLCNACGIRQRKARRA   AA   G                           G      S
Subjt:  RTSCNSNGHNNNNIINGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGG------------------------GGGDTVVVGS

Query:  GKEVHKEKKSRLSCDGDGSIVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVH
           V K KK ++  + +  +  + V  +   +K+ T ++S   S        F   + T+ LSKS S + +VFP+DE+EAA+LLM LS G+VH
Subjt:  GKEVHKEKKSRLSCDGDGSIVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFDRHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTCTCTATCCATTCATGGAGAAACAAGATGAACAAGAAGGAGAAGAAACCAATCTTAAACTCTATATTTTCTCCTCTAGCTCTCAAGTTGTAGCTGCTGCTTC
TTCTTCTGCTTCTTCTTCTTCACAGCTTGCTTTTGCCACTTGCTTTAGTACTAATGCCACTCAACATGATCATCAAAGAGTGAATCTTCATCATCATCTTCTTCAAGAGT
CTCACCACCAACTTCAGCATGATTATCATCAGGCTGAAAAATCCATTGATGGATCAAGAGAAAACAGTGAAGCTGAAGTGATACTTTCATCTAACTCTAATTTGGTGAAA
TCTAGCACAAGGGAGGCCTTAGAAAGATCAAGATCAAGATCAGATATTGATGATCATCAAGAACAACAACTAGATAATCCTAATGGATCATCAGCTGCAAAATACTGGAT
GTCTTCAAAGATGAGACTGATGCAAAAGATGATGATAAACAAGACAGATAAGAAAGTCATTGGGGGAGCAAATTCAGATGATCACAAGGCTGCAACAAGAAACATTATTA
ATAATAACAATCAAGGATCAGATGGTGGAAAAATATGGGAAATTAGAGGAAGAACATCTTGTAATTCAAATGGACATAACAATAACAACATTATTAATGGTGTTAGGGTT
TGCAGTGATTGTAACACCACAACAACTCCTCTTTGGCGAAGTGGTCCTCAAGGCCCTAAGTCGCTATGCAATGCGTGTGGGATCCGACAGAGGAAGGCAAGGCGAGCCAT
GGCGGAAGCTGCAAATGGGGGCGGTGGAGGCGGAGGCGATACGGTCGTTGTTGGTTCGGGGAAGGAGGTGCACAAGGAGAAGAAATCCCGCTTGAGCTGCGATGGAGATG
GTTCCATTGTTGGTGATCATGTGAAGAAGAACAAGTACTGGAATAAGAATATGACAGAAAACTCGTCGGACTCTCAAAGTGAAGGGCCGAACGACGGCGTTTCTTTTGAT
CGCCATAACTTCACTTTACGTTTGAGCAAAAGTGGTTCAACTTTTGGGAGAGTGTTTCCAAAAGATGAGGAAGAAGCAGCCATTCTTTTGATGGAGCTTTCTTGTGGCCT
TGTTCACACCTGTTGA
mRNA sequenceShow/hide mRNA sequence
TTTTTTTCCTCTCTCTTCTCTTCTCCCTTTATAAGCCCTTCCCATCAAGTACTTTTGTTTTATTTCATTACCCTATCACATATTGCCCACTTTTCTGGACACAAATAATC
AACCCATTCTCTTTCTATTTCTCTAGATCTCTCCTATGAATTCTCTCTATCCATTCATGGAGAAACAAGATGAACAAGAAGGAGAAGAAACCAATCTTAAACTCTATATT
TTCTCCTCTAGCTCTCAAGTTGTAGCTGCTGCTTCTTCTTCTGCTTCTTCTTCTTCACAGCTTGCTTTTGCCACTTGCTTTAGTACTAATGCCACTCAACATGATCATCA
AAGAGTGAATCTTCATCATCATCTTCTTCAAGAGTCTCACCACCAACTTCAGCATGATTATCATCAGGCTGAAAAATCCATTGATGGATCAAGAGAAAACAGTGAAGCTG
AAGTGATACTTTCATCTAACTCTAATTTGGTGAAATCTAGCACAAGGGAGGCCTTAGAAAGATCAAGATCAAGATCAGATATTGATGATCATCAAGAACAACAACTAGAT
AATCCTAATGGATCATCAGCTGCAAAATACTGGATGTCTTCAAAGATGAGACTGATGCAAAAGATGATGATAAACAAGACAGATAAGAAAGTCATTGGGGGAGCAAATTC
AGATGATCACAAGGCTGCAACAAGAAACATTATTAATAATAACAATCAAGGATCAGATGGTGGAAAAATATGGGAAATTAGAGGAAGAACATCTTGTAATTCAAATGGAC
ATAACAATAACAACATTATTAATGGTGTTAGGGTTTGCAGTGATTGTAACACCACAACAACTCCTCTTTGGCGAAGTGGTCCTCAAGGCCCTAAGTCGCTATGCAATGCG
TGTGGGATCCGACAGAGGAAGGCAAGGCGAGCCATGGCGGAAGCTGCAAATGGGGGCGGTGGAGGCGGAGGCGATACGGTCGTTGTTGGTTCGGGGAAGGAGGTGCACAA
GGAGAAGAAATCCCGCTTGAGCTGCGATGGAGATGGTTCCATTGTTGGTGATCATGTGAAGAAGAACAAGTACTGGAATAAGAATATGACAGAAAACTCGTCGGACTCTC
AAAGTGAAGGGCCGAACGACGGCGTTTCTTTTGATCGCCATAACTTCACTTTACGTTTGAGCAAAAGTGGTTCAACTTTTGGGAGAGTGTTTCCAAAAGATGAGGAAGAA
GCAGCCATTCTTTTGATGGAGCTTTCTTGTGGCCTTGTTCACACCTGTTGAGACACTTTTCAAAAATTTTCCCACTGAATCACCTCGTTTTTTCTTTTTCTTTCGTTTCT
CCTTGAGAAAATCAATGTGTTCTTTTTTGTTTTTTTTTTAGTGTGCATGTCGAAGATTTTGCAGCTTTGTGATTGAATATTTAAGCAATGATAATAAAGCAAAAGTTTTT
C
Protein sequenceShow/hide protein sequence
MNSLYPFMEKQDEQEGEETNLKLYIFSSSSQVVAAASSSASSSSQLAFATCFSTNATQHDHQRVNLHHHLLQESHHQLQHDYHQAEKSIDGSRENSEAEVILSSNSNLVK
SSTREALERSRSRSDIDDHQEQQLDNPNGSSAAKYWMSSKMRLMQKMMINKTDKKVIGGANSDDHKAATRNIINNNNQGSDGGKIWEIRGRTSCNSNGHNNNNIINGVRV
CSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAANGGGGGGGDTVVVGSGKEVHKEKKSRLSCDGDGSIVGDHVKKNKYWNKNMTENSSDSQSEGPNDGVSFD
RHNFTLRLSKSGSTFGRVFPKDEEEAAILLMELSCGLVHTC