; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027802 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027802
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGATA transcription factor 21-like
Genome locationtig00153055:2712963..2715092
RNA-Seq ExpressionSgr027802
SyntenySgr027802
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608378.1 putative GATA transcription factor 22, partial [Cucurbita argyrosperma subsp. sororia]1.5e-5661.57Show/hide
Query:  DQELEDQ--SKLDNRS-AKYWMSSKMRLMQKMMI----NTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTT
        +Q+L++    K DN S AKYWMSSKMRLMQKMM     N+D+ AAAA       A+   R+  N G          RS CS     +NGVRVCSDCNTTT
Subjt:  DQELEDQ--SKLDNRS-AKYWMSSKMRLMQKMMI----NTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTT

Query:  TPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAAL--GKQLHKEKKSRTSCHGNDAIVAHVKNKCSK---MADNSSDSQSEGNVS
        TPLWRSGPQGPKSLCNACGIRQRKARRAMAEAA+GGGGG  VVVA+ AA   GK+LHKEKKSR S HG+D     VK K      M +   +SQSEG   
Subjt:  TPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAAL--GKQLHKEKKSRTSCHGNDAIVAHVKNKCSK---MADNSSDSQSEGNVS

Query:  FDHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVHSC
          ++  LRLSK  SAFGRVFPRDEEEAAILLMELSCGLVH+C
Subjt:  FDHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVHSC

KAG7037720.1 putative GATA transcription factor 22, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-5661.83Show/hide
Query:  DQELEDQ--SKLDNRS-AKYWMSSKMRLMQKMMI----NTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTT
        +Q+L++    K DN S AKYWMSSKMRLMQKMM     N+D+ AAAA       A+   R+  N G          RS CS     +NGVRVCSDCNTTT
Subjt:  DQELEDQ--SKLDNRS-AKYWMSSKMRLMQKMMI----NTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTT

Query:  TPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAAL-GKQLHKEKKSRTSCHGNDAIVAHVKNKCSK---MADNSSDSQSEGNVSF
        TPLWRSGPQGPKSLCNACGIRQRKARRAMAEAA+GGGGG  VVVA+ AA  GK+LHKEKKSR S HG+D     VK K      M +   +SQSEG    
Subjt:  TPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAAL-GKQLHKEKKSRTSCHGNDAIVAHVKNKCSK---MADNSSDSQSEGNVSF

Query:  DHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVHSC
         ++  LRLSK  SAFGRVFPRDEEEAAILLMELSCGLVH+C
Subjt:  DHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVHSC

XP_022154815.1 GATA transcription factor 21-like [Momordica charantia]7.8e-6163.75Show/hide
Query:  DQELEDQSKLDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKP--ASPNTRN---KQNFGNHRHGKWEI--TRSSCSNGGHNNNGVRVCSDCNTTT
        ++E+EDQ     ++AKYWMSSKMR+MQKMM N DKK+ A  +  HKP  +S  TRN         +  GKW I    S+CSNG   +NGVRVCSDCNTTT
Subjt:  DQELEDQSKLDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKP--ASPNTRN---KQNFGNHRHGKWEI--TRSSCSNGGHNNNGVRVCSDCNTTT

Query:  TPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCSKMADNSSDSQSEGNVSFD-HN
        TPLWRSGPQGPKSLCNACGIRQRKARRAMAE ASGGGG       SEAA GKQLHKEKK RTSCHG+ A +  VKNK        SD Q+  +VSFD H+
Subjt:  TPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCSKMADNSSDSQSEGNVSFD-HN

Query:  FTLRLSKRSS--AFGRVFPRDEEEAAILLMELSCGLVHSC
        FTLRL K S   AFGRVF RDEEEAAILLMELSCG +H+C
Subjt:  FTLRLSKRSS--AFGRVFPRDEEEAAILLMELSCGLVHSC

XP_022941123.1 putative GATA transcription factor 22 [Cucurbita moschata]4.4e-5662.29Show/hide
Query:  DQELEDQ--SKLDNRS-AKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTTTPLW
        +Q+L++    K DN S AKYWMSSKMRLMQKMM     K    NSD    A+   R+  N G          RS CS     +NGVRVCSDCNTTTTPLW
Subjt:  DQELEDQ--SKLDNRS-AKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTTTPLW

Query:  RSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCSK---MADNSSDSQSEGNVSFDHNFT
        RSGPQGPKSLCNACGIRQRKARRAMAEAA+GGGGG  VVVA+ AA GK+LHKEKKSR S HG+D     VK K      M +   +SQSEG     ++  
Subjt:  RSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCSK---MADNSSDSQSEGNVSFDHNFT

Query:  LRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVHSC
        LRLSK  SAFGRVFPRDEEEAAILLMELSCGLVH+C
Subjt:  LRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVHSC

XP_038898726.1 GATA transcription factor 21-like [Benincasa hispida]7.5e-6459.39Show/hide
Query:  KAEKSIGGISNEGCQVIFSST---SLLKSSVGSCKPDNDQELEDQSKLDNRSAKYWMSSKMRLMQKMMINTDKKA---AAANSDHHKPASPNTRNKQNFG
        K EKSI G S E  +VIFSS    +  +S +   K ++D     + +  N SAKYWMSSKMRLMQKMMINT+ K      ANSDH + A+ N  N  N G
Subjt:  KAEKSIGGISNEGCQVIFSST---SLLKSSVGSCKPDNDQELEDQSKLDNRSAKYWMSSKMRLMQKMMINTDKKA---AAANSDHHKPASPNTRNKQNFG

Query:  NHRHGKWEI-----TRSSCS-NGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKS
        +   GKWE      + SSC+ N G  NNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAA+GGGG    VV +  +  K++HKEKKS
Subjt:  NHRHGKWEI-----TRSSCS-NGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKS

Query:  RTSCHG--NDAIVAHVK-NKCSKMAD----NSSDSQSE-----GNVSFD--HNFTLRLSKRS--SAFGRVFPRDEEEAAILLMELSCGLVHSC
        R +C G  N   V  VK NKC+   D    N+  SQSE       VSFD  HNFTLRLSK S  SAFG+VFPRDEEEAAILLMELSCGL+HSC
Subjt:  RTSCHG--NDAIVAHVK-NKCSKMAD----NSSDSQSE-----GNVSFD--HNFTLRLSKRS--SAFGRVFPRDEEEAAILLMELSCGLVHSC

TrEMBL top hitse value%identityAlignment
A0A0A0L3Z9 GATA-type domain-containing protein2.4e-5550.91Show/hide
Query:  EKSIGGISNEGCQVIFSSTS---------LLKSSVGSCKPDNDQELEDQSKLD--NRSAKYWMSSKMRLMQKMMINTD---KK--------AAAANSDHH
        EK IGG   E  +VI SS+S         LL+  V   + DND       K D  N S KYWMSSKMRLMQKMMINT+   KK          A NSDHH
Subjt:  EKSIGGISNEGCQVIFSSTS---------LLKSSVGSCKPDNDQELEDQSKLD--NRSAKYWMSSKMRLMQKMMINTD---KK--------AAAANSDHH

Query:  KPASPNTRNKQNFGNHRHGKWE----------ITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSV
        + A+ N  +  N GN   GKWE          I+ +S + G   NNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMA+ A+   GGG V
Subjt:  KPASPNTRNKQNFGNHRHGKWE----------ITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSV

Query:  VVASEAAL---GKQLHKEKKSRTSCH-----GNDAIVAHVK--NKCSKMADNSSDSQSEGN--------------VSF--DHNFTLRLSKRS--------
           +EAA     K+ HKEKKSR SC      GN+ +   VK  +K     +N+ D  S  N              VSF   HNFTLRLSK +        
Subjt:  VVASEAAL---GKQLHKEKKSRTSCH-----GNDAIVAHVK--NKCSKMADNSSDSQSEGN--------------VSF--DHNFTLRLSKRS--------

Query:  -SAFGRVFPRDEEEAAILLMELSCGLVHSC
         SAFG+VFPRDEEEAAILLMELSCGL+H+C
Subjt:  -SAFGRVFPRDEEEAAILLMELSCGLVHSC

A0A1S3BVA6 GATA transcription factor 21-like1.2e-5450.46Show/hide
Query:  EKSIGGISNEGCQVIFSSTS--------LLKSSVGSCKPDNDQELEDQSKLDNR--SAKYWMSSKMRLMQKMMINTDKK------------AAAANSDHH
        EK  GG   E  +VI SS+S        LL+  V     DND       K D+   S KYWMSSKMRLMQKMMINT+                A NSDHH
Subjt:  EKSIGGISNEGCQVIFSSTS--------LLKSSVGSCKPDNDQELEDQSKLDNR--SAKYWMSSKMRLMQKMMINTDKK------------AAAANSDHH

Query:  KPASPNTRNKQNFGNHRHGKWE----------ITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSV
        + A+ N  N  N GN   GKWE          I+ +S + G   NNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMA+ A+    G   
Subjt:  KPASPNTRNKQNFGNHRHGKWE----------ITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSV

Query:  VVASEAALG-KQLHKEKKSRTSCH------GNDAIVAHVK---NKCSKMAD------NSSDSQSE-----GNVSFD--HNFTLRLSKRS---------SA
             AA G K+ HKEKKSR SC       GN+ +  H K     C+K  D      N+  S+SE       VSFD  HNFTLRLSK +         SA
Subjt:  VVASEAALG-KQLHKEKKSRTSCH------GNDAIVAHVK---NKCSKMAD------NSSDSQSE-----GNVSFD--HNFTLRLSKRS---------SA

Query:  FGRVFPRDEEEAAILLMELSCGLVHSC
        FG+VFPRDEEEAAILLMELSCGL+H+C
Subjt:  FGRVFPRDEEEAAILLMELSCGLVHSC

A0A6J1DKQ2 GATA transcription factor 21-like3.8e-6163.75Show/hide
Query:  DQELEDQSKLDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKP--ASPNTRN---KQNFGNHRHGKWEI--TRSSCSNGGHNNNGVRVCSDCNTTT
        ++E+EDQ     ++AKYWMSSKMR+MQKMM N DKK+ A  +  HKP  +S  TRN         +  GKW I    S+CSNG   +NGVRVCSDCNTTT
Subjt:  DQELEDQSKLDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKP--ASPNTRN---KQNFGNHRHGKWEI--TRSSCSNGGHNNNGVRVCSDCNTTT

Query:  TPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCSKMADNSSDSQSEGNVSFD-HN
        TPLWRSGPQGPKSLCNACGIRQRKARRAMAE ASGGGG       SEAA GKQLHKEKK RTSCHG+ A +  VKNK        SD Q+  +VSFD H+
Subjt:  TPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCSKMADNSSDSQSEGNVSFD-HN

Query:  FTLRLSKRSS--AFGRVFPRDEEEAAILLMELSCGLVHSC
        FTLRL K S   AFGRVF RDEEEAAILLMELSCG +H+C
Subjt:  FTLRLSKRSS--AFGRVFPRDEEEAAILLMELSCGLVHSC

A0A6J1FR75 putative GATA transcription factor 222.1e-5662.29Show/hide
Query:  DQELEDQ--SKLDNRS-AKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTTTPLW
        +Q+L++    K DN S AKYWMSSKMRLMQKMM     K    NSD    A+   R+  N G          RS CS     +NGVRVCSDCNTTTTPLW
Subjt:  DQELEDQ--SKLDNRS-AKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTTTPLW

Query:  RSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCSK---MADNSSDSQSEGNVSFDHNFT
        RSGPQGPKSLCNACGIRQRKARRAMAEAA+GGGGG  VVVA+ AA GK+LHKEKKSR S HG+D     VK K      M +   +SQSEG     ++  
Subjt:  RSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCSK---MADNSSDSQSEGNVSFDHNFT

Query:  LRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVHSC
        LRLSK  SAFGRVFPRDEEEAAILLMELSCGLVH+C
Subjt:  LRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVHSC

A0A6J1IZ80 putative GATA transcription factor 225.3e-5561.44Show/hide
Query:  DQELEDQ--SKLDNRS-AKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTTTPLW
        +Q+L++    K DN S AKYWMSSKMRLMQKMM     K    NSD    A+   R+  N G          RS CS     +NGVRVCSDCNTTTTPLW
Subjt:  DQELEDQ--SKLDNRS-AKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTTTPLW

Query:  RSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCS---KMADNSSDSQSEGNVSFDHNFT
        RSGPQGPKSLCNACGIRQRKARRAMAEAA+ GGG G+VVVA  AA GK+L+KEKKSR S HG+D     VK K      M +   +SQSEG     ++  
Subjt:  RSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCS---KMADNSSDSQSEGNVSFDHNFT

Query:  LRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVHSC
        LRLSK  SAFGRVFPRDEEEAAILLMELSCGLVH+C
Subjt:  LRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVHSC

SwissProt top hitse value%identityAlignment
Q5HZ36 GATA transcription factor 218.7e-3141.42Show/hide
Query:  EDQSKLDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRN-KQNFGNHRHG----KWEITRSSCS------------NGGHNNNGV-RVC
        ++ +K D+ S K+ MS KMRL++K + N  +     N+++HK +     N K NF    H     K  +TR + +            NG  NNNGV RVC
Subjt:  EDQSKLDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRN-KQNFGNHRHG----KWEITRSSCS------------NGGHNNNGV-RVC

Query:  SDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVA---SEAALGKQLHKEKKSRT---SCHGNDAIVAHVK------------
        SDCNTT TPLWRSGP+GPKSLCNACGIRQRKARRA A AA+   G   V VA    +  L K+L  +KK        + +  +VA  K            
Subjt:  SDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVA---SEAALGKQLHKEKKSRT---SCHGNDAIVAHVK------------

Query:  --------NKCSKMADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVH
                ++ SK   +S+ S S     FD + T+ LSK SSA+ +VFP+DE+EAA+LLM LS G+VH
Subjt:  --------NKCSKMADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVH

Q6QPM2 GATA transcription factor 197.7e-1143.18Show/hide
Query:  NTRNKQNFGNHRHGK--WEITRSSCSNGGHNNNGV--RVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGS
        N  +++ F +H      W+    S   GG   + +  R C++C+TT+TPLWR+GP+GPKSLCNACGIR +K  R  + A +   GGGS
Subjt:  NTRNKQNFGNHRHGK--WEITRSSCSNGGHNNNGV--RVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGS

Q6YW48 Protein CYTOKININ-RESPONSIVE GATA TRANSCRIPTION FACTOR 11.7e-1835.18Show/hide
Query:  NRSAKYWMSS---KMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLC
        N S   WMS+   KMR+++K    TD +  A      +  +    ++Q                          VRVCSDCNTT TPLWRSGP GPKSLC
Subjt:  NRSAKYWMSS---KMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWEITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLC

Query:  NACGIRQRKARRAMAEAASGGGG---GGSVVVA------------SEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCSKMADNSSDSQSEGNVSFDHN-
        NACGIRQRKARRAMA AA+GG       SV  A              A + + L  +K+ +   H   A+ A       ++   +   Q    V    N 
Subjt:  NACGIRQRKARRAMAEAASGGGG---GGSVVVA------------SEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCSKMADNSSDSQSEGNVSFDHN-

Query:  ---------------FTLRLSKRSSAFGRVFPRDE-EEAAILLMELSCGLVHS
                        T   +  S AF    PRDE  +AA+LLM LSCGLVHS
Subjt:  ---------------FTLRLSKRSSAFGRVFPRDE-EEAAILLMELSCGLVHS

Q9LIB5 GATA transcription factor 171.8e-1534.78Show/hide
Query:  SSCSNGGHNNNG--VRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVK
        SS  +GG +++G   R C DC T  TPLWR GP GPKSLCNACGI+ RK R                    +AALG +  ++KK+R S   ND  + H  
Subjt:  SSCSNGGHNNNG--VRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVK

Query:  NK--------------------CSKMADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPR---DEEEAAILLMELSCGLVHS
         K                    C+    +SS S    +   D  F + + KRS+   +   R   +EE AA+LLM LSC  V++
Subjt:  NK--------------------CSKMADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPR---DEEEAAILLMELSCGLVHS

Q9SZI6 Putative GATA transcription factor 224.0e-2837.82Show/hide
Query:  GISNEGCQVIFSSTSLLKSSVGSCKPDNDQELEDQSK-----LDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWE
        G S+   Q++    + LK ++   K DN Q+  D  +     +   ++  W+SSK+RLM+K      KKA    SD  K    +T N Q+          
Subjt:  GISNEGCQVIFSSTSLLKSSVGSCKPDNDQELEDQSK-----LDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWE

Query:  ITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHV
        ++ S   NG +N+  +R+CSDCNTT TPLWRSGP+GPKSLCNACGIRQRKARRA    A+     G     S   + K++  + K     +   + +   
Subjt:  ITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHV

Query:  KNKCSKM--------------ADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVH
         N C +M                NS+   S  N+ FD +  L LSK SSA+ +VFP+DE+EAAILLM LS G+VH
Subjt:  KNKCSKM--------------ADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVH

Arabidopsis top hitse value%identityAlignment
AT3G16870.1 GATA transcription factor 171.3e-1634.78Show/hide
Query:  SSCSNGGHNNNG--VRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVK
        SS  +GG +++G   R C DC T  TPLWR GP GPKSLCNACGI+ RK R                    +AALG +  ++KK+R S   ND  + H  
Subjt:  SSCSNGGHNNNG--VRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVK

Query:  NK--------------------CSKMADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPR---DEEEAAILLMELSCGLVHS
         K                    C+    +SS S    +   D  F + + KRS+   +   R   +EE AA+LLM LSC  V++
Subjt:  NK--------------------CSKMADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPR---DEEEAAILLMELSCGLVHS

AT4G16141.1 GATA type zinc finger transcription factor family protein9.0e-1534.81Show/hide
Query:  EITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSR--------TSCH
        ++   +CS+ G   +  + C DC T+ TPLWR GP GPKSLCNACGI+ RK R+A   A         +   S   LG +    K  +          C 
Subjt:  EITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSR--------TSCH

Query:  GNDAIVA-----HVKNKCSKMADNSSDS-QSEGNVS-----FDHNFTLRLSKRSSAFGRVFPR---DEEEAAILLMELSCG
             +A     +VKNK  +  +NSS S  ++ NV       D  F +   KRS+   +   R   +EE AA+LLM LSCG
Subjt:  GNDAIVA-----HVKNKCSKMADNSSDS-QSEGNVS-----FDHNFTLRLSKRSSAFGRVFPR---DEEEAAILLMELSCG

AT4G26150.1 cytokinin-responsive gata factor 12.9e-2937.82Show/hide
Query:  GISNEGCQVIFSSTSLLKSSVGSCKPDNDQELEDQSK-----LDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWE
        G S+   Q++    + LK ++   K DN Q+  D  +     +   ++  W+SSK+RLM+K      KKA    SD  K    +T N Q+          
Subjt:  GISNEGCQVIFSSTSLLKSSVGSCKPDNDQELEDQSK-----LDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRHGKWE

Query:  ITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHV
        ++ S   NG +N+  +R+CSDCNTT TPLWRSGP+GPKSLCNACGIRQRKARRA    A+     G     S   + K++  + K     +   + +   
Subjt:  ITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHV

Query:  KNKCSKM--------------ADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVH
         N C +M                NS+   S  N+ FD +  L LSK SSA+ +VFP+DE+EAAILLM LS G+VH
Subjt:  KNKCSKM--------------ADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVH

AT4G36620.1 GATA transcription factor 195.4e-1243.18Show/hide
Query:  NTRNKQNFGNHRHGK--WEITRSSCSNGGHNNNGV--RVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGS
        N  +++ F +H      W+    S   GG   + +  R C++C+TT+TPLWR+GP+GPKSLCNACGIR +K  R  + A +   GGGS
Subjt:  NTRNKQNFGNHRHGK--WEITRSSCSNGGHNNNGV--RVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGS

AT5G56860.1 GATA type zinc finger transcription factor family protein6.2e-3241.42Show/hide
Query:  EDQSKLDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRN-KQNFGNHRHG----KWEITRSSCS------------NGGHNNNGV-RVC
        ++ +K D+ S K+ MS KMRL++K + N  +     N+++HK +     N K NF    H     K  +TR + +            NG  NNNGV RVC
Subjt:  EDQSKLDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRN-KQNFGNHRHG----KWEITRSSCS------------NGGHNNNGV-RVC

Query:  SDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVA---SEAALGKQLHKEKKSRT---SCHGNDAIVAHVK------------
        SDCNTT TPLWRSGP+GPKSLCNACGIRQRKARRA A AA+   G   V VA    +  L K+L  +KK        + +  +VA  K            
Subjt:  SDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVA---SEAALGKQLHKEKKSRT---SCHGNDAIVAHVK------------

Query:  --------NKCSKMADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVH
                ++ SK   +S+ S S     FD + T+ LSK SSA+ +VFP+DE+EAA+LLM LS G+VH
Subjt:  --------NKCSKMADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAAAAAGAAGTTATATTTGCGATTGGTGACAAGGCAGAGAAATCCATTGGTGGAATATCAAACGAAGGCTGTCAAGTGATATTTTCATCTACCTCTTTGCTGAA
GTCTAGCGTTGGTAGCTGCAAACCCGATAATGATCAAGAACTAGAGGATCAAAGCAAACTCGATAATAGGTCGGCGAAGTACTGGATGTCTTCGAAGATGAGATTGATGC
AGAAGATGATGATTAACACAGACAAGAAAGCTGCTGCTGCAAATTCAGATCATCACAAGCCAGCGTCGCCTAACACAAGAAACAAACAGAATTTTGGGAATCACCGACAT
GGGAAATGGGAAATTACCAGGTCGTCTTGTTCAAACGGCGGCCATAACAATAATGGTGTTAGGGTTTGCAGTGATTGTAACACCACGACGACTCCTCTATGGCGGAGTGG
TCCTCAAGGCCCCAAGTCGCTATGCAATGCCTGTGGGATTCGGCAAAGGAAAGCAAGGCGAGCCATGGCGGAAGCTGCAAGTGGCGGTGGCGGCGGAGGCTCAGTCGTGG
TTGCCTCGGAGGCGGCGTTGGGGAAGCAATTGCACAAGGAGAAGAAATCTCGCACGAGCTGCCATGGGAACGACGCCATTGTCGCCCATGTGAAGAACAAGTGCAGTAAG
ATGGCAGATAACTCTTCGGATTCGCAAAGTGAAGGGAACGTTTCTTTTGATCATAACTTCACTTTACGTTTAAGCAAAAGAAGTTCGGCTTTTGGGAGAGTGTTTCCCAG
AGATGAGGAAGAAGCAGCCATCCTTTTGATGGAGCTTTCTTGTGGCCTTGTTCACAGCTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAAAAAGAAGTTATATTTGCGATTGGTGACAAGGCAGAGAAATCCATTGGTGGAATATCAAACGAAGGCTGTCAAGTGATATTTTCATCTACCTCTTTGCTGAA
GTCTAGCGTTGGTAGCTGCAAACCCGATAATGATCAAGAACTAGAGGATCAAAGCAAACTCGATAATAGGTCGGCGAAGTACTGGATGTCTTCGAAGATGAGATTGATGC
AGAAGATGATGATTAACACAGACAAGAAAGCTGCTGCTGCAAATTCAGATCATCACAAGCCAGCGTCGCCTAACACAAGAAACAAACAGAATTTTGGGAATCACCGACAT
GGGAAATGGGAAATTACCAGGTCGTCTTGTTCAAACGGCGGCCATAACAATAATGGTGTTAGGGTTTGCAGTGATTGTAACACCACGACGACTCCTCTATGGCGGAGTGG
TCCTCAAGGCCCCAAGTCGCTATGCAATGCCTGTGGGATTCGGCAAAGGAAAGCAAGGCGAGCCATGGCGGAAGCTGCAAGTGGCGGTGGCGGCGGAGGCTCAGTCGTGG
TTGCCTCGGAGGCGGCGTTGGGGAAGCAATTGCACAAGGAGAAGAAATCTCGCACGAGCTGCCATGGGAACGACGCCATTGTCGCCCATGTGAAGAACAAGTGCAGTAAG
ATGGCAGATAACTCTTCGGATTCGCAAAGTGAAGGGAACGTTTCTTTTGATCATAACTTCACTTTACGTTTAAGCAAAAGAAGTTCGGCTTTTGGGAGAGTGTTTCCCAG
AGATGAGGAAGAAGCAGCCATCCTTTTGATGGAGCTTTCTTGTGGCCTTGTTCACAGCTGTTGA
Protein sequenceShow/hide protein sequence
MDEKEVIFAIGDKAEKSIGGISNEGCQVIFSSTSLLKSSVGSCKPDNDQELEDQSKLDNRSAKYWMSSKMRLMQKMMINTDKKAAAANSDHHKPASPNTRNKQNFGNHRH
GKWEITRSSCSNGGHNNNGVRVCSDCNTTTTPLWRSGPQGPKSLCNACGIRQRKARRAMAEAASGGGGGGSVVVASEAALGKQLHKEKKSRTSCHGNDAIVAHVKNKCSK
MADNSSDSQSEGNVSFDHNFTLRLSKRSSAFGRVFPRDEEEAAILLMELSCGLVHSC