; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G024490 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G024490
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionGATA transcription factor-like protein
Genome locationchr02:31040706..31045601
RNA-Seq ExpressionLsi02G024490
SyntenyLsi02G024490
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575793.1 hypothetical protein SDJN03_26432, partial [Cucurbita argyrosperma subsp. sororia]3.6e-9982.1Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        M S+LTAIA K NW F LAQFQRLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQ+NLEPD+AKSNYER+DS +GDSNGPF PPKAQYASSPRLETT
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRAHSTV DDVSCIG  GGP     KNR  + KEQE+D R+YYKHHKASPLAEIEF DTRKPITRATDGTAYDG GKDVIGWLPEQ D
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRL
        T +DSL+RATEIWK+NAMRGDPDAPQSR+
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRL

KAG7014335.1 hypothetical protein SDJN02_24512 [Cucurbita argyrosperma subsp. argyrosperma]5.2e-9881.66Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        M S+LTAIA K NW F LAQFQRLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQ+NLEPD+AKSNYER+DS +GDSNGPF PPKAQYASSPRLETT
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRAHSTV  DVSCIG  GGP     KNR  + KEQEED R+YYKHHKASPLAEIEF DTRKPIT ATDGTAYDG GKDVIGWLPEQ D
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRL
        T +DSL+RATEIWK+NAMRGDPDAPQSR+
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRL

XP_023548846.1 uncharacterized protein LOC111807374 [Cucurbita pepo subsp. pepo]4.0e-9881.66Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        M S+LTAIA K NW+F LAQFQRLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQ+NLEPD+AK+NYER+DS +GDSNGPF PPKAQYASSPRLETT
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRAHSTV  DVSCIG  GGP     KNR  + KEQEED R+YYKHHKASPLAEIEF DTRKPITRATDGTAYDG GKDVIGWLPEQ D
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRL
        T +DSLRRA EIWK+NAMRGDPDAPQSR+
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRL

XP_038899333.1 uncharacterized protein LOC120086662 isoform X1 [Benincasa hispida]6.8e-10686.9Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        MQS+LTAIAPKSNWA  LAQFQRLRR  LTT RTADPSVHANDDNDPAVLSGEPE SQ+NLEPDN K+NYER+D   GDSNGPFG PKAQ+ASSPRLET 
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
        VVGQASKPITQQKR  STVTD+VSCIGVYGGPLE+  +NR TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDG GKDVIGWLPEQLD
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRL
        T +DSLRRATEIWK+NAMRGDPDAPQSR+
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRL

XP_038899334.1 uncharacterized protein LOC120086662 isoform X2 [Benincasa hispida]2.4e-10386.03Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        MQS+LTAIAPKSNWA  LAQFQRLRR  LTT RTADPSVHANDDNDPAVLSGEPE   +NLEPDN K+NYER+D   GDSNGPFG PKAQ+ASSPRLET 
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
        VVGQASKPITQQKR  STVTD+VSCIGVYGGPLE+  +NR TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDG GKDVIGWLPEQLD
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRL
        T +DSLRRATEIWK+NAMRGDPDAPQSR+
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRL

TrEMBL top hitse value%identityAlignment
A0A0A0K9G7 Uncharacterized protein3.3e-9882.13Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGG--LTTCRTADPSVHAN---DDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSK-GDSNGPFGPPKAQYASS
        MQS L AIAPKSNWAF++ QFQ LRRGG  LTT RTADPS+HAN   DDNDPAVLSGEPERSQ+NLEPDNAK+NY+R D  K GDS GPFG P AQ+ASS
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGG--LTTCRTADPSVHAN---DDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSK-GDSNGPFGPPKAQYASS

Query:  PRLETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGW
        PRLETTVVGQASKPITQQKRAHS   DDVSCIGVYGGPLE+  +NR TEMKE+EEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDG    VIGW
Subjt:  PRLETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGW

Query:  LPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRL
        LPEQ+DT +DSLRRATEIWK+NAMRGDPDAPQSR+
Subjt:  LPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRL

A0A1S3BR22 uncharacterized protein LOC1034927782.1e-9779.25Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQENLEPDNAKSNYE-REDSSKGDSNGPFGPPKAQYASSPRL
        MQS+L AIAP+SNWA ++ QFQ LRRGGLTT RTADPSVHANDD  NDP+VLSGEPERSQ+NLEPDNAK+NYE R+D  +GDSNGPFGP KAQ+ASSPRL
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQENLEPDNAKSNYE-REDSSKGDSNGPFGPPKAQYASSPRL

Query:  ETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQE---------EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAG
        ETTVVGQASKPITQQKRAHS   DDVSCIGVYGGPLEE  ++R TEMK++E         EDNRDYYKHHKASPLAEIEF DTRKPITRATDGTA  G G
Subjt:  ETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQE---------EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAG

Query:  KDVIGWLPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRL
        K VIGWLPEQ+DT +DSLRRATEIWK+NAMRGDPDAPQSR+
Subjt:  KDVIGWLPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRL

A0A5D3D3D5 Uncharacterized protein5.3e-9678.75Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQENLEPDNAKSNYE-REDSSKGDSNGPFGPPKAQYASSPRL
        MQS+L AIAP+SNWA ++ Q Q LRRGGLTT RTADPSVHANDD  NDP+VLSGEPERSQ+NLEPDNAK+NYE R+D  +G SNGPFGP KAQ+ASSPRL
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQENLEPDNAKSNYE-REDSSKGDSNGPFGPPKAQYASSPRL

Query:  ETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQE--------EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGK
        ETTVVGQASKPITQQKRAHS   DDVSCIGVYGGPLEE  ++R TEMK++E        EDNRDYYKHHKASPLAEIEF DTRKPITRATDGTA  G GK
Subjt:  ETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQE--------EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGK

Query:  DVIGWLPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRL
         VIGWLPEQ+DT +DSLRRATEIWK+NAMRGDPDAPQSR+
Subjt:  DVIGWLPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRL

A0A6J1GPT4 uncharacterized protein LOC1114563887.3e-9880.79Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        M S+LTAIA K NW F LAQFQRLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQ+NLEPD+AKSNYER+DS +GDSNGPF PPKAQYASSPRLETT
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRAHSTV  DVSCIG  GGP     KNR  + KEQ++D R+YYKHHKASPLAEIEF DTRKPITRATDGTAYDG GKD+IGWLPEQ D
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRL
        T +DSL+RATEIWK+NAMRGDPDAPQSR+
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRL

A0A6J1JL82 uncharacterized protein LOC1114879533.3e-9882.1Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        M S+LTAIA K NWAF LAQFQRLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQ+NLEPD AK+NY  +DS +GDSNGPF PPKAQYASSPRLETT
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRAHSTV  DVSCIG  GGP  E  KNR  + KEQEED R+YYKHHKASPLAEIEFADTRKPITRATDGTAYDG GKDVI WLPEQ D
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRL
        T +DSLRRATEIWK+NAMRGDPDAPQSR+
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRL

SwissProt top hitse value%identityAlignment
Q5HZ36 GATA transcription factor 211.1e-0545.45Show/hide
Query:  RACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK
        R C  C+TT+TPLWR+GP GP+                   SLCNACGIR RK +
Subjt:  RACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK

Q6YW48 Protein CYTOKININ-RESPONSIVE GATA TRANSCRIPTION FACTOR 18.3e-0645.45Show/hide
Query:  RACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK
        R C  C+TT+TPLWR+GP GP+                   SLCNACGIR RK +
Subjt:  RACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK

Q8LC59 GATA transcription factor 234.9e-0630.16Show/hide
Query:  RACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMKNNNNNGGVNNKIGKGKKLGEESSKVRVVRLGREIMLHTSTTAMAD
        R C  C TT+TP+WR GP GP+                   SLCNACGIR+RK +             + + LG    +       ++I L +S+     
Subjt:  RACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMKNNNNNGGVNNKIGKGKKLGEESSKVRVVRLGREIMLHTSTTAMAD

Query:  NGIAEPIGEEEQAAAMLLMALSSGYI
              + EEEQAA  LL+   S  +
Subjt:  NGIAEPIGEEEQAAAMLLMALSSGYI

Q8LG10 GATA transcription factor 159.2e-1340.94Show/hide
Query:  QRACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK----NNNNNGGVNNKIGKGKKLGEESSKVRVVRLGREIMLHTST
        +++C  C T++TPLWR GPAGP+                   SLCNACGIR RK +    +N +         +  K G +S K R++ LGRE+M+  ST
Subjt:  QRACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK----NNNNNGGVNNKIGKGKKLGEESSKVRVVRLGREIMLHTST

Query:  TAMADNGIAEPIGEEEQAAAMLLMALS
           A+N     +GEEEQ AA+LLMALS
Subjt:  TAMADNGIAEPIGEEEQAAAMLLMALS

Q9FJ10 GATA transcription factor 161.8e-0834.78Show/hide
Query:  RQRACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK----------NNNNNGGVNNKIGKGKKLGEESSKVRVVRLGRE
        +++ C  C T++TPLWR GP GP+                   SLCNACGIR RK +            +++GG N K G       ES K  ++ LG  
Subjt:  RQRACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK----------NNNNNGGVNNKIGKGKKLGEESSKVRVVRLGRE

Query:  IMLHTSTTAMADNGIAEPIGEEEQAAAMLLMALSSGYI
          +   +T        + +GEEEQ AA+LLMALS G +
Subjt:  IMLHTSTTAMADNGIAEPIGEEEQAAAMLLMALSSGYI

Arabidopsis top hitse value%identityAlignment
AT1G02700.1 unknown protein4.9e-4648.32Show/hide
Query:  IMQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHA-NDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLE
        +MQS+L A A  +         +RL  G  T+ RTADP +HA ND  DPA+   +PE   +   P  A      +         P  PPK+  A++ +LE
Subjt:  IMQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHA-NDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLE

Query:  TTVVGQASKPITQQKRAHSTVT----DDVSCIGVYGG--PLEEANKNRTEMKEQE-EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVI
        +T VG  S+P  QQKR +ST +    D VSC G+ G   P +E        +E E E ++++YKHHKASPL+EIEFADTRKPIT+ATDGTAY  AGKDVI
Subjt:  TTVVGQASKPITQQKRAHSTVT----DDVSCIGVYGG--PLEEANKNRTEMKEQE-EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVI

Query:  GWLPEQLDTAEDSLRRATEIWKENAMRGDPDA-PQSRL
        GWLPEQLDTAE+SL +AT I+K NA RGDP+  P SR+
Subjt:  GWLPEQLDTAEDSLRRATEIWKENAMRGDPDA-PQSRL

AT3G06740.1 GATA transcription factor 156.5e-1440.94Show/hide
Query:  QRACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK----NNNNNGGVNNKIGKGKKLGEESSKVRVVRLGREIMLHTST
        +++C  C T++TPLWR GPAGP+                   SLCNACGIR RK +    +N +         +  K G +S K R++ LGRE+M+  ST
Subjt:  QRACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK----NNNNNGGVNNKIGKGKKLGEESSKVRVVRLGREIMLHTST

Query:  TAMADNGIAEPIGEEEQAAAMLLMALS
           A+N     +GEEEQ AA+LLMALS
Subjt:  TAMADNGIAEPIGEEEQAAAMLLMALS

AT4G16141.1 GATA type zinc finger transcription factor family protein7.0e-0830.22Show/hide
Query:  QRACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYR----------------KMKNNNNNG--GVNNKIGKGKKL--------
        ++ CV C T+RTPLWR GPAGP+                   SLCNACGI+ R                K K+NNN G    N K GKG+ +        
Subjt:  QRACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYR----------------KMKNNNNNG--GVNNKIGKGKKL--------

Query:  --------------------------GEESSKVRVVRLGREIMLHTSTTAMADNGIAE-----PIGEEEQAAAMLLMALSSG
                                     ++K  V R+GR +       AM  + + +      +GEEE+ AA+LLMALS G
Subjt:  --------------------------GEESSKVRVVRLGREIMLHTSTTAMADNGIAE-----PIGEEEQAAAMLLMALSSG

AT5G26930.1 GATA transcription factor 233.5e-0730.16Show/hide
Query:  RACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMKNNNNNGGVNNKIGKGKKLGEESSKVRVVRLGREIMLHTSTTAMAD
        R C  C TT+TP+WR GP GP+                   SLCNACGIR+RK +             + + LG    +       ++I L +S+     
Subjt:  RACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMKNNNNNGGVNNKIGKGKKLGEESSKVRVVRLGREIMLHTSTTAMAD

Query:  NGIAEPIGEEEQAAAMLLMALSSGYI
              + EEEQAA  LL+   S  +
Subjt:  NGIAEPIGEEEQAAAMLLMALSSGYI

AT5G49300.1 GATA transcription factor 161.3e-0934.78Show/hide
Query:  RQRACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK----------NNNNNGGVNNKIGKGKKLGEESSKVRVVRLGRE
        +++ C  C T++TPLWR GP GP+                   SLCNACGIR RK +            +++GG N K G       ES K  ++ LG  
Subjt:  RQRACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCNACGIRYRKMK----------NNNNNGGVNNKIGKGKKLGEESSKVRVVRLGRE

Query:  IMLHTSTTAMADNGIAEPIGEEEQAAAMLLMALSSGYI
          +   +T        + +GEEEQ AA+LLMALS G +
Subjt:  IMLHTSTTAMADNGIAEPIGEEEQAAAMLLMALSSGYI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTAAATACTTTCTTCAAACACCTTCCGTTCTGATGCTCAAAGGTGAAGTAATTGGTGATTTTGGTTTGGATCCAGTGATGATGGCCTTCCTCATGAACCGCCCCGC
TGATCTCTCTGCTTCGACCTTCGACCCATCTCTAATGGCCGCCGTATCGAAACAGAGTCTGCTCAATATGGAGCCGCAGAGACAGAGAGCCTGCGTCCACTGTCACACCA
CCAGAACACCTCTCTGGAGAGCCGGTCCGGCGGGGCCAAGGGTTGAATGTTTTGAGTTAGTAATGTTGATTAATTTCTGGGTTTTTGTTGGTGGACAGTCGCTGTGCAAT
GCATGTGGGATTCGATACAGGAAGATGAAGAATAACAATAATAATGGGGGAGTGAATAATAAGATTGGAAAAGGGAAGAAACTGGGAGAAGAATCGTCGAAGGTAAGAGT
AGTGAGATTAGGGAGAGAGATAATGCTGCACACATCAACGACGGCGATGGCGGATAATGGAATTGCAGAGCCAATCGGAGAAGAAGAACAGGCGGCGGCGATGCTTCTTA
TGGCTTTATCTTCCGGCTATATAATTCCCCACCGTTCTAGGCGCATCCGCAACCACACGACAAGACATAATAACAATAGAATATCCGAGAACTTACCACTCAAAGTATAC
GTGGATGATGTGCTGTCCATGCCACGTAAACCGCTCGCCACAACCTCCCTCGTTTTGCTTTTAAGATCTCGCCGCAAACAACAATTTGGAAGTAAAAAGATCATGCAATC
GAAATTGACGGCGATCGCACCGAAATCGAATTGGGCCTTCTATCTGGCCCAATTCCAACGCCTCCGACGAGGTGGTCTGACGACATGTCGTACAGCTGACCCTTCCGTTC
ACGCCAACGACGACAACGACCCCGCCGTTTTATCCGGTGAACCCGAGAGATCACAGGAAAATTTAGAGCCAGATAATGCGAAATCCAATTACGAAAGGGAGGACTCTAGT
AAGGGAGATTCAAATGGGCCGTTTGGGCCACCGAAGGCACAATACGCCTCCTCCCCTCGGTTAGAAACCACTGTAGTGGGCCAGGCCTCAAAGCCCATTACTCAACAAAA
AAGAGCCCACAGTACGGTGACCGACGACGTGAGTTGCATCGGCGTCTACGGCGGGCCTTTGGAGGAAGCGAATAAAAACAGAACTGAAATGAAAGAACAGGAGGAAGACA
ATAGAGATTATTACAAGCACCACAAGGCGTCGCCGTTGGCGGAGATCGAGTTTGCGGATACTCGCAAGCCGATAACCAGAGCGACGGACGGGACGGCGTACGATGGGGCC
GGGAAGGATGTGATTGGGTGGTTGCCGGAGCAGCTGGATACGGCGGAGGATTCGCTTCGGAGAGCGACGGAGATTTGGAAAGAAAATGCGATGCGTGGAGATCCCGATGC
TCCACAGTCGAGGCTGAAAATGCCGAGAAAAGAAAGACTTTTTGGGAATAGGTGGCACGGCGCCTTCTTGCATACACCTGATTCCGATTCTCGGCCGATGTGGGACAAGC
TGATTCCCCCACCGCCGCACCATATTGTTCCCACCTGGCCTAAAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTAAATACTTTCTTCAAACACCTTCCGTTCTGATGCTCAAAGGTGAAGTAATTGGTGATTTTGGTTTGGATCCAGTGATGATGGCCTTCCTCATGAACCGCCCCGC
TGATCTCTCTGCTTCGACCTTCGACCCATCTCTAATGGCCGCCGTATCGAAACAGAGTCTGCTCAATATGGAGCCGCAGAGACAGAGAGCCTGCGTCCACTGTCACACCA
CCAGAACACCTCTCTGGAGAGCCGGTCCGGCGGGGCCAAGGGTTGAATGTTTTGAGTTAGTAATGTTGATTAATTTCTGGGTTTTTGTTGGTGGACAGTCGCTGTGCAAT
GCATGTGGGATTCGATACAGGAAGATGAAGAATAACAATAATAATGGGGGAGTGAATAATAAGATTGGAAAAGGGAAGAAACTGGGAGAAGAATCGTCGAAGGTAAGAGT
AGTGAGATTAGGGAGAGAGATAATGCTGCACACATCAACGACGGCGATGGCGGATAATGGAATTGCAGAGCCAATCGGAGAAGAAGAACAGGCGGCGGCGATGCTTCTTA
TGGCTTTATCTTCCGGCTATATAATTCCCCACCGTTCTAGGCGCATCCGCAACCACACGACAAGACATAATAACAATAGAATATCCGAGAACTTACCACTCAAAGTATAC
GTGGATGATGTGCTGTCCATGCCACGTAAACCGCTCGCCACAACCTCCCTCGTTTTGCTTTTAAGATCTCGCCGCAAACAACAATTTGGAAGTAAAAAGATCATGCAATC
GAAATTGACGGCGATCGCACCGAAATCGAATTGGGCCTTCTATCTGGCCCAATTCCAACGCCTCCGACGAGGTGGTCTGACGACATGTCGTACAGCTGACCCTTCCGTTC
ACGCCAACGACGACAACGACCCCGCCGTTTTATCCGGTGAACCCGAGAGATCACAGGAAAATTTAGAGCCAGATAATGCGAAATCCAATTACGAAAGGGAGGACTCTAGT
AAGGGAGATTCAAATGGGCCGTTTGGGCCACCGAAGGCACAATACGCCTCCTCCCCTCGGTTAGAAACCACTGTAGTGGGCCAGGCCTCAAAGCCCATTACTCAACAAAA
AAGAGCCCACAGTACGGTGACCGACGACGTGAGTTGCATCGGCGTCTACGGCGGGCCTTTGGAGGAAGCGAATAAAAACAGAACTGAAATGAAAGAACAGGAGGAAGACA
ATAGAGATTATTACAAGCACCACAAGGCGTCGCCGTTGGCGGAGATCGAGTTTGCGGATACTCGCAAGCCGATAACCAGAGCGACGGACGGGACGGCGTACGATGGGGCC
GGGAAGGATGTGATTGGGTGGTTGCCGGAGCAGCTGGATACGGCGGAGGATTCGCTTCGGAGAGCGACGGAGATTTGGAAAGAAAATGCGATGCGTGGAGATCCCGATGC
TCCACAGTCGAGGCTGAAAATGCCGAGAAAAGAAAGACTTTTTGGGAATAGGTGGCACGGCGCCTTCTTGCATACACCTGATTCCGATTCTCGGCCGATGTGGGACAAGC
TGATTCCCCCACCGCCGCACCATATTGTTCCCACCTGGCCTAAAAAATAA
Protein sequenceShow/hide protein sequence
MIKYFLQTPSVLMLKGEVIGDFGLDPVMMAFLMNRPADLSASTFDPSLMAAVSKQSLLNMEPQRQRACVHCHTTRTPLWRAGPAGPRVECFELVMLINFWVFVGGQSLCN
ACGIRYRKMKNNNNNGGVNNKIGKGKKLGEESSKVRVVRLGREIMLHTSTTAMADNGIAEPIGEEEQAAAMLLMALSSGYIIPHRSRRIRNHTTRHNNNRISENLPLKVY
VDDVLSMPRKPLATTSLVLLLRSRRKQQFGSKKIMQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSS
KGDSNGPFGPPKAQYASSPRLETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNRTEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGA
GKDVIGWLPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRLKMPRKERLFGNRWHGAFLHTPDSDSRPMWDKLIPPPPHHIVPTWPKK