; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022846 (gene) of Chayote v1 genome

Gene IDSed0022846
OrganismSechium edule (Chayote v1)
DescriptionGATA transcription factor
Genome locationLG04:32293720..32294854
RNA-Seq ExpressionSed0022846
SyntenySed0022846
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150081.1 GATA transcription factor 16 [Cucumis sativus]1.7e-3464.29Show/hide
Query:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKR
        MMDPI + S       +S  ESEQ++K CADCGTT+TPLWRGGPAGPKSLCNACGIRSRKKRRS++       VER++KG S+  +NNGGGN+GK+G + 
Subjt:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKR

Query:  LKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV
        LKWR MA GR EL Q+R++    EEE+AAVLLMALSYGSV
Subjt:  LKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV

XP_008458384.1 PREDICTED: GATA transcription factor 16-like isoform X1 [Cucumis melo]8.5e-3463.57Show/hide
Query:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKR
        MMDPI + S       +S  ESEQ++K CADCGTT+TPLWRGGPAGPKSLCNACGIRSRKKRRS++       VER++KG S+  +NNGGGN+GK+G + 
Subjt:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKR

Query:  LKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV
        LKWR MA GR EL Q+ ++    EEE+AAVLLMALSYGSV
Subjt:  LKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV

XP_022138518.1 GATA transcription factor 16-like [Momordica charantia]2.9e-3464.34Show/hide
Query:  MMDPIAKAS-----GKS--SSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS-SNNGGGNEGKMG
        MMDPI + S     G+S   +S AESEQ+RK CADCGTT+TPLWRGGPAGPKSLCNACGIRSRKKRRS++       VER++KG S+ ++NGGGN+ K+G
Subjt:  MMDPIAKAS-----GKS--SSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS-SNNGGGNEGKMG

Query:  EKRLKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV
           LKWR MA GR EL Q+R++    EEE+AAVLLMALSYGSV
Subjt:  EKRLKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV

XP_023548663.1 GATA transcription factor 16-like [Cucurbita pepo subsp. pepo]2.5e-3362.94Show/hide
Query:  MMDPIAKAS-------GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSSSN-NGGGNEGKMG
        MMDPI + S           SS AESEQ++K CADCGTT+TPLWRGGPAGPKSLCNACGIRSRKKRRS++       VER+SKG S+ N NGGGN+ K+G
Subjt:  MMDPIAKAS-------GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSSSN-NGGGNEGKMG

Query:  EKRLKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV
           LKWR  A GR +L Q+R++    EEE+AAVLLMALSYGSV
Subjt:  EKRLKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV

XP_038875288.1 GATA transcription factor 16-like [Benincasa hispida]9.1e-3666.91Show/hide
Query:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS-SNNGGGNEGKMGEKRL
        MMDPI + S       SS AESEQ++K CADCGTT+TPLWRGGPAGPKSLCNACGIRSRKKRRS++       VER+SKG S+ SNNGGGN+ K+G + L
Subjt:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS-SNNGGGNEGKMGEKRL

Query:  KWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV
        KWR MA GR EL Q+R++    EEE+AAVLLMALSYGSV
Subjt:  KWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV

TrEMBL top hitse value%identityAlignment
A0A0A0KG76 GATA-type domain-containing protein8.3e-3564.29Show/hide
Query:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKR
        MMDPI + S       +S  ESEQ++K CADCGTT+TPLWRGGPAGPKSLCNACGIRSRKKRRS++       VER++KG S+  +NNGGGN+GK+G + 
Subjt:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKR

Query:  LKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV
        LKWR MA GR EL Q+R++    EEE+AAVLLMALSYGSV
Subjt:  LKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV

A0A1S3C8C2 GATA transcription factor 16-like isoform X14.1e-3463.57Show/hide
Query:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKR
        MMDPI + S       +S  ESEQ++K CADCGTT+TPLWRGGPAGPKSLCNACGIRSRKKRRS++       VER++KG S+  +NNGGGN+GK+G + 
Subjt:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKR

Query:  LKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV
        LKWR MA GR EL Q+ ++    EEE+AAVLLMALSYGSV
Subjt:  LKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV

A0A1S3C902 GATA transcription factor 16-like isoform X24.6e-3368.33Show/hide
Query:  ESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKRLKWRFMAIGRGELKQKRKME
        ESEQ++K CADCGTT+TPLWRGGPAGPKSLCNACGIRSRKKRRS++       VER++KG S+  +NNGGGN+GK+G + LKWR MA GR EL Q+ ++ 
Subjt:  ESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKRLKWRFMAIGRGELKQKRKME

Query:  GEEEEEEAAVLLMALSYGSV
           EEE+AAVLLMALSYGSV
Subjt:  GEEEEEEAAVLLMALSYGSV

A0A5D3BXI1 GATA transcription factor 16-like isoform X14.1e-3463.57Show/hide
Query:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKR
        MMDPI + S       +S  ESEQ++K CADCGTT+TPLWRGGPAGPKSLCNACGIRSRKKRRS++       VER++KG S+  +NNGGGN+GK+G + 
Subjt:  MMDPIAKAS---GKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS--SNNGGGNEGKMGEKR

Query:  LKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV
        LKWR MA GR EL Q+ ++    EEE+AAVLLMALSYGSV
Subjt:  LKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV

A0A6J1CD73 GATA transcription factor 16-like1.4e-3464.34Show/hide
Query:  MMDPIAKAS-----GKS--SSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS-SNNGGGNEGKMG
        MMDPI + S     G+S   +S AESEQ+RK CADCGTT+TPLWRGGPAGPKSLCNACGIRSRKKRRS++       VER++KG S+ ++NGGGN+ K+G
Subjt:  MMDPIAKAS-----GKS--SSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV-------VERRSKGGSS-SNNGGGNEGKMG

Query:  EKRLKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV
           LKWR MA GR EL Q+R++    EEE+AAVLLMALSYGSV
Subjt:  EKRLKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYGSV

SwissProt top hitse value%identityAlignment
Q8LC59 GATA transcription factor 237.3e-1237.4Show/hide
Query:  MDPIAKASGKSSSSAAESEQSR---KACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIGR
        MDP    S  SS  +   ++ +   + C++C TT+TP+WRGGP GPKSLCNACGIR RK+RRS ++         S         +  K++     + G 
Subjt:  MDPIAKASGKSSSSAAESEQSR---KACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIGR

Query:  GELKQKRKMEGEEEEEEAAVLLMALSYGSVV
          +K++R +   +EEE+AA+ L+ LS  SV+
Subjt:  GELKQKRKMEGEEEEEEAAVLLMALSYGSVV

Q8LG10 GATA transcription factor 153.2e-2351.11Show/hide
Query:  MDPIAKASGKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIGRGEL
        +D I + S  SS+ A  +E  +K+CA CGT++TPLWRGGPAGPKSLCNACGIR+RKKRR+++  R       S+N      K G+  LK R M +GR  +
Subjt:  MDPIAKASGKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIGRGEL

Query:  KQKRKMEGEE-----EEEEAAVLLMALSYGSVVYA
         Q+   E +      EEE+AAVLLMALSY S VYA
Subjt:  KQKRKMEGEE-----EEEEAAVLLMALSYGSVVYA

Q9FJ10 GATA transcription factor 164.6e-2259.63Show/hide
Query:  RKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIG---RGELKQKRKMEGEEEEEEAAVL
        +K CADCGT++TPLWRGGP GPKSLCNACGIR+RKKRR    + +    SSS  GGGN  K GE  LK   M +G   R  ++++R+  G  EEE+AAVL
Subjt:  RKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIG---RGELKQKRKMEGEEEEEEAAVL

Query:  LMALSYGSV
        LMALSYGSV
Subjt:  LMALSYGSV

Q9LIB5 GATA transcription factor 176.0e-1437.04Show/hide
Query:  SGKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV---VERRSKGGSSSNNGGGNEGKMGEKRLK---------------
        S   S   + S  +++ C DCGT RTPLWRGGPAGPKSLCNACGI+SRKKR++ +    E + K   S+ N   N      K+ K               
Subjt:  SGKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV---VERRSKGGSSSNNGGGNEGKMGEKRLK---------------

Query:  -----------------WRFMAIG-------RGELKQKRKMEGEEEEEEAAVLLMALSYGSV
                          +F+ +G       R  +++KR      EEE AAVLLMALS  SV
Subjt:  -----------------WRFMAIG-------RGELKQKRKMEGEEEEEEAAVLLMALSYGSV

Q9SZI6 Putative GATA transcription factor 224.7e-1170Show/hide
Query:  KACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV
        + C+DC TT+TPLWR GP GPKSLCNACGIR RK RR+ +
Subjt:  KACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV

Arabidopsis top hitse value%identityAlignment
AT3G06740.1 GATA transcription factor 152.2e-2451.11Show/hide
Query:  MDPIAKASGKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIGRGEL
        +D I + S  SS+ A  +E  +K+CA CGT++TPLWRGGPAGPKSLCNACGIR+RKKRR+++  R       S+N      K G+  LK R M +GR  +
Subjt:  MDPIAKASGKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIGRGEL

Query:  KQKRKMEGEE-----EEEEAAVLLMALSYGSVVYA
         Q+   E +      EEE+AAVLLMALSY S VYA
Subjt:  KQKRKMEGEE-----EEEEAAVLLMALSYGSVVYA

AT3G16870.1 GATA transcription factor 174.2e-1537.04Show/hide
Query:  SGKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV---VERRSKGGSSSNNGGGNEGKMGEKRLK---------------
        S   S   + S  +++ C DCGT RTPLWRGGPAGPKSLCNACGI+SRKKR++ +    E + K   S+ N   N      K+ K               
Subjt:  SGKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV---VERRSKGGSSSNNGGGNEGKMGEKRLK---------------

Query:  -----------------WRFMAIG-------RGELKQKRKMEGEEEEEEAAVLLMALSYGSV
                          +F+ +G       R  +++KR      EEE AAVLLMALS  SV
Subjt:  -----------------WRFMAIG-------RGELKQKRKMEGEEEEEEAAVLLMALSYGSV

AT4G16141.1 GATA type zinc finger transcription factor family protein5.5e-1534.5Show/hide
Query:  SSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV--------------------------------------------------
        SS+     ++K C DCGT+RTPLWRGGPAGPKSLCNACGI+SRKKR++ +                                                  
Subjt:  SSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVV--------------------------------------------------

Query:  -------VERRSKGGSSSNNGGGNEGKMGE-KRLKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYG
               ++R  +  SSSNN   N  ++G      ++  A+ R  +++KR      EEE AAVLLMALS G
Subjt:  -------VERRSKGGSSSNNGGGNEGKMGE-KRLKWRFMAIGRGELKQKRKMEGEEEEEEAAVLLMALSYG

AT5G26930.1 GATA transcription factor 235.2e-1337.4Show/hide
Query:  MDPIAKASGKSSSSAAESEQSR---KACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIGR
        MDP    S  SS  +   ++ +   + C++C TT+TP+WRGGP GPKSLCNACGIR RK+RRS ++         S         +  K++     + G 
Subjt:  MDPIAKASGKSSSSAAESEQSR---KACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIGR

Query:  GELKQKRKMEGEEEEEEAAVLLMALSYGSVV
          +K++R +   +EEE+AA+ L+ LS  SV+
Subjt:  GELKQKRKMEGEEEEEEAAVLLMALSYGSVV

AT5G49300.1 GATA transcription factor 163.2e-2359.63Show/hide
Query:  RKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIG---RGELKQKRKMEGEEEEEEAAVL
        +K CADCGT++TPLWRGGP GPKSLCNACGIR+RKKRR    + +    SSS  GGGN  K GE  LK   M +G   R  ++++R+  G  EEE+AAVL
Subjt:  RKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIG---RGELKQKRKMEGEEEEEEAAVL

Query:  LMALSYGSV
        LMALSYGSV
Subjt:  LMALSYGSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGATCCGATTGCGAAGGCATCCGGAAAATCATCGTCGTCGGCGGCGGAGAGTGAGCAGAGCAGAAAGGCGTGCGCCGATTGCGGCACGACGAGGACTCCTCTCTG
GCGCGGAGGTCCGGCTGGCCCTAAGTCTCTTTGCAATGCGTGTGGGATCAGAAGCAGGAAGAAGAGAAGATCGGTTGTAGTAGAGAGGAGAAGCAAGGGAGGAAGTAGCA
GCAACAATGGCGGCGGAAACGAAGGGAAAATGGGAGAGAAGAGGTTGAAATGGCGATTCATGGCGATTGGCAGAGGAGAGTTGAAGCAGAAGAGAAAGATGGAAGGAGAA
GAAGAAGAAGAAGAAGCTGCAGTTTTATTAATGGCTCTTTCTTATGGATCTGTTGTATATGCTTGA
mRNA sequenceShow/hide mRNA sequence
CGAGCCTAAAAGCGGCTGCTAAACCGTAAACTTCTGTCGGTTCGATTCTTCACATCTTCATCTTCTTCCTCTCGTTCTTCGTGCCACTGATCGCTCTCCAACTCCCTTCA
TGTCGGTTTCGTCTTTTTCGTCTTCCAAATTTCCATAACCTCAATTTTCCACTGTTCTAGTCGCGTTTGTTCATCACATTTCTGTTCCAGTTTTGATCATCCAGTGTTTT
TGTTCATCCGATCTTCCGTTCGAGCCTGAGCGGTGCCGTTTCTCCGTCATTTTTCCTCGATTTTATGTCTTCGATTCGTCTCTTCCTCTCTCAATTTCCGTTTCCGCTTT
CCGATTCGAGTTTCGGTGCTGATTTCCTCGATTATTCGTCTCAGATCTCTGAGATTTCCTTTGGAAAATGATGGATCCGATTGCGAAGGCATCCGGAAAATCATCGTCGT
CGGCGGCGGAGAGTGAGCAGAGCAGAAAGGCGTGCGCCGATTGCGGCACGACGAGGACTCCTCTCTGGCGCGGAGGTCCGGCTGGCCCTAAGTCTCTTTGCAATGCGTGT
GGGATCAGAAGCAGGAAGAAGAGAAGATCGGTTGTAGTAGAGAGGAGAAGCAAGGGAGGAAGTAGCAGCAACAATGGCGGCGGAAACGAAGGGAAAATGGGAGAGAAGAG
GTTGAAATGGCGATTCATGGCGATTGGCAGAGGAGAGTTGAAGCAGAAGAGAAAGATGGAAGGAGAAGAAGAAGAAGAAGAAGCTGCAGTTTTATTAATGGCTCTTTCTT
ATGGATCTGTTGTATATGCTTGAAATGCTAACTCTAATGGAAAGCTGCATTTTCTTCTTCAATGGATTCACATCTATCTCTCTTCTTCTCTAATTTTTTTACAATGACTC
TTATTTATTTATTTTCTCTTTTACATAGTGGTTGTGGTACCATTAAATACATATGGAATCCATTCATTTTGTATGTATATTTGATC
Protein sequenceShow/hide protein sequence
MMDPIAKASGKSSSSAAESEQSRKACADCGTTRTPLWRGGPAGPKSLCNACGIRSRKKRRSVVVERRSKGGSSSNNGGGNEGKMGEKRLKWRFMAIGRGELKQKRKMEGE
EEEEEAAVLLMALSYGSVVYA