; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007039 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007039
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGATA transcription factor 16-like
Genome locationChr10:640036..641206
RNA-Seq ExpressionHG10007039
SyntenyHG10007039
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK10277.1 GATA transcription factor 16-like [Cucumis melo var. makuwa]2.0e-4564.52Show/hide
Query:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPK-------------------SLCNACGIRFRKRGISTIGTNR-GCDRKREGVHNNGSSTMTTV
        MG +DL QKGLLL DTKCCVDCKTTKTPLWRGGPTGPK                   SLCNACGIRFRKR I T  TNR G D+KRE V +N SST+  V
Subjt:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPK-------------------SLCNACGIRFRKRGISTIGTNR-GCDRKREGVHNNGSSTMTTV

Query:  SATTSSSE---TTATTTSG-DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
        SATT+SS    TT TTTSG DGDEN GECGS RM++MM  EE+VMV       VKK R + QRK+GEEEKQAAVSLMALS GSL +
Subjt:  SATTSSSE---TTATTTSG-DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

XP_008450587.1 PREDICTED: GATA transcription factor 16-like [Cucumis melo]6.8e-4971.86Show/hide
Query:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNR-GCDRKREGVHNNGSSTMTTVSATTSSSE---TTATTTSG
        MG +DL QKGLLL DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR I T  TNR G D+KRE V +N SST+  VSATT+SS    TT TTTSG
Subjt:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNR-GCDRKREGVHNNGSSTMTTVSATTSSSE---TTATTTSG

Query:  -DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
         DGDEN GECGS RM++MM  EE+VMV       VKK R + QRK+GEEEKQAAVSLMALS GSL +
Subjt:  -DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

XP_011659732.1 GATA transcription factor 16 [Cucumis sativus]1.8e-4971.86Show/hide
Query:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSE----TTATTTSG
        MG MDL QKGLLL DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR IST GTNR  D+KRE V++N SS + TVSATT+SS     TT T++SG
Subjt:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSE----TTATTTSG

Query:  -DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
         DGDEN GECGSLRMRLMM+ EE+VMV       VKKQ+ + QRK+GEEEKQAA+SL+ALS  SL +
Subjt:  -DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

XP_022135615.1 GATA transcription factor 16-like isoform X4 [Momordica charantia]7.8e-4566.47Show/hide
Query:  MGLMDL---RQKGLLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSG
        MG+MD+   + K  +  DT K CVDCKTTKTPLWRGGP GPKSLCNACGIRFRKR +STIGTNRGCDRKRE  H++G ST   +SATTSSS T A   S 
Subjt:  MGLMDL---RQKGLLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSG

Query:  DG-------DENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
        +G       +E+LGECGSLRMRLMMA  EEV+V QN    + KQ  R  RKLGEEE QAAVSLMALSCGS+FA
Subjt:  DG-------DENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

XP_038880207.1 GATA transcription factor 17-like [Benincasa hispida]1.0e-6886.14Show/hide
Query:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTS----G
        MG+MDLRQKGLLL DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR ISTIGTNRG DRKRE VHNNGS+  TTVSATTSS+ TT TTTS    G
Subjt:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTS----G

Query:  DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
        DGDENLGECGSL MRLMMA EEEVMVVQNLPSSVKKQR +R+RKLGEEEKQAAVSLMALSCGS+ +
Subjt:  DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

TrEMBL top hitse value%identityAlignment
A0A0A0M1G9 GATA-type domain-containing protein8.7e-5071.86Show/hide
Query:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSE----TTATTTSG
        MG MDL QKGLLL DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR IST GTNR  D+KRE V++N SS + TVSATT+SS     TT T++SG
Subjt:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSE----TTATTTSG

Query:  -DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
         DGDEN GECGSLRMRLMM+ EE+VMV       VKKQ+ + QRK+GEEEKQAA+SL+ALS  SL +
Subjt:  -DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

A0A1S3BQ71 GATA transcription factor 16-like3.3e-4971.86Show/hide
Query:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNR-GCDRKREGVHNNGSSTMTTVSATTSSSE---TTATTTSG
        MG +DL QKGLLL DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR I T  TNR G D+KRE V +N SST+  VSATT+SS    TT TTTSG
Subjt:  MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNR-GCDRKREGVHNNGSSTMTTVSATTSSSE---TTATTTSG

Query:  -DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
         DGDEN GECGS RM++MM  EE+VMV       VKK R + QRK+GEEEKQAAVSLMALS GSL +
Subjt:  -DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

A0A6J1C1I3 GATA transcription factor 16-like isoform X43.8e-4566.47Show/hide
Query:  MGLMDL---RQKGLLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSG
        MG+MD+   + K  +  DT K CVDCKTTKTPLWRGGP GPKSLCNACGIRFRKR +STIGTNRGCDRKRE  H++G ST   +SATTSSS T A   S 
Subjt:  MGLMDL---RQKGLLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSG

Query:  DG-------DENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
        +G       +E+LGECGSLRMRLMMA  EEV+V QN    + KQ  R  RKLGEEE QAAVSLMALSCGS+FA
Subjt:  DG-------DENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

A0A6J1C373 GATA transcription factor 17-like isoform X18.4e-4567.24Show/hide
Query:  MGLMD-LRQKG---LLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTS
        MG+MD LR+K     +  DT K CVDCKTTKTPLWRGGP GPKSLCNACGIRFRKR +STIGTNRGCDRKRE  H++G ST   +SATTSSS T A   S
Subjt:  MGLMD-LRQKG---LLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTS

Query:  GDG-------DENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
         +G       +E+LGECGSLRMRLMMA  EEV+V QN    + KQ  R  RKLGEEE QAAVSLMALSCGS+FA
Subjt:  GDG-------DENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

A0A6J1C5A4 GATA transcription factor 17-like isoform X23.8e-4566.47Show/hide
Query:  MGLMDL---RQKGLLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSG
        MG+MD+   + K  +  DT K CVDCKTTKTPLWRGGP GPKSLCNACGIRFRKR +STIGTNRGCDRKRE  H++G ST   +SATTSSS T A   S 
Subjt:  MGLMDL---RQKGLLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSG

Query:  DG-------DENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
        +G       +E+LGECGSLRMRLMMA  EEV+V QN    + KQ  R  RKLGEEE QAAVSLMALSCGS+FA
Subjt:  DG-------DENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

SwissProt top hitse value%identityAlignment
Q6YW48 Protein CYTOKININ-RESPONSIVE GATA TRANSCRIPTION FACTOR 16.5e-1079.41Show/hide
Query:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRK
        + C DC TTKTPLWR GP GPKSLCNACGIR RK
Subjt:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRK

Q8LC59 GATA transcription factor 231.1e-1282.86Show/hide
Query:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR
        +CC +CKTTKTP+WRGGPTGPKSLCNACGIR RK+
Subjt:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR

Q8LG10 GATA transcription factor 153.7e-1342.76Show/hide
Query:  DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMM
        + K C  C T+KTPLWRGGP GPKSLCNACGIR RK+   T+ +NR  D+K++  HN                          GD       SL+ RLM 
Subjt:  DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMM

Query:  ASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGS
           E +M      S+ + Q   R+ KLGEEE QAAV LMALS  S
Subjt:  ASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGS

Q9FJ10 GATA transcription factor 161.5e-1443.84Show/hide
Query:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMMAS
        K C DC T+KTPLWRGGP GPKSLCNACGIR RK              KR G               T  ++    ++SG G+   GE  SL+  LM   
Subjt:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMMAS

Query:  EEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
          +        S+V+KQR    +KLGEEE QAAV LMALS GS++A
Subjt:  EEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

Q9LIB5 GATA transcription factor 176.2e-1338.89Show/hide
Query:  DTK-CCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDG----DENLGECGSLR
        DTK  CVDC T +TPLWRGGP GPKSLCNACGI+ RK+  + +G      R  E   N  S+    ++    +++        DG    D++   C + R
Subjt:  DTK-CCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDG----DENLGECGSLR

Query:  MRLMMASEE---------EVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
             +++          +V V++   S+V+K+R    RKLGEEE+ AAV LMALSC S++A
Subjt:  MRLMMASEE---------EVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

Arabidopsis top hitse value%identityAlignment
AT3G06740.1 GATA transcription factor 152.6e-1442.76Show/hide
Query:  DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMM
        + K C  C T+KTPLWRGGP GPKSLCNACGIR RK+   T+ +NR  D+K++  HN                          GD       SL+ RLM 
Subjt:  DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMM

Query:  ASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGS
           E +M      S+ + Q   R+ KLGEEE QAAV LMALS  S
Subjt:  ASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGS

AT3G16870.1 GATA transcription factor 174.4e-1438.89Show/hide
Query:  DTK-CCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDG----DENLGECGSLR
        DTK  CVDC T +TPLWRGGP GPKSLCNACGI+ RK+  + +G      R  E   N  S+    ++    +++        DG    D++   C + R
Subjt:  DTK-CCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDG----DENLGECGSLR

Query:  MRLMMASEE---------EVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
             +++          +V V++   S+V+K+R    RKLGEEE+ AAV LMALSC S++A
Subjt:  MRLMMASEE---------EVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA

AT4G16141.1 GATA type zinc finger transcription factor family protein1.1e-1234.97Show/hide
Query:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENL--GECGSLRMRLMM
        K CVDC T++TPLWRGGP GPKSLCNACGI+ RK+  + +G  +   + +   +NN       V                 G   +  GE G+++ ++  
Subjt:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENL--GECGSLRMRLMM

Query:  ASEEEVMVVQNLPSSVKK-------------------QRCRRQRKLGEEEKQAAVSLMALSCG
           E      N   +VK+                   ++ R  RKLGEEE+ AAV LMALSCG
Subjt:  ASEEEVMVVQNLPSSVKK-------------------QRCRRQRKLGEEEKQAAVSLMALSCG

AT5G26930.1 GATA transcription factor 237.6e-1482.86Show/hide
Query:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR
        +CC +CKTTKTP+WRGGPTGPKSLCNACGIR RK+
Subjt:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR

AT5G49300.1 GATA transcription factor 161.1e-1543.84Show/hide
Query:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMMAS
        K C DC T+KTPLWRGGP GPKSLCNACGIR RK              KR G               T  ++    ++SG G+   GE  SL+  LM   
Subjt:  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMMAS

Query:  EEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
          +        S+V+KQR    +KLGEEE QAAV LMALS GS++A
Subjt:  EEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTGATGGATTTGAGGCAAAAGGGACTGTTGCTGCCGGACACTAAATGTTGTGTTGATTGTAAGACAACCAAGACTCCTTTGTGGCGTGGAGGCCCTACTGGACC
TAAGTCACTGTGCAATGCATGCGGGATCAGGTTTAGAAAGAGAGGAATATCCACGATAGGAACGAACAGAGGATGTGACAGGAAGAGAGAAGGAGTTCATAACAATGGCT
CCTCCACCATGACCACCGTGTCAGCCACCACTTCATCGAGTGAGACAACAGCCACCACCACCTCTGGAGATGGGGATGAGAATTTGGGGGAATGTGGGTCATTGAGGATG
AGATTGATGATGGCATCGGAGGAGGAGGTGATGGTGGTGCAGAATTTACCGTCGTCGGTGAAGAAACAGCGTTGTCGACGGCAGAGGAAGCTTGGGGAGGAGGAGAAGCA
GGCAGCAGTGTCATTAATGGCGCTGTCATGTGGCTCTCTTTTTGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTGATGGATTTGAGGCAAAAGGGACTGTTGCTGCCGGACACTAAATGTTGTGTTGATTGTAAGACAACCAAGACTCCTTTGTGGCGTGGAGGCCCTACTGGACC
TAAGTCACTGTGCAATGCATGCGGGATCAGGTTTAGAAAGAGAGGAATATCCACGATAGGAACGAACAGAGGATGTGACAGGAAGAGAGAAGGAGTTCATAACAATGGCT
CCTCCACCATGACCACCGTGTCAGCCACCACTTCATCGAGTGAGACAACAGCCACCACCACCTCTGGAGATGGGGATGAGAATTTGGGGGAATGTGGGTCATTGAGGATG
AGATTGATGATGGCATCGGAGGAGGAGGTGATGGTGGTGCAGAATTTACCGTCGTCGGTGAAGAAACAGCGTTGTCGACGGCAGAGGAAGCTTGGGGAGGAGGAGAAGCA
GGCAGCAGTGTCATTAATGGCGCTGTCATGTGGCTCTCTTTTTGCCTGA
Protein sequenceShow/hide protein sequence
MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENLGECGSLRM
RLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA