; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G008690 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G008690
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionGATA-N domain-containing protein
Genome locationGy14Chr5:6678754..6680287
RNA-Seq ExpressionCsGy5G008690
SyntenyCsGy5G008690
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR008013 - GATA-type transcription activator, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146444.1 uncharacterized protein LOC101211273 [Cucumis sativus]1.24e-120100Show/hide
Query:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK
        MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK
Subjt:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK

Query:  IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
        IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
Subjt:  IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK

XP_008456929.1 PREDICTED: uncharacterized protein LOC103496731 [Cucumis melo]1.28e-11395.93Show/hide
Query:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK
        MVAI PSHGGFPSQLGQGWKSVRGRPDRRSSNLRV AEKGEERES GGENKKSLFSSVTEALDFSAVRS+RDAELLDDARQATKAGGRM+REQYGALRRK
Subjt:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK

Query:  IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
        IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSS GNFFSK FSK
Subjt:  IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK

XP_022941869.1 uncharacterized protein LOC111447101 [Cucurbita moschata]5.83e-9584.09Show/hide
Query:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRA---EKGEERESGGG-ENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGA
        MVAI PSH GF SQ GQGWK+ R     RSSNLRV A   EKGEE+E GG  +NK+SLFSSVTEALDFS VRS+RDAELLDDARQATKAGGRMSREQYGA
Subjt:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRA---EKGEERESGGG-ENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRY HVACLEKSNSS  NFF+KFF K
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK

XP_023532465.1 uncharacterized protein LOC111794620 [Cucurbita pepo subsp. pepo]4.78e-9483.52Show/hide
Query:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRA---EKGEERESGGG-ENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGA
        MVAI PSH GF SQ GQGWK+ R     RSSNLRV A   EKGEE+E  G  +NK+SLFSSVTEALDFS VRS+RDAELLDDARQATKAGGRMSREQYGA
Subjt:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRA---EKGEERESGGG-ENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRY HVACLEKSNSS  NFF+KFF K
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK

XP_038891315.1 uncharacterized protein LOC120080762 [Benincasa hispida]8.80e-10691.53Show/hide
Query:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRV---RAEKGEERESGGG--ENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYG
        MVAI PSHGGFPSQL QGWK+VRGRPDR SSNLRV    AEKGEERESGGG  ENKKSLFSSVTEALDFSAVRS+RDAELLDDARQATKAGGRM+REQYG
Subjt:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRV---RAEKGEERESGGG--ENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYG

Query:  ALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
        ALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKS SS GNFFSK FSK
Subjt:  ALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK

TrEMBL top hitse value%identityAlignment
A0A0A0KM41 GATA-N domain-containing protein6.00e-121100Show/hide
Query:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK
        MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK
Subjt:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK

Query:  IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
        IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
Subjt:  IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK

A0A1S3C3X8 uncharacterized protein LOC1034967316.21e-11495.93Show/hide
Query:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK
        MVAI PSHGGFPSQLGQGWKSVRGRPDRRSSNLRV AEKGEERES GGENKKSLFSSVTEALDFSAVRS+RDAELLDDARQATKAGGRM+REQYGALRRK
Subjt:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK

Query:  IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
        IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSS GNFFSK FSK
Subjt:  IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK

A0A5D3DFR3 GATA-N domain-containing protein6.21e-11495.93Show/hide
Query:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK
        MVAI PSHGGFPSQLGQGWKSVRGRPDRRSSNLRV AEKGEERES GGENKKSLFSSVTEALDFSAVRS+RDAELLDDARQATKAGGRM+REQYGALRRK
Subjt:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRK

Query:  IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
        IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSS GNFFSK FSK
Subjt:  IGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK

A0A6J1FPP4 uncharacterized protein LOC1114471012.82e-9584.09Show/hide
Query:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRA---EKGEERESGGG-ENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGA
        MVAI PSH GF SQ GQGWK+ R     RSSNLRV A   EKGEE+E GG  +NK+SLFSSVTEALDFS VRS+RDAELLDDARQATKAGGRMSREQYGA
Subjt:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRA---EKGEERESGGG-ENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRY HVACLEKSNSS  NFF+KFF K
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK

A0A6J1K2I3 uncharacterized protein LOC1114918081.28e-9181.25Show/hide
Query:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRA---EKGEERESGGG-ENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGA
        MVAI PS+ GF SQ GQGWK+ R     RSSNLRV A   EKGEE+E G   +NK+SLFSSVTEALDFS VRS+RDAELL+DARQATK+GGRMS+EQYGA
Subjt:  MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRA---EKGEERESGGG-ENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRY HVACLEKSNSS  NFF+KFF K
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56290.1 unknown protein1.2e-4967.39Show/hide
Query:  VRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKK
        V A++ ++ +    E + SLF+ +T+ALDFS VRS +DAELL +AR+ATK+G +M++EQYGALRRKIGGTYKDFFKSY+EVDGQYVEEGWVDKTCK+CKK
Subjt:  VRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKK

Query:  DTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK
        DT+GEARQVDKLGRY HV+CL+  N   GNFF++ FS+
Subjt:  DTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCTATTTGTCCGTCTCACGGGGGGTTCCCGTCGCAATTAGGGCAGGGATGGAAGTCGGTAAGGGGGAGGCCTGATAGGAGGAGCTCCAATTTGAGAGTAAGGGC
GGAGAAGGGCGAGGAGAGAGAAAGCGGGGGAGGAGAGAATAAGAAATCGCTGTTCAGCAGCGTGACTGAGGCGTTGGATTTCTCAGCAGTCCGATCGACTCGCGACGCCG
AGCTCCTCGACGATGCCCGCCAGGCCACCAAAGCCGGCGGCAGAATGAGCCGTGAACAGTATGGAGCTTTAAGAAGGAAAATAGGAGGGACCTACAAGGACTTCTTCAAA
TCTTACATAGAAGTGGATGGACAATATGTTGAAGAAGGGTGGGTGGACAAGACATGCAAGGTGTGCAAGAAGGACACGAGGGGAGAAGCAAGGCAAGTTGACAAACTTGG
AAGATATGTTCATGTTGCTTGTTTGGAGAAGTCCAACTCTTCCCCGGGGAATTTCTTTTCCAAGTTCTTCTCCAAATGA
mRNA sequenceShow/hide mRNA sequence
CTTCCACGCCATAGAATCACAAAGTCCTACATTTGTCCCCACTGCGACACAAACATGTGAATCTCAAGAGTCAATCTCCCATCTCTTCGGTAGCCATAAATTAACTATTT
TATTCCTAAGATTAATAAACCAAGCATTAAAAAAATCAAGTGGCCATAGAGAATGGAGAGATCAAGAGTATCAAACACAGCCACGTAGCTCCACTCCACGGCAACCACAA
CGACCAATCACTGCTCACCTCAAATTCTCACTTAAAGCACTGAGAAACTTCTCCTTCATGGCGGATAATCGTCCAGGCCGCCGTCGATCCTATAAATTTCAGGTTCAATT
GAACGAACTAACCGGCGGCAACAATGGTGGCTATTTGTCCGTCTCACGGGGGGTTCCCGTCGCAATTAGGGCAGGGATGGAAGTCGGTAAGGGGGAGGCCTGATAGGAGG
AGCTCCAATTTGAGAGTAAGGGCGGAGAAGGGCGAGGAGAGAGAAAGCGGGGGAGGAGAGAATAAGAAATCGCTGTTCAGCAGCGTGACTGAGGCGTTGGATTTCTCAGC
AGTCCGATCGACTCGCGACGCCGAGCTCCTCGACGATGCCCGCCAGGCCACCAAAGCCGGCGGCAGAATGAGCCGTGAACAGTATGGAGCTTTAAGAAGGAAAATAGGAG
GGACCTACAAGGACTTCTTCAAATCTTACATAGAAGTGGATGGACAATATGTTGAAGAAGGGTGGGTGGACAAGACATGCAAGGTGTGCAAGAAGGACACGAGGGGAGAA
GCAAGGCAAGTTGACAAACTTGGAAGATATGTTCATGTTGCTTGTTTGGAGAAGTCCAACTCTTCCCCGGGGAATTTCTTTTCCAAGTTCTTCTCCAAATGAAATTAAAA
AAAAAAAAAACAAACTTAGGTTTAAGTATATAAATTGGACTGTAGAAATTTGAATTTGTGTAACGTGTATTTAGTTCGTAAGTTTATGATTTATACCTCGAAGTTAACAT
TGTTGTAGTATATAAGTTTTGAGTTTCATAAACAATAAAGTTTTATGTATACACAAATTGTGAGAGGTAGATGAAATAGGTTTTGGGGTTTAGAG
Protein sequenceShow/hide protein sequence
MVAICPSHGGFPSQLGQGWKSVRGRPDRRSSNLRVRAEKGEERESGGGENKKSLFSSVTEALDFSAVRSTRDAELLDDARQATKAGGRMSREQYGALRRKIGGTYKDFFK
SYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSPGNFFSKFFSK