; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014723 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014723
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGATA-N domain-containing protein
Genome locationChr02:18657624..18660115
RNA-Seq ExpressionHG10014723
SyntenyHG10014723
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146444.1 uncharacterized protein LOC101211273 [Cucumis sativus]7.3e-8192.05Show/hide
Query:  MVAISPSHGGFPSQLGQGWKVVRGRPD-RSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAI PSHGGFPSQLGQGWK VRGRPD RS NLRV    AEKGEERESG GGENKKSLFSSVTEALDFSAVRS+RDAELLDDARQATK+GGRM+REQYGA
Subjt:  MVAISPSHGGFPSQLGQGWKVVRGRPD-RSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSS GNFFSK FSK
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK

XP_008456929.1 PREDICTED: uncharacterized protein LOC103496731 [Cucumis melo]1.0e-8294.32Show/hide
Query:  MVAISPSHGGFPSQLGQGWKVVRGRPD-RSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSHGGFPSQLGQGWK VRGRPD RS NLRV+   AEKGEERES  GGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATK+GGRMTREQYGA
Subjt:  MVAISPSHGGFPSQLGQGWKVVRGRPD-RSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK

XP_022941869.1 uncharacterized protein LOC111447101 [Cucurbita moschata]6.6e-7484.09Show/hide
Query:  MVAISPSHGGFPSQLGQGWKVVRG-RPDRSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSH GF SQ GQGWK  R   P RS NLRV AK+ EKGEE+E G   +NK+SLFSSVTEALDFS VRSSRDAELLDDARQATK+GGRM+REQYGA
Subjt:  MVAISPSHGGFPSQLGQGWKVVRG-RPDRSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRY HVACLEKSNSSS NFF+K F K
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK

XP_023532465.1 uncharacterized protein LOC111794620 [Cucurbita pepo subsp. pepo]1.5e-7384.09Show/hide
Query:  MVAISPSHGGFPSQLGQGWKVVRG-RPDRSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSH GF SQ GQGWK  R   P RS NLRV AKA EKGEE+E     +NK+SLFSSVTEALDFS VRSSRDAELLDDARQATK+GGRM+REQYGA
Subjt:  MVAISPSHGGFPSQLGQGWKVVRG-RPDRSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRY HVACLEKSNSSS NFF+K F K
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK

XP_038891315.1 uncharacterized protein LOC120080762 [Benincasa hispida]6.8e-8796.02Show/hide
Query:  MVAISPSHGGFPSQLGQGWKVVRGRPDRSFNLRVLAKAAEKGEERES-GDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSHGGFPSQL QGWK VRGRPDRS NLRVLAKAAEKGEERES G GGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATK+GGRMTREQYGA
Subjt:  MVAISPSHGGFPSQLGQGWKVVRGRPDRSFNLRVLAKAAEKGEERES-GDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKS SSSGNFFSKLFSK
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK

TrEMBL top hitse value%identityAlignment
A0A0A0KM41 GATA-N domain-containing protein3.5e-8192.05Show/hide
Query:  MVAISPSHGGFPSQLGQGWKVVRGRPD-RSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAI PSHGGFPSQLGQGWK VRGRPD RS NLRV    AEKGEERESG GGENKKSLFSSVTEALDFSAVRS+RDAELLDDARQATK+GGRM+REQYGA
Subjt:  MVAISPSHGGFPSQLGQGWKVVRGRPD-RSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSS GNFFSK FSK
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK

A0A1S3C3X8 uncharacterized protein LOC1034967314.9e-8394.32Show/hide
Query:  MVAISPSHGGFPSQLGQGWKVVRGRPD-RSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSHGGFPSQLGQGWK VRGRPD RS NLRV+   AEKGEERES  GGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATK+GGRMTREQYGA
Subjt:  MVAISPSHGGFPSQLGQGWKVVRGRPD-RSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK

A0A5D3DFR3 GATA-N domain-containing protein4.9e-8394.32Show/hide
Query:  MVAISPSHGGFPSQLGQGWKVVRGRPD-RSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSHGGFPSQLGQGWK VRGRPD RS NLRV+   AEKGEERES  GGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATK+GGRMTREQYGA
Subjt:  MVAISPSHGGFPSQLGQGWKVVRGRPD-RSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK

A0A6J1FPP4 uncharacterized protein LOC1114471013.2e-7484.09Show/hide
Query:  MVAISPSHGGFPSQLGQGWKVVRG-RPDRSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPSH GF SQ GQGWK  R   P RS NLRV AK+ EKGEE+E G   +NK+SLFSSVTEALDFS VRSSRDAELLDDARQATK+GGRM+REQYGA
Subjt:  MVAISPSHGGFPSQLGQGWKVVRG-RPDRSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRY HVACLEKSNSSS NFF+K F K
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK

A0A6J1K2I3 uncharacterized protein LOC1114918082.7e-7383.52Show/hide
Query:  MVAISPSHGGFPSQLGQGWKVVRG-RPDRSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA
        MVAISPS+ GF SQ GQGWK  R   P RS NLRV AKA EKGEE+E G   +NK+SLFSSVTEALDFS VRSSRDAELL+DARQATKSGGRM++EQYGA
Subjt:  MVAISPSHGGFPSQLGQGWKVVRG-RPDRSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGA

Query:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
        LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRY HVACLEKSNSSS NFF+K F K
Subjt:  LRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56290.1 unknown protein1.3e-5169.72Show/hide
Query:  VLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCK
        V++   EK E+++     E + SLF+ +T+ALDFS VRS +DAELL +AR+ATKSG +MT+EQYGALRRKIGGTYKDFFKSY+EVDGQYVEEGWVDKTCK
Subjt:  VLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGALRRKIGGTYKDFFKSYIEVDGQYVEEGWVDKTCK

Query:  VCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK
        +CKKDT+GEARQVDKLGRY HV+CL+  N  SGNFF++LFS+
Subjt:  VCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCGATTTCTCCGTCCCACGGCGGGTTCCCGTCACAATTAGGGCAGGGATGGAAGGTGGTAAGGGGGAGGCCTGATAGGAGCTTCAATTTGAGAGTATTGGCAAA
AGCGGCGGAGAAGGGCGAGGAGAGAGAAAGCGGCGACGGAGGAGAGAATAAGAAATCGCTGTTCAGCAGCGTGACGGAGGCGTTGGATTTCTCCGCGGTGCGATCGAGTC
GCGACGCCGAGCTCCTCGATGATGCTCGTCAGGCCACCAAATCCGGCGGCAGAATGACCCGGGAACAGTATGGAGCTTTAAGAAGGAAAATAGGAGGGACCTACAAGGAC
TTCTTCAAATCTTACATAGAAGTGGATGGGCAATATGTTGAAGAAGGGTGGGTGGACAAAACATGCAAGGTGTGCAAGAAGGACACAAGGGGAGAAGCAAGACAAGTTGA
CAAACTTGGAAGATATGTACATGTTGCTTGTTTGGAGAAATCCAACTCTTCCTCAGGGAATTTCTTCTCCAAGCTCTTCTCCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCGATTTCTCCGTCCCACGGCGGGTTCCCGTCACAATTAGGGCAGGGATGGAAGGTGGTAAGGGGGAGGCCTGATAGGAGCTTCAATTTGAGAGTATTGGCAAA
AGCGGCGGAGAAGGGCGAGGAGAGAGAAAGCGGCGACGGAGGAGAGAATAAGAAATCGCTGTTCAGCAGCGTGACGGAGGCGTTGGATTTCTCCGCGGTGCGATCGAGTC
GCGACGCCGAGCTCCTCGATGATGCTCGTCAGGCCACCAAATCCGGCGGCAGAATGACCCGGGAACAGTATGGAGCTTTAAGAAGGAAAATAGGAGGGACCTACAAGGAC
TTCTTCAAATCTTACATAGAAGTGGATGGGCAATATGTTGAAGAAGGGTGGGTGGACAAAACATGCAAGGTGTGCAAGAAGGACACAAGGGGAGAAGCAAGACAAGTTGA
CAAACTTGGAAGATATGTACATGTTGCTTGTTTGGAGAAATCCAACTCTTCCTCAGGGAATTTCTTCTCCAAGCTCTTCTCCAAATGA
Protein sequenceShow/hide protein sequence
MVAISPSHGGFPSQLGQGWKVVRGRPDRSFNLRVLAKAAEKGEERESGDGGENKKSLFSSVTEALDFSAVRSSRDAELLDDARQATKSGGRMTREQYGALRRKIGGTYKD
FFKSYIEVDGQYVEEGWVDKTCKVCKKDTRGEARQVDKLGRYVHVACLEKSNSSSGNFFSKLFSK