; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022538 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022538
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionG-box-binding factor 4-like isoform X1
Genome locationChr05:25238863..25240890
RNA-Seq ExpressionHG10022538
SyntenyHG10022538
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR043452 - Plant bZIP transcription factors


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK31528.1 G-box-binding factor 4-like isoform X2 [Cucumis melo var. makuwa]5.1e-4777.12Show/hide
Query:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR
        +DG+F TS  SS SKTVDDLWR+LKEE+VE++IL                 NPLSCLKDFD VYV D+E VGFGN VEI ARGKRRRVAMEPMD+AALQR
Subjt:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR

Query:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ
        QRRMIKNRESAARSRERKQAHQ+ELESIASRLEEENERLLKEKAERSKERLKQ
Subjt:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ

XP_008461925.1 PREDICTED: G-box-binding factor 4-like isoform X1 [Cucumis melo]5.1e-4775.48Show/hide
Query:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR
        +DG+F TS  SS SKTVDDLWR+LKEE+VE++IL                 NPLSCLKDFD VYV D+E VGFGN V+I ARGKRRRVAMEPMD+AALQR
Subjt:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR

Query:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQVA
        QRRMIKNRESAARSRERKQAHQ+ELESIASRLEEENERLLKEKAERSKERLKQ++
Subjt:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQVA

XP_008461926.1 PREDICTED: G-box-binding factor 4-like isoform X2 [Cucumis melo]8.7e-4775.97Show/hide
Query:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR
        +DG+F TS  SS SKTVDDLWR+LKEE+VE++IL                 NPLSCLKDFD VYV D+E VGFGN V+I ARGKRRRVAMEPMD+AALQR
Subjt:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR

Query:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV
        QRRMIKNRESAARSRERKQAHQ+ELESIASRLEEENERLLKEKAERSKERLKQ+
Subjt:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV

XP_008461927.1 PREDICTED: G-box-binding factor 4-like isoform X3 [Cucumis melo]1.0e-4768.57Show/hide
Query:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR
        +DG+F TS  SS SKTVDDLWR+LKEE+VE++IL                 NPLSCLKDFD VYV D+E VGFGN V+I ARGKRRRVAMEPMD+AALQR
Subjt:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR

Query:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ-----VAIFTQASFPEIITKGK
        QRRMIKNRESAARSRERKQAHQ+ELESIASRLEEENERLLKEKAERSKERLKQ     + +  +   P++I  G+
Subjt:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ-----VAIFTQASFPEIITKGK

XP_022981612.1 G-box-binding factor 4-like isoform X1 [Cucurbita maxima]2.4e-3669.28Show/hide
Query:  MDGKFDTSSSASKTVDDLWRELKEEAVEEMI-LEGFLQAKAHDQDVRILNNPLSCLKDFDTVYVDEEIVGFGNGVEISAR-GKRRRVAMEPMDEAALQRQ
        MD K    S+AS+ VDD+WR   E+AVEEM+  E F+  KA  +DVRIL NPL+C   F+    +E IVGFGNG EIS R GKRRR  MEPMDEAALQRQ
Subjt:  MDGKFDTSSSASKTVDDLWRELKEEAVEEMI-LEGFLQAKAHDQDVRILNNPLSCLKDFDTVYVDEEIVGFGNGVEISAR-GKRRRVAMEPMDEAALQRQ

Query:  RRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV
        RRMIKNRESAARSRERK AHQVELE IA+RLEEEN RLLK+KAER KERLKQ+
Subjt:  RRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV

TrEMBL top hitse value%identityAlignment
A0A1S3CFQ6 G-box-binding factor 4-like isoform X12.5e-4775.48Show/hide
Query:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR
        +DG+F TS  SS SKTVDDLWR+LKEE+VE++IL                 NPLSCLKDFD VYV D+E VGFGN V+I ARGKRRRVAMEPMD+AALQR
Subjt:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR

Query:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQVA
        QRRMIKNRESAARSRERKQAHQ+ELESIASRLEEENERLLKEKAERSKERLKQ++
Subjt:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQVA

A0A1S3CGA7 G-box-binding factor 4-like isoform X24.2e-4775.97Show/hide
Query:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR
        +DG+F TS  SS SKTVDDLWR+LKEE+VE++IL                 NPLSCLKDFD VYV D+E VGFGN V+I ARGKRRRVAMEPMD+AALQR
Subjt:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR

Query:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV
        QRRMIKNRESAARSRERKQAHQ+ELESIASRLEEENERLLKEKAERSKERLKQ+
Subjt:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV

A0A1S3CH58 G-box-binding factor 4-like isoform X35.0e-4868.57Show/hide
Query:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR
        +DG+F TS  SS SKTVDDLWR+LKEE+VE++IL                 NPLSCLKDFD VYV D+E VGFGN V+I ARGKRRRVAMEPMD+AALQR
Subjt:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR

Query:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ-----VAIFTQASFPEIITKGK
        QRRMIKNRESAARSRERKQAHQ+ELESIASRLEEENERLLKEKAERSKERLKQ     + +  +   P++I  G+
Subjt:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ-----VAIFTQASFPEIITKGK

A0A5D3E6S6 G-box-binding factor 4-like isoform X22.5e-4777.12Show/hide
Query:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR
        +DG+F TS  SS SKTVDDLWR+LKEE+VE++IL                 NPLSCLKDFD VYV D+E VGFGN VEI ARGKRRRVAMEPMD+AALQR
Subjt:  MDGKFDTS--SSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYV-DEEIVGFGNGVEISARGKRRRVAMEPMDEAALQR

Query:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ
        QRRMIKNRESAARSRERKQAHQ+ELESIASRLEEENERLLKEKAERSKERLKQ
Subjt:  QRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ

A0A6J1IUG5 G-box-binding factor 4-like isoform X11.2e-3669.28Show/hide
Query:  MDGKFDTSSSASKTVDDLWRELKEEAVEEMI-LEGFLQAKAHDQDVRILNNPLSCLKDFDTVYVDEEIVGFGNGVEISAR-GKRRRVAMEPMDEAALQRQ
        MD K    S+AS+ VDD+WR   E+AVEEM+  E F+  KA  +DVRIL NPL+C   F+    +E IVGFGNG EIS R GKRRR  MEPMDEAALQRQ
Subjt:  MDGKFDTSSSASKTVDDLWRELKEEAVEEMI-LEGFLQAKAHDQDVRILNNPLSCLKDFDTVYVDEEIVGFGNGVEISAR-GKRRRVAMEPMDEAALQRQ

Query:  RRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV
        RRMIKNRESAARSRERK AHQVELE IA+RLEEEN RLLK+KAER KERLKQ+
Subjt:  RRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV

SwissProt top hitse value%identityAlignment
P42777 G-box-binding factor 47.7e-2250Show/hide
Query:  SSSASKTVDDLWRE----------LKEEAVEE-MILEGFLQAKAHDQ------DVRI----LNNPLSCLKDFDTV-YVDEEIV--GFGNGVEISARGKRR
        S +  K+VDD+W+E          +KEE  E+ M LE FL     D+      DV+I    LNN  S   DF    +   ++V    G GV    RGKR 
Subjt:  SSSASKTVDDLWRE----------LKEEAVEE-MILEGFLQAKAHDQ------DVRI----LNNPLSCLKDFDTV-YVDEEIV--GFGNGVEISARGKRR

Query:  RVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV
        RV ME MD+AA QRQ+RMIKNRESAARSRERKQA+QVELE++A++LEEENE+LLKE  E +KER K++
Subjt:  RVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV

Q0JHF1 bZIP transcription factor 127.5e-1743.15Show/hide
Query:  EMILEGFLQAK-AHDQDVRILNNPLSCLKDFDTVYVDEEIVGFGNGVE----ISARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELE
        EM LE FL  + A  +D  ++ +P        +    + ++GF NG E    ++    R+R  M+PMD AA+QRQ+RMIKNRESAARSRERKQA+  ELE
Subjt:  EMILEGFLQAK-AHDQDVRILNNPLSCLKDFDTVYVDEEIVGFGNGVE----ISARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELE

Query:  SIASRLEEENERLLKEKAERSKERLKQVAIFTQASFPEIITKGKQR
        S+ ++LEEEN ++ KE+ E+ ++RLK++    +   P II K   R
Subjt:  SIASRLEEENERLLKEKAERSKERLKQVAIFTQASFPEIITKGKQR

Q9C5Q2 ABSCISIC ACID-INSENSITIVE 5-like protein 31.8e-0733.54Show/hide
Query:  ASKTVDDLWRELKEE--------------------AVEEMIL------------EGFLQAKAHDQDVRILNNPLSCLKDFDTVYVDE--EIVGFGNGVEI
        + KTVD++WR+++++                     +E+++L            E  +   ++ Q V   + P    + F T  V E  ++V  G   + 
Subjt:  ASKTVDDLWRELKEE--------------------AVEEMIL------------EGFLQAKAHDQDVRILNNPLSCLKDFDTVYVDE--EIVGFGNGVEI

Query:  SARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEK
             R+RVA E +++   +RQ+RMIKNRESAARSR RKQA+  ELE   SRLEEENE+L + K
Subjt:  SARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEK

Q9LES3 ABSCISIC ACID-INSENSITIVE 5-like protein 21.1e-0760.66Show/hide
Query:  GKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEK
        G++R  + E +++   +RQ+RMIKNRESAARSR RKQA+  ELE   SRLEEENERL K+K
Subjt:  GKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEK

Q9SJN0 Protein ABSCISIC ACID-INSENSITIVE 52.2e-0843Show/hide
Query:  PLSCLKDFDTVYVDEEIVGFGNGVEISARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ
        PLS +      +   + +G   GV++     R+RV   P+++   +RQRRMIKNRESAARSR RKQA+ VELE+  ++L+EEN +L    AE  ++R +Q
Subjt:  PLSCLKDFDTVYVDEEIVGFGNGVEISARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ

Arabidopsis top hitse value%identityAlignment
AT1G03970.1 G-box binding factor 45.5e-2350Show/hide
Query:  SSSASKTVDDLWRE----------LKEEAVEE-MILEGFLQAKAHDQ------DVRI----LNNPLSCLKDFDTV-YVDEEIV--GFGNGVEISARGKRR
        S +  K+VDD+W+E          +KEE  E+ M LE FL     D+      DV+I    LNN  S   DF    +   ++V    G GV    RGKR 
Subjt:  SSSASKTVDDLWRE----------LKEEAVEE-MILEGFLQAKAHDQ------DVRI----LNNPLSCLKDFDTV-YVDEEIV--GFGNGVEISARGKRR

Query:  RVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV
        RV ME MD+AA QRQ+RMIKNRESAARSRERKQA+QVELE++A++LEEENE+LLKE  E +KER K++
Subjt:  RVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQV

AT2G36270.1 Basic-leucine zipper (bZIP) transcription factor family protein1.6e-0943Show/hide
Query:  PLSCLKDFDTVYVDEEIVGFGNGVEISARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ
        PLS +      +   + +G   GV++     R+RV   P+++   +RQRRMIKNRESAARSR RKQA+ VELE+  ++L+EEN +L    AE  ++R +Q
Subjt:  PLSCLKDFDTVYVDEEIVGFGNGVEISARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQ

AT2G41070.1 Basic-leucine zipper (bZIP) transcription factor family protein1.3e-0833.54Show/hide
Query:  ASKTVDDLWRELKEE--------------------AVEEMIL------------EGFLQAKAHDQDVRILNNPLSCLKDFDTVYVDE--EIVGFGNGVEI
        + KTVD++WR+++++                     +E+++L            E  +   ++ Q V   + P    + F T  V E  ++V  G   + 
Subjt:  ASKTVDDLWRELKEE--------------------AVEEMIL------------EGFLQAKAHDQDVRILNNPLSCLKDFDTVYVDE--EIVGFGNGVEI

Query:  SARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEK
             R+RVA E +++   +RQ+RMIKNRESAARSR RKQA+  ELE   SRLEEENE+L + K
Subjt:  SARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEK

AT3G56850.1 ABA-responsive element binding protein 37.7e-0960.66Show/hide
Query:  GKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEK
        G++R  + E +++   +RQ+RMIKNRESAARSR RKQA+  ELE   SRLEEENERL K+K
Subjt:  GKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEK

AT5G44080.1 Basic-leucine zipper (bZIP) transcription factor family protein1.2e-2546.15Show/hide
Query:  SSSASKTVDDLWRE--------LKEEAVEE-MILEGFL-----------QAKAHDQDVRI------------LNNPLSCLKDFDTVYVDEEIVGFGNGVE
        ++   K+VD++WRE        +KEE  EE M LE FL            A A D DV+I             +NP   +       V+  IV FGNG++
Subjt:  SSSASKTVDDLWRE--------LKEEAVEE-MILEGFL-----------QAKAHDQDVRI------------LNNPLSCLKDFDTVYVDEEIVGFGNGVE

Query:  I---SARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQVAIF
        +    ARGKR RV +EP+D+AA QRQRRMIKNRESAARSRERKQA+QVELE++A++LEEENE L KE  ++ KER +++  F
Subjt:  I---SARGKRRRVAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQVAIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGTAAGTTCGATACCTCATCTTCTGCTTCTAAAACTGTGGACGATCTTTGGAGGGAGTTGAAGGAGGAGGCTGTTGAAGAGATGATATTAGAGGGTTTTCTTCA
AGCCAAAGCACATGATCAGGATGTGAGGATTTTGAATAATCCGTTGAGTTGTTTAAAGGATTTCGATACGGTTTATGTTGATGAGGAGATTGTTGGATTTGGTAATGGAG
TTGAAATTAGTGCGAGAGGGAAGAGAAGGCGTGTAGCCATGGAGCCGATGGATGAAGCTGCTCTGCAAAGACAACGGAGGATGATTAAGAACAGGGAATCTGCCGCTAGG
TCCAGAGAAAGGAAACAAGCACATCAAGTTGAGTTAGAGTCAATAGCTTCGAGACTTGAGGAAGAGAACGAGCGATTATTGAAAGAGAAGGCTGAGAGATCCAAGGAACG
ACTAAAGCAGGTAGCAATCTTTACTCAAGCTTCTTTTCCGGAAATAATTACTAAAGGAAAACAGAGAATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGTAAGTTCGATACCTCATCTTCTGCTTCTAAAACTGTGGACGATCTTTGGAGGGAGTTGAAGGAGGAGGCTGTTGAAGAGATGATATTAGAGGGTTTTCTTCA
AGCCAAAGCACATGATCAGGATGTGAGGATTTTGAATAATCCGTTGAGTTGTTTAAAGGATTTCGATACGGTTTATGTTGATGAGGAGATTGTTGGATTTGGTAATGGAG
TTGAAATTAGTGCGAGAGGGAAGAGAAGGCGTGTAGCCATGGAGCCGATGGATGAAGCTGCTCTGCAAAGACAACGGAGGATGATTAAGAACAGGGAATCTGCCGCTAGG
TCCAGAGAAAGGAAACAAGCACATCAAGTTGAGTTAGAGTCAATAGCTTCGAGACTTGAGGAAGAGAACGAGCGATTATTGAAAGAGAAGGCTGAGAGATCCAAGGAACG
ACTAAAGCAGGTAGCAATCTTTACTCAAGCTTCTTTTCCGGAAATAATTACTAAAGGAAAACAGAGAATATAA
Protein sequenceShow/hide protein sequence
MDGKFDTSSSASKTVDDLWRELKEEAVEEMILEGFLQAKAHDQDVRILNNPLSCLKDFDTVYVDEEIVGFGNGVEISARGKRRRVAMEPMDEAALQRQRRMIKNRESAAR
SRERKQAHQVELESIASRLEEENERLLKEKAERSKERLKQVAIFTQASFPEIITKGKQRI