; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022998 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022998
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGATA transcription factor-like protein
Genome locationChr05:30336999..30338123
RNA-Seq ExpressionHG10022998
SyntenyHG10022998
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575793.1 hypothetical protein SDJN03_26432, partial [Cucurbita argyrosperma subsp. sororia]7.5e-10582.77Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        M S+LTAIA K NW F LAQFQRLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQ+NLEPD+AKSNYER+DS +GDSNGPF PPKAQYASSPRLETT
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRAHSTV DDVSCIG  GGP     KNR  + KEQE+D R+YYKHHKASPLAEIEF DTRKPITRATDGTAYDG GKDVIGWLPEQ D
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF
        T +DSL+RATEIWK+NAMRGDPDAPQSRVLRAL GEQF
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF

KAG7014335.1 hypothetical protein SDJN02_24512 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-10382.35Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        M S+LTAIA K NW F LAQFQRLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQ+NLEPD+AKSNYER+DS +GDSNGPF PPKAQYASSPRLETT
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRAHSTV  DVSCIG  GGP     KNR  + KEQEED R+YYKHHKASPLAEIEF DTRKPIT ATDGTAYDG GKDVIGWLPEQ D
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF
        T +DSL+RATEIWK+NAMRGDPDAPQSRVLRAL GEQF
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF

XP_023548846.1 uncharacterized protein LOC111807374 [Cucurbita pepo subsp. pepo]8.3e-10482.35Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        M S+LTAIA K NW+F LAQFQRLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQ+NLEPD+AK+NYER+DS +GDSNGPF PPKAQYASSPRLETT
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRAHSTV  DVSCIG  GGP     KNR  + KEQEED R+YYKHHKASPLAEIEF DTRKPITRATDGTAYDG GKDVIGWLPEQ D
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF
        T +DSLRRA EIWK+NAMRGDPDAPQSRVLRAL GEQF
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF

XP_038899333.1 uncharacterized protein LOC120086662 isoform X1 [Benincasa hispida]2.4e-11186.97Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        MQS+LTAIAPKSNWA  LAQFQRLRR  LTT RTADPSVHANDDNDPAVLSGEPE SQ+NLEPDN K+NYER+D   GDSNGPFG PKAQ+ASSPRLET 
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
        VVGQASKPITQQKR  STVTD+VSCIGVYGGPLE+  +NR TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDG GKDVIGWLPEQLD
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF
        T +DSLRRATEIWK+NAMRGDPDAPQSR+LRAL GEQF
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF

XP_038899334.1 uncharacterized protein LOC120086662 isoform X2 [Benincasa hispida]8.6e-10986.13Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        MQS+LTAIAPKSNWA  LAQFQRLRR  LTT RTADPSVHANDDNDPAVLSGEPE   +NLEPDN K+NYER+D   GDSNGPFG PKAQ+ASSPRLET 
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
        VVGQASKPITQQKR  STVTD+VSCIGVYGGPLE+  +NR TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDG GKDVIGWLPEQLD
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF
        T +DSLRRATEIWK+NAMRGDPDAPQSR+LRAL GEQF
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF

TrEMBL top hitse value%identityAlignment
A0A0A0K9G7 Uncharacterized protein2.0e-10382.38Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGG--LTTCRTADPSVHAN---DDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSK-GDSNGPFGPPKAQYASS
        MQS L AIAPKSNWAF++ QFQ LRRGG  LTT RTADPS+HAN   DDNDPAVLSGEPERSQ+NLEPDNAK+NY+R D  K GDS GPFG P AQ+ASS
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGG--LTTCRTADPSVHAN---DDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSK-GDSNGPFGPPKAQYASS

Query:  PRLETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGW
        PRLETTVVGQASKPITQQKRAHS   DDVSCIGVYGGPLE+  +NR TEMKE+EEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDG    VIGW
Subjt:  PRLETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGW

Query:  LPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF
        LPEQ+DT +DSLRRATEIWK+NAMRGDPDAPQSRVLRAL GE+F
Subjt:  LPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF

A0A1S3BR22 uncharacterized protein LOC1034927781.7e-10279.6Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQENLEPDNAKSNYE-REDSSKGDSNGPFGPPKAQYASSPRL
        MQS+L AIAP+SNWA ++ QFQ LRRGGLTT RTADPSVHANDD  NDP+VLSGEPERSQ+NLEPDNAK+NYE R+D  +GDSNGPFGP KAQ+ASSPRL
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQENLEPDNAKSNYE-REDSSKGDSNGPFGPPKAQYASSPRL

Query:  ETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQE---------EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAG
        ETTVVGQASKPITQQKRAHS   DDVSCIGVYGGPLEE  ++R TEMK++E         EDNRDYYKHHKASPLAEIEF DTRKPITRATDGTA  G G
Subjt:  ETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQE---------EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAG

Query:  KDVIGWLPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF
        K VIGWLPEQ+DT +DSLRRATEIWK+NAMRGDPDAPQSRVLRAL GE F
Subjt:  KDVIGWLPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF

A0A5D3D3D5 Uncharacterized protein4.2e-10179.12Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQENLEPDNAKSNYE-REDSSKGDSNGPFGPPKAQYASSPRL
        MQS+L AIAP+SNWA ++ Q Q LRRGGLTT RTADPSVHANDD  NDP+VLSGEPERSQ+NLEPDNAK+NYE R+D  +G SNGPFGP KAQ+ASSPRL
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQENLEPDNAKSNYE-REDSSKGDSNGPFGPPKAQYASSPRL

Query:  ETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQE--------EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGK
        ETTVVGQASKPITQQKRAHS   DDVSCIGVYGGPLEE  ++R TEMK++E        EDNRDYYKHHKASPLAEIEF DTRKPITRATDGTA  G GK
Subjt:  ETTVVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQE--------EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGK

Query:  DVIGWLPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF
         VIGWLPEQ+DT +DSLRRATEIWK+NAMRGDPDAPQSRVLRAL GE F
Subjt:  DVIGWLPEQLDTAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF

A0A6J1GPT4 uncharacterized protein LOC1114563881.5e-10381.51Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        M S+LTAIA K NW F LAQFQRLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQ+NLEPD+AKSNYER+DS +GDSNGPF PPKAQYASSPRLETT
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRAHSTV  DVSCIG  GGP     KNR  + KEQ++D R+YYKHHKASPLAEIEF DTRKPITRATDGTAYDG GKD+IGWLPEQ D
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF
        T +DSL+RATEIWK+NAMRGDPDAPQSRVLRAL GEQF
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF

A0A6J1JL82 uncharacterized protein LOC1114879536.8e-10482.77Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT
        M S+LTAIA K NWAF LAQFQRLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQ+NLEPD AK+NY  +DS +GDSNGPF PPKAQYASSPRLETT
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETT

Query:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRAHSTV  DVSCIG  GGP  E  KNR  + KEQEED R+YYKHHKASPLAEIEFADTRKPITRATDGTAYDG GKDVI WLPEQ D
Subjt:  VVGQASKPITQQKRAHSTVTDDVSCIGVYGGPLEEANKNR-TEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF
        T +DSLRRATEIWK+NAMRGDPDAPQSRVLRAL GEQF
Subjt:  TAEDSLRRATEIWKENAMRGDPDAPQSRVLRALHGEQF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02700.1 unknown protein9.6e-5048.78Show/hide
Query:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHA-NDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLET
        MQS+L A A  +         +RL  G  T+ RTADP +HA ND  DPA+   +PE   +   P  A      +         P  PPK+  A++ +LE+
Subjt:  MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHA-NDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLET

Query:  TVVGQASKPITQQKRAHSTVT----DDVSCIGVYGG--PLEEANKNRTEMKEQE-EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIG
        T VG  S+P  QQKR +ST +    D VSC G+ G   P +E        +E E E ++++YKHHKASPL+EIEFADTRKPIT+ATDGTAY  AGKDVIG
Subjt:  TVVGQASKPITQQKRAHSTVT----DDVSCIGVYGG--PLEEANKNRTEMKEQE-EDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIG

Query:  WLPEQLDTAEDSLRRATEIWKENAMRGDPDA-PQSRVLRALHGEQF
        WLPEQLDTAE+SL +AT I+K NA RGDP+  P SR+LR + GE F
Subjt:  WLPEQLDTAEDSLRRATEIWKENAMRGDPDA-PQSRVLRALHGEQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCGAAATTGACGGCGATCGCACCGAAATCGAATTGGGCCTTCTATCTGGCCCAATTCCAACGCCTCCGACGAGGTGGTCTGACGACATGTCGTACAGCTGACCC
TTCCGTTCACGCCAACGACGACAACGACCCCGCCGTTTTATCCGGTGAACCCGAGAGATCACAGGAAAATTTAGAGCCAGATAATGCGAAATCCAATTACGAAAGGGAGG
ACTCTAGTAAGGGAGATTCAAATGGGCCGTTTGGGCCACCGAAGGCACAATACGCCTCCTCCCCTCGGTTAGAAACCACTGTAGTGGGCCAGGCCTCAAAGCCCATTACT
CAACAAAAAAGAGCCCACAGTACGGTGACCGACGACGTGAGTTGCATCGGCGTCTACGGCGGGCCTTTGGAGGAAGCGAATAAAAACAGAACTGAAATGAAAGAACAGGA
GGAAGACAATAGAGATTATTACAAGCACCACAAGGCGTCGCCGTTGGCGGAGATCGAGTTTGCGGATACTCGCAAGCCGATAACCAGAGCGACGGACGGGACGGCGTACG
ATGGGGCCGGGAAGGATGTGATTGGGTGGTTGCCGGAGCAGCTGGATACGGCGGAGGATTCGCTTCGGAGAGCGACGGAGATTTGGAAAGAAAATGCGATGCGTGGAGAT
CCCGATGCTCCACAGTCGAGGGTTCTTAGGGCTTTGCATGGTGAACAGTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAATCGAAATTGACGGCGATCGCACCGAAATCGAATTGGGCCTTCTATCTGGCCCAATTCCAACGCCTCCGACGAGGTGGTCTGACGACATGTCGTACAGCTGACCC
TTCCGTTCACGCCAACGACGACAACGACCCCGCCGTTTTATCCGGTGAACCCGAGAGATCACAGGAAAATTTAGAGCCAGATAATGCGAAATCCAATTACGAAAGGGAGG
ACTCTAGTAAGGGAGATTCAAATGGGCCGTTTGGGCCACCGAAGGCACAATACGCCTCCTCCCCTCGGTTAGAAACCACTGTAGTGGGCCAGGCCTCAAAGCCCATTACT
CAACAAAAAAGAGCCCACAGTACGGTGACCGACGACGTGAGTTGCATCGGCGTCTACGGCGGGCCTTTGGAGGAAGCGAATAAAAACAGAACTGAAATGAAAGAACAGGA
GGAAGACAATAGAGATTATTACAAGCACCACAAGGCGTCGCCGTTGGCGGAGATCGAGTTTGCGGATACTCGCAAGCCGATAACCAGAGCGACGGACGGGACGGCGTACG
ATGGGGCCGGGAAGGATGTGATTGGGTGGTTGCCGGAGCAGCTGGATACGGCGGAGGATTCGCTTCGGAGAGCGACGGAGATTTGGAAAGAAAATGCGATGCGTGGAGAT
CCCGATGCTCCACAGTCGAGGGTTCTTAGGGCTTTGCATGGTGAACAGTTTTAA
Protein sequenceShow/hide protein sequence
MQSKLTAIAPKSNWAFYLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQENLEPDNAKSNYEREDSSKGDSNGPFGPPKAQYASSPRLETTVVGQASKPIT
QQKRAHSTVTDDVSCIGVYGGPLEEANKNRTEMKEQEEDNRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLDTAEDSLRRATEIWKENAMRGD
PDAPQSRVLRALHGEQF