; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020104 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020104
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGATA zinc finger domain-containing protein isoform 1
Genome locationChr04:28810655..28816237
RNA-Seq ExpressionHG10020104
SyntenyHG10020104
Gene Ontology termsNA
InterPro domainsIPR023213 - Chloramphenicol acetyltransferase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149221.3 uncharacterized protein LOC101208906 [Cucumis sativus]1.5e-25091.42Show/hide
Query:  MSDQS-PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR
        MSDQS PPP  ESKSRPVGGTEHSWCRAVPGGTGTTVLGLL+SKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR
Subjt:  MSDQS-PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR

Query:  AIASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGE
        AIASHPDA+DPSVSDFHKIHEHEIN   WFDP HPSYSDTDVMFA VYTVS+SQWAVFL LHTATCDRAAAAALLRELLVLAA GGEIEGGGFE GDNGE
Subjt:  AIASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGE

Query:  IGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHQTE
        +GLGIEDLIPNGKANK LWARG DMLGYSLNSFRLANLEFKD N+ERFSQMIRL+MNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD LPP+Q E
Subjt:  IGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHQTE

Query:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI
        KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAE T+WEVA RCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI
Subjt:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI

Query:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG
        I+  SGP QQ+LGLHDYIG ASAHGVGPSIA FD IRDGQLD ACVYPSPLFSRDQMNRIFD+MKKIL N+++EV EG
Subjt:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG

XP_008442855.1 PREDICTED: uncharacterized protein LOC103486623 [Cucumis melo]1.7e-24991.84Show/hide
Query:  MSDQS-PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR
        MSDQS PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLL+SKPPDIPHLQSSLHTLQNLHPILRSKIHHDP RRDFSFLIP SP LHLQILDLAAT R
Subjt:  MSDQS-PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR

Query:  AIASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGE
        AIASHPDA+DPSVSDFHKIHEHEIN   WFDP HPSYSDTDVMFA VYTVS+SQWAVFL LHTATCDRAAAAALLRELLVLAA GGEIEGG FEIGDNGE
Subjt:  AIASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGE

Query:  IGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHQTE
        IGLGIEDLIPNGKANK LWARG DMLGYSLNSFRLANLEFKD NSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD LPP+Q E
Subjt:  IGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHQTE

Query:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI
        KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAE  LWEVA RCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI
Subjt:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI

Query:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG
        I+ TSGP  Q+LGL+DYIG ASAHGVGPSIA FD IRDGQLDCACVYPSPLFSRDQMN+IFDEMKKIL N+A+EV EG
Subjt:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG

XP_023005679.1 uncharacterized protein LOC111498610 [Cucurbita maxima]7.4e-23786.53Show/hide
Query:  SDQSPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAI
        S+ S PPAGESKSRPVGGTE+SWCRA PGGTGTTVLGLL+SKPPD+P+LQS+LH+LQNLHPIL SKIH+DP RRDFSFL PPSPPLHLQILDL ATARAI
Subjt:  SDQSPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAI

Query:  ASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEIG
        ASHPDA+DPSVSDFHKI EHEIN+ TW DPNHPSYSDTDVMFA+VYT+SD QW VFLRLHTA CDR AA ALLREL  LAAA GE EGG FEIGD+GEIG
Subjt:  ASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEIG

Query:  LGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTEKYA
        LGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANS RFSQMIRLKMNSD T+KLLAGCKLRGIK+CGALAAAGLIATRCSKDLPP+  EKY 
Subjt:  LGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTEKYA

Query:  VVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIDT
        VVTLNDCRSLL+PPLT+HHLGFYHSAILNTHD+SAE TLW+VAKRCYF+FSNAKDNNKHFSDMSDLNFLMCKAIENP LTPSSSMRTALISVFEDPI + 
Subjt:  VVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIDT

Query:  TSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG
         S PAQ+HLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYP PLFSRDQMN+I  +MKKIL   A+EV EG
Subjt:  TSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG

XP_023540823.1 uncharacterized protein LOC111801084 [Cucurbita pepo subsp. pepo]2.2e-23686.74Show/hide
Query:  SDQSPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAI
        S+ S PPAGESKSRPVGGTE+SWCRA PGGTGTTVLGLL+SKPPD+P+LQS+LH+LQNLHPILRSKIH+DPSRRDFSFL PPSP LHLQILDL A ARAI
Subjt:  SDQSPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAI

Query:  ASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEIG
        ASHPDA+DPSVSDFHKI EHEIN+ TW DPN+PSYSDTDVMFA+VYT++D QWAVFLRLHTA CDR AA ALLREL  LAAA GE EGG FEIGD+GEIG
Subjt:  ASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEIG

Query:  LGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTEKYA
        LGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANS RFSQMIRLKMNSD T+KLLAGCKLRGIK+CGALAAAGLIATRCSKDLP +  EKY 
Subjt:  LGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTEKYA

Query:  VVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIDT
        VVTLNDCRSLL+PPLT+HHLGFYHSAILNTHDISAE TLWEVAKRCYF+FSNAKDNNKHFSDMSDLNFLMCKAIENP LTPSSSMRTALISVFEDPI + 
Subjt:  VVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIDT

Query:  TSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG
         S PAQ++LGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYP PLFSRDQMN+I DEMKKIL   A+EV EG
Subjt:  TSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG

XP_038905440.1 uncharacterized protein LOC120091472 [Benincasa hispida]2.5e-26194.54Show/hide
Query:  MSDQSPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARA
        MSDQSPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLL+SKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLI PSPPLHLQILDL ATARA
Subjt:  MSDQSPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARA

Query:  IASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEI
        IASHPDANDPSVSDFHKIHE EIN ATWFDPNHPSYSDTDVMFA VYT+SDSQWA+FLRLHTATCDRAAAAALLRELLVL A GGEIEGGGFEIGDNGEI
Subjt:  IASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEI

Query:  GLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTEKY
        GLGIEDLIPNGKANK LWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNS ETQKLLAGCKLRG+KLCGALAAAGL+ATRCSKDLP HQ EKY
Subjt:  GLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTEKY

Query:  AVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIID
        AVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAE TLWEVAKRCYFS+SNAKDNNKHFSDMSDLNFLMCKAIENP LTPSSSMRTALISVFEDPIID
Subjt:  AVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIID

Query:  TTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG
         TSGP QQ+LGLHDY GCASAHGVGPSIA FDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKIL NNAMEV EG
Subjt:  TTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG

TrEMBL top hitse value%identityAlignment
A0A0A0LGP2 Uncharacterized protein7.5e-25191.42Show/hide
Query:  MSDQS-PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR
        MSDQS PPP  ESKSRPVGGTEHSWCRAVPGGTGTTVLGLL+SKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR
Subjt:  MSDQS-PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR

Query:  AIASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGE
        AIASHPDA+DPSVSDFHKIHEHEIN   WFDP HPSYSDTDVMFA VYTVS+SQWAVFL LHTATCDRAAAAALLRELLVLAA GGEIEGGGFE GDNGE
Subjt:  AIASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGE

Query:  IGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHQTE
        +GLGIEDLIPNGKANK LWARG DMLGYSLNSFRLANLEFKD N+ERFSQMIRL+MNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD LPP+Q E
Subjt:  IGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHQTE

Query:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI
        KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAE T+WEVA RCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI
Subjt:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI

Query:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG
        I+  SGP QQ+LGLHDYIG ASAHGVGPSIA FD IRDGQLD ACVYPSPLFSRDQMNRIFD+MKKIL N+++EV EG
Subjt:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG

A0A1S3B7G8 uncharacterized protein LOC1034866238.3e-25091.84Show/hide
Query:  MSDQS-PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR
        MSDQS PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLL+SKPPDIPHLQSSLHTLQNLHPILRSKIHHDP RRDFSFLIP SP LHLQILDLAAT R
Subjt:  MSDQS-PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR

Query:  AIASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGE
        AIASHPDA+DPSVSDFHKIHEHEIN   WFDP HPSYSDTDVMFA VYTVS+SQWAVFL LHTATCDRAAAAALLRELLVLAA GGEIEGG FEIGDNGE
Subjt:  AIASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGE

Query:  IGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHQTE
        IGLGIEDLIPNGKANK LWARG DMLGYSLNSFRLANLEFKD NSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD LPP+Q E
Subjt:  IGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHQTE

Query:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI
        KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAE  LWEVA RCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI
Subjt:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI

Query:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG
        I+ TSGP  Q+LGL+DYIG ASAHGVGPSIA FD IRDGQLDCACVYPSPLFSRDQMN+IFDEMKKIL N+A+EV EG
Subjt:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG

A0A5A7TQM7 GATA zinc finger domain-containing protein isoform 13.0e-23688.08Show/hide
Query:  MSDQS-PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR
        MSDQS PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLL+SKPPDIPHLQSSLHTLQNLHPILRSKIHHDP RRDFSFLIP SP LHLQILDLAAT R
Subjt:  MSDQS-PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATAR

Query:  AIASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGE
        AIASHPDA+DPSVSDFHKIHEHEIN   WFDP HPSYSDTDVMFA VYTVS+SQWAVFL LHTATCDRAAAAALLRELLVLAA GGEIEGG FEIGDNGE
Subjt:  AIASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGE

Query:  IGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHQTE
        IGLGIEDLIPNGKANK LWARG DMLGYSLNSFRLANLEFKD NSERFSQMI                  RGIKLCGALAAAGLIATRCSKD LPP+Q E
Subjt:  IGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKD-LPPHQTE

Query:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI
        KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAE  LWEVA RCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI
Subjt:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI

Query:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG
        I+ TSGP  Q+LGL+DYIG ASAHGVGPSIA FD IRDGQLDCACVYPSPLFSRDQMN+IFDEMKKIL N+A+EV EG
Subjt:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG

A0A6J1J6C5 uncharacterized protein LOC1114816321.8e-23687.34Show/hide
Query:  ESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAIASHPDANDP
        E + RPVGGTEHSWCRAVPGGTGTTVLGLL+SKPPDI HLQ+SLH LQNLHPILRSKIHHDPSRRDFSFLIPPSP +HLQILDLAA ARAIASHPDA+DP
Subjt:  ESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAIASHPDANDP

Query:  SVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEIGLGIEDLIPN
        S+SDFHKI EHEIN A W +P+HPSYSDTDVMFA VY VSD QWAVFL LHTA CDR AAA+LLRELLVL AA G+IEGGGF+IGDNGEIG GIEDLIP+
Subjt:  SVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEIGLGIEDLIPN

Query:  GKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTEKYAVVTLNDCRS
        GKA+KPLWARGLDMLGYSLNSFR ANLEFKDA+SERFSQMIRLK+NSDETQKLLAGCK RGIKLCGAL AAGLIATRCSKDLPPHQTEKYAVVTL DCRS
Subjt:  GKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTEKYAVVTLNDCRS

Query:  LLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIDTTSGPAQQHL
        LLDPPLT+HHLGFYHSAILNTHDISAE TLWEV++RCYFSFSNAKDNNKHF+DMSDLNFLM KAIENP LTPSSSMRTALIS FEDPII  TS PAQQHL
Subjt:  LLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIDTTSGPAQQHL

Query:  GLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG
        G+ DYIGCASAHGVGPSIA FD+IRDGQLDCACVYPSPLFSRDQMN++FDEMKKIL ++AMEV EG
Subjt:  GLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG

A0A6J1KZY5 uncharacterized protein LOC1114986103.6e-23786.53Show/hide
Query:  SDQSPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAI
        S+ S PPAGESKSRPVGGTE+SWCRA PGGTGTTVLGLL+SKPPD+P+LQS+LH+LQNLHPIL SKIH+DP RRDFSFL PPSPPLHLQILDL ATARAI
Subjt:  SDQSPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAI

Query:  ASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEIG
        ASHPDA+DPSVSDFHKI EHEIN+ TW DPNHPSYSDTDVMFA+VYT+SD QW VFLRLHTA CDR AA ALLREL  LAAA GE EGG FEIGD+GEIG
Subjt:  ASHPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEIG

Query:  LGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTEKYA
        LGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANS RFSQMIRLKMNSD T+KLLAGCKLRGIK+CGALAAAGLIATRCSKDLPP+  EKY 
Subjt:  LGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTEKYA

Query:  VVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIDT
        VVTLNDCRSLL+PPLT+HHLGFYHSAILNTHD+SAE TLW+VAKRCYF+FSNAKDNNKHFSDMSDLNFLMCKAIENP LTPSSSMRTALISVFEDPI + 
Subjt:  VVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIDT

Query:  TSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG
         S PAQ+HLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYP PLFSRDQMN+I  +MKKIL   A+EV EG
Subjt:  TSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52610.1 unknown protein7.1e-13754.06Show/hide
Query:  PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQI--LDLAATARAIAS
        P    +S +RPVGGTE+SWCRA+ GGTG  V+ LL+S+ P + +LQ++L  LQ  HP LRS I  D S   FSF++  +   H++I   D  +TA+ I  
Subjt:  PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQI--LDLAATARAIAS

Query:  HPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSD--SQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEIG
          D++DP       I EHE+N  TW +P+    S++ V   ++Y ++D   Q  +  RL+TA  DR AA  LLRE +   AA G    G         +G
Subjt:  HPDANDPSVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSD--SQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEIG

Query:  LG--IEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDA-NSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTE
        LG  IE+LIP+GK +KP WARG+D+LGYSLN+FR +NL F DA NS R SQ++RLK++ D+T KL+AGCK RG+KL  ALA++ LIA   SK+LPP+Q E
Subjt:  LG--IEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDA-NSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTE

Query:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI
        KYAVVTL+DCRS+L+PPLTS+  GFYH+ IL+THD++ E  LW++AKRCY SF+++K++NK F+DMSDLNFLMCKAIENP+LTPSSS+RTA IS+FEDP+
Subjt:  KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPI

Query:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKIL
        ID +  P    LG+ DYIGCAS HGVGPS+A FD +RDG+LDCA VYPSPL SR+QM+ +   MK IL
Subjt:  IDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGATCAATCTCCTCCTCCCGCCGGCGAGTCCAAGTCCCGTCCCGTCGGGGGCACCGAGCACAGCTGGTGCCGCGCGGTCCCCGGCGGCACCGGCACCACCGTCCT
CGGCCTACTCGTCTCAAAACCTCCCGATATTCCTCATCTCCAATCCTCCCTCCACACTCTCCAAAACCTCCATCCAATCCTCCGCTCCAAAATCCACCACGATCCTTCCC
GACGAGATTTCTCCTTCCTCATTCCTCCTTCTCCGCCGCTCCACCTCCAGATTCTCGACCTCGCCGCAACTGCACGCGCTATCGCCTCTCATCCCGATGCCAACGATCCT
TCCGTCTCCGATTTCCACAAGATCCACGAGCACGAGATCAACCTCGCCACGTGGTTTGATCCAAACCATCCGTCGTACTCCGACACTGATGTGATGTTCGCTAACGTCTA
CACCGTAAGCGATAGCCAATGGGCGGTATTCCTCCGCCTCCACACGGCGACATGCGATCGTGCCGCGGCGGCCGCGCTGTTGAGAGAACTGCTCGTGCTTGCGGCGGCCG
GAGGAGAAATTGAGGGCGGAGGATTTGAAATTGGGGATAATGGTGAGATTGGATTAGGGATTGAAGATCTAATCCCTAACGGTAAAGCGAATAAGCCTCTGTGGGCGCGT
GGATTAGACATGCTTGGTTACTCCTTGAATTCGTTCCGATTAGCGAATTTGGAATTCAAAGACGCGAATTCTGAGAGATTTTCTCAGATGATTAGGTTGAAGATGAACTC
CGATGAGACTCAAAAACTTCTCGCTGGTTGCAAATTGAGAGGCATTAAGCTGTGTGGAGCTCTGGCAGCTGCTGGATTGATTGCCACTCGTTGTTCTAAGGACCTTCCTC
CTCACCAAACGGAAAAATATGCTGTTGTTACCCTCAATGATTGTCGTTCCCTCCTTGATCCTCCCCTCACGAGCCACCATTTAGGATTTTATCACTCTGCCATCCTCAAC
ACACATGACATATCAGCTGAACATACACTATGGGAAGTGGCAAAGCGATGCTATTTTTCCTTCTCAAATGCCAAAGACAACAACAAGCATTTCTCAGACATGTCTGACTT
AAACTTCCTCATGTGCAAAGCGATTGAAAATCCCAGCCTCACTCCATCTTCATCCATGAGAACGGCCCTCATCTCGGTCTTCGAAGACCCCATCATCGACACTACTTCCG
GTCCCGCGCAGCAGCACCTCGGCCTACACGACTACATTGGCTGTGCCTCTGCACACGGTGTGGGGCCATCAATTGCCTTCTTCGACATGATTCGTGACGGTCAGTTGGAT
TGTGCTTGTGTGTACCCGTCGCCTTTGTTTTCCCGAGATCAAATGAACCGAATTTTTGATGAGATGAAGAAAATTCTGGCGAATAATGCCATGGAAGTAGCTGAAGGCTA
A
mRNA sequenceShow/hide mRNA sequence
ATGTCGGATCAATCTCCTCCTCCCGCCGGCGAGTCCAAGTCCCGTCCCGTCGGGGGCACCGAGCACAGCTGGTGCCGCGCGGTCCCCGGCGGCACCGGCACCACCGTCCT
CGGCCTACTCGTCTCAAAACCTCCCGATATTCCTCATCTCCAATCCTCCCTCCACACTCTCCAAAACCTCCATCCAATCCTCCGCTCCAAAATCCACCACGATCCTTCCC
GACGAGATTTCTCCTTCCTCATTCCTCCTTCTCCGCCGCTCCACCTCCAGATTCTCGACCTCGCCGCAACTGCACGCGCTATCGCCTCTCATCCCGATGCCAACGATCCT
TCCGTCTCCGATTTCCACAAGATCCACGAGCACGAGATCAACCTCGCCACGTGGTTTGATCCAAACCATCCGTCGTACTCCGACACTGATGTGATGTTCGCTAACGTCTA
CACCGTAAGCGATAGCCAATGGGCGGTATTCCTCCGCCTCCACACGGCGACATGCGATCGTGCCGCGGCGGCCGCGCTGTTGAGAGAACTGCTCGTGCTTGCGGCGGCCG
GAGGAGAAATTGAGGGCGGAGGATTTGAAATTGGGGATAATGGTGAGATTGGATTAGGGATTGAAGATCTAATCCCTAACGGTAAAGCGAATAAGCCTCTGTGGGCGCGT
GGATTAGACATGCTTGGTTACTCCTTGAATTCGTTCCGATTAGCGAATTTGGAATTCAAAGACGCGAATTCTGAGAGATTTTCTCAGATGATTAGGTTGAAGATGAACTC
CGATGAGACTCAAAAACTTCTCGCTGGTTGCAAATTGAGAGGCATTAAGCTGTGTGGAGCTCTGGCAGCTGCTGGATTGATTGCCACTCGTTGTTCTAAGGACCTTCCTC
CTCACCAAACGGAAAAATATGCTGTTGTTACCCTCAATGATTGTCGTTCCCTCCTTGATCCTCCCCTCACGAGCCACCATTTAGGATTTTATCACTCTGCCATCCTCAAC
ACACATGACATATCAGCTGAACATACACTATGGGAAGTGGCAAAGCGATGCTATTTTTCCTTCTCAAATGCCAAAGACAACAACAAGCATTTCTCAGACATGTCTGACTT
AAACTTCCTCATGTGCAAAGCGATTGAAAATCCCAGCCTCACTCCATCTTCATCCATGAGAACGGCCCTCATCTCGGTCTTCGAAGACCCCATCATCGACACTACTTCCG
GTCCCGCGCAGCAGCACCTCGGCCTACACGACTACATTGGCTGTGCCTCTGCACACGGTGTGGGGCCATCAATTGCCTTCTTCGACATGATTCGTGACGGTCAGTTGGAT
TGTGCTTGTGTGTACCCGTCGCCTTTGTTTTCCCGAGATCAAATGAACCGAATTTTTGATGAGATGAAGAAAATTCTGGCGAATAATGCCATGGAAGTAGCTGAAGGCTA
A
Protein sequenceShow/hide protein sequence
MSDQSPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLVSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAIASHPDANDP
SVSDFHKIHEHEINLATWFDPNHPSYSDTDVMFANVYTVSDSQWAVFLRLHTATCDRAAAAALLRELLVLAAAGGEIEGGGFEIGDNGEIGLGIEDLIPNGKANKPLWAR
GLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDLPPHQTEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILN
THDISAEHTLWEVAKRCYFSFSNAKDNNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIDTTSGPAQQHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLD
CACVYPSPLFSRDQMNRIFDEMKKILANNAMEVAEG