; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019903 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019903
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Description14 kDa zinc-binding protein
Genome locationChr04:26660955..26665129
RNA-Seq ExpressionHG10019903
SyntenyHG10019903
Gene Ontology termsGO:0006790 - sulfur compound metabolic process (biological process)
GO:0009150 - purine ribonucleotide metabolic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0047627 - adenylylsulfatase activity (molecular function)
InterPro domainsIPR001310 - Histidine triad (HIT) protein
IPR011146 - HIT-like domain
IPR036265 - HIT-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053944.1 14 kDa zinc-binding protein [Cucumis melo var. makuwa]1.5e-5299.08Show/hide
Query:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
        MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIP+VKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
Subjt:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV

Query:  VINDGPSGC
        VINDGPSGC
Subjt:  VINDGPSGC

XP_004136603.2 14 kDa zinc-binding protein [Cucumis sativus]6.8e-53100Show/hide
Query:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
        MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
Subjt:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV

Query:  VINDGPSGC
        VINDGPSGC
Subjt:  VINDGPSGC

XP_008443161.1 PREDICTED: 14 kDa zinc-binding protein [Cucumis melo]1.5e-5299.08Show/hide
Query:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
        MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIP+VKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
Subjt:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV

Query:  VINDGPSGC
        VINDGPSGC
Subjt:  VINDGPSGC

XP_022971276.1 14 kDa zinc-binding protein-like isoform X2 [Cucurbita maxima]1.7e-5196.33Show/hide
Query:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
        MASSE+EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIA+QEGLDDGFR+
Subjt:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV

Query:  VINDGPSGC
        VINDGPSGC
Subjt:  VINDGPSGC

XP_038905126.1 14 kDa zinc-binding protein [Benincasa hispida]3.4e-5299.08Show/hide
Query:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
        MASSEQEAALAAVPSDSPTIFDKIINKEIPS VVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
Subjt:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV

Query:  VINDGPSGC
        VINDGPSGC
Subjt:  VINDGPSGC

TrEMBL top hitse value%identityAlignment
A0A0A0LC24 HIT domain-containing protein3.3e-53100Show/hide
Query:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
        MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
Subjt:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV

Query:  VINDGPSGC
        VINDGPSGC
Subjt:  VINDGPSGC

A0A1S3B859 14 kDa zinc-binding protein7.4e-5399.08Show/hide
Query:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
        MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIP+VKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
Subjt:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV

Query:  VINDGPSGC
        VINDGPSGC
Subjt:  VINDGPSGC

A0A5A7UFH9 14 kDa zinc-binding protein7.4e-5399.08Show/hide
Query:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
        MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIP+VKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
Subjt:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV

Query:  VINDGPSGC
        VINDGPSGC
Subjt:  VINDGPSGC

A0A6J1GGA0 14 kDa zinc-binding protein-like4.0e-5195.41Show/hide
Query:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
        MASSE+EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERH EILGHLLYTAKLIA+QEGLDDGFR+
Subjt:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV

Query:  VINDGPSGC
        VINDGPSGC
Subjt:  VINDGPSGC

A0A6J1I2W3 14 kDa zinc-binding protein-like isoform X28.1e-5296.33Show/hide
Query:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV
        MASSE+EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIA+QEGLDDGFR+
Subjt:  MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRV

Query:  VINDGPSGC
        VINDGPSGC
Subjt:  VINDGPSGC

SwissProt top hitse value%identityAlignment
P32084 Uncharacterized HIT-like protein Synpcc7942_13909.0e-2458.89Show/hide
Query:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSG
        TIF KII +EIP+ +V+EDD  LAFRD+APQAP HIL+IP  K  ++ L +A   H  +LGHLL T K IA QEGL +G+R VIN GP+G
Subjt:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSG

P42855 14 kDa zinc-binding protein (Fragment)1.3e-3881.32Show/hide
Query:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGC
        TIF KII+KEIPSTVV+EDDKVLAFRDI PQ P HIL+IPKV+DGL+GL KAEERH +ILG LLYTAKL+AKQEGLD+GFR+VINDGP GC
Subjt:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGC

P42856 14 kDa zinc-binding protein4.6e-4482.24Show/hide
Query:  SSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVI
        SSE+EAAL  +  DSPTIFDKII KEIPSTVV+ED+KVLAFRDI PQAPTHILIIPKVKDGL+GL+KAEERH EILG+LLY AK++AKQEGL+DG+RVVI
Subjt:  SSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVI

Query:  NDGPSGC
        NDGPSGC
Subjt:  NDGPSGC

Q8GUN2 Adenylylsulfatase HINT11.7e-4683.18Show/hide
Query:  SSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVI
        +SE+EAALAA PSDSPTIFDKII+KEIPSTVVFEDDKVLAFRDI PQ P HIL+IPKV+DGL+GLSKAEERH +ILG LLYTAKL+AKQEGL +GFR+VI
Subjt:  SSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVI

Query:  NDGPSGC
        NDGP GC
Subjt:  NDGPSGC

Q8SQ21 Histidine triad nucleotide-binding protein 2, mitochondrial2.9e-2250Show/hide
Query:  EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGP
        +A  AA    +PTIF +I+++ +P+ +++ED + LAFRD+APQAP H L+IP  K  +  +S+AEE   ++LGHLL  AK  AK EGL DG+R+VINDG 
Subjt:  EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGP

Query:  SG
         G
Subjt:  SG

Arabidopsis top hitse value%identityAlignment
AT1G31160.1 HISTIDINE TRIAD NUCLEOTIDE-BINDING 24.9e-3359.46Show/hide
Query:  ASSEQEAALAA---VPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGF
        A +E+ AA AA     + +PTIFDKII KEIPS +V+ED+ VLAFRDI PQAP H+L+IPK++DGL+ L KAE RH E+LG LL+ +K++A++EG+ DGF
Subjt:  ASSEQEAALAA---VPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGF

Query:  RVVINDGPSGC
        RVVIN+G   C
Subjt:  RVVINDGPSGC

AT3G56490.1 HIS triad family protein 31.2e-4783.18Show/hide
Query:  SSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVI
        +SE+EAALAA PSDSPTIFDKII+KEIPSTVVFEDDKVLAFRDI PQ P HIL+IPKV+DGL+GLSKAEERH +ILG LLYTAKL+AKQEGL +GFR+VI
Subjt:  SSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVI

Query:  NDGPSGC
        NDGP GC
Subjt:  NDGPSGC

AT4G16566.1 histidine triad nucleotide-binding 44.8e-0428.38Show/hide
Query:  IFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVK-DGLSGLSKAEERHTEILGHLLYTAKLIAKQE
        IF +I+     + ++  D+KV+AF+DI P A  H L+IPK     ++ L + +E ++ ++ H+L   + + +++
Subjt:  IFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVK-DGLSGLSKAEERHTEILGHLLYTAKLIAKQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTTCCGAGCAGGAAGCGGCTCTTGCAGCCGTTCCCTCCGATTCTCCCACCATATTTGACAAAATCATTAATAAGGAAATTCCGTCTACGGTGGTCTTTGAGGA
TGACAAGGTCCTTGCTTTTAGGGACATAGCACCACAAGCTCCTACACATATTCTAATCATTCCAAAAGTGAAGGATGGGTTATCTGGATTATCTAAGGCTGAGGAGAGGC
ACACGGAGATTCTCGGCCACCTTCTTTACACTGCCAAGCTCATTGCCAAACAAGAAGGGCTGGACGATGGCTTTAGGGTCGTAATTAACGATGGACCAAGTGGATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTTCCGAGCAGGAAGCGGCTCTTGCAGCCGTTCCCTCCGATTCTCCCACCATATTTGACAAAATCATTAATAAGGAAATTCCGTCTACGGTGGTCTTTGAGGA
TGACAAGGTCCTTGCTTTTAGGGACATAGCACCACAAGCTCCTACACATATTCTAATCATTCCAAAAGTGAAGGATGGGTTATCTGGATTATCTAAGGCTGAGGAGAGGC
ACACGGAGATTCTCGGCCACCTTCTTTACACTGCCAAGCTCATTGCCAAACAAGAAGGGCTGGACGATGGCTTTAGGGTCGTAATTAACGATGGACCAAGTGGATGTTAG
Protein sequenceShow/hide protein sequence
MASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGC