; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G004700 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G004700
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Description14 kDa zinc-binding protein-like
Genome locationCmo_Chr14:2319274..2325254
RNA-Seq ExpressionCmoCh14G004700
SyntenyCmoCh14G004700
Gene Ontology termsGO:0006790 - sulfur compound metabolic process (biological process)
GO:0009150 - purine ribonucleotide metabolic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0047627 - adenylylsulfatase activity (molecular function)
InterPro domainsIPR001310 - Histidine triad (HIT) protein
IPR011146 - HIT-like domain
IPR019808 - Histidine triad, conserved site
IPR036265 - HIT-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580845.1 Adenylylsulfatase HINT1, partial [Cucurbita argyrosperma subsp. sororia]4.4e-67100Show/hide
Query:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
        MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
Subjt:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV

Query:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG

KAG7017601.1 Adenylylsulfatase HINT1 [Cucurbita argyrosperma subsp. argyrosperma]4.4e-67100Show/hide
Query:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
        MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
Subjt:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV

Query:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG

XP_022934452.1 14 kDa zinc-binding protein-like [Cucurbita moschata]4.4e-67100Show/hide
Query:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
        MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
Subjt:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV

Query:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG

XP_022971276.1 14 kDa zinc-binding protein-like isoform X2 [Cucurbita maxima]2.4e-6596.15Show/hide
Query:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
        MASSEKEAALAAVPSDSPTIFDKIINKEIPST+VFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILG LLY AKLIA+QEGLDDGFR+
Subjt:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV

Query:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG

XP_023521241.1 14 kDa zinc-binding protein-like [Cucurbita pepo subsp. pepo]9.9e-6799.23Show/hide
Query:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
        MASSEK+AALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
Subjt:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV

Query:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG

TrEMBL top hitse value%identityAlignment
A0A0A0LC24 HIT domain-containing protein2.0e-6596.15Show/hide
Query:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
        MASSE+EAALAAVPSDSPTIFDKIINKEIPST+VFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERHTEILG LLY AKLIAKQEGLDDGFRV
Subjt:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV

Query:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A5A7UFH9 14 kDa zinc-binding protein4.5e-6595.38Show/hide
Query:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
        MASSE+EAALAAVPSDSPTIFDKIINKEIPST+VFEDDKVLAFRDI+PQAPTHILIIP+VKDGLSGLSKAEERHTEILG LLY AKLIAKQEGLDDGFRV
Subjt:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV

Query:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A6J1F2T2 14 kDa zinc-binding protein-like2.2e-67100Show/hide
Query:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
        MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
Subjt:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV

Query:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A6J1I2W3 14 kDa zinc-binding protein-like isoform X21.2e-6596.15Show/hide
Query:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
        MASSEKEAALAAVPSDSPTIFDKIINKEIPST+VFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILG LLY AKLIA+QEGLDDGFR+
Subjt:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV

Query:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A6J1J6X6 14 kDa zinc-binding protein1.2e-6597.69Show/hide
Query:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV
        MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAE+RHTEILGQLLY AKLIAKQEGLDDGFRV
Subjt:  MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRV

Query:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        VINDGPSGCQSVYHLHVHLLGGRQMNW PG
Subjt:  VINDGPSGCQSVYHLHVHLLGGRQMNWPPG

SwissProt top hitse value%identityAlignment
P32084 Uncharacterized HIT-like protein Synpcc7942_13903.9e-3459.82Show/hide
Query:  TIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH
        TIF KII +EIP+ IV+EDD  LAFRD++PQAP HIL+IP  K  ++ L +A   H  +LG LL   K IA QEGL +G+R VIN GP+G Q+VYHLH+H
Subjt:  TIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH

Query:  LLGGRQMNWPPG
        LLGGR + WPPG
Subjt:  LLGGRQMNWPPG

P42855 14 kDa zinc-binding protein (Fragment)3.2e-5281.25Show/hide
Query:  TIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH
        TIF KII+KEIPST+V+EDDKVLAFRDI+PQ P HIL+IPKV+DGL+GL KAEERH +ILG+LLY AKL+AKQEGLD+GFR+VINDGP GCQSVYH+HVH
Subjt:  TIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH

Query:  LLGGRQMNWPPG
        L+GGRQMNWPPG
Subjt:  LLGGRQMNWPPG

P42856 14 kDa zinc-binding protein7.9e-5984.38Show/hide
Query:  SSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVI
        SSEKEAAL  +  DSPTIFDKII KEIPST+V+ED+KVLAFRDI+PQAPTHILIIPKVKDGL+GL+KAEERH EILG LLY AK++AKQEGL+DG+RVVI
Subjt:  SSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVI

Query:  NDGPSGCQSVYHLHVHLLGGRQMNWPPG
        NDGPSGCQSVYH+HVHLLGGRQMNWPPG
Subjt:  NDGPSGCQSVYHLHVHLLGGRQMNWPPG

Q8GUN2 Adenylylsulfatase HINT11.1e-6083.59Show/hide
Query:  SSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVI
        +SEKEAALAA PSDSPTIFDKII+KEIPST+VFEDDKVLAFRDI+PQ P HIL+IPKV+DGL+GLSKAEERH +ILG+LLY AKL+AKQEGL +GFR+VI
Subjt:  SSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVI

Query:  NDGPSGCQSVYHLHVHLLGGRQMNWPPG
        NDGP GCQSVYH+HVHL+GGRQMNWPPG
Subjt:  NDGPSGCQSVYHLHVHLLGGRQMNWPPG

Q8SQ21 Histidine triad nucleotide-binding protein 2, mitochondrial2.3e-3454.03Show/hide
Query:  EAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVINDGP
        +A  AA    +PTIF +I+++ +P+ I++ED + LAFRD++PQAP H L+IP  K  +  +S+AEE   ++LG LL  AK  AK EGL DG+R+VINDG 
Subjt:  EAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVINDGP

Query:  SGCQSVYHLHVHLLGGRQMNWPPG
         G QSVYHLH+H+LGGRQ+ WPPG
Subjt:  SGCQSVYHLHVHLLGGRQMNWPPG

Arabidopsis top hitse value%identityAlignment
AT1G31160.1 HISTIDINE TRIAD NUCLEOTIDE-BINDING 26.4e-4866.94Show/hide
Query:  EAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVINDGP
        +AA +   + +PTIFDKII KEIPS IV+ED+ VLAFRDI+PQAP H+L+IPK++DGL+ L KAE RH E+LGQLL+A+K++A++EG+ DGFRVVIN+G 
Subjt:  EAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVINDGP

Query:  SGCQSVYHLHVHLLGGRQMNWPPG
          CQSVYHLH+H+LGGRQM WPPG
Subjt:  SGCQSVYHLHVHLLGGRQMNWPPG

AT3G56490.1 HIS triad family protein 37.8e-6283.59Show/hide
Query:  SSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVI
        +SEKEAALAA PSDSPTIFDKII+KEIPST+VFEDDKVLAFRDI+PQ P HIL+IPKV+DGL+GLSKAEERH +ILG+LLY AKL+AKQEGL +GFR+VI
Subjt:  SSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVI

Query:  NDGPSGCQSVYHLHVHLLGGRQMNWPPG
        NDGP GCQSVYH+HVHL+GGRQMNWPPG
Subjt:  NDGPSGCQSVYHLHVHLLGGRQMNWPPG

AT4G16566.1 histidine triad nucleotide-binding 41.4e-0528Show/hide
Query:  IFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVK-DGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH
        IF +I+     + ++  D+KV+AF+DI P A  H L+IPK     ++ L + +E ++ ++  +L   + + +++      R   +  P    SV HLH+H
Subjt:  IFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVK-DGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTTCCGAGAAGGAAGCGGCTCTTGCAGCTGTTCCCTCCGATTCCCCCACCATATTTGACAAAATCATTAATAAGGAAATTCCATCTACGATTGTCTTTGAGGA
TGATAAGGTTCTTGCTTTTAGGGACATATCTCCACAAGCTCCTACACATATTCTAATCATTCCAAAAGTTAAGGATGGGTTATCTGGATTATCTAAGGCTGAGGAGAGGC
ACACAGAGATTCTCGGTCAACTGCTTTACGCTGCCAAGCTCATTGCCAAACAAGAAGGGCTGGACGATGGCTTTAGGGTCGTAATTAATGATGGACCGAGTGGATGCCAA
TCGGTTTATCATCTTCATGTTCACCTTTTGGGGGGACGACAAATGAATTGGCCCCCAGGTTAA
mRNA sequenceShow/hide mRNA sequence
TCGACTCACAGTTCTAAGCTCCCACATTTCCCAATCTCTTTCCATTTCCATGGCATCTTCCGAGAAGGAAGCGGCTCTTGCAGCTGTTCCCTCCGATTCCCCCACCATAT
TTGACAAAATCATTAATAAGGAAATTCCATCTACGATTGTCTTTGAGGATGATAAGGTTCTTGCTTTTAGGGACATATCTCCACAAGCTCCTACACATATTCTAATCATT
CCAAAAGTTAAGGATGGGTTATCTGGATTATCTAAGGCTGAGGAGAGGCACACAGAGATTCTCGGTCAACTGCTTTACGCTGCCAAGCTCATTGCCAAACAAGAAGGGCT
GGACGATGGCTTTAGGGTCGTAATTAATGATGGACCGAGTGGATGCCAATCGGTTTATCATCTTCATGTTCACCTTTTGGGGGGACGACAAATGAATTGGCCCCCAGGTT
AAGATGGACACTTTTTCAACGTTGAAATACATCTAATAATGTTTTCATTACCTGTTCGAAAACTTTATCAACTTTATCAACTACTGCTCGTCCCTTGTGGTCTCTAACAT
CCCATTGAATATGTAAGCAGGGATGTAGACATGAACTGGTGGGAGTGGACGACATGGTTGAGCAGCAGGGCATTGCAAAAGCTGATGCCTGAGGAAATATGTCTGCTCTT
TCCAGGTACCTTGTTCTTGATCTTCTCATAGAATCCCCTTATTTCTCATGTTCCAAGATGCCCATTCATTGGATTTTAGCTTTTGAAAAATGATGGTATCCATGCCTAGT
TTATTGAAAGGAAGGAGGTGTTCTTTGGCCCATCATGAGGGAATTAGATAGAGTTTTGTTGCGAAAATATGACTGGTGAGGAGAATTAGGTAGCGTTTTGGTGTAAATAC
ATTAAGACAACAGCTAAATGTCAAGGAACTTAAGAAGGTACGATTAAAGATGTCACAATGAATCACAAAAGCTAAGGATGAGCTAAGGATGTCTAGAAAGATGTGGTGGT
GTAATGAGTGAAATTTTTTATGAAGAGTTGAGTCAAAGGACTATTAAGAAAACCATGTGCGTGGCTCCAG
Protein sequenceShow/hide protein sequence
MASSEKEAALAAVPSDSPTIFDKIINKEIPSTIVFEDDKVLAFRDISPQAPTHILIIPKVKDGLSGLSKAEERHTEILGQLLYAAKLIAKQEGLDDGFRVVINDGPSGCQ
SVYHLHVHLLGGRQMNWPPG