; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028438 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028438
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description14 kDa zinc-binding protein-like
Genome locationtig00153145:2203890..2212508
RNA-Seq ExpressionSgr028438
SyntenySgr028438
Gene Ontology termsGO:0006790 - sulfur compound metabolic process (biological process)
GO:0009150 - purine ribonucleotide metabolic process (biological process)
GO:0005777 - peroxisome (cellular component)
GO:0047627 - adenylylsulfatase activity (molecular function)
InterPro domainsIPR001310 - Histidine triad (HIT) protein
IPR011146 - HIT-like domain
IPR036265 - HIT-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136603.2 14 kDa zinc-binding protein [Cucumis sativus]3.5e-5692.37Show/hide
Query:  KIRALMVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSK
        +IRA+MVN DIK RTRLSVLSSH + SLS SMASSE+EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSK
Subjt:  KIRALMVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSK

Query:  AEERHMEILGHLLYTAKLIAKQEGLDDGFRV
        AEERH EILGHLLYTAKLIAKQEGLDDGFRV
Subjt:  AEERHMEILGHLLYTAKLIAKQEGLDDGFRV

XP_008443161.1 PREDICTED: 14 kDa zinc-binding protein [Cucumis melo]1.1e-5493.65Show/hide
Query:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVN DIK RTRLSVLSSHI+ SLS SMASSE+EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIP+VKDGLSGLSKAEERH
Subjt:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  MEILGHLLYTAKLIAKQEGLDDGFRV
         EILGHLLYTAKLIAKQEGLDDGFRV
Subjt:  MEILGHLLYTAKLIAKQEGLDDGFRV

XP_022950942.1 14 kDa zinc-binding protein-like [Cucurbita moschata]7.8e-5694.44Show/hide
Query:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVNGDIK  TRLSVLSSHIT SLS SMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERH
Subjt:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  MEILGHLLYTAKLIAKQEGLDDGFRV
        MEILGHLLYTAKLIA+QEGLDDGFR+
Subjt:  MEILGHLLYTAKLIAKQEGLDDGFRV

XP_022971276.1 14 kDa zinc-binding protein-like isoform X2 [Cucurbita maxima]1.4e-5791.04Show/hide
Query:  KANKIRALMVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSG
        ++ KIRA+MVNGDIK  TRLSVLSSHIT SLS SMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSG
Subjt:  KANKIRALMVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSG

Query:  LSKAEERHMEILGHLLYTAKLIAKQEGLDDGFRV
        LSKAEERH EILGHLLYTAKLIA+QEGLDDGFR+
Subjt:  LSKAEERHMEILGHLLYTAKLIAKQEGLDDGFRV

XP_023539987.1 14 kDa zinc-binding protein-like [Cucurbita pepo subsp. pepo]3.9e-5593.65Show/hide
Query:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVNGDIK  TRLSVLSSHIT SLS SMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERH
Subjt:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  MEILGHLLYTAKLIAKQEGLDDGFRV
         EILGHLLYTAKLIA+QEGLDDGFR+
Subjt:  MEILGHLLYTAKLIAKQEGLDDGFRV

TrEMBL top hitse value%identityAlignment
A0A0A0LC24 HIT domain-containing protein7.1e-5593.65Show/hide
Query:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVN DIK RTRLSVLSSH + SLS SMASSE+EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
Subjt:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  MEILGHLLYTAKLIAKQEGLDDGFRV
         EILGHLLYTAKLIAKQEGLDDGFRV
Subjt:  MEILGHLLYTAKLIAKQEGLDDGFRV

A0A1S3B859 14 kDa zinc-binding protein5.5e-5593.65Show/hide
Query:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVN DIK RTRLSVLSSHI+ SLS SMASSE+EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIP+VKDGLSGLSKAEERH
Subjt:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  MEILGHLLYTAKLIAKQEGLDDGFRV
         EILGHLLYTAKLIAKQEGLDDGFRV
Subjt:  MEILGHLLYTAKLIAKQEGLDDGFRV

A0A6J1DLM3 14 kDa zinc-binding protein2.3e-5390.48Show/hide
Query:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVNGDI HRTRLSVLSSHI    S SMASSEK+AALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHIL+IPKVKDGLSGLSKAEERH
Subjt:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  MEILGHLLYTAKLIAKQEGLDDGFRV
          ILGHLLYTAKL+AKQEGLDDGFRV
Subjt:  MEILGHLLYTAKLIAKQEGLDDGFRV

A0A6J1GGA0 14 kDa zinc-binding protein-like3.8e-5694.44Show/hide
Query:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVNGDIK  TRLSVLSSHIT SLS SMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERH
Subjt:  MVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  MEILGHLLYTAKLIAKQEGLDDGFRV
        MEILGHLLYTAKLIA+QEGLDDGFR+
Subjt:  MEILGHLLYTAKLIAKQEGLDDGFRV

A0A6J1I2W3 14 kDa zinc-binding protein-like isoform X26.9e-5891.04Show/hide
Query:  KANKIRALMVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSG
        ++ KIRA+MVNGDIK  TRLSVLSSHIT SLS SMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSG
Subjt:  KANKIRALMVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSG

Query:  LSKAEERHMEILGHLLYTAKLIAKQEGLDDGFRV
        LSKAEERH EILGHLLYTAKLIA+QEGLDDGFR+
Subjt:  LSKAEERHMEILGHLLYTAKLIAKQEGLDDGFRV

SwissProt top hitse value%identityAlignment
P32084 Uncharacterized HIT-like protein Synpcc7942_13909.4e-2058.02Show/hide
Query:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYTAKLIAKQEGLDDGFR
        TIF KII +EIP+ +V+EDD  LAFRD+APQAP HIL+IP  K  ++ L +A   H  +LGHLL T K IA QEGL +G+R
Subjt:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYTAKLIAKQEGLDDGFR

P42855 14 kDa zinc-binding protein (Fragment)7.4e-3380.49Show/hide
Query:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYTAKLIAKQEGLDDGFRV
        TIF KII+KEIPSTVV+EDDKVLAFRDI PQ P HIL+IPKV+DGL+GL KAEERH++ILG LLYTAKL+AKQEGLD+GFR+
Subjt:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYTAKLIAKQEGLDDGFRV

P42856 14 kDa zinc-binding protein2.6e-3881.63Show/hide
Query:  SSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYTAKLIAKQEGLDDGFRV
        SSEKEAAL  +  DSPTIFDKII KEIPSTVV+ED+KVLAFRDI PQAPTHILIIPKVKDGL+GL+KAEERH+EILG+LLY AK++AKQEGL+DG+RV
Subjt:  SSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYTAKLIAKQEGLDDGFRV

Q8GUN2 Adenylylsulfatase HINT12.1e-4376.72Show/hide
Query:  RLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYT
        R+S+LSSH   S ++++ +SEKEAALAA PSDSPTIFDKII+KEIPSTVVFEDDKVLAFRDI PQ P HIL+IPKV+DGL+GLSKAEERH++ILG LLYT
Subjt:  RLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYT

Query:  AKLIAKQEGLDDGFRV
        AKL+AKQEGL +GFR+
Subjt:  AKLIAKQEGLDDGFRV

Q8SQ21 Histidine triad nucleotide-binding protein 2, mitochondrial1.0e-1847.87Show/hide
Query:  EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYTAKLIAKQEGLDDGFRV
        +A  AA    +PTIF +I+++ +P+ +++ED + LAFRD+APQAP H L+IP  K  +  +S+AEE   ++LGHLL  AK  AK EGL DG+R+
Subjt:  EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYTAKLIAKQEGLDDGFRV

Arabidopsis top hitse value%identityAlignment
AT1G31160.1 HISTIDINE TRIAD NUCLEOTIDE-BINDING 24.6e-3058.72Show/hide
Query:  LSLSTSMASSEKEAALAA---VPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYTAKLIAKQ
        L  ST  A +E+ AA AA     + +PTIFDKII KEIPS +V+ED+ VLAFRDI PQAP H+L+IPK++DGL+ L KAE RH+E+LG LL+ +K++A++
Subjt:  LSLSTSMASSEKEAALAA---VPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYTAKLIAKQ

Query:  EGLDDGFRV
        EG+ DGFRV
Subjt:  EGLDDGFRV

AT3G56490.1 HIS triad family protein 31.5e-4476.72Show/hide
Query:  RLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYT
        R+S+LSSH   S ++++ +SEKEAALAA PSDSPTIFDKII+KEIPSTVVFEDDKVLAFRDI PQ P HIL+IPKV+DGL+GLSKAEERH++ILG LLYT
Subjt:  RLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHMEILGHLLYT

Query:  AKLIAKQEGLDDGFRV
        AKL+AKQEGL +GFR+
Subjt:  AKLIAKQEGLDDGFRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAAATGGGGCTTTTAATCTGTCGTGCAGAGAGACGCGGAGTTGGAGGAAGAAGATCAAAGCCAACAAAATTAGGGCTCTGATGGTCAATGGAGATATAAAACACAG
GACTCGACTCTCAGTTCTAAGCTCCCACATCACCCTCTCCCTTTCCACTTCCATGGCGTCTTCTGAGAAGGAAGCGGCTCTTGCAGCCGTTCCCTCTGATTCCCCCACCA
TATTCGACAAAATCATTAATAAGGAAATTCCATCTACGGTGGTCTTTGAGGATGACAAGGTCCTTGCTTTTAGGGACATAGCACCACAAGCTCCTACGCATATTCTAATC
ATTCCAAAAGTTAAGGATGGGTTATCTGGATTATCTAAGGCTGAGGAGAGACACATGGAGATTCTTGGCCACCTTCTTTACACTGCCAAGCTCATTGCCAAGCAAGAAGG
GCTGGACGATGGCTTTAGGGTCAATCGGTTTATCATCTTCACGTTCACCTTCTGGGGGGACGACAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAAATGGGGCTTTTAATCTGTCGTGCAGAGAGACGCGGAGTTGGAGGAAGAAGATCAAAGCCAACAAAATTAGGGCTCTGATGGTCAATGGAGATATAAAACACAG
GACTCGACTCTCAGTTCTAAGCTCCCACATCACCCTCTCCCTTTCCACTTCCATGGCGTCTTCTGAGAAGGAAGCGGCTCTTGCAGCCGTTCCCTCTGATTCCCCCACCA
TATTCGACAAAATCATTAATAAGGAAATTCCATCTACGGTGGTCTTTGAGGATGACAAGGTCCTTGCTTTTAGGGACATAGCACCACAAGCTCCTACGCATATTCTAATC
ATTCCAAAAGTTAAGGATGGGTTATCTGGATTATCTAAGGCTGAGGAGAGACACATGGAGATTCTTGGCCACCTTCTTTACACTGCCAAGCTCATTGCCAAGCAAGAAGG
GCTGGACGATGGCTTTAGGGTCAATCGGTTTATCATCTTCACGTTCACCTTCTGGGGGGACGACAAATGA
Protein sequenceShow/hide protein sequence
MVNGAFNLSCRETRSWRKKIKANKIRALMVNGDIKHRTRLSVLSSHITLSLSTSMASSEKEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILI
IPKVKDGLSGLSKAEERHMEILGHLLYTAKLIAKQEGLDDGFRVNRFIIFTFTFWGDDK