; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028671 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028671
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUPF0587 protein C1orf123 homolog
Genome locationtig00153204:3344138..3345966
RNA-Seq ExpressionSgr028671
SyntenySgr028671
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR008584 - CXXC motif containing zinc binding protein, eukaryotic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585732.1 CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subsp. sororia]8.8e-4270.15Show/hide
Query:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF
        ++  C  LNET+PLQAGKGTTNLVQK      D       GRGQPLTQETS+SGKSSPLMLFDCRGYEPV F+FGPGWKAES            IEGTKF
Subjt:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF

Query:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        EDIDLSGGE+AEYDE GECPVMISNL+ATF+ VK
Subjt:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

XP_022132352.1 UPF0587 protein C1orf123 [Momordica charantia]2.2e-4069.4Show/hide
Query:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF
        ++  C  LNETV L  GKGTTNLVQK      D       GRG+PLTQETS+SGKSSPLMLFDCRGYEP+DF+FGPGWKAES            IEGTKF
Subjt:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF

Query:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        EDIDLS GEFAEYDE GECPVMIS LKATFDLVK
Subjt:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

XP_022952030.1 UPF0587 protein C1orf123 homolog [Cucurbita moschata]8.8e-4270.15Show/hide
Query:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF
        ++  C  LNET+PLQAGKGTTNLVQK      D       GRGQPLTQETS+SGKSSPLMLFDCRGYEPV F+FGPGWKAES            IEGTKF
Subjt:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF

Query:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        EDIDLSGGE+AEYDE GECPVMISNL+ATF+ VK
Subjt:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

XP_038884333.1 CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida]1.3e-4069.4Show/hide
Query:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF
        ++  C  LNET+PLQAGKGTTNLVQK      +       GRGQPLTQETS+ GKSSPLMLFDCRGYEP+DFVFGPGWKAES            IEGTKF
Subjt:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF

Query:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        EDIDLS GEFAEYDE GECPVMIS L+ATF+LVK
Subjt:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

XP_038884334.1 CXXC motif containing zinc binding protein isoform X2 [Benincasa hispida]1.3e-4069.4Show/hide
Query:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF
        ++  C  LNET+PLQAGKGTTNLVQK      +       GRGQPLTQETS+ GKSSPLMLFDCRGYEP+DFVFGPGWKAES            IEGTKF
Subjt:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF

Query:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        EDIDLS GEFAEYDE GECPVMIS L+ATF+LVK
Subjt:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

TrEMBL top hitse value%identityAlignment
A0A1S3BAF1 UPF0587 protein C1orf123 homolog1.9e-3764.18Show/hide
Query:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF
        ++  C  L+ET+PLQAGKGTTNLVQK      +       GRG+PLTQE S+SG  SPLMLFDCRGYEP+ FVFGPGWK ES            IEGTKF
Subjt:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF

Query:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        EDIDL+GGEFAEYDE GECPVMISNL A F+L+K
Subjt:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

A0A5A7V299 UPF0587 protein C1orf123-like protein1.9e-3764.18Show/hide
Query:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF
        ++  C  L+ET+PLQAGKGTTNLVQK      +       GRG+PLTQE S+SG  SPLMLFDCRGYEP+ FVFGPGWK ES            IEGTKF
Subjt:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF

Query:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        EDIDL+GGEFAEYDE GECPVMISNL A F+L+K
Subjt:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

A0A6J1BTL8 UPF0587 protein C1orf1231.1e-4069.4Show/hide
Query:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF
        ++  C  LNETV L  GKGTTNLVQK      D       GRG+PLTQETS+SGKSSPLMLFDCRGYEP+DF+FGPGWKAES            IEGTKF
Subjt:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF

Query:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        EDIDLS GEFAEYDE GECPVMIS LKATFDLVK
Subjt:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

A0A6J1GKL1 UPF0587 protein C1orf123 homolog4.3e-4270.15Show/hide
Query:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF
        ++  C  LNET+PLQAGKGTTNLVQK      D       GRGQPLTQETS+SGKSSPLMLFDCRGYEPV F+FGPGWKAES            IEGTKF
Subjt:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF

Query:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        EDIDLSGGE+AEYDE GECPVMISNL+ATF+ VK
Subjt:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

A0A6J1KKK0 UPF0587 protein C1orf123 homolog4.3e-4270.15Show/hide
Query:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF
        ++  C  LNET+PLQAGKGTTNLVQK      D       GRGQPLTQETS+SGKSSPLMLFDCRGYEPV F+FGPGWKAES            IEGTKF
Subjt:  ERNVC-NLNETVPLQAGKGTTNLVQKGWNRYHDS------GRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF

Query:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        EDIDLSGGE+AEYDE GECPVMISNL+ATF+ VK
Subjt:  EDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

SwissProt top hitse value%identityAlignment
A1Z9A2 UPF0587 protein CG46462.1e-0632.53Show/hide
Query:  DSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKFEDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        DSG    +++F+CRG EPV+F    GW+  S++            G +FE++DLS  ++ EYD+     V I    + F  +K
Subjt:  DSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKFEDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

Q290L7 UPF0587 protein GA183267.8e-0938.55Show/hide
Query:  DSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKFEDIDLSGGEFAEYDENGECPVMISNLKATFDLVK
        DSGK   +++FDCRG EPVDF    GWK  SS+            G  FED+DLS  ++ EYD+     V +    + F  +K
Subjt:  DSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKFEDIDLSGGEFAEYDENGECPVMISNLKATFDLVK

Q32P66 CXXC motif containing zinc binding protein4.0e-0528.1Show/hide
Query:  LNETVPLQAGKGTTNLVQKGWNRYHDSG----RGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKFEDIDLSGGE
        L ++V L+ G+G+ ++VQK      ++          +    D+ K   ++ F+CRG EPVDF    G+ AE  +            GT F DI+L   +
Subjt:  LNETVPLQAGKGTTNLVQKGWNRYHDSG----RGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKFEDIDLSGGE

Query:  FAEYDENGECPVMISNLKATF
        + +YDE  +  V I  +   F
Subjt:  FAEYDENGECPVMISNLKATF

Q3B8G0 CXXC motif containing zinc binding protein4.8e-0629.27Show/hide
Query:  LNETVPLQAGKGTTNLVQ--KGWNRYHD----SGRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKFEDIDLSG
        L ++VPL+ G+G+ ++VQ  K  +R +     +    P   E S++ K+  ++ F+CRG EP+DF    G+ AE ++            GT F +I+L  
Subjt:  LNETVPLQAGKGTTNLVQ--KGWNRYHD----SGRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKFEDIDLSG

Query:  GEFAEYDENGECPVMISNLKATF
         ++ +YDE  +  V I  ++  F
Subjt:  GEFAEYDENGECPVMISNLKATF

Q9NWV4 CXXC motif containing zinc binding protein3.1e-0529.27Show/hide
Query:  LNETVPLQAGKGTTNLVQKGWNRYHD------SGRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKFEDIDLSG
        L ++V L+ G+G+ ++VQK      +      S   +P   E +++ K+  ++ F+CRG EPVDF    G+ AE  +            GT F DI+L  
Subjt:  LNETVPLQAGKGTTNLVQKGWNRYHD------SGRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKFEDIDLSG

Query:  GEFAEYDENGECPVMISNLKATF
         ++ +YDE  +  V I  +   F
Subjt:  GEFAEYDENGECPVMISNLKATF

Arabidopsis top hitse value%identityAlignment
AT4G32930.1 unknown protein5.4e-2948.89Show/hide
Query:  PERNVCNLNETVPLQAGKGTTNLVQK------GWNRYHDSGRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF
        P+     LNET     G+GT +LVQK        N     G+G+PLT E S++G+ +PLM+FDCRGYEP+DF FG  WKA++              GTKF
Subjt:  PERNVCNLNETVPLQAGKGTTNLVQK------GWNRYHDSGRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKF

Query:  EDIDLSGG-EFAEYDENGECPVMISNLKATFDLVK
        ++IDLS G EF EYDE GECPVMISN +A+F + K
Subjt:  EDIDLSGG-EFAEYDENGECPVMISNLKATFDLVK

AT4G32930.2 unknown protein5.4e-2946.85Show/hide
Query:  PERNVCNLNETVPLQAGKGTTNLVQKGWNRYHD--------------SGRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSF
        P+     LNET     G+GT +LVQK  N   D               G+G+PLT E S++G+ +PLM+FDCRGYEP+DF FG  WKA++          
Subjt:  PERNVCNLNETVPLQAGKGTTNLVQKGWNRYHD--------------SGRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSF

Query:  PQIEGTKFEDIDLSGG-EFAEYDENGECPVMISNLKATFDLVK
            GTKF++IDLS G EF EYDE GECPVMISN +A+F + K
Subjt:  PQIEGTKFEDIDLSGG-EFAEYDENGECPVMISNLKATFDLVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGGAGCTGTGGAGAGGTGAGCCAGAAAGAAACGTGTGTAACTTGAATGAAACTGTTCCTCTCCAAGCGGGAAAAGGAACTACTAATCTCGTTCAAAAGGGATGGAA
CCGTTACCATGATTCCGGGCGAGGTCAACCATTGACCCAGGAAACAAGTGACTCAGGGAAGTCATCTCCCTTGATGTTATTTGACTGCAGAGGTTATGAGCCTGTGGACT
TCGTATTTGGACCTGGATGGAAAGCAGAATCTAGTCAAGTTTTTATGCAGCTGCCTTCTTTTCCGCAGATTGAGGGGACTAAATTTGAGGATATTGACTTGTCTGGAGGT
GAGTTTGCAGAGTATGATGAGAATGGAGAATGCCCCGTCATGATTTCCAATCTAAAAGCCACATTTGACTTGGTAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGGGAGCTGTGGAGAGGTGAGCCAGAAAGAAACGTGTGTAACTTGAATGAAACTGTTCCTCTCCAAGCGGGAAAAGGAACTACTAATCTCGTTCAAAAGGGATGGAA
CCGTTACCATGATTCCGGGCGAGGTCAACCATTGACCCAGGAAACAAGTGACTCAGGGAAGTCATCTCCCTTGATGTTATTTGACTGCAGAGGTTATGAGCCTGTGGACT
TCGTATTTGGACCTGGATGGAAAGCAGAATCTAGTCAAGTTTTTATGCAGCTGCCTTCTTTTCCGCAGATTGAGGGGACTAAATTTGAGGATATTGACTTGTCTGGAGGT
GAGTTTGCAGAGTATGATGAGAATGGAGAATGCCCCGTCATGATTTCCAATCTAAAAGCCACATTTGACTTGGTAAAGTAG
Protein sequenceShow/hide protein sequence
MRELWRGEPERNVCNLNETVPLQAGKGTTNLVQKGWNRYHDSGRGQPLTQETSDSGKSSPLMLFDCRGYEPVDFVFGPGWKAESSQVFMQLPSFPQIEGTKFEDIDLSGG
EFAEYDENGECPVMISNLKATFDLVK