; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016865 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016865
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCCHC-type domain-containing protein
Genome locationtig00153014:503652..505253
RNA-Seq ExpressionSgr016865
SyntenySgr016865
Gene Ontology termsNA
InterPro domainsIPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PPD83812.1 hypothetical protein GOBAR_DD19246 [Gossypium barbadense]1.3e-1430.05Show/hide
Query:  WGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSH---SETAEEDLQYGVWLRGTA-NQRGNYG---------
        W + +R+++ I V+KPLRR  ++K +     +    + YE+L DFC+ C  +GH LK    +         +LQ+G W+R  A NQ  + G         
Subjt:  WGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSH---SETAEEDLQYGVWLRGTA-NQRGNYG---------

Query:  ---------GRKGG-----RE------SNKNRGHIDTIIKDDNGS-WQFTGIHGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNR
                 G+ GG     RE         ++ HID++IK DNG   +FTG +GH + + R   W +++++    N  WI+ G FN I+++S+K  G  +
Subjt:  ---------GRKGG-----RE------SNKNRGHIDTIIKDDNGS-WQFTGIHGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNR

Query:  RPSQMKAFRDVID
          + M  F D++D
Subjt:  RPSQMKAFRDVID

PPD84469.1 hypothetical protein GOBAR_DD18598 [Gossypium barbadense]3.7e-1427.01Show/hide
Query:  WGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHSETA---EEDLQYGVWLR---GTANQRGNYG-------
        W + LR++I I+++ P+RR  ++K +G    +    + YE+L  FCY C  +GH +K  +S    +     +LQYG WLR     +NQ    G       
Subjt:  WGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHSETA---EEDLQYGVWLR---GTANQRGNYG-------

Query:  -----------GRK-------GGRESNKNRGHIDTIIKDDNGSWQFTGIHGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPS
                   G K       G  E  +N    +    D++ S++FTG +G++  ++R  +W ++ ++       WI+ G FN I+D+++K  G  +  +
Subjt:  -----------GRK-------GGRESNKNRGHIDTIIKDDNGSWQFTGIHGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPS

Query:  QMKAFRDVIDD
         +  FR V+D+
Subjt:  QMKAFRDVIDD

PPS08715.1 hypothetical protein GOBAR_AA11939 [Gossypium barbadense]6.3e-1424.05Show/hide
Query:  WGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHSETA---EEDLQYGVWLR--------------------
        W + +R+++ ID++KPLRR  ++K+      +T   I YE+LLDFCY C  +GH  K+ + + E A   + + QYG W+R                    
Subjt:  WGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHSETA---EEDLQYGVWLR--------------------

Query:  ---------------------GTANQRGNYGGRKGGR------------------ESNKNRG-------------------HIDTIIKDDNG-SWQFTGI
                             G + Q+GN  G K  R                  +  ++RG                   HID++I  +N    +F   
Subjt:  ---------------------GTANQRGNYGGRKGGR------------------ESNKNRG-------------------HIDTIIKDDNG-SWQFTGI

Query:  HGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVIDD
        +G++  ++R  +W ++ R+  + N  WI+ G FN ++D+++K  G  +    M+ F  +ID+
Subjt:  HGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVIDD

XP_010686122.1 PREDICTED: uncharacterized protein LOC104900404 [Beta vulgaris subsp. vulgaris]5.5e-1832.51Show/hide
Query:  DGVICWGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKD-WQSHSETAEEDLQYGVWLR-----GTANQRGNYGGR
        DGV  W ++ RVRI +D+ KPLRR   I +         + + YE+L  FCYAC  +GH+ +D   +  E   E  Q+G WLR     G +++R    GR
Subjt:  DGVICWGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKD-WQSHSETAEEDLQYGVWLR-----GTANQRGNYGGR

Query:  -----KGGRES------NKNRGHIDTIIKDDNGSWQFTGIHGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVI
              G +E+      + ++ HI   +      W+F G++G      +  TW+LI  L    +   +L G FNEI+   +K  G++R    M+ FR+VI
Subjt:  -----KGGRES------NKNRGHIDTIIKDDNGSWQFTGIHGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVI

Query:  DDC
        D C
Subjt:  DDC

XP_030923374.1 uncharacterized protein LOC115950293 [Quercus lobata]1.1e-1350Show/hide
Query:  HIDTII-KDDNGSWQFTGIHGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVIDDC
        HIDTII K    +W+FTGI+G     R+VETW+L++ L    NL WI  G FNEI+   +K+ G+ RR S+M +FRDV+D+C
Subjt:  HIDTII-KDDNGSWQFTGIHGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVIDDC

TrEMBL top hitse value%identityAlignment
A0A2N9E949 CCHC-type domain-containing protein1.0e-1728.1Show/hide
Query:  DDGVICWGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQ---SHSETAE-EDLQYGVWLRGTANQR-------
        +DG I WG+ +RV++ IDV+ PL R   +K+     E  W+ + YEKL  FCY C  LGH  ++ +    H ++ +    +YG WLR T  +R       
Subjt:  DDGVICWGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQ---SHSETAE-EDLQYGVWLRGTANQR-------

Query:  -GNYGGRKGGRESNKNRG----------------------------HIDTIIKDDNGS-----------------WQFTGIHGHSSGDRRVETWKLIERL
         G +  R G    N+  G                            H+  ++K++N S                 W+ TG +G      R+++WKL++ L
Subjt:  -GNYGGRKGGRESNKNRG----------------------------HIDTIIKDDNGS-----------------WQFTGIHGHSSGDRRVETWKLIERL

Query:  IHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVID
           + L W++ G FNEI+ +S+K+  + R  SQM +FR+ ++
Subjt:  IHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVID

A0A2P5XZD0 CCHC-type domain-containing protein3.1e-1424.05Show/hide
Query:  WGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHSETA---EEDLQYGVWLR--------------------
        W + +R+++ ID++KPLRR  ++K+      +T   I YE+LLDFCY C  +GH  K+ + + E A   + + QYG W+R                    
Subjt:  WGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHSETA---EEDLQYGVWLR--------------------

Query:  ---------------------GTANQRGNYGGRKGGR------------------ESNKNRG-------------------HIDTIIKDDNG-SWQFTGI
                             G + Q+GN  G K  R                  +  ++RG                   HID++I  +N    +F   
Subjt:  ---------------------GTANQRGNYGGRKGGR------------------ESNKNRG-------------------HIDTIIKDDNG-SWQFTGI

Query:  HGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVIDD
        +G++  ++R  +W ++ R+  + N  WI+ G FN ++D+++K  G  +    M+ F  +ID+
Subjt:  HGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVIDD

A0A5C7GW09 CCHC-type domain-containing protein2.6e-1338.1Show/hide
Query:  LRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHS---ETAEEDLQYGVWLRGTANQRGNYGGRKGGRESNKNRGH
        LRVR+ ++++KPLRR + I V+G   E   + I YE+LLDFC+ C  LGH  KD        ET +EDL +G W+R     +G +GGR+   +S  N+  
Subjt:  LRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHS---ETAEEDLQYGVWLRGTANQRGNYGGRKGGRESNKNRGH

Query:  IDTIIKDDNGSWQFTGIHGHSSGDRR
            +  + GSW+     G S  D R
Subjt:  IDTIIKDDNGSWQFTGIHGHSSGDRR

A0A6J1DU55 uncharacterized protein LOC1110231353.4e-1345.26Show/hide
Query:  WGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHSETAEED----LQYGVWLRGTANQRGNYGGRKG
        WG +LR+R+ ID+ KPLRR I I + G M    WIPI YE+L DFCY C  +GH   D  +    A++D     +YG WLR   ++ G   GRKG
Subjt:  WGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHSETAEED----LQYGVWLRGTANQRGNYGGRKG

A0A803LLP6 Uncharacterized protein2.0e-1325.53Show/hide
Query:  KTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHSETAEEDLQYGVWLRGTA-NQRGNYGGRKGGRESNKNRGH
        K++R+R+ +DV +PL + + +K+ G + E  +  + YEK   FCY C  +GH +KD   H E     + +G W++ +    R    G  GG     + G 
Subjt:  KTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHSETAEEDLQYGVWLRGTA-NQRGNYGGRKGGRESNKNRGH

Query:  I-------DTIIKDDNGSWQFTGIHGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVIDDCR
        +            +++  W+F  I+GH   + + +T +L+E L   +  SW++ G  N ++   +K  G           R  +D C+
Subjt:  I-------DTIIKDDNGSWQFTGIHGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVIDDCR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATGGGGTAATATGCTGGGGGAAAACCTTGAGGGTTCGAATTACTATTGACGTGAACAAGCCGTTAAGGAGGGCCATCATGATCAAAGTTATTGGCTCTATGGC
TGAAGATACTTGGATCCCCATTACTTACGAGAAATTATTGGACTTTTGTTATGCATGTGACTGGCTGGGGCATGTGCTGAAGGATTGGCAATCACACTCGGAGACGGCAG
AAGAAGATCTTCAATATGGCGTGTGGCTAAGAGGAACGGCTAATCAAAGGGGAAATTATGGAGGAAGAAAAGGAGGAAGAGAGAGCAACAAAAATAGAGGACACATCGAC
ACGATTATAAAAGACGACAATGGGAGTTGGCAATTCACAGGTATTCATGGGCATTCTAGTGGAGATAGAAGGGTGGAAACTTGGAAACTTATTGAACGACTCATCCATGT
ATCTAATCTCTCATGGATCCTCTGGGGACATTTCAATGAGATCATAGACGATTCCAAAAAGGTTAGTGGATCAAACAGAAGGCCAAGCCAAATGAAGGCTTTTAGAGATG
TGATTGATGACTGTAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGATGGGGTAATATGCTGGGGGAAAACCTTGAGGGTTCGAATTACTATTGACGTGAACAAGCCGTTAAGGAGGGCCATCATGATCAAAGTTATTGGCTCTATGGC
TGAAGATACTTGGATCCCCATTACTTACGAGAAATTATTGGACTTTTGTTATGCATGTGACTGGCTGGGGCATGTGCTGAAGGATTGGCAATCACACTCGGAGACGGCAG
AAGAAGATCTTCAATATGGCGTGTGGCTAAGAGGAACGGCTAATCAAAGGGGAAATTATGGAGGAAGAAAAGGAGGAAGAGAGAGCAACAAAAATAGAGGACACATCGAC
ACGATTATAAAAGACGACAATGGGAGTTGGCAATTCACAGGTATTCATGGGCATTCTAGTGGAGATAGAAGGGTGGAAACTTGGAAACTTATTGAACGACTCATCCATGT
ATCTAATCTCTCATGGATCCTCTGGGGACATTTCAATGAGATCATAGACGATTCCAAAAAGGTTAGTGGATCAAACAGAAGGCCAAGCCAAATGAAGGCTTTTAGAGATG
TGATTGATGACTGTAGATGA
Protein sequenceShow/hide protein sequence
MDDGVICWGKTLRVRITIDVNKPLRRAIMIKVIGSMAEDTWIPITYEKLLDFCYACDWLGHVLKDWQSHSETAEEDLQYGVWLRGTANQRGNYGGRKGGRESNKNRGHID
TIIKDDNGSWQFTGIHGHSSGDRRVETWKLIERLIHVSNLSWILWGHFNEIIDDSKKVSGSNRRPSQMKAFRDVIDDCR