; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018318 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018318
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCCHC-type domain-containing protein
Genome locationtig00153161:273781..283346
RNA-Seq ExpressionSgr018318
SyntenySgr018318
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MBA0550147.1 hypothetical protein [Gossypium lobatum]8.2e-1428.97Show/hide
Query:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNLKS-RKNMWKQRPSPRDFSSENDDSVK
        K LRRGI ++  G    CW P KY +LP FC+ CG+VGH ++EC  +     +R  +   Y L L+ +  L   +S   N   +  +    +   +  +K
Subjt:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNLKS-RKNMWKQRPSPRDFSSENDDSVK

Query:  DSSWSFGSGKSGDVDSEKVVLPAALEGAVDSMVATLVTSRDLLPGKLGPDMLMISAPVVTERGVLSCQLNDNCGDSRGPTVISKQVTALAKVETKVKGRK
                 K G+++ EK+    A E   D++      +   L  + G  M    A  + ++  L+  L D+ G  +     +K++            R+
Subjt:  DSSWSFGSGKSGDVDSEKVVLPAALEGAVDSMVATLVTSRDLLPGKLGPDMLMISAPVVTERGVLSCQLNDNCGDSRGPTVISKQVTALAKVETKVKGRK

Query:  DVKSWKRKAPLPEAMKTLCWNVRGMGNPRAFRAICDVIHLNNPQIIFLSETK
        D      KA  P AMK +CWNVRG+G+PR  R +  ++  NNPQ++FL +TK
Subjt:  DVKSWKRKAPLPEAMKTLCWNVRGMGNPRAFRAICDVIHLNNPQIIFLSETK

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]5.3e-1355.71Show/hide
Query:  MKLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQG
        MK L RGIK+N+DGP+GGCW PI+Y RLPDF Y CG++ H + +C           SK  QYG WLRFQG
Subjt:  MKLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]1.0e-1652.63Show/hide
Query:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNL-KSRKNMWKQRPSPRDFSSEN
        K LRRGIKINIDGP+GGCW PI+Y RLPDFCY CG +GH  ++C+A Y A         +YG WLRF GS     K RK     R      SS N
Subjt:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNL-KSRKNMWKQRPSPRDFSSEN

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.9e-1038.71Show/hide
Query:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNLKSRKNMWKQRPSPRDFSSENDDSVKD
        K LRRGIK+N+DGPIGG W PI+Y RLPDFCY CG                 S   K  QYG WLR+QG++K    +     ++P         ++S   
Subjt:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNLKSRKNMWKQRPSPRDFSSENDDSVKD

Query:  SSWSFGSGKSGDVDSEKVVLPAAL
        S+   G+G  G V S     P A+
Subjt:  SSWSFGSGKSGDVDSEKVVLPAAL

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]3.2e-1038.02Show/hide
Query:  LVKLTIILQILAIREGLCLVERLDLSPILVETDSLQAVQLINGVEETSEEAGTWIMDSRRQMEARHLVPVVHVYRKANFVAHTIAMEAFRYPSSMLWVSD
        +VK  ++ +ILAIREGL L  RL +  ++VETDSL+A+ LI        EA +W+ D R        +   HV+R++N VA+ +  E        LW  D
Subjt:  LVKLTIILQILAIREGLCLVERLDLSPILVETDSLQAVQLINGVEETSEEAGTWIMDSRRQMEARHLVPVVHVYRKANFVAHTIAMEAFRYPSSMLWVSD

Query:  FPLWLTELVKGEEVDVVAHMA
        FP+WL  L +  + + VA MA
Subjt:  FPLWLTELVKGEEVDVVAHMA

TrEMBL top hitse value%identityAlignment
A0A6J1BSZ1 uncharacterized protein LOC1110054812.6e-1355.71Show/hide
Query:  MKLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQG
        MK L RGIK+N+DGP+GGCW PI+Y RLPDF Y CG++ H + +C           SK  QYG WLRFQG
Subjt:  MKLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQG

A0A6J1DU55 uncharacterized protein LOC1110231355.0e-1752.63Show/hide
Query:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNL-KSRKNMWKQRPSPRDFSSEN
        K LRRGIKINIDGP+GGCW PI+Y RLPDFCY CG +GH  ++C+A Y A         +YG WLRF GS     K RK     R      SS N
Subjt:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNL-KSRKNMWKQRPSPRDFSSEN

A0A6J1DX30 uncharacterized protein LOC1110248749.2e-1138.71Show/hide
Query:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNLKSRKNMWKQRPSPRDFSSENDDSVKD
        K LRRGIK+N+DGPIGG W PI+Y RLPDFCY CG                 S   K  QYG WLR+QG++K    +     ++P         ++S   
Subjt:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNLKSRKNMWKQRPSPRDFSSENDDSVKD

Query:  SSWSFGSGKSGDVDSEKVVLPAAL
        S+   G+G  G V S     P A+
Subjt:  SSWSFGSGKSGDVDSEKVVLPAAL

A0A7J8LCH9 CCHC-type domain-containing protein4.0e-1428.97Show/hide
Query:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNLKS-RKNMWKQRPSPRDFSSENDDSVK
        K LRRGI ++  G    CW P KY +LP FC+ CG+VGH ++EC  +     +R  +   Y L L+ +  L   +S   N   +  +    +   +  +K
Subjt:  KLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNLKS-RKNMWKQRPSPRDFSSENDDSVK

Query:  DSSWSFGSGKSGDVDSEKVVLPAALEGAVDSMVATLVTSRDLLPGKLGPDMLMISAPVVTERGVLSCQLNDNCGDSRGPTVISKQVTALAKVETKVKGRK
                 K G+++ EK+    A E   D++      +   L  + G  M    A  + ++  L+  L D+ G  +     +K++            R+
Subjt:  DSSWSFGSGKSGDVDSEKVVLPAALEGAVDSMVATLVTSRDLLPGKLGPDMLMISAPVVTERGVLSCQLNDNCGDSRGPTVISKQVTALAKVETKVKGRK

Query:  DVKSWKRKAPLPEAMKTLCWNVRGMGNPRAFRAICDVIHLNNPQIIFLSETK
        D      KA  P AMK +CWNVRG+G+PR  R +  ++  NNPQ++FL +TK
Subjt:  DVKSWKRKAPLPEAMKTLCWNVRGMGNPRAFRAICDVIHLNNPQIIFLSETK

A0A803QQ69 Uncharacterized protein4.1e-1124.81Show/hide
Query:  WNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQ--YGLWLRFQGSLKNLKSRKNMWKQRPSP--RDFSSENDDSVKDSSWSFGSGKSGDVD
        W P +Y RLP  C+ CG++GH    CE  +    S G+ C    YG W++ +     LK RK + ++R +   R+ +++      +   +F     G+V 
Subjt:  WNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQ--YGLWLRFQGSLKNLKSRKNMWKQRPSP--RDFSSENDDSVKDSSWSFGSGKSGDVD

Query:  SEKVVLPAALEGAVDSMVATLVTSRDLLPGKL----GPDMLMISAPVVTERGVLSCQLNDNCGDSRGPTVIS----KQVTALAKVETKVKG----RKDVK
            +       ++++ +          PGK     G   ++ S   +T+ G     +    G   G  V++    ++   +AK+  K+ G     +   
Subjt:  SEKVVLPAALEGAVDSMVATLVTSRDLLPGKL----GPDMLMISAPVVTERGVLSCQLNDNCGDSRGPTVIS----KQVTALAKVETKVKG----RKDVK

Query:  SWKRKAPLPEAMKTLCWNVRGMGNPRAFRAICDVIHLNNPQIIFLSETKEPTRLPVRARIVV
                P AM  L WNV+G+GNP   +A+C  +   +P++IFLSET+       R R+V+
Subjt:  SWKRKAPLPEAMKTLCWNVRGMGNPRAFRAICDVIHLNNPQIIFLSETKEPTRLPVRARIVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTGCTTAGGCGAGGTATTAAAATTAATATTGATGGGCCGATTGGGGGCTGTTGGAACCCAATAAAATATGGGCGGCTACCGGATTTTTGTTATCGTTGTGGAAA
GGTGGGACATCGTGTGAATGAGTGTGAAGCAGTTTATGGTGCTCCAAATTCGAGGGGATCAAAATGCTGTCAGTATGGTTTGTGGCTAAGGTTTCAAGGGAGCTTGAAGA
ATCTGAAAAGCAGAAAGAACATGTGGAAACAGAGGCCTTCTCCTAGGGATTTTTCTTCAGAGAATGATGATTCTGTGAAGGATTCTTCTTGGTCTTTTGGGTCGGGGAAG
TCAGGTGATGTTGATTCAGAGAAGGTGGTTCTGCCAGCAGCACTGGAAGGGGCCGTTGACAGTATGGTGGCGACATTGGTTACGAGTAGGGATTTATTGCCTGGAAAATT
AGGGCCTGATATGCTGATGATTTCTGCTCCAGTTGTTACTGAAAGAGGGGTGCTGAGTTGTCAACTTAATGATAATTGTGGTGATTCAAGAGGCCCTACTGTGATAAGTA
AGCAGGTGACTGCGTTGGCTAAAGTTGAGACAAAAGTTAAGGGAAGGAAAGATGTTAAAAGTTGGAAGAGGAAAGCACCCCTACCGGAAGCAATGAAAACTTTATGTTGG
AATGTTCGTGGGATGGGGAACCCAAGGGCATTCAGAGCAATTTGTGATGTCATTCATCTTAATAATCCCCAAATTATTTTCCTATCTGAAACGAAGGAGCCAACACGCCT
TCCAGTTCGAGCGAGGATAGTTGTGATCGTTCTTGTATGGCTCATGAAGGAATTCGCTAGAGTCCACCTCGTTCTAAAGAGTTTAAGATCAACACTGATGTCACTTGTAA
AACTCACAATCATTCTACAGATTTTGGCCATTCGAGAAGGGCTTTGTCTGGTAGAACGGTTGGATTTGTCTCCCATTTTGGTTGAAACCGACTCTCTTCAGGCTGTGCAG
CTGATTAATGGTGTGGAGGAAACGAGTGAGGAAGCTGGTACTTGGATTATGGACAGCAGAAGACAAATGGAAGCTCGTCATCTTGTTCCTGTTGTGCATGTCTATCGTAA
AGCAAACTTTGTAGCTCATACCATTGCTATGGAAGCTTTTAGATACCCCAGCTCAATGCTATGGGTTTCAGATTTCCCCCTTTGGCTCACTGAGTTAGTAAAGGGGGAAG
AAGTTGATGTTGTAGCCCACATGGCAGACTTGGATGCTAATGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTGCTTAGGCGAGGTATTAAAATTAATATTGATGGGCCGATTGGGGGCTGTTGGAACCCAATAAAATATGGGCGGCTACCGGATTTTTGTTATCGTTGTGGAAA
GGTGGGACATCGTGTGAATGAGTGTGAAGCAGTTTATGGTGCTCCAAATTCGAGGGGATCAAAATGCTGTCAGTATGGTTTGTGGCTAAGGTTTCAAGGGAGCTTGAAGA
ATCTGAAAAGCAGAAAGAACATGTGGAAACAGAGGCCTTCTCCTAGGGATTTTTCTTCAGAGAATGATGATTCTGTGAAGGATTCTTCTTGGTCTTTTGGGTCGGGGAAG
TCAGGTGATGTTGATTCAGAGAAGGTGGTTCTGCCAGCAGCACTGGAAGGGGCCGTTGACAGTATGGTGGCGACATTGGTTACGAGTAGGGATTTATTGCCTGGAAAATT
AGGGCCTGATATGCTGATGATTTCTGCTCCAGTTGTTACTGAAAGAGGGGTGCTGAGTTGTCAACTTAATGATAATTGTGGTGATTCAAGAGGCCCTACTGTGATAAGTA
AGCAGGTGACTGCGTTGGCTAAAGTTGAGACAAAAGTTAAGGGAAGGAAAGATGTTAAAAGTTGGAAGAGGAAAGCACCCCTACCGGAAGCAATGAAAACTTTATGTTGG
AATGTTCGTGGGATGGGGAACCCAAGGGCATTCAGAGCAATTTGTGATGTCATTCATCTTAATAATCCCCAAATTATTTTCCTATCTGAAACGAAGGAGCCAACACGCCT
TCCAGTTCGAGCGAGGATAGTTGTGATCGTTCTTGTATGGCTCATGAAGGAATTCGCTAGAGTCCACCTCGTTCTAAAGAGTTTAAGATCAACACTGATGTCACTTGTAA
AACTCACAATCATTCTACAGATTTTGGCCATTCGAGAAGGGCTTTGTCTGGTAGAACGGTTGGATTTGTCTCCCATTTTGGTTGAAACCGACTCTCTTCAGGCTGTGCAG
CTGATTAATGGTGTGGAGGAAACGAGTGAGGAAGCTGGTACTTGGATTATGGACAGCAGAAGACAAATGGAAGCTCGTCATCTTGTTCCTGTTGTGCATGTCTATCGTAA
AGCAAACTTTGTAGCTCATACCATTGCTATGGAAGCTTTTAGATACCCCAGCTCAATGCTATGGGTTTCAGATTTCCCCCTTTGGCTCACTGAGTTAGTAAAGGGGGAAG
AAGTTGATGTTGTAGCCCACATGGCAGACTTGGATGCTAATGAATAA
Protein sequenceShow/hide protein sequence
MKLLRRGIKINIDGPIGGCWNPIKYGRLPDFCYRCGKVGHRVNECEAVYGAPNSRGSKCCQYGLWLRFQGSLKNLKSRKNMWKQRPSPRDFSSENDDSVKDSSWSFGSGK
SGDVDSEKVVLPAALEGAVDSMVATLVTSRDLLPGKLGPDMLMISAPVVTERGVLSCQLNDNCGDSRGPTVISKQVTALAKVETKVKGRKDVKSWKRKAPLPEAMKTLCW
NVRGMGNPRAFRAICDVIHLNNPQIIFLSETKEPTRLPVRARIVVIVLVWLMKEFARVHLVLKSLRSTLMSLVKLTIILQILAIREGLCLVERLDLSPILVETDSLQAVQ
LINGVEETSEEAGTWIMDSRRQMEARHLVPVVHVYRKANFVAHTIAMEAFRYPSSMLWVSDFPLWLTELVKGEEVDVVAHMADLDANE