; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012668 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012668
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRNase H domain-containing protein
Genome locationtig00153488:205601..210771
RNA-Seq ExpressionSgr012668
SyntenySgr012668
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG47194.1 hypothetical protein EZV62_026488 [Acer yangbiense]1.6e-0727.96Show/hide
Query:  GVEHGEQIDDVNSKATWILTYQLEMSRIGAQKHGASSDAPLLGQSATD---SCWLPSPSGWFKVNTNAAYDKS--------VLRS-------SMAKQVRA
        G + GE +   N    W   +  E  +         S AP+  +  +D     WL  P G FK+NT+AA +          V+R+       S  K  R 
Subjt:  GVEHGEQIDDVNSKATWILTYQLEMSRIGAQKHGASSDAPLLGQSATD---SCWLPSPSGWFKVNTNAAYDKS--------VLRS-------SMAKQVRA

Query:  TLNPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAF
          +P++AE LA+ EGL  A   GF   V+E+D++   + +    ++ ++ G    D+         FS R+  R  N++AH LA LA S+   FV S   
Subjt:  TLNPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAF

Query:  PSWLSALAEGE
        P  +  L  G+
Subjt:  PSWLSALAEGE

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.9e-1129.82Show/hide
Query:  HGEQIDDVNSKATWILTY-----QLEMSRIGAQKHGASSDAPLLGQSATDSCWLPSPSGWFKVNTNAAYDKS------VLRSSMAKQVRAT-------LN
        HG+Q+  V  K  W+  +     Q +MS    +    S+  P++        W PS S   K+NT+AA   +      ++R S    V AT       L+
Subjt:  HGEQIDDVNSKATWILTY-----QLEMSRIGAQKHGASSDAPLLGQSATDSCWLPSPSGWFKVNTNAAYDKS------VLRSSMAKQVRAT-------LN

Query:  PLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSS-AESFVSSRAFPS
        PLLAE+  + EGL  AA   F +L VE+DS+ A ++++ E     +   W  ++Q +T      S  ++SR CN+ AH LA    +S + ++     FP+
Subjt:  PLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSS-AESFVSSRAFPS

Query:  WLSALAEGEAAAQ--HVA
        WL  L + +  +   HVA
Subjt:  WLSALAEGEAAAQ--HVA

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]2.9e-0930.65Show/hide
Query:  EQIDDVNSKATWILTYQLEMSRIGAQKHGASSDAPLLGQSATDSCWLPSPSGWFKVNTNAAYDKS------VLRSSMAKQVRATLN------PLLAELLA
        ++I +   K  WIL Y  E+       +G     P + +      W P   G  K+NT+AA  +       +LR   A+ V A ++      PLLA++LA
Subjt:  EQIDDVNSKATWILTYQLEMSRIGAQKHGASSDAPLLGQSATDSCWLPSPSGWFKVNTNAAYDKS------VLRSSMAKQVRATLN------PLLAELLA

Query:  MCEGLLSAAELGFHNLVVETDSIQATRILQGETAVW-NEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSALAE
        + EGL  A  LG H +VVETDS++A  ++ G+ + W  EA +W  D++           ++  R  N +A+ L     S    F+    FP WL  LAE
Subjt:  MCEGLLSAAELGFHNLVVETDSIQATRILQGETAVW-NEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSALAE

XP_042952220.1 uncharacterized protein LOC122289300 [Carya illinoinensis]9.4e-0831.65Show/hide
Query:  TDSCWLPSPSGWFKVNTNAAYDKS-------VLRSSMAKQVRATL--------NPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNE
        ++  W   PS WFKVN + A D++       V+      QV AT+        +PLLAE          A +LG H +V+E DS+Q T+ LQ E   W+ 
Subjt:  TDSCWLPSPSGWFKVNTNAAYDKS-------VLRSSMAKQVRATL--------NPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNE

Query:  AGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSAL
        A    S+ +        + + +  R  N+IAH LA  A +  E  V+    PSW+  L
Subjt:  AGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSAL

XP_042969074.1 uncharacterized protein LOC122301757 [Carya illinoinensis]1.2e-0730.38Show/hide
Query:  TDSCWLPSPSGWFKVNTNAAYDKS-------VLRSSMAKQVRATL--------NPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNE
        ++  W   PS WFK N + A D++       V+      QV AT+        +PLLAE          A +LG   +V+E DS+Q T+ LQ E   W+ 
Subjt:  TDSCWLPSPSGWFKVNTNAAYDKS-------VLRSSMAKQVRATL--------NPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNE

Query:  AGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSAL
        A    S+ +        + + +  R  N+IAH LA +A + +E  V+    PSW+  L
Subjt:  AGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSAL

TrEMBL top hitse value%identityAlignment
A0A5C7GQX3 RNase H domain-containing protein7.7e-0827.96Show/hide
Query:  GVEHGEQIDDVNSKATWILTYQLEMSRIGAQKHGASSDAPLLGQSATD---SCWLPSPSGWFKVNTNAAYDKS--------VLRS-------SMAKQVRA
        G + GE +   N    W   +  E  +         S AP+  +  +D     WL  P G FK+NT+AA +          V+R+       S  K  R 
Subjt:  GVEHGEQIDDVNSKATWILTYQLEMSRIGAQKHGASSDAPLLGQSATD---SCWLPSPSGWFKVNTNAAYDKS--------VLRS-------SMAKQVRA

Query:  TLNPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAF
          +P++AE LA+ EGL  A   GF   V+E+D++   + +    ++ ++ G    D+         FS R+  R  N++AH LA LA S+   FV S   
Subjt:  TLNPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAF

Query:  PSWLSALAEGE
        P  +  L  G+
Subjt:  PSWLSALAEGE

A0A5C7IFK3 RNase H domain-containing protein2.3e-0729.03Show/hide
Query:  WLPSPSGWFKVNTNAAYDKS--------VLRSSMAKQVRAT-------LNPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTW
        W P   G  K+N +AA D          V+R S  + ++A         +  +AE  A+ EG+L A   G  +L +E+DS+   R+  GE +  N+    
Subjt:  WLPSPSGWFKVNTNAAYDKS--------VLRSSMAKQVRAT-------LNPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTW

Query:  ASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSALA
          D+Q + +R    S+ Y  R+CN +AH +A  A     S +    +P WL  +A
Subjt:  ASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSALA

A0A5C7IST2 RNase H domain-containing protein1.7e-0729.68Show/hide
Query:  WLPSPSGWFKVNTNAAYDKS--------VLRSSMAKQVRAT-------LNPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTW
        W+P   G  KVN +AA D          V+R S  + +++         +  +AE   + EGLL A   G   L +E+DS+   R+  GE +  N+    
Subjt:  WLPSPSGWFKVNTNAAYDKS--------VLRSSMAKQVRAT-------LNPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTW

Query:  ASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSALA
          D+Q + +R    S+ Y  R+CN++AH +A  A     S +    +P WL  LA
Subjt:  ASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSALA

A0A6J1DX30 uncharacterized protein LOC1110248743.4e-1129.82Show/hide
Query:  HGEQIDDVNSKATWILTY-----QLEMSRIGAQKHGASSDAPLLGQSATDSCWLPSPSGWFKVNTNAAYDKS------VLRSSMAKQVRAT-------LN
        HG+Q+  V  K  W+  +     Q +MS    +    S+  P++        W PS S   K+NT+AA   +      ++R S    V AT       L+
Subjt:  HGEQIDDVNSKATWILTY-----QLEMSRIGAQKHGASSDAPLLGQSATDSCWLPSPSGWFKVNTNAAYDKS------VLRSSMAKQVRAT-------LN

Query:  PLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSS-AESFVSSRAFPS
        PLLAE+  + EGL  AA   F +L VE+DS+ A ++++ E     +   W  ++Q +T      S  ++SR CN+ AH LA    +S + ++     FP+
Subjt:  PLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSS-AESFVSSRAFPS

Query:  WLSALAEGEAAAQ--HVA
        WL  L + +  +   HVA
Subjt:  WLSALAEGEAAAQ--HVA

A0A6J1DZK3 uncharacterized protein LOC1110249681.4e-0930.65Show/hide
Query:  EQIDDVNSKATWILTYQLEMSRIGAQKHGASSDAPLLGQSATDSCWLPSPSGWFKVNTNAAYDKS------VLRSSMAKQVRATLN------PLLAELLA
        ++I +   K  WIL Y  E+       +G     P + +      W P   G  K+NT+AA  +       +LR   A+ V A ++      PLLA++LA
Subjt:  EQIDDVNSKATWILTYQLEMSRIGAQKHGASSDAPLLGQSATDSCWLPSPSGWFKVNTNAAYDKS------VLRSSMAKQVRATLN------PLLAELLA

Query:  MCEGLLSAAELGFHNLVVETDSIQATRILQGETAVW-NEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSALAE
        + EGL  A  LG H +VVETDS++A  ++ G+ + W  EA +W  D++           ++  R  N +A+ L     S    F+    FP WL  LAE
Subjt:  MCEGLLSAAELGFHNLVVETDSIQATRILQGETAVW-NEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLASLAFSSAESFVSSRAFPSWLSALAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGCAGCAGTTCATCGCGCCACTTTAAGAGAAAGGAGTACTATTGATGGGTTAGTTATGTCGGGTTGTCACGAGTTCCAACTCCACCGTCGCTTGGGTGTCGTCGC
CATCGTGCAAGGAAGGGTGGCTAAGAGTGGCACGAGGGGAGAGATGCTCAAGGGGGTGGAACATGGGGAACAAATTGATGATGTGAACTCCAAGGCTACTTGGATTCTGA
CATATCAGCTGGAAATGTCCAGAATCGGTGCCCAGAAGCATGGGGCTTCGTCCGATGCTCCCCTTTTGGGTCAATCCGCTACTGATTCCTGCTGGTTGCCGTCCCCATCT
GGCTGGTTCAAAGTTAACACAAATGCCGCCTATGATAAAAGTGTTCTTCGCTCATCGATGGCAAAGCAGGTTCGGGCTACTTTAAATCCACTTCTCGCTGAACTCTTGGC
TATGTGTGAGGGGCTTTTATCTGCTGCGGAATTGGGATTTCATAACCTTGTAGTTGAAACGGACTCGATTCAGGCAACCCGCATCCTTCAAGGTGAGACTGCGGTGTGGA
ACGAGGCTGGCACTTGGGCCTCGGATGTCCAAGAGGTGACGGCCAGGCTTGGAGCTTTTTCTTTAAGGTATGCTAGTAGAACTTGCAACCAGATTGCACATTCTTTAGCC
TCTTTAGCTTTTTCTTCTGCTGAGTCTTTTGTTTCGTCGAGGGCTTTCCCTTCTTGGCTGTCTGCATTAGCTGAAGGGGAAGCTGCTGCTCAGCATGTAGCCCAAGTGGT
GCCCTTTTTGTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGCAGCAGTTCATCGCGCCACTTTAAGAGAAAGGAGTACTATTGATGGGTTAGTTATGTCGGGTTGTCACGAGTTCCAACTCCACCGTCGCTTGGGTGTCGTCGC
CATCGTGCAAGGAAGGGTGGCTAAGAGTGGCACGAGGGGAGAGATGCTCAAGGGGGTGGAACATGGGGAACAAATTGATGATGTGAACTCCAAGGCTACTTGGATTCTGA
CATATCAGCTGGAAATGTCCAGAATCGGTGCCCAGAAGCATGGGGCTTCGTCCGATGCTCCCCTTTTGGGTCAATCCGCTACTGATTCCTGCTGGTTGCCGTCCCCATCT
GGCTGGTTCAAAGTTAACACAAATGCCGCCTATGATAAAAGTGTTCTTCGCTCATCGATGGCAAAGCAGGTTCGGGCTACTTTAAATCCACTTCTCGCTGAACTCTTGGC
TATGTGTGAGGGGCTTTTATCTGCTGCGGAATTGGGATTTCATAACCTTGTAGTTGAAACGGACTCGATTCAGGCAACCCGCATCCTTCAAGGTGAGACTGCGGTGTGGA
ACGAGGCTGGCACTTGGGCCTCGGATGTCCAAGAGGTGACGGCCAGGCTTGGAGCTTTTTCTTTAAGGTATGCTAGTAGAACTTGCAACCAGATTGCACATTCTTTAGCC
TCTTTAGCTTTTTCTTCTGCTGAGTCTTTTGTTTCGTCGAGGGCTTTCCCTTCTTGGCTGTCTGCATTAGCTGAAGGGGAAGCTGCTGCTCAGCATGTAGCCCAAGTGGT
GCCCTTTTTGTCCTAA
Protein sequenceShow/hide protein sequence
MVAAVHRATLRERSTIDGLVMSGCHEFQLHRRLGVVAIVQGRVAKSGTRGEMLKGVEHGEQIDDVNSKATWILTYQLEMSRIGAQKHGASSDAPLLGQSATDSCWLPSPS
GWFKVNTNAAYDKSVLRSSMAKQVRATLNPLLAELLAMCEGLLSAAELGFHNLVVETDSIQATRILQGETAVWNEAGTWASDVQEVTARLGAFSLRYASRTCNQIAHSLA
SLAFSSAESFVSSRAFPSWLSALAEGEAAAQHVAQVVPFLS