; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016418 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016418
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRNase H domain-containing protein
Genome locationtig00152909:779617..780228
RNA-Seq ExpressionSgr016418
SyntenySgr016418
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010682926.1 PREDICTED: uncharacterized protein LOC104897688 [Beta vulgaris subsp. vulgaris]2.3e-1732.55Show/hide
Query:  LCWILWNNRNNLCFQREGKRPWELWVWANQYDNGEMGFGEAGEC---------EVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVIRTAST
        LCW +W  RN   F+   + P  +   A Q   G +  G   +           V RW  P   ++KLNSDAAM    N  G+G ++ +  G+V+  A  
Subjt:  LCWILWNNRNNLCFQREGKRPWELWVWANQYDNGEMGFGEAGEC---------EVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVIRTAST

Query:  IRLN-------VRCVDFADGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAIEAVLSGV
         R+N          +   +GIKLALE G   + +EVD++R++K L D     S+  S+IRD++Q     S I+ S  +R GN  AH LA+L+ ++    V
Subjt:  IRLN-------VRCVDFADGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAIEAVLSGV

Query:  WLKEAPADVARI
        WL++ P+++A +
Subjt:  WLKEAPADVARI

XP_015382610.1 uncharacterized protein LOC107175578 [Citrus sinensis]2.3e-1729.11Show/hide
Query:  MEEMITLCWILWNNRNNLCFQREGKRPWELWVWA-------NQYDNGEMGFGEAGECEVIR-WHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVI
        +E MI +CW++WN RN + F+ + + P  L   A        +    E    E+ + +  + W+PP     K+N DAA S  ++ AG+G II++  G VI
Subjt:  MEEMITLCWILWNNRNNLCFQREGKRPWELWVWA-------NQYDNGEMGFGEAGECEVIR-WHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVI

Query:  RTASTIRLNVRCVDFAD------GIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAIEAV
          A  I      V FA+      G+++A      A+ VE D+  V KL++++  G+SE+  +I +++  +R++  ++ ++T R+ N   H LA+L +E  
Subjt:  RTASTIRLNVRCVDFAD------GIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAIEAV

Query:  LSGVWLKEAPADV
         + VW+   P+ +
Subjt:  LSGVWLKEAPADV

XP_022139684.1 uncharacterized protein LOC111010533 [Momordica charantia]3.0e-1730Show/hide
Query:  EEMITLCWILWNNRNNLCFQREGKRPWELWVWANQY---------------DNGEMGFGEAGECEVIR----WHPPSGNIFKLNSDAAMSISQNSAGIG-
        EE++   W LWN RN   F +      +L  W + Y                +    F ++ +    +    W P    +FKL +DA+ S    +AG+G 
Subjt:  EEMITLCWILWNNRNNLCFQREGKRPWELWVWANQY---------------DNGEMGFGEAGECEVIR----WHPPSGNIFKLNSDAAMSISQNSAGIG-

Query:  IIIKNQGGEVIRTASTIRLNVRCVDFA------DGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAA
        III++  G+V+ +A+    +V  VD A      +G+++A+E G+  I +E DS R++ L   + EG S+  S+I  +K  + +  +++ SFTKR GN  A
Subjt:  IIIKNQGGEVIRTASTIRLNVRCVDFA------DGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAA

Query:  HWLARLAIEA
        H LAR A+++
Subjt:  HWLARLAIEA

XP_022140628.1 uncharacterized protein LOC111011237 [Momordica charantia]1.1e-1632.96Show/hide
Query:  WANQY------DNGEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVIRTASTIRLNVRCVDFA------DGIKLALECGVW
        WAN+Y       N     G       + W PP   I+K+N+DA+   S   AG+GIII+N  G+V+ +A+    N++ VD A      +G++LA + GV 
Subjt:  WANQY------DNGEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVIRTASTIRLNVRCVDFA------DGIKLALECGVW

Query:  AIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAIEAVLSGVWLKEAPADV
         + +E DS R+F L     E  SE   ++   K         + +F KR GNKAAH LAR A+      +W+++ P ++
Subjt:  AIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAIEAVLSGVWLKEAPADV

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]5.0e-1731.8Show/hide
Query:  EEMITLCWILWNNRNNLCFQREGKRPW----ELWVWANQYDNGEMGFGEAGECEV---------IRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQG
        EE+  + W LWN RN   F    K  +    EL  WAN+Y    M F EA    +         I W PP   I+K+N+DA+   S   AG+GIII N  
Subjt:  EEMITLCWILWNNRNNLCFQREGKRPW----ELWVWANQYDNGEMGFGEAGECEV---------IRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQG

Query:  GEVIRTASTIRLNVRCVDFA------DGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLA
        G+V+  A+    N++ VD A      +G++LA E G                +H  +E  SE   ++   K         + +F KR GNKAAH LAR A
Subjt:  GEVIRTASTIRLNVRCVDFA------DGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLA

Query:  IEAVLSGVWLKEAPADV
        +      +W+++ P ++
Subjt:  IEAVLSGVWLKEAPADV

TrEMBL top hitse value%identityAlignment
A0A6J1CDQ4 uncharacterized protein LOC1110105331.4e-1730Show/hide
Query:  EEMITLCWILWNNRNNLCFQREGKRPWELWVWANQY---------------DNGEMGFGEAGECEVIR----WHPPSGNIFKLNSDAAMSISQNSAGIG-
        EE++   W LWN RN   F +      +L  W + Y                +    F ++ +    +    W P    +FKL +DA+ S    +AG+G 
Subjt:  EEMITLCWILWNNRNNLCFQREGKRPWELWVWANQY---------------DNGEMGFGEAGECEVIR----WHPPSGNIFKLNSDAAMSISQNSAGIG-

Query:  IIIKNQGGEVIRTASTIRLNVRCVDFA------DGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAA
        III++  G+V+ +A+    +V  VD A      +G+++A+E G+  I +E DS R++ L   + EG S+  S+I  +K  + +  +++ SFTKR GN  A
Subjt:  IIIKNQGGEVIRTASTIRLNVRCVDFA------DGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAA

Query:  HWLARLAIEA
        H LAR A+++
Subjt:  HWLARLAIEA

A0A6J1CIF1 uncharacterized protein LOC1110112375.4e-1732.96Show/hide
Query:  WANQY------DNGEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVIRTASTIRLNVRCVDFA------DGIKLALECGVW
        WAN+Y       N     G       + W PP   I+K+N+DA+   S   AG+GIII+N  G+V+ +A+    N++ VD A      +G++LA + GV 
Subjt:  WANQY------DNGEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVIRTASTIRLNVRCVDFA------DGIKLALECGVW

Query:  AIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAIEAVLSGVWLKEAPADV
         + +E DS R+F L     E  SE   ++   K         + +F KR GNKAAH LAR A+      +W+++ P ++
Subjt:  AIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAIEAVLSGVWLKEAPADV

A0A6J1DAR4 uncharacterized protein LOC1110189542.4e-1731.8Show/hide
Query:  EEMITLCWILWNNRNNLCFQREGKRPW----ELWVWANQYDNGEMGFGEAGECEV---------IRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQG
        EE+  + W LWN RN   F    K  +    EL  WAN+Y    M F EA    +         I W PP   I+K+N+DA+   S   AG+GIII N  
Subjt:  EEMITLCWILWNNRNNLCFQREGKRPW----ELWVWANQYDNGEMGFGEAGECEV---------IRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQG

Query:  GEVIRTASTIRLNVRCVDFA------DGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLA
        G+V+  A+    N++ VD A      +G++LA E G                +H  +E  SE   ++   K         + +F KR GNKAAH LAR A
Subjt:  GEVIRTASTIRLNVRCVDFA------DGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLA

Query:  IEAVLSGVWLKEAPADV
        +      +W+++ P ++
Subjt:  IEAVLSGVWLKEAPADV

A0A6J1DBJ7 uncharacterized protein LOC1110189734.6e-1632.57Show/hide
Query:  MITLCWILWNNRNNL--CFQREGKRPWELWVWANQY--------DNGEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVIR
        ++ L W +WN RN     F   G    +L  W+  Y         +            V  W PP+  + K+N DAA       AG+G+II++  G V  
Subjt:  MITLCWILWNNRNNL--CFQREGKRPWELWVWANQY--------DNGEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVIR

Query:  TASTIRLNVRCVD------FA--DGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSE-ITCSFTKRAGNKAAHWLARLAIE
        TA  IRL  R  D      FA  +GI LA+E G    Q+E DS R+F LL  +   +SEV  +   +K  + S +E ++ SFT R GN  AH LA+LA+ 
Subjt:  TASTIRLNVRCVD------FA--DGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSE-ITCSFTKRAGNKAAHWLARLAIE

Query:  AVLSGVWLKEAPADVARI
        +    +W++E P +++ +
Subjt:  AVLSGVWLKEAPADVARI

A0A803NGI9 Uncharacterized protein2.7e-1629.11Show/hide
Query:  MEEMITLCWILWNNRNNLCFQREGKRPWELWV--------WANQYDNGEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVI
        ME+++   W LWN+RNN    + G  P +LW         +  Q  + +      G      W  P  +  K+N DAA  IS+N  G+GIII+N  G+V+
Subjt:  MEEMITLCWILWNNRNNLCFQREGKRPWELWV--------WANQYDNGEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVI

Query:  RTAS---TIRLNVRCVD---FADGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAIEAV
           S   T RL  + ++      GI  A  C +     E DS  +   ++      S    ++ D+K ++   S +  S  KR  N+AAH LA+ A+E  
Subjt:  RTAS---TIRLNVRCVD---FADGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAIEAV

Query:  LSGVWLKEAPADV
           +W +E P+ +
Subjt:  LSGVWLKEAPADV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.6e-0824.1Show/hide
Query:  LCWILWNNRNNLCFQ----------REGKRPWELWVWANQYDNGEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVI----
        L W LW +RN L F+          R     +E W    + + G+    +      ++W  P     K N+DA   +     GIG I++N+ G V+    
Subjt:  LCWILWNNRNNLCFQ----------REGKRPWELWVWANQYDNGEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVI----

Query:  ----RTASTIRLNVRCVDFADGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAI
            RT + +   +  + +A  +          I  E D+  +  LL+ +    +  P+ + D++Q +  + E+   FT R GNK A  +AR +I
Subjt:  ----RTASTIRLNVRCVDFADGIKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAI

AT4G29090.1 Ribonuclease H-like superfamily protein6.0e-0826.02Show/hide
Query:  LCWILWNNRNNLCFQ----------REGKRPWELWVWANQYDN-GEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVIRTA
        L W LW NRN L F+          R  +   E W    + ++ G         C   RW PP     K N+DA  +      GIG +++N+ GEV    
Subjt:  LCWILWNNRNNLCFQ----------REGKRPWELWVWANQYDN-GEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVIRTA

Query:  STIRLNVRCVDFADGIKLALECGVWAIQ-----------VEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLAR
        +     ++ V     ++  LE   WA+             E DS  + ++L+++    S  P+ I+DL++ +  ++E+   F  R GN  A  +AR
Subjt:  STIRLNVRCVDFADGIKLALECGVWAIQ-----------VEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAATGATAACACTATGCTGGATTTTGTGGAACAATAGAAATAACCTATGCTTCCAAAGAGAGGGGAAACGACCATGGGAGCTGTGGGTTTGGGCAAATCAATA
CGACAATGGGGAGATGGGCTTTGGTGAGGCAGGGGAATGTGAAGTAATCCGGTGGCATCCCCCTAGTGGTAACATTTTCAAATTAAACTCAGATGCAGCTATGTCCATAT
CACAGAATTCAGCAGGCATTGGCATTATCATAAAAAATCAGGGGGGAGAAGTCATTCGTACCGCATCCACGATTCGTCTGAATGTACGCTGTGTTGATTTCGCCGATGGG
ATAAAGCTTGCATTAGAGTGTGGGGTTTGGGCGATTCAAGTGGAAGTTGATTCGTATAGAGTTTTTAAGCTTCTGCATGATGAAATTGAAGGCGAATCAGAGGTGCCAAG
TATGATCCGAGACCTGAAGCAGAAAGTGAGAAGTTGGAGTGAGATTACCTGTAGCTTCACAAAACGGGCTGGCAATAAAGCAGCTCATTGGCTTGCGCGGTTGGCGATTG
AAGCTGTGTTGAGTGGCGTTTGGTTGAAGGAAGCTCCGGCGGATGTTGCGAGGATTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAATGATAACACTATGCTGGATTTTGTGGAACAATAGAAATAACCTATGCTTCCAAAGAGAGGGGAAACGACCATGGGAGCTGTGGGTTTGGGCAAATCAATA
CGACAATGGGGAGATGGGCTTTGGTGAGGCAGGGGAATGTGAAGTAATCCGGTGGCATCCCCCTAGTGGTAACATTTTCAAATTAAACTCAGATGCAGCTATGTCCATAT
CACAGAATTCAGCAGGCATTGGCATTATCATAAAAAATCAGGGGGGAGAAGTCATTCGTACCGCATCCACGATTCGTCTGAATGTACGCTGTGTTGATTTCGCCGATGGG
ATAAAGCTTGCATTAGAGTGTGGGGTTTGGGCGATTCAAGTGGAAGTTGATTCGTATAGAGTTTTTAAGCTTCTGCATGATGAAATTGAAGGCGAATCAGAGGTGCCAAG
TATGATCCGAGACCTGAAGCAGAAAGTGAGAAGTTGGAGTGAGATTACCTGTAGCTTCACAAAACGGGCTGGCAATAAAGCAGCTCATTGGCTTGCGCGGTTGGCGATTG
AAGCTGTGTTGAGTGGCGTTTGGTTGAAGGAAGCTCCGGCGGATGTTGCGAGGATTGCTTAG
Protein sequenceShow/hide protein sequence
MEEMITLCWILWNNRNNLCFQREGKRPWELWVWANQYDNGEMGFGEAGECEVIRWHPPSGNIFKLNSDAAMSISQNSAGIGIIIKNQGGEVIRTASTIRLNVRCVDFADG
IKLALECGVWAIQVEVDSYRVFKLLHDEIEGESEVPSMIRDLKQKVRSWSEITCSFTKRAGNKAAHWLARLAIEAVLSGVWLKEAPADVARIA