; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016117 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016117
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRNase H domain-containing protein
Genome locationtig00007400:104090..107300
RNA-Seq ExpressionSgr016117
SyntenySgr016117
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
DAD25548.1 TPA_asm: hypothetical protein HUJ06_027012 [Nelumbo nucifera]5.6e-1827.78Show/hide
Query:  LIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLNLV--SSLVGHLI
        +IR+    K WAK VW+  VP K+ +FAWK    ALP DDR++  GI+LAS+C CC     E+L+HLFI S++A+ L+   ++IF +N+    S+   L+
Subjt:  LIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLNLV--SSLVGHLI

Query:  R--------------------SIPFFNTW---------EGDITQFVDVYKCGKVSRYCS----------FADHLVLEALNINTREPQRSPLRVVKCSH--
        +                    S   +  W         E   T  + + K  +  R CS            D ++L  LN+  R  +   + + K S   
Subjt:  R--------------------SIPFFNTW---------EGDITQFVDVYKCGKVSRYCS----------FADHLVLEALNINTREPQRSPLRVVKCSH--

Query:  --------------------------------------CYGICSSVVAEFRALYDGILLFKSISEIRTCPIVRETDSKVLVDLLTSKA
                                               YGICS+++AE RA++DG+ L   + E+    I+ E+D KV+VD    KA
Subjt:  --------------------------------------CYGICSSVVAEFRALYDGILLFKSISEIRTCPIVRETDSKVLVDLLTSKA

KAF5192308.1 hypothetical protein FRX31_018104 [Thalictrum thalictroides]5.8e-1540.78Show/hide
Query:  SSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLN
        S+G F+++S ++L      S    K +W   +P KL+ F WK F+ A+P+D  V G  I + SKC CC RP  ET  HLF+HSDLA  ++   S+ FG+ 
Subjt:  SSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLN

Query:  LVS
          S
Subjt:  LVS

KAG2724482.1 hypothetical protein I3760_01G019600 [Carya illinoinensis]2.2e-1436.75Show/hide
Query:  GSKKSRSNLFGFLHKYEASSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIH
        G+ K   +L  ++H      G FT +SA + +R       WAK VW + +P K+++  WK F+++L +DDR++ +GI + SKCNCC+R   E LNH+   
Subjt:  GSKKSRSNLFGFLHKYEASSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIH

Query:  SDLASFLFHKVSSIFGL
         D+AS ++ + S I G+
Subjt:  SDLASFLFHKVSSIFGL

XP_018851891.1 uncharacterized protein LOC109014035 [Juglans regia]6.8e-1625.72Show/hide
Query:  EASSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFG
        +  SG+FT +SA + IR R+      K +W+  +P K ++F WK ++ AL +DD+++ +GI LASKC+CC + + E +NH+    ++A  ++ K  ++ G
Subjt:  EASSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFG

Query:  LNL----------------VSSLVGHLIRSIPFFNTWE---------------------GDITQFVDVYKCGKVSRYCSF--ADHLVLEALNINTREPQR
        L +                 SS +G ++  +P   +W                        I  +V V   G ++++  F   D  +L +LN+     Q+
Subjt:  LNL----------------VSSLVGHLIRSIPFFNTWE---------------------GDITQFVDVYKCGKVSRYCSF--ADHLVLEALNINTREPQR

Query:  SPLRVVKCS-------------HCYGICSSVVAEFRALYDGILLFKSISEIRTCPIVRETDSKVLVDLLTSKAKGL
         P++++  S              C G   S  AE RAL +GI   K   ++    I  + DSKV++  L     G+
Subjt:  SPLRVVKCS-------------HCYGICSSVVAEFRALYDGILLFKSISEIRTCPIVRETDSKVLVDLLTSKAKGL

XP_042973084.1 uncharacterized protein LOC122304886 [Carya illinoinensis]9.9e-1532.65Show/hide
Query:  WDLPISYRGRSSRCITHFLYADETLVFCN--GSKKSRSNLFGFLHKYEASSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLD
        WD+ + +R          + A +T   C+  G+ K   +L  ++H      G FT +SA + +R      TW K VW + +P K+++  WK F+ +L +D
Subjt:  WDLPISYRGRSSRCITHFLYADETLVFCN--GSKKSRSNLFGFLHKYEASSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLD

Query:  DRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFG
        DR++ +GI + SKCNCC R   E LNH+    D+AS ++ + S + G
Subjt:  DRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFG

TrEMBL top hitse value%identityAlignment
A0A2I4H6W8 uncharacterized protein LOC1090140353.3e-1625.72Show/hide
Query:  EASSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFG
        +  SG+FT +SA + IR R+      K +W+  +P K ++F WK ++ AL +DD+++ +GI LASKC+CC + + E +NH+    ++A  ++ K  ++ G
Subjt:  EASSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFG

Query:  LNL----------------VSSLVGHLIRSIPFFNTWE---------------------GDITQFVDVYKCGKVSRYCSF--ADHLVLEALNINTREPQR
        L +                 SS +G ++  +P   +W                        I  +V V   G ++++  F   D  +L +LN+     Q+
Subjt:  LNL----------------VSSLVGHLIRSIPFFNTWE---------------------GDITQFVDVYKCGKVSRYCSF--ADHLVLEALNINTREPQR

Query:  SPLRVVKCS-------------HCYGICSSVVAEFRALYDGILLFKSISEIRTCPIVRETDSKVLVDLLTSKAKGL
         P++++  S              C G   S  AE RAL +GI   K   ++    I  + DSKV++  L     G+
Subjt:  SPLRVVKCS-------------HCYGICSSVVAEFRALYDGILLFKSISEIRTCPIVRETDSKVLVDLLTSKAKGL

A0A6P9EFB8 uncharacterized protein LOC1183445864.0e-1437Show/hide
Query:  SGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLNL
        +G+F+ +SA + IR R  S  WA  +W  N+P K+++  WK  ++ L +DD++K +GI   SKCNCC R  ME LNH+  + D A  ++   ++  G+++
Subjt:  SGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLNL

A0A6P9EK80 uncharacterized protein LOC1183470724.0e-1437Show/hide
Query:  SGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLNL
        +G+F+ +SA + IR R  S  WA  +W  N+P K+++  WK  ++ L +DD++K +GI   SKCNCC R  ME LNH+  + D A  ++   ++  G+++
Subjt:  SGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLNL

A0A7J6W695 zf-RVT domain-containing protein2.8e-1540.78Show/hide
Query:  SSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLN
        S+G F+++S ++L      S    K +W   +P KL+ F WK F+ A+P+D  V G  I + SKC CC RP  ET  HLF+HSDLA  ++   S+ FG+ 
Subjt:  SSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLN

Query:  LVS
          S
Subjt:  LVS

M1BXK5 RNase H family protein2.4e-1437.62Show/hide
Query:  SSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLN
        + G FTV+SA +L+R R E + W   +W K +P K++ F W+ +   +P DD +K + I++ SKC CCN  + ET+ HLF+ + +A  L++  +S  G+N
Subjt:  SSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLN

Query:  L
        +
Subjt:  L

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G33710.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.5e-0834.57Show/hide
Query:  IRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVS
        +R R++  +WAK VW K   PK     W      LP   R+   G+ L + C  C+   +E  +HLF+  + A FL+H VS
Subjt:  IRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVS

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.2e-0634.83Show/hide
Query:  FLHKYEASSGSFTVRSALELIRNRT--ESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLF
        F+ K   S  S    SA   I      E   W KA+W K   PK    +W    H LP  D++   G+H+ S C  CN    ET  HLF
Subjt:  FLHKYEASSGSFTVRSALELIRNRT--ESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTATCATCCCTAGATCGATCAATGCAACAGCAATTGCCTTTATTCCGAAAGAGGGATCTCCAGCCACCTTTTCAGATTTCAGGCCAATCAGTCTTTGTAACTAA
AGTTACAAGGACTTCGTCAAGGTTTGGAACCCTAGACTCTTCATTATTGCGGAGGAAGCACTTAGCAGGGGTTTATCTTCTCTCTTCCAAAAAGCACTTTTGGGATTTGC
CCATAAGCTATAGGGGCAGATCATCGAGATGCATCACTCATTTCTTATATGCGGACGAGACTCTTGTTTTCTGCAATGGTTCAAAGAAATCGAGATCGAACCTATTTGGC
TTTCTTCACAAGTATGAAGCTTCATCAGGTAGCTTCACTGTTCGGTCCGCTTTGGAGTTAATTCGTAATAGAACAGAGTCAAAGACCTGGGCAAAAGCTGTTTGGAGCAA
GAATGTCCCACCAAAGCTTAACATGTTTGCTTGGAAGCAGTTCTACCATGCATTACCTTTGGATGACAGGGTCAAAGGACTTGGGATTCATCTTGCTAGCAAATGTAACT
GCTGTAATAGACCACAAATGGAAACTCTCAATCATTTGTTTATTCACAGCGACCTGGCATCTTTCCTTTTTCACAAAGTGAGCAGTATCTTTGGGCTGAATCTCGTGAGT
TCTCTTGTTGGCCATCTGATCCGCTCCATTCCCTTCTTTAACACATGGGAAGGTGATATAACACAGTTTGTGGATGTCTACAAGTGCGGAAAAGTCTCTCGCTATTGTTC
TTTTGCTGATCATCTCGTTCTTGAAGCTCTGAACATAAACACAAGAGAACCTCAGAGATCTCCTTTGAGAGTCGTCAAATGCTCTCATTGCTATGGTATATGTTCAAGTG
TCGTTGCTGAATTCCGAGCGCTATATGATGGCATCTTACTGTTCAAATCAATATCAGAAATCCGTACATGTCCTATTGTTCGAGAAACGGATTCTAAAGTGTTGGTTGAT
TTACTTACTTCAAAGGCTAAGGGTTTGCTTAGGCCCCTACCTCGCCCAACTGCCTCTCCTAATTGGAGAGAGAAAACCAATCAGACGGGGAGGAGTTCTTTCTCCGTCCC
ATCACGGGAAGTTGCGGTTCAGCCACCGATACCGGTGCGGGAGACAGGGAAAAAGATAAGGGTCGGTCTATTCAGTCTTATTAGTCCCCTTCAGTCACTGGACTGGGCTC
TGAGTCCTGATACCGAATGGGAGGCCCATGCTAGCTCCACGCGGGAAGAAGTCAGACAAGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGTATCATCCCTAGATCGATCAATGCAACAGCAATTGCCTTTATTCCGAAAGAGGGATCTCCAGCCACCTTTTCAGATTTCAGGCCAATCAGTCTTTGTAACTAA
AGTTACAAGGACTTCGTCAAGGTTTGGAACCCTAGACTCTTCATTATTGCGGAGGAAGCACTTAGCAGGGGTTTATCTTCTCTCTTCCAAAAAGCACTTTTGGGATTTGC
CCATAAGCTATAGGGGCAGATCATCGAGATGCATCACTCATTTCTTATATGCGGACGAGACTCTTGTTTTCTGCAATGGTTCAAAGAAATCGAGATCGAACCTATTTGGC
TTTCTTCACAAGTATGAAGCTTCATCAGGTAGCTTCACTGTTCGGTCCGCTTTGGAGTTAATTCGTAATAGAACAGAGTCAAAGACCTGGGCAAAAGCTGTTTGGAGCAA
GAATGTCCCACCAAAGCTTAACATGTTTGCTTGGAAGCAGTTCTACCATGCATTACCTTTGGATGACAGGGTCAAAGGACTTGGGATTCATCTTGCTAGCAAATGTAACT
GCTGTAATAGACCACAAATGGAAACTCTCAATCATTTGTTTATTCACAGCGACCTGGCATCTTTCCTTTTTCACAAAGTGAGCAGTATCTTTGGGCTGAATCTCGTGAGT
TCTCTTGTTGGCCATCTGATCCGCTCCATTCCCTTCTTTAACACATGGGAAGGTGATATAACACAGTTTGTGGATGTCTACAAGTGCGGAAAAGTCTCTCGCTATTGTTC
TTTTGCTGATCATCTCGTTCTTGAAGCTCTGAACATAAACACAAGAGAACCTCAGAGATCTCCTTTGAGAGTCGTCAAATGCTCTCATTGCTATGGTATATGTTCAAGTG
TCGTTGCTGAATTCCGAGCGCTATATGATGGCATCTTACTGTTCAAATCAATATCAGAAATCCGTACATGTCCTATTGTTCGAGAAACGGATTCTAAAGTGTTGGTTGAT
TTACTTACTTCAAAGGCTAAGGGTTTGCTTAGGCCCCTACCTCGCCCAACTGCCTCTCCTAATTGGAGAGAGAAAACCAATCAGACGGGGAGGAGTTCTTTCTCCGTCCC
ATCACGGGAAGTTGCGGTTCAGCCACCGATACCGGTGCGGGAGACAGGGAAAAAGATAAGGGTCGGTCTATTCAGTCTTATTAGTCCCCTTCAGTCACTGGACTGGGCTC
TGAGTCCTGATACCGAATGGGAGGCCCATGCTAGCTCCACGCGGGAAGAAGTCAGACAAGTCTGA
Protein sequenceShow/hide protein sequence
MVVSSLDRSMQQQLPLFRKRDLQPPFQISGQSVFVTKVTRTSSRFGTLDSSLLRRKHLAGVYLLSSKKHFWDLPISYRGRSSRCITHFLYADETLVFCNGSKKSRSNLFG
FLHKYEASSGSFTVRSALELIRNRTESKTWAKAVWSKNVPPKLNMFAWKQFYHALPLDDRVKGLGIHLASKCNCCNRPQMETLNHLFIHSDLASFLFHKVSSIFGLNLVS
SLVGHLIRSIPFFNTWEGDITQFVDVYKCGKVSRYCSFADHLVLEALNINTREPQRSPLRVVKCSHCYGICSSVVAEFRALYDGILLFKSISEIRTCPIVRETDSKVLVD
LLTSKAKGLLRPLPRPTASPNWREKTNQTGRSSFSVPSREVAVQPPIPVRETGKKIRVGLFSLISPLQSLDWALSPDTEWEAHASSTREEVRQV