; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022496 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022496
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationtig00154740:169354..170066
RNA-Seq ExpressionSgr022496
SyntenySgr022496
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000123 - RNA-directed DNA polymerase (reverse transcriptase), msDNA
IPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB8468469.1 hypothetical protein FH972_025303 [Carpinus fangiana]5.8e-5650.21Show/hide
Query:  MYRSWIPKPNKPGELRAITQPNKAD----------------------TIGFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGE
        MYRSWIPKPNKP +LR ITQPN+ D                      + GFR+GRGPITFF E+ RWG +D+LIKSDIVKCFDNI+H   IS L S LG+
Subjt:  MYRSWIPKPNKPGELRAITQPNKAD----------------------TIGFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGE

Query:  ENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLEN
        EN AF DLI  FL+T I+D    +++N  KGIPQG SLSPVLMN +LH++D +++  M+ E  + Y RYADDMIF   +G +SEA Y R +   ++ LE 
Subjt:  ENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLEN

Query:  LKQAETSIELIRGRPRKILVLGLVASISSLGIMSS
        LK  ++ IEL+R  P K  VLGL+  +   G + +
Subjt:  LKQAETSIELIRGRPRKILVLGLVASISSLGIMSS

KUM47405.1 cob intron3 ORF [Picea glauca]1.1e-3852.15Show/hide
Query:  GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQ
        GFRKGRGPIT F ++                          IS L   L + N         FL+T I+DK   +++N+TKGIPQG SLSPVLMNIFLHQ
Subjt:  GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQ

Query:  LDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLENLKQAETSIELIRGRPRKILVLGLVASISSLGIMSS
        LD KI+ FM+ EE +GYVRYADDMIFAIK+G +SE VY R K+  +K L +LK AETSIELIRG PRK  VLGLV SI   G + +
Subjt:  LDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLENLKQAETSIELIRGRPRKILVLGLVASISSLGIMSS

MBA0704412.1 hypothetical protein [Gossypium laxum]4.0e-4162.07Show/hide
Query:  MYRSWIPKPNKPGELRAITQPNKADTIGFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTA
        MYRSWIPKPNKPG+LRAITQPNK       KG            WG VD+LIKSDIV CFDNIDH L I+++QSYLG+EN+ F DLIL FL+T I+DK  
Subjt:  MYRSWIPKPNKPGELRAITQPNKADTIGFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTA

Query:  QNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYV
        ++  N  KGIPQG  LSPV+MNIFLHQLD++INSFM+ E+ V Y+
Subjt:  QNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYV

WP_165001732.1 reverse transcriptase/maturase family protein [Candidatus Frankia datiscae]4.4e-4046.56Show/hide
Query:  LRAITQPNKADTI----------------------GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLR
        +R +TQP   DTI                      GFR+GRGPITFF ++Q WG +DKL+K++I+KCFDN+DH + + +L+S+LG  N  F  LI  + +
Subjt:  LRAITQPNKADTI----------------------GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLR

Query:  TVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEG-VGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLENLK
        T+I+DK+ ++ +  TKGIPQGCSLSPVLMNIFLH+LD ++    +  EG +GYVRYADDMIF I+    SE VY   K   +K ++ +K
Subjt:  TVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEG-VGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLENLK

WP_206438078.1 reverse transcriptase/maturase family protein, partial [Candidatus Frankia datiscae]6.8e-4949.75Show/hide
Query:  MYRSWIPKPNKPGELRAITQPNKADTI----------------------GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGE
        MYRSWIPKP KP E+R +TQP   DTI                      GFR+GRGPITFF ++Q WG +DKL+K+DI+KCFDN+DH + + +L+S+LG 
Subjt:  MYRSWIPKPNKPGELRAITQPNKADTI----------------------GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGE

Query:  ENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEG-VGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLE
         N  F  LI  + +T+I+DK+ ++ +  TKGIPQGCSLSPVLMNIFLH+LD ++    +  EG +GYVRYADDMIF I+    SE VY   K   +K ++
Subjt:  ENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEG-VGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLE

Query:  NLK
         +K
Subjt:  NLK

TrEMBL top hitse value%identityAlignment
A0A101LZF7 18S ribosomal RNA intron 1 ORF2.9e-2936.89Show/hide
Query:  MYRSWIPKPNKPGELRAITQPNKADTI----------------------GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGE
        M R +IPKP+KPG+LR IT P+ +D I                      GFRK R   T F++V  WG +D+ + +DIVKCFDN++H   +  +QS++ +
Subjt:  MYRSWIPKPNKPGELRAITQPNKADTI----------------------GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGE

Query:  ENRAFLDLILVFLRTVIIDKTAQNFANY---TKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDMIFAIKRGG---------NSEAVYY
        +    ++LI  FL T I+D+   N+ NY   +KGI QGCSLSPVL+NI+LH  D+ ++ F      V Y RYADD++ A +  G         N + V+ 
Subjt:  ENRAFLDLILVFLRTVIIDKTAQNFANY---TKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDMIFAIKRGG---------NSEAVYY

Query:  RVKQLLKKT---LENLKQAETSIEL
         + +  K T   LE++K+ E  +++
Subjt:  RVKQLLKKT---LENLKQAETSIEL

A0A117NGV6 Cob intron3 ORF5.3e-3952.15Show/hide
Query:  GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQ
        GFRKGRGPIT F ++                          IS L   L + N         FL+T I+DK   +++N+TKGIPQG SLSPVLMNIFLHQ
Subjt:  GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQ

Query:  LDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLENLKQAETSIELIRGRPRKILVLGLVASISSLGIMSS
        LD KI+ FM+ EE +GYVRYADDMIFAIK+G +SE VY R K+  +K L +LK AETSIELIRG PRK  VLGLV SI   G + +
Subjt:  LDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLENLKQAETSIELIRGRPRKILVLGLVASISSLGIMSS

A0A1Y0AZM1 LtrA2.2e-2937.61Show/hide
Query:  MYRSWIPKPNKPGELRAITQPNKADTI---------------------GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEE
        M+R WIPK       R ITQP+ +D +                     GFR  RG IT F  +  W  +  + +SDIV CFDNI H L +S LQS+LG +
Subjt:  MYRSWIPKPNKPGELRAITQPNKADTI---------------------GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEE

Query:  NRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLENL
        N   LDLI   L   I+D+   N+A+ +KGIPQG   SP+LMNI LH +D+++  F+   E + Y+RYADD++     G +++    R+    +K+L +L
Subjt:  NRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLENL

Query:  KQAETSIELIR----GRPRKILVLGLVASISSLG
        K  E S ++ R    G P  I +LG + SI+  G
Subjt:  KQAETSIELIR----GRPRKILVLGLVASISSLG

A0A5N6L0M6 Reverse transcriptase domain-containing protein2.8e-5650.21Show/hide
Query:  MYRSWIPKPNKPGELRAITQPNKAD----------------------TIGFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGE
        MYRSWIPKPNKP +LR ITQPN+ D                      + GFR+GRGPITFF E+ RWG +D+LIKSDIVKCFDNI+H   IS L S LG+
Subjt:  MYRSWIPKPNKPGELRAITQPNKAD----------------------TIGFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGE

Query:  ENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLEN
        EN AF DLI  FL+T I+D    +++N  KGIPQG SLSPVLMN +LH++D +++  M+ E  + Y RYADDMIF   +G +SEA Y R +   ++ LE 
Subjt:  ENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLEN

Query:  LKQAETSIELIRGRPRKILVLGLVASISSLGIMSS
        LK  ++ IEL+R  P K  VLGL+  +   G + +
Subjt:  LKQAETSIELIRGRPRKILVLGLVASISSLGIMSS

A0A7J8YYK1 Reverse transcriptase domain-containing protein1.9e-4162.07Show/hide
Query:  MYRSWIPKPNKPGELRAITQPNKADTIGFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTA
        MYRSWIPKPNKPG+LRAITQPNK       KG            WG VD+LIKSDIV CFDNIDH L I+++QSYLG+EN+ F DLIL FL+T I+DK  
Subjt:  MYRSWIPKPNKPGELRAITQPNKADTIGFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTA

Query:  QNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYV
        ++  N  KGIPQG  LSPV+MNIFLHQLD++INSFM+ E+ V Y+
Subjt:  QNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYV

SwissProt top hitse value%identityAlignment
B1N1A3 Putative nicotine oxidoreductase5.9e-1136.84Show/hide
Query:  LRAITQPN-KADTIGFRKGRGPITFFLEV-QRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQ
        L AI +P    ++ GFR G+   T   +V + W  V  +I+ DI  CFDNI H   I  L+  + +E   F++LI   L     +  A  F + T G PQ
Subjt:  LRAITQPN-KADTIGFRKGRGPITFFLEV-QRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQ

Query:  GCSLSPVLMNIFLHQLDMKINSFM----QIEEG
        G  +SP+L N+FL QLD K+   +    Q EEG
Subjt:  GCSLSPVLMNIFLHQLDMKINSFM----QIEEG

P03876 Putative COX1/OXI3 intron 2 protein5.3e-1235.9Show/hide
Query:  GFRKGRGPITFFLEVQRW-GLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLH
        GFR     +T  ++ + +    +  IK D+ KCFD I H + I+VL   +  +++ F+DL+   LR   +DK   N+ N T GIPQG  +SP+L NIFL 
Subjt:  GFRKGRGPITFFLEVQRW-GLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLH

Query:  QLDMKINSFMQIEEGVG
        +LD  + +  + E   G
Subjt:  QLDMKINSFMQIEEGVG

P05511 Uncharacterized 91 kDa protein in cob intron3.3e-0629.66Show/hide
Query:  IKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDM
        I+ DI  CFD+I H   I++L S +  +++ F+ LI   L    +  T   +     G PQG  +SP+L NI+LHQLD  I +     +  G +      
Subjt:  IKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDM

Query:  IFAIKRGGNSEAVYYRVKQLLKKTLENLKQAETSIELIRGRPRKI
          A KR   S  ++Y + +  ++  ++    + +IE+ R  P KI
Subjt:  IFAIKRGGNSEAVYYRVKQLLKKTLENLKQAETSIELIRGRPRKI

P19593 Probable reverse transcriptase1.5e-0625.86Show/hide
Query:  LIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDM----KINSFMQIEEGVGYVR
        +++ DI  C DNI+H   IS +  ++ ++      ++  +L+   I++ +      T G+PQG  +SP++MN+ L  L+     KI       +G  Y R
Subjt:  LIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDM----KINSFMQIEEGVGYVR

Query:  YADDMIFAIKRGGNSEAVYYRVKQLL----------KKTLENLKQAETSIELIRGRPRKILVLGLVASISSLGI
        YADDM+        +      VK+ L          K T++N+       E +  R RK+         S +GI
Subjt:  YADDMIFAIKRGGNSEAVYYRVKQLL----------KKTLENLKQAETSIELIRGRPRKILVLGLVASISSLGI

P38478 Uncharacterized mitochondrial protein ymf405.0e-1028.04Show/hide
Query:  RSWIPKPNKPGELRAITQPNKADTI----------------------GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGE--
        R +IPK +  G+LR++  P+  D I                      GFR  R P T   +++RW     +I+ DI   FDNIDH L    L  ++ E  
Subjt:  RSWIPKPNKPGELRAITQPNKADTI----------------------GFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGE--

Query:  ENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLEN
        +++  L L    +R   +++  +   +   G+PQG  LSP+L NI+LHQ D+    FM+  +    V+Y      +      +  +Y + +    K +++
Subjt:  ENRAFLDLILVFLRTVIIDKTAQNFANYTKGIPQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLEN

Query:  LKQAETSIELIRGR
        LK   +S E+IR R
Subjt:  LKQAETSIELIRGR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACCGCTCATGGATTCCCAAACCAAACAAACCTGGGGAGCTGCGAGCCATTACCCAGCCCAACAAGGCTGATACCATTGGGTTTAGGAAAGGTCGAGGTCCTATTAC
TTTCTTCCTTGAAGTCCAGCGCTGGGGTCTGGTTGATAAACTCATTAAGTCAGATATTGTAAAGTGTTTTGATAACATAGATCATGGGCTCAACATCTCAGTTCTTCAAT
CTTATTTAGGTGAGGAGAACCGTGCCTTCCTTGATCTTATTTTAGTGTTCTTACGAACAGTAATAATCGACAAAACTGCTCAAAATTTTGCAAACTACACGAAAGGCATA
CCGCAAGGCTGCTCCCTCTCCCCCGTGCTAATGAACATCTTTTTGCACCAACTCGATATGAAAATCAATTCCTTTATGCAAATCGAAGAGGGCGTAGGCTACGTTCGGTA
TGCTGATGACATGATTTTTGCTATCAAAAGGGGAGGAAATTCTGAGGCAGTATATTACAGGGTTAAACAGCTCCTCAAAAAGACCCTGGAAAACCTAAAACAGGCCGAAA
CCTCTATAGAACTAATCCGAGGAAGACCCCGAAAAATCCTAGTTTTAGGTCTGGTCGCATCAATCAGCTCGCTTGGGATCATGTCTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTACCGCTCATGGATTCCCAAACCAAACAAACCTGGGGAGCTGCGAGCCATTACCCAGCCCAACAAGGCTGATACCATTGGGTTTAGGAAAGGTCGAGGTCCTATTAC
TTTCTTCCTTGAAGTCCAGCGCTGGGGTCTGGTTGATAAACTCATTAAGTCAGATATTGTAAAGTGTTTTGATAACATAGATCATGGGCTCAACATCTCAGTTCTTCAAT
CTTATTTAGGTGAGGAGAACCGTGCCTTCCTTGATCTTATTTTAGTGTTCTTACGAACAGTAATAATCGACAAAACTGCTCAAAATTTTGCAAACTACACGAAAGGCATA
CCGCAAGGCTGCTCCCTCTCCCCCGTGCTAATGAACATCTTTTTGCACCAACTCGATATGAAAATCAATTCCTTTATGCAAATCGAAGAGGGCGTAGGCTACGTTCGGTA
TGCTGATGACATGATTTTTGCTATCAAAAGGGGAGGAAATTCTGAGGCAGTATATTACAGGGTTAAACAGCTCCTCAAAAAGACCCTGGAAAACCTAAAACAGGCCGAAA
CCTCTATAGAACTAATCCGAGGAAGACCCCGAAAAATCCTAGTTTTAGGTCTGGTCGCATCAATCAGCTCGCTTGGGATCATGTCTTCTTAG
Protein sequenceShow/hide protein sequence
MYRSWIPKPNKPGELRAITQPNKADTIGFRKGRGPITFFLEVQRWGLVDKLIKSDIVKCFDNIDHGLNISVLQSYLGEENRAFLDLILVFLRTVIIDKTAQNFANYTKGI
PQGCSLSPVLMNIFLHQLDMKINSFMQIEEGVGYVRYADDMIFAIKRGGNSEAVYYRVKQLLKKTLENLKQAETSIELIRGRPRKILVLGLVASISSLGIMSS