; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr003681 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr003681
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationtig00002361:30112..30738
RNA-Seq ExpressionSgr003681
SyntenySgr003681
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU51479.1 hypothetical protein TSUD_413680 [Trifolium subterraneum]1.1e-1330.61Show/hide
Query:  ALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHW--------NTNQLENEK------VNGQS
        + R+ + +C+L D GY+G  +T TNRH    L+  RLDRFL   D +    + +  HL   KSDH PIL  +        N NQ   +K       +   
Subjt:  ALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHW--------NTNQLENEK------VNGQS

Query:  KNVIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQKWLK-AEQELEKLLEEDEIYWKQRSRKDWL
          ++   W  +QG+       K+  ++  L+ W R+   G +   I +  +++  L++ +  N I   +K  E+EL+ LLE++E++W QRSR  WL
Subjt:  KNVIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQKWLK-AEQELEKLLEEDEIYWKQRSRKDWL

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]2.8e-2537.7Show/hide
Query:  MKALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHW--------------NTNQLENEKVNG
        M+  +D +D C L+DPG+ GD FT  + H     +WERLDRFL+N  +      + + HL F+ SDHRPILA W                ++ E +  + 
Subjt:  MKALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHW--------------NTNQLENEKVNG

Query:  QS-KNVIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQKWLKAEQELEKLLEEDEIYWKQ
        Q  K ++ R W  +       FQ KI   +++L KWN  R+ GSL+GAI +KE EI+++ K+    W     +A+++LEKLLEE+E YW+Q
Subjt:  QS-KNVIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQKWLKAEQELEKLLEEDEIYWKQ

XP_023877776.1 uncharacterized protein LOC111990221 [Quercus suber]4.8e-1736.18Show/hide
Query:  MKALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHWNTNQLENEKVNGQ-------------
        M+A RDV+D+C  +D GY G DFT   +   G LVWERLDR + N+D L R     V+HL+   SDHRPIL   N N  E+ + N +             
Subjt:  MKALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHWNTNQLENEKVNGQ-------------

Query:  SKNVIARGWNYR-QGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNW-IQKWLKAEQELEKLLEEDEIYWKQRSRKDWLK
          N ++R W+++ +G   +    K++   + L  W+RQ   G++K  I K ++ + K E E  R    Q+  + + EL KL E++E  W QRSR  W+K
Subjt:  SKNVIARGWNYR-QGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNW-IQKWLKAEQELEKLLEEDEIYWKQRSRKDWLK

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]7.7e-1531.55Show/hide
Query:  ALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHW-NTNQLENEKVNGQ--------------
        A R+ +++CNL+D G RG  FT +NR F   L+ ERLDRFL + D      ++IV +L    SDH P++      N+    K N                
Subjt:  ALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHW-NTNQLENEKVNGQ--------------

Query:  SKNVIARGW----NYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQK-----WLKAEQELEKLLEEDEIYWKQRS
         KN++   W    ++ QG+    F++     +  L  W+R   +G       +K K+++K   E + N+ Q+         E+++EK+L ++E+YWKQRS
Subjt:  SKNVIARGW----NYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQK-----WLKAEQELEKLLEEDEIYWKQRS

Query:  RKDWLK
        R DWLK
Subjt:  RKDWLK

XP_030961642.1 uncharacterized protein LOC115983160 [Quercus lobata]2.5e-1330.93Show/hide
Query:  MKALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHWNTNQLENEK-------------VNGQ
        M+A RDV+D+C  +D G+ G +FT  +R + G+L+WERLDR + N+D L +    +V HL    SDHRPI   ++ N  E+++              +  
Subjt:  MKALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHWNTNQLENEK-------------VNGQ

Query:  SKNVIARGWNYRQ-GNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNW-IQKWLKAEQELEKLLEEDEIYWKQRSR
          + + R W  +Q GN   +  +K++   + L  W++    GS+K  +AK ++ + K E+E          +   +EL  LLE++   W+QR+R
Subjt:  SKNVIARGWNYRQ-GNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNW-IQKWLKAEQELEKLLEEDEIYWKQRSR

TrEMBL top hitse value%identityAlignment
A0A2N9FMJ0 Reverse transcriptase domain-containing protein7.0e-1431.98Show/hide
Query:  MKALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHWN---TNQLENEKVNGQS--------K
        M+A RD ID+CNL+D GY G  FT  N     +  W RLDR L N D L +  H I+EH+    SDH+ +L  W    T+  + +    +         +
Subjt:  MKALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHWN---TNQLENEKVNGQS--------K

Query:  NVIARGWNYRQGNTAN-EFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQKWLK-AEQELEKLLEEDEIYWKQRSRKDWLK
          I   W  R   TA  +   K+    + L  W+R R  G++   +A+K++ ++  E E  R+     +K  + E+  LL ++E  W+QRSR +WL+
Subjt:  NVIARGWNYRQGNTAN-EFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQKWLK-AEQELEKLLEEDEIYWKQRSRKDWLK

A0A2Z6MYR9 Reverse transcriptase domain-containing protein1.6e-1329.08Show/hide
Query:  ALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPIL---AHWNTNQLENEK-----------VNGQS
        + R+ + +C+L D GY+G+ +T TNRH    L+  RLDRFL   + +    +   +HL   KSDH P+L   +H N N+  N +            N   
Subjt:  ALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPIL---AHWNTNQLENEK-----------VNGQS

Query:  KNVIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEK-EKERNWIQKWLKAEQELEKLLEEDEIYWKQRSRKDWL
         +++   W    G+     ++K+   +  L+ W R+   G +   I + ++++  L++ +  +N   +  + EQEL+ LLE++E++W QRSR  WL
Subjt:  KNVIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEK-EKERNWIQKWLKAEQELEKLLEEDEIYWKQRSRKDWL

A0A2Z6PLH8 Uncharacterized protein2.7e-1329.74Show/hide
Query:  RDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHWNTNQLEN-----------EKV---NGQSKN
        R  +++C+L D GY+GD +T  N+     L+ ERLDRFL N + +    +    HL   KSDH PIL  +NT+  +N           E++   +    +
Subjt:  RDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHWNTNQLEN-----------EKV---NGQSKN

Query:  VIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKL-EKEKERNWIQKWLKAEQELEKLLEEDEIYWKQRSRKDWLK
        ++   W   +G+  +    KI  ++  L++W  +R  G +   I   +++++KL E+   ++ +++    E+EL+ +LE +E++WKQRSR  WL+
Subjt:  VIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKL-EKEKERNWIQKWLKAEQELEKLLEEDEIYWKQRSRKDWLK

A0A2Z6PUI4 Uncharacterized protein5.4e-1430.61Show/hide
Query:  ALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHW--------NTNQLENEK------VNGQS
        + R+ + +C+L D GY+G  +T TNRH    L+  RLDRFL   D +    + +  HL   KSDH PIL  +        N NQ   +K       +   
Subjt:  ALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHW--------NTNQLENEK------VNGQS

Query:  KNVIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQKWLK-AEQELEKLLEEDEIYWKQRSRKDWL
          ++   W  +QG+       K+  ++  L+ W R+   G +   I +  +++  L++ +  N I   +K  E+EL+ LLE++E++W QRSR  WL
Subjt:  KNVIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQKWLK-AEQELEKLLEEDEIYWKQRSRKDWL

A0A6J1DRA0 uncharacterized protein LOC1110224231.4e-2537.7Show/hide
Query:  MKALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHW--------------NTNQLENEKVNG
        M+  +D +D C L+DPG+ GD FT  + H     +WERLDRFL+N  +      + + HL F+ SDHRPILA W                ++ E +  + 
Subjt:  MKALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHW--------------NTNQLENEKVNG

Query:  QS-KNVIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQKWLKAEQELEKLLEEDEIYWKQ
        Q  K ++ R W  +       FQ KI   +++L KWN  R+ GSL+GAI +KE EI+++ K+    W     +A+++LEKLLEE+E YW+Q
Subjt:  QS-KNVIARGWNYRQGNTANEFQRKIEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQKWLKAEQELEKLLEEDEIYWKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCCTTAAGAGATGTGATAGATAATTGTAACCTCATGGACCCGGGTTATAGAGGAGATGATTTCACTAGGACCAACAGACACTTTACAGGTTACCTTGTTTGGGA
AAGACTTGATAGATTTTTAATGAATTTCGATATGTTGTGCAGGTGTGGTCATATTATCGTGGAGCACTTGAGGTTTATGAAGTCTGATCACAGACCGATACTGGCACATT
GGAATACAAACCAGCTGGAAAATGAGAAAGTTAACGGCCAGAGCAAAAACGTGATTGCAAGGGGTTGGAATTACAGGCAAGGAAACACAGCAAATGAATTTCAAAGGAAA
ATTGAAGGAAGTATCCAAGATCTCTACAAGTGGAATAGACAAAGGATTGAAGGCTCACTAAAAGGTGCAATAGCAAAGAAGGAAAAGGAGATTAGAAAATTGGAGAAGGA
AAAAGAAAGAAATTGGATTCAAAAGTGGCTAAAGGCTGAACAGGAACTGGAGAAGCTACTAGAAGAAGATGAGATCTATTGGAAGCAACGTTCACGAAAAGACTGGCTCA
AGTGGGCGATAGAAATACAAATTGGTTTCATATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCCTTAAGAGATGTGATAGATAATTGTAACCTCATGGACCCGGGTTATAGAGGAGATGATTTCACTAGGACCAACAGACACTTTACAGGTTACCTTGTTTGGGA
AAGACTTGATAGATTTTTAATGAATTTCGATATGTTGTGCAGGTGTGGTCATATTATCGTGGAGCACTTGAGGTTTATGAAGTCTGATCACAGACCGATACTGGCACATT
GGAATACAAACCAGCTGGAAAATGAGAAAGTTAACGGCCAGAGCAAAAACGTGATTGCAAGGGGTTGGAATTACAGGCAAGGAAACACAGCAAATGAATTTCAAAGGAAA
ATTGAAGGAAGTATCCAAGATCTCTACAAGTGGAATAGACAAAGGATTGAAGGCTCACTAAAAGGTGCAATAGCAAAGAAGGAAAAGGAGATTAGAAAATTGGAGAAGGA
AAAAGAAAGAAATTGGATTCAAAAGTGGCTAAAGGCTGAACAGGAACTGGAGAAGCTACTAGAAGAAGATGAGATCTATTGGAAGCAACGTTCACGAAAAGACTGGCTCA
AGTGGGCGATAGAAATACAAATTGGTTTCATATGA
Protein sequenceShow/hide protein sequence
MKALRDVIDNCNLMDPGYRGDDFTRTNRHFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLRFMKSDHRPILAHWNTNQLENEKVNGQSKNVIARGWNYRQGNTANEFQRK
IEGSIQDLYKWNRQRIEGSLKGAIAKKEKEIRKLEKEKERNWIQKWLKAEQELEKLLEEDEIYWKQRSRKDWLKWAIEIQIGFI