; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016786 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016786
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationtig00153010:674500..675000
RNA-Seq ExpressionSgr016786
SyntenySgr016786
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON78325.1 Endonuclease/exonuclease/phosphatase [Parasponia andersonii]7.8e-1636.55Show/hide
Query:  MEDFRAMIDSCELIDPSFKGDNFTWYRNLSG-SKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATINYKNQKRLVR--FEESWTR
        M  F+ ++D C  ID  FKGD +TW+    G S + ERLD  L +LN      + +VHHL    SDH P+L    N        ++KR  R  FEE W  
Subjt:  MEDFRAMIDSCELIDPSFKGDNFTWYRNLSG-SKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATINYKNQKRLVR--FEESWTR

Query:  FEDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLG
          +C S+IE+ W R     +T   +NKK+ NC  +L  W++ + G
Subjt:  FEDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLG

XP_018859783.1 uncharacterized protein LOC109021580 [Juglans regia]9.2e-1735.19Show/hide
Query:  MEDFRAMIDSCELIDPSF-KGDNFTWYRNLSGSK-IWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATINYKNQKRLVRFEESWTRF
        ME FRA +  C L D  F +G  FTW     G   IWERLD  L S + +++  +F V H     SDH PI   W N      Y  +K++ RFE  W   
Subjt:  MEDFRAMIDSCELIDPSF-KGDNFTWYRNLSGSK-IWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATINYKNQKRLVRFEESWTRF

Query:  EDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKERDIQELQ
        E+C+++I++ W+  +G G+    + +++  C ++L NWNR R  G V   + K +  +Q LQ
Subjt:  EDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKERDIQELQ

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]9.8e-2742.6Show/hide
Query:  MEDFRAMIDSCELIDPSFKGDNFTW-----YRNLSGSKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDG-ATINYKNQKRLVRFEES
        M++F+  +D C L+DP F GD FTW     YR      IWERLD +L++  + Q   +  + HL+FLASDHRPILA W   G AT+  +  +R  RFEE 
Subjt:  MEDFRAMIDSCELIDPSFKGDNFTW-----YRNLSGSKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDG-ATINYKNQKRLVRFEES

Query:  WTRFEDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKERDIQELQRSG
        W  F++CK ++   W   QG          KI++C+  L  WN  RLGGS+RGAI +KE +IQ + + G
Subjt:  WTRFEDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKERDIQELQRSG

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]2.6e-1938.35Show/hide
Query:  LSGSKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATINYKNQKRLVRFEESWTRFEDCKSLIEESWKRYQGSGSTSCRINKKISN
        + G  IWERLD +L++ +M+    +  V HL+ L+SDHRPILA WD +       +++R +RFEESW + + C+ +I  +W    G G  +     KI +
Subjt:  LSGSKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATINYKNQKRLVRFEESWTRFEDCKSLIEESWKRYQGSGSTSCRINKKISN

Query:  CITRLHNWNRRRLGGSVRGAIGKKERDIQELQR
        C++RL+ WN+ RL  S++GAI  KE++++ L++
Subjt:  CITRLHNWNRRRLGGSVRGAIGKKERDIQELQR

XP_030487129.1 uncharacterized protein LOC115704047 [Cannabis sativa]1.1e-1433.33Show/hide
Query:  MEDFRAMIDSCELIDPSFKGDNFTWYRNLS-GSKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATI-NYKNQKRLVRFEESWTRF
        ++ FR  +D C L +  F+G+ FTW+ N S G+ + ERLD   ++     S  S ++ HLDF ASDHR +LA  D   + I   +  K   RFE+ W + 
Subjt:  MEDFRAMIDSCELIDPSFKGDNFTWYRNLS-GSKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATI-NYKNQKRLVRFEESWTRF

Query:  EDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKERDIQELQRSGHQD
        +DC  +I + W     S  T+  ++  IS+C + L +W+R +  G +   I      ++ LQ S H D
Subjt:  EDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKERDIQELQRSGHQD

TrEMBL top hitse value%identityAlignment
A0A2N9I0P4 Reverse transcriptase domain-containing protein3.4e-1737.13Show/hide
Query:  MEDFRAMIDSCELIDPSFKGDNFTWYRNLSGS-KIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGAT-INYKNQKRLVRFEESWTRF
        M+DFR  IDSC  ID  F G+ FTW  N  GS  IWERLD  L +   M    +  +HH+D   SDH P+   W N  AT   + +Q+R  RFEE W   
Subjt:  MEDFRAMIDSCELIDPSFKGDNFTWYRNLSGS-KIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGAT-INYKNQKRLVRFEESWTRF

Query:  EDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKE--RDIQELQRSG
          C+ L+ ++WK      S +  ++ KI NC   L +W++       R  I KK   ++++ L R G
Subjt:  EDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKE--RDIQELQRSG

A0A2N9IMU2 Reverse transcriptase domain-containing protein3.4e-1737.13Show/hide
Query:  MEDFRAMIDSCELIDPSFKGDNFTWYRNLSGS-KIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGAT-INYKNQKRLVRFEESWTRF
        M+DFR  IDSC  ID  F G+ FTW  N  GS  IWERLD  L +   M    +  +HH+D   SDH P+   W N  AT   + +Q+R  RFEE W   
Subjt:  MEDFRAMIDSCELIDPSFKGDNFTWYRNLSGS-KIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGAT-INYKNQKRLVRFEESWTRF

Query:  EDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKE--RDIQELQRSG
          C+ L+ ++WK      S +  ++ KI NC   L +W++       R  I KK   ++++ L R G
Subjt:  EDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKE--RDIQELQRSG

A0A2P5DYE7 Endonuclease/exonuclease/phosphatase3.8e-1636.55Show/hide
Query:  MEDFRAMIDSCELIDPSFKGDNFTWYRNLSG-SKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATINYKNQKRLVR--FEESWTR
        M  F+ ++D C  ID  FKGD +TW+    G S + ERLD  L +LN      + +VHHL    SDH P+L    N        ++KR  R  FEE W  
Subjt:  MEDFRAMIDSCELIDPSFKGDNFTWYRNLSG-SKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATINYKNQKRLVR--FEESWTR

Query:  FEDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLG
          +C S+IE+ W R     +T   +NKK+ NC  +L  W++ + G
Subjt:  FEDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLG

A0A6J1DRA0 uncharacterized protein LOC1110224234.7e-2742.6Show/hide
Query:  MEDFRAMIDSCELIDPSFKGDNFTW-----YRNLSGSKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDG-ATINYKNQKRLVRFEES
        M++F+  +D C L+DP F GD FTW     YR      IWERLD +L++  + Q   +  + HL+FLASDHRPILA W   G AT+  +  +R  RFEE 
Subjt:  MEDFRAMIDSCELIDPSFKGDNFTW-----YRNLSGSKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDG-ATINYKNQKRLVRFEES

Query:  WTRFEDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKERDIQELQRSG
        W  F++CK ++   W   QG          KI++C+  L  WN  RLGGS+RGAI +KE +IQ + + G
Subjt:  WTRFEDCKSLIEESWKRYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKERDIQELQRSG

A0A6J1DUG8 uncharacterized protein LOC1110241351.3e-1938.35Show/hide
Query:  LSGSKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATINYKNQKRLVRFEESWTRFEDCKSLIEESWKRYQGSGSTSCRINKKISN
        + G  IWERLD +L++ +M+    +  V HL+ L+SDHRPILA WD +       +++R +RFEESW + + C+ +I  +W    G G  +     KI +
Subjt:  LSGSKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATINYKNQKRLVRFEESWTRFEDCKSLIEESWKRYQGSGSTSCRINKKISN

Query:  CITRLHNWNRRRLGGSVRGAIGKKERDIQELQR
        C++RL+ WN+ RL  S++GAI  KE++++ L++
Subjt:  CITRLHNWNRRRLGGSVRGAIGKKERDIQELQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACTTCAGAGCTATGATAGATTCATGTGAATTGATTGATCCGAGTTTCAAAGGAGATAATTTTACTTGGTACAGGAATTTATCGGGCAGCAAGATATGGGAAAG
ACTGGATCTGTATCTATTGAGTTTAAACATGATGCAGAGTAGTCATAGCTTTATGGTACACCATCTAGATTTCTTAGCATCCGACCATAGACCTATCCTTGCAGTATGGG
ATAATGATGGTGCAACCATAAACTACAAGAACCAAAAAAGACTGGTAAGATTTGAGGAAAGTTGGACACGTTTTGAAGATTGCAAGTCATTGATAGAAGAAAGCTGGAAA
AGATATCAGGGTAGTGGGTCGACAAGTTGCAGGATCAATAAAAAGATATCCAACTGTATCACGAGGCTGCATAATTGGAACAGAAGGAGACTAGGGGGATCGGTACGTGG
TGCAATAGGGAAAAAAGAAAGAGACATACAAGAGCTGCAAAGAAGTGGACATCAGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACTTCAGAGCTATGATAGATTCATGTGAATTGATTGATCCGAGTTTCAAAGGAGATAATTTTACTTGGTACAGGAATTTATCGGGCAGCAAGATATGGGAAAG
ACTGGATCTGTATCTATTGAGTTTAAACATGATGCAGAGTAGTCATAGCTTTATGGTACACCATCTAGATTTCTTAGCATCCGACCATAGACCTATCCTTGCAGTATGGG
ATAATGATGGTGCAACCATAAACTACAAGAACCAAAAAAGACTGGTAAGATTTGAGGAAAGTTGGACACGTTTTGAAGATTGCAAGTCATTGATAGAAGAAAGCTGGAAA
AGATATCAGGGTAGTGGGTCGACAAGTTGCAGGATCAATAAAAAGATATCCAACTGTATCACGAGGCTGCATAATTGGAACAGAAGGAGACTAGGGGGATCGGTACGTGG
TGCAATAGGGAAAAAAGAAAGAGACATACAAGAGCTGCAAAGAAGTGGACATCAGGATTAG
Protein sequenceShow/hide protein sequence
MEDFRAMIDSCELIDPSFKGDNFTWYRNLSGSKIWERLDLYLLSLNMMQSSHSFMVHHLDFLASDHRPILAVWDNDGATINYKNQKRLVRFEESWTRFEDCKSLIEESWK
RYQGSGSTSCRINKKISNCITRLHNWNRRRLGGSVRGAIGKKERDIQELQRSGHQD