; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016852 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016852
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153010:2014003..2015617
RNA-Seq ExpressionSgr016852
SyntenySgr016852
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA2969676.1 Hypothetical predicted protein [Olea europaea subsp. europaea]3.4e-4062.04Show/hide
Query:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS
        G   + LLVYVDDI++TGPN+  I+ LK  L+S FKLKDLG+LKYFL +E+ARS  GI +SQRHYTLQ+LEDTG+L CK AN PMD    L     +LL+
Subjt:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS

Query:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATPR
        D   YRRL+GRLL+LTISR DIT+ +HKLSQF++ PR
Subjt:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATPR

XP_022887164.1 uncharacterized protein LOC111403045 [Olea europaea var. sylvestris]3.3e-4367.88Show/hide
Query:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS
        G++ V LLVYVDDII+TGPN+ +I  LK  L+S FKLKDLG LKYFL LE+ARS  GIFLSQR+YTLQ+LED GFL CK AN+PM+   RL++  G+L+ 
Subjt:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS

Query:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATPR
        D   YRRLVGRLL+LTISR DITFV+HKLSQFV+ PR
Subjt:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATPR

XP_022888908.1 uncharacterized protein LOC111404314 [Olea europaea var. sylvestris]1.8e-4164.96Show/hide
Query:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS
        G++ V LLVYVDDII+TG N+  I+ LK++L+S FK KDLG LKYFL LE+A S KGI LSQRHYTLQ+LEDTGFL CK A +PMD  ++L +   +LL 
Subjt:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS

Query:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATPR
        D   +RRL+GRLL+LTISR DITF +HKLSQFVA PR
Subjt:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATPR

XP_024026696.1 uncharacterized protein LOC112093108 [Morus notabilis]7.3e-4367.15Show/hide
Query:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS
        G S + LLVYVDDII+TGPNL +++DLK  L+S FKLKDLG LK+FL +E+ARS  GI LSQRHY LQ+LED+GFL CK A VPMD+ V L A  G LLS
Subjt:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS

Query:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATPR
        D   YRRL+GRLL+LTI+R DITF +HKLSQF+A PR
Subjt:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATPR

XP_024028503.1 uncharacterized protein LOC112093728 [Morus notabilis]3.4e-4064.96Show/hide
Query:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS
        GS+ V LLVYVDDII+TGP+L ++  LK  L+S FKLKDLG LKYFL LE+ARS+ GI LSQR YTLQ+LEDTGFL C  A VPM+  V+L A++G++LS
Subjt:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS

Query:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATPR
        D   Y RL+GRLL+L +SR DITF +HKLSQF+A PR
Subjt:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATPR

TrEMBL top hitse value%identityAlignment
A0A151TGV3 Copia protein4.0e-3960Show/hide
Query:  SSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLSD
        S+ + +LVYVDDI++  PNL  I+ +K +L+ YFKLKDLGDLK+FL LEL++S   IF+ QRHYT  ILED G L CK + VPM+++++LHA +   LSD
Subjt:  SSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLSD

Query:  PFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP
        P VYRRL+GRLL+LTISR DI++ IHKLSQFV+ P
Subjt:  PFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP

A0A6J1DNK9 uncharacterized protein LOC111022877 isoform X12.4e-3965.44Show/hide
Query:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS
        G+S V LLV VDDIIVT  +  LI  LK  L + FKLKDLG L+YFL LEL RS+  IFLSQRHY LQ++EDTGFL  K   +PMD N++L AS+G+LL 
Subjt:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS

Query:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP
        DP VYRRL+GRLL+LTISR DITF +HKLSQF+A P
Subjt:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP

A0A6J1DP23 uncharacterized protein LOC111022877 isoform X42.4e-3965.44Show/hide
Query:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS
        G+S V LLV VDDIIVT  +  LI  LK  L + FKLKDLG L+YFL LEL RS+  IFLSQRHY LQ++EDTGFL  K   +PMD N++L AS+G+LL 
Subjt:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS

Query:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP
        DP VYRRL+GRLL+LTISR DITF +HKLSQF+A P
Subjt:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP

A0A6J1DQI6 uncharacterized protein LOC111022877 isoform X32.4e-3965.44Show/hide
Query:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS
        G+S V LLV VDDIIVT  +  LI  LK  L + FKLKDLG L+YFL LEL RS+  IFLSQRHY LQ++EDTGFL  K   +PMD N++L AS+G+LL 
Subjt:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS

Query:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP
        DP VYRRL+GRLL+LTISR DITF +HKLSQF+A P
Subjt:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP

A0A6J1DSZ9 uncharacterized protein LOC111022877 isoform X22.4e-3965.44Show/hide
Query:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS
        G+S V LLV VDDIIVT  +  LI  LK  L + FKLKDLG L+YFL LEL RS+  IFLSQRHY LQ++EDTGFL  K   +PMD N++L AS+G+LL 
Subjt:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS

Query:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP
        DP VYRRL+GRLL+LTISR DITF +HKLSQF+A P
Subjt:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-0827.5Show/hide
Query:  SKPPSYLQDFHCCGSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELA--RSAKGIFLSQRHYTLQILEDTGFLFCKQANVPM
        S P  Y + F    ++ + LL+YVDD+++ G +  LI+ LK  L   F +KDLG  +  L +++   R+++ ++LSQ  Y  ++LE       K  + P+
Subjt:  SKPPSYLQDFHCCGSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELA--RSAKGIFLSQRHYTLQILEDTGFLFCKQANVPM

Query:  DSNVRLH--------ASNGELLSDPFVYRRLVGRLLH-LTISRSDITFVIHKLSQFVATP
          +++L            G +   P  Y   VG L++ +  +R DI   +  +S+F+  P
Subjt:  DSNVRLH--------ASNGELLSDPFVYRRLVGRLLH-LTISRSDITFVIHKLSQFVATP

P25600 Putative transposon Ty5-1 protein YCL074W1.6e-0828.67Show/hide
Query:  FHCCGSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKG-IFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASN
        F       + + VYVDD++V  P+  +   +K  L   + +KDLG +  FL L + +S+ G I LS + Y  +   ++     K    P+ ++  L  + 
Subjt:  FHCCGSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKG-IFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASN

Query:  GELLSDPFVYRRLVGRLLH-LTISRSDITFVIHKLSQFVATPR
           L D   Y+ +VG+LL      R DI++ +  LS+F+  PR
Subjt:  GELLSDPFVYRRLVGRLLH-LTISRSDITFVIHKLSQFVATPR

P92519 Uncharacterized mitochondrial protein AtMg008107.1e-1737.69Show/hide
Query:  LLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLSDPFVYR
        LL+YVDDI++TG +  L++ L   L S F +KDLG + YFL +++     G+FLSQ  Y  QIL + G L CK  + P+   +    S  +   DP  +R
Subjt:  LLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLSDPFVYR

Query:  RLVGRLLHLTISRSDITFVIHKLSQFVATP
         +VG L +LT++R DI++ ++ + Q +  P
Subjt:  RLVGRLLHLTISRSDITFVIHKLSQFVATP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-2039.71Show/hide
Query:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS
        G S+V +LVYVDDI++TG +  L+ +    L   F +KD  +L YFL +E  R   G+ LSQR Y L +L  T  +  K    PM  + +L   +G  L+
Subjt:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS

Query:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP
        DP  YR +VG L +L  +R DI++ +++LSQF+  P
Subjt:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-2138.97Show/hide
Query:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS
        G S++ +LVYVDDI++TG +  L+      L   F +K+  DL YFL +E  R  +G+ LSQR YTL +L  T  L  K    PM ++ +L   +G  L 
Subjt:  GSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLS

Query:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP
        DP  YR +VG L +L  +R D+++ +++LSQ++  P
Subjt:  DPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.6e-3254.2Show/hide
Query:  LLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLSDPFVYR
        +LVYVDDII+   N   + +LKS L S FKL+DLG LKYFL LE+ARSA GI + QR Y L +L++TG L CK ++VPMD +V   A +G    D   YR
Subjt:  LLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLSDPFVYR

Query:  RLVGRLLHLTISRSDITFVIHKLSQFVATPR
        RL+GRL++L I+R DI+F ++KLSQF   PR
Subjt:  RLVGRLLHLTISRSDITFVIHKLSQFVATPR

ATMG00810.1 DNA/RNA polymerases superfamily protein5.1e-1837.69Show/hide
Query:  LLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLSDPFVYR
        LL+YVDDI++TG +  L++ L   L S F +KDLG + YFL +++     G+FLSQ  Y  QIL + G L CK  + P+   +    S  +   DP  +R
Subjt:  LLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQANVPMDSNVRLHASNGELLSDPFVYR

Query:  RLVGRLLHLTISRSDITFVIHKLSQFVATP
         +VG L +LT++R DI++ ++ + Q +  P
Subjt:  RLVGRLLHLTISRSDITFVIHKLSQFVATP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGACAACACAACGATCTAAATTTGATTCAAGAGCCATTGCTGCTGTTTTTATGGGGTATCCATCAGCTTCTTCTGATGAGATGACTAATTTATTTTCGAATTTGGT
TCTTCCAAAGGCACTTGATTCCCACTCACCAATTGACACTTCTGATGCTTCGGTGCCAATTTCTTCTAATCTTGTGCATACTGATCCAAGCAGTATATCAGTTGATGTTC
CTATTTCAACAATCTCAAACATTTTTGTGGTTCCTAGTTTGACTTTTATTTCAACAAAATCTAATATTCCCTCAGCCATTGTTGATCATGCTAGTGTTGATGGTACTTCT
GCAGAAATGGGTCTTCCTTCCATTGCACCTTGTCGATCAATTCGACAGTCCAAACCTCCATCTTACTTGCAAGATTTTCATTGTTGTGGCTCTTCTTTGGTGACACTCCT
TGTGTACGTTGATGACATAATTGTGACTGGCCCTAATCTTTTCTTGATTAGTGATTTGAAGTCTATTCTTTATAGTTATTTCAAGCTAAAGGATCTTGGTGACCTCAAAT
ATTTTTTAGAACTTGAATTGGCTCGTTCAGCTAAAGGCATATTTCTTTCACAGAGACATTATACTTTACAAATTTTAGAAGATACTGGGTTTCTGTTTTGTAAGCAAGCT
AATGTTCCAATGGACTCTAATGTTCGTCTACATGCTTCTAATGGTGAACTCTTGAGTGATCCCTTTGTTTATCGTAGGCTTGTGGGGCGTCTTTTACATTTGACTATCTC
TCGCTCGGATATCACCTTTGTGATTCACAAGCTTAGTCAGTTTGTTGCAACACCTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGACAACACAACGATCTAAATTTGATTCAAGAGCCATTGCTGCTGTTTTTATGGGGTATCCATCAGCTTCTTCTGATGAGATGACTAATTTATTTTCGAATTTGGT
TCTTCCAAAGGCACTTGATTCCCACTCACCAATTGACACTTCTGATGCTTCGGTGCCAATTTCTTCTAATCTTGTGCATACTGATCCAAGCAGTATATCAGTTGATGTTC
CTATTTCAACAATCTCAAACATTTTTGTGGTTCCTAGTTTGACTTTTATTTCAACAAAATCTAATATTCCCTCAGCCATTGTTGATCATGCTAGTGTTGATGGTACTTCT
GCAGAAATGGGTCTTCCTTCCATTGCACCTTGTCGATCAATTCGACAGTCCAAACCTCCATCTTACTTGCAAGATTTTCATTGTTGTGGCTCTTCTTTGGTGACACTCCT
TGTGTACGTTGATGACATAATTGTGACTGGCCCTAATCTTTTCTTGATTAGTGATTTGAAGTCTATTCTTTATAGTTATTTCAAGCTAAAGGATCTTGGTGACCTCAAAT
ATTTTTTAGAACTTGAATTGGCTCGTTCAGCTAAAGGCATATTTCTTTCACAGAGACATTATACTTTACAAATTTTAGAAGATACTGGGTTTCTGTTTTGTAAGCAAGCT
AATGTTCCAATGGACTCTAATGTTCGTCTACATGCTTCTAATGGTGAACTCTTGAGTGATCCCTTTGTTTATCGTAGGCTTGTGGGGCGTCTTTTACATTTGACTATCTC
TCGCTCGGATATCACCTTTGTGATTCACAAGCTTAGTCAGTTTGTTGCAACACCTCGTTAG
Protein sequenceShow/hide protein sequence
MLTTQRSKFDSRAIAAVFMGYPSASSDEMTNLFSNLVLPKALDSHSPIDTSDASVPISSNLVHTDPSSISVDVPISTISNIFVVPSLTFISTKSNIPSAIVDHASVDGTS
AEMGLPSIAPCRSIRQSKPPSYLQDFHCCGSSLVTLLVYVDDIIVTGPNLFLISDLKSILYSYFKLKDLGDLKYFLELELARSAKGIFLSQRHYTLQILEDTGFLFCKQA
NVPMDSNVRLHASNGELLSDPFVYRRLVGRLLHLTISRSDITFVIHKLSQFVATPR