; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020066 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020066
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetroelement pol polyprotein-like
Genome locationtig00153446:1837818..1838372
RNA-Seq ExpressionSgr020066
SyntenySgr020066
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031937.1 retroelement pol polyprotein-like [Cucumis melo var. makuwa]2.2e-5966.27Show/hide
Query:  GFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYF
        G+SYFLTIVDD TRYTWV+ML+ KSDV+SIIPQFFKL+ETQY K+IK   SDNAP+L F  FF  KGV+HQ+S +   ++NSVVERKHQHILN ARALYF
Subjt:  GFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYF

Query:  QSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKF
        QS+V L+FWG+CILT +Y+INRTPS +L W++PF+ L  ++ DY+ L+VFG LC+AS+LP++ SKF
Subjt:  QSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKF

KAA0068025.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.6e-6264.61Show/hide
Query:  FSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYFQ
        +SYFLTIVDD TRYTWV+ML+ KSDV+SIIP FFKL+ETQY K+IK   SDNA KL F  FF  KGV+HQ+S +   ++NSVVE+KHQHILN ARALYFQ
Subjt:  FSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYFQ

Query:  SRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP
        S+V L+FWG+CI+T VY+I+RTPS +L W+ PF+ L   + DY+ L+VFG LC+AS+LPH+ SKF PRA P++F+GYP
Subjt:  SRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP

TYK16758.1 Copia protein [Cucumis melo var. makuwa]1.4e-6364.8Show/hide
Query:  GFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYF
        G+SYFLTIVDD TRYTWV+ML+ KSDV+SIIPQFFKL+ETQY K+IK   SDNAP+L F  FF  KGV+HQ+S +   ++NSVVERKHQHILN ARALYF
Subjt:  GFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYF

Query:  QSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP
        QS+V L+FWG+CILT +Y+INRTPS +L W++ F+ L  ++ DY+ L+VFG LC+AS+LP++ SKF  RA P++F+GYP
Subjt:  QSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP

TYK18103.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.6e-6264.61Show/hide
Query:  FSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYFQ
        +SYFLTIVDD TRYTWV+ML+ KSDV+SIIP FFKL+ETQY K+IK   SDNA KL F  FF  KGV+HQ+S +   ++NSVVE+KHQHILN ARALYFQ
Subjt:  FSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYFQ

Query:  SRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP
        S+V L+FWG+CI+T VY+I+RTPS +L W+ PF+ L   + DY+ L+VFG LC+AS+LPH+ SKF PRA P++F+GYP
Subjt:  SRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP

XP_038895765.1 uncharacterized protein LOC120083929 [Benincasa hispida]2.2e-6767.4Show/hide
Query:  HVGFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARAL
        HVG  +FLTIVDD +R+TWV+ML+ KS VL IIPQFF  VETQY K IK+F SDNAP+LSF +FF  +GV+HQ+S +GR E+NSVVERKHQH+LNV+RAL
Subjt:  HVGFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARAL

Query:  YFQSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP
        +FQSR  L FW EC+LT VY+INRT S +L W+TP++LL G++ADYSL+R F CLCFASTL H  SKFSPR  PA F+GYP
Subjt:  YFQSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP

TrEMBL top hitse value%identityAlignment
A0A438HDI8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-5559.12Show/hide
Query:  HVGFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARAL
        H GF YFLTIVDDCTR TWV++LR KSDV +I PQFF +V+T++  +IK   SDNAP+L+  + F    V+H FS +   ++NSVVERKHQHILNVARAL
Subjt:  HVGFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARAL

Query:  YFQSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP
        YFQS + + +WG+C+LT VY+INR PS +L+ +TPFELL      YS L+ FGCLC++STLP    KFSPRA P +F+GYP
Subjt:  YFQSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP

A0A5A7SRC2 Retroelement pol polyprotein-like1.0e-5966.27Show/hide
Query:  GFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYF
        G+SYFLTIVDD TRYTWV+ML+ KSDV+SIIPQFFKL+ETQY K+IK   SDNAP+L F  FF  KGV+HQ+S +   ++NSVVERKHQHILN ARALYF
Subjt:  GFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYF

Query:  QSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKF
        QS+V L+FWG+CILT +Y+INRTPS +L W++PF+ L  ++ DY+ L+VFG LC+AS+LP++ SKF
Subjt:  QSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKF

A0A5A7VQN7 Cysteine-rich RLK (Receptor-like protein kinase) 87.8e-6364.61Show/hide
Query:  FSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYFQ
        +SYFLTIVDD TRYTWV+ML+ KSDV+SIIP FFKL+ETQY K+IK   SDNA KL F  FF  KGV+HQ+S +   ++NSVVE+KHQHILN ARALYFQ
Subjt:  FSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYFQ

Query:  SRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP
        S+V L+FWG+CI+T VY+I+RTPS +L W+ PF+ L   + DY+ L+VFG LC+AS+LPH+ SKF PRA P++F+GYP
Subjt:  SRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP

A0A5D3CZP1 Copia protein7.0e-6464.8Show/hide
Query:  GFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYF
        G+SYFLTIVDD TRYTWV+ML+ KSDV+SIIPQFFKL+ETQY K+IK   SDNAP+L F  FF  KGV+HQ+S +   ++NSVVERKHQHILN ARALYF
Subjt:  GFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYF

Query:  QSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP
        QS+V L+FWG+CILT +Y+INRTPS +L W++ F+ L  ++ DY+ L+VFG LC+AS+LP++ SKF  RA P++F+GYP
Subjt:  QSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP

A0A5D3D1N3 Cysteine-rich RLK (Receptor-like protein kinase) 87.8e-6364.61Show/hide
Query:  FSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYFQ
        +SYFLTIVDD TRYTWV+ML+ KSDV+SIIP FFKL+ETQY K+IK   SDNA KL F  FF  KGV+HQ+S +   ++NSVVE+KHQHILN ARALYFQ
Subjt:  FSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYFQ

Query:  SRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP
        S+V L+FWG+CI+T VY+I+RTPS +L W+ PF+ L   + DY+ L+VFG LC+AS+LPH+ SKF PRA P++F+GYP
Subjt:  SRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-1629.83Show/hide
Query:  SYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDD---FFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALY
        +YF+  VD  T Y   Y+++ KSDV S+   F    E  +   +   + DN  +   ++   F   KG+ +  +     + N V ER  + I   AR + 
Subjt:  SYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDD---FFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALY

Query:  FQSRVSLSFWGECILTVVYIINRTPSHVL--HWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGY
          +++  SFWGE +LT  Y+INR PS  L    +TP+E+          LRVFG   +   + +   KF  ++  ++FVGY
Subjt:  FQSRVSLSFWGECILTVVYIINRTPSHVL--HWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-2733.15Show/hide
Query:  GFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLS---FDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARA
        G  YF+T +DD +R  WVY+L+ K  V  +  +F  LVE +  + +K   SDN  + +   F+++  + G+ H+ +  G  + N V ER ++ I+   R+
Subjt:  GFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLS---FDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARA

Query:  LYFQSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGY
        +   +++  SFWGE + T  Y+INR+PS  L +  P  + T     YS L+VFGC  FA       +K   ++ P +F+GY
Subjt:  LYFQSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGY

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.2e-0524.11Show/hide
Query:  SYFLTIVDDCTRYTWVYML--RRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKL---SFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARA
        SYF++  D+ TR+ WVY L  RR+  +L++       ++ Q+   + +   D   +    +   FF  +G+   ++       + V ER ++ +LN  R 
Subjt:  SYFLTIVDDCTRYTWVYML--RRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKL---SFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARA

Query:  LYFQSRVSLSFW
        L   S +    W
Subjt:  LYFQSRVSLSFW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-2733.88Show/hide
Query:  HVGFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPK-LSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARA
        H  + Y++  VD  TRYTW+Y L++KS V      F  L+E ++Q  I  F+SDN  + ++  ++F   G+ H  S     E N + ERKH+HI+     
Subjt:  HVGFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPK-LSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARA

Query:  LYFQSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYPL
        L   + +  ++W       VY+INR P+ +L   +PF+ L G+  +Y  LRVFGC C+    P++  K   ++   +F+GY L
Subjt:  LYFQSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYPL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-2533.89Show/hide
Query:  FSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPK-LSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYF
        + Y++  VD  TRYTW+Y L++KS V      F  LVE ++Q  I   +SDN  + +   D+    G+ H  S     E N + ERKH+HI+ +   L  
Subjt:  FSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPK-LSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYF

Query:  QSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYPL
         + V  ++W       VY+INR P+ +L  ++PF+ L G   +Y  L+VFGC C+    P++  K   ++    F+GY L
Subjt:  QSRVSLSFWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYPL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGTTGGTTTCTCTTATTTCCTTACCATTGTGGATGATTGCACGCGTTATACTTGGGTATATATGCTTAGGAGGAAGTCTGATGTTTTGTCTATTATACCTCAATT
TTTTAAGCTTGTGGAGACACAATATCAGAAATCTATTAAAATTTTTCATTCTGACAATGCTCCTAAACTCTCCTTTGATGATTTCTTTCGAACTAAAGGGGTTGTTCATC
AGTTTTCGTATATGGGGCGTCTTGAGCGAAATTCAGTTGTTGAGAGGAAGCACCAACATATCCTTAATGTTGCCCGTGCACTTTATTTTCAGTCTCGAGTCTCTTTGAGT
TTTTGGGGCGAATGTATTCTTACGGTCGTATATATTATTAACAGAACTCCTTCTCATGTTCTTCATTGGCGTACACCTTTTGAGTTGTTGACGGGCTCTATGGCTGATTA
TTCCCTTTTAAGGGTCTTTGGTTGTCTCTGTTTTGCCTCCACACTTCCTCATCATCTCTCTAAGTTTTCTCCTCGTGCAACCCCTGCTATGTTTGTTGGTTACCCCCTAG
CATGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGTTGGTTTCTCTTATTTCCTTACCATTGTGGATGATTGCACGCGTTATACTTGGGTATATATGCTTAGGAGGAAGTCTGATGTTTTGTCTATTATACCTCAATT
TTTTAAGCTTGTGGAGACACAATATCAGAAATCTATTAAAATTTTTCATTCTGACAATGCTCCTAAACTCTCCTTTGATGATTTCTTTCGAACTAAAGGGGTTGTTCATC
AGTTTTCGTATATGGGGCGTCTTGAGCGAAATTCAGTTGTTGAGAGGAAGCACCAACATATCCTTAATGTTGCCCGTGCACTTTATTTTCAGTCTCGAGTCTCTTTGAGT
TTTTGGGGCGAATGTATTCTTACGGTCGTATATATTATTAACAGAACTCCTTCTCATGTTCTTCATTGGCGTACACCTTTTGAGTTGTTGACGGGCTCTATGGCTGATTA
TTCCCTTTTAAGGGTCTTTGGTTGTCTCTGTTTTGCCTCCACACTTCCTCATCATCTCTCTAAGTTTTCTCCTCGTGCAACCCCTGCTATGTTTGTTGGTTACCCCCTAG
CATGA
Protein sequenceShow/hide protein sequence
MHVGFSYFLTIVDDCTRYTWVYMLRRKSDVLSIIPQFFKLVETQYQKSIKIFHSDNAPKLSFDDFFRTKGVVHQFSYMGRLERNSVVERKHQHILNVARALYFQSRVSLS
FWGECILTVVYIINRTPSHVLHWRTPFELLTGSMADYSLLRVFGCLCFASTLPHHLSKFSPRATPAMFVGYPLA