; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019994 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019994
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153446:477388..480320
RNA-Seq ExpressionSgr019994
SyntenySgr019994
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0006952 - defense response (biological process)
GO:0043167 - ion binding (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PHU08115.1 hypothetical protein BC332_24604 [Capsicum chinense]7.2e-5253.81Show/hide
Query:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK
        M++YLKAL LWE V +  D  PLG N T+  ++ +E+ K KK KAL+ +H+ LSD IF RI+ C+T KE W+KL+EEF+GS +VK++KLLTLKREFEML+
Subjt:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK

Query:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG
        MKE ++ ++Y  K++ I+N+IRL GE F + +V+EK+M+S+PS+FESKISAIE S DL TLS+ ELI K QAQEQ+ +I +E+  E AF  K KGK+ V 
Subjt:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG

Query:  KDDRKATEDQGGKGKG--GASTK
        KD+R+   D+  K K   G+S K
Subjt:  KDDRKATEDQGGKGKG--GASTK

RVX13462.1 Retrovirus-related Pol polyprotein from transposon RE2 [Vitis vinifera]7.2e-5251.83Show/hide
Query:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK
        M+ YL++ GLW  V + ADP PLG N T+  ++ +EEEK+KK KA++ +H+ L+D IF +I++ +T K+ WDKL  EFEGS +VK V+LLTLKREFE++K
Subjt:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK

Query:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG
        MK++ S +DY  ++M +VNQ+RL GE F +Q+V+EKIMVSVP KFE+KISAIE S DL TL+IVEL  KL AQEQ+  +  +E  EGAF    KGK    
Subjt:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG

Query:  KDDRKATEDQGGKGKGGA
           +K  ++  GK +G +
Subjt:  KDDRKATEDQGGKGKGGA

XP_015165534.1 PREDICTED: uncharacterized protein LOC107061209 [Solanum tuberosum]2.5e-5247.47Show/hide
Query:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK
        M++YLKAL LWE + +  DP PLG N T+  ++++E+ K +K KAL+ +H+TLSD IF RI+ C+T KE W+KL +EF+GS +VK +KLLTLKREFE+L+
Subjt:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK

Query:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG
        MKE +  ++Y AK++ IVN++RL GE F + +V+EK+M+S+P++FESKISAIE S DL TLS+ ELI KLQAQEQ+ +I +EE  E AF  + KG++ + 
Subjt:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG

Query:  KDDRKATEDQGGKGKGGASTKEEIQ-ELPSRRDNLVCRREKEGSTEHLQESEVDWVQ
        KD+R+   D+G +   G + + +    L +     + +++  GS E  + S VD+V+
Subjt:  KDDRKATEDQGGKGKGGASTKEEIQ-ELPSRRDNLVCRREKEGSTEHLQESEVDWVQ

XP_015167177.1 PREDICTED: uncharacterized protein LOC107061811 [Solanum tuberosum]4.5e-5456.59Show/hide
Query:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK
        M++YLKAL LWE + T  DP PLG N T+  ++++E+ K +K KAL  +H+ LSD IF RI+ C+T KE W+KL EEF+GS +VK VKLLTLKREFEML+
Subjt:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK

Query:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG
        M E ++ + Y AK++ IVN++RL GE FP+ RV+EK+M+S+P++FESKISAIE S DL TLS+ ELI KLQAQEQ  +I +EE  E AF  + KGK+ + 
Subjt:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG

Query:  KDDRK
        KD+R+
Subjt:  KDDRK

XP_022156661.1 uncharacterized protein LOC111023510 [Momordica charantia]3.2e-5271.18Show/hide
Query:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK
        MQSYLKALGLWE VST+ D +PLGENLTLN I LHEE+K+K  K LS IHA+LS+PIFA+IIDCKT KEA DKL EEFEGS             EFEMLK
Subjt:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK

Query:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKL
        MK+S+S  DY  KVM IVNQIRL GE F +QRV+EKIMVSVPSKFESKIS IE SS+LTTLSI ELI KL
Subjt:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKL

TrEMBL top hitse value%identityAlignment
A0A2G3BNR5 NB-ARC domain-containing protein3.5e-5253.81Show/hide
Query:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK
        M++YLKAL LWE V +  D  PLG N T+  ++ +E+ K KK KAL+ +H+ LSD IF RI+ C+T KE W+KL+EEF+GS +VK++KLLTLKREFEML+
Subjt:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK

Query:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG
        MKE ++ ++Y  K++ I+N+IRL GE F + +V+EK+M+S+PS+FESKISAIE S DL TLS+ ELI K QAQEQ+ +I +E+  E AF  K KGK+ V 
Subjt:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG

Query:  KDDRKATEDQGGKGKG--GASTK
        KD+R+   D+  K K   G+S K
Subjt:  KDDRKATEDQGGKGKG--GASTK

A0A438F3A5 Retrovirus-related Pol polyprotein from transposon TNT 1-947.7e-5251.83Show/hide
Query:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK
        M+ YL++ GLW  V + ADP PLG N T+  ++ +EEEK+KK KA++ +H+ L+D IF +I++ +T K+ WDKL  EFEGS +VK V+LLTLKREFE++K
Subjt:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK

Query:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG
        MK+  S +DY  ++M +VNQ+RL GE F +Q+V+EKIMVSVP KFE+KISAIE S DL TL+IVEL  KL AQEQ+  +  +E  EGAF    KGK    
Subjt:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG

Query:  KDDRKATEDQGGKGKGGA
           +K  ++  GK +G +
Subjt:  KDDRKATEDQGGKGKGGA

A0A438JWX0 Retrovirus-related Pol polyprotein from transposon RE23.5e-5251.83Show/hide
Query:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK
        M+ YL++ GLW  V + ADP PLG N T+  ++ +EEEK+KK KA++ +H+ L+D IF +I++ +T K+ WDKL  EFEGS +VK V+LLTLKREFE++K
Subjt:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK

Query:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG
        MK++ S +DY  ++M +VNQ+RL GE F +Q+V+EKIMVSVP KFE+KISAIE S DL TL+IVEL  KL AQEQ+  +  +E  EGAF    KGK    
Subjt:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG

Query:  KDDRKATEDQGGKGKGGA
           +K  ++  GK +G +
Subjt:  KDDRKATEDQGGKGKGGA

A0A6J1DVL9 uncharacterized protein LOC1110235101.6e-5271.18Show/hide
Query:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK
        MQSYLKALGLWE VST+ D +PLGENLTLN I LHEE+K+K  K LS IHA+LS+PIFA+IIDCKT KEA DKL EEFEGS             EFEMLK
Subjt:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK

Query:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKL
        MK+S+S  DY  KVM IVNQIRL GE F +QRV+EKIMVSVPSKFESKIS IE SS+LTTLSI ELI KL
Subjt:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKL

A5B9M8 Integrase catalytic domain-containing protein7.7e-5251.83Show/hide
Query:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK
        M+ YL++ GLW  V + ADP PLG N T+  ++ +EEEK+KK KA++ +H+ L+D IF +I++ +T K+ WDKL  EFEGS +VK V+LLTLKREFE++K
Subjt:  MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLK

Query:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG
        MK+  S +DY  ++M +VNQ+RL GE F +Q+V+EKIMVSVP KFE+KISAIE S DL TL+IVEL  KL AQEQ+  +  +E  EGAF    KGK    
Subjt:  MKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVG

Query:  KDDRKATEDQGGKGKGGA
           +K  ++  GK +G +
Subjt:  KDDRKATEDQGGKGKGGA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein1.7e-0623.81Show/hide
Query:  QSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEK------VKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSL--KVKAVKLLTLK
        +S L   GLW+ V  N  PQ   +N  L      EE        VK  KAL  + ++L+D +F + +   + K+ WD L +  E +   +++ V +  L+
Subjt:  QSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEK------VKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSL--KVKAVKLLTLK

Query:  REFEMLKMKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGA---FN
        ++ E LKM +  S   Y  K + I+ ++  A  +  +  + + +  ++   F+   S +E   D+  ++   L+     +  + +   EE + G      
Subjt:  REFEMLKMKESNSARDYRAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGA---FN

Query:  VKSKGKKLVG
        +KSK +K  G
Subjt:  VKSKGKKLVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCCTATCTCAAAGCACTTGGCCTTTGGGAATTTGTTTCAACTAATGCCGATCCTCAACCATTGGGAGAAAATCTGACGTTGAATCATATCAGACTACATGAAGA
GGAGAAGGTAAAGAAGCTCAAGGCCTTATCTGAAATTCATGCCACTTTATCTGATCCTATATTTGCTAGGATTATTGATTGTAAAACGACAAAAGAAGCTTGGGATAAAT
TACATGAGGAATTTGAAGGAAGCTTAAAGGTGAAAGCTGTCAAATTATTAACTTTGAAAAGAGAGTTTGAGATGCTGAAAATGAAGGAATCAAACTCTGCGAGGGACTAT
AGAGCTAAAGTGATGATCATTGTGAATCAGATCAGACTAGCTGGTGAAAAATTTCCCAATCAAAGAGTTATGGAAAAAATAATGGTTAGTGTTCCCAGTAAATTTGAGTC
AAAGATCTCAGCCATAGAGGGGTCTTCTGATTTGACTACTCTCTCTATAGTTGAGTTAATTGGCAAATTACAAGCCCAGGAACAAAAGGATACAATTTATAATGAAGAGC
ATGTTGAAGGTGCATTTAATGTCAAGTCTAAAGGTAAGAAACTTGTTGGAAAGGATGATAGAAAGGCAACTGAAGATCAAGGGGGCAAGGGAAAGGGAGGAGCATCAACA
AAAGAGGAAATTCAAGAACTGCCATCAAGAAGAGATAATTTAGTTTGCCGACGAGAGAAAGAAGGATCAACAGAACATCTCCAAGAATCTGAAGTTGATTGGGTGCAAGA
AAAATTTCAAATAGAAACCCATATGGCTGAATTTGAGAAAGATCCGAAAGAATCTCATGTGGCTAAAGTTGAAAAAGATTTGATTGTAGCTTATATGTTTGAAGTTGGAA
AACATGGAGAAATTTCAATTAATCAACCTCAAGAACCTGATTCTGATTTGTTACCTAAAAATACTGCGCTTGAATTGCAATTATTGAGTTTGGAACCTCAAAATCTGGCA
GGAGGAAAGAAACAAATTGGTAACGTTGTTGCTGGTAGTGGAGAGGTTGAAGAACCTAGTAGAGTCTTAATAAATCAACATCCTTCTCAGACTTGGCCGACAAGTGAAGA
AGATGATATTTTGAAAATTTTTGAAAGTGATTTCAGTGATTTCAGTGTGCGTATTCAGAGGGATTTTTTAGGAAGAACAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAATCCTATCTCAAAGCACTTGGCCTTTGGGAATTTGTTTCAACTAATGCCGATCCTCAACCATTGGGAGAAAATCTGACGTTGAATCATATCAGACTACATGAAGA
GGAGAAGGTAAAGAAGCTCAAGGCCTTATCTGAAATTCATGCCACTTTATCTGATCCTATATTTGCTAGGATTATTGATTGTAAAACGACAAAAGAAGCTTGGGATAAAT
TACATGAGGAATTTGAAGGAAGCTTAAAGGTGAAAGCTGTCAAATTATTAACTTTGAAAAGAGAGTTTGAGATGCTGAAAATGAAGGAATCAAACTCTGCGAGGGACTAT
AGAGCTAAAGTGATGATCATTGTGAATCAGATCAGACTAGCTGGTGAAAAATTTCCCAATCAAAGAGTTATGGAAAAAATAATGGTTAGTGTTCCCAGTAAATTTGAGTC
AAAGATCTCAGCCATAGAGGGGTCTTCTGATTTGACTACTCTCTCTATAGTTGAGTTAATTGGCAAATTACAAGCCCAGGAACAAAAGGATACAATTTATAATGAAGAGC
ATGTTGAAGGTGCATTTAATGTCAAGTCTAAAGGTAAGAAACTTGTTGGAAAGGATGATAGAAAGGCAACTGAAGATCAAGGGGGCAAGGGAAAGGGAGGAGCATCAACA
AAAGAGGAAATTCAAGAACTGCCATCAAGAAGAGATAATTTAGTTTGCCGACGAGAGAAAGAAGGATCAACAGAACATCTCCAAGAATCTGAAGTTGATTGGGTGCAAGA
AAAATTTCAAATAGAAACCCATATGGCTGAATTTGAGAAAGATCCGAAAGAATCTCATGTGGCTAAAGTTGAAAAAGATTTGATTGTAGCTTATATGTTTGAAGTTGGAA
AACATGGAGAAATTTCAATTAATCAACCTCAAGAACCTGATTCTGATTTGTTACCTAAAAATACTGCGCTTGAATTGCAATTATTGAGTTTGGAACCTCAAAATCTGGCA
GGAGGAAAGAAACAAATTGGTAACGTTGTTGCTGGTAGTGGAGAGGTTGAAGAACCTAGTAGAGTCTTAATAAATCAACATCCTTCTCAGACTTGGCCGACAAGTGAAGA
AGATGATATTTTGAAAATTTTTGAAAGTGATTTCAGTGATTTCAGTGTGCGTATTCAGAGGGATTTTTTAGGAAGAACAAGTTGA
Protein sequenceShow/hide protein sequence
MQSYLKALGLWEFVSTNADPQPLGENLTLNHIRLHEEEKVKKLKALSEIHATLSDPIFARIIDCKTTKEAWDKLHEEFEGSLKVKAVKLLTLKREFEMLKMKESNSARDY
RAKVMIIVNQIRLAGEKFPNQRVMEKIMVSVPSKFESKISAIEGSSDLTTLSIVELIGKLQAQEQKDTIYNEEHVEGAFNVKSKGKKLVGKDDRKATEDQGGKGKGGAST
KEEIQELPSRRDNLVCRREKEGSTEHLQESEVDWVQEKFQIETHMAEFEKDPKESHVAKVEKDLIVAYMFEVGKHGEISINQPQEPDSDLLPKNTALELQLLSLEPQNLA
GGKKQIGNVVAGSGEVEEPSRVLINQHPSQTWPTSEEDDILKIFESDFSDFSVRIQRDFLGRTS