; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019849 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019849
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153419:905694..907135
RNA-Seq ExpressionSgr019849
SyntenySgr019849
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.8e-2738.38Show/hide
Query:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL
        L  DN +  EF  +   V+DK TG+V+L   +K+ LYQL                           +  N  AF+            WHR+LGHP+ K+L
Subjt:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL

Query:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
        D V+ +  +    +DN  FC+ACQ+GK   L F  S+SHA  P ELV++++ GP+P+ +  GFK+Y+HF+DDFSR  WIYPLK+K
Subjt:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]7.0e-2738.92Show/hide
Query:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL
        L  DN +  EF  D   V+DK TG+V+L   +K+ LYQL +         C                        V+ ++K      WHR+LGHPS  +L
Subjt:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL

Query:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
        D V+    + T  +D   FC+ACQ GKS  L F  S+SHA    EL+++++ GP+P++SI GFK+Y+HF+DD SR  WIYPLK+K
Subjt:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]1.2e-2637.84Show/hide
Query:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL
        L  DN +F EF  +   V+DK TGQ +L  R+K+ LYQL      +    C                        V+ ++K      WHR+LGHP+ K+L
Subjt:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL

Query:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
        + V+    +    +D   FC+ACQFGK   L F  S+SH   P  L++S++ GP+P+ S  GFK+Y+HF+DDFSR  WI+PLK+K
Subjt:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

PNY02796.1 copia protein (gag-int-pol protein), partial [Trifolium pratense]1.4e-2736.76Show/hide
Query:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL
        L  DN +  EF  D   V+DK TG+ +L  ++K  LYQ+ +    +   +C+ ++                                WHR+LGHP+ K+L
Subjt:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL

Query:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
        D V+    + T  +D   FC+ACQFGK   L F  S SHA  P +L+++++ GP+P+ S  GFK+Y+HF+DDFSR  WIYPLK+K
Subjt:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

RZB67542.1 Retrovirus-related Pol polyprotein from transposon RE1 [Glycine soja]2.6e-2636.76Show/hide
Query:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL
        L  DN    EF  +   V+DK TG+ +L  ++++ LYQL S         C+ ++         +K N                   WHR+LGHP+ K+L
Subjt:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL

Query:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
        + V+    +    ND   FC+ACQFGK   L F  S+SHA  P +L++S++ GP+P+ S   FK+Y+HF+DDFSR  WI+PLK+K
Subjt:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

TrEMBL top hitse value%identityAlignment
A0A2K3NIC3 Copia protein (Gag-int-pol protein) (Fragment)6.8e-2836.76Show/hide
Query:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL
        L  DN +  EF  D   V+DK TG+ +L  ++K  LYQ+ +    +   +C+ ++                                WHR+LGHP+ K+L
Subjt:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL

Query:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
        D V+    + T  +D   FC+ACQFGK   L F  S SHA  P +L+++++ GP+P+ S  GFK+Y+HF+DDFSR  WIYPLK+K
Subjt:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

A0A2Z6MBG6 Integrase catalytic domain-containing protein8.9e-2838.38Show/hide
Query:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL
        L  DN +  EF  +   V+DK TG+V+L   +K+ LYQL                           +  N  AF+            WHR+LGHP+ K+L
Subjt:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL

Query:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
        D V+ +  +    +DN  FC+ACQ+GK   L F  S+SHA  P ELV++++ GP+P+ +  GFK+Y+HF+DDFSR  WIYPLK+K
Subjt:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

A0A803P5A9 Uncharacterized protein2.7e-3247.03Show/hide
Query:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL
        L  DN +  EF  D   V+DK T +V+L   +K+ LYQL S      P+S   V    QSA P       S +    SN     + VWHR+LGHPS KIL
Subjt:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKIL

Query:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
         LV+N+  +   FN+N  FC ACQ+GKS AL F  SNS AT   EL++++L GP+P++S   FKFYIHFLDD+SR  W+YPLK+K
Subjt:  DLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

A0A803PM38 Uncharacterized protein2.8e-2941.97Show/hide
Query:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNM-KYVSNCV-------WHRQL
        L  DN V  EF  D   V+DK+TGQVVL  ++K+ LYQ  +      P S +S++++   + P   S +  +   V SN+ K ++N +       WHR+L
Subjt:  LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNM-KYVSNCV-------WHRQL

Query:  GHPSPKILDLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
        GHPS ++LD V++    +   N +  FC ACQ GKS +L F  +   ATAP ELV++++ GPSP+ S   F++YIHF+DDFSR+ WIYPLK K
Subjt:  GHPSPKILDLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

A0A803QD60 Uncharacterized protein6.6e-3144.68Show/hide
Query:  DNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAA------SLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSP
        DN V  EF+ D  VV+DK T +V+L   +++ LYQLQ+     T  S S + +      S    V   KS+ N    L     K   + VW R+LGHPSP
Subjt:  DNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAA------SLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSP

Query:  KILDLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
        ++L  V+N+  +    N+   FC ACQFGKS AL F  SNS AT   +LV+S+L GPSPV S  GF+ YIHF+DD +R+ WIYPLK K
Subjt:  KILDLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-1333.98Show/hide
Query:  VSNCVWHRQLGHPSPKILDLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPL
        +S  +WH+++GH S K L ++     +          C  C FGK   +SF  S+       +LVYS++ GP  ++S+ G K+++ F+DD SR +W+Y L
Subjt:  VSNCVWHRQLGHPSPKILDLVINTYCLLTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPL

Query:  KRK
        K K
Subjt:  KRK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-1732.79Show/hide
Query:  NAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKILDLVI
        N V  EF   SF V+D +TG  +L  + K+ LY+         P++ S   +   S  P  K+  +S                WH +LGHP+P IL+ VI
Subjt:  NAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKILDLVI

Query:  NTYCLLTKFNDNCPF--CKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
        + Y  L+  N +  F  C  C   KS+ + F +S  ++T P E +YS++   SP+ S   +++Y+ F+D F+R+ W+YPLK+K
Subjt:  NTYCLLTKFNDNCPF--CKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-1531.32Show/hide
Query:  NAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKILDLVI
        N V  EF   SF V+D +TG  +L  + K+ LY+         P++ S   +   S  P  K+  +S                WH +LGHPS  IL+ VI
Subjt:  NAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKILDLVI

Query:  NTYCL-LTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK
        + + L +   +     C  C   KS  + F  S   ++ P E +YS++   SP+ SI  +++Y+ F+D F+R+ W+YPLK+K
Subjt:  NTYCL-LTKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTAAGATGTGATAATGCTGTTTTCTTTGAGTTTCATTTTGATTCTTTTGTTGTACAGGACAAGGATACGGGGCAAGTAGTGCTAGTGAGGAGAATTAAAAATAGTCTTTA
TCAACTTCAATCTTGTGATCATCTTGCTACTCCTGTTTCATGCTCATCAGTCGCTGCCAGTCTACAGAGTGCAGTTCCTATGTTGAAGTCAAATGCTAATAGTCTTGCTT
TTCTTGTTTTTAGTAATATGAAATATGTTTCCAACTGTGTTTGGCATAGGCAATTAGGGCACCCTTCCCCTAAAATTCTTGATCTTGTAATCAATACCTACTGTTTGTTG
ACCAAGTTTAATGATAATTGTCCTTTTTGCAAAGCATGTCAATTTGGAAAATCATCTGCTCTTTCTTTTCTACGCTCCAATTCTCATGCTACAGCACCATTTGAGTTAGT
TTATTCAAATCTTTTGGGGCCATCTCCAGTTGATTCCATTCCTGGTTTTAAATTTTACATTCACTTTTTGGATGATTTCAGTCGGCATGTGTGGATTTATCCTCTTAAAC
GAAAAATCGGATGGTGGTGGTGGTATAAGCCACTTGCTGCTATTTGTCAGTCTCTTGGCATTGGTTTTCTGTTTGCTTGCCTACATACTCTATGTCCATCTCTAGCTCCT
ACTTCGCCTCTTAATATTGCTCCCTCAATTGCTTTTGAAAATGGAGAATCTGGGACAATTGCTTCTTCTGCAACCTCTATTGCCTTGGCTAATCCTCCTGCTGCTTTTCT
TCAGAATATTCATCCCATGTAG
mRNA sequenceShow/hide mRNA sequence
CTAAGATGTGATAATGCTGTTTTCTTTGAGTTTCATTTTGATTCTTTTGTTGTACAGGACAAGGATACGGGGCAAGTAGTGCTAGTGAGGAGAATTAAAAATAGTCTTTA
TCAACTTCAATCTTGTGATCATCTTGCTACTCCTGTTTCATGCTCATCAGTCGCTGCCAGTCTACAGAGTGCAGTTCCTATGTTGAAGTCAAATGCTAATAGTCTTGCTT
TTCTTGTTTTTAGTAATATGAAATATGTTTCCAACTGTGTTTGGCATAGGCAATTAGGGCACCCTTCCCCTAAAATTCTTGATCTTGTAATCAATACCTACTGTTTGTTG
ACCAAGTTTAATGATAATTGTCCTTTTTGCAAAGCATGTCAATTTGGAAAATCATCTGCTCTTTCTTTTCTACGCTCCAATTCTCATGCTACAGCACCATTTGAGTTAGT
TTATTCAAATCTTTTGGGGCCATCTCCAGTTGATTCCATTCCTGGTTTTAAATTTTACATTCACTTTTTGGATGATTTCAGTCGGCATGTGTGGATTTATCCTCTTAAAC
GAAAAATCGGATGGTGGTGGTGGTATAAGCCACTTGCTGCTATTTGTCAGTCTCTTGGCATTGGTTTTCTGTTTGCTTGCCTACATACTCTATGTCCATCTCTAGCTCCT
ACTTCGCCTCTTAATATTGCTCCCTCAATTGCTTTTGAAAATGGAGAATCTGGGACAATTGCTTCTTCTGCAACCTCTATTGCCTTGGCTAATCCTCCTGCTGCTTTTCT
TCAGAATATTCATCCCATGTAG
Protein sequenceShow/hide protein sequence
LRCDNAVFFEFHFDSFVVQDKDTGQVVLVRRIKNSLYQLQSCDHLATPVSCSSVAASLQSAVPMLKSNANSLAFLVFSNMKYVSNCVWHRQLGHPSPKILDLVINTYCLL
TKFNDNCPFCKACQFGKSSALSFLRSNSHATAPFELVYSNLLGPSPVDSIPGFKFYIHFLDDFSRHVWIYPLKRKIGWWWWYKPLAAICQSLGIGFLFACLHTLCPSLAP
TSPLNIAPSIAFENGESGTIASSATSIALANPPAAFLQNIHPM