; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016795 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016795
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationtig00153010:820673..821341
RNA-Seq ExpressionSgr016795
SyntenySgr016795
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]1.7e-2949.65Show/hide
Query:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF
        +L+ V+K CN+ +  +D   F EACQF K H L F PS+ HV  P  LIHSDVWGP+PI S   FKYY HF+DDF++  W++PL++KSDTI  F  F   
Subjt:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF

Query:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS
         + QFN  +K IQ DGGGEYK +    I   +QF     PY S
Subjt:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS

KYP31892.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan]6.3e-2948.44Show/hide
Query:  LNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAFI
        L+ V+K CN+ +  +D   F EACQ+ K H L F  S+ H    FEL+H+DVWGP+P+ S   FKYY HFLDDF++  W+YPL+ KSDT   F  F    
Subjt:  LNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAFI

Query:  QTQFN--LKTIQIDGGGEYKPLISHNVQ
        +  FN  +KTIQ DGGGEYK + +H ++
Subjt:  QTQFN--LKTIQIDGGGEYKPLISHNVQ

MCH94186.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Trifolium medium]6.3e-2945.45Show/hide
Query:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF
        +L+ V+K CN+ +  +D   F EACQ+ K H L F  S+ H   P EL+H+DVWGP+PI +   FKYY HF+DDF++  W+YPL++KS+T+  F  F   
Subjt:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF

Query:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS
         + QFN  +K IQ DGGGEYKP+    +   +QF     PY S
Subjt:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]3.3e-3052.03Show/hide
Query:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF
        +L+ V+K+CN+    +D  KF EACQ  KSH L F  S+ H     ELIH+DVWGP+PI+SI  FKYY HF+DD ++  W+YPL++KSDTI  F  F   
Subjt:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF

Query:  IQTQFN--LKTIQIDGGGEYKPL
        ++ QFN  +K IQ DGGGE+KP+
Subjt:  IQTQFN--LKTIQIDGGGEYKPL

PNY02796.1 copia protein (gag-int-pol protein), partial [Trifolium pratense]1.7e-2947.55Show/hide
Query:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF
        +L+ V+K CN+   S+D  KF EACQF K H L F  S  H   P +LIH+DVWGP+PI S   FKYY HF+DDF++  W+YPL++KS+TI  F  F   
Subjt:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF

Query:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS
        ++ QFN  +K +Q DGGGEYK +    +   +QF     PY S
Subjt:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS

TrEMBL top hitse value%identityAlignment
A0A151QNR2 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)3.0e-2948.44Show/hide
Query:  LNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAFI
        L+ V+K CN+ +  +D   F EACQ+ K H L F  S+ H    FEL+H+DVWGP+P+ S   FKYY HFLDDF++  W+YPL+ KSDT   F  F    
Subjt:  LNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAFI

Query:  QTQFN--LKTIQIDGGGEYKPLISHNVQ
        +  FN  +KTIQ DGGGEYK + +H ++
Subjt:  QTQFN--LKTIQIDGGGEYKPLISHNVQ

A0A2K3LJ49 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-3052.03Show/hide
Query:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF
        +L+ V+K+CN+    +D  KF EACQ  KSH L F  S+ H     ELIH+DVWGP+PI+SI  FKYY HF+DD ++  W+YPL++KSDTI  F  F   
Subjt:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF

Query:  IQTQFN--LKTIQIDGGGEYKPL
        ++ QFN  +K IQ DGGGE+KP+
Subjt:  IQTQFN--LKTIQIDGGGEYKPL

A0A2K3NIC3 Copia protein (Gag-int-pol protein) (Fragment)8.0e-3047.55Show/hide
Query:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF
        +L+ V+K CN+   S+D  KF EACQF K H L F  S  H   P +LIH+DVWGP+PI S   FKYY HF+DDF++  W+YPL++KS+TI  F  F   
Subjt:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF

Query:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS
        ++ QFN  +K +Q DGGGEYK +    +   +QF     PY S
Subjt:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS

A0A2Z6P4D5 Integrase catalytic domain-containing protein8.0e-3049.65Show/hide
Query:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF
        +L+ V+K CN+ +  +D   F EACQF K H L F PS+ HV  P  LIHSDVWGP+PI S   FKYY HF+DDF++  W++PL++KSDTI  F  F   
Subjt:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF

Query:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS
         + QFN  +K IQ DGGGEYK +    I   +QF     PY S
Subjt:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS

A0A392N2Z1 Retrovirus-related pol polyprotein from transposon tnt 1-94 (Fragment)3.0e-2945.45Show/hide
Query:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF
        +L+ V+K CN+ +  +D   F EACQ+ K H L F  S+ H   P EL+H+DVWGP+PI +   FKYY HF+DDF++  W+YPL++KS+T+  F  F   
Subjt:  MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAF

Query:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS
         + QFN  +K IQ DGGGEYKP+    +   +QF     PY S
Subjt:  IQTQFN--LKTIQIDGGGEYKPL----ISHNVQFNEEGFPYAS

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-0637.5Show/hide
Query:  HVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAFIQTQFNLKTI--QIDGGGEY
        H+  P  ++HSDV GP    ++D   Y+  F+D F      Y ++ KSD    FQ F A  +  FNLK +   ID G EY
Subjt:  HVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAFIQTQFNLKTI--QIDGGGEY

P0C2J0 Transposon Ty1-PR2 Gag-Pol polyprotein4.5e-0634.78Show/hide
Query:  PFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLR-RKSDTILP-FQYFNAFIQTQF--NLKTIQIDGGGEYKPLISHNVQFNEEG
        PF+ +H+D++GP          Y+  F D+  K  W+YPL  R+ D+IL  F    AFI+ QF  ++  IQ+D G EY     H     + G
Subjt:  PFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLR-RKSDTILP-FQYFNAFIQTQF--NLKTIQIDGGGEYKPLISHNVQFNEEG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-1235.92Show/hide
Query:  TLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAFI--QTQFNLKTIQIDGG
        T+K  + C F K H + F  S+       +L++SDV GP  I+S+   KY+  F+DD ++ +W+Y L+ K      FQ F+A +  +T   LK ++ D G
Subjt:  TLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAFI--QTQFNLKTIQIDGG

Query:  GEY
        GEY
Subjt:  GEY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-1334.62Show/hide
Query:  MLNNVIKLCNLPMQSNDTLKFSEA--CQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFN
        +LN+VI   +L +  N + KF     C  +KS+ + FS ST +   P E I+SDVW  SPI S D ++YY  F+D F +  WLYPL++KS     F  F 
Subjt:  MLNNVIKLCNLPMQSNDTLKFSEA--CQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFN

Query:  AFIQTQFNLK--TIQIDGGGEYKPLISHNVQFNEEGFPYASGFGDSLPSSTTPPST
          ++ +F  +  T   D GGE+  L  +   F++ G  +           T+PP T
Subjt:  AFIQTQFNLK--TIQIDGGGEYKPLISHNVQFNEEGFPYASGFGDSLPSSTTPPST

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.3e-1640.98Show/hide
Query:  MLNNVIKLCNLPM--QSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFN
        +LN+VI   +LP+   S+  L  S+ C  +KSH + FS ST   + P E I+SDVW  SPI SID ++YY  F+D F +  WLYPL++KS     F  F 
Subjt:  MLNNVIKLCNLPM--QSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFN

Query:  AFIQTQFNLK--TIQIDGGGEY
        + ++ +F  +  T+  D GGE+
Subjt:  AFIQTQFNLK--TIQIDGGGEY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAATAATGTTATTAAACTTTGTAATTTGCCCATGCAATCTAATGATACACTTAAATTTTCTGAGGCATGCCAGTTTGATAAATCCCATGCACTCCTCTTCTCTCC
ATCTACATTTCATGTTGCGCCTCCTTTTGAATTAATACATTCAGATGTTTGGGGGCCATCTCCAATTGACTCCATAGATTGTTTTAAATACTATGCGCATTTTCTCGATG
ATTTCAACAAGCCCGTTTGGCTATATCCACTTAGAAGGAAGAGTGATACTATACTTCCCTTTCAATATTTCAATGCCTTTATTCAAACACAGTTTAATCTGAAGACCATT
CAGATAGATGGGGGCGGTGAATACAAACCCCTTATATCACACAATGTTCAGTTCAATGAGGAGGGTTTTCCATATGCCTCTGGCTTTGGTGACAGTTTGCCATCCTCTAC
CACCCCTCCCAGTACTGCGCCACCCATTTCCACATTGTTTCCAGCACATCCACTCAGTCATATATCTCCTTCACCACAACAACCAACACGTGTCCCTGCTTCTCCCTTGC
CCACTTCTTCTAATTCATCTAATTTTGTTTTACAGGAACATGCCATTAGATCTTCCCCAGTTATTGAAGTGGTGGCTCATCGTCTGCTGACTCTCATCAAGCTATTGAAA
CACCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTAATAATGTTATTAAACTTTGTAATTTGCCCATGCAATCTAATGATACACTTAAATTTTCTGAGGCATGCCAGTTTGATAAATCCCATGCACTCCTCTTCTCTCC
ATCTACATTTCATGTTGCGCCTCCTTTTGAATTAATACATTCAGATGTTTGGGGGCCATCTCCAATTGACTCCATAGATTGTTTTAAATACTATGCGCATTTTCTCGATG
ATTTCAACAAGCCCGTTTGGCTATATCCACTTAGAAGGAAGAGTGATACTATACTTCCCTTTCAATATTTCAATGCCTTTATTCAAACACAGTTTAATCTGAAGACCATT
CAGATAGATGGGGGCGGTGAATACAAACCCCTTATATCACACAATGTTCAGTTCAATGAGGAGGGTTTTCCATATGCCTCTGGCTTTGGTGACAGTTTGCCATCCTCTAC
CACCCCTCCCAGTACTGCGCCACCCATTTCCACATTGTTTCCAGCACATCCACTCAGTCATATATCTCCTTCACCACAACAACCAACACGTGTCCCTGCTTCTCCCTTGC
CCACTTCTTCTAATTCATCTAATTTTGTTTTACAGGAACATGCCATTAGATCTTCCCCAGTTATTGAAGTGGTGGCTCATCGTCTGCTGACTCTCATCAAGCTATTGAAA
CACCAATAA
Protein sequenceShow/hide protein sequence
MLNNVIKLCNLPMQSNDTLKFSEACQFDKSHALLFSPSTFHVAPPFELIHSDVWGPSPIDSIDCFKYYAHFLDDFNKPVWLYPLRRKSDTILPFQYFNAFIQTQFNLKTI
QIDGGGEYKPLISHNVQFNEEGFPYASGFGDSLPSSTTPPSTAPPISTLFPAHPLSHISPSPQQPTRVPASPLPTSSNSSNFVLQEHAIRSSPVIEVVAHRLLTLIKLLK
HQ