; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025492 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025492
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationtig00006406:1925114..1928426
RNA-Seq ExpressionSgr025492
SyntenySgr025492
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF2286048.1 hypothetical protein GH714_009932 [Hevea brasiliensis]1.7e-2638.42Show/hide
Query:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDLTTLFIAELIRNCKPKNKGIQCVIKSMLRVHLIPSPKARNMLK
        MKES ++++Y +KL++IVN++RL+G +F D R+V+KI+V+   +FE+ I ++E   DL+ + +AEL+                    + + + + R +++
Subjt:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDLTTLFIAELIRNCKPKNKGIQCVIKSMLRVHLIPSPKARNMLK

Query:  ----MMIERERVFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL
                  RVFSW+S+KQ++VAQST EA++I A +  NQA+WL KLL DL  + +E TK+  DN++AIAI++NP+
Subjt:  ----MMIERERVFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL

KAF2303818.1 hypothetical protein GH714_023602 [Hevea brasiliensis]1.0e-1839.78Show/hide
Query:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDL-TTLFIAELIRNCKPKNKGIQCVIKSMLRVHLIPSPKARNML
        MK+  +V++Y+ KL+  VN+IRL+GEDF  ++VVEK+++S  +KFE+KI AI   +       +   IR           V    L +    S  AR   
Subjt:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDL-TTLFIAELIRNCKPKNKGIQCVIKSMLRVHLIPSPKARNML

Query:  KMMIERERV-------FSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL
         M      V       FSW SRKQD+VAQ TA+A+Y+  A+AANQAIWL K+L DL    ++   +  DN+SAI++A NP+
Subjt:  KMMIERERV-------FSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL

XP_015577125.1 uncharacterized protein LOC107261552 [Ricinus communis]1.5e-1774.24Show/hide
Query:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL
        VFSWN++KQDVVAQSTAEA+YI AA+AANQAIWL  LL+DL F+   PTKL C NKSAIAIA NP+
Subjt:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL

XP_022150313.1 uncharacterized protein LOC111018511 [Momordica charantia]2.9e-2184.85Show/hide
Query:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL
        VFSW SRKQ+VVAQ TAEA+YILAASAANQAIWL KLLDDL FKPEEPT L CDNKSAIAIA NP+
Subjt:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL

XP_038979882.1 uncharacterized protein LOC120109997 [Phoenix dactylifera]1.9e-2036.84Show/hide
Query:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDLTTLFIAELIRNCKPKNKGIQCVIKSMLRVHLIPSPKARNMLK
        MKE+ SV+DY+T+ + +VNQ+++ GE+  D ++VEKI++S   KF+S +  IEE  DL  L + +++ + K   + ++                 R+  K
Subjt:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDLTTLFIAELIRNCKPKNKGIQCVIKSMLRVHLIPSPKARNMLK

Query:  MMIERERVFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASN
         +   E  F     KQ  VAQ +AEA+Y+LA  A +QAIWL ++L+ +  K EE  ++ CDNK+AIA+A N
Subjt:  MMIERERVFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASN

TrEMBL top hitse value%identityAlignment
A0A396INR0 Putative RNA-directed DNA polymerase7.9e-1771.21Show/hide
Query:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL
        VFSWNS+KQDVVAQS+AEA+YI AA+A+NQAIW+ K+L DL    EEP  L CDNKSAIAIA NP+
Subjt:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL

A0A396INR0 Putative RNA-directed DNA polymerase3.9e-0850.75Show/hide
Query:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDLTTLFIAELI
        MKE  SV++YT+KL  +VNQ+RL GE   D +VVEK+++S   KFE+K+ AIEE  DL  L ++E++
Subjt:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDLTTLFIAELI

A0A396INR0 Putative RNA-directed DNA polymerase3.0e-1671.21Show/hide
Query:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL
        VFSWNS+KQD+VAQSTAEA+Y+ AA AANQAIWL KLL DL  K   PT + CDN SAIAIA NP+
Subjt:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL

A0A6A6KDQ4 Uncharacterized protein8.4e-2738.42Show/hide
Query:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDLTTLFIAELIRNCKPKNKGIQCVIKSMLRVHLIPSPKARNMLK
        MKES ++++Y +KL++IVN++RL+G +F D R+V+KI+V+   +FE+ I ++E   DL+ + +AEL+                    + + + + R +++
Subjt:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDLTTLFIAELIRNCKPKNKGIQCVIKSMLRVHLIPSPKARNMLK

Query:  ----MMIERERVFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL
                  RVFSW+S+KQ++VAQST EA++I A +  NQA+WL KLL DL  + +E TK+  DN++AIAI++NP+
Subjt:  ----MMIERERVFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL

A0A6A6LU86 TPT domain-containing protein5.0e-1939.78Show/hide
Query:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDL-TTLFIAELIRNCKPKNKGIQCVIKSMLRVHLIPSPKARNML
        MK+  +V++Y+ KL+  VN+IRL+GEDF  ++VVEK+++S  +KFE+KI AI   +       +   IR           V    L +    S  AR   
Subjt:  MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDL-TTLFIAELIRNCKPKNKGIQCVIKSMLRVHLIPSPKARNML

Query:  KMMIERERV-------FSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL
         M      V       FSW SRKQD+VAQ TA+A+Y+  A+AANQAIWL K+L DL    ++   +  DN+SAI++A NP+
Subjt:  KMMIERERV-------FSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL

A0A6J1D946 uncharacterized protein LOC1110185111.4e-2184.85Show/hide
Query:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL
        VFSW SRKQ+VVAQ TAEA+YILAASAANQAIWL KLLDDL FKPEEPT L CDNKSAIAIA NP+
Subjt:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-0741.94Show/hide
Query:  WNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNP
        WN+++Q+ VA S+ EA+Y+    A  +A+WL  LL  +  K E P K+  DN+  I+IA+NP
Subjt:  WNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.7e-0539.06Show/hide
Query:  SWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL
        SW S+ Q  VA ST EA+YI A     + IWL + L +L    +E   + CD++SAI ++ N +
Subjt:  SWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.0e-0942.19Show/hide
Query:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASN
        + SW S+KQ VV++S+AEA+Y   + A ++ +WL +   +L+    +PT L CDN +AI IA+N
Subjt:  VFSWNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAATCAAACTCTGTGAGGGATTATACGACCAAACTAATGACCATTGTGAATCAAATCAGACTAGTTGGTGAAGATTTTCCTGATGAAAGAGTTGTGGAAAAAAT
AATGGTTAGTGATTCCAGTAAATTTGAGTCAAAGATCTTAGCCATAGAGGAGTTTTCTGATCTGACTACTCTCTTTATAGCTGAGTTAATTAGAAATTGCAAGCCCAAGA
ACAAAGGGATACAATGTGTAATAAAGAGCATGTTGAGGGTGCATTTAATACCAAGTCCAAAGGCAAGAAACATGTTGAAAATGATGATAGAAAGGGAAAGGGTCTTCTCC
TGGAATTCAAGAAAACAAGATGTTGTAGCTCAATCTACTGCTGAAGCAAAATATATTTTAGCAGCATCAGCTGCAAATCAAGCAATATGGCTTCACAAATTGCTTGATGA
TTTGAGATTTAAACCAGAGGAACCTACAAAATTAGTTTGTGATAACAAGTCTGCTATTGCTATTGCCTCAAATCCTTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAATCAAACTCTGTGAGGGATTATACGACCAAACTAATGACCATTGTGAATCAAATCAGACTAGTTGGTGAAGATTTTCCTGATGAAAGAGTTGTGGAAAAAAT
AATGGTTAGTGATTCCAGTAAATTTGAGTCAAAGATCTTAGCCATAGAGGAGTTTTCTGATCTGACTACTCTCTTTATAGCTGAGTTAATTAGAAATTGCAAGCCCAAGA
ACAAAGGGATACAATGTGTAATAAAGAGCATGTTGAGGGTGCATTTAATACCAAGTCCAAAGGCAAGAAACATGTTGAAAATGATGATAGAAAGGGAAAGGGTCTTCTCC
TGGAATTCAAGAAAACAAGATGTTGTAGCTCAATCTACTGCTGAAGCAAAATATATTTTAGCAGCATCAGCTGCAAATCAAGCAATATGGCTTCACAAATTGCTTGATGA
TTTGAGATTTAAACCAGAGGAACCTACAAAATTAGTTTGTGATAACAAGTCTGCTATTGCTATTGCCTCAAATCCTTTATAA
Protein sequenceShow/hide protein sequence
MKESNSVRDYTTKLMTIVNQIRLVGEDFPDERVVEKIMVSDSSKFESKILAIEEFSDLTTLFIAELIRNCKPKNKGIQCVIKSMLRVHLIPSPKARNMLKMMIERERVFS
WNSRKQDVVAQSTAEAKYILAASAANQAIWLHKLLDDLRFKPEEPTKLVCDNKSAIAIASNPL