; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016433 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016433
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00152909:985608..986332
RNA-Seq ExpressionSgr016433
SyntenySgr016433
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD1825827.1 unnamed protein product [Ananas comosus var. bracteatus]1.8e-5557.87Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADENM---------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQ----
        ++EC   +TP+  KEKLQK DG EPADE +               PD M+ VSVLSRF +  S+ HMV AKRVLRYLKGTLS+GIK  +V+ F+LQ    
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADENM---------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQ----

Query:  --------------------ATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIKYYVLRDMQKEGEVLLVHCSSDEQIAD
                            ATAAVNQA WLKKL+ DLHL+QEE ++VFVDNQATL IS NPVFHG+TKHFKIKYY LR++QK GEV LV+CSS++QIAD
Subjt:  --------------------ATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIKYYVLRDMQKEGEVLLVHCSSDEQIAD

Query:  IFTKSFSVSRFQALRA
        IFTKSF V RF+ LRA
Subjt:  IFTKSFSVSRFQALRA

KYP62106.1 Copia protein [Cajanus cajan]3.8e-4542.74Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFEL-----
        M++CKS++TPM +KEK  K DG E  DE                 PDI+  V++LSRF HCAS+ H+ AA+RV+RY+KGT SFGIK  K + F+L     
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFEL-----

Query:  ---------------------------------------------QATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK
                                                      AT AVNQA+WL+K++ DL+L+Q+E  K+FVDNQA + IS NPVFHG+TKHF IK
Subjt:  ---------------------------------------------QATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK

Query:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR
         + LR++Q++G V LV+C ++EQI DIFTKS   S+F+ LR
Subjt:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR

RVX01687.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.5e-4544.4Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADENM---------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFE------
        M+ECKS +TPM  KEK  K DG E  DE +               PDIM+ VS+LSR+ HCAS+ H  AAKRV+RY+KGT+ +GIK S+V+SF       
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADENM---------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFE------

Query:  --------------------------------------------LQATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK
                                                    + A AAVNQA+WL+KL+ DL +KQE S KVFVDNQAT+ I+ +PVFHG+TKHFKIK
Subjt:  --------------------------------------------LQATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK

Query:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR
         Y LR++QKEG++ LV+C+++ Q ADI TK+    RF+ LR
Subjt:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR

XP_003613757.4 uncharacterized protein LOC11413243 [Medicago truncatula]1.1e-4443.57Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFEL-----
        M++CKSV+TPM  KEKL K D  E  DE                 PDI+  VS+LSRF HCAS+ H+ AAKRV+RY+KGT++FGIK  K + ++L     
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFEL-----

Query:  ---------------------------------------------QATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK
                                                      ATAAVNQA+WL+K++ DLHL+Q+E  K+FVDNQA + IS NPVFHG+TKHF IK
Subjt:  ---------------------------------------------QATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK

Query:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR
         + +R++QK+  V LV+C ++EQIADIFTK    S+F+ LR
Subjt:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR

XP_031377996.1 uncharacterized protein LOC116193313 [Punica granatum]7.0e-4744.4Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQ----
        M+ECKSV+TPM  K K+QK DG  P DE                MPDIM+ V+VLSRF  CAS+ HMVAAKRV+RYLKGT S+G+K  +  +F+L     
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQ----

Query:  ----------------------------------------------ATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK
                                                      ATAA NQA+WL+KL+ DL +  +   ++FVDNQA L IS NPVFHG+TKHFK+K
Subjt:  ----------------------------------------------ATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK

Query:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR
        +Y LR++Q+ GE+ LV+C +++Q+AD+FTKSF V RF+ LR
Subjt:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR

TrEMBL top hitse value%identityAlignment
A0A151RJK0 Copia protein7.1e-4541.91Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFEL-----
        M++CKS++TPM +KEK  K DG E  DE                 PDI+  V++LSRF HCAS+ H+ AA+RV+RY+KGT SFGIK  K + F+L     
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFEL-----

Query:  ---------------------------------------------QATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK
                                                      AT A+NQA+WL+K++ DL+L+Q+E  ++FVDNQA + IS NPVFHG+TKHF IK
Subjt:  ---------------------------------------------QATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK

Query:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR
         + LR++Q++G V LV+C ++EQIA+IFTKS   S+F+ LR
Subjt:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR

A0A151T4Y8 Copia protein1.9e-4542.74Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFEL-----
        M++CKS++TPM +KEK  K DG E  DE                 PDI+  V++LSRF HCAS+ H+ AA+RV+RY+KGT SFGIK  K + F+L     
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFEL-----

Query:  ---------------------------------------------QATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK
                                                      AT AVNQA+WL+K++ DL+L+Q+E  K+FVDNQA + IS NPVFHG+TKHF IK
Subjt:  ---------------------------------------------QATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK

Query:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR
         + LR++Q++G V LV+C ++EQI DIFTKS   S+F+ LR
Subjt:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR

A0A438IYC9 Retrovirus-related Pol polyprotein from transposon RE14.1e-4544.4Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADENM---------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFE------
        M+ECKS +TPM  KEK  K DG E  DE +               PDIM+ VS+LSR+ HCAS+ H  AAKRV+RY+KGT+ +GIK S+V+SF       
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADENM---------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFE------

Query:  --------------------------------------------LQATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK
                                                    + A AAVNQA+WL+KL+ DL +KQE S KVFVDNQAT+ I+ +PVFHG+TKHFKIK
Subjt:  --------------------------------------------LQATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK

Query:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR
         Y LR++QKEG++ LV+C+++ Q ADI TK+    RF+ LR
Subjt:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR

A0A6P8C8B8 uncharacterized protein LOC1161933133.4e-4744.4Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQ----
        M+ECKSV+TPM  K K+QK DG  P DE                MPDIM+ V+VLSRF  CAS+ HMVAAKRV+RYLKGT S+G+K  +  +F+L     
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQ----

Query:  ----------------------------------------------ATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK
                                                      ATAA NQA+WL+KL+ DL +  +   ++FVDNQA L IS NPVFHG+TKHFK+K
Subjt:  ----------------------------------------------ATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK

Query:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR
        +Y LR++Q+ GE+ LV+C +++Q+AD+FTKSF V RF+ LR
Subjt:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR

A0A6V7P4U8 Uncharacterized protein8.9e-5657.87Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADENM---------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQ----
        ++EC   +TP+  KEKLQK DG EPADE +               PD M+ VSVLSRF +  S+ HMV AKRVLRYLKGTLS+GIK  +V+ F+LQ    
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADENM---------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQ----

Query:  --------------------ATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIKYYVLRDMQKEGEVLLVHCSSDEQIAD
                            ATAAVNQA WLKKL+ DLHL+QEE ++VFVDNQATL IS NPVFHG+TKHFKIKYY LR++QK GEV LV+CSS++QIAD
Subjt:  --------------------ATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIKYYVLRDMQKEGEVLLVHCSSDEQIAD

Query:  IFTKSFSVSRFQALRA
        IFTKSF V RF+ LRA
Subjt:  IFTKSFSVSRFQALRA

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-1625.41Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFE------
        M+ C +V+TP+  K   +  +  E  +                  PD+   V++LSR+    +       KRVLRYLKGT+   +   K  +FE      
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFE------

Query:  -----------------------------------------------LQATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHF
                                                       +    AV +A+WLK L+  +++K E  +K++ DNQ  + I+ NP  H R KH 
Subjt:  -----------------------------------------------LQATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHF

Query:  KIKYYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR
         IKY+  R+  +   + L +  ++ Q+ADIFTK    +RF  LR
Subjt:  KIKYYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-1527.05Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADE--NM--------------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGT----LSFG-----
        MK  K V+TP+    KL K       +E  NM                    PDI + V V+SRF     K H  A K +LRYL+GT    L FG     
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADE--NM--------------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGT----LSFG-----

Query:  ----------------------------------------IKLSKVESFELQATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRT
                                                + LS  E+  + AT    + +WLK+ + +L L Q+E V V+ D+Q+ + +S N ++H RT
Subjt:  ----------------------------------------IKLSKVESFELQATAAVNQAVWLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRT

Query:  KHFKIKYYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQ
        KH  ++Y+ +R+M  +  + ++  S++E  AD+ TK    ++F+
Subjt:  KHFKIKYYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-1627.73Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQA---
        M   K V TPM    KL    G +  D                  PDI Y V+ LS+F H  ++ H+ A KR+LRYL GT + GI L K  +  L A   
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQA---

Query:  ---------------------------------------TAAVNQAV--------WLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK
                                               T A  ++V        W+  L+ +L ++      ++ DN     +  NPVFH R KH  I 
Subjt:  ---------------------------------------TAAVNQAV--------WLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK

Query:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQ
        Y+ +R+  + G + +VH S+ +Q+AD  TK  S + FQ
Subjt:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1426.89Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQA---
        M   K V TPM    KL    G +  D                  PD+ Y V+ LS++ H  + +H  A KRVLRYL GT   GI L K  +  L A   
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADE---------------NMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQA---

Query:  ---------------------------------------TAAVNQAV--------WLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK
                                               T A  ++V        W+  L+ +L ++      ++ DN     +  NPVFH R KH  + 
Subjt:  ---------------------------------------TAAVNQAV--------WLKKLIHDLHLKQEESVKVFVDNQATLGISLNPVFHGRTKHFKIK

Query:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQ
        Y+ +R+  + G + +VH S+ +Q+AD  TK  S   FQ
Subjt:  YYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQ

Arabidopsis top hitse value%identityAlignment
ATMG00810.1 DNA/RNA polymerases superfamily protein1.6e-0429.17Show/hide
Query:  MKECKSVNTPMILKEKLQKADGKEPADENM--------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQA
        M +CK ++TP+ LK     +  K P   +               PDI Y V+++ +  H  +       KRVLRY+KGT+  G+ + K     +QA
Subjt:  MKECKSVNTPMILKEKLQKADGKEPADENM--------------PDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGTGTAAGAGTGTGAACACTCCAATGATCCTGAAGGAGAAGTTGCAGAAGGCTGATGGAAAGGAACCTGCAGATGAGAATATGCCTGACATCATGTACACAGT
AAGTGTCTTATCTAGATTTTGTCACTGTGCAAGTAAGAATCATATGGTTGCTGCAAAGAGGGTTCTGAGATATCTGAAAGGCACACTGTCTTTTGGGATCAAATTAAGCA
AAGTTGAGAGTTTTGAGTTGCAAGCAACTGCAGCAGTGAACCAAGCTGTTTGGTTGAAGAAGTTGATTCATGACTTGCATCTGAAGCAAGAAGAGAGTGTGAAGGTGTTT
GTGGATAATCAAGCTACTCTAGGTATTTCATTGAATCCAGTTTTTCATGGAAGAACCAAGCATTTTAAGATCAAGTATTATGTTCTGAGAGACATGCAGAAGGAAGGAGA
AGTGCTGCTGGTTCATTGCAGCTCAGATGAGCAAATAGCAGATATCTTCACTAAGTCTTTTAGTGTTAGTCGTTTTCAAGCTCTTAGAGCAAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAGTGTAAGAGTGTGAACACTCCAATGATCCTGAAGGAGAAGTTGCAGAAGGCTGATGGAAAGGAACCTGCAGATGAGAATATGCCTGACATCATGTACACAGT
AAGTGTCTTATCTAGATTTTGTCACTGTGCAAGTAAGAATCATATGGTTGCTGCAAAGAGGGTTCTGAGATATCTGAAAGGCACACTGTCTTTTGGGATCAAATTAAGCA
AAGTTGAGAGTTTTGAGTTGCAAGCAACTGCAGCAGTGAACCAAGCTGTTTGGTTGAAGAAGTTGATTCATGACTTGCATCTGAAGCAAGAAGAGAGTGTGAAGGTGTTT
GTGGATAATCAAGCTACTCTAGGTATTTCATTGAATCCAGTTTTTCATGGAAGAACCAAGCATTTTAAGATCAAGTATTATGTTCTGAGAGACATGCAGAAGGAAGGAGA
AGTGCTGCTGGTTCATTGCAGCTCAGATGAGCAAATAGCAGATATCTTCACTAAGTCTTTTAGTGTTAGTCGTTTTCAAGCTCTTAGAGCAAGCTAG
Protein sequenceShow/hide protein sequence
MKECKSVNTPMILKEKLQKADGKEPADENMPDIMYTVSVLSRFCHCASKNHMVAAKRVLRYLKGTLSFGIKLSKVESFELQATAAVNQAVWLKKLIHDLHLKQEESVKVF
VDNQATLGISLNPVFHGRTKHFKIKYYVLRDMQKEGEVLLVHCSSDEQIADIFTKSFSVSRFQALRAS