; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016432 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016432
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationtig00152909:982756..985552
RNA-Seq ExpressionSgr016432
SyntenySgr016432
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW74810.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]5.4e-0663.27Show/hide
Query:  VIVSLYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKIQQ
        ++VSLYVDDLLVTGS   QI+ FK+EM  +FEMTDLG M +FLGM++++
Subjt:  VIVSLYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKIQQ

RVW74810.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.0e-2535.29Show/hide
Query:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE
        SL++  P VFDG NY  WA+R++AY++  D W+ +   Y                                + +S  IF +IM LK   EIW FLK +YE
Subjt:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE

Query:  GDERIKGMKVLNL----------------------------QGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKV
        G ER+KGM+VLNL                            QG     V+  ++++LFV +C +  +  + WL+DSGCTNHMT D+ LFK+L+K+  SKV
Subjt:  GDERIKGMKVLNL----------------------------QGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKV

Query:  KIGN
        +IGN
Subjt:  KIGN

RVW83884.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.7e-2639.78Show/hide
Query:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE
        SL++LAP VFDG NY+ WA+R++AY++  D W+ +   Y                                  +S  IF KIM LK A EIW FLK +YE
Subjt:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE

Query:  GDERIKG-----MKVLNLQGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKVKIGN
        G+ER+KG      K    QG     V+  +E++LFV +C +  +    WL+DSGC NHMT D+ LFK+LDK+  SKV++GN
Subjt:  GDERIKG-----MKVLNLQGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKVKIGN

RVW83884.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]5.4e-0649.28Show/hide
Query:  ESLSESILYVKKS---GTNLVIVSLYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKIQQ
        + + E +  +KK        ++VSLYVDDLLVTGS   QI+ FK+EM  +FEMTDLG M +FLGM++++
Subjt:  ESLSESILYVKKS---GTNLVIVSLYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKIQQ

RVW83884.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.0e-2533.78Show/hide
Query:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE
        SL++LAP VFDG NY+ WA+R++AY++  D W+ L   Y                                  +S  IF +IM LK A EIW FLK +YE
Subjt:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE

Query:  GDERIKGMKVLNL--------------------------------------------QGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTS
        G+ER++G++VL+L                                            QG     V+  +E++LFV +C +  +  + WL+DSGCTNHMT 
Subjt:  GDERIKGMKVLNL--------------------------------------------QGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTS

Query:  DKELFKDLDKSFKSKVKIGNAV
        D+ LFK+LDK+  SKV++GN V
Subjt:  DKELFKDLDKSFKSKVKIGNAV

XP_022158688.1 uncharacterized protein LOC111025149 [Momordica charantia]5.1e-4140Show/hide
Query:  MESRSNSLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLE--------------------------------LAYMQLSPGIFNKIMALKLAKEIWEF
        ME  SN+LSSLAP VFDGENY  WAIRIQAYMEG DYW+ +E                                  Y  +SP IFN+IMALK AKEIWEF
Subjt:  MESRSNSLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLE--------------------------------LAYMQLSPGIFNKIMALKLAKEIWEF

Query:  LKGQYEGDERIKGMKVLNL---------------------------------------------------------------------------------
        LK +YEGDERIKGMKVLNL                                                                                 
Subjt:  LKGQYEGDERIKGMKVLNL---------------------------------------------------------------------------------

Query:  ---------------------------------QGRAHVAVQREEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKVKIGN
                                         QG AH AVQ+EED+LFVATC S VTQCD+WLVDSGCTN M SDKELFKDLD+SFKS+VKIGN
Subjt:  ---------------------------------QGRAHVAVQREEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKVKIGN

XP_038889190.1 uncharacterized protein LOC120079069 [Benincasa hispida]2.6e-3236.43Show/hide
Query:  MESRSNSLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLE--------------------------------LAYMQLSPGIFNKIMALKLAKEIWEF
        MES SNSLSSL P VFDGENY AWAIR+QAYME  DYW+ +E                                  Y+ +SP IFN+IMALK  KEIWEF
Subjt:  MESRSNSLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLE--------------------------------LAYMQLSPGIFNKIMALKLAKEIWEF

Query:  LKGQYEGDERIKGMKVLNL-------QGRAHVAVQREEDKLF----------------------------------------------------------
        LK +YEGDERIKGMKVLNL       Q + + +++   DKL                                                           
Subjt:  LKGQYEGDERIKGMKVLNL-------QGRAHVAVQREEDKLF----------------------------------------------------------

Query:  ---------------VATC------------------------------VSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKVKIGN
                        +TC                                L TQCD WLVDSGCTNHMT+DKELFKD+DKSFK +VKIGN
Subjt:  ---------------VATC------------------------------VSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKVKIGN

TrEMBL top hitse value%identityAlignment
A0A438GRH0 Retrovirus-related Pol polyprotein from transposon RE12.6e-0663.27Show/hide
Query:  VIVSLYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKIQQ
        ++VSLYVDDLLVTGS   QI+ FK+EM  +FEMTDLG M +FLGM++++
Subjt:  VIVSLYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKIQQ

A0A438GRH0 Retrovirus-related Pol polyprotein from transposon RE11.5e-2535.29Show/hide
Query:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE
        SL++  P VFDG NY  WA+R++AY++  D W+ +   Y                                + +S  IF +IM LK   EIW FLK +YE
Subjt:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE

Query:  GDERIKGMKVLNL----------------------------QGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKV
        G ER+KGM+VLNL                            QG     V+  ++++LFV +C +  +  + WL+DSGCTNHMT D+ LFK+L+K+  SKV
Subjt:  GDERIKGMKVLNL----------------------------QGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKV

Query:  KIGN
        +IGN
Subjt:  KIGN

A0A438HHG6 Retrovirus-related Pol polyprotein from transposon RE11.3e-2639.78Show/hide
Query:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE
        SL++LAP VFDG NY+ WA+R++AY++  D W+ +   Y                                  +S  IF KIM LK A EIW FLK +YE
Subjt:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE

Query:  GDERIKG-----MKVLNLQGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKVKIGN
        G+ER+KG      K    QG     V+  +E++LFV +C +  +    WL+DSGC NHMT D+ LFK+LDK+  SKV++GN
Subjt:  GDERIKG-----MKVLNLQGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKVKIGN

A0A438HHG6 Retrovirus-related Pol polyprotein from transposon RE12.6e-0649.28Show/hide
Query:  ESLSESILYVKKS---GTNLVIVSLYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKIQQ
        + + E +  +KK        ++VSLYVDDLLVTGS   QI+ FK+EM  +FEMTDLG M +FLGM++++
Subjt:  ESLSESILYVKKS---GTNLVIVSLYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKIQQ

A0A438HHG6 Retrovirus-related Pol polyprotein from transposon RE11.5e-2533.78Show/hide
Query:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE
        SL++LAP VFDG NY+ WA+R++AY++  D W+ L   Y                                  +S  IF +IM LK A EIW FLK +YE
Subjt:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE

Query:  GDERIKGMKVLNL--------------------------------------------QGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTS
        G+ER++G++VL+L                                            QG     V+  +E++LFV +C +  +  + WL+DSGCTNHMT 
Subjt:  GDERIKGMKVLNL--------------------------------------------QGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTS

Query:  DKELFKDLDKSFKSKVKIGNAV
        D+ LFK+LDK+  SKV++GN V
Subjt:  DKELFKDLDKSFKSKVKIGNAV

A0A6J1DWT9 uncharacterized protein LOC1110251492.5e-4140Show/hide
Query:  MESRSNSLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLE--------------------------------LAYMQLSPGIFNKIMALKLAKEIWEF
        ME  SN+LSSLAP VFDGENY  WAIRIQAYMEG DYW+ +E                                  Y  +SP IFN+IMALK AKEIWEF
Subjt:  MESRSNSLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLE--------------------------------LAYMQLSPGIFNKIMALKLAKEIWEF

Query:  LKGQYEGDERIKGMKVLNL---------------------------------------------------------------------------------
        LK +YEGDERIKGMKVLNL                                                                                 
Subjt:  LKGQYEGDERIKGMKVLNL---------------------------------------------------------------------------------

Query:  ---------------------------------QGRAHVAVQREEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKVKIGN
                                         QG AH AVQ+EED+LFVATC S VTQCD+WLVDSGCTN M SDKELFKDLD+SFKS+VKIGN
Subjt:  ---------------------------------QGRAHVAVQREEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKVKIGN

A5BDI4 Integrase catalytic domain-containing protein1.5e-2535.29Show/hide
Query:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE
        SL++  P VFDG NY  WA+R++AY++  D W+ +   Y                                + +S  IF +IM LK   EIW FLK +YE
Subjt:  SLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAY--------------------------------MQLSPGIFNKIMALKLAKEIWEFLKGQYE

Query:  GDERIKGMKVLNL----------------------------QGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKV
        G ER+KGM+VLNL                            QG     V+  ++++LFV +C +  +  + WL+DSGCTNHMT D+ LFK+L+K+  SKV
Subjt:  GDERIKGMKVLNL----------------------------QGRAHVAVQR-EEDKLFVATCVSLVTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKV

Query:  KIGN
        +IGN
Subjt:  KIGN

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.0e-0441.54Show/hide
Query:  ESLSESILYVKK-SGTNLVIVSLYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKI
        ++ S+  +Y K+ S  N +I+ LYVDD+L+ G D   I   K ++ K F+M DLG     LGMKI
Subjt:  ESLSESILYVKK-SGTNLVIVSLYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKI

Arabidopsis top hitse value%identityAlignment
ATMG00810.1 DNA/RNA polymerases superfamily protein2.4e-0445.45Show/hide
Query:  LYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKIQ
        LYVDD+L+TGS    + +   ++   F M DLG +HYFLG++I+
Subjt:  LYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCAAGATCAAATAGTCTTTCTTCACTGGCTCCATCTGTGTTTGATGGTGAAAACTACCATGCATGGGCTATCAGAATACAAGCCTACATGGAGGGTTATGATTA
TTGGCAAAGGCTAGAGCTTGCCTATATGCAGCTGTCTCCTGGCATATTCAACAAAATTATGGCCTTGAAGTTAGCAAAAGAGATTTGGGAGTTCCTCAAAGGTCAGTATG
AAGGAGATGAGAGGATTAAGGGCATGAAAGTGTTGAACTTGCAAGGAAGAGCACATGTTGCAGTACAGCGAGAAGAAGATAAGCTTTTTGTGGCTACCTGCGTCTCATTA
GTCACTCAATGTGACAATTGGTTGGTTGATAGTGGGTGTACCAATCATATGACAAGTGACAAGGAGTTGTTCAAGGACCTTGACAAGTCATTTAAGTCAAAGGTGAAGAT
AGGAAATGCTGTTCAAAATAAAGATGCAAGACAAGAGCTTCTCTTTGGATCCACTCGAGAAGGAGTAGATAGCTTTCAAGTGTCAAGAGAAGTAGAGGTGAAACAACAAG
AAGTTTCTCTGGATCTTGCTGAACTGGTTGATGATGCACCAATTAGAGGGACTAGACTGCTGAGTGAGAGTTTGAGTGAATCCATACTCTATGTGAAGAAATCAGGTACT
AATCTTGTGATTGTATCTCTTTATGTAGATGACTTGTTGGTGACAGGAAGTGATGGTGTTCAAATAGAGGTTTTCAAGCAAGAAATGATGAAGCTGTTTGAGATGACTGA
TCTAGGTTTGATGCATTACTTCTTGGGTATGAAGATTCAGCAAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCAAGATCAAATAGTCTTTCTTCACTGGCTCCATCTGTGTTTGATGGTGAAAACTACCATGCATGGGCTATCAGAATACAAGCCTACATGGAGGGTTATGATTA
TTGGCAAAGGCTAGAGCTTGCCTATATGCAGCTGTCTCCTGGCATATTCAACAAAATTATGGCCTTGAAGTTAGCAAAAGAGATTTGGGAGTTCCTCAAAGGTCAGTATG
AAGGAGATGAGAGGATTAAGGGCATGAAAGTGTTGAACTTGCAAGGAAGAGCACATGTTGCAGTACAGCGAGAAGAAGATAAGCTTTTTGTGGCTACCTGCGTCTCATTA
GTCACTCAATGTGACAATTGGTTGGTTGATAGTGGGTGTACCAATCATATGACAAGTGACAAGGAGTTGTTCAAGGACCTTGACAAGTCATTTAAGTCAAAGGTGAAGAT
AGGAAATGCTGTTCAAAATAAAGATGCAAGACAAGAGCTTCTCTTTGGATCCACTCGAGAAGGAGTAGATAGCTTTCAAGTGTCAAGAGAAGTAGAGGTGAAACAACAAG
AAGTTTCTCTGGATCTTGCTGAACTGGTTGATGATGCACCAATTAGAGGGACTAGACTGCTGAGTGAGAGTTTGAGTGAATCCATACTCTATGTGAAGAAATCAGGTACT
AATCTTGTGATTGTATCTCTTTATGTAGATGACTTGTTGGTGACAGGAAGTGATGGTGTTCAAATAGAGGTTTTCAAGCAAGAAATGATGAAGCTGTTTGAGATGACTGA
TCTAGGTTTGATGCATTACTTCTTGGGTATGAAGATTCAGCAAGGATAG
Protein sequenceShow/hide protein sequence
MESRSNSLSSLAPSVFDGENYHAWAIRIQAYMEGYDYWQRLELAYMQLSPGIFNKIMALKLAKEIWEFLKGQYEGDERIKGMKVLNLQGRAHVAVQREEDKLFVATCVSL
VTQCDNWLVDSGCTNHMTSDKELFKDLDKSFKSKVKIGNAVQNKDARQELLFGSTREGVDSFQVSREVEVKQQEVSLDLAELVDDAPIRGTRLLSESLSESILYVKKSGT
NLVIVSLYVDDLLVTGSDGVQIEVFKQEMMKLFEMTDLGLMHYFLGMKIQQG