; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014527 (gene) of Chayote v1 genome

Gene IDSed0014527
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase
Genome locationLG05:11071620..11076004
RNA-Seq ExpressionSed0014527
SyntenySed0014527
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032207.1 pol protein [Cucumis melo var. makuwa]1.6e-11549.68Show/hide
Query:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM
        GYYRRFV++FS IA PLT+LTRKG  FVW   C++SF  LK+ L +APVL VP+G+G  ++Y+DASK GLGCVLMQ GKV+AYASRQLK +EQNYPTHD+
Subjt:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM

Query:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL-------------------
        ELAAVVFA KIWRHYLYGE+IQ+F+DH+SLKY FTQKE NMRQRRWLELVKDYDC+ILYHPGKANVVAD LSRK SHSA L                   
Subjt:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL-------------------

Query:  ----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHPEQVKAPGETSPVCVG
                  L+ +P LR+ +             RG A  GQ  +           +R++       + + +  EAH SPF+ HP  VKAP +     + 
Subjt:  ----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHPEQVKAPGETSPVCVG

Query:  RLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHGVLLRIV--------STVIRGLVCWISERV---
         LS+P WKWE +SMDFI+GL RT R F+ IWV+VDRLTKSA F+PG+ TYT  +WAQL++ +IVR HGV + IV        S   +GL   +  R+   
Subjt:  RLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHGVLLRIV--------STVIRGLVCWISERV---

Query:  ---------------------------------GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC
                                          +    MEFA NNSY++ I MAPFEALYGK C
Subjt:  ---------------------------------GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC

KAA0056702.1 pol protein [Cucumis melo var. makuwa]2.3e-11749.89Show/hide
Query:  VGYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHD
        VGYYRRFV++FS IA PLT+LTRKG  FVW   C++SF  LK+ L +APV+ VP+G+G  V+Y+DASK GLGCVLMQ GKV+AYASRQLK +EQNYPTHD
Subjt:  VGYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHD

Query:  MELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL------------------
        +ELAAVVFA KIWRHYLYGE+IQ+F+DH+SLKY FTQKE NMRQRRWLELVKDYDC+ILYHPGKANVVAD LSRK SHSA L                  
Subjt:  MELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL------------------

Query:  -----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP--------------
                   L+ +P LR+ +             RG A  GQ  +           +R +       + + +  EAH SPF+ HP              
Subjt:  -----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP--------------

Query:  ------------------EQVKAPGETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYH
                          +QVKAP +     +  LS+P WKWE +SMDFI+GL RT R F+ IW +VDRLTKSAHF+PG+ TYT  +WAQL++ +IVR H
Subjt:  ------------------EQVKAPGETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYH

Query:  GVLLRIV--------STVIRGLVCWISERV-----GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC
        GV + IV        S   +GL   +  ++      +    MEFA NNSY++TI MAPFEALYGK C
Subjt:  GVLLRIV--------STVIRGLVCWISERV-----GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC

KAA0056806.1 pol protein [Cucumis melo var. makuwa]6.6e-11752.71Show/hide
Query:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM
        GYYRRFV++FS IA PLT+LTRKG  FVW   C++SF  LK+ L +APVL +P+G+G  V+Y+DASK GLGCVLMQ GKV+AYASRQLK +EQNYPTHD+
Subjt:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM

Query:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVLLSPRPDLRESL-------S
        ELAAVVFA KIWRHYLYGE+IQ+F+DH SLKY FTQKE NMRQRRWLELVKDYDC+ILYHPGKANVVAD LSRK SHSA L++ +  L   L       S
Subjt:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVLLSPRPDLRESL-------S

Query:  G--------RGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------EQVKAPGETSPVCVGRLSVPRWKWEEISMDF
        G        RG A  GQ  +           +R++       + + +  EAH SPF+ HP         ++VKAP +     +  LS+P WKWE +SMDF
Subjt:  G--------RGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------EQVKAPGETSPVCVGRLSVPRWKWEEISMDF

Query:  ISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQI-------------VRYHGV-------LLRIVSTVIRGLVCWISERVGTRFRSM
        I+GL RT R F+ IWV+VD LTKSAHF+PG+ TYT  +WAQL++ +I               +H         L +++  ++R           +    M
Subjt:  ISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQI-------------VRYHGV-------LLRIVSTVIRGLVCWISERVGTRFRSM

Query:  EFANNNSYRSTIQMAPFEALYGKRC
        EFA NNSY++TI MAPFEALYGK C
Subjt:  EFANNNSYRSTIQMAPFEALYGKRC

KAA0059071.1 pol protein [Cucumis melo var. makuwa]4.7e-11548.95Show/hide
Query:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM
        GYYRRFV++FS IA PLT+LTRKG  FVW   C++S   LK+ L +APVL VP+G+G  V+Y+DAS  GLGCVLMQ GKV+AYASRQLK +EQNYPTHD+
Subjt:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM

Query:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL-------------------
        ELAAVVFA KIWRHYLYGE+IQ+F+DH+SLKY FTQKE NMRQRRWLELVKDYDC+ILYHPGKANVVAD LSRK SHSA L                   
Subjt:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL-------------------

Query:  ----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------EQVKAP
                  L+ +P LR+ +             RG A  GQ  +           +R++       I + +  EAH SPF+ HP         ++VKAP
Subjt:  ----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------EQVKAP

Query:  GETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHGVLLRIV--------STVIRGLVC
         +     +  LS+P WKWE +SMDFI+GL RT + F+ IWV+VDRLTKSAHF+PG+ TYT  +WAQL++ +IVR HGV + IV        S   +GL  
Subjt:  GETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHGVLLRIV--------STVIRGLVC

Query:  WISERV------------------------------------GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC
         +  R+                                     +    MEFA NNSY++TI MAPFEALYGK C
Subjt:  WISERV------------------------------------GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC

KAA0063718.1 pol protein [Cucumis melo var. makuwa]5.6e-11650.44Show/hide
Query:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM
        GYYRRFV++FS IA PLT+LTRK   FVW   C++SF  LK+ L +APVL VP+G+G  V+Y+DASK GL CVLMQ GKV+AYASRQLK +EQNYPTHD+
Subjt:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM

Query:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVLLSPRPDLRESL--------
        ELAAVVFA KIWRHYLYGE+IQ+F+DH+SLKY FTQKE NMRQRRWLELVKDYDC+ILYHPGKANVVAD LSRK SHSA L++ +  L   L        
Subjt:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVLLSPRPDLRESL--------

Query:  --------------------------------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------------
                                          RG A  GQ  +           +R++       + + +  EAH SPF+ HP               
Subjt:  --------------------------------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------------

Query:  -----------------EQVKAPGETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHG
                         +QVKAP +     +  LS+P WKWE +SMDFI+GL RT R F+ IWV+VDRLTKSAHF+PG+ TYT  +WAQL++ +IVR HG
Subjt:  -----------------EQVKAPGETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHG

Query:  VLLRIVS-TVIRGLVCWISERVGTRFRSMEFANNNSYRSTIQMAPFEALYGKRC
        V + IVS    R    +      +    MEFA NNSY++TI MAPFEALYGK C
Subjt:  VLLRIVS-TVIRGLVCWISERVGTRFRSMEFANNNSYRSTIQMAPFEALYGKRC

TrEMBL top hitse value%identityAlignment
A0A5A7SS38 Pol protein7.9e-11649.68Show/hide
Query:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM
        GYYRRFV++FS IA PLT+LTRKG  FVW   C++SF  LK+ L +APVL VP+G+G  ++Y+DASK GLGCVLMQ GKV+AYASRQLK +EQNYPTHD+
Subjt:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM

Query:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL-------------------
        ELAAVVFA KIWRHYLYGE+IQ+F+DH+SLKY FTQKE NMRQRRWLELVKDYDC+ILYHPGKANVVAD LSRK SHSA L                   
Subjt:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL-------------------

Query:  ----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHPEQVKAPGETSPVCVG
                  L+ +P LR+ +             RG A  GQ  +           +R++       + + +  EAH SPF+ HP  VKAP +     + 
Subjt:  ----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHPEQVKAPGETSPVCVG

Query:  RLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHGVLLRIV--------STVIRGLVCWISERV---
         LS+P WKWE +SMDFI+GL RT R F+ IWV+VDRLTKSA F+PG+ TYT  +WAQL++ +IVR HGV + IV        S   +GL   +  R+   
Subjt:  RLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHGVLLRIV--------STVIRGLVCWISERV---

Query:  ---------------------------------GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC
                                          +    MEFA NNSY++ I MAPFEALYGK C
Subjt:  ---------------------------------GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC

A0A5A7ULI8 Pol protein1.1e-11749.89Show/hide
Query:  VGYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHD
        VGYYRRFV++FS IA PLT+LTRKG  FVW   C++SF  LK+ L +APV+ VP+G+G  V+Y+DASK GLGCVLMQ GKV+AYASRQLK +EQNYPTHD
Subjt:  VGYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHD

Query:  MELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL------------------
        +ELAAVVFA KIWRHYLYGE+IQ+F+DH+SLKY FTQKE NMRQRRWLELVKDYDC+ILYHPGKANVVAD LSRK SHSA L                  
Subjt:  MELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL------------------

Query:  -----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP--------------
                   L+ +P LR+ +             RG A  GQ  +           +R +       + + +  EAH SPF+ HP              
Subjt:  -----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP--------------

Query:  ------------------EQVKAPGETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYH
                          +QVKAP +     +  LS+P WKWE +SMDFI+GL RT R F+ IW +VDRLTKSAHF+PG+ TYT  +WAQL++ +IVR H
Subjt:  ------------------EQVKAPGETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYH

Query:  GVLLRIV--------STVIRGLVCWISERV-----GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC
        GV + IV        S   +GL   +  ++      +    MEFA NNSY++TI MAPFEALYGK C
Subjt:  GVLLRIV--------STVIRGLVCWISERV-----GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC

A0A5A7UR62 Reverse transcriptase3.2e-11752.71Show/hide
Query:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM
        GYYRRFV++FS IA PLT+LTRKG  FVW   C++SF  LK+ L +APVL +P+G+G  V+Y+DASK GLGCVLMQ GKV+AYASRQLK +EQNYPTHD+
Subjt:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM

Query:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVLLSPRPDLRESL-------S
        ELAAVVFA KIWRHYLYGE+IQ+F+DH SLKY FTQKE NMRQRRWLELVKDYDC+ILYHPGKANVVAD LSRK SHSA L++ +  L   L       S
Subjt:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVLLSPRPDLRESL-------S

Query:  G--------RGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------EQVKAPGETSPVCVGRLSVPRWKWEEISMDF
        G        RG A  GQ  +           +R++       + + +  EAH SPF+ HP         ++VKAP +     +  LS+P WKWE +SMDF
Subjt:  G--------RGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------EQVKAPGETSPVCVGRLSVPRWKWEEISMDF

Query:  ISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQI-------------VRYHGV-------LLRIVSTVIRGLVCWISERVGTRFRSM
        I+GL RT R F+ IWV+VD LTKSAHF+PG+ TYT  +WAQL++ +I               +H         L +++  ++R           +    M
Subjt:  ISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQI-------------VRYHGV-------LLRIVSTVIRGLVCWISERVGTRFRSM

Query:  EFANNNSYRSTIQMAPFEALYGKRC
        EFA NNSY++TI MAPFEALYGK C
Subjt:  EFANNNSYRSTIQMAPFEALYGKRC

A0A5A7V003 Reverse transcriptase2.3e-11548.95Show/hide
Query:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM
        GYYRRFV++FS IA PLT+LTRKG  FVW   C++S   LK+ L +APVL VP+G+G  V+Y+DAS  GLGCVLMQ GKV+AYASRQLK +EQNYPTHD+
Subjt:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM

Query:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL-------------------
        ELAAVVFA KIWRHYLYGE+IQ+F+DH+SLKY FTQKE NMRQRRWLELVKDYDC+ILYHPGKANVVAD LSRK SHSA L                   
Subjt:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVL-------------------

Query:  ----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------EQVKAP
                  L+ +P LR+ +             RG A  GQ  +           +R++       I + +  EAH SPF+ HP         ++VKAP
Subjt:  ----------LSPRPDLRESL-----------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------EQVKAP

Query:  GETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHGVLLRIV--------STVIRGLVC
         +     +  LS+P WKWE +SMDFI+GL RT + F+ IWV+VDRLTKSAHF+PG+ TYT  +WAQL++ +IVR HGV + IV        S   +GL  
Subjt:  GETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHGVLLRIV--------STVIRGLVC

Query:  WISERV------------------------------------GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC
         +  R+                                     +    MEFA NNSY++TI MAPFEALYGK C
Subjt:  WISERV------------------------------------GTRFRSMEFANNNSYRSTIQMAPFEALYGKRC

A0A5A7V810 Reverse transcriptase2.7e-11650.44Show/hide
Query:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM
        GYYRRFV++FS IA PLT+LTRK   FVW   C++SF  LK+ L +APVL VP+G+G  V+Y+DASK GL CVLMQ GKV+AYASRQLK +EQNYPTHD+
Subjt:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDM

Query:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVLLSPRPDLRESL--------
        ELAAVVFA KIWRHYLYGE+IQ+F+DH+SLKY FTQKE NMRQRRWLELVKDYDC+ILYHPGKANVVAD LSRK SHSA L++ +  L   L        
Subjt:  ELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVLLSPRPDLRESL--------

Query:  --------------------------------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------------
                                          RG A  GQ  +           +R++       + + +  EAH SPF+ HP               
Subjt:  --------------------------------SGRGCAIVGQWAD----------VQRQVVCSGCCRIASRVALEAHDSPFAWHP---------------

Query:  -----------------EQVKAPGETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHG
                         +QVKAP +     +  LS+P WKWE +SMDFI+GL RT R F+ IWV+VDRLTKSAHF+PG+ TYT  +WAQL++ +IVR HG
Subjt:  -----------------EQVKAPGETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHG

Query:  VLLRIVS-TVIRGLVCWISERVGTRFRSMEFANNNSYRSTIQMAPFEALYGKRC
        V + IVS    R    +      +    MEFA NNSY++TI MAPFEALYGK C
Subjt:  VLLRIVS-TVIRGLVCWISERVGTRFRSMEFANNNSYRSTIQMAPFEALYGKRC

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.8e-3237.86Show/hide
Query:  PIGCLPVSRKVESCPVGYYRRFVKDFSAIAWPLTRLTRKGVEF-VWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAY
        PI   P   K      GYYR+F+ +F+ IA P+T+  +K ++    +   D +F +LK  ++  P+L VP+ T    +  DAS   LG VL Q G  ++Y
Subjt:  PIGCLPVSRKVESCPVGYYRRFVKDFSAIAWPLTRLTRKGVEF-VWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAY

Query:  ASRQLKPYEQNYPTHDMELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSR----------
         SR L  +E NY T + EL A+V+A K +RHYL G   ++ SDHQ L +L+  K+ N +  RW   + ++D  I Y  GK N VAD LSR          
Subjt:  ASRQLKPYEQNYPTHDMELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSR----------

Query:  KTSHSA
        +T HSA
Subjt:  KTSHSA

P10394 Retrovirus-related Pol polyprotein from transposon 4122.3e-2437.08Show/hide
Query:  YYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQ----HGKVIAYASRQLKPYEQNYPT
        YYRRF+K+F+  +  +TRL +K V F W   C ++F  LK  L +  +L  P+ +    +  DASK   G VL Q    H   +AYASR     E N  T
Subjt:  YYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQ----HGKVIAYASRQLKPYEQNYPT

Query:  HDMELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKT
         + ELAA+ +A   +R Y+YG+   + +DH+ L YLF+    + +  R    +++Y+  + Y  GK N VAD LSR T
Subjt:  HDMELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKT

P20825 Retrovirus-related Pol polyprotein from transposon 2976.6e-2738.51Show/hide
Query:  GYYRRFVKDFSAIAWPLTRLTRKGVEF-VWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHD
        GYYR+F+ +++ IA P+T   +K  +         E+F +LK  +   P+L +P+     V+  DAS   LG VL Q+G  I++ SR L  +E NY   +
Subjt:  GYYRRFVKDFSAIAWPLTRLTRKGVEF-VWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHD

Query:  MELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSR
         EL A+V+A K +RHYL G +  + SDHQ L++L   KE   +  RW   + +Y  +I Y  GK N VAD LSR
Subjt:  MELAAVVFARKIWRHYLYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus9.2e-2937.77Show/hide
Query:  YYRRFVKDFSAIAWPLTRLTR-----------KGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQ----HGKVIAYASR
        YYR+F++D++ +A PLT LTR             V    D    +SF  LK  L S+ +LA P  T    +  DAS + +G VL Q      + IAY SR
Subjt:  YYRRFVKDFSAIAWPLTRLTR-----------KGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQ----HGKVIAYASR

Query:  QLKPYEQNYPTHDMELAAVVFARKIWRHYLYGE-RIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSR
         L   E+NY T + E+ A++++    R YLYG   I++++DHQ L +    + FN + +RW   +++Y+C+++Y PGK+NVVAD LSR
Subjt:  QLKPYEQNYPTHDMELAAVVFARKIWRHYLYGE-RIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSR

Q9UR07 Transposon Tf2-11 polyprotein4.0e-2424.94Show/hide
Query:  VGYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGK-----VIAYASRQLKPYEQN
        V Y R+F+   S +  PL  L +K V + W     ++   +K+ L S PVL   + +  +++  DAS   +G VL Q         + Y S ++   + N
Subjt:  VGYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGK-----VIAYASRQLKPYEQN

Query:  YPTHDMELAAVVFARKIWRHYLYG--ERIQMFSDHQSLKYLFTQKE--FNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVLLSPRPDLRE
        Y   D E+ A++ + K WRHYL    E  ++ +DH++L    T +    N R  RW   ++D++ +I Y PG AN +AD LSR    +     P P   E
Subjt:  YPTHDMELAAVVFARKIWRHYLYG--ERIQMFSDHQSLKYLFTQKE--FNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVLLSPRPDLRE

Query:  SLSGRGCAIVGQWADVQRQVVC-------------SGCCRIASRVALEAHDSPFAWHPEQVKAPGETSPVCV--------GRLSVP-------------R
          S      +    D + QVV              +   R+   + L+  D       +Q+  P +T             G+L  P              
Subjt:  SLSGRGCAIVGQWADVQRQVVC-------------SGCCRIASRVALEAHDSPFAWHPEQVKAPGETSPVCV--------GRLSVP-------------R

Query:  WK---------------------------------------WEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYH
        WK                                       WE +SMDFI+ L  +S  ++ ++V+VDR +K A  +P   + T EQ A++F ++++ Y 
Subjt:  WK---------------------------------------WEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYH

Query:  G
        G
Subjt:  G

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.0e-0644.44Show/hide
Query:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPE
        GYYRRFVK++  I  PLT L +K     W      +F  LK ++T+ PVLA+P+
Subjt:  GYYRRFVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATATCGTATTGTACCTCGGTTGGGGATTCTTACTGAGCAATTCGTTGCTCACCCTTATTACTGTGGTTTTTTCCACAGGTCATTTTCCTATGATCACTAGCTCTCC
GACAGTTATCAGAGCGCCATCATGCCAACTTCCCGCAGTGTGTGATGTATGTTGTGGCTTGTTGTGGTATCCAGATAGATTCGGATGCCCTGGGACATCGTTCCAGAGGG
AGAGAGTGGTTGCAAAGTTGGTAGCCTTCAAGGAGGTTGATGAGGCATCTTGGTGCTTCGGTAAAGTGTTGTTGTTTGCGGCCAAGAAGCGGAACCCGTCGTGCAAAAGG
GCTAGTTCAGTGGTAATCAAAAGATTTCAAGCGAAAGGAGTGTTCGAGAGGAGCCGGGCCTTTTCCCAGTGTTACTGGAGGTTGAGTAAGTCAAGCAGGTTGACGGTTGG
AGAGTTGAGCATCGTAGAAGCAAGTTGGTTCCAGCTGAGGGACCATGAGTTGGATGCCTATAAGAGCAAGGTTGCCACAGCTTCGAGCATCGTGTGGAGATGGAACTTTG
GGTGTTCCATCGATTGGTTGTATCTTTGCCAGGAGCTAGTGTGGCTTTTGGGAAAAGCATTTGTTAGTCTTGGATCGCTAAATGAAGGTTGGGAGTTGTGCACACAAGTG
TGTTTTGTGTGGTTCGTTCTGTGGCGAGAATTTCAGATCCATGTAGGGTTTGCTTTGGGGATCTGGTGCTCAGTTCTTGGAAAAGTTGGACTCCAGATGTTATGTTGTGG
TGGACGTCTGAGTAGGAGGATGACGCATTCGGTAGCGTTTCGGGTCCTCTTGGTTTCCAAGGAGTTGGAGTTGGCCAGCATGATTGTGTTGCTGGAAAAAGAAGTGTGGA
TTGAAATTGCTAGAAGTCAGTTTGCTCGTGGGGCGAATGGTTCGTGCCCTATTGGTTGTCTTCCTGTTTCTAGGAAAGTTGAGAGTTGCCCCGTTGGTTACTACCGTCGG
TTTGTTAAGGATTTTTCCGCCATAGCCTGGCCCCTCACTCGTCTGACGAGGAAGGGTGTCGAGTTTGTTTGGGACGGTAGGTGTGATGAGAGCTTTGCCAGGCTCAAGCG
CAGTTTGACTTCAGCACCAGTGTTGGCAGTCCCGGAGGGGACAGGTGGTTTGGTGGTCTACAACGATGCTTCTAAGTTCGGGTTGGGTTGTGTATTGATGCAGCACGGGA
AGGTCATTGCTTATGCTTCCCGTCAGCTGAAGCCATATGAGCAGAACTATCCGACTCATGATATGGAATTGGCAGCAGTGGTGTTTGCACGGAAGATTTGGAGGCATTAC
CTGTATGGTGAGCGGATTCAGATGTTTTCCGATCACCAGAGCCTCAAGTATCTGTTTACTCAGAAGGAATTCAATATGAGGCAGAGGAGATGGTTAGAGTTGGTGAAGGA
TTATGACTGCCAGATTTTGTACCATCCGGGTAAAGCCAATGTTGTAGCTGATACTTTGAGTAGGAAGACGTCGCACTCCGCGGTATTGTTGAGTCCTCGGCCAGATTTGC
GTGAGAGTTTGAGCGGTCGTGGTTGCGCGATTGTTGGACAGTGGGCTGACGTACAGAGGCAGGTTGTATGTTCCGGATGTTGCAGGATTGCGAGCAGAGTTGCTCTCGAG
GCCCACGATTCTCCGTTTGCTTGGCACCCGGAGCAAGTGAAGGCACCAGGCGAGACCAGCCCGGTTTGTGTTGGCCGGCTGAGTGTACCTCGGTGGAAATGGGAAGAGAT
CTCCATGGACTTTATTTCTGGTTTGTCGAGGACTTCGAGGAAGTTCTCTCAGATTTGGGTGATAGTCGATCGATTGACTAAGAGCGCACACTTCATTCCCGGTAGAGACA
CTTATACCGTGGAACAGTGGGCGCAGTTGTTTTTGGAACAAATTGTTCGTTATCATGGTGTATTACTGAGGATTGTGTCGACCGTGATTCGAGGGCTTGTGTGTTGGATT
TCGGAGCGAGTTGGGACCAGATTCAGATCGATGGAGTTTGCAAATAACAACAGTTATCGGTCGACGATTCAGATGGCTCCATTCGAGGCTTTGTATGGTAAGCGTTGCTG
A
mRNA sequenceShow/hide mRNA sequence
ATGTATATCGTATTGTACCTCGGTTGGGGATTCTTACTGAGCAATTCGTTGCTCACCCTTATTACTGTGGTTTTTTCCACAGGTCATTTTCCTATGATCACTAGCTCTCC
GACAGTTATCAGAGCGCCATCATGCCAACTTCCCGCAGTGTGTGATGTATGTTGTGGCTTGTTGTGGTATCCAGATAGATTCGGATGCCCTGGGACATCGTTCCAGAGGG
AGAGAGTGGTTGCAAAGTTGGTAGCCTTCAAGGAGGTTGATGAGGCATCTTGGTGCTTCGGTAAAGTGTTGTTGTTTGCGGCCAAGAAGCGGAACCCGTCGTGCAAAAGG
GCTAGTTCAGTGGTAATCAAAAGATTTCAAGCGAAAGGAGTGTTCGAGAGGAGCCGGGCCTTTTCCCAGTGTTACTGGAGGTTGAGTAAGTCAAGCAGGTTGACGGTTGG
AGAGTTGAGCATCGTAGAAGCAAGTTGGTTCCAGCTGAGGGACCATGAGTTGGATGCCTATAAGAGCAAGGTTGCCACAGCTTCGAGCATCGTGTGGAGATGGAACTTTG
GGTGTTCCATCGATTGGTTGTATCTTTGCCAGGAGCTAGTGTGGCTTTTGGGAAAAGCATTTGTTAGTCTTGGATCGCTAAATGAAGGTTGGGAGTTGTGCACACAAGTG
TGTTTTGTGTGGTTCGTTCTGTGGCGAGAATTTCAGATCCATGTAGGGTTTGCTTTGGGGATCTGGTGCTCAGTTCTTGGAAAAGTTGGACTCCAGATGTTATGTTGTGG
TGGACGTCTGAGTAGGAGGATGACGCATTCGGTAGCGTTTCGGGTCCTCTTGGTTTCCAAGGAGTTGGAGTTGGCCAGCATGATTGTGTTGCTGGAAAAAGAAGTGTGGA
TTGAAATTGCTAGAAGTCAGTTTGCTCGTGGGGCGAATGGTTCGTGCCCTATTGGTTGTCTTCCTGTTTCTAGGAAAGTTGAGAGTTGCCCCGTTGGTTACTACCGTCGG
TTTGTTAAGGATTTTTCCGCCATAGCCTGGCCCCTCACTCGTCTGACGAGGAAGGGTGTCGAGTTTGTTTGGGACGGTAGGTGTGATGAGAGCTTTGCCAGGCTCAAGCG
CAGTTTGACTTCAGCACCAGTGTTGGCAGTCCCGGAGGGGACAGGTGGTTTGGTGGTCTACAACGATGCTTCTAAGTTCGGGTTGGGTTGTGTATTGATGCAGCACGGGA
AGGTCATTGCTTATGCTTCCCGTCAGCTGAAGCCATATGAGCAGAACTATCCGACTCATGATATGGAATTGGCAGCAGTGGTGTTTGCACGGAAGATTTGGAGGCATTAC
CTGTATGGTGAGCGGATTCAGATGTTTTCCGATCACCAGAGCCTCAAGTATCTGTTTACTCAGAAGGAATTCAATATGAGGCAGAGGAGATGGTTAGAGTTGGTGAAGGA
TTATGACTGCCAGATTTTGTACCATCCGGGTAAAGCCAATGTTGTAGCTGATACTTTGAGTAGGAAGACGTCGCACTCCGCGGTATTGTTGAGTCCTCGGCCAGATTTGC
GTGAGAGTTTGAGCGGTCGTGGTTGCGCGATTGTTGGACAGTGGGCTGACGTACAGAGGCAGGTTGTATGTTCCGGATGTTGCAGGATTGCGAGCAGAGTTGCTCTCGAG
GCCCACGATTCTCCGTTTGCTTGGCACCCGGAGCAAGTGAAGGCACCAGGCGAGACCAGCCCGGTTTGTGTTGGCCGGCTGAGTGTACCTCGGTGGAAATGGGAAGAGAT
CTCCATGGACTTTATTTCTGGTTTGTCGAGGACTTCGAGGAAGTTCTCTCAGATTTGGGTGATAGTCGATCGATTGACTAAGAGCGCACACTTCATTCCCGGTAGAGACA
CTTATACCGTGGAACAGTGGGCGCAGTTGTTTTTGGAACAAATTGTTCGTTATCATGGTGTATTACTGAGGATTGTGTCGACCGTGATTCGAGGGCTTGTGTGTTGGATT
TCGGAGCGAGTTGGGACCAGATTCAGATCGATGGAGTTTGCAAATAACAACAGTTATCGGTCGACGATTCAGATGGCTCCATTCGAGGCTTTGTATGGTAAGCGTTGCTG
A
Protein sequenceShow/hide protein sequence
MYIVLYLGWGFLLSNSLLTLITVVFSTGHFPMITSSPTVIRAPSCQLPAVCDVCCGLLWYPDRFGCPGTSFQRERVVAKLVAFKEVDEASWCFGKVLLFAAKKRNPSCKR
ASSVVIKRFQAKGVFERSRAFSQCYWRLSKSSRLTVGELSIVEASWFQLRDHELDAYKSKVATASSIVWRWNFGCSIDWLYLCQELVWLLGKAFVSLGSLNEGWELCTQV
CFVWFVLWREFQIHVGFALGIWCSVLGKVGLQMLCCGGRLSRRMTHSVAFRVLLVSKELELASMIVLLEKEVWIEIARSQFARGANGSCPIGCLPVSRKVESCPVGYYRR
FVKDFSAIAWPLTRLTRKGVEFVWDGRCDESFARLKRSLTSAPVLAVPEGTGGLVVYNDASKFGLGCVLMQHGKVIAYASRQLKPYEQNYPTHDMELAAVVFARKIWRHY
LYGERIQMFSDHQSLKYLFTQKEFNMRQRRWLELVKDYDCQILYHPGKANVVADTLSRKTSHSAVLLSPRPDLRESLSGRGCAIVGQWADVQRQVVCSGCCRIASRVALE
AHDSPFAWHPEQVKAPGETSPVCVGRLSVPRWKWEEISMDFISGLSRTSRKFSQIWVIVDRLTKSAHFIPGRDTYTVEQWAQLFLEQIVRYHGVLLRIVSTVIRGLVCWI
SERVGTRFRSMEFANNNSYRSTIQMAPFEALYGKRC