; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G21630 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G21630
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr3:17919094..17921629
RNA-Seq ExpressionCSPI03G21630
SyntenyCSPI03G21630
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO45752.1 pol protein [Cucumis melo subsp. melo]0.0e+0073.34Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +FSKIDLRSGYHQLRI+D ++PKT+FRSRYGHY+FIVMSFGLTNAP VFMDLMNRVF++FLDTFVI+FIDDIL+YSKTEAEHEEHL  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W+RPSTVSEVRSFLGLA YYRRFVE+FS +ATPLTQLTRK   F WS ACEDSFQ LKQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL
                       SDASK+GLG VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWL
Subjt:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL

Query:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS
        ELVKD+D EILYHPGKANV+ADALSRK                   RA+I V VG VT QLAQL VQPTLR+++IDAQS+DPYLVE+R L E GQ  EFS
Subjt:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG------------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSM
        +SSDGGL+ ER LCVP++SA+K +LL EAH+S FSMH G                              +CQQVKAPRQKPA LLQPLS+PEWKWE VSM
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG------------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSM

Query:  DFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERL
        DFITGLPRTL+GFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLY++EIVRLHGVPV IVSDRDARFTSKFWKGLQ AMGTRLDFSTAFHPQTDGQTERL
Subjt:  DFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERL

Query:  NQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR
        NQ+LEDMLRAC LEFPGSWDSHLHLMEFAYNNS+ ATIGMAPFEALYG+CCR+PVCW EVGEQRLMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRR
Subjt:  NQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR

Query:  KNLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEI
        K+LEF VGDKVF KVAP+KGV+RFE++ KLSPRFVGPFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVVDYEPLEIDENLSYVEQPVE+
Subjt:  KNLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEI

Query:  LAREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELF
        LAR VK LRN+ IPL+KVLW+NHR+EEATWERE +M++RYPELF
Subjt:  LAREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELF

KAA0026271.1 pol protein [Cucumis melo var. makuwa]0.0e+0072.9Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +FSKIDLRSGYHQLRI+D ++PKT+FRSRYGHYEFIVMSFGLTNAP VFMDLMNRVF++FLDTFVI+FIDDIL+YSKTEAEHEEHL  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W+RPSTVSEVRSFLGLA YYRRFVE+FS +ATPLTQLTRK   F WS ACEDSFQNLKQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL
                       SDASK+GLG VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWL
Subjt:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL

Query:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS
        ELVKD+D EILYHPGKANV+ADALSRK                   RA+I VSVG VT QLAQL VQPTLR+++IDAQ +DPYLVE+R L E GQ  EFS
Subjt:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD
        +SSDGGL+ ERCLCVP++SA K +LL EAH+S FSMH G                             +CQQVKAPRQKP  LLQPLS+PEWKWE VSMD
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD

Query:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN
        FITGLPRTL+GFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLY++EIVRLH VPV IVSD+DARFTSKFWKGLQ AMGTRLDFSTAFHPQTDGQTERLN
Subjt:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN

Query:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK
        Q+LEDMLRAC LEFPGSWDSHLHLMEFAYNNS+ ATIGM PFEALYGKCCR+PVCW EVGEQ LMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRRK
Subjt:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK

Query:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL
        +LEFA+GDKVF KVAP++GV+RFE++ KLSPRFV PFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVVDYEPLEIDENLSY EQPVE+L
Subjt:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL

Query:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQD
        AREVK LRN+ IPL+KVLW+NHR+EEATWERE +M++RYPELF++
Subjt:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQD

KAA0051357.1 pol protein [Cucumis melo var. makuwa]0.0e+0073.14Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +FSKIDLRSGY+QLRI+D ++PKT+FRSRYGHYEFIVMSFGLTNAP VFMDLMNRVF++FLDTFVI+FIDDIL+YSKTEAEHEEHL  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W+RPSTVSEVRSFLGLA YYRRFVE+FS +ATPLTQLTRK   F WS ACEDSFQNLKQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL
                       SDASK+GLG VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWL
Subjt:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL

Query:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS
        ELVKD+D EILYHPGKANV+ADALSRK                   RA+I VSVG VT QLAQL VQPTLR+++IDAQS+DPYLVE+R L E GQ  EFS
Subjt:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD
        +SSDGGL  E  LCVP++SA+K +LL EAH+S FSMH G                             +CQQVK PRQKPA LLQPLS+PEWKWE VSMD
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD

Query:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN
        FITGLPRTL+GFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLY++EIVRLHGVPV IVSDRDARFTSKFWKGLQ AMGTRLDFSTAFHPQ DGQTERLN
Subjt:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN

Query:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK
        Q+LEDMLRAC LEFPGSWDSHLHLMEFAYNNS+ ATIGMAPFEALYGKCCR+PVCW EVGEQRLMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRRK
Subjt:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK

Query:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL
        +LEF + DKVF KVAP+KGV+RFE++ KLSPRFVGPFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVVDYEPLEIDENLSYVEQPVE+L
Subjt:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL

Query:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQD
        AREVK LRN+ IPL+KVLW+NHR+EEATWERE +M++RYPELF++
Subjt:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQD

KAA0051368.1 pol protein [Cucumis melo var. makuwa]0.0e+0075.43Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLH-------------
        +FSK DLRSGYHQLRI+D ++PKT+FRSRYGHYEFIVMSFGLTNAP VFMDLMNRVF+ FLDTFVI+FIDDIL+YSKTEAEHEEHL              
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLH-------------

Query:  ------KVLELFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ--------------------SDASKRGL
              K+  +  W+RPSTVSEVRSFLGLA YYRRFVE+FS +ATPLTQLTRK   F WS ACEDSFQNLKQ                    SDASK+GL
Subjt:  ------KVLELFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ--------------------SDASKRGL

Query:  GFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWLELVKDHDYEILYHPGKANVIADA
        G VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWLELVKD+D EILYHPGKANV+ADA
Subjt:  GFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWLELVKDHDYEILYHPGKANVIADA

Query:  LSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCLCVPTNSAIKI
        LSRK                   RA+I VSVG VT QLAQL VQPTLR+++IDAQS+DPYLVE+R L E GQ  EFS+SSDGGL+ ER LCVP++SA+K 
Subjt:  LSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCLCVPTNSAIKI

Query:  DLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLT
        +LL EAH+S FSMH G                             +CQQVKAPRQKPA LLQPLS+PEWKWE VSMDFITGLPRTL+GFTVIWVVVDRLT
Subjt:  DLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLEFPGSWDSHLH
        KSAHFVPGKSTYT SKWAQLY++EIVRLHGVPV IVSDRDARFTSKFWKGLQ AMGTRLDFSTAFHPQTDGQTERLNQ+LEDMLRAC LEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLEFPGSWDSHLH

Query:  LMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFKVAPIKGVMRF
        LMEFAYNNS+ ATIGMAPFEALYG+CCR+PVCW EVGEQRLMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRRK+LEF +GDKVF KVAP+KGV+RF
Subjt:  LMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFKVAPIKGVMRF

Query:  EKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEILAREVKMLRNRSIPLIKVLWQNHR
        E++ KLSPRFVGPFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVVDYEPLEIDENLSYVEQPVE+LAREVK LRN+ IPL+KVLW+NHR
Subjt:  EKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEILAREVKMLRNRSIPLIKVLWQNHR

Query:  IEEATWEREAEMKTRYPELFQD
        +EEATWERE +M++RYPELF++
Subjt:  IEEATWEREAEMKTRYPELFQD

KAA0066456.1 pol protein [Cucumis melo var. makuwa]0.0e+0072.99Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +FSKIDLRSGYHQLRI+D ++PKT+FRSRYGHYEFIVMSFGLTNAP VFMDLMNRVF++FLDTFVI+FIDDIL+YSKTEAEHEEHL  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W+RPSTVSEVRSFLGL  YYRRFVE+FS +ATPLTQLTRK   F WS ACEDSFQNLKQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL
                       SDASK+GLG VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWL
Subjt:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL

Query:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS
        ELVKD+D EILYHPGKANV+ADALSRK                   RA+I VS+G VT QLA+L VQPTLR+++IDAQ +DPYLVE+R LVE GQ  EFS
Subjt:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD
        +SSDGGL+ ER LCVP++SA+KI+LL EAH+S FSMH G                             +CQQVKAPRQKPA LLQPLS+PEWKWE VSMD
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD

Query:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN
        FITGLPRTL+GFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLY++EIVRLHGVPV IVSDRDARFTSKFWKGLQ AMGTRLDFSTAFHPQTDGQTERLN
Subjt:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN

Query:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK
        Q+LEDMLRAC LEFPGSWDSHLHLMEFAYNNS+ ATIGMAPFEALYG+CCR+PVCW EVGEQRLMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRRK
Subjt:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK

Query:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL
        +LEF VGDKVF KVAP++GV+RFE++ KLSPRFVGPFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVVDYEPLEIDENLSY E+PV++L
Subjt:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL

Query:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQ
        AREVK LRN+ IPL+KVLW+NHR+EEATWE E +M++RYPELF+
Subjt:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQ

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ2 Pol protein0.0e+0072.9Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +FSKIDLRSGYHQLRI+D ++PKT+FRSRYGHYEFIVMSFGLTNAP VFMDLMNRVF++FLDTFVI+FIDDIL+YSKTEAEHEEHL  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W+RPSTVSEVRSFLGLA YYRRFVE+FS +ATPLTQLTRK   F WS ACEDSFQNLKQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL
                       SDASK+GLG VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWL
Subjt:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL

Query:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS
        ELVKD+D EILYHPGKANV+ADALSRK                   RA+I VSVG VT QLAQL VQPTLR+++IDAQ +DPYLVE+R L E GQ  EFS
Subjt:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD
        +SSDGGL+ ERCLCVP++SA K +LL EAH+S FSMH G                             +CQQVKAPRQKP  LLQPLS+PEWKWE VSMD
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD

Query:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN
        FITGLPRTL+GFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLY++EIVRLH VPV IVSD+DARFTSKFWKGLQ AMGTRLDFSTAFHPQTDGQTERLN
Subjt:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN

Query:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK
        Q+LEDMLRAC LEFPGSWDSHLHLMEFAYNNS+ ATIGM PFEALYGKCCR+PVCW EVGEQ LMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRRK
Subjt:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK

Query:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL
        +LEFA+GDKVF KVAP++GV+RFE++ KLSPRFV PFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVVDYEPLEIDENLSY EQPVE+L
Subjt:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL

Query:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQD
        AREVK LRN+ IPL+KVLW+NHR+EEATWERE +M++RYPELF++
Subjt:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQD

A0A5A7U7V9 Reverse transcriptase0.0e+0075.43Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLH-------------
        +FSK DLRSGYHQLRI+D ++PKT+FRSRYGHYEFIVMSFGLTNAP VFMDLMNRVF+ FLDTFVI+FIDDIL+YSKTEAEHEEHL              
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLH-------------

Query:  ------KVLELFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ--------------------SDASKRGL
              K+  +  W+RPSTVSEVRSFLGLA YYRRFVE+FS +ATPLTQLTRK   F WS ACEDSFQNLKQ                    SDASK+GL
Subjt:  ------KVLELFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ--------------------SDASKRGL

Query:  GFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWLELVKDHDYEILYHPGKANVIADA
        G VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWLELVKD+D EILYHPGKANV+ADA
Subjt:  GFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWLELVKDHDYEILYHPGKANVIADA

Query:  LSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCLCVPTNSAIKI
        LSRK                   RA+I VSVG VT QLAQL VQPTLR+++IDAQS+DPYLVE+R L E GQ  EFS+SSDGGL+ ER LCVP++SA+K 
Subjt:  LSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLERCLCVPTNSAIKI

Query:  DLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLT
        +LL EAH+S FSMH G                             +CQQVKAPRQKPA LLQPLS+PEWKWE VSMDFITGLPRTL+GFTVIWVVVDRLT
Subjt:  DLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLEFPGSWDSHLH
        KSAHFVPGKSTYT SKWAQLY++EIVRLHGVPV IVSDRDARFTSKFWKGLQ AMGTRLDFSTAFHPQTDGQTERLNQ+LEDMLRAC LEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLEFPGSWDSHLH

Query:  LMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFKVAPIKGVMRF
        LMEFAYNNS+ ATIGMAPFEALYG+CCR+PVCW EVGEQRLMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRRK+LEF +GDKVF KVAP+KGV+RF
Subjt:  LMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFKVAPIKGVMRF

Query:  EKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEILAREVKMLRNRSIPLIKVLWQNHR
        E++ KLSPRFVGPFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVVDYEPLEIDENLSYVEQPVE+LAREVK LRN+ IPL+KVLW+NHR
Subjt:  EKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEILAREVKMLRNRSIPLIKVLWQNHR

Query:  IEEATWEREAEMKTRYPELFQD
        +EEATWERE +M++RYPELF++
Subjt:  IEEATWEREAEMKTRYPELFQD

A0A5A7UAA8 Reverse transcriptase0.0e+0073.14Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +FSKIDLRSGY+QLRI+D ++PKT+FRSRYGHYEFIVMSFGLTNAP VFMDLMNRVF++FLDTFVI+FIDDIL+YSKTEAEHEEHL  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W+RPSTVSEVRSFLGLA YYRRFVE+FS +ATPLTQLTRK   F WS ACEDSFQNLKQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL
                       SDASK+GLG VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWL
Subjt:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL

Query:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS
        ELVKD+D EILYHPGKANV+ADALSRK                   RA+I VSVG VT QLAQL VQPTLR+++IDAQS+DPYLVE+R L E GQ  EFS
Subjt:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD
        +SSDGGL  E  LCVP++SA+K +LL EAH+S FSMH G                             +CQQVK PRQKPA LLQPLS+PEWKWE VSMD
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD

Query:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN
        FITGLPRTL+GFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLY++EIVRLHGVPV IVSDRDARFTSKFWKGLQ AMGTRLDFSTAFHPQ DGQTERLN
Subjt:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN

Query:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK
        Q+LEDMLRAC LEFPGSWDSHLHLMEFAYNNS+ ATIGMAPFEALYGKCCR+PVCW EVGEQRLMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRRK
Subjt:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK

Query:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL
        +LEF + DKVF KVAP+KGV+RFE++ KLSPRFVGPFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVVDYEPLEIDENLSYVEQPVE+L
Subjt:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL

Query:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQD
        AREVK LRN+ IPL+KVLW+NHR+EEATWERE +M++RYPELF++
Subjt:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQD

A0A5A7VJE2 Reverse transcriptase0.0e+0072.99Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +FSKIDLRSGYHQLRI+D ++PKT+FRSRYGHYEFIVMSFGLTNAP VFMDLMNRVF++FLDTFVI+FIDDIL+YSKTEAEHEEHL  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W+RPSTVSEVRSFLGL  YYRRFVE+FS +ATPLTQLTRK   F WS ACEDSFQNLKQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL
                       SDASK+GLG VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWL
Subjt:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL

Query:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS
        ELVKD+D EILYHPGKANV+ADALSRK                   RA+I VS+G VT QLA+L VQPTLR+++IDAQ +DPYLVE+R LVE GQ  EFS
Subjt:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD
        +SSDGGL+ ER LCVP++SA+KI+LL EAH+S FSMH G                             +CQQVKAPRQKPA LLQPLS+PEWKWE VSMD
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG-----------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSMD

Query:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN
        FITGLPRTL+GFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLY++EIVRLHGVPV IVSDRDARFTSKFWKGLQ AMGTRLDFSTAFHPQTDGQTERLN
Subjt:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN

Query:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK
        Q+LEDMLRAC LEFPGSWDSHLHLMEFAYNNS+ ATIGMAPFEALYG+CCR+PVCW EVGEQRLMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRRK
Subjt:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRK

Query:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL
        +LEF VGDKVF KVAP++GV+RFE++ KLSPRFVGPFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVVDYEPLEIDENLSY E+PV++L
Subjt:  NLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEIL

Query:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQ
        AREVK LRN+ IPL+KVLW+NHR+EEATWE E +M++RYPELF+
Subjt:  AREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQ

Q84KB0 Pol protein0.0e+0073.34Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +FSKIDLRSGYHQLRI+D ++PKT+FRSRYGHY+FIVMSFGLTNAP VFMDLMNRVF++FLDTFVI+FIDDIL+YSKTEAEHEEHL  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W+RPSTVSEVRSFLGLA YYRRFVE+FS +ATPLTQLTRK   F WS ACEDSFQ LKQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL
                       SDASK+GLG VLMQQ KVVA ASRQLKSHEQNYPTHDLELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELN RQRRWL
Subjt:  ---------------SDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNTRQRRWL

Query:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS
        ELVKD+D EILYHPGKANV+ADALSRK                   RA+I V VG VT QLAQL VQPTLR+++IDAQS+DPYLVE+R L E GQ  EFS
Subjt:  ELVKDHDYEILYHPGKANVIADALSRK-------------------RAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG------------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSM
        +SSDGGL+ ER LCVP++SA+K +LL EAH+S FSMH G                              +CQQVKAPRQKPA LLQPLS+PEWKWE VSM
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLG------------------------------ICQQVKAPRQKPARLLQPLSVPEWKWEYVSM

Query:  DFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERL
        DFITGLPRTL+GFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLY++EIVRLHGVPV IVSDRDARFTSKFWKGLQ AMGTRLDFSTAFHPQTDGQTERL
Subjt:  DFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERL

Query:  NQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR
        NQ+LEDMLRAC LEFPGSWDSHLHLMEFAYNNS+ ATIGMAPFEALYG+CCR+PVCW EVGEQRLMGPELVQSTNEAIQKIR+RM  A SRQKSYADVRR
Subjt:  NQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR

Query:  KNLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEI
        K+LEF VGDKVF KVAP+KGV+RFE++ KLSPRFVGPFEILERIG VAYRLALPPSLS +H+VFHVSMLRKYV D SHVVDYEPLEIDENLSYVEQPVE+
Subjt:  KNLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRKYVADISHVVDYEPLEIDENLSYVEQPVEI

Query:  LAREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELF
        LAR VK LRN+ IPL+KVLW+NHR+EEATWERE +M++RYPELF
Subjt:  LAREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELF

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.6e-8429.33Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +F+K+DL+S YH +R+R  +  K +FR   G +E++VM +G++ AP  F   +N +  +  ++ V+ ++DDIL++SK+E+EH +H+  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W +P    E+R FLG   Y R+F+   S L  PL  L +K   + W+P    + +N+KQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQ---DKV--VANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--
                       +DAS   +G VL Q+   DK   V   S ++   + NY   D E+  ++ +LK WRHYL    E  +I TDH++L    T +   
Subjt:  ---------------SDASKRGLGFVLMQQ---DKV--VANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--

Query:  LNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITV----SVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLV-----ERRRLVETGQI-DEFS
         N R  RW   ++D ++EI Y PG AN IADALSR   +       S     + + Q+ +    + +V+   ++D  L+     E +R+ E  Q+ D   
Subjt:  LNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITV----SVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLV-----ERRRLVETGQI-DEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGI-----------------------------CQQVKAPRQKPARLLQPLSVPEWKWEYVSMD
        I+S   ++L      P ++ +   ++ + H     +H GI                             CQ  K+   KP   LQP+   E  WE +SMD
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGI-----------------------------CQQVKAPRQKPARLLQPLSVPEWKWEYVSMD

Query:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN
        FIT LP +  G+  ++VVVDR +K A  VP   + T  + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER N
Subjt:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN

Query:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-KCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR
        Q +E +LR      P +W  H+ L++ +YNN+ H+   M PFE ++      +P+      ++     E  Q T +  Q ++  +   + + K Y D++ 
Subjt:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-KCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR

Query:  KNL-EFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVSMLRKY
        + + EF  GD V  K     G +   K  KL+P F GPF +L++ G   Y L LP S+  +  + FHVS L KY
Subjt:  KNL-EFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVSMLRKY

P0CT35 Transposon Tf2-2 polyprotein1.6e-8429.33Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +F+K+DL+S YH +R+R  +  K +FR   G +E++VM +G++ AP  F   +N +  +  ++ V+ ++DDIL++SK+E+EH +H+  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W +P    E+R FLG   Y R+F+   S L  PL  L +K   + W+P    + +N+KQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQ---DKV--VANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--
                       +DAS   +G VL Q+   DK   V   S ++   + NY   D E+  ++ +LK WRHYL    E  +I TDH++L    T +   
Subjt:  ---------------SDASKRGLGFVLMQQ---DKV--VANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--

Query:  LNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITV----SVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLV-----ERRRLVETGQI-DEFS
         N R  RW   ++D ++EI Y PG AN IADALSR   +       S     + + Q+ +    + +V+   ++D  L+     E +R+ E  Q+ D   
Subjt:  LNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITV----SVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLV-----ERRRLVETGQI-DEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGI-----------------------------CQQVKAPRQKPARLLQPLSVPEWKWEYVSMD
        I+S   ++L      P ++ +   ++ + H     +H GI                             CQ  K+   KP   LQP+   E  WE +SMD
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGI-----------------------------CQQVKAPRQKPARLLQPLSVPEWKWEYVSMD

Query:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN
        FIT LP +  G+  ++VVVDR +K A  VP   + T  + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER N
Subjt:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN

Query:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-KCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR
        Q +E +LR      P +W  H+ L++ +YNN+ H+   M PFE ++      +P+      ++     E  Q T +  Q ++  +   + + K Y D++ 
Subjt:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-KCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR

Query:  KNL-EFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVSMLRKY
        + + EF  GD V  K     G +   K  KL+P F GPF +L++ G   Y L LP S+  +  + FHVS L KY
Subjt:  KNL-EFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVSMLRKY

P0CT36 Transposon Tf2-3 polyprotein1.6e-8429.33Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +F+K+DL+S YH +R+R  +  K +FR   G +E++VM +G++ AP  F   +N +  +  ++ V+ ++DDIL++SK+E+EH +H+  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W +P    E+R FLG   Y R+F+   S L  PL  L +K   + W+P    + +N+KQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQ---DKV--VANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--
                       +DAS   +G VL Q+   DK   V   S ++   + NY   D E+  ++ +LK WRHYL    E  +I TDH++L    T +   
Subjt:  ---------------SDASKRGLGFVLMQQ---DKV--VANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--

Query:  LNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITV----SVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLV-----ERRRLVETGQI-DEFS
         N R  RW   ++D ++EI Y PG AN IADALSR   +       S     + + Q+ +    + +V+   ++D  L+     E +R+ E  Q+ D   
Subjt:  LNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITV----SVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLV-----ERRRLVETGQI-DEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGI-----------------------------CQQVKAPRQKPARLLQPLSVPEWKWEYVSMD
        I+S   ++L      P ++ +   ++ + H     +H GI                             CQ  K+   KP   LQP+   E  WE +SMD
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGI-----------------------------CQQVKAPRQKPARLLQPLSVPEWKWEYVSMD

Query:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN
        FIT LP +  G+  ++VVVDR +K A  VP   + T  + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER N
Subjt:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN

Query:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-KCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR
        Q +E +LR      P +W  H+ L++ +YNN+ H+   M PFE ++      +P+      ++     E  Q T +  Q ++  +   + + K Y D++ 
Subjt:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-KCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR

Query:  KNL-EFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVSMLRKY
        + + EF  GD V  K     G +   K  KL+P F GPF +L++ G   Y L LP S+  +  + FHVS L KY
Subjt:  KNL-EFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVSMLRKY

P0CT37 Transposon Tf2-4 polyprotein1.6e-8429.33Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +F+K+DL+S YH +R+R  +  K +FR   G +E++VM +G++ AP  F   +N +  +  ++ V+ ++DDIL++SK+E+EH +H+  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W +P    E+R FLG   Y R+F+   S L  PL  L +K   + W+P    + +N+KQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQ---DKV--VANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--
                       +DAS   +G VL Q+   DK   V   S ++   + NY   D E+  ++ +LK WRHYL    E  +I TDH++L    T +   
Subjt:  ---------------SDASKRGLGFVLMQQ---DKV--VANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--

Query:  LNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITV----SVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLV-----ERRRLVETGQI-DEFS
         N R  RW   ++D ++EI Y PG AN IADALSR   +       S     + + Q+ +    + +V+   ++D  L+     E +R+ E  Q+ D   
Subjt:  LNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITV----SVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLV-----ERRRLVETGQI-DEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGI-----------------------------CQQVKAPRQKPARLLQPLSVPEWKWEYVSMD
        I+S   ++L      P ++ +   ++ + H     +H GI                             CQ  K+   KP   LQP+   E  WE +SMD
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGI-----------------------------CQQVKAPRQKPARLLQPLSVPEWKWEYVSMD

Query:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN
        FIT LP +  G+  ++VVVDR +K A  VP   + T  + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER N
Subjt:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN

Query:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-KCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR
        Q +E +LR      P +W  H+ L++ +YNN+ H+   M PFE ++      +P+      ++     E  Q T +  Q ++  +   + + K Y D++ 
Subjt:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-KCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR

Query:  KNL-EFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVSMLRKY
        + + EF  GD V  K     G +   K  KL+P F GPF +L++ G   Y L LP S+  +  + FHVS L KY
Subjt:  KNL-EFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVSMLRKY

P0CT41 Transposon Tf2-12 polyprotein1.6e-8429.33Show/hide
Query:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------
        +F+K+DL+S YH +R+R  +  K +FR   G +E++VM +G++ AP  F   +N +  +  ++ V+ ++DDIL++SK+E+EH +H+  VL+         
Subjt:  MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLE---------

Query:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----
                                         +  W +P    E+R FLG   Y R+F+   S L  PL  L +K   + W+P    + +N+KQ     
Subjt:  ---------------------------------LFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQ-----

Query:  ---------------SDASKRGLGFVLMQQ---DKV--VANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--
                       +DAS   +G VL Q+   DK   V   S ++   + NY   D E+  ++ +LK WRHYL    E  +I TDH++L    T +   
Subjt:  ---------------SDASKRGLGFVLMQQ---DKV--VANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--

Query:  LNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITV----SVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLV-----ERRRLVETGQI-DEFS
         N R  RW   ++D ++EI Y PG AN IADALSR   +       S     + + Q+ +    + +V+   ++D  L+     E +R+ E  Q+ D   
Subjt:  LNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITV----SVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLV-----ERRRLVETGQI-DEFS

Query:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGI-----------------------------CQQVKAPRQKPARLLQPLSVPEWKWEYVSMD
        I+S   ++L      P ++ +   ++ + H     +H GI                             CQ  K+   KP   LQP+   E  WE +SMD
Subjt:  ISSDGGLMLERCLCVPTNSAIKIDLLDEAHNSLFSMHLGI-----------------------------CQQVKAPRQKPARLLQPLSVPEWKWEYVSMD

Query:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN
        FIT LP +  G+  ++VVVDR +K A  VP   + T  + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER N
Subjt:  FITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVRLHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLN

Query:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-KCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR
        Q +E +LR      P +W  H+ L++ +YNN+ H+   M PFE ++      +P+      ++     E  Q T +  Q ++  +   + + K Y D++ 
Subjt:  QILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYG-KCCRTPVCWSEVGEQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRR

Query:  KNL-EFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVSMLRKY
        + + EF  GD V  K     G +   K  KL+P F GPF +L++ G   Y L LP S+  +  + FHVS L KY
Subjt:  KNL-EFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSAL-HNVFHVSMLRKY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein8.2e-0738.46Show/hide
Query:  KVLELFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLK
        K+  +  W  P   +E+R FLGL  YYRRFV+++  +  PLT+L +K +L  W+     +F+ LK
Subjt:  KVLELFDWSRPSTVSEVRSFLGLAVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTCTAAGATTGACTTACGCTCAGGTTATCACCAGTTGAGGATTAGAGACAGTAATATTCCTAAGACTTCTTTTCGTTCGAGGTATGGGCATTATGAATTCATTGT
AATGTCTTTTGGTTTGACCAACGCACCTGTAGTATTTATGGATTTGATGAATAGGGTGTTTAAGGATTTCTTAGACACTTTTGTGATAATTTTCATTGATGATATTTTGG
TTTATTCCAAGACCGAGGCCGAACATGAGGAACATTTACATAAAGTATTAGAACTCTTCGATTGGTCTCGACCCTCTACAGTTAGTGAGGTTCGTAGTTTTCTGGGTTTA
GCAGTGTATTACCGGAGGTTTGTGGAGGATTTTTCACATTTGGCTACTCCCTTGACTCAATTGACCAGGAAGAGAACTCTGTTTTTTTGGAGTCCAGCTTGTGAGGATAG
TTTTCAGAACCTTAAGCAAAGTGATGCTTCCAAGAGAGGACTTGGTTTTGTTTTGATGCAGCAAGATAAGGTAGTTGCTAATGCCTCTCGTCAGTTGAAGAGTCATGAGC
AGAACTACCCTACACATGATTTGGAGTTGGCAACAGTGGTTTTTGCACTAAAGATATGGAGACATTACTTGTATGGTGAAAAGATACAAATCTTTACTGACCATAAGAGC
CTGAAGTATTTCTTCACCCAGAAGGAGTTAAATACGAGACAGCGAAGGTGGCTCGAGTTGGTAAAAGATCATGACTATGAGATATTGTACCATCCTGGTAAGGCGAATGT
GATAGCAGATGCTCTTAGTAGAAAAAGAGCTAAGATTACAGTATCCGTGGGGACGGTCACCTCACAGTTGGCTCAGTTAATGGTGCAACCAACGCTGAGGCGGAAGGTTA
TTGATGCTCAGAGTAGTGATCCTTATTTGGTGGAGAGACGTCGCCTCGTTGAAACAGGTCAGATTGATGAGTTTTCCATATCCTCTGATGGTGGACTAATGTTAGAGAGA
TGTTTGTGTGTGCCAACAAACAGTGCAATTAAGATTGACTTACTAGATGAAGCCCATAATTCCCTGTTTTCCATGCATCTTGGTATATGTCAGCAGGTTAAGGCACCTAG
ACAAAAACCAGCACGTTTGTTACAACCTTTAAGTGTGCCAGAGTGGAAGTGGGAGTATGTGTCGATGGATTTTATTACAGGGCTGCCTAGAACCCTGAAGGGTTTCACAG
TGATTTGGGTTGTTGTAGACAGGCTTACGAAATCGGCACATTTCGTACCAGGGAAATCCACTTATACTGATAGTAAGTGGGCACAGTTGTATTTAACTGAGATTGTGAGA
CTGCATGGAGTGCCTGTGCCGATCGTTTCTGACAGGGATGCACGATTTACTTCCAAGTTTTGGAAGGGACTTCAGATTGCCATGGGTACGAGGTTAGATTTCAGTACAGC
TTTTCATCCACAGACTGATGGTCAAACTGAACGTTTGAACCAAATTTTAGAGGATATGCTACGAGCTTGTGTTCTAGAGTTTCCAGGTAGTTGGGACTCTCATTTACATT
TGATGGAGTTTGCTTATAACAACAGTTTCCATGCTACCATTGGTATGGCACCATTTGAGGCTTTATATGGTAAATGTTGTAGAACCCCTGTTTGTTGGAGTGAGGTTGGT
GAACAAAGGTTGATGGGACCTGAGTTGGTTCAGTCCACAAATGAGGCTATTCAGAAGATTAGAACACGTATGCAGATAGCGCACAGTAGACAGAAGAGTTATGCGGATGT
GAGACGGAAAAACCTTGAGTTTGCGGTGGGAGATAAGGTATTCTTTAAGGTAGCACCTATAAAAGGTGTTATGCGTTTTGAGAAGAAAGCGAAGTTGAGTCCTCGTTTTG
TTGGACCGTTTGAGATCTTAGAGAGAATTGGTGTTGTGGCGTATCGCTTGGCGCTACCACCATCTCTCTCCGCACTTCATAATGTTTTCCATGTTTCGATGTTGAGGAAG
TATGTGGCAGATATATCTCATGTAGTGGACTATGAACCCTTGGAGATTGATGAGAATTTGAGCTATGTGGAACAACCTGTGGAGATTCTGGCTAGAGAGGTAAAGATGCT
TCGTAATAGAAGCATTCCATTAATAAAGGTTTTGTGGCAGAATCATCGAATTGAAGAGGCGACATGGGAGCGAGAGGCTGAGATGAAAACTCGATATCCGGAGTTATTTC
AGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCTCTAAGATTGACTTACGCTCAGGTTATCACCAGTTGAGGATTAGAGACAGTAATATTCCTAAGACTTCTTTTCGTTCGAGGTATGGGCATTATGAATTCATTGT
AATGTCTTTTGGTTTGACCAACGCACCTGTAGTATTTATGGATTTGATGAATAGGGTGTTTAAGGATTTCTTAGACACTTTTGTGATAATTTTCATTGATGATATTTTGG
TTTATTCCAAGACCGAGGCCGAACATGAGGAACATTTACATAAAGTATTAGAACTCTTCGATTGGTCTCGACCCTCTACAGTTAGTGAGGTTCGTAGTTTTCTGGGTTTA
GCAGTGTATTACCGGAGGTTTGTGGAGGATTTTTCACATTTGGCTACTCCCTTGACTCAATTGACCAGGAAGAGAACTCTGTTTTTTTGGAGTCCAGCTTGTGAGGATAG
TTTTCAGAACCTTAAGCAAAGTGATGCTTCCAAGAGAGGACTTGGTTTTGTTTTGATGCAGCAAGATAAGGTAGTTGCTAATGCCTCTCGTCAGTTGAAGAGTCATGAGC
AGAACTACCCTACACATGATTTGGAGTTGGCAACAGTGGTTTTTGCACTAAAGATATGGAGACATTACTTGTATGGTGAAAAGATACAAATCTTTACTGACCATAAGAGC
CTGAAGTATTTCTTCACCCAGAAGGAGTTAAATACGAGACAGCGAAGGTGGCTCGAGTTGGTAAAAGATCATGACTATGAGATATTGTACCATCCTGGTAAGGCGAATGT
GATAGCAGATGCTCTTAGTAGAAAAAGAGCTAAGATTACAGTATCCGTGGGGACGGTCACCTCACAGTTGGCTCAGTTAATGGTGCAACCAACGCTGAGGCGGAAGGTTA
TTGATGCTCAGAGTAGTGATCCTTATTTGGTGGAGAGACGTCGCCTCGTTGAAACAGGTCAGATTGATGAGTTTTCCATATCCTCTGATGGTGGACTAATGTTAGAGAGA
TGTTTGTGTGTGCCAACAAACAGTGCAATTAAGATTGACTTACTAGATGAAGCCCATAATTCCCTGTTTTCCATGCATCTTGGTATATGTCAGCAGGTTAAGGCACCTAG
ACAAAAACCAGCACGTTTGTTACAACCTTTAAGTGTGCCAGAGTGGAAGTGGGAGTATGTGTCGATGGATTTTATTACAGGGCTGCCTAGAACCCTGAAGGGTTTCACAG
TGATTTGGGTTGTTGTAGACAGGCTTACGAAATCGGCACATTTCGTACCAGGGAAATCCACTTATACTGATAGTAAGTGGGCACAGTTGTATTTAACTGAGATTGTGAGA
CTGCATGGAGTGCCTGTGCCGATCGTTTCTGACAGGGATGCACGATTTACTTCCAAGTTTTGGAAGGGACTTCAGATTGCCATGGGTACGAGGTTAGATTTCAGTACAGC
TTTTCATCCACAGACTGATGGTCAAACTGAACGTTTGAACCAAATTTTAGAGGATATGCTACGAGCTTGTGTTCTAGAGTTTCCAGGTAGTTGGGACTCTCATTTACATT
TGATGGAGTTTGCTTATAACAACAGTTTCCATGCTACCATTGGTATGGCACCATTTGAGGCTTTATATGGTAAATGTTGTAGAACCCCTGTTTGTTGGAGTGAGGTTGGT
GAACAAAGGTTGATGGGACCTGAGTTGGTTCAGTCCACAAATGAGGCTATTCAGAAGATTAGAACACGTATGCAGATAGCGCACAGTAGACAGAAGAGTTATGCGGATGT
GAGACGGAAAAACCTTGAGTTTGCGGTGGGAGATAAGGTATTCTTTAAGGTAGCACCTATAAAAGGTGTTATGCGTTTTGAGAAGAAAGCGAAGTTGAGTCCTCGTTTTG
TTGGACCGTTTGAGATCTTAGAGAGAATTGGTGTTGTGGCGTATCGCTTGGCGCTACCACCATCTCTCTCCGCACTTCATAATGTTTTCCATGTTTCGATGTTGAGGAAG
TATGTGGCAGATATATCTCATGTAGTGGACTATGAACCCTTGGAGATTGATGAGAATTTGAGCTATGTGGAACAACCTGTGGAGATTCTGGCTAGAGAGGTAAAGATGCT
TCGTAATAGAAGCATTCCATTAATAAAGGTTTTGTGGCAGAATCATCGAATTGAAGAGGCGACATGGGAGCGAGAGGCTGAGATGAAAACTCGATATCCGGAGTTATTTC
AGGATTAG
Protein sequenceShow/hide protein sequence
MFSKIDLRSGYHQLRIRDSNIPKTSFRSRYGHYEFIVMSFGLTNAPVVFMDLMNRVFKDFLDTFVIIFIDDILVYSKTEAEHEEHLHKVLELFDWSRPSTVSEVRSFLGL
AVYYRRFVEDFSHLATPLTQLTRKRTLFFWSPACEDSFQNLKQSDASKRGLGFVLMQQDKVVANASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYGEKIQIFTDHKS
LKYFFTQKELNTRQRRWLELVKDHDYEILYHPGKANVIADALSRKRAKITVSVGTVTSQLAQLMVQPTLRRKVIDAQSSDPYLVERRRLVETGQIDEFSISSDGGLMLER
CLCVPTNSAIKIDLLDEAHNSLFSMHLGICQQVKAPRQKPARLLQPLSVPEWKWEYVSMDFITGLPRTLKGFTVIWVVVDRLTKSAHFVPGKSTYTDSKWAQLYLTEIVR
LHGVPVPIVSDRDARFTSKFWKGLQIAMGTRLDFSTAFHPQTDGQTERLNQILEDMLRACVLEFPGSWDSHLHLMEFAYNNSFHATIGMAPFEALYGKCCRTPVCWSEVG
EQRLMGPELVQSTNEAIQKIRTRMQIAHSRQKSYADVRRKNLEFAVGDKVFFKVAPIKGVMRFEKKAKLSPRFVGPFEILERIGVVAYRLALPPSLSALHNVFHVSMLRK
YVADISHVVDYEPLEIDENLSYVEQPVEILAREVKMLRNRSIPLIKVLWQNHRIEEATWEREAEMKTRYPELFQD