; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0165381 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0165381
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr06:17148521..17150211
RNA-Seq ExpressionCmc06g0165381
SyntenyCmc06g0165381
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040547.1 pol protein [Cucumis melo var. makuwa]5.1e-26383.13Show/hide
Query:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS
        MK SKLLSQGTW ILAS+V  RE EVSLSSEPVVREYPDVF D+LPGLP PRE+DFAIELE  TAPI R PYRMAPAELKELKVQL+ELLDKGFIRPSVS
Subjt:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS

Query:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM
        PW APV+FVK KDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPKT+FR+RY HYEF+VMSFGLTNAPAVFM
Subjt:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL
        DLMNRVFKDFLD+F IVFIDDILIYSKTEAEHEEHLHQ+                                 G  VDP K+EAVT+WPRPSTVSEI SFL
Subjt:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL

Query:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT
        GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSP CESSFQELKQKLV+APVLTV DGSG+FVIYSDASKKGL CVLMQQ KVVAY SRQLK  EQNYPT
Subjt:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT

Query:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI
        HDLEL AVVFALKIWRHYLYGEKIQI+TDHKSL+Y FTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVAHSA LITKQ PLLRD ERAEI
Subjt:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI

Query:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL
        AVS+GEVTSQLAQ++VQPTLRQ+II AQLNDPYLVEKR +V+ GQ EDFSISSDDGL F+G L
Subjt:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL

KAA0050011.1 pol protein [Cucumis melo var. makuwa]7.2e-26586.67Show/hide
Query:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS
        MK SKLLSQGTW ILAS+V TRE EVSLSSEPVVREYPDVFPD+LPGLP PREIDF IELES TA I R PYRMAPAELKELKVQL+ELLDKGFIRPSVS
Subjt:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS

Query:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM
        PW APV+FVK KDGSMRLCIDY+ELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKT+FR+RY HYEFIVMSFGLTNAPAVFM
Subjt:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL----------GSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQL
        DLMNRVFKDFLD+F IVFIDDILIYSKTE EHEEHL Q+          G  VDP K+EAVT+WPRPST+SEI SFL L GYYRRFV DFSRIA+PLTQL
Subjt:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL----------GSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQL

Query:  TRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEK
        T+KGTPFVWSP CESSFQELKQKLV APVL V DGSGSFVIYSDASKKGL CVLMQQ +VVAY SRQLKS EQNYPTHDLELAAVVFALKIWRHYLYGEK
Subjt:  TRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEK

Query:  IQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLRQR
        IQI TDHKSL+Y FTQKELNMRQRRWLELVKDYD E+LYHPGKANVVADALSRKVAHSAALITKQAPLLRD ERAEIAVS+GEVTSQLAQ++VQPTLRQR
Subjt:  IQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLRQR

Query:  IIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL
        IIVAQLNDPYLVEKR LV+ GQ EDFSISSDDGLTF+G L
Subjt:  IIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL

KAA0060745.1 pol protein [Cucumis melo var. makuwa]8.8e-26382.95Show/hide
Query:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS
        MK SKLLSQGTW ILAS+V  RE EVSLSSEPVVREYPDVFPD+LPGLP PRE+DFAIELE  TAPI R PYRMAPAELKELKVQL+ELLDKGFIRPSVS
Subjt:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS

Query:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM
        PW APV+FVK KDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPKT+FR+RY HYEF+VMSFGLTNAPAVFM
Subjt:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL
        DLMNRVFKDFLD+F IVFIDDILIYSKTEAEHEEHLHQ+                                 G  VDP K+EAVT+WPRPSTVSEI SFL
Subjt:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL

Query:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT
        GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSP CESSFQELKQKLV APVLTV DGSG+FVIYSDASKKGL CVLMQQ KVVAY SRQLK  EQNYPT
Subjt:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT

Query:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI
        HDLELAAVVFALKIWRHYLYGEKIQI+TDHKSL+Y FTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVAHSAALITKQ PLLRD ERAEI
Subjt:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI

Query:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL
        AVS+GEVT+QLAQ++VQPTLRQ+II AQLNDPYL EKR +V+ GQ EDFSIS DDGL F+G L
Subjt:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL

KAA0063403.1 reverse transcriptase [Cucumis melo var. makuwa]1.4e-26890.11Show/hide
Query:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS
        MKTSKLLSQGTWSILASIV TREL+VSLSSEPVVREYPDVF D+LPGLPSPREIDFAIELE DTAPI R PYRMAPAELKELKVQL+ELLDKGFIRPSVS
Subjt:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS

Query:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM
        PWRAPV+FVK KDGSMR+CIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFR RYRHYEFIVMSFGLTNAPAVFM
Subjt:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEE------HLHQLGSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKG
        DLMNRVFKDFLDTF+ +F      +SK E   ++       +   G  VDP+KVEAVTSWPRPSTVSEI SFLGLAGYYRRFVEDFSRIASPLTQLTRKG
Subjt:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEE------HLHQLGSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKG

Query:  TPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF
        TPFVWSP CESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGL CVLMQQ KVVAY SRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF
Subjt:  TPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF

Query:  TDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLRQRIIVA
        TDHKS+EYCFTQKELNMRQRRWL+LVKDYD EILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQ+TVQPTLRQRIIVA
Subjt:  TDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLRQRIIVA

Query:  QLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL
        QLNDPYLVEKRLLV+AGQSEDFSISSDDGLTFDG L
Subjt:  QLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL

KAA0063793.1 pol protein [Cucumis melo var. makuwa]6.7e-26383.13Show/hide
Query:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS
        MK SKLLS GTW ILAS+V  RE EVSLSSEPVVREYPDVFPDKLPGLP PRE+DFAIELE  TAPI R PYRMAPAELKELKVQL+ELLDKGFIRPSVS
Subjt:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS

Query:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM
        PW APV+FVK KDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPKT+FR+RY HYEF+VMSFGLTNAPAVFM
Subjt:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL
        DLMNRVFKDFLD+F IVFIDDILIYSKTEAEHEEHLHQ+                                 G  VDP K+EAVT+WPRPSTVSEI SFL
Subjt:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL

Query:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT
        GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSP CE SFQELKQKLV+APVLTV DGSG+FVIYSDASKKGL CVLMQQ KVVAY SRQLK  EQNYPT
Subjt:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT

Query:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI
        HDLELAAVVFALKIWRHYLYGEKIQI+TDHKSL+Y FTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVAHSAALITKQ PLLRD ERAEI
Subjt:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI

Query:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL
        AVS+GEVT+QLAQ+TVQPTLRQ+II AQLNDPYL EKR +V+  Q EDFSISSDDGL F+G L
Subjt:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL

TrEMBL top hitse value%identityAlignment
A0A5A7TG62 Reverse transcriptase2.5e-26383.13Show/hide
Query:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS
        MK SKLLSQGTW ILAS+V  RE EVSLSSEPVVREYPDVF D+LPGLP PRE+DFAIELE  TAPI R PYRMAPAELKELKVQL+ELLDKGFIRPSVS
Subjt:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS

Query:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM
        PW APV+FVK KDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPKT+FR+RY HYEF+VMSFGLTNAPAVFM
Subjt:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL
        DLMNRVFKDFLD+F IVFIDDILIYSKTEAEHEEHLHQ+                                 G  VDP K+EAVT+WPRPSTVSEI SFL
Subjt:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL

Query:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT
        GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSP CESSFQELKQKLV+APVLTV DGSG+FVIYSDASKKGL CVLMQQ KVVAY SRQLK  EQNYPT
Subjt:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT

Query:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI
        HDLEL AVVFALKIWRHYLYGEKIQI+TDHKSL+Y FTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVAHSA LITKQ PLLRD ERAEI
Subjt:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI

Query:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL
        AVS+GEVTSQLAQ++VQPTLRQ+II AQLNDPYLVEKR +V+ GQ EDFSISSDDGL F+G L
Subjt:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL

A0A5A7U8T4 Pol protein3.5e-26586.67Show/hide
Query:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS
        MK SKLLSQGTW ILAS+V TRE EVSLSSEPVVREYPDVFPD+LPGLP PREIDF IELES TA I R PYRMAPAELKELKVQL+ELLDKGFIRPSVS
Subjt:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS

Query:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM
        PW APV+FVK KDGSMRLCIDY+ELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKT+FR+RY HYEFIVMSFGLTNAPAVFM
Subjt:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL----------GSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQL
        DLMNRVFKDFLD+F IVFIDDILIYSKTE EHEEHL Q+          G  VDP K+EAVT+WPRPST+SEI SFL L GYYRRFV DFSRIA+PLTQL
Subjt:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL----------GSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQL

Query:  TRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEK
        T+KGTPFVWSP CESSFQELKQKLV APVL V DGSGSFVIYSDASKKGL CVLMQQ +VVAY SRQLKS EQNYPTHDLELAAVVFALKIWRHYLYGEK
Subjt:  TRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEK

Query:  IQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLRQR
        IQI TDHKSL+Y FTQKELNMRQRRWLELVKDYD E+LYHPGKANVVADALSRKVAHSAALITKQAPLLRD ERAEIAVS+GEVTSQLAQ++VQPTLRQR
Subjt:  IQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLRQR

Query:  IIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL
        IIVAQLNDPYLVEKR LV+ GQ EDFSISSDDGLTF+G L
Subjt:  IIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL

A0A5A7V4E4 Reverse transcriptase4.3e-26382.95Show/hide
Query:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS
        MK SKLLSQGTW ILAS+V  RE EVSLSSEPVVREYPDVFPD+LPGLP PRE+DFAIELE  TAPI R PYRMAPAELKELKVQL+ELLDKGFIRPSVS
Subjt:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS

Query:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM
        PW APV+FVK KDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPKT+FR+RY HYEF+VMSFGLTNAPAVFM
Subjt:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL
        DLMNRVFKDFLD+F IVFIDDILIYSKTEAEHEEHLHQ+                                 G  VDP K+EAVT+WPRPSTVSEI SFL
Subjt:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL

Query:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT
        GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSP CESSFQELKQKLV APVLTV DGSG+FVIYSDASKKGL CVLMQQ KVVAY SRQLK  EQNYPT
Subjt:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT

Query:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI
        HDLELAAVVFALKIWRHYLYGEKIQI+TDHKSL+Y FTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVAHSAALITKQ PLLRD ERAEI
Subjt:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI

Query:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL
        AVS+GEVT+QLAQ++VQPTLRQ+II AQLNDPYL EKR +V+ GQ EDFSIS DDGL F+G L
Subjt:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL

A0A5A7V5N1 Reverse transcriptase6.8e-26990.11Show/hide
Query:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS
        MKTSKLLSQGTWSILASIV TREL+VSLSSEPVVREYPDVF D+LPGLPSPREIDFAIELE DTAPI R PYRMAPAELKELKVQL+ELLDKGFIRPSVS
Subjt:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS

Query:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM
        PWRAPV+FVK KDGSMR+CIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFR RYRHYEFIVMSFGLTNAPAVFM
Subjt:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEE------HLHQLGSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKG
        DLMNRVFKDFLDTF+ +F      +SK E   ++       +   G  VDP+KVEAVTSWPRPSTVSEI SFLGLAGYYRRFVEDFSRIASPLTQLTRKG
Subjt:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEE------HLHQLGSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKG

Query:  TPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF
        TPFVWSP CESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGL CVLMQQ KVVAY SRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF
Subjt:  TPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF

Query:  TDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLRQRIIVA
        TDHKS+EYCFTQKELNMRQRRWL+LVKDYD EILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQ+TVQPTLRQRIIVA
Subjt:  TDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLRQRIIVA

Query:  QLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL
        QLNDPYLVEKRLLV+AGQSEDFSISSDDGLTFDG L
Subjt:  QLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL

A0A5A7V6R2 Reverse transcriptase3.3e-26383.13Show/hide
Query:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS
        MK SKLLS GTW ILAS+V  RE EVSLSSEPVVREYPDVFPDKLPGLP PRE+DFAIELE  TAPI R PYRMAPAELKELKVQL+ELLDKGFIRPSVS
Subjt:  MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVS

Query:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM
        PW APV+FVK KDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPKT+FR+RY HYEF+VMSFGLTNAPAVFM
Subjt:  PWRAPVMFVKNKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL
        DLMNRVFKDFLD+F IVFIDDILIYSKTEAEHEEHLHQ+                                 G  VDP K+EAVT+WPRPSTVSEI SFL
Subjt:  DLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFL

Query:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT
        GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSP CE SFQELKQKLV+APVLTV DGSG+FVIYSDASKKGL CVLMQQ KVVAY SRQLK  EQNYPT
Subjt:  GLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPT

Query:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI
        HDLELAAVVFALKIWRHYLYGEKIQI+TDHKSL+Y FTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVAHSAALITKQ PLLRD ERAEI
Subjt:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEI

Query:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL
        AVS+GEVT+QLAQ+TVQPTLRQ+II AQLNDPYL EKR +V+  Q EDFSISSDDGL F+G L
Subjt:  AVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.9e-7136.01Show/hide
Query:  YRMAPAELKELKVQLKELLDKGFIRPSVSPWRAPVMFV-KNKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRI
        Y    A  +E++ Q++++L++G IR S SP+ +P+  V K +D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +
Subjt:  YRMAPAELKELKVQLKELLDKGFIRPSVSPWRAPVMFV-KNKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRI

Query:  RDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHE----------------------EHLHQLGSFV-
            + KT+F T++ HYE++ M FGL NAPA F   MN + +  L+   +V++DDI+++S +  EH                       E L Q  +F+ 
Subjt:  RDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFFIVFIDDILIYSKTEAEHE----------------------EHLHQLGSFV-

Query:  ----------DPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIY
                  +PEK+EA+  +P P+   EI +FLGL GYYR+F+ +F+ IA P+T+  +K       +P  +S+F++LK  +   P+L V D +  F + 
Subjt:  ----------DPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIY

Query:  SDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPG
        +DAS   L  VL Q    ++Y SR L   E NY T + EL A+V+A K +RHYL G   +I +DH+ L + +  K+ N +  RW   + ++D +I Y  G
Subjt:  SDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPG

Query:  KANVVADALSR
        K N VADALSR
Subjt:  KANVVADALSR

P0CT34 Transposon Tf2-1 polyprotein2.6e-6830.78Show/hide
Query:  VVREYPDVFPD-KLPGLPSP-REIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVSPWRAPVMFVKNKDGSMRLCIDYRELNKVTV
        + +E+ D+  +     LP P + ++F +EL  +   +    Y + P +++ +  ++ + L  G IR S +    PVMFV  K+G++R+ +DY+ LNK   
Subjt:  VVREYPDVFPD-KLPGLPSP-REIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVSPWRAPVMFVKNKDGSMRLCIDYRELNKVTV

Query:  KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFFIVFIDDILIYSKTEA
         N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+R  D  K +FR     +E++VM +G++ APA F   +N +  +  ++  + ++DDILI+SK+E+
Subjt:  KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFFIVFIDDILIYSKTEA

Query:  EHEEHL---------------------HQ-----LGSFVD-------PEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF
        EH +H+                     HQ     +G  +         E ++ V  W +P    E+  FLG   Y R+F+   S++  PL  L +K   +
Subjt:  EHEEHL---------------------HQ-----LGSFVD-------PEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF

Query:  VWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQ---DKV--VAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYG--EK
         W+PT   + + +KQ LVS PVL   D S   ++ +DAS   +  VL Q+   DK   V Y S ++   + NY   D E+ A++ +LK WRHYL    E 
Subjt:  VWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQ---DKV--VAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYG--EK

Query:  IQIFTDHKSLEYCFTQKE--LNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLR
         +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++ +  P+ +D E   I        + + Q+++    +
Subjt:  IQIFTDHKSLEYCFTQKE--LNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLR

Query:  QRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGL
         +++    ND  L+   LL    +  + +I   DGL
Subjt:  QRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGL

P0CT35 Transposon Tf2-2 polyprotein2.6e-6830.78Show/hide
Query:  VVREYPDVFPD-KLPGLPSP-REIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVSPWRAPVMFVKNKDGSMRLCIDYRELNKVTV
        + +E+ D+  +     LP P + ++F +EL  +   +    Y + P +++ +  ++ + L  G IR S +    PVMFV  K+G++R+ +DY+ LNK   
Subjt:  VVREYPDVFPD-KLPGLPSP-REIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVSPWRAPVMFVKNKDGSMRLCIDYRELNKVTV

Query:  KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFFIVFIDDILIYSKTEA
         N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+R  D  K +FR     +E++VM +G++ APA F   +N +  +  ++  + ++DDILI+SK+E+
Subjt:  KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFFIVFIDDILIYSKTEA

Query:  EHEEHL---------------------HQ-----LGSFVD-------PEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF
        EH +H+                     HQ     +G  +         E ++ V  W +P    E+  FLG   Y R+F+   S++  PL  L +K   +
Subjt:  EHEEHL---------------------HQ-----LGSFVD-------PEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF

Query:  VWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQ---DKV--VAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYG--EK
         W+PT   + + +KQ LVS PVL   D S   ++ +DAS   +  VL Q+   DK   V Y S ++   + NY   D E+ A++ +LK WRHYL    E 
Subjt:  VWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQ---DKV--VAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYG--EK

Query:  IQIFTDHKSLEYCFTQKE--LNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLR
         +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++ +  P+ +D E   I        + + Q+++    +
Subjt:  IQIFTDHKSLEYCFTQKE--LNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLR

Query:  QRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGL
         +++    ND  L+   LL    +  + +I   DGL
Subjt:  QRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGL

P0CT41 Transposon Tf2-12 polyprotein2.6e-6830.78Show/hide
Query:  VVREYPDVFPD-KLPGLPSP-REIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVSPWRAPVMFVKNKDGSMRLCIDYRELNKVTV
        + +E+ D+  +     LP P + ++F +EL  +   +    Y + P +++ +  ++ + L  G IR S +    PVMFV  K+G++R+ +DY+ LNK   
Subjt:  VVREYPDVFPD-KLPGLPSP-REIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVSPWRAPVMFVKNKDGSMRLCIDYRELNKVTV

Query:  KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFFIVFIDDILIYSKTEA
         N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+R  D  K +FR     +E++VM +G++ APA F   +N +  +  ++  + ++DDILI+SK+E+
Subjt:  KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFFIVFIDDILIYSKTEA

Query:  EHEEHL---------------------HQ-----LGSFVD-------PEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF
        EH +H+                     HQ     +G  +         E ++ V  W +P    E+  FLG   Y R+F+   S++  PL  L +K   +
Subjt:  EHEEHL---------------------HQ-----LGSFVD-------PEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF

Query:  VWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQ---DKV--VAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYG--EK
         W+PT   + + +KQ LVS PVL   D S   ++ +DAS   +  VL Q+   DK   V Y S ++   + NY   D E+ A++ +LK WRHYL    E 
Subjt:  VWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQQ---DKV--VAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYG--EK

Query:  IQIFTDHKSLEYCFTQKE--LNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLR
         +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++ +  P+ +D E   I        + + Q+++    +
Subjt:  IQIFTDHKSLEYCFTQKE--LNMRQRRWLELVKDYDSEILYHPGKANVVADALSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLR

Query:  QRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGL
         +++    ND  L+   LL    +  + +I   DGL
Subjt:  QRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus5.7e-7133.12Show/hide
Query:  EYPDVFPDKLPGLPSPREIDFAIELESDT---APILRIPYRMAPAELKELKVQLKELLDKGFIRPSVSPWRAPVMFVKNK-----DGSMRLCIDYRELNK
        E+P +F   L G+     ++ A++ E  T    PI    Y        E++ Q+ ELL  G IRPS SP+ +P+  V  K     +   R+ +D++ LN 
Subjt:  EYPDVFPDKLPGLPSPREIDFAIELESDT---APILRIPYRMAPAELKELKVQLKELLDKGFIRPSVSPWRAPVMFVKNK-----DGSMRLCIDYRELNK

Query:  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFFIVFIDDILIYSK
        VT+ + YP+P I+     L  A  F+ +DL SG+HQ+ +++SDIPKT+F T    YEF+ + FGL NAPA+F  +++ + ++ +     V+IDDI+++S+
Subjt:  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFFIVFIDDILIYSK

Query:  TEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTR--
            H ++L  +                                 G   DP+KV A++  P P++V E+  FLG+  YYR+F++D++++A PLT LTR  
Subjt:  TEAEHEEHLHQL---------------------------------GSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTR--

Query:  ---------KGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQ----QDKVVAYTSRQLKSQEQNYPTHDLELAAVVFAL
                    P     T   SF +LK  L S+ +L     +  F + +DAS   +  VL Q    +D+ +AY SR L   E+NY T + E+ A++++L
Subjt:  ---------KGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFVIYSDASKKGLDCVLMQ----QDKVVAYTSRQLKSQEQNYPTHDLELAAVVFAL

Query:  KIWRHYLYGE-KIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSR
           R YLYG   I+++TDH+ L +    +  N + +RW   +++Y+ E++Y PGK+NVVADALSR
Subjt:  KIWRHYLYGE-KIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.9e-1538.38Show/hide
Query:  HEEHLHQLGSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFV
        H   +   G   DP K+EA+  WP P   +E+  FLGL GYYRRFV+++ +I  PLT+L +K +   W+     +F+ LK  + + PVL + D    FV
Subjt:  HEEHLHQLGSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGACTAGTAAACTACTCAGCCAGGGTACCTGGAGCATCTTGGCAAGCATAGTAGTTACAAGAGAACTAGAAGTTTCCCTGTCTTCTGAACCAGTGGTAAGGGAGTA
CCCTGATGTTTTCCCCGACAAGCTTCCAGGACTTCCATCTCCTAGGGAGATAGACTTCGCCATTGAGTTAGAGTCAGACACTGCTCCTATCTTGAGGATCCCTTACAGAA
TGGCCCCAGCTGAGCTAAAAGAGCTAAAGGTGCAATTGAAGGAGTTACTGGACAAGGGTTTTATTCGACCTAGTGTGTCACCTTGGAGAGCACCAGTGATGTTTGTGAAG
AATAAGGATGGGTCGATGCGCCTTTGCATTGACTACAGAGAGCTGAACAAGGTGACAGTTAAGAATCGCTATCCCTTGCCCAGGATTGATGATTTGTTCGATCAGTTGCA
AGGAGCCACCGTCTTTTCTAAGATCGACCTGCGATCAGGCTACCACCAATTGAGGATCAGGGATAGTGATATTCCTAAGACTTCTTTTCGTACAAGATACAGACATTACG
AGTTCATTGTGATGTCTTTTGGGTTGACTAATGCTCCTGCAGTATTCATGGACTTGATGAACAGGGTGTTTAAGGATTTCTTAGACACGTTTTTCATAGTTTTCATTGAT
GACATTTTGATTTACTCCAAGACTGAGGCTGAGCATGAGGAGCATTTGCACCAGTTAGGGAGTTTCGTGGATCCAGAAAAGGTCGAAGCAGTTACCAGTTGGCCTCGACC
GTCTACAGTTAGCGAGATTCATAGTTTTTTAGGTTTGGCAGGTTACTACAGGAGGTTTGTGGAAGACTTCTCTCGTATAGCCAGTCCCTTGACCCAATTGACCAGGAAGG
GGACTCCTTTTGTTTGGAGCCCAACTTGTGAGAGTAGCTTTCAGGAGCTCAAGCAGAAGCTTGTGTCTGCACCAGTTCTAACAGTATCCGATGGATCTGGAAGTTTCGTG
ATCTATAGTGATGCCTCAAAGAAAGGACTGGATTGTGTTCTGATGCAGCAAGATAAGGTAGTTGCTTATACCTCTCGTCAGTTGAAGAGTCAAGAGCAAAACTACCCTAC
CCATGACCTAGAGTTGGCAGCAGTGGTTTTTGCACTGAAGATATGGAGGCACTACCTGTACGGTGAGAAGATACAGATTTTCACTGACCATAAGAGCCTAGAGTACTGCT
TCACCCAGAAGGAGTTGAACATGAGGCAGAGGAGGTGGCTTGAGTTAGTGAAAGACTATGACAGCGAGATTTTATATCACCCAGGTAAGGCAAACGTAGTAGCTGACGCG
TTGAGTAGGAAGGTTGCGCATTCAGCAGCGCTTATCACCAAGCAAGCTCCCTTACTAAGAGATCTCGAGAGAGCTGAGATTGCAGTCTCAATAGGGGAAGTGACCTCACA
ATTGGCTCAGGTTACCGTGCAGCCGACCTTGAGACAGAGGATTATTGTTGCTCAGCTAAATGATCCTTATTTGGTCGAGAAGCGTCTATTAGTAAAGGCAGGGCAAAGTG
AGGATTTCTCCATATCCTCTGATGACGGACTTACTTTCGATGGACATTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGACTAGTAAACTACTCAGCCAGGGTACCTGGAGCATCTTGGCAAGCATAGTAGTTACAAGAGAACTAGAAGTTTCCCTGTCTTCTGAACCAGTGGTAAGGGAGTA
CCCTGATGTTTTCCCCGACAAGCTTCCAGGACTTCCATCTCCTAGGGAGATAGACTTCGCCATTGAGTTAGAGTCAGACACTGCTCCTATCTTGAGGATCCCTTACAGAA
TGGCCCCAGCTGAGCTAAAAGAGCTAAAGGTGCAATTGAAGGAGTTACTGGACAAGGGTTTTATTCGACCTAGTGTGTCACCTTGGAGAGCACCAGTGATGTTTGTGAAG
AATAAGGATGGGTCGATGCGCCTTTGCATTGACTACAGAGAGCTGAACAAGGTGACAGTTAAGAATCGCTATCCCTTGCCCAGGATTGATGATTTGTTCGATCAGTTGCA
AGGAGCCACCGTCTTTTCTAAGATCGACCTGCGATCAGGCTACCACCAATTGAGGATCAGGGATAGTGATATTCCTAAGACTTCTTTTCGTACAAGATACAGACATTACG
AGTTCATTGTGATGTCTTTTGGGTTGACTAATGCTCCTGCAGTATTCATGGACTTGATGAACAGGGTGTTTAAGGATTTCTTAGACACGTTTTTCATAGTTTTCATTGAT
GACATTTTGATTTACTCCAAGACTGAGGCTGAGCATGAGGAGCATTTGCACCAGTTAGGGAGTTTCGTGGATCCAGAAAAGGTCGAAGCAGTTACCAGTTGGCCTCGACC
GTCTACAGTTAGCGAGATTCATAGTTTTTTAGGTTTGGCAGGTTACTACAGGAGGTTTGTGGAAGACTTCTCTCGTATAGCCAGTCCCTTGACCCAATTGACCAGGAAGG
GGACTCCTTTTGTTTGGAGCCCAACTTGTGAGAGTAGCTTTCAGGAGCTCAAGCAGAAGCTTGTGTCTGCACCAGTTCTAACAGTATCCGATGGATCTGGAAGTTTCGTG
ATCTATAGTGATGCCTCAAAGAAAGGACTGGATTGTGTTCTGATGCAGCAAGATAAGGTAGTTGCTTATACCTCTCGTCAGTTGAAGAGTCAAGAGCAAAACTACCCTAC
CCATGACCTAGAGTTGGCAGCAGTGGTTTTTGCACTGAAGATATGGAGGCACTACCTGTACGGTGAGAAGATACAGATTTTCACTGACCATAAGAGCCTAGAGTACTGCT
TCACCCAGAAGGAGTTGAACATGAGGCAGAGGAGGTGGCTTGAGTTAGTGAAAGACTATGACAGCGAGATTTTATATCACCCAGGTAAGGCAAACGTAGTAGCTGACGCG
TTGAGTAGGAAGGTTGCGCATTCAGCAGCGCTTATCACCAAGCAAGCTCCCTTACTAAGAGATCTCGAGAGAGCTGAGATTGCAGTCTCAATAGGGGAAGTGACCTCACA
ATTGGCTCAGGTTACCGTGCAGCCGACCTTGAGACAGAGGATTATTGTTGCTCAGCTAAATGATCCTTATTTGGTCGAGAAGCGTCTATTAGTAAAGGCAGGGCAAAGTG
AGGATTTCTCCATATCCTCTGATGACGGACTTACTTTCGATGGACATTTGTGA
Protein sequenceShow/hide protein sequence
MKTSKLLSQGTWSILASIVVTRELEVSLSSEPVVREYPDVFPDKLPGLPSPREIDFAIELESDTAPILRIPYRMAPAELKELKVQLKELLDKGFIRPSVSPWRAPVMFVK
NKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTSFRTRYRHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDTFFIVFID
DILIYSKTEAEHEEHLHQLGSFVDPEKVEAVTSWPRPSTVSEIHSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPTCESSFQELKQKLVSAPVLTVSDGSGSFV
IYSDASKKGLDCVLMQQDKVVAYTSRQLKSQEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLEYCFTQKELNMRQRRWLELVKDYDSEILYHPGKANVVADA
LSRKVAHSAALITKQAPLLRDLERAEIAVSIGEVTSQLAQVTVQPTLRQRIIVAQLNDPYLVEKRLLVKAGQSEDFSISSDDGLTFDGHL