; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G015550 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G015550
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr01:11968236..11975154
RNA-Seq ExpressionCmoCh01G015550
SyntenyCmoCh01G015550
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0004518 - nuclease activity (molecular function)
GO:0005488 - binding (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025998.1 pol protein [Cucumis melo var. makuwa]1.0e-20362.31Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        A VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYE VVMSF LTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EHL +VL  LR  +L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
        YAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+TV+E+RS FLGLAGYYRRF++DF+RIA+PLTQLTRK   F WS ACE SFQELK++L
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR
         +APVL VPDG+GN VIYSDA+K GL CVLMQ G+V+AYASRQLK +E++YPTHDLELAAVV ALK+WRHYLYGE+IQ+YTDHKSLKY FTQKEL MRQR
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR

Query:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK
        RWLELVKDYD EIL+HPGKANVVADALSRK AHS+A++T+Q  +  +FERA+IAV V +  AQLA +T+ PTL+++II  Q  DPY ++  R +ETEQ +
Subjt:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK

Query:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------
         F I+ D  L ++ R CVP D  +K ++L+EAH SP+T+HPG+TKMY DL++ +WW GMK+DVA+ V                                 
Subjt:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------

Query:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL
               LP+T  G+TVIWVVVDRLTK+ HF                         +PVS+VS+RD RFTS+FWK LQ AL
Subjt:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL

KAA0053301.1 pol protein [Cucumis melo var. makuwa]1.2e-20462.31Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        A VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYE VVMSF LTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EHL +VL  LR  +L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
        YAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+TV+E+RS FLGLAGYYRRF++DF+RIA+PLTQLTRK   F WS ACE SFQELK++L
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR
         +APVL VPDG+GN VIYSDA+K GL CVLMQ G+V+AYASRQLK +E++YPTHDLELAAVV ALK+WRHYLY E+IQ+YTDHKSLKY FTQKEL MRQR
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR

Query:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK
        RWLELVKDYD EIL+HPGKANVVADALSRK AHS+A++T+Q  +  +FERA+IAV V +  AQLA +T+ PTL+++II  Q  DPY ++  R +ETEQ +
Subjt:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK

Query:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------
        DF I+ DG L ++ R CVP D  +K ++L+EAH SP+T+HPG+TKMY DL++ +WW GMK+DVA+ V                                 
Subjt:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------

Query:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL
               LP+T  G+TVIWVVVDRLTK+ HF                         +PVS++S+RD RFTS+FWK LQ AL
Subjt:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL

TYK01576.1 pol protein [Cucumis melo var. makuwa]1.0e-20362.31Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        A VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYE VVMSF LTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EHL +VL  LR  +L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
        YAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+TV+E+RS FLGLAGYYRRF++DF+RIA+PLTQLTRK   F WS ACE SFQELK++L
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR
         +APVL VPDG+GN VIYSDA+K GL CVLMQ G+V+AYASRQLK +E++YPTHDLELAAVV ALK+WRHYLYGE+IQ+YTDHKSLKY FTQKEL MRQR
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR

Query:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK
        RWLELVKDYD EIL+HPGKANVVADALSRK AHS+A++T+Q  +  +FERA+IAV V +  AQLA +T+ PTL+++II  Q  DPY ++  R +ETEQ +
Subjt:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK

Query:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------
         F I+ D  L ++ R CVP D  +K ++L+EAH SP+T+HPG+TKMY DL++ +WW GMK+DVA+ V                                 
Subjt:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------

Query:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL
               LP+T  G+TVIWVVVDRLTK+ HF                         +PVS+VS+RD RFTS+FWK LQ AL
Subjt:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL

TYK20443.1 pol protein [Cucumis melo var. makuwa]1.0e-20362.31Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        A VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYE VVMSF LTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EHL +VL  LR  +L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
        YAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+TV+E+RS FLGLAGYYRRF++DF+RIA+PLTQLTRK   F WS ACE SFQELK++L
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR
         +APVL VPDG+GN VIYSDA+K GL CVLMQ G+V+AYASRQLK +E++YPTHDLELAAVV ALK+WRHYLYGE+IQ+YTDHKSLKY FTQKEL MRQR
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR

Query:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK
        RWLELVKDYD EIL+HPGKANVVADALSRK AHS+A++T+Q  +  +FERA+IAV V +  AQLA +T+ PTL+++II  Q  DPY ++  R +ETEQ +
Subjt:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK

Query:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------
         F I+ D  L ++ R CVP D  +K ++L+EAH SP+T+HPG+TKMY DL++ +WW GMK+DVA+ V                                 
Subjt:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------

Query:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL
               LP+T  G+TVIWVVVDRLTK+ HF                         +PVS+VS+RD RFTS+FWK LQ AL
Subjt:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL

XP_022931734.1 uncharacterized protein LOC111437896 [Cucurbita moschata]1.2e-20462.82Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        A VFSKIDLRSGYHQ+R++E+D+PKTAFR+RYGHYE +VMSF LTNAPAVFMELMNRVF++FLD+FVIVFIDDILVYSK+  EH  HLR+VL +LR  +L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
        YAKFSKCEFWLQ+V FLGHVVS  G+TVDPAK+EAV+ W RPTTVTEVRS FLGLAGYYRRFIKDF++++A LTQLT+K K F W++ CE SF ELK+RL
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR
         +APVL VPDG+G LV+YSDA+  GL CVLMQ G+VIAYASRQLK+YER+YPTHDLELAAVV ALK WRHYLYGER+QVYTDHKSLKYLFTQKEL MRQR
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR

Query:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK
        RWLELVKDYD+EIL+HPGKANVVADALSRKTAH+SA++TRQ  +Q E +RA I VL R   AQLA +++ PTL+ +II  Q+ D + S++  Q ETE+  
Subjt:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK

Query:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------
         + ++ +G L WQ+R CVP D +I ++I++EAH + YT HPG+TKMY DLK  +WWPGMKKDVAE V                                 
Subjt:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------

Query:  -------LPRTQLGFTVIWVVVDRLTKTTHFI-------------------------PVSVVSNRDMRFTSRFWKSLQEAL
               LP+T+ GF VIWV+VDRLTK  HFI                         PV++VS+RD +FTS FWK LQ+AL
Subjt:  -------LPRTQLGFTVIWVVVDRLTKTTHFI-------------------------PVSVVSNRDMRFTSRFWKSLQEAL

TrEMBL top hitse value%identityAlignment
A0A5A7SIJ5 Reverse transcriptase4.9e-20462.31Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        A VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYE VVMSF LTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EHL +VL  LR  +L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
        YAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+TV+E+RS FLGLAGYYRRF++DF+RIA+PLTQLTRK   F WS ACE SFQELK++L
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR
         +APVL VPDG+GN VIYSDA+K GL CVLMQ G+V+AYASRQLK +E++YPTHDLELAAVV ALK+WRHYLYGE+IQ+YTDHKSLKY FTQKEL MRQR
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR

Query:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK
        RWLELVKDYD EIL+HPGKANVVADALSRK AHS+A++T+Q  +  +FERA+IAV V +  AQLA +T+ PTL+++II  Q  DPY ++  R +ETEQ +
Subjt:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK

Query:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------
         F I+ D  L ++ R CVP D  +K ++L+EAH SP+T+HPG+TKMY DL++ +WW GMK+DVA+ V                                 
Subjt:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------

Query:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL
               LP+T  G+TVIWVVVDRLTK+ HF                         +PVS+VS+RD RFTS+FWK LQ AL
Subjt:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL

A0A5A7UIB4 Pol protein5.8e-20562.31Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        A VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYE VVMSF LTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EHL +VL  LR  +L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
        YAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+TV+E+RS FLGLAGYYRRF++DF+RIA+PLTQLTRK   F WS ACE SFQELK++L
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR
         +APVL VPDG+GN VIYSDA+K GL CVLMQ G+V+AYASRQLK +E++YPTHDLELAAVV ALK+WRHYLY E+IQ+YTDHKSLKY FTQKEL MRQR
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR

Query:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK
        RWLELVKDYD EIL+HPGKANVVADALSRK AHS+A++T+Q  +  +FERA+IAV V +  AQLA +T+ PTL+++II  Q  DPY ++  R +ETEQ +
Subjt:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK

Query:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------
        DF I+ DG L ++ R CVP D  +K ++L+EAH SP+T+HPG+TKMY DL++ +WW GMK+DVA+ V                                 
Subjt:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------

Query:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL
               LP+T  G+TVIWVVVDRLTK+ HF                         +PVS++S+RD RFTS+FWK LQ AL
Subjt:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL

A0A5D3BTN0 Reverse transcriptase4.9e-20462.31Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        A VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYE VVMSF LTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EHL +VL  LR  +L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
        YAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+TV+E+RS FLGLAGYYRRF++DF+RIA+PLTQLTRK   F WS ACE SFQELK++L
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR
         +APVL VPDG+GN VIYSDA+K GL CVLMQ G+V+AYASRQLK +E++YPTHDLELAAVV ALK+WRHYLYGE+IQ+YTDHKSLKY FTQKEL MRQR
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR

Query:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK
        RWLELVKDYD EIL+HPGKANVVADALSRK AHS+A++T+Q  +  +FERA+IAV V +  AQLA +T+ PTL+++II  Q  DPY ++  R +ETEQ +
Subjt:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK

Query:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------
         F I+ D  L ++ R CVP D  +K ++L+EAH SP+T+HPG+TKMY DL++ +WW GMK+DVA+ V                                 
Subjt:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------

Query:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL
               LP+T  G+TVIWVVVDRLTK+ HF                         +PVS+VS+RD RFTS+FWK LQ AL
Subjt:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL

A0A5D3C6W3 Reverse transcriptase4.9e-20462.31Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        A VFSKIDLRSGYHQ+R+++ D+PKTAFR+RYGHYE VVMSF LTNAPAVFM+LMNRVF+DFLDSFVIVFIDDIL+YSKT  EH EHL +VL  LR  +L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
        YAKFSKCEFWL+KV FLGHVVS +G++VDPAK+EAV  W RP+TV+E+RS FLGLAGYYRRF++DF+RIA+PLTQLTRK   F WS ACE SFQELK++L
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR
         +APVL VPDG+GN VIYSDA+K GL CVLMQ G+V+AYASRQLK +E++YPTHDLELAAVV ALK+WRHYLYGE+IQ+YTDHKSLKY FTQKEL MRQR
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR

Query:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK
        RWLELVKDYD EIL+HPGKANVVADALSRK AHS+A++T+Q  +  +FERA+IAV V +  AQLA +T+ PTL+++II  Q  DPY ++  R +ETEQ +
Subjt:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK

Query:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------
         F I+ D  L ++ R CVP D  +K ++L+EAH SP+T+HPG+TKMY DL++ +WW GMK+DVA+ V                                 
Subjt:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------

Query:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL
               LP+T  G+TVIWVVVDRLTK+ HF                         +PVS+VS+RD RFTS+FWK LQ AL
Subjt:  -------LPRTQLGFTVIWVVVDRLTKTTHF-------------------------IPVSVVSNRDMRFTSRFWKSLQEAL

A0A6J1EV26 Reverse transcriptase5.8e-20562.82Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        A VFSKIDLRSGYHQ+R++E+D+PKTAFR+RYGHYE +VMSF LTNAPAVFMELMNRVF++FLD+FVIVFIDDILVYSK+  EH  HLR+VL +LR  +L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
        YAKFSKCEFWLQ+V FLGHVVS  G+TVDPAK+EAV+ W RPTTVTEVRS FLGLAGYYRRFIKDF++++A LTQLT+K K F W++ CE SF ELK+RL
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR
         +APVL VPDG+G LV+YSDA+  GL CVLMQ G+VIAYASRQLK+YER+YPTHDLELAAVV ALK WRHYLYGER+QVYTDHKSLKYLFTQKEL MRQR
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQR

Query:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK
        RWLELVKDYD+EIL+HPGKANVVADALSRKTAH+SA++TRQ  +Q E +RA I VL R   AQLA +++ PTL+ +II  Q+ D + S++  Q ETE+  
Subjt:  RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIK

Query:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------
         + ++ +G L WQ+R CVP D +I ++I++EAH + YT HPG+TKMY DLK  +WWPGMKKDVAE V                                 
Subjt:  DFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR--------------------------------

Query:  -------LPRTQLGFTVIWVVVDRLTKTTHFI-------------------------PVSVVSNRDMRFTSRFWKSLQEAL
               LP+T+ GF VIWV+VDRLTK  HFI                         PV++VS+RD +FTS FWK LQ+AL
Subjt:  -------LPRTQLGFTVIWVVVDRLTKTTHFI-------------------------PVSVVSNRDMRFTSRFWKSLQEAL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.4e-6741.59Show/hide
Query:  FSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRLYAK
        F+ IDL  G+HQI +  + V KTAF T++GHYE + M F L NAPA F   MN + +  L+   +V++DDI+V+S + DEH + L  V   L K  L  +
Subjt:  FSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRLYAK

Query:  FSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSR-ACESSFQELKERLAS
          KCEF  Q+  FLGHV++ DGI  +P K+EA+  +  PT   E+++ FLGL GYYR+FI +FA IA P+T+  +K  K D +    +S+F++LK  ++ 
Subjt:  FSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSR-ACESSFQELKERLAS

Query:  APVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQRRW
         P+L VPD T    + +DA+   L  VL Q+G  ++Y SR L ++E +Y T + EL A+V A K +RHYL G   ++ +DH+ L +L+  K+   +  RW
Subjt:  APVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQRRW

Query:  LELVKDYDVEILHHPGKANVVADALSR
           + ++D +I +  GK N VADALSR
Subjt:  LELVKDYDVEILHHPGKANVVADALSR

P0CT34 Transposon Tf2-1 polyprotein1.0e-5728.66Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        + +F+K+DL+S YH IRV++ D  K AFR   G +E +VM + ++ APA F   +N +  +  +S V+ ++DDIL++SK+  EH +H++ VL  L+   L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
            +KCEF   +V F+G+ +S+ G T     ++ V+ W +P    E+R  FLG   Y R+FI   +++  PL  L +K  ++ W+     + + +K+ L
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNG-----RVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYG--ERIQVYTDHKSLKYLFTQK
         S PVL   D +  +++ +DA+   +  VL Q         + Y S ++   + +Y   D E+ A++ +LK WRHYL    E  ++ TDH++L    T +
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNG-----RVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYG--ERIQVYTDHKSLKYLFTQK

Query:  ELIMRQR--RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVV
             +R  RW   ++D++ EI + PG AN +ADALSR    +  +     +  + F               +  ++I    + Q++     D     ++
Subjt:  ELIMRQR--RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVV

Query:  RQLETEQIKDFWITGDGCL-KWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR
           E +++++     DG L   +++  +PND ++ R I+ + HE    +HPG   + + +   F W G++K + E V+
Subjt:  RQLETEQIKDFWITGDGCL-KWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR

P0CT41 Transposon Tf2-12 polyprotein1.0e-5728.66Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        + +F+K+DL+S YH IRV++ D  K AFR   G +E +VM + ++ APA F   +N +  +  +S V+ ++DDIL++SK+  EH +H++ VL  L+   L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL
            +KCEF   +V F+G+ +S+ G T     ++ V+ W +P    E+R  FLG   Y R+FI   +++  PL  L +K  ++ W+     + + +K+ L
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERL

Query:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNG-----RVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYG--ERIQVYTDHKSLKYLFTQK
         S PVL   D +  +++ +DA+   +  VL Q         + Y S ++   + +Y   D E+ A++ +LK WRHYL    E  ++ TDH++L    T +
Subjt:  ASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQNG-----RVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYG--ERIQVYTDHKSLKYLFTQK

Query:  ELIMRQR--RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVV
             +R  RW   ++D++ EI + PG AN +ADALSR    +  +     +  + F               +  ++I    + Q++     D     ++
Subjt:  ELIMRQR--RWLELVKDYDVEILHHPGKANVVADALSRKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVV

Query:  RQLETEQIKDFWITGDGCL-KWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR
           E +++++     DG L   +++  +PND ++ R I+ + HE    +HPG   + + +   F W G++K + E V+
Subjt:  RQLETEQIKDFWITGDGCL-KWQNRTCVPNDVEIKRKILSEAHESPYTVHPGATKMYHDLKTCFWWPGMKKDVAECVR

P20825 Retrovirus-related Pol polyprotein from transposon 2971.5e-6440.37Show/hide
Query:  FSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRLYAK
        F+ IDL  G+HQI + E+ + KTAF T+ GHYE + M F L NAPA F   MN + +  L+   +V++DDI+++S +  EH   ++ V   L    L  +
Subjt:  FSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRLYAK

Query:  FSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSR-ACESSFQELKERLAS
          KCEF  ++  FLGH+V+ DGI  +P KV+A++ +  PT   E+R+ FLGL GYYR+FI ++A IA P+T   +KR K D  +     +F++LK  +  
Subjt:  FSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSR-ACESSFQELKERLAS

Query:  APVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQRRW
         P+L +PD     V+ +DA+   L  VL QNG  I++ SR L D+E +Y   + EL A+V A K +RHYL G +  + +DH+ L++L   KE   +  RW
Subjt:  APVLIVPDGTGNLVIYSDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQRRW

Query:  LELVKDYDVEILHHPGKANVVADALSR
           + +Y  +I +  GK N VADALSR
Subjt:  LELVKDYDVEILHHPGKANVVADALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.5e-6139.13Show/hide
Query:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL
        A  F+ +DL SG+HQI +KE D+PKTAF T  G YE + + F L NAPA+F  +++ + ++ +     V+IDDI+V+S+  D H ++LR VL  L K  L
Subjt:  AAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRL

Query:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTR-----------KRKKFDWSRAC
             K  F   +V FLG++V+ DGI  DP KV A+     PT+V E++  FLG+  YYR+FI+D+A++A PLT LTR            +         
Subjt:  YAKFSKCEFWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTR-----------KRKKFDWSRAC

Query:  ESSFQELKERLASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQN----GRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGE-RIQVYTDHK
          SF +LK  L S+ +L  P  T    + +DA+   +  VL Q+     R IAY SR L   E +Y T + E+ A++ +L   R YLYG   I+VYTDH+
Subjt:  ESSFQELKERLASAPVLIVPDGTGNLVIYSDAAKHGLRCVLMQN----GRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGE-RIQVYTDHK

Query:  SLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSR
         L +    +    + +RW   +++Y+ E+++ PGK+NVVADALSR
Subjt:  SLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALSR

Arabidopsis top hitse value%identityAlignment
AT3G47650.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein3.0e-0466.67Show/hide
Query:  GSGVDAVDFFNGQFIAGASCWLCGGNK
        G GV+ +D FNGQF AGA CWLC G K
Subjt:  GSGVDAVDFFNGQFIAGASCWLCGGNK

ATMG00860.1 DNA/RNA polymerases superfamily protein2.9e-2341.27Show/hide
Query:  HLRKVLLVLRKQRLYAKFSKCEFWLQKVVFLG--HVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFD
        HL  VL +  + + YA   KC F   ++ +LG  H++S +G++ DPAK+EA++GW  P   TE+R  FLGL GYYRRF+K++ +I  PLT+L  K+    
Subjt:  HLRKVLLVLRKQRLYAKFSKCEFWLQKVVFLG--HVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFD

Query:  WSRACESSFQELKERLASAPVLIVPD
        W+     +F+ LK  + + PVL +PD
Subjt:  WSRACESSFQELKERLASAPVLIVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGCAGCGGTATTTTCGAAGATTGATCTTCGTTCTGGTTATCACCAGATAAGAGTCAAAGAAGATGACGTACCGAAGACAGCTTTTCGTACTCGGTATGGGCATTA
TGAGCTTGTTGTGATGTCCTTTGACTTGACTAATGCCCCTGCAGTGTTTATGGAGCTGATGAATCGGGTATTCCAGGATTTTCTGGATTCTTTTGTCATTGTATTCATTG
ATGATATCTTGGTTTATTCCAAGACAAACGATGAACATGCAGAACATTTGAGGAAGGTTTTGTTGGTTCTGCGTAAACAAAGATTATATGCCAAGTTCTCAAAATGTGAG
TTTTGGCTTCAAAAAGTAGTATTCCTTGGTCATGTGGTATCCAAGGATGGTATAACTGTTGATCCAGCAAAGGTGGAGGCAGTTATAGGTTGGATTCGACCAACTACAGT
TACTGAGGTGAGAAGTTTTTTTTTGGGTTTAGCCGGATATTACAGGCGCTTTATTAAAGACTTTGCAAGGATTGCTGCACCACTGACTCAGTTAACCCGAAAACGGAAGA
AATTTGATTGGAGTCGAGCTTGTGAAAGTAGTTTTCAGGAACTCAAGGAAAGATTAGCGTCAGCCCCAGTGCTTATTGTACCTGACGGTACTGGGAACCTAGTAATTTAT
AGTGATGCCGCTAAGCATGGGTTGAGGTGCGTACTTATGCAAAACGGGAGAGTTATTGCTTATGCCTCTCGGCAATTAAAGGATTATGAACGCAGTTACCCAACTCATGA
TTTAGAATTAGCTGCTGTGGTGGTTGCTCTGAAGGTATGGAGACATTATTTGTATGGTGAGAGGATACAAGTATATACTGATCATAAGAGTCTTAAATATCTGTTCACCC
AGAAAGAGCTCATCATGAGGCAACGTCGGTGGTTGGAATTGGTAAAAGATTATGATGTGGAGATCCTACATCATCCTGGTAAAGCTAATGTGGTAGCTGATGCCTTGAGT
CGTAAGACAGCTCACTCATCTGCAATGCTAACGAGACAACACAACATTCAGATGGAGTTTGAACGAGCCCAGATAGCTGTCTTGGTCAGGAAAGCGGCAGCTCAGCTAGC
TCTTATGACTATTTGTCCGACGTTGCAGGAACAAATTATTCGGGGTCAGCAAAGGGACCCTTATTTTTCTCAAGTTGTGAGGCAACTTGAGACTGAACAAATCAAGGATT
TCTGGATAACGGGTGATGGATGTTTGAAGTGGCAGAACCGAACTTGTGTGCCAAATGATGTAGAGATCAAGAGGAAGATTCTATCTGAGGCGCATGAGTCGCCCTATACT
GTGCATCCAGGTGCCACAAAGATGTATCATGATTTGAAAACATGCTTTTGGTGGCCTGGGATGAAGAAAGATGTGGCTGAATGTGTGAGACTACCTAGAACCCAGTTAGG
GTTTACAGTGATATGGGTCGTTGTGGATAGATTAACTAAGACAACCCATTTTATACCGGTTTCTGTTGTGTCAAATAGGGATATGCGGTTTACTTCTCGATTCTGGAAGA
GTCTTCAGGAAGCCCTTGTTGGAGGTTGTGGCAATCAGAAGCTCAACCTGATTAACAATGGCTTCCATTATTCTCCAGTTGCTGGATTCCTTCATCTAAATGGAAGTGGA
GTTGATGCTGTTGATTTCTTCAATGGACAATTCATAGCCGGTGCATCTTGTTGGTTGTGCGGTGGGAACAAAGGAAATGCTGTGTGGGAATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGGCAGCGGTATTTTCGAAGATTGATCTTCGTTCTGGTTATCACCAGATAAGAGTCAAAGAAGATGACGTACCGAAGACAGCTTTTCGTACTCGGTATGGGCATTA
TGAGCTTGTTGTGATGTCCTTTGACTTGACTAATGCCCCTGCAGTGTTTATGGAGCTGATGAATCGGGTATTCCAGGATTTTCTGGATTCTTTTGTCATTGTATTCATTG
ATGATATCTTGGTTTATTCCAAGACAAACGATGAACATGCAGAACATTTGAGGAAGGTTTTGTTGGTTCTGCGTAAACAAAGATTATATGCCAAGTTCTCAAAATGTGAG
TTTTGGCTTCAAAAAGTAGTATTCCTTGGTCATGTGGTATCCAAGGATGGTATAACTGTTGATCCAGCAAAGGTGGAGGCAGTTATAGGTTGGATTCGACCAACTACAGT
TACTGAGGTGAGAAGTTTTTTTTTGGGTTTAGCCGGATATTACAGGCGCTTTATTAAAGACTTTGCAAGGATTGCTGCACCACTGACTCAGTTAACCCGAAAACGGAAGA
AATTTGATTGGAGTCGAGCTTGTGAAAGTAGTTTTCAGGAACTCAAGGAAAGATTAGCGTCAGCCCCAGTGCTTATTGTACCTGACGGTACTGGGAACCTAGTAATTTAT
AGTGATGCCGCTAAGCATGGGTTGAGGTGCGTACTTATGCAAAACGGGAGAGTTATTGCTTATGCCTCTCGGCAATTAAAGGATTATGAACGCAGTTACCCAACTCATGA
TTTAGAATTAGCTGCTGTGGTGGTTGCTCTGAAGGTATGGAGACATTATTTGTATGGTGAGAGGATACAAGTATATACTGATCATAAGAGTCTTAAATATCTGTTCACCC
AGAAAGAGCTCATCATGAGGCAACGTCGGTGGTTGGAATTGGTAAAAGATTATGATGTGGAGATCCTACATCATCCTGGTAAAGCTAATGTGGTAGCTGATGCCTTGAGT
CGTAAGACAGCTCACTCATCTGCAATGCTAACGAGACAACACAACATTCAGATGGAGTTTGAACGAGCCCAGATAGCTGTCTTGGTCAGGAAAGCGGCAGCTCAGCTAGC
TCTTATGACTATTTGTCCGACGTTGCAGGAACAAATTATTCGGGGTCAGCAAAGGGACCCTTATTTTTCTCAAGTTGTGAGGCAACTTGAGACTGAACAAATCAAGGATT
TCTGGATAACGGGTGATGGATGTTTGAAGTGGCAGAACCGAACTTGTGTGCCAAATGATGTAGAGATCAAGAGGAAGATTCTATCTGAGGCGCATGAGTCGCCCTATACT
GTGCATCCAGGTGCCACAAAGATGTATCATGATTTGAAAACATGCTTTTGGTGGCCTGGGATGAAGAAAGATGTGGCTGAATGTGTGAGACTACCTAGAACCCAGTTAGG
GTTTACAGTGATATGGGTCGTTGTGGATAGATTAACTAAGACAACCCATTTTATACCGGTTTCTGTTGTGTCAAATAGGGATATGCGGTTTACTTCTCGATTCTGGAAGA
GTCTTCAGGAAGCCCTTGTTGGAGGTTGTGGCAATCAGAAGCTCAACCTGATTAACAATGGCTTCCATTATTCTCCAGTTGCTGGATTCCTTCATCTAAATGGAAGTGGA
GTTGATGCTGTTGATTTCTTCAATGGACAATTCATAGCCGGTGCATCTTGTTGGTTGTGCGGTGGGAACAAAGGAAATGCTGTGTGGGAATTGTAAAGGAGATGGCTTCT
TTGGAGGTTTTTCTCAGCACTTACGATCAATAGAGGTATGTATTGTTAGTTTGTTTATGCTTGTGCATATATTAAGTATCCTGTCATCAACAATGGCGGTAACATTTAGG
CATACTGGTATCATATTCAAGTTCCTGACATTAAGTCCTGGCAATCTTTCATATAAGATTTGGCACAGTTGTGAATCTCAGGAGGTCATGCTCTGATCAATCAGTTTTGC
ATTTTGTACTTTTGGTTTACTAATTGATAAATGC
Protein sequenceShow/hide protein sequence
MTAAVFSKIDLRSGYHQIRVKEDDVPKTAFRTRYGHYELVVMSFDLTNAPAVFMELMNRVFQDFLDSFVIVFIDDILVYSKTNDEHAEHLRKVLLVLRKQRLYAKFSKCE
FWLQKVVFLGHVVSKDGITVDPAKVEAVIGWIRPTTVTEVRSFFLGLAGYYRRFIKDFARIAAPLTQLTRKRKKFDWSRACESSFQELKERLASAPVLIVPDGTGNLVIY
SDAAKHGLRCVLMQNGRVIAYASRQLKDYERSYPTHDLELAAVVVALKVWRHYLYGERIQVYTDHKSLKYLFTQKELIMRQRRWLELVKDYDVEILHHPGKANVVADALS
RKTAHSSAMLTRQHNIQMEFERAQIAVLVRKAAAQLALMTICPTLQEQIIRGQQRDPYFSQVVRQLETEQIKDFWITGDGCLKWQNRTCVPNDVEIKRKILSEAHESPYT
VHPGATKMYHDLKTCFWWPGMKKDVAECVRLPRTQLGFTVIWVVVDRLTKTTHFIPVSVVSNRDMRFTSRFWKSLQEALVGGCGNQKLNLINNGFHYSPVAGFLHLNGSG
VDAVDFFNGQFIAGASCWLCGGNKGNAVWEL