; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014385 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014385
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr12:185519..190674
RNA-Seq ExpressionLag0014385
SyntenyLag0014385
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]5.4e-23440.28Show/hide
Query:  LASSRW-----------SWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKL
        LAS RW           S EK      T  MK F+ FI   EL+D+PL+   FTW++ +   +   +DRFL S+   Q F  +    LPR TSDH+PI L
Subjt:  LASSRW-----------SWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKL

Query:  TLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLS
             +WGPT F+F N WL H SF++    WW      GW GH FM+KL+  K  ++ WN  +FG+    K+D++ +L + D+ E+   L   +  +R  
Subjt:  TLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLS

Query:  IKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLS
         K +L  L  RE+  WRQ+ + KW+ EGD N+ FFH      R +  I EL + +G  + +  SI+ E + +++ L++  +G  +  +  DW  IS   +
Subjt:  IKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLS

Query:  ASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVL
          LE PFTEEE+ +A+  +  +K+PGPDGFT   F+  W ++K+D++ VF +F +S IIN + N ++I L+PKK  ++ + D+RPISL + LYKI+A+VL
Subjt:  ASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVL

Query:  SERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPR
        + R++ VL  TI   Q AFV  RQILDA L+ANE++DE +R  E+ V  K+D EKA+D V W+FLD +  +KGFG  WR+W+RGC+SSV++++++NG  +
Subjt:  SERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPR

Query:  GKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFM
        G   ASRGLRQGDPLSPFLF +V D LSR+L+KA+ + +++G  V  G +   ++HLQFADDTI FSS  E  +  L   + +F   SGL +N  K+   
Subjt:  GKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFM

Query:  GIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYR
        GI L+   L  LA+   CK  GWP  YLGLPL G PK+  FW+PVIE+I +RL  W   +LS GGR+T IQ+ L ++P Y+LSLFK P  V  KIE++ R
Subjt:  GIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYR

Query:  NFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSN
        +FLW G    K  HL+ WD V  P   GGLG   I  +N +LL KW+WR+  E  ALW +VI + YGS             S + PWK I          
Subjt:  NFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSN

Query:  IHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTR--SWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWALESNGN
            VGNG    FW D W G   L   +PRL  +   K+API+     +TR  SW+F  RR L + EIED   L+  L  ++   S+ D   W L  +G 
Subjt:  IHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTR--SWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWALESNGN

Query:  FSTNSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYL--QATFE
        F+  S    LS           K +W  Q P KVK F+W V+H  +NT D LQ R P+  +SP  C +C    E+V H+F  C      W  L   A  +
Subjt:  FSTNSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYL--QATFE

Query:  WPFPRSGDILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTLFNQW
        W  PRS  I  +LS  F G  F     +LW     A  W +W ERN RIF DK ++  +  +S   L   W+  S  F    L+ L   W
Subjt:  WPFPRSGDILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTLFNQW

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.6e-23238.32Show/hide
Query:  RWSWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAK---SLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFS
        RW  E + + P    M+ FN FI    L+D PL + K+TW++ RA+   S +DRFL +      F   +   L R TSDH+PI L      WGP+ F+F+
Subjt:  RWSWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAK---SLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFS

Query:  NFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDLLTLAAREDAI
        N +L    +++ ++ WW N    G+ G+ FM++LK     I+ W  +  GK ++ K+  IKE++ ID  E      +   ++R ++K DL  +   E  I
Subjt:  NFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDLLTLAAREDAI

Query:  WRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSASLEVPFTEEEVHRA
        W Q+CK  W+ EGDEN++FFH    A +++  I ++++ SG++ ++D+ I   F+  ++ +++     +   +  DW  IS+  S  L+ PF E E+   
Subjt:  WRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSASLEVPFTEEEVHRA

Query:  VNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEY
        +     NK+PGPDG+  +F +KSW+ +K++I  +F DF  + IIN  +NET I LI KK   ++  D+RPISLT+ +YK++A+ L++RLK+ LP TI+E 
Subjt:  VNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEY

Query:  QSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPL
        Q AFV  RQI +A L+ANE +D W+ KKE+   IKLDIEKAFD ++W F+D +   K +   WR+ I  CISSV YSI+ING+PRG+   SRG+RQGDPL
Subjt:  QSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPL

Query:  SPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADR
        SPF+F++ +D LSRLL     +  I G+     S  L++TH+ FADD ++F    + ++ NL   + LFE ASGLNIN  K+    I +      S+AD 
Subjt:  SPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADR

Query:  YGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHL
        +G   G  P +YLG+PL G+P S +FW+ V++KI+K+L +W    LSKGGR+T I +TL++LPIY +S+FK PK +  KIE  +RNFLW G +      L
Subjt:  YGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHL

Query:  LKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWK
        ++W+++ +P E+GGLGI  + + N +LL KW+W+   EK  LW+R+I +KY  +     P      S+  PWK +       + NI  KV +G+D SFW 
Subjt:  LKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWK

Query:  DNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTRSWSFYPRRPLLEIEIEDWTSL-LSLLQPMNNQDSLDLWWWALESNGNFSTNSLSKHLSVS--CP
        DNW G++ L    PRL+ LS  K   + +FW   +  W  +  RPL + E   W ++  SL  P+ N+       W L SN  F T S+ + ++ +   P
Subjt:  DNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTRSWSFYPRRPLLEIEIEDWTSL-LSLLQPMNNQDSLDLWWWALESNGNFSTNSLSKHLSVS--CP

Query:  GDFSV-LYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYLQATFEWPFPRSGDILSLLSLL
         +F   LYK +W  ++PKK KFF+W + H CINT D+LQ R P   +SP+ C MC+   E + H+F  CP++   WS  +A   W    + D+ SL+  +
Subjt:  GDFSV-LYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYLQATFEWPFPRSGDILSLLSLL

Query:  FMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTL
              +N+K ++         W +WLERN RIF  ++K      E +      WS  S  F NY   ++
Subjt:  FMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTL

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.2e-22839.61Show/hide
Query:  SLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTF
        S +DRFL S      F   +   L R TSDH+PI L      WGP  F+F+N +L    +++ ++ WW N    G+ G+ FM++LK     I+ W     
Subjt:  SLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTF

Query:  GKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDAS
        GK +  K+  IKE+N ID  E      +    +RL++K DL  +   E  IW Q+CK  W+ EGDEN++FFH    A +++  I ++++  G++ ++D+ 
Subjt:  GKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDAS

Query:  IETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLN
        I   F+  ++ +++     +   D  DW  IS+     L+ PF E E+   +     NK+PGPDGFT +F +KSW+ +K +I  +F DF  +  IN  +N
Subjt:  IETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLN

Query:  ETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEF
        ET I LI KK   ++V D+RPISLT+ +YK++A+VL++RLK+ LP+TI+E Q AFV  RQI +A L+ANE +D W+ KKE+   IKLDIEKAFD ++W F
Subjt:  ETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEF

Query:  LDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTI
        +D +   K +   WR  I  CISSV YSI+ING+PRG+   +RG+RQGDPLSPF+F++ +D LS LLI    +G I G++ G     L++TH+ FADD +
Subjt:  LDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTI

Query:  LFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKG
        +F    E ++ NL   + LFE ASGLNIN  K+    I +      S+ D +G   G  P TYLG+PL GKP S +FW+ +++KI+K+L SW    LSKG
Subjt:  LFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKG

Query:  GRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVIST
        GR+T I +TL++LPIY LS+FK PK +  KIE  +RNFLW G +      L++W++V +P E+GGLGI  + + N +LL KW+W+   EK  LW+R+I +
Subjt:  GRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVIST

Query:  KYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTRSWSFYPRRPLLEI
        KY  +     P      S+  PWK + +     + NI  KV +G+D SFW DNW G+S L  + PRL+ LS  K   + D W    + W+ +  RPL + 
Subjt:  KYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTRSWSFYPRRPLLEI

Query:  EIEDWTSLLSLLQPMNNQDSLDLWWWALESNGNFSTNSLSKHLS--VSCPGDF-SVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPS
        E   W ++ + L             W L SN  F T S+ K LS   + P +F   LYK +W   +PKK KFF+W + H CINT D+LQ R P   +SP+
Subjt:  EIEDWTSLLSLLQPMNNQDSLDLWWWALESNGNFSTNSLSKHLS--VSCPGDF-SVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPS

Query:  CCPMCHGDAESVIHIFSTCPFASMYWSYLQATFEWPFPRSGDILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSL
         C MC+   E + H+F  CP++   WS  QA  +W      D+ SL   +      K +K ++    +    W +WLERN RIF  +KK+     E    
Subjt:  CCPMCHGDAESVIHIFSTCPFASMYWSYLQATFEWPFPRSGDILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSL

Query:  LAISWSKLSSPFCNYSLSTL
            WS  S  F NY   ++
Subjt:  LAISWSKLSSPFCNYSLSTL

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]9.2e-23440.18Show/hide
Query:  LASSRW-----------SWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKL
        LAS RW           S EK      T  MK+F+ FI   EL+D+PL+   FTW++ +   +   +DRFL S+   Q F  +    LPR TSDH+PI L
Subjt:  LASSRW-----------SWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKL

Query:  TLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLS
             +WGPT F+F N WL H SF++    WW      GW GH FM+KL+  K  ++ WN  +FG+    K+D++  L + D+ E+   L   +  +R  
Subjt:  TLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLS

Query:  IKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLS
         K +L  L  RE+  WRQ+ + KW+ EGD N+ FFH      R +  I EL + +G+ + +  SI+ E + +++ L++  +G  +  +  DW  IS   +
Subjt:  IKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLS

Query:  ASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVL
          LE PFTEEE+ +A+  +  +K+PGPDGFT   F+  W ++K+D++ VF +F +S IIN + N ++I L+PKK  ++ + D+RPISL + LYKI+A+VL
Subjt:  ASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVL

Query:  SERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPR
        + R+++VL  TI   Q AFV  RQILDA L+ANE++DE +R  E+ V  K+D EKA+D V W+FLD +  +KGFG  WR+W+RGC+SSV++++++NG  +
Subjt:  SERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPR

Query:  GKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFM
        G   ASRGLRQGDPLSPFLF +V D LSR+L+KA+ + +++G  V  G +   ++HLQFADDTI FSS  E  +  L   + +F   SGL +N  K+   
Subjt:  GKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFM

Query:  GIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYR
        GI L+   L  LA+   CK  GWP  YLGLPL G PK+  FW+PVIE+I +RL  W   +LS GGR+T IQ+ L ++P Y+LSLFK P  V  KIE++ R
Subjt:  GIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYR

Query:  NFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSN
        +FLW G    K  HL+ WD V  P   GGLG   I  +N +LL KW+WR+  E  ALW +VI + YGS             S + PWK I          
Subjt:  NFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSN

Query:  IHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTR--SWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWALESNGN
            VGNG    FW D W G   L   +PRL  +   K+API+      TR  SW+F  RR L + EIED   L+     ++   S+ D   W+L S+G 
Subjt:  IHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTR--SWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWALESNGN

Query:  FSTNSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYL--QATFE
        F+  S    LS           K +W  Q P KVK F+W V+H  +NT D LQ R P+  +SP  C +C    E+V H+F  C      W  L   A  +
Subjt:  FSTNSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYL--QATFE

Query:  WPFPRSGDILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTLFNQW
        W  PRS  I  +L+  F G  F     +LW     A  W +W ERN RIF DK ++  +  +S   L   W+  S  F    L+ L   W
Subjt:  WPFPRSGDILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTLFNQW

XP_020420593.1 uncharacterized protein LOC18774736 [Prunus persica]1.0e-23241.2Show/hide
Query:  MKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFSNFWLSHKSFEQLLQN
        MKNFN FID   L D  L +  FTW++ R  ++   +DRFL S++    F +     L R+T DH PI+L     +WGP  F+F N W+ +  F++  + 
Subjt:  MKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFSNFWLSHKSFEQLLQN

Query:  WWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDLLTLAAREDAIWRQRCKFKWLSEGDE
        WW    + GW G+ F ++L+  K  I++WN   FG   S K++    +  +D  E    LD+ + K R  +   +  L  +E+  WRQR K +W  +GD 
Subjt:  WWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDLLTLAAREDAIWRQRCKFKWLSEGDE

Query:  NTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSASLEVPFTEEEVHRAVNDLGSNKSPGPDGF
        NT FFH   +  R++N I +L       +V +  IE E ++F+K L+S  A   +  +  +W AIS   +  LE PF EEEV RAV D G +KSPGPDGF
Subjt:  NTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSASLEVPFTEEEVHRAVNDLGSNKSPGPDGF

Query:  TAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEYQSAFVADRQILDASL
        +   F+  W+I+K+D+M V  DFF   IINA  NET+ICLIPKK  +  V D+RPISL + LYK+V++VL+ RL++VL  TI+ YQSAFV  RQILDA+L
Subjt:  TAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEYQSAFVADRQILDASL

Query:  VANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRL
        +ANE+++E +R  +  +  K+D+EKA+D V+W F+DE+   KGFG  WR WIRG + + N+S++ING+PRGKF ASRGLRQGDPLSPFLF +V+D LSR+
Subjt:  VANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRL

Query:  LIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGL
        + KA    +  GL  G G   + ++HLQFADDTI F    + + +NL + ++LF   SG+ IN  K   +GI LD   L  LA  +GC++G WP +YLGL
Subjt:  LIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGL

Query:  PLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGL
        PL G P+++ FW+PV+EK+E RL  W    LSK GRLT IQA L ++PIYY+SLF+ P  V  +IEKL R+FLW G +G K +H + W+ V      GGL
Subjt:  PLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGL

Query:  GIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYG--SQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWKDNWMGSSNLQQIF
        G+  ++ ++ +L AKW+WR  NE  ALW +VI + YG  +  +D +P T+   S +  W+ I+S  +L       +VG G    FW+D+W G   + ++F
Subjt:  GIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYG--SQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWKDNWMGSSNLQQIF

Query:  PRLYHLSNRKDAPIADF--WCQHTRSWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDS-LDLWWWALESNGNFSTNSLSKHLSVSCPGDFSVLYKHIWAG
        PRL++LS +++  I+ F        SW F  RR L E+EI +   LL LL+ +    S LD   W L+  G F+ +S   H+      +    Y  IW  
Subjt:  PRLYHLSNRKDAPIADF--WCQHTRSWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDS-LDLWWWALESNGNFSTNSLSKHLSVSCPGDFSVLYKHIWAG

Query:  QYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYL--QATFEWPFPRSGDILSLLSLLFMGHPFKNEKKI
        + P KVK F+W+     +NT D LQ R P+L ISP  C +C+   +SV H+   CPF+   W  L  +    W  P       L S+ F       + KI
Subjt:  QYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYL--QATFEWPFPRSGDILSLLSLLFMGHPFKNEKKI

Query:  LWLCHVHAFFWNLWLERNGRIFSD-KKKDIGHFIESSSLLAISWSKLSSPF
        LW   + A  WNLW+ER+ RIF D K   +    +     A  W+  S  F
Subjt:  LWLCHVHAFFWNLWLERNGRIFSD-KKKDIGHFIESSSLLAISWSKLSSPF

TrEMBL top hitse value%identityAlignment
A0A438GDE7 LINE-1 retrotransposable element ORF2 protein4.5e-23440.18Show/hide
Query:  LASSRW-----------SWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKL
        LAS RW           S EK      T  MK+F+ FI   EL+D+PL+   FTW++ +   +   +DRFL S+   Q F  +    LPR TSDH+PI L
Subjt:  LASSRW-----------SWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKL

Query:  TLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLS
             +WGPT F+F N WL H SF++    WW      GW GH FM+KL+  K  ++ WN  +FG+    K+D++  L + D+ E+   L   +  +R  
Subjt:  TLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLS

Query:  IKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLS
         K +L  L  RE+  WRQ+ + KW+ EGD N+ FFH      R +  I EL + +G+ + +  SI+ E + +++ L++  +G  +  +  DW  IS   +
Subjt:  IKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLS

Query:  ASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVL
          LE PFTEEE+ +A+  +  +K+PGPDGFT   F+  W ++K+D++ VF +F +S IIN + N ++I L+PKK  ++ + D+RPISL + LYKI+A+VL
Subjt:  ASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVL

Query:  SERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPR
        + R+++VL  TI   Q AFV  RQILDA L+ANE++DE +R  E+ V  K+D EKA+D V W+FLD +  +KGFG  WR+W+RGC+SSV++++++NG  +
Subjt:  SERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPR

Query:  GKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFM
        G   ASRGLRQGDPLSPFLF +V D LSR+L+KA+ + +++G  V  G +   ++HLQFADDTI FSS  E  +  L   + +F   SGL +N  K+   
Subjt:  GKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFM

Query:  GIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYR
        GI L+   L  LA+   CK  GWP  YLGLPL G PK+  FW+PVIE+I +RL  W   +LS GGR+T IQ+ L ++P Y+LSLFK P  V  KIE++ R
Subjt:  GIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYR

Query:  NFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSN
        +FLW G    K  HL+ WD V  P   GGLG   I  +N +LL KW+WR+  E  ALW +VI + YGS             S + PWK I          
Subjt:  NFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSN

Query:  IHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTR--SWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWALESNGN
            VGNG    FW D W G   L   +PRL  +   K+API+      TR  SW+F  RR L + EIED   L+     ++   S+ D   W+L S+G 
Subjt:  IHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTR--SWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWALESNGN

Query:  FSTNSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYL--QATFE
        F+  S    LS           K +W  Q P KVK F+W V+H  +NT D LQ R P+  +SP  C +C    E+V H+F  C      W  L   A  +
Subjt:  FSTNSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYL--QATFE

Query:  WPFPRSGDILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTLFNQW
        W  PRS  I  +L+  F G  F     +LW     A  W +W ERN RIF DK ++  +  +S   L   W+  S  F    L+ L   W
Subjt:  WPFPRSGDILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTLFNQW

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein4.2e-23238.32Show/hide
Query:  RWSWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAK---SLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFS
        RW  E + + P    M+ FN FI    L+D PL + K+TW++ RA+   S +DRFL +      F   +   L R TSDH+PI L      WGP+ F+F+
Subjt:  RWSWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAK---SLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFS

Query:  NFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDLLTLAAREDAI
        N +L    +++ ++ WW N    G+ G+ FM++LK     I+ W  +  GK ++ K+  IKE++ ID  E      +   ++R ++K DL  +   E  I
Subjt:  NFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDLLTLAAREDAI

Query:  WRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSASLEVPFTEEEVHRA
        W Q+CK  W+ EGDEN++FFH    A +++  I ++++ SG++ ++D+ I   F+  ++ +++     +   +  DW  IS+  S  L+ PF E E+   
Subjt:  WRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSASLEVPFTEEEVHRA

Query:  VNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEY
        +     NK+PGPDG+  +F +KSW+ +K++I  +F DF  + IIN  +NET I LI KK   ++  D+RPISLT+ +YK++A+ L++RLK+ LP TI+E 
Subjt:  VNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEY

Query:  QSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPL
        Q AFV  RQI +A L+ANE +D W+ KKE+   IKLDIEKAFD ++W F+D +   K +   WR+ I  CISSV YSI+ING+PRG+   SRG+RQGDPL
Subjt:  QSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPL

Query:  SPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADR
        SPF+F++ +D LSRLL     +  I G+     S  L++TH+ FADD ++F    + ++ NL   + LFE ASGLNIN  K+    I +      S+AD 
Subjt:  SPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADR

Query:  YGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHL
        +G   G  P +YLG+PL G+P S +FW+ V++KI+K+L +W    LSKGGR+T I +TL++LPIY +S+FK PK +  KIE  +RNFLW G +      L
Subjt:  YGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHL

Query:  LKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWK
        ++W+++ +P E+GGLGI  + + N +LL KW+W+   EK  LW+R+I +KY  +     P      S+  PWK +       + NI  KV +G+D SFW 
Subjt:  LKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWK

Query:  DNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTRSWSFYPRRPLLEIEIEDWTSL-LSLLQPMNNQDSLDLWWWALESNGNFSTNSLSKHLSVS--CP
        DNW G++ L    PRL+ LS  K   + +FW   +  W  +  RPL + E   W ++  SL  P+ N+       W L SN  F T S+ + ++ +   P
Subjt:  DNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTRSWSFYPRRPLLEIEIEDWTSL-LSLLQPMNNQDSLDLWWWALESNGNFSTNSLSKHLSVS--CP

Query:  GDFSV-LYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYLQATFEWPFPRSGDILSLLSLL
         +F   LYK +W  ++PKK KFF+W + H CINT D+LQ R P   +SP+ C MC+   E + H+F  CP++   WS  +A   W    + D+ SL+  +
Subjt:  GDFSV-LYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYLQATFEWPFPRSGDILSLLSLL

Query:  FMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTL
              +N+K ++         W +WLERN RIF  ++K      E +      WS  S  F NY   ++
Subjt:  FMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTL

A5BCI7 Reverse transcriptase domain-containing protein2.6e-23440.28Show/hide
Query:  LASSRW-----------SWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKL
        LAS RW           S EK      T  MK F+ FI   EL+D+PL+   FTW++ +   +   +DRFL S+   Q F  +    LPR TSDH+PI L
Subjt:  LASSRW-----------SWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKL

Query:  TLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLS
             +WGPT F+F N WL H SF++    WW      GW GH FM+KL+  K  ++ WN  +FG+    K+D++ +L + D+ E+   L   +  +R  
Subjt:  TLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLS

Query:  IKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLS
         K +L  L  RE+  WRQ+ + KW+ EGD N+ FFH      R +  I EL + +G  + +  SI+ E + +++ L++  +G  +  +  DW  IS   +
Subjt:  IKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLS

Query:  ASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVL
          LE PFTEEE+ +A+  +  +K+PGPDGFT   F+  W ++K+D++ VF +F +S IIN + N ++I L+PKK  ++ + D+RPISL + LYKI+A+VL
Subjt:  ASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVL

Query:  SERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPR
        + R++ VL  TI   Q AFV  RQILDA L+ANE++DE +R  E+ V  K+D EKA+D V W+FLD +  +KGFG  WR+W+RGC+SSV++++++NG  +
Subjt:  SERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPR

Query:  GKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFM
        G   ASRGLRQGDPLSPFLF +V D LSR+L+KA+ + +++G  V  G +   ++HLQFADDTI FSS  E  +  L   + +F   SGL +N  K+   
Subjt:  GKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFM

Query:  GIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYR
        GI L+   L  LA+   CK  GWP  YLGLPL G PK+  FW+PVIE+I +RL  W   +LS GGR+T IQ+ L ++P Y+LSLFK P  V  KIE++ R
Subjt:  GIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYR

Query:  NFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSN
        +FLW G    K  HL+ WD V  P   GGLG   I  +N +LL KW+WR+  E  ALW +VI + YGS             S + PWK I          
Subjt:  NFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSN

Query:  IHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTR--SWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWALESNGN
            VGNG    FW D W G   L   +PRL  +   K+API+     +TR  SW+F  RR L + EIED   L+  L  ++   S+ D   W L  +G 
Subjt:  IHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTR--SWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWALESNGN

Query:  FSTNSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYL--QATFE
        F+  S    LS           K +W  Q P KVK F+W V+H  +NT D LQ R P+  +SP  C +C    E+V H+F  C      W  L   A  +
Subjt:  FSTNSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYL--QATFE

Query:  WPFPRSGDILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTLFNQW
        W  PRS  I  +LS  F G  F     +LW     A  W +W ERN RIF DK ++  +  +S   L   W+  S  F    L+ L   W
Subjt:  WPFPRSGDILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTLFNQW

M5VS59 Reverse transcriptase domain-containing protein (Fragment)1.4e-23242.4Show/hide
Query:  EAFSLCGMKLSLASS----RWSWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHY
        + +  CG K  L       R+S EKSN+   TK M++FN FI    L D  L +  FTW++ R  ++   +DRFL+S S    F +     LPRITSDH 
Subjt:  EAFSLCGMKLSLASS----RWSWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHY

Query:  PIKLTLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSK
        PI+L   + +WGP+ F+F N WL+H  F + ++ WW    + GW G+ FM +LK  K  ++ W+   FG  + D ++    L  +D +E  + LD  +  
Subjt:  PIKLTLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSK

Query:  RRLSIKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAIS
         R ++ + +  LA +E+  WRQR K KW  EGD NT FFH      R++N I +L       +  DA+IE E + F+K L+S    + +  +  +W  IS
Subjt:  RRLSIKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAIS

Query:  DNLSASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIV
           +  LE PF  EEV +AV + G +KSPGPDGF+  FF+  W ++K D+M V  DFF+S I+N   NET+ICLIPKK  +  V D RPISL + LYK++
Subjt:  DNLSASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIV

Query:  ARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIIN
        ++VL+ RL++VL +TI++ Q AFV  RQILDA LVANE+++E +++K K +  K+D EKA+D V+W F+D++   KGFG  WR WI GC+ SVN+SI+IN
Subjt:  ARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIIN

Query:  GKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHK
        GKPRGKF ASRGLRQGDPLSPFLF +V D LSR++ +A    L+ G  + SG   + ++HLQFADDTI      E +  NL + +KLF + SG+ IN  K
Subjt:  GKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHK

Query:  TEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIE
        +  +GI    ++L ++A  +GC++G WP  YLGLPL G P++L+FW PV++K+EKRL  W    LSKGGRLT IQA L ++P YY+SLFK P  V  K+E
Subjt:  TEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIE

Query:  KLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYG--SQHFDLQPGTKSLHSSKGPWKQINSTK
        +L RNFLW G    K  HL++W++V    EEGGLGI  ++ +N +L AKW+WR   E  +LW R+I +KYG  S  +D +   K   S + PW++I+   
Subjt:  KLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYG--SQHFDLQPGTKSLHSSKGPWKQINSTK

Query:  HLIFSNIHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTR--SWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWA
        +         VGNG+   FW+D W+    L+ +FPRL  LS RK+  IA F   H    +W F  RR L E EI +   LL +L  +    S  D   W 
Subjt:  HLIFSNIHIKVGNGKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTR--SWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWA

Query:  LESNGNFSTNSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFA-SMYWSYL
        +E  G+FS  S    L +S   D    +  IW  + P K++FF+W  ++  INT D +Q R P + +SPS C +C  +AE++ H+F  C ++  ++W  L
Subjt:  LESNGNFSTNSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFA-SMYWSYL

Query:  QAT-FEW
         A   EW
Subjt:  QAT-FEW

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)5.8e-23440.81Show/hide
Query:  RWSWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFS
        R+S EKSN+   TK M++FN FI    L D  L +  FTW++ R  ++   +DRFL+S S  + F +     LPRITSDH PI+L   + +WGP+ F+F 
Subjt:  RWSWEKSNQRPPTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSL---IDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFS

Query:  NFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDLLTLAAREDAI
        N WL+H  F++ ++ WW    + GW G+ FM +LK  K  ++ W+   FG  + D ++    L  +D +E  + LD  +   R ++ + +  LA +E+  
Subjt:  NFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDLLTLAAREDAI

Query:  WRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSASLEVPFTEEEVHRA
        WRQR K KW  +GD NT FFH      R++N I +L       +  DA+IE E + F+K L+S                                  ++A
Subjt:  WRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSASLEVPFTEEEVHRA

Query:  VNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEY
        V D G +KSPGPDGF+  FF+  W ++K D+M V  DFF+S I+N   NET+ICLIPKK  +  V DYRPISL + LYK++++VL+  L++VL +TI++ 
Subjt:  VNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEY

Query:  QSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPL
        Q AFV  RQILDA LVANE+++E +++K K +  K+D EKA+D V+W F+D++   KGFG  WR WI GC+ SVN+SI+INGKPRGKF ASRGLRQGDPL
Subjt:  QSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPL

Query:  SPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADR
        SPFLF +V D LSRL+ +A    L+ G  + SG   + ++HLQFADDTI      E +  NL + +KLF + SG+ IN  K+  +GI      L ++A  
Subjt:  SPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADR

Query:  YGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHL
        +GC++G WP  YLGLPL G P++L+FW PV+EK+EKRL  W    LSKGGRLT IQA L ++P YY+SLFK P  V  K+E+L RNFLW G +  K  HL
Subjt:  YGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHL

Query:  LKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWK
        ++W++V    EEGGLGI  ++ +  +L AKW+WR   E  +LW R+I +KYG            + S+  PW++I+   +         VGNG+   FW+
Subjt:  LKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWK

Query:  DNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHT--RSWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWALESNGNFSTNSLSKHLSVSCP
        D W+    L+ +FPRL  LS RK+  IA F   H    +W F  RR L E EI +   LL +L  +    S  D   W +E  G+FS  S    L +S  
Subjt:  DNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHT--RSWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSL-DLWWWALESNGNFSTNSLSKHLSVSCP

Query:  GDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFA-SMYWSYLQAT-FEWPFPRSGDILSLLSL
         D    +  IW  + P K++FF+W  ++  INT D +Q R P + +SPS C +C  +AE++ H+F  C ++  ++W  L A   EW  P+    L  ++L
Subjt:  GDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFA-SMYWSYLQAT-FEWPFPRSGDILSLLSL

Query:  LFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIES----SSLLAISWSKLSSPFCNYSLSTLFNQWRCLL
           G        IL  C VHA FWN+W+ERN RIF   +  IG  +E         A  W+ +S  F +Y  ST+      +L
Subjt:  LFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKKDIGHFIES----SSLLAISWSKLSSPFCNYSLSTLFNQWRCLL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.1e-4324.52Show/hide
Query:  EKSNQRPPTKGMKNFNKFIDFVELLDI-PLQHGK---FTWTSS--RAKSLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTL---GKERWGPTTFK
        ++S ++   K  +  N  +   +L+DI    H K   +T+ S+     S ID  + S +   K     +  +    SDH  IKL L      +   TT+K
Subjt:  EKSNQRPPTKGMKNFNKFIDFVELLDI-PLQHGK---FTWTSS--RAKSLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTL---GKERWGPTTFK

Query:  FSNFWLS----HKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAF--KPFIQEWNINTFGKKD--SDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDL
         +N  L+    H   +  ++ ++  +  +           KA     FI    +N + +K   S    L  +L +++ +E+        +  + S + ++
Subjt:  FSNFWLS----HKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAF--KPFIQEWNINTFGKKD--SDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDL

Query:  LTLAAREDAIWRQRCKFKWLSEGDENTAFFH-----------NYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGA
          + A    I  Q    K L + +E+ ++F              +   R +N I  + +  G    D   I+T   ++YK L++ K     L ++E+   
Subjt:  LTLAAREDAIWRQRCKFKWLSEGDENTAFFH-----------NYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGA

Query:  ISDNLS---------ASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKK-IGAKSVGDYR
          D  +          SL  P T  E+   +N L + KSPGPDGFTAEF+++    L   ++ +F    K  I+  +  E  I LIPK         ++R
Subjt:  ISDNLS---------ASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKK-IGAKSVGDYR

Query:  PISLTSCLYKIVARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEK-EVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIR
        PISL +   KI+ ++L+ R+++ +   I   Q  F+   Q       +  +I    R K+K  V I +D EKAFD +   F+ +     G    + + IR
Subjt:  PISLTSCLYKIVARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEK-EVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIR

Query:  GCISSVNYSIIINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKL
                +II+NG+    F    G RQG PLSP LF +V++ L+R + +   +  IKG+ +G     LS+    FADD I++         NL K I  
Subjt:  GCISSVNYSIIINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKL

Query:  FEEASGLNINCHKTEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSL--SFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYY
        F + SG  IN  K++      + Q+   +       I      YLG+ L    K L    ++P++++I++  + W +   S  GR+  ++  +    IY 
Subjt:  FEEASGLNINCHKTEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSL--SFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYY

Query:  LSL--FKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAK--WIWRHHNEKGALWRR
         +    K P     ++EK    F+W  K       +L   K KA    GG+ +   +    + + K  W W + N     W R
Subjt:  LSL--FKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAK--WIWRHHNEKGALWRR

P08548 LINE-1 reverse transcriptase homolog1.4e-4624.91Show/hide
Query:  EKSNQRPPTKGMKNFNKFIDFVELLDI----PLQHGKFTWTSSR--AKSLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPT---TFK
        ++S+++  +K + + N  I  ++L DI         ++T+ SS     S ID  L   S   KF    +  +P I SDH+ IK+ L   R   T   T+K
Subjt:  EKSNQRPPTKGMKNFNKFIDFVELLDI----PLQHGKFTWTSSR--AKSLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPT---TFK

Query:  FSNFWLS--------HKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDL
         +N  L          K   + L+   NN+    +       K      FI    +  F KK ++++++   +  +   E+ +  +   S+R+   KI  
Subjt:  FSNFWLS--------HKSFEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDL

Query:  LTLAAREDAIWRQRCKFK-WLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDAS-IETEFVDFYKMLFSKKAGIRFLPDIEDW------GAISD
                 I +Q  K K W  E           +   +R  S++  +      +  D S I+    ++YK L+S K     L +I+ +        +S 
Subjt:  LTLAAREDAIWRQRCKFK-WLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDAS-IETEFVDFYKMLFSKKAGIRFLPDIEDW------GAISD

Query:  NLSASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKK-IGAKSVGDYRPISLTSCLYKIV
             L  P +  E+   + +L   KSPGPDGFT+EF++     L   ++ +F +  K  I+     E  I LIPK         +YRPISL +   KI+
Subjt:  NLSASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKK-IGAKSVGDYRPISLTSCLYKIV

Query:  ARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKE-VCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIII
         ++L+ R+++ +   I   Q  F+   Q       +  +I    + K K+ + + +D EKAFD +   F+    +  G   T+ + I    S    +II+
Subjt:  ARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKE-VCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIII

Query:  NGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCH
        NG     F    G RQG PLSP LF +V++ L+   I    +  IKG+H+GS    LS+    FADD I++          L + IK +   SG  IN H
Subjt:  NGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCH

Query:  KTEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSL--SFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSL--FKAPKKV
        K+       + Q+  ++ D     +      YLG+ L    K L    +E + ++I + ++ W +   S  GR+  ++ ++    IY  +    KAP   
Subjt:  KTEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSL--SFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSL--FKAPKKV

Query:  TVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAK--WIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQ
           +EK+  +F+W  K       LL  +K KA    GG+ +  ++    S++ K  W W H N +  +W R+       ++ ++ P T            
Subjt:  TVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAK--WIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQ

Query:  INSTKHLIFSNIHIKVGNGKDTSFWK---DNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQH
             +LIF      +  GKD+ F K    NW+      ++ P L  L+      I D   +H
Subjt:  INSTKHLIFSNIHIKVGNGKDTSFWK---DNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQH

P0C2F6 Putative ribonuclease H protein At1g657504.9e-3624.83Show/hide
Query:  LPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGG
        +P+  K  +   +  ++E++  R+  W  + LS  GRLT  +A L ++P++ +S    P+ +  ++++L R FLW      K  HL+KW KV +P +EGG
Subjt:  LPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGG

Query:  LGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQIN-STKHLIFSNIHIKVGNGKDTSFWKDNWMGSSNLQQIF
        LG+   ++ N +L++K  WR   EK +LW  V+  KY               S    W+ I    + ++   +    G+G+   FW D W+    L +  
Subjt:  LGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQIN-STKHLIFSNIHIKVGNGKDTSFWKDNWMGSSNLQQIF

Query:  PRLYHLSNRK-----DAPIA-DFWCQHTRSWSFYPRRPLL--EIEIEDWTSLLSLLQPMNNQDSLDLWWWALESNGNFSTNSLSKHLSVS--CPGDFSVL
             L N +     D  +A D W    R W F    P       +E    +L L+    ++ S     W    +G FS  S  + L+V      + +  
Subjt:  PRLYHLSNRK-----DAPIA-DFWCQHTRSWSFYPRRPLL--EIEIEDWTSLLSLLQPMNNQDSLDLWWWALESNGNFSTNSLSKHLSVS--CPGDFSVL

Query:  YKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYW-----------SYLQATFEWPFPRSGDILSL
        +  +W  + P++VK FLW V +  + T+++   R    + + + C +C G  ES++H+   CP     W            + ++ FEW +   GD    
Subjt:  YKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYW-----------SYLQATFEWPFPRSGDILSL

Query:  LSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKK
                     + I W        W  W  R G IF +  K
Subjt:  LSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKKK

P11369 LINE-1 retrotransposable element ORF2 protein3.2e-3524.74Show/hide
Query:  IVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSA---------SLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKS
        I ++ +  G    D   I+     FYK L+S K     L ++++     D              L  P + +E+   +N L + KSPGPDGF+AEF++  
Subjt:  IVELLSRSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSA---------SLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKS

Query:  WNILKKDIMGVFNDFFKSAIINANLNETY----ICLIPK-KIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVAN
            K+D++ + +  F    +   L  ++    I LIPK +     + ++RPISL +   KI+ ++L+ R+++ +   I   Q  F+   Q       + 
Subjt:  WNILKKDIMGVFNDFFKSAIINANLNETY----ICLIPK-KIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVAN

Query:  ELIDEWQRKKEK-EVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLI
         +I    + K+K  + I LD EKAFD +   F+ ++    G    +   I+   S    +I +NG+         G RQG PLSP+LF +V++ L+R + 
Subjt:  ELIDEWQRKKEK-EVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLI

Query:  KADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGLPL
        +   Q  IKG+ +G     +S+     ADD I++ S  +     L   I  F E  G  IN +K+       + Q+   + +     I      YLG+ L
Subjt:  KADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGLPL

Query:  KGKPKSL--SFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSL--FKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEG
          + K L    ++ + ++I++ L  W     S  GR+  ++  +    IY  +    K P +   ++E     F+W  K       LLK  +       G
Subjt:  KGKPKSL--SFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSL--FKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEG

Query:  GLGIVGIQNKNGSLLAKWIWRHHNEKGA-LWRRVISTK-----YGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSN
        G+ +  ++    +++ K  W  + ++    W R+   +     YG   FD   G K++      WK     K  IF+N
Subjt:  GLGIVGIQNKNGSLLAKWIWRHHNEKGA-LWRRVISTK-----YGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSN

P14381 Transposon TX1 uncharacterized 149 kDa protein9.8e-4525.71Show/hide
Query:  AKSLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNN--------HPLEGWPGHGFMKKLKAFKP
        ++S IDR  IS     +  ++++   P    +   +++++         + F+N  L  + F + +++ W            L  W   G +      K 
Subjt:  AKSLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNN--------HPLEGWPGHGFMKKLKAFKP

Query:  FIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDH-MSKRRLSIKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLS
          QE+  +  G+++++ + L  E+  +D ++ L   +D  +    L  K  L  +  R+      R + + L + D  + FF+        +  I  L +
Subjt:  FIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDH-MSKRRLSIKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLS

Query:  RSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDI--EDWG---AISDNLSASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMG
          G  L D  +I      FY+ LFS        PD   E W     +S+     LE P T +E+ +A+  +  NKSPG DG T EFF+  W+ L  D   
Subjt:  RSGKSLVDDASIETEFVDFYKMLFSKKAGIRFLPDI--EDWG---AISDNLSASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMG

Query:  VFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVC
        V  + FK   +  +     + L+PKK   + + ++RP+SL S  YKIVA+ +S RLK VL   I   QS  V  R I D   +  +L+   +R       
Subjt:  VFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVC

Query:  IKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSG
        + LD EKAFD VD ++L    +   FG  +  +++   +S    + IN          RG+RQG PLS  L+ + ++    LL K  L GL+        
Subjt:  IKLDIEKAFDMVDWEFLDEIFRVKGFGHTWRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSG

Query:  SHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADRYGCKIGGWPN---TYLGLPLKGK--PKSLSFWE
           + +    +ADD IL  +Q+   L+   +  +++  AS   IN  K+     GL   SL         +   W +    YLG+ L  +  P S +F E
Subjt:  SHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEASGLNINCHKTEFMGIGLDPQSLGSLADRYGCKIGGWPN---TYLGLPLKGK--PKSLSFWE

Query:  PVIEKIEKRLHSWG--SQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGS
         + E +  RL  W   ++ LS  GR   I   + +   Y L      ++   KI++   +FLW GK      H +       P++EGG G+V I+++  +
Subjt:  PVIEKIEKRLHSWG--SQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGS

Query:  LLAKWIWRH-HNEKGALW--------RRVISTKYGSQHFDLQP
           + I R+ + +    W        R+V +  Y  Q F ++P
Subjt:  LLAKWIWRH-HNEKGALW--------RRVISTKYGSQHFDLQP

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.8e-3125.78Show/hide
Query:  PTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSLI----DRFLISDSCAQKFGNA-SVNRLPRITSDHYPIKLTL-GKERWGPTTFKFSNFWLSHKS
        P +G++ F   +   +L+DIP +   +TW++ +  + I    DR + +      F +A +V  L  + SDH P  + L    +     F++ +F  +H +
Subjt:  PTKGMKNFNKFIDFVELLDIPLQHGKFTWTSSRAKSLI----DRFLISDSCAQKFGNA-SVNRLPRITSDHYPIKLTL-GKERWGPTTFKFSNFWLSHKS

Query:  FEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLD-----DHMSKRRLSIKIDLLTLAAREDAIWRQ
        F   L   W      G       + LKA K   +  N   FG      ++ +  L  I ++   +  D     +H+++++ +        AA  ++ +RQ
Subjt:  FEQLLQNWWNNHPLEGWPGHGFMKKLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLD-----DHMSKRRLSIKIDLLTLAAREDAIWRQ

Query:  RCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKA------GIRFLPDIEDWGAISDNLSASLEVPFTEEEV
        + + KWL +GD NT FFH  + A + +N I  L       + +   ++   V +Y  L    +       ++ + DI  +   +D L++ L    +++E+
Subjt:  RCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGKSLVDDASIETEFVDFYKMLFSKKA------GIRFLPDIEDWGAISDNLSASLEVPFTEEEV

Query:  HRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIV
          AV  +  NK+PGPD FTAEFF +SW ++K   +    +FF++  +    N T I LIPK  G   +  +RP+S  + +YKI+
Subjt:  HRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases7.0e-0631.65Show/hide
Query:  ERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKE--VCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTW
        ERLK ++ + I   Q++F+  R   D  +   E +   +RKK  +  + +KLD+EKA+D + W++L++     GF   W
Subjt:  ERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKE--VCIKLDIEKAFDMVDWEFLDEIFRVKGFGHTW

AT4G29090.1 Ribonuclease H-like superfamily protein2.0e-2926.13Show/hide
Query:  LPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPG
        LP Y ++ F  PK V  +I  +  +F WR K  +KG H   WD +     EGG+G   I+  N +LL K +WR  +   +L  +V  ++Y  +   L   
Subjt:  LPIYYLSLFKAPKKVTVKIEKLYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPG

Query:  TKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWKDNWMGSS------NLQQIFPRLYHLSNRKDAPIADFWCQHTRSWSFYPRRPLLEI---EIE
          S  S    WK I++++ ++       VGNG+D   W+  W+ S        +Q++ P+ Y  S      ++D   +  R W    R+ ++E+   E+E
Subjt:  TKSLHSSKGPWKQINSTKHLIFSNIHIKVGNGKDTSFWKDNWMGSS------NLQQIFPRLYHLSNRKDAPIADFWCQHTRSWSFYPRRPLLEI---EIE

Query:  DWTSLLSLLQPMNNQDSLDLWWWALESNGNFST-----------NSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWL
            L+  L+P   +  LD + W   S+G+++            N  S    VS P   + +Y+ IW  Q   K++ FLW+   + +     L +R    
Subjt:  DWTSLLSLLQPMNNQDSLDLWWWALESNGNFST-----------NSLSKHLSVSCPGDFSVLYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWL

Query:  VISPSCCPMCHGDAESVIHIFSTCPFASMYWSYLQATFEWPFPRSGD-----ILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKK
        +   S C  C    E+V H+   C FA + W    A    P P  G+      ++L  +  +G+     +K   L  V    W LW  RN  +F  ++
Subjt:  VISPSCCPMCHGDAESVIHIFSTCPFASMYWSYLQATFEWPFPRSGD-----ILSLLSLLFMGHPFKNEKKILWLCHVHAFFWNLWLERNGRIFSDKK

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.9e-1254.29Show/hide
Query:  IINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDT
        IING P+G    SRGLRQGDPLSP+LFI+  + LS L  +A  QG + G+ V + S    I HL FADDT
Subjt:  IINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCATCAAGATTGTCAACACTTTTGAGAAGCTTTTGAGAAATTTCTTTTGGAATGGCTCTAGTCTTGATGGGGCATGCCCAATATTAATTGGGGAAAGACAAAGCTT
CCTCTTGTCTTGGGTGGATGGGGTATTGGCAACATTTGAAGGGGGAATGAAGTCCTTCTTTCAAAATGGGTTTGGAGATTCTTACATGAACCTAAAGCCCTTTGGCACAA
GATGGAGTCTTGTTTCAAAGACTCAAAAAGATGGTGGGCTCGGAATTGGTGTGCTAAAACAAAGAAATATTGCTAATATTGCTTTGTTGGCCAAATGGGGTTGGCGTTTT
AAGATGGAACCTCAATCTTTATGGAGAGAAGTTATTGTTAGCATCCATGGTTCTTGTAATAATGGATGGAATTCAGATTCTTGGAATTTATCTTTTCAACGCCTTTTGAT
TGATGAGGAGATTGTTGATTTTCAGTCTTTGTTACTTAACATTTATGGGGCTTTATTAAATCCATATGGAGCTCGCGTTTTATCGGATGGTCCTCCATTGACGCCATCGG
ATCATCGGGAGGCATTCTCATTATGTGGAATGAAATTATCCTTAGCATCATCGAGATGGTCTTGGGAAAAATCCAATCAGAGGCCTCCTACCAAAGGCATGAAAAATTTC
AACAAATTTATAGATTTTGTGGAGCTTCTGGACATCCCGTTGCAACATGGTAAATTCACATGGACTAGTAGTCGGGCAAAATCCCTCATTGACCGTTTCTTGATATCAGA
TAGTTGCGCTCAGAAATTTGGTAATGCCTCTGTTAATCGCCTCCCTCGCATCACATCTGACCATTATCCTATTAAGCTTACCCTGGGCAAAGAGAGATGGGGGCCAACAA
CTTTTAAATTCTCCAACTTCTGGCTATCTCACAAGTCCTTTGAACAATTGCTTCAGAATTGGTGGAACAATCACCCTCTGGAAGGTTGGCCTGGTCACGGCTTTATGAAA
AAGCTCAAAGCCTTCAAGCCTTTTATCCAAGAATGGAATATCAACACCTTTGGTAAAAAGGATTCTGATAAACAGGACCTTATAAAAGAGCTAAATGATATAGATACCAA
GGAAGAGTTGGACGTATTGGATGATCATATGTCCAAGCGTAGACTATCCATTAAAATAGACCTTTTAACCTTGGCAGCCCGAGAAGATGCTATATGGAGACAACGTTGCA
AATTCAAATGGCTTTCGGAGGGAGATGAGAACACTGCTTTTTTCCACAATTATATGGCTGCCACTCGAAGACAGAACTCCATTGTGGAACTTTTATCGCGATCGGGTAAG
AGTCTAGTTGATGATGCTAGCATTGAAACAGAGTTTGTGGATTTCTACAAGATGTTATTTTCTAAAAAGGCTGGAATCCGGTTTTTACCTGACATAGAAGACTGGGGTGC
CATATCAGACAACCTTAGTGCTAGCTTGGAAGTCCCTTTCACAGAAGAAGAAGTCCATAGAGCTGTCAACGATTTGGGATCGAACAAATCTCCCGGCCCGGATGGTTTCA
CGGCGGAATTCTTTAAAAAATCATGGAACATTCTTAAGAAAGACATTATGGGGGTGTTCAATGATTTTTTTAAGAGTGCTATTATAAACGCCAACCTCAATGAGACTTAT
ATCTGTCTTATCCCAAAGAAAATAGGAGCTAAATCGGTTGGAGACTATAGACCCATTAGCCTTACATCTTGCCTCTACAAAATTGTGGCTCGTGTCTTATCAGAAAGATT
GAAGAAAGTCTTACCCCACACTATCGCCGAATACCAATCTGCCTTTGTTGCAGATAGACAAATCTTAGATGCCTCTCTTGTTGCTAACGAGCTTATTGACGAGTGGCAAA
GGAAAAAGGAAAAAGAAGTATGCATCAAGCTTGATATCGAAAAGGCCTTTGATATGGTTGATTGGGAATTCCTTGACGAGATTTTTCGTGTTAAGGGTTTCGGACACACA
TGGAGGAGATGGATTAGGGGATGTATATCTTCGGTCAACTATTCTATTATCATAAATGGGAAACCAAGAGGAAAATTTGGTGCCTCCCGTGGACTTCGACAAGGTGACCC
TTTATCTCCATTCTTATTTATTATGGTTGTTGATTGCCTTAGTAGGTTGCTTATAAAGGCCGATCTACAAGGTCTTATTAAGGGCCTCCACGTTGGTTCCGGGTCACATG
CTCTCTCCATCACCCATCTACAATTTGCGGATGACACTATCCTTTTCTCCTCCCAAAACGAAGCTCACCTTGACAACCTTTTCAAATCGATAAAGCTTTTTGAGGAGGCA
TCAGGGCTGAATATAAATTGTCATAAAACAGAGTTCATGGGCATTGGTTTGGACCCTCAATCTCTTGGTTCTTTGGCTGATCGTTATGGTTGCAAAATTGGTGGCTGGCC
AAACACTTACTTGGGCCTTCCATTGAAGGGGAAGCCGAAGTCTTTATCTTTCTGGGAGCCTGTTATAGAGAAAATTGAAAAAAGACTTCATTCTTGGGGATCCCAACACC
TCTCGAAAGGAGGTAGACTCACCTTCATTCAAGCTACCCTTCAGAATTTGCCCATTTATTATTTATCCCTCTTTAAAGCCCCAAAAAAGGTCACCGTAAAGATTGAAAAG
TTGTATCGAAACTTTTTATGGCGGGGCAAAAATGGTTCCAAAGGCTCTCATCTTTTGAAATGGGACAAAGTTAAAGCTCCGATTGAAGAAGGTGGTTTGGGCATTGTCGG
TATTCAAAACAAAAACGGCTCTCTCTTAGCAAAATGGATCTGGCGACATCATAATGAAAAAGGCGCTCTATGGCGTAGGGTCATTTCTACAAAATATGGATCCCAACACT
TTGATCTTCAGCCTGGCACTAAATCATTACACTCATCAAAAGGTCCTTGGAAACAGATTAACAGTACGAAGCATCTCATTTTTTCAAATATTCATATCAAAGTGGGGAAT
GGAAAAGACACATCATTCTGGAAGGACAATTGGATGGGGAGCTCTAACCTTCAACAAATTTTCCCTAGACTATATCATCTTTCTAATAGAAAGGATGCACCTATCGCAGA
TTTTTGGTGTCAACATACTCGATCTTGGTCCTTTTATCCTAGAAGACCTTTATTGGAGATTGAAATCGAAGATTGGACCTCCCTCCTCTCATTATTGCAGCCGATGAACA
ATCAAGACAGTTTAGACTTGTGGTGGTGGGCCCTTGAATCTAATGGGAACTTCTCCACCAACTCACTATCAAAGCACCTCTCAGTATCTTGCCCCGGTGATTTTTCAGTC
TTATATAAGCATATATGGGCGGGTCAATATCCAAAGAAGGTGAAATTCTTTCTCTGGGAGGTTAGCCATTCTTGTATCAACACTCAAGACAAGCTCCAACATAGATCTCC
ATGGCTGGTGATTTCTCCTTCTTGCTGCCCGATGTGCCACGGAGATGCAGAATCAGTGATTCACATTTTCAGCACTTGTCCTTTCGCCTCTATGTATTGGAGCTATTTAC
AAGCGACTTTCGAATGGCCCTTCCCTAGATCGGGTGATATCCTCTCTCTCTTATCTCTTCTTTTTATGGGCCACCCCTTTAAGAATGAAAAGAAGATCCTTTGGCTATGT
CATGTTCATGCATTTTTTTGGAACCTATGGTTGGAGCGTAATGGTCGTATCTTTTCTGATAAAAAGAAAGACATTGGTCATTTTATTGAGTCTTCTTCATTATTAGCTAT
CTCTTGGAGTAAATTATCCTCTCCTTTTTGTAATTATAGTCTCTCTACCCTCTTTAATCAATGGAGGTGTCTTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCCATCAAGATTGTCAACACTTTTGAGAAGCTTTTGAGAAATTTCTTTTGGAATGGCTCTAGTCTTGATGGGGCATGCCCAATATTAATTGGGGAAAGACAAAGCTT
CCTCTTGTCTTGGGTGGATGGGGTATTGGCAACATTTGAAGGGGGAATGAAGTCCTTCTTTCAAAATGGGTTTGGAGATTCTTACATGAACCTAAAGCCCTTTGGCACAA
GATGGAGTCTTGTTTCAAAGACTCAAAAAGATGGTGGGCTCGGAATTGGTGTGCTAAAACAAAGAAATATTGCTAATATTGCTTTGTTGGCCAAATGGGGTTGGCGTTTT
AAGATGGAACCTCAATCTTTATGGAGAGAAGTTATTGTTAGCATCCATGGTTCTTGTAATAATGGATGGAATTCAGATTCTTGGAATTTATCTTTTCAACGCCTTTTGAT
TGATGAGGAGATTGTTGATTTTCAGTCTTTGTTACTTAACATTTATGGGGCTTTATTAAATCCATATGGAGCTCGCGTTTTATCGGATGGTCCTCCATTGACGCCATCGG
ATCATCGGGAGGCATTCTCATTATGTGGAATGAAATTATCCTTAGCATCATCGAGATGGTCTTGGGAAAAATCCAATCAGAGGCCTCCTACCAAAGGCATGAAAAATTTC
AACAAATTTATAGATTTTGTGGAGCTTCTGGACATCCCGTTGCAACATGGTAAATTCACATGGACTAGTAGTCGGGCAAAATCCCTCATTGACCGTTTCTTGATATCAGA
TAGTTGCGCTCAGAAATTTGGTAATGCCTCTGTTAATCGCCTCCCTCGCATCACATCTGACCATTATCCTATTAAGCTTACCCTGGGCAAAGAGAGATGGGGGCCAACAA
CTTTTAAATTCTCCAACTTCTGGCTATCTCACAAGTCCTTTGAACAATTGCTTCAGAATTGGTGGAACAATCACCCTCTGGAAGGTTGGCCTGGTCACGGCTTTATGAAA
AAGCTCAAAGCCTTCAAGCCTTTTATCCAAGAATGGAATATCAACACCTTTGGTAAAAAGGATTCTGATAAACAGGACCTTATAAAAGAGCTAAATGATATAGATACCAA
GGAAGAGTTGGACGTATTGGATGATCATATGTCCAAGCGTAGACTATCCATTAAAATAGACCTTTTAACCTTGGCAGCCCGAGAAGATGCTATATGGAGACAACGTTGCA
AATTCAAATGGCTTTCGGAGGGAGATGAGAACACTGCTTTTTTCCACAATTATATGGCTGCCACTCGAAGACAGAACTCCATTGTGGAACTTTTATCGCGATCGGGTAAG
AGTCTAGTTGATGATGCTAGCATTGAAACAGAGTTTGTGGATTTCTACAAGATGTTATTTTCTAAAAAGGCTGGAATCCGGTTTTTACCTGACATAGAAGACTGGGGTGC
CATATCAGACAACCTTAGTGCTAGCTTGGAAGTCCCTTTCACAGAAGAAGAAGTCCATAGAGCTGTCAACGATTTGGGATCGAACAAATCTCCCGGCCCGGATGGTTTCA
CGGCGGAATTCTTTAAAAAATCATGGAACATTCTTAAGAAAGACATTATGGGGGTGTTCAATGATTTTTTTAAGAGTGCTATTATAAACGCCAACCTCAATGAGACTTAT
ATCTGTCTTATCCCAAAGAAAATAGGAGCTAAATCGGTTGGAGACTATAGACCCATTAGCCTTACATCTTGCCTCTACAAAATTGTGGCTCGTGTCTTATCAGAAAGATT
GAAGAAAGTCTTACCCCACACTATCGCCGAATACCAATCTGCCTTTGTTGCAGATAGACAAATCTTAGATGCCTCTCTTGTTGCTAACGAGCTTATTGACGAGTGGCAAA
GGAAAAAGGAAAAAGAAGTATGCATCAAGCTTGATATCGAAAAGGCCTTTGATATGGTTGATTGGGAATTCCTTGACGAGATTTTTCGTGTTAAGGGTTTCGGACACACA
TGGAGGAGATGGATTAGGGGATGTATATCTTCGGTCAACTATTCTATTATCATAAATGGGAAACCAAGAGGAAAATTTGGTGCCTCCCGTGGACTTCGACAAGGTGACCC
TTTATCTCCATTCTTATTTATTATGGTTGTTGATTGCCTTAGTAGGTTGCTTATAAAGGCCGATCTACAAGGTCTTATTAAGGGCCTCCACGTTGGTTCCGGGTCACATG
CTCTCTCCATCACCCATCTACAATTTGCGGATGACACTATCCTTTTCTCCTCCCAAAACGAAGCTCACCTTGACAACCTTTTCAAATCGATAAAGCTTTTTGAGGAGGCA
TCAGGGCTGAATATAAATTGTCATAAAACAGAGTTCATGGGCATTGGTTTGGACCCTCAATCTCTTGGTTCTTTGGCTGATCGTTATGGTTGCAAAATTGGTGGCTGGCC
AAACACTTACTTGGGCCTTCCATTGAAGGGGAAGCCGAAGTCTTTATCTTTCTGGGAGCCTGTTATAGAGAAAATTGAAAAAAGACTTCATTCTTGGGGATCCCAACACC
TCTCGAAAGGAGGTAGACTCACCTTCATTCAAGCTACCCTTCAGAATTTGCCCATTTATTATTTATCCCTCTTTAAAGCCCCAAAAAAGGTCACCGTAAAGATTGAAAAG
TTGTATCGAAACTTTTTATGGCGGGGCAAAAATGGTTCCAAAGGCTCTCATCTTTTGAAATGGGACAAAGTTAAAGCTCCGATTGAAGAAGGTGGTTTGGGCATTGTCGG
TATTCAAAACAAAAACGGCTCTCTCTTAGCAAAATGGATCTGGCGACATCATAATGAAAAAGGCGCTCTATGGCGTAGGGTCATTTCTACAAAATATGGATCCCAACACT
TTGATCTTCAGCCTGGCACTAAATCATTACACTCATCAAAAGGTCCTTGGAAACAGATTAACAGTACGAAGCATCTCATTTTTTCAAATATTCATATCAAAGTGGGGAAT
GGAAAAGACACATCATTCTGGAAGGACAATTGGATGGGGAGCTCTAACCTTCAACAAATTTTCCCTAGACTATATCATCTTTCTAATAGAAAGGATGCACCTATCGCAGA
TTTTTGGTGTCAACATACTCGATCTTGGTCCTTTTATCCTAGAAGACCTTTATTGGAGATTGAAATCGAAGATTGGACCTCCCTCCTCTCATTATTGCAGCCGATGAACA
ATCAAGACAGTTTAGACTTGTGGTGGTGGGCCCTTGAATCTAATGGGAACTTCTCCACCAACTCACTATCAAAGCACCTCTCAGTATCTTGCCCCGGTGATTTTTCAGTC
TTATATAAGCATATATGGGCGGGTCAATATCCAAAGAAGGTGAAATTCTTTCTCTGGGAGGTTAGCCATTCTTGTATCAACACTCAAGACAAGCTCCAACATAGATCTCC
ATGGCTGGTGATTTCTCCTTCTTGCTGCCCGATGTGCCACGGAGATGCAGAATCAGTGATTCACATTTTCAGCACTTGTCCTTTCGCCTCTATGTATTGGAGCTATTTAC
AAGCGACTTTCGAATGGCCCTTCCCTAGATCGGGTGATATCCTCTCTCTCTTATCTCTTCTTTTTATGGGCCACCCCTTTAAGAATGAAAAGAAGATCCTTTGGCTATGT
CATGTTCATGCATTTTTTTGGAACCTATGGTTGGAGCGTAATGGTCGTATCTTTTCTGATAAAAAGAAAGACATTGGTCATTTTATTGAGTCTTCTTCATTATTAGCTAT
CTCTTGGAGTAAATTATCCTCTCCTTTTTGTAATTATAGTCTCTCTACCCTCTTTAATCAATGGAGGTGTCTTTTGTAA
Protein sequenceShow/hide protein sequence
MPIKIVNTFEKLLRNFFWNGSSLDGACPILIGERQSFLLSWVDGVLATFEGGMKSFFQNGFGDSYMNLKPFGTRWSLVSKTQKDGGLGIGVLKQRNIANIALLAKWGWRF
KMEPQSLWREVIVSIHGSCNNGWNSDSWNLSFQRLLIDEEIVDFQSLLLNIYGALLNPYGARVLSDGPPLTPSDHREAFSLCGMKLSLASSRWSWEKSNQRPPTKGMKNF
NKFIDFVELLDIPLQHGKFTWTSSRAKSLIDRFLISDSCAQKFGNASVNRLPRITSDHYPIKLTLGKERWGPTTFKFSNFWLSHKSFEQLLQNWWNNHPLEGWPGHGFMK
KLKAFKPFIQEWNINTFGKKDSDKQDLIKELNDIDTKEELDVLDDHMSKRRLSIKIDLLTLAAREDAIWRQRCKFKWLSEGDENTAFFHNYMAATRRQNSIVELLSRSGK
SLVDDASIETEFVDFYKMLFSKKAGIRFLPDIEDWGAISDNLSASLEVPFTEEEVHRAVNDLGSNKSPGPDGFTAEFFKKSWNILKKDIMGVFNDFFKSAIINANLNETY
ICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIAEYQSAFVADRQILDASLVANELIDEWQRKKEKEVCIKLDIEKAFDMVDWEFLDEIFRVKGFGHT
WRRWIRGCISSVNYSIIINGKPRGKFGASRGLRQGDPLSPFLFIMVVDCLSRLLIKADLQGLIKGLHVGSGSHALSITHLQFADDTILFSSQNEAHLDNLFKSIKLFEEA
SGLNINCHKTEFMGIGLDPQSLGSLADRYGCKIGGWPNTYLGLPLKGKPKSLSFWEPVIEKIEKRLHSWGSQHLSKGGRLTFIQATLQNLPIYYLSLFKAPKKVTVKIEK
LYRNFLWRGKNGSKGSHLLKWDKVKAPIEEGGLGIVGIQNKNGSLLAKWIWRHHNEKGALWRRVISTKYGSQHFDLQPGTKSLHSSKGPWKQINSTKHLIFSNIHIKVGN
GKDTSFWKDNWMGSSNLQQIFPRLYHLSNRKDAPIADFWCQHTRSWSFYPRRPLLEIEIEDWTSLLSLLQPMNNQDSLDLWWWALESNGNFSTNSLSKHLSVSCPGDFSV
LYKHIWAGQYPKKVKFFLWEVSHSCINTQDKLQHRSPWLVISPSCCPMCHGDAESVIHIFSTCPFASMYWSYLQATFEWPFPRSGDILSLLSLLFMGHPFKNEKKILWLC
HVHAFFWNLWLERNGRIFSDKKKDIGHFIESSSLLAISWSKLSSPFCNYSLSTLFNQWRCLL