; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G18940 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G18940
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr1:14348773..14353597
RNA-Seq ExpressionCSPI01G18940
SyntenyCSPI01G18940
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.7e-23840.82Show/hide
Query:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------
        MW+D  + +       F++SI++     +N   WWL+A+YGP+KR+NR  FW ELE +KS CLP W++GGDFN++RW+ ETT KN  +  +  F      
Subjt:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------

Query:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH
                      W +LRAQ  LSRLDRFL+T +WE +F  H SK+L RTTSDHFPI LES+++ W  SPFRFTN++LK+  +K+NIE WW NT+Q G+
Subjt:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH

Query:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS
         GYSFM RL QL+  IK+W +  K  N+  ++  +KE++ IDKLEAE + TE+H   RT++K DL+Q+ + EAQ+WAQKCKR+W  EGDENS+F+HKIC+
Subjt:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS

Query:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEI--------------------------YN
         RQ++  IS I    G +C  D DI   FI HF +IYTD +    FIENL   PI  +  + L K F E EI                          ++
Subjt:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEI--------------------------YN

Query:  ALKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK
         +K+NI +IF DFH   IINK+VN T I LIAKKE C   +D+RPISLTT +YKLI K +A+RLK  LPD ISE+Q+AFV+GRQ+ +AILI NEA+DFW+
Subjt:  ALKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK

Query:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI
         KK +GFV+KLDIEK FDK+NW  ID++L KK +  KW   I +CISSVQYSILINGRP+ +IKP RGIRQGDP+SPFIFVLAMDYLS LLN+L  +  I
Subjt:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI

Query:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS
         GV F+   NLTH+LFADDIL+F+ED DD + N++  L LFE  SGLNINL+KSTI PIN+ T R   +A +WG S   LP  YLG+PLGG+P+S  FW 
Subjt:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS

Query:  E------------------------------------------------------------------------------PPEENQQLEI-----------
                                                                                        P+E   L I           
Subjt:  E------------------------------------------------------------------------------PPEENQQLEI-----------

Query:  -------------------------------------------------CFSF---------QRGDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT
                                                         C S+           G+ +SFW   W+  +P + + PRLFALS+ K+ S+ 
Subjt:  -------------------------------------------------CFSF---------QRGDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT

Query:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLPPYL-------------------------------------------------FP------------
          WN    DW L+  RPLR  EE LW N+KASLP  L                                                 FP            
Subjt:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLPPYL-------------------------------------------------FP------------

Query:  -CINTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVL-----------------------ERPLLTLNPA-----NIWNERNR
         CINT D LQ+RLP W L+P+WC +C  ++ED  HLF  CP+S +LW   + +L                       ++ L+T N        IW ERN 
Subjt:  -CINTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVL-----------------------ERPLLTLNPA-----NIWNERNR

Query:  RIFKGEEKTVDYVWEDT
        RIFK +EK    +WEDT
Subjt:  RIFKGEEKTVDYVWEDT

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.8e-21638.32Show/hide
Query:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------
        +WDDT +KV+D  V N+++S+ I     + N +WWLT+VYGP K  +R   W ELE ++S CLP WL+ GDFNIVRW+ ET AK++    +A F      
Subjt:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------

Query:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH
                      W +LR  P  SRLDRFL +  WE  F  H S+ L R  SDHFPI LES  +KW   PFR  NS L++  F++N   WW ++ Q G 
Subjt:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH

Query:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS
        PGY+F+  L  LS  IK W  +     D  ++ LLKE++ IDKLE +  ++  H   R S+K+DL  +   +AQ+W Q+ ++ WN+ GDEN++++H+IC+
Subjt:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS

Query:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA-------------------------
        + QR++ I SI    G    +  DI +TFI HF  IYT +  +   I+NL   PI  +   +LCK F E EI +                          
Subjt:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA-------------------------

Query:  -LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK
         LK ++L +F DFH+ GI+N  VN+TFIALI+KKE CS PSDYRPISLTT LYK++ K +A RLK+ LPD I+ENQ+AF++GRQ+NDAILI NEA+D WK
Subjt:  -LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK

Query:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI
        Q+K KGFV+KLDIEK FDKI+W+ IDYML KK FPHKW  WIK CIS+VQYSIL+NG PK +IK  RGIRQGDP+SPFIFVLAMDYLS LL+HLE +  I
Subjt:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI

Query:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS
        KGVSFN   N++HLLFADD+L+F+ED++  ++N++ AL LFE  SGL  N +KSTISPINI   RT+ +AS +GF   FLP+ YLGVPLGG P SR FW 
Subjt:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS

Query:  EPPE---------------------------------------------------------------ENQQL---EICFSFQR-----------------
        +  E                                                               +N  L    IC S +                  
Subjt:  EPPE---------------------------------------------------------------ENQQL---EICFSFQR-----------------

Query:  ----------------------------------------------------------------GDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT
                                                                        G +LSFWHS+WH   P +   PRL+ALS+ +  ++ 
Subjt:  ----------------------------------------------------------------GDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT

Query:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLP----------PYLFPC--------------------------------------------------
         +W+    DW++ PRRPL   E+  WD++K SLP          P   P                                                   
Subjt:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLP----------PYLFPC--------------------------------------------------

Query:  ---INTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPAN----------------------------IWNERN
           +NTMD +Q+R P+ +LNPSWCI C+++ ED  HLF  CPF+  LW         P++  N  +                            IW  RN
Subjt:  ---INTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPAN----------------------------IWNERN

Query:  RRIFKGEEKTVDYVWED
          IF  ++ +    WED
Subjt:  RRIFKGEEKTVDYVWED

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.3e-23441.39Show/hide
Query:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKN---------------
        MWDD ++ V DFI  NF+LSI I+ P   +N  WWL+A+YGPS   NR  FW EL ++K+ C P WL+ GDFN+VR+ SET+A+N               
Subjt:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKN---------------

Query:  ---IVIPWIAMFLVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH
           I  P       W +LR  P+LSR+DRFLYT  WE LF  H+SK L R TSDHFPI LES+ + W  SPF+  N  LKE  FK NI  WWKN  Q+GH
Subjt:  ---IVIPWIAMFLVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH

Query:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS
        PG+SFM +L QLS  I++  +  K  +D+++   +KE++ ID+LEAE N++E     RT +K D+     KEAQ+W QK KRLW  EGDEN++F+HKICS
Subjt:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS

Query:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYT-DKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNALKKNILEIFNDFHENGIINKIVNS
         RQRRS IS+I++A G  C+T++ I K F+DHF +IY    +   W I+NL  +PI       LC  F EEEI+ AL        +       ++  +N 
Subjt:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYT-DKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNALKKNILEIFNDFHENGIINKIVNS

Query:  TFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKGFVVKLDIEKGFDKINWTLI
        T IALIAKKE C+ P+DYRPISLTT +YKLI KVIAERLK  LP  ++ENQ+AFV+ RQ+ DAIL+ NEA+D+W+ KK +GFV+KLDIEK FDK+NW  I
Subjt:  TFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKGFVVKLDIEKGFDKINWTLI

Query:  DYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADDILLFME
        D+ML KKG+P KW +WI+ CISSVQYSI+INGRP+ KI+P RGIRQGDPISPFIFVLAMDY+S LLN + ++  IKGV   G  NLTHLLFADDILLF+E
Subjt:  DYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADDILLFME

Query:  DDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQF---------------------------
        DD+ +I N++  + LF+L SGL+INLNKSTISPIN+D  RT  +AS WG S  FLPI YLGVPLGGK  ++ F                           
Subjt:  DDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQF---------------------------

Query:  -----------------------------------WSEPPEENQ--------------------------------------------------------
                                           W  PPE ++                                                        
Subjt:  -----------------------------------WSEPPEENQ--------------------------------------------------------

Query:  -----------------------------QLEICFSFQRGDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSITNMWNAEKVDWDLYPRRPLRSVEEAL
                                     Q  + +  + G + SFWH  WH+ SP +   PRL+ALS+ KE+SI +MWN   +DWDL PRR LR  E  L
Subjt:  -----------------------------QLEICFSFQRGDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSITNMWNAEKVDWDLYPRRPLRSVEEAL

Query:  WDNM-------------------------------------------------------KASLP--------PYLFPCINTMDMLQRRLPTWNLNPSWCI
        W  +                                                       K S+P          L+  +NT + L +RLP     PSWC+
Subjt:  WDNM-------------------------------------------------------KASLP--------PYLFPCINTMDMLQRRLPTWNLNPSWCI

Query:  LCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPA----------------------------NIWNERNRRIFKGEEKTVDYVWED
        +CK  +EDR HLF LCP +  +W+ +   L   +  L+P                             NIW ERN RIF G+EKTV  +WED
Subjt:  LCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPA----------------------------NIWNERNRRIFKGEEKTVDYVWED

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.0e-21538.23Show/hide
Query:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------
        +WDDT +KV+D  V N+++S+ I     + N +WWLT+VYGP K  +R   W ELE ++S CLP WL+ GDFNIVRW+ ET AK++    +A F      
Subjt:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------

Query:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH
                      W +LR  P  SRLDRFL +  WE  F  H S+ L R  SDHFPI LES  +KW   PFR  NS L++  F++N   WW ++ Q G 
Subjt:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH

Query:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS
        PGY+F+  L  LS  IK W  +     D  ++ LLKE++ IDKLE +  ++  H   R S+K+DL  +   +AQ+W Q+ ++ WN+ GDEN++++H+IC+
Subjt:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS

Query:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA-------------------------
        + QR++ I SI    G    +  DI +TFI HF  IYT +  +   I+NL   PI  +   +LCK F E EI +                          
Subjt:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA-------------------------

Query:  -LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK
         LK ++L +F DFH+ GI+N  VN+TFIALI+KKE CS PSDYRPISLTT LYK++ K +A RLK+ LPD I+ENQ+AF++GRQ+NDAILI NE +D WK
Subjt:  -LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK

Query:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI
        Q+K KGFV+KLDIEK FDKI+W+ IDYML KK FPHKW  WIK CIS+VQYSIL+NG PK +IK  RGIRQGDP+SPFIFVLAMDYLS LL+HLE +  I
Subjt:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI

Query:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS
        KGVSFN   N++HLLFADD+L+F+ED++  ++N++ AL LFE  SGL  N +KSTISPINI   RT+ +AS +GF   FLP+ YLGVPLGG P SR FW 
Subjt:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS

Query:  EPPE---------------------------------------------------------------ENQQL---EICFSFQR-----------------
        +  E                                                               +N  L    IC S +                  
Subjt:  EPPE---------------------------------------------------------------ENQQL---EICFSFQR-----------------

Query:  ----------------------------------------------------------------GDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT
                                                                        G +LSFWHS+WH   P +   PRL+ALS+ +  ++ 
Subjt:  ----------------------------------------------------------------GDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT

Query:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLP----------PYLFPC--------------------------------------------------
         +W+    DW++ PRRPL   E+  WD++K SLP          P   P                                                   
Subjt:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLP----------PYLFPC--------------------------------------------------

Query:  ---INTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPAN----------------------------IWNERN
           +NTMD +Q+R P+ +LNPSWCI C+++ ED  HLF  CPF+  LW         P++  N  +                            IW  RN
Subjt:  ---INTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPAN----------------------------IWNERN

Query:  RRIFKGEEKTVDYVWED
          IF  ++ +    WED
Subjt:  RRIFKGEEKTVDYVWED

TYK24536.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.0e-21945.6Show/hide
Query:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKN---------------
        MWDD ++ V D I   F+LSI I++P   ++  WWL+A+YGPS   NR  FW EL ++K+ C P WL+ GDFN VR+ SET+ +N               
Subjt:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKN---------------

Query:  ---IVIPWIAMFLVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH
           I  P       W +LR QP+LSR+DRFLYT  WE LF  H+SK L R TSDHFPI LES+ + W  SPF+F N  LKE  FK+N+ +WWKN  Q GH
Subjt:  ---IVIPWIAMFLVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH

Query:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS
        PG+SFM +L QLS  I+   K  K  ND+E++  +KE+++ID+LEAE N +E     RT +K D+     KEAQ+W QK KRLW  EGDEN++F+HKICS
Subjt:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS

Query:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYT-DKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA------------------------
         RQRRS IS+I++  G  C+T++ I K F+DHF +IY    +   W I+NL  +PI      +LC  F EEEI+ A                        
Subjt:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYT-DKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA------------------------

Query:  --LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFW
          LK+ I  IF DFH N IINK VN T IALIAKKE C+ P+DYRP                         I++ENQ+AFV+GRQ+ DAIL+ NEA+D+W
Subjt:  --LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFW

Query:  KQKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENL
        + KK +GFV+KLDIEK FDK+NW  ID+ML KKG+P +W  WI+ CISSVQYSI+INGRP+ KI+P RGIRQGDPISPFIFVLAMDY+S LLN + ++  
Subjt:  KQKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENL

Query:  IKGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFW
        IKGV   G  NLTHLLFADDILLF+EDD+ +I N++  + LF+L SGL+INLNKSTISPIN+   RT  +AS WG S  FLPI YLGVPLGGK  ++ FW
Subjt:  IKGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFW

Query:  SEPPEE-NQQLEICFSFQRGDTLSFWHSRWHELSPFTQ---SNPRLFALSSRKENSITNMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLPPYLFPCIN
            E+ N++L    +  +   LS   +  H  SP+ +       +F   S+ ++SI +MWN   +DWDL PRR +R  E  LW  +K SL      C N
Subjt:  SEPPEE-NQQLEICFSFQRGDTLSFWHSRWHELSPFTQ---SNPRLFALSSRKENSITNMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLPPYLFPCIN

Query:  TMDMLQRRLPTWNLNP----SWCILCKAAEEDRQHLFSLCPFSS--KLWKN---------------------VEVVLERPLLTLNPA-------------
          D      PTW LN     +   + KA ++  Q    L   ++   LWK                       ++    P L   P+             
Subjt:  TMDMLQRRLPTWNLNP----SWCILCKAAEEDRQHLFSLCPFSS--KLWKN---------------------VEVVLERPLLTLNPA-------------

Query:  --------NIWNERNRRIFKGEEKTVDYVWED
                NIW ERN RIF G+EKTV  +WED
Subjt:  --------NIWNERNRRIFKGEEKTVDYVWED

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein1.8e-23840.82Show/hide
Query:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------
        MW+D  + +       F++SI++     +N   WWL+A+YGP+KR+NR  FW ELE +KS CLP W++GGDFN++RW+ ETT KN  +  +  F      
Subjt:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------

Query:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH
                      W +LRAQ  LSRLDRFL+T +WE +F  H SK+L RTTSDHFPI LES+++ W  SPFRFTN++LK+  +K+NIE WW NT+Q G+
Subjt:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH

Query:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS
         GYSFM RL QL+  IK+W +  K  N+  ++  +KE++ IDKLEAE + TE+H   RT++K DL+Q+ + EAQ+WAQKCKR+W  EGDENS+F+HKIC+
Subjt:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS

Query:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEI--------------------------YN
         RQ++  IS I    G +C  D DI   FI HF +IYTD +    FIENL   PI  +  + L K F E EI                          ++
Subjt:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEI--------------------------YN

Query:  ALKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK
         +K+NI +IF DFH   IINK+VN T I LIAKKE C   +D+RPISLTT +YKLI K +A+RLK  LPD ISE+Q+AFV+GRQ+ +AILI NEA+DFW+
Subjt:  ALKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK

Query:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI
         KK +GFV+KLDIEK FDK+NW  ID++L KK +  KW   I +CISSVQYSILINGRP+ +IKP RGIRQGDP+SPFIFVLAMDYLS LLN+L  +  I
Subjt:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI

Query:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS
         GV F+   NLTH+LFADDIL+F+ED DD + N++  L LFE  SGLNINL+KSTI PIN+ T R   +A +WG S   LP  YLG+PLGG+P+S  FW 
Subjt:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS

Query:  E------------------------------------------------------------------------------PPEENQQLEI-----------
                                                                                        P+E   L I           
Subjt:  E------------------------------------------------------------------------------PPEENQQLEI-----------

Query:  -------------------------------------------------CFSF---------QRGDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT
                                                         C S+           G+ +SFW   W+  +P + + PRLFALS+ K+ S+ 
Subjt:  -------------------------------------------------CFSF---------QRGDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT

Query:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLPPYL-------------------------------------------------FP------------
          WN    DW L+  RPLR  EE LW N+KASLP  L                                                 FP            
Subjt:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLPPYL-------------------------------------------------FP------------

Query:  -CINTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVL-----------------------ERPLLTLNPA-----NIWNERNR
         CINT D LQ+RLP W L+P+WC +C  ++ED  HLF  CP+S +LW   + +L                       ++ L+T N        IW ERN 
Subjt:  -CINTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVL-----------------------ERPLLTLNPA-----NIWNERNR

Query:  RIFKGEEKTVDYVWEDT
        RIFK +EK    +WEDT
Subjt:  RIFKGEEKTVDYVWEDT

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein4.2e-21638.32Show/hide
Query:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------
        +WDDT +KV+D  V N+++S+ I     + N +WWLT+VYGP K  +R   W ELE ++S CLP WL+ GDFNIVRW+ ET AK++    +A F      
Subjt:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------

Query:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH
                      W +LR  P  SRLDRFL +  WE  F  H S+ L R  SDHFPI LES  +KW   PFR  NS L++  F++N   WW ++ Q G 
Subjt:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH

Query:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS
        PGY+F+  L  LS  IK W  +     D  ++ LLKE++ IDKLE +  ++  H   R S+K+DL  +   +AQ+W Q+ ++ WN+ GDEN++++H+IC+
Subjt:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS

Query:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA-------------------------
        + QR++ I SI    G    +  DI +TFI HF  IYT +  +   I+NL   PI  +   +LCK F E EI +                          
Subjt:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA-------------------------

Query:  -LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK
         LK ++L +F DFH+ GI+N  VN+TFIALI+KKE CS PSDYRPISLTT LYK++ K +A RLK+ LPD I+ENQ+AF++GRQ+NDAILI NEA+D WK
Subjt:  -LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK

Query:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI
        Q+K KGFV+KLDIEK FDKI+W+ IDYML KK FPHKW  WIK CIS+VQYSIL+NG PK +IK  RGIRQGDP+SPFIFVLAMDYLS LL+HLE +  I
Subjt:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI

Query:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS
        KGVSFN   N++HLLFADD+L+F+ED++  ++N++ AL LFE  SGL  N +KSTISPINI   RT+ +AS +GF   FLP+ YLGVPLGG P SR FW 
Subjt:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS

Query:  EPPE---------------------------------------------------------------ENQQL---EICFSFQR-----------------
        +  E                                                               +N  L    IC S +                  
Subjt:  EPPE---------------------------------------------------------------ENQQL---EICFSFQR-----------------

Query:  ----------------------------------------------------------------GDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT
                                                                        G +LSFWHS+WH   P +   PRL+ALS+ +  ++ 
Subjt:  ----------------------------------------------------------------GDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT

Query:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLP----------PYLFPC--------------------------------------------------
         +W+    DW++ PRRPL   E+  WD++K SLP          P   P                                                   
Subjt:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLP----------PYLFPC--------------------------------------------------

Query:  ---INTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPAN----------------------------IWNERN
           +NTMD +Q+R P+ +LNPSWCI C+++ ED  HLF  CPF+  LW         P++  N  +                            IW  RN
Subjt:  ---INTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPAN----------------------------IWNERN

Query:  RRIFKGEEKTVDYVWED
          IF  ++ +    WED
Subjt:  RRIFKGEEKTVDYVWED

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein4.5e-23441.39Show/hide
Query:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKN---------------
        MWDD ++ V DFI  NF+LSI I+ P   +N  WWL+A+YGPS   NR  FW EL ++K+ C P WL+ GDFN+VR+ SET+A+N               
Subjt:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKN---------------

Query:  ---IVIPWIAMFLVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH
           I  P       W +LR  P+LSR+DRFLYT  WE LF  H+SK L R TSDHFPI LES+ + W  SPF+  N  LKE  FK NI  WWKN  Q+GH
Subjt:  ---IVIPWIAMFLVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH

Query:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS
        PG+SFM +L QLS  I++  +  K  +D+++   +KE++ ID+LEAE N++E     RT +K D+     KEAQ+W QK KRLW  EGDEN++F+HKICS
Subjt:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS

Query:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYT-DKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNALKKNILEIFNDFHENGIINKIVNS
         RQRRS IS+I++A G  C+T++ I K F+DHF +IY    +   W I+NL  +PI       LC  F EEEI+ AL        +       ++  +N 
Subjt:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYT-DKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNALKKNILEIFNDFHENGIINKIVNS

Query:  TFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKGFVVKLDIEKGFDKINWTLI
        T IALIAKKE C+ P+DYRPISLTT +YKLI KVIAERLK  LP  ++ENQ+AFV+ RQ+ DAIL+ NEA+D+W+ KK +GFV+KLDIEK FDK+NW  I
Subjt:  TFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKGFVVKLDIEKGFDKINWTLI

Query:  DYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADDILLFME
        D+ML KKG+P KW +WI+ CISSVQYSI+INGRP+ KI+P RGIRQGDPISPFIFVLAMDY+S LLN + ++  IKGV   G  NLTHLLFADDILLF+E
Subjt:  DYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADDILLFME

Query:  DDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQF---------------------------
        DD+ +I N++  + LF+L SGL+INLNKSTISPIN+D  RT  +AS WG S  FLPI YLGVPLGGK  ++ F                           
Subjt:  DDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQF---------------------------

Query:  -----------------------------------WSEPPEENQ--------------------------------------------------------
                                           W  PPE ++                                                        
Subjt:  -----------------------------------WSEPPEENQ--------------------------------------------------------

Query:  -----------------------------QLEICFSFQRGDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSITNMWNAEKVDWDLYPRRPLRSVEEAL
                                     Q  + +  + G + SFWH  WH+ SP +   PRL+ALS+ KE+SI +MWN   +DWDL PRR LR  E  L
Subjt:  -----------------------------QLEICFSFQRGDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSITNMWNAEKVDWDLYPRRPLRSVEEAL

Query:  WDNM-------------------------------------------------------KASLP--------PYLFPCINTMDMLQRRLPTWNLNPSWCI
        W  +                                                       K S+P          L+  +NT + L +RLP     PSWC+
Subjt:  WDNM-------------------------------------------------------KASLP--------PYLFPCINTMDMLQRRLPTWNLNPSWCI

Query:  LCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPA----------------------------NIWNERNRRIFKGEEKTVDYVWED
        +CK  +EDR HLF LCP +  +W+ +   L   +  L+P                             NIW ERN RIF G+EKTV  +WED
Subjt:  LCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPA----------------------------NIWNERNRRIFKGEEKTVDYVWED

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein9.5e-21638.23Show/hide
Query:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------
        +WDDT +KV+D  V N+++S+ I     + N +WWLT+VYGP K  +R   W ELE ++S CLP WL+ GDFNIVRW+ ET AK++    +A F      
Subjt:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMF------

Query:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH
                      W +LR  P  SRLDRFL +  WE  F  H S+ L R  SDHFPI LES  +KW   PFR  NS L++  F++N   WW ++ Q G 
Subjt:  ------------LVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH

Query:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS
        PGY+F+  L  LS  IK W  +     D  ++ LLKE++ IDKLE +  ++  H   R S+K+DL  +   +AQ+W Q+ ++ WN+ GDEN++++H+IC+
Subjt:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS

Query:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA-------------------------
        + QR++ I SI    G    +  DI +TFI HF  IYT +  +   I+NL   PI  +   +LCK F E EI +                          
Subjt:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA-------------------------

Query:  -LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK
         LK ++L +F DFH+ GI+N  VN+TFIALI+KKE CS PSDYRPISLTT LYK++ K +A RLK+ LPD I+ENQ+AF++GRQ+NDAILI NE +D WK
Subjt:  -LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWK

Query:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI
        Q+K KGFV+KLDIEK FDKI+W+ IDYML KK FPHKW  WIK CIS+VQYSIL+NG PK +IK  RGIRQGDP+SPFIFVLAMDYLS LL+HLE +  I
Subjt:  QKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLI

Query:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS
        KGVSFN   N++HLLFADD+L+F+ED++  ++N++ AL LFE  SGL  N +KSTISPINI   RT+ +AS +GF   FLP+ YLGVPLGG P SR FW 
Subjt:  KGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWS

Query:  EPPE---------------------------------------------------------------ENQQL---EICFSFQR-----------------
        +  E                                                               +N  L    IC S +                  
Subjt:  EPPE---------------------------------------------------------------ENQQL---EICFSFQR-----------------

Query:  ----------------------------------------------------------------GDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT
                                                                        G +LSFWHS+WH   P +   PRL+ALS+ +  ++ 
Subjt:  ----------------------------------------------------------------GDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSIT

Query:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLP----------PYLFPC--------------------------------------------------
         +W+    DW++ PRRPL   E+  WD++K SLP          P   P                                                   
Subjt:  NMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLP----------PYLFPC--------------------------------------------------

Query:  ---INTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPAN----------------------------IWNERN
           +NTMD +Q+R P+ +LNPSWCI C+++ ED  HLF  CPF+  LW         P++  N  +                            IW  RN
Subjt:  ---INTMDMLQRRLPTWNLNPSWCILCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPAN----------------------------IWNERN

Query:  RRIFKGEEKTVDYVWED
          IF  ++ +    WED
Subjt:  RRIFKGEEKTVDYVWED

A0A5D3DLM2 LINE-1 retrotransposable element ORF2 protein2.4e-21945.6Show/hide
Query:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKN---------------
        MWDD ++ V D I   F+LSI I++P   ++  WWL+A+YGPS   NR  FW EL ++K+ C P WL+ GDFN VR+ SET+ +N               
Subjt:  MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKN---------------

Query:  ---IVIPWIAMFLVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH
           I  P       W +LR QP+LSR+DRFLYT  WE LF  H+SK L R TSDHFPI LES+ + W  SPF+F N  LKE  FK+N+ +WWKN  Q GH
Subjt:  ---IVIPWIAMFLVWFHLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGH

Query:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS
        PG+SFM +L QLS  I+   K  K  ND+E++  +KE+++ID+LEAE N +E     RT +K D+     KEAQ+W QK KRLW  EGDEN++F+HKICS
Subjt:  PGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICS

Query:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYT-DKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA------------------------
         RQRRS IS+I++  G  C+T++ I K F+DHF +IY    +   W I+NL  +PI      +LC  F EEEI+ A                        
Subjt:  VRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYT-DKKRDLWFIENLPCTPIEEVAHDDLCKFFYEEEIYNA------------------------

Query:  --LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFW
          LK+ I  IF DFH N IINK VN T IALIAKKE C+ P+DYRP                         I++ENQ+AFV+GRQ+ DAIL+ NEA+D+W
Subjt:  --LKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFW

Query:  KQKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENL
        + KK +GFV+KLDIEK FDK+NW  ID+ML KKG+P +W  WI+ CISSVQYSI+INGRP+ KI+P RGIRQGDPISPFIFVLAMDY+S LLN + ++  
Subjt:  KQKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENL

Query:  IKGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFW
        IKGV   G  NLTHLLFADDILLF+EDD+ +I N++  + LF+L SGL+INLNKSTISPIN+   RT  +AS WG S  FLPI YLGVPLGGK  ++ FW
Subjt:  IKGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFW

Query:  SEPPEE-NQQLEICFSFQRGDTLSFWHSRWHELSPFTQ---SNPRLFALSSRKENSITNMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLPPYLFPCIN
            E+ N++L    +  +   LS   +  H  SP+ +       +F   S+ ++SI +MWN   +DWDL PRR +R  E  LW  +K SL      C N
Subjt:  SEPPEE-NQQLEICFSFQRGDTLSFWHSRWHELSPFTQ---SNPRLFALSSRKENSITNMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLPPYLFPCIN

Query:  TMDMLQRRLPTWNLNP----SWCILCKAAEEDRQHLFSLCPFSS--KLWKN---------------------VEVVLERPLLTLNPA-------------
          D      PTW LN     +   + KA ++  Q    L   ++   LWK                       ++    P L   P+             
Subjt:  TMDMLQRRLPTWNLNP----SWCILCKAAEEDRQHLFSLCPFSS--KLWKN---------------------VEVVLERPLLTLNPA-------------

Query:  --------NIWNERNRRIFKGEEKTVDYVWED
                NIW ERN RIF G+EKTV  +WED
Subjt:  --------NIWNERNRRIFKGEEKTVDYVWED

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.1e-2423.06Show/hide
Query:  SKLLPRTTSDHFPITLE---SNSLKWCSSPFRFTNSFLKEV----SFKQNIELWWKNTAQDGHPGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQ-----
        ++++    SDH  I LE    N  +  S+ ++  N  L +       K  I+++++ T ++    Y  +W   +         K I ++  K +Q     
Subjt:  SKLLPRTTSDHFPITLE---SNSLKWCSSPFRFTNSFLKEV----SFKQNIELWWKNTAQDGHPGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQ-----

Query:  -TLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKC--KRLWNMEG-DENSAFYHKICSVRQRRSFISSISTAQGAHCTTDKDIEKT
         TL  +L+ ++K E  ++         T I+ +L ++   E Q   QK    R W  E  ++      ++   ++ ++ I +I   +G   T   +I+ T
Subjt:  -TLLKELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKC--KRLWNMEG-DENSAFYHKICSVRQRRSFISSISTAQGAHCTTDKDIEKT

Query:  FIDHFGEIYTDKKRDL----WFIE---------------NLPCTPIEEVA------------HDDLCKFFYEEEIYNALKKNILEIFNDFHENGII-NKI
          +++  +Y +K  +L     F++               N P T  E VA             D     FY +     L   +L++F    + GI+ N  
Subjt:  FIDHFGEIYTDKKRDL----WFIE---------------NLPCTPIEEVA------------HDDLCKFFYEEEIYNALKKNILEIFNDFHENGII-NKI

Query:  VNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKGFV-VKLDIEKGFDKIN
          ++ I +       +   ++RPISL     K+++K++A R++  +  +I  +Q+ F+ G Q    I      +    + K K  V + +D EK FDKI 
Subjt:  VNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKGFV-VKLDIEKGFDKIN

Query:  WTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADDIL
           +   L+K G    +   I+        +I++NG+         G RQG P+SP +F +    L +L   + +E  IKG+   GK  +   LFADD++
Subjt:  WTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADDIL

Query:  LFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPL
        +++E+   +  N+   +  F   SG  IN+ KS     N + Q  + +     F+I    I+YLG+ L
Subjt:  LFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPL

P08548 LINE-1 reverse transcriptase homolog6.0e-2623.71Show/hide
Query:  KLLPRTTSDHFPITLESN---SLKWCSSPFRFTNSFLKEVSF-----KQNIELWWKNTAQDGHPGYSFMWRLIQ--LSHTIKSWSKSIKVSNDKERQTLL
        +++P   SDH  I +E N   +L   +  ++  N  LK+        K+  +   +N  QD    Y  +W   +  L     +    +K +  +E   L+
Subjt:  KLLPRTTSDHFPITLESN---SLKWCSSPFRFTNSFLKEVSF-----KQNIELWWKNTAQDGHPGYSFMWRLIQ--LSHTIKSWSKSIKVSNDKERQTLL

Query:  KELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICSVRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGE
          L+ ++K E  +N         T I+ +LN++  K       K K  +  + ++       +   ++ +S ISSI        T   +I+K   +++ +
Subjt:  KELEHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICSVRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGE

Query:  IYTDKKRDLWFIE------NLPCTPIEEV-------AHDDLCKFF-------------YEEEIYNALKKN----ILEIFNDFHENGIINKIVNSTFIALI
        +Y+ K  +L  I+      +LP    +EV       +  ++                 +  E Y   K+     +L +F +  + GI+        I LI
Subjt:  IYTDKKRDLWFIE------NLPCTPIEEV-------AHDDLCKFF-------------YEEEIYNALKKN----ILEIFNDFHENGIINKIVNSTFIALI

Query:  AKK-ETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKG-FVVKLDIEKGFDKINWTLIDYML
         K  +  +   +YRPISL     K+++K++  R++  +  II  +Q+ F+ G Q    I      +    + K K   ++ +D EK FD I    +   L
Subjt:  AKK-ETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKG-FVVKLDIEKGFDKINWTLIDYML

Query:  HKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMR-GIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADDILLFMEDDD
         K G    +   I+   S    +I++NG  K K  P+R G RQG P+SP +F + M+ L+I    + +E  IKG+   G   +   LFADD+++++E+  
Subjt:  HKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMR-GIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADDILLFMEDDD

Query:  DTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPL
        D+   +   +K +   SG  IN +KS       + Q    V  +  F++    ++YLGV L
Subjt:  DTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPL

P11369 LINE-1 retrotransposable element ORF2 protein4.3e-2423.68Show/hide
Query:  KLLPRTTSDHFPITLESNSLKWCSSP---FRFTNSFLKEVSFKQNIELWWKNTA---QDGHPGYSFMWRLIQ--LSHTIKSWSKSIKVSNDKERQTLLKE
        +++P   SDH  + L  N+      P   ++  N+ L +   K+ I+   K+     ++    Y  +W  ++  L   + + S S K        +L   
Subjt:  KLLPRTTSDHFPITLESNSLKWCSSP---FRFTNSFLKEVSFKQNIELWWKNTA---QDGHPGYSFMWRLIQ--LSHTIKSWSKSIKVSNDKERQTLLKE

Query:  LEHIDKLEAENNITEMHISCRTSIKTDLNQMAIK-EAQVWAQKCKRLWNMEGDENSAFYHKICSV----------RQRRSFISSISTAQGAHCTTDKDIE
        L+ ++K EA             S K    Q  IK   ++   + +R         S F+ KI  +           + +  I+ I   +G   T  ++I+
Subjt:  LEHIDKLEAENNITEMHISCRTSIKTDLNQMAIK-EAQVWAQKCKRLWNMEGDENSAFYHKICSV----------RQRRSFISSISTAQGAHCTTDKDIE

Query:  KTFIDHFGEIYTDKKRDL----WFIE---------------NLPCTP--IEEVAHDDLCKFF-----YEEEIYNALKKNILEIFND-FHENGIINKIVNS
         T    +  +Y+ K  +L     F++               N P +P  IE V +    K       +  E Y   K++++ I +  FH+  +   + NS
Subjt:  KTFIDHFGEIYTDKKRDL----WFIE---------------NLPCTP--IEEVAHDDLCKFF-----YEEEIYNALKKNILEIFND-FHENGIINKIVNS

Query:  TFIALIA-----KKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKG-FVVKLDIEKGFDK
         + A I      +K+   +  ++RPISL     K+++K++A R++  +  II  +Q+ F+ G Q    I      + +  + K K   ++ LD EK FDK
Subjt:  TFIALIA-----KKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKG-FVVKLDIEKGFDK

Query:  INWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADD
        I    +  +L + G    + N IK   S    +I +NG     I    G RQG P+SP++F +    L +L   + ++  IKG+   GK  +   L ADD
Subjt:  INWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADD

Query:  ILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPL
        +++++ D  ++   +   +  F    G  IN NKS       + Q    +  T  FSI    I+YLGV L
Subjt:  ILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPL

P14381 Transposon TX1 uncharacterized 149 kDa protein9.6e-1624.26Show/hide
Query:  YNALKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDF
        ++ L  +   +  +  + G +        ++L+ KK    +  ++RP+SL +  YK++ K I+ RLK+VL ++I  +Q   V GR + D + +  + + F
Subjt:  YNALKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDF

Query:  WKQKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKEN
         ++       + LD EK FD+++   +   L    F  ++  ++KT  +S +  + IN      +   RG+RQG P+S  ++ LA++    LL    ++ 
Subjt:  WKQKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKEN

Query:  LIKGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKST---ISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGK--P
        L   V       +    +ADD++L  +D  D ++  +   +++   S   IN +KS+      + +D         +W   I    I+YLGV L  +  P
Subjt:  LIKGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKST---ISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGK--P

Query:  NSRQF
         S+ F
Subjt:  NSRQF

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.6e-1830.36Show/hide
Query:  LIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKGFVVKLDIEKGFDKINWTLIDYML
        LI K      PS++RPI++ + L +L+ +++A+RL+  +    ++   A + G  VN ++L++       +Q+KT   VV LD+ K FD ++ + I   L
Subjt:  LIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKKTKGFVVKLDIEKGFDKINWTLIDYML

Query:  HKKGFPHKWHNWIKTCISSVQYSILIN-GRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADDILLFMEDDD
         + G      N+I   +S    +I +  G   RKI   RG++QGDP+SPF+F   +D    LL  L+    I G    G+  +  L FADD+LL +ED+D
Subjt:  HKKGFPHKWHNWIKTCISSVQYSILIN-GRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSFNGKHNLTHLLFADDILLFMEDDD

Query:  DTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGF
          +      +  F    G+++N  KS    I++      C+  T  F
Subjt:  DTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGF

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.9e-0620.49Show/hide
Query:  IPWIAMFLVWF-HLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFP-ITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGHPGY
        IP   +   W  H    PI+ +LDR +   +W + F    +       SDH P I +  N  K     FR+ +      +F  ++ + W+     G    
Subjt:  IPWIAMFLVWF-HLRAQPILSRLDRFLYTPEWETLFEPHFSKLLPRTTSDHFP-ITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGHPGY

Query:  SFMWRLIQLSHTIKSWSKSIKVSNDKERQTLL-KELEHIDKLEA-----ENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHK
             +  L   +K+  K  K+ N +    +  K  E +D LE+       N ++         +   N  A      + QK +  W  +GD N+ F+HK
Subjt:  SFMWRLIQLSHTIKSWSKSIKVSNDKERQTLL-KELEHIDKLEA-----ENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHK

Query:  ICSVRQRRSFISSISTAQGAHCTTDKDIEKTFIDHF-------GEIYT-DKKRDLWFIENLPCT-----------------------PIEEVAHDD--LC
        +    Q ++ I  +             +++  + ++        +I T D  + +  I    C                        P  +    D    
Subjt:  ICSVRQRRSFISSISTAQGAHCTTDKDIEKTFIDHF-------GEIYT-DKKRDLWFIENLPCT-----------------------PIEEVAHDD--LC

Query:  KFFYEEEIYNALKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLI
        +FF+E   +  +K + +    +F   G + K  N+T I LI K       S +RP+S  T +YK+I
Subjt:  KFFYEEEIYNALKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLI

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-0725Show/hide
Query:  TNMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLPPYLFPC-------INTMDMLQRRLPTWNLN-PSWCILCKAAEEDRQHLFSLCPFSSKLWK--NVE
        +N ++A +    L+P+       +A+W   K  +P + F C       ++T D    RL  W L+ P+ C+LC A ++ R HLF  C FS  +W+     
Subjt:  TNMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLPPYLFPC-------INTMDMLQRRLPTWNLN-PSWCILCKAAEEDRQHLFSLCPFSSKLWK--NVE

Query:  VVLERPLLTLNPAN-------------------------IWNERNRRIFKGEEKTVDYVWEDTQ
          L  P   ++  N                         IW ERN+R+  G  ++ + + +D Q
Subjt:  VVLERPLLTLNPAN-------------------------IWNERNRRIFKGEEKTVDYVWEDTQ

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.0e-0838.27Show/hide
Query:  IAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKK-TKGF-VVKLDIEKGFDKINWTLIDYMLHKKGFPHKW
        + ERLK ++ ++I   Q +F+ GR   D I+   EAV   ++KK  KG+ ++KLD+EK +D+I W  ++  L   GFP  W
Subjt:  IAERLKTVLPDIISENQLAFVRGRQVNDAILIENEAVDFWKQKK-TKGF-VVKLDIEKGFDKINWTLIDYMLHKKGFPHKW

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.9e-0536.05Show/hide
Query:  EKVDWDLYPRRPLRSVEEALWDNMKASLPPYLFPCINTMDMLQRRLPT------WNLN-PSWCILCKAAEEDRQHLFSLCPFSSKL
        EKVDW            +A+W   K  +P + F  I+ ++ ++ RLPT      W L+ PS C+LC A +E RQHLF  C F+ ++
Subjt:  EKVDWDLYPRRPLRSVEEALWDNMKASLPPYLFPCINTMDMLQRRLPT------WNLN-PSWCILCKAAEEDRQHLFSLCPFSSKL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)8.1e-1041.79Show/hide
Query:  LINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSF-NGKHNLTHLLFADD
        +ING P+  + P RG+RQGDP+SP++F+L  + LS L    +++  + G+   N    + HLLFADD
Subjt:  LINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHLEKENLIKGVSF-NGKHNLTHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGATGACACTCAATATAAAGTGGATGATTTTATTGTGAAAAACTTCACTCTTTCAATTAAAATATCTTATCCCAAAGATTCGAACAATCTTCATTGGTGGCTAAC
TGCTGTCTATGGGCCATCTAAACGAGAAAATAGAGGAGATTTCTGGATGGAGCTCGAAGAGATTAAATCAAATTGCCTTCCAAGATGGCTTATGGGTGGGGACTTTAATA
TTGTTAGATGGCAATCAGAAACTACAGCGAAAAACATTGTTATCCCCTGGATAGCAATGTTTTTGGTATGGTTTCATCTTAGAGCTCAACCTATTTTATCAAGACTAGAC
AGATTTCTTTATACTCCAGAATGGGAAACTCTTTTCGAACCTCACTTCTCCAAACTGCTTCCTCGAACAACCTCAGACCACTTTCCCATTACACTTGAGTCAAACAGTCT
AAAGTGGTGCTCCTCTCCGTTCAGATTCACAAACTCTTTTCTCAAAGAAGTTTCTTTCAAGCAAAACATTGAGCTTTGGTGGAAAAACACTGCCCAAGACGGACACCCGG
GATACTCCTTTATGTGGAGACTCATACAACTCTCTCATACAATCAAAAGTTGGAGTAAAAGCATAAAAGTTTCCAATGATAAAGAAAGACAGACACTATTGAAAGAACTC
GAGCACATTGATAAACTGGAAGCTGAAAATAATATCACTGAGATGCACATTTCTTGTAGAACATCCATCAAAACTGATTTGAATCAAATGGCCATCAAAGAAGCACAAGT
GTGGGCCCAAAAATGCAAACGGTTATGGAACATGGAAGGTGATGAAAACTCTGCTTTTTATCATAAAATTTGTTCAGTTAGGCAAAGGAGAAGCTTCATATCAAGTATTT
CCACTGCGCAAGGAGCTCACTGTACAACTGATAAGGATATTGAGAAAACATTCATTGATCATTTTGGGGAGATTTATACTGATAAGAAAAGAGATTTATGGTTCATTGAA
AACCTTCCCTGCACTCCTATAGAAGAGGTTGCTCATGATGATCTTTGCAAATTCTTTTATGAGGAGGAGATTTACAACGCTCTTAAAAAAAATATCTTGGAAATCTTTAA
TGACTTCCATGAAAATGGCATCATCAATAAAATTGTGAATTCTACCTTTATTGCTCTTATTGCCAAAAAGGAGACCTGCTCAGTCCCTTCGGACTATAGACCTATAAGTC
TTACAACTGGTCTTTACAAGCTCATAGATAAAGTAATTGCTGAAAGACTTAAAACAGTTCTGCCTGATATAATCTCAGAGAATCAATTAGCTTTCGTCAGAGGGAGGCAG
GTTAATGATGCCATTTTGATTGAAAATGAAGCGGTGGACTTCTGGAAACAGAAAAAAACCAAAGGCTTTGTGGTCAAGCTTGACATTGAAAAAGGTTTTGATAAAATAAA
TTGGACACTCATTGATTATATGCTTCATAAGAAAGGCTTCCCCCACAAATGGCATAATTGGATTAAAACATGTATTTCAAGTGTTCAATACTCCATCCTTATTAATGGCA
GACCCAAACGTAAAATCAAACCCATGAGGGGTATTCGACAAGGAGATCCTATCTCTCCTTTTATATTTGTTCTCGCCATGGATTATCTCAGTATACTTCTCAATCACTTG
GAGAAAGAAAACTTGATAAAAGGTGTAAGTTTCAACGGGAAACACAACCTCACTCACCTTCTTTTTGCTGACGATATCCTACTCTTTATGGAGGATGATGACGACACCAT
TGATAACATGAGATATGCCCTTAAGCTTTTTGAATTGCCCTCAGGTCTCAACATCAACCTCAATAAATCTACGATTTCACCTATCAACATCGATACGCAGAGAACAAATT
GTGTGGCGTCAACATGGGGATTCTCTATAAACTTTCTTCCCATTCAATACTTGGGAGTGCCTTTGGGAGGTAAACCGAATTCTAGACAATTCTGGTCTGAACCTCCAGAA
GAAAATCAACAATTGGAAATATGCTTCTCTTTCCAGAGGGGTGACACCCTCTCTTTCTGGCACAGCCGTTGGCATGAACTTAGTCCGTTCACACAGTCCAACCCGAGATT
ATTTGCTCTTTCTTCTAGAAAAGAAAACTCCATTACAAACATGTGGAATGCAGAAAAAGTTGATTGGGACCTTTACCCTCGCAGACCATTAAGAAGTGTCGAGGAAGCTC
TTTGGGACAATATGAAAGCCTCCCTCCCCCCCTACCTGTTTCCGTGTATAAATACCATGGATATGCTACAAAGAAGACTCCCAACTTGGAATTTAAATCCCTCTTGGTGT
ATTCTTTGCAAAGCTGCTGAGGAAGACAGACAACACTTGTTCTCCCTCTGCCCCTTCTCATCTAAACTCTGGAAAAATGTTGAAGTTGTATTGGAGAGACCTCTTCTCAC
TTTAAATCCCGCTAATATCTGGAATGAAAGAAACCGAAGAATTTTTAAAGGGGAAGAAAAAACAGTTGATTATGTGTGGGAAGACACTCAACCCTTTAAGCTTTTTTCTG
CTTCTTGTCTTTATCTCATATATATTAATGAAGCTGGTTTGATGTGGTTGCATTGGATTGTCTCCCCTGTGGAGATATCCTCTTGCATCTGCCATCCCAAAAAGAGACAA
GAATACAACGGCATATGGATACGTAAGACAAAGAACAAGAGCAAAAATGGCCTAACTGAACTCCAAAAAGGCCTAGCTCAGTTATCAAAAGAGAACCACAAGTTCGATAT
GTTTCTTCATCCGACTCGAAAACCCCAAAAAGATCATGTGCAAAAGTCTTTTCAGCTAACAGTGGTGATGAGAAGAGGAAAATCAAATAAGATTAATTCATATGAAAGTT
CGAGCAAGAGGAGTTTAACAACAGAATATCGGAAGAGTACACAACTGGGCAAGACTCTTGAAACTGAGAAAACGGTGGTTATCACCAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGGATGACACTCAATATAAAGTGGATGATTTTATTGTGAAAAACTTCACTCTTTCAATTAAAATATCTTATCCCAAAGATTCGAACAATCTTCATTGGTGGCTAAC
TGCTGTCTATGGGCCATCTAAACGAGAAAATAGAGGAGATTTCTGGATGGAGCTCGAAGAGATTAAATCAAATTGCCTTCCAAGATGGCTTATGGGTGGGGACTTTAATA
TTGTTAGATGGCAATCAGAAACTACAGCGAAAAACATTGTTATCCCCTGGATAGCAATGTTTTTGGTATGGTTTCATCTTAGAGCTCAACCTATTTTATCAAGACTAGAC
AGATTTCTTTATACTCCAGAATGGGAAACTCTTTTCGAACCTCACTTCTCCAAACTGCTTCCTCGAACAACCTCAGACCACTTTCCCATTACACTTGAGTCAAACAGTCT
AAAGTGGTGCTCCTCTCCGTTCAGATTCACAAACTCTTTTCTCAAAGAAGTTTCTTTCAAGCAAAACATTGAGCTTTGGTGGAAAAACACTGCCCAAGACGGACACCCGG
GATACTCCTTTATGTGGAGACTCATACAACTCTCTCATACAATCAAAAGTTGGAGTAAAAGCATAAAAGTTTCCAATGATAAAGAAAGACAGACACTATTGAAAGAACTC
GAGCACATTGATAAACTGGAAGCTGAAAATAATATCACTGAGATGCACATTTCTTGTAGAACATCCATCAAAACTGATTTGAATCAAATGGCCATCAAAGAAGCACAAGT
GTGGGCCCAAAAATGCAAACGGTTATGGAACATGGAAGGTGATGAAAACTCTGCTTTTTATCATAAAATTTGTTCAGTTAGGCAAAGGAGAAGCTTCATATCAAGTATTT
CCACTGCGCAAGGAGCTCACTGTACAACTGATAAGGATATTGAGAAAACATTCATTGATCATTTTGGGGAGATTTATACTGATAAGAAAAGAGATTTATGGTTCATTGAA
AACCTTCCCTGCACTCCTATAGAAGAGGTTGCTCATGATGATCTTTGCAAATTCTTTTATGAGGAGGAGATTTACAACGCTCTTAAAAAAAATATCTTGGAAATCTTTAA
TGACTTCCATGAAAATGGCATCATCAATAAAATTGTGAATTCTACCTTTATTGCTCTTATTGCCAAAAAGGAGACCTGCTCAGTCCCTTCGGACTATAGACCTATAAGTC
TTACAACTGGTCTTTACAAGCTCATAGATAAAGTAATTGCTGAAAGACTTAAAACAGTTCTGCCTGATATAATCTCAGAGAATCAATTAGCTTTCGTCAGAGGGAGGCAG
GTTAATGATGCCATTTTGATTGAAAATGAAGCGGTGGACTTCTGGAAACAGAAAAAAACCAAAGGCTTTGTGGTCAAGCTTGACATTGAAAAAGGTTTTGATAAAATAAA
TTGGACACTCATTGATTATATGCTTCATAAGAAAGGCTTCCCCCACAAATGGCATAATTGGATTAAAACATGTATTTCAAGTGTTCAATACTCCATCCTTATTAATGGCA
GACCCAAACGTAAAATCAAACCCATGAGGGGTATTCGACAAGGAGATCCTATCTCTCCTTTTATATTTGTTCTCGCCATGGATTATCTCAGTATACTTCTCAATCACTTG
GAGAAAGAAAACTTGATAAAAGGTGTAAGTTTCAACGGGAAACACAACCTCACTCACCTTCTTTTTGCTGACGATATCCTACTCTTTATGGAGGATGATGACGACACCAT
TGATAACATGAGATATGCCCTTAAGCTTTTTGAATTGCCCTCAGGTCTCAACATCAACCTCAATAAATCTACGATTTCACCTATCAACATCGATACGCAGAGAACAAATT
GTGTGGCGTCAACATGGGGATTCTCTATAAACTTTCTTCCCATTCAATACTTGGGAGTGCCTTTGGGAGGTAAACCGAATTCTAGACAATTCTGGTCTGAACCTCCAGAA
GAAAATCAACAATTGGAAATATGCTTCTCTTTCCAGAGGGGTGACACCCTCTCTTTCTGGCACAGCCGTTGGCATGAACTTAGTCCGTTCACACAGTCCAACCCGAGATT
ATTTGCTCTTTCTTCTAGAAAAGAAAACTCCATTACAAACATGTGGAATGCAGAAAAAGTTGATTGGGACCTTTACCCTCGCAGACCATTAAGAAGTGTCGAGGAAGCTC
TTTGGGACAATATGAAAGCCTCCCTCCCCCCCTACCTGTTTCCGTGTATAAATACCATGGATATGCTACAAAGAAGACTCCCAACTTGGAATTTAAATCCCTCTTGGTGT
ATTCTTTGCAAAGCTGCTGAGGAAGACAGACAACACTTGTTCTCCCTCTGCCCCTTCTCATCTAAACTCTGGAAAAATGTTGAAGTTGTATTGGAGAGACCTCTTCTCAC
TTTAAATCCCGCTAATATCTGGAATGAAAGAAACCGAAGAATTTTTAAAGGGGAAGAAAAAACAGTTGATTATGTGTGGGAAGACACTCAACCCTTTAAGCTTTTTTCTG
CTTCTTGTCTTTATCTCATATATATTAATGAAGCTGGTTTGATGTGGTTGCATTGGATTGTCTCCCCTGTGGAGATATCCTCTTGCATCTGCCATCCCAAAAAGAGACAA
GAATACAACGGCATATGGATACGTAAGACAAAGAACAAGAGCAAAAATGGCCTAACTGAACTCCAAAAAGGCCTAGCTCAGTTATCAAAAGAGAACCACAAGTTCGATAT
GTTTCTTCATCCGACTCGAAAACCCCAAAAAGATCATGTGCAAAAGTCTTTTCAGCTAACAGTGGTGATGAGAAGAGGAAAATCAAATAAGATTAATTCATATGAAAGTT
CGAGCAAGAGGAGTTTAACAACAGAATATCGGAAGAGTACACAACTGGGCAAGACTCTTGAAACTGAGAAAACGGTGGTTATCACCAGATGA
Protein sequenceShow/hide protein sequence
MWDDTQYKVDDFIVKNFTLSIKISYPKDSNNLHWWLTAVYGPSKRENRGDFWMELEEIKSNCLPRWLMGGDFNIVRWQSETTAKNIVIPWIAMFLVWFHLRAQPILSRLD
RFLYTPEWETLFEPHFSKLLPRTTSDHFPITLESNSLKWCSSPFRFTNSFLKEVSFKQNIELWWKNTAQDGHPGYSFMWRLIQLSHTIKSWSKSIKVSNDKERQTLLKEL
EHIDKLEAENNITEMHISCRTSIKTDLNQMAIKEAQVWAQKCKRLWNMEGDENSAFYHKICSVRQRRSFISSISTAQGAHCTTDKDIEKTFIDHFGEIYTDKKRDLWFIE
NLPCTPIEEVAHDDLCKFFYEEEIYNALKKNILEIFNDFHENGIINKIVNSTFIALIAKKETCSVPSDYRPISLTTGLYKLIDKVIAERLKTVLPDIISENQLAFVRGRQ
VNDAILIENEAVDFWKQKKTKGFVVKLDIEKGFDKINWTLIDYMLHKKGFPHKWHNWIKTCISSVQYSILINGRPKRKIKPMRGIRQGDPISPFIFVLAMDYLSILLNHL
EKENLIKGVSFNGKHNLTHLLFADDILLFMEDDDDTIDNMRYALKLFELPSGLNINLNKSTISPINIDTQRTNCVASTWGFSINFLPIQYLGVPLGGKPNSRQFWSEPPE
ENQQLEICFSFQRGDTLSFWHSRWHELSPFTQSNPRLFALSSRKENSITNMWNAEKVDWDLYPRRPLRSVEEALWDNMKASLPPYLFPCINTMDMLQRRLPTWNLNPSWC
ILCKAAEEDRQHLFSLCPFSSKLWKNVEVVLERPLLTLNPANIWNERNRRIFKGEEKTVDYVWEDTQPFKLFSASCLYLIYINEAGLMWLHWIVSPVEISSCICHPKKRQ
EYNGIWIRKTKNKSKNGLTELQKGLAQLSKENHKFDMFLHPTRKPQKDHVQKSFQLTVVMRRGKSNKINSYESSSKRSLTTEYRKSTQLGKTLETEKTVVITR