; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy04g006500 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy04g006500
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr04:36399730..36405676
RNA-Seq ExpressionLcy04g006500
SyntenyLcy04g006500
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.0e-14240.06Show/hide
Query:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS
        +CK++W+++GDENS+FFHK+CTAR ++  I ++I+  G + ++D+ +    I+HF  IY  N   +  + NLDW PI+   +  L  PF EAE++  +KS
Subjt:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS

Query:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA
          +NKA GPDG+ + F +K W+ +K +I  +F DF    I N+VVN T I LI KK      +DFRPISLTTA+YK++AK LA+RLK TL DTIS +Q+A
Subjt:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA

Query:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF
        FV+ RQI++ IL+ANEA+D WR  K++G +IKLD+EKAFDK++W FID VL+ K Y   WR  I +CISSV YSI++NG+PRG I+         PLSPF
Subjt:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF

Query:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH
        +FVLAMDYLSRL+     K  ++GV    ++++TH+LFADDIL+FV+D D  + N+  I+  FE ASGL INL KST+  IN+   R   I   WG    
Subjt:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH

Query:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL
         LP +YLG+PLGG P ++ FW+ +++KIQ+++ NW++  LSKGGR+ LI S L+   + ++S       + QK++  W +         F  NG      
Subjt:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL

Query:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI
         I+L   N +        L I + N+  F L   +L   L E+      ++     + K       G    K    N   K      C +   K      
Subjt:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI

Query:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS
           W+V +G+ I FWL NW+   PL     RL+ LS+NK  +V+E W+ S   W+    RPL D E   W+ +   LP P   RG     W  + ++ F 
Subjt:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS

Query:  TKYARSILSVAPSRPFSSH
        T   +  ++ AP  P + H
Subjt:  TKYARSILSVAPSRPFSSH

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.6e-14240.47Show/hide
Query:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS
        +CK++W+++GDENS+FFHK+CTAR ++  I ++I+  G + ++D+ +    I+HF  IY  N   +  + NLDW PI+      L  PF E+E++  +KS
Subjt:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS

Query:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA
          +NKA GPDGFT+ F +K W+ +K +I  +F DF  N   N+VVN T I LI KK+    +SDFRPISLTTA+YK++AKVLA+RLK TL  TIS  Q+A
Subjt:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA

Query:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF
        FV+ RQI++ IL+ANEA+D WR  K++G +IKLD+EKAFDK++W FID +L+ K Y   WR+ I +CISSV YSI++NG+PRG I+         PLSPF
Subjt:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF

Query:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH
        +FVLAMDYLS L+    +KG ++GV  G ++++TH+LFADDIL+FV+D +  + N+  I+  FE ASGL INL KST+  IN+   R + I   WG    
Subjt:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH

Query:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL
         LP  YLG+PLGG P ++ FW+ +++KIQ+++ +W++  LSKGGR+ LI S L+   + +LS       + QK++  W +         F  NG      
Subjt:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL

Query:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI
         I+L   N +        L I + ++  F L   +L   L E+      ++     + K       G    K    N   K      C +   K      
Subjt:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI

Query:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS
          GW+V +G+ I FWL NW+   PL  V  RL+ LS+NK  +V++ W+ S + WN    RPL D E   W+ +   LP P   RG     W  + ++ F 
Subjt:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS

Query:  TKYARSILSVAPSRPFSSH
        T   +  LS A + P + H
Subjt:  TKYARSILSVAPSRPFSSH

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.6e-14140.06Show/hide
Query:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS
        +CK++W+++GDENS+FFHK+CT R ++  I ++I+  G + ++D+ +    I+HF  IY  N   +  + N DW PI+      L  PF E+E++  +KS
Subjt:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS

Query:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA
          +NKA GPDGFT+ F +K W+ +K +I  +F DF  N   N+VVN T I LI KKN    +SDF+PISLTTA+YK++AKVLA+RLK TL DTIS  Q+A
Subjt:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA

Query:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF
        FV+ RQI++ IL+ANEA+D WR  K++G +IKLD+EKAFDK++W FID +L+ K Y   WR+ I +CISSV YSI++NG+PRG I+         PLS F
Subjt:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF

Query:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH
        +FVLAMDYLS L+    +KG ++GV  G ++++TH+LFADDIL+FV+D +  + N+  I+  FE ASGL INL KST+  IN+   R + I   WG    
Subjt:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH

Query:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL
         LP  YLG+PLGG P ++ FW+ +++KIQ+++ +W++  LSKGGR+ LI S L+   + +LS       + QK++  W +         F  NG      
Subjt:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL

Query:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI
         I+L   N +        L I   ++  F L   +L   L E+      ++     + K       G    K    N   K      C +   K      
Subjt:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI

Query:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS
          GW+V +G+ I FWL NW+   PL     RL+ LS+NK  +V++ W+ S + WN    RPL D E   W+ +   LP P   RG     W  + ++ F 
Subjt:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS

Query:  TKYARSILSVAPSRPFSSH
        T   +  LS A + P + H
Subjt:  TKYARSILSVAPSRPFSSH

KAA0046762.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.2e-14341.48Show/hide
Query:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPE-WIVTNLDWAPINLDLASSLISPFREAEVFDCIK
        R KKLWL +GDENS+FFH++CTAR +RN I E+  EEG+   S++ + +  IK F  I+    + + + + NL+W PI      +L +PF E E+   I 
Subjt:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPE-WIVTNLDWAPINLDLASSLISPFREAEVFDCIK

Query:  SIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQL
        S+   K  GPDGF I FFK +W +LK  IM +F DF+   + N+ +N+T IALIPKK    N  DFRPISLTT++YKI+AK L+ RLKT+L +TIS NQL
Subjt:  SIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQL

Query:  AFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSP
        AFV+ RQI+D IL+ANEAVD W+V K KG ++KLD+EKAFD ++WDFID VL  K +P  WR WI+ CIS+V+YSII+NG+P+G I+A        PLSP
Subjt:  AFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSP

Query:  FLFVLAMDYLSRLIEAAEKKGLLSGV-VMGDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGS
        FLFV+AMDYLSRL+   E  G + GV    + +I+H+LFADDILLF++D+D  + N+   +  FE+ASGL+INL KS +  +N+++ R  +   FWG  S
Subjt:  FLFVLAMDYLSRLIEAAEKKGLLSGV-VMGDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGS

Query:  HPLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLSLPQKLKVVWTSIKSKVLTKLF----SLNGLGVFSLRI
          LP++YLG+PLGG PK+  FW  + EKIQ+++ NW++  +SKGGRL LI+S L      +LS+ Q   +   +I+      L+    S  G  + +   
Subjt:  HPLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLSLPQKLKVVWTSIKSKVLTKLF----SLNGLGVFSLRI

Query:  TLFGGNSLALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEIW----GGWE
                 L IS  +     L T +L   L E       ++      + K +G   G +         +          A  R       W      W+
Subjt:  TLFGGNSLALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEIW----GGWE

Query:  VRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFSTKYAR
        + NG  I FW  NWS+ G L     RL+ LS +K++TV++AW+  D  W    RR L DRE  +W ++  +LP P    GS    W+    + FS   A+
Subjt:  VRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFSTKYAR

Query:  SILS
         ++S
Subjt:  SILS

TYK05690.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.2e-14340.75Show/hide
Query:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPE-WIVTNLDWAPINLDLASSLISPFREAEVFDCIK
        R KK+WL +GDENSAFFH++C++R +RN IHE+  EEG    ++N +    + HF  +Y ++ + + +   NL+W PI+    S L +PF E E+   I 
Subjt:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPE-WIVTNLDWAPINLDLASSLISPFREAEVFDCIK

Query:  SIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQL
        S   NKA GPDGF I FFK +W++LK  I+ +F DFF   + N+ + +T IALI KK    +  DFRPISLTT++YKI+AK L+ RLK TL DTIS NQL
Subjt:  SIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQL

Query:  AFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSP
        AF++ RQI+D IL+ANEA+D W+V K KG ++KLD+EKAFD +SWDF D VL  K YP +WR WI  CIS+V+YSII+NGKP+G I+A        PLSP
Subjt:  AFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSP

Query:  FLFVLAMDYLSRLIEAAEKKGLLSGVVM-GDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGS
        FLFV+AMDYLSRL+   E  G + GV +  D +I+H+LFADDILLFV+D+D  + N+   I  FEKASGL+INL KS +  +N++  R  +    W    
Subjt:  FLFVLAMDYLSRLIEAAEKKGLLSGVVM-GDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGS

Query:  HPLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLSLPQ-------KLKVVWTS--------------IKSKV
        H LP+ YLG+PLGG PK+  FW  + ++IQ+++ NW++  +SKGGRL LI+S L    + +LS+ Q        ++ +W +              I   +
Subjt:  HPLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLSLPQ-------KLKVVWTS--------------IKSKV

Query:  LTKLFSLNGLGVFSLRITLFGGNSLALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGR
        +TK     GLG+  L+IT     + AL +S +  R +    N L   L+                  K +G+  G +       + +          A  
Subjt:  LTKLFSLNGLGVFSLRITLFGGNSLALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGR

Query:  RKKKGEEIW----GGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDV
        R       W     GW++ NG  I FW  NWS  G L     RL+ LS +K  ++++ W+ ++  W  + RR L DRE+  W  +   LP+P + RG   
Subjt:  RKKKGEEIW----GGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDV

Query:  QRWLASEDDFFSTKYARSILSVAP
          W++     FS   A+S +S  P
Subjt:  QRWLASEDDFFSTKYARSILSVAP

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein9.9e-14340.06Show/hide
Query:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS
        +CK++W+++GDENS+FFHK+CTAR ++  I ++I+  G + ++D+ +    I+HF  IY  N   +  + NLDW PI+   +  L  PF EAE++  +KS
Subjt:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS

Query:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA
          +NKA GPDG+ + F +K W+ +K +I  +F DF    I N+VVN T I LI KK      +DFRPISLTTA+YK++AK LA+RLK TL DTIS +Q+A
Subjt:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA

Query:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF
        FV+ RQI++ IL+ANEA+D WR  K++G +IKLD+EKAFDK++W FID VL+ K Y   WR  I +CISSV YSI++NG+PRG I+         PLSPF
Subjt:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF

Query:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH
        +FVLAMDYLSRL+     K  ++GV    ++++TH+LFADDIL+FV+D D  + N+  I+  FE ASGL INL KST+  IN+   R   I   WG    
Subjt:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH

Query:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL
         LP +YLG+PLGG P ++ FW+ +++KIQ+++ NW++  LSKGGR+ LI S L+   + ++S       + QK++  W +         F  NG      
Subjt:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL

Query:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI
         I+L   N +        L I + N+  F L   +L   L E+      ++     + K       G    K    N   K      C +   K      
Subjt:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI

Query:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS
           W+V +G+ I FWL NW+   PL     RL+ LS+NK  +V+E W+ S   W+    RPL D E   W+ +   LP P   RG     W  + ++ F 
Subjt:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS

Query:  TKYARSILSVAPSRPFSSH
        T   +  ++ AP  P + H
Subjt:  TKYARSILSVAPSRPFSSH

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein7.6e-14340.47Show/hide
Query:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS
        +CK++W+++GDENS+FFHK+CTAR ++  I ++I+  G + ++D+ +    I+HF  IY  N   +  + NLDW PI+      L  PF E+E++  +KS
Subjt:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS

Query:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA
          +NKA GPDGFT+ F +K W+ +K +I  +F DF  N   N+VVN T I LI KK+    +SDFRPISLTTA+YK++AKVLA+RLK TL  TIS  Q+A
Subjt:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA

Query:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF
        FV+ RQI++ IL+ANEA+D WR  K++G +IKLD+EKAFDK++W FID +L+ K Y   WR+ I +CISSV YSI++NG+PRG I+         PLSPF
Subjt:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF

Query:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH
        +FVLAMDYLS L+    +KG ++GV  G ++++TH+LFADDIL+FV+D +  + N+  I+  FE ASGL INL KST+  IN+   R + I   WG    
Subjt:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH

Query:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL
         LP  YLG+PLGG P ++ FW+ +++KIQ+++ +W++  LSKGGR+ LI S L+   + +LS       + QK++  W +         F  NG      
Subjt:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL

Query:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI
         I+L   N +        L I + ++  F L   +L   L E+      ++     + K       G    K    N   K      C +   K      
Subjt:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI

Query:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS
          GW+V +G+ I FWL NW+   PL  V  RL+ LS+NK  +V++ W+ S + WN    RPL D E   W+ +   LP P   RG     W  + ++ F 
Subjt:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS

Query:  TKYARSILSVAPSRPFSSH
        T   +  LS A + P + H
Subjt:  TKYARSILSVAPSRPFSSH

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein4.2e-14140.06Show/hide
Query:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS
        +CK++W+++GDENS+FFHK+CT R ++  I ++I+  G + ++D+ +    I+HF  IY  N   +  + N DW PI+      L  PF E+E++  +KS
Subjt:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKS

Query:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA
          +NKA GPDGFT+ F +K W+ +K +I  +F DF  N   N+VVN T I LI KKN    +SDF+PISLTTA+YK++AKVLA+RLK TL DTIS  Q+A
Subjt:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLA

Query:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF
        FV+ RQI++ IL+ANEA+D WR  K++G +IKLD+EKAFDK++W FID +L+ K Y   WR+ I +CISSV YSI++NG+PRG I+         PLS F
Subjt:  FVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSPF

Query:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH
        +FVLAMDYLS L+    +KG ++GV  G ++++TH+LFADDIL+FV+D +  + N+  I+  FE ASGL INL KST+  IN+   R + I   WG    
Subjt:  LFVLAMDYLSRLIEAAEKKGLLSGVVMG-DISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSH

Query:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL
         LP  YLG+PLGG P ++ FW+ +++KIQ+++ +W++  LSKGGR+ LI S L+   + +LS       + QK++  W +         F  NG      
Subjt:  PLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLS-------LPQKLKVVWTSIKSKVLTKLFSLNGLGVFSL

Query:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI
         I+L   N +        L I   ++  F L   +L   L E+      ++     + K       G    K    N   K      C +   K      
Subjt:  RITLFGGNSL-------ALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEI

Query:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS
          GW+V +G+ I FWL NW+   PL     RL+ LS+NK  +V++ W+ S + WN    RPL D E   W+ +   LP P   RG     W  + ++ F 
Subjt:  WGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFS

Query:  TKYARSILSVAPSRPFSSH
        T   +  LS A + P + H
Subjt:  TKYARSILSVAPSRPFSSH

A0A5A7TTK1 LINE-1 retrotransposable element ORF2 protein4.4e-14341.48Show/hide
Query:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPE-WIVTNLDWAPINLDLASSLISPFREAEVFDCIK
        R KKLWL +GDENS+FFH++CTAR +RN I E+  EEG+   S++ + +  IK F  I+    + + + + NL+W PI      +L +PF E E+   I 
Subjt:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPE-WIVTNLDWAPINLDLASSLISPFREAEVFDCIK

Query:  SIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQL
        S+   K  GPDGF I FFK +W +LK  IM +F DF+   + N+ +N+T IALIPKK    N  DFRPISLTT++YKI+AK L+ RLKT+L +TIS NQL
Subjt:  SIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQL

Query:  AFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSP
        AFV+ RQI+D IL+ANEAVD W+V K KG ++KLD+EKAFD ++WDFID VL  K +P  WR WI+ CIS+V+YSII+NG+P+G I+A        PLSP
Subjt:  AFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSP

Query:  FLFVLAMDYLSRLIEAAEKKGLLSGV-VMGDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGS
        FLFV+AMDYLSRL+   E  G + GV    + +I+H+LFADDILLF++D+D  + N+   +  FE+ASGL+INL KS +  +N+++ R  +   FWG  S
Subjt:  FLFVLAMDYLSRLIEAAEKKGLLSGV-VMGDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGS

Query:  HPLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLSLPQKLKVVWTSIKSKVLTKLF----SLNGLGVFSLRI
          LP++YLG+PLGG PK+  FW  + EKIQ+++ NW++  +SKGGRL LI+S L      +LS+ Q   +   +I+      L+    S  G  + +   
Subjt:  HPLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLSLPQKLKVVWTSIKSKVLTKLF----SLNGLGVFSLRI

Query:  TLFGGNSLALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEIW----GGWE
                 L IS  +     L T +L   L E       ++      + K +G   G +         +          A  R       W      W+
Subjt:  TLFGGNSLALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGRRKKKGEEIW----GGWE

Query:  VRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFSTKYAR
        + NG  I FW  NWS+ G L     RL+ LS +K++TV++AW+  D  W    RR L DRE  +W ++  +LP P    GS    W+    + FS   A+
Subjt:  VRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDVQRWLASEDDFFSTKYAR

Query:  SILS
         ++S
Subjt:  SILS

A0A5D3C2W8 LINE-1 retrotransposable element ORF2 protein4.4e-14340.75Show/hide
Query:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPE-WIVTNLDWAPINLDLASSLISPFREAEVFDCIK
        R KK+WL +GDENSAFFH++C++R +RN IHE+  EEG    ++N +    + HF  +Y ++ + + +   NL+W PI+    S L +PF E E+   I 
Subjt:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPE-WIVTNLDWAPINLDLASSLISPFREAEVFDCIK

Query:  SIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQL
        S   NKA GPDGF I FFK +W++LK  I+ +F DFF   + N+ + +T IALI KK    +  DFRPISLTT++YKI+AK L+ RLK TL DTIS NQL
Subjt:  SIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQL

Query:  AFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSP
        AF++ RQI+D IL+ANEA+D W+V K KG ++KLD+EKAFD +SWDF D VL  K YP +WR WI  CIS+V+YSII+NGKP+G I+A        PLSP
Subjt:  AFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAK------IPLSP

Query:  FLFVLAMDYLSRLIEAAEKKGLLSGVVM-GDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGS
        FLFV+AMDYLSRL+   E  G + GV +  D +I+H+LFADDILLFV+D+D  + N+   I  FEKASGL+INL KS +  +N++  R  +    W    
Subjt:  FLFVLAMDYLSRLIEAAEKKGLLSGVVM-GDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGS

Query:  HPLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLSLPQ-------KLKVVWTS--------------IKSKV
        H LP+ YLG+PLGG PK+  FW  + ++IQ+++ NW++  +SKGGRL LI+S L    + +LS+ Q        ++ +W +              I   +
Subjt:  HPLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLSLPQ-------KLKVVWTS--------------IKSKV

Query:  LTKLFSLNGLGVFSLRITLFGGNSLALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGR
        +TK     GLG+  L+IT     + AL +S +  R +    N L   L+                  K +G+  G +       + +          A  
Subjt:  LTKLFSLNGLGVFSLRITLFGGNSLALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNRKTGEKNGEGKGDGGGCCAAGR

Query:  RKKKGEEIW----GGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDV
        R       W     GW++ NG  I FW  NWS  G L     RL+ LS +K  ++++ W+ ++  W  + RR L DRE+  W  +   LP+P + RG   
Subjt:  RKKKGEEIW----GGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPDSFRGSDV

Query:  QRWLASEDDFFSTKYARSILSVAP
          W++     FS   A+S +S  P
Subjt:  QRWLASEDDFFSTKYARSILSVAP

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.0e-3626.7Show/hide
Query:  RHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIY----DANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKSIGQNKASGPDGFTIKFFKK
        +  +NQI  + +++G        ++T + +++  +Y    +   E +  +       +N +   SL  P   +E+   I S+   K+ GPDGFT +F+++
Subjt:  RHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIY----DANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKSIGQNKASGPDGFTIKFFKK

Query:  FWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNM-AGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLAFVRKRQISDPILLA-NEA
        +   L P ++ +F       I        +I LIPK         +FRPISL     KIL K+LA R++  +   I  +Q+ F+   Q    I  + N  
Subjt:  FWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNM-AGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLAFVRKRQISDPILLA-NEA

Query:  VDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKP------RGNIQAKIPLSPFLFVLAMDYLSRLIEAAE
          + R   K  V+I +D EKAFDKI   F+   L   G    +   I+A     + +IILNG+       +   +   PLSP LF + ++ L+R I   +
Subjt:  VDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKP------RGNIQAKIPLSPFLFVLAMDYLSRLIEAAE

Query:  KKGLLSGVVMGDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSHPLPIA-----YLGIPLGG
        +   + G+ +G   +   LFADD+++++++   + +N+  +I +F K SG +IN+ KS     N   Q  S I      G  P  IA     YLGI L  
Subjt:  KKGLLSGVVMGDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSHPLPIA-----YLGIPLGG

Query:  TPKN--TQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGL
          K+   + ++P++++I+     W+ +  S  GR+ +++  +
Subjt:  TPKN--TQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGL

P08548 LINE-1 reverse transcriptase homolog2.0e-3127.37Show/hide
Query:  PFREAEVFDCIKSIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNM-AGNISDFRPISLTTALYKILAKVLAERL
        P   +E+   I+++ + K+ GPDGFT +F++ F   L P ++++F +     I        NI LIPK         ++RPISL     KIL K+L  R+
Subjt:  PFREAEVFDCIKSIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNM-AGNISDFRPISLTTALYKILAKVLAERL

Query:  KTTLGDTISLNQLAFVRKRQISDPILLA-NEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKP----
        +  +   I  +Q+ F+   Q    I  + N    + ++  K  +++ +D EKAFD I   F+   L   G   T+   I+A  S  + +IILNG      
Subjt:  KTTLGDTISLNQLAFVRKRQISDPILLA-NEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKP----

Query:  --RGNIQAKIPLSPFLFVLAMDYLSRLIEAAEKKGLLSGVVMGDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQ
          R   +   PLSP LF + M+ L+  I   E+K  + G+ +G   I   LFADD+++++++   +   +  +IK +   SG +IN HKS        +Q
Subjt:  --RGNIQAKIPLSPFLFVLAMDYLSRLIEAAEKKGLLSGVVMGDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQ

Query:  RTSDITRFWGCGSHPLPIAYLGIPLGGTPKN--TQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGL
            +         P  + YLG+ L    K+   + +E + ++I   +  W+ +  S  GR+ +++  +
Subjt:  RTSDITRFWGCGSHPLPIAYLGIPLGGTPKN--TQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGL

P11369 LINE-1 retrotransposable element ORF2 protein7.5e-3127.01Show/hide
Query:  INLDLASSLISPFREAEVFDCIKSIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPK-KNMAGNISDFRPISLTTALY
        +N D    L SP    E+   I S+   K+ GPDGF+ +F++ F   L P +  +FH                I LIPK +     I +FRPISL     
Subjt:  INLDLASSLISPFREAEVFDCIKSIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPK-KNMAGNISDFRPISLTTALY

Query:  KILAKVLAERLKTTLGDTISLNQLAFVRKRQISDPILLANEAVD-LWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYS
        KIL K+LA R++  +   I  +Q+ F+   Q    I  +   +  + ++  K  ++I LD EKAFDKI   F+  VL   G    + + IKA  S    +
Subjt:  KILAKVLAERLKTTLGDTISLNQLAFVRKRQISDPILLANEAVD-LWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYS

Query:  IILNGKPRGNIQAK------IPLSPFLFVLAMDYLSRLIEAAEKKGLLSGVVMGDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHK
        I +NG+    I  K       PLSP+LF + ++ L+R I   ++   + G+ +G   +   L ADD+++++ D   +   +  +I SF +  G +IN +K
Subjt:  IILNGKPRGNIQAK------IPLSPFLFVLAMDYLSRLIEAAEKKGLLSGVVMGDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHK

Query:  STVSGINLTDQRTSDITRFWGCGSHPLPIAYLGIPLGGTPKN--TQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLSLPQKLKVVWTS
        S         Q   +I            I YLG+ L    K+   + ++ + ++I+  ++ W+ +  S  GR+ ++          K+++  K    + +
Subjt:  STVSGINLTDQRTSDITRFWGCGSHPLPIAYLGIPLGGTPKN--TQFWEPMIEKIQRRIQNWRFVSLSKGGRLALIQSGLQIWCVGKLSLPQKLKVVWTS

Query:  IKSKVLTKLFS
        I  K+ T+ F+
Subjt:  IKSKVLTKLFS

P14381 Transposon TX1 uncharacterized 149 kDa protein3.1e-2425.11Show/hide
Query:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANH-EPEWIVTNLDWAPINLDLASS-LISPFREAEVFDCI
        R +   L D D  S FF+ +   +  R QI  L +E+G  +     +       +  ++  +   P+      D  P+  +     L +P    E+   +
Subjt:  RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANH-EPEWIVTNLDWAPINLDLASS-LISPFREAEVFDCI

Query:  KSIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQ
        + +  NK+ G DG TI+FF+ FW+ L P    V  + F             ++L+PKK     I ++RP+SL +  YKI+AK ++ RLK+ L + I  +Q
Subjt:  KSIGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQ

Query:  LAFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKA------CISSVSYSIILNGKPRGNIQAKIPLS
           V  R I D + L  + +   R +      + LD EKAFD++   ++   L    +   +  ++K       C+  +++S+         ++   PLS
Subjt:  LAFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKA------CISSVSYSIILNGKPRGNIQAKIPLS

Query:  PFLFVLAMDYLSRLIEAAEKKGLLSGVVM--GDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDI-TRFWG
          L+ LA++    L+     +  L+G+V+   D+ +    +ADD++L  Q D   +E      + +  AS  +IN  KS  SG+     +   +   F  
Subjt:  PFLFVLAMDYLSRLIEAAEKKGLLSGVVM--GDISITHLLFADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDI-TRFWG

Query:  CGSHPLPIAYLGIPLGGT--PKNTQFWEPMIEKIQRRIQNWRFVS--LSKGGRLALIQS--GLQIW
               I YLG+ L     P +  F E + E +  R+  W+  +  LS  GR  +I      QIW
Subjt:  CGSHPLPIAYLGIPLGGT--PKNTQFWEPMIEKIQRRIQNWRFVS--LSKGGRLALIQS--GLQIW

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.6e-1228.89Show/hide
Query:  LIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLAFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVL
        LIPK     N S++RPI++ +AL ++L ++LA+RL+  +   +   Q  + R        LL +  +   R  +K   ++ LDV KAFD +S   I   L
Subjt:  LIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLAFVRKRQISDPILLANEAVDLWRVSKKKGVLIKLDVEKAFDKISWDFIDCVL

Query:  LNKGYPNTWRDWIKACISSVSYSIILNGKP-------RGNIQAKIPLSPFLFVLAMDYLSRLIEAAEKKGLLSGVVMGDISITHLLFADDILLFVQDDDK
           G      ++I   +S  + +I +           R  ++   PLSPFLF   +D L  L       G+  G  +G+  I  L FADD+LL ++D+D 
Subjt:  LNKGYPNTWRDWIKACISSVSYSIILNGKP-------RGNIQAKIPLSPFLFVLAMDYLSRLIEAAEKKGLLSGVVMGDISITHLLFADDILLFVQDDDK

Query:  AIENMFFIIKSFEKASGLQINLHKS
         +      + +F +  G+ +N  KS
Subjt:  AIENMFFIIKSFEKASGLQINLHKS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.0e-1930.9Show/hide
Query:  WLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHE---PEWIVTNLDWAPI--NLDLASSLISPFREAEVFDCIKS
        WL DGD N+ FFHKV  A   +N I  L  ++ + + +   ++  ++ ++  +  ++ +   P+ +    D  P   N  LAS L +   + E+   + +
Subjt:  WLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHE---PEWIVTNLDWAPI--NLDLASSLISPFREAEVFDCIKS

Query:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKIL
        + +NKA GPD FT +FF + W ++K S ++   +FF      +  N T I LIPK      +S FRP+S  T +YKI+
Subjt:  IGQNKASGPDGFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKIL

AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.8e-1039.76Show/hide
Query:  LAERLKTTLGDTISLNQLAFVRKRQISDPILLANEAVDLWRVSKKKGV----LIKLDVEKAFDKISWDFIDCVLLNKGYPNTW
        + ERLK  + + I   Q +F+  R  +D I+   EAV   R  +KKGV    L+KLD+EKA+D+I WD+++  L++ G+P  W
Subjt:  LAERLKTTLGDTISLNQLAFVRKRQISDPILLANEAVDLWRVSKKKGV----LIKLDVEKAFDKISWDFIDCVLLNKGYPNTW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.1e-0541.79Show/hide
Query:  ILNGKPRG------NIQAKIPLSPFLFVLAMDYLSRLIEAAEKKGLLSGVVMGDIS--ITHLLFADD
        I+NG P+G       ++   PLSP+LF+L  + LS L   A+++G L G+ + + S  I HLLFADD
Subjt:  ILNGKPRG------NIQAKIPLSPFLFVLAMDYLSRLIEAAEKKGLLSGVVMGDIS--ITHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CGTTGCAAGAAGCTTTGGCTAAATGACGGGGATGAAAACTCAGCTTTTTTCCATAAAGTTTGTACTGCCCGTCATAGAAGGAATCAAATTCATGAGCTCATTTCAGAAGA
AGGGATTAGCATTGTTTCCGACAATATGATGGAAACTGAAGTGATTAAACATTTCTTGGCTATCTATGATGCAAATCATGAACCTGAATGGATTGTGACTAATCTCGACT
GGGCTCCCATCAACCTTGATTTGGCCAGCAGTTTGATATCTCCTTTCAGAGAGGCTGAAGTTTTCGATTGCATCAAGTCTATTGGCCAAAATAAGGCCTCGGGTCCTGAC
GGCTTCACTATCAAATTTTTCAAGAAATTCTGGAACATTCTAAAACCGTCTATCATGTCAGTTTTCCATGATTTCTTTCACAACAAGATTGCCAACCGTGTTGTAAATCA
CACAAATATTGCGCTCATCCCCAAAAAGAATATGGCTGGCAACATTTCCGACTTCCGCCCAATTAGCCTTACCACTGCTCTTTATAAAATCCTTGCTAAGGTTTTGGCGG
AACGTTTGAAGACCACTTTAGGCGATACTATTAGCTTAAATCAATTAGCTTTTGTTCGCAAGAGACAGATCTCTGATCCCATATTGTTAGCTAATGAAGCTGTTGACTTA
TGGCGTGTTTCTAAAAAGAAAGGCGTCCTTATTAAGCTTGATGTGGAGAAAGCTTTCGACAAAATTAGCTGGGATTTCATTGATTGTGTTCTTCTCAATAAGGGTTACCC
TAACACGTGGAGAGATTGGATTAAGGCTTGTATTTCTTCAGTCTCTTACTCTATTATTCTAAATGGTAAGCCTCGAGGCAACATTCAAGCTAAGATCCCTTTATCTCCCT
TTCTTTTTGTTCTCGCCATGGATTATCTTAGTAGATTAATTGAGGCTGCTGAAAAGAAAGGTCTTTTATCTGGTGTAGTCATGGGAGATATCTCTATCACTCATCTCCTC
TTTGCTGATGACATTTTACTTTTTGTACAAGATGATGATAAGGCCATTGAAAACATGTTTTTCATTATTAAATCTTTTGAAAAAGCCTCTGGTCTTCAGATAAATCTTCA
CAAGTCTACTGTTTCTGGCATCAACCTGACAGATCAAAGAACTTCAGATATTACTCGCTTTTGGGGATGTGGCTCTCATCCTCTGCCAATTGCTTACCTTGGCATTCCTT
TAGGCGGCACTCCTAAAAATACTCAGTTTTGGGAGCCCATGATTGAGAAGATTCAACGAAGAATTCAAAATTGGCGCTTTGTATCTCTTTCCAAGGGAGGCCGTCTCGCT
CTTATTCAATCGGGCCTTCAAATTTGGTGCGTTGGGAAATTGTCTCTACCCCAAAAGCTGAAGGTGGTTTGGACATCCATAAAATCAAAAGTACTAACGAAGCTCTTCTC
CTTAAATGGACTTGGCGTTTTTTCACTGAGGATAACTCTCTTTGGAGGAAATTCATTAGCTCTAAATATTTCAGCCTTCAACACAAGAGTTTTCCCTCTAGCAACAAATT
TTCTAGTTCCAGATCTCCTTGAACAACTGGAGAAGGAGGCGACGGTGCTTGCGGGACCTACTCACCGGAGGAAGAAAAACGAAGGGGAAGGCGGCGGCGGCGTGAACAGA
AAAACGGGGGAAAAAAATGGGGAAGGAAAAGGCGACGGCGGCGGCTGCTGTGCGGCTGGGAGAAGAAAAAAAAAGGGAGAAGAGATTTGGGGGGGATGGGAGGTGCGCAA
CGGTAAATCCATCCTTTTTTGGCTCTACAACTGGTCTGTTCTTGGTCCTTTGAAATATGTTTGTGATCGTCTCTATCAGTTGTCTTCAAACAAAAATCTCACAGTTGAGG
AAGCTTGGTCGATTTCGGATAGAGTGTGGAATTTTAGTCCTCGCCGGCCTCTCCTTGATAGAGAAGTTCAAAGATGGAATGAGCTGTCTGGTCTTTTACCCATTCCAGAT
TCTTTTCGAGGTTCTGATGTTCAACGATGGCTTGCTTCTGAAGATGACTTTTTCTCCACAAAATATGCTCGATCCATCCTATCGGTCGCACCTTCCAGACCTTTTTCCAG
CCATGGTTAA
mRNA sequenceShow/hide mRNA sequence
CGTTGCAAGAAGCTTTGGCTAAATGACGGGGATGAAAACTCAGCTTTTTTCCATAAAGTTTGTACTGCCCGTCATAGAAGGAATCAAATTCATGAGCTCATTTCAGAAGA
AGGGATTAGCATTGTTTCCGACAATATGATGGAAACTGAAGTGATTAAACATTTCTTGGCTATCTATGATGCAAATCATGAACCTGAATGGATTGTGACTAATCTCGACT
GGGCTCCCATCAACCTTGATTTGGCCAGCAGTTTGATATCTCCTTTCAGAGAGGCTGAAGTTTTCGATTGCATCAAGTCTATTGGCCAAAATAAGGCCTCGGGTCCTGAC
GGCTTCACTATCAAATTTTTCAAGAAATTCTGGAACATTCTAAAACCGTCTATCATGTCAGTTTTCCATGATTTCTTTCACAACAAGATTGCCAACCGTGTTGTAAATCA
CACAAATATTGCGCTCATCCCCAAAAAGAATATGGCTGGCAACATTTCCGACTTCCGCCCAATTAGCCTTACCACTGCTCTTTATAAAATCCTTGCTAAGGTTTTGGCGG
AACGTTTGAAGACCACTTTAGGCGATACTATTAGCTTAAATCAATTAGCTTTTGTTCGCAAGAGACAGATCTCTGATCCCATATTGTTAGCTAATGAAGCTGTTGACTTA
TGGCGTGTTTCTAAAAAGAAAGGCGTCCTTATTAAGCTTGATGTGGAGAAAGCTTTCGACAAAATTAGCTGGGATTTCATTGATTGTGTTCTTCTCAATAAGGGTTACCC
TAACACGTGGAGAGATTGGATTAAGGCTTGTATTTCTTCAGTCTCTTACTCTATTATTCTAAATGGTAAGCCTCGAGGCAACATTCAAGCTAAGATCCCTTTATCTCCCT
TTCTTTTTGTTCTCGCCATGGATTATCTTAGTAGATTAATTGAGGCTGCTGAAAAGAAAGGTCTTTTATCTGGTGTAGTCATGGGAGATATCTCTATCACTCATCTCCTC
TTTGCTGATGACATTTTACTTTTTGTACAAGATGATGATAAGGCCATTGAAAACATGTTTTTCATTATTAAATCTTTTGAAAAAGCCTCTGGTCTTCAGATAAATCTTCA
CAAGTCTACTGTTTCTGGCATCAACCTGACAGATCAAAGAACTTCAGATATTACTCGCTTTTGGGGATGTGGCTCTCATCCTCTGCCAATTGCTTACCTTGGCATTCCTT
TAGGCGGCACTCCTAAAAATACTCAGTTTTGGGAGCCCATGATTGAGAAGATTCAACGAAGAATTCAAAATTGGCGCTTTGTATCTCTTTCCAAGGGAGGCCGTCTCGCT
CTTATTCAATCGGGCCTTCAAATTTGGTGCGTTGGGAAATTGTCTCTACCCCAAAAGCTGAAGGTGGTTTGGACATCCATAAAATCAAAAGTACTAACGAAGCTCTTCTC
CTTAAATGGACTTGGCGTTTTTTCACTGAGGATAACTCTCTTTGGAGGAAATTCATTAGCTCTAAATATTTCAGCCTTCAACACAAGAGTTTTCCCTCTAGCAACAAATT
TTCTAGTTCCAGATCTCCTTGAACAACTGGAGAAGGAGGCGACGGTGCTTGCGGGACCTACTCACCGGAGGAAGAAAAACGAAGGGGAAGGCGGCGGCGGCGTGAACAGA
AAAACGGGGGAAAAAAATGGGGAAGGAAAAGGCGACGGCGGCGGCTGCTGTGCGGCTGGGAGAAGAAAAAAAAAGGGAGAAGAGATTTGGGGGGGATGGGAGGTGCGCAA
CGGTAAATCCATCCTTTTTTGGCTCTACAACTGGTCTGTTCTTGGTCCTTTGAAATATGTTTGTGATCGTCTCTATCAGTTGTCTTCAAACAAAAATCTCACAGTTGAGG
AAGCTTGGTCGATTTCGGATAGAGTGTGGAATTTTAGTCCTCGCCGGCCTCTCCTTGATAGAGAAGTTCAAAGATGGAATGAGCTGTCTGGTCTTTTACCCATTCCAGAT
TCTTTTCGAGGTTCTGATGTTCAACGATGGCTTGCTTCTGAAGATGACTTTTTCTCCACAAAATATGCTCGATCCATCCTATCGGTCGCACCTTCCAGACCTTTTTCCAG
CCATGGTTAA
Protein sequenceShow/hide protein sequence
RCKKLWLNDGDENSAFFHKVCTARHRRNQIHELISEEGISIVSDNMMETEVIKHFLAIYDANHEPEWIVTNLDWAPINLDLASSLISPFREAEVFDCIKSIGQNKASGPD
GFTIKFFKKFWNILKPSIMSVFHDFFHNKIANRVVNHTNIALIPKKNMAGNISDFRPISLTTALYKILAKVLAERLKTTLGDTISLNQLAFVRKRQISDPILLANEAVDL
WRVSKKKGVLIKLDVEKAFDKISWDFIDCVLLNKGYPNTWRDWIKACISSVSYSIILNGKPRGNIQAKIPLSPFLFVLAMDYLSRLIEAAEKKGLLSGVVMGDISITHLL
FADDILLFVQDDDKAIENMFFIIKSFEKASGLQINLHKSTVSGINLTDQRTSDITRFWGCGSHPLPIAYLGIPLGGTPKNTQFWEPMIEKIQRRIQNWRFVSLSKGGRLA
LIQSGLQIWCVGKLSLPQKLKVVWTSIKSKVLTKLFSLNGLGVFSLRITLFGGNSLALNISAFNTRVFPLATNFLVPDLLEQLEKEATVLAGPTHRRKKNEGEGGGGVNR
KTGEKNGEGKGDGGGCCAAGRRKKKGEEIWGGWEVRNGKSILFWLYNWSVLGPLKYVCDRLYQLSSNKNLTVEEAWSISDRVWNFSPRRPLLDREVQRWNELSGLLPIPD
SFRGSDVQRWLASEDDFFSTKYARSILSVAPSRPFSSHG