; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G26240 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G26240
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr6:23215332..23218001
RNA-Seq ExpressionCSPI06G26240
SyntenyCSPI06G26240
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.1e-27052.03Show/hide
Query:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG
        +KE++ ID+LEAE N++E     RT +K D+     KEAQ+W QK KRLW  EGDEN++F+HKI S RQRRS IS+I++  G  C+T+  I K F+DHF 
Subjt:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG

Query:  EIYT-EKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAK
        +IY    +   W ID+L W+PI      +L    + +EI+ AL  F  NKSPGPDGFTMEF K TW+  KE IL IF DFH NCIIN  +N T IALIAK
Subjt:  EIYT-EKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAK

Query:  RDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKG
        ++ C  P+DYRPISLTT +YKLIAKVIAERLK  LP  ++ENQ+AFV+GRQI DAIL+ANEA+D+W+ KK +G+V+KLDIEKAFDK+NW  IDFML KKG
Subjt:  RDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKG

Query:  FPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINN
        +P KWR WI ACI+SV YSI+INGRPRGKI+P+RGI QGDPISPFIFVLAMDY+S LLN + ++  IKGV   G  NLTHLLFADDILLF+EDDE +I N
Subjt:  FPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINN

Query:  MRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLAS
        ++N + LF+LA+GL+INLNKSTISPIN+D  RT  +A++WG S  FLPI YLGVPLG                              GGKITLIK++LAS
Subjt:  MRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLAS

Query:  IPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPI
        +P Y +S+FK P S  K+IE   RNFLW+N  +   ++L+ W+ + S   KGGL I+ ++ TNF LL+KW+WR+  E +PLWK+II AKY     G++P 
Subjt:  IPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPI

Query:  KSKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSL
           +SSS++PW SI KG +W    + W IK G S SFWHS WH+ SP +   PRL+ALST K  SI +MWN    DWDL P+R LR  E  LW ++K SL
Subjt:  KSKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSL

Query:  -PSLPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKED
          S  ++G D+P WTLN+NG +TVAS+K       Q+  +      + NLWK++IPK+C FF                +RLP    +PSWC++CK   ED
Subjt:  -PSLPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKED

Query:  KQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQAI
        + HLF  CP +  +W+ +   L   +   +   LC  +   K KTKK     +  A+ LWNIW ERN RIF G+EK    +WEDI+A+
Subjt:  KQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQAI

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.7e-25549.1Show/hide
Query:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG
        +KE+++IDKLEAE + TE+H   RT++K DL Q+ L EAQ+WAQKCKR+W  EGDENS+F+HKI + RQ++  IS I    G  C  D DI   FI HF 
Subjt:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG

Query:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR
        +IYT+ +    FI++L W PI   + + L K  +  EI+  LK+F KNK+PGPDG+ M+FL+ +W+F K+NI +IF DFH   IIN ++N T I LIAK+
Subjt:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR

Query:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF
        + C   +D+RPISLTT +YKLIAK +A+RLKQ LPD ISE+Q+AFV+GRQIT+AILIANEA+DFW+ KK +G+V+KLDIEKAFDK+NW  IDF+L KK +
Subjt:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF

Query:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM
         QKWRK I +CI+SV YSILINGRPRG+IKP+RGI QGDP+SPFIFVLAMDYLS LLN+L  +  I GV F+   NLTH+LFADDIL+F+ED ++ ++N+
Subjt:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM

Query:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI
        +  L LFE A+GLNINL+KSTI PIN+ T R   +A  WG S   LP  YLG+PLG                              GG+ITLI +TL S+
Subjt:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI

Query:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK
        P Y +SVFK PK + + IE   RNFLW    +  NI+LI+W+ ++SP  KGGL I+SV STNF LL KW+W+F  EK+PLWKR+I +KY++  +G  P  
Subjt:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK

Query:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP
         K+SS+ +PW ++ +   W    I W +  G+ +SFW   W+  +P +   PRLFALST K  S+   WN    DW L+  RPLR  EE LW ++K SLP
Subjt:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP

Query:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK
        + LP+ G   P W LN+N  F  AS+K A         N     +Y  LWK   PK+CKFF                +RLP WT+ P+WC +C  ++ED 
Subjt:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK

Query:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA
         HLF HCP+S  LW K + +L+      +  +L +++     + +K     +  A  LW IW ERN RIFK +EK    +WED  A
Subjt:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.9e-25950.56Show/hide
Query:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG
        +KE+  IDKLEAE   TE+H   R ++K DL Q+ L EAQ+WAQKCKR+W  EGDENS+F+HKI + RQ++  IS +    G  C  D DI   FI HF 
Subjt:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG

Query:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR
        EIYT+ K    FID+L W PI  T+   L K  +  EI+  LK+F KNK+PGPDGFTM+FL+ +W+F K NI +IF DFH N  IN ++N T I LIAK+
Subjt:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR

Query:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF
        D C   SD+RPISLTT +YKLIAKV+A+RLKQ LP  ISE Q+AFV+GRQIT+AILIANEA+DFW+ KK +G+V+KLDIEKAFDK+NW  IDFML KK +
Subjt:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF

Query:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM
          KWR  I +CI+SV YSILINGRPRG+IKPTRGI QGDP+SPFIFVLAMDYLS LL +L ++  I GV+F    NLTH+LFADDIL+F+ED E+ ++N+
Subjt:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM

Query:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI
        +  L LFE A+GLNINL+KSTI PIN+ T R N +   WG S   LP  YLG+PLG                              GG+ITLI +TL S+
Subjt:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI

Query:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK
        P Y +SVFK PK + + IE   RNFLW  T +  NI+LI+W+ V+SP  KGGL I+SV STNF LL KW+W+F  EK PLWKR+I +KY+Q  +G  P +
Subjt:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK

Query:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP
         KYSS+ +PW ++     W    I W +  G+ +SFW   W+  SP +   PRLFALST K  S+ ++WN    DW+++  RPLR  E+ LW ++K SLP
Subjt:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP

Query:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK
        + LPD G   P W LN+N  F  ASIK      +   TN     +Y  LWK   PK+CKFF                +RLP WT+ P+WC +C  ++ED 
Subjt:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK

Query:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA
         HLF HCP+S  LW K + +L       +  +L +++     KT+K     + +A  LW IW ERN RIFK ++K+   +WEDI A
Subjt:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.7e-25750Show/hide
Query:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG
        +KE++ IDKLEAE   TE+H   R ++K DL Q+ L +AQ+WAQKCKR+W  EGDENS+F+HKI + RQ++  IS +    G  C  D DI   FI HF 
Subjt:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG

Query:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR
        EIYT+ K    FID+  W PI  T+   L K  +  EI+  LK+F KNK+PGPDGFTM+FL+ +W+F K NI +IF DFH N  IN ++N T I LIAK+
Subjt:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR

Query:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF
        + C   SD++PISLTT +YKLIAKV+A+RLKQ LPD ISE Q+AFV+GRQIT+AILIANEA+DFW+ KK +G+V+KLDIEKAFDK+NW  IDFML KK +
Subjt:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF

Query:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM
          KWR  I +CI+SV YSILINGRPRG+IKPTRGI QGDP+S FIFVLAMDYLS LL +L ++  I GV+F    NLTH+LFADDIL+F+ED E+ ++N+
Subjt:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM

Query:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI
        +  L LFE A+GLNINL+KSTI PIN+ T R N +   WG S   LP  YLG+PLG                              GG+ITLI +TL S+
Subjt:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI

Query:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK
        P Y +SVFK PK + + IE   RNFLW  T +  NI+LI+W+ V+SP  KGGL I+ V STNF LL KW+W+F  EK PLWKR+I +KY+Q  +G  P +
Subjt:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK

Query:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP
         KYSS+ +PW ++     W    I W +  G+ +SFW   W+  SP +   PRLFALST K  S+ ++WN    DW+++  RPLR  E+ LW ++K SLP
Subjt:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP

Query:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK
        + LPD G   P W LN+N  F  ASIK      +   TN     +Y  LWK   PK+CKFF                +RLP WT+ P+WC +C  ++ED 
Subjt:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK

Query:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA
         HLF HCP+S  LW K + +L       +  +L +++     KT+K     + +A  LW IW ERN RIFK ++K+   +WEDI A
Subjt:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]5.3e-27052.03Show/hide
Query:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG
        +KE++ ID+LEAE N++E     RT +K D+     KEAQ+W QK KRLW  EGDEN++F+HKI S RQRRS IS+I++  G  C+T+  I K F+DHF 
Subjt:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG

Query:  EIYT-EKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAK
        +IY    +   W ID+L W+PI      +L    + +EI+ AL  F  NKSPGPDGFTMEF K TW+  KE IL IF DFH NCIIN  +N T IALIAK
Subjt:  EIYT-EKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAK

Query:  RDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKG
        ++ C  P+DYRPISLTT +YKLIAKVIAERLK  LP  ++ENQ+AFV+GRQI DAIL+ANEA+D+W+ KK +G+V+KLDIEKAFDK+NW  IDFML KKG
Subjt:  RDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKG

Query:  FPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINN
        +P KWR WI ACI+SV YSI+INGRPRGKI+P+RGI QGDPISPFIFVLAMDY+S LLN + ++  IKGV   G  NLTHLLFADDILLF+EDDE +I N
Subjt:  FPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINN

Query:  MRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLAS
        ++N + LF+LA+GL+INLNKSTISPIN+D  RT  +A++WG S  FLPI YLGVPLG                              GGKITLIK++LAS
Subjt:  MRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLAS

Query:  IPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPI
        +P Y +S+FK P S  K+IE   RNFLW+N  +   ++L+ W+ + S   KGGL I+ ++ TNF LL+KW+WR+  E +PLWK+II AKY     G++P 
Subjt:  IPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPI

Query:  KSKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSL
           +SSS++PW SI KG +W    + W IK G S SFWHS WH+ SP +   PRL+ALST K  SI +MWN    DWDL P+R LR  E  LW ++K SL
Subjt:  KSKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSL

Query:  -PSLPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKED
          S  ++G D+P WTLN+NG +TVAS+K       Q+  +      + NLWK++IPK+C FF                +RLP    +PSWC++CK   ED
Subjt:  -PSLPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKED

Query:  KQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQAI
        + HLF  CP +  +W+ +   L   +   +   LC  +   K KTKK     +  A+ LWNIW ERN RIF G+EK    +WEDI+A+
Subjt:  KQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQAI

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein2.6e-27052.03Show/hide
Query:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG
        +KE++ ID+LEAE N++E     RT +K D+     KEAQ+W QK KRLW  EGDEN++F+HKI S RQRRS IS+I++  G  C+T+  I K F+DHF 
Subjt:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG

Query:  EIYT-EKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAK
        +IY    +   W ID+L W+PI      +L    + +EI+ AL  F  NKSPGPDGFTMEF K TW+  KE IL IF DFH NCIIN  +N T IALIAK
Subjt:  EIYT-EKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAK

Query:  RDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKG
        ++ C  P+DYRPISLTT +YKLIAKVIAERLK  LP  ++ENQ+AFV+GRQI DAIL+ANEA+D+W+ KK +G+V+KLDIEKAFDK+NW  IDFML KKG
Subjt:  RDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKG

Query:  FPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINN
        +P KWR WI ACI+SV YSI+INGRPRGKI+P+RGI QGDPISPFIFVLAMDY+S LLN + ++  IKGV   G  NLTHLLFADDILLF+EDDE +I N
Subjt:  FPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINN

Query:  MRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLAS
        ++N + LF+LA+GL+INLNKSTISPIN+D  RT  +A++WG S  FLPI YLGVPLG                              GGKITLIK++LAS
Subjt:  MRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLAS

Query:  IPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPI
        +P Y +S+FK P S  K+IE   RNFLW+N  +   ++L+ W+ + S   KGGL I+ ++ TNF LL+KW+WR+  E +PLWK+II AKY     G++P 
Subjt:  IPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPI

Query:  KSKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSL
           +SSS++PW SI KG +W    + W IK G S SFWHS WH+ SP +   PRL+ALST K  SI +MWN    DWDL P+R LR  E  LW ++K SL
Subjt:  KSKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSL

Query:  -PSLPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKED
          S  ++G D+P WTLN+NG +TVAS+K       Q+  +      + NLWK++IPK+C FF                +RLP    +PSWC++CK   ED
Subjt:  -PSLPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKED

Query:  KQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQAI
        + HLF  CP +  +W+ +   L   +   +   LC  +   K KTKK     +  A+ LWNIW ERN RIF G+EK    +WEDI+A+
Subjt:  KQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQAI

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein1.8e-25549.1Show/hide
Query:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG
        +KE+++IDKLEAE + TE+H   RT++K DL Q+ L EAQ+WAQKCKR+W  EGDENS+F+HKI + RQ++  IS I    G  C  D DI   FI HF 
Subjt:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG

Query:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR
        +IYT+ +    FI++L W PI   + + L K  +  EI+  LK+F KNK+PGPDG+ M+FL+ +W+F K+NI +IF DFH   IIN ++N T I LIAK+
Subjt:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR

Query:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF
        + C   +D+RPISLTT +YKLIAK +A+RLKQ LPD ISE+Q+AFV+GRQIT+AILIANEA+DFW+ KK +G+V+KLDIEKAFDK+NW  IDF+L KK +
Subjt:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF

Query:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM
         QKWRK I +CI+SV YSILINGRPRG+IKP+RGI QGDP+SPFIFVLAMDYLS LLN+L  +  I GV F+   NLTH+LFADDIL+F+ED ++ ++N+
Subjt:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM

Query:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI
        +  L LFE A+GLNINL+KSTI PIN+ T R   +A  WG S   LP  YLG+PLG                              GG+ITLI +TL S+
Subjt:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI

Query:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK
        P Y +SVFK PK + + IE   RNFLW    +  NI+LI+W+ ++SP  KGGL I+SV STNF LL KW+W+F  EK+PLWKR+I +KY++  +G  P  
Subjt:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK

Query:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP
         K+SS+ +PW ++ +   W    I W +  G+ +SFW   W+  +P +   PRLFALST K  S+   WN    DW L+  RPLR  EE LW ++K SLP
Subjt:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP

Query:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK
        + LP+ G   P W LN+N  F  AS+K A         N     +Y  LWK   PK+CKFF                +RLP WT+ P+WC +C  ++ED 
Subjt:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK

Query:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA
         HLF HCP+S  LW K + +L+      +  +L +++     + +K     +  A  LW IW ERN RIFK +EK    +WED  A
Subjt:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein9.1e-26050.56Show/hide
Query:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG
        +KE+  IDKLEAE   TE+H   R ++K DL Q+ L EAQ+WAQKCKR+W  EGDENS+F+HKI + RQ++  IS +    G  C  D DI   FI HF 
Subjt:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG

Query:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR
        EIYT+ K    FID+L W PI  T+   L K  +  EI+  LK+F KNK+PGPDGFTM+FL+ +W+F K NI +IF DFH N  IN ++N T I LIAK+
Subjt:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR

Query:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF
        D C   SD+RPISLTT +YKLIAKV+A+RLKQ LP  ISE Q+AFV+GRQIT+AILIANEA+DFW+ KK +G+V+KLDIEKAFDK+NW  IDFML KK +
Subjt:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF

Query:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM
          KWR  I +CI+SV YSILINGRPRG+IKPTRGI QGDP+SPFIFVLAMDYLS LL +L ++  I GV+F    NLTH+LFADDIL+F+ED E+ ++N+
Subjt:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM

Query:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI
        +  L LFE A+GLNINL+KSTI PIN+ T R N +   WG S   LP  YLG+PLG                              GG+ITLI +TL S+
Subjt:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI

Query:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK
        P Y +SVFK PK + + IE   RNFLW  T +  NI+LI+W+ V+SP  KGGL I+SV STNF LL KW+W+F  EK PLWKR+I +KY+Q  +G  P +
Subjt:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK

Query:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP
         KYSS+ +PW ++     W    I W +  G+ +SFW   W+  SP +   PRLFALST K  S+ ++WN    DW+++  RPLR  E+ LW ++K SLP
Subjt:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP

Query:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK
        + LPD G   P W LN+N  F  ASIK      +   TN     +Y  LWK   PK+CKFF                +RLP WT+ P+WC +C  ++ED 
Subjt:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK

Query:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA
         HLF HCP+S  LW K + +L       +  +L +++     KT+K     + +A  LW IW ERN RIFK ++K+   +WEDI A
Subjt:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein3.3e-25750Show/hide
Query:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG
        +KE++ IDKLEAE   TE+H   R ++K DL Q+ L +AQ+WAQKCKR+W  EGDENS+F+HKI + RQ++  IS +    G  C  D DI   FI HF 
Subjt:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG

Query:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR
        EIYT+ K    FID+  W PI  T+   L K  +  EI+  LK+F KNK+PGPDGFTM+FL+ +W+F K NI +IF DFH N  IN ++N T I LIAK+
Subjt:  EIYTEKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKR

Query:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF
        + C   SD++PISLTT +YKLIAKV+A+RLKQ LPD ISE Q+AFV+GRQIT+AILIANEA+DFW+ KK +G+V+KLDIEKAFDK+NW  IDFML KK +
Subjt:  DTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGF

Query:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM
          KWR  I +CI+SV YSILINGRPRG+IKPTRGI QGDP+S FIFVLAMDYLS LL +L ++  I GV+F    NLTH+LFADDIL+F+ED E+ ++N+
Subjt:  PQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNM

Query:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI
        +  L LFE A+GLNINL+KSTI PIN+ T R N +   WG S   LP  YLG+PLG                              GG+ITLI +TL S+
Subjt:  RNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLASI

Query:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK
        P Y +SVFK PK + + IE   RNFLW  T +  NI+LI+W+ V+SP  KGGL I+ V STNF LL KW+W+F  EK PLWKR+I +KY+Q  +G  P +
Subjt:  PNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPIK

Query:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP
         KYSS+ +PW ++     W    I W +  G+ +SFW   W+  SP +   PRLFALST K  S+ ++WN    DW+++  RPLR  E+ LW ++K SLP
Subjt:  SKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSLP

Query:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK
        + LPD G   P W LN+N  F  ASIK      +   TN     +Y  LWK   PK+CKFF                +RLP WT+ P+WC +C  ++ED 
Subjt:  S-LPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKEDK

Query:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA
         HLF HCP+S  LW K + +L       +  +L +++     KT+K     + +A  LW IW ERN RIFK ++K+   +WEDI A
Subjt:  QHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQA

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein2.0e-27052.03Show/hide
Query:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG
        +KE++ ID+LEAE N++E     RT +K D+     KEAQ+W QK KRLW  EGDEN++F+HKI S RQRRS IS+I++  G  C+T+  I K F+DHF 
Subjt:  MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFG

Query:  EIYT-EKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAK
        +IY    +   W ID+L W+PI      +L    + +EI+ AL  F  NKSPGPDGFTMEF K TW+  KE IL IF DFH NCIIN  +N T IALIAK
Subjt:  EIYT-EKKRDLWFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAK

Query:  RDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKG
        ++ C  P+DYRPISLTT +YKLIAKVIAERLK  LP  ++ENQ+AFV+GRQI DAIL+ANEA+D+W+ KK +G+V+KLDIEKAFDK+NW  IDFML KKG
Subjt:  RDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKG

Query:  FPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINN
        +P KWR WI ACI+SV YSI+INGRPRGKI+P+RGI QGDPISPFIFVLAMDY+S LLN + ++  IKGV   G  NLTHLLFADDILLF+EDDE +I N
Subjt:  FPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINN

Query:  MRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLAS
        ++N + LF+LA+GL+INLNKSTISPIN+D  RT  +A++WG S  FLPI YLGVPLG                              GGKITLIK++LAS
Subjt:  MRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLG------------------------------GGKITLIKATLAS

Query:  IPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPI
        +P Y +S+FK P S  K+IE   RNFLW+N  +   ++L+ W+ + S   KGGL I+ ++ TNF LL+KW+WR+  E +PLWK+II AKY     G++P 
Subjt:  IPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKYEQSYLGELPI

Query:  KSKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSL
           +SSS++PW SI KG +W    + W IK G S SFWHS WH+ SP +   PRL+ALST K  SI +MWN    DWDL P+R LR  E  LW ++K SL
Subjt:  KSKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKVSL

Query:  -PSLPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKED
          S  ++G D+P WTLN+NG +TVAS+K       Q+  +      + NLWK++IPK+C FF                +RLP    +PSWC++CK   ED
Subjt:  -PSLPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFF----------------RRLPTWTIKPSWCILCKATKED

Query:  KQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQAI
        + HLF  CP +  +W+ +   L   +   +   LC  +   K KTKK     +  A+ LWNIW ERN RIF G+EK    +WEDI+A+
Subjt:  KQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQAI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.1e-3923.76Show/hide
Query:  KELERIDKLEAENNITELHISCRTSIKTDLKQMALK-EAQVWAQKCKRLWNIEGDENSAFYHKISSV----------RQRRSFISSISTAQGALCTTDRD
        +E  +ID L ++  + EL    +T  K   +Q   K  A++   + ++      +  S F+ +I+ +          ++ ++ I +I   +G + T   +
Subjt:  KELERIDKLEAENNITELHISCRTSIKTDLKQMALK-EAQVWAQKCKRLWNIEGDENSAFYHKISSV----------RQRRSFISSISTAQGALCTTDRD

Query:  IEKTFIDHFGEIYTEKKRDL----WFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIIN
        I+ T  +++  +Y  K  +L     F+D+     + + + + L++  +  EI   + + P  KSPGPDGFT EF +         +L++F    K  I+ 
Subjt:  IEKTFIDHFGEIYTEKKRDL----WFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIIN

Query:  SILNSTFIALIAK--RDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYV-LKLDIEKAF
        +      I LI K  RDT     ++RPISL     K++ K++A R++Q +  +I  +Q+ F+ G Q    I  +   +    + K K +V + +D EKAF
Subjt:  SILNSTFIALIAK--RDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYV-LKLDIEKAF

Query:  DKINWTIIDFMLHKKGFPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFA
        DKI    +   L+K G    + K I A       +I++NG+         G  QG P+SP +F +    L +L   + ++  IKG+   GK  +   LFA
Subjt:  DKINWTIIDFMLHKKGFPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFA

Query:  DDILLFMEDDEETINNMRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLGG--------------------------
        DD+++++E+   +  N+   +  F   +G  IN+ KS     N + Q  + +  +  F++    I+YLG+ L                            
Subjt:  DDILLFMEDDEETINNMRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQYLGVPLGG--------------------------

Query:  ------GKITLIKATLAS--IPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINK-GGLDINSVQSTNFTLLSKWIWRFYEEKN
              G+I ++K  +    I  ++    K P + + ++E     F+W    ++K   + K  ++LS  NK GG+ +   +      ++K  W +Y+ ++
Subjt:  ------GKITLIKATLAS--IPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINK-GGLDINSVQSTNFTLLSKWIWRFYEEKN

Query:  -PLWKR
           W R
Subjt:  -PLWKR

P08548 LINE-1 reverse transcriptase homolog3.9e-4226.47Show/hide
Query:  KRLWNIEGDENSAFYHKISSV----------RQRRSFISSISTAQGALCTTDRDIEKTFIDHFGEIYTEKKRDLWFIDS------LPWTPIKETDHDDLS
        KR+        S F+ KI+ +          ++ +S ISSI      + T   +I+K   +++ ++Y+ K  +L  ID       LP    KE +   L+
Subjt:  KRLWNIEGDENSAFYHKISSV----------RQRRSFISSISTAQGALCTTDRDIEKTFIDHFGEIYTEKKRDLWFIDS------LPWTPIKETDHDDLS

Query:  KKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAK--RDTCIAPSDYRPISLTTGLYKLIAKVIAE
        +  S+ EI + ++N PK KSPGPDGFT EF +         +L +F +  K  I+ +      I LI K  +D      +YRPISL     K++ K++  
Subjt:  KKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAK--RDTCIAPSDYRPISLTTGLYKLIAKVIAE

Query:  RLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGY-VLKLDIEKAFDKINWTIIDFMLHKKGFPQKWRKWIEACITSVHYSILINGRPRG
        R++Q +  II  +Q+ F+ G Q    I  +   +    + K K + +L +D EKAFD I    +   L K G    + K IEA  +    +I++NG    
Subjt:  RLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGY-VLKLDIEKAFDKINWTIIDFMLHKKGFPQKWRKWIEACITSVHYSILINGRPRG

Query:  KIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNMRNALRLFELATGLNINLNKSTISPINI
              G  QG P+SP +F + M+ L+I    + ++  IKG+   G   +   LFADD+++++E+  ++   +   ++ +   +G  IN +KS       
Subjt:  KIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNMRNALRLFELATGLNINLNKSTISPINI

Query:  DTQRTNYVAAKWGFSVNFLPIQYLGVPLGG--------------------------------GKITLIKATL--ASIPNYHISVFKPPKSVYKDIETIRR
        + Q    V     F+V    ++YLGV L                                  G+I ++K ++   +I N++    K P S +KD+E I  
Subjt:  DTQRTNYVAAKWGFSVNFLPIQYLGVPLGG--------------------------------GKITLIKATL--ASIPNYHISVFKPPKSVYKDIETIRR

Query:  NFLWRNTFDKKNINLIKWSTVLSPINK-GGLDINSVQSTNFTLLSKWIWRFYEEKN-PLWKRI
        +F+W    ++K   + K  T+LS  NK GG+ +  ++    +++ K  W +++ +   +W RI
Subjt:  NFLWRNTFDKKNINLIKWSTVLSPINK-GGLDINSVQSTNFTLLSKWIWRFYEEKN-PLWKRI

P0C2F6 Putative ribonuclease H protein At1g657501.1e-2827.44Show/hide
Query:  GKITLIKATLASIPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITA
        G++TL KA L+S+P + +S    P+S+   ++ + R FLW +T +KK  +L+KWS V SP  +GGL + + +S N  L+SK  WR  +EKN LW  ++  
Subjt:  GKITLIKATLASIPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITA

Query:  KYEQSYLGELPIKSKYSSSKAPWMSIIKG-ADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRS
        KY    + +        S  + W SI  G  D V   + W    G  + FW  RW    P  + +       T     +A    +    WD     P  +
Subjt:  KYEQSYLGELPIKSKYSSSKAPWMSIIKG-ADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRS

Query:  VEEVLWEDMKVSLPSLPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDD-GEIYNNLWKSTIPKRCKFF-------------RRLPTWTIKPSW
            L  +++  +  L     D   W  + +G F+V S   A  +   DE    +    +N LWK  +P+R K F              R        + 
Subjt:  VEEVLWEDMKVSLPSLPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDD-GEIYNNLWKSTIPKRCKFF-------------RRLPTWTIKPSW

Query:  CILCKATKEDKQHLFTHCPFSTLLWKKV
        C +CK   E   H+   CP    +W +V
Subjt:  CILCKATKEDKQHLFTHCPFSTLLWKKV

P11369 LINE-1 retrotransposable element ORF2 protein7.2e-3624.68Show/hide
Query:  KISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFGEIYTEKKRDL----WFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTM
        +++   + +  I+ I   +G + T   +I+ T    +  +Y+ K  +L     F+D      + +   D L+   S KEI   + + P  KSPGPDGF+ 
Subjt:  KISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFGEIYTEKKRDL----WFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTM

Query:  EFLKGTWNFTKENILEIFND-FHKNCIINSILNSTFIALIA------KRDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQI
        EF    +   KE+++ I +  FHK  +  ++ NS + A I       K  T I   ++RPISL     K++ K++A R+++ +  II  +Q+ F+ G Q 
Subjt:  EFLKGTWNFTKENILEIFND-FHKNCIINSILNSTFIALIA------KRDTCIAPSDYRPISLTTGLYKLIAKVIAERLKQVLPDIISENQLAFVRGRQI

Query:  TDAILIANEAVDFWKQKKTKGY-VLKLDIEKAFDKINWTIIDFMLHKKGFPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAM
           I  +   + +  + K K + ++ LD EKAFDKI    +  +L + G    +   I+A  +    +I +NG     I    G  QG P+SP++F +  
Subjt:  TDAILIANEAVDFWKQKKTKGY-VLKLDIEKAFDKINWTIIDFMLHKKGFPQKWRKWIEACITSVHYSILINGRPRGKIKPTRGIWQGDPISPFIFVLAM

Query:  DYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNMRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQY
          L +L   + +Q  IKG+   GK  +   L ADD+++++ D + +   + N +  F    G  IN NKS       + Q    +     FS+    I+Y
Subjt:  DYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNMRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWGFSVNFLPIQY

Query:  LGVPLGG--------------------------------GKITLIKATL--ASIPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLS
        LGV L                                  G+I ++K  +   +I  ++    K P   + ++E     F+W N   +   +L+K      
Subjt:  LGVPLGG--------------------------------GKITLIKATL--ASIPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLS

Query:  PINKGGLDINSVQSTNFTLLSKWIWRFYEEKN-PLWKRI
            GG+ +  ++     ++ K  W +Y ++    W RI
Subjt:  PINKGGLDINSVQSTNFTLLSKWIWRFYEEKN-PLWKRI

P14381 Transposon TX1 uncharacterized 149 kDa protein4.0e-2621.82Show/hide
Query:  KTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTD---RDIEKTFIDHF---GEIYTEKKRDLWFIDSLPWTPI
        K  L+ M  ++A+    + +     + D  S F++ +   +  R  I+ +    G         RD  ++F  +      I  +   +LW  D LP   +
Subjt:  KTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTD---RDIEKTFIDHF---GEIYTEKKRDLWFIDSLPWTPI

Query:  KETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKRDTCIAPSDYRPISLTTGLYKL
         E   + L    +  E+  AL+  P NKSPG DG T+EF +  W+    +   +  +  K   +        ++L+ K+       ++RP+SL +  YK+
Subjt:  KETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKRDTCIAPSDYRPISLTTGLYKL

Query:  IAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGFPQKWRKWIEACITSVHYSILI
        +AK I+ RLK VL ++I  +Q   V GR I D + +  + + F ++       L LD EKAFD+++   +   L    F  ++  +++    S    + I
Subjt:  IAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGFPQKWRKWIEACITSVHYSILI

Query:  NGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNMRNALRLFELATGLNINLNKST
        N      +   RG+ QG P+S  ++ LA++    LL    ++ L   V       +    +ADD++L  +D  + +   +    ++  A+   IN +KS+
Subjt:  NGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNMRNALRLFELATGLNINLNKST

Query:  ---------------ISPINIDTQRTNYV-----AAKWGFSVNFLPIQ------------YLGVPLGGGKITLIKATLASIPNYHISVFKPPKSVYKDIE
                          I+ +++   Y+     A ++  S NF+ ++            +  V    G+  +I   +AS   Y +    P +     I+
Subjt:  ---------------ISPINIDTQRTNYV-----AAKWGFSVNFLPIQ------------YLGVPLGGGKITLIKATLASIPNYHISVFKPPKSVYKDIE

Query:  TIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRF-YEEKNPLWKRIITAKYEQ
            +FLW         + +       P+ +GG  +  ++S   T   + I R+ Y + +P W  + ++ Y Q
Subjt:  TIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRF-YEEKNPLWKRIITAKYEQ

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.4e-1227.51Show/hide
Query:  WAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFGEIYTEKKRDLWFIDSLPWTPIKETD----HDDLSKKNSA--
        + QK +  W  +GD N+ F+HK+    Q ++ I  +             +++  + ++  +      D+   DS+    IK+      +D L+ + SA  
Subjt:  WAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFGEIYTEKKRDLWFIDSLPWTPIKETD----HDDLSKKNSA--

Query:  --KEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKRDTCIAPSDYRPISLTTGLYKLI
          KEI  A+   P+NK+PGPD FT EF   +W   K++ +    +F +   +    N+T I LI K       S +RP+S  T +YK+I
Subjt:  --KEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKRDTCIAPSDYRPISLTTGLYKLI

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.5e-0727.68Show/hide
Query:  RLPTWTIK-PSWCILCKATKEDKQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLV-AATLWNIWNERNKRIFKGEEKK
        RL  W +  P+ C+LC A  + + HLF  C FS ++W+      +  L  P     C +   +  + K       L   + ++ IW ERN+R+  G  + 
Subjt:  RLPTWTIK-PSWCILCKATKEDKQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLV-AATLWNIWNERNKRIFKGEEKK

Query:  ADTVWEDIQAIM
         +++ +DIQ I+
Subjt:  ADTVWEDIQAIM

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.1e-1041.98Show/hide
Query:  IAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKK-TKGY-VLKLDIEKAFDKINWTIIDFMLHKKGFPQKW
        + ERLK ++ ++I   Q +F+ GR  TD I+   EAV   ++KK  KG+ +LKLD+EKA+D+I W  ++  L   GFP+ W
Subjt:  IAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKK-TKGY-VLKLDIEKAFDKINWTIIDFMLHKKGFPQKW

AT4G29090.1 Ribonuclease H-like superfamily protein4.6e-2223.66Show/hide
Query:  SIPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKY-EQSYLGEL
        ++P Y ++ F  PK+V K I ++  +F WRN  + K ++   W  +     +GG+    +++ N  LL K +WR       L  ++  ++Y  +S     
Subjt:  SIPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPLWKRIITAKY-EQSYLGEL

Query:  PIKSKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKV
        P+ S+ S     W SI    + +    +  +  G+ +  W  +W +  P      R+  +  ++  S++++  V     D   +   + V E+L+ +++ 
Subjt:  PIKSKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEVLWEDMKV

Query:  SL-PSLPDSG---LDNPRWTLNNNGSFTVAS-----IKLARPLNNQDETNEDD-GEIYNNLWKS-TIPKRCKFFRRLPTWTI------------KPSWCI
         L   L   G   LD+  W   ++G +TV S      ++    ++  E +E     IY  +WKS T PK   F  +  + ++            K S CI
Subjt:  SL-PSLPDSG---LDNPRWTLNNNGSFTVAS-----IKLARPLNNQDETNEDD-GEIYNNLWKS-TIPKRCKFFRRLPTWTI------------KPSWCI

Query:  LCKATKEDKQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKT----KGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTV
         C + KE   HL   C F+ L W    + +  PL    + ++  +L+       G  + +  +Q LV   LW +W  RN+ +F+G E  A  V
Subjt:  LCKATKEDKQHLFTHCPFSTLLWKKVEVILDKPLLFPNSTALCKDLFKT----KGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTV

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)8.2e-1143.28Show/hide
Query:  LINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSF-NGKHNLTHLLFADD
        +ING P+G + P+RG+ QGDP+SP++F+L  + LS L    ++Q  + G+   N    + HLLFADD
Subjt:  LINGRPRGKIKPTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSF-NGKHNLTHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAACTCGAGCGCATAGACAAACTGGAAGCAGAAAACAATATCACTGAACTACACATCTCTTGCAGAACTTCCATCAAAACTGACTTGAAACAAATGGCCTTGAA
AGAAGCACAAGTTTGGGCCCAAAAATGCAAACGGTTATGGAACATTGAAGGTGACGAAAATTCAGCTTTTTATCATAAAATCAGCTCTGTTAGGCAAAGAAGAAGCTTCA
TATCAAGCATTTCCACCGCTCAAGGTGCTCTGTGTACAACTGATAGGGACATTGAGAAAACCTTCATCGATCATTTTGGGGAAATCTACACGGAAAAGAAAAGAGATCTT
TGGTTCATCGACAGCCTCCCCTGGACTCCTATAAAAGAGACAGACCATGATGACCTGAGCAAAAAAAATTCTGCAAAAGAAATATACAATGCTCTAAAAAACTTCCCCAA
AAATAAATCTCCGGGACCAGATGGCTTTACTATGGAGTTTCTCAAGGGCACTTGGAATTTCACAAAAGAGAATATATTGGAAATCTTCAATGACTTCCATAAAAATTGCA
TAATCAACAGTATTTTGAATTCTACCTTTATTGCTCTCATTGCAAAAAGGGATACATGCATAGCCCCTTCGGATTATAGGCCCATAAGCCTTACAACTGGTTTGTACAAG
CTTATTGCTAAAGTTATCGCCGAAAGACTAAAACAAGTCCTGCCTGATATAATCTCAGAGAACCAATTAGCTTTCGTCAGAGGGAGACAGATTACGGATGCCATTTTGAT
TGCAAATGAAGCGGTGGATTTCTGGAAGCAGAAAAAAACAAAAGGATATGTGCTCAAGCTTGACATCGAAAAGGCCTTTGATAAAATCAATTGGACTATCATCGATTTTA
TGCTGCATAAAAAAGGCTTTCCACAAAAATGGCGTAAATGGATTGAAGCTTGTATTACTAGTGTTCATTACTCCATCCTTATCAACGGAAGACCAAGAGGTAAAATTAAA
CCCACAAGAGGCATCTGGCAAGGTGATCCAATCTCTCCGTTCATTTTTGTTCTCGCGATGGATTATCTCAGCATACTTCTCAATCACTTGGAGAAACAAAACTTGATAAA
AGGAGTTAGTTTCAATGGGAAACATAACCTTACTCATCTCCTCTTTGCTGACGATATCCTTCTTTTTATGGAGGACGATGAAGAAACCATTAATAATATGAGAAATGCCC
TTCGGCTTTTTGAATTGGCCACAGGTCTCAACATCAACCTCAACAAATCCACGATCTCTCCTATCAACATTGATACGCAGAGAACAAATTATGTGGCGGCTAAATGGGGC
TTCTCAGTTAACTTCCTACCTATCCAATATTTAGGAGTGCCTTTGGGTGGTGGCAAGATAACTCTCATTAAAGCTACATTAGCTAGCATTCCTAATTATCACATTTCGGT
TTTCAAGCCCCCCAAATCTGTCTACAAAGATATTGAAACAATAAGGAGAAATTTCCTCTGGAGAAACACCTTTGATAAGAAAAACATCAATCTCATCAAATGGTCAACGG
TGTTGTCTCCTATCAACAAAGGTGGCCTGGACATTAACAGCGTTCAAAGTACAAATTTCACTCTACTAAGCAAGTGGATCTGGAGATTTTATGAAGAAAAAAATCCTCTG
TGGAAACGCATAATCACTGCTAAATATGAACAATCATACTTGGGTGAACTTCCAATCAAAAGCAAATATAGCAGCTCCAAAGCACCTTGGATGTCTATCATTAAAGGAGC
TGACTGGGTCCTTCCTCAAATCAAATGGTCCATTAAAAGGGGAGACTCCCTCTCGTTCTGGCACAGCCGATGGCATGAACTTAGTCCATTCACACAGACCAACCCAAGAT
TATTTGCTCTTTCTACCAGGAAAGGAGACTCCATTGCAAACATGTGGAATGTTGAAAAAGCCGACTGGGACCTCTACCCTCAAAGACCCTTAAGAAGTGTCGAGGAAGTT
CTTTGGGAAGATATGAAAGTCTCCCTCCCTTCCTTGCCTGATTCAGGATTGGACAATCCTCGTTGGACTTTAAACAACAATGGCAGCTTCACTGTGGCTTCCATCAAACT
TGCAAGACCTCTAAACAACCAAGACGAGACCAACGAAGATGATGGGGAAATCTATAACAACTTGTGGAAATCTACCATCCCCAAAAGATGCAAATTCTTCAGAAGGCTCC
CAACGTGGACTATAAAGCCCTCTTGGTGCATCCTCTGCAAAGCTACTAAGGAAGACAAACAACATCTTTTCACCCATTGCCCCTTCTCAACATTGCTCTGGAAGAAAGTT
GAAGTTATTCTGGACAAACCCCTGCTCTTTCCAAATTCCACTGCCTTATGCAAAGATCTCTTCAAAACAAAAGGTAAAACAAAAAAGCAAACCTTCAATCAACACCTGGT
GGCTGCTACCCTTTGGAACATTTGGAATGAAAGAAACAAAAGAATTTTTAAGGGGGAAGAAAAAAAAGCTGATACCGTATGGGAAGACATACAAGCCATAATGGATTTTT
AG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAACTCGAGCGCATAGACAAACTGGAAGCAGAAAACAATATCACTGAACTACACATCTCTTGCAGAACTTCCATCAAAACTGACTTGAAACAAATGGCCTTGAA
AGAAGCACAAGTTTGGGCCCAAAAATGCAAACGGTTATGGAACATTGAAGGTGACGAAAATTCAGCTTTTTATCATAAAATCAGCTCTGTTAGGCAAAGAAGAAGCTTCA
TATCAAGCATTTCCACCGCTCAAGGTGCTCTGTGTACAACTGATAGGGACATTGAGAAAACCTTCATCGATCATTTTGGGGAAATCTACACGGAAAAGAAAAGAGATCTT
TGGTTCATCGACAGCCTCCCCTGGACTCCTATAAAAGAGACAGACCATGATGACCTGAGCAAAAAAAATTCTGCAAAAGAAATATACAATGCTCTAAAAAACTTCCCCAA
AAATAAATCTCCGGGACCAGATGGCTTTACTATGGAGTTTCTCAAGGGCACTTGGAATTTCACAAAAGAGAATATATTGGAAATCTTCAATGACTTCCATAAAAATTGCA
TAATCAACAGTATTTTGAATTCTACCTTTATTGCTCTCATTGCAAAAAGGGATACATGCATAGCCCCTTCGGATTATAGGCCCATAAGCCTTACAACTGGTTTGTACAAG
CTTATTGCTAAAGTTATCGCCGAAAGACTAAAACAAGTCCTGCCTGATATAATCTCAGAGAACCAATTAGCTTTCGTCAGAGGGAGACAGATTACGGATGCCATTTTGAT
TGCAAATGAAGCGGTGGATTTCTGGAAGCAGAAAAAAACAAAAGGATATGTGCTCAAGCTTGACATCGAAAAGGCCTTTGATAAAATCAATTGGACTATCATCGATTTTA
TGCTGCATAAAAAAGGCTTTCCACAAAAATGGCGTAAATGGATTGAAGCTTGTATTACTAGTGTTCATTACTCCATCCTTATCAACGGAAGACCAAGAGGTAAAATTAAA
CCCACAAGAGGCATCTGGCAAGGTGATCCAATCTCTCCGTTCATTTTTGTTCTCGCGATGGATTATCTCAGCATACTTCTCAATCACTTGGAGAAACAAAACTTGATAAA
AGGAGTTAGTTTCAATGGGAAACATAACCTTACTCATCTCCTCTTTGCTGACGATATCCTTCTTTTTATGGAGGACGATGAAGAAACCATTAATAATATGAGAAATGCCC
TTCGGCTTTTTGAATTGGCCACAGGTCTCAACATCAACCTCAACAAATCCACGATCTCTCCTATCAACATTGATACGCAGAGAACAAATTATGTGGCGGCTAAATGGGGC
TTCTCAGTTAACTTCCTACCTATCCAATATTTAGGAGTGCCTTTGGGTGGTGGCAAGATAACTCTCATTAAAGCTACATTAGCTAGCATTCCTAATTATCACATTTCGGT
TTTCAAGCCCCCCAAATCTGTCTACAAAGATATTGAAACAATAAGGAGAAATTTCCTCTGGAGAAACACCTTTGATAAGAAAAACATCAATCTCATCAAATGGTCAACGG
TGTTGTCTCCTATCAACAAAGGTGGCCTGGACATTAACAGCGTTCAAAGTACAAATTTCACTCTACTAAGCAAGTGGATCTGGAGATTTTATGAAGAAAAAAATCCTCTG
TGGAAACGCATAATCACTGCTAAATATGAACAATCATACTTGGGTGAACTTCCAATCAAAAGCAAATATAGCAGCTCCAAAGCACCTTGGATGTCTATCATTAAAGGAGC
TGACTGGGTCCTTCCTCAAATCAAATGGTCCATTAAAAGGGGAGACTCCCTCTCGTTCTGGCACAGCCGATGGCATGAACTTAGTCCATTCACACAGACCAACCCAAGAT
TATTTGCTCTTTCTACCAGGAAAGGAGACTCCATTGCAAACATGTGGAATGTTGAAAAAGCCGACTGGGACCTCTACCCTCAAAGACCCTTAAGAAGTGTCGAGGAAGTT
CTTTGGGAAGATATGAAAGTCTCCCTCCCTTCCTTGCCTGATTCAGGATTGGACAATCCTCGTTGGACTTTAAACAACAATGGCAGCTTCACTGTGGCTTCCATCAAACT
TGCAAGACCTCTAAACAACCAAGACGAGACCAACGAAGATGATGGGGAAATCTATAACAACTTGTGGAAATCTACCATCCCCAAAAGATGCAAATTCTTCAGAAGGCTCC
CAACGTGGACTATAAAGCCCTCTTGGTGCATCCTCTGCAAAGCTACTAAGGAAGACAAACAACATCTTTTCACCCATTGCCCCTTCTCAACATTGCTCTGGAAGAAAGTT
GAAGTTATTCTGGACAAACCCCTGCTCTTTCCAAATTCCACTGCCTTATGCAAAGATCTCTTCAAAACAAAAGGTAAAACAAAAAAGCAAACCTTCAATCAACACCTGGT
GGCTGCTACCCTTTGGAACATTTGGAATGAAAGAAACAAAAGAATTTTTAAGGGGGAAGAAAAAAAAGCTGATACCGTATGGGAAGACATACAAGCCATAATGGATTTTT
AG
Protein sequenceShow/hide protein sequence
MKELERIDKLEAENNITELHISCRTSIKTDLKQMALKEAQVWAQKCKRLWNIEGDENSAFYHKISSVRQRRSFISSISTAQGALCTTDRDIEKTFIDHFGEIYTEKKRDL
WFIDSLPWTPIKETDHDDLSKKNSAKEIYNALKNFPKNKSPGPDGFTMEFLKGTWNFTKENILEIFNDFHKNCIINSILNSTFIALIAKRDTCIAPSDYRPISLTTGLYK
LIAKVIAERLKQVLPDIISENQLAFVRGRQITDAILIANEAVDFWKQKKTKGYVLKLDIEKAFDKINWTIIDFMLHKKGFPQKWRKWIEACITSVHYSILINGRPRGKIK
PTRGIWQGDPISPFIFVLAMDYLSILLNHLEKQNLIKGVSFNGKHNLTHLLFADDILLFMEDDEETINNMRNALRLFELATGLNINLNKSTISPINIDTQRTNYVAAKWG
FSVNFLPIQYLGVPLGGGKITLIKATLASIPNYHISVFKPPKSVYKDIETIRRNFLWRNTFDKKNINLIKWSTVLSPINKGGLDINSVQSTNFTLLSKWIWRFYEEKNPL
WKRIITAKYEQSYLGELPIKSKYSSSKAPWMSIIKGADWVLPQIKWSIKRGDSLSFWHSRWHELSPFTQTNPRLFALSTRKGDSIANMWNVEKADWDLYPQRPLRSVEEV
LWEDMKVSLPSLPDSGLDNPRWTLNNNGSFTVASIKLARPLNNQDETNEDDGEIYNNLWKSTIPKRCKFFRRLPTWTIKPSWCILCKATKEDKQHLFTHCPFSTLLWKKV
EVILDKPLLFPNSTALCKDLFKTKGKTKKQTFNQHLVAATLWNIWNERNKRIFKGEEKKADTVWEDIQAIMDF