; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0006653 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0006653
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr03:1658676..1663950
RNA-Seq ExpressionIVF0006653
SyntenyIVF0006653
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039309.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.071.88Show/hide
Query:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD
        MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKSLI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD
Subjt:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD

Query:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRAT--SDSSDSYDTSDSSHSSGNSFCDSPSSD
         KNRKSCILVPEG EKS WVSFLSMITPKVEVKAKTRP FLPRSSP+ RLSPPIDYHKRSYA+AV+EGR++  SDSSDSY +SDSS SSGNS CDSP   
Subjt:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRAT--SDSSDSYDTSDSSHSSGNSFCDSPSSD

Query:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH
        LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAFHAEK LVHF+SN+PANLLCQNKGWTTVGKY+V+FEKW+ A HA+PKLIPSYGGWTTFRGIPLH
Subjt:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH

Query:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ
        LWNM TFQQ+GKACGGLIKVAEET++A+NL++A++K+RYNYSGFLPA V+IFD EGNKF +QVVTH EGKWL+ERNVRLHGTFKRQAAA+FD+FNP+SEQ
Subjt:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ

Query:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD
        F F+G+EAISPD L+T S  RKS +P+QP ALKSVIIK  + ATSP+ LNEEVVND++LHATANKSK +IL GISNDG LDKGKQKVDI  Q  SA    
Subjt:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD

Query:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRERN-------------------------------------------------------
        K KRKVSFNSP NKT  FNPDSAPANHSP     EKK++VSRER+                                                       
Subjt:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRERN-------------------------------------------------------

Query:  -----HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS
             HHSSDNAEVIDITNTEVVPETPE+KM   E SNSS E NYRK KH H+R++YYRKKE+KEKD +S+AFK QL +WLKENGLK+S  TDSSGATTS
Subjt:  -----HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS

Query:  TNVLINQLNSG--------------LASKGIGALGTS--------------------------ILQNVEQFHHQQSS-----------------------
        TN L +QL S               L S+  G    S                          + +++   HH  SS                       
Subjt:  TNVLINQLNSG--------------LASKGIGALGTS--------------------------ILQNVEQFHHQQSS-----------------------

Query:  DRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK
          SS                L NNR+TWSNLRNPPTFSR+DRFLYNS WE LF+PH TRTLPR TSDHFPLVCE+S + L WGP PFRLNSIAL+DPEFK
Subjt:  DRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK

Query:  RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWL
        RNM RWWE S+Q+GHPGF FIQRLKSLAN IKPWQKEK  SLT AK++I+REVDSIDK ELDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL
Subjt:  RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWL

Query:  REGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKT
        +EGDENS+FFHRICSSRQKR+ IHEIQDEEG IQNTN +IS AF+  FS+IYR STK DPLFI+NL+WNPI++S+WS LCAPF E+EIKGVI S DG K 
Subjt:  REGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKT

Query:  PGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ
        PG DGFPISFFK+YW+LLKEDILDIFKDF++KGVINKNMNNTYIALI KKKDYSHPKDFRPISLTTSIYK IAKTLSNRLK TLPDTISGNQLAF+KNRQ
Subjt:  PGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ

Query:  ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAM
        ITDAILMANEA+D+WKVKKIKGFILKLDIEKAFD LNW+FID VL+K N+P  WRKWIRGCISNVTYS+IVNG+PQGRIKANRGLRQGDPLS FLFVIAM
Subjt:  ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAM

Query:  DYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECAS
        DYLSRLLSHLES+GAIKGV L ++CNISHILFADDILLF+EDND+FLNNLRMA+SLFE+AS LKINL KSA+VP+NVS +RA ECAS
Subjt:  DYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECAS

KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]0.061.92Show/hide
Query:  MITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGR--ATSDSSDSYDTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNL
        MITPKVEVK KTRPTFLPRSSP+ RLSPPIDYHKRSYA+ VTEGR   TSDSSDSY +SDSSHSSGNSFCDSPS DLLENTVV+VRRFFHDDW KILQNL
Subjt:  MITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGR--ATSDSSDSYDTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNL

Query:  RKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNMTTFQQLGKACGGLIKVAEET
        RKQTEESFTYNAFHAEKALVHF+SNIP NLLCQNKGWTTVGKYSV+FEKWS AYHATPKLIPSYGGWTTF+                             
Subjt:  RKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNMTTFQQLGKACGGLIKVAEET

Query:  RSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQF----FFEGMEAISPDFLSTSSDG
        R++  LV+                                                          +D+F+   E       F+G EAISPDFLSTSS  
Subjt:  RSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQF----FFEGMEAISPDFLSTSSDG

Query:  RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNP
        RKS+TPDQP ALKSVIIK D+ ATSP++LNEEVVNDSNLHATANKS+ EIL GI NDGVLDKGKQKVDIQL PNSALNL+K KRKVSFNSP NKTNIFNP
Subjt:  RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNP

Query:  DSAPANHSPSLSSPEKKQKVSRERN--------------------------------------------------------HHSSDNAEVIDITNTEVVP
        DSAPANHS SLSSPEKKQKVSRER+                                                        HHSS NAEVIDITNTEVVP
Subjt:  DSAPANHSPSLSSPEKKQKVSRERN--------------------------------------------------------HHSSDNAEVIDITNTEVVP

Query:  ETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKA--------FKKQLASW---------------------------LKENGLKIS
        ETPEMKM VNENSNSSSEANYRKPKHVH+R+YYYRKK  K +    +          K +L +W                           L E  LKI+
Subjt:  ETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKA--------FKKQLASW---------------------------LKENGLKIS

Query:  T----------------VTDSSGATTSTNVLINQLNSGLASK--GIGALGTSILQNVEQ----------------------FHHQQ--------------
                         V ++SG++    +L +  +  L S+   I +L  +   N                          H+ Q              
Subjt:  T----------------VTDSSGATTSTNVLINQLNSGLASK--GIGALGTSILQNVEQ----------------------FHHQQ--------------

Query:  ------------SSDRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVP
                    SS  SS                L NNRFTWSNLRNP TFSRIDRFLYNS+WENLFSPHTTRTLPR TSDHFPLVCE+SN KL WGP P
Subjt:  ------------SSDRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVP

Query:  FRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLK
        FRLNSIAL+DPEFKRNM RWWENS+Q+GHPGFSFIQRLKSLAN IKPWQKEKLHSL +AK++I+REVDSIDKKELDTPL+Q+ESNRRLALKA+LS+LSLK
Subjt:  FRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLK

Query:  ESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLED
        ESQF                       C                                    IY+SSTKSDPLFI+NLDWNPIE SEW HLCAPFLE+
Subjt:  ESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLED

Query:  EIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPD
        EIKGVINS DGKK P  DGFPISFFK+YW+LLKEDI+DIFKDF++KGVINKNMNNTYIALI KKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLP 
Subjt:  EIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPD

Query:  TISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLR
        TISGNQLAF+KNRQITDAILMANEAVD+WKVKKIKGFILKLDIEK F  LNWDFID+VL KKNFP  WRKWIRGCISNVTYSVI+NGRPQGRIKANRGLR
Subjt:  TISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLR

Query:  QGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECA
        QGDPLSPFLFVIAMDY SRLLSHLE+SGAIKGVSLN+NCNISHILFADDILLF+EDND FLNNL MALSLFE+AS LKINLLKSALVP+NVS+NRAKECA
Subjt:  QGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECA

Query:  SIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGS
        S WGI CHSL LSYLGVPLGG+                                                                        NGS GS
Subjt:  SIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGS

Query:  HLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISF
        HLINWTKV KSKEEGGLG SRL VTNKALL+KWLWRY SEPNALWRRLIQCKYKG +PGDIPSN SS +SKAPWRSIIDNIDWFKSNQSW+LNNGDQISF
Subjt:  HLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISF

Query:  WYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR
        WYSNWS EG LSTAYPRLFALTLDKEISVKDAWNT DN+
Subjt:  WYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.078.3Show/hide
Query:  SSLLLAVKRSLSSPVLFTAFHLPSLVFPTISQTMAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNT
        + LLL VKRSLS PVLF AFHLPSL F        +FKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSP+DLDWIRCTLKSLIATPNT
Subjt:  SSLLLAVKRSLSSPVLFTAFHLPSLVFPTISQTMAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNT

Query:  NRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTE
        NRFFLETRDSEQ IWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGP+KSGWVSFLSMITPKVEVKAKTRPTFLPR+SPD RLSPPIDYHKRSYA+AVTE
Subjt:  NRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTE

Query:  GR--ATSDSSDSYDTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKY
        GR  ATSDSSDSYD+SDSSHSS NSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGW+TVGKY
Subjt:  GR--ATSDSSDSYDTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKY

Query:  SVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHP
        SV+FEKWS  YHATPKLIPSYGGWTTFRGIPLHLWNM TFQQ+GKAC GLIKVAEETRSAKNL++ARIKVRYNYSGFLPANVRIFDNEGNKF +QVVTHP
Subjt:  SVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHP

Query:  EGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQFFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSK
        EGKWLIERNVRLHGTFKRQAAA+FD+FNPESEQFFFEG EAISPDFLSTSSDGRKS+TPDQP ALKSVIIK DR AT PSFLNEE+VNDSNLHATANKSK
Subjt:  EGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQFFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSK

Query:  SEILPGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRERN----------------------
         EIL GISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSP NKTNIFNPDSAPANHSPSL+SPEKKQKVSRER+                      
Subjt:  SEILPGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRERN----------------------

Query:  --------------------------------------HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKD
                                              HH+SDNAEV+DITNTEVVPETPEMKM VNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKD
Subjt:  --------------------------------------HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKD

Query:  PDSKAFKKQLASWLKENGLKISTVTDSSGATTSTNVLINQLNSGLA--------------------------------------------SKGIGALGTS
        PDS+AFKKQL SWLK+NGLK+ST TDSSGATTSTNVL+NQ+NSGL                                              +G+ +L  +
Subjt:  PDSKAFKKQLASWLKENGLKISTVTDSSGATTSTNVLINQLNSGLA--------------------------------------------SKGIGALGTS

Query:  ILQNVE----------------------QFHHQQ--------------------------SSDRSS----------------LINNRFTWSNLRNPPTFS
         L N                        + H+ Q                          SS  +S                L NNRFTWSNLRNPPTFS
Subjt:  ILQNVE----------------------QFHHQQ--------------------------SSDRSS----------------LINNRFTWSNLRNPPTFS

Query:  RIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEK
        RIDRFLYNS+WENLFSPHTTRTLPRSTSDHFPLVCE+SN KLSWGP+PFRLNSI LSDPEFKRNMGRWWENSIQ G+PGFSFIQRLKSLANFIKPWQKEK
Subjt:  RIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEK

Query:  LHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNI
        LHSLT+AK++I+REVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEG IQNTN 
Subjt:  LHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNI

Query:  SISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKN
        SISTAFIKFFS+IYRSSTKSDPLFI+NLDWNPI  SEWSHLCAPFLE EIKGVINS DGKKTPG DGFPISFFK++W                       
Subjt:  SISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKN

Query:  MNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNW
                                                 LKTTLP+TISGNQLAFVKNRQITDAILMANEAVD+WKVKKIKGFILKLDIEKAFD LN 
Subjt:  MNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNW

Query:  DFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILL
        DFID VLEKKNFP  WRKWIRGCISNVTYSVI+NGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLN NCNISHILFADDILL
Subjt:  DFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILL

Query:  FIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG
        FIEDND FL NLRMALSLFERAS LKINLLKSALVP+NVS+ RAKECAS WGI CHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG
Subjt:  FIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG

Query:  RLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCK
        RLTLIKSTLSSLPIYQLSVFQAPS+TCKNIEK WRKFLWKGNNGS GSHLINWTKVSKSKEEGGLG SRL+VTNKALL+KWLWRYLSEPNALWRRLIQCK
Subjt:  RLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCK

Query:  YKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR
        YKG +PGDIPSNISS TSKAPWRSIID+ DWFKSNQSW+LNNGDQISFWYSNWS EGRLSTAYPRLFALTLDKEISVKDAWNTFDN+
Subjt:  YKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.069.66Show/hide
Query:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD
        MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKSLI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD
Subjt:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD

Query:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRAT--SDSSDSYDTSDSSHSSGNSFCDSPSSD
         KNRKSCILVPEG EKS WVSFLSMITPKVEVKAKTRP FLPRSSP+ RLSPPIDYHKRSYA+AV+EGR++  SDSSDSY +SDSS SSGNS CDSP   
Subjt:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRAT--SDSSDSYDTSDSSHSSGNSFCDSPSSD

Query:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH
        LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAFHAEK LVHF+SN+PANLLCQNKGWTTVGKY+V+FEKW+ A HA+PKLIPSYGGWTTFRGIPLH
Subjt:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH

Query:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ
        LWNM TFQQ+GKACGGLIKVAEET++A+NL++A++K+RYNYSGFLPA V+IFD EGNKF +QVVTH EGKWL+ERNVRLHGTFKRQAAA+FD+FNP+SEQ
Subjt:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ

Query:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD
        F F+G+EAISPD L+T S  RKS +P+QP ALKSVIIK  + ATSP+ LNEEVVND++LHATANKSK +IL GISNDG LDKGKQKVDI  Q  SA    
Subjt:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD

Query:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRERN-------------------------------------------------------
        K KRKVSFNSP NKT  FNPDSAPANHSP     EKK++VSRER+                                                       
Subjt:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRERN-------------------------------------------------------

Query:  -----HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS
             HHSSDNAEVIDITNTEVVPETPE+KM   E SNSS E NYRK KH H+R++YYRKKE+KEKD +S+AFK QL +WLKENGLK+S  TDSSGATTS
Subjt:  -----HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS

Query:  TNVLINQLNSG--------------LASKGIGALGTS--------------------------ILQNVEQFHHQQSS-----------------------
        TN L +QL S               L S+  G    S                          + +++   HH  SS                       
Subjt:  TNVLINQLNSG--------------LASKGIGALGTS--------------------------ILQNVEQFHHQQSS-----------------------

Query:  DRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK
          SS                L NNR+TWSNLRNPPTFSR+DRFLYNS WE LF+PH TRTLPR TSDHFPLVCE+S + L WGP PFRLNSIAL+DPEFK
Subjt:  DRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK

Query:  RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWL
        RNM RWWE S+Q+GHPGF FIQRLKSLAN IKPWQKEK  SLT AK++I+REVDSIDK ELDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL
Subjt:  RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWL

Query:  REGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKT
        +EGDENS+FFHRICSSRQKR+ IHEIQDEEG IQNTN +IS AF+  FS+IYR STK DPLFI+NL+WNPI++S+WS LCAPF E+EIKGVI S DG K 
Subjt:  REGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKT

Query:  PGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ
        PG DGFPISFFK+YW+LLKEDILDIFKDF++KGVINKNMNNTYIALI KKKDYSHPKDFRPISLTTSIYK IAKTLSNRLK TLPDTISGNQLAF+KNRQ
Subjt:  PGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ

Query:  ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAM
        ITDAILMANEA+D+WKVKKIKGFILKLDIEKAFD LNW+FID VL+K N+P  WRKWIRGCISNVTYS+IVNG+PQGRIKANRGLRQGDPLS FLFVIAM
Subjt:  ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAM

Query:  DYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSY
        DYLSRLLSHLES+GAIKG                                                                       GI CH+LPL+Y
Subjt:  DYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSY

Query:  LGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEE
        LGVPLGGNPKSNLFWRN+ED+IQKKL+NWKYA ISKGGRLTLIKSTLSSLPIY+LSVFQAPS T KNIEK WR FLWKG+ G  GSHLINW+ V+K KEE
Subjt:  LGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEE

Query:  GGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTA
        GGLG SRL VTN+ALL+KWLWRY SEPN+LWRRLI  KYKG +PGD+PSNISS +SKAPWRSII+NIDWFKSNQ W+LNNGDQISFWYSNWS EG LSTA
Subjt:  GGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTA

Query:  YPRLFALTLDKEISVKDAWNTFDNR
        YPRLFAL++DKE S+KD WN+ +N+
Subjt:  YPRLFALTLDKEISVKDAWNTFDNR

TYK05808.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.064.22Show/hide
Query:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD
        MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKSLI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD
Subjt:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD

Query:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRAT--SDSSDSYDTSDSSHSSGNSFCDSPSSD
         KNRKSCILVPEGPEKSG VSFLSMITPKVEVKAKTRPTFLPRSSP+ RLSPPIDYHKRSY +AV++GR++  SDSSDSY +SDSS SSGNS CDSP   
Subjt:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRAT--SDSSDSYDTSDSSHSSGNSFCDSPSSD

Query:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH
        LLENTVV+                                 AL+HF+SN+PANLLCQNKGWTTV KY V+                              
Subjt:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH

Query:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ
                                   K+L                                                                      
Subjt:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ

Query:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD
          F+G+EAISPD L+T S  RKSN+ +QP ALKSVIIK  R ATSP+ LNEEVVND++LHAT  KS+ +IL GISNDG LDKGKQKVDI  Q  SA   D
Subjt:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD

Query:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRERN-------------------------------------------------------
        K KRKVSFNSP NKT  FN DSAP NHSP LSSPEKKQ+VSRER+                                                       
Subjt:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRERN-------------------------------------------------------

Query:  -----HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS
             HHSSDNAEVIDITNTEVVPETPE+KM   E SNSS E NYRK KH H+R++YYRKKE+KEKD +S+AFK QL +WLKENGLK+ST TDSSGATTS
Subjt:  -----HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS

Query:  TNVLINQLNSGLASKGIGALGTS--ILQNVEQFHHQ------------------QSSDRSS----------------LINNRFTWSNLRNPPTFSRIDRF
        TN L +QL S ++     A+ +S  IL   +  HH                    SS  SS                L NNR+TWSNLRNPPTFSR+DRF
Subjt:  TNVLINQLNSGLASKGIGALGTS--ILQNVEQFHHQ------------------QSSDRSS----------------LINNRFTWSNLRNPPTFSRIDRF

Query:  LYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLT
        LYNS WE LF+PH TRTL R TSDHFPLVCE+S + L WGP PFRLNSIAL+DP+FKRNM RWWE S+Q+GHPGFSFI+RLKSLAN IKPWQKEK HSLT
Subjt:  LYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLT

Query:  HAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTA
         AK++I+REVDSIDK ELDTPL+QEESNRRLALKA+LS+LSLKESQFW+QRAKKLWL+EGDENS+FFHRICSSRQKR+ IHEIQDEEG IQNTN +IS A
Subjt:  HAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTA

Query:  FIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTY
        F+  FS IYR STK DPLFI+NL+WNPI++S+WS LCAPFLE+EIKGVI S DG K PG DGFPISFFK+YW+LLKEDILDIFKDF++KG          
Subjt:  FIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTY

Query:  IALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDF
                                   IIAKTLSNRLK TLPDTISGNQLAF+KNRQITDAIL ANEA+D+WKVKKIK FILKLDIEKAFD LNWDFIDF
Subjt:  IALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDF

Query:  VLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDN
        VL+KKN+P  WRKWIRGCISNVTYS+IVN +PQ RIKANRGLRQGDPLSPFLFV AMDYLSRLLSHLESSGAIKGV L ++CNISHILFADDILLF+EDN
Subjt:  VLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDN

Query:  DYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLI
        D+FLNNLRMALSLFE+AS LKINL KSA+VP+NVS +RA ECAS WGI CH+LPL+YLGVPLGGNPKSN+FWRN+ED+IQKKLNNWKYA ISKGGRLTLI
Subjt:  DYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLI

Query:  KSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNY
        KSTLSSL IYQLSVFQAP  T KNIEK WR FLWKG+ G  GSHLINW+ V+K KEEGGLG SRL V N+ALL+KWLWRY SEPN+LWRRLI  KYKG +
Subjt:  KSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNY

Query:  PGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR
        PGDIPSNISS +SKAPW+SII+NIDWFKSNQ W+LNN DQISFWYSNWS EG LSTAYPRLFAL++DK+ S+KD WN+ +N+
Subjt:  PGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR

TrEMBL top hitse value%identityAlignment
A0A5A7TDG1 LINE-1 retrotransposable element ORF2 protein0.0e+0071.88Show/hide
Query:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD
        MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKSLI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD
Subjt:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD

Query:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRA--TSDSSDSYDTSDSSHSSGNSFCDSPSSD
         KNRKSCILVPEG EKS WVSFLSMITPKVEVKAKTRP FLPRSSP+ RLSPPIDYHKRSYA+AV+EGR+  +SDSSDSY +SDSS SSGNS CDSP   
Subjt:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRA--TSDSSDSYDTSDSSHSSGNSFCDSPSSD

Query:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH
        LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAFHAEK LVHF+SN+PANLLCQNKGWTTVGKY+V+FEKW+ A HA+PKLIPSYGGWTTFRGIPLH
Subjt:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH

Query:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ
        LWNM TFQQ+GKACGGLIKVAEET++A+NL++A++K+RYNYSGFLPA V+IFD EGNKF +QVVTH EGKWL+ERNVRLHGTFKRQAAA+FD+FNP+SEQ
Subjt:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ

Query:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD
        F F+G+EAISPD L+T S  RKS +P+QP ALKSVIIK  + ATSP+ LNEEVVND++LHATANKSK +IL GISNDG LDKGKQKVDI  Q  SA    
Subjt:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD

Query:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRER--------------------------------------------------------
        K KRKVSFNSP NKT  FNPDSAPANH     SPEKK++VSRER                                                        
Subjt:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRER--------------------------------------------------------

Query:  ----NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS
            +HHSSDNAEVIDITNTEVVPETPE+KM   E SNSS E NYRK KH H+R++YYRKKE+KEKD +S+AFK QL +WLKENGLK+S  TDSSGATTS
Subjt:  ----NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS

Query:  TNVLINQLNS--------------GLASKGIGALGTS--------------------------ILQNVEQFHHQQS-----------------------S
        TN L +QL S               L S+  G    S                          + +++   HH  S                       S
Subjt:  TNVLINQLNS--------------GLASKGIGALGTS--------------------------ILQNVEQFHHQQS-----------------------S

Query:  DRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK
          SS                L NNR+TWSNLRNPPTFSR+DRFLYNS WE LF+PH TRTLPR TSDHFPLVCE+S + L WGP PFRLNSIAL+DPEFK
Subjt:  DRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK

Query:  RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWL
        RNM RWWE S+Q+GHPGF FIQRLKSLAN IKPWQKEK  SLT AK++I+REVDSIDK ELDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL
Subjt:  RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWL

Query:  REGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKT
        +EGDENS+FFHRICSSRQKR+ IHEIQDEEG IQNTN +IS AF+  FS+IYR STK DPLFI+NL+WNPI++S+WS LCAPF E+EIKGVI S DG K 
Subjt:  REGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKT

Query:  PGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ
        PG DGFPISFFK+YW+LLKEDILDIFKDF++KGVINKNMNNTYIALI KKKDYSHPKDFRPISLTTSIYK IAKTLSNRLK TLPDTISGNQLAF+KNRQ
Subjt:  PGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ

Query:  ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAM
        ITDAILMANEA+D+WKVKKIKGFILKLDIEKAFD LNW+FID VL+K N+P  WRKWIRGCISNVTYS+IVNG+PQGRIKANRGLRQGDPLS FLFVIAM
Subjt:  ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAM

Query:  DYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECAS
        DYLSRLLSHLES+GAIKGV L ++CNISHILFADDILLF+EDND+FLNNLRMA+SLFE+AS LKINL KSA+VP+NVS +RA ECAS
Subjt:  DYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECAS

A0A5A7UV84 Reverse transcriptase domain-containing protein0.0e+0061.92Show/hide
Query:  MITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGR--ATSDSSDSYDTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNL
        MITPKVEVK KTRPTFLPRSSP+ RLSPPIDYHKRSYA+ VTEGR   TSDSSDSY +SDSSHSSGNSFCDSPS DLLENTVV+VRRFFHDDW KILQNL
Subjt:  MITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGR--ATSDSSDSYDTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNL

Query:  RKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNMTTFQQLGKACGGLIKVAEET
        RKQTEESFTYNAFHAEKALVHF+SNIP NLLCQNKGWTTVGKYSV+FEKWS AYHATPKLIPSYGGWTTF+                             
Subjt:  RKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNMTTFQQLGKACGGLIKVAEET

Query:  RSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ----FFFEGMEAISPDFLSTSSDG
        R++  LV+                                                          +D+F+   E       F+G EAISPDFLSTSS  
Subjt:  RSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ----FFFEGMEAISPDFLSTSSDG

Query:  RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNP
        RKS+TPDQP ALKSVIIK D+ ATSP++LNEEVVNDSNLHATANKS+ EIL GI NDGVLDKGKQKVDIQL PNSALNL+K KRKVSFNSP NKTNIFNP
Subjt:  RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNP

Query:  DSAPANHSPSLSSPEKKQKVSRER--------------------------------------------------------NHHSSDNAEVIDITNTEVVP
        DSAPANHS SLSSPEKKQKVSRER                                                        +HHSS NAEVIDITNTEVVP
Subjt:  DSAPANHSPSLSSPEKKQKVSRER--------------------------------------------------------NHHSSDNAEVIDITNTEVVP

Query:  ETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKA--------FKKQLASW---------------------------LKENGLKIS
        ETPEMKM VNENSNSSSEANYRKPKHVH+R+YYYRKK  K +    +          K +L +W                           L E  LKI+
Subjt:  ETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKA--------FKKQLASW---------------------------LKENGLKIS

Query:  T----------------VTDSSGATTSTNVLINQLNSGLAS--KGIGALGTSILQNVE----------------------QFHHQQ--------------
                         V ++SG++    +L +  +  L S  + I +L  +   N                          H+ Q              
Subjt:  T----------------VTDSSGATTSTNVLINQLNSGLAS--KGIGALGTSILQNVE----------------------QFHHQQ--------------

Query:  ------------SSDRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVP
                    SS  SS                L NNRFTWSNLRNP TFSRIDRFLYNS+WENLFSPHTTRTLPR TSDHFPLVCE+SN KL WGP P
Subjt:  ------------SSDRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVP

Query:  FRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLK
        FRLNSIAL+DPEFKRNM RWWENS+Q+GHPGFSFIQRLKSLAN IKPWQKEKLHSL +AK++I+REVDSIDKKELDTPL+Q+ESNRRLALKA+LS+LSLK
Subjt:  FRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLK

Query:  ESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLED
        ESQF                       C                                    IY+SSTKSDPLFI+NLDWNPIE SEW HLCAPFLE+
Subjt:  ESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLED

Query:  EIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPD
        EIKGVINS DGKK P  DGFPISFFK+YW+LLKEDI+DIFKDF++KGVINKNMNNTYIALI KKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLP 
Subjt:  EIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPD

Query:  TISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLR
        TISGNQLAF+KNRQITDAILMANEAVD+WKVKKIKGFILKLDIEK F  LNWDFID+VL KKNFP  WRKWIRGCISNVTYSVI+NGRPQGRIKANRGLR
Subjt:  TISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLR

Query:  QGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECA
        QGDPLSPFLFVIAMDY SRLLSHLE+SGAIKGVSLN+NCNISHILFADDILLF+EDND FLNNL MALSLFE+AS LKINLLKSALVP+NVS+NRAKECA
Subjt:  QGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECA

Query:  SIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGS
        S WGI CHSL LSYLGVPLG                                                                        G+NGS GS
Subjt:  SIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGS

Query:  HLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISF
        HLINWTKV KSKEEGGLG SRL VTNKALL+KWLWRY SEPNALWRRLIQCKYKG +PGDIPSN SS +SKAPWRSIIDNIDWFKSNQSW+LNNGDQISF
Subjt:  HLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISF

Query:  WYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR
        WYSNWS EG LSTAYPRLFALTLDKEISVKDAWNT DN+
Subjt:  WYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein0.0e+0069.66Show/hide
Query:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD
        MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKSLI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD
Subjt:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD

Query:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRA--TSDSSDSYDTSDSSHSSGNSFCDSPSSD
         KNRKSCILVPEG EKS WVSFLSMITPKVEVKAKTRP FLPRSSP+ RLSPPIDYHKRSYA+AV+EGR+  +SDSSDSY +SDSS SSGNS CDSP   
Subjt:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRA--TSDSSDSYDTSDSSHSSGNSFCDSPSSD

Query:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH
        LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAFHAEK LVHF+SN+PANLLCQNKGWTTVGKY+V+FEKW+ A HA+PKLIPSYGGWTTFRGIPLH
Subjt:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH

Query:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ
        LWNM TFQQ+GKACGGLIKVAEET++A+NL++A++K+RYNYSGFLPA V+IFD EGNKF +QVVTH EGKWL+ERNVRLHGTFKRQAAA+FD+FNP+SEQ
Subjt:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ

Query:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD
        F F+G+EAISPD L+T S  RKS +P+QP ALKSVIIK  + ATSP+ LNEEVVND++LHATANKSK +IL GISNDG LDKGKQKVDI  Q  SA    
Subjt:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD

Query:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRER--------------------------------------------------------
        K KRKVSFNSP NKT  FNPDSAPANH     SPEKK++VSRER                                                        
Subjt:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRER--------------------------------------------------------

Query:  ----NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS
            +HHSSDNAEVIDITNTEVVPETPE+KM   E SNSS E NYRK KH H+R++YYRKKE+KEKD +S+AFK QL +WLKENGLK+S  TDSSGATTS
Subjt:  ----NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS

Query:  TNVLINQLNS--------------GLASKGIGALGTS--------------------------ILQNVEQFHHQQS-----------------------S
        TN L +QL S               L S+  G    S                          + +++   HH  S                       S
Subjt:  TNVLINQLNS--------------GLASKGIGALGTS--------------------------ILQNVEQFHHQQS-----------------------S

Query:  DRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK
          SS                L NNR+TWSNLRNPPTFSR+DRFLYNS WE LF+PH TRTLPR TSDHFPLVCE+S + L WGP PFRLNSIAL+DPEFK
Subjt:  DRSS----------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK

Query:  RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWL
        RNM RWWE S+Q+GHPGF FIQRLKSLAN IKPWQKEK  SLT AK++I+REVDSIDK ELDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL
Subjt:  RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWL

Query:  REGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKT
        +EGDENS+FFHRICSSRQKR+ IHEIQDEEG IQNTN +IS AF+  FS+IYR STK DPLFI+NL+WNPI++S+WS LCAPF E+EIKGVI S DG K 
Subjt:  REGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKT

Query:  PGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ
        PG DGFPISFFK+YW+LLKEDILDIFKDF++KGVINKNMNNTYIALI KKKDYSHPKDFRPISLTTSIYK IAKTLSNRLK TLPDTISGNQLAF+KNRQ
Subjt:  PGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ

Query:  ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAM
        ITDAILMANEA+D+WKVKKIKGFILKLDIEKAFD LNW+FID VL+K N+P  WRKWIRGCISNVTYS+IVNG+PQGRIKANRGLRQGDPLS FLFVIAM
Subjt:  ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAM

Query:  DYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSY
        DYLSRLLSHLES+GAIKG                                                                       GI CH+LPL+Y
Subjt:  DYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSY

Query:  LGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEE
        LGVPLGGNPKSNLFWRN+ED+IQKKL+NWKYA ISKGGRLTLIKSTLSSLPIY+LSVFQAPS T KNIEK WR FLWKG+ G  GSHLINW+ V+K KEE
Subjt:  LGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEE

Query:  GGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTA
        GGLG SRL VTN+ALL+KWLWRY SEPN+LWRRLI  KYKG +PGD+PSNISS +SKAPWRSII+NIDWFKSNQ W+LNNGDQISFWYSNWS EG LSTA
Subjt:  GGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTA

Query:  YPRLFALTLDKEISVKDAWNTFDNR
        YPRLFAL++DKE S+KD WN+ +N+
Subjt:  YPRLFALTLDKEISVKDAWNTFDNR

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein0.0e+0078.3Show/hide
Query:  SSLLLAVKRSLSSPVLFTAFHLPSLVFPTISQTMAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNT
        + LLL VKRSLS PVLF AFHLPSL F        +FKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSP+DLDWIRCTLKSLIATPNT
Subjt:  SSLLLAVKRSLSSPVLFTAFHLPSLVFPTISQTMAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNT

Query:  NRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTE
        NRFFLETRDSEQ IWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGP+KSGWVSFLSMITPKVEVKAKTRPTFLPR+SPD RLSPPIDYHKRSYA+AVTE
Subjt:  NRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTE

Query:  GR--ATSDSSDSYDTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKY
        GR  ATSDSSDSYD+SDSSHSS NSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGW+TVGKY
Subjt:  GR--ATSDSSDSYDTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKY

Query:  SVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHP
        SV+FEKWS  YHATPKLIPSYGGWTTFRGIPLHLWNM TFQQ+GKAC GLIKVAEETRSAKNL++ARIKVRYNYSGFLPANVRIFDNEGNKF +QVVTHP
Subjt:  SVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHP

Query:  EGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQFFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSK
        EGKWLIERNVRLHGTFKRQAAA+FD+FNPESEQFFFEG EAISPDFLSTSSDGRKS+TPDQP ALKSVIIK DR AT PSFLNEE+VNDSNLHATANKSK
Subjt:  EGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQFFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSK

Query:  SEILPGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRER-----------------------
         EIL GISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSP NKTNIFNPDSAPANHSPSL+SPEKKQKVSRER                       
Subjt:  SEILPGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRER-----------------------

Query:  -------------------------------------NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKD
                                             +HH+SDNAEV+DITNTEVVPETPEMKM VNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKD
Subjt:  -------------------------------------NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKD

Query:  PDSKAFKKQLASWLKENGLKISTVTDSSGATTSTNVLINQLNSGL--------------------------------------------ASKGIGALGTS
        PDS+AFKKQL SWLK+NGLK+ST TDSSGATTSTNVL+NQ+NSGL                                              +G+ +L  +
Subjt:  PDSKAFKKQLASWLKENGLKISTVTDSSGATTSTNVLINQLNSGL--------------------------------------------ASKGIGALGTS

Query:  ILQNVE----------------------QFHHQQ--------------------------SSDRSS----------------LINNRFTWSNLRNPPTFS
         L N                        + H+ Q                          SS  +S                L NNRFTWSNLRNPPTFS
Subjt:  ILQNVE----------------------QFHHQQ--------------------------SSDRSS----------------LINNRFTWSNLRNPPTFS

Query:  RIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEK
        RIDRFLYNS+WENLFSPHTTRTLPRSTSDHFPLVCE+SN KLSWGP+PFRLNSI LSDPEFKRNMGRWWENSIQ G+PGFSFIQRLKSLANFIKPWQKEK
Subjt:  RIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEK

Query:  LHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNI
        LHSLT+AK++I+REVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEG IQNTN 
Subjt:  LHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNI

Query:  SISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKN
        SISTAFIKFFS+IYRSSTKSDPLFI+NLDWNPI  SEWSHLCAPFLE EIKGVINS DGKKTPG DGFPISFFK++W                       
Subjt:  SISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKN

Query:  MNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNW
                                                 LKTTLP+TISGNQLAFVKNRQITDAILMANEAVD+WKVKKIKGFILKLDIEKAFD LN 
Subjt:  MNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNW

Query:  DFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILL
        DFID VLEKKNFP  WRKWIRGCISNVTYSVI+NGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLN NCNISHILFADDILL
Subjt:  DFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILL

Query:  FIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG
        FIEDND FL NLRMALSLFERAS LKINLLKSALVP+NVS+ RAKECAS WGI CHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG
Subjt:  FIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG

Query:  RLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCK
        RLTLIKSTLSSLPIYQLSVFQAPS+TCKNIEK WRKFLWKGNNGS GSHLINWTKVSKSKEEGGLG SRL+VTNKALL+KWLWRYLSEPNALWRRLIQCK
Subjt:  RLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCK

Query:  YKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR
        YKG +PGDIPSNISS TSKAPWRSIID+ DWFKSNQSW+LNNGDQISFWYSNWS EGRLSTAYPRLFALTLDKEISVKDAWNTFDN+
Subjt:  YKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR

A0A5D3C3M3 LINE-1 retrotransposable element ORF2 protein0.0e+0064.1Show/hide
Query:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD
        MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKSLI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD
Subjt:  MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVD

Query:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRA--TSDSSDSYDTSDSSHSSGNSFCDSPSSD
         KNRKSCILVPEGPEKSG VSFLSMITPKVEVKAKTRPTFLPRSSP+ RLSPPIDYHKRSY +AV++GR+  +SDSSDSY +SDSS SSGNS CDSP   
Subjt:  QKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRA--TSDSSDSYDTSDSSHSSGNSFCDSPSSD

Query:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH
        LLENTVV+                                 AL+HF+SN+PANLLCQNKGWTTV KY V+                              
Subjt:  LLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH

Query:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ
                                                                                                           +
Subjt:  LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ

Query:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD
          F+G+EAISPD L+T S  RKSN+ +QP ALKSVIIK  R ATSP+ LNEEVVND++LHAT  KS+ +IL GISNDG LDKGKQKVDI  Q  SA   D
Subjt:  FFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLD

Query:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRER--------------------------------------------------------
        K KRKVSFNSP NKT  FN DSAP NHSP LSSPEKKQ+VSRER                                                        
Subjt:  KSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRER--------------------------------------------------------

Query:  ----NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS
            +HHSSDNAEVIDITNTEVVPETPE+KM   E SNSS E NYRK KH H+R++YYRKKE+KEKD +S+AFK QL +WLKENGLK+ST TDSSGATTS
Subjt:  ----NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTS

Query:  TNVLINQLNSGLASKGIGALGTS--ILQNVEQFHHQ------------------QSSDRSS----------------LINNRFTWSNLRNPPTFSRIDRF
        TN L +QL S ++     A+ +S  IL   +  HH                    SS  SS                L NNR+TWSNLRNPPTFSR+DRF
Subjt:  TNVLINQLNSGLASKGIGALGTS--ILQNVEQFHHQ------------------QSSDRSS----------------LINNRFTWSNLRNPPTFSRIDRF

Query:  LYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLT
        LYNS WE LF+PH TRTL R TSDHFPLVCE+S + L WGP PFRLNSIAL+DP+FKRNM RWWE S+Q+GHPGFSFI+RLKSLAN IKPWQKEK HSLT
Subjt:  LYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLT

Query:  HAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTA
         AK++I+REVDSIDK ELDTPL+QEESNRRLALKA+LS+LSLKESQFW+QRAKKLWL+EGDENS+FFHRICSSRQKR+ IHEIQDEEG IQNTN +IS A
Subjt:  HAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTA

Query:  FIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTY
        F+  FS IYR STK DPLFI+NL+WNPI++S+WS LCAPFLE+EIKGVI S DG K PG DGFPISFFK+YW+LLKEDILDIFKDF++KG          
Subjt:  FIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTY

Query:  IALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDF
                                   IIAKTLSNRLK TLPDTISGNQLAF+KNRQITDAIL ANEA+D+WKVKKIK FILKLDIEKAFD LNWDFIDF
Subjt:  IALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDF

Query:  VLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDN
        VL+KKN+P  WRKWIRGCISNVTYS+IVN +PQ RIKANRGLRQGDPLSPFLFV AMDYLSRLLSHLESSGAIKGV L ++CNISHILFADDILLF+EDN
Subjt:  VLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDN

Query:  DYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLI
        D+FLNNLRMALSLFE+AS LKINL KSA+VP+NVS +RA ECAS WGI CH+LPL+YLGVPLGGNPKSN+FWRN+ED+IQKKLNNWKYA ISKGGRLTLI
Subjt:  DYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLI

Query:  KSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNY
        KSTLSSL IYQLSVFQAP  T KNIEK WR FLWKG+ G  GSHLINW+ V+K KEEGGLG SRL V N+ALL+KWLWRY SEPN+LWRRLI  KYKG +
Subjt:  KSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNY

Query:  PGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR
        PGDIPSNISS +SKAPW+SII+NIDWFKSNQ W+LNN DQISFWYSNWS EG LSTAYPRLFAL++DK+ S+KD WN+ +N+
Subjt:  PGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.4e-4825.86Show/hide
Query:  TFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWG-PVPFRLNSIALSD----PEFKRNMGRWWE------NSIQDGHPGFSFIQRL
        T+S+ID  + +     L     T  +    SDH  +  E     L+      ++LN++ L+D     E K  +  ++E       + Q+    F  + R 
Subjt:  TFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWG-PVPFRLNSIALSD----PEFKRNMGRWWE------NSIQDGHPGFSFIQRL

Query:  KSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLK---ESQFW-YQRAKKLWLREGDENSSFFHRICSSRQKR
        K +A  +  +++++  S      S L+E++  ++        QE +  R  LK   ++ +L+   ES+ W ++R  K+             R+   ++++
Subjt:  KSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLK---ESQFW-YQRAKKLWLREGDENSSFFHRICSSRQKR

Query:  SFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRS---STKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYL
        + I  I++++G I      I T   +++  +Y +   + +    F+D      +   E   L  P    EI  +INSL  KK+PG DGF   F++ Y   
Subjt:  SFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRS---STKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYL

Query:  LKEDILDIFKDFYDKGVINKNMNNTYIALIPKK-KDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWK
        L   +L +F+    +G++  +     I LIPK  +D +  ++FRPISL     KI+ K L+NR++  +   I  +Q+ F+   Q    I  +   +    
Subjt:  LKEDILDIFKDFYDKGVINKNMNNTYIALIPKK-KDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWK

Query:  VKKIKG-FILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGA
          K K   I+ +D EKAFDK+   F+   L K     ++ K IR      T ++I+NG+         G RQG PLSP LF I ++ L+R    +     
Subjt:  VKKIKG-FILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGA

Query:  IKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIP--CHSLPLSYLGVPLGGNPKSNL
        IKG+ L     +   LFADD+++++E+      NL   +S F + S  KIN+ KS     N   NR  E   +  +P    S  + YLG+ L  + K +L
Subjt:  IKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIP--CHSLPLSYLGVPLGGNPKSNL

Query:  FWRNVE---DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRL
        F  N +    +I++  N WK    S  GR+ ++K  +    IY+ +    + P      +EK+  KF+W      +   ++     S+  + GG+     
Subjt:  FWRNVE---DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRL

Query:  HVTNKALLTK--WLWRYLSEPNALWRR
         +  KA +TK  W W Y +     W R
Subjt:  HVTNKALLTK--WLWRYLSEPNALWRR

P08548 LINE-1 reverse transcriptase homolog3.7e-4625.75Show/hide
Query:  TVTDSSGATTSTNVLINQLNSGLA---SKGIGALGTSILQNVEQFHHQQSSDRSSLIN---NRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLP
        T+TD S   +ST++++   N+ LA         L   IL       H   +D     +     +T+ +  +  T+S+ID  L + +  NL        +P
Subjt:  TVTDSSGATTSTNVLINQLNSGLA---SKGIGALGTSILQNVEQFHHQQSSDRSSLIN---NRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLP

Query:  RSTSDHFPLVCE-NSNTKLSWGPVPFRLNSIALSD----PEFKRNMGRWWE-NSIQDGH-----PGFSFIQRLK--SLANFIKPWQKEKLHSLTHAKDSI
           SDH  +  E N+N  L      ++LN++ L D     E K+ + ++ E N+ QD +          + R K  +L  F+K  ++E++++L       
Subjt:  RSTSDHFPLVCE-NSNTKLSWGPVPFRLNSIALSD----PEFKRNMGRWWE-NSIQDGH-----PGFSFIQRLK--SLANFIKPWQKEKLHSLTHAKDSI

Query:  LREVDSIDKKELDTPLTQEESNRR--LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKF
        +  +  ++K+E   P   + S R+    ++A+L+E+  K       ++K  +  + ++       +   ++ +S I  I++    I      I     ++
Subjt:  LREVDSIDKKELDTPLTQEESNRR--LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKF

Query:  FSKIYR---SSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYI
        + K+Y     + K    +++      +   E   L  P    EI   I +L  KK+PG DGF   F++T+   L   +L++F++   +G++        I
Subjt:  FSKIYR---SSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYI

Query:  ALIPKK-KDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMA-NEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFID
         LIPK  KD +  +++RPISL     KI+ K L+NR++  +   I  +Q+ F+   Q    I  + N      K+K     IL +D EKAFD +   F+ 
Subjt:  ALIPKK-KDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMA-NEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFID

Query:  FVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIED
          L+K      + K I    S  T ++I+NG          G RQG PLSP LF I M+ L+     +    AIKG+ + S   I   LFADD+++++E+
Subjt:  FVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIED

Query:  NDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLP--LSYLGVPLGGNPKSNLFWRNVE---DKIQKKLNNWKYAQISKG
               L   +  +   S  KIN  KS  V    + N   E      IP   +P  + YLGV L  + K +L+  N E    +I + +N WK    S  
Subjt:  NDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLP--LSYLGVPLGGNPKSNLFWRNVE---DKIQKKLNNWKYAQISKG

Query:  GRLTLIKSTLSSLPIYQLSV--FQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEG-GLGTSRLHVTNKALLTKWLWRYLSEPNALWRRL
        GR+ ++K ++    IY  +    +AP    K++EK    F+W      +   L++    +K+K  G  L   RL+  +  + T W W    E + +W R+
Subjt:  GRLTLIKSTLSSLPIYQLSV--FQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEG-GLGTSRLHVTNKALLTKWLWRYLSEPNALWRRL

P0C2F6 Putative ribonuclease H protein At1g657504.8e-2232.95Show/hide
Query:  DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKW
        +++  +++ W+   +S  GRLTL K+ LSS+P++ +S    P      +++  R FLW         HL+ W+KV   K+EGGLG       N+AL++K 
Subjt:  DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKW

Query:  LWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNI-DWFKSNQSWELNNGDQISFWYSNW
         WR L E N+LW  ++Q KY      D    I   +  + WRSI   + D       W   +G QI FW   W
Subjt:  LWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNI-DWFKSNQSWELNNGDQISFWYSNW

P11369 LINE-1 retrotransposable element ORF2 protein2.7e-4924.47Show/hide
Query:  TFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPL-VCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPW
        TFS+ID  + + T  N +       +P   SDH  L +  N+N         ++LN+  L+D   K  + +  ++ ++      +      +L + +K +
Subjt:  TFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPL-VCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPW

Query:  QKEKLHSLTHAK--------DSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEI
         + KL +L+ +K         S+   + +++KKE ++P  +      + L+ +++++  + +     + +  +  + ++      R+    + +  I++I
Subjt:  QKEKLHSLTHAK--------DSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEI

Query:  QDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPL-----FIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKED
        ++E+G I      I      F+ ++Y  STK + L     F+D      +   +  HL +P    EI+ VINSL  KK+PG DGF   F++T+    KED
Subjt:  QDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPL-----FIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKED

Query:  ILDIFKDFYDKGVINKNMNNTY----IALIPK-KKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFW-
        ++ I    + K  +   + N++    I LIPK +KD +  ++FRPISL     KI+ K L+NR++  +   I  +Q+ F+   Q    I  +   + +  
Subjt:  ILDIFKDFYDKGVINKNMNNTY----IALIPK-KKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFW-

Query:  KVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGA
        K+K     I+ LD EKAFDK+   F+  VLE+      +   I+   S    ++ VNG     I    G RQG PLSP+LF I ++ L+R +   +    
Subjt:  KVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGA

Query:  IKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKS--NL
        IKG+ +     +   L ADD++++I D       L   ++ F      KIN  KS       +    KE          +  + YLGV L    K   + 
Subjt:  IKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKS--NL

Query:  FWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVT
         +++++ +I++ L  WK    S  GR+ ++K  +    IY+ +    + P+     +E +  KF+W      +   L+   + S     GG+    L + 
Subjt:  FWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVT

Query:  NKALL--TKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDW
         +A++  T W W Y       W R+   +   +  G +  +  + T +    SI +N  W
Subjt:  NKALL--TKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDW

P14381 Transposon TX1 uncharacterized 149 kDa protein2.2e-3523.78Show/hide
Query:  SRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWE--NSIQDGHPGFSFIQRLKSLAN-FIKPW
        SRIDR   +S   +     T R  P   SDH  +    S          +  N+  L D  F +++   W    + QD    F+ + +   +    +K  
Subjt:  SRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWE--NSIQDGHPGFSFIQRLKSLAN-FIKPW

Query:  QKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRR------LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQD
         +E   S++  +++   E+++++ + LD       S  +      L  K  L  +  ++++  + R++   L + D  S FF+ +   +  R  I  +  
Subjt:  QKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRR------LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQD

Query:  EEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNP-IEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFK
        E+G       +I      F+  ++     S     +  D  P +       L  P   DE+   +  +   K+PGLDG  I FF+ +W  L  D   +  
Subjt:  EEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNP-IEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFK

Query:  DFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKL
        + + KG +  +     ++L+PKK D    K++RP+SL ++ YKI+AK +S RLK+ L + I  +Q   V  R I D + +  + + F +   +    L L
Subjt:  DFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKL

Query:  DIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNI
        D EKAFD+++  ++   L+  +F   +  +++   ++    V +N      +   RG+RQG PLS  L+ +A++    LL    +   +K      +  +
Subjt:  DIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNI

Query:  SHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSA-LVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGN--PKSNLFWRNVEDKIQK
            +ADD++L  +D    L   +    ++  ASS +IN  KS+ L+  ++ V+      +   I   S  + YLGV L     P S  F   +E+ +  
Subjt:  SHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSA-LVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGN--PKSNLFWRNVEDKIQK

Query:  KLNNWK-YAQI-SKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWR
        +L  WK +A++ S  GR  +I   ++S   Y+L            I++    FLW      +G H ++    S   +EGG G   +         + + R
Subjt:  KLNNWK-YAQI-SKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWR

Query:  YL-SEPNALWRRLIQCKYK
        YL ++P+  W  L    Y+
Subjt:  YL-SEPNALWRRLIQCKYK

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.6e-2526.1Show/hide
Query:  LGTSI-LQNVEQFHH-QQSSDRSSLINN--RFTWSNLRNP-PTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFP--LVCENSNTKLSWGPVPFRLNS
        L TSI ++ +E+F +  + SD   + +    +TWSN ++  P   ++DR + N  W + F            SDH P  ++ EN   +       FR  S
Subjt:  LGTSI-LQNVEQFHH-QQSSDRSSLINN--RFTWSNLRNP-PTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFP--LVCENSNTKLSWGPVPFRLNS

Query:  IALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFW
           + P F  ++   WE  I  G   FS  + LK+     K   ++   ++ H     L  ++SI  + L  P         +A K      +  ES F+
Subjt:  IALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFW

Query:  YQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNL----DWNPIEHSEW--SHLCAPFLE
         Q+++  WL++GD N+ FFH++  + Q ++ I  ++ ++ +       +    + +++ +  S   SD L  D++    D +P   ++   S L A   +
Subjt:  YQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNL----DWNPIEHSEW--SHLCAPFLE

Query:  DEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKII
         EI   + ++   K PG D F   FF   W+++K+  +   K+F+  G + K  N T I LIPK         FRP+S  T +YKII
Subjt:  DEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKII

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.3e-1626.42Show/hide
Query:  SLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKV
        +LP+ YLG+PL     +   +  + +KI+ ++  W    +S  GRL LI S + SL  + +S F+ PS   K I+     FLW G   +     + W+ V
Subjt:  SLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKV

Query:  SKSKEEGGLGTSRLHVTNK---------ALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQIS
           K+EGGLG   L   NK           L  W+W+ + +  AL    ++                                        +++NG   S
Subjt:  SKSKEEGGLGTSRLHVTNK---------ALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQIS

Query:  FWYSNWSLEGRL
        FW+ NWS  GRL
Subjt:  FWYSNWSLEGRL

AT4G29090.1 Ribonuclease H-like superfamily protein1.0e-1129.01Show/hide
Query:  SLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKY-----KGNY
        +LP Y ++ F  P   CK I      F W+    + G H   W  +S  K EGG+G   +   N ALL K +WR LS P +L  ++ + +Y       N 
Subjt:  SLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKY-----KGNY

Query:  P-GDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRL
        P G  PS +        W+SI  + +  +      + NG+ I  W   W L+ + ++A  R+
Subjt:  P-GDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-1028.38Show/hide
Query:  SLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKE-EGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDI
        +LP+Y +S F+   + CK +  +  +F W           + W K+ KSKE +GGLG   L   N+ALL K  +R + +P+ L  RL++ +Y   +P   
Subjt:  SLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKE-EGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDI

Query:  PSNISSITSKA-PWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLE
            S  T  +  WRSII   +         + +G     W   W ++
Subjt:  PSNISSITSKA-PWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLE

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)4.7e-1247.76Show/hide
Query:  IVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNC-NISHILFADD
        I+NG PQG +  +RGLRQGDPLSP+LF++  + LS L    +  G + G+ +++N   I+H+LFADD
Subjt:  IVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNC-NISHILFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTCCTTTCATTATACTAAATTCACCTGAATTCACCCACTGTATTAACTACAAATCCAGCCTTTTACTTGCTGTTAAAAGGTCACTTTCATCTCCAGTTCTTTT
CACTGCTTTTCACCTTCCTTCCTTGGTTTTTCCTACAATCTCCCAAACAATGGCCTACTTCAAATCACTTCCTAGATCATGTAAGGTTGAAAGAAAGGAATTTGTCCTTC
ACCTTGACAAATATTCAAAGCACACTCATTATTGGTTAACAGAAACTGGAGCTCATAAGGCTTTCTCCATTGAAGTTTCCCCAAAAGACTTGGACTGGATAAGATGTACT
CTGAAATCCTTGATCGCAACCCCAAATACAAACCGATTCTTCCTTGAGACCCGTGACTCTGAGCAATGCATCTGGATCAGGAAAACAAGAAATAGTAAAGGATGTACTGC
AGAAATTTTTAGAGTTGATCAAAAAAACAGAAAATCATGTATCCTAGTTCCGGAAGGACCTGAGAAAAGTGGCTGGGTCTCCTTCCTATCCATGATTACCCCAAAAGTAG
AAGTGAAAGCAAAAACAAGACCAACTTTTTTGCCAAGGTCCAGTCCTGACGGTCGTCTTTCTCCTCCCATTGACTACCACAAGCGATCATACGCAAGAGCTGTCACAGAA
GGAAGAGCTACAAGTGACTCAAGTGACTCTTATGATACAAGCGATTCGAGTCATTCATCAGGTAATAGTTTTTGTGACTCTCCCTCATCTGATCTTCTTGAAAATACAGT
GGTGATAGTTAGACGATTCTTTCATGATGACTGGCACAAAATCCTTCAAAACCTGAGAAAACAAACAGAGGAATCTTTCACTTATAATGCTTTCCATGCTGAAAAAGCTT
TAGTTCATTTTAGCTCAAACATACCTGCAAACCTTCTCTGCCAGAACAAAGGATGGACCACGGTAGGAAAGTACTCGGTAAAATTTGAAAAATGGTCCTCTGCGTATCAC
GCCACTCCAAAACTTATTCCTAGTTATGGAGGATGGACAACTTTCAGAGGAATTCCACTACACTTGTGGAATATGACGACTTTTCAACAACTTGGAAAAGCCTGCGGAGG
TTTGATTAAAGTGGCTGAGGAAACAAGATCAGCAAAAAACCTGGTAAAAGCAAGGATAAAGGTTAGATACAATTACTCAGGCTTCTTACCAGCAAATGTAAGGATTTTTG
ATAATGAGGGAAACAAATTTTCCATCCAAGTAGTTACTCACCCAGAAGGCAAATGGTTAATAGAAAGGAATGTCAGATTACATGGCACCTTCAAGAGACAAGCTGCAGCC
GCCTTTGATGAATTCAATCCTGAATCAGAGCAATTCTTCTTCGAAGGAATGGAGGCCATATCGCCGGACTTCCTTTCCACTAGCTCCGACGGCCGTAAAAGCAACACACC
GGATCAGCCACCTGCATTAAAATCTGTTATCATTAAATCTGACAGAGTTGCCACGTCGCCAAGCTTCTTAAATGAAGAGGTAGTTAATGATAGTAATTTGCATGCAACGG
CTAACAAATCCAAATCAGAGATATTACCTGGGATATCAAATGATGGCGTGTTGGACAAAGGAAAACAGAAGGTTGACATTCAGCTTCAACCCAATTCAGCATTAAATTTG
GATAAATCCAAAAGGAAAGTCTCCTTCAACTCTCCCTGTAATAAAACCAACATCTTCAATCCGGATTCTGCTCCAGCCAATCATTCTCCATCATTAAGTTCCCCTGAAAA
AAAACAGAAAGTAAGTAGAGAGAGAAACCACCACAGCTCTGATAATGCAGAAGTTATTGATATAACAAACACTGAAGTGGTTCCTGAGACACCTGAAATGAAAATGCAAG
TTAATGAGAATTCAAATTCTTCTTCTGAAGCCAACTACAGGAAACCAAAACATGTTCATAAAAGAAAATACTACTACAGGAAAAAAGAAGAAAAGGAGAAGGATCCGGAC
TCAAAGGCCTTCAAAAAACAACTTGCTTCCTGGTTGAAGGAAAATGGTCTGAAAATCTCTACGGTCACTGACTCTTCAGGGGCAACTACTTCAACAAATGTTTTGATAAA
TCAATTGAATTCTGGGTTAGCTTCAAAGGGGATAGGGGCTTTGGGGACATCTATTCTCCAGAATGTTGAACAATTTCATCACCAACAATCTTCTGATAGATCCTCCCTCA
TAAACAATAGATTCACTTGGTCAAACTTACGGAATCCTCCTACTTTTTCCAGAATTGATAGATTCCTTTACAACTCAACTTGGGAAAATCTCTTCAGTCCCCACACAACA
AGGACCCTTCCTAGATCTACTTCAGATCACTTTCCTCTGGTCTGTGAAAACTCCAACACCAAGCTTAGTTGGGGTCCTGTCCCATTCCGTTTAAACTCCATAGCTCTCAG
TGACCCAGAATTCAAAAGAAACATGGGAAGATGGTGGGAAAACTCGATCCAAGATGGTCATCCAGGTTTCTCCTTCATCCAAAGGCTAAAGTCTTTAGCAAATTTTATCA
AACCTTGGCAAAAGGAGAAATTACACTCTCTCACCCATGCTAAAGATAGCATTTTAAGGGAAGTGGACTCTATTGACAAAAAGGAATTGGATACTCCTTTGACTCAAGAG
GAAAGTAATCGTCGTCTAGCCCTAAAAGCTGATCTCAGCGAGCTATCTCTCAAGGAGTCCCAATTCTGGTACCAAAGGGCTAAAAAGCTTTGGCTTAGGGAGGGAGATGA
AAACTCCTCCTTCTTTCATAGAATTTGCTCATCAAGACAAAAGAGAAGTTTCATTCATGAAATCCAGGATGAAGAAGGTTTGATTCAGAATACAAACATCAGTATATCAA
CTGCTTTTATAAAATTCTTTTCAAAGATTTATAGAAGCTCTACAAAAAGTGATCCTCTTTTTATAGATAATCTAGATTGGAATCCGATAGAGCATTCTGAGTGGTCGCAC
CTTTGTGCCCCTTTTTTGGAAGATGAGATTAAAGGGGTTATAAACTCTTTAGATGGAAAAAAGACTCCTGGTCTAGACGGCTTCCCTATCTCCTTCTTTAAAACTTACTG
GTATCTTCTAAAAGAGGATATCTTGGACATATTCAAGGATTTTTATGACAAAGGTGTTATCAACAAGAATATGAATAACACCTACATTGCTTTGATCCCAAAAAAGAAGG
ATTATTCTCATCCCAAAGACTTCAGACCAATCAGCCTAACAACGTCCATCTATAAGATCATTGCCAAAACTCTTTCAAACAGGTTAAAAACCACCCTTCCTGACACCATC
TCAGGAAACCAGCTAGCTTTTGTCAAGAATCGCCAAATTACTGATGCTATCCTAATGGCAAATGAAGCTGTGGATTTTTGGAAGGTGAAGAAGATAAAGGGCTTTATTTT
GAAGCTTGACATTGAAAAGGCTTTTGACAAGTTAAATTGGGATTTCATCGATTTTGTCCTGGAGAAAAAGAATTTTCCAATCCTTTGGAGAAAGTGGATAAGAGGATGTA
TAAGCAATGTCACTTACTCTGTTATTGTCAACGGAAGACCTCAAGGTCGTATTAAAGCTAACAGAGGTCTTAGACAAGGTGATCCCCTTTCCCCTTTTCTATTTGTTATT
GCCATGGATTACCTTAGTCGTCTCTTATCACATCTGGAAAGTTCTGGTGCAATTAAAGGGGTATCTCTCAACAGTAATTGCAACATCTCTCACATCCTTTTCGCTGATGA
TATTCTTCTTTTCATAGAAGATAATGATTATTTTCTGAATAACCTTAGAATGGCTTTATCTCTGTTCGAAAGAGCTTCGAGTCTCAAAATAAACTTATTGAAATCAGCTC
TGGTGCCAATGAATGTGTCCGTGAATAGAGCTAAAGAATGTGCTTCGATTTGGGGTATTCCTTGCCACTCTCTCCCCCTCTCCTACTTGGGAGTTCCTCTTGGTGGCAAT
CCAAAATCCAACCTTTTTTGGCGCAACGTTGAAGATAAGATCCAAAAAAAGCTCAATAATTGGAAATATGCTCAGATATCAAAAGGTGGAAGACTCACTTTAATCAAGTC
TACCCTTAGCAGTCTTCCTATTTATCAACTATCTGTTTTCCAAGCTCCTTCCATGACGTGCAAAAACATTGAAAAATCCTGGAGAAAGTTCCTTTGGAAAGGTAATAACG
GATCTGTAGGATCCCACCTAATCAACTGGACTAAAGTCTCTAAATCTAAAGAGGAGGGTGGGCTGGGTACCTCAAGGCTTCATGTGACAAATAAAGCCCTCTTAACTAAG
TGGCTCTGGCGTTATCTCTCGGAACCTAATGCCCTTTGGAGGAGACTGATTCAATGCAAATATAAAGGCAACTATCCAGGAGACATTCCATCAAACATCTCCTCTATTAC
TTCTAAAGCCCCGTGGAGATCTATCATTGACAACATTGATTGGTTCAAAAGTAATCAAAGTTGGGAACTGAATAATGGAGATCAAATCTCCTTTTGGTATTCTAATTGGT
CTCTAGAAGGTCGTCTCTCAACTGCCTATCCTAGACTTTTTGCTCTTACTCTTGACAAAGAAATCTCAGTTAAAGATGCGTGGAACACATTCGATAACCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTCCTTTCATTATACTAAATTCACCTGAATTCACCCACTGTATTAACTACAAATCCAGCCTTTTACTTGCTGTTAAAAGGTCACTTTCATCTCCAGTTCTTTT
CACTGCTTTTCACCTTCCTTCCTTGGTTTTTCCTACAATCTCCCAAACAATGGCCTACTTCAAATCACTTCCTAGATCATGTAAGGTTGAAAGAAAGGAATTTGTCCTTC
ACCTTGACAAATATTCAAAGCACACTCATTATTGGTTAACAGAAACTGGAGCTCATAAGGCTTTCTCCATTGAAGTTTCCCCAAAAGACTTGGACTGGATAAGATGTACT
CTGAAATCCTTGATCGCAACCCCAAATACAAACCGATTCTTCCTTGAGACCCGTGACTCTGAGCAATGCATCTGGATCAGGAAAACAAGAAATAGTAAAGGATGTACTGC
AGAAATTTTTAGAGTTGATCAAAAAAACAGAAAATCATGTATCCTAGTTCCGGAAGGACCTGAGAAAAGTGGCTGGGTCTCCTTCCTATCCATGATTACCCCAAAAGTAG
AAGTGAAAGCAAAAACAAGACCAACTTTTTTGCCAAGGTCCAGTCCTGACGGTCGTCTTTCTCCTCCCATTGACTACCACAAGCGATCATACGCAAGAGCTGTCACAGAA
GGAAGAGCTACAAGTGACTCAAGTGACTCTTATGATACAAGCGATTCGAGTCATTCATCAGGTAATAGTTTTTGTGACTCTCCCTCATCTGATCTTCTTGAAAATACAGT
GGTGATAGTTAGACGATTCTTTCATGATGACTGGCACAAAATCCTTCAAAACCTGAGAAAACAAACAGAGGAATCTTTCACTTATAATGCTTTCCATGCTGAAAAAGCTT
TAGTTCATTTTAGCTCAAACATACCTGCAAACCTTCTCTGCCAGAACAAAGGATGGACCACGGTAGGAAAGTACTCGGTAAAATTTGAAAAATGGTCCTCTGCGTATCAC
GCCACTCCAAAACTTATTCCTAGTTATGGAGGATGGACAACTTTCAGAGGAATTCCACTACACTTGTGGAATATGACGACTTTTCAACAACTTGGAAAAGCCTGCGGAGG
TTTGATTAAAGTGGCTGAGGAAACAAGATCAGCAAAAAACCTGGTAAAAGCAAGGATAAAGGTTAGATACAATTACTCAGGCTTCTTACCAGCAAATGTAAGGATTTTTG
ATAATGAGGGAAACAAATTTTCCATCCAAGTAGTTACTCACCCAGAAGGCAAATGGTTAATAGAAAGGAATGTCAGATTACATGGCACCTTCAAGAGACAAGCTGCAGCC
GCCTTTGATGAATTCAATCCTGAATCAGAGCAATTCTTCTTCGAAGGAATGGAGGCCATATCGCCGGACTTCCTTTCCACTAGCTCCGACGGCCGTAAAAGCAACACACC
GGATCAGCCACCTGCATTAAAATCTGTTATCATTAAATCTGACAGAGTTGCCACGTCGCCAAGCTTCTTAAATGAAGAGGTAGTTAATGATAGTAATTTGCATGCAACGG
CTAACAAATCCAAATCAGAGATATTACCTGGGATATCAAATGATGGCGTGTTGGACAAAGGAAAACAGAAGGTTGACATTCAGCTTCAACCCAATTCAGCATTAAATTTG
GATAAATCCAAAAGGAAAGTCTCCTTCAACTCTCCCTGTAATAAAACCAACATCTTCAATCCGGATTCTGCTCCAGCCAATCATTCTCCATCATTAAGTTCCCCTGAAAA
AAAACAGAAAGTAAGTAGAGAGAGAAACCACCACAGCTCTGATAATGCAGAAGTTATTGATATAACAAACACTGAAGTGGTTCCTGAGACACCTGAAATGAAAATGCAAG
TTAATGAGAATTCAAATTCTTCTTCTGAAGCCAACTACAGGAAACCAAAACATGTTCATAAAAGAAAATACTACTACAGGAAAAAAGAAGAAAAGGAGAAGGATCCGGAC
TCAAAGGCCTTCAAAAAACAACTTGCTTCCTGGTTGAAGGAAAATGGTCTGAAAATCTCTACGGTCACTGACTCTTCAGGGGCAACTACTTCAACAAATGTTTTGATAAA
TCAATTGAATTCTGGGTTAGCTTCAAAGGGGATAGGGGCTTTGGGGACATCTATTCTCCAGAATGTTGAACAATTTCATCACCAACAATCTTCTGATAGATCCTCCCTCA
TAAACAATAGATTCACTTGGTCAAACTTACGGAATCCTCCTACTTTTTCCAGAATTGATAGATTCCTTTACAACTCAACTTGGGAAAATCTCTTCAGTCCCCACACAACA
AGGACCCTTCCTAGATCTACTTCAGATCACTTTCCTCTGGTCTGTGAAAACTCCAACACCAAGCTTAGTTGGGGTCCTGTCCCATTCCGTTTAAACTCCATAGCTCTCAG
TGACCCAGAATTCAAAAGAAACATGGGAAGATGGTGGGAAAACTCGATCCAAGATGGTCATCCAGGTTTCTCCTTCATCCAAAGGCTAAAGTCTTTAGCAAATTTTATCA
AACCTTGGCAAAAGGAGAAATTACACTCTCTCACCCATGCTAAAGATAGCATTTTAAGGGAAGTGGACTCTATTGACAAAAAGGAATTGGATACTCCTTTGACTCAAGAG
GAAAGTAATCGTCGTCTAGCCCTAAAAGCTGATCTCAGCGAGCTATCTCTCAAGGAGTCCCAATTCTGGTACCAAAGGGCTAAAAAGCTTTGGCTTAGGGAGGGAGATGA
AAACTCCTCCTTCTTTCATAGAATTTGCTCATCAAGACAAAAGAGAAGTTTCATTCATGAAATCCAGGATGAAGAAGGTTTGATTCAGAATACAAACATCAGTATATCAA
CTGCTTTTATAAAATTCTTTTCAAAGATTTATAGAAGCTCTACAAAAAGTGATCCTCTTTTTATAGATAATCTAGATTGGAATCCGATAGAGCATTCTGAGTGGTCGCAC
CTTTGTGCCCCTTTTTTGGAAGATGAGATTAAAGGGGTTATAAACTCTTTAGATGGAAAAAAGACTCCTGGTCTAGACGGCTTCCCTATCTCCTTCTTTAAAACTTACTG
GTATCTTCTAAAAGAGGATATCTTGGACATATTCAAGGATTTTTATGACAAAGGTGTTATCAACAAGAATATGAATAACACCTACATTGCTTTGATCCCAAAAAAGAAGG
ATTATTCTCATCCCAAAGACTTCAGACCAATCAGCCTAACAACGTCCATCTATAAGATCATTGCCAAAACTCTTTCAAACAGGTTAAAAACCACCCTTCCTGACACCATC
TCAGGAAACCAGCTAGCTTTTGTCAAGAATCGCCAAATTACTGATGCTATCCTAATGGCAAATGAAGCTGTGGATTTTTGGAAGGTGAAGAAGATAAAGGGCTTTATTTT
GAAGCTTGACATTGAAAAGGCTTTTGACAAGTTAAATTGGGATTTCATCGATTTTGTCCTGGAGAAAAAGAATTTTCCAATCCTTTGGAGAAAGTGGATAAGAGGATGTA
TAAGCAATGTCACTTACTCTGTTATTGTCAACGGAAGACCTCAAGGTCGTATTAAAGCTAACAGAGGTCTTAGACAAGGTGATCCCCTTTCCCCTTTTCTATTTGTTATT
GCCATGGATTACCTTAGTCGTCTCTTATCACATCTGGAAAGTTCTGGTGCAATTAAAGGGGTATCTCTCAACAGTAATTGCAACATCTCTCACATCCTTTTCGCTGATGA
TATTCTTCTTTTCATAGAAGATAATGATTATTTTCTGAATAACCTTAGAATGGCTTTATCTCTGTTCGAAAGAGCTTCGAGTCTCAAAATAAACTTATTGAAATCAGCTC
TGGTGCCAATGAATGTGTCCGTGAATAGAGCTAAAGAATGTGCTTCGATTTGGGGTATTCCTTGCCACTCTCTCCCCCTCTCCTACTTGGGAGTTCCTCTTGGTGGCAAT
CCAAAATCCAACCTTTTTTGGCGCAACGTTGAAGATAAGATCCAAAAAAAGCTCAATAATTGGAAATATGCTCAGATATCAAAAGGTGGAAGACTCACTTTAATCAAGTC
TACCCTTAGCAGTCTTCCTATTTATCAACTATCTGTTTTCCAAGCTCCTTCCATGACGTGCAAAAACATTGAAAAATCCTGGAGAAAGTTCCTTTGGAAAGGTAATAACG
GATCTGTAGGATCCCACCTAATCAACTGGACTAAAGTCTCTAAATCTAAAGAGGAGGGTGGGCTGGGTACCTCAAGGCTTCATGTGACAAATAAAGCCCTCTTAACTAAG
TGGCTCTGGCGTTATCTCTCGGAACCTAATGCCCTTTGGAGGAGACTGATTCAATGCAAATATAAAGGCAACTATCCAGGAGACATTCCATCAAACATCTCCTCTATTAC
TTCTAAAGCCCCGTGGAGATCTATCATTGACAACATTGATTGGTTCAAAAGTAATCAAAGTTGGGAACTGAATAATGGAGATCAAATCTCCTTTTGGTATTCTAATTGGT
CTCTAGAAGGTCGTCTCTCAACTGCCTATCCTAGACTTTTTGCTCTTACTCTTGACAAAGAAATCTCAGTTAAAGATGCGTGGAACACATTCGATAACCGATGA
Protein sequenceShow/hide protein sequence
MAAPFIILNSPEFTHCINYKSSLLLAVKRSLSSPVLFTAFHLPSLVFPTISQTMAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCT
LKSLIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTE
GRATSDSSDSYDTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYH
ATPKLIPSYGGWTTFRGIPLHLWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQAAA
AFDEFNPESEQFFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNL
DKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRERNHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPD
SKAFKKQLASWLKENGLKISTVTDSSGATTSTNVLINQLNSGLASKGIGALGTSILQNVEQFHHQQSSDRSSLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTT
RTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQE
ESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSH
LCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTI
SGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVI
AMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGN
PKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTK
WLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRLFALTLDKEISVKDAWNTFDNR