; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0256071 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0256071
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationCMiso1.1chr09:21088878..21091334
RNA-Seq ExpressionCmc09g0256071
SyntenyCmc09g0256071
Gene Ontology termsGO:0007165 - signal transduction (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037445.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0097.55Show/hide
Query:  MLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGR
        MLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFS HTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGR
Subjt:  MLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGR

Query:  WWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDE
        WWENSIQDGHPGFSFIQRLKSLANFI PWQKEKL SLTY KDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDE
Subjt:  WWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDE

Query:  NSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDG
        NSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGIST FIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEW HLCAPFLEGEIKGVINSLDGKKTPGPDG
Subjt:  NSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDG

Query:  FPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAI
        FPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKK+YSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAI
Subjt:  FPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAI

Query:  LMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSR
        LMANEA+DFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSR
Subjt:  LMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSR

Query:  LLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPL
        LLSHL++SGAIKGVSLNSNCNISHILF DDILLFIEDNDYFL NLRMALSLFE+ASGLKINLL+SALVPVNVSVNRAK+CAS WGISCHSLPLSYLGVPL
Subjt:  LLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPL

Query:  GGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
        GGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
Subjt:  GGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV

KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]0.0e+0081.54Show/hide
Query:  MKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNN
        MKLLTWNARGLGSPSKRALIKN IISYSPDFVILTET LKITNKRIIKS WPSNSINWI KNASGSSGGILILWDAQ+HSLL  EE +FSLSANF  NNN
Subjt:  MKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNN

Query:  LSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFL
         SWWLTGLYGP KRR+RIHFW +LHNLQHLNS PW L  DLNVIRMREE+TS+ SSSHSSRMLNNFI+NNLLIDPPL NNRFTWSNLRNP TFSRIDRFL
Subjt:  LSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFL

Query:  YNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTY
        YNS+WENLFS HTTRTLPR TSDHFPLVCEDSNPKL WGP PFRLNSIAL+DPEFKRNM RWWENS+Q+GHPGFSFIQRLKSLAN IKPWQKEKL SL Y
Subjt:  YNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTY

Query:  VKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVF
         K+ IIREVDSIDKKELDTPL+Q+ESNRRLALKA+LS+LSLKESQF                       C                              
Subjt:  VKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVF

Query:  IKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYI
              IY+SSTKSDPLFI+NLDWNPIE SEW HLCAPFLE EIKGVINS DGKK P PDGFPISFFKSYW+LLKEDIMDIFKDF++KGVINKNMNNTYI
Subjt:  IKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYI

Query:  ALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFV
        ALI KKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKT LP TISGNQLAF+KNRQITDAILMANEAVD+WKVKKIKGFILKLDIEK F NLNWDFID+V
Subjt:  ALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFV

Query:  LEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDND
        L KKNFPN WRKWIRGCISNVTYSVI+NGRPQGRIKANRGLRQGDPLSPFLFVIAMDY SRLLSHL+ SGAIKGVSLN+NCNISHILF DDILLF+EDND
Subjt:  LEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDND

Query:  YFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGN
         FLNNL MALSLFE+ASGLKINLL+SALVPVNVS+NRAK+CAS WGISCHSL LSYLGVPLGG+
Subjt:  YFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGN

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0085.81Show/hide
Query:  LKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILG
        LKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLL  EEGLFSLSANFL NNN SWWLTGLYGPVKRRERIHFW ELHNLQHLNS PWILG
Subjt:  LKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILG

Query:  GDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSW
        GDLNVIRMREESTSV SSSH+SRMLNNFI+NNLLIDPPL NNRFTWSNLRNPPTFSRIDRFLYNS+WENLFS HTTRTLPRSTSDHFPLVCEDSNPKLSW
Subjt:  GDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSW

Query:  GPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSE
        GP+PFRLNSI LSDPEFKRNMGRWWENSIQ G+PGFSFIQRLKSLANFIKPWQKEKL SLTY K+ IIREVDSIDKKELDTPLTQEESNRRLALKADLSE
Subjt:  GPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSE

Query:  LSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAP
        LSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEG IQNTNN IST FIKFFSRIYRSSTKSDPLFI+NLDWNPI  SEWSHLCAP
Subjt:  LSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAP

Query:  FLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKT
        FLEGEIKGVINS DGKKTPGPDGFPISFFKS+W                                                                LKT
Subjt:  FLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKT

Query:  ALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKAN
         LP+TISGNQLAFVKNRQITDAILMANEAVD+WKVKKIKGFILKLDIEKAFDNLN DFID VLEKKNFPN WRKWIRGCISNVTYSVI+NGRPQGRIKAN
Subjt:  ALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKAN

Query:  RGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRA
        RGLRQGDPLSPFLFVIAMDYLSRLLSHL++SGAIKGVSLN NCNISHILF DDILLFIEDND FL NLRMALSLFERASGLKINLL+SALVPVNVS+ RA
Subjt:  RGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRA

Query:  KDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
        K+CAS WGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
Subjt:  KDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV

TYK03140.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0081.07Show/hide
Query:  VILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLN
        +ILTETRLKITNKRIIKSLWPSNSINWIAKNA GSSGGILILWDAQNHSLL HEEG+FSLSANFLFNNNLSWWLTGLYGPVKRRERIHFW ELHNLQHLN
Subjt:  VILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLN

Query:  SLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCED
        S PWILGGDLNV R+REESTSVSSSSHSSRMLNNFI NNLL+DPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFS HTTRTLPRSTSDHFPLVCED
Subjt:  SLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCED

Query:  SNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLA
        SNPKLSWGPVPFRLNSIAL+DP+FKRNMGR                                           IIREVDSIDKKELDTPL+QEESNRRLA
Subjt:  SNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLA

Query:  LKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSE
        LKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEE                                 DNLDWN IEHSE
Subjt:  LKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSE

Query:  WSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKT
        WSHLCAPFLE EIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYS+PKDFRPIS TTSIYKIIAKT
Subjt:  WSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKT

Query:  LSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRP
        LSNRLKT+LPDTISGNQLAFVKNRQITDAILMANEAVDFWK+KKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRP
Subjt:  LSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRP

Query:  QGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPV
        QGRIKANRGLRQGDPLS FLFVIAMDYLSRLLSHL++SGAIKGVSL++NCNISHILF DDILLFI+DNDYFLNNLRMALSLFERASGLKINLL+SALVPV
Subjt:  QGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPV

Query:  NVSVNRAKDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
        NVS NRAK+CAS W                            V   I      W++  ISKGGRLTLIKSTLSSLPIYQLSV
Subjt:  NVSVNRAKDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV

TYK11012.1 uncharacterized protein E5676_scaffold874G00540 [Cucumis melo var. makuwa]0.0e+0081.41Show/hide
Query:  MKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNN
        MKLLTWNARGLGSPSKRALIKN IISYSPDFVILTET LKITNKRIIKS WPSNSINWI KNASGSSGGILILWDAQ+HSLL  EE +FSLSANF  NNN
Subjt:  MKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNN

Query:  LSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFL
         SWWLTGLYGP KRR+RIHFW +LHNLQHLNS PW L  DLNVIRMREE+TS+ SSSHSSRMLNNFI+NNLLIDPPL NNRFTWSNLRNP TFSRIDRFL
Subjt:  LSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFL

Query:  YNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTY
        YNS+WENLFS HTTRTLPR TSDHFPLVCEDSNPKL WGP PFRLNSIAL+DPEFKRNM RWWENS+Q+GH GFSFIQRLKSLAN IKPWQKEKL SL Y
Subjt:  YNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTY

Query:  VKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVF
         K+ IIREVDSIDKKELDTPL+Q+ESNRRLALKA+LS+LSLKESQF                       C                              
Subjt:  VKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVF

Query:  IKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYI
              IY+SSTKSDPLFI+NLDWNPIE SEW HLCAPFLE EIKGVINS DGKK P PDGFPISFFKSYW+LLKEDIMDIFKDF++KGVINKNMNNTYI
Subjt:  IKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYI

Query:  ALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFV
        ALI KKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKT LP TISGNQLAF+KNRQITDAILMANEAVD+WKVKKIKGFILKLDIEK F NLNWDFID+V
Subjt:  ALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFV

Query:  LEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDND
        L KKNFPN WRKWIRGCISNVTYSVI+NGRPQGRIKANRGLRQGDPLSPFLFVIAMDY SRLLSHL+ SGAIKGVSLN+NCNISHILF DDILLF+EDND
Subjt:  LEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDND

Query:  YFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGN
         FLNNL MALSLFE+ASGLKINLL+SALVPVNVS+NRAK+CAS WGISCHSL LSYLGVPLGG+
Subjt:  YFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGN

TrEMBL top hitse value%identityAlignment
A0A5A7UV84 Reverse transcriptase domain-containing protein0.0e+0081.54Show/hide
Query:  MKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNN
        MKLLTWNARGLGSPSKRALIKN IISYSPDFVILTET LKITNKRIIKS WPSNSINWI KNASGSSGGILILWDAQ+HSLL  EE +FSLSANF  NNN
Subjt:  MKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNN

Query:  LSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFL
         SWWLTGLYGP KRR+RIHFW +LHNLQHLNS PW L  DLNVIRMREE+TS+ SSSHSSRMLNNFI+NNLLIDPPL NNRFTWSNLRNP TFSRIDRFL
Subjt:  LSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFL

Query:  YNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTY
        YNS+WENLFS HTTRTLPR TSDHFPLVCEDSNPKL WGP PFRLNSIAL+DPEFKRNM RWWENS+Q+GHPGFSFIQRLKSLAN IKPWQKEKL SL Y
Subjt:  YNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTY

Query:  VKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVF
         K+ IIREVDSIDKKELDTPL+Q+ESNRRLALKA+LS+LSLKESQF                       C                              
Subjt:  VKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVF

Query:  IKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYI
              IY+SSTKSDPLFI+NLDWNPIE SEW HLCAPFLE EIKGVINS DGKK P PDGFPISFFKSYW+LLKEDIMDIFKDF++KGVINKNMNNTYI
Subjt:  IKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYI

Query:  ALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFV
        ALI KKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKT LP TISGNQLAF+KNRQITDAILMANEAVD+WKVKKIKGFILKLDIEK F NLNWDFID+V
Subjt:  ALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFV

Query:  LEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDND
        L KKNFPN WRKWIRGCISNVTYSVI+NGRPQGRIKANRGLRQGDPLSPFLFVIAMDY SRLLSHL+ SGAIKGVSLN+NCNISHILF DDILLF+EDND
Subjt:  LEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDND

Query:  YFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGN
         FLNNL MALSLFE+ASGLKINLL+SALVPVNVS+NRAK+CAS WGISCHSL LSYLGVPLGG+
Subjt:  YFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGN

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein0.0e+0085.81Show/hide
Query:  LKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILG
        LKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLL  EEGLFSLSANFL NNN SWWLTGLYGPVKRRERIHFW ELHNLQHLNS PWILG
Subjt:  LKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILG

Query:  GDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSW
        GDLNVIRMREESTSV SSSH+SRMLNNFI+NNLLIDPPL NNRFTWSNLRNPPTFSRIDRFLYNS+WENLFS HTTRTLPRSTSDHFPLVCEDSNPKLSW
Subjt:  GDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSW

Query:  GPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSE
        GP+PFRLNSI LSDPEFKRNMGRWWENSIQ G+PGFSFIQRLKSLANFIKPWQKEKL SLTY K+ IIREVDSIDKKELDTPLTQEESNRRLALKADLSE
Subjt:  GPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSE

Query:  LSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAP
        LSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEG IQNTNN IST FIKFFSRIYRSSTKSDPLFI+NLDWNPI  SEWSHLCAP
Subjt:  LSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAP

Query:  FLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKT
        FLEGEIKGVINS DGKKTPGPDGFPISFFKS+W                                                                LKT
Subjt:  FLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKT

Query:  ALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKAN
         LP+TISGNQLAFVKNRQITDAILMANEAVD+WKVKKIKGFILKLDIEKAFDNLN DFID VLEKKNFPN WRKWIRGCISNVTYSVI+NGRPQGRIKAN
Subjt:  ALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKAN

Query:  RGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRA
        RGLRQGDPLSPFLFVIAMDYLSRLLSHL++SGAIKGVSLN NCNISHILF DDILLFIEDND FL NLRMALSLFERASGLKINLL+SALVPVNVS+ RA
Subjt:  RGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRA

Query:  KDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
        K+CAS WGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
Subjt:  KDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV

A0A5D3BUZ3 LINE-1 retrotransposable element ORF2 protein0.0e+0097.55Show/hide
Query:  MLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGR
        MLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFS HTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGR
Subjt:  MLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGR

Query:  WWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDE
        WWENSIQDGHPGFSFIQRLKSLANFI PWQKEKL SLTY KDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDE
Subjt:  WWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDE

Query:  NSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDG
        NSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGIST FIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEW HLCAPFLEGEIKGVINSLDGKKTPGPDG
Subjt:  NSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDG

Query:  FPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAI
        FPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKK+YSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAI
Subjt:  FPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAI

Query:  LMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSR
        LMANEA+DFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSR
Subjt:  LMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSR

Query:  LLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPL
        LLSHL++SGAIKGVSLNSNCNISHILF DDILLFIEDNDYFL NLRMALSLFE+ASGLKINLL+SALVPVNVSVNRAK+CAS WGISCHSLPLSYLGVPL
Subjt:  LLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPL

Query:  GGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
        GGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
Subjt:  GGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV

A0A5D3BVM7 LINE-1 retrotransposable element ORF2 protein0.0e+0081.07Show/hide
Query:  VILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLN
        +ILTETRLKITNKRIIKSLWPSNSINWIAKNA GSSGGILILWDAQNHSLL HEEG+FSLSANFLFNNNLSWWLTGLYGPVKRRERIHFW ELHNLQHLN
Subjt:  VILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLN

Query:  SLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCED
        S PWILGGDLNV R+REESTSVSSSSHSSRMLNNFI NNLL+DPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFS HTTRTLPRSTSDHFPLVCED
Subjt:  SLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCED

Query:  SNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLA
        SNPKLSWGPVPFRLNSIAL+DP+FKRNMGR                                           IIREVDSIDKKELDTPL+QEESNRRLA
Subjt:  SNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRLA

Query:  LKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSE
        LKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEE                                 DNLDWN IEHSE
Subjt:  LKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSE

Query:  WSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKT
        WSHLCAPFLE EIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYS+PKDFRPIS TTSIYKIIAKT
Subjt:  WSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKT

Query:  LSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRP
        LSNRLKT+LPDTISGNQLAFVKNRQITDAILMANEAVDFWK+KKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRP
Subjt:  LSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRP

Query:  QGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPV
        QGRIKANRGLRQGDPLS FLFVIAMDYLSRLLSHL++SGAIKGVSL++NCNISHILF DDILLFI+DNDYFLNNLRMALSLFERASGLKINLL+SALVPV
Subjt:  QGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPV

Query:  NVSVNRAKDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV
        NVS NRAK+CAS W                            V   I      W++  ISKGGRLTLIKSTLSSLPIYQLSV
Subjt:  NVSVNRAKDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV

A0A5D3CI86 Reverse transcriptase domain-containing protein0.0e+0081.41Show/hide
Query:  MKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNN
        MKLLTWNARGLGSPSKRALIKN IISYSPDFVILTET LKITNKRIIKS WPSNSINWI KNASGSSGGILILWDAQ+HSLL  EE +FSLSANF  NNN
Subjt:  MKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNN

Query:  LSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFL
         SWWLTGLYGP KRR+RIHFW +LHNLQHLNS PW L  DLNVIRMREE+TS+ SSSHSSRMLNNFI+NNLLIDPPL NNRFTWSNLRNP TFSRIDRFL
Subjt:  LSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFL

Query:  YNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTY
        YNS+WENLFS HTTRTLPR TSDHFPLVCEDSNPKL WGP PFRLNSIAL+DPEFKRNM RWWENS+Q+GH GFSFIQRLKSLAN IKPWQKEKL SL Y
Subjt:  YNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTY

Query:  VKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVF
         K+ IIREVDSIDKKELDTPL+Q+ESNRRLALKA+LS+LSLKESQF                       C                              
Subjt:  VKDNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVF

Query:  IKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYI
              IY+SSTKSDPLFI+NLDWNPIE SEW HLCAPFLE EIKGVINS DGKK P PDGFPISFFKSYW+LLKEDIMDIFKDF++KGVINKNMNNTYI
Subjt:  IKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYI

Query:  ALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFV
        ALI KKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKT LP TISGNQLAF+KNRQITDAILMANEAVD+WKVKKIKGFILKLDIEK F NLNWDFID+V
Subjt:  ALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFV

Query:  LEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDND
        L KKNFPN WRKWIRGCISNVTYSVI+NGRPQGRIKANRGLRQGDPLSPFLFVIAMDY SRLLSHL+ SGAIKGVSLN+NCNISHILF DDILLF+EDND
Subjt:  LEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDND

Query:  YFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGN
         FLNNL MALSLFE+ASGLKINLL+SALVPVNVS+NRAK+CAS WGISCHSL LSYLGVPLGG+
Subjt:  YFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.7e-4824.26Show/hide
Query:  LLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRL--KITNKRIIKSLWPSNSINWIAKNASGSSGGILIL----WDAQNHSLLCHEEGLFSLSANFL
        +LT N  GL SP KR  + + I S  P    + ET L  + T++  IK        N   K A     G+ IL     D +   +   +EG + +    +
Subjt:  LLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRL--KITNKRIIKSLWPSNSINWIAKNASGSSGGILIL----WDAQNHSLLCHEEGLFSLSANFL

Query:  FNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLID-PPLINNRFTWSNLRNPP--TF
            L+  +  +Y P     R      L +LQ       ++ GD N      + ++    +  ++ LN+ +    LID    ++ + T     + P  T+
Subjt:  FNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLID-PPLINNRFTWSNLRNPP--TF

Query:  SRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWG-PVPFRLNSIALSD----PEFKRNMGRWWE------NSIQDGHPGFSFIQRLKS
        S+ID  + +     L     T  +    SDH  +  E     L+      ++LN++ L+D     E K  +  ++E       + Q+    F  + R K 
Subjt:  SRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCEDSNPKLSWG-PVPFRLNSIALSD----PEFKRNMGRWWE------NSIQDGHPGFSFIQRLKS

Query:  LANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRR---LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFI
        +A  +  +++++ +S     D +  ++  ++K+E     T  +++RR     ++A+L E+  +++      ++  +    ++      R+   +++++ I
Subjt:  LANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRR---LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFI

Query:  HEIQDEEGLIQNTNNGISTVFIKFFSRIYRS---STKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKE
          I++++G I      I T   +++  +Y +   + +    F+D      +   E   L  P    EI  +INSL  KK+PGPDGF   F++ Y   L  
Subjt:  HEIQDEEGLIQNTNNGISTVFIKFFSRIYRS---STKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKE

Query:  DIMDIFKDFYDKGVINKNMNNTYIALIPKK-KDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKK
         ++ +F+    +G++  +     I LIPK  +D +  ++FRPISL     KI+ K L+NR++  +   I  +Q+ F+   Q    I  +   +      K
Subjt:  DIMDIFKDFYDKGVINKNMNNTYIALIPKK-KDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKK

Query:  IKG-FILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKG
         K   I+ +D EKAFD +   F+   L K     ++ K IR      T ++I+NG+         G RQG PLSP LF I ++ L+R +   K    IKG
Subjt:  IKG-FILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKG

Query:  VSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWRNV
        + L     +   LF DD+++++E+      NL   +S F + SG KIN+ +S     N +             +  S  + YLG+ L  + K +LF  N 
Subjt:  VSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWRNV

Query:  E---DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLS
        +    +I++  N WK    S  GR+ ++K  +    IY+ +
Subjt:  E---DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLS

P08548 LINE-1 reverse transcriptase homolog4.1e-4424.56Show/hide
Query:  MKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNK-RIIKSLWPSNSINWIAKNASGSSGGILILW-DA---QNHSLLCHEEGLFSLSANF
        + + + N  GL  P KR  + + I    PD   + E+ L + +K R+    W S        N      GI IL+ DA   +   +   ++G F      
Subjt:  MKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNK-RIIKSLWPSNSINWIAKNASGSSGGILILW-DA---QNHSLLCHEEGLFSLSANF

Query:  LFNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLID--PPLINNRFTWSNLRNP-PT
           + +S  +  +Y P     +    T L ++ +L S   I+ GD N      + +S    S     LN+ I +  L D       N+  ++   +   T
Subjt:  LFNNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLID--PPLINNRFTWSNLRNP-PT

Query:  FSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCE-DSNPKLSWGPVPFRLNSIALSD----PEFKRNMGRWWE-NSIQDGH-----PGFSFIQRLK
        +S+ID  L + +  NL        +P   SDH  +  E ++N  L      ++LN++ L D     E K+ + ++ E N+ QD +          + R K
Subjt:  FSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFPLVCE-DSNPKLSWGPVPFRLNSIALSD----PEFKRNMGRWWE-NSIQDGH-----PGFSFIQRLK

Query:  --SLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRR--LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRS
          +L  F+K  ++E++       +N++  +  ++K+E   P   + S R+    ++A+L+E+  K       ++K  +  + ++       +   ++ +S
Subjt:  --SLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRR--LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRS

Query:  FIHEIQDEEGLIQNTNNGISTVFIKFFSRIYR---SSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLL
         I  I++    I    + I  +  +++ ++Y     + K    +++      +   E   L  P    EI   I +L  KK+PGPDGF   F++++   L
Subjt:  FIHEIQDEEGLIQNTNNGISTVFIKFFSRIYR---SSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLL

Query:  KEDIMDIFKDFYDKGVINKNMNNTYIALIPKK-KDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMA-NEAVDFWK
           ++++F++   +G++        I LIPK  KD +  +++RPISL     KI+ K L+NR++  +   I  +Q+ F+   Q    I  + N      K
Subjt:  KEDIMDIFKDFYDKGVINKNMNNTYIALIPKK-KDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMA-NEAVDFWK

Query:  VKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAI
        +K     IL +D EKAFDN+   F+   L+K      + K I    S  T ++I+NG          G RQG PLSP LF I M+ L+  +   K   AI
Subjt:  VKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAI

Query:  KGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWR
        KG+ + S   I   LF DD+++++E+       L   +  +   SG KIN  +S       +    K        +     + YLGV L  + K +L+  
Subjt:  KGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGNPKSNLFWR

Query:  NVE---DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLS
        N E    +I + +N WK    S  GR+ ++K ++    IY  +
Subjt:  NVE---DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLS

P11369 LINE-1 retrotransposable element ORF2 protein2.1e-4823.94Show/hide
Query:  LLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRII-----KSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLF
        L++ N  GL SP KR  + + +    P F  L ET L+  ++  +     K+++ +N +    K  +G +  I    D Q   +   +EG F L    + 
Subjt:  LLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRII-----KSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLF

Query:  NNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLID------PPLINNRFTWSNLRNPP
           LS  +  +Y P   R        L  L+   +   I+ GD N     ++ +     +  +  L   +    L D      P      +T+ +  +  
Subjt:  NNNLSWWLTGLYGPVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLID------PPLINNRFTWSNLRNPP

Query:  TFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDH------FPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLAN
        TFS+ID  + + T  N +       +P   SDH      F     +  P  +W     +LN+  L+D   K  + +  ++ ++      +      +L +
Subjt:  TFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDH------FPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLAN

Query:  FIKPWQKEKLQSLTYVK--------DNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRS
         +K + + KL +L+  K         ++   + +++KKE ++P  +      + L+ +++++  + +     + +  +  + ++      R+    + + 
Subjt:  FIKPWQKEKLQSLTYVK--------DNIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRS

Query:  FIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPL-----FIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWY
         I++I++E+G I      I      F+ R+Y  STK + L     F+D      +   +  HL +P    EI+ VINSL  KK+PGPDGF   F++++  
Subjt:  FIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPL-----FIDNLDWNPIEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWY

Query:  LLKEDIMDIFKDFYDKGVINKNMNNTY----IALIPK-KKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEA
          KED++ I    + K  +   + N++    I LIPK +KD +  ++FRPISL     KI+ K L+NR++  +   I  +Q+ F+   Q    I  +   
Subjt:  LLKEDIMDIFKDFYDKGVINKNMNNTY----IALIPK-KKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEA

Query:  VDFW-KVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHL
        + +  K+K     I+ LD EKAFD +   F+  VLE+      +   I+   S    ++ VNG     I    G RQG PLSP+LF I ++ L+R +   
Subjt:  VDFW-KVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHL

Query:  KNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGNPK
        K    IKG+ +     +   L  DD++++I D       L   ++ F    G KIN  +S       +    K+       S  +  + YLGV L    K
Subjt:  KNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGNPK

Query:  S--NLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLS
           +  +++++ +I++ L  WK    S  GR+ ++K  +    IY+ +
Subjt:  S--NLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLS

P14381 Transposon TX1 uncharacterized 149 kDa protein3.6e-3223.47Show/hide
Query:  LTGLYGPVKRRERIHFWTELHNLQHL--NSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINN----RFTWSNLRN-PPTFSRID
        L  +Y P    ER  F+  L        +    I+GGD N      +         S  +L   I +  L+D     N     FT+  +R+   + SRID
Subjt:  LTGLYGPVKRRERIHFWTELHNLQHL--NSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINN----RFTWSNLRN-PPTFSRID

Query:  RFLYNSTWENLFSTHTTRTLPRSTSDH----FPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE--NSIQDGHPGFSFIQRLKSLAN-FIKPW
        R   +S   +   + T R  P   SDH      +    S PK ++    +  N+  L D  F +++   W    + QD    F+ + +   +    +K  
Subjt:  RFLYNSTWENLFSTHTTRTLPRSTSDH----FPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWE--NSIQDGHPGFSFIQRLKSLAN-FIKPW

Query:  QKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRR------LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQD
         +E  +S++  ++    E+++++ + LD       S  +      L  K  L  +  ++++  + R++   L + D  S FF+ +   +  R  I  +  
Subjt:  QKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRR------LALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQD

Query:  EEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNP-IEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFK
        E+G        I      F+  ++     S     +  D  P +       L  P    E+   +  +   K+PG DG  I FF+ +W  L  D   +  
Subjt:  EEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNP-IEHSEWSHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFK

Query:  DFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKL
        + + KG +  +     ++L+PKK D    K++RP+SL ++ YKI+AK +S RLK+ L + I  +Q   V  R I D + +  + + F +   +    L L
Subjt:  DFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKL

Query:  DIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNI
        D EKAFD ++  ++   L+  +F   +  +++   ++    V +N      +   RG+RQG PLS  L+ +A++    LL        +K      +  +
Subjt:  DIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNI

Query:  SHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESA-LVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGN--PKSNLFWRNVEDKIQK
            + DD++L  +D    L   +    ++  AS  +IN  +S+ L+  ++ V+      +   IS  S  + YLGV L     P S  F   +E+ +  
Subjt:  SHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESA-LVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGN--PKSNLFWRNVEDKIQK

Query:  KLNNWK-YAQI-SKGGRLTLIKSTLSSLPIYQL
        +L  WK +A++ S  GR  +I   ++S   Y+L
Subjt:  KLNNWK-YAQI-SKGGRLTLIKSTLSSLPIYQL

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)9.5e-1728.21Show/hide
Query:  LIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVL
        LIPK  D  +P ++RPI++ +++ +++ + L+ RL+ A+    +    A +    +    L+ +  +   + ++    ++ LD+ KAFD ++   I   L
Subjt:  LIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVL

Query:  EKKNFPNLWRKWIRGCISNVTYSVIVN-GRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDND
        ++         +I G +S+ T ++ V  G    +I   RG++QGDPLSPFLF   +D    LL  L+++  I G        I  + F DD+LL +EDND
Subjt:  EKKNFPNLWRKWIRGCISNVTYSVIVN-GRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDND

Query:  YFLNNLRMALSLFERASGLKINLLESALVPVNVS
          L      ++ F R  G+ +N  +S  + V  S
Subjt:  YFLNNLRMALSLFERASGLKINLLESALVPVNVS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.0e-2926.91Show/hide
Query:  ILGGDLNVIRMREESTSVSSSSHSSRMLNNF---ITNNLLIDPPLINNRFTWSNLRNP-PTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFP-LVCE
        IL GD + I    +  SV  +S   R L  F   + ++ L+D P     +TWSN ++  P   ++DR + N  W + F +          SDH P ++  
Subjt:  ILGGDLNVIRMREESTSVSSSSHSSRMLNNF---ITNNLLIDPPLINNRFTWSNLRNP-PTFSRIDRFLYNSTWENLFSTHTTRTLPRSTSDHFP-LVCE

Query:  DSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRL
        ++ PK S     FR  S   + P F  ++   WE  I  G   FS  + LK+     K   ++   ++ +     +  ++SI  + L  P         +
Subjt:  DSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRL

Query:  ALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQ-DEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNL----DWN
        A K      +  ES F+ Q+++  WL++GD N+ FFH++  + Q ++ I  ++ D++  ++N    +  + + +++ +  S   SD L  D++    D +
Subjt:  ALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQ-DEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNL----DWN

Query:  PIEHSEW--SHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTS
        P   ++   S L A   + EI   + ++   K PGPD F   FF   W+++K+  +   K+F+  G + K  N T I LIPK         FRP+S  T 
Subjt:  PIEHSEW--SHLCAPFLEGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTS

Query:  IYKII
        +YKII
Subjt:  IYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.7e-0939.74Show/hide
Query:  RLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKK-IKGF-ILKLDIEKAFDNLNWDFIDFVLEKKNFPNLW
        RLK  + + I   Q +F+  R  TD I+   EAV   + KK +KG+ +LKLD+EKA+D + WD+++  L    FP +W
Subjt:  RLKTALPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKK-IKGF-ILKLDIEKAFDNLNWDFIDFVLEKKNFPNLW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)9.4e-1246.27Show/hide
Query:  IVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNC-NISHILFTDD
        I+NG PQG +  +RGLRQGDPLSP+LF++  + LS L    +  G + G+ +++N   I+H+LF DD
Subjt:  IVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLKNSGAIKGVSLNSNC-NISHILFTDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTGCTTACTTGGAATGCAAGAGGTTTAGGCTCCCCTTCTAAAAGAGCCCTAATAAAAAATACAATAATTTCATACTCCCCTGACTTTGTGATTCTGACTGAAAC
TAGGCTTAAGATCACAAACAAGAGAATCATTAAGTCCCTTTGGCCCTCTAATAGCATTAATTGGATTGCTAAAAATGCTTCGGGTAGTTCTGGAGGGATCTTAATTCTTT
GGGATGCCCAGAATCATTCTCTTTTATGTCATGAGGAAGGGCTTTTTAGCCTTTCAGCAAATTTTTTGTTCAACAACAATTTGTCTTGGTGGCTAACAGGTCTTTATGGT
CCAGTCAAAAGGAGGGAAAGAATCCATTTTTGGACAGAGCTTCATAATCTTCAACATCTTAATTCCCTCCCTTGGATCTTAGGAGGTGATCTTAATGTCATCAGAATGAG
AGAGGAATCAACGTCAGTTTCTAGCTCTTCTCACAGCTCCAGAATGTTGAACAATTTCATCACCAACAATCTTCTGATAGATCCTCCTCTCATAAACAATAGATTCACTT
GGTCAAACTTAAGGAATCCTCCTACTTTTTCCCGAATTGATAGATTCCTTTACAATTCAACTTGGGAAAATCTCTTCAGTACCCACACAACAAGGACCCTTCCTAGATCT
ACTTCAGACCACTTTCCTCTGGTCTGTGAAGATTCCAACCCCAAGCTTAGTTGGGGTCCTGTCCCATTCCGTTTGAACTCCATAGCACTTAGTGACCCAGAATTCAAAAG
AAATATGGGAAGATGGTGGGAAAACTCGATCCAAGATGGTCACCCAGGATTCTCCTTCATCCAAAGGCTAAAGTCCTTAGCAAATTTTATCAAACCTTGGCAAAAGGAGA
AATTACAGTCCCTCACCTATGTTAAAGATAACATTATAAGGGAAGTGGACTCTATTGACAAAAAGGAATTGGATACTCCTTTGACTCAAGAGGAAAGTAATCGTCGACTA
GCTCTAAAAGCCGATCTCAGCGAGTTATCTCTCAAGGAGTCCCAATTCTGGTACCAAAGGGCTAAAAAGCTTTGGCTTAGGGAGGGAGATGAAAACTCCTCCTTCTTTCA
TAGAATTTGCTCATCAAGACAGAAGAGAAGTTTCATTCATGAAATCCAGGATGAAGAAGGTTTGATTCAGAATACAAACAACGGTATATCAACTGTTTTTATAAAATTCT
TTTCAAGGATTTATAGAAGCTCTACAAAAAGTGATCCTCTTTTTATCGATAATCTAGATTGGAATCCGATTGAGCATTCTGAGTGGTCGCACCTTTGTGCCCCTTTTTTG
GAAGGTGAGATTAAAGGGGTTATAAACTCTTTAGATGGAAAAAAGACTCCTGGTCCAGACGGCTTTCCTATCTCCTTCTTTAAATCTTACTGGTATCTTCTAAAAGAGGA
TATCATGGACATATTCAAGGATTTTTATGACAAAGGTGTTATCAACAAGAATATGAATAACACCTACATTGCTTTGATCCCAAAAAAGAAGGACTATTCTCATCCTAAAG
ACTTCAGACCAATCAGCCTAACAACGTCCATCTATAAGATCATTGCCAAAACTCTTTCAAACAGGTTAAAAACCGCCCTTCCTGACACCATCTCAGGAAACCAGCTAGCT
TTTGTCAAGAATCGCCAAATTACTGATGCTATCCTAATGGCAAATGAAGCTGTGGATTTTTGGAAGGTGAAGAAGATAAAGGGCTTTATTTTGAAGCTTGACATTGAAAA
GGCTTTTGACAATTTAAATTGGGATTTCATCGATTTTGTCCTCGAGAAAAAGAATTTTCCAAACCTTTGGAGAAAGTGGATAAGAGGATGTATAAGCAATGTCACTTACT
CTGTTATTGTCAACGGAAGACCCCAAGGACGTATTAAAGCTAACAGAGGTCTTAGACAAGGTGATCCCCTTTCCCCTTTTCTGTTTGTTATTGCCATGGATTACCTTAGT
CGTCTTTTATCCCATCTGAAAAATTCTGGTGCAATTAAAGGGGTATCTCTCAACAGTAATTGCAACATCTCCCACATCCTCTTCACTGATGATATTCTTCTTTTCATAGA
AGATAATGATTACTTCCTGAATAACCTTAGAATGGCTTTATCTCTATTTGAAAGAGCTTCGGGTCTCAAAATCAACTTATTGGAATCAGCTCTGGTGCCAGTGAATGTGT
CTGTGAATAGAGCTAAAGATTGTGCTTCGCTTTGGGGTATTTCTTGCCACTCTCTCCCCCTCTCCTACTTGGGAGTTCCTCTTGGTGGCAATCCAAAATCCAACCTTTTT
TGGCGCAACGTTGAAGATAAGATCCAAAAAAAGCTCAATAATTGGAAATATGCTCAGATATCAAAAGGCGGAAGACTCACTTTAATCAAGTCTACCCTTAGCAGTCTTCC
TATTTATCAACTATCTGTTTCCAAGCTCCTTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATTGCTTACTTGGAATGCAAGAGGTTTAGGCTCCCCTTCTAAAAGAGCCCTAATAAAAAATACAATAATTTCATACTCCCCTGACTTTGTGATTCTGACTGAAAC
TAGGCTTAAGATCACAAACAAGAGAATCATTAAGTCCCTTTGGCCCTCTAATAGCATTAATTGGATTGCTAAAAATGCTTCGGGTAGTTCTGGAGGGATCTTAATTCTTT
GGGATGCCCAGAATCATTCTCTTTTATGTCATGAGGAAGGGCTTTTTAGCCTTTCAGCAAATTTTTTGTTCAACAACAATTTGTCTTGGTGGCTAACAGGTCTTTATGGT
CCAGTCAAAAGGAGGGAAAGAATCCATTTTTGGACAGAGCTTCATAATCTTCAACATCTTAATTCCCTCCCTTGGATCTTAGGAGGTGATCTTAATGTCATCAGAATGAG
AGAGGAATCAACGTCAGTTTCTAGCTCTTCTCACAGCTCCAGAATGTTGAACAATTTCATCACCAACAATCTTCTGATAGATCCTCCTCTCATAAACAATAGATTCACTT
GGTCAAACTTAAGGAATCCTCCTACTTTTTCCCGAATTGATAGATTCCTTTACAATTCAACTTGGGAAAATCTCTTCAGTACCCACACAACAAGGACCCTTCCTAGATCT
ACTTCAGACCACTTTCCTCTGGTCTGTGAAGATTCCAACCCCAAGCTTAGTTGGGGTCCTGTCCCATTCCGTTTGAACTCCATAGCACTTAGTGACCCAGAATTCAAAAG
AAATATGGGAAGATGGTGGGAAAACTCGATCCAAGATGGTCACCCAGGATTCTCCTTCATCCAAAGGCTAAAGTCCTTAGCAAATTTTATCAAACCTTGGCAAAAGGAGA
AATTACAGTCCCTCACCTATGTTAAAGATAACATTATAAGGGAAGTGGACTCTATTGACAAAAAGGAATTGGATACTCCTTTGACTCAAGAGGAAAGTAATCGTCGACTA
GCTCTAAAAGCCGATCTCAGCGAGTTATCTCTCAAGGAGTCCCAATTCTGGTACCAAAGGGCTAAAAAGCTTTGGCTTAGGGAGGGAGATGAAAACTCCTCCTTCTTTCA
TAGAATTTGCTCATCAAGACAGAAGAGAAGTTTCATTCATGAAATCCAGGATGAAGAAGGTTTGATTCAGAATACAAACAACGGTATATCAACTGTTTTTATAAAATTCT
TTTCAAGGATTTATAGAAGCTCTACAAAAAGTGATCCTCTTTTTATCGATAATCTAGATTGGAATCCGATTGAGCATTCTGAGTGGTCGCACCTTTGTGCCCCTTTTTTG
GAAGGTGAGATTAAAGGGGTTATAAACTCTTTAGATGGAAAAAAGACTCCTGGTCCAGACGGCTTTCCTATCTCCTTCTTTAAATCTTACTGGTATCTTCTAAAAGAGGA
TATCATGGACATATTCAAGGATTTTTATGACAAAGGTGTTATCAACAAGAATATGAATAACACCTACATTGCTTTGATCCCAAAAAAGAAGGACTATTCTCATCCTAAAG
ACTTCAGACCAATCAGCCTAACAACGTCCATCTATAAGATCATTGCCAAAACTCTTTCAAACAGGTTAAAAACCGCCCTTCCTGACACCATCTCAGGAAACCAGCTAGCT
TTTGTCAAGAATCGCCAAATTACTGATGCTATCCTAATGGCAAATGAAGCTGTGGATTTTTGGAAGGTGAAGAAGATAAAGGGCTTTATTTTGAAGCTTGACATTGAAAA
GGCTTTTGACAATTTAAATTGGGATTTCATCGATTTTGTCCTCGAGAAAAAGAATTTTCCAAACCTTTGGAGAAAGTGGATAAGAGGATGTATAAGCAATGTCACTTACT
CTGTTATTGTCAACGGAAGACCCCAAGGACGTATTAAAGCTAACAGAGGTCTTAGACAAGGTGATCCCCTTTCCCCTTTTCTGTTTGTTATTGCCATGGATTACCTTAGT
CGTCTTTTATCCCATCTGAAAAATTCTGGTGCAATTAAAGGGGTATCTCTCAACAGTAATTGCAACATCTCCCACATCCTCTTCACTGATGATATTCTTCTTTTCATAGA
AGATAATGATTACTTCCTGAATAACCTTAGAATGGCTTTATCTCTATTTGAAAGAGCTTCGGGTCTCAAAATCAACTTATTGGAATCAGCTCTGGTGCCAGTGAATGTGT
CTGTGAATAGAGCTAAAGATTGTGCTTCGCTTTGGGGTATTTCTTGCCACTCTCTCCCCCTCTCCTACTTGGGAGTTCCTCTTGGTGGCAATCCAAAATCCAACCTTTTT
TGGCGCAACGTTGAAGATAAGATCCAAAAAAAGCTCAATAATTGGAAATATGCTCAGATATCAAAAGGCGGAAGACTCACTTTAATCAAGTCTACCCTTAGCAGTCTTCC
TATTTATCAACTATCTGTTTCCAAGCTCCTTCCTTGA
Protein sequenceShow/hide protein sequence
MKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLCHEEGLFSLSANFLFNNNLSWWLTGLYG
PVKRRERIHFWTELHNLQHLNSLPWILGGDLNVIRMREESTSVSSSSHSSRMLNNFITNNLLIDPPLINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSTHTTRTLPRS
TSDHFPLVCEDSNPKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLQSLTYVKDNIIREVDSIDKKELDTPLTQEESNRRL
ALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNNGISTVFIKFFSRIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFL
EGEIKGVINSLDGKKTPGPDGFPISFFKSYWYLLKEDIMDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTALPDTISGNQLA
FVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNFPNLWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLS
RLLSHLKNSGAIKGVSLNSNCNISHILFTDDILLFIEDNDYFLNNLRMALSLFERASGLKINLLESALVPVNVSVNRAKDCASLWGISCHSLPLSYLGVPLGGNPKSNLF
WRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVSKLLP