; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0021125 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0021125
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr12:24034054..24037230
RNA-Seq ExpressionPay0021125
SyntenyPay0021125
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0087.98Show/hide
Query:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF
        M+RFN+FI+NCNL DPPL+NAK+TWSNLRAQATLSRLDRFLF++ WENIFPGHTSKVLTRTTSDHFPIVLESS+ISWGPSPFRFTNAYLKDPDYKKNIEF
Subjt:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF

Query:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE
        WWGNTSQPGYAGYSFM RLKQLAL IK WG++KKGK+EASKKA IKEID IDKLEAEGSATEIHREKR ALKADLSQI LTEAQIWAQKCKRIW+HEGDE
Subjt:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE

Query:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGY
        NSSFFHKICTARQKKCLISKIINNSGQNCL DSDIADAFIQHFE+IYTDNRN+ LFI+NLDWCPISN NS+LLDKPFNE EIW TLK FAKNKAPGPDGY
Subjt:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGY

Query:  TMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAIL
         MDFLQKSW+FMKQNICDIFKDFHSTHIINKVVNETLITLIAKKE+CET ADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGRQITEAIL
Subjt:  TMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAIL

Query:  IANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ-------------------
        IANEALDFWR+KKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYS+LINGRPRGRIKPSRGIRQ                   
Subjt:  IANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ-------------------

Query:  ----------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLG
                                          EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLG
Subjt:  ----------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLG

Query:  GKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIH
        G+PSS+ FWDNVLQKIQKKLS+WKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNG SNGHNISLIRWNQIVSPKEKGGLGIH
Subjt:  GKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIH

Query:  SVNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFA
        SVNSTNF+LLCKWLWKFLTEKDPLWKRLIISKYD+EKMG FPS GKFSSNNSPWKAVTECISWF+KNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFA
Subjt:  SVNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFA

Query:  LSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKK
        LSTNKKGSVK+FWNPSSNDWHLHINRPLRDHE+NLWHNIKASLPTPL NRG PKPLW LNSNNIFDTASVKR+++EA  SPANFHPNLYKTLWKVEFPKK
Subjt:  LSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKK

Query:  CKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATL
        CKFFIWTLIHGCINTADRLQKRLPNW L+PNWCYMCNKSQEDINHLFIHCPYSQ+LWSKAKALLNWN TP DVQSL+QNICSLNI  QKGLITFNT AT+
Subjt:  CKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATL

Query:  LWKIWLERNNRIFKQQGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFV
        LWKIWLERNNRIFKQQ K  Q+LWED LAQ GLWSCKSKLFSNYDCCSIALNISAFV
Subjt:  LWKIWLERNNRIFKQQGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFV

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0085.76Show/hide
Query:  NLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAGYSFMHRLKQLALKI
        NLRAQATLSRLDRFLFS  WEN FPGHTSK LTRTTSDHFPIVLESSSISWGP PFRFTNAYLKDPDYK+NIEFWWGNTSQPG+AGYSFM RLKQLA+KI
Subjt:  NLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAGYSFMHRLKQLALKI

Query:  KAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDENSSFFHKICTARQKKCLISKIINNSG
        KAWGKEKKGKDE SKKAWIKEI+LIDKLEAEG+ATEIHR KR+ALKADLSQITLTEAQIWAQKCKRIW+HEGDENSSFFHKICTARQKKCLISK+INN G
Subjt:  KAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDENSSFFHKICTARQKKCLISKIINNSG

Query:  QNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQNICDIFKDFHST
        QNCL DSDI DAFIQHFEEIYTDN+N+ LFIDNLDWCPISNTN  LLDKPFNE EIW TLK F KNKAPGPDG+TMDFLQKSW+FMK NICDIFKDFHS 
Subjt:  QNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQNICDIFKDFHST

Query:  HIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKKERGFVIKLDIEKA
        H INKVVNETLITLIAKK+NCETV+DFRPISLTTAIYKLIAK LADRLKQTLP TISE QMAFVKGRQITEAILIANEALDFWRNKKERGFVIKLDIEKA
Subjt:  HIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKKERGFVIKLDIEKA

Query:  FDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ---------------------------------------------
        FDKLNWRFIDF+LMKKNYS KWR MIASCISSVQYS+LINGRPRGRIKP+RGIRQ                                             
Subjt:  FDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ---------------------------------------------

Query:  --------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSAKFWDNVLQKIQKKLSSWKYS
                ED++DYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSS+ FWDN+LQKIQKKLSSWKYS
Subjt:  --------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSAKFWDNVLQKIQKKLSSWKYS

Query:  QLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWLWKFLTEKDPLWK
        QLSKGGRITLINSTLESLPIYQ+SVFKVPKGIAQKIEA WRNFLWNGTSNGHNISLIRWNQ+VSPKEKGGLGIHSV+STNF+LLCKWLWKFLTEK+PLWK
Subjt:  QLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWLWKFLTEKDPLWK

Query:  RLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINR
        RLIISKYDQEKMGRFPSRGK+SSNNSPWKAVT CISWF+KNI WKVNDGEDISFWLDNWNGN+PLSL VPRLFALSTNKKGSVKD WNPS  DW++H+NR
Subjt:  RLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINR

Query:  PLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW
        PLRDHE+NLWHNIKASLPTPL +RG  KPLWKLNSNNIFDTAS+K+ LSEASASP NFHP+LYKTLWKV+FPKKCKFFIWTLIHGCINTADRLQKRLPNW
Subjt:  PLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW

Query:  ALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATLLWKIWLERNNRIFKQQGKDFQELWED
         L+PNWCYMCNKSQEDINHLFIHCPYSQ+LWSKA+ALL WN TPNDV+SL QNICSLNI TQKGLITFNTIA LLWKIWLERNNRIFKQQ K+FQ+LWED
Subjt:  ALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATLLWKIWLERNNRIFKQQGKDFQELWED

Query:  ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
        ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
Subjt:  ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0084.84Show/hide
Query:  MHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDENSSFFHKICTARQKK
        M RLKQLA+KIKAWGKEKKGKDE SKKAWIKEIDLIDKLEAEG+ATEIHR+KR+ALKADLSQITLT+AQ+WAQKCKRIW+HEGDENSSFFHKICT RQKK
Subjt:  MHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDENSSFFHKICTARQKK

Query:  CLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQN
        CLISK+INN GQNCL DSDI DAFIQHFEEIYTDN+N+ LFIDN DWCPISNTN  LLDKPFNE EIW TLK F KNKAPGPDG+TMDFLQKSW+FMK N
Subjt:  CLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQN

Query:  ICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKKER
        ICDIFKDFHS H INKVVNETLITLIAKK NCETV+DF+PISLTTAIYKLIAK LADRLKQTLPDTISE QMAFVKGRQITEAILIANEALDFWRNKKER
Subjt:  ICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKKER

Query:  GFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ----------------------------------
        GFVIKLDIEKAFDKLNWRFIDF+LMKKNYS KWR MIASCISSVQYS+LINGRPRGRIKP+RGIRQ                                  
Subjt:  GFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ----------------------------------

Query:  -------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSAKFWDNVLQK
                           ED++DYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSS+ FWDN+LQK
Subjt:  -------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSAKFWDNVLQK

Query:  IQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWLW
        IQKKLSSWKYSQLSKGGRITLINSTLESLPIYQ+SVFKVPKGIAQKIEA WRNFLWNGTSNGHNISLIRWNQ+VSPKEKGGLGIH V+STNF+LLCKWLW
Subjt:  IQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWLW

Query:  KFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNP
        KFLTEK+PLWKRLIISKYDQEKMGRFPSRGK+SSNNSPWKAVT CISWF+KNI WKVNDGEDISFWLDNWNGN+PLSLAVPRLFALSTNKKGSVKD WNP
Subjt:  KFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNP

Query:  SSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINT
        S  DW++H+NRPLRDHE+NLWHNIKASLPTPL +RG  KPLWKLNSNNIFDTAS+K+ LSEASASP NFHP+LYKTLWKV+FPKKCKFFIWTLIHGCINT
Subjt:  SSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINT

Query:  ADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATLLWKIWLERNNRIFKQ
        ADRLQKRLPNW L+PNWCYMCNKSQEDINHLFIHCPYSQ+LWSKA+ALL WN TPNDV+SL QNICSLNI TQKGLITFNTIA LLWKIWLERNNRIFKQ
Subjt:  ADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATLLWKIWLERNNRIFKQ

Query:  QGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
        Q K+FQ+LWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
Subjt:  QGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFVK

TYJ99326.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0085.45Show/hide
Query:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF
        MKRFNTFI+NCNL DPPLTNAKFTWSNLRAQATLSRLDRFLFST WENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYK+NIEF
Subjt:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF

Query:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE
        WWGNTSQPG+AGYSFMHRLKQLA+KIKAWG+EKKGKDEASKKAWIKEIDLI+KLEAEG++TEIHREKRIALKADLSQITLTEAQIWAQKCKRIW+HEGDE
Subjt:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE

Query:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGY
        NSSFFHKICTARQKKCLISKIIN  GQNCL DSDI DAFIQHFEEIYTDNRN+HLFIDNLDWCPISNTNS LLDKPFNE EIW TLK FAKNKAPGPDG+
Subjt:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGY

Query:  TMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAIL
        TMDFLQKSW+FMKQNICDIFKDFHS H INKVVNETLIT IAKKENCETVADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGRQITEAIL
Subjt:  TMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAIL

Query:  IANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ-------------------
        IANEALD WRNKKERGFVIKLDIEKAFDKLNWRFIDF+LMKKNYSQKWRKMIASCISSVQYS+LINGRPRGRIKPSRGIRQ                   
Subjt:  IANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ-------------------

Query:  ----------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLG
                                          ED+DDYVSNLKMILHLFESASGLNINLSKSTIFPINVP DRA SIADSWGISKGHLPTSYLGMPLG
Subjt:  ----------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLG

Query:  GKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK
        GKPSS+ FWDNVLQKIQKKLSSWKYSQLSKG RITLINSTLESLPIYQ+SVFKVPKGIAQKIEA WRNFLWNGTSNGHNIS     ++   K K
Subjt:  GKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]0.0e+0051.7Show/hide
Query:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF
        M+ FN FI + NL DPPL+NAKFTWSNLR    LSR+DRFL++T+WEN+F  H SK L+R TSDHFPIVLESS ISWGPSPF+  N +LK+P +K N+  
Subjt:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF

Query:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE
        WW N  Q G+ G+SFM +LKQL+  I+   ++ K   +  K AWIKEID ID+LEAEG+ +E    +R  LKAD+      EAQIW QK KR+WI EGDE
Subjt:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE

Query:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYT-DNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDG
        N+SFFHKIC+ARQ++ +IS I +  G  C  +  IA AF+ HFE+IY      +   IDNL+W PIS   +  L   F E+EI   L  F+ NK+PGPDG
Subjt:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYT-DNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDG

Query:  YTMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAI
        +TM+F + +W+ +K+ I +IF+DFHS  IINK VN T I LIAKKE C   AD+RPISLTT+IYKLIAK +A+RLK TLP T++E+QMAFVKGRQI +AI
Subjt:  YTMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAI

Query:  LIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ------------------
        L+ANEA+D+WR KK +GFVIKLDIEKAFDKLNWRFIDF+LMKK Y  KWR  I +CISSVQYS++INGRPRG+I+PSRGIRQ                  
Subjt:  LIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ------------------

Query:  ---------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGG
                                         ED +  + NLK I++LF+ ASGL+INL+KSTI PINV   R + IA  WGIS   LP +YLG+PLGG
Subjt:  ---------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGG

Query:  KPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHS
        K  +  FW NV +KI KKL+SWKYS LSKGG+ITLI S+L SLP YQ+S+FKVP    + IE +WRNFLW      H + L+ W +I S KEKGGLGI  
Subjt:  KPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHS

Query:  VNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFAL
        +  TNF+LL KWLW+++ E  PLWK++I +KY     G  P     SS+ SPW ++ + + WF +++SWK+ +G   SFW  +W+ N+PLS   PRL+AL
Subjt:  VNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFAL

Query:  STNKKGSVKDFWNPSSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFH-PNLYKTLWKVEFPKK
        STNK+ S++D WN +  DW L+  R LR+ E  LW  +K SL       G   P+W LNSN ++  ASVK++L +   +  +F   N +K LWK   PKK
Subjt:  STNKKGSVKDFWNPSSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFH-PNLYKTLWKVEFPKK

Query:  CKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATL
        C FFIWTL++  +NTA++L KRLPN    P+WC MC ++ ED  HLFI CP ++ +W    + L+ N      + L   +CS    T+K +I FNT A+ 
Subjt:  CKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATL

Query:  LWKIWLERNNRIFKQQGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAF
        LW IWLERN RIF  + K   E+WEDI A  GLW+ +S LFSNY   SIALN++AF
Subjt:  LWKIWLERNNRIFKQQGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAF

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein0.0e+0051.7Show/hide
Query:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF
        M+ FN FI + NL DPPL+NAKFTWSNLR    LSR+DRFL++T+WEN+F  H SK L+R TSDHFPIVLESS ISWGPSPF+  N +LK+P +K N+  
Subjt:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF

Query:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE
        WW N  Q G+ G+SFM +LKQL+  I+   ++ K   +  K AWIKEID ID+LEAEG+ +E    +R  LKAD+      EAQIW QK KR+WI EGDE
Subjt:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE

Query:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYT-DNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDG
        N+SFFHKIC+ARQ++ +IS I +  G  C  +  IA AF+ HFE+IY      +   IDNL+W PIS   +  L   F E+EI   L  F+ NK+PGPDG
Subjt:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYT-DNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDG

Query:  YTMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAI
        +TM+F + +W+ +K+ I +IF+DFHS  IINK VN T I LIAKKE C   AD+RPISLTT+IYKLIAK +A+RLK TLP T++E+QMAFVKGRQI +AI
Subjt:  YTMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAI

Query:  LIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ------------------
        L+ANEA+D+WR KK +GFVIKLDIEKAFDKLNWRFIDF+LMKK Y  KWR  I +CISSVQYS++INGRPRG+I+PSRGIRQ                  
Subjt:  LIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ------------------

Query:  ---------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGG
                                         ED +  + NLK I++LF+ ASGL+INL+KSTI PINV   R + IA  WGIS   LP +YLG+PLGG
Subjt:  ---------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGG

Query:  KPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHS
        K  +  FW NV +KI KKL+SWKYS LSKGG+ITLI S+L SLP YQ+S+FKVP    + IE +WRNFLW      H + L+ W +I S KEKGGLGI  
Subjt:  KPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHS

Query:  VNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFAL
        +  TNF+LL KWLW+++ E  PLWK++I +KY     G  P     SS+ SPW ++ + + WF +++SWK+ +G   SFW  +W+ N+PLS   PRL+AL
Subjt:  VNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFAL

Query:  STNKKGSVKDFWNPSSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFH-PNLYKTLWKVEFPKK
        STNK+ S++D WN +  DW L+  R LR+ E  LW  +K SL       G   P+W LNSN ++  ASVK++L +   +  +F   N +K LWK   PKK
Subjt:  STNKKGSVKDFWNPSSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFH-PNLYKTLWKVEFPKK

Query:  CKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATL
        C FFIWTL++  +NTA++L KRLPN    P+WC MC ++ ED  HLFI CP ++ +W    + L+ N      + L   +CS    T+K +I FNT A+ 
Subjt:  CKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATL

Query:  LWKIWLERNNRIFKQQGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAF
        LW IWLERN RIF  + K   E+WEDI A  GLW+ +S LFSNY   SIALN++AF
Subjt:  LWKIWLERNNRIFKQQGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAF

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein0.0e+0087.98Show/hide
Query:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF
        M+RFN+FI+NCNL DPPL+NAK+TWSNLRAQATLSRLDRFLF++ WENIFPGHTSKVLTRTTSDHFPIVLESS+ISWGPSPFRFTNAYLKDPDYKKNIEF
Subjt:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF

Query:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE
        WWGNTSQPGYAGYSFM RLKQLAL IK WG++KKGK+EASKKA IKEID IDKLEAEGSATEIHREKR ALKADLSQI LTEAQIWAQKCKRIW+HEGDE
Subjt:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE

Query:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGY
        NSSFFHKICTARQKKCLISKIINNSGQNCL DSDIADAFIQHFE+IYTDNRN+ LFI+NLDWCPISN NS+LLDKPFNE EIW TLK FAKNKAPGPDGY
Subjt:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGY

Query:  TMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAIL
         MDFLQKSW+FMKQNICDIFKDFHSTHIINKVVNETLITLIAKKE+CET ADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGRQITEAIL
Subjt:  TMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAIL

Query:  IANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ-------------------
        IANEALDFWR+KKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYS+LINGRPRGRIKPSRGIRQ                   
Subjt:  IANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ-------------------

Query:  ----------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLG
                                          EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLG
Subjt:  ----------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLG

Query:  GKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIH
        G+PSS+ FWDNVLQKIQKKLS+WKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNG SNGHNISLIRWNQIVSPKEKGGLGIH
Subjt:  GKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIH

Query:  SVNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFA
        SVNSTNF+LLCKWLWKFLTEKDPLWKRLIISKYD+EKMG FPS GKFSSNNSPWKAVTECISWF+KNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFA
Subjt:  SVNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFA

Query:  LSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKK
        LSTNKKGSVK+FWNPSSNDWHLHINRPLRDHE+NLWHNIKASLPTPL NRG PKPLW LNSNNIFDTASVKR+++EA  SPANFHPNLYKTLWKVEFPKK
Subjt:  LSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKK

Query:  CKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATL
        CKFFIWTLIHGCINTADRLQKRLPNW L+PNWCYMCNKSQEDINHLFIHCPYSQ+LWSKAKALLNWN TP DVQSL+QNICSLNI  QKGLITFNT AT+
Subjt:  CKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATL

Query:  LWKIWLERNNRIFKQQGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFV
        LWKIWLERNNRIFKQQ K  Q+LWED LAQ GLWSCKSKLFSNYDCCSIALNISAFV
Subjt:  LWKIWLERNNRIFKQQGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFV

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein0.0e+0085.76Show/hide
Query:  NLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAGYSFMHRLKQLALKI
        NLRAQATLSRLDRFLFS  WEN FPGHTSK LTRTTSDHFPIVLESSSISWGP PFRFTNAYLKDPDYK+NIEFWWGNTSQPG+AGYSFM RLKQLA+KI
Subjt:  NLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAGYSFMHRLKQLALKI

Query:  KAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDENSSFFHKICTARQKKCLISKIINNSG
        KAWGKEKKGKDE SKKAWIKEI+LIDKLEAEG+ATEIHR KR+ALKADLSQITLTEAQIWAQKCKRIW+HEGDENSSFFHKICTARQKKCLISK+INN G
Subjt:  KAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDENSSFFHKICTARQKKCLISKIINNSG

Query:  QNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQNICDIFKDFHST
        QNCL DSDI DAFIQHFEEIYTDN+N+ LFIDNLDWCPISNTN  LLDKPFNE EIW TLK F KNKAPGPDG+TMDFLQKSW+FMK NICDIFKDFHS 
Subjt:  QNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQNICDIFKDFHST

Query:  HIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKKERGFVIKLDIEKA
        H INKVVNETLITLIAKK+NCETV+DFRPISLTTAIYKLIAK LADRLKQTLP TISE QMAFVKGRQITEAILIANEALDFWRNKKERGFVIKLDIEKA
Subjt:  HIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKKERGFVIKLDIEKA

Query:  FDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ---------------------------------------------
        FDKLNWRFIDF+LMKKNYS KWR MIASCISSVQYS+LINGRPRGRIKP+RGIRQ                                             
Subjt:  FDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ---------------------------------------------

Query:  --------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSAKFWDNVLQKIQKKLSSWKYS
                ED++DYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSS+ FWDN+LQKIQKKLSSWKYS
Subjt:  --------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSAKFWDNVLQKIQKKLSSWKYS

Query:  QLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWLWKFLTEKDPLWK
        QLSKGGRITLINSTLESLPIYQ+SVFKVPKGIAQKIEA WRNFLWNGTSNGHNISLIRWNQ+VSPKEKGGLGIHSV+STNF+LLCKWLWKFLTEK+PLWK
Subjt:  QLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWLWKFLTEKDPLWK

Query:  RLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINR
        RLIISKYDQEKMGRFPSRGK+SSNNSPWKAVT CISWF+KNI WKVNDGEDISFWLDNWNGN+PLSL VPRLFALSTNKKGSVKD WNPS  DW++H+NR
Subjt:  RLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINR

Query:  PLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW
        PLRDHE+NLWHNIKASLPTPL +RG  KPLWKLNSNNIFDTAS+K+ LSEASASP NFHP+LYKTLWKV+FPKKCKFFIWTLIHGCINTADRLQKRLPNW
Subjt:  PLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW

Query:  ALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATLLWKIWLERNNRIFKQQGKDFQELWED
         L+PNWCYMCNKSQEDINHLFIHCPYSQ+LWSKA+ALL WN TPNDV+SL QNICSLNI TQKGLITFNTIA LLWKIWLERNNRIFKQQ K+FQ+LWED
Subjt:  ALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATLLWKIWLERNNRIFKQQGKDFQELWED

Query:  ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
        ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
Subjt:  ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein0.0e+0084.84Show/hide
Query:  MHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDENSSFFHKICTARQKK
        M RLKQLA+KIKAWGKEKKGKDE SKKAWIKEIDLIDKLEAEG+ATEIHR+KR+ALKADLSQITLT+AQ+WAQKCKRIW+HEGDENSSFFHKICT RQKK
Subjt:  MHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDENSSFFHKICTARQKK

Query:  CLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQN
        CLISK+INN GQNCL DSDI DAFIQHFEEIYTDN+N+ LFIDN DWCPISNTN  LLDKPFNE EIW TLK F KNKAPGPDG+TMDFLQKSW+FMK N
Subjt:  CLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQN

Query:  ICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKKER
        ICDIFKDFHS H INKVVNETLITLIAKK NCETV+DF+PISLTTAIYKLIAK LADRLKQTLPDTISE QMAFVKGRQITEAILIANEALDFWRNKKER
Subjt:  ICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKKER

Query:  GFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ----------------------------------
        GFVIKLDIEKAFDKLNWRFIDF+LMKKNYS KWR MIASCISSVQYS+LINGRPRGRIKP+RGIRQ                                  
Subjt:  GFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ----------------------------------

Query:  -------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSAKFWDNVLQK
                           ED++DYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSS+ FWDN+LQK
Subjt:  -------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSAKFWDNVLQK

Query:  IQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWLW
        IQKKLSSWKYSQLSKGGRITLINSTLESLPIYQ+SVFKVPKGIAQKIEA WRNFLWNGTSNGHNISLIRWNQ+VSPKEKGGLGIH V+STNF+LLCKWLW
Subjt:  IQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWLW

Query:  KFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNP
        KFLTEK+PLWKRLIISKYDQEKMGRFPSRGK+SSNNSPWKAVT CISWF+KNI WKVNDGEDISFWLDNWNGN+PLSLAVPRLFALSTNKKGSVKD WNP
Subjt:  KFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNP

Query:  SSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINT
        S  DW++H+NRPLRDHE+NLWHNIKASLPTPL +RG  KPLWKLNSNNIFDTAS+K+ LSEASASP NFHP+LYKTLWKV+FPKKCKFFIWTLIHGCINT
Subjt:  SSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINT

Query:  ADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATLLWKIWLERNNRIFKQ
        ADRLQKRLPNW L+PNWCYMCNKSQEDINHLFIHCPYSQ+LWSKA+ALL WN TPNDV+SL QNICSLNI TQKGLITFNTIA LLWKIWLERNNRIFKQ
Subjt:  ADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATLLWKIWLERNNRIFKQ

Query:  QGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
        Q K+FQ+LWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
Subjt:  QGKDFQELWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFVK

A0A5D3BJP3 LINE-1 retrotransposable element ORF2 protein0.0e+0085.45Show/hide
Query:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF
        MKRFNTFI+NCNL DPPLTNAKFTWSNLRAQATLSRLDRFLFST WENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYK+NIEF
Subjt:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEF

Query:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE
        WWGNTSQPG+AGYSFMHRLKQLA+KIKAWG+EKKGKDEASKKAWIKEIDLI+KLEAEG++TEIHREKRIALKADLSQITLTEAQIWAQKCKRIW+HEGDE
Subjt:  WWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDE

Query:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGY
        NSSFFHKICTARQKKCLISKIIN  GQNCL DSDI DAFIQHFEEIYTDNRN+HLFIDNLDWCPISNTNS LLDKPFNE EIW TLK FAKNKAPGPDG+
Subjt:  NSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGY

Query:  TMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAIL
        TMDFLQKSW+FMKQNICDIFKDFHS H INKVVNETLIT IAKKENCETVADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGRQITEAIL
Subjt:  TMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAIL

Query:  IANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ-------------------
        IANEALD WRNKKERGFVIKLDIEKAFDKLNWRFIDF+LMKKNYSQKWRKMIASCISSVQYS+LINGRPRGRIKPSRGIRQ                   
Subjt:  IANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ-------------------

Query:  ----------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLG
                                          ED+DDYVSNLKMILHLFESASGLNINLSKSTIFPINVP DRA SIADSWGISKGHLPTSYLGMPLG
Subjt:  ----------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLG

Query:  GKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK
        GKPSS+ FWDNVLQKIQKKLSSWKYSQLSKG RITLINSTLESLPIYQ+SVFKVPKGIAQKIEA WRNFLWNGTSNGHNIS     ++   K K
Subjt:  GKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.0e-2522.67Show/hide
Query:  SKVLTRTTSDHFPIVLE--------SSSISWGPSPFRFTNAYLKDPDYKKNIEFWW------GNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEAS
        ++++T   SDH  I LE        S S +W  +     N Y    + K  I+ ++        T Q  +  +  + R K +AL      +E+   D  +
Subjt:  SKVLTRTTSDHFPIVLE--------SSSISWGPSPFRFTNAYLKDPDYKKNIEFWW------GNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEAS

Query:  KKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKC--KRIWIHEG-DENSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIAD
         +  +KE++  ++  ++ S     R++   ++A+L +I   E Q   QK    R W  E  ++      ++   +++K  I  I N+ G      ++I  
Subjt:  KKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKC--KRIWIHEG-DENSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIAD

Query:  AFIQHFEEIYTDNRNN----HLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQNICDIFKDFHSTHIINKVV
           ++++ +Y +   N      F+D      ++    + L++P    EI   +      K+PGPDG+T +F Q+    +   +  +F+      I+    
Subjt:  AFIQHFEEIYTDNRNN----HLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQNICDIFKDFHSTHIINKVV

Query:  NETLITLIAKKENCETVAD-FRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQ----ITEAILIANEALDFWRNKKERGFVIKLDIEKAFD
         E  I LI K     T  + FRPISL     K++ K LA+R++Q +   I   Q+ F+ G Q    I ++I   N      R K +   +I +D EKAFD
Subjt:  NETLITLIAKKENCETVAD-FRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQ----ITEAILIANEALDFWRNKKERGFVIKLDIEKAFD

Query:  KLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLING------------RPRGRIKP----------SRGIRQEDR-----------------DDYV-
        K+   F+   L K      + K+I +       ++++NG            R    + P          +R IRQE                   DD + 
Subjt:  KLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLING------------RPRGRIKP----------SRGIRQEDR-----------------DDYV-

Query:  ---------SNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPL--GGKPSSAKFWDNVLQKIQKKLSSWKYSQLSK
                  NL  ++  F   SG  IN+ KS  F  N        I      +       YLG+ L    K    + +  +L++I++  + WK    S 
Subjt:  ---------SNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPL--GGKPSSAKFWDNVLQKIQKKLSSWKYSQLSK

Query:  GGRITLINSTLESLPIYQMSV--FKVPKGIAQKIEASWRNFLWN
         GRI ++   +    IY+ +    K+P     ++E +   F+WN
Subjt:  GGRITLINSTLESLPIYQMSV--FKVPKGIAQKIEASWRNFLWN

P08548 LINE-1 reverse transcriptase homolog5.6e-2721.85Show/hide
Query:  NTFITNCNLTD------PPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLE---SSSISWGPSPFRFTNAYLKD----
        N+ I + +LTD      P  T   F  S   A  T S++D  L   H  N+      +++    SDH  I +E   + ++      ++  N  LKD    
Subjt:  NTFITNCNLTD------PPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLE---SSSISWGPSPFRFTNAYLKD----

Query:  PDYKKNI-EFWWGNTSQPGYAGYSFMHRLKQLALK-----IKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIA-LKADLSQITLTEAQ
         + KK I +F   N +Q     Y  +    +  L+     ++A+ K+ + ++  +    +K+++     + E S  +  R K I  ++A+L++I      
Subjt:  PDYKKNI-EFWWGNTSQPGYAGYSFMHRLKQLALK-----IKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIA-LKADLSQITLTEAQ

Query:  IWAQKCKRIWIHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDN-LDWC---PISNTNSDLLDKPFNED
            K K  +  + ++       +   ++ K LIS I N + +     S+I     ++++++Y+    N   ID  L+ C    +S    ++L++P +  
Subjt:  IWAQKCKRIWIHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDN-LDWC---PISNTNSDLLDKPFNED

Query:  EIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCET-VADFRPISLTTAIYKLIAKALADRLKQTLP
        EI  T++   K K+PGPDG+T +F Q     +   + ++F++     I+     E  ITLI K     T   ++RPISL     K++ K L +R++Q + 
Subjt:  EIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCET-VADFRPISLTTAIYKLIAKALADRLKQTLP

Query:  DTISESQMAFVKGRQ----ITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKP
          I   Q+ F+ G Q    I ++I +  + ++  +NK     ++ +D EKAFD +   F+   L K      + K+I +  S    ++++NG        
Subjt:  DTISESQMAFVKGRQ----ITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKP

Query:  SRGIRQ-------------------------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSI
          G RQ                                                 E+  D  + L  ++  + + SG  IN  KS  F         K++
Subjt:  SRGIRQ-------------------------------------------------EDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSI

Query:  ADSWGISKGHLPTSYLGMPL--GGKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSV--FKVPKGIAQKIEASWRNFLWNGTS
         DS   +       YLG+ L    K    + ++ + ++I + ++ WK    S  GRI ++  ++    IY  +    K P    + +E    +F+WN   
Subjt:  ADSWGISKGHLPTSYLGMPL--GGKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSV--FKVPKGIAQKIEASWRNFLWNGTS

Query:  NGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCK--WLWKFLTEKDPLWKRL
               I    + +  + GG+ +  +     S++ K  W W    E D +W R+
Subjt:  NGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCK--WLWKFLTEKDPLWKRL

P0C2F6 Putative ribonuclease H protein At1g657501.1e-3524.65Show/hide
Query:  MPLGGKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGG
        MP+  K  +   +  +L+++  ++S W+   LS  GR+TL  + L S+P++ MS   +P+ I  +++   R FLW  T+      L++W+++ SPK++GG
Subjt:  MPLGGKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGG

Query:  LGIHSVNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECI-SWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAV
        LG+ +  S N +L+ K  W+ L EK+ LW  ++  KY   ++          S +S W+++   +       + W   DG+ I FW D W    PL L +
Subjt:  LGIHSVNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECI-SWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAV

Query:  PRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKV
              +       KD W P    W      P   +   L   ++A +   L+     +  WK + +  F   S    L+       N   + +  LWKV
Subjt:  PRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKV

Query:  EFPKKCKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGL--IT
          P++ K F+W + +  + T +   +R  + +   N C +C    E + H+   CP    +W +   ++   R        +      N+  + G   I 
Subjt:  EFPKKCKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGL--IT

Query:  FNTI-ATLLWKIWLERNNRIFKQQGK
        ++TI A ++W  W  R   IF +  K
Subjt:  FNTI-ATLLWKIWLERNNRIFKQQGK

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-2722.6Show/hide
Query:  KIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEG-DENSSFFHKICTARQKKCLISKIIN
        K+ A    KK ++ A   +    +  ++K EA  S     R++ I L+ +++Q+  T   I      R W  E  ++      ++    + K LI+KI N
Subjt:  KIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEG-DENSSFFHKICTARQKKCLISKIIN

Query:  NSGQNCLKDSDIADAFIQHFEEIYTDNRNN----HLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQNICDI
          G       +I +     ++ +Y+    N      F+D      ++    D L+ P +  EI   +      K+PGPDG++ +F Q    F +  I  +
Subjt:  NSGQNCLKDSDIADAFIQHFEEIYTDNRNN----HLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQNICDI

Query:  FKDFHSTHIINKVVN---ETLITLIAKKENCET-VADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKKER
         K FH   +   + N   E  ITLI K +   T + +FRPISL     K++ K LA+R+++ +   I   Q+ F+ G Q    I  +   + +    K++
Subjt:  FKDFHSTHIINKVVN---ETLITLIAKKENCET-VADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKKER

Query:  G-FVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ---------------------------------
           +I LD EKAFDK+   F+  VL +      +  MI +  S    ++ +NG     I    G RQ                                 
Subjt:  G-FVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQ---------------------------------

Query:  ---------EDRDDYVSN-------LKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGG--KPSSAKFWDNVLQKI
                 +D   Y+S+       L  +++ F    G  IN +KS  F         K I ++   S       YLG+ L    K    K + ++ ++I
Subjt:  ---------EDRDDYVSN-------LKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGG--KPSSAKFWDNVLQKI

Query:  QKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSV--FKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWL
        ++ L  WK    S  GRI ++   +    IY+ +    K+P     ++E +   F+WN        SL++       +  GG+ +  +     +++ K  
Subjt:  QKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSV--FKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWL

Query:  WKFLTEKD-PLWKRL
        W +  ++    W R+
Subjt:  WKFLTEKD-PLWKRL

P14381 Transposon TX1 uncharacterized 149 kDa protein4.0e-2522.16Show/hide
Query:  FTWSNLR-AQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSP--FRFTNAYLKDPDYKKNIEFWW--GNTSQPGYAGYSFMH
        FT+  +R    + SR+DR   S+H   +    +S +     SDH  + L  S     P    + F N+ L+D  + K++   W      Q  +A  +   
Subjt:  FTWSNLR-AQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSP--FRFTNAYLKDPDYKKNIEFWW--GNTSQPGYAGYSFMH

Query:  RLKQLALKI--KAWGKEKKGKDEASKKAWIKEI-DLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDENSSFFHKICTARQK
         + ++ LK+  + + K   G+  A  +A   E+ DL  +L   GS  +  + + +  K  L  +   +A+    + +   + + D  S FF+ +   +  
Subjt:  RLKQLALKI--KAWGKEKKGKDEASKKAWIKEI-DLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDENSSFFHKICTARQK

Query:  KCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNL-DWCP-ISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFM
        +  I+ +    G        I D     ++ +++ +  +    + L D  P +S    + L+ P   DE+   L+    NK+PG DG T++F Q  W  +
Subjt:  KCLISKIINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNL-DWCP-ISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFM

Query:  KQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNK
          +   +  +      +       +++L+ KK +   + ++RP+SL +  YK++AKA++ RLK  L + I   Q   V GR I + + +  + L F R  
Subjt:  KQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNK

Query:  KERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQE----------DRDDYVSNL-KMILHLFESA
              + LD EKAFD+++ +++   L   ++  ++   + +  +S +  V IN      +   RG+RQ             + ++  L K +  L    
Subjt:  KERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQE----------DRDDYVSNL-KMILHLFESA

Query:  SGLNINLSKSTIFPINVPTD---------------RAKSIADSWGISKGHLPTS---------------------YLGMPLGGK--PSSAKFWDNVLQKI
          + + LS      I V  D                A S   +W  S G L  S                     YLG+ L  +  P S  F + + + +
Subjt:  SGLNINLSKSTIFPINVPTD---------------RAKSIADSWGISKGHLPTS---------------------YLGMPLGGK--PSSAKFWDNVLQKI

Query:  QKKLSSWK--YSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWL
          +L  WK     LS  GR  +IN  + S   Y++      +    KI+    +FLW G    H +S         P ++GG G+  + S   +   + +
Subjt:  QKKLSSWK--YSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWL

Query:  WKFL-TEKDPLWKRLIISKYDQ
         ++L  +  P W  L  S Y Q
Subjt:  WKFL-TEKDPLWKRLIISKYDQ

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.5e-2424.27Show/hide
Query:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSR-LDRFLFSTHWENIFPGHTSKVLTRTTSDHFP-IVLESSSISWGPSPFRFTNAYLKDPDYKKNI
        ++ F   + + +L D P     +TWSN +    + R LDR + +  W + FP   +       SDH P I++  +        FR+ +     P +  ++
Subjt:  MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSR-LDRFLFSTHWENIFPGHTSKVLTRTTSDHFP-IVLESSSISWGPSPFRFTNAYLKDPDYKKNI

Query:  EFWWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEG
           W      G   +S    LK      K   ++  G  +   K  +  ++ I        +  + R + +A K   +         + QK +  W+ +G
Subjt:  EFWWGNTSQPGYAGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEG

Query:  DENSSFFHKICTARQKKCLISKIINNSG---QNCLKDSDIADAFIQHF----EEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAK
        D N+ FFHK+  A Q K LI  +  +     +N  +  ++  A+  H      +I T +      I ++     ++T +  L    ++ EI   +    +
Subjt:  DENSSFFHKICTARQKKCLISKIINNSG---QNCLKDSDIADAFIQHF----EEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAK

Query:  NKAPGPDGYTMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLI
        NKAPGPD +T +F  +SW  +K +     K+F  T  + K  N T ITLI K    + ++ FRP+S  T +YK+I
Subjt:  NKAPGPDGYTMDFLQKSWAFMKQNICDIFKDFHSTHIINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLI

AT2G02650.1 Ribonuclease H-like superfamily protein7.5e-1125.17Show/hide
Query:  LSEASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALL-NWNRTPND
        L E +  P      + + +WK+    K K F+W  + G + T  RL+ R  N   +P  C  C   +E I+H+  +CPY+Q +W  A  ++ N    P+ 
Subjt:  LSEASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALNPNWCYMCNKSQEDINHLFIHCPYSQKLWSKAKALL-NWNRTPND

Query:  VQSLVQNICSLNISTQKGLITFNTIATLLWKIWLERNNRIFKQ--QGKDFQ
         +  +  +  L+ +     +       ++W++W  RN  +F+Q  Q  D++
Subjt:  VQSLVQNICSLNISTQKGLITFNTIATLLWKIWLERNNRIFKQ--QGKDFQ

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-2724.87Show/hide
Query:  IADSWGISKGHLPTSYLGMPLGGKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGH
        I  S+  + G LP  YLG+PL  K  +   +  +++KI+ ++  W    LS  GR+ LI+S + SL  + MS F++P    ++I++   +FLW+G     
Subjt:  IADSWGISKGHLPTSYLGMPLGGKPSSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGH

Query:  NISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDI
          + + W+ + +PK++GGLGI S+   N              K   W                   G  +  +  WK + +  +     +   +++G + 
Subjt:  NISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDI

Query:  SFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHIN-RPLRDHEQNLW--HNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLS
        SFW DNW+        + RL  + T  +G +       ++     +N RP R     L    ++ A +    L  G     WK N  +IF      +   
Subjt:  SFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHIN-RPLRDHEQNLW--HNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLS

Query:  EASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW-ALNPNWCYMCNKSQEDINHLFIHCPYSQKL
         A+  P     N YK +W      K     W  I   + T DR+     +W A   + C +C+   E  +HLF  CPYS ++
Subjt:  EASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW-ALNPNWCYMCNKSQEDINHLFIHCPYSQKL

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.7e-0731.4Show/hide
Query:  LADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKK--ERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIA
        + +RLK  + + I  +Q +F+ GR  T+ I+   EA+   R KK  +   ++KLD+EKA+D++ W +++  L+   + + W   IA
Subjt:  LADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKK--ERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIA

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.4e-1126.39Show/hide
Query:  SLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKE-KGGLGIHSVNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRF
        +LP+Y MS F++ K + +K+ ++   F W+   N   IS + W ++   KE  GGLG   +   N +LL K  ++ + +   L  RL+ S+Y        
Subjt:  SLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKE-KGGLGIHSVNSTNFSLLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRF

Query:  PSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNW
         S G  +  +  W+++        + +   + DG     WLD W
Subjt:  PSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGATTCAACACTTTCATTACCAATTGTAATCTGACTGATCCTCCCCTCACCAATGCAAAGTTTACTTGGTCAAATCTCAGAGCTCAGGCCACCCTCTCCAGACT
GGACAGATTTCTTTTCTCGACCCATTGGGAAAATATTTTCCCAGGCCATACTTCAAAAGTGTTAACCCGAACTACTTCAGACCATTTTCCCATCGTTCTTGAGTCGTCTT
CGATCTCTTGGGGACCTTCTCCTTTTAGATTCACAAATGCCTACCTAAAAGACCCAGACTACAAGAAAAACATAGAGTTTTGGTGGGGAAACACCAGTCAGCCAGGCTAT
GCAGGTTATTCCTTTATGCACAGACTAAAGCAGTTGGCTCTGAAAATCAAAGCTTGGGGAAAAGAAAAAAAAGGAAAAGATGAAGCTTCAAAAAAGGCCTGGATCAAAGA
AATCGACCTAATTGATAAGCTAGAGGCTGAAGGATCTGCAACAGAGATTCACAGAGAGAAAAGGATTGCTCTAAAAGCCGACCTCTCCCAAATTACTCTCACTGAAGCTC
AAATATGGGCCCAAAAATGCAAAAGAATATGGATCCATGAAGGTGATGAAAATTCTTCCTTTTTCCACAAAATTTGCACAGCAAGGCAAAAAAAGTGTTTGATCTCCAAG
ATTATAAACAACAGTGGACAGAATTGCCTAAAGGACAGTGACATTGCCGATGCCTTCATTCAACATTTTGAAGAAATCTATACAGACAACAGAAACAACCATTTGTTTAT
TGATAATCTCGATTGGTGCCCCATCTCCAACACCAACAGTGACTTGCTGGACAAACCCTTTAATGAAGATGAAATTTGGCACACTTTAAAGCCCTTTGCAAAGAATAAAG
CTCCAGGTCCAGATGGTTATACGATGGATTTCCTACAAAAGTCTTGGGCTTTTATGAAGCAAAACATTTGTGATATCTTCAAAGATTTTCATAGCACCCATATCATCAAT
AAAGTTGTCAATGAAACTCTCATTACCCTTATAGCCAAAAAAGAAAATTGCGAGACAGTTGCAGACTTTCGGCCCATCAGCCTCACCACGGCTATCTACAAATTAATCGC
AAAGGCTTTGGCTGATAGATTGAAACAAACTCTCCCCGACACGATCTCTGAGTCTCAAATGGCCTTCGTTAAAGGCAGACAAATTACAGAGGCCATTCTTATTGCAAATG
AAGCTTTGGATTTCTGGAGAAATAAAAAAGAAAGAGGCTTTGTGATAAAACTGGACATTGAAAAGGCCTTCGATAAGCTAAATTGGCGCTTCATAGACTTTGTGCTTATG
AAAAAGAACTACTCTCAGAAATGGAGGAAAATGATTGCCAGCTGCATCTCTAGTGTCCAATACTCTGTTCTTATCAATGGTAGACCGAGAGGCAGAATCAAACCTTCTAG
AGGAATCCGACAGGAGGATAGGGATGACTACGTATCAAACCTCAAAATGATCCTTCATCTCTTTGAATCAGCCTCGGGCCTTAACATCAATCTGTCCAAGTCTACTATCT
TTCCCATAAACGTCCCAACAGATCGTGCAAAGTCTATAGCGGACAGTTGGGGAATAAGCAAGGGCCATCTTCCGACATCTTACCTTGGTATGCCCTTAGGAGGGAAGCCT
TCATCAGCAAAATTCTGGGACAATGTACTTCAGAAAATCCAGAAAAAATTGAGCAGCTGGAAATACTCTCAGTTATCCAAAGGTGGCAGAATCACTCTGATAAACTCAAC
TCTTGAAAGCCTCCCCATATATCAAATGTCGGTCTTCAAGGTCCCCAAAGGTATAGCTCAGAAAATTGAAGCTTCTTGGAGAAATTTCCTTTGGAATGGTACATCGAATG
GACACAACATTAGCCTCATCAGATGGAACCAAATTGTTTCCCCAAAAGAGAAAGGAGGCCTCGGTATTCACTCTGTCAATAGCACAAATTTTTCCCTTCTCTGTAAATGG
CTCTGGAAATTTCTAACTGAAAAAGATCCTTTATGGAAACGCCTGATCATTTCCAAATATGATCAGGAGAAAATGGGCAGATTTCCTTCTCGTGGAAAATTCAGCAGCAA
TAACAGCCCTTGGAAAGCAGTGACAGAGTGTATCAGTTGGTTCCATAAAAACATCAGCTGGAAGGTAAATGATGGAGAAGATATCTCCTTTTGGCTTGACAACTGGAATG
GAAATGCTCCTTTATCTTTGGCCGTCCCCCGTCTTTTTGCTCTATCTACAAACAAAAAGGGGTCTGTTAAAGATTTTTGGAATCCTTCATCTAATGACTGGCATCTCCAT
ATCAATCGGCCCCTTCGTGACCATGAACAAAACTTGTGGCACAATATTAAAGCCTCTCTTCCAACTCCCTTACTGAATAGGGGGCTTCCAAAACCATTATGGAAACTAAA
TTCAAACAACATATTCGATACCGCGTCCGTAAAAAGGAGCCTATCTGAAGCTTCAGCCTCTCCAGCTAACTTTCATCCAAATCTCTACAAAACTCTGTGGAAGGTGGAAT
TTCCAAAAAAGTGTAAATTTTTCATCTGGACGCTCATCCACGGTTGCATTAATACAGCTGATCGCCTGCAGAAACGTTTACCAAATTGGGCCCTCAACCCCAACTGGTGT
TATATGTGCAACAAGAGCCAAGAAGACATAAATCATCTCTTCATCCATTGCCCCTACAGTCAGAAGTTATGGAGTAAGGCCAAAGCTCTCCTCAATTGGAATAGAACTCC
AAATGATGTGCAGTCCCTTGTTCAGAACATTTGCTCCCTCAACATAAGTACTCAAAAAGGGCTGATAACATTCAATACCATTGCTACCCTCCTTTGGAAGATTTGGCTGG
AAAGAAACAATAGAATCTTCAAACAACAGGGAAAAGATTTTCAAGAGCTTTGGGAAGACATTCTCGCTCAAACCGGTTTATGGAGCTGCAAATCTAAATTATTTTCAAAT
TATGATTGTTGCTCCATAGCGTTAAATATCTCTGCTTTTGTAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGATTCAACACTTTCATTACCAATTGTAATCTGACTGATCCTCCCCTCACCAATGCAAAGTTTACTTGGTCAAATCTCAGAGCTCAGGCCACCCTCTCCAGACT
GGACAGATTTCTTTTCTCGACCCATTGGGAAAATATTTTCCCAGGCCATACTTCAAAAGTGTTAACCCGAACTACTTCAGACCATTTTCCCATCGTTCTTGAGTCGTCTT
CGATCTCTTGGGGACCTTCTCCTTTTAGATTCACAAATGCCTACCTAAAAGACCCAGACTACAAGAAAAACATAGAGTTTTGGTGGGGAAACACCAGTCAGCCAGGCTAT
GCAGGTTATTCCTTTATGCACAGACTAAAGCAGTTGGCTCTGAAAATCAAAGCTTGGGGAAAAGAAAAAAAAGGAAAAGATGAAGCTTCAAAAAAGGCCTGGATCAAAGA
AATCGACCTAATTGATAAGCTAGAGGCTGAAGGATCTGCAACAGAGATTCACAGAGAGAAAAGGATTGCTCTAAAAGCCGACCTCTCCCAAATTACTCTCACTGAAGCTC
AAATATGGGCCCAAAAATGCAAAAGAATATGGATCCATGAAGGTGATGAAAATTCTTCCTTTTTCCACAAAATTTGCACAGCAAGGCAAAAAAAGTGTTTGATCTCCAAG
ATTATAAACAACAGTGGACAGAATTGCCTAAAGGACAGTGACATTGCCGATGCCTTCATTCAACATTTTGAAGAAATCTATACAGACAACAGAAACAACCATTTGTTTAT
TGATAATCTCGATTGGTGCCCCATCTCCAACACCAACAGTGACTTGCTGGACAAACCCTTTAATGAAGATGAAATTTGGCACACTTTAAAGCCCTTTGCAAAGAATAAAG
CTCCAGGTCCAGATGGTTATACGATGGATTTCCTACAAAAGTCTTGGGCTTTTATGAAGCAAAACATTTGTGATATCTTCAAAGATTTTCATAGCACCCATATCATCAAT
AAAGTTGTCAATGAAACTCTCATTACCCTTATAGCCAAAAAAGAAAATTGCGAGACAGTTGCAGACTTTCGGCCCATCAGCCTCACCACGGCTATCTACAAATTAATCGC
AAAGGCTTTGGCTGATAGATTGAAACAAACTCTCCCCGACACGATCTCTGAGTCTCAAATGGCCTTCGTTAAAGGCAGACAAATTACAGAGGCCATTCTTATTGCAAATG
AAGCTTTGGATTTCTGGAGAAATAAAAAAGAAAGAGGCTTTGTGATAAAACTGGACATTGAAAAGGCCTTCGATAAGCTAAATTGGCGCTTCATAGACTTTGTGCTTATG
AAAAAGAACTACTCTCAGAAATGGAGGAAAATGATTGCCAGCTGCATCTCTAGTGTCCAATACTCTGTTCTTATCAATGGTAGACCGAGAGGCAGAATCAAACCTTCTAG
AGGAATCCGACAGGAGGATAGGGATGACTACGTATCAAACCTCAAAATGATCCTTCATCTCTTTGAATCAGCCTCGGGCCTTAACATCAATCTGTCCAAGTCTACTATCT
TTCCCATAAACGTCCCAACAGATCGTGCAAAGTCTATAGCGGACAGTTGGGGAATAAGCAAGGGCCATCTTCCGACATCTTACCTTGGTATGCCCTTAGGAGGGAAGCCT
TCATCAGCAAAATTCTGGGACAATGTACTTCAGAAAATCCAGAAAAAATTGAGCAGCTGGAAATACTCTCAGTTATCCAAAGGTGGCAGAATCACTCTGATAAACTCAAC
TCTTGAAAGCCTCCCCATATATCAAATGTCGGTCTTCAAGGTCCCCAAAGGTATAGCTCAGAAAATTGAAGCTTCTTGGAGAAATTTCCTTTGGAATGGTACATCGAATG
GACACAACATTAGCCTCATCAGATGGAACCAAATTGTTTCCCCAAAAGAGAAAGGAGGCCTCGGTATTCACTCTGTCAATAGCACAAATTTTTCCCTTCTCTGTAAATGG
CTCTGGAAATTTCTAACTGAAAAAGATCCTTTATGGAAACGCCTGATCATTTCCAAATATGATCAGGAGAAAATGGGCAGATTTCCTTCTCGTGGAAAATTCAGCAGCAA
TAACAGCCCTTGGAAAGCAGTGACAGAGTGTATCAGTTGGTTCCATAAAAACATCAGCTGGAAGGTAAATGATGGAGAAGATATCTCCTTTTGGCTTGACAACTGGAATG
GAAATGCTCCTTTATCTTTGGCCGTCCCCCGTCTTTTTGCTCTATCTACAAACAAAAAGGGGTCTGTTAAAGATTTTTGGAATCCTTCATCTAATGACTGGCATCTCCAT
ATCAATCGGCCCCTTCGTGACCATGAACAAAACTTGTGGCACAATATTAAAGCCTCTCTTCCAACTCCCTTACTGAATAGGGGGCTTCCAAAACCATTATGGAAACTAAA
TTCAAACAACATATTCGATACCGCGTCCGTAAAAAGGAGCCTATCTGAAGCTTCAGCCTCTCCAGCTAACTTTCATCCAAATCTCTACAAAACTCTGTGGAAGGTGGAAT
TTCCAAAAAAGTGTAAATTTTTCATCTGGACGCTCATCCACGGTTGCATTAATACAGCTGATCGCCTGCAGAAACGTTTACCAAATTGGGCCCTCAACCCCAACTGGTGT
TATATGTGCAACAAGAGCCAAGAAGACATAAATCATCTCTTCATCCATTGCCCCTACAGTCAGAAGTTATGGAGTAAGGCCAAAGCTCTCCTCAATTGGAATAGAACTCC
AAATGATGTGCAGTCCCTTGTTCAGAACATTTGCTCCCTCAACATAAGTACTCAAAAAGGGCTGATAACATTCAATACCATTGCTACCCTCCTTTGGAAGATTTGGCTGG
AAAGAAACAATAGAATCTTCAAACAACAGGGAAAAGATTTTCAAGAGCTTTGGGAAGACATTCTCGCTCAAACCGGTTTATGGAGCTGCAAATCTAAATTATTTTCAAAT
TATGATTGTTGCTCCATAGCGTTAAATATCTCTGCTTTTGTAAAATAG
Protein sequenceShow/hide protein sequence
MKRFNTFITNCNLTDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGY
AGYSFMHRLKQLALKIKAWGKEKKGKDEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWIHEGDENSSFFHKICTARQKKCLISK
IINNSGQNCLKDSDIADAFIQHFEEIYTDNRNNHLFIDNLDWCPISNTNSDLLDKPFNEDEIWHTLKPFAKNKAPGPDGYTMDFLQKSWAFMKQNICDIFKDFHSTHIIN
KVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRQITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLM
KKNYSQKWRKMIASCISSVQYSVLINGRPRGRIKPSRGIRQEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKP
SSAKFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFSLLCKW
LWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFHKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLH
INRPLRDHEQNLWHNIKASLPTPLLNRGLPKPLWKLNSNNIFDTASVKRSLSEASASPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALNPNWC
YMCNKSQEDINHLFIHCPYSQKLWSKAKALLNWNRTPNDVQSLVQNICSLNISTQKGLITFNTIATLLWKIWLERNNRIFKQQGKDFQELWEDILAQTGLWSCKSKLFSN
YDCCSIALNISAFVK