; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0006035 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0006035
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr05:711257..715470
RNA-Seq ExpressionIVF0006035
SyntenyIVF0006035
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.092.22Show/hide
Query:  NDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNL
        ND+  SIL+     GAFSVSIQVGSNNGA WWLSAIYGPAKRKNRPLFWEELE+LKSIC PTWILGGDFNVIRWKEET+TKNPA LSM+RFN+FISNCNL
Subjt:  NDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNL

Query:  IDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAAG
        IDPPL+NAK+TWSNLRAQATLSRLDRFLF++ WENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYA  
Subjt:  IDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAAG

Query:  FENQSL----------GKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ
           + L          G+ KKGKNEASKKA IKEID IDKLEAEGSATEIHREKR ALKADLSQI LTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ
Subjt:  FENQSL----------GKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ

Query:  KKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMK
        KKCLISKIINNSGQNCLNDSDIADAFIQHFE+IYTDNRNS LFI+NLDWCPISN NS+LLDKPFNEAEIWLTLKSFAKNKAPGPDGY MDFLQKSWSFMK
Subjt:  KKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMK

Query:  QNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKK
        QNICDIFKDFHSTH INKVVNETLITLIAKKE+CET ADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGR+ITEAILIANEALDFWR+KK
Subjt:  QNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKK

Query:  ERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV
        ERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV
Subjt:  ERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV

Query:  NFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVL
         FSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGG+PSSSNFWDNVL
Subjt:  NFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVL

Query:  QKIQKKLSSWKYSQLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKW
        QKIQKKLS+WKYSQLSKGGRITLINSTLESLP     +  VPKGIAQKIEASWRNFLWNG SNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKW
Subjt:  QKIQKKLSSWKYSQLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKW

Query:  LWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFW
        LWKFLTEKDPLWKRLIISKYD+EKMG FPS GKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVK+FW
Subjt:  LWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFW

Query:  NPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCI
        NPSSNDWHLHINRPLRDHE+NLWHNIKASLPTPLPNRG PKPLW LNSNNIFDTASVKR ++EAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCI
Subjt:  NPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCI

Query:  NTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIF
        NTADRLQKRLPNW LSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALL WN TPTDVQSL+QNICSLNIRNQKGLITFNT+AT+LWKIWLERNNRIF
Subjt:  NTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIF

Query:  KQQGKDSQDLWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFV
        KQQ K  QDLWED LAQ GLWSCKSKLFSNYDCCSIALNISAFV
Subjt:  KQQGKDSQDLWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFV

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.088.95Show/hide
Query:  NLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFEN
        NLRAQATLSRLDRFLFS  WEN FPGHTSK LTRTTSDHFPIVLESS+ISWGP PFRFTNAYLKDPDYK+NIEFWWGNTSQPG+A             + 
Subjt:  NLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFEN

Query:  QSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSG
        ++ GK KKGK+E SKKAWIKEI+LIDKLEAEG+ATEIHR KR+ALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISK+INN G
Subjt:  QSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSG

Query:  QNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHST
        QNCLNDSDI DAFIQHFEEIYTDN+NS LFIDNLDWCPISNTN  LLDKPFNE+EIWLTLKSF KNKAPGPDG+TMDFLQKSWSFMK NICDIFKDFHS 
Subjt:  QNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHST

Query:  HTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKA
        HTINKVVNETLITLIAKK+NCETV+DFRPISLTTAIYKLIAK LADRLKQTLP TISE QMAFVKGR+ITEAILIANEALDFWRNKKERGFVIKLDIEKA
Subjt:  HTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKA

Query:  FDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILF
        FDKLNWRFIDF+LMKKNYS KWR MIASCISSVQYSILINGRPRGRIKP+RGIRQGDPLSPFIFVLAMDYLS LL NLA+K KINGVNF PNLNLTHILF
Subjt:  FDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILF

Query:  ADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYS
        ADDILIFVED++DYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSSSNFWDN+LQKIQKKLSSWKYS
Subjt:  ADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYS

Query:  QLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWK
        QLSKGGRITLINSTLESLP     +  VPKGIAQKIEA WRNFLWNGTSNGHNISLIRWNQ+VSPKEKGGLGIHSV+STNFALLCKWLWKFLTEK+PLWK
Subjt:  QLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWK

Query:  RLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINR
        RLIISKYDQEKMGRFPSRGK+SSNNSPWKAVT CISWFYKNI WKVNDGEDISFWLDNWNGN+PLSL VPRLFALSTNKKGSVKD WNPS  DW++H+NR
Subjt:  RLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINR

Query:  PLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW
        PLRDHEKNLWHNIKASLPTPLP+RG  KPLWKLNSNNIFDTAS+K+ LSEA  SP NFHP+LYKTLWKV+FPKKCKFFIWTLIHGCINTADRLQKRLPNW
Subjt:  PLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW

Query:  ALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWED
         LSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKA+ALLKWN TP DV+SL QNICSLNI+ QKGLITFNT A LLWKIWLERNNRIFKQQ K+ QDLWED
Subjt:  ALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWED

Query:  ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
        ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
Subjt:  ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.089.81Show/hide
Query:  QSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSG
        ++ GK KKGK+E SKKAWIKEIDLIDKLEAEG+ATEIHR+KR+ALKADLSQITLT+AQ+WAQKCKRIWVHEGDENSSFFHKICT RQKKCLISK+INN G
Subjt:  QSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSG

Query:  QNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHST
        QNCLNDSDI DAFIQHFEEIYTDN+NS LFIDN DWCPISNTN  LLDKPFNE+EIWLTLKSF KNKAPGPDG+TMDFLQKSWSFMK NICDIFKDFHS 
Subjt:  QNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHST

Query:  HTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKA
        HTINKVVNETLITLIAKK NCETV+DF+PISLTTAIYKLIAK LADRLKQTLPDTISE QMAFVKGR+ITEAILIANEALDFWRNKKERGFVIKLDIEKA
Subjt:  HTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKA

Query:  FDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILF
        FDKLNWRFIDF+LMKKNYS KWR MIASCISSVQYSILINGRPRGRIKP+RGIRQGDPLS FIFVLAMDYLS LL NLA+K KINGVNF PNLNLTHILF
Subjt:  FDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILF

Query:  ADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYS
        ADDILIFVED++DYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSSSNFWDN+LQKIQKKLSSWKYS
Subjt:  ADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYS

Query:  QLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWK
        QLSKGGRITLINSTLESLP     +  VPKGIAQKIEA WRNFLWNGTSNGHNISLIRWNQ+VSPKEKGGLGIH V+STNFALLCKWLWKFLTEK+PLWK
Subjt:  QLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWK

Query:  RLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINR
        RLIISKYDQEKMGRFPSRGK+SSNNSPWKAVT CISWFYKNI WKVNDGEDISFWLDNWNGN+PLSLAVPRLFALSTNKKGSVKD WNPS  DW++H+NR
Subjt:  RLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINR

Query:  PLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW
        PLRDHEKNLWHNIKASLPTPLP+RG  KPLWKLNSNNIFDTAS+K+ LSEA  SP NFHP+LYKTLWKV+FPKKCKFFIWTLIHGCINTADRLQKRLPNW
Subjt:  PLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW

Query:  ALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWED
         LSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKA+ALLKWN TP DV+SL QNICSLNI+ QKGLITFNT A LLWKIWLERNNRIFKQQ K+ QDLWED
Subjt:  ALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWED

Query:  ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
        ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
Subjt:  ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK

TYJ99326.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.091.4Show/hide
Query:  GAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSN
        G FSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSN
Subjt:  GAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSN

Query:  LRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFENQ
        LRAQATLSRLDRFLFST WENIFPGHTSKVLTRTTSDHFPIVLESS+ISWGPSPFRFTNAYLKDPDYK+NIEFWWGNTSQPG+A             + +
Subjt:  LRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFENQ

Query:  SLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQ
        + G+ KKGK+EASKKAWIKEIDLI+KLEAEG++TEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIIN  GQ
Subjt:  SLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQ

Query:  NCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTH
        NCLNDSDI DAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNS LLDKPFNEAEIWLTLKSFAKNKAPGPDG+TMDFLQKSWSFMKQNICDIFKDFHS H
Subjt:  NCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTH

Query:  TINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAF
        TINKVVNETLIT IAKKENCETVADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGR+ITEAILIANEALD WRNKKERGFVIKLDIEKAF
Subjt:  TINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAF

Query:  DKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFA
        DKLNWRFIDF+LMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADK KINGVNF PNLNLTHILFA
Subjt:  DKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFA

Query:  DDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQ
        DDILIFVED+DDYVSNLKMILHLFESASGLNINLSKSTIFPINVP DRA SIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQ
Subjt:  DDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQ

Query:  LSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK
        LSKG RITLINSTLESLP     +  VPKGIAQKIEA WRNFLWNGTSNGHNIS     ++   K K
Subjt:  LSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK

TYK29577.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.097.07Show/hide
Query:  MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHISPLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAH
        MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHISPLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAH
Subjt:  MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHISPLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAH

Query:  EDREKKKKENLREETEDDEVSFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEE
        EDREKKKKENLREETEDDEVSFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEE
Subjt:  EDREKKKKENLREETEDDEVSFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEE

Query:  LENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD
        LENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD
Subjt:  LENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD

Query:  HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIH
        HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYA             + ++ G+ KKGKNEASKKAWIKEIDLIDKLEAEGSATEIH
Subjt:  HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIH

Query:  REKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCP
        REKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCP
Subjt:  REKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCP

Query:  ISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK
        ISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK
Subjt:  ISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK

Query:  LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQK
        LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQK
Subjt:  LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQK

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein0.0e+0092.13Show/hide
Query:  NDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNL
        ND+  SIL+     GAFSVSIQVGSNNGA WWLSAIYGPAKRKNRPLFWEELE+LKSIC PTWILGGDFNVIRWKEET+TKNPA LSM+RFN+FISNCNL
Subjt:  NDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNL

Query:  IDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA-
        IDPPL+NAK+TWSNLRAQATLSRLDRFLF++ WENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYA  
Subjt:  IDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA-

Query:  ---------GFENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ
                     ++ G+ KKGKNEASKKA IKEID IDKLEAEGSATEIHREKR ALKADLSQI LTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ
Subjt:  ---------GFENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ

Query:  KKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMK
        KKCLISKIINNSGQNCLNDSDIADAFIQHFE+IYTDNRNS LFI+NLDWCPISN NS+LLDKPFNEAEIWLTLKSFAKNKAPGPDGY MDFLQKSWSFMK
Subjt:  KKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMK

Query:  QNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKK
        QNICDIFKDFHSTH INKVVNETLITLIAKKE+CET ADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGR+ITEAILIANEALDFWR+KK
Subjt:  QNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKK

Query:  ERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV
        ERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV
Subjt:  ERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV

Query:  NFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVL
         FSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGG+PSSSNFWDNVL
Subjt:  NFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVL

Query:  QKIQKKLSSWKYSQLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKW
        QKIQKKLS+WKYSQLSKGGRITLINSTLESLP     +  VPKGIAQKIEASWRNFLWNG SNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKW
Subjt:  QKIQKKLSSWKYSQLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKW

Query:  LWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFW
        LWKFLTEKDPLWKRLIISKYD+EKMG FPS GKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVK+FW
Subjt:  LWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFW

Query:  NPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCI
        NPSSNDWHLHINRPLRDHE+NLWHNIKASLPTPLPNRG PKPLW LNSNNIFDTASVKR ++EAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCI
Subjt:  NPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCI

Query:  NTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIF
        NTADRLQKRLPNW LSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALL WN TPTDVQSL+QNICSLNIRNQKGLITFNT+AT+LWKIWLERNNRIF
Subjt:  NTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIF

Query:  KQQGKDSQDLWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFV
        KQQ K  QDLWED LAQ GLWSCKSKLFSNYDCCSIALNISAFV
Subjt:  KQQGKDSQDLWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFV

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein0.0e+0088.95Show/hide
Query:  NLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFEN
        NLRAQATLSRLDRFLFS  WEN FPGHTSK LTRTTSDHFPIVLESS+ISWGP PFRFTNAYLKDPDYK+NIEFWWGNTSQPG+A             + 
Subjt:  NLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFEN

Query:  QSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSG
        ++ GK KKGK+E SKKAWIKEI+LIDKLEAEG+ATEIHR KR+ALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISK+INN G
Subjt:  QSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSG

Query:  QNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHST
        QNCLNDSDI DAFIQHFEEIYTDN+NS LFIDNLDWCPISNTN  LLDKPFNE+EIWLTLKSF KNKAPGPDG+TMDFLQKSWSFMK NICDIFKDFHS 
Subjt:  QNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHST

Query:  HTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKA
        HTINKVVNETLITLIAKK+NCETV+DFRPISLTTAIYKLIAK LADRLKQTLP TISE QMAFVKGR+ITEAILIANEALDFWRNKKERGFVIKLDIEKA
Subjt:  HTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKA

Query:  FDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILF
        FDKLNWRFIDF+LMKKNYS KWR MIASCISSVQYSILINGRPRGRIKP+RGIRQGDPLSPFIFVLAMDYLS LL NLA+K KINGVNF PNLNLTHILF
Subjt:  FDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILF

Query:  ADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYS
        ADDILIFVED++DYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSSSNFWDN+LQKIQKKLSSWKYS
Subjt:  ADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYS

Query:  QLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWK
        QLSKGGRITLINSTLESLP     +  VPKGIAQKIEA WRNFLWNGTSNGHNISLIRWNQ+VSPKEKGGLGIHSV+STNFALLCKWLWKFLTEK+PLWK
Subjt:  QLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWK

Query:  RLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINR
        RLIISKYDQEKMGRFPSRGK+SSNNSPWKAVT CISWFYKNI WKVNDGEDISFWLDNWNGN+PLSL VPRLFALSTNKKGSVKD WNPS  DW++H+NR
Subjt:  RLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINR

Query:  PLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW
        PLRDHEKNLWHNIKASLPTPLP+RG  KPLWKLNSNNIFDTAS+K+ LSEA  SP NFHP+LYKTLWKV+FPKKCKFFIWTLIHGCINTADRLQKRLPNW
Subjt:  PLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW

Query:  ALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWED
         LSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKA+ALLKWN TP DV+SL QNICSLNI+ QKGLITFNT A LLWKIWLERNNRIFKQQ K+ QDLWED
Subjt:  ALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWED

Query:  ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
        ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
Subjt:  ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein0.0e+0090.1Show/hide
Query:  GKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNC
        GK KKGK+E SKKAWIKEIDLIDKLEAEG+ATEIHR+KR+ALKADLSQITLT+AQ+WAQKCKRIWVHEGDENSSFFHKICT RQKKCLISK+INN GQNC
Subjt:  GKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNC

Query:  LNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTI
        LNDSDI DAFIQHFEEIYTDN+NS LFIDN DWCPISNTN  LLDKPFNE+EIWLTLKSF KNKAPGPDG+TMDFLQKSWSFMK NICDIFKDFHS HTI
Subjt:  LNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTI

Query:  NKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDK
        NKVVNETLITLIAKK NCETV+DF+PISLTTAIYKLIAK LADRLKQTLPDTISE QMAFVKGR+ITEAILIANEALDFWRNKKERGFVIKLDIEKAFDK
Subjt:  NKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDK

Query:  LNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFADD
        LNWRFIDF+LMKKNYS KWR MIASCISSVQYSILINGRPRGRIKP+RGIRQGDPLS FIFVLAMDYLS LL NLA+K KINGVNF PNLNLTHILFADD
Subjt:  LNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFADD

Query:  ILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLS
        ILIFVED++DYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSSSNFWDN+LQKIQKKLSSWKYSQLS
Subjt:  ILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLS

Query:  KGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLI
        KGGRITLINSTLESLP     +  VPKGIAQKIEA WRNFLWNGTSNGHNISLIRWNQ+VSPKEKGGLGIH V+STNFALLCKWLWKFLTEK+PLWKRLI
Subjt:  KGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLI

Query:  ISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLR
        ISKYDQEKMGRFPSRGK+SSNNSPWKAVT CISWFYKNI WKVNDGEDISFWLDNWNGN+PLSLAVPRLFALSTNKKGSVKD WNPS  DW++H+NRPLR
Subjt:  ISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLR

Query:  DHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALS
        DHEKNLWHNIKASLPTPLP+RG  KPLWKLNSNNIFDTAS+K+ LSEA  SP NFHP+LYKTLWKV+FPKKCKFFIWTLIHGCINTADRLQKRLPNW LS
Subjt:  DHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALS

Query:  PNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWEDILA
        PNWCYMCNKSQEDINHLFIHCPYSQQLWSKA+ALLKWN TP DV+SL QNICSLNI+ QKGLITFNT A LLWKIWLERNNRIFKQQ K+ QDLWEDILA
Subjt:  PNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWEDILA

Query:  QTGLWSCKSKLFSNYDCCSIALNISAFVK
        QTGLWSCKSKLFSNYDCCSIALNISAFVK
Subjt:  QTGLWSCKSKLFSNYDCCSIALNISAFVK

A0A5D3BJP3 LINE-1 retrotransposable element ORF2 protein0.0e+0091.4Show/hide
Query:  GAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSN
        G FSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSN
Subjt:  GAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSN

Query:  LRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFENQ
        LRAQATLSRLDRFLFST WENIFPGHTSKVLTRTTSDHFPIVLESS+ISWGPSPFRFTNAYLKDPDYK+NIEFWWGNTSQPG+A             + +
Subjt:  LRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFENQ

Query:  SLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQ
        + G+ KKGK+EASKKAWIKEIDLI+KLEAEG++TEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIIN  GQ
Subjt:  SLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQ

Query:  NCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTH
        NCLNDSDI DAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNS LLDKPFNEAEIWLTLKSFAKNKAPGPDG+TMDFLQKSWSFMKQNICDIFKDFHS H
Subjt:  NCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTH

Query:  TINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAF
        TINKVVNETLIT IAKKENCETVADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGR+ITEAILIANEALD WRNKKERGFVIKLDIEKAF
Subjt:  TINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAF

Query:  DKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFA
        DKLNWRFIDF+LMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADK KINGVNF PNLNLTHILFA
Subjt:  DKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFA

Query:  DDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQ
        DDILIFVED+DDYVSNLKMILHLFESASGLNINLSKSTIFPINVP DRA SIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQ
Subjt:  DDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQ

Query:  LSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK
        LSKG RITLINSTLESLP     +  VPKGIAQKIEA WRNFLWNGTSNGHNIS     ++   K K
Subjt:  LSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK

A0A5D3E0F6 LINE-1 retrotransposable element ORF2 protein0.0e+0097.07Show/hide
Query:  MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHISPLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAH
        MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHISPLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAH
Subjt:  MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHISPLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAH

Query:  EDREKKKKENLREETEDDEVSFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEE
        EDREKKKKENLREETEDDEVSFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEE
Subjt:  EDREKKKKENLREETEDDEVSFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEE

Query:  LENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD
        LENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD
Subjt:  LENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD

Query:  HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIH
        HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYA             + ++ G+ KKGKNEASKKAWIKEIDLIDKLEAEGSATEIH
Subjt:  HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GFENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIH

Query:  REKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCP
        REKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCP
Subjt:  REKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCP

Query:  ISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK
        ISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK
Subjt:  ISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK

Query:  LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQK
        LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQK
Subjt:  LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.8e-3823.48Show/hide
Query:  ILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLT----NAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLE-
        ++ GDFN      + ST+   +   +  N+ +   +LID   T    + ++T+ +     T S++D  + S     +     ++++T   SDH  I LE 
Subjt:  ILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLT----NAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLE-

Query:  -------SSTISWGPSPFRFTNAYLKDPDYKKNIEFWW-----GNTSQPGYAAGFENQSLGK-----RKKGKNEASK----KAWIKEIDLIDKLEAEGSA
               S + +W  +     N Y    + K  I+ ++      +T+       F+    GK       K K E SK     + +KE++  ++  ++ S 
Subjt:  -------SSTISWGPSPFRFTNAYLKDPDYKKNIEFWW-----GNTSQPGYAAGFENQSLGK-----RKKGKNEASK----KAWIKEIDLIDKLEAEGSA

Query:  TEIHREKRIALKADLSQITLTEAQIWAQKC--KRIWVHEG-DENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDN----RNS
            R++   ++A+L +I   E Q   QK    R W  E  ++      ++   +++K  I  I N+ G    + ++I     ++++ +Y +        
Subjt:  TEIHREKRIALKADLSQITLTEAQIWAQKC--KRIWVHEG-DENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDN----RNS

Query:  HLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVAD-
          F+D      ++    + L++P   +EI   + S    K+PGPDG+T +F Q+    +   +  +F+       +     E  I LI K     T  + 
Subjt:  HLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVAD-

Query:  FRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKG-------RKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYS
        FRPISL     K++ K LA+R++Q +   I   Q+ F+ G       RK    I   N A      K +   +I +D EKAFDK+   F+   L K    
Subjt:  FRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKG-------RKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYS

Query:  QKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFADDILIFVEDRDDYVSNLK
          + K+I +       +I++NG+         G RQG PLSP +F + ++ L+R +     +++I G+       +   LFADD+++++E+      NL 
Subjt:  QKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFADDILIFVEDRDDYVSNLK

Query:  MILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPL--GGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLIN-STLE
         ++  F   SG  IN+ KS  F  N        I      +       YLG+ L    K      +  +L++I++  + WK    S  GRI ++  + L 
Subjt:  MILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPL--GGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLIN-STLE

Query:  SLPYISN-----VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK-GGLGIHSVNSTNFALLCKWLWKFLTEKD-PLWKR
         + Y  N     +P     ++E +   F+WN       I+      I+S K K GG+ +        A + K  W +   +D   W R
Subjt:  SLPYISN-----VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK-GGLGIHSVNSTNFALLCKWLWKFLTEKD-PLWKR

P08548 LINE-1 reverse transcriptase homolog8.3e-4424.14Show/hide
Query:  IYGPAKRKNRPLFWEE-LENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLID------PPLTNAKFTWSNLRAQATLSRLDRFL
        IY P    N P F  E L ++ ++   T I+ GDFN      + S+K   S  +   N+ I + +L D      P  T   F  S   A  T S++D  L
Subjt:  IYGPAKRKNRPLFWEE-LENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLID------PPLTNAKFTWSNLRAQATLSRLDRFL

Query:  FSTHWENIFPGHTSKVLTRTTSDHFPIVLE--------SSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAAGFEN-QSLGKRKKGKNEASKK
           H  N+      +++    SDH  I +E        + T +W  +     + ++ D   K+  +F   N +Q      ++N     K        + +
Subjt:  FSTHWENIFPGHTSKVLTRTTSDHFPIVLE--------SSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAAGFEN-QSLGKRKKGKNEASKK

Query:  AWIKEIDLIDKLEAEGSATEIHREKRIALK-ADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKI----------CTARQKKCLISKIINNSGQNCLN
        A++K+ +  +     G   ++ +E+    K +   +IT   A++   + KRI        S FF KI             ++ K LIS I N + +   +
Subjt:  AWIKEIDLIDKLEAEGSATEIHREKRIALK-ADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKI----------CTARQKKCLISKIINNSGQNCLN

Query:  DSDIADAFIQHFEEIYTDNRNSHLFIDN-LDWC---PISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTH
         S+I     ++++++Y+    +   ID  L+ C    +S    ++L++P + +EI  T+++  K K+PGPDG+T +F Q     +   + ++F++     
Subjt:  DSDIADAFIQHFEEIYTDNRNSHLFIDN-LDWC---PISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTH

Query:  TINKVVNETLITLIAKKENCET-VADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKG-------RKITEAILIANEALDFWRNKKERGFVI
         +     E  ITLI K     T   ++RPISL     K++ K L +R++Q +   I   Q+ F+ G       RK    I   N+       K +   ++
Subjt:  TINKVVNETLITLIAKKENCET-VADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKG-------RKITEAILIANEALDFWRNKKERGFVI

Query:  KLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNF-SPN
         +D EKAFD +   F+   L K      + K+I +  S    +I++NG          G RQG PLSP +F + M+ L+  +    +++ I G++  S  
Subjt:  KLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNF-SPN

Query:  LNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPL--GGKPSSSNFWDNVLQKI
        + L+  LFADD+++++E+  D  + L  ++  + + SG  IN  KS  F         K++ DS   +       YLG+ L    K      ++ + ++I
Subjt:  LNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPL--GGKPSSSNFWDNVLQKI

Query:  QKKLSSWKYSQLSKGGRITLIN-STLESLPYISN-----VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCK--
         + ++ WK    S  GRI ++  S L    Y  N      P    + +E    +F+WN          I    + +  + GG+ +  +     +++ K  
Subjt:  QKKLSSWKYSQLSKGGRITLIN-STLESLPYISN-----VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCK--

Query:  WLWKFLTEKDPLWKRL
        W W    E D +W R+
Subjt:  WLWKFLTEKDPLWKRL

P0C2F6 Putative ribonuclease H protein At1g657501.0e-3324.94Show/hide
Query:  MPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPYISN----VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGG
        MP+  K  + + +  +L+++  ++S W+   LS  GR+TL  + L S+P  S     +P+ I  +++   R FLW  T+      L++W+++ SPK++GG
Subjt:  MPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPYISN----VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGG

Query:  LGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVT----ECISWFYKNISWKVNDGEDISFWLDNWNGNAPLS
        LG+ +  S N AL+ K  W+ L EK+ LW  ++  KY   ++          S +S W+++     + +S     + W   DG+ I FW D W    PL 
Subjt:  LGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVT----ECISWFYKNISWKVNDGEDISFWLDNWNGNAPLS

Query:  LAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTL
        L +      +       KD W P    W      P   +   L   ++A +   L      +  WK + +  F   S   +L+   +   N   + +  L
Subjt:  LAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTL

Query:  WKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSK-----------AKALLKW
        WKV  P++ K F+W + +  + T +   +R      + N C +C    E + H+   CP    +W +           +K+L +W
Subjt:  WKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSK-----------AKALLKW

P11369 LINE-1 retrotransposable element ORF2 protein5.1e-4123.3Show/hide
Query:  IYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLID-----PPLTNAKFTWSNLRAQATLSRLDRFLFS
        IY P  R       + L  LK+   P  I+ GDFN     ++ S K   +    +    +   +L D      P T     +S      T S++D  +  
Subjt:  IYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLID-----PPLTNAKFTWSNLRAQATLSRLDRFLFS

Query:  THWENIFPGHTSKVLTRTTSDHFPI-VLESSTISWGPSPF--RFTNAYLKDPDYKKNI-----EFWWGNTSQPGYAAGFENQ-------------SLGKR
         H   +      +++    SDH  + ++ ++ I+ G   F  +  N  L D   K+ I     +F   N ++   A  + N              +L   
Subjt:  THWENIFPGHTSKVLTRTTSDHFPI-VLESSTISWGPSPF--RFTNAYLKDPDYKKNI-----EFWWGNTSQPGYAAGFENQ-------------SLGKR

Query:  KKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEG-DENSSFFHKICTARQKKCLISKIINNSGQNCLN
        KK +  A   +    +  ++K EA  S     R++ I L+ +++Q+  T   I      R W  E  ++      ++    + K LI+KI N  G    +
Subjt:  KKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEG-DENSSFFHKICTARQKKCLISKIINNSGQNCLN

Query:  DSDIADAFIQHFEEIYTDNRNS----HLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTH
          +I +     ++ +Y+    +      F+D      ++    D L+ P +  EI   + S    K+PGPDG++ +F Q   +F +  I  + K FH   
Subjt:  DSDIADAFIQHFEEIYTDNRNS----HLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTH

Query:  TINKVVN---ETLITLIAKKENCET-VADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERG-FVIKLD
            + N   E  ITLI K +   T + +FRPISL     K++ K LA+R+++ +   I   Q+ F+ G +    I  +   + +    K++   +I LD
Subjt:  TINKVVN---ETLITLIAKKENCET-VADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERG-FVIKLD

Query:  IEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLT
         EKAFDK+   F+  VL +      +  MI +  S    +I +NG     I    G RQG PLSP++F + ++ L+R +     +++I G+       + 
Subjt:  IEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLT

Query:  HILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGG--KPSSSNFWDNVLQKIQKKL
          L ADD+++++ D  +    L  +++ F    G  IN +KS  F         K I ++   S       YLG+ L    K      + ++ ++I++ L
Subjt:  HILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGG--KPSSSNFWDNVLQKIQKKL

Query:  SSWKYSQLSKGGRITLIN-STLESLPYISN-----VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFL
          WK    S  GRI ++  + L    Y  N     +P     ++E +   F+WN        SL++       +  GG+ +  +     A++ K  W + 
Subjt:  SSWKYSQLSKGGRITLIN-STLESLPYISN-----VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFL

Query:  TEKD-PLWKRL
         ++    W R+
Subjt:  TEKD-PLWKRL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.8e-3823.97Show/hide
Query:  NGASWWLSAIYGPAKRKNRPLFWEEL----ENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLID-----PPLTNAKFTWSNLR-
        +G ++ L  +Y P     R  F+E L    E + S      I+GGDFN      + +       S       I++ +L+D      P T A FT+  +R 
Subjt:  NGASWWLSAIYGPAKRKNRPLFWEEL----ENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLID-----PPLTNAKFTWSNLR-

Query:  AQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSP--FRFTNAYLKDPDYKKNIEFWWGN---------TSQPGYAAGFEN--
           + SR+DR   S+H   +    +S +     SDH  + L  S     P    + F N+ L+D  + K++   W           T    +  G  +  
Subjt:  AQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSP--FRFTNAYLKDPDYKKNIEFWWGN---------TSQPGYAAGFEN--

Query:  ---QSLGKRKKGKNEASKKAWIKEI-DLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKII
           Q   K   G+  A  +A   E+ DL  +L   GS  +  + + +  K  L  +   +A+    + +   + + D  S FF+ +   +  +  I+ + 
Subjt:  ---QSLGKRKKGKNEASKKAWIKEI-DLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKII

Query:  NNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNL-DWCP-ISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIF
           G    +   I D     ++ +++ +  S    + L D  P +S    + L+ P    E+   L+    NK+PG DG T++F Q  W  +  +   + 
Subjt:  NNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNL-DWCP-ISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIF

Query:  KDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIK
         +      +       +++L+ KK +   + ++RP+SL +  YK++AKA++ RLK  L + I   Q   V GR I + + +  + L F R        + 
Subjt:  KDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIK

Query:  LDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLN
        LD EKAFD+++ +++   L   ++  ++   + +  +S +  + IN      +   RG+RQG PLS  ++ LA++    LL     KR    V   P++ 
Subjt:  LDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLN

Query:  LTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKST-IFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGK--PSSSNFWDNVLQKIQ
        +    +ADD+++  +D  D +   +    ++ +AS   IN SKS+ +   ++  D          IS       YLG+ L  +  P S NF + + + + 
Subjt:  LTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKST-IFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGK--PSSSNFWDNVLQKIQ

Query:  KKLSSWK--YSQLSKGGRITLINSTLES-----LPYISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWL
         +L  WK     LS  GR  +IN  + S     L  +S   + IA KI+    +FLW G    H +S         P ++GG G+  + S       + +
Subjt:  KKLSSWK--YSQLSKGGRITLINSTLES-----LPYISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWL

Query:  WKFL-TEKDPLWKRLIISKYDQ
         ++L  +  P W  L  S Y Q
Subjt:  WKFL-TEKDPLWKRLIISKYDQ

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.8e-2524.5Show/hide
Query:  ILGGDFNVIRWKEETSTKNPASLSMK---RFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSR-LDRFLFSTHWENIFPGHTSKVLTRTTSDHFP-IVLE
        IL GDF+ I    +  +    S+ M+    F   + + +L+D P     +TWSN +    + R LDR + +  W + FP   +       SDH P I++ 
Subjt:  ILGGDFNVIRWKEETSTKNPASLSMK---RFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSR-LDRFLFSTHWENIFPGHTSKVLTRTTSDHFP-IVLE

Query:  SSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAAGFENQSLGKRKKGKNEASK--------KAWIKEIDLIDKLEAEGS------ATEIHREK
         +        FR+ +     P +  ++   W    +     G    SLG+  K   +  K            K  + +D LE+  S      +  + R +
Subjt:  SSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAAGFENQSLGKRKKGKNEASK--------KAWIKEIDLIDKLEAEGS------ATEIHREK

Query:  RIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNR-----NSHLFIDNLDW
         +A K   +         + QK +  W+ +GD N+ FFHK+  A Q K LI  +  +      N + + +  + ++  +   +      +S   I ++  
Subjt:  RIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNR-----NSHLFIDNLDW

Query:  CPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAI
           ++T +  L    ++ EI   + +  +NKAPGPD +T +F  +SW  +K +     K+F  T  + K  N T ITLI K    + ++ FRP+S  T +
Subjt:  CPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAI

Query:  YKLI
        YK+I
Subjt:  YKLI

AT2G02650.1 Ribonuclease H-like superfamily protein5.1e-1225.17Show/hide
Query:  ASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALL--K
        A+ + +L E  I P      + + +WK+    K K F+W  + G + T  RL+ R  N    P  C  C   +E I+H+  +CPY+Q +W  A  ++  +
Subjt:  ASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALL--K

Query:  WNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQ
        W   P+  +  +  +  L+       +       ++W++W  RN  +F+Q+
Subjt:  WNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQ

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.4e-2322.98Show/hide
Query:  IADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPYI----SNVPKGIAQKIEASWRNFLWNGTSNGH
        I  S+  + G LP  YLG+PL  K  +++ +  +++KI+ ++  W    LS  GR+ LI+S + SL         +P    ++I++   +FLW+G     
Subjt:  IADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPYI----SNVPKGIAQKIEASWRNFLWNGTSNGH

Query:  NISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDI
          + + W+ + +PK++GGLGI S+   N              K   W                   G  +  +  WK + +  +     +   +++G + 
Subjt:  NISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDI

Query:  SFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPK----PLWKLNSNNIFDTASVKRIL
        SFW DNW+        + RL  + T  +G +       ++     +N   R H  +    I+  +   + ++GL        WK N +      + K   
Subjt:  SFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPK----PLWKLNSNNIFDTASVKRIL

Query:  SEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW-ALSPNWCYMCNKSQEDINHLFIHCPYSQQL
        + A         N YK +W      K     W  I   + T DR+     +W A + + C +C+   E  +HLF  CPYS ++
Subjt:  SEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW-ALSPNWCYMCNKSQEDINHLFIHCPYSQQL

AT4G29090.1 Ribonuclease H-like superfamily protein7.0e-2221.72Show/hide
Query:  VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNS-
        +PK + ++I +   +F W        +    W+ +   K +GG+G   + + N ALL K +W+ L+  + L  ++  S+Y  +     P      S  S 
Subjt:  VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNS-

Query:  PWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLA-----VPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPL-RDHEKNLWHNIKASLPTP
         WK++        +     V +GEDI  W   W  + P S A     VP     S +    V D  + S  +W   +   L  + E+ L   ++     P
Subjt:  PWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLA-----VPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPL-RDHEKNLWHNIKASLPTP

Query:  LPNRGLPKPLWKLNSNNIFDTAS--------VKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWCYMCNK
           R L    W   S+  +   S        + +  S   +S  + +P +Y+ +WK +   K + F+W  +   +  A  L  R        + C  C  
Subjt:  LPNRGLPKPLWKLNSNNIFDTAS--------VKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWCYMCNK

Query:  SQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQN---ICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDL-------WEDIL
         +E +NHL   C +++  W+ +   +       D  S+  N   + +L   N +          LLW++W  RN  +F+ +  ++Q++        E+  
Subjt:  SQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQN---ICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDL-------WEDIL

Query:  AQTGLWSCKSKLFSNYDCC
         +T   SC +K   N   C
Subjt:  AQTGLWSCKSKLFSNYDCC

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)8.6e-1246.27Show/hide
Query:  LINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNL-NLTHILFADD
        +ING P+G + PSRG+RQGDPLSP++F+L  + LS L     ++ ++ G+  S N   + H+LFADD
Subjt:  LINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNL-NLTHILFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTATGCAAAAATGGAAGCTAAATCAAAAAGGCCTACTCAGTCCATAAAGAAAAAAGTCTATAGAGTCAAGAGCCGAAGCATGGAAAGGGAAACTCCTCAAACAAG
CAGGCAAAAAGATAAAGAAAAAATTGACCCGAATGAGTTTGAACTGGTGGTAGACCTTGGCCACATCTCCCCCCTCTCAGATACTGATTTCTCTTGTCCTGAAAGTCCTT
CGTACATCCCTTCACCAACATCTCCTACTGAGTCAGACATTGTAAAGGACAGCCTGGCCTCTATGATGACTTGTGCTCATGAAGATAGAGAAAAGAAGAAAAAGGAGAAC
TTAAGGGAAGAGACCGAAGATGATGAAGTTAGCTTCAAGAGGAAACTTACAGATTGGCTGAAAGAAAACAACCTCAGACTGGCGGCAGATTTTAATTCACAGTTTAATTC
TGTTACAAATGATAGGATGATTTCTATTTTAAATGGGCCACCAAATGTAGGGGCTTTCTCTGTCTCCATCCAAGTTGGCTCCAACAATGGTGCTTCTTGGTGGCTTTCTG
CCATTTACGGCCCAGCTAAAAGAAAAAATAGGCCTTTATTTTGGGAGGAACTTGAAAATCTAAAATCCATTTGCTTTCCAACCTGGATTCTTGGTGGAGATTTTAACGTT
ATCAGATGGAAGGAGGAGACGTCCACCAAAAATCCAGCCTCGCTAAGCATGAAAAGATTCAACACTTTCATAAGCAATTGTAATCTGATTGATCCTCCCCTCACCAATGC
AAAGTTTACTTGGTCAAATCTCAGAGCTCAGGCCACCCTCTCCAGACTGGACAGATTTCTTTTCTCTACCCATTGGGAAAATATTTTCCCGGGCCATACTTCAAAAGTGT
TAACCCGAACTACTTCAGACCATTTTCCCATTGTTCTCGAGTCGTCTACGATCTCTTGGGGTCCTTCTCCTTTTAGATTCACAAATGCCTACCTAAAAGATCCAGACTAC
AAGAAAAACATTGAGTTTTGGTGGGGAAACACCAGTCAGCCAGGCTATGCAGCTGGCTTTGAAAATCAAAGCTTGGGGAAGAGAAAAAAAGGAAAAAATGAAGCTTCTAA
AAAGGCCTGGATCAAAGAAATCGATCTAATTGACAAACTAGAGGCTGAAGGATCTGCAACTGAGATTCACAGAGAGAAAAGGATTGCTCTAAAAGCCGACCTTTCCCAAA
TTACTCTCACTGAAGCTCAAATATGGGCCCAAAAATGCAAAAGAATATGGGTCCATGAAGGTGATGAAAATTCTTCCTTTTTCCACAAAATTTGCACAGCAAGGCAAAAA
AAGTGTTTGATCTCCAAGATAATAAACAACAGTGGACAGAATTGCCTAAATGACAGTGACATTGCCGATGCCTTCATTCAACATTTTGAAGAAATCTATACAGACAACAG
AAACAGCCATCTGTTTATTGATAATCTCGATTGGTGCCCCATCTCCAACACCAACAGTGACTTGCTGGACAAACCCTTTAATGAAGCTGAAATTTGGCTCACTTTAAAGT
CTTTTGCAAAGAATAAAGCTCCAGGTCCAGATGGTTATACGATGGATTTCCTACAAAAGTCTTGGTCTTTTATGAAGCAAAACATTTGTGATATCTTCAAGGATTTTCAC
AGCACCCATACCATCAATAAAGTTGTCAATGAAACTCTCATTACCCTTATAGCCAAAAAAGAAAATTGTGAGACAGTTGCAGACTTTCGGCCCATCAGCCTCACCACGGC
TATCTACAAATTAATCGCAAAGGCTTTGGCTGATAGATTGAAACAAACTCTCCCCGATACGATCTCTGAGTCTCAAATGGCCTTCGTTAAAGGAAGAAAAATTACAGAGG
CCATTCTTATTGCAAATGAAGCTTTGGATTTCTGGAGAAATAAAAAAGAAAGAGGTTTTGTGATAAAACTGGACATTGAAAAGGCCTTCGATAAGCTAAATTGGCGCTTC
ATAGACTTTGTGCTTATGAAAAAGAACTACTCCCAGAAATGGAGGAAAATGATTGCCAGTTGCATCTCTAGTGTCCAATACTCTATTCTTATCAATGGTAGACCGAGAGG
CAGAATCAAACCTTCTAGAGGAATCCGACAGGGTGACCCCCTTTCACCCTTCATCTTTGTTTTGGCTATGGACTATCTCAGCCGTCTTTTGAACAACTTAGCAGATAAAA
GAAAAATCAATGGAGTCAATTTCAGTCCCAACCTTAATCTTACCCACATCCTATTTGCGGATGACATCCTCATCTTTGTAGAGGATAGGGATGACTACGTATCAAACCTC
AAAATGATCCTTCATCTCTTTGAATCAGCCTCGGGCCTTAACATCAATCTGTCCAAGTCTACTATCTTTCCCATAAACGTCCCAACAGATCGTGCAAAGTCTATAGCGGA
CAGTTGGGGAATAAGCAAGGGCCATCTTCCGACATCTTACCTTGGTATGCCCTTAGGAGGGAAGCCTTCCTCATCAAACTTCTGGGACAATGTGCTTCAGAAAATCCAGA
AAAAATTGAGCAGCTGGAAATACTCTCAGTTATCCAAAGGCGGCAGAATCACTCTGATAAACTCAACTCTTGAAAGCCTTCCATATATATCAAATGTCCCCAAAGGTATA
GCTCAGAAAATTGAAGCTTCTTGGAGAAATTTCCTTTGGAATGGTACATCGAATGGCCACAACATTAGCCTCATCAGATGGAACCAAATTGTCTCCCCAAAAGAGAAAGG
AGGCCTCGGTATTCACTCTGTCAATAGCACAAATTTTGCCCTCCTCTGTAAATGGCTCTGGAAATTTCTAACTGAAAAAGATCCTTTATGGAAACGCCTGATCATTTCCA
AATATGATCAGGAGAAAATGGGCAGATTTCCTTCTCGTGGAAAATTCAGCAGCAATAATAGCCCTTGGAAAGCAGTGACAGAGTGTATCAGTTGGTTCTATAAAAACATC
AGCTGGAAGGTAAATGATGGAGAAGATATCTCCTTTTGGCTTGACAACTGGAATGGAAATGCTCCTTTATCTTTGGCCGTCCCCCGTCTTTTTGCTCTATCTACAAACAA
AAAGGGGTCTGTTAAAGATTTTTGGAATCCCTCATCTAATGACTGGCATCTCCATATCAATCGGCCCCTCCGTGACCATGAAAAAAATTTGTGGCACAATATTAAAGCCT
CTCTTCCAACTCCCTTACCGAATAGGGGCCTCCCAAAGCCTTTATGGAAACTAAATTCAAACAACATCTTCGATACCGCTTCCGTAAAAAGGATCCTATCTGAAGCTCCA
ATCTCTCCAGCAAACTTTCATCCTAATCTCTACAAAACTCTGTGGAAGGTGGAGTTTCCAAAAAAGTGTAAATTTTTCATCTGGACGCTCATCCATGGTTGCATTAATAC
AGCTGATCGCCTGCAGAAACGTTTACCAAATTGGGCCCTCAGTCCCAACTGGTGTTACATGTGCAACAAGAGCCAAGAAGACATAAATCATCTCTTCATCCATTGCCCCT
ATAGTCAGCAGTTATGGAGTAAGGCCAAAGCTCTCCTCAAATGGAATAGAACTCCAACTGATGTGCAGTCCCTTGTTCAGAACATTTGCTCCCTTAACATAAGAAATCAA
AAAGGGCTGATAACATTCAATACCAGTGCTACCCTCCTTTGGAAGATTTGGCTGGAAAGAAACAATAGAATCTTCAAGCAACAGGGAAAAGATTCTCAAGATCTTTGGGA
AGACATTCTCGCTCAAACCGGTTTATGGAGCTGCAAATCTAAATTATTTTCAAATTATGATTGTTGCTCCATAGCGTTAAACATCTCTGCTTTTGTAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTTATGCAAAAATGGAAGCTAAATCAAAAAGGCCTACTCAGTCCATAAAGAAAAAAGTCTATAGAGTCAAGAGCCGAAGCATGGAAAGGGAAACTCCTCAAACAAG
CAGGCAAAAAGATAAAGAAAAAATTGACCCGAATGAGTTTGAACTGGTGGTAGACCTTGGCCACATCTCCCCCCTCTCAGATACTGATTTCTCTTGTCCTGAAAGTCCTT
CGTACATCCCTTCACCAACATCTCCTACTGAGTCAGACATTGTAAAGGACAGCCTGGCCTCTATGATGACTTGTGCTCATGAAGATAGAGAAAAGAAGAAAAAGGAGAAC
TTAAGGGAAGAGACCGAAGATGATGAAGTTAGCTTCAAGAGGAAACTTACAGATTGGCTGAAAGAAAACAACCTCAGACTGGCGGCAGATTTTAATTCACAGTTTAATTC
TGTTACAAATGATAGGATGATTTCTATTTTAAATGGGCCACCAAATGTAGGGGCTTTCTCTGTCTCCATCCAAGTTGGCTCCAACAATGGTGCTTCTTGGTGGCTTTCTG
CCATTTACGGCCCAGCTAAAAGAAAAAATAGGCCTTTATTTTGGGAGGAACTTGAAAATCTAAAATCCATTTGCTTTCCAACCTGGATTCTTGGTGGAGATTTTAACGTT
ATCAGATGGAAGGAGGAGACGTCCACCAAAAATCCAGCCTCGCTAAGCATGAAAAGATTCAACACTTTCATAAGCAATTGTAATCTGATTGATCCTCCCCTCACCAATGC
AAAGTTTACTTGGTCAAATCTCAGAGCTCAGGCCACCCTCTCCAGACTGGACAGATTTCTTTTCTCTACCCATTGGGAAAATATTTTCCCGGGCCATACTTCAAAAGTGT
TAACCCGAACTACTTCAGACCATTTTCCCATTGTTCTCGAGTCGTCTACGATCTCTTGGGGTCCTTCTCCTTTTAGATTCACAAATGCCTACCTAAAAGATCCAGACTAC
AAGAAAAACATTGAGTTTTGGTGGGGAAACACCAGTCAGCCAGGCTATGCAGCTGGCTTTGAAAATCAAAGCTTGGGGAAGAGAAAAAAAGGAAAAAATGAAGCTTCTAA
AAAGGCCTGGATCAAAGAAATCGATCTAATTGACAAACTAGAGGCTGAAGGATCTGCAACTGAGATTCACAGAGAGAAAAGGATTGCTCTAAAAGCCGACCTTTCCCAAA
TTACTCTCACTGAAGCTCAAATATGGGCCCAAAAATGCAAAAGAATATGGGTCCATGAAGGTGATGAAAATTCTTCCTTTTTCCACAAAATTTGCACAGCAAGGCAAAAA
AAGTGTTTGATCTCCAAGATAATAAACAACAGTGGACAGAATTGCCTAAATGACAGTGACATTGCCGATGCCTTCATTCAACATTTTGAAGAAATCTATACAGACAACAG
AAACAGCCATCTGTTTATTGATAATCTCGATTGGTGCCCCATCTCCAACACCAACAGTGACTTGCTGGACAAACCCTTTAATGAAGCTGAAATTTGGCTCACTTTAAAGT
CTTTTGCAAAGAATAAAGCTCCAGGTCCAGATGGTTATACGATGGATTTCCTACAAAAGTCTTGGTCTTTTATGAAGCAAAACATTTGTGATATCTTCAAGGATTTTCAC
AGCACCCATACCATCAATAAAGTTGTCAATGAAACTCTCATTACCCTTATAGCCAAAAAAGAAAATTGTGAGACAGTTGCAGACTTTCGGCCCATCAGCCTCACCACGGC
TATCTACAAATTAATCGCAAAGGCTTTGGCTGATAGATTGAAACAAACTCTCCCCGATACGATCTCTGAGTCTCAAATGGCCTTCGTTAAAGGAAGAAAAATTACAGAGG
CCATTCTTATTGCAAATGAAGCTTTGGATTTCTGGAGAAATAAAAAAGAAAGAGGTTTTGTGATAAAACTGGACATTGAAAAGGCCTTCGATAAGCTAAATTGGCGCTTC
ATAGACTTTGTGCTTATGAAAAAGAACTACTCCCAGAAATGGAGGAAAATGATTGCCAGTTGCATCTCTAGTGTCCAATACTCTATTCTTATCAATGGTAGACCGAGAGG
CAGAATCAAACCTTCTAGAGGAATCCGACAGGGTGACCCCCTTTCACCCTTCATCTTTGTTTTGGCTATGGACTATCTCAGCCGTCTTTTGAACAACTTAGCAGATAAAA
GAAAAATCAATGGAGTCAATTTCAGTCCCAACCTTAATCTTACCCACATCCTATTTGCGGATGACATCCTCATCTTTGTAGAGGATAGGGATGACTACGTATCAAACCTC
AAAATGATCCTTCATCTCTTTGAATCAGCCTCGGGCCTTAACATCAATCTGTCCAAGTCTACTATCTTTCCCATAAACGTCCCAACAGATCGTGCAAAGTCTATAGCGGA
CAGTTGGGGAATAAGCAAGGGCCATCTTCCGACATCTTACCTTGGTATGCCCTTAGGAGGGAAGCCTTCCTCATCAAACTTCTGGGACAATGTGCTTCAGAAAATCCAGA
AAAAATTGAGCAGCTGGAAATACTCTCAGTTATCCAAAGGCGGCAGAATCACTCTGATAAACTCAACTCTTGAAAGCCTTCCATATATATCAAATGTCCCCAAAGGTATA
GCTCAGAAAATTGAAGCTTCTTGGAGAAATTTCCTTTGGAATGGTACATCGAATGGCCACAACATTAGCCTCATCAGATGGAACCAAATTGTCTCCCCAAAAGAGAAAGG
AGGCCTCGGTATTCACTCTGTCAATAGCACAAATTTTGCCCTCCTCTGTAAATGGCTCTGGAAATTTCTAACTGAAAAAGATCCTTTATGGAAACGCCTGATCATTTCCA
AATATGATCAGGAGAAAATGGGCAGATTTCCTTCTCGTGGAAAATTCAGCAGCAATAATAGCCCTTGGAAAGCAGTGACAGAGTGTATCAGTTGGTTCTATAAAAACATC
AGCTGGAAGGTAAATGATGGAGAAGATATCTCCTTTTGGCTTGACAACTGGAATGGAAATGCTCCTTTATCTTTGGCCGTCCCCCGTCTTTTTGCTCTATCTACAAACAA
AAAGGGGTCTGTTAAAGATTTTTGGAATCCCTCATCTAATGACTGGCATCTCCATATCAATCGGCCCCTCCGTGACCATGAAAAAAATTTGTGGCACAATATTAAAGCCT
CTCTTCCAACTCCCTTACCGAATAGGGGCCTCCCAAAGCCTTTATGGAAACTAAATTCAAACAACATCTTCGATACCGCTTCCGTAAAAAGGATCCTATCTGAAGCTCCA
ATCTCTCCAGCAAACTTTCATCCTAATCTCTACAAAACTCTGTGGAAGGTGGAGTTTCCAAAAAAGTGTAAATTTTTCATCTGGACGCTCATCCATGGTTGCATTAATAC
AGCTGATCGCCTGCAGAAACGTTTACCAAATTGGGCCCTCAGTCCCAACTGGTGTTACATGTGCAACAAGAGCCAAGAAGACATAAATCATCTCTTCATCCATTGCCCCT
ATAGTCAGCAGTTATGGAGTAAGGCCAAAGCTCTCCTCAAATGGAATAGAACTCCAACTGATGTGCAGTCCCTTGTTCAGAACATTTGCTCCCTTAACATAAGAAATCAA
AAAGGGCTGATAACATTCAATACCAGTGCTACCCTCCTTTGGAAGATTTGGCTGGAAAGAAACAATAGAATCTTCAAGCAACAGGGAAAAGATTCTCAAGATCTTTGGGA
AGACATTCTCGCTCAAACCGGTTTATGGAGCTGCAAATCTAAATTATTTTCAAATTATGATTGTTGCTCCATAGCGTTAAACATCTCTGCTTTTGTAAAATAG
Protein sequenceShow/hide protein sequence
MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHISPLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAHEDREKKKKEN
LREETEDDEVSFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNV
IRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDY
KKNIEFWWGNTSQPGYAAGFENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQK
KCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFH
STHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRF
IDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFADDILIFVEDRDDYVSNL
KMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPYISNVPKGI
AQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNI
SWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAP
ISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQ
KGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFVK