; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0022079 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0022079
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr05:4761025..4767731
RNA-Seq ExpressionIVF0022079
SyntenyIVF0022079
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035247.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.082.54Show/hide
Query:  MDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITFRSSAPTKRIRSEI
        MDITPDTL WIRNCFKDLLDTSTTKHFFAE+R EDNCMWVRKTKNKSKTSIT EIFRIDNKGRKCSILVPEGPDSFGWKSFLALITFR SAPTKRI SEI
Subjt:  MDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITFRSSAPTKRIRSEI

Query:  RKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFSLRKQSEIAFSYKP
        RKE VS +SDSFSSDSDSSRKSYAK LSDSSE++NKKRYK+TSDDSSSRRSS+IGFKPFTLSG+SFEKTVI+TRRCFHDDWNRIMFSLRKQSEIAFSYKP
Subjt:  RKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFSLRKQSEIAFSYKP

Query:  FQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESW-DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDA
        FQADKA LFLNPDHAKLLCSNK ANGWSTVGNYQ    S+ +    S H        W     IP                     V KETMQM+KLIDA
Subjt:  FQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESW-DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDA

Query:  KIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKHSISYH
        KIKVRYNYIGFVPAS+LITDNQGENFI+TTV P +ARWLVERNVRVHG+F+TKAADEFDQHN LAE YTYNGFQAIP E TRT GDYS  NSDKHS+S H
Subjt:  KIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKHSISYH

Query:  TQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFEKRWSPRQKTKTK
        TQAKKNNSSESEYDPFDQQLS+RRKEKGK IL+INDQDHGHYSKRSKRISNRKVSFLSPG IQS SSNTE N +GKSLEISTIND FEKRWSPRQK+K K
Subjt:  TQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFEKRWSPRQKTKTK

Query:  LTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASRSTAEGNS
        LTYRIK DP E  ED KLSLKE GEGSKQMNLSVDMGPISPL+SM+QSENNHG D+ NNQT     K+    E +N T SVKEG D NKS SRST +G S
Subjt:  LTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASRSTAEGNS

Query:  KDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSISLNILNTNGNWW
        KDAKT SELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPV+VSDQN D++GHGPLGDKG I+VLWDDT FKVN+IKVG++SIS+N+L+TNGNWW
Subjt:  KDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSISLNILNTNGNWW

Query:  LTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPTYSRLDRFLLSKG
        LTSVYGPYK NDRT LW ELE+LQ+LC+PNWLI GDFNIVRW+ E NAKSLD+RNMANFNNFISVNELIDPPPLNN +TWSNLR+NPTYS LDRFLLSKG
Subjt:  LTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPTYSRLDRFLLSKG

Query:  WENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALL
        WEN FGLHTSRT+ER +SDHFPI+LESPQIKWGPCPFRLNNSSL+DKEFQKNF +WWN+SKQ GFPGYAFIQSL SLSK IKEWQHNKVNLYDA +K LL
Subjt:  WENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALL

Query:  KEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQN
        +EID IDKLE QG MST HHQKRISLKS+LLSIENNQA IWHQR+RQRWNLLGDENN++FHR+CTINQRKN IKSICDP GTSLDSI DISR FISHFQN
Subjt:  KEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQN

Query:  IYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKE
        IYTKE+YEEILIDNL+WNPIS   QSELCKPFDE EIKSTIMS SNEKAPGPDGYT+LFYKKHW DLK DLLNV KDFHK GIVNNNVNNTFIALISKKE
Subjt:  IYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKE

Query:  KCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAEN
        KC+ PSDYRPISLTTSLYKIMAKALANRLKS LPDT+AEN
Subjt:  KCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAEN

KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.098.74Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWK FLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLN DHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNY GFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ+HGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE

Query:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        KRWSPRQKTKTKLTYRIKKDPQESTEDH LSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDT NNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI
        KSASRSTAEGNSKDAKTGSE+EIDRAFKEKLVIWLKENELKLSPKYTNDVPSSS FPVIVSDQNMDIAGHGPLGDKGGILVLWDDT FKVNDIKVGNYSI
Subjt:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI

Query:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT
        SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPP LNNNFTWSNLRVNPT
Subjt:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT

Query:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
        YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDK+FQKNFINWWN+SKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKESYEEILIDNLS
        DISRTFISHFQNIYTKE+YEEILIDNLS
Subjt:  DISRTFISHFQNIYTKESYEEILIDNLS

KAA0056839.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.091.85Show/hide
Query:  MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQ
        MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHK GIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYK+MAKALANRLKSALPDTIAENQ
Subjt:  MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQ

Query:  MAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWSFIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS
        MAFIKGRQINDAILIANE IDTWKQRKIKGFVLKLD+EKAFDKISWSFID+MLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS
Subjt:  MAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWSFIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS

Query:  PFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ
        PFIFVLAMDYLSRLLSHLESKGAIKGVSFNN CNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ
Subjt:  PFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ

Query:  TKFLPVNYLGVPLGGNPRSRSFGI-------KQLN---------------------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN
        TKFLPVNYLGVPLGGNPRSRSF         K+LN                           LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN
Subjt:  TKFLPVNYLGVPLGGNPRSRSFGI-------KQLN---------------------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN

Query:  ICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWH
        ICTSPKELGGLGISK+KDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISW+ NDGSSLSFWHSKWH
Subjt:  ICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWH

Query:  NNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNW
        NNIPLSLQ PRLYALSNMQSATVKEIWDQGSDDWNM+PRRPLNEREQQTWDSIKMSLPRIHNNRGMCKP+WNPSDSKKYTVASAKDIAFKESSIPKETNW
Subjt:  NNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNW

Query:  EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTG
        EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMD IQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFAR+LWNMWSSETGTPM  TNVKDLCLQLCRQ+ 
Subjt:  EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTG

Query:  RNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSK-KYLKKINR
        RN KNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSK K LK  ++
Subjt:  RNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSK-KYLKKINR

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.097.57Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE

Query:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI
        KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI
Subjt:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI

Query:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT
        SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT
Subjt:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT

Query:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
        YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV
        DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV
Subjt:  DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV

Query:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWS
        NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANE IDTWKQRKIKGFVLKLDIEKAFDKISWS
Subjt:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWS

Query:  FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF
        FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF
Subjt:  FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF

Query:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFGI-------KQLN------------
        VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSF         K+LN            
Subjt:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFGI-------KQLN------------

Query:  ---------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
                       LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
Subjt:  ---------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY

Query:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ
        TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ
Subjt:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ

Query:  QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW
        QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW
Subjt:  QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW

Query:  CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
        CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
Subjt:  CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG

Query:  SWSSK-KYLKKINR
        SWSSK K LK  ++
Subjt:  SWSSK-KYLKKINR

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.097.57Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE

Query:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI
        KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI
Subjt:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI

Query:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT
        SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT
Subjt:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT

Query:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
        YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV
        DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV
Subjt:  DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV

Query:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWS
        NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWS
Subjt:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWS

Query:  FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF
        FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNN CNISHLLFADDVLIF
Subjt:  FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF

Query:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFGI-------KQLN------------
        VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSF         K+LN            
Subjt:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFGI-------KQLN------------

Query:  ---------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
                       LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
Subjt:  ---------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY

Query:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ
        TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ
Subjt:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ

Query:  QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW
        QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW
Subjt:  QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW

Query:  CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
        CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
Subjt:  CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG

Query:  SWSSK-KYLKKINR
        SWSSK K LK  ++
Subjt:  SWSSK-KYLKKINR

TrEMBL top hitse value%identityAlignment
A0A5A7US62 LINE-1 retrotransposable element ORF2 protein0.0e+0097.57Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE

Query:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI
        KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI
Subjt:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI

Query:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT
        SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT
Subjt:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT

Query:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
        YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV
        DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV
Subjt:  DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV

Query:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWS
        NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANE IDTWKQRKIKGFVLKLDIEKAFDKISWS
Subjt:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWS

Query:  FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF
        FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF
Subjt:  FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF

Query:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFG-------IKQLN------------
        VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSF         K+LN            
Subjt:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFG-------IKQLN------------

Query:  ---------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
                       LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
Subjt:  ---------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY

Query:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ
        TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ
Subjt:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ

Query:  QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW
        QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW
Subjt:  QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW

Query:  CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
        CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
Subjt:  CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG

Query:  SWSSK-KYLKKINR
        SWSSK K LK  ++
Subjt:  SWSSK-KYLKKINR

A0A5A7UTI6 LINE-1 retrotransposable element ORF2 protein0.0e+0091.85Show/hide
Query:  MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQ
        MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHK GIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYK+MAKALANRLKSALPDTIAENQ
Subjt:  MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQ

Query:  MAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWSFIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS
        MAFIKGRQINDAILIANE IDTWKQRKIKGFVLKLD+EKAFDKISWSFID+MLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS
Subjt:  MAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWSFIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS

Query:  PFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ
        PFIFVLAMDYLSRLLSHLESKGAIKGVSFNN CNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ
Subjt:  PFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ

Query:  TKFLPVNYLGVPLGGNPRSRSFGI-------KQLN---------------------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN
        TKFLPVNYLGVPLGGNPRSRSF         K+LN                           LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN
Subjt:  TKFLPVNYLGVPLGGNPRSRSFGI-------KQLN---------------------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN

Query:  ICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWH
        ICTSPKELGGLGISK+KDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISW+ NDGSSLSFWHSKWH
Subjt:  ICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWH

Query:  NNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNW
        NNIPLSLQ PRLYALSNMQSATVKEIWDQGSDDWNM+PRRPLNEREQQTWDSIKMSLPRIHNNRGMCKP+WNPSDSKKYTVASAKDIAFKESSIPKETNW
Subjt:  NNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNW

Query:  EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTG
        EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMD IQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFAR+LWNMWSSETGTPM  TNVKDLCLQLCRQ+ 
Subjt:  EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTG

Query:  RNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSK-KYLKKINR
        RN KNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSK K LK  ++
Subjt:  RNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSK-KYLKKINR

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein0.0e+0098.74Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWK FLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLN DHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNY GFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ+HGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE

Query:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        KRWSPRQKTKTKLTYRIKKDPQESTEDH LSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDT NNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI
        KSASRSTAEGNSKDAKTGSE+EIDRAFKEKLVIWLKENELKLSPKYTNDVPSSS FPVIVSDQNMDIAGHGPLGDKGGILVLWDDT FKVNDIKVGNYSI
Subjt:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI

Query:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT
        SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPP LNNNFTWSNLRVNPT
Subjt:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT

Query:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
        YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDK+FQKNFINWWN+SKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKESYEEILIDNLS
        DISRTFISHFQNIYTKE+YEEILIDNLS
Subjt:  DISRTFISHFQNIYTKESYEEILIDNLS

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein0.0e+0097.57Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFE

Query:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  KRWSPRQKTKTKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI
        KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI
Subjt:  KSASRSTAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSI

Query:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT
        SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT
Subjt:  SLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPT

Query:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
        YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  YSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV
        DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV
Subjt:  DISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNV

Query:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWS
        NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWS
Subjt:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWS

Query:  FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF
        FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNN CNISHLLFADDVLIF
Subjt:  FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF

Query:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFG-------IKQLN------------
        VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSF         K+LN            
Subjt:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFG-------IKQLN------------

Query:  ---------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
                       LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
Subjt:  ---------------LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKY

Query:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ
        TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ
Subjt:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQ

Query:  QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW
        QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW
Subjt:  QTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSW

Query:  CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
        CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
Subjt:  CISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG

Query:  SWSSK-KYLKKINR
        SWSSK K LK  ++
Subjt:  SWSSK-KYLKKINR

A0A5D3DFM9 LINE-1 retrotransposable element ORF2 protein0.0e+0082.54Show/hide
Query:  MDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITFRSSAPTKRIRSEI
        MDITPDTL WIRNCFKDLLDTSTTKHFFAE+R EDNCMWVRKTKNKSKTSIT EIFRIDNKGRKCSILVPEGPDSFGWKSFLALITFR SAPTKRI SEI
Subjt:  MDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKSFLALITFRSSAPTKRIRSEI

Query:  RKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFSLRKQSEIAFSYKP
        RKE VS +SDSFSSDSDSSRKSYAK LSDSSE++NKKRYK+TSDDSSSRRSS+IGFKPFTLSG+SFEKTVI+TRRCFHDDWNRIMFSLRKQSEIAFSYKP
Subjt:  RKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFSLRKQSEIAFSYKP

Query:  FQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESW-DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDA
        FQADKA LFLNPDHAKLLCSNK ANGWSTVGNYQ    S+ +    S H        W     IP                     V KETMQM+KLIDA
Subjt:  FQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESW-DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDA

Query:  KIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKHSISYH
        KIKVRYNYIGFVPAS+LITDNQGENFI+TTV P +ARWLVERNVRVHG+F+TKAADEFDQHN LAE YTYNGFQAIP E TRT GDYS  NSDKHS+S H
Subjt:  KIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKHSISYH

Query:  TQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFEKRWSPRQKTKTK
        TQAKKNNSSESEYDPFDQQLS+RRKEKGK IL+INDQDHGHYSKRSKRISNRKVSFLSPG IQS SSNTE N +GKSLEISTIND FEKRWSPRQK+K K
Subjt:  TQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFEKRWSPRQKTKTK

Query:  LTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASRSTAEGNS
        LTYRIK DP E  ED KLSLKE GEGSKQMNLSVDMGPISPL+SM+QSENNHG D+ NNQT     K+    E +N T SVKEG D NKS SRST +G S
Subjt:  LTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASRSTAEGNS

Query:  KDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSISLNILNTNGNWW
        KDAKT SELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPV+VSDQN D++GHGPLGDK GI+VLWDDT FKVN+IKVG++SIS+N+L+TNGNWW
Subjt:  KDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSISLNILNTNGNWW

Query:  LTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPTYSRLDRFLLSKG
        LTSVYGPYK NDRT LW ELE+LQ+LC+PNWLI GDFNIVRW+ E NAKSLD+RNMANFNNFISVNELIDPPPLNN +TWSNLR+NPTYS LDRFLLSKG
Subjt:  LTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPTYSRLDRFLLSKG

Query:  WENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALL
        WEN FGLHTSRT+ER +SDHFPI+LESPQIKWGPCPFRLNNSSL+DKEFQKNF +WWN+SKQ GFPGYAFIQSL SLSK IKEWQHNKVNLYDA +K LL
Subjt:  WENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALL

Query:  KEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQN
        +EID IDKLE QG MST HHQKRISLKS+LLSIENNQA IWHQR+RQRWNLLGDENN++FHR+CTINQRKN IKSICDP GTSLDSI DISR FISHFQN
Subjt:  KEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQN

Query:  IYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKE
        IYTKE+YEEILIDNL+WNPIS   QSELCKPFDE EIKSTIMS SNEKAPGPDGYT+LFYKKHW DLK DLLNV KDFHK GIVNNNVNNTFIALISKKE
Subjt:  IYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKE

Query:  KCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAEN
        KC+ PSDYRPISLTTSLYKIMAKALANRLKS LPDT+AEN
Subjt:  KCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAEN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.9e-4326.14Show/hide
Query:  KGGILVLWDD-TNFKVNDI---KVGNYSISLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANF
        K G+ +L  D T+FK   I   K G+Y +    +       + ++Y P     R  +   L  LQ     + LI GDFN      + + +    ++    
Subjt:  KGGILVLWDD-TNFKVNDI---KVGNYSISLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANF

Query:  NNFISVNELID----PPPLNNNFTWSNLRVNPTYSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLE--------SPQIKWGPCPFRLNNSSLRD-
        N+ +   +LID      P +  +T+ +   + TYS++D  + SK   +      +  +   +SDH  I LE        S    W     +LNN  L D 
Subjt:  NNFISVNELID----PPPLNNNFTWSNLRVNPTYSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLE--------SPQIKWGPCPFRLNNSSLRD-

Query:  ------KEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQI
              K   K F    N +K   +      Q+L    K +   +   +N Y   +K    +ID +     + E     H K  S + ++  I     +I
Subjt:  ------KEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQI

Query:  WHQRARQRWNLLGDENNSYFHRICTIN----------QRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTK--ESYEEI--LIDNLSWNPISRLCQS
          Q+  Q+ N   +  + +F RI  I+          + KN I +I +  G       +I  T   +++++Y    E+ EE+   +D  +   +++    
Subjt:  WHQRARQRWNLLGDENNSYFHRICTIN----------QRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTK--ESYEEI--LIDNLSWNPISRLCQS

Query:  ELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEK-CSKPSDYRPISLTTSLYKIMAKAL
         L +P   SEI + I S   +K+PGPDG+T  FY+++  +L   LL +F+   K GI+ N+     I LI K  +  +K  ++RPISL     KI+ K L
Subjt:  ELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEK-CSKPSDYRPISLTTSLYKIMAKAL

Query:  ANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFV-LKLDIEKAFDKISWSFIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAP
        ANR++  +   I  +Q+ FI G Q    I  +  VI    + K K  V + +D EKAFDKI   F+   L K      + K I+A       +I+LNG  
Subjt:  ANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKGFV-LKLDIEKAFDKISWSFIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAP

Query:  KGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPI
              + G RQG PLSP +F + ++ L+R    +  +  IKG+       +   LFADD+++++E+      NL   ++ F K SG   N  KS     
Subjt:  KGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPI

Query:  NISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFGIKQLNLSTFKAPVSVYKEIEKHWRD
        N +     QI     F      + YLG+ L       +  +K L    +K  +   KE    W++
Subjt:  NISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFGIKQLNLSTFKAPVSVYKEIEKHWRD

P08548 LINE-1 reverse transcriptase homolog4.2e-4127.08Show/hide
Query:  LTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDP----PPLNNNFTWSNLRVNPTYSRLDRFL
        + ++Y P  +N    +   L  + +L     ++ GDFN      + ++K    + + + N+ I   +L D      P    +T+ +   + TYS++D  L
Subjt:  LTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDP----PPLNNNFTWSNLRVNPTYSRLDRFL

Query:  LSKGWENAFGLHTSRTLERNISDHFPILLE---SPQIKWGPCPFRLNNSSLRD--------KEFQK----------NFINWWNSSKQAGFPGYAFIQSLN
          K   N         +    SDH  I +E   +  +      ++LNN  L+D        KE  K          N+ N W+++K A   G  FI    
Subjt:  LSKGWENAFGLHTSRTLERNISDHFPILLE---SPQIKWGPCPFRLNNSSLRD--------KEFQK----------NFINWWNSSKQAGFPGYAFIQSLN

Query:  SLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQ------R
        +L  F+K+ +  +VN    + K L KE                H   + S + ++  I     +I ++R  Q+ N      + +F +I  I++      R
Subjt:  SLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQ------R

Query:  KNLIKSICDPAGTSLDSI----DDISRTFISHFQNIYTKESYEEI--LIDNLSWNPISRLCQSE---LCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFY
        K  +KS+        D I     +I +    +++ +Y+   YE +  +   L    + RL Q E   L +P   SEI STI +   +K+PGPDG+T  FY
Subjt:  KNLIKSICDPAGTSLDSI----DDISRTFISHFQNIYTKESYEEI--LIDNLSWNPISRLCQSE---LCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFY

Query:  KKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEK-CSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANE
        +    +L   LLN+F++  K GI+ N      I LI K  K  ++  +YRPISL     KI+ K L NR++  +   I  +Q+ FI G Q    I  +  
Subjt:  KKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEK-CSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANE

Query:  VIDTWKQRKIKG-FVLKLDIEKAFDKISWSFIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSH
        VI    + K K   +L +D EKAFD I   F+   L K      + K I+A  S    +I+LNG          G RQG PLSP +F + M+ L+     
Subjt:  VIDTWKQRKIKG-FVLKLDIEKAFDKISWSFIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSH

Query:  LESKGAIKGVSFNNCCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKST--ISPINISAGRTDQIASFFGFQTKFLPVNYLGVPL
        +  + AIKG+   +   I   LFADD+++++E+       L   +  +   SG   N  KS   I   N  A +T  +     F      + YLGV L
Subjt:  LESKGAIKGVSFNNCCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKST--ISPINISAGRTDQIASFFGFQTKFLPVNYLGVPL

P0C2F6 Putative ribonuclease H protein At1g657501.4e-2826.6Show/hide
Query:  NLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRN
        ++ST   P S+   +++  R FLWG + +K+  HL+ W+   SPK+ GGLG+   K  N+AL+ K  WR   E NSLW   +  KY      D   +   
Subjt:  NLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRN

Query:  SSANSPWNAIK-KWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQQTWDSIKMSLPRI
         S +S W +I    +D     + W+P DG  + FW  +W +  PL L++      ++  +   K++W  G   W+     P      +  +   + L  +
Subjt:  SSANSPWNAIK-KWKDWYESKISWLPNDGSSLSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQQTWDSIKMSLPRI

Query:  HNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSWCISCRSSNEDMNH
           R   + +W  S   +++V SA ++      +P+  N       LW+  +P++ K F+W + +Q + T +   +R+ S S   + C  C+   E M H
Subjt:  HNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSWCISCRSSNEDMNH

Query:  LFIFCPFARSLW
        +   CP    +W
Subjt:  LFIFCPFARSLW

P11369 LINE-1 retrotransposable element ORF2 protein9.7e-3824.96Show/hide
Query:  LQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDP----PPLNNNFTWSNLRVNPTYSRLDRFLLSKGWENAFGLHTSRTLE---R
        L++   P+ +I GDFN     ++ + K    R+       +   +L D      P    +T+ +   + T+S++D  +  K      GL+  + +E    
Subjt:  LQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDP----PPLNNNFTWSNLRVNPTYSRLDRFLLSKGWENAFGLHTSRTLE---R

Query:  NISDHFPI-LLESPQIKWGPCPF--RLNNSSLRD-------KEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKK--------
         +SDH  + L+ +  I  G   F  +LNN+ L D       K+  K+F+  +N ++   +P         +L   +K +   K+    A+KK        
Subjt:  NISDHFPI-LLESPQIKWGPCPF--RLNNSSLRD-------KEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKK--------

Query:  ALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSY---FHRICTINQRKNLIKSICDPAGTSLDSIDDISRTF
        +L   +  ++K E       +  Q+ I L+ ++  +E  +     QR  Q  +   ++ N       R+   ++ K LI  I +  G      ++I  T 
Subjt:  ALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSY---FHRICTINQRKNLIKSICDPAGTSLDSIDDISRTF

Query:  ISHFQNIYTK--ESYEEI--LIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNN
         S ++ +Y+   E+ +E+   +D      +++     L  P    EI++ I S   +K+PGPDG++  FY+    DL   L  +F      G + N+   
Subjt:  ISHFQNIYTK--ESYEEI--LIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNN

Query:  TFIALISKKEK-CSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKG-FVLKLDIEKAFDKISWS
          I LI K +K  +K  ++RPISL     KI+ K LANR++  +   I  +Q+ FI G Q    I  +  VI    + K K   ++ LD EKAFDKI   
Subjt:  TFIALISKKEK-CSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRKIKG-FVLKLDIEKAFDKISWS

Query:  FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF
        F+  +L +      +   IKA  S    +I +NG     I  + G RQG PLSP++F + ++ L+R    +  +  IKG+       +   L ADD++++
Subjt:  FIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISHLLFADDVLIF

Query:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFGIKQLNLSTFKAPVSVYKEIEKHWR
        + D +     L   +  F +  G   N++KS       +     +I     F      + YLGV L       +  +K L    FK+     KE  + W+
Subjt:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFGIKQLNLSTFKAPVSVYKEIEKHWR

Query:  D
        D
Subjt:  D

P14381 Transposon TX1 uncharacterized 149 kDa protein1.2e-3222.02Show/hide
Query:  ISLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPN--WLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELID----PPPLNNNFTWS
        + L +  +   + L +VY P    +R + +  L         +   +I GDFN     R+ N       + +     I+   L+D      P    FT+ 
Subjt:  ISLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPN--WLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELID----PPPLNNNFTWS

Query:  NLR-VNPTYSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLE---SPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSL
         +R  + + SR+DR  +S    +     T R      SDH  + L    +P +      +  NNS L D+ F K+  + W   +       AF     +L
Subjt:  NLR-VNPTYSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLE---SPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSL

Query:  SKFIKEWQHNKVNL--------------YDANKKALLKEIDIIDKLEFQGEMSTTHHQ----KRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSY
        +++   W   KV+L               +A  +AL  E+     L+ +  +S +  Q    + +  K  L ++E  QA+    R+R +     D  + +
Subjt:  SKFIKEWQHNKVNL--------------YDANKKALLKEIDIIDKLEFQGEMSTTHHQ----KRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSY

Query:  FHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKESYEEILIDNLSWN---PISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYT
        F+ +      +  I  +    GT L+  + I     S +QN+++ +       + L W+    +S   +  L  P    E+   +    + K+PG DG T
Subjt:  FHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKESYEEILIDNLSWN---PISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYT

Query:  MLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILI
        + F++  W  L  D   V  +  K G +  +     ++L+ KK       ++RP+SL ++ YKI+AKA++ RLKS L + I  +Q   + GR I D + +
Subjt:  MLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILI

Query:  ANEVIDTWKQRKIKGFVLKLDIEKAFDKISWSFIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLL
          +++   ++  +    L LD EKAFD++   ++   L    F  ++  ++K   ++ +  + +N +    +   RG+RQG PLS  ++ LA++    LL
Subjt:  ANEVIDTWKQRKIKGFVLKLDIEKAFDKISWSFIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLL

Query:  SHLESKGAIKGVSFNNCCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFF---GFQTKFLPVNYLGVP
            +   +K         +    +ADDV++  +D    L   Q    ++  AS    N SKS  S +   + + D +   F    +++K   + YLGV 
Subjt:  SHLESKGAIKGVSFNNCCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFF---GFQTKFLPVNYLGVP

Query:  LGG------------------------------NPRSRSFGIKQL-------NLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELG
        L                                + R R+  I QL        L           +I++   DFLW G       H ++  + + P + G
Subjt:  LGG------------------------------NPRSRSFGIKQL-------NLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELG

Query:  GLGISKLKDTNQALLCKWLWRY
        G G+  ++        + + RY
Subjt:  GLGISKLKDTNQALLCKWLWRY

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein7.4e-2523.43Show/hide
Query:  LIAGDFNIVRWERETNA---KSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLR-VNPTYSRLDRFLLSKGWENAFGLHTSRTLERNISDHFP--ILL
        ++ GDF+ +    +  +    S+  R +  F N +  ++L+D P    ++TWSN +  NP   +LDR + +  W ++F    +      +SDH P  I+L
Subjt:  LIAGDFNIVRWERETNA---KSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLR-VNPTYSRLDRFLLSKGWENAFGLHTSRTLERNISDHFP--ILL

Query:  ESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKV-NLYDANKKALLKEIDIIDKLEFQ------GEMSTT
        E+   +   C FR  +       F  +    W      G   ++  + L +  K  K        N+    K+AL    D ++ ++ Q        +   
Subjt:  ESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKV-NLYDANKKALLKEIDIIDKLEFQ------GEMSTT

Query:  HHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKESYEEILIDNLSWN
         H  R        ++E+     + Q++R +W   GD N  +FH++   NQ KNLIK +       ++++  +    ++++ ++   +S      D L+ +
Subjt:  HHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKESYEEILIDNLSWN

Query:  PISRL-------CQSELCKPF----DESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEKCSKPSD
         + R+       C   L         + EI + + +    KAPGPD +T  F+ + W  +KD  +   K+F + G +    N T I LI K     + S 
Subjt:  PISRL-------CQSELCKPF----DESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEKCSKPSD

Query:  YRPISLTTSLYKIM
        +RP+S  T +YKI+
Subjt:  YRPISLTTSLYKIM

AT3G25270.1 Ribonuclease H-like superfamily protein2.0e-0927.04Show/hide
Query:  LWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARSLW---NMWSSETGTPMVNTNVK-DLCLQLCRQTGRN
        +W+     K K F+W ++   L T DN+++R+     N   C  C   +E   HLF  C +A+ +W    +   E  T  +    K +L L  C      
Subjt:  LWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARSLW---NMWSSETGTPMVNTNVK-DLCLQLCRQTGRN

Query:  AKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSW-SSKKYLKKINR
         +    FN AI  LW +W  RN L+F  K  S+ N  +        W  +  Y++ +N+
Subjt:  AKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSW-SSKKYLKKINR

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.3e-0835.8Show/hide
Query:  LANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRK-IKGF-VLKLDIEKAFDKISWSFIDYMLAKKHFPHKW
        +  RLK  + + I   Q +FI GR   D I+   E + + +++K +KG+ +LKLD+EKA+D+I W +++  L    FP  W
Subjt:  LANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRK-IKGF-VLKLDIEKAFDKISWSFIDYMLAKKHFPHKW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-0523.91Show/hide
Query:  LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINW-NICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRN
        +S F+    + K++     +F W   E+K+    + W  +C S ++ GGLG   L   NQALL K  +R  ++ ++L  + + ++Y  +       VG  
Subjt:  LSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINW-NICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRN

Query:  SSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKW
         S    W +I   ++     +     DG     W  +W
Subjt:  SSANSPWNAIKKWKDWYESKISWLPNDGSSLSFWHSKW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.2e-1247.76Show/hide
Query:  LLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSF-NNCCNISHLLFADD
        ++NGAP+G +   RG+RQGDPLSP++F+L  + LS L    + +G + G+   NN   I+HLLFADD
Subjt:  LLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSF-NNCCNISHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCATCAAACAACTACCCAAATCCTGCCAAATCGAGCAAAAGGAATTCATACTATCTTTGGATAGTAAACCAAGCTTCATCCGTATGTGGTTGACCGAAGTTTG
TCAACACAAATCTTTCTCCATGGACATCACTCCTGACACGCTGGCGTGGATCAGAAACTGCTTCAAAGACTTGTTGGACACTTCAACAACCAAACACTTTTTTGCTGAAA
GAAGGATGGAAGATAACTGCATGTGGGTAAGAAAGACCAAAAACAAGAGCAAAACTAGTATAACTGCTGAAATCTTCAGAATTGATAACAAAGGAAGAAAATGTAGCATA
TTGGTACCAGAAGGGCCTGACAGCTTCGGATGGAAGTCCTTCTTAGCCTTGATCACTTTCAGATCTTCAGCTCCAACTAAGAGAATTCGATCAGAAATAAGAAAGGAGCC
CGTCTCAACTTTTTCAGACTCCTTCTCATCAGACTCAGACTCTTCCAGAAAATCTTATGCAAAAGTTCTTTCTGATAGCAGTGAAGATGACAACAAGAAGAGGTACAAAG
CAACATCAGACGACAGTTCTAGTAGAAGAAGTTCATCGATTGGTTTTAAGCCTTTTACTCTCTCAGGAAATTCCTTTGAAAAAACGGTTATCATTACCAGGCGATGTTTC
CATGATGACTGGAACAGAATCATGTTTTCCCTAAGAAAGCAATCTGAAATAGCTTTCTCTTACAAGCCATTCCAAGCTGACAAAGCTATCCTATTCCTGAACCCAGACCA
TGCTAAACTTCTGTGTAGCAACAAAGGTGCAAATGGATGGTCTACAGTGGGAAATTACCAGGTTAAATTCGAAAGCTGGGATTCAAATTTACACTCTTTTCACTCTGTTA
TCCCCAGCTATGGAGGCTGGCTTCGTTTTAGAGGAATTCCGCTTCATTTATGGAACTACAATACTTTCCAACACATTGGATCGGCTTGTGGAGGTTTCCTGGATGTTGCT
AAGGAAACAATGCAAATGGATAAGCTCATTGATGCTAAAATCAAAGTCCGTTACAATTACATTGGCTTTGTTCCAGCCTCGATCCTGATCACTGATAACCAAGGCGAAAA
TTTTATAGTCACTACTGTTCAACCAGCCGAAGCCAGATGGCTCGTTGAAAGAAACGTTAGAGTTCATGGTTCCTTTAGAACTAAAGCTGCTGATGAATTTGACCAACACA
ACCATTTAGCAGAAACTTACACTTACAATGGATTCCAAGCCATCCCGCCGGAACCAACAAGAACTCACGGTGACTACAGCATCCACAACTCTGACAAACACTCTATCTCA
TATCACACACAAGCCAAGAAAAATAACTCCTCAGAATCTGAATATGATCCTTTTGATCAACAGCTCAGTGATAGAAGGAAAGAGAAAGGGAAAGCTATCCTCATCATAAA
TGATCAGGATCATGGCCATTACTCCAAAAGATCAAAAAGGATCTCCAACAGAAAAGTTTCTTTCTTATCTCCAGGTGGTATCCAATCAAACAGTTCAAATACTGAGATTA
ATACTAAAGGAAAATCATTGGAGATCTCGACCATTAATGATCAATTTGAAAAAAGATGGAGTCCGCGCCAAAAAACCAAGACTAAACTCACATACAGAATCAAAAAGGAT
CCTCAAGAATCTACCGAAGATCATAAGCTGTCTTTAAAAGAAACTGGTGAAGGAAGCAAACAAATGAATCTTTCGGTGGATATGGGCCCCATCTCCCCTCTGGAATCTAT
GATACAATCAGAAAACAATCATGGACTTGACACTTTCAACAATCAGACACCTGATGGAAATTCAAAATCAACAGATAGTGCAGAAGCAAAAAATCTGACTGTCTCGGTTA
AAGAAGGAGCTGATCAAAACAAGTCTGCTTCAAGATCCACAGCTGAGGGAAATTCAAAGGATGCAAAGACTGGCAGTGAGCTGGAAATTGACAGAGCCTTCAAGGAAAAA
CTTGTTATTTGGCTAAAGGAAAACGAACTCAAACTGTCTCCAAAATATACCAATGATGTACCTAGTTCCTCATATTTTCCTGTCATTGTATCTGACCAAAATATGGATAT
TGCAGGCCATGGGCCTTTGGGGGATAAGGGTGGCATTTTAGTCTTATGGGATGATACCAATTTCAAAGTCAATGACATCAAAGTCGGCAATTACTCCATTTCTTTAAATA
TTCTCAATACAAATGGGAATTGGTGGCTTACCTCTGTTTATGGCCCTTACAAATACAATGACAGAACAAAATTGTGGCCTGAATTAGAGATCCTTCAATCTCTTTGCCTT
CCCAACTGGTTGATTGCAGGGGACTTCAATATAGTCAGATGGGAGAGGGAGACAAATGCAAAATCTCTTGATAAGAGAAACATGGCTAATTTCAACAACTTCATCTCAGT
GAATGAGCTTATTGATCCCCCTCCTTTAAACAACAATTTCACTTGGTCTAATCTCAGAGTAAATCCAACTTACTCTCGTCTTGATCGTTTTCTACTCTCAAAGGGTTGGG
AAAATGCCTTTGGCTTACATACGTCTAGGACGTTGGAAAGAAATATCTCGGACCATTTCCCTATTCTTTTGGAATCCCCTCAAATCAAATGGGGCCCCTGCCCTTTCAGA
CTCAATAACTCCTCCTTAAGGGACAAAGAATTCCAGAAAAACTTCATAAATTGGTGGAACAGCTCCAAACAAGCAGGCTTCCCGGGCTACGCTTTTATTCAAAGTCTAAA
TTCTCTATCAAAGTTCATTAAAGAGTGGCAACATAACAAAGTCAACCTATATGATGCCAACAAAAAAGCCCTCCTGAAAGAGATTGACATAATTGACAAGTTAGAATTCC
AAGGAGAAATGTCTACCACTCATCATCAAAAGAGAATTTCTCTCAAATCGGACTTGTTAAGCATTGAAAACAATCAAGCTCAGATATGGCATCAAAGAGCAAGACAAAGG
TGGAACCTGTTGGGGGACGAAAATAACTCTTACTTCCACAGAATCTGCACCATAAACCAAAGAAAAAATCTAATCAAATCCATCTGTGACCCAGCCGGAACTTCTCTAGA
CTCAATTGATGATATTTCGAGGACATTCATCTCTCATTTTCAGAATATATACACTAAAGAAAGCTATGAAGAAATCCTTATAGATAACTTGAGCTGGAATCCTATCTCTC
GTTTATGCCAATCAGAGTTATGCAAGCCTTTTGATGAATCTGAAATAAAAAGCACAATTATGTCCTTTAGCAACGAAAAGGCCCCAGGCCCGGATGGCTACACTATGCTC
TTCTACAAGAAGCACTGGCCTGATCTCAAGGATGACTTGCTGAACGTATTTAAGGATTTCCACAAGGCAGGCATCGTTAATAACAATGTAAACAACACTTTCATAGCCCT
CATTAGCAAGAAAGAGAAGTGCAGCAAGCCCTCAGACTATCGCCCCATCAGCCTAACAACTTCCCTCTACAAGATAATGGCAAAAGCCTTAGCTAACAGACTCAAATCTG
CTCTTCCTGATACTATTGCGGAAAATCAAATGGCTTTCATTAAAGGAAGACAAATCAATGATGCAATTCTTATTGCAAATGAAGTAATTGACACTTGGAAACAAAGGAAA
ATAAAAGGTTTTGTCCTAAAGCTTGATATCGAAAAAGCCTTCGATAAAATTAGCTGGAGCTTCATTGATTATATGCTTGCAAAAAAGCACTTTCCACATAAATGGAGGAA
ATGGATTAAAGCCTGTATAAGCAATGTCCAATACTCCATTCTGCTAAATGGAGCCCCTAAAGGTAGAATCAAGGCGGAAAGAGGTATTAGGCAGGGAGATCCTCTTTCTC
CTTTCATCTTTGTTTTAGCCATGGACTACTTAAGCAGATTGCTATCCCACCTGGAATCAAAAGGGGCTATCAAAGGGGTCTCCTTCAACAACTGCTGCAACATATCACAT
CTTCTATTTGCGGATGATGTTCTCATCTTTGTTGAAGACAATGAAAGGTATTTAAATAATTTACAAATGGCGCTCACCCTCTTTGAGAAAGCCTCTGGGCTGACTTTCAA
CAACTCCAAGTCAACAATTAGCCCCATAAACATTTCAGCTGGAAGAACAGATCAGATTGCCAGCTTCTTCGGGTTTCAAACTAAATTCCTTCCTGTTAACTACCTAGGAG
TTCCGCTGGGAGGAAACCCAAGATCCAGATCTTTTGGGATCAAACAATTGAATCTCTCAACATTCAAAGCTCCTGTCTCAGTATACAAAGAAATTGAAAAACATTGGCGA
GATTTCTTGTGGGGAGGAAGTGAGGATAAACAAAATGCTCACCTAATCAACTGGAACATCTGTACCTCCCCTAAGGAGCTAGGAGGCCTGGGAATCAGTAAGCTAAAAGA
TACAAACCAAGCCCTCCTATGCAAATGGCTATGGAGATACCACAATGAGTCAAACTCTCTTTGGAAAAAGTGTATAGATGCAAAATACACAAAAAATCATCAAGGAGACA
TTCCAGTGGTAGGCAGAAACAGTAGTGCAAACTCTCCTTGGAATGCAATCAAAAAATGGAAAGATTGGTATGAATCTAAAATAAGCTGGCTACCTAATGATGGCTCCTCT
CTCTCCTTTTGGCACAGCAAGTGGCATAATAACATCCCTTTGTCCCTACAAATCCCCAGACTCTACGCTCTCTCGAATATGCAATCAGCTACTGTAAAGGAAATTTGGGA
CCAAGGTTCCGATGATTGGAACATGAAACCAAGAAGACCTTTAAATGAAAGGGAACAACAAACATGGGATTCTATCAAAATGAGTTTACCTCGGATTCACAACAACAGAG
GGATGTGCAAGCCTACTTGGAATCCAAGTGACAGCAAGAAGTACACAGTGGCTTCAGCAAAAGATATAGCATTCAAGGAAAGCTCAATTCCAAAGGAAACAAACTGGGAA
AAGGAGCTCAAACATCTTTGGAGATCTCACATTCCGCAAAAATGTAAATTTTTCATTTGGACCATGGTTCATCAGAAACTCAACACTATGGATAACATCCAAAAGAGGAA
CCCAAGCATGAGCCTCAACCCAAGCTGGTGCATTTCCTGCCGCTCCTCCAACGAAGACATGAACCATTTATTCATTTTCTGTCCCTTTGCCCGCAGCCTTTGGAATATGT
GGAGTTCGGAAACAGGTACTCCCATGGTCAACACAAACGTAAAAGACCTCTGTTTACAATTATGTAGGCAAACAGGCAGAAATGCAAAGAATATCATCAGCTTCAATTCA
GCTATAGCTACCTTATGGACAATTTGGATTCGAAGAAACAATCTAATCTTCGCGGATAAGGACTCATCCTATCTAAATGCTTGGGAAGACATATGCACTCTCACTGGAAG
TTGGTCTTCAAAAAAATATCTAAAGAAGATTAACCGATTGCTGTGGGATGATTTTGCTGGCGTGGAAGTGGAGGAGTCACTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCATCAAACAACTACCCAAATCCTGCCAAATCGAGCAAAAGGAATTCATACTATCTTTGGATAGTAAACCAAGCTTCATCCGTATGTGGTTGACCGAAGTTTG
TCAACACAAATCTTTCTCCATGGACATCACTCCTGACACGCTGGCGTGGATCAGAAACTGCTTCAAAGACTTGTTGGACACTTCAACAACCAAACACTTTTTTGCTGAAA
GAAGGATGGAAGATAACTGCATGTGGGTAAGAAAGACCAAAAACAAGAGCAAAACTAGTATAACTGCTGAAATCTTCAGAATTGATAACAAAGGAAGAAAATGTAGCATA
TTGGTACCAGAAGGGCCTGACAGCTTCGGATGGAAGTCCTTCTTAGCCTTGATCACTTTCAGATCTTCAGCTCCAACTAAGAGAATTCGATCAGAAATAAGAAAGGAGCC
CGTCTCAACTTTTTCAGACTCCTTCTCATCAGACTCAGACTCTTCCAGAAAATCTTATGCAAAAGTTCTTTCTGATAGCAGTGAAGATGACAACAAGAAGAGGTACAAAG
CAACATCAGACGACAGTTCTAGTAGAAGAAGTTCATCGATTGGTTTTAAGCCTTTTACTCTCTCAGGAAATTCCTTTGAAAAAACGGTTATCATTACCAGGCGATGTTTC
CATGATGACTGGAACAGAATCATGTTTTCCCTAAGAAAGCAATCTGAAATAGCTTTCTCTTACAAGCCATTCCAAGCTGACAAAGCTATCCTATTCCTGAACCCAGACCA
TGCTAAACTTCTGTGTAGCAACAAAGGTGCAAATGGATGGTCTACAGTGGGAAATTACCAGGTTAAATTCGAAAGCTGGGATTCAAATTTACACTCTTTTCACTCTGTTA
TCCCCAGCTATGGAGGCTGGCTTCGTTTTAGAGGAATTCCGCTTCATTTATGGAACTACAATACTTTCCAACACATTGGATCGGCTTGTGGAGGTTTCCTGGATGTTGCT
AAGGAAACAATGCAAATGGATAAGCTCATTGATGCTAAAATCAAAGTCCGTTACAATTACATTGGCTTTGTTCCAGCCTCGATCCTGATCACTGATAACCAAGGCGAAAA
TTTTATAGTCACTACTGTTCAACCAGCCGAAGCCAGATGGCTCGTTGAAAGAAACGTTAGAGTTCATGGTTCCTTTAGAACTAAAGCTGCTGATGAATTTGACCAACACA
ACCATTTAGCAGAAACTTACACTTACAATGGATTCCAAGCCATCCCGCCGGAACCAACAAGAACTCACGGTGACTACAGCATCCACAACTCTGACAAACACTCTATCTCA
TATCACACACAAGCCAAGAAAAATAACTCCTCAGAATCTGAATATGATCCTTTTGATCAACAGCTCAGTGATAGAAGGAAAGAGAAAGGGAAAGCTATCCTCATCATAAA
TGATCAGGATCATGGCCATTACTCCAAAAGATCAAAAAGGATCTCCAACAGAAAAGTTTCTTTCTTATCTCCAGGTGGTATCCAATCAAACAGTTCAAATACTGAGATTA
ATACTAAAGGAAAATCATTGGAGATCTCGACCATTAATGATCAATTTGAAAAAAGATGGAGTCCGCGCCAAAAAACCAAGACTAAACTCACATACAGAATCAAAAAGGAT
CCTCAAGAATCTACCGAAGATCATAAGCTGTCTTTAAAAGAAACTGGTGAAGGAAGCAAACAAATGAATCTTTCGGTGGATATGGGCCCCATCTCCCCTCTGGAATCTAT
GATACAATCAGAAAACAATCATGGACTTGACACTTTCAACAATCAGACACCTGATGGAAATTCAAAATCAACAGATAGTGCAGAAGCAAAAAATCTGACTGTCTCGGTTA
AAGAAGGAGCTGATCAAAACAAGTCTGCTTCAAGATCCACAGCTGAGGGAAATTCAAAGGATGCAAAGACTGGCAGTGAGCTGGAAATTGACAGAGCCTTCAAGGAAAAA
CTTGTTATTTGGCTAAAGGAAAACGAACTCAAACTGTCTCCAAAATATACCAATGATGTACCTAGTTCCTCATATTTTCCTGTCATTGTATCTGACCAAAATATGGATAT
TGCAGGCCATGGGCCTTTGGGGGATAAGGGTGGCATTTTAGTCTTATGGGATGATACCAATTTCAAAGTCAATGACATCAAAGTCGGCAATTACTCCATTTCTTTAAATA
TTCTCAATACAAATGGGAATTGGTGGCTTACCTCTGTTTATGGCCCTTACAAATACAATGACAGAACAAAATTGTGGCCTGAATTAGAGATCCTTCAATCTCTTTGCCTT
CCCAACTGGTTGATTGCAGGGGACTTCAATATAGTCAGATGGGAGAGGGAGACAAATGCAAAATCTCTTGATAAGAGAAACATGGCTAATTTCAACAACTTCATCTCAGT
GAATGAGCTTATTGATCCCCCTCCTTTAAACAACAATTTCACTTGGTCTAATCTCAGAGTAAATCCAACTTACTCTCGTCTTGATCGTTTTCTACTCTCAAAGGGTTGGG
AAAATGCCTTTGGCTTACATACGTCTAGGACGTTGGAAAGAAATATCTCGGACCATTTCCCTATTCTTTTGGAATCCCCTCAAATCAAATGGGGCCCCTGCCCTTTCAGA
CTCAATAACTCCTCCTTAAGGGACAAAGAATTCCAGAAAAACTTCATAAATTGGTGGAACAGCTCCAAACAAGCAGGCTTCCCGGGCTACGCTTTTATTCAAAGTCTAAA
TTCTCTATCAAAGTTCATTAAAGAGTGGCAACATAACAAAGTCAACCTATATGATGCCAACAAAAAAGCCCTCCTGAAAGAGATTGACATAATTGACAAGTTAGAATTCC
AAGGAGAAATGTCTACCACTCATCATCAAAAGAGAATTTCTCTCAAATCGGACTTGTTAAGCATTGAAAACAATCAAGCTCAGATATGGCATCAAAGAGCAAGACAAAGG
TGGAACCTGTTGGGGGACGAAAATAACTCTTACTTCCACAGAATCTGCACCATAAACCAAAGAAAAAATCTAATCAAATCCATCTGTGACCCAGCCGGAACTTCTCTAGA
CTCAATTGATGATATTTCGAGGACATTCATCTCTCATTTTCAGAATATATACACTAAAGAAAGCTATGAAGAAATCCTTATAGATAACTTGAGCTGGAATCCTATCTCTC
GTTTATGCCAATCAGAGTTATGCAAGCCTTTTGATGAATCTGAAATAAAAAGCACAATTATGTCCTTTAGCAACGAAAAGGCCCCAGGCCCGGATGGCTACACTATGCTC
TTCTACAAGAAGCACTGGCCTGATCTCAAGGATGACTTGCTGAACGTATTTAAGGATTTCCACAAGGCAGGCATCGTTAATAACAATGTAAACAACACTTTCATAGCCCT
CATTAGCAAGAAAGAGAAGTGCAGCAAGCCCTCAGACTATCGCCCCATCAGCCTAACAACTTCCCTCTACAAGATAATGGCAAAAGCCTTAGCTAACAGACTCAAATCTG
CTCTTCCTGATACTATTGCGGAAAATCAAATGGCTTTCATTAAAGGAAGACAAATCAATGATGCAATTCTTATTGCAAATGAAGTAATTGACACTTGGAAACAAAGGAAA
ATAAAAGGTTTTGTCCTAAAGCTTGATATCGAAAAAGCCTTCGATAAAATTAGCTGGAGCTTCATTGATTATATGCTTGCAAAAAAGCACTTTCCACATAAATGGAGGAA
ATGGATTAAAGCCTGTATAAGCAATGTCCAATACTCCATTCTGCTAAATGGAGCCCCTAAAGGTAGAATCAAGGCGGAAAGAGGTATTAGGCAGGGAGATCCTCTTTCTC
CTTTCATCTTTGTTTTAGCCATGGACTACTTAAGCAGATTGCTATCCCACCTGGAATCAAAAGGGGCTATCAAAGGGGTCTCCTTCAACAACTGCTGCAACATATCACAT
CTTCTATTTGCGGATGATGTTCTCATCTTTGTTGAAGACAATGAAAGGTATTTAAATAATTTACAAATGGCGCTCACCCTCTTTGAGAAAGCCTCTGGGCTGACTTTCAA
CAACTCCAAGTCAACAATTAGCCCCATAAACATTTCAGCTGGAAGAACAGATCAGATTGCCAGCTTCTTCGGGTTTCAAACTAAATTCCTTCCTGTTAACTACCTAGGAG
TTCCGCTGGGAGGAAACCCAAGATCCAGATCTTTTGGGATCAAACAATTGAATCTCTCAACATTCAAAGCTCCTGTCTCAGTATACAAAGAAATTGAAAAACATTGGCGA
GATTTCTTGTGGGGAGGAAGTGAGGATAAACAAAATGCTCACCTAATCAACTGGAACATCTGTACCTCCCCTAAGGAGCTAGGAGGCCTGGGAATCAGTAAGCTAAAAGA
TACAAACCAAGCCCTCCTATGCAAATGGCTATGGAGATACCACAATGAGTCAAACTCTCTTTGGAAAAAGTGTATAGATGCAAAATACACAAAAAATCATCAAGGAGACA
TTCCAGTGGTAGGCAGAAACAGTAGTGCAAACTCTCCTTGGAATGCAATCAAAAAATGGAAAGATTGGTATGAATCTAAAATAAGCTGGCTACCTAATGATGGCTCCTCT
CTCTCCTTTTGGCACAGCAAGTGGCATAATAACATCCCTTTGTCCCTACAAATCCCCAGACTCTACGCTCTCTCGAATATGCAATCAGCTACTGTAAAGGAAATTTGGGA
CCAAGGTTCCGATGATTGGAACATGAAACCAAGAAGACCTTTAAATGAAAGGGAACAACAAACATGGGATTCTATCAAAATGAGTTTACCTCGGATTCACAACAACAGAG
GGATGTGCAAGCCTACTTGGAATCCAAGTGACAGCAAGAAGTACACAGTGGCTTCAGCAAAAGATATAGCATTCAAGGAAAGCTCAATTCCAAAGGAAACAAACTGGGAA
AAGGAGCTCAAACATCTTTGGAGATCTCACATTCCGCAAAAATGTAAATTTTTCATTTGGACCATGGTTCATCAGAAACTCAACACTATGGATAACATCCAAAAGAGGAA
CCCAAGCATGAGCCTCAACCCAAGCTGGTGCATTTCCTGCCGCTCCTCCAACGAAGACATGAACCATTTATTCATTTTCTGTCCCTTTGCCCGCAGCCTTTGGAATATGT
GGAGTTCGGAAACAGGTACTCCCATGGTCAACACAAACGTAAAAGACCTCTGTTTACAATTATGTAGGCAAACAGGCAGAAATGCAAAGAATATCATCAGCTTCAATTCA
GCTATAGCTACCTTATGGACAATTTGGATTCGAAGAAACAATCTAATCTTCGCGGATAAGGACTCATCCTATCTAAATGCTTGGGAAGACATATGCACTCTCACTGGAAG
TTGGTCTTCAAAAAAATATCTAAAGAAGATTAACCGATTGCTGTGGGATGATTTTGCTGGCGTGGAAGTGGAGGAGTCACTATAG
Protein sequenceShow/hide protein sequence
MAFIKQLPKSCQIEQKEFILSLDSKPSFIRMWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSI
LVPEGPDSFGWKSFLALITFRSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCF
HDDWNRIMFSLRKQSEIAFSYKPFQADKAILFLNPDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
KETMQMDKLIDAKIKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKHSIS
YHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSLEISTINDQFEKRWSPRQKTKTKLTYRIKKD
PQESTEDHKLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASRSTAEGNSKDAKTGSELEIDRAFKEK
LVIWLKENELKLSPKYTNDVPSSSYFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKVGNYSISLNILNTNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCL
PNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPTYSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFR
LNNSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQR
WNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTML
FYKKHWPDLKDDLLNVFKDFHKAGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEVIDTWKQRK
IKGFVLKLDIEKAFDKISWSFIDYMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNCCNISH
LLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFGIKQLNLSTFKAPVSVYKEIEKHWR
DFLWGGSEDKQNAHLINWNICTSPKELGGLGISKLKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWLPNDGSS
LSFWHSKWHNNIPLSLQIPRLYALSNMQSATVKEIWDQGSDDWNMKPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPTWNPSDSKKYTVASAKDIAFKESSIPKETNWE
KELKHLWRSHIPQKCKFFIWTMVHQKLNTMDNIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARSLWNMWSSETGTPMVNTNVKDLCLQLCRQTGRNAKNIISFNS
AIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSKKYLKKINRLLWDDFAGVEVEESL