; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0007313 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0007313
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr12:22358777..22364473
RNA-Seq ExpressionIVF0007313
SyntenyIVF0007313
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035247.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.068.33Show/hide
Query:  MDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITFRSSAPTKRIRSEI
        MDITPDTL WIRNCFKDLLDTSTTKHFFAE+R EDNCMWVRKTKNKSKTSIT EIFRIDNKGRKCSILVPEGPDSFGWK FLALITFR SAPTKRI SEI
Subjt:  MDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITFRSSAPTKRIRSEI

Query:  RKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFSLRKQSEIAFSYKP
        RKE VS +SDSFSSDSDSSRKSYAK LSDSSE++NKKRYK+TSDDSSSRRSS+IGFKPFTLSG+SFEKTVI+TRRCFHDDWNRIMFSLRKQSEIAFSYKP
Subjt:  RKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFSLRKQSEIAFSYKP

Query:  FQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESW-DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDA
        FQADKA LFLN DHAKLLCSNK ANGWSTVGNYQ    S+ +    S H        W     IP                     V KETMQM+KLIDA
Subjt:  FQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESW-DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDA

Query:  KIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKHSISYH
        KIKVRYNY GFVPAS+LITDNQGENFI+TTV P +ARWLVERNVRVHG+F+TKAADEFDQHN LAE YTYNGFQAIP E TRT GDYS  NSDKHS+S H
Subjt:  KIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKHSISYH

Query:  TQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ------------------------------------------IMAIT-PKDQKGSPTEKF---
        TQAKKNNSSESEYDPFDQQLS+RRKEKGK IL+INDQ                                          I  I  P +++ SP +K    
Subjt:  TQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ------------------------------------------IMAIT-PKDQKGSPTEKF---

Query:  LSYPLV-DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASRSTAEGNS
        L+Y +  DP E  ED  LSLKE GEGSKQMNLSVDMGPISPL+SM+QSENNHG D+LNNQT     K+    E +N T SVKEG D NKS SRST +G S
Subjt:  LSYPLV-DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASRSTAEGNS

Query:  KDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDIK------------------
        KDAKT SE+EIDRAFKEKLVIWLKENELKLSPKYTNDVPSSS FPV+VSDQN D++GHGPLGDKG I+VLWDDTKFKVN+IK                  
Subjt:  KDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDIK------------------

Query:  ---------------------------------------------------------------------------------------------------G
                                                                                                           G
Subjt:  ---------------------------------------------------------------------------------------------------G

Query:  WENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALL
        WEN FGLHTSRT+ER +SDHFPI+LESPQIKWGPCPFRLNNSSL+DK+FQKNF +WWNNSKQ GFPGYAFIQSL SLSK IKEWQHNKVNLYDA +K LL
Subjt:  WENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALL

Query:  KEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQN
        +EID IDKLE QG MST HHQKRISLKS+LLSIENNQA IWHQR+RQRWNLLGDENN++FHR+CTINQRKN IKSICDP GTSLDSI DISR FISHFQN
Subjt:  KEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQN

Query:  IYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKE
        IYTKENYEEILIDNL           +LCKPFDE EIKSTIMS SNEKAPGPDGYT+LFYKKHW DLK DLLNV KDFHK GIVNNNVNNTFIALISKKE
Subjt:  IYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKE

Query:  KCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAEN
        KC+ PSDYRPISLTTSLYK+MAKALANRLKS LPDT+AEN
Subjt:  KCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAEN

KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.082.38Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ                +  ++P                            +
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D

Query:  QKGSPTEKF---LSYPLV-DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        ++ SP +K    L+Y +  DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  QKGSPTEKF---LSYPLV-DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDIK------
        KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDIK      
Subjt:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDIK------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------GWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
                   GWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  -----------GWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKENYEEILIDNL
        DISRTFISHFQNIYTKENYEEILIDNL
Subjt:  DISRTFISHFQNIYTKENYEEILIDNL

KAA0056839.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0100Show/hide
Query:  MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQ
        MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQ
Subjt:  MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQ

Query:  MAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWSFIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS
        MAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWSFIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS
Subjt:  MAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWSFIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS

Query:  PFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ
        PFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ
Subjt:  PFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ

Query:  TKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGRLTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN
        TKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGRLTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN
Subjt:  TKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGRLTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN

Query:  ICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWH
        ICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWH
Subjt:  ICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWH

Query:  NNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNW
        NNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNW
Subjt:  NNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNW

Query:  EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSD
        EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSD
Subjt:  EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSD

Query:  RNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSKSKTLKNYSQATIALNIKALCNLPM
        RNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSKSKTLKNYSQATIALNIKALCNLPM
Subjt:  RNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSKSKTLKNYSQATIALNIKALCNLPM

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.087.81Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWK FLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLN DHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNY GFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ                +  ++P                            +
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D

Query:  QKGSPTEKF---LSYPLV-DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        ++ SP +K    L+Y +  DPQESTEDH LSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDT NNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  QKGSPTEKF---LSYPLV-DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDIK------
        KSASRSTAEGNSKDAKTGSE+EIDRAFKEKLVIWLKENELKLSPKYTNDVPSSS FPVIVSDQNMDIAGHGPLGDKGGILVLWDDT FKVNDIK      
Subjt:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDIK------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------GWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
                   GWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDK+FQKNFINWWN+SKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  -----------GWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNV
        DISRTFISHFQNIYTKE+YEEILIDNL           +LCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHK GIVNNNV
Subjt:  DISRTFISHFQNIYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNV

Query:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWS
        NNTFIALISKKEKCSKPSDYRPISLTTSLYK+MAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLD+EKAFDKISWS
Subjt:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWS

Query:  FIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIF
        FID+MLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNN CNISHLLFADDVLIF
Subjt:  FIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIF

Query:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGR
        VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFW QTIECIHKKLNGWKYSQISKGGR
Subjt:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGR

Query:  LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
        LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISK+KDTNQALLCKWLWRYHNESNSLWKKCIDAKY
Subjt:  LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKY

Query:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQ
        TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISW+ NDGSSLSFWHSKWHNNIPLSLQ PRLYALSNMQSATVKEIWDQGSDDWNM+PRRPLNEREQ
Subjt:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQ

Query:  QTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSW
        QTWDSIKMSLPRIHNNRGMCKP+WNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMD IQKRNPSMSLNPSW
Subjt:  QTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSW

Query:  CISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSDRNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
        CISCRSSNEDMNHLFIFCPFAR+LWNMWSSETGTPM  TNVKDLCLQLCRQ+ RN KNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
Subjt:  CISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSDRNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG

Query:  SWSSKSKTLKNYSQATIALNIKALCNLPM
        SWSSKSKTLKNYSQATIALNIKALCNLPM
Subjt:  SWSSKSKTLKNYSQATIALNIKALCNLPM

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.087.81Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWK FLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLN DHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNY GFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ                +  ++P                            +
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D

Query:  QKGSPTEKF---LSYPLV-DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        ++ SP +K    L+Y +  DPQESTEDH LSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDT NNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  QKGSPTEKF---LSYPLV-DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDIK------
        KSASRSTAEGNSKDAKTGSE+EIDRAFKEKLVIWLKENELKLSPKYTNDVPSSS FPVIVSDQNMDIAGHGPLGDKGGILVLWDDT FKVNDIK      
Subjt:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDIK------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------GWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
                   GWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDK+FQKNFINWWN+SKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  -----------GWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNV
        DISRTFISHFQNIYTKE+YEEILIDNL           +LCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHK GIVNNNV
Subjt:  DISRTFISHFQNIYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNV

Query:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWS
        NNTFIALISKKEKCSKPSDYRPISLTTSLYK+MAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANE IDTWKQRKIKGFVLKLD+EKAFDKISWS
Subjt:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWS

Query:  FIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIF
        FID+MLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIF
Subjt:  FIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIF

Query:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGR
        VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFW QTIECIHKKLNGWKYSQISKGGR
Subjt:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGR

Query:  LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
        LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISK+KDTNQALLCKWLWRYHNESNSLWKKCIDAKY
Subjt:  LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKY

Query:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQ
        TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISW+ NDGSSLSFWHSKWHNNIPLSLQ PRLYALSNMQSATVKEIWDQGSDDWNM+PRRPLNEREQ
Subjt:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQ

Query:  QTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSW
        QTWDSIKMSLPRIHNNRGMCKP+WNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMD IQKRNPSMSLNPSW
Subjt:  QTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSW

Query:  CISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSDRNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
        CISCRSSNEDMNHLFIFCPFAR+LWNMWSSETGTPM  TNVKDLCLQLCRQ+ RN KNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
Subjt:  CISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSDRNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG

Query:  SWSSKSKTLKNYSQATIALNIKALCNLPM
        SWSSKSKTLKNYSQATIALNIKALCNLPM
Subjt:  SWSSKSKTLKNYSQATIALNIKALCNLPM

TrEMBL top hitse value%identityAlignment
A0A5A7US62 LINE-1 retrotransposable element ORF2 protein0.0e+0087.81Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWK FLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLN DHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNY GFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ                +  ++P                            +
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D

Query:  QKGSPTEKF---LSYPL-VDPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        ++ SP +K    L+Y +  DPQESTEDH LSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDT NNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  QKGSPTEKF---LSYPL-VDPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDI-------
        KSASRSTAEGNSKDAKTGSE+EIDRAFKEKLVIWLKENELKLSPKYTNDVPSSS FPVIVSDQNMDIAGHGPLGDKGGILVLWDDT FKVNDI       
Subjt:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDI-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
                  KGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDK+FQKNFINWWN+SKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  ----------KGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNV
        DISRTFISHFQNIYTKE+YEEILIDNL           +LCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHK GIVNNNV
Subjt:  DISRTFISHFQNIYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNV

Query:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWS
        NNTFIALISKKEKCSKPSDYRPISLTTSLYK+MAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLD+EKAFDKISWS
Subjt:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWS

Query:  FIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIF
        FID+MLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNN CNISHLLFADDVLIF
Subjt:  FIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIF

Query:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGR
        VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFW QTIECIHKKLNGWKYSQISKGGR
Subjt:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGR

Query:  LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
        LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISK+KDTNQALLCKWLWRYHNESNSLWKKCIDAKY
Subjt:  LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKY

Query:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQ
        TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISW+ NDGSSLSFWHSKWHNNIPLSLQ PRLYALSNMQSATVKEIWDQGSDDWNM+PRRPLNEREQ
Subjt:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQ

Query:  QTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSW
        QTWDSIKMSLPRIHNNRGMCKP+WNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMD IQKRNPSMSLNPSW
Subjt:  QTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSW

Query:  CISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSDRNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
        CISCRSSNEDMNHLFIFCPFAR+LWNMWSSETGTPM  TNVKDLCLQLCRQ+ RN KNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
Subjt:  CISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSDRNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG

Query:  SWSSKSKTLKNYSQATIALNIKALCNLPM
        SWSSKSKTLKNYSQATIALNIKALCNLPM
Subjt:  SWSSKSKTLKNYSQATIALNIKALCNLPM

A0A5A7UTI6 LINE-1 retrotransposable element ORF2 protein0.0e+00100Show/hide
Query:  MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQ
        MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQ
Subjt:  MSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQ

Query:  MAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWSFIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS
        MAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWSFIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS
Subjt:  MAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWSFIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLS

Query:  PFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ
        PFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ
Subjt:  PFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQ

Query:  TKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGRLTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN
        TKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGRLTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN
Subjt:  TKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGRLTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWN

Query:  ICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWH
        ICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWH
Subjt:  ICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWH

Query:  NNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNW
        NNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNW
Subjt:  NNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNW

Query:  EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSD
        EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSD
Subjt:  EKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSD

Query:  RNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSKSKTLKNYSQATIALNIKALCNLPM
        RNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSKSKTLKNYSQATIALNIKALCNLPM
Subjt:  RNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSKSKTLKNYSQATIALNIKALCNLPM

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein0.0e+0082.38Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ                +  ++P                            +
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D

Query:  QKGSPTEKF---LSYPL-VDPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        ++ SP +K    L+Y +  DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  QKGSPTEKF---LSYPL-VDPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDI-------
        KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDI       
Subjt:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDI-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
                  KGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  ----------KGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKENYEEILIDNL
        DISRTFISHFQNIYTKENYEEILIDNL
Subjt:  DISRTFISHFQNIYTKENYEEILIDNL

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein0.0e+0087.81Show/hide
Query:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF
        MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWK FLALITF
Subjt:  MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITF

Query:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
        RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS
Subjt:  RSSAPTKRIRSEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFS

Query:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
        LRKQSEIAFSYKPFQADKAILFLN DHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA
Subjt:  LRKQSEIAFSYKPFQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVA

Query:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
        KETMQMDKLIDAKIKVRYNY GFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS
Subjt:  KETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYS

Query:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D
        IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ                +  ++P                            +
Subjt:  IHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ----------------IMAITPK---------------------------D

Query:  QKGSPTEKF---LSYPL-VDPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
        ++ SP +K    L+Y +  DPQESTEDH LSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDT NNQTPDGNSKSTDSAEAKNLTVSVKEGADQN
Subjt:  QKGSPTEKF---LSYPL-VDPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQN

Query:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDI-------
        KSASRSTAEGNSKDAKTGSE+EIDRAFKEKLVIWLKENELKLSPKYTNDVPSSS FPVIVSDQNMDIAGHGPLGDKGGILVLWDDT FKVNDI       
Subjt:  KSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDI-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK
                  KGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDK+FQKNFINWWN+SKQAGFPGYAFIQSLNSLSKFIKEWQHNK
Subjt:  ----------KGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK

Query:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
        VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID
Subjt:  VNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSID

Query:  DISRTFISHFQNIYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNV
        DISRTFISHFQNIYTKE+YEEILIDNL           +LCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHK GIVNNNV
Subjt:  DISRTFISHFQNIYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNV

Query:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWS
        NNTFIALISKKEKCSKPSDYRPISLTTSLYK+MAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANE IDTWKQRKIKGFVLKLD+EKAFDKISWS
Subjt:  NNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWS

Query:  FIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIF
        FID+MLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIF
Subjt:  FIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIF

Query:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGR
        VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFW QTIECIHKKLNGWKYSQISKGGR
Subjt:  VEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKLNGWKYSQISKGGR

Query:  LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKY
        LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISK+KDTNQALLCKWLWRYHNESNSLWKKCIDAKY
Subjt:  LTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKY

Query:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQ
        TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISW+ NDGSSLSFWHSKWHNNIPLSLQ PRLYALSNMQSATVKEIWDQGSDDWNM+PRRPLNEREQ
Subjt:  TKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQ

Query:  QTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSW
        QTWDSIKMSLPRIHNNRGMCKP+WNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMD IQKRNPSMSLNPSW
Subjt:  QTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSW

Query:  CISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSDRNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
        CISCRSSNEDMNHLFIFCPFAR+LWNMWSSETGTPM  TNVKDLCLQLCRQ+ RN KNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG
Subjt:  CISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSDRNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTG

Query:  SWSSKSKTLKNYSQATIALNIKALCNLPM
        SWSSKSKTLKNYSQATIALNIKALCNLPM
Subjt:  SWSSKSKTLKNYSQATIALNIKALCNLPM

A0A5D3DFM9 LINE-1 retrotransposable element ORF2 protein0.0e+0068.33Show/hide
Query:  MDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITFRSSAPTKRIRSEI
        MDITPDTL WIRNCFKDLLDTSTTKHFFAE+R EDNCMWVRKTKNKSKTSIT EIFRIDNKGRKCSILVPEGPDSFGWK FLALITFR SAPTKRI SEI
Subjt:  MDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITFRSSAPTKRIRSEI

Query:  RKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFSLRKQSEIAFSYKP
        RKE VS +SDSFSSDSDSSRKSYAK LSDSSE++NKKRYK+TSDDSSSRRSS+IGFKPFTLSG+SFEKTVI+TRRCFHDDWNRIMFSLRKQSEIAFSYKP
Subjt:  RKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFSLRKQSEIAFSYKP

Query:  FQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESW-DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDA
        FQADKA LFLN DHAKLLCSNK ANGWSTVGNYQ    S+ +    S H        W     IP                     V KETMQM+KLIDA
Subjt:  FQADKAILFLNSDHAKLLCSNKGANGWSTVGNYQVKFESW-DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDA

Query:  KIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKHSISYH
        KIKVRYNY GFVPAS+LITDNQGENFI+TTV P +ARWLVERNVRVHG+F+TKAADEFDQHN LAE YTYNGFQAIP E TRT GDYS  NSDKHS+S H
Subjt:  KIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKHSISYH

Query:  TQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ------------------------------------------IMAIT-PKDQKGSPTEKF---
        TQAKKNNSSESEYDPFDQQLS+RRKEKGK IL+INDQ                                          I  I  P +++ SP +K    
Subjt:  TQAKKNNSSESEYDPFDQQLSDRRKEKGKAILIINDQ------------------------------------------IMAIT-PKDQKGSPTEKF---

Query:  LSYPLV-DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASRSTAEGNS
        L+Y +  DP E  ED  LSLKE GEGSKQMNLSVDMGPISPL+SM+QSENNHG D+LNNQT     K+    E +N T SVKEG D NKS SRST +G S
Subjt:  LSYPLV-DPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASRSTAEGNS

Query:  KDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDI-------------------
        KDAKT SE+EIDRAFKEKLVIWLKENELKLSPKYTNDVPSSS FPV+VSDQN D++GHGPLGDK GI+VLWDDTKFKVN+I                   
Subjt:  KDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDI-------------------

Query:  --------------------------------------------------------------------------------------------------KG
                                                                                                          KG
Subjt:  --------------------------------------------------------------------------------------------------KG

Query:  WENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALL
        WEN FGLHTSRT+ER +SDHFPI+LESPQIKWGPCPFRLNNSSL+DK+FQKNF +WWNNSKQ GFPGYAFIQSL SLSK IKEWQHNKVNLYDA +K LL
Subjt:  WENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALL

Query:  KEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQN
        +EID IDKLE QG MST HHQKRISLKS+LLSIENNQA IWHQR+RQRWNLLGDENN++FHR+CTINQRKN IKSICDP GTSLDSI DISR FISHFQN
Subjt:  KEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQN

Query:  IYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKE
        IYTKENYEEILIDNL           +LCKPFDE EIKSTIMS SNEKAPGPDGYT+LFYKKHW DLK DLLNV KDFHK GIVNNNVNNTFIALISKKE
Subjt:  IYTKENYEEILIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKE

Query:  KCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAEN
        KC+ PSDYRPISLTTSLYK+MAKALANRLKS LPDT+AEN
Subjt:  KCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAEN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.4e-4526.32Show/hide
Query:  ISDHFPILLE--------SPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK-------VNLYDANKKALLK
        +SDH  I LE        S    W     +LNN  L D         W +N  +A    + F  + N  + +   W   K       + L    +K    
Subjt:  ISDHFPILLE--------SPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNK-------VNLYDANKKALLK

Query:  EIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTIN----------QRKNLIKSICDPAGTSLDSIDDIS
        +ID +     + E     H K  S + ++  I     +I  Q+  Q+ N   +  + +F RI  I+          + KN I +I +  G       +I 
Subjt:  EIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTIN----------QRKNLIKSICDPAGTSLDSIDDIS

Query:  RTFISHFQNIYTK--ENYEEI--LIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNN
         T   +++++Y    EN EE+   +D              L +P   SEI + I S   +K+PGPDG+T  FY+++  +L   LL +F+   K GI+ N+
Subjt:  RTFISHFQNIYTK--ENYEEI--LIDNL-----------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNN

Query:  VNNTFIALISKKEK-CSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFV-LKLDLEKAFDKI
             I LI K  +  +K  ++RPISL     K++ K LANR++  +   I  +Q+ FI G Q    I  +   I    + K K  V + +D EKAFDKI
Subjt:  VNNTFIALISKKEK-CSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFV-LKLDLEKAFDKI

Query:  SWSFIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDV
           F+   L K      + K I+A       +I+LNG        + G RQG PLSP +F + ++ L+R    +  +  IKG+       +   LFADD+
Subjt:  SWSFIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDV

Query:  LIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRS--RSFWSQTIECIHKKLNGWKYSQI
        ++++E+      NL   ++ F K SG   N  KS     N +     QI     F      + YLG+ L  + +   +  +   ++ I +  N WK    
Subjt:  LIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRS--RSFWSQTIECIHKKLNGWKYSQI

Query:  SKGGRLTLLKASLSSLPTYQLST--FKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCK--WLWRYHNESNSL
        S  GR+ ++K ++     Y+ +    K P++ + E+EK    F+W      Q    I  +I +   + GG+ +   K   +A + K  W W Y N     
Subjt:  SKGGRLTLLKASLSSLPTYQLST--FKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCK--WLWRYHNESNSL

Query:  WKK
        W +
Subjt:  WKK

P08548 LINE-1 reverse transcriptase homolog1.7e-4425.23Show/hide
Query:  NFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNL
        N+ N W+ +K A   G  FI    +L  F+K+ +  +VN    + K L KE                H   + S + ++  I     +I ++R  Q+ N 
Subjt:  NFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNL

Query:  LGDENNSYFHRICTINQ------RKNLIKSICDPAGTSLDSI----DDISRTFISHFQNIYTK--ENYEEILIDNLKLC--------------KPFDESE
             + +F +I  I++      RK  +KS+        D I     +I +    +++ +Y+   EN +EI    L+ C              +P   SE
Subjt:  LGDENNSYFHRICTINQ------RKNLIKSICDPAGTSLDSI----DDISRTFISHFQNIYTK--ENYEEILIDNLKLC--------------KPFDESE

Query:  IKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEK-CSKPSDYRPISLTTSLYKLMAKALANRLKSALPD
        I STI +   +K+PGPDG+T  FY+    +L   LLN+F++  K GI+ N      I LI K  K  ++  +YRPISL     K++ K L NR++  +  
Subjt:  IKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEK-CSKPSDYRPISLTTSLYKLMAKALANRLKSALPD

Query:  TIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKG-FVLKLDLEKAFDKISWSFIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGI
         I  +Q+ FI G Q    I  +   I    + K K   +L +D EKAFD I   F+   L K      + K I+A  S    +I+LNG          G 
Subjt:  TIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKG-FVLKLDLEKAFDKISWSFIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGI

Query:  RQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKST--ISPINISAGRTD
        RQG PLSP +F + M+ L+     +  + AIKG+   +   I   LFADD+++++E+       L   +  +   SG   N  KS   I   N  A +T 
Subjt:  RQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKST--ISPINISAGRTD

Query:  QIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKL----NGWKYSQISKGGRLTLLKASLSSLPTYQLST--FKAPVSVYKEIEKHWRDFLW
         +     F      + YLGV L      +  + +  E + K++    N WK    S  GR+ ++K S+     Y  +    KAP+S +K++EK    F+W
Subjt:  QIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWSQTIECIHKKL----NGWKYSQISKGGRLTLLKASLSSLPTYQLST--FKAPVSVYKEIEKHWRDFLW

Query:  GGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYH-NESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISW
              Q    I   + ++  + GG+ +  ++   ++++ K  W +H N    +W +       +N + D P        + P   I+  KD   +K  W
Subjt:  GGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYH-NESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISW

Query:  MANDGSSLSFWHSKWHNNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQQTWDSIKM-------------SLPRIHNNRGMCKPSW
        +             W          P L  L+ + S  +K++      +   E  + L E   +T + I +             ++ +IH         W
Subjt:  MANDGSSLSFWHSKWHNNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQQTWDSIKM-------------SLPRIHNNRGMCKPSW

Query:  NPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNP
        +    K +   +AK+I  K S  P E  WEK    ++  +   K    + T +H++L  ++K + R+P
Subjt:  NPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNP

P0C2F6 Putative ribonuclease H protein At1g657506.8e-3828.01Show/hide
Query:  SRSFWSQTIECIHKKLNGWKYSQISKGGRLTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKD
        ++  + + +E +  +++GW+   +S  GRLTL KA LSS+P + +ST   P S+   +++  R FLWG + +K+  HL+ W+   SPK+ GGLG+   K 
Subjt:  SRSFWSQTIECIHKKLNGWKYSQISKGGRLTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKD

Query:  TNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIK-KWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQFPRLYALSN
         N+AL+ K  WR   E NSLW   +  KY      D   +    S +S W +I    +D     + W+  DG  + FW  +W +  PL L+       ++
Subjt:  TNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIPVVGRNSSANSPWNAIK-KWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQFPRLYALSN

Query:  MQSATVKEIW--DQGSDDWNMEPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQK
          +   K++W   +G D   ++P    N R +    ++ + L     +R     SW  S   +++V SA ++      +P+  N       LW+  +P++
Subjt:  MQSATVKEIW--DQGSDDWNMEPRRPLNEREQQTWDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQK

Query:  CKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARNLW
         K F+W + +Q + T ++  +R+ S S   + C  C+   E M H+   CP    +W
Subjt:  CKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARNLW

P11369 LINE-1 retrotransposable element ORF2 protein1.0e-3824.65Show/hide
Query:  ISDHFPI-LLESPQIKWGPCPF--RLNNSSLRD-------KDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKK--------A
        +SDH  + L+ +  I  G   F  +LNN+ L D       K   K+F+  +N ++   +P         +L   +K +   K+    A+KK        +
Subjt:  ISDHFPI-LLESPQIKWGPCPF--RLNNSSLRD-------KDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKK--------A

Query:  LLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSY---FHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFI
        L   +  ++K E       +  Q+ I L+ ++  +E  +     QR  Q  +   ++ N       R+   ++ K LI  I +  G      ++I  T  
Subjt:  LLKEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSY---FHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFI

Query:  SHFQNIYTK--ENYEEI--LIDNLKLCK-----------PFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNT
        S ++ +Y+   EN +E+   +D  ++ K           P    EI++ I S   +K+PGPDG++  FY+    DL   L  +F      G + N+    
Subjt:  SHFQNIYTK--ENYEEI--LIDNLKLCK-----------PFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNT

Query:  FIALISKKEK-CSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKG-FVLKLDLEKAFDKISWSF
         I LI K +K  +K  ++RPISL     K++ K LANR++  +   I  +Q+ FI G Q    I  +   I    + K K   ++ LD EKAFDKI   F
Subjt:  FIALISKKEK-CSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKG-FVLKLDLEKAFDKISWSF

Query:  IDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFV
        +  +L +      +   IKA  S    +I +NG     I  + G RQG PLSP++F + ++ L+R    +  +  IKG+       +   L ADD+++++
Subjt:  IDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFV

Query:  EDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPR---SRSFWSQTIECIHKKLNGWKYSQISKG
         D +     L   +  F +  G   N++KS       +     +I     F      + YLGV L    +    ++F S   E I + L  WK    S  
Subjt:  EDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPR---SRSFWSQTIECIHKKLNGWKYSQISKG

Query:  GRLTLLKASLSSLPTYQLST--FKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCK--WLWRYHNESNSLWKK
        GR+ ++K ++     Y+ +    K P   + E+E     F+W   + +     I  ++    +  GG+ +  +K   +A++ K  W W Y +     W +
Subjt:  GRLTLLKASLSSLPTYQLST--FKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCK--WLWRYHNESNSLWKK

Query:  CIDAKYTKNHQGDI
          D +   +  G +
Subjt:  CIDAKYTKNHQGDI

P14381 Transposon TX1 uncharacterized 149 kDa protein1.6e-3423.84Show/hide
Query:  FRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNL--------------YDANKKALLKEIDIIDKLEFQGEMSTTHHQ-
        +  NNS L D+ F K+  + W   +       AF     +L+++   W   KV+L               +A  +AL  E+     L+ +  +S +  Q 
Subjt:  FRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNL--------------YDANKKALLKEIDIIDKLEFQGEMSTTHHQ-

Query:  ---KRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKENYE----EILIDN
           + +  K  L ++E  QA+    R+R +     D  + +F+ +      +  I  +    GT L+  + I     S +QN+++ +       E L D 
Subjt:  ---KRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKENYE----EILIDN

Query:  L---------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTS
        L         +L  P    E+   +    + K+PG DG T+ F++  W  L  D   V  +  K G +  +     ++L+ KK       ++RP+SL ++
Subjt:  L---------KLCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTS

Query:  LYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWSFIDFMLAKKHFPHKWRKWIKACISNVQY
         YK++AKA++ RLKS L + I  +Q   + GR I D + +  + +   ++  +    L LD EKAFD++   ++   L    F  ++  ++K   ++ + 
Subjt:  LYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWSFIDFMLAKKHFPHKWRKWIKACISNVQY

Query:  SILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMD----YLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGL
         + +N +    +   RG+RQG PLS  ++ LA++     L + L+ L  K     V  + Y        ADDV++  +D    L   Q    ++  AS  
Subjt:  SILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMD----YLSRLLSHLESKGAIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGL

Query:  TFNNSKSTISPINISAGRTDQIASFF---GFQTKFLPVNYLGVPLGGN--PRSRSFWSQTIECIHKKLNGWK--YSQISKGGRLTLLKASLSSLPTYQLS
          N SKS  S +   + + D +   F    +++K   + YLGV L     P S++F  +  EC+  +L  WK     +S  GR  ++   ++S   Y+L 
Subjt:  TFNNSKSTISPINISAGRTDQIASFF---GFQTKFLPVNYLGVPLGGN--PRSRSFWSQTIECIHKKLNGWK--YSQISKGGRLTLLKASLSSLPTYQLS

Query:  TFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRY
                  +I++   DFLW G       H ++  + + P + GG G+  ++        + + RY
Subjt:  TFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRY

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.4e-1626.6Show/hide
Query:  WHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKENYEEIL----IDNLKLCKPF--------------D
        + Q++R +W   GD N  +FH++   NQ KNLIK +       ++++  +    ++++ ++   ++  +IL    +  +K   PF               
Subjt:  WHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKENYEEIL----IDNLKLCKPF--------------D

Query:  ESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKLM
        + EI + + +    KAPGPD +T  F+ + W  +KD  +   K+F +TG +    N T I LI K     + S +RP+S  T +YK++
Subjt:  ESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKLM

AT3G25270.1 Ribonuclease H-like superfamily protein5.2e-0927.06Show/hide
Query:  LWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNV-KDLCLQLCRQSDRNTKN
        +W+     K K F+W ++   L T D +++R+     N   C  C   +E   HLF  C +A+ +W   S      + TT +  +  ++L   S    + 
Subjt:  LWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSWCISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNV-KDLCLQLCRQSDRNTKN

Query:  IISFNSAIATLWTIWIRRNNLIFADKDSSYLNA----------WEDICTLTGSWSSKSKTLKNYSQATIA
           FN AI  LW +W  RN L+F  K  S+ N           WED  T   S + +  +   + Q T+A
Subjt:  IISFNSAIATLWTIWIRRNNLIFADKDSSYLNA----------WEDICTLTGSWSSKSKTLKNYSQATIA

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.2e-0838.27Show/hide
Query:  LANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRK-IKGF-VLKLDLEKAFDKISWSFIDFMLAKKHFPHKW
        +  RLK  + + I   Q +FI GR   D I+   EA+ + +++K +KG+ +LKLDLEKA+D+I W +++  L    FP  W
Subjt:  LANRLKSALPDTIAENQMAFIKGRQINDAILIANEAIDTWKQRK-IKGF-VLKLDLEKAFDKISWSFIDFMLAKKHFPHKW

AT4G29090.1 Ribonuclease H-like superfamily protein2.4e-2225.13Show/hide
Query:  SLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIP
        +LPTY ++ F  P +V K+I     DF W   ++ +  H   W+  +  K  GG+G   ++  N ALL K +WR  +   SL  K   ++Y   H+ D  
Subjt:  SLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKCIDAKYTKNHQGDIP

Query:  VVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQF------PRLYALSNMQSATVKEIWDQGSDDWNMEPRRPL-NEREQQT
             S  +  W +I   ++        +  +G  +  W  KW ++ P S         P+ YA S      V ++ D+   +W  +    L  E E++ 
Subjt:  VVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQF------PRLYALSNMQSATVKEIWDQGSDDWNMEPRRPL-NEREQQT

Query:  WDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASA----KDIAFKESSIPKETNWEKEL----KHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSM
           ++    RI ++      +W+ + S  YTV S       I  K SS P+E + E  L    + +W+S    K + F+W  +   L     +  R+ S 
Subjt:  WDSIKMSLPRIHNNRGMCKPSWNPSDSKKYTVASA----KDIAFKESSIPKETNWEKEL----KHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSM

Query:  SLNPSWCISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQL---CRQSDRNTKNIISFNSAIATLWTIWIRRNNLIFADKD
            S CI C S  E +NHL   C FAR  W +  S    P+       + + L       + N +   +       LW +W  RN L+F  ++
Subjt:  SLNPSWCISCRSSNEDMNHLFIFCPFARNLWNMWSSETGTPMATTNVKDLCLQL---CRQSDRNTKNIISFNSAIATLWTIWIRRNNLIFADKD

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.8e-1247.76Show/hide
Query:  LLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSF-NNYCNISHLLFADD
        ++NGAP+G +   RG+RQGDPLSP++F+L  + LS L    + +G + G+   NN   I+HLLFADD
Subjt:  LLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKGAIKGVSF-NNYCNISHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTTGACCGAAGTTTGTCAACACAAATCCTTCTCCATGGACATCACTCCTGACACGCTGGCATGGATCAGAAACTGCTTCAAAGACTTGTTGGACACTTCAACAAC
CAAACACTTTTTTGCTGAAAGAAGGATGGAAGATAACTGCATGTGGGTAAGAAAGACCAAAAACAAGAGCAAAACTAGTATAACTGCTGAAATCTTCAGAATTGATAACA
AAGGGAGAAAATGTAGCATATTGGTACCAGAAGGGCCTGACAGCTTCGGATGGAAGTGCTTCTTAGCCTTGATCACTTTCAGATCTTCAGCTCCAACAAAGAGAATTCGA
TCAGAAATAAGGAAGGAGCCCGTCTCAACATTTTCAGACTCCTTCTCATCAGACTCAGACTCTTCCAGAAAATCTTATGCAAAAGTTCTTTCTGATAGCAGTGAAGATGA
CAACAAGAAGAGGTACAAAGCAACATCAGACGACAGTTCTAGCAGAAGAAGTTCATCGATTGGTTTTAAGCCTTTTACTCTCTCAGGAAATTCCTTTGAAAAAACTGTTA
TTATTACCAGGCGATGTTTCCATGATGACTGGAACAGAATCATGTTTTCCCTAAGAAAACAATCTGAAATAGCTTTCTCTTACAAACCGTTCCAAGCTGACAAAGCCATC
CTATTCCTGAATTCAGACCATGCTAAACTTCTGTGTAGCAACAAAGGTGCAAACGGATGGTCTACAGTGGGAAATTACCAGGTTAAATTCGAAAGCTGGGATTCAAACTT
ACACTCTTTTCACTCTGTTATCCCTAGTTATGGAGGCTGGCTTCGGTTTAGAGGAATTCCGCTTCATTTATGGAACTACAATACTTTCCAACACATTGGTTCGGCTTGTG
GAGGTTTCCTGGATGTTGCTAAGGAAACAATGCAAATGGATAAGCTCATTGATGCTAAAATCAAAGTCCGTTACAATTACACTGGCTTTGTTCCAGCCTCGATCCTGATC
ACTGATAACCAAGGCGAAAATTTTATAGTTACTACTGTTCAACCAGCCGAAGCCAGATGGCTCGTTGAAAGAAATGTTAGAGTTCATGGTTCCTTTAGAACAAAAGCTGC
TGATGAATTTGACCAACACAACCATTTAGCAGAAACTTACACTTACAATGGATTCCAAGCCATCCCGCCGGAACCAACAAGAACCCACGGTGACTACAGCATCCACAACT
CTGACAAACACTCTATCTCATATCACACACAAGCCAAGAAAAATAACTCCTCAGAATCTGAATACGATCCTTTTGATCAACAGCTCAGTGATAGAAGGAAAGAGAAAGGG
AAAGCTATCCTCATCATAAATGATCAAATCATGGCCATTACTCCAAAAGATCAAAAAGGATCTCCAACAGAAAAGTTTCTTTCTTATCCCCTGGTGGATCCTCAAGAATC
TACTGAAGATCATAATCTGTCTTTAAAAGAAACTGGTGAAGGTAGCAAACAAATGAATCTTTCGGTGGATATGGGCCCCATCTCCCCTCTGGAATCTATGATACAATCAG
AAAACAATCATGGACTTGACACTCTCAACAATCAGACACCTGATGGAAATTCAAAATCAACAGATAGTGCAGAAGCAAAAAATCTGACTGTCTCGGTTAAAGAAGGAGCT
GATCAAAACAAGTCTGCTTCAAGATCCACAGCTGAGGGAAATTCAAAGGATGCAAAGACTGGCAGTGAAATGGAAATCGACAGAGCCTTTAAGGAAAAACTTGTTATTTG
GCTGAAGGAAAACGAACTCAAATTGTCTCCTAAATATACCAATGATGTACCTAGTTCCTCATCTTTTCCTGTCATTGTATCTGACCAAAATATGGATATTGCAGGCCATG
GGCCTCTAGGGGATAAGGGTGGCATTTTAGTCTTATGGGATGATACCAAATTCAAAGTCAACGACATCAAAGGTTGGGAAAATGCCTTTGGCTTACATACGTCTCGGACG
TTGGAAAGAAACATCTCGGACCATTTCCCTATTCTTTTGGAGTCCCCTCAAATCAAATGGGGCCCCTGCCCTTTCAGACTCAATAACTCCTCCCTAAGGGACAAAGATTT
CCAGAAAAACTTCATAAATTGGTGGAACAACTCCAAACAAGCAGGCTTCCCGGGCTACGCCTTCATTCAAAGCCTAAATTCTCTATCAAAGTTCATTAAAGAGTGGCAAC
ATAACAAAGTCAACCTATATGATGCCAACAAAAAAGCCCTCCTGAAAGAGATTGACATAATTGACAAGTTAGAATTCCAAGGAGAAATGTCTACCACTCATCATCAAAAG
AGAATTTCTCTAAAATCGGACTTGTTAAGCATTGAAAACAATCAAGCTCAGATATGGCATCAAAGAGCAAGACAAAGATGGAACCTGTTGGGGGACGAAAACAACTCTTA
CTTCCACAGAATCTGCACCATTAACCAAAGGAAAAATCTAATCAAATCCATCTGTGACCCAGCCGGAACTTCTCTAGACTCAATTGATGATATTTCGAGGACATTCATCT
CTCATTTTCAGAATATATACACTAAAGAGAACTATGAAGAAATCCTTATAGATAACTTGAAGTTATGCAAGCCTTTCGATGAATCTGAAATAAAAAGCACAATTATGTCC
TTTAGCAACGAAAAGGCCCCAGGCCCGGATGGTTACACTATGCTCTTCTACAAGAAGCACTGGCCTGATCTCAAGGACGACTTGCTGAACGTTTTTAAGGATTTCCACAA
GACAGGCATTGTAAATAACAATGTTAACAACACTTTCATAGCCCTCATTAGCAAGAAAGAGAAGTGCAGCAAGCCCTCAGACTATCGCCCCATCAGCCTAACAACTTCCC
TATACAAGTTAATGGCAAAAGCTTTAGCTAACAGACTCAAATCCGCTCTTCCAGATACTATTGCGGAAAATCAAATGGCTTTCATTAAAGGAAGACAAATCAATGATGCA
ATTCTTATTGCAAATGAAGCAATTGACACTTGGAAACAAAGGAAAATAAAAGGTTTTGTCCTAAAGCTTGATCTTGAAAAAGCCTTCGATAAAATTAGCTGGAGCTTCAT
TGATTTTATGCTTGCAAAGAAGCACTTTCCACATAAATGGAGGAAATGGATTAAAGCCTGTATAAGCAATGTCCAATACTCCATTCTGCTAAATGGAGCCCCTAAAGGAA
GAATCAAGGCGGAAAGAGGTATTAGGCAGGGAGATCCTCTTTCTCCTTTCATCTTTGTTTTAGCCATGGACTACTTAAGCAGATTGCTTTCCCACCTGGAATCAAAAGGG
GCTATCAAAGGGGTCTCCTTCAATAACTATTGCAACATATCACACCTTCTATTTGCGGATGACGTTCTCATCTTTGTTGAAGACAATGAAAGGTATTTAAATAATTTACA
GATGGCGCTCACCCTCTTTGAGAAAGCCTCTGGGCTGACTTTCAACAACTCCAAGTCAACAATTAGCCCCATAAACATTTCAGCTGGAAGAACAGATCAGATTGCCAGCT
TCTTCGGGTTTCAAACTAAATTCCTTCCTGTTAACTACCTAGGAGTTCCGCTGGGAGGAAACCCAAGATCCAGGTCTTTTTGGAGTCAAACAATTGAATGTATCCACAAA
AAACTAAATGGTTGGAAATACTCACAAATTTCCAAAGGAGGAAGGCTTACTCTCTTAAAAGCTTCTCTGTCTAGTCTTCCTACCTATCAGCTCTCAACATTCAAAGCTCC
TGTCTCAGTATACAAAGAAATTGAAAAACATTGGCGAGATTTCTTGTGGGGAGGAAGTGAGGATAAACAAAATGCTCACCTAATCAACTGGAACATCTGTACCTCCCCTA
AGGAGCTAGGAGGCCTGGGAATCAGTAAGGTAAAAGACACAAACCAAGCCCTCCTATGCAAATGGCTATGGAGATACCACAATGAGTCAAACTCTCTTTGGAAAAAGTGT
ATAGATGCAAAATACACAAAAAATCATCAAGGAGACATTCCAGTGGTAGGCAGAAACAGTAGTGCAAACTCTCCTTGGAATGCAATCAAAAAATGGAAAGATTGGTATGA
ATCTAAAATAAGCTGGATGGCAAATGATGGCTCCTCTCTCTCCTTTTGGCACAGCAAGTGGCATAATAACATCCCTTTGTCCCTACAATTCCCCAGACTCTACGCTCTCT
CGAATATGCAATCAGCTACTGTAAAGGAAATTTGGGACCAAGGTTCTGACGATTGGAACATGGAACCAAGAAGACCTTTAAATGAAAGGGAACAACAAACATGGGATTCT
ATCAAAATGAGTTTACCTCGGATTCACAACAACAGAGGGATGTGCAAGCCTTCTTGGAATCCAAGTGACAGCAAGAAGTACACAGTGGCTTCAGCAAAAGATATAGCATT
CAAGGAAAGCTCAATTCCAAAGGAAACAAACTGGGAAAAGGAGCTCAAACATCTTTGGAGATCTCACATTCCGCAAAAATGTAAATTTTTCATTTGGACCATGGTTCATC
AGAAACTCAACACTATGGATAAAATCCAAAAGAGGAACCCAAGCATGAGCCTCAATCCAAGCTGGTGCATTTCCTGCCGCTCCTCAAACGAAGACATGAACCATTTATTC
ATTTTCTGTCCCTTTGCCCGCAACCTTTGGAATATGTGGAGTTCGGAAACAGGTACTCCCATGGCCACCACAAACGTAAAAGACCTCTGTTTACAACTATGTAGGCAATC
AGACAGAAATACTAAGAATATCATCAGCTTCAATTCAGCTATAGCTACCTTATGGACAATTTGGATTCGAAGAAATAATCTTATCTTTGCGGATAAGGACTCATCCTATC
TAAATGCTTGGGAAGACATATGCACTCTCACTGGAAGTTGGTCTTCAAAGAGTAAAACTCTCAAAAATTATAGTCAAGCTACTATAGCACTAAACATAAAGGCATTATGT
AACCTCCCCATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGTTGACCGAAGTTTGTCAACACAAATCCTTCTCCATGGACATCACTCCTGACACGCTGGCATGGATCAGAAACTGCTTCAAAGACTTGTTGGACACTTCAACAAC
CAAACACTTTTTTGCTGAAAGAAGGATGGAAGATAACTGCATGTGGGTAAGAAAGACCAAAAACAAGAGCAAAACTAGTATAACTGCTGAAATCTTCAGAATTGATAACA
AAGGGAGAAAATGTAGCATATTGGTACCAGAAGGGCCTGACAGCTTCGGATGGAAGTGCTTCTTAGCCTTGATCACTTTCAGATCTTCAGCTCCAACAAAGAGAATTCGA
TCAGAAATAAGGAAGGAGCCCGTCTCAACATTTTCAGACTCCTTCTCATCAGACTCAGACTCTTCCAGAAAATCTTATGCAAAAGTTCTTTCTGATAGCAGTGAAGATGA
CAACAAGAAGAGGTACAAAGCAACATCAGACGACAGTTCTAGCAGAAGAAGTTCATCGATTGGTTTTAAGCCTTTTACTCTCTCAGGAAATTCCTTTGAAAAAACTGTTA
TTATTACCAGGCGATGTTTCCATGATGACTGGAACAGAATCATGTTTTCCCTAAGAAAACAATCTGAAATAGCTTTCTCTTACAAACCGTTCCAAGCTGACAAAGCCATC
CTATTCCTGAATTCAGACCATGCTAAACTTCTGTGTAGCAACAAAGGTGCAAACGGATGGTCTACAGTGGGAAATTACCAGGTTAAATTCGAAAGCTGGGATTCAAACTT
ACACTCTTTTCACTCTGTTATCCCTAGTTATGGAGGCTGGCTTCGGTTTAGAGGAATTCCGCTTCATTTATGGAACTACAATACTTTCCAACACATTGGTTCGGCTTGTG
GAGGTTTCCTGGATGTTGCTAAGGAAACAATGCAAATGGATAAGCTCATTGATGCTAAAATCAAAGTCCGTTACAATTACACTGGCTTTGTTCCAGCCTCGATCCTGATC
ACTGATAACCAAGGCGAAAATTTTATAGTTACTACTGTTCAACCAGCCGAAGCCAGATGGCTCGTTGAAAGAAATGTTAGAGTTCATGGTTCCTTTAGAACAAAAGCTGC
TGATGAATTTGACCAACACAACCATTTAGCAGAAACTTACACTTACAATGGATTCCAAGCCATCCCGCCGGAACCAACAAGAACCCACGGTGACTACAGCATCCACAACT
CTGACAAACACTCTATCTCATATCACACACAAGCCAAGAAAAATAACTCCTCAGAATCTGAATACGATCCTTTTGATCAACAGCTCAGTGATAGAAGGAAAGAGAAAGGG
AAAGCTATCCTCATCATAAATGATCAAATCATGGCCATTACTCCAAAAGATCAAAAAGGATCTCCAACAGAAAAGTTTCTTTCTTATCCCCTGGTGGATCCTCAAGAATC
TACTGAAGATCATAATCTGTCTTTAAAAGAAACTGGTGAAGGTAGCAAACAAATGAATCTTTCGGTGGATATGGGCCCCATCTCCCCTCTGGAATCTATGATACAATCAG
AAAACAATCATGGACTTGACACTCTCAACAATCAGACACCTGATGGAAATTCAAAATCAACAGATAGTGCAGAAGCAAAAAATCTGACTGTCTCGGTTAAAGAAGGAGCT
GATCAAAACAAGTCTGCTTCAAGATCCACAGCTGAGGGAAATTCAAAGGATGCAAAGACTGGCAGTGAAATGGAAATCGACAGAGCCTTTAAGGAAAAACTTGTTATTTG
GCTGAAGGAAAACGAACTCAAATTGTCTCCTAAATATACCAATGATGTACCTAGTTCCTCATCTTTTCCTGTCATTGTATCTGACCAAAATATGGATATTGCAGGCCATG
GGCCTCTAGGGGATAAGGGTGGCATTTTAGTCTTATGGGATGATACCAAATTCAAAGTCAACGACATCAAAGGTTGGGAAAATGCCTTTGGCTTACATACGTCTCGGACG
TTGGAAAGAAACATCTCGGACCATTTCCCTATTCTTTTGGAGTCCCCTCAAATCAAATGGGGCCCCTGCCCTTTCAGACTCAATAACTCCTCCCTAAGGGACAAAGATTT
CCAGAAAAACTTCATAAATTGGTGGAACAACTCCAAACAAGCAGGCTTCCCGGGCTACGCCTTCATTCAAAGCCTAAATTCTCTATCAAAGTTCATTAAAGAGTGGCAAC
ATAACAAAGTCAACCTATATGATGCCAACAAAAAAGCCCTCCTGAAAGAGATTGACATAATTGACAAGTTAGAATTCCAAGGAGAAATGTCTACCACTCATCATCAAAAG
AGAATTTCTCTAAAATCGGACTTGTTAAGCATTGAAAACAATCAAGCTCAGATATGGCATCAAAGAGCAAGACAAAGATGGAACCTGTTGGGGGACGAAAACAACTCTTA
CTTCCACAGAATCTGCACCATTAACCAAAGGAAAAATCTAATCAAATCCATCTGTGACCCAGCCGGAACTTCTCTAGACTCAATTGATGATATTTCGAGGACATTCATCT
CTCATTTTCAGAATATATACACTAAAGAGAACTATGAAGAAATCCTTATAGATAACTTGAAGTTATGCAAGCCTTTCGATGAATCTGAAATAAAAAGCACAATTATGTCC
TTTAGCAACGAAAAGGCCCCAGGCCCGGATGGTTACACTATGCTCTTCTACAAGAAGCACTGGCCTGATCTCAAGGACGACTTGCTGAACGTTTTTAAGGATTTCCACAA
GACAGGCATTGTAAATAACAATGTTAACAACACTTTCATAGCCCTCATTAGCAAGAAAGAGAAGTGCAGCAAGCCCTCAGACTATCGCCCCATCAGCCTAACAACTTCCC
TATACAAGTTAATGGCAAAAGCTTTAGCTAACAGACTCAAATCCGCTCTTCCAGATACTATTGCGGAAAATCAAATGGCTTTCATTAAAGGAAGACAAATCAATGATGCA
ATTCTTATTGCAAATGAAGCAATTGACACTTGGAAACAAAGGAAAATAAAAGGTTTTGTCCTAAAGCTTGATCTTGAAAAAGCCTTCGATAAAATTAGCTGGAGCTTCAT
TGATTTTATGCTTGCAAAGAAGCACTTTCCACATAAATGGAGGAAATGGATTAAAGCCTGTATAAGCAATGTCCAATACTCCATTCTGCTAAATGGAGCCCCTAAAGGAA
GAATCAAGGCGGAAAGAGGTATTAGGCAGGGAGATCCTCTTTCTCCTTTCATCTTTGTTTTAGCCATGGACTACTTAAGCAGATTGCTTTCCCACCTGGAATCAAAAGGG
GCTATCAAAGGGGTCTCCTTCAATAACTATTGCAACATATCACACCTTCTATTTGCGGATGACGTTCTCATCTTTGTTGAAGACAATGAAAGGTATTTAAATAATTTACA
GATGGCGCTCACCCTCTTTGAGAAAGCCTCTGGGCTGACTTTCAACAACTCCAAGTCAACAATTAGCCCCATAAACATTTCAGCTGGAAGAACAGATCAGATTGCCAGCT
TCTTCGGGTTTCAAACTAAATTCCTTCCTGTTAACTACCTAGGAGTTCCGCTGGGAGGAAACCCAAGATCCAGGTCTTTTTGGAGTCAAACAATTGAATGTATCCACAAA
AAACTAAATGGTTGGAAATACTCACAAATTTCCAAAGGAGGAAGGCTTACTCTCTTAAAAGCTTCTCTGTCTAGTCTTCCTACCTATCAGCTCTCAACATTCAAAGCTCC
TGTCTCAGTATACAAAGAAATTGAAAAACATTGGCGAGATTTCTTGTGGGGAGGAAGTGAGGATAAACAAAATGCTCACCTAATCAACTGGAACATCTGTACCTCCCCTA
AGGAGCTAGGAGGCCTGGGAATCAGTAAGGTAAAAGACACAAACCAAGCCCTCCTATGCAAATGGCTATGGAGATACCACAATGAGTCAAACTCTCTTTGGAAAAAGTGT
ATAGATGCAAAATACACAAAAAATCATCAAGGAGACATTCCAGTGGTAGGCAGAAACAGTAGTGCAAACTCTCCTTGGAATGCAATCAAAAAATGGAAAGATTGGTATGA
ATCTAAAATAAGCTGGATGGCAAATGATGGCTCCTCTCTCTCCTTTTGGCACAGCAAGTGGCATAATAACATCCCTTTGTCCCTACAATTCCCCAGACTCTACGCTCTCT
CGAATATGCAATCAGCTACTGTAAAGGAAATTTGGGACCAAGGTTCTGACGATTGGAACATGGAACCAAGAAGACCTTTAAATGAAAGGGAACAACAAACATGGGATTCT
ATCAAAATGAGTTTACCTCGGATTCACAACAACAGAGGGATGTGCAAGCCTTCTTGGAATCCAAGTGACAGCAAGAAGTACACAGTGGCTTCAGCAAAAGATATAGCATT
CAAGGAAAGCTCAATTCCAAAGGAAACAAACTGGGAAAAGGAGCTCAAACATCTTTGGAGATCTCACATTCCGCAAAAATGTAAATTTTTCATTTGGACCATGGTTCATC
AGAAACTCAACACTATGGATAAAATCCAAAAGAGGAACCCAAGCATGAGCCTCAATCCAAGCTGGTGCATTTCCTGCCGCTCCTCAAACGAAGACATGAACCATTTATTC
ATTTTCTGTCCCTTTGCCCGCAACCTTTGGAATATGTGGAGTTCGGAAACAGGTACTCCCATGGCCACCACAAACGTAAAAGACCTCTGTTTACAACTATGTAGGCAATC
AGACAGAAATACTAAGAATATCATCAGCTTCAATTCAGCTATAGCTACCTTATGGACAATTTGGATTCGAAGAAATAATCTTATCTTTGCGGATAAGGACTCATCCTATC
TAAATGCTTGGGAAGACATATGCACTCTCACTGGAAGTTGGTCTTCAAAGAGTAAAACTCTCAAAAATTATAGTCAAGCTACTATAGCACTAAACATAAAGGCATTATGT
AACCTCCCCATGTAA
Protein sequenceShow/hide protein sequence
MWLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSKTSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLALITFRSSAPTKRIR
SEIRKEPVSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGNSFEKTVIITRRCFHDDWNRIMFSLRKQSEIAFSYKPFQADKAI
LFLNSDHAKLLCSNKGANGWSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDAKIKVRYNYTGFVPASILI
TDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKHSISYHTQAKKNNSSESEYDPFDQQLSDRRKEKG
KAILIINDQIMAITPKDQKGSPTEKFLSYPLVDPQESTEDHNLSLKETGEGSKQMNLSVDMGPISPLESMIQSENNHGLDTLNNQTPDGNSKSTDSAEAKNLTVSVKEGA
DQNKSASRSTAEGNSKDAKTGSEMEIDRAFKEKLVIWLKENELKLSPKYTNDVPSSSSFPVIVSDQNMDIAGHGPLGDKGGILVLWDDTKFKVNDIKGWENAFGLHTSRT
LERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFINWWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQK
RISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKENYEEILIDNLKLCKPFDESEIKSTIMS
FSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKTGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKLMAKALANRLKSALPDTIAENQMAFIKGRQINDA
ILIANEAIDTWKQRKIKGFVLKLDLEKAFDKISWSFIDFMLAKKHFPHKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLESKG
AIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKSTISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWSQTIECIHK
KLNGWKYSQISKGGRLTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINWNICTSPKELGGLGISKVKDTNQALLCKWLWRYHNESNSLWKKC
IDAKYTKNHQGDIPVVGRNSSANSPWNAIKKWKDWYESKISWMANDGSSLSFWHSKWHNNIPLSLQFPRLYALSNMQSATVKEIWDQGSDDWNMEPRRPLNEREQQTWDS
IKMSLPRIHNNRGMCKPSWNPSDSKKYTVASAKDIAFKESSIPKETNWEKELKHLWRSHIPQKCKFFIWTMVHQKLNTMDKIQKRNPSMSLNPSWCISCRSSNEDMNHLF
IFCPFARNLWNMWSSETGTPMATTNVKDLCLQLCRQSDRNTKNIISFNSAIATLWTIWIRRNNLIFADKDSSYLNAWEDICTLTGSWSSKSKTLKNYSQATIALNIKALC
NLPM