; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0004695 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0004695
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr03:1142458..1145900
RNA-Seq ExpressionPay0004695
SyntenyPay0004695
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0080.7Show/hide
Query:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN
        MR FN FI+DSNLIDPPLSNAKFTWSNLRV  +LSRIDRFLYTT+WENLFTAHYSK                  + WGP PFK INVHLKEPWFKNNV N
Subjt:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN

Query:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE
        WWKNLRQEGHPGFSFM+KLK LS IIR+EQ+KN   SDE+K AWIKE+D+IDRLEAEGNLSEELSLRRT++KAD+L+  FKEAQIWYQKSKRLW TEGDE
Subjt:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE

Query:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG
        NTSFFHKICSARQRRSIISNINS DGVPC+TNE+IAKAFLDHFE IY GGG E+PWLI+NL+WSPIST QAQNLCS FTEEEIH ALTAFSNNKSPGPDG
Subjt:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG

Query:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI
        FTMEF+K+ W VLK++I NIFRDFH NCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP  VAENQ+AFVKGRQIIDAI
Subjt:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR
        LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYP KWR WI+ACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDY+SR
Subjt:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR

Query:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG
        LLNSVGE IKGVK+  N+NLTHLLFADDILLFVEDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+RT+QIASQW I+TKF PINYLGVPLGG
Subjt:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG

Query:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG-----
        KQ TK FWKN++EKI+KKLASWKYSMLSKGGKITLIKS+LASLPTYQLSIFKAPVSTCK IEK+WRNF WKNP ETHKLHLV+WAKITS KE+GG     
Subjt:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG-----

Query:  ----------------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFAL
                                           SKGDIPCVCNHSSSRSPWFSICKGL WFQRHVSWKIKNGR+FSFWHSHWHQNSPLS HYPRL+AL
Subjt:  ----------------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFAL

Query:  STNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------K
        STN++SSI+DMWN TLMDWDL PRRQLR+WEHPLWAELKNSLNASFCENG DSP W LNS+G Y+VAS                               K
Subjt:  STNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------K

Query:  CNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASA
        C FFIWTLLYDSVNT EQL KR+PNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIW  ISSHL+SNVNCLSPK LC TMCSWKQKTKKN+ILFNTYASA
Subjt:  CNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASA

Query:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS
        LWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTSRSSLF+NYQA+SIALNLNAFS
Subjt:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0076.72Show/hide
Query:  MWDDLRFNVTDFIEGEYSLSININFPDGPSSSAWWLSAIYGPTGGRNRNSFWAELLDLKNKCSPT---------------------CKYSMRNFNNFISD
        MWDDLRFNVTDFIEG +SLSININ PDGPS+SAWWLSAIYGP+GGRNR SFWAELLDLKNKCSPT                      K+SMR FN FI+D
Subjt:  MWDDLRFNVTDFIEGEYSLSININFPDGPSSSAWWLSAIYGPTGGRNRNSFWAELLDLKNKCSPT---------------------CKYSMRNFNNFISD

Query:  SNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRNWWKNLRQEGH
        SNLIDPPLSNAKFTWSNLRV  +LSRIDRFLYTT+WENLFTAHYSK                  + WGP PFK INVHLKEPWFKNN+ NWWKNLRQEGH
Subjt:  SNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRNWWKNLRQEGH

Query:  PGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICS
        PGFSFM+KLK LS IIR+EQ+KN   SDE+K AWIKE+D+IDRLEAEGNLSEELSLRRT++KAD+L   FKEAQIWYQKSKRLW TEGDENTSFFHKICS
Subjt:  PGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICS

Query:  ARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAW
        ARQRRSIISNINS DGVPC+TNE+IAKAFLDHFE IY GGG E+PWLI+NL+WSPIST QAQ LCS FTEEEIH ALTAFS+NKSP     T        
Subjt:  ARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAW

Query:  FVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW
                           ++  +NITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP  VAENQ+AFVK RQIIDAILVANEAIDYW
Subjt:  FVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW

Query:  RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIK
        R KKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYP KWR WI+ACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGE IK
Subjt:  RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIK

Query:  GVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGGKQTTKAFWKN
        GVK+  N+NLTHLLFADDILLFVEDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+RT+QIASQW I+TKF PINYLGVPLGGKQTTKAFWKN
Subjt:  GVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGGKQTTKAFWKN

Query:  IDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG---------------
        ++EKI+KKLASWKYSMLSKGGKITLIKS+LASLPTYQLSIFKAPVSTCK IEK+WRNF WKNP ETHKLHLV+WAKITS KE+GG               
Subjt:  IDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG---------------

Query:  ------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKD
                                 SKGDIPCVCNHSSSRSPWFSICKGL WFQRHVSWKIKNGR+FSFWH HWHQNSPLS HYPRL+ALSTN++SSI+D
Subjt:  ------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKD

Query:  MWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLY
        MWN TLMDWDL PRRQLR+WE PLWAELKNS+NASFCENG+DSP W LNS+G Y+VAS                               KC FFIWTLLY
Subjt:  MWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLY

Query:  DSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNA
        DSVNT EQL KR+P LCSRPSWCVMCKRNDEDRIHLFILCPIAKSIW  ISSHLNSNVNCLSPK LC TMCSWKQKTKKN+ILFNTYASALWNIWLERNA
Subjt:  DSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNA

Query:  RIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS
        RIFNGKEKTVA++WEDIKALAGLWTSRSSLF+NYQA+SIALNLNAFS
Subjt:  RIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS

TYK10356.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0077.5Show/hide
Query:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN
        MR FN FISDSNLIDPPLSNAK+TWSNL+ Q ILSRIDRFLYT DWENLFTAHYSK                 ++ WGPLPFK INVHLKEPWFKNN+RN
Subjt:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN

Query:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE
        WW NLR EGHPGFSFMKKLK+LSVIIRDEQKKNNR +DE+KRAW+KEVDNIDRLEAEGNL EELSLRRTK KADILL DFK AQIWYQKSKRLWNTEGDE
Subjt:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE

Query:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG
        NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAK FLDHFEGIYNGGGVENPWLIENLSWSPIST+QAQNLCS F+EEEIH+ALTAFSNNKSPGPDG
Subjt:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG

Query:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI
        FTMEFFKAAWFVLKDDIFNIFRDFH NCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPS VAENQ+AFVKGRQIIDAI
Subjt:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR
        LVANEAIDYWRVKKIQGF                         GYPIKWRRWIKACISSVQYSIIINGRPR                             
Subjt:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR

Query:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG
                                         EDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQW ITTKFFPINYLGVPLGG
Subjt:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG

Query:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGD
        K TTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVST K IEKSWRNFFWKN SETHKLHL                   
Subjt:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGD

Query:  IPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELK
                                   VSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQD+SIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELK
Subjt:  IPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELK

Query:  NSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRN
        NSLNASFCENGRDSPTW+LNSDGFYSVAS                               KC FFIWTLLYDSVNT +QLTKRMPNLCSRPSWCVMCKRN
Subjt:  NSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRN

Query:  DEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSS
        +EDRIHLFILCPIAKSIWNLISSHL SNVNCLSPK LC TMCSWKQKTKKNIILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSR  
Subjt:  DEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSS

Query:  LFTNYQATSIALNLNAFS
        LFTNYQATSIALNLNAF+
Subjt:  LFTNYQATSIALNLNAFS

TYK24536.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0066.88Show/hide
Query:  MWDDLRFNVTDFIEGEYSLSININFPDGPSSSAWWLSAIYGPTGGRNRNSFWAELLDLKNKCSPT---------------------CKYSMRNFNNFISD
        MWDDLRF+VTD IEG +SLSININFPDGPSSSAWWLSAIYGP+GGRNR SFWAELLDLKNKCSPT                      K+SMR FN FISD
Subjt:  MWDDLRFNVTDFIEGEYSLSININFPDGPSSSAWWLSAIYGPTGGRNRNSFWAELLDLKNKCSPT---------------------CKYSMRNFNNFISD

Query:  SNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRNWWKNLRQEGH
        SNLIDPPLSNAKFTWSNLRVQ +LSRIDRFLYT +WENLFTAHYSK                  + WGP PFKFINVHLKEPWFK NV  WWKNLRQ GH
Subjt:  SNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRNWWKNLRQEGH

Query:  PGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICS
        PGFSFMKKLK LS IIRDEQKKN   +DEEK+AWIKE+DNIDRLEAEGN SEELSLRRTK+KAD+L+  FKEAQIWYQKSKRLW TEGDENTSFFHKICS
Subjt:  PGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICS

Query:  ARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAW
        ARQRRSIISNINS DGVPC+TNE+IAKAFLDHFE IY GGG E+PWLI+NLSWSPISTTQAQNLCS FTEEEIH ALTAFSNNKSPGPDGFTMEF+K+ W
Subjt:  ARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAW

Query:  FVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW
         VLK++IFNIFRDFH NCIINKAVN+TNIALIAKKEKCAEPADYRPI                         +AENQ+AFVKGRQIIDAILVANEAIDYW
Subjt:  FVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW

Query:  RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIK
        RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYP +WR+WI+ACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGE IK
Subjt:  RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIK

Query:  GVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGGKQTTKAFWKN
        GVKM  N+NLTHLLFADDILLFVEDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN+ AART+QIASQW I+TKF PINYLGVPLGGKQTTK+FWKN
Subjt:  GVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGGKQTTKAFWKN

Query:  IDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGDIPCVCNHSSS
        ++EKI+KKL SWKYSMLSKG          A+ P        +P +      K W  F                                          
Subjt:  IDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGDIPCVCNHSSS

Query:  RSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCEN
                                                       +F   +  DSSI+DMWN TLMDWDL PRRQ+RDWE+PLWAELKNSLN SFCEN
Subjt:  RSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCEN

Query:  GRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFIL
        G+DSPTW LNSDGFY+VAS                               KC FFIWTLLYDSVNT EQLTKRMPNLCSRPS                  
Subjt:  GRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFIL

Query:  CPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSI
                                         WKQKTKKN+IL+NTYASALWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTSRSSLFTNYQA+SI
Subjt:  CPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSI

Query:  ALNLNAFS
        ALNLNAFS
Subjt:  ALNLNAFS

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]0.0e+0080.61Show/hide
Query:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN
        MR FN FI+DSNLIDPPLSNAKFTWSNLRV  +LSRIDRFLYTT+WENLFTAHYSK                  + WGP PFK INVHLKEPWFKNNV N
Subjt:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN

Query:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE
        WWKNLRQEGHPGFSFM+KLK LS IIR+EQ+KN   SDE+K AWIKE+D+IDRLEAEGNLSEELSLRRT++KAD+L+  FKEAQIWYQKSKRLW TEGDE
Subjt:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE

Query:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG
        NTSFFHKICSARQRRSIISNINS DGVPC+TNE+IAKAFLDHFE IY GGG E+PWLI+NL+WSPIST QAQNLCS FTEEEIH ALTAFSNNKSPGPDG
Subjt:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG

Query:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI
        FTMEF+K+ W VLK++I NIFRDFH NCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP  VAENQ+AFVKGRQIIDAI
Subjt:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR
        LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYP KWR WI+ACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDY+SR
Subjt:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR

Query:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG
        LLNSVGE IKGVK+  N+NLTHLLFADDILLFVEDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+RT+QIASQW I+TKF PINYLGVPLGG
Subjt:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG

Query:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG-----
        KQ TK FWKN++EKI+KKLASWKYSMLSKGGKITLIKS+LASLPTYQLSIFK PVSTCK IEK+WRNF WKNP ETHKLHLV+WAKITS KE+GG     
Subjt:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG-----

Query:  ----------------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFAL
                                           SKGDIPCVCNHSSSRSPWFSICKGL WFQRHVSWKIKNGR+FSFWHSHWHQNSPLS HYPRL+AL
Subjt:  ----------------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFAL

Query:  STNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------K
        STN++SSI+DMWN TLMDWDL PRRQLR+WEHPLWAELKNSLNASFCENG DSP W LNS+G Y+VAS                               K
Subjt:  STNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------K

Query:  CNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASA
        C FFIWTLLYDSVNT EQL KR+PNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIW  ISSHL+SNVNCLSPK LC TMCSWKQKTKKN+ILFNTYASA
Subjt:  CNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASA

Query:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS
        LWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTSRSSLF+NYQA+SIALNLNAFS
Subjt:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein0.0e+0080.61Show/hide
Query:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN
        MR FN FI+DSNLIDPPLSNAKFTWSNLRV  +LSRIDRFLYTT+WENLFTAHYSK                  + WGP PFK INVHLKEPWFKNNV N
Subjt:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN

Query:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE
        WWKNLRQEGHPGFSFM+KLK LS IIR+EQ+KN   SDE+K AWIKE+D+IDRLEAEGNLSEELSLRRT++KAD+L+  FKEAQIWYQKSKRLW TEGDE
Subjt:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE

Query:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG
        NTSFFHKICSARQRRSIISNINS DGVPC+TNE+IAKAFLDHFE IY GGG E+PWLI+NL+WSPIST QAQNLCS FTEEEIH ALTAFSNNKSPGPDG
Subjt:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG

Query:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI
        FTMEF+K+ W VLK++I NIFRDFH NCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP  VAENQ+AFVKGRQIIDAI
Subjt:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR
        LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYP KWR WI+ACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDY+SR
Subjt:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR

Query:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG
        LLNSVGE IKGVK+  N+NLTHLLFADDILLFVEDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+RT+QIASQW I+TKF PINYLGVPLGG
Subjt:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG

Query:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG-----
        KQ TK FWKN++EKI+KKLASWKYSMLSKGGKITLIKS+LASLPTYQLSIFK PVSTCK IEK+WRNF WKNP ETHKLHLV+WAKITS KE+GG     
Subjt:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG-----

Query:  ----------------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFAL
                                           SKGDIPCVCNHSSSRSPWFSICKGL WFQRHVSWKIKNGR+FSFWHSHWHQNSPLS HYPRL+AL
Subjt:  ----------------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFAL

Query:  STNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------K
        STN++SSI+DMWN TLMDWDL PRRQLR+WEHPLWAELKNSLNASFCENG DSP W LNS+G Y+VAS                               K
Subjt:  STNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------K

Query:  CNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASA
        C FFIWTLLYDSVNT EQL KR+PNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIW  ISSHL+SNVNCLSPK LC TMCSWKQKTKKN+ILFNTYASA
Subjt:  CNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASA

Query:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS
        LWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTSRSSLF+NYQA+SIALNLNAFS
Subjt:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein0.0e+0076.72Show/hide
Query:  MWDDLRFNVTDFIEGEYSLSININFPDGPSSSAWWLSAIYGPTGGRNRNSFWAELLDLKNKCSPT---------------------CKYSMRNFNNFISD
        MWDDLRFNVTDFIEG +SLSININ PDGPS+SAWWLSAIYGP+GGRNR SFWAELLDLKNKCSPT                      K+SMR FN FI+D
Subjt:  MWDDLRFNVTDFIEGEYSLSININFPDGPSSSAWWLSAIYGPTGGRNRNSFWAELLDLKNKCSPT---------------------CKYSMRNFNNFISD

Query:  SNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRNWWKNLRQEGH
        SNLIDPPLSNAKFTWSNLRV  +LSRIDRFLYTT+WENLFTAHYSK                  + WGP PFK INVHLKEPWFKNN+ NWWKNLRQEGH
Subjt:  SNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRNWWKNLRQEGH

Query:  PGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICS
        PGFSFM+KLK LS IIR+EQ+KN   SDE+K AWIKE+D+IDRLEAEGNLSEELSLRRT++KAD+L   FKEAQIWYQKSKRLW TEGDENTSFFHKICS
Subjt:  PGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICS

Query:  ARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAW
        ARQRRSIISNINS DGVPC+TNE+IAKAFLDHFE IY GGG E+PWLI+NL+WSPIST QAQ LCS FTEEEIH ALTAFS+NKSP     T        
Subjt:  ARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAW

Query:  FVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW
                           ++  +NITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP  VAENQ+AFVK RQIIDAILVANEAIDYW
Subjt:  FVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW

Query:  RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIK
        R KKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYP KWR WI+ACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGE IK
Subjt:  RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIK

Query:  GVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGGKQTTKAFWKN
        GVK+  N+NLTHLLFADDILLFVEDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+RT+QIASQW I+TKF PINYLGVPLGGKQTTKAFWKN
Subjt:  GVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGGKQTTKAFWKN

Query:  IDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG---------------
        ++EKI+KKLASWKYSMLSKGGKITLIKS+LASLPTYQLSIFKAPVSTCK IEK+WRNF WKNP ETHKLHLV+WAKITS KE+GG               
Subjt:  IDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG---------------

Query:  ------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKD
                                 SKGDIPCVCNHSSSRSPWFSICKGL WFQRHVSWKIKNGR+FSFWH HWHQNSPLS HYPRL+ALSTN++SSI+D
Subjt:  ------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKD

Query:  MWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLY
        MWN TLMDWDL PRRQLR+WE PLWAELKNS+NASFCENG+DSP W LNS+G Y+VAS                               KC FFIWTLLY
Subjt:  MWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLY

Query:  DSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNA
        DSVNT EQL KR+P LCSRPSWCVMCKRNDEDRIHLFILCPIAKSIW  ISSHLNSNVNCLSPK LC TMCSWKQKTKKN+ILFNTYASALWNIWLERNA
Subjt:  DSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNA

Query:  RIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS
        RIFNGKEKTVA++WEDIKALAGLWTSRSSLF+NYQA+SIALNLNAFS
Subjt:  RIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS

A0A5D3CJ08 LINE-1 retrotransposable element ORF2 protein0.0e+0077.5Show/hide
Query:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN
        MR FN FISDSNLIDPPLSNAK+TWSNL+ Q ILSRIDRFLYT DWENLFTAHYSK                 ++ WGPLPFK INVHLKEPWFKNN+RN
Subjt:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN

Query:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE
        WW NLR EGHPGFSFMKKLK+LSVIIRDEQKKNNR +DE+KRAW+KEVDNIDRLEAEGNL EELSLRRTK KADILL DFK AQIWYQKSKRLWNTEGDE
Subjt:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE

Query:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG
        NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAK FLDHFEGIYNGGGVENPWLIENLSWSPIST+QAQNLCS F+EEEIH+ALTAFSNNKSPGPDG
Subjt:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG

Query:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI
        FTMEFFKAAWFVLKDDIFNIFRDFH NCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPS VAENQ+AFVKGRQIIDAI
Subjt:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR
        LVANEAIDYWRVKKIQGF                         GYPIKWRRWIKACISSVQYSIIINGRPR                             
Subjt:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR

Query:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG
                                         EDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQW ITTKFFPINYLGVPLGG
Subjt:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG

Query:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGD
        K TTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVST K IEKSWRNFFWKN SETHKLHL                   
Subjt:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGD

Query:  IPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELK
                                   VSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQD+SIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELK
Subjt:  IPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELK

Query:  NSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRN
        NSLNASFCENGRDSPTW+LNSDGFYSVAS                               KC FFIWTLLYDSVNT +QLTKRMPNLCSRPSWCVMCKRN
Subjt:  NSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRN

Query:  DEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSS
        +EDRIHLFILCPIAKSIWNLISSHL SNVNCLSPK LC TMCSWKQKTKKNIILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSR  
Subjt:  DEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSS

Query:  LFTNYQATSIALNLNAFS
        LFTNYQATSIALNLNAF+
Subjt:  LFTNYQATSIALNLNAFS

A0A5D3DLM2 LINE-1 retrotransposable element ORF2 protein0.0e+0066.88Show/hide
Query:  MWDDLRFNVTDFIEGEYSLSININFPDGPSSSAWWLSAIYGPTGGRNRNSFWAELLDLKNKCSPT---------------------CKYSMRNFNNFISD
        MWDDLRF+VTD IEG +SLSININFPDGPSSSAWWLSAIYGP+GGRNR SFWAELLDLKNKCSPT                      K+SMR FN FISD
Subjt:  MWDDLRFNVTDFIEGEYSLSININFPDGPSSSAWWLSAIYGPTGGRNRNSFWAELLDLKNKCSPT---------------------CKYSMRNFNNFISD

Query:  SNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRNWWKNLRQEGH
        SNLIDPPLSNAKFTWSNLRVQ +LSRIDRFLYT +WENLFTAHYSK                  + WGP PFKFINVHLKEPWFK NV  WWKNLRQ GH
Subjt:  SNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRNWWKNLRQEGH

Query:  PGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICS
        PGFSFMKKLK LS IIRDEQKKN   +DEEK+AWIKE+DNIDRLEAEGN SEELSLRRTK+KAD+L+  FKEAQIWYQKSKRLW TEGDENTSFFHKICS
Subjt:  PGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICS

Query:  ARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAW
        ARQRRSIISNINS DGVPC+TNE+IAKAFLDHFE IY GGG E+PWLI+NLSWSPISTTQAQNLCS FTEEEIH ALTAFSNNKSPGPDGFTMEF+K+ W
Subjt:  ARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAW

Query:  FVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW
         VLK++IFNIFRDFH NCIINKAVN+TNIALIAKKEKCAEPADYRPI                         +AENQ+AFVKGRQIIDAILVANEAIDYW
Subjt:  FVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW

Query:  RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIK
        RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYP +WR+WI+ACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGE IK
Subjt:  RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIK

Query:  GVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGGKQTTKAFWKN
        GVKM  N+NLTHLLFADDILLFVEDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN+ AART+QIASQW I+TKF PINYLGVPLGGKQTTK+FWKN
Subjt:  GVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGGKQTTKAFWKN

Query:  IDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGDIPCVCNHSSS
        ++EKI+KKL SWKYSMLSKG          A+ P        +P +      K W  F                                          
Subjt:  IDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGDIPCVCNHSSS

Query:  RSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCEN
                                                       +F   +  DSSI+DMWN TLMDWDL PRRQ+RDWE+PLWAELKNSLN SFCEN
Subjt:  RSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCEN

Query:  GRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFIL
        G+DSPTW LNSDGFY+VAS                               KC FFIWTLLYDSVNT EQLTKRMPNLCSRPS                  
Subjt:  GRDSPTWVLNSDGFYSVAS-------------------------------KCNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFIL

Query:  CPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSI
                                         WKQKTKKN+IL+NTYASALWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTSRSSLFTNYQA+SI
Subjt:  CPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSI

Query:  ALNLNAFS
        ALNLNAFS
Subjt:  ALNLNAFS

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein0.0e+0080.7Show/hide
Query:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN
        MR FN FI+DSNLIDPPLSNAKFTWSNLRV  +LSRIDRFLYTT+WENLFTAHYSK                  + WGP PFK INVHLKEPWFKNNV N
Subjt:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFLYTTDWENLFTAHYSK----------------PSLDWGPLPFKFINVHLKEPWFKNNVRN

Query:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE
        WWKNLRQEGHPGFSFM+KLK LS IIR+EQ+KN   SDE+K AWIKE+D+IDRLEAEGNLSEELSLRRT++KAD+L+  FKEAQIWYQKSKRLW TEGDE
Subjt:  WWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDE

Query:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG
        NTSFFHKICSARQRRSIISNINS DGVPC+TNE+IAKAFLDHFE IY GGG E+PWLI+NL+WSPIST QAQNLCS FTEEEIH ALTAFSNNKSPGPDG
Subjt:  NTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDG

Query:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI
        FTMEF+K+ W VLK++I NIFRDFH NCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP  VAENQ+AFVKGRQIIDAI
Subjt:  FTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR
        LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYP KWR WI+ACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDY+SR
Subjt:  LVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR

Query:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG
        LLNSVGE IKGVK+  N+NLTHLLFADDILLFVEDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+RT+QIASQW I+TKF PINYLGVPLGG
Subjt:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGG

Query:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG-----
        KQ TK FWKN++EKI+KKLASWKYSMLSKGGKITLIKS+LASLPTYQLSIFKAPVSTCK IEK+WRNF WKNP ETHKLHLV+WAKITS KE+GG     
Subjt:  KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG-----

Query:  ----------------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFAL
                                           SKGDIPCVCNHSSSRSPWFSICKGL WFQRHVSWKIKNGR+FSFWHSHWHQNSPLS HYPRL+AL
Subjt:  ----------------------------------QSKGDIPCVCNHSSSRSPWFSICKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFAL

Query:  STNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------K
        STN++SSI+DMWN TLMDWDL PRRQLR+WEHPLWAELKNSLNASFCENG DSP W LNS+G Y+VAS                               K
Subjt:  STNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSVAS-------------------------------K

Query:  CNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASA
        C FFIWTLLYDSVNT EQL KR+PNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIW  ISSHL+SNVNCLSPK LC TMCSWKQKTKKN+ILFNTYASA
Subjt:  CNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASA

Query:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS
        LWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTSRSSLF+NYQA+SIALNLNAFS
Subjt:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.5e-3823.23Show/hide
Query:  QKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICSARQRRSIISNINSDDGVPC
        ++K  R   +   + +KE++  ++  ++ +  +E+    TKI+A++   + ++      +S+  +    ++      ++   ++ ++ I  I +D G   
Subjt:  QKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICSARQRRSIISNINSDDGVPC

Query:  TTNENIAKAFLDHFEGIYNGGGVEN----PWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAWFVLKDDIFNIFRDFH
        T    I     ++++ +Y    +EN       ++  +   ++  + ++L    T  EI   + +    KSPGPDGFT EF++     L   +  +F+   
Subjt:  TTNENIAKAFLDHFEGIYNGGGVEN----PWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAWFVLKDDIFNIFRDFH

Query:  MNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIKLD
           I+  +    +I LI K  +   +  ++RPISL     K++ K++A R+++ +  ++  +Q+ F+ G Q    I  +   I +  R K     +I +D
Subjt:  MNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIKLD

Query:  IEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIKGVKM-NDNLNLTH
         EKAFDK+   F+   L K G    + + I+A       +II+NG+         G RQG P+SP +F + ++ ++R +    E IKG+++  + + L+ 
Subjt:  IEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIKGVKM-NDNLNLTH

Query:  LLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPL--GGKQTTKAFWKNIDEKISKKLA
         LFADD+++++E+   S QNL  +I+ F   SG  IN+ KS     N +     QI  +   T     I YLG+ L    K   K  +K + ++I +   
Subjt:  LLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPL--GGKQTTKAFWKNIDEKISKKLA

Query:  SWKYSMLSKGGKITLIKSTLASLPTYQLSI--FKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGDIPCVCNHSSSRSPWFSIC
         WK    S  G+I ++K  +     Y+ +    K P++    +EK+   F W       K   ++ + ++   + GG +  D       + +++ W+   
Subjt:  SWKYSMLSKGGKITLIKSTLASLPTYQLSI--FKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGDIPCVCNHSSSRSPWFSIC

Query:  KGLAWFQ
            W+Q
Subjt:  KGLAWFQ

P08548 LINE-1 reverse transcriptase homolog1.4e-3623.24Show/hide
Query:  FKFINVHLKEPWFKNNV-RNWWKNLRQEGHPGFSFMKKLKHLSVIIRDE-------QKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKA
        +K  N+ LK+ W  + + +   K L Q  +   ++         ++R +        KK  R+        +K+++  +    + +  +E+    TKI+A
Subjt:  FKFINVHLKEPWFKNNV-RNWWKNLRQEGHPGFSFMKKLKHLSVIIRDE-------QKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKA

Query:  DILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYN---GGGVENPWLIENLSWSPISTTQ
        ++   + K       KSK  +  + ++       +   ++ +S+IS+I + +    T    I K   ++++ +Y+       E    +E      +S  +
Subjt:  DILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYN---GGGVENPWLIENLSWSPISTTQ

Query:  AQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAK
         + L    +  EI + +      KSPGPDGFT EF++     L   + N+F++     I+       NI LI K  K      +YRPISL     K++ K
Subjt:  AQNLCSFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAK

Query:  VIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIING
        ++  R+++ +  ++  +Q+ F+ G Q    I  +   I +  ++K     ++ +D EKAFD +   F+   L K G    + + I+A  S    +II+NG
Subjt:  VIAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIING

Query:  RPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIKGVKM-NDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKST--I
                  G RQG P+SP +F + M+ ++  +    + IKG+ + ++ + L+  LFADD+++++E+   S   L  +I  +   SG  IN +KS   I
Subjt:  RPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIKGVKM-NDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKST--I

Query:  SPINIDAARTDQIASQWRITTKFFPINYLGVPL--GGKQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSI--FKAPVSTCKII
           N  A +T + +  + +  K   + YLGV L    K   K  ++ + ++I++ +  WK    S  G+I ++K ++     Y  +    KAP+S  K +
Subjt:  SPINIDAARTDQIASQWRITTKFFPINYLGVPL--GGKQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSI--FKAPVSTCKII

Query:  EKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGDIPCVCNHSSSRSPWF
        EK   +F W       K   ++   +++  + GG +  D+         ++ W+
Subjt:  EKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGDIPCVCNHSSSRSPWF

P0C2F6 Putative ribonuclease H protein At1g657507.8e-1624.02Show/hide
Query:  VPLGGKQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG
        +P+  K+  K  +  I E++S +++ W+   LS  G++TL K+ L+S+P + +S    P S    +++  R F W + +E  K HLV W+K+ SPK+ GG
Subjt:  VPLGGKQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGG

Query:  ----------------------QSKG-----------------DIPCVCNHSSSRSPWFSICKGLAWFQRH-VSWKIKNGRNFSFWHSHWHQNSPLSLHY
                              Q K                  D   +    S  S W SI  GL     H V W   +G+   FW   W    PL L  
Subjt:  ----------------------QSKG-----------------DIPCVCNHSSSRSPWFSICKGLAWFQRH-VSWKIKNGRNFSFWHSHWHQNSPLSLHY

Query:  PRLFALSTNQDSSIKDMWNTTLMDWDL---------KPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVL-----------NSDGFYS------VAS
              +       KD+W      WD            R +LR     L    ++ L+  F ++G+ S                N   F++      V  
Subjt:  PRLFALSTNQDSSIKDMWNTTLMDWDL---------KPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVL-----------NSDGFYS------VAS

Query:  KCNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIW
        +   F+W +   +V T E+  +R     S  + C +CK   E  +H+   CP    IW
Subjt:  KCNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIW

P11369 LINE-1 retrotransposable element ORF2 protein1.4e-3625.24Show/hide
Query:  KICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVEN----PWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFT
        ++    + + +I+ I ++ G   T  E I       ++ +Y+   +EN       ++      ++  Q  +L S  + +EI   + +    KSPGPDGF+
Subjt:  KICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVEN----PWLIENLSWSPISTTQAQNLCSFFTEEEIHTALTAFSNNKSPGPDGFT

Query:  MEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAIL
         EF++     L   +  +F    +   +  +     I LI K +K   +  ++RPISL     K++ K++A R++E + +++  +Q+ F+ G Q    I 
Subjt:  MEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQLAFVKGRQIIDAIL

Query:  VANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR
         +   I Y  ++K     +I LD EKAFDK+   F+  +L + G    +   IKA  S    +I +NG     I    G RQG P+SP++F + ++ ++R
Subjt:  VANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISR

Query:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFF----PINYLGV
         +    E IKG+++     +   L ADD+++++ D ++S + L N+IN F    G  IN NKS    +     +  Q   + R TT F      I YLGV
Subjt:  LLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFF----PINYLGV

Query:  PLGG--KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSI--FKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKE
         L    K      +K++ ++I + L  WK    S  G+I ++K  +     Y+ +    K P      +E +   F W N     K   ++ + +   + 
Subjt:  PLGG--KQTTKAFWKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSI--FKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKE

Query:  RGGQSKGDIPCVCNHSSSRSPWF
         GG +  D+         ++ W+
Subjt:  RGGQSKGDIPCVCNHSSSRSPWF

P14381 Transposon TX1 uncharacterized 149 kDa protein9.8e-3523.64Show/hide
Query:  FINVHLKEPWFKNNVRNWWKNLRQEGHPGFSFMKK-----LKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLY
        F N  L++  F  +VR+ W+  R      F+ + +       HL ++ ++  K  + Q + E  A   EV ++++    G+  + L     + K  +   
Subjt:  FINVHLKEPWFKNNVRNWWKNLRQEGHPGFSFMKK-----LKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLRRTKIKADILLY

Query:  DFKEAQIWYQKSKRLWNTEGDENTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWS---PISTTQAQNLC
        + ++A+  + +S+     + D  + FF+ +   +  R  I+ + ++DG P    E I       ++ +++   + +P   E L W     +S  + + L 
Subjt:  DFKEAQIWYQKSKRLWNTEGDENTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWS---PISTTQAQNLC

Query:  SFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERL
        +  T +E+  AL    +NKSPG DG T+EFF+  W  L  D   +  +      +  +     ++L+ KK       ++RP+SL ++ YK++AK I+ RL
Subjt:  SFFTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERL

Query:  KETLPSMVAENQLAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQ
        K  L  ++  +Q   V GR I D + +  + + + R   +    + LD EKAFD+++ +++   L    +  ++  ++K   +S +  + IN      + 
Subjt:  KETLPSMVAENQLAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQ

Query:  PSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIKGVKMND-NLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAAR
          RG+RQG P+S  ++ LA++    LL    + + G+ + + ++ +    +ADD++L V  D   L+  +    ++  AS   IN +KS  S +   + +
Subjt:  PSRGIRQGDPISPFIFVLAMDYISRLLNSVGENIKGVKMND-NLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAAR

Query:  TDQIASQWR-ITTKFFPINYLGVPLGGKQ-TTKAFWKNIDEKISKKLASWK--YSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFW
         D +   +R I+ +   I YLGV L  ++      +  ++E +  +L  WK    +LS  G+  +I   +AS   Y+L            I++   +F W
Subjt:  TDQIASQWR-ITTKFFPINYLGVPLGGKQ-TTKAFWKNIDEKISKKLASWK--YSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFW

Query:  KNPSETHKLHLVSWAKITSPKERGGQ
                 H VS    + P + GGQ
Subjt:  KNPSETHKLHLVSWAKITSPKERGGQ

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.2e-2628.53Show/hide
Query:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQ-ILSRIDRFLYTTDWENLFTAHYSKPSL----DWGP-------LP---------FKFINVHLKEPWFK
        +  F N + DS+L+D P     +TWSN +    I+ ++DR +   DW + F +  +   L    D  P       LP         F F++ H   P F 
Subjt:  MRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQ-ILSRIDRFLYTTDWENLFTAHYSKPSL----DWGP-------LP---------FKFINVHLKEPWFK

Query:  NNVRNWWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKE-VDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEA--QIWYQKSKR
         ++   W+     G   FS  + LK          K  NRQ     +   KE +D+++ ++++   +   SL R +  A      F  A    + QKS+ 
Subjt:  NNVRNWWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKE-VDNIDRLEAEGNLSEELSLRRTKIKADILLYDFKEA--QIWYQKSKR

Query:  LWNTEGDENTSFFHKICSARQRRSIISNINSDDGV---PCTTNENIAKAFLDHFEG----IYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHT
         W  +GD NT FFHK+  A Q +++I  +  DD V     T  + +  A+  H  G    I     V+    I+++     + T A  L +  +++EI  
Subjt:  LWNTEGDENTSFFHKICSARQRRSIISNINSDDGV---PCTTNENIAKAFLDHFEG----IYNGGGVENPWLIENLSWSPISTTQAQNLCSFFTEEEIHT

Query:  ALTAFSNNKSPGPDGFTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLI
        A+ A   NK+PGPD FT EFF  +WFV+KD      ++F     + K  N T I LI K     + + +RP+S  T +YK+I
Subjt:  ALTAFSNNKSPGPDGFTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLI

AT1G45063.1 copper ion binding;electron carriers3.1e-0727.05Show/hide
Query:  TLLYDSVNTTEQLTKRMPNLCSR--------PSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWK----QKTKKNIILF
        TL    ++  E+LT+   ++C R        PS C++C   DE R H+F  CP +  +W+   S+       ++P  +      W     +  K   IL 
Subjt:  TLLYDSVNTTEQLTKRMPNLCSR--------PSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWK----QKTKKNIILF

Query:  NTYASALWNIWLERNARIFNGK
          Y +++++IW ERN R+ + K
Subjt:  NTYASALWNIWLERNARIFNGK

AT4G05095.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT4G04650.1)1.1e-0424.3Show/hide
Query:  SRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKK----NIILFNTYASALWNIWLERNARIFNGKEKTVADL
        S PS CV+C  N E R HLF  C ++ ++W       +     L+P +L     +W     +    ++I+   + ++++ +W ERN R+ +   ++   +
Subjt:  SRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKK----NIILFNTYASALWNIWLERNARIFNGKEKTVADL

Query:  WEDIKAL
         ++IK +
Subjt:  WEDIKAL

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.7e-0834.57Show/hide
Query:  IAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYWRVKK-IQGF-VIKLDIEKAFDKLNWRFIDFMLMKKGYPIKW
        + ERLK  + +++   Q +F+ GR   D I+   EA+   R KK ++G+ ++KLD+EKA+D++ W +++  L+  G+P  W
Subjt:  IAERLKETLPSMVAENQLAFVKGRQIIDAILVANEAIDYWRVKK-IQGF-VIKLDIEKAFDKLNWRFIDFMLMKKGYPIKW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)7.8e-1146.27Show/hide
Query:  IINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGE--NIKGVKMNDNL-NLTHLLFADD
        IING P+G + PSRG+RQGDP+SP++F+L  + +S L     E   + G+++++N   + HLLFADD
Subjt:  IINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGE--NIKGVKMNDNL-NLTHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGACGATCTCAGATTTAATGTTACTGATTTCATTGAAGGAGAATATTCTTTATCCATTAACATAAACTTCCCTGATGGGCCTTCGAGTTCGGCCTGGTGGTTGTC
GGCTATATATGGGCCTACGGGTGGTCGAAACAGAAACTCTTTTTGGGCTGAACTGCTGGACCTAAAAAACAAATGCTCCCCCACTTGTAAATACAGTATGAGAAATTTTA
ACAATTTCATCTCTGACAGCAATCTTATTGACCCGCCCCTCTCAAATGCAAAATTTACTTGGTCTAATCTCAGAGTTCAACAAATTCTCTCCAGAATTGACAGATTCCTT
TATACAACCGATTGGGAAAATTTATTTACTGCTCACTATTCAAAACCCTCTCTAGATTGGGGCCCTCTTCCTTTCAAGTTTATAAACGTCCATCTGAAAGAGCCGTGGTT
CAAAAACAACGTTAGAAATTGGTGGAAAAATTTGAGACAGGAGGGACACCCGGGCTTTTCTTTTATGAAAAAGCTAAAGCACTTATCCGTCATTATAAGAGATGAACAAA
AAAAGAATAATCGTCAAAGTGATGAAGAAAAGAGAGCTTGGATCAAAGAAGTCGACAACATAGACAGATTAGAAGCTGAAGGAAACTTATCTGAAGAGCTTAGCCTTCGT
AGGACAAAAATAAAAGCTGATATCCTTCTGTACGACTTCAAAGAAGCACAAATTTGGTACCAAAAAAGCAAGAGACTGTGGAACACTGAGGGAGATGAAAATACCTCTTT
CTTTCACAAAATTTGCTCTGCCAGACAAAGAAGGAGTATTATTTCAAATATTAACTCTGACGATGGTGTTCCTTGTACGACAAATGAGAACATTGCAAAGGCCTTCTTAG
ACCACTTTGAAGGAATCTATAATGGGGGTGGAGTGGAAAACCCCTGGCTTATTGAAAATCTCAGTTGGTCCCCTATATCCACTACCCAAGCTCAAAATTTATGTTCTTTT
TTCACGGAGGAGGAAATTCATACAGCCCTTACTGCTTTCTCAAACAATAAAAGCCCGGGTCCAGATGGCTTTACCATGGAATTCTTCAAAGCAGCTTGGTTTGTTCTCAA
GGATGATATCTTCAATATCTTCAGAGACTTCCACATGAACTGTATCATCAATAAAGCAGTAAATATCACAAATATTGCTCTGATTGCCAAGAAAGAGAAGTGTGCGGAGC
CGGCGGATTACAGACCTATTAGTCTAACGACTTCCATTTACAAACTTATTGCCAAAGTCATTGCGGAAAGACTAAAAGAAACTCTTCCCTCAATGGTGGCAGAGAACCAA
TTGGCTTTTGTAAAGGGAAGACAAATCATTGATGCTATCTTAGTTGCAAATGAAGCCATAGACTATTGGAGAGTTAAAAAAATTCAAGGCTTTGTTATTAAGCTGGATAT
TGAAAAAGCCTTTGATAAACTAAATTGGAGATTCATTGATTTTATGCTTATGAAAAAGGGCTACCCCATTAAATGGAGGAGATGGATAAAAGCTTGTATCAGTAGTGTTC
AGTACTCTATTATCATCAACGGCAGACCTAGAGGTAAAATTCAACCTTCCCGTGGTATCCGACAAGGAGACCCTATATCCCCTTTTATATTTGTCCTAGCAATGGACTAT
ATAAGCAGGCTGCTGAACTCTGTGGGCGAAAACATCAAAGGGGTGAAAATGAATGACAACCTAAATCTGACACACTTACTTTTTGCAGATGATATTCTGCTTTTTGTAGA
AGATGATGAGCACTCCCTACAAAATTTAAAGAATATCATCAATCTCTTCCAGCTAGCATCGGGGTTGAATATCAATCTCAATAAGTCCACCATCTCCCCTATAAATATTG
ATGCTGCAAGAACCGATCAGATAGCTTCTCAATGGAGAATTACTACTAAATTTTTTCCAATCAACTACCTTGGAGTCCCTCTCGGAGGCAAACAAACAACAAAGGCTTTT
TGGAAGAACATTGATGAAAAGATAAGCAAAAAACTTGCCAGCTGGAAATATTCCATGTTATCCAAAGGTGGAAAAATTACCTTGATTAAATCTACTTTGGCCAGCCTTCC
TACTTATCAACTATCAATTTTCAAAGCCCCTGTATCAACATGCAAAATCATTGAGAAATCTTGGAGAAATTTCTTCTGGAAGAACCCATCCGAGACCCACAAACTGCACC
TAGTTAGTTGGGCGAAGATTACTTCTCCAAAAGAGAGAGGGGGCCAATCTAAAGGGGACATCCCATGTGTTTGCAATCATAGTAGCAGCCGCTCCCCTTGGTTTTCCATC
TGCAAAGGATTGGCCTGGTTTCAAAGGCATGTTTCCTGGAAAATTAAAAATGGTAGAAACTTCTCTTTTTGGCATAGCCACTGGCATCAAAATAGTCCTCTTTCATTACA
CTACCCCAGATTATTTGCTCTCTCTACAAACCAGGACAGCTCCATAAAAGATATGTGGAACACTACTTTGATGGATTGGGATCTAAAACCAAGAAGACAGCTAAGAGATT
GGGAACATCCTCTGTGGGCTGAGTTAAAAAACTCTCTAAATGCCAGTTTTTGCGAAAATGGCAGAGACTCTCCAACTTGGGTCCTAAACTCTGATGGCTTTTACTCTGTT
GCCTCAAAATGCAATTTTTTCATCTGGACTCTGCTCTATGATAGTGTGAATACCACCGAACAACTTACAAAGAGAATGCCAAATCTCTGTTCTAGACCAAGCTGGTGTGT
GATGTGTAAAAGGAATGACGAAGACAGAATCCACCTCTTTATCCTTTGCCCCATTGCAAAGTCCATCTGGAATTTAATATCATCACACTTAAACAGCAATGTAAACTGTC
TCAGTCCAAAAGTTCTATGTACTACCATGTGCAGCTGGAAACAGAAAACCAAAAAGAATATCATTCTCTTCAATACCTATGCCTCTGCCCTTTGGAACATATGGCTGGAG
AGGAATGCCCGTATCTTCAATGGGAAAGAAAAAACAGTTGCTGACTTATGGGAAGATATAAAAGCTCTTGCAGGACTGTGGACCAGTAGATCTTCACTATTTACAAATTA
TCAAGCTACTTCCATAGCCCTAAACCTTAATGCTTTTAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGGACGATCTCAGATTTAATGTTACTGATTTCATTGAAGGAGAATATTCTTTATCCATTAACATAAACTTCCCTGATGGGCCTTCGAGTTCGGCCTGGTGGTTGTC
GGCTATATATGGGCCTACGGGTGGTCGAAACAGAAACTCTTTTTGGGCTGAACTGCTGGACCTAAAAAACAAATGCTCCCCCACTTGTAAATACAGTATGAGAAATTTTA
ACAATTTCATCTCTGACAGCAATCTTATTGACCCGCCCCTCTCAAATGCAAAATTTACTTGGTCTAATCTCAGAGTTCAACAAATTCTCTCCAGAATTGACAGATTCCTT
TATACAACCGATTGGGAAAATTTATTTACTGCTCACTATTCAAAACCCTCTCTAGATTGGGGCCCTCTTCCTTTCAAGTTTATAAACGTCCATCTGAAAGAGCCGTGGTT
CAAAAACAACGTTAGAAATTGGTGGAAAAATTTGAGACAGGAGGGACACCCGGGCTTTTCTTTTATGAAAAAGCTAAAGCACTTATCCGTCATTATAAGAGATGAACAAA
AAAAGAATAATCGTCAAAGTGATGAAGAAAAGAGAGCTTGGATCAAAGAAGTCGACAACATAGACAGATTAGAAGCTGAAGGAAACTTATCTGAAGAGCTTAGCCTTCGT
AGGACAAAAATAAAAGCTGATATCCTTCTGTACGACTTCAAAGAAGCACAAATTTGGTACCAAAAAAGCAAGAGACTGTGGAACACTGAGGGAGATGAAAATACCTCTTT
CTTTCACAAAATTTGCTCTGCCAGACAAAGAAGGAGTATTATTTCAAATATTAACTCTGACGATGGTGTTCCTTGTACGACAAATGAGAACATTGCAAAGGCCTTCTTAG
ACCACTTTGAAGGAATCTATAATGGGGGTGGAGTGGAAAACCCCTGGCTTATTGAAAATCTCAGTTGGTCCCCTATATCCACTACCCAAGCTCAAAATTTATGTTCTTTT
TTCACGGAGGAGGAAATTCATACAGCCCTTACTGCTTTCTCAAACAATAAAAGCCCGGGTCCAGATGGCTTTACCATGGAATTCTTCAAAGCAGCTTGGTTTGTTCTCAA
GGATGATATCTTCAATATCTTCAGAGACTTCCACATGAACTGTATCATCAATAAAGCAGTAAATATCACAAATATTGCTCTGATTGCCAAGAAAGAGAAGTGTGCGGAGC
CGGCGGATTACAGACCTATTAGTCTAACGACTTCCATTTACAAACTTATTGCCAAAGTCATTGCGGAAAGACTAAAAGAAACTCTTCCCTCAATGGTGGCAGAGAACCAA
TTGGCTTTTGTAAAGGGAAGACAAATCATTGATGCTATCTTAGTTGCAAATGAAGCCATAGACTATTGGAGAGTTAAAAAAATTCAAGGCTTTGTTATTAAGCTGGATAT
TGAAAAAGCCTTTGATAAACTAAATTGGAGATTCATTGATTTTATGCTTATGAAAAAGGGCTACCCCATTAAATGGAGGAGATGGATAAAAGCTTGTATCAGTAGTGTTC
AGTACTCTATTATCATCAACGGCAGACCTAGAGGTAAAATTCAACCTTCCCGTGGTATCCGACAAGGAGACCCTATATCCCCTTTTATATTTGTCCTAGCAATGGACTAT
ATAAGCAGGCTGCTGAACTCTGTGGGCGAAAACATCAAAGGGGTGAAAATGAATGACAACCTAAATCTGACACACTTACTTTTTGCAGATGATATTCTGCTTTTTGTAGA
AGATGATGAGCACTCCCTACAAAATTTAAAGAATATCATCAATCTCTTCCAGCTAGCATCGGGGTTGAATATCAATCTCAATAAGTCCACCATCTCCCCTATAAATATTG
ATGCTGCAAGAACCGATCAGATAGCTTCTCAATGGAGAATTACTACTAAATTTTTTCCAATCAACTACCTTGGAGTCCCTCTCGGAGGCAAACAAACAACAAAGGCTTTT
TGGAAGAACATTGATGAAAAGATAAGCAAAAAACTTGCCAGCTGGAAATATTCCATGTTATCCAAAGGTGGAAAAATTACCTTGATTAAATCTACTTTGGCCAGCCTTCC
TACTTATCAACTATCAATTTTCAAAGCCCCTGTATCAACATGCAAAATCATTGAGAAATCTTGGAGAAATTTCTTCTGGAAGAACCCATCCGAGACCCACAAACTGCACC
TAGTTAGTTGGGCGAAGATTACTTCTCCAAAAGAGAGAGGGGGCCAATCTAAAGGGGACATCCCATGTGTTTGCAATCATAGTAGCAGCCGCTCCCCTTGGTTTTCCATC
TGCAAAGGATTGGCCTGGTTTCAAAGGCATGTTTCCTGGAAAATTAAAAATGGTAGAAACTTCTCTTTTTGGCATAGCCACTGGCATCAAAATAGTCCTCTTTCATTACA
CTACCCCAGATTATTTGCTCTCTCTACAAACCAGGACAGCTCCATAAAAGATATGTGGAACACTACTTTGATGGATTGGGATCTAAAACCAAGAAGACAGCTAAGAGATT
GGGAACATCCTCTGTGGGCTGAGTTAAAAAACTCTCTAAATGCCAGTTTTTGCGAAAATGGCAGAGACTCTCCAACTTGGGTCCTAAACTCTGATGGCTTTTACTCTGTT
GCCTCAAAATGCAATTTTTTCATCTGGACTCTGCTCTATGATAGTGTGAATACCACCGAACAACTTACAAAGAGAATGCCAAATCTCTGTTCTAGACCAAGCTGGTGTGT
GATGTGTAAAAGGAATGACGAAGACAGAATCCACCTCTTTATCCTTTGCCCCATTGCAAAGTCCATCTGGAATTTAATATCATCACACTTAAACAGCAATGTAAACTGTC
TCAGTCCAAAAGTTCTATGTACTACCATGTGCAGCTGGAAACAGAAAACCAAAAAGAATATCATTCTCTTCAATACCTATGCCTCTGCCCTTTGGAACATATGGCTGGAG
AGGAATGCCCGTATCTTCAATGGGAAAGAAAAAACAGTTGCTGACTTATGGGAAGATATAAAAGCTCTTGCAGGACTGTGGACCAGTAGATCTTCACTATTTACAAATTA
TCAAGCTACTTCCATAGCCCTAAACCTTAATGCTTTTAGTTAG
Protein sequenceShow/hide protein sequence
MWDDLRFNVTDFIEGEYSLSININFPDGPSSSAWWLSAIYGPTGGRNRNSFWAELLDLKNKCSPTCKYSMRNFNNFISDSNLIDPPLSNAKFTWSNLRVQQILSRIDRFL
YTTDWENLFTAHYSKPSLDWGPLPFKFINVHLKEPWFKNNVRNWWKNLRQEGHPGFSFMKKLKHLSVIIRDEQKKNNRQSDEEKRAWIKEVDNIDRLEAEGNLSEELSLR
RTKIKADILLYDFKEAQIWYQKSKRLWNTEGDENTSFFHKICSARQRRSIISNINSDDGVPCTTNENIAKAFLDHFEGIYNGGGVENPWLIENLSWSPISTTQAQNLCSF
FTEEEIHTALTAFSNNKSPGPDGFTMEFFKAAWFVLKDDIFNIFRDFHMNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSMVAENQ
LAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPIKWRRWIKACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDY
ISRLLNSVGENIKGVKMNDNLNLTHLLFADDILLFVEDDEHSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARTDQIASQWRITTKFFPINYLGVPLGGKQTTKAF
WKNIDEKISKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSIFKAPVSTCKIIEKSWRNFFWKNPSETHKLHLVSWAKITSPKERGGQSKGDIPCVCNHSSSRSPWFSI
CKGLAWFQRHVSWKIKNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKDMWNTTLMDWDLKPRRQLRDWEHPLWAELKNSLNASFCENGRDSPTWVLNSDGFYSV
ASKCNFFIWTLLYDSVNTTEQLTKRMPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWNLISSHLNSNVNCLSPKVLCTTMCSWKQKTKKNIILFNTYASALWNIWLE
RNARIFNGKEKTVADLWEDIKALAGLWTSRSSLFTNYQATSIALNLNAFS