; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0008060 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0008060
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr07:29059962..29063403
RNA-Seq ExpressionPay0008060
SyntenyPay0008060
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0071.71Show/hide
Query:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS
        MR FN FI+DSNLIDPPL NAK+TWSNLR  P+LSRIDRFLYT  WENLFTAHYSKTLSRVTSDHFPI+LESS+ISWGP PF+LINVHLKEPWFKNN  +
Subjt:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS

Query:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE
        WW NLRQ+GHPGFS M+K K LS IIR+EQ++N  Y+DE+K AW+KE+D+IDR+EAEGNLSEELSLRRT++KA++LM  FKEA IW+QKSKRLW TEGDE
Subjt:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE

Query:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD
        NTSFFH+ICSARQR SIISNINS DG+PC+TNE+I KAFLDHFE IY  GG ++ WLI+NLNWSPI ++QAQNLCS F+EEEIH AL AFSNNKSPGPD 
Subjt:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD

Query:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI
        FTMEF+K+TW  LK++I NIF+DF+ NCIINKAVN+TNI+LIAKKEKC  PADYRPISLTTSIYK+IAKVIAERLK+TLP TVAENQMAFVKGRQIIDAI
Subjt:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR
        LVANEAIDYWRVKKI+GFVIKLDIEKAFDKLNWRFIDF+LMKKGYP KWR WI+ACISSVQYSIIING PRGKIQPSRGIRQGDPISPFIFVLAMDY+SR
Subjt:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR

Query:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG
        +L+S+GE IKGV+++ N+NLTHLLFADDILLFVEDDE S+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+R +QIASQW ISTKF PINYLGVPLGG
Subjt:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG

Query:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-------------------
        K ITK FWKN++EKINKKLASWKYSMLSKGGKITLIKS+LASLPTYQLS+FKAP S  K+IEK+WRNF WK+  E HKLHL                   
Subjt:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-------------------

Query:  ------------------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFAL
                                                                          VSWKI+NGR+FSFWHSHWHQNSPLS HYPRL+AL
Subjt:  ------------------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFAL

Query:  STNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKK
        STN++SSI++MWN+TLMDWDL PRRQLR+WE+PLWAELKNSLNASFCENG D+P W LNS+G Y+VASVKK + QP+Q +L  Q+QNT KNLWKT+IPKK
Subjt:  STNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKK

Query:  Y-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASA
                                                  RIHLFILCPIA  IW  ISSHL+SNVNCLSPK+LCITMCSWKQK+KKN ILFNTYASA
Subjt:  Y-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASA

Query:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT
        LWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTS S LFSNYQA+S+ALNL+AF+
Subjt:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0069.92Show/hide
Query:  MWDDLRFNVSDFIEGSFSLSIKINIPDGPLCSAWWLSAIYGPAGGRNKNSFWAELLDLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISD
        MWDDLRFNV+DFIEG+FSLSI IN PDGP  SAWWLSAIYGP+GGRN+ SFWAELLDLKNKCSP WLLAGDFNVVRF S+TSAQN   +SMR FN FI+D
Subjt:  MWDDLRFNVSDFIEGSFSLSIKINIPDGPLCSAWWLSAIYGPAGGRNKNSFWAELLDLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISD

Query:  SNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGH
        SNLIDPPL NAK+TWSNLR  P+LSRIDRFLYT  WENLFTAHYSKTLSRVTSDHFPI+LESS+ISWGP PF+LINVHLKEPWFKNN  +WW NLRQ+GH
Subjt:  SNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGH

Query:  PGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICS
        PGFS M+K K LS IIR+EQ++N  Y+DE+K AW+KE+D+IDR+EAEGNLSEELSLRRT++KA++L   FKEA IW+QKSKRLW TEGDENTSFFH+ICS
Subjt:  PGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICS

Query:  ARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATW
        ARQR SIISNINS DG+PC+TNE+I KAFLDHFE IY  GG ++ WLI+NLNWSPI ++QAQ LCS F+EEEIH AL AFS+NKSP     T        
Subjt:  ARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATW

Query:  YDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYW
                           ++  +N+TNI+LIAKKEKC  PADYRPISLTTSIYK+IAKVIAERLK+TLP TVAENQMAFVK RQIIDAILVANEAIDYW
Subjt:  YDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYW

Query:  RVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIK
        R KKI+GFVIKLDIEKAFDKLNWRFIDF+LMKKGYP KWR WI+ACISSVQYSIIING PRGKIQPSRGIRQGDPISPFIFVLAMDY+SR+L+S+GE IK
Subjt:  RVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIK

Query:  GVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGGKPITKAFWKN
        GV+++ N+NLTHLLFADDILLFVEDDE S+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+R +QIASQW ISTKF PINYLGVPLGGK  TKAFWKN
Subjt:  GVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGGKPITKAFWKN

Query:  IDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-----------------------------
        ++EKINKKLASWKYSMLSKGGKITLIKS+LASLPTYQLS+FKAP S  K+IEK+WRNF WK+  E HKLHL                             
Subjt:  IDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-----------------------------

Query:  --------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKN
                                                                VSWKI+NGR+FSFWH HWHQNSPLS HYPRL+ALSTN++SSI++
Subjt:  --------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKN

Query:  MWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKKY---------
        MWN+TLMDWDL PRRQLR+WE PLWAELKNS+NASFCENG+D+P W LNS+G Y+VASVKKA+ QP+Q +L LQ+QNT KNLWKT+IPKK          
Subjt:  MWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKKY---------

Query:  --------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASALWNIWLERNA
                                        RIHLFILCPIA  IW  ISSHLNSNVNCLSPK+LCITMCSWKQK+KKN ILFNTYASALWNIWLERNA
Subjt:  --------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASALWNIWLERNA

Query:  RIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT
        RIFNGKEKTVA++WEDIKALAGLWTS S LFSNYQA+S+ALNL+AF+
Subjt:  RIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT

TYK10356.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0077.16Show/hide
Query:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS
        MRKFN FISDSNLIDPPL NAKYTWSNL+AQPILSRIDRFLYT  WENLFTAHYSKTLSRVTSDHFPI+LESS ISWGPLPF+LINVHLKEPWFKNN R+
Subjt:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS

Query:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE
        WWNNLR +GHPGFS MKK KNLS+IIRDEQK+NNR+NDE+KRAWVKEVDNIDR+EAEGNL EELSLRRTK KA+IL+ DFK A IW+QKSKRLWNTEGDE
Subjt:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE

Query:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD
        NTSFFH+ICSARQR SIISNINSDDG+PCTTNENI K FLDHFEGIYN GGV+N WLIENL+WSPI +SQAQNLCS FSEEEIHSAL AFSNNKSPGPD 
Subjt:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD

Query:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI
        FTMEFFKA W+ LKDDI NIF+DF+ NCIINKAVN+TNI+LIAKKEKC  PADYRPISLTTSIYK+IAKVIAERLKETLP TVAENQMAFVKGRQIIDAI
Subjt:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR
        LVANEAIDYWRVKKI+GF                         GYPIKWRRWIKACISSVQYSIIING PR                             
Subjt:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR

Query:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG
                                         EDDE SLQNLKNIINLFQLASGLNINLNKSTISPINIDAAR DQIASQW I+TKFFPINYLGVPLGG
Subjt:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG

Query:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNGRNFSFWHSHWH
        KP TKAFWKNIDEKI+KKLASWKYSMLSKGGKITLIKSTLASLPTYQLS+FKAP S YKSIEKSWRNFFWK+ +E HKLHLVSWKI+NGRNFSFWHSHWH
Subjt:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNGRNFSFWHSHWH

Query:  QNSPLSLHYPRLFALSTNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQN
        QNSPLSLHYPRLFALSTNQD+SIK+MWN+TLMDWDLKPRRQLRDWE+PLWAELKNSLNASFCENGRD+PTW+LNSDGFYSVASVKKA+ QPDQGIL LQN
Subjt:  QNSPLSLHYPRLFALSTNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQN

Query:  QNTLKNLWKTNIPKKY-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQ
        QNT KNLWK++IPKK                                          RIHLFILCPIA  IW+LISSHL SNVNCLSPKDLCITMCSWKQ
Subjt:  QNTLKNLWKTNIPKKY-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQ

Query:  KSKKN-ILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT
        K+KKN ILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTS  LLF+NYQATS+ALNL+AFT
Subjt:  KSKKN-ILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT

TYK24536.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0070.55Show/hide
Query:  MWDDLRFNVSDFIEGSFSLSIKINIPDGPLCSAWWLSAIYGPAGGRNKNSFWAELLDLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISD
        MWDDLRF+V+D IEG+FSLSI IN PDGP  SAWWLSAIYGP+GGRN+ SFWAELLDLKNKCSP WLLAGDFN VRFSS+TS QN   +SMR FN FISD
Subjt:  MWDDLRFNVSDFIEGSFSLSIKINIPDGPLCSAWWLSAIYGPAGGRNKNSFWAELLDLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISD

Query:  SNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGH
        SNLIDPPL NAK+TWSNLR QP+LSRIDRFLYT  WENLFTAHYSKTLSRVTSDHFPI LESSMISWGP PF+ INVHLKEPWFK N   WW NLRQ GH
Subjt:  SNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGH

Query:  PGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICS
        PGFS MKK K LS IIRDEQK+N  YNDEEK+AW+KE+DNIDR+EAEGN SEELSLRRTK+KA++LM  FKEA IW+QKSKRLW TEGDENTSFFH+ICS
Subjt:  PGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICS

Query:  ARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATW
        ARQR SIISNINS DG+PC+TNE+I KAFLDHFE IY  GG ++ WLI+NL+WSPI ++QAQNLCS F+EEEIH+AL AFSNNKSPGPD FTMEF+K+TW
Subjt:  ARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATW

Query:  YDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYW
          LK++I NIF+DF+ NCIINKAVN+TNI+LIAKKEKC  PADYRPI                         +AENQMAFVKGRQIIDAILVANEAIDYW
Subjt:  YDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYW

Query:  RVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIK
        RVKKI+GFVIKLDIEKAFDKLNWRFIDF+LMKKGYP +WR+WI+ACISSVQYSIIING PRGKIQPSRGIRQGDPISPFIFVLAMDY+SR+L+S+GE IK
Subjt:  RVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIK

Query:  GVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGGKPITKAFWKN
        GV+M+ N+NLTHLLFADDILLFVEDDE S+QNLKNIINLFQLASGL+INLNKSTISPIN+ AAR +QIASQW ISTKF PINYLGVPLGGK  TK+FWKN
Subjt:  GVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGGKPITKAFWKN

Query:  IDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNGRNFSFWHSHWHQNSPLSLHYP
        ++EKINKKL SWKYSMLSKG                      A         K W  F                                          
Subjt:  IDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNGRNFSFWHSHWHQNSPLSLHYP

Query:  RLFALSTNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKT
         +F   +  DSSI++MWN+TLMDWDL PRRQ+RDWEYPLWAELKNSLN SFCENG+D+PTW LNSDGFY+VASVKKA+ QPDQ  L LQ+QNT KNLWKT
Subjt:  RLFALSTNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKT

Query:  NIPKKYRIHLFILCPIAIFIWSLISSHLNSNVNCLS-PKDLCITMCSWKQKSKKN-ILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWT
        +IPKK             FIW+L+   +N+         +LC     WKQK+KKN IL+NTYASALWNIWLERNARIFNGKEKTVA++WEDIKALAGLWT
Subjt:  NIPKKYRIHLFILCPIAIFIWSLISSHLNSNVNCLS-PKDLCITMCSWKQKSKKN-ILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWT

Query:  SISLLFSNYQATSLALNLHAFT
        S S LF+NYQA+S+ALNL+AF+
Subjt:  SISLLFSNYQATSLALNLHAFT

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]0.0e+0071.62Show/hide
Query:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS
        MR FN FI+DSNLIDPPL NAK+TWSNLR  P+LSRIDRFLYT  WENLFTAHYSKTLSRVTSDHFPI+LESS+ISWGP PF+LINVHLKEPWFKNN  +
Subjt:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS

Query:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE
        WW NLRQ+GHPGFS M+K K LS IIR+EQ++N  Y+DE+K AW+KE+D+IDR+EAEGNLSEELSLRRT++KA++LM  FKEA IW+QKSKRLW TEGDE
Subjt:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE

Query:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD
        NTSFFH+ICSARQR SIISNINS DG+PC+TNE+I KAFLDHFE IY  GG ++ WLI+NLNWSPI ++QAQNLCS F+EEEIH AL AFSNNKSPGPD 
Subjt:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD

Query:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI
        FTMEF+K+TW  LK++I NIF+DF+ NCIINKAVN+TNI+LIAKKEKC  PADYRPISLTTSIYK+IAKVIAERLK+TLP TVAENQMAFVKGRQIIDAI
Subjt:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR
        LVANEAIDYWRVKKI+GFVIKLDIEKAFDKLNWRFIDF+LMKKGYP KWR WI+ACISSVQYSIIING PRGKIQPSRGIRQGDPISPFIFVLAMDY+SR
Subjt:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR

Query:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG
        +L+S+GE IKGV+++ N+NLTHLLFADDILLFVEDDE S+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+R +QIASQW ISTKF PINYLGVPLGG
Subjt:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG

Query:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-------------------
        K ITK FWKN++EKINKKLASWKYSMLSKGGKITLIKS+LASLPTYQLS+FK P S  K+IEK+WRNF WK+  E HKLHL                   
Subjt:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-------------------

Query:  ------------------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFAL
                                                                          VSWKI+NGR+FSFWHSHWHQNSPLS HYPRL+AL
Subjt:  ------------------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFAL

Query:  STNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKK
        STN++SSI++MWN+TLMDWDL PRRQLR+WE+PLWAELKNSLNASFCENG D+P W LNS+G Y+VASVKK + QP+Q +L  Q+QNT KNLWKT+IPKK
Subjt:  STNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKK

Query:  Y-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASA
                                                  RIHLFILCPIA  IW  ISSHL+SNVNCLSPK+LCITMCSWKQK+KKN ILFNTYASA
Subjt:  Y-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASA

Query:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT
        LWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTS S LFSNYQA+S+ALNL+AF+
Subjt:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein0.0e+0071.62Show/hide
Query:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS
        MR FN FI+DSNLIDPPL NAK+TWSNLR  P+LSRIDRFLYT  WENLFTAHYSKTLSRVTSDHFPI+LESS+ISWGP PF+LINVHLKEPWFKNN  +
Subjt:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS

Query:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE
        WW NLRQ+GHPGFS M+K K LS IIR+EQ++N  Y+DE+K AW+KE+D+IDR+EAEGNLSEELSLRRT++KA++LM  FKEA IW+QKSKRLW TEGDE
Subjt:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE

Query:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD
        NTSFFH+ICSARQR SIISNINS DG+PC+TNE+I KAFLDHFE IY  GG ++ WLI+NLNWSPI ++QAQNLCS F+EEEIH AL AFSNNKSPGPD 
Subjt:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD

Query:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI
        FTMEF+K+TW  LK++I NIF+DF+ NCIINKAVN+TNI+LIAKKEKC  PADYRPISLTTSIYK+IAKVIAERLK+TLP TVAENQMAFVKGRQIIDAI
Subjt:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR
        LVANEAIDYWRVKKI+GFVIKLDIEKAFDKLNWRFIDF+LMKKGYP KWR WI+ACISSVQYSIIING PRGKIQPSRGIRQGDPISPFIFVLAMDY+SR
Subjt:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR

Query:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG
        +L+S+GE IKGV+++ N+NLTHLLFADDILLFVEDDE S+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+R +QIASQW ISTKF PINYLGVPLGG
Subjt:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG

Query:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-------------------
        K ITK FWKN++EKINKKLASWKYSMLSKGGKITLIKS+LASLPTYQLS+FK P S  K+IEK+WRNF WK+  E HKLHL                   
Subjt:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-------------------

Query:  ------------------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFAL
                                                                          VSWKI+NGR+FSFWHSHWHQNSPLS HYPRL+AL
Subjt:  ------------------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFAL

Query:  STNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKK
        STN++SSI++MWN+TLMDWDL PRRQLR+WE+PLWAELKNSLNASFCENG D+P W LNS+G Y+VASVKK + QP+Q +L  Q+QNT KNLWKT+IPKK
Subjt:  STNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKK

Query:  Y-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASA
                                                  RIHLFILCPIA  IW  ISSHL+SNVNCLSPK+LCITMCSWKQK+KKN ILFNTYASA
Subjt:  Y-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASA

Query:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT
        LWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTS S LFSNYQA+S+ALNL+AF+
Subjt:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein0.0e+0069.92Show/hide
Query:  MWDDLRFNVSDFIEGSFSLSIKINIPDGPLCSAWWLSAIYGPAGGRNKNSFWAELLDLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISD
        MWDDLRFNV+DFIEG+FSLSI IN PDGP  SAWWLSAIYGP+GGRN+ SFWAELLDLKNKCSP WLLAGDFNVVRF S+TSAQN   +SMR FN FI+D
Subjt:  MWDDLRFNVSDFIEGSFSLSIKINIPDGPLCSAWWLSAIYGPAGGRNKNSFWAELLDLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISD

Query:  SNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGH
        SNLIDPPL NAK+TWSNLR  P+LSRIDRFLYT  WENLFTAHYSKTLSRVTSDHFPI+LESS+ISWGP PF+LINVHLKEPWFKNN  +WW NLRQ+GH
Subjt:  SNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGH

Query:  PGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICS
        PGFS M+K K LS IIR+EQ++N  Y+DE+K AW+KE+D+IDR+EAEGNLSEELSLRRT++KA++L   FKEA IW+QKSKRLW TEGDENTSFFH+ICS
Subjt:  PGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICS

Query:  ARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATW
        ARQR SIISNINS DG+PC+TNE+I KAFLDHFE IY  GG ++ WLI+NLNWSPI ++QAQ LCS F+EEEIH AL AFS+NKSP     T        
Subjt:  ARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATW

Query:  YDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYW
                           ++  +N+TNI+LIAKKEKC  PADYRPISLTTSIYK+IAKVIAERLK+TLP TVAENQMAFVK RQIIDAILVANEAIDYW
Subjt:  YDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYW

Query:  RVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIK
        R KKI+GFVIKLDIEKAFDKLNWRFIDF+LMKKGYP KWR WI+ACISSVQYSIIING PRGKIQPSRGIRQGDPISPFIFVLAMDY+SR+L+S+GE IK
Subjt:  RVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIK

Query:  GVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGGKPITKAFWKN
        GV+++ N+NLTHLLFADDILLFVEDDE S+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+R +QIASQW ISTKF PINYLGVPLGGK  TKAFWKN
Subjt:  GVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGGKPITKAFWKN

Query:  IDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-----------------------------
        ++EKINKKLASWKYSMLSKGGKITLIKS+LASLPTYQLS+FKAP S  K+IEK+WRNF WK+  E HKLHL                             
Subjt:  IDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-----------------------------

Query:  --------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKN
                                                                VSWKI+NGR+FSFWH HWHQNSPLS HYPRL+ALSTN++SSI++
Subjt:  --------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKN

Query:  MWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKKY---------
        MWN+TLMDWDL PRRQLR+WE PLWAELKNS+NASFCENG+D+P W LNS+G Y+VASVKKA+ QP+Q +L LQ+QNT KNLWKT+IPKK          
Subjt:  MWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKKY---------

Query:  --------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASALWNIWLERNA
                                        RIHLFILCPIA  IW  ISSHLNSNVNCLSPK+LCITMCSWKQK+KKN ILFNTYASALWNIWLERNA
Subjt:  --------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASALWNIWLERNA

Query:  RIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT
        RIFNGKEKTVA++WEDIKALAGLWTS S LFSNYQA+S+ALNL+AF+
Subjt:  RIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT

A0A5D3CJ08 LINE-1 retrotransposable element ORF2 protein0.0e+0077.16Show/hide
Query:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS
        MRKFN FISDSNLIDPPL NAKYTWSNL+AQPILSRIDRFLYT  WENLFTAHYSKTLSRVTSDHFPI+LESS ISWGPLPF+LINVHLKEPWFKNN R+
Subjt:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS

Query:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE
        WWNNLR +GHPGFS MKK KNLS+IIRDEQK+NNR+NDE+KRAWVKEVDNIDR+EAEGNL EELSLRRTK KA+IL+ DFK A IW+QKSKRLWNTEGDE
Subjt:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE

Query:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD
        NTSFFH+ICSARQR SIISNINSDDG+PCTTNENI K FLDHFEGIYN GGV+N WLIENL+WSPI +SQAQNLCS FSEEEIHSAL AFSNNKSPGPD 
Subjt:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD

Query:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI
        FTMEFFKA W+ LKDDI NIF+DF+ NCIINKAVN+TNI+LIAKKEKC  PADYRPISLTTSIYK+IAKVIAERLKETLP TVAENQMAFVKGRQIIDAI
Subjt:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR
        LVANEAIDYWRVKKI+GF                         GYPIKWRRWIKACISSVQYSIIING PR                             
Subjt:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR

Query:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG
                                         EDDE SLQNLKNIINLFQLASGLNINLNKSTISPINIDAAR DQIASQW I+TKFFPINYLGVPLGG
Subjt:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG

Query:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNGRNFSFWHSHWH
        KP TKAFWKNIDEKI+KKLASWKYSMLSKGGKITLIKSTLASLPTYQLS+FKAP S YKSIEKSWRNFFWK+ +E HKLHLVSWKI+NGRNFSFWHSHWH
Subjt:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNGRNFSFWHSHWH

Query:  QNSPLSLHYPRLFALSTNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQN
        QNSPLSLHYPRLFALSTNQD+SIK+MWN+TLMDWDLKPRRQLRDWE+PLWAELKNSLNASFCENGRD+PTW+LNSDGFYSVASVKKA+ QPDQGIL LQN
Subjt:  QNSPLSLHYPRLFALSTNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQN

Query:  QNTLKNLWKTNIPKKY-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQ
        QNT KNLWK++IPKK                                          RIHLFILCPIA  IW+LISSHL SNVNCLSPKDLCITMCSWKQ
Subjt:  QNTLKNLWKTNIPKKY-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQ

Query:  KSKKN-ILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT
        K+KKN ILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTS  LLF+NYQATS+ALNL+AFT
Subjt:  KSKKN-ILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT

A0A5D3DLM2 LINE-1 retrotransposable element ORF2 protein0.0e+0070.55Show/hide
Query:  MWDDLRFNVSDFIEGSFSLSIKINIPDGPLCSAWWLSAIYGPAGGRNKNSFWAELLDLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISD
        MWDDLRF+V+D IEG+FSLSI IN PDGP  SAWWLSAIYGP+GGRN+ SFWAELLDLKNKCSP WLLAGDFN VRFSS+TS QN   +SMR FN FISD
Subjt:  MWDDLRFNVSDFIEGSFSLSIKINIPDGPLCSAWWLSAIYGPAGGRNKNSFWAELLDLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISD

Query:  SNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGH
        SNLIDPPL NAK+TWSNLR QP+LSRIDRFLYT  WENLFTAHYSKTLSRVTSDHFPI LESSMISWGP PF+ INVHLKEPWFK N   WW NLRQ GH
Subjt:  SNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGH

Query:  PGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICS
        PGFS MKK K LS IIRDEQK+N  YNDEEK+AW+KE+DNIDR+EAEGN SEELSLRRTK+KA++LM  FKEA IW+QKSKRLW TEGDENTSFFH+ICS
Subjt:  PGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICS

Query:  ARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATW
        ARQR SIISNINS DG+PC+TNE+I KAFLDHFE IY  GG ++ WLI+NL+WSPI ++QAQNLCS F+EEEIH+AL AFSNNKSPGPD FTMEF+K+TW
Subjt:  ARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATW

Query:  YDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYW
          LK++I NIF+DF+ NCIINKAVN+TNI+LIAKKEKC  PADYRPI                         +AENQMAFVKGRQIIDAILVANEAIDYW
Subjt:  YDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYW

Query:  RVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIK
        RVKKI+GFVIKLDIEKAFDKLNWRFIDF+LMKKGYP +WR+WI+ACISSVQYSIIING PRGKIQPSRGIRQGDPISPFIFVLAMDY+SR+L+S+GE IK
Subjt:  RVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIK

Query:  GVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGGKPITKAFWKN
        GV+M+ N+NLTHLLFADDILLFVEDDE S+QNLKNIINLFQLASGL+INLNKSTISPIN+ AAR +QIASQW ISTKF PINYLGVPLGGK  TK+FWKN
Subjt:  GVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGGKPITKAFWKN

Query:  IDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNGRNFSFWHSHWHQNSPLSLHYP
        ++EKINKKL SWKYSMLSKG                      A         K W  F                                          
Subjt:  IDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNGRNFSFWHSHWHQNSPLSLHYP

Query:  RLFALSTNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKT
         +F   +  DSSI++MWN+TLMDWDL PRRQ+RDWEYPLWAELKNSLN SFCENG+D+PTW LNSDGFY+VASVKKA+ QPDQ  L LQ+QNT KNLWKT
Subjt:  RLFALSTNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKT

Query:  NIPKKYRIHLFILCPIAIFIWSLISSHLNSNVNCLS-PKDLCITMCSWKQKSKKN-ILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWT
        +IPKK             FIW+L+   +N+         +LC     WKQK+KKN IL+NTYASALWNIWLERNARIFNGKEKTVA++WEDIKALAGLWT
Subjt:  NIPKKYRIHLFILCPIAIFIWSLISSHLNSNVNCLS-PKDLCITMCSWKQKSKKN-ILFNTYASALWNIWLERNARIFNGKEKTVADLWEDIKALAGLWT

Query:  SISLLFSNYQATSLALNLHAFT
        S S LF+NYQA+S+ALNL+AF+
Subjt:  SISLLFSNYQATSLALNLHAFT

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein0.0e+0071.71Show/hide
Query:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS
        MR FN FI+DSNLIDPPL NAK+TWSNLR  P+LSRIDRFLYT  WENLFTAHYSKTLSRVTSDHFPI+LESS+ISWGP PF+LINVHLKEPWFKNN  +
Subjt:  MRKFNNFISDSNLIDPPLLNAKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRS

Query:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE
        WW NLRQ+GHPGFS M+K K LS IIR+EQ++N  Y+DE+K AW+KE+D+IDR+EAEGNLSEELSLRRT++KA++LM  FKEA IW+QKSKRLW TEGDE
Subjt:  WWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDE

Query:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD
        NTSFFH+ICSARQR SIISNINS DG+PC+TNE+I KAFLDHFE IY  GG ++ WLI+NLNWSPI ++QAQNLCS F+EEEIH AL AFSNNKSPGPD 
Subjt:  NTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDD

Query:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI
        FTMEF+K+TW  LK++I NIF+DF+ NCIINKAVN+TNI+LIAKKEKC  PADYRPISLTTSIYK+IAKVIAERLK+TLP TVAENQMAFVKGRQIIDAI
Subjt:  FTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAI

Query:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR
        LVANEAIDYWRVKKI+GFVIKLDIEKAFDKLNWRFIDF+LMKKGYP KWR WI+ACISSVQYSIIING PRGKIQPSRGIRQGDPISPFIFVLAMDY+SR
Subjt:  LVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR

Query:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG
        +L+S+GE IKGV+++ N+NLTHLLFADDILLFVEDDE S+QNLKNIINLFQLASGL+INLNKSTISPIN+DA+R +QIASQW ISTKF PINYLGVPLGG
Subjt:  MLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLGG

Query:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-------------------
        K ITK FWKN++EKINKKLASWKYSMLSKGGKITLIKS+LASLPTYQLS+FKAP S  K+IEK+WRNF WK+  E HKLHL                   
Subjt:  KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLHL-------------------

Query:  ------------------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFAL
                                                                          VSWKI+NGR+FSFWHSHWHQNSPLS HYPRL+AL
Subjt:  ------------------------------------------------------------------VSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFAL

Query:  STNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKK
        STN++SSI++MWN+TLMDWDL PRRQLR+WE+PLWAELKNSLNASFCENG D+P W LNS+G Y+VASVKK + QP+Q +L  Q+QNT KNLWKT+IPKK
Subjt:  STNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQPDQGILTLQNQNTLKNLWKTNIPKK

Query:  Y-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASA
                                                  RIHLFILCPIA  IW  ISSHL+SNVNCLSPK+LCITMCSWKQK+KKN ILFNTYASA
Subjt:  Y-----------------------------------------RIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKN-ILFNTYASA

Query:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT
        LWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTS S LFSNYQA+S+ALNL+AF+
Subjt:  LWNIWLERNARIFNGKEKTVADLWEDIKALAGLWTSISLLFSNYQATSLALNLHAFT

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.5e-4023.08Show/hide
Query:  IYGPAGGRNKNSFWAELL-DLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISDSNLID-PPLLNAKYTWSNLRAQP--ILSRIDRFLYTV
        IY P  G  +  F  ++L DL+       L+ GDFN        S +   +   ++ N+ +  ++LID    L+ K T     + P    S+ID   + V
Subjt:  IYGPAGGRNKNSFWAELL-DLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISDSNLID-PPLLNAKYTWSNLRAQP--ILSRIDRFLYTV

Query:  GWENLFT-AHYSKTLSRVTSDHFPIILE---SSMISWGPLPFRLINVHLKEPWFKNNFRS----WWNNLRQDGHPGFSIMKKFKNLS----IIIRDEQKR
        G + L +    ++ ++   SDH  I LE    ++       ++L N+ L + W  N  ++    ++           ++   FK +     I +   +++
Subjt:  GWENLFT-AHYSKTLSRVTSDHFPIILE---SSMISWGPLPFRLINVHLKEPWFKNNFRS----WWNNLRQDGHPGFSIMKKFKNLS----IIIRDEQKR

Query:  NNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICSARQRMSIISNINSDDGIPCTTN
          R   +   + +KE++  ++  ++ +  +E+    TKI+AE+   + ++      +S+  +    ++      R+   ++  + I  I +D G   T  
Subjt:  NNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICSARQRMSIISNINSDDGIPCTTN

Query:  ENIVKAFLDHFEGIYNEGGVDNL----WLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATWYDLKDDICNIFKDFYINC
          I     ++++ +Y    ++NL      ++      +   + ++L    +  EI + + +    KSPGPD FT EF++    +L   +  +F+      
Subjt:  ENIVKAFLDHFEGIYNEGGVDNL----WLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATWYDLKDDICNIFKDFYINC

Query:  IINKAVNVTNISLIAKKEKCVVPAD-YRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYW-RVKKIKGFVIKLDIEK
        I+  +    +I LI K  +     + +RPISL     KI+ K++A R+++ +   +  +Q+ F+ G Q    I  +   I +  R K     +I +D EK
Subjt:  IINKAVNVTNISLIAKKEKCVVPAD-YRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYW-RVKKIKGFVIKLDIEK

Query:  AFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIKGVRM-KDNLNLTHLLF
        AFDK+   F+   L K G    + + I+A       +II+NG          G RQG P+SP +F + ++ ++R +    E IKG+++ K+ + L+  LF
Subjt:  AFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIKGVRM-KDNLNLTHLLF

Query:  ADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPL--GGKPITKAFWKNIDEKINKKLASWK
        ADD+++++E+   S QNL  +I+ F   SG  IN+ KS     N +     QI  +   +     I YLG+ L    K + K  +K + ++I +    WK
Subjt:  ADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPL--GGKPITKAFWKNIDEKINKKLASWK

Query:  YSMLSKGGKITLIKSTLASLPTYQLSV--FKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNG----RNFSFWHS--------HWHQN
            S  G+I ++K  +     Y+ +    K P + +  +EK+   F W  +       ++S K + G     +F  ++         +W+QN
Subjt:  YSMLSKGGKITLIKSTLASLPTYQLSV--FKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNG----RNFSFWHS--------HWHQN

P08548 LINE-1 reverse transcriptase homolog1.1e-3824.35Show/hide
Query:  IYGPAGGRNKNSFWAE-LLDLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISDSNLIDPPLL----NAKYTWSNLRAQPILSRIDRFLYT
        IY P    N   F  E L D+ N  S   ++ GDFN        S++      +   N+ I   +L D          +YT+ +  A    S+ID  L  
Subjt:  IYGPAGGRNKNSFWAE-LLDLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISDSNLIDPPLL----NAKYTWSNLRAQPILSRIDRFLYT

Query:  VGWENLFTAHYSKTLSRVTSDHFPIILE---SSMISWGPLPFRLINVHLKEPWFKNNFRSWWNN-LRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDE
            NL      + +  + SDH  I +E   +  +      ++L N+ LK+ W  +  +      L Q+ +   +    +     ++R +      +  +
Subjt:  VGWENLFTAHYSKTLSRVTSDHFPIILE---SSMISWGPLPFRLINVHLKEPWFKNNFRSWWNN-LRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDE

Query:  EKRAWVKE-VDNIDRVEAEGNLSEELSLRR--TKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICSARQRMSIISNINSDDGIPCTTNENIV
         +R  V   + ++ ++E E + + + S R+  TKI+AE+   + K       KSK  +  + ++       +   ++  S+IS+I + +    T    I 
Subjt:  EKRAWVKE-VDNIDRVEAEGNLSEELSLRR--TKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICSARQRMSIISNINSDDGIPCTTNENIV

Query:  KAFLDHFEGIYNEGGVDNL----WLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATWYDLKDDICNIFKDFYINCIINK
        K   ++++ +Y+    +NL      +E  +   +   + + L    S  EI S +      KSPGPD FT EF++    +L   + N+F++     I+  
Subjt:  KAFLDHFEGIYNEGGVDNL----WLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATWYDLKDDICNIFKDFYINCIINK

Query:  AVNVTNISLIAKKEKCVV-PADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIKG---FVIKLDIEKAF
             NI+LI K  K      +YRPISL     KI+ K++  R+++ +   +  +Q+ F+ G Q    I  +   I +  + K+K     ++ +D EKAF
Subjt:  AVNVTNISLIAKKEKCVV-PADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIKG---FVIKLDIEKAF

Query:  DKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIKGVRM-KDNLNLTHLLFAD
        D +   F+   L K G    + + I+A  S    +II+NG          G RQG P+SP +F + M+ ++  +    + IKG+ +  + + L+  LFAD
Subjt:  DKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIKGVRM-KDNLNLTHLLFAD

Query:  DILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKST--ISPINIDAARIDQIASQWRISTKFFPINYLGVPL--GGKPITKAFWKNIDEKINKKLASWK
        D+++++E+   S   L  +I  +   SG  IN +KS   I   N  A +  + +  + +  K   + YLGV L    K + K  ++ + ++I + +  WK
Subjt:  DILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKST--ISPINIDAARIDQIASQWRISTKFFPINYLGVPL--GGKPITKAFWKNIDEKINKKLASWK

Query:  YSMLSKGGKITLIKSTLASLPTYQLSV--FKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNG
            S  G+I ++K ++     Y  +    KAP S +K +EK   +F W  +       L+S K + G
Subjt:  YSMLSKGGKITLIKSTLASLPTYQLSV--FKAPASIYKSIEKSWRNFFWKSQTEAHKLHLVSWKIRNG

P11369 LINE-1 retrotransposable element ORF2 protein1.6e-3726.99Show/hide
Query:  RICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNL----WLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFT
        R+    +   +I+ I ++ G   T  E I       ++ +Y+   ++NL      ++      +   Q  +L S  S +EI + + +    KSPGPD F+
Subjt:  RICSARQRMSIISNINSDDGIPCTTNENIVKAFLDHFEGIYNEGGVDNL----WLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFT

Query:  MEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPAD-YRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAIL
         EF++    DL   +  +F    +   +  +     I+LI K +K     + +RPISL     KI+ K++A R++E +   +  +Q+ F+ G Q    I 
Subjt:  MEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPAD-YRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAIL

Query:  VANEAIDYWRVKKIKG-FVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR
         +   I Y    K K   +I LD EKAFDK+   F+  VL + G    +   IKA  S    +I +NG     I    G RQG P+SP++F + ++ ++R
Subjt:  VANEAIDYWRVKKIKG-FVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSR

Query:  MLDSMGENIKGVRM-KDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLG
         +    E IKG+++ K+ + ++  L ADD+++++ D + S + L N+IN F    G  IN NKS       +     +I      S     I YLGV L 
Subjt:  MLDSMGENIKGVRM-KDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWRISTKFFPINYLGVPLG

Query:  G--KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSV--FKAPASIYKSIEKSWRNFFWKSQ
           K +    +K++ ++I + L  WK    S  G+I ++K  +     Y+ +    K P   +  +E +   F W ++
Subjt:  G--KPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSV--FKAPASIYKSIEKSWRNFFWKSQ

P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-3422.7Show/hide
Query:  LSAIYGPAGGRNKNSFWAELLDLKN--KCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISDSNLID----PPLLNAKYTWSNLRAQPI-LSRID
        L  +Y P  G  +  F+  L             ++ GDFN    +   +    R  S       I+  +L+D           +T+  +R   +  SRID
Subjt:  LSAIYGPAGGRNKNSFWAELLDLKN--KCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISDSNLID----PPLLNAKYTWSNLRAQPI-LSRID

Query:  RFLYTVGWENLFTAHYSKTLSRVT-SDHFPIILESSMISWGPLP--FRLINVHLKEPWFKNNFRSWWNNLR--QDGHPGFSIMKKF-----KNLSIIIRD
        R   +    +L +   S T+     SDH  + L  S+    P    +   N  L++  F  + R  W   R  QD    F+ + ++      +L ++ ++
Subjt:  RFLYTVGWENLFTAHYSKTLSRVT-SDHFPIILESSMISWGPLP--FRLINVHLKEPWFKNNFRSWWNNLR--QDGHPGFSIMKKF-----KNLSIIIRD

Query:  EQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICSARQRMSIISNINSDDGIP
          K  +   + E  A   EV ++++    G+  + L     + K  +   + ++A     +S+     + D  + FF+ +   +     I+ + ++DG P
Subjt:  EQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICSARQRMSIISNINSDDGIP

Query:  CTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSS-QAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATWYDLKDDICNIFKDFYIN
            E I       ++ +++   +      E  +  P+ S  + + L +  + +E+  AL    +NKSPG D  T+EFF+  W  L  D   +  + +  
Subjt:  CTTNENIVKAFLDHFEGIYNEGGVDNLWLIENLNWSPIPSS-QAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATWYDLKDDICNIFKDFYIN

Query:  CIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIKGFVIKLDIEKA
          +  +     +SL+ KK    +  ++RP+SL ++ YKI+AK I+ RLK  L   +  +Q   V GR I D + +  + + + R   +    + LD EKA
Subjt:  CIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIKGFVIKLDIEKA

Query:  FDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIKGVRMKD-NLNLTHLLFA
        FD+++ +++   L    +  ++  ++K   +S +  + IN +    +   RG+RQG P+S  ++ LA   +   L  + + + G+ +K+ ++ +    +A
Subjt:  FDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIKGVRMKD-NLNLTHLLFA

Query:  DDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWR-ISTKFFPINYLGVPLGGK--PITKAFWKNIDEKINKKLASWK
        DD++L V  D   L+  +    ++  AS   IN +KS  S +   + ++D +   +R IS +   I YLGV L  +  P+++ F + ++E +  +L  WK
Subjt:  DDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINIDAARIDQIASQWR-ISTKFFPINYLGVPLGGK--PITKAFWKNIDEKINKKLASWK

Query:  --YSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFW
            +LS  G+  +I   +AS   Y+L            I++   +F W
Subjt:  --YSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFW

Q03278 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.3e-1526.01Show/hide
Query:  QAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAK
        + +NL    S +EI    V      + GPD  T       W  + + I ++F     +    +    +   LI K+   + PA +RP+S+ +   +   +
Subjt:  QAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRPISLTTSIYKIIAK

Query:  VIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIKG-FVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIING
        ++A R+ E   L     Q AF+    + +   + +  I   R+ KIKG ++  LD++KAFD +  R I   L +K  P++ R +I     + +  + +  
Subjt:  VIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIKG-FVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSVQYSIIING

Query:  TPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNI----------------INLFQL
        T    I+P+RG+RQGDP+SP +F   MD V R L      + G        +  L+FADD++L  E  E    +L  I                  L  +
Subjt:  TPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNI----------------INLFQL

Query:  ASGLNINLNKSTISPINIDAARIDQI--ASQWRISTKFFPINYLGV
         SG    +   T  P  +    I Q+  A QW+         YLGV
Subjt:  ASGLNINLNKSTISPINIDAARIDQI--ASQWRISTKFFPINYLGV

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein4.8e-0523.64Show/hide
Query:  KNSFWAELLDLKNK---CSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFIS----------DSNLIDPPLLNAKYTWSN-LRAQPILSRIDRFLYT
        + S W ++  L      C+  WL+ GDFN +       A    HYS+   N  +           DS+L+D P     YTWSN  +  PIL ++DR +  
Subjt:  KNSFWAELLDLKNK---CSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFIS----------DSNLIDPPLLNAKYTWSN-LRAQPILSRIDRFLYT

Query:  VGWENLFTAHYSKTLSRVTSDHFP--IILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEK
          W   F    +       SDH    +IL +S        F+  +     P F ++  + W      G   FS+ +  K      R   +R   +++ + 
Subjt:  VGWENLFTAHYSKTLSRVTSDHFP--IILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGHPGFSIMKKFKNLSIIIRDEQKRNNRYNDEEK

Query:  RAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGD
        +      D + R       +E ++ +     A  L   +K      QKS+  W  EGD
Subjt:  RAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGD

AT1G43760.1 DNAse I-like superfamily protein2.5e-3026.52Show/hide
Query:  KCSPIWLLAGDFNVVRFSS---KTSAQNLRHYSMRKFNNFISDSNLIDPPLLNAKYTWSNLR-AQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHF
        K   + +L GDF+ +  +S        ++    + +F N + DS+L+D P     YTWSN +   PI+ ++DR +    W + F +  +       SDH 
Subjt:  KCSPIWLLAGDFNVVRFSS---KTSAQNLRHYSMRKFNNFISDSNLIDPPLLNAKYTWSNLR-AQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHF

Query:  P-IILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGHPGFSI---MKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSE
        P II+  ++       FR  +     P F  +    W      G   FS+   +K  K    ++  +   N ++  +E       +D+++ ++++   + 
Subjt:  P-IILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGHPGFSI---MKKFKNLSIIIRDEQKRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSE

Query:  ELSLRRTKIKAEILMYDFKEA--HIWHQKSKRLWNTEGDENTSFFHRICSARQRMSIISNINSDDGI---PCTTNENIVKAFLDHFEGIYNE-GGVDNLW
          SL R +  A      F  A    + QKS+  W  +GD NT FFH++  A Q  ++I  +  DD +     T  + ++ A+  H  G  ++    D++ 
Subjt:  ELSLRRTKIKAEILMYDFKEA--HIWHQKSKRLWNTEGDENTSFFHRICSARQRMSIISNINSDDGI---PCTTNENIVKAFLDHFEGIYNE-GGVDNLW

Query:  LIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRP
         I++++      + A  L +  S++EI +A+ A   NK+PGPD FT EFF  +W+ +KD      K+F+    + K  N T I+LI K       + +RP
Subjt:  LIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVVPADYRP

Query:  ISLTTSIYKII
        +S  T +YKII
Subjt:  ISLTTSIYKII

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-0921.51Show/hide
Query:  PINYLGVPLGGKPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFW-------------------
        P+ YLG+PL  K +T + +  + EKI  ++  W    LS  G++ LI S + SL  + +S F+ P++  K I+    +F W                   
Subjt:  PINYLGVPLGGKPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFW-------------------

Query:  ---------KSQTEAHK---------LHLVSW------------------KIRNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQ---DSSIKNMWNSTL
                 +S  EA+K           L SW                   I NG N SFW  +W +         RL  ++ ++   D  I    +   
Subjt:  ---------KSQTEAHK---------LHLVSW------------------KIRNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQ---DSSIKNMWNSTL

Query:  MDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKK---AIHQPDQGILTLQNQNTLKNLWKTNIPKKYRIHLFILCPIAI
           + +PRR   D    +  ++   +      +G DT  W  N D F    + K+   A  +P   +      N  K +W ++   KY          ++
Subjt:  MDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKK---AIHQPDQGILTLQNQNTLKNLWKTNIPKKYRIHLFILCPIAI

Query:  FIWSLISSHLNSNVNCLS---PKDLCITMCSWKQKSKKNILFN-------------TYASALWNIWLERNAR
          W  I + L +    LS     D    +C    +++ ++ F              T+   L ++W ERN R
Subjt:  FIWSLISSHLNSNVNCLS---PKDLCITMCSWKQKSKKNILFN-------------TYASALWNIWLERNAR

AT4G20520.1 RNA binding;RNA-directed DNA polymerases7.9e-0835.8Show/hide
Query:  IAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYWRVKK-IKGF-VIKLDIEKAFDKLNWRFIDFVLMKKGYPIKW
        + ERLK  +   +   Q +F+ GR   D I+   EA+   R KK +KG+ ++KLD+EKA+D++ W +++  L+  G+P  W
Subjt:  IAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYWRVKK-IKGF-VIKLDIEKAFDKLNWRFIDFVLMKKGYPIKW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)9.9e-1146.27Show/hide
Query:  IINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGE--NIKGVRMKDNL-NLTHLLFADD
        IING P+G + PSRG+RQGDP+SP++F+L  + +S +     E   + G+R+ +N   + HLLFADD
Subjt:  IINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGE--NIKGVRMKDNL-NLTHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGACGATCTCAGATTCAATGTTTCTGACTTCATTGAAGGTTCCTTCTCTTTATCCATAAAAATAAACATCCCTGATGGGCCTCTGTGTTCAGCCTGGTGGTTGTC
GGCTATATACGGGCCTGCTGGTGGCAGAAACAAAAATTCCTTTTGGGCGGAACTTCTGGACCTAAAAAATAAATGTTCTCCTATCTGGCTTTTAGCGGGAGATTTTAATG
TAGTGAGATTCTCCTCTAAGACATCTGCCCAAAATCTCAGACACTACAGTATGAGGAAATTTAACAACTTCATCTCTGACAGCAATCTTATTGACCCACCCCTCTTAAAT
GCAAAATATACATGGTCAAACCTCAGAGCTCAACCGATTCTCTCCAGAATTGACAGATTCTTATATACAGTGGGTTGGGAAAACTTATTCACTGCCCACTATTCAAAAAC
ACTCTCCCGAGTTACATCAGACCACTTCCCGATCATCCTTGAATCATCCATGATTAGTTGGGGCCCTCTACCTTTCAGGCTTATAAATGTACATTTAAAAGAGCCGTGGT
TTAAAAACAACTTTAGAAGTTGGTGGAATAATTTGAGACAGGATGGACATCCAGGCTTCTCTATCATGAAGAAGTTCAAAAATTTATCCATCATTATAAGGGATGAACAA
AAAAGGAATAACCGTTACAATGATGAAGAAAAGAGAGCTTGGGTCAAAGAAGTTGATAACATTGACAGAGTAGAAGCTGAAGGAAATTTATCTGAAGAGCTTAGCCTTCG
CAGAACAAAAATAAAAGCTGAGATCCTTATGTATGACTTCAAAGAAGCACACATATGGCACCAAAAAAGCAAGAGACTTTGGAACACTGAAGGAGATGAAAACACTTCAT
TCTTTCACAGAATTTGCTCTGCTAGACAACGAATGAGTATCATTTCAAATATTAATTCAGATGATGGAATCCCTTGTACTACAAATGAAAACATTGTCAAAGCCTTCTTA
GACCATTTTGAAGGAATCTACAATGAAGGTGGTGTGGACAACCTCTGGCTTATTGAAAATCTCAATTGGTCCCCTATACCCTCATCCCAAGCTCAAAACTTATGCTCATT
TTTCTCGGAGGAGGAAATACATTCAGCCCTTGTTGCTTTCTCAAACAATAAAAGCCCGGGCCCAGACGACTTTACTATGGAATTCTTCAAAGCAACTTGGTATGACCTCA
AGGATGATATTTGTAATATATTCAAAGACTTCTACATCAACTGTATCATCAATAAAGCAGTGAATGTAACAAATATTTCCCTGATTGCTAAGAAAGAAAAGTGTGTGGTG
CCAGCGGATTACAGACCTATTAGTCTAACGACTTCCATTTACAAGATTATTGCCAAAGTCATTGCTGAAAGACTTAAAGAAACTCTTCCTTTAACGGTGGCAGAGAATCA
AATGGCCTTTGTAAAAGGAAGACAAATCATTGATGCTATCTTAGTTGCAAATGAAGCTATTGACTATTGGAGAGTAAAGAAAATTAAAGGCTTTGTTATTAAGCTGGACA
TTGAAAAGGCCTTTGATAAGCTAAATTGGAGATTCATAGACTTTGTGCTTATGAAAAAGGGCTATCCCATTAAATGGAGGAGATGGATAAAAGCTTGTATCAGTAGTGTT
CAGTATTCTATTATCATCAATGGCACACCAAGAGGCAAAATTCAACCATCCCGTGGTATTCGTCAAGGAGACCCAATCTCCCCTTTTATTTTTGTCCTAGCAATGGACTA
TGTAAGCAGGATGCTGGATTCAATGGGTGAGAACATCAAAGGGGTGAGAATGAAAGACAACCTTAACCTCACTCACTTACTTTTTGCAGATGATATTCTGCTTTTTGTAG
AAGATGATGAGCCCTCCCTACAAAATTTAAAAAATATCATCAATCTTTTCCAGCTAGCTTCGGGTTTGAATATCAATCTGAACAAGTCCACCATCTCCCCTATAAATATT
GATGCTGCCAGAATTGATCAGATAGCTTCACAATGGAGAATTTCTACAAAATTTTTCCCAATCAATTATCTTGGAGTGCCACTCGGAGGTAAACCAATAACAAAGGCTTT
CTGGAAGAACATTGATGAAAAAATCAACAAAAAACTCGCCAGCTGGAAATACTCTATGTTATCCAAAGGTGGGAAAATCACCTTGATTAAATCTACTTTGGCCAGCCTTC
CTACTTATCAATTATCAGTGTTCAAAGCCCCTGCATCAATCTACAAAAGCATTGAGAAATCTTGGAGGAACTTCTTCTGGAAGAGCCAAACCGAGGCCCACAAACTACAT
TTGGTTTCCTGGAAAATTAGAAATGGAAGAAACTTCTCCTTTTGGCACAGCCACTGGCATCAAAATAGTCCTCTTTCGTTACACTACCCTAGATTATTTGCTCTCTCTAC
AAACCAAGACAGCTCCATAAAAAACATGTGGAACTCAACTTTGATGGATTGGGATCTAAAACCAAGAAGACAGTTGAGGGATTGGGAATATCCTCTGTGGGCCGAGCTAA
AAAACTCTCTAAATGCCAGTTTTTGCGAAAATGGCAGAGACACACCTACTTGGGTCCTCAACTCTGATGGCTTTTACTCTGTTGCTTCGGTAAAGAAAGCTATTCACCAA
CCTGATCAAGGCATTTTAACTCTCCAAAATCAAAACACTTTAAAAAATCTTTGGAAAACCAACATTCCAAAGAAATACAGAATTCATCTCTTTATCTTATGCCCTATAGC
AATTTTCATTTGGAGCTTAATATCATCCCACTTAAACAGCAATGTCAACTGTCTCAGTCCCAAGGATCTCTGTATTACTATGTGCAGCTGGAAACAGAAAAGCAAAAAGA
ACATCCTCTTCAACACCTATGCCTCTGCCCTTTGGAACATATGGTTGGAAAGAAATGCCCGCATTTTCAATGGCAAAGAAAAAACAGTTGCGGACTTATGGGAAGATATA
AAAGCTCTTGCAGGACTATGGACTAGCATATCTTTATTGTTTTCAAATTATCAAGCTACATCCTTAGCTCTAAACCTTCATGCGTTTACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGGACGATCTCAGATTCAATGTTTCTGACTTCATTGAAGGTTCCTTCTCTTTATCCATAAAAATAAACATCCCTGATGGGCCTCTGTGTTCAGCCTGGTGGTTGTC
GGCTATATACGGGCCTGCTGGTGGCAGAAACAAAAATTCCTTTTGGGCGGAACTTCTGGACCTAAAAAATAAATGTTCTCCTATCTGGCTTTTAGCGGGAGATTTTAATG
TAGTGAGATTCTCCTCTAAGACATCTGCCCAAAATCTCAGACACTACAGTATGAGGAAATTTAACAACTTCATCTCTGACAGCAATCTTATTGACCCACCCCTCTTAAAT
GCAAAATATACATGGTCAAACCTCAGAGCTCAACCGATTCTCTCCAGAATTGACAGATTCTTATATACAGTGGGTTGGGAAAACTTATTCACTGCCCACTATTCAAAAAC
ACTCTCCCGAGTTACATCAGACCACTTCCCGATCATCCTTGAATCATCCATGATTAGTTGGGGCCCTCTACCTTTCAGGCTTATAAATGTACATTTAAAAGAGCCGTGGT
TTAAAAACAACTTTAGAAGTTGGTGGAATAATTTGAGACAGGATGGACATCCAGGCTTCTCTATCATGAAGAAGTTCAAAAATTTATCCATCATTATAAGGGATGAACAA
AAAAGGAATAACCGTTACAATGATGAAGAAAAGAGAGCTTGGGTCAAAGAAGTTGATAACATTGACAGAGTAGAAGCTGAAGGAAATTTATCTGAAGAGCTTAGCCTTCG
CAGAACAAAAATAAAAGCTGAGATCCTTATGTATGACTTCAAAGAAGCACACATATGGCACCAAAAAAGCAAGAGACTTTGGAACACTGAAGGAGATGAAAACACTTCAT
TCTTTCACAGAATTTGCTCTGCTAGACAACGAATGAGTATCATTTCAAATATTAATTCAGATGATGGAATCCCTTGTACTACAAATGAAAACATTGTCAAAGCCTTCTTA
GACCATTTTGAAGGAATCTACAATGAAGGTGGTGTGGACAACCTCTGGCTTATTGAAAATCTCAATTGGTCCCCTATACCCTCATCCCAAGCTCAAAACTTATGCTCATT
TTTCTCGGAGGAGGAAATACATTCAGCCCTTGTTGCTTTCTCAAACAATAAAAGCCCGGGCCCAGACGACTTTACTATGGAATTCTTCAAAGCAACTTGGTATGACCTCA
AGGATGATATTTGTAATATATTCAAAGACTTCTACATCAACTGTATCATCAATAAAGCAGTGAATGTAACAAATATTTCCCTGATTGCTAAGAAAGAAAAGTGTGTGGTG
CCAGCGGATTACAGACCTATTAGTCTAACGACTTCCATTTACAAGATTATTGCCAAAGTCATTGCTGAAAGACTTAAAGAAACTCTTCCTTTAACGGTGGCAGAGAATCA
AATGGCCTTTGTAAAAGGAAGACAAATCATTGATGCTATCTTAGTTGCAAATGAAGCTATTGACTATTGGAGAGTAAAGAAAATTAAAGGCTTTGTTATTAAGCTGGACA
TTGAAAAGGCCTTTGATAAGCTAAATTGGAGATTCATAGACTTTGTGCTTATGAAAAAGGGCTATCCCATTAAATGGAGGAGATGGATAAAAGCTTGTATCAGTAGTGTT
CAGTATTCTATTATCATCAATGGCACACCAAGAGGCAAAATTCAACCATCCCGTGGTATTCGTCAAGGAGACCCAATCTCCCCTTTTATTTTTGTCCTAGCAATGGACTA
TGTAAGCAGGATGCTGGATTCAATGGGTGAGAACATCAAAGGGGTGAGAATGAAAGACAACCTTAACCTCACTCACTTACTTTTTGCAGATGATATTCTGCTTTTTGTAG
AAGATGATGAGCCCTCCCTACAAAATTTAAAAAATATCATCAATCTTTTCCAGCTAGCTTCGGGTTTGAATATCAATCTGAACAAGTCCACCATCTCCCCTATAAATATT
GATGCTGCCAGAATTGATCAGATAGCTTCACAATGGAGAATTTCTACAAAATTTTTCCCAATCAATTATCTTGGAGTGCCACTCGGAGGTAAACCAATAACAAAGGCTTT
CTGGAAGAACATTGATGAAAAAATCAACAAAAAACTCGCCAGCTGGAAATACTCTATGTTATCCAAAGGTGGGAAAATCACCTTGATTAAATCTACTTTGGCCAGCCTTC
CTACTTATCAATTATCAGTGTTCAAAGCCCCTGCATCAATCTACAAAAGCATTGAGAAATCTTGGAGGAACTTCTTCTGGAAGAGCCAAACCGAGGCCCACAAACTACAT
TTGGTTTCCTGGAAAATTAGAAATGGAAGAAACTTCTCCTTTTGGCACAGCCACTGGCATCAAAATAGTCCTCTTTCGTTACACTACCCTAGATTATTTGCTCTCTCTAC
AAACCAAGACAGCTCCATAAAAAACATGTGGAACTCAACTTTGATGGATTGGGATCTAAAACCAAGAAGACAGTTGAGGGATTGGGAATATCCTCTGTGGGCCGAGCTAA
AAAACTCTCTAAATGCCAGTTTTTGCGAAAATGGCAGAGACACACCTACTTGGGTCCTCAACTCTGATGGCTTTTACTCTGTTGCTTCGGTAAAGAAAGCTATTCACCAA
CCTGATCAAGGCATTTTAACTCTCCAAAATCAAAACACTTTAAAAAATCTTTGGAAAACCAACATTCCAAAGAAATACAGAATTCATCTCTTTATCTTATGCCCTATAGC
AATTTTCATTTGGAGCTTAATATCATCCCACTTAAACAGCAATGTCAACTGTCTCAGTCCCAAGGATCTCTGTATTACTATGTGCAGCTGGAAACAGAAAAGCAAAAAGA
ACATCCTCTTCAACACCTATGCCTCTGCCCTTTGGAACATATGGTTGGAAAGAAATGCCCGCATTTTCAATGGCAAAGAAAAAACAGTTGCGGACTTATGGGAAGATATA
AAAGCTCTTGCAGGACTATGGACTAGCATATCTTTATTGTTTTCAAATTATCAAGCTACATCCTTAGCTCTAAACCTTCATGCGTTTACCTAG
Protein sequenceShow/hide protein sequence
MWDDLRFNVSDFIEGSFSLSIKINIPDGPLCSAWWLSAIYGPAGGRNKNSFWAELLDLKNKCSPIWLLAGDFNVVRFSSKTSAQNLRHYSMRKFNNFISDSNLIDPPLLN
AKYTWSNLRAQPILSRIDRFLYTVGWENLFTAHYSKTLSRVTSDHFPIILESSMISWGPLPFRLINVHLKEPWFKNNFRSWWNNLRQDGHPGFSIMKKFKNLSIIIRDEQ
KRNNRYNDEEKRAWVKEVDNIDRVEAEGNLSEELSLRRTKIKAEILMYDFKEAHIWHQKSKRLWNTEGDENTSFFHRICSARQRMSIISNINSDDGIPCTTNENIVKAFL
DHFEGIYNEGGVDNLWLIENLNWSPIPSSQAQNLCSFFSEEEIHSALVAFSNNKSPGPDDFTMEFFKATWYDLKDDICNIFKDFYINCIINKAVNVTNISLIAKKEKCVV
PADYRPISLTTSIYKIIAKVIAERLKETLPLTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIKGFVIKLDIEKAFDKLNWRFIDFVLMKKGYPIKWRRWIKACISSV
QYSIIINGTPRGKIQPSRGIRQGDPISPFIFVLAMDYVSRMLDSMGENIKGVRMKDNLNLTHLLFADDILLFVEDDEPSLQNLKNIINLFQLASGLNINLNKSTISPINI
DAARIDQIASQWRISTKFFPINYLGVPLGGKPITKAFWKNIDEKINKKLASWKYSMLSKGGKITLIKSTLASLPTYQLSVFKAPASIYKSIEKSWRNFFWKSQTEAHKLH
LVSWKIRNGRNFSFWHSHWHQNSPLSLHYPRLFALSTNQDSSIKNMWNSTLMDWDLKPRRQLRDWEYPLWAELKNSLNASFCENGRDTPTWVLNSDGFYSVASVKKAIHQ
PDQGILTLQNQNTLKNLWKTNIPKKYRIHLFILCPIAIFIWSLISSHLNSNVNCLSPKDLCITMCSWKQKSKKNILFNTYASALWNIWLERNARIFNGKEKTVADLWEDI
KALAGLWTSISLLFSNYQATSLALNLHAFT