; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g019680 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g019680
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr06:44320068..44323505
RNA-Seq ExpressionLcy06g019680
SyntenyLcy06g019680
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.5e-28747.12Show/hide
Query:  MSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEH
        M  FNKFI+D++L+DPPL N  +TW+NLR  PV+SR+DRFL++ +W   F  H+SK LSR TSDHFPI+L+ S  +WGP PF+  N  L    F  NV +
Subjt:  MSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEH

Query:  WWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDE
        WW +   EG PGFSF+R+LK L+  ++N +  N     E K A   EID ID LE+ G+L +     R  LKAD+  +   EA+ W Q+ K+LW+++ DE
Subjt:  WWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDE

Query:  NSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDG
        N++FFHKIC+AR+RR+ I  + + +G+   ++  + K  +DHF DIY    + S W+I NLNW PI++  A+ L S F ++E+ E + +   NK P PDG
Subjt:  NSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDG

Query:  FTIEFFKKFWNTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAI
        FT+EF+K  W+  K  I+++F DF+ + +IN+ +N TNIALI K+    + +D+RPISLTTS+YK++AKV+AERLK TL  T++ NQ AFV+ RQI DAI
Subjt:  FTIEFFKKFWNTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAI

Query:  LLANEAVDFWRVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSR
        L+ANEA+D+WRV + +G +IKLD+EKAFDK++W FID +L+KKGYP  WR WIRACISSV YSII+NG+PRG IQ  RGIRQGDP+SPF+FVLAMDY+SR
Subjt:  LLANEAVDFWRVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSR

Query:  LIEAVEKKGLVSGVVM-GDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPL
        L+ +V +K  + GV + G+I++THLLFADDILLFV+DD+ +I+N+  II  F+  SGL INLNKSTIS IN+   RT +IAS WG +   LPI+YLG PL
Subjt:  LIEAVEKKGLVSGVVM-GDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPL

Query:  GGIPKNNLFWEPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGI
        GG      FW+ + EKI  ++ +W++  LSKGG++TLI S L S+P Y LSIFKAP S C  IEK +R FLW+    +   +LV W  ++SSK +GGLGI
Subjt:  GGIPKNNLFWEPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGI

Query:  HKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLY
         ++K+TN ALL KW+WR+ +E++ LW+  IN KY SL     P     SSSRSPW +I K  + F  +  W I+NGRS  FWH +W    PL     RLY
Subjt:  HKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLY

Query:  QLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIP
         LS+NK  SI ++W+++   W+  PRR L + +   W+E    L     + G+D   W ++ +GL+T  S +  L            +    NLW   IP
Subjt:  QLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIP

Query:  KKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVF
        KK   FIW+L + S+NT+++L     N    PS C++C +N ED  HLFI C  A      I+  L  S V   +    C  + + K  +++ ++  N +
Subjt:  KKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVF

Query:  IATLWLLWNERNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSF
         + LW +W ERN RIF  K +T  ++WEDI +LA LW ++S +FS+Y AS IALN  +F
Subjt:  IATLWLLWNERNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSF

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0045.91Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL
        MW++   +I    KG FSVS+ V  ++G  WWLS IYGPA RK R  FW EL  L  +C   W+LGGDFNV R   ET++ NPA LSM +FN FIS+ +L
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL

Query:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF
        +DPPL N  YTW+NLR++  +SRLDRFLF++ W   F  H SK L+R TSDHFPI+L+ S+ +WGP PFRF N  L +  +  N+E WW +T   G+ G+
Subjt:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF

Query:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR
        SF+RRLK LA  +K W        +  KKA   EID+ID LE+ GS  ++ ++ R +LKADL +  L EA+ W Q+CK++W+ + DENS+FFHKICTAR+
Subjt:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR

Query:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK
        ++  I ++    G + ++D  +    I HF DIY  N+NS+  I NL+W PI+++++  L  PF + E++  +KS  +NK P PDG+ ++F +K W+  K
Subjt:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK

Query:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR
         +I  +F DF+ + +IN+ +N T I LI K+      +DFRPISLTT++YK++AK LA+RLK TL DTIS +Q AFV+ RQI++AIL+ANEA+DFWR  +
Subjt:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR

Query:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV
        ++G +IKLD+EKAFDK++W FID VL+KK Y   WR+ I +CISSV YSI++NG+PRG I+  RGIRQGDPLSPF+FVLAMDYLSRL+  +  K  ++GV
Subjt:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV

Query:  VMG-DISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI
            ++++TH+LFADDIL+FV+D D  + N+ +I+  FE  SGL INL+KSTI  IN+   R   IA SWG +   LP  YLG PLGG P ++ FW+ ++
Subjt:  VMG-DISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI

Query:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW
        +KIQ ++ NW++  LSKGGR+TLI+S L S+P+Y +S+FK P  I  KIE  +R FLW GAS+  + +L+RW  + S K +GGLGIH +  TN ALL KW
Subjt:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW

Query:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW
        +W+F  E++ LW+  I +KY       FPS  K SS+ SPW A+++    F+ N  W + +G    FW DNW+   PL     RL+ LS+NK  S+ E W
Subjt:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW

Query:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRP--FHSPGEKILNNLWTADIPKKIKVFIWSLFH
        + S   W+    RPL D +   W      LPTP P RG     W ++ + +F T S +  ++  P  P  FH     +   LW  + PKK K FIW+L H
Subjt:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRP--FHSPGEKILNNLWTADIPKKIKVFIWSLFH

Query:  RSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERN
          INT+DRLQ    N   +P+ C +C K+ EDI+HLFIHC  +    +K    L  +   P  + S   ++ +    +Q+ L+  N     LW +W ERN
Subjt:  RSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERN

Query:  RRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSFL
         RIF+ + +    LWED ++   LW+ KSK+FS+Y    IALN  +F+
Subjt:  RRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSFL

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.8e-29044.06Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL
        +WD+    + D   G +S+S+ +  ++G +WWL+ +YGP     R   W EL  L  LC   WL+ GDFN+ R   ET++ +  K +M+ FN FIS  +L
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL

Query:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF
        +DPP +N  +TW+NLR  P  SRLDRFL S  W   F  H S+ L R  SDHFPILL+     WGPCPFR +N  L +K F  N  +WW  +   GFPG+
Subjt:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF

Query:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR
        +FI+ L  L++ +K W+ +  + +   KKA+  EID ID LE  G +     Q R SLK+DL      +A+ W+QR ++ W    DEN+++FH+ICT  +
Subjt:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR

Query:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK
        R+N I  +    G S+ S   + +  I HF +IY      E +I NL+W PI+ +    L  PF + E+   I S    K P PDG+T+ F+KK W   K
Subjt:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK

Query:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR
          +++VF DF+ + ++N N+N+T IALI K+    K SD+RPISLTTSLYKIMAK LA RLKS L DTI+ NQ AF++ RQI+DAIL+ANEA+D W+  +
Subjt:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR

Query:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV
         KG ++KLD+EKAFDKISW+FID +L KK +P  WR+WI+ACIS+V YSI+LNG P+G I+A+RGIRQGDPLSPF+FVLAMDYLSRL+  +E KG + GV
Subjt:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV

Query:  VMGD-ISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI
           +  +++HLLFADD+L+FV+D+++ + N+ + +  FE  SGL  N +KSTIS IN+S  RT +IAS +G     LP++YLG PLGG P++  FW+  I
Subjt:  VMGD-ISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI

Query:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW
        E I  +++ W++  +SKGGRLTL+ + L+S+P Y LS FKAP S+  +IEK +R FLW G+    + +L+ W I +S K  GGLGI K+K+TN ALL KW
Subjt:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW

Query:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW
        +WR+ NE N+LW+  I+ KY+  H    P   + SS+ SPW+AI K +D + +   W   +G S  FWH  W +  PL     RLY LS+ ++ ++ E+W
Subjt:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW

Query:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRS
              WN +PRRPL +R+ Q W      LP  +  RG     W  S    +T  SA+ I     S P  +  EK L +LW + IP+K K FIW++ H+ 
Subjt:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRS

Query:  INTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRR
        +NT D +Q    +   NPS CI C  ++ED++HLFI C  A    N  +   G  MV    +   C  L      + + ++  N  IATLW +W  RN  
Subjt:  INTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRR

Query:  IFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS
        IF DK  +    WEDI +L   W++KSK   +YS + IALN K+
Subjt:  IFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-30046.04Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFS---WWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISD
        MWD+L  N+TD ++G FS+S+ +   DG S   WWLS IYGP+  + RKSFW EL DL   C   WLL GDFNV R  SETS+ NP+K SM  FNKFI+D
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFS---WWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISD

Query:  TDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGF
        ++L+DPPL N  +TW+NLR  PV+SR+DRFL++ +W   F  H+SK LSR TSDHFPI+L+ S  +WGP PF+  N  L    F  N+ +WW +   EG 
Subjt:  TDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGF

Query:  PGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICT
        PGFSF+R+LK L+  ++N +  N     E K A   EID ID LE+ G+L +     R  LKAD+  +   EA+ W Q+ K+LW+++ DEN++FFHKIC+
Subjt:  PGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICT

Query:  ARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFW
        AR+RR+ I  + + +G+   ++  + K  +DHF DIY    + S W+I NLNW PI++  A+ L S F ++E+ E + +   NK P              
Subjt:  ARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFW

Query:  NTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFW
                        ++ ++  +N TNIALI K+    + +D+RPISLTTS+YK++AKV+AERLK TL  T++ NQ AFV+ RQI DAIL+ANEA+D+W
Subjt:  NTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFW

Query:  RVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGL
        R  + +G +IKLD+EKAFDK++W FID +L+KKGYP  WR WIRACISSV YSII+NG+PRG IQ  RGIRQGDP+SPF+FVLAMDY+SRL+ +V +K  
Subjt:  RVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGL

Query:  VSGVVM-GDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFW
        + GV + G+I++THLLFADDILLFV+DD+ +I+N+  II  F+  SGL INLNKSTIS IN+   RT +IAS WG +   LPI+YLG PLGG      FW
Subjt:  VSGVVM-GDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFW

Query:  EPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDAL
        + + EKI  ++ +W++  LSKGG++TLI S L S+P Y LSIFKAP S C  IEK +R FLW+    +   +LV W  ++SSK +GGLGI ++K+TN AL
Subjt:  EPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDAL

Query:  LLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSI
        L KW+WR+ +E++ LW+  IN KY SL     P     SSSRSPW +I K  + F  +  W I+NGRS  FWH +W    PL     RLY LS+NK  SI
Subjt:  LLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSI

Query:  AEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSL
         ++W+++   W+  PRR L + +L  W+E    +     + G D   W ++ +GL+T  S +  L            +    NLW   IPKK   FIW+L
Subjt:  AEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSL

Query:  FHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNE
         + S+NT+++L          PS C++C +N ED  HLFI C  A      I+  L  S V   +    C  + + K  +++ ++  N + + LW +W E
Subjt:  FHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNE

Query:  RNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSF
        RN RIF  K +T  ++WEDI +LA LW ++S +FS+Y AS IALN  +F
Subjt:  RNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSF

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.4e-28943.97Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL
        +WD+    + D   G +S+S+ +  ++G +WWL+ +YGP     R   W EL  L  LC   WL+ GDFN+ R   ET++ +  K +M+ FN FIS  +L
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL

Query:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF
        +DPP +N  +TW+NLR  P  SRLDRFL S  W   F  H S+ L R  SDHFPILL+     WGPCPFR +N  L +K F  N  +WW  +   GFPG+
Subjt:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF

Query:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR
        +FI+ L  L++ +K W+ +  + +   KKA+  EID ID LE  G +     Q R SLK+DL      +A+ W+QR ++ W    DEN+++FH+ICT  +
Subjt:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR

Query:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK
        R+N I  +    G S+ S   + +  I HF +IY      E +I NL+W PI+ +    L  PF + E+   I S    K P PDG+T+ F+KK W   K
Subjt:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK

Query:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR
          +++VF DF+ + ++N N+N+T IALI K+    K SD+RPISLTTSLYKIMAK LA RLKS L DTI+ NQ AF++ RQI+DAIL+ANE +D W+  +
Subjt:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR

Query:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV
         KG ++KLD+EKAFDKISW+FID +L KK +P  WR+WI+ACIS+V YSI+LNG P+G I+A+RGIRQGDPLSPF+FVLAMDYLSRL+  +E KG + GV
Subjt:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV

Query:  VMGD-ISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI
           +  +++HLLFADD+L+FV+D+++ + N+ + +  FE  SGL  N +KSTIS IN+S  RT +IAS +G     LP++YLG PLGG P++  FW+  I
Subjt:  VMGD-ISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI

Query:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW
        E I  +++ W++  +SKGGRLTL+ + L+S+P Y LS FKAP S+  +IEK +R FLW G+    + +L+ W I +S K  GGLGI K+K+TN ALL KW
Subjt:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW

Query:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW
        +WR+ NE N+LW+  I+ KY+  H    P   + SS+ SPW+AI K +D + +   W   +G S  FWH  W +  PL     RLY LS+ ++ ++ E+W
Subjt:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW

Query:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRS
              WN +PRRPL +R+ Q W      LP  +  RG     W  S    +T  SA+ I     S P  +  EK L +LW + IP+K K FIW++ H+ 
Subjt:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRS

Query:  INTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRR
        +NT D +Q    +   NPS CI C  ++ED++HLFI C  A    N  +   G  MV    +   C  L      + + ++  N  IATLW +W  RN  
Subjt:  INTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRR

Query:  IFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS
        IF DK  +    WEDI +L   W++KSK   +YS + IALN K+
Subjt:  IFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein0.0e+0045.91Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL
        MW++   +I    KG FSVS+ V  ++G  WWLS IYGPA RK R  FW EL  L  +C   W+LGGDFNV R   ET++ NPA LSM +FN FIS+ +L
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL

Query:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF
        +DPPL N  YTW+NLR++  +SRLDRFLF++ W   F  H SK L+R TSDHFPI+L+ S+ +WGP PFRF N  L +  +  N+E WW +T   G+ G+
Subjt:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF

Query:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR
        SF+RRLK LA  +K W        +  KKA   EID+ID LE+ GS  ++ ++ R +LKADL +  L EA+ W Q+CK++W+ + DENS+FFHKICTAR+
Subjt:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR

Query:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK
        ++  I ++    G + ++D  +    I HF DIY  N+NS+  I NL+W PI+++++  L  PF + E++  +KS  +NK P PDG+ ++F +K W+  K
Subjt:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK

Query:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR
         +I  +F DF+ + +IN+ +N T I LI K+      +DFRPISLTT++YK++AK LA+RLK TL DTIS +Q AFV+ RQI++AIL+ANEA+DFWR  +
Subjt:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR

Query:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV
        ++G +IKLD+EKAFDK++W FID VL+KK Y   WR+ I +CISSV YSI++NG+PRG I+  RGIRQGDPLSPF+FVLAMDYLSRL+  +  K  ++GV
Subjt:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV

Query:  VMG-DISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI
            ++++TH+LFADDIL+FV+D D  + N+ +I+  FE  SGL INL+KSTI  IN+   R   IA SWG +   LP  YLG PLGG P ++ FW+ ++
Subjt:  VMG-DISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI

Query:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW
        +KIQ ++ NW++  LSKGGR+TLI+S L S+P+Y +S+FK P  I  KIE  +R FLW GAS+  + +L+RW  + S K +GGLGIH +  TN ALL KW
Subjt:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW

Query:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW
        +W+F  E++ LW+  I +KY       FPS  K SS+ SPW A+++    F+ N  W + +G    FW DNW+   PL     RL+ LS+NK  S+ E W
Subjt:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW

Query:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRP--FHSPGEKILNNLWTADIPKKIKVFIWSLFH
        + S   W+    RPL D +   W      LPTP P RG     W ++ + +F T S +  ++  P  P  FH     +   LW  + PKK K FIW+L H
Subjt:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRP--FHSPGEKILNNLWTADIPKKIKVFIWSLFH

Query:  RSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERN
          INT+DRLQ    N   +P+ C +C K+ EDI+HLFIHC  +    +K    L  +   P  + S   ++ +    +Q+ L+  N     LW +W ERN
Subjt:  RSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERN

Query:  RRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSFL
         RIF+ + +    LWED ++   LW+ KSK+FS+Y    IALN  +F+
Subjt:  RRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSFL

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein2.3e-29044.06Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL
        +WD+    + D   G +S+S+ +  ++G +WWL+ +YGP     R   W EL  L  LC   WL+ GDFN+ R   ET++ +  K +M+ FN FIS  +L
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL

Query:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF
        +DPP +N  +TW+NLR  P  SRLDRFL S  W   F  H S+ L R  SDHFPILL+     WGPCPFR +N  L +K F  N  +WW  +   GFPG+
Subjt:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF

Query:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR
        +FI+ L  L++ +K W+ +  + +   KKA+  EID ID LE  G +     Q R SLK+DL      +A+ W+QR ++ W    DEN+++FH+ICT  +
Subjt:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR

Query:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK
        R+N I  +    G S+ S   + +  I HF +IY      E +I NL+W PI+ +    L  PF + E+   I S    K P PDG+T+ F+KK W   K
Subjt:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK

Query:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR
          +++VF DF+ + ++N N+N+T IALI K+    K SD+RPISLTTSLYKIMAK LA RLKS L DTI+ NQ AF++ RQI+DAIL+ANEA+D W+  +
Subjt:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR

Query:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV
         KG ++KLD+EKAFDKISW+FID +L KK +P  WR+WI+ACIS+V YSI+LNG P+G I+A+RGIRQGDPLSPF+FVLAMDYLSRL+  +E KG + GV
Subjt:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV

Query:  VMGD-ISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI
           +  +++HLLFADD+L+FV+D+++ + N+ + +  FE  SGL  N +KSTIS IN+S  RT +IAS +G     LP++YLG PLGG P++  FW+  I
Subjt:  VMGD-ISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI

Query:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW
        E I  +++ W++  +SKGGRLTL+ + L+S+P Y LS FKAP S+  +IEK +R FLW G+    + +L+ W I +S K  GGLGI K+K+TN ALL KW
Subjt:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW

Query:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW
        +WR+ NE N+LW+  I+ KY+  H    P   + SS+ SPW+AI K +D + +   W   +G S  FWH  W +  PL     RLY LS+ ++ ++ E+W
Subjt:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW

Query:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRS
              WN +PRRPL +R+ Q W      LP  +  RG     W  S    +T  SA+ I     S P  +  EK L +LW + IP+K K FIW++ H+ 
Subjt:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRS

Query:  INTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRR
        +NT D +Q    +   NPS CI C  ++ED++HLFI C  A    N  +   G  MV    +   C  L      + + ++  N  IATLW +W  RN  
Subjt:  INTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRR

Query:  IFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS
        IF DK  +    WEDI +L   W++KSK   +YS + IALN K+
Subjt:  IFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein6.5e-30146.04Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFS---WWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISD
        MWD+L  N+TD ++G FS+S+ +   DG S   WWLS IYGP+  + RKSFW EL DL   C   WLL GDFNV R  SETS+ NP+K SM  FNKFI+D
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFS---WWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISD

Query:  TDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGF
        ++L+DPPL N  +TW+NLR  PV+SR+DRFL++ +W   F  H+SK LSR TSDHFPI+L+ S  +WGP PF+  N  L    F  N+ +WW +   EG 
Subjt:  TDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGF

Query:  PGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICT
        PGFSF+R+LK L+  ++N +  N     E K A   EID ID LE+ G+L +     R  LKAD+  +   EA+ W Q+ K+LW+++ DEN++FFHKIC+
Subjt:  PGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICT

Query:  ARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFW
        AR+RR+ I  + + +G+   ++  + K  +DHF DIY    + S W+I NLNW PI++  A+ L S F ++E+ E + +   NK P              
Subjt:  ARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFW

Query:  NTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFW
                        ++ ++  +N TNIALI K+    + +D+RPISLTTS+YK++AKV+AERLK TL  T++ NQ AFV+ RQI DAIL+ANEA+D+W
Subjt:  NTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFW

Query:  RVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGL
        R  + +G +IKLD+EKAFDK++W FID +L+KKGYP  WR WIRACISSV YSII+NG+PRG IQ  RGIRQGDP+SPF+FVLAMDY+SRL+ +V +K  
Subjt:  RVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGL

Query:  VSGVVM-GDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFW
        + GV + G+I++THLLFADDILLFV+DD+ +I+N+  II  F+  SGL INLNKSTIS IN+   RT +IAS WG +   LPI+YLG PLGG      FW
Subjt:  VSGVVM-GDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFW

Query:  EPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDAL
        + + EKI  ++ +W++  LSKGG++TLI S L S+P Y LSIFKAP S C  IEK +R FLW+    +   +LV W  ++SSK +GGLGI ++K+TN AL
Subjt:  EPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDAL

Query:  LLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSI
        L KW+WR+ +E++ LW+  IN KY SL     P     SSSRSPW +I K  + F  +  W I+NGRS  FWH +W    PL     RLY LS+NK  SI
Subjt:  LLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSI

Query:  AEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSL
         ++W+++   W+  PRR L + +L  W+E    +     + G D   W ++ +GL+T  S +  L            +    NLW   IPKK   FIW+L
Subjt:  AEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSL

Query:  FHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNE
         + S+NT+++L          PS C++C +N ED  HLFI C  A      I+  L  S V   +    C  + + K  +++ ++  N + + LW +W E
Subjt:  FHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNE

Query:  RNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSF
        RN RIF  K +T  ++WEDI +LA LW ++S +FS+Y AS IALN  +F
Subjt:  RNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSF

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein6.7e-29043.97Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL
        +WD+    + D   G +S+S+ +  ++G +WWL+ +YGP     R   W EL  L  LC   WL+ GDFN+ R   ET++ +  K +M+ FN FIS  +L
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL

Query:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF
        +DPP +N  +TW+NLR  P  SRLDRFL S  W   F  H S+ L R  SDHFPILL+     WGPCPFR +N  L +K F  N  +WW  +   GFPG+
Subjt:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF

Query:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR
        +FI+ L  L++ +K W+ +  + +   KKA+  EID ID LE  G +     Q R SLK+DL      +A+ W+QR ++ W    DEN+++FH+ICT  +
Subjt:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR

Query:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK
        R+N I  +    G S+ S   + +  I HF +IY      E +I NL+W PI+ +    L  PF + E+   I S    K P PDG+T+ F+KK W   K
Subjt:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK

Query:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR
          +++VF DF+ + ++N N+N+T IALI K+    K SD+RPISLTTSLYKIMAK LA RLKS L DTI+ NQ AF++ RQI+DAIL+ANE +D W+  +
Subjt:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR

Query:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV
         KG ++KLD+EKAFDKISW+FID +L KK +P  WR+WI+ACIS+V YSI+LNG P+G I+A+RGIRQGDPLSPF+FVLAMDYLSRL+  +E KG + GV
Subjt:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV

Query:  VMGD-ISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI
           +  +++HLLFADD+L+FV+D+++ + N+ + +  FE  SGL  N +KSTIS IN+S  RT +IAS +G     LP++YLG PLGG P++  FW+  I
Subjt:  VMGD-ISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMI

Query:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW
        E I  +++ W++  +SKGGRLTL+ + L+S+P Y LS FKAP S+  +IEK +R FLW G+    + +L+ W I +S K  GGLGI K+K+TN ALL KW
Subjt:  EKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKW

Query:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW
        +WR+ NE N+LW+  I+ KY+  H    P   + SS+ SPW+AI K +D + +   W   +G S  FWH  W +  PL     RLY LS+ ++ ++ E+W
Subjt:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW

Query:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRS
              WN +PRRPL +R+ Q W      LP  +  RG     W  S    +T  SA+ I     S P  +  EK L +LW + IP+K K FIW++ H+ 
Subjt:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRS

Query:  INTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRR
        +NT D +Q    +   NPS CI C  ++ED++HLFI C  A    N  +   G  MV    +   C  L      + + ++  N  IATLW +W  RN  
Subjt:  INTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRR

Query:  IFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS
        IF DK  +    WEDI +L   W++KSK   +YS + IALN K+
Subjt:  IFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein3.1e-28747.12Show/hide
Query:  MSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEH
        M  FNKFI+D++L+DPPL N  +TW+NLR  PV+SR+DRFL++ +W   F  H+SK LSR TSDHFPI+L+ S  +WGP PF+  N  L    F  NV +
Subjt:  MSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEH

Query:  WWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDE
        WW +   EG PGFSF+R+LK L+  ++N +  N     E K A   EID ID LE+ G+L +     R  LKAD+  +   EA+ W Q+ K+LW+++ DE
Subjt:  WWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDE

Query:  NSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDG
        N++FFHKIC+AR+RR+ I  + + +G+   ++  + K  +DHF DIY    + S W+I NLNW PI++  A+ L S F ++E+ E + +   NK P PDG
Subjt:  NSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDG

Query:  FTIEFFKKFWNTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAI
        FT+EF+K  W+  K  I+++F DF+ + +IN+ +N TNIALI K+    + +D+RPISLTTS+YK++AKV+AERLK TL  T++ NQ AFV+ RQI DAI
Subjt:  FTIEFFKKFWNTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAI

Query:  LLANEAVDFWRVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSR
        L+ANEA+D+WRV + +G +IKLD+EKAFDK++W FID +L+KKGYP  WR WIRACISSV YSII+NG+PRG IQ  RGIRQGDP+SPF+FVLAMDY+SR
Subjt:  LLANEAVDFWRVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSR

Query:  LIEAVEKKGLVSGVVM-GDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPL
        L+ +V +K  + GV + G+I++THLLFADDILLFV+DD+ +I+N+  II  F+  SGL INLNKSTIS IN+   RT +IAS WG +   LPI+YLG PL
Subjt:  LIEAVEKKGLVSGVVM-GDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPL

Query:  GGIPKNNLFWEPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGI
        GG      FW+ + EKI  ++ +W++  LSKGG++TLI S L S+P Y LSIFKAP S C  IEK +R FLW+    +   +LV W  ++SSK +GGLGI
Subjt:  GGIPKNNLFWEPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGI

Query:  HKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLY
         ++K+TN ALL KW+WR+ +E++ LW+  IN KY SL     P     SSSRSPW +I K  + F  +  W I+NGRS  FWH +W    PL     RLY
Subjt:  HKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLY

Query:  QLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIP
         LS+NK  SI ++W+++   W+  PRR L + +   W+E    L     + G+D   W ++ +GL+T  S +  L            +    NLW   IP
Subjt:  QLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIP

Query:  KKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVF
        KK   FIW+L + S+NT+++L     N    PS C++C +N ED  HLFI C  A      I+  L  S V   +    C  + + K  +++ ++  N +
Subjt:  KKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVF

Query:  IATLWLLWNERNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSF
         + LW +W ERN RIF  K +T  ++WEDI +LA LW ++S +FS+Y AS IALN  +F
Subjt:  IATLWLLWNERNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.1e-5024.87Show/hide
Query:  LLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGP----YTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLD-
        L+ GDFN      + S+         + N  +  TDL+D      P    YT+ +       S++D  + S +   K     ++ ++   SDH  I L+ 
Subjt:  LLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGP----YTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLD-

Query:  -------DSSSTWGPCPFRFDNYLLDN------KSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSG
                 S+TW       ++Y + N      K F    E+  T T+   +  F  + R KF+A           +++K K+     E  +ID L S  
Subjt:  -------DSSSTWGPCPFRFDNYLLDN------KSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSG

Query:  SLDDMAKQLRKSLKAD-LQETALLEARYWNQRCKKLWLSDRDENSAFFHKI----------CTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY
         L ++ KQ +   KA   QE   + A       +K      +  S FF +I             +R +NQI  +   +G        ++  + +++  +Y
Subjt:  SLDDMAKQLRKSLKAD-LQETALLEARYWNQRCKKLWLSDRDENSAFFHKI----------CTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY

Query:  ----DYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPK
            +  +  +  +       +N      L  P    E+   I S+   K P PDGFT EF++++     P ++ +F       ++  +    +I LIPK
Subjt:  ----DYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPK

Query:  RNM-TGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQ----ISDAILLANEAVDFWRVSRKKGVLIKLDVEKAFDKISWNFIDCV
            T K  +FRPISL     KI+ K+LA R++  ++  I  +Q  F+   Q    I  +I   N      R   K  V+I +D EKAFDKI   F+   
Subjt:  RNM-TGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQ----ISDAILLANEAVDFWRVSRKKGVLIKLDVEKAFDKISWNFIDCV

Query:  LLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGVVMGDISVTHLLFADDILLFVQDDDK
        L K G   ++ + IRA     + +IILNG+       K G RQG PLSP LF + ++ L+R   A+ ++  + G+ +G   V   LFADD+++++++   
Subjt:  LLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGVVMGDISVTHLLFADDILLFVQDDDK

Query:  AIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLF---WEPMIEKIQHRIHNWRFVSLSKGGRLTL
        + +N+  +I +F   SG +IN+ KS     N + Q  S+I       +    I YLG  L    K +LF   ++P++++I+   + W+ +  S  GR+ +
Subjt:  AIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLF---WEPMIEKIQHRIHNWRFVSLSKGGRLTL

Query:  IHSVLNSMPLYILSI--FKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEEN
        +   +    +Y  +    K P +   ++EK   KF+W    +      +   I+S     GG+ +   K    A + K  W ++   +
Subjt:  IHSVLNSMPLYILSI--FKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEEN

P08548 LINE-1 reverse transcriptase homolog2.4e-4223.05Show/hide
Query:  IYGPASRKKRKSFWRE-LYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGP----YTWTNLRSEPVMSRLDRFLFS
        IY P        F RE L D+  L     ++ GDFN      + SS       +   N  I   DL D      P    YT+ +  +    S++D  L  
Subjt:  IYGPASRKKRKSFWRE-LYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGP----YTWTNLRSEPVMSRLDRFLFS

Query:  NSWCIK----------FNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSN
         S   K          F+DHH  K+    + +    L   + TW     + +N +L +   I  ++   T    +     +  + L   A+ V   K   
Subjt:  NSWCIK----------FNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSN

Query:  TDSF--KEKKKAISN---EIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKI---------CTARRRRNQIH
          +F  K +++ ++N    + +++  E S       K++ K ++A+L E   +E +   Q+  K         S FF KI          T ++R   + 
Subjt:  TDSF--KEKKKAISN---EIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKI---------CTARRRRNQIH

Query:  ELFTKEGISIVSD-FLLEKEVIDHFADIYDYNQNS----EWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFKP
                 I +D   ++K + +++  +Y +   +    +  +   +   ++      L  P    E+   I+++ + K P PDGFT EF++ F     P
Subjt:  ELFTKEGISIVSD-FLLEKEVIDHFADIYDYNQNS----EWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFKP

Query:  SIMSVFHDFYHSKVINRNMNHTNIALIPK--RNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQ----ISDAILLANEAVDF
         ++++F +     ++       NI LIPK  ++ T K  ++RPISL     KI+ K+L  R++  ++  I  +Q  F+   Q    I  +I   N     
Subjt:  SIMSVFHDFYHSKVINRNMNHTNIALIPK--RNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQ----ISDAILLANEAVDF

Query:  WRVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKG
         ++  K  +++ +D EKAFD I   F+   L K G    + + I A  S  + +IILNG    +   + G RQG PLSP LF + M+ L+    A+ ++ 
Subjt:  WRVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKG

Query:  LVSGVVMGDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPL----GGIPKN
         + G+ +G   +   LFADD+++++++   +   +  +IK + + SG +IN +KS       +NQ    +  S    + P  + YLG  L      + K 
Subjt:  LVSGVVMGDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPL----GGIPKN

Query:  NLFWEPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSI--FKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIK
        N  +E + ++I   ++ W+ +  S  GR+ ++   +    +Y  +    KAP S    +EKI   F+W           +   ++S+    GG+ +  ++
Subjt:  NLFWEPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSI--FKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIK

Query:  ETNDALLLKWIWRFF-NEENTLWRNFINNKY--SSLHFECFPSSSK
            ++++K  W +  N E  +W    N +   ++ H+  F    K
Subjt:  ETNDALLLKWIWRFF-NEENTLWRNFINNKY--SSLHFECFPSSSK

P0C2F6 Putative ribonuclease H protein At1g657501.8e-3427.12Show/hide
Query:  MIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLL
        ++E++  R+  WR  +LS  GRLTL  +VL+SMP++ +S    P SI N+++++ R FLW   +     +LV+W  V S K EGGLG+   K  N AL+ 
Subjt:  MIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLL

Query:  KWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAIS-KLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIA
        K  WR   E+N+LW   +  KY               S  S W +I+  L+D+      W   +G+   FW D W S  PL  + N   +  ++ +  +A
Subjt:  KWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAIS-KLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIA

Query:  EVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSV--LPSRPFHSPGEKILNNLWTADIPKKIKVFIWS
        +      R W+F    P +  +  R    A +L      R  D   W  S DG F+ +SA  +L+V  +P RP  +      N LW   +P+++K F+W 
Subjt:  EVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSV--LPSRPFHSPGEKILNNLWTADIPKKIKVFIWS

Query:  LFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCA-DLF------TSKAISQRQLLRRNVFIA
        + ++++ T +      +  L   +VC +C    E + H+   C           L + + +VP      F +  LF               +    +F  
Subjt:  LFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCA-DLF------TSKAISQRQLLRRNVFIA

Query:  TLWLLWNERNRRIFEDKARTRNQL
         +W  W  R   IF +  + R+++
Subjt:  TLWLLWNERNRRIFEDKARTRNQL

P11369 LINE-1 retrotransposable element ORF2 protein4.5e-4126.99Show/hide
Query:  INSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPK-RNMTGKISDFRPISLTTSLY
        +N      L SP    E+   I S+   K P PDGF+ EF++ F     P +  +FH       +  +     I LIPK +    KI +FRPISL     
Subjt:  INSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPK-RNMTGKISDFRPISLTTSLY

Query:  KIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFW-RVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYS
        KI+ K+LA R++  ++  I  +Q  F+   Q    I  +   + +  ++  K  ++I LD EKAFDKI   F+  VL + G    +   I+A  S    +
Subjt:  KIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFW-RVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYS

Query:  IILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGVVMGDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNK
        I +NG+    I  K G RQG PLSP+LF + ++ L+R   A+ ++  + G+ +G   V   L ADD+++++ D   +   +  +I SF    G +IN NK
Subjt:  IILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGVVMGDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNK

Query:  STISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKN--NLFWEPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSI--FKAPTSIC
        S       + Q   EI  +   ++    I YLG  L    K+  +  ++ + ++I+  +  W+ +  S  GR+ ++   +    +Y  +    K PT   
Subjt:  STISGINLSNQRTSEIASSWGCNLHPLPIDYLGAPLGGIPKN--NLFWEPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSI--FKAPTSIC

Query:  NKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLH
        N++E    KF+W       + +L++       +  GG+ +  +K    A+++K  W ++ +      N I +   + H
Subjt:  NKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLH

P14381 Transposon TX1 uncharacterized 149 kDa protein4.5e-4124.31Show/hide
Query:  GFSWWLSGIYGPASRKKRKSFWREL--YDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGP----YTWTNLRSEPV-
        G ++ L  +Y P +  +R  F+  L  Y       +  ++GGDFN    + + +       S S   + I+   L+D      P    +T+  +R   V 
Subjt:  GFSWWLSGIYGPASRKKRKSFWREL--YDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGP----YTWTNLRSEPV-

Query:  MSRLDRFLFSNSWCIK----------FNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWW------TDTFCEGFPGFSFIR
         SR+DR   S+    +          F+DH+   L    +   P      ++ W      F+N LL+++ F  +V   W       D F      +   +
Subjt:  MSRLDRFLFSNSWCIK----------FNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWW------TDTFCEGFPGFSFIR

Query:  -RLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRN
          LK L ++   +  S +     + +A++ E+  ++    SGS D   +      K  L+     +AR    R +   L D D  S FF+ +   +  R 
Subjt:  -RLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRN

Query:  QIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSAR---GLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK
        QI  LF ++G  +     +       + +++  +  S      L W+ +  VS R    L +P   DE+ + ++ +  NK P  DG TIEFF+ FW+T  
Subjt:  QIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSAR---GLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFK

Query:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR
        P    V  + +    +  +     ++L+PK+     I ++RP+SL ++ YKI+AK ++ RLKS L + I  +QS  V  R I D + L  + + F R + 
Subjt:  PSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSR

Query:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV
             + LD EKAFD++   ++   L    +   +  +++   +S    + +N      +   RG+RQG PLS  L+ LA++    L+     +  ++G+
Subjt:  KKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSIILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGV

Query:  VM--GDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIAS-----SWGCNLHPLPIDYLGAPLGG--IPKN
        V+   D+ V    +ADD++L  Q D   +E      + +   S  RIN +KS  SG+   + +   +       SW   +    I YLG  L     P +
Subjt:  VM--GDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQRTSEIAS-----SWGCNLHPLPIDYLGAPLGG--IPKN

Query:  NLFWEPMIEKIQHRIHNWRFVS--LSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIK
          F E + E +  R+  W+  +  LS  GR  +I+ ++ S   Y L           KI++    FLW G       + V   + S    EGG G+  I+
Subjt:  NLFWEPMIEKIQHRIHNWRFVS--LSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIK

Query:  ETNDALLLKWIWRF-FNEENTLWRNFINNKY
               L+ I R+ + + +  W    ++ Y
Subjt:  ETNDALLLKWIWRF-FNEENTLWRNFINNKY

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.2e-2825.99Show/hide
Query:  LLGGDFNVFRHSSETSSNNPAKLSM---SKFNKFISDTDLLDPPLINGPYTWTNLRSE-PVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFP--ILL
        +L GDF+    +S+  S     + M    +F   + D+DL+D P     YTW+N + + P++ +LDR + +  W   F    +       SDH P  I+L
Subjt:  LLGGDFNVFRHSSETSSNNPAKLSM---SKFNKFISDTDLLDPPLINGPYTWTNLRSE-PVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFP--ILL

Query:  DDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALES---SGSLDDMAKQL
        ++       C FR+ ++L  + +F+ ++   W +    G   FS    LK  A K K  KL N   F   +      +D +++++S   +   D + +  
Subjt:  DDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALES---SGSLDDMAKQL

Query:  RKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQ-----NSEWIIVNLNW
          + K      A LE+ ++ Q+ +  WL D D N+ FFHK+  A + +N I  L   + + + +   +++ ++ ++  +   +      +S   I +++ 
Subjt:  RKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQ-----NSEWIIVNLNW

Query:  EPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSL
           N   A  L +   D E+   + ++ +NK P PD FT EFF + W   K S ++   +F+ +  + +  N T I LIPK     ++S FRP+S  T +
Subjt:  EPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDFRPISLTTSL

Query:  YKIM
        YKI+
Subjt:  YKIM

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-2928.11Show/hide
Query:  LPIDYLGAPLGGIPKNNLFWEPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVS
        LP+ YLG PL         + P++EKI+ RI  W    LS  GRL LI SV++S+  + +S F+ P++   +I+ I   FLW G   +     V W  V 
Subjt:  LPIDYLGAPLGGIPKNNLFWEPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVS

Query:  SSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFG
        + K EGGLGI  +KE N              + + W                  S   +     W  I K + +     + DI NG +T FW DNWS  G
Subjt:  SSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFG

Query:  PLKFVCNRLYQLSSNK---NLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGD---GLFTTKSARAILSVLPSRPFH
               RL  ++ ++   ++ I    S ++ + N +PRR   D  L+     AE +       G D  RW  +GD     F TK   A           
Subjt:  PLKFVCNRLYQLSSNK---NLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGD---GLFTTKSARAILSVLPSRPFH

Query:  SPGEKI--LNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHC
         P  K+     +W +    K  V  W      + T DR+ +    +    S C+LC    E  DHLF  C
Subjt:  SPGEKI--LNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHC

AT4G29090.1 Ribonuclease H-like superfamily protein1.7e-2223.8Show/hide
Query:  SMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFP
        ++P Y ++ F  P ++C +I  +   F W     +   +   W+ +S  KAEGG+G   I+  N ALL K +WR  +   +L      ++Y     +  P
Subjt:  SMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFP

Query:  SSSKVSSSRS-PWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSD------RMWNFQPRRPLFDRDLQR
         ++ + S  S  W +I   Q+I     R  + NG   + W   W    P      R+ ++   +  S++ +   SD      R W       LF  +++R
Subjt:  SSSKVSSSRS-PWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSD------RMWNFQPRRPLFDRDLQR

Query:  WSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGE-------KILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSL
             EL   P  +R  D   W  +  G +T KS   +L+ + ++   SP E        I   +W +    KI+ F+W     S+  +    A+    L
Subjt:  WSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGE-------KILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSL

Query:  HNPSVCILCWKNSEDIDHLFIHC--SRASFFRNKINLALGLSMVPPATIDSFCADLF--TSKAISQRQLLRRNVFIA-TLWLLWNERNRRIFEDK
           S CI C    E ++HL   C  +R ++  + I + LG         DS   +L+   +      Q  + +  +   LW LW  RN  +F  +
Subjt:  HNPSVCILCWKNSEDIDHLFIHC--SRASFFRNKINLALGLSMVPPATIDSFCADLF--TSKAISQRQLLRRNVFIA-TLWLLWNERNRRIFEDK

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-1223.57Show/hide
Query:  RWDIRNGRSTLFWHDNWSSFGPLKFVCNRL--YQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFT
        R D+ NG S  FW+D W+ FG L          QL   ++  + E   + D  W     R     + Q +     + P P+  RG D   W  +      
Subjt:  RWDIRNGRSTLFWHDNWSSFGPLKFVCNRL--YQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFT

Query:  TKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRA----SFFRNKIN
        + S+R     +     HSP       +W  +   +  +  W  F   + T DRL+    N    PS  +LC    E   HLF  CS +     FF +K  
Subjt:  TKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRA----SFFRNKIN

Query:  LALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQL
         +      PP  + +  + +      S    + + +  + ++ +W ERN RIF   + + + L
Subjt:  LALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)6.5e-1146.27Show/hide
Query:  ILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGVVMGDIS--VTHLLFADD
        I+NG P+G +   RG+RQGDPLSP+LF+L  + LS L    +++G + G+ + + S  + HLLFADD
Subjt:  ILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGVVMGDIS--VTHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGATGAATTATGTTGCAACATCACTGATCATGTCAAAGGTCTGTTCTCTGTTTCTGTTCTCGTTACTCTCTCAGATGGCTTCTCTTGGTGGTTGTCCGGTATTTA
TGGCCCTGCGAGTAGGAAAAAACGTAAGTCTTTTTGGAGAGAACTTTATGATCTTCACGGCTTATGTGGTGACTGTTGGCTTCTTGGTGGTGATTTTAATGTTTTCAGGC
ATTCTTCGGAAACATCCTCCAACAATCCTGCCAAATTAAGTATGTCGAAGTTTAACAAATTCATTTCGGACACTGACCTCCTCGACCCGCCGCTTATTAATGGTCCATAT
ACTTGGACAAATCTTCGCAGTGAGCCTGTTATGTCCCGTCTTGACAGATTTCTTTTCTCTAATAGCTGGTGTATTAAATTTAACGATCATCATTCGAAGAAGCTTTCTCG
TTGCACATCAGATCATTTTCCTATTCTTCTGGATGATTCTTCTTCTACTTGGGGCCCTTGTCCTTTTCGTTTCGACAATTATCTTCTGGACAATAAATCCTTCATTGGCA
ATGTTGAGCATTGGTGGACTGATACTTTCTGTGAGGGTTTTCCTGGATTTTCTTTTATTCGCAGACTAAAGTTCTTAGCAAGGAAAGTCAAAAACTGGAAGCTTTCCAAC
ACAGATTCCTTCAAGGAAAAGAAAAAGGCCATTTCCAACGAGATTGATCGCATTGATGCTCTTGAATCTTCGGGTTCTTTGGACGATATGGCAAAGCAACTTAGAAAATC
TCTTAAAGCTGATCTGCAAGAAACAGCTCTCCTTGAAGCCCGTTATTGGAATCAGCGTTGTAAAAAGCTTTGGCTCAGCGATAGAGATGAGAACTCTGCTTTTTTCCATA
AAATTTGTACTGCTCGCCGCCGAAGAAACCAAATCCATGAGTTATTTACAAAAGAAGGTATTAGCATTGTTTCTGATTTTTTGCTGGAAAAGGAAGTAATCGATCATTTT
GCGGATATTTATGACTACAATCAGAATTCTGAATGGATTATTGTCAATCTTAATTGGGAACCAATTAATTCTGTTTCTGCTAGAGGTCTGATATCTCCTTTTAGGGATGA
TGAGGTTTTCGAATGTATTAAGTCCATTGGCCAAAATAAAGTTCCGTGCCCAGATGGTTTCACCATTGAGTTTTTTAAGAAATTCTGGAATACTTTCAAACCTTCTATTA
TGTCAGTCTTCCACGATTTCTACCACAGCAAGGTTATCAATCGAAATATGAATCACACCAATATTGCGCTCATCCCCAAAAGGAACATGACTGGGAAAATTTCAGATTTC
CGTCCTATTAGCCTCACCACCTCTCTTTACAAAATTATGGCTAAGGTTTTGGCGGAGCGCTTAAAGTCCACTTTGGAAGACACTATAAGCTTGAATCAGTCGGCTTTTGT
TCGAAAGAGACAAATCTCTGACGCCATTCTGCTAGCTAACGAAGCTGTTGACTTCTGGAGAGTTTCTAGAAAGAAAGGTGTCCTTATAAAGCTTGATGTGGAGAAAGCTT
TTGACAAAATTAGTTGGAATTTCATCGATTGTGTTCTCCTTAAAAAAGGATACCCGACAATTTGGCGAGAATGGATTAGAGCTTGCATATCTTCAGTTTCTTATTCTATC
ATTCTCAACGGCAAGCCTCGAGGTAACATTCAAGCCAAAAGAGGAATTAGACAAGGCGATCCTCTATCTCCTTTTCTTTTTGTCCTTGCCATGGATTACCTTAGTAGATT
AATCGAAGCTGTCGAAAAAAAAGGGCTCGTTTCTGGAGTTGTTATGGGGGATATTTCTGTCACACACCTTCTATTTGCTGATGATATTTTACTTTTTGTTCAAGATGATG
ACAAAGCCATTGAAAATATGTATCTTATCATTAAGTCTTTTGAACATGGTTCTGGTTTGCGTATAAATCTCAACAAATCCACTATCTCTGGTATCAACCTATCGAACCAA
AGAACATCTGAGATCGCATCCTCTTGGGGCTGTAATCTTCACCCCCTACCCATTGATTATCTTGGCGCTCCTTTGGGTGGAATTCCTAAGAATAACCTGTTTTGGGAGCC
CATGATAGAGAAGATTCAGCATAGAATTCACAATTGGCGGTTTGTATCTCTTTCTAAAGGAGGTCGTCTCACTCTTATTCACTCGGTTCTCAATAGTATGCCCCTCTACA
TCCTCTCGATTTTCAAAGCCCCAACGTCTATCTGCAACAAAATTGAAAAAATTTTCCGAAAATTTCTTTGGGAGGGAGCTTCTTCTTCAGGCTCCACTAATCTTGTGAGA
TGGGAAATTGTTTCTTCTTCTAAAGCTGAAGGTGGTCTTGGCATTCACAAAATTAAAGAGACTAACGACGCTCTTCTCCTTAAGTGGATCTGGCGTTTCTTCAATGAGGA
AAACACTCTTTGGAGGAATTTCATCAACAATAAATATTCCAGTCTTCATTTTGAGTGCTTTCCTTCAAGCAGCAAAGTCTCCAGCTCCAGATCTCCTTGGCATGCTATCT
CTAAGCTTCAAGACATTTTCTTTGCCAATTTCAGATGGGATATTCGCAATGGTCGCTCTACTCTGTTTTGGCATGATAATTGGTCTTCCTTTGGCCCCCTAAAATTTGTC
TGCAACCGTTTATATCAGCTATCATCCAACAAAAATCTCTCTATAGCTGAAGTTTGGTCTTCTTCGGACAGAATGTGGAACTTTCAGCCCCGAAGACCTTTGTTCGATAG
AGATTTACAAAGGTGGAGCGAATTTGCGGAATTATTGCCCACCCCTAATCCTCAAAGGGGTTCGGATATTCGGCGTTGGATGGTATCTGGAGATGGTCTATTTACTACTA
AATCTGCGCGCGCTATTTTATCTGTTCTCCCTTCAAGACCGTTTCACAGTCCTGGAGAAAAAATTCTCAACAATCTTTGGACCGCTGACATTCCAAAGAAAATCAAGGTC
TTCATTTGGTCTCTCTTTCATCGTAGCATAAATACGTCGGATAGACTTCAAGCGATTTTTCAGAATTCTCTTCACAACCCATCAGTCTGCATTCTTTGTTGGAAGAATTC
TGAAGATATTGATCATCTTTTCATCCACTGTAGTCGCGCATCCTTTTTCAGGAACAAAATCAACCTTGCTTTGGGCCTCTCTATGGTTCCTCCGGCAACTATAGATTCTT
TTTGCGCTGATTTGTTTACATCCAAAGCTATTTCGCAAAGGCAATTGCTCAGAAGAAACGTTTTCATAGCTACCCTTTGGTTATTATGGAACGAGCGTAATCGCCGTATT
TTTGAAGATAAAGCTCGCACTCGAAATCAACTCTGGGAGGACATCGTCTCTCTTGCTGCTCTTTGGGCTACGAAATCCAAAGTTTTCTCTGATTATAGTGCTTCTCATAT
TGCTTTAAATTGGAAATCTTTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGGATGAATTATGTTGCAACATCACTGATCATGTCAAAGGTCTGTTCTCTGTTTCTGTTCTCGTTACTCTCTCAGATGGCTTCTCTTGGTGGTTGTCCGGTATTTA
TGGCCCTGCGAGTAGGAAAAAACGTAAGTCTTTTTGGAGAGAACTTTATGATCTTCACGGCTTATGTGGTGACTGTTGGCTTCTTGGTGGTGATTTTAATGTTTTCAGGC
ATTCTTCGGAAACATCCTCCAACAATCCTGCCAAATTAAGTATGTCGAAGTTTAACAAATTCATTTCGGACACTGACCTCCTCGACCCGCCGCTTATTAATGGTCCATAT
ACTTGGACAAATCTTCGCAGTGAGCCTGTTATGTCCCGTCTTGACAGATTTCTTTTCTCTAATAGCTGGTGTATTAAATTTAACGATCATCATTCGAAGAAGCTTTCTCG
TTGCACATCAGATCATTTTCCTATTCTTCTGGATGATTCTTCTTCTACTTGGGGCCCTTGTCCTTTTCGTTTCGACAATTATCTTCTGGACAATAAATCCTTCATTGGCA
ATGTTGAGCATTGGTGGACTGATACTTTCTGTGAGGGTTTTCCTGGATTTTCTTTTATTCGCAGACTAAAGTTCTTAGCAAGGAAAGTCAAAAACTGGAAGCTTTCCAAC
ACAGATTCCTTCAAGGAAAAGAAAAAGGCCATTTCCAACGAGATTGATCGCATTGATGCTCTTGAATCTTCGGGTTCTTTGGACGATATGGCAAAGCAACTTAGAAAATC
TCTTAAAGCTGATCTGCAAGAAACAGCTCTCCTTGAAGCCCGTTATTGGAATCAGCGTTGTAAAAAGCTTTGGCTCAGCGATAGAGATGAGAACTCTGCTTTTTTCCATA
AAATTTGTACTGCTCGCCGCCGAAGAAACCAAATCCATGAGTTATTTACAAAAGAAGGTATTAGCATTGTTTCTGATTTTTTGCTGGAAAAGGAAGTAATCGATCATTTT
GCGGATATTTATGACTACAATCAGAATTCTGAATGGATTATTGTCAATCTTAATTGGGAACCAATTAATTCTGTTTCTGCTAGAGGTCTGATATCTCCTTTTAGGGATGA
TGAGGTTTTCGAATGTATTAAGTCCATTGGCCAAAATAAAGTTCCGTGCCCAGATGGTTTCACCATTGAGTTTTTTAAGAAATTCTGGAATACTTTCAAACCTTCTATTA
TGTCAGTCTTCCACGATTTCTACCACAGCAAGGTTATCAATCGAAATATGAATCACACCAATATTGCGCTCATCCCCAAAAGGAACATGACTGGGAAAATTTCAGATTTC
CGTCCTATTAGCCTCACCACCTCTCTTTACAAAATTATGGCTAAGGTTTTGGCGGAGCGCTTAAAGTCCACTTTGGAAGACACTATAAGCTTGAATCAGTCGGCTTTTGT
TCGAAAGAGACAAATCTCTGACGCCATTCTGCTAGCTAACGAAGCTGTTGACTTCTGGAGAGTTTCTAGAAAGAAAGGTGTCCTTATAAAGCTTGATGTGGAGAAAGCTT
TTGACAAAATTAGTTGGAATTTCATCGATTGTGTTCTCCTTAAAAAAGGATACCCGACAATTTGGCGAGAATGGATTAGAGCTTGCATATCTTCAGTTTCTTATTCTATC
ATTCTCAACGGCAAGCCTCGAGGTAACATTCAAGCCAAAAGAGGAATTAGACAAGGCGATCCTCTATCTCCTTTTCTTTTTGTCCTTGCCATGGATTACCTTAGTAGATT
AATCGAAGCTGTCGAAAAAAAAGGGCTCGTTTCTGGAGTTGTTATGGGGGATATTTCTGTCACACACCTTCTATTTGCTGATGATATTTTACTTTTTGTTCAAGATGATG
ACAAAGCCATTGAAAATATGTATCTTATCATTAAGTCTTTTGAACATGGTTCTGGTTTGCGTATAAATCTCAACAAATCCACTATCTCTGGTATCAACCTATCGAACCAA
AGAACATCTGAGATCGCATCCTCTTGGGGCTGTAATCTTCACCCCCTACCCATTGATTATCTTGGCGCTCCTTTGGGTGGAATTCCTAAGAATAACCTGTTTTGGGAGCC
CATGATAGAGAAGATTCAGCATAGAATTCACAATTGGCGGTTTGTATCTCTTTCTAAAGGAGGTCGTCTCACTCTTATTCACTCGGTTCTCAATAGTATGCCCCTCTACA
TCCTCTCGATTTTCAAAGCCCCAACGTCTATCTGCAACAAAATTGAAAAAATTTTCCGAAAATTTCTTTGGGAGGGAGCTTCTTCTTCAGGCTCCACTAATCTTGTGAGA
TGGGAAATTGTTTCTTCTTCTAAAGCTGAAGGTGGTCTTGGCATTCACAAAATTAAAGAGACTAACGACGCTCTTCTCCTTAAGTGGATCTGGCGTTTCTTCAATGAGGA
AAACACTCTTTGGAGGAATTTCATCAACAATAAATATTCCAGTCTTCATTTTGAGTGCTTTCCTTCAAGCAGCAAAGTCTCCAGCTCCAGATCTCCTTGGCATGCTATCT
CTAAGCTTCAAGACATTTTCTTTGCCAATTTCAGATGGGATATTCGCAATGGTCGCTCTACTCTGTTTTGGCATGATAATTGGTCTTCCTTTGGCCCCCTAAAATTTGTC
TGCAACCGTTTATATCAGCTATCATCCAACAAAAATCTCTCTATAGCTGAAGTTTGGTCTTCTTCGGACAGAATGTGGAACTTTCAGCCCCGAAGACCTTTGTTCGATAG
AGATTTACAAAGGTGGAGCGAATTTGCGGAATTATTGCCCACCCCTAATCCTCAAAGGGGTTCGGATATTCGGCGTTGGATGGTATCTGGAGATGGTCTATTTACTACTA
AATCTGCGCGCGCTATTTTATCTGTTCTCCCTTCAAGACCGTTTCACAGTCCTGGAGAAAAAATTCTCAACAATCTTTGGACCGCTGACATTCCAAAGAAAATCAAGGTC
TTCATTTGGTCTCTCTTTCATCGTAGCATAAATACGTCGGATAGACTTCAAGCGATTTTTCAGAATTCTCTTCACAACCCATCAGTCTGCATTCTTTGTTGGAAGAATTC
TGAAGATATTGATCATCTTTTCATCCACTGTAGTCGCGCATCCTTTTTCAGGAACAAAATCAACCTTGCTTTGGGCCTCTCTATGGTTCCTCCGGCAACTATAGATTCTT
TTTGCGCTGATTTGTTTACATCCAAAGCTATTTCGCAAAGGCAATTGCTCAGAAGAAACGTTTTCATAGCTACCCTTTGGTTATTATGGAACGAGCGTAATCGCCGTATT
TTTGAAGATAAAGCTCGCACTCGAAATCAACTCTGGGAGGACATCGTCTCTCTTGCTGCTCTTTGGGCTACGAAATCCAAAGTTTTCTCTGATTATAGTGCTTCTCATAT
TGCTTTAAATTGGAAATCTTTTCTGTAG
Protein sequenceShow/hide protein sequence
MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPY
TWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSN
TDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHF
ADIYDYNQNSEWIIVNLNWEPINSVSARGLISPFRDDEVFECIKSIGQNKVPCPDGFTIEFFKKFWNTFKPSIMSVFHDFYHSKVINRNMNHTNIALIPKRNMTGKISDF
RPISLTTSLYKIMAKVLAERLKSTLEDTISLNQSAFVRKRQISDAILLANEAVDFWRVSRKKGVLIKLDVEKAFDKISWNFIDCVLLKKGYPTIWREWIRACISSVSYSI
ILNGKPRGNIQAKRGIRQGDPLSPFLFVLAMDYLSRLIEAVEKKGLVSGVVMGDISVTHLLFADDILLFVQDDDKAIENMYLIIKSFEHGSGLRINLNKSTISGINLSNQ
RTSEIASSWGCNLHPLPIDYLGAPLGGIPKNNLFWEPMIEKIQHRIHNWRFVSLSKGGRLTLIHSVLNSMPLYILSIFKAPTSICNKIEKIFRKFLWEGASSSGSTNLVR
WEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFV
CNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKV
FIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRRI
FEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSFL