; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0003204 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0003204
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr4:48971067..48974588
RNA-Seq ExpressionLag0003204
SyntenyLag0003204
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN66330.1 hypothetical protein VITISV_000598 [Vitis vinifera]4.2e-19737.72Show/hide
Query:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGT---------------SGGI---------AIIWNE
        M  ISWN+RGLGS  KR +VKDFL   NP +V+ QETK    DR+ + S+WS RN  WA L A G                 GG+            W E
Subjt:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGT---------------SGGI---------AIIWNE

Query:  SS---------------FSVLE-----------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSDNIPVKFGSASVRKLERPISD
         S               F+V+            T +MR F+ FI  + L D PL N  FTWS+ + +P    +DR+L S+   + F  +    L R  SD
Subjt:  SS---------------FSVLE-----------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSDNIPVKFGSASVRKLERPISD

Query:  HFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLD
        H+PI L     +WGPTPFRF N WL H  F +S  SWWR     GW GH F++KL+ +K +LK WN N FG   +++ S++ E+  ID  E+ G +S   
Subjt:  HFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLD

Query:  ISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQKLFTKLNSSPPLPTID----
        ++ R   K +L  +   EEI W+QK+K+KW  EGD N+  FH++    R K+ I  + +  G  + S E I +E + +++KL+    S PP  +      
Subjt:  ISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQKLFTKLNSSPPLPTID----

Query:  DWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTT
        DWSPIS E  + L++PF EAEI+ A+  L  +K PGPDGFT   F+  W+++K D+  VF +F  SGIIN + N ++I L+PKK  +K + D+RPISL T
Subjt:  DWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTT

Query:  CLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVN
        CLYKIIA+VLS RL+ VL +TI   Q  FV  +QI+DA LIANE++DE  R   +GV+ K+D EKA+D V+WDFLD+VL+ KGF  RWR W+R C+SSV+
Subjt:  CLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVN

Query:  YSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRL
        Y+I++NG  +G +KA RGLRQ   +        +  LSRMLL  E+   +EGF VG N+    ++HLQFADDTILF+SP++D +  L +++  F  IS L
Subjt:  YSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRL

Query:  NINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGAPTDEGGLSIYDIHKKNTSLLAKWIW
         +N +KS   GI L Q  +  LAS    K   WP++YLGLPL GNP    FW P+IE+I +RL  W    +S G     G +++  +H    S +  +  
Subjt:  NINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGAPTDEGGLSIYDIHKKNTSLLAKWIW

Query:  RFYQEPEALWRKIIATKYGVARNPLKLGN-HSISSSKGPWKAIHSLQHFIYDNIDSRVGKEAFIADLWNPSTG-SWNLHLRRDLNENEILEWAILSHHLT
          ++ P ++  KI   +     + +  G+   I   +  W     L+          + K   I+ +   S   SWN + RR+L ++EI +   L H L 
Subjt:  RFYQEPEALWRKIIATKYGVARNPLKLGN-HSISSSKGPWKAIHSLQHFIYDNIDSRVGKEAFIADLWNPSTG-SWNLHLRRDLNENEILEWAILSHHLT

Query:  NFSLK-DEEDSWSWQLGNIDSFTTGSL---TKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKASKTQ
          +L     D+ SW L +   FT  S      ++  SSP   T+    +WK  +  KVK  IW ++H  +NT D++Q R P  ++SP+ C +C +  ++ 
Subjt:  NFSLK-DEEDSWSWQLGNIDSFTTGSL---TKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKASKTQ

Query:  AHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV
         HLF  C      W  L Q     +  P  I  ++ ++  G   +K   ++WQ    A +W +W ERNAR+
Subjt:  AHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV

CAN68165.1 hypothetical protein VITISV_008538 [Vitis vinifera]1.0e-19835.12Show/hide
Query:  ISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNE----------SSFSVLE----------
        ISWN RGLGS KKR +VKDFL S  P +V++QETK A+ DR+ + S+W++RN  WA L A G SGGI +IW+            SFSV            
Subjt:  ISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNE----------SSFSVLE----------

Query:  -----------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSD
                                                             T +M+  + FI    L D PL +  FTWS+ + +P    +DR+L S+
Subjt:  -----------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSD

Query:  NIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSL
             F  +    L R  SDH+PI L     +WGPTPFRF N WL H +F +    WWR     GW GH F++KL+ LK +LK+WN N FG   +++  +
Subjt:  NIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSL

Query:  NQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQ
          ++   D  E+ G +S   + +R   K +L  +   EEI WRQK+++KW  EGD N+  FH++    R +  I  + +  G  L + + I++E + +++
Subjt:  NQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQ

Query:  KLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKK
        KL+   +         DWSPIS E  + LE+PFTE EIYKA+  +  +  PGPDGFT   F+  W+++K D+  VF++F +SGIIN S N ++I L+PKK
Subjt:  KLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKK

Query:  IGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGF
          AK + +YRPISL T LYKIIA+VL+ RL+ +L +TI   Q  FV  +QI+DA LIANE++DE  R   +GV+ K+D EKA+D V WDFLD+V++ KGF
Subjt:  IGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGF

Query:  GQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLI
            RKWIR C+SSV+++I++NG  +G +K  RGLRQ   +        +   S MLL AE+    EGF VG N+    ++HLQFADDTI FSS +++ +
Subjt:  GQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLI

Query:  TNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA---------
          L +V+  F  IS L +N +KS   GI L Q  +  LA     K   WP++YLGLPL GNPK  SFW P+IE+I  RL  W    +S G          
Subjt:  TNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA---------

Query:  -------------PTD------------------------------------EGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARNPLK
                     P                                      +GGL +  I  +N++LL KW+WR+ +E  ALW ++I + YG   N   
Subjt:  -------------PTD------------------------------------EGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARNPLK

Query:  LGNHSISSSKGPWKAI-------HSLQHFIYDNIDSRVGKEAFIADLW--NPSTG---------------------------SWNLHLRRDLNENEILEW
               S + PWKAI            FI  + D    +  F  DLW  + S G                           SWN + RR+L+++EI + 
Subjt:  LGNHSISSSKGPWKAI-------HSLQHFIYDNIDSRVGKEAFIADLW--NPSTG---------------------------SWNLHLRRDLNENEILEW

Query:  AILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHK
          L   L +  L     D  SW L +   FT  S    L+  S          +W   +P K+KFF+W ++H  +NT D++Q R P  +LSP  C++C +
Subjt:  AILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHK

Query:  ASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV
          +T  HLF  C      W  L Q     +  P  ++ ++S++  G   +K   ++WQ    A LW +W ERNAR+
Subjt:  ASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV

RVW16209.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]8.5e-19835.06Show/hide
Query:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNE----------SSFSVLE-------
        M  ISWNVRGLGS  KR ++KDFL S NP +V++QETK    DR+ + S+W+ RN  W +L A G SGGI IIW+            SFSV         
Subjt:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNE----------SSFSVLE-------

Query:  --------------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYL
                                                                T +MR F+ FI    L D PL N  FTWS+ + +P    +DR+L
Subjt:  --------------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYL

Query:  LSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKR
         S+   + F       L R  SDH+PI +      WGPTPFRF N WL H  F ++   WW     +GW GH F+++L+ +K +LK+WN + FG+  +K+
Subjt:  LSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKR

Query:  NSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFIS
                                      K +L  +   EEI WRQK+K+KW  EGD N+ F+H++    R +  I E+ +  G  L + E I +E + 
Subjt:  NSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFIS

Query:  FYQKLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLI
        +++KL+T            DWSPIS E    LE+PFTE EI KA+  L  +K PGPDGFT   F++ W+++K D+  VF +F +SGIIN S N ++I LI
Subjt:  FYQKLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLI

Query:  PKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQA
        PKK  +K + D+RPISL T LYKIIA+VLS RL+ VL +TI   Q  FV  +QI+DA LIANE++DE  R   +GV+ K+D EKA+D V WDFLD++L+ 
Subjt:  PKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQA

Query:  KGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQ
        KGF  RWRKW+  C+SSV+++I++NG  +G +KASRGLRQ   +        +  LSRML+ AE+   +EGF VG N+    ++HLQFADD I FS+ ++
Subjt:  KGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQ

Query:  DLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA------
        + +  L +++  F  I  L +N NKS   GI L Q  +  LA     K   WP++YLGLPL GNPK   FW P++E+I  RL  W    +S G       
Subjt:  DLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA------

Query:  ----------------------------------------------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARN
                                                            P   GGL + +I ++N +LL KW+WR+ +E  ALW ++I + YG   N
Subjt:  ----------------------------------------------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARN

Query:  PLKLGNHSISSSKGPWKAIHSL-QHF--IYDNIDSRVGKEAFIADLW--------------------NPSTGS---------WNLHLRRDLNENEILEWA
                  S + PWKAI  + Q F  I   +     +  F  DLW                    N S  S         WNL+ RR+L+++EI +  
Subjt:  PLKLGNHSISSSKGPWKAIHSL-QHF--IYDNIDSRVGKEAFIADLW--------------------NPSTGS---------WNLHLRRDLNENEILEWA

Query:  ILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKA
         L   L +  L     D+  W L +   F+  S    L+ SS ++       +W   +P KVK F+W ++H  +NT D++Q R P  +LSP  C++C K 
Subjt:  ILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKA

Query:  SKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV
         ++  HLF  C      W  L Q     +  P  IY ++S+   G   +K   ++WQ    A +  +W ERNAR+
Subjt:  SKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.6e-20735.85Show/hide
Query:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNESSFSVLE-----------------
        M  ISWN RGLGS KKR +VKDFL S  P +V+ QETK  + DR+ + S+W++RN  WA+L A G SGGI IIW+    S  E                 
Subjt:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNESSFSVLE-----------------

Query:  --------------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYL
                                                                T +M+ F+ FI    L D+PL +  FTWS+ + NP    +DR+L
Subjt:  --------------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYL

Query:  LSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKR
         S+     F  +    L R  SDH+PI L     +WGPTPFRF N WL H +F ++   WWR     GW GH F++KL+ +K +LK WN   FG+ S ++
Subjt:  LSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKR

Query:  NSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFIS
          +   L+  D  E+ G +S   +++R   K +L  +   EEI WRQK+++KW  EGD N+ FFH++    R +  I E+ + NG  + + E I++E + 
Subjt:  NSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFIS

Query:  FYQKLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLI
        +++KL+T  +         DWSPIS E    LE+PFTE EI KA+  +  +K PGPDGFT   F+  W ++K D+  VF +F +SGIIN S N ++I L+
Subjt:  FYQKLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLI

Query:  PKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQA
        PKK  ++ + D+RPISL T LYKIIA+VL+ R+++VL +TI   Q  FV  +QI+DA LIANE++DE  R   +GV+ K+D EKA+D V WDFLD+V++ 
Subjt:  PKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQA

Query:  KGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQ
        KGFG RWRKW+R C+SSV++++++NG  +G +KASRGLRQ   +        +  LSRMLL AE+   +EGF VG N+    ++HLQFADDTI FSS ++
Subjt:  KGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQ

Query:  DLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA------
        + +  L NV+  F  IS L +N +KS   GI L Q  +  LA     K   WP++YLGLPL GNPK   FW P+IE+I +RL  W    +S G       
Subjt:  DLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA------

Query:  ----------------------------------------------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARN
                                                            P   GGL    I  +N +LL KW+WR+ +E  ALW ++I + YG   N
Subjt:  ----------------------------------------------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARN

Query:  PLKLGNHSISSSKGPWKAI-------HSLQHFIYDNIDSRVGKEAFIADLW-----------------------------NPSTGSWNLHLRRDLNENEI
           + N    S + PWKAI            F+  N D    +  F  DLW                             +    SWN   RR+L+++EI
Subjt:  PLKLGNHSISSSKGPWKAI-------HSLQHFIYDNIDSRVGKEAFIADLW-----------------------------NPSTGSWNLHLRRDLNENEI

Query:  --LEWAILSHHLTNFSLKDEEDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCV
          LE  + S    + S     D  SW L +   FT  S    L+  S +        +W   +P KVK F+W ++H  +NT D++Q R P  +LSP  C 
Subjt:  --LEWAILSHHLTNFSLKDEEDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCV

Query:  MCHKASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV
        +C K  +T  HLF  C      W  L Q+    +  P  I  +L+ +  G  F+K   ++WQ    A +W +W ERNAR+
Subjt:  MCHKASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV

RVX23556.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]5.9e-19936.3Show/hide
Query:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTS---------------------GGIAIIW-NESS
        M  ISWNVRGLGS  KR ++KDFL    P +V++QETK  K DR+L+ S+W+ RN  W  L A G S                     G    +W     
Subjt:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTS---------------------GGIAIIW-NESS

Query:  FSVLE-----------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTP
        F+V+            T +MR F+ FI  + L D PL N  FTWS+ + +P    +DR+L S+     F   +   L R  SDH+PI L      WGPTP
Subjt:  FSVLE-----------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTP

Query:  FRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAAT
        FRF N WL H +F ++  +WWR     GW GH F+++L+ +K + K+WN   FG  ++K+ S+ ++L  +D  E++G ++   + +R   K +L ++   
Subjt:  FRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAAT

Query:  EEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQKLFTK-LNSSPPLPTIDDWSPISSEQQAVLEAPFTEA
        EEI WRQK+++KW  EGD N+ FFH++    R +  I  + +  G  L +   I +E + +++KL+   +  S  +  + DWSPIS E    L APFTE 
Subjt:  EEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQKLFTK-LNSSPPLPTIDDWSPISSEQQAVLEAPFTEA

Query:  EIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPD
        EI KA+  +  +K PGPDGFT   F+  W+++K D+  VF +F +SG+IN S N ++I L+PKK   K + D+RPISL T LYKIIA+VLS RL+ VL +
Subjt:  EIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPD

Query:  TITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLR
        TI   Q  FV  +QIMDA LIANE++DE  R   +GV+ K+D EKA+D V WDFLD VL+ KGF  +WRKW+  C+SSV+Y++++NG  +G +KASRGLR
Subjt:  TITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLR

Query:  QSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQID
        Q   +        +  LSRML+ AE+   +EGF VG N+    ++HLQFADDTI FS+ +++ +  L +++  F  IS L +N +KS   GI L    + 
Subjt:  QSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQID

Query:  LLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA---------------------------------------------
         LA   G K   WP++YLGLPL GNP+   FW P+IE+I +RL  W    +S G                                              
Subjt:  LLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA---------------------------------------------

Query:  -------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARNPLKLGNHSISSSKGPWKAI-------HSLQHFIYDNIDS
                     P + GGL   +I  +N +LL KW+WR+ +E  ALW ++I + YG   N          S + PWKAI        S   F+  N + 
Subjt:  -------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARNPLKLGNHSISSSKGPWKAI-------HSLQHFIYDNIDS

Query:  RVGKEAFIADL--WNPSTGSWNLHLRRDLNENEILEWAILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKV
           +  F  DL  W+ S   WNL+ RR+L+++EI +   L   L    L     D+ SW L +   F+  S    L+             +W   +P KV
Subjt:  RVGKEAFIADL--WNPSTGSWNLHLRRDLNENEILEWAILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKV

Query:  KFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYA
        + FIW ++H  +NT D++Q R P  +LSP  C++C K  ++  H+F  C      W  L Q     +  P  I  ++ +   G   +K    +WQ    A
Subjt:  KFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYA

Query:  FLWNLWLERNARV
         +  +W ERNAR+
Subjt:  FLWNLWLERNARV

TrEMBL top hitse value%identityAlignment
A0A438BYX6 Transposon TX1 uncharacterized 149 kDa protein4.1e-19835.06Show/hide
Query:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNE----------SSFSVLE-------
        M  ISWNVRGLGS  KR ++KDFL S NP +V++QETK    DR+ + S+W+ RN  W +L A G SGGI IIW+            SFSV         
Subjt:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNE----------SSFSVLE-------

Query:  --------------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYL
                                                                T +MR F+ FI    L D PL N  FTWS+ + +P    +DR+L
Subjt:  --------------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYL

Query:  LSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKR
         S+   + F       L R  SDH+PI +      WGPTPFRF N WL H  F ++   WW     +GW GH F+++L+ +K +LK+WN + FG+  +K+
Subjt:  LSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKR

Query:  NSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFIS
                                      K +L  +   EEI WRQK+K+KW  EGD N+ F+H++    R +  I E+ +  G  L + E I +E + 
Subjt:  NSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFIS

Query:  FYQKLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLI
        +++KL+T            DWSPIS E    LE+PFTE EI KA+  L  +K PGPDGFT   F++ W+++K D+  VF +F +SGIIN S N ++I LI
Subjt:  FYQKLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLI

Query:  PKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQA
        PKK  +K + D+RPISL T LYKIIA+VLS RL+ VL +TI   Q  FV  +QI+DA LIANE++DE  R   +GV+ K+D EKA+D V WDFLD++L+ 
Subjt:  PKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQA

Query:  KGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQ
        KGF  RWRKW+  C+SSV+++I++NG  +G +KASRGLRQ   +        +  LSRML+ AE+   +EGF VG N+    ++HLQFADD I FS+ ++
Subjt:  KGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQ

Query:  DLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA------
        + +  L +++  F  I  L +N NKS   GI L Q  +  LA     K   WP++YLGLPL GNPK   FW P++E+I  RL  W    +S G       
Subjt:  DLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA------

Query:  ----------------------------------------------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARN
                                                            P   GGL + +I ++N +LL KW+WR+ +E  ALW ++I + YG   N
Subjt:  ----------------------------------------------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARN

Query:  PLKLGNHSISSSKGPWKAIHSL-QHF--IYDNIDSRVGKEAFIADLW--------------------NPSTGS---------WNLHLRRDLNENEILEWA
                  S + PWKAI  + Q F  I   +     +  F  DLW                    N S  S         WNL+ RR+L+++EI +  
Subjt:  PLKLGNHSISSSKGPWKAIHSL-QHF--IYDNIDSRVGKEAFIADLW--------------------NPSTGS---------WNLHLRRDLNENEILEWA

Query:  ILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKA
         L   L +  L     D+  W L +   F+  S    L+ SS ++       +W   +P KVK F+W ++H  +NT D++Q R P  +LSP  C++C K 
Subjt:  ILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKA

Query:  SKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV
         ++  HLF  C      W  L Q     +  P  IY ++S+   G   +K   ++WQ    A +  +W ERNAR+
Subjt:  SKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV

A0A438GDE7 LINE-1 retrotransposable element ORF2 protein1.3e-20735.85Show/hide
Query:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNESSFSVLE-----------------
        M  ISWN RGLGS KKR +VKDFL S  P +V+ QETK  + DR+ + S+W++RN  WA+L A G SGGI IIW+    S  E                 
Subjt:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNESSFSVLE-----------------

Query:  --------------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYL
                                                                T +M+ F+ FI    L D+PL +  FTWS+ + NP    +DR+L
Subjt:  --------------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYL

Query:  LSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKR
         S+     F  +    L R  SDH+PI L     +WGPTPFRF N WL H +F ++   WWR     GW GH F++KL+ +K +LK WN   FG+ S ++
Subjt:  LSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKR

Query:  NSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFIS
          +   L+  D  E+ G +S   +++R   K +L  +   EEI WRQK+++KW  EGD N+ FFH++    R +  I E+ + NG  + + E I++E + 
Subjt:  NSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFIS

Query:  FYQKLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLI
        +++KL+T  +         DWSPIS E    LE+PFTE EI KA+  +  +K PGPDGFT   F+  W ++K D+  VF +F +SGIIN S N ++I L+
Subjt:  FYQKLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLI

Query:  PKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQA
        PKK  ++ + D+RPISL T LYKIIA+VL+ R+++VL +TI   Q  FV  +QI+DA LIANE++DE  R   +GV+ K+D EKA+D V WDFLD+V++ 
Subjt:  PKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQA

Query:  KGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQ
        KGFG RWRKW+R C+SSV++++++NG  +G +KASRGLRQ   +        +  LSRMLL AE+   +EGF VG N+    ++HLQFADDTI FSS ++
Subjt:  KGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQ

Query:  DLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA------
        + +  L NV+  F  IS L +N +KS   GI L Q  +  LA     K   WP++YLGLPL GNPK   FW P+IE+I +RL  W    +S G       
Subjt:  DLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA------

Query:  ----------------------------------------------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARN
                                                            P   GGL    I  +N +LL KW+WR+ +E  ALW ++I + YG   N
Subjt:  ----------------------------------------------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARN

Query:  PLKLGNHSISSSKGPWKAI-------HSLQHFIYDNIDSRVGKEAFIADLW-----------------------------NPSTGSWNLHLRRDLNENEI
           + N    S + PWKAI            F+  N D    +  F  DLW                             +    SWN   RR+L+++EI
Subjt:  PLKLGNHSISSSKGPWKAI-------HSLQHFIYDNIDSRVGKEAFIADLW-----------------------------NPSTGSWNLHLRRDLNENEI

Query:  --LEWAILSHHLTNFSLKDEEDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCV
          LE  + S    + S     D  SW L +   FT  S    L+  S +        +W   +P KVK F+W ++H  +NT D++Q R P  +LSP  C 
Subjt:  --LEWAILSHHLTNFSLKDEEDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCV

Query:  MCHKASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV
        +C K  +T  HLF  C      W  L Q+    +  P  I  +L+ +  G  F+K   ++WQ    A +W +W ERNAR+
Subjt:  MCHKASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV

A0A438KQT0 Transposon TX1 uncharacterized 149 kDa protein2.8e-19936.3Show/hide
Query:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTS---------------------GGIAIIW-NESS
        M  ISWNVRGLGS  KR ++KDFL    P +V++QETK  K DR+L+ S+W+ RN  W  L A G S                     G    +W     
Subjt:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTS---------------------GGIAIIW-NESS

Query:  FSVLE-----------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTP
        F+V+            T +MR F+ FI  + L D PL N  FTWS+ + +P    +DR+L S+     F   +   L R  SDH+PI L      WGPTP
Subjt:  FSVLE-----------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTP

Query:  FRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAAT
        FRF N WL H +F ++  +WWR     GW GH F+++L+ +K + K+WN   FG  ++K+ S+ ++L  +D  E++G ++   + +R   K +L ++   
Subjt:  FRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAAT

Query:  EEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQKLFTK-LNSSPPLPTIDDWSPISSEQQAVLEAPFTEA
        EEI WRQK+++KW  EGD N+ FFH++    R +  I  + +  G  L +   I +E + +++KL+   +  S  +  + DWSPIS E    L APFTE 
Subjt:  EEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQKLFTK-LNSSPPLPTIDDWSPISSEQQAVLEAPFTEA

Query:  EIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPD
        EI KA+  +  +K PGPDGFT   F+  W+++K D+  VF +F +SG+IN S N ++I L+PKK   K + D+RPISL T LYKIIA+VLS RL+ VL +
Subjt:  EIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPD

Query:  TITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLR
        TI   Q  FV  +QIMDA LIANE++DE  R   +GV+ K+D EKA+D V WDFLD VL+ KGF  +WRKW+  C+SSV+Y++++NG  +G +KASRGLR
Subjt:  TITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLR

Query:  QSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQID
        Q   +        +  LSRML+ AE+   +EGF VG N+    ++HLQFADDTI FS+ +++ +  L +++  F  IS L +N +KS   GI L    + 
Subjt:  QSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQID

Query:  LLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA---------------------------------------------
         LA   G K   WP++YLGLPL GNP+   FW P+IE+I +RL  W    +S G                                              
Subjt:  LLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA---------------------------------------------

Query:  -------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARNPLKLGNHSISSSKGPWKAI-------HSLQHFIYDNIDS
                     P + GGL   +I  +N +LL KW+WR+ +E  ALW ++I + YG   N          S + PWKAI        S   F+  N + 
Subjt:  -------------PTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARNPLKLGNHSISSSKGPWKAI-------HSLQHFIYDNIDS

Query:  RVGKEAFIADL--WNPSTGSWNLHLRRDLNENEILEWAILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKV
           +  F  DL  W+ S   WNL+ RR+L+++EI +   L   L    L     D+ SW L +   F+  S    L+             +W   +P KV
Subjt:  RVGKEAFIADL--WNPSTGSWNLHLRRDLNENEILEWAILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKV

Query:  KFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYA
        + FIW ++H  +NT D++Q R P  +LSP  C++C K  ++  H+F  C      W  L Q     +  P  I  ++ +   G   +K    +WQ    A
Subjt:  KFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYA

Query:  FLWNLWLERNARV
         +  +W ERNAR+
Subjt:  FLWNLWLERNARV

A5BPI6 Uncharacterized protein4.8e-19935.12Show/hide
Query:  ISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNE----------SSFSVLE----------
        ISWN RGLGS KKR +VKDFL S  P +V++QETK A+ DR+ + S+W++RN  WA L A G SGGI +IW+            SFSV            
Subjt:  ISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNE----------SSFSVLE----------

Query:  -----------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSD
                                                             T +M+  + FI    L D PL +  FTWS+ + +P    +DR+L S+
Subjt:  -----------------------------------------------------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSD

Query:  NIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSL
             F  +    L R  SDH+PI L     +WGPTPFRF N WL H +F +    WWR     GW GH F++KL+ LK +LK+WN N FG   +++  +
Subjt:  NIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSL

Query:  NQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQ
          ++   D  E+ G +S   + +R   K +L  +   EEI WRQK+++KW  EGD N+  FH++    R +  I  + +  G  L + + I++E + +++
Subjt:  NQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQ

Query:  KLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKK
        KL+   +         DWSPIS E  + LE+PFTE EIYKA+  +  +  PGPDGFT   F+  W+++K D+  VF++F +SGIIN S N ++I L+PKK
Subjt:  KLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKK

Query:  IGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGF
          AK + +YRPISL T LYKIIA+VL+ RL+ +L +TI   Q  FV  +QI+DA LIANE++DE  R   +GV+ K+D EKA+D V WDFLD+V++ KGF
Subjt:  IGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGF

Query:  GQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLI
            RKWIR C+SSV+++I++NG  +G +K  RGLRQ   +        +   S MLL AE+    EGF VG N+    ++HLQFADDTI FSS +++ +
Subjt:  GQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLI

Query:  TNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA---------
          L +V+  F  IS L +N +KS   GI L Q  +  LA     K   WP++YLGLPL GNPK  SFW P+IE+I  RL  W    +S G          
Subjt:  TNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGA---------

Query:  -------------PTD------------------------------------EGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARNPLK
                     P                                      +GGL +  I  +N++LL KW+WR+ +E  ALW ++I + YG   N   
Subjt:  -------------PTD------------------------------------EGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARNPLK

Query:  LGNHSISSSKGPWKAI-------HSLQHFIYDNIDSRVGKEAFIADLW--NPSTG---------------------------SWNLHLRRDLNENEILEW
               S + PWKAI            FI  + D    +  F  DLW  + S G                           SWN + RR+L+++EI + 
Subjt:  LGNHSISSSKGPWKAI-------HSLQHFIYDNIDSRVGKEAFIADLW--NPSTG---------------------------SWNLHLRRDLNENEILEW

Query:  AILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHK
          L   L +  L     D  SW L +   FT  S    L+  S          +W   +P K+KFF+W ++H  +NT D++Q R P  +LSP  C++C +
Subjt:  AILSHHLTNFSLKDE-EDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHK

Query:  ASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV
          +T  HLF  C      W  L Q     +  P  ++ ++S++  G   +K   ++WQ    A LW +W ERNAR+
Subjt:  ASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV

A5C3T9 Uncharacterized protein2.0e-19737.72Show/hide
Query:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGT---------------SGGI---------AIIWNE
        M  ISWN+RGLGS  KR +VKDFL   NP +V+ QETK    DR+ + S+WS RN  WA L A G                 GG+            W E
Subjt:  MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGT---------------SGGI---------AIIWNE

Query:  SS---------------FSVLE-----------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSDNIPVKFGSASVRKLERPISD
         S               F+V+            T +MR F+ FI  + L D PL N  FTWS+ + +P    +DR+L S+   + F  +    L R  SD
Subjt:  SS---------------FSVLE-----------TRAMRRFNRFIETAALKDIPLTNGKFTWSSFRPNPSLSLIDRYLLSDNIPVKFGSASVRKLERPISD

Query:  HFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLD
        H+PI L     +WGPTPFRF N WL H  F +S  SWWR     GW GH F++KL+ +K +LK WN N FG   +++ S++ E+  ID  E+ G +S   
Subjt:  HFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLD

Query:  ISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQKLFTKLNSSPPLPTID----
        ++ R   K +L  +   EEI W+QK+K+KW  EGD N+  FH++    R K+ I  + +  G  + S E I +E + +++KL+    S PP  +      
Subjt:  ISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQKLFTKLNSSPPLPTID----

Query:  DWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTT
        DWSPIS E  + L++PF EAEI+ A+  L  +K PGPDGFT   F+  W+++K D+  VF +F  SGIIN + N ++I L+PKK  +K + D+RPISL T
Subjt:  DWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTT

Query:  CLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVN
        CLYKIIA+VLS RL+ VL +TI   Q  FV  +QI+DA LIANE++DE  R   +GV+ K+D EKA+D V+WDFLD+VL+ KGF  RWR W+R C+SSV+
Subjt:  CLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVN

Query:  YSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRL
        Y+I++NG  +G +KA RGLRQ   +        +  LSRMLL  E+   +EGF VG N+    ++HLQFADDTILF+SP++D +  L +++  F  IS L
Subjt:  YSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRL

Query:  NINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGAPTDEGGLSIYDIHKKNTSLLAKWIW
         +N +KS   GI L Q  +  LAS    K   WP++YLGLPL GNP    FW P+IE+I +RL  W    +S G     G +++  +H    S +  +  
Subjt:  NINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGAPTDEGGLSIYDIHKKNTSLLAKWIW

Query:  RFYQEPEALWRKIIATKYGVARNPLKLGN-HSISSSKGPWKAIHSLQHFIYDNIDSRVGKEAFIADLWNPSTG-SWNLHLRRDLNENEILEWAILSHHLT
          ++ P ++  KI   +     + +  G+   I   +  W     L+          + K   I+ +   S   SWN + RR+L ++EI +   L H L 
Subjt:  RFYQEPEALWRKIIATKYGVARNPLKLGN-HSISSSKGPWKAIHSLQHFIYDNIDSRVGKEAFIADLWNPSTG-SWNLHLRRDLNENEILEWAILSHHLT

Query:  NFSLK-DEEDSWSWQLGNIDSFTTGSL---TKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKASKTQ
          +L     D+ SW L +   FT  S      ++  SSP   T+    +WK  +  KVK  IW ++H  +NT D++Q R P  ++SP+ C +C +  ++ 
Subjt:  NFSLK-DEEDSWSWQLGNIDSFTTGSL---TKKLASSSPTTGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKASKTQ

Query:  AHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV
         HLF  C      W  L Q     +  P  I  ++ ++  G   +K   ++WQ    A +W +W ERNAR+
Subjt:  AHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKLIWQIFLYAFLWNLWLERNARV

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.1e-3326.59Show/hide
Query:  QQSDKRNSLNQELMIIDKKEENGLISDLDISRR---TEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTF--FHRLMANNRRKSSINEILSANGNSLT
        Q+  K ++L  +L  ++K+E+    +    SRR   T+I+A+L  I  T++ + +      WF E  +N       RL+   R K+ I+ I +  G+  T
Subjt:  QQSDKRNSLNQELMIIDKKEENGLISDLDISRR---TEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTF--FHRLMANNRRKSSINEILSANGNSLT

Query:  SDEDIEKEFISFYQKLF-TKLNSSPPLPTIDD---WSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKS
           +I+     +Y+ L+  KL +   + T  D      ++ E+   L  P T +EI   +  L + K+PGPDGFTAEF+++    L   +  +F    K 
Subjt:  SDEDIEKEFISFYQKLF-TKLNSSPPLPTIDD---WSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKS

Query:  GIINASLNETYICLIPKK-IGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNR-RNRKGVIIKLDIE
        GI+  S  E  I LIPK         ++RPISL     KI+ ++L++R+++ +   I   Q  F+   Q       +  +I   NR +++  VII +D E
Subjt:  GIINASLNETYICLIPKK-IGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNR-RNRKGVIIKLDIE

Query:  KAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLL-----SAEQNGSIEGFWVGINQ
        KAFD +   F+   L   G    + K IR        +II+NG+         G RQ   +        SP L  ++L     +  Q   I+G  +G  +
Subjt:  KAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLL-----SAEQNGSIEGFWVGINQ

Query:  QPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRI--SFWTPMIE
          +++    FADD I++         NL  +I NF  +S   IN  KS+       +Q    +       + S  + YLG+ L  + K +    + P+++
Subjt:  QPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKSEFLGIGLTQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRI--SFWTPMIE

Query:  KIQKRLINWGSTNISKGAPTDEGGLSIYDIHKKNTSLLAKWIWRFYQEP
        +I++       TN  K  P    G     I+    ++L K I+RF   P
Subjt:  KIQKRLINWGSTNISKGAPTDEGGLSIYDIHKKNTSLLAKWIWRFYQEP

P08548 LINE-1 reverse transcriptase homolog9.2e-3026.83Show/hide
Query:  QSDKRNSLNQELMIIDKKEENGLISDLDISRR---TEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSA--NGNS--L
        + ++ N+L   L  ++K+E     S+   SRR   T+I+A+L N    + II +      WF E         + +AN  RK  +  ++S+  NGN    
Subjt:  QSDKRNSLNQELMIIDKKEENGLISDLDISRR---TEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSA--NGNS--L

Query:  TSDEDIEKEFISFYQKLFTKLNSSPPLPTIDDW------SPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDF
        T   +I+K    +Y+KL++  +    L  ID +        +S ++  +L  P + +EI   + +L   K+PGPDGFT+EF++     L   +  +F + 
Subjt:  TSDEDIEKEFISFYQKLFTKLNSSPPLPTIDDW------SPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDF

Query:  FKSGIINASLNETYICLIPKK-IGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNR-RNRKGVIIKL
         K GI+  +  E  I LIPK         +YRPISL     KI+ ++L++R+++ +   I   Q  F+   Q       +  +I   N+ +N+  +I+ +
Subjt:  FKSGIINASLNETYICLIPKK-IGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNR-RNRKGVIIKL

Query:  DIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRML-----LSAEQNGSIEGFWVG
        D EKAFD +   F+   L+  G    + K I    S    +II+NG          G RQ   +        SP L  ++     ++  +  +I+G  +G
Subjt:  DIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRML-----LSAEQNGSIEGFWVG

Query:  INQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKS
          +  +++    FADD I++    +D  T L  VI  +  +S   IN +KS
Subjt:  INQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKS

P11369 LINE-1 retrotransposable element ORF2 protein2.2e-3128.8Show/hide
Query:  NSLNQELMIIDKKEENGLISDLDISRRTEI--KADLINIAATEEIIWRQKSKLKWFLEGDVNTTF--FHRLMANNRRKSSINEILSANGNSLTSDEDIEK
        +SL   L  ++KKE N        SRR EI      IN   T   I R      WF E  +N       RL   +R K  IN+I +  G+  T  E+I+ 
Subjt:  NSLNQELMIIDKKEENGLISDLDISRRTEI--KADLINIAATEEIIWRQKSKLKWFLEGDVNTTF--FHRLMANNRRKSSINEILSANGNSLTSDEDIEK

Query:  EFISFYQKLF-TKLNSSPPLPTIDD---WSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASL
           SFY++L+ TKL +   +    D      ++ +Q   L +P +  EI   +  L + K+PGPDGF+AEF++     L   +  +F+     G +  S 
Subjt:  EFISFYQKLF-TKLNSSPPLPTIDD---WSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASL

Query:  NETYICLIPK-KIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNR-RNRKGVIIKLDIEKAFDTVD
         E  I LIPK +     + ++RPISL     KI+ ++L++R+++ +   I P Q  F+   Q       +  +I   N+ +++  +II LD EKAFD + 
Subjt:  NETYICLIPK-KIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNR-RNRKGVIIKLDIEKAFDTVD

Query:  WDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLL-----SAEQNGSIEGFWVGINQQPINITH
          F+  VL+  G    +   I+   S    +I +NG+    I    G RQ   +        SP L  ++L     +  Q   I+G  +G  +  I++  
Subjt:  WDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLL-----SAEQNGSIEGFWVGINQQPINITH

Query:  LQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKS
           ADD I++ S  ++    L N+I++F  +    IN NKS
Subjt:  LQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKS

P14381 Transposon TX1 uncharacterized 149 kDa protein3.3e-4325.38Show/hide
Query:  SLSLIDRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTP--FRFINAWLTHRTFLQSVESWWR--------TNSLLGWPGHGFIQKLKGL
        S S IDR  +S ++  +  S+++R    P SDH  + L +      P    + F N+ L    F +SV   WR          +L  W   G +     L
Subjt:  SLSLIDRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTP--FRFINAWLTHRTFLQSVESWWR--------TNSLLGWPGHGFIQKLKGL

Query:  KKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEIL
        K   +++  +V GQ++ +  +LN E++ ++++        L      E K  L N+   +      +S+++   + D  + FF+ L      +  I  + 
Subjt:  KKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEIL

Query:  SANGNSLTSDEDIEKEFISFYQKLFTKLNSSP-PLPTIDDWSPISSEQQAV-LEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVF
        + +G  L   E I     SFYQ LF+    SP     + D  P+ SE++   LE P T  E+ +A+  +  NK+PG DG T EFF+  W+ L  D   V 
Subjt:  SANGNSLTSDEDIEKEFISFYQKLFTKLNSSP-PLPTIDDWSPISSEQQAV-LEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVF

Query:  NDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIK
         + FK G +  S     + L+PKK   + + ++RP+SL +  YKI+A+ +S RLK VL + I P QS  V  + I D   +  +L+    R       + 
Subjt:  NDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIK

Query:  LDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRM--LLSAEQNGSIEGFWVGIN
        LD EKAFD VD  +L   LQA  FG ++  +++   +S    + IN      +   RG+RQ   +    G+  S  +     LL     G +      + 
Subjt:  LDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRM--LLSAEQNGSIEGFWVGIN

Query:  QQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKS---------------------------EFLGIGLTQQQIDL------LASG
        +  + +    +ADD IL +    DL          + A S   IN +KS                           ++LG+ L+ ++  +      L   
Subjt:  QQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKS---------------------------EFLGIGLTQQQIDL------LASG

Query:  FGGKVGSWPMIYLGLPLNGNPKRIS------FW---------TPMIEKIQKRLI-------NWGSTNISKGAPTDEGGLSIYDIHKKNTSLLAKWIWRF-
           ++G W      L + G    I+       W            I KIQ+RL+       +W S  +S   P  EGG  +  I  +  +   + I R+ 
Subjt:  FGGKVGSWPMIYLGLPLNGNPKRIS------FW---------TPMIEKIQKRLI-------NWGSTNISKGAPTDEGGLSIYDIHKKNTSLLAKWIWRF-

Query:  YQEPEALWRKIIATKYGVARN
        Y +P   W  + ++ Y   RN
Subjt:  YQEPEALWRKIIATKYGVARN

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM1.9e-1122.46Show/hide
Query:  SSEQQAVLEAPFTEAEIYKAVTD-------LGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISL
        SS    V++   +   ++ A+T+       +  + +PGPDG T    K +  +    +  + N     G +  S+       IPK + AK   D+RPIS+
Subjt:  SSEQQAVLEAPFTEAEIYKAVTD-------LGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISL

Query:  TTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISS
         + L + +  +L+ RL   +     P Q  F+      D + I + ++   ++  R   I  LD+ KAFD++    + + L+A G  + +  ++++    
Subjt:  TTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISS

Query:  VNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAIS
           S+  +G    +   +RG++Q        G+  SP L  +++              +     N     FADD +LF+  +  L   L   + +F +I 
Subjt:  VNYSIIINGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAIS

Query:  RLNINRNKSEFLGI-GLTQQQIDLL
         L +N +K   +GI G  +Q+  +L
Subjt:  RLNINRNKSEFLGI-GLTQQQIDLL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.6e-3529.2Show/hide
Query:  FSVLET----RAMRRFNRFIETAALKDIPLTNGKFTWSSFR-PNPSLSLIDRYLLSDNIPVKFGSASVRKLERPISDHFPICLTL-GKERWGPTPFRFIN
        +SVL+T    R +  F   +  + L DIP     +TWS+ +  NP +  +DR + + +    F SA        +SDH P  + L    +     FR+ +
Subjt:  FSVLET----RAMRRFNRFIETAALKDIPLTNGKFTWSSFR-PNPSLSLIDRYLLSDNIPVKFGSASVRKLERPISDHFPICLTL-GKERWGPTPFRFIN

Query:  AWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLDISRRTE--IKADLINIAATEEI
           TH TFL S+   W     +G       + LK  KK  K  N   FG    K     + L  ++  +   L +  D   R E   +      AA  E 
Subjt:  AWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVFGQQSDKRNSLNQELMIIDKKEENGLISDLDISRRTE--IKADLINIAATEEI

Query:  IWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQKLF---TKLNSSPPLPTIDDWSPISSEQQAV--LEAPFTE
         +RQKS++KW  +GD NT FFH+++  N+ K+ I  +   +   + +   +++  +++Y  L    + + +   +  I D  P          L A  ++
Subjt:  IWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQKLF---TKLNSSPPLPTIDDWSPISSEQQAV--LEAPFTE

Query:  AEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTTCLYKII
         EI  AV  +  NK PGPD FTAEFF +SW ++K+       +FF++G +    N T I LIPK  G   +  +RP+S  T +YKII
Subjt:  AEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYRPISLTTCLYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases7.8e-0833.33Show/hide
Query:  DRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGV----IIKLDIEKAFDTVDWDFLDNVLQAKGFGQRW
        +RLK ++ + I P Q++F+  +   D  +   E +   + R +KGV    ++KLD+EKA+D + WD+L++ L + GF + W
Subjt:  DRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGV----IIKLDIEKAFDTVDWDFLDNVLQAKGFGQRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTTATCTCGTGGAATGTTAGAGGATTGGGATCATGGAAGAAGAGAGCTCTTGTCAAAGATTTTCTCTCCTCTCATAATCCATCATTGGTGATTCTTCAAGAGAC
CAAGCTGGCCAAGATTGATAGGAAGTTGATTAAATCTATATGGAGTTCCAGAAACATTGCATGGGCTTCTCTTGATGCTGAGGGAACATCAGGTGGGATAGCCATTATCT
GGAATGAATCATCCTTCTCGGTGCTTGAGACCCGAGCCATGAGGAGATTTAATCGCTTCATTGAAACAGCAGCATTAAAGGACATCCCCCTCACTAATGGTAAATTCACA
TGGTCAAGCTTCAGACCGAACCCTTCCCTGTCCCTTATCGACAGATACTTGCTCTCAGATAACATTCCTGTTAAATTCGGATCAGCCTCAGTTCGTAAGCTTGAAAGACC
CATATCTGACCACTTCCCTATATGTCTCACTTTGGGAAAGGAACGATGGGGACCAACTCCATTCAGATTCATCAATGCTTGGCTTACCCATAGAACCTTTCTACAATCGG
TAGAATCTTGGTGGCGGACAAACTCCTTGTTAGGGTGGCCTGGACACGGTTTCATACAAAAGTTAAAAGGCCTAAAAAAAGAACTGAAACAGTGGAACCACAATGTTTTT
GGTCAGCAATCTGATAAGAGGAACAGCCTTAATCAGGAGCTAATGATCATAGACAAGAAAGAGGAAAATGGCCTTATATCTGATTTAGACATCTCCAGAAGGACAGAGAT
AAAGGCTGACTTGATCAACATAGCAGCAACCGAGGAGATTATCTGGCGCCAAAAAAGCAAATTAAAATGGTTCTTGGAAGGGGATGTCAACACAACTTTTTTCCACCGAT
TAATGGCTAACAACAGAAGAAAAAGTTCCATTAATGAAATTTTATCAGCCAATGGAAACAGCCTCACTTCAGATGAAGATATTGAAAAAGAATTTATATCTTTCTATCAG
AAGCTCTTTACAAAACTGAACAGTAGTCCTCCCTTGCCTACCATTGATGACTGGAGTCCTATATCATCTGAACAACAAGCAGTTTTAGAAGCCCCTTTCACTGAAGCAGA
AATTTATAAGGCAGTAACAGATTTAGGCTCGAACAAGACCCCGGGTCCGGACGGTTTCACGGCCGAATTCTTTAAAAAATCATGGAACATCCTCAAGAATGACATAAAGG
GGGTGTTCAATGATTTTTTTAAGAGTGGTATTATTAATGCAAGCCTCAATGAAACTTACATTTGCCTAATCCCGAAGAAGATCGGTGCCAAAGCAGTAGGTGACTATCGT
CCTATTAGTTTGACAACGTGCTTATACAAGATCATCGCCCGAGTCCTCTCTGATCGCCTTAAAAAAGTCCTCCCAGACACAATCACTCCCTACCAATCGACTTTTGTGGG
AAACAAACAGATCATGGATGCCTCCCTAATCGCCAATGAATTAATAGATGAATGGAATAGAAGAAATCGAAAGGGGGTGATCATCAAGCTTGACATTGAGAAAGCCTTTG
ACACGGTGGATTGGGATTTCCTTGACAATGTCCTTCAAGCCAAAGGCTTCGGCCAAAGGTGGAGGAAATGGATTAGGGACTGTATTTCATCGGTCAACTATTCCATCATT
ATTAATGGCAAGCCGAGAGGAAAAATCAAAGCATCCAGAGGTCTTAGACAATCAACCACTATGATAAATAAGAAGGGGGAGAGAGGGTCACCTTGTCTTAGTAGAATGCT
TCTATCGGCTGAACAGAATGGCTCCATCGAAGGTTTTTGGGTCGGCATTAATCAACAGCCCATAAATATCACTCATCTCCAATTCGCAGATGATACCATTTTATTTTCCT
CCCCAAAGCAGGATCTTATTACAAACCTCTTCAACGTTATACACAACTTTGAGGCGATATCGAGGCTGAATATAAACCGTAACAAGTCTGAATTTCTGGGCATAGGATTG
ACTCAACAACAGATTGACCTCTTGGCATCGGGTTTTGGTGGCAAGGTTGGCTCATGGCCGATGATCTACTTAGGCCTTCCTCTAAATGGCAACCCGAAAAGGATCAGTTT
TTGGACACCAATGATTGAGAAAATTCAAAAGAGATTGATCAATTGGGGATCCACCAATATCTCCAAAGGCGCTCCTACTGACGAAGGGGGATTAAGCATTTATGATATTC
ACAAGAAAAATACTTCTCTCTTGGCCAAATGGATTTGGAGATTTTACCAAGAACCGGAGGCTTTATGGAGGAAAATCATCGCTACAAAATATGGGGTGGCTCGAAATCCC
CTAAAATTGGGTAACCACTCCATTAGCTCCTCCAAAGGACCATGGAAAGCTATCCACAGCTTGCAACATTTTATCTATGATAACATTGATTCCAGAGTTGGGAAGGAAGC
ATTCATTGCTGATTTATGGAATCCTAGTACTGGGTCTTGGAACCTTCATTTGAGGCGAGATCTCAATGAGAATGAAATCCTTGAGTGGGCCATTTTATCTCATCATCTTA
CCAACTTCTCCCTCAAAGACGAGGAAGACTCATGGAGTTGGCAGCTCGGAAACATTGACAGCTTCACCACGGGGTCTCTCACGAAAAAATTGGCTTCCTCCTCTCCCACA
ACCGGAACAGAATTATATAGTGGACTGTGGAAAGGTCCTATGCCGAAAAAGGTTAAATTCTTCATATGGGAACTCAGTCATGCTAGCATCAACACCGCAGACATCATTCA
GAAAAGATATCCTGGAAATAGTCTTTCTCCTAGCTGCTGTGTCATGTGCCACAAAGCTAGCAAGACCCAAGCTCATCTTTTCAGTAGCTGCGATTTTGCATCAGAATTTT
GGTCTCATCTCCAGCAAGCTTTTGGGTGGCTTTTTGCTCGTCCGGGTGACATCTACACCCTTCTCTCTTTATCTGTTTGTGGACCTCCATTTACAAAAGACAAAAAGTTG
ATATGGCAGATTTTTTTGTATGCTTTCCTATGGAATCTTTGGCTGGAAAGAAACGCTAGAGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTTTATCTCGTGGAATGTTAGAGGATTGGGATCATGGAAGAAGAGAGCTCTTGTCAAAGATTTTCTCTCCTCTCATAATCCATCATTGGTGATTCTTCAAGAGAC
CAAGCTGGCCAAGATTGATAGGAAGTTGATTAAATCTATATGGAGTTCCAGAAACATTGCATGGGCTTCTCTTGATGCTGAGGGAACATCAGGTGGGATAGCCATTATCT
GGAATGAATCATCCTTCTCGGTGCTTGAGACCCGAGCCATGAGGAGATTTAATCGCTTCATTGAAACAGCAGCATTAAAGGACATCCCCCTCACTAATGGTAAATTCACA
TGGTCAAGCTTCAGACCGAACCCTTCCCTGTCCCTTATCGACAGATACTTGCTCTCAGATAACATTCCTGTTAAATTCGGATCAGCCTCAGTTCGTAAGCTTGAAAGACC
CATATCTGACCACTTCCCTATATGTCTCACTTTGGGAAAGGAACGATGGGGACCAACTCCATTCAGATTCATCAATGCTTGGCTTACCCATAGAACCTTTCTACAATCGG
TAGAATCTTGGTGGCGGACAAACTCCTTGTTAGGGTGGCCTGGACACGGTTTCATACAAAAGTTAAAAGGCCTAAAAAAAGAACTGAAACAGTGGAACCACAATGTTTTT
GGTCAGCAATCTGATAAGAGGAACAGCCTTAATCAGGAGCTAATGATCATAGACAAGAAAGAGGAAAATGGCCTTATATCTGATTTAGACATCTCCAGAAGGACAGAGAT
AAAGGCTGACTTGATCAACATAGCAGCAACCGAGGAGATTATCTGGCGCCAAAAAAGCAAATTAAAATGGTTCTTGGAAGGGGATGTCAACACAACTTTTTTCCACCGAT
TAATGGCTAACAACAGAAGAAAAAGTTCCATTAATGAAATTTTATCAGCCAATGGAAACAGCCTCACTTCAGATGAAGATATTGAAAAAGAATTTATATCTTTCTATCAG
AAGCTCTTTACAAAACTGAACAGTAGTCCTCCCTTGCCTACCATTGATGACTGGAGTCCTATATCATCTGAACAACAAGCAGTTTTAGAAGCCCCTTTCACTGAAGCAGA
AATTTATAAGGCAGTAACAGATTTAGGCTCGAACAAGACCCCGGGTCCGGACGGTTTCACGGCCGAATTCTTTAAAAAATCATGGAACATCCTCAAGAATGACATAAAGG
GGGTGTTCAATGATTTTTTTAAGAGTGGTATTATTAATGCAAGCCTCAATGAAACTTACATTTGCCTAATCCCGAAGAAGATCGGTGCCAAAGCAGTAGGTGACTATCGT
CCTATTAGTTTGACAACGTGCTTATACAAGATCATCGCCCGAGTCCTCTCTGATCGCCTTAAAAAAGTCCTCCCAGACACAATCACTCCCTACCAATCGACTTTTGTGGG
AAACAAACAGATCATGGATGCCTCCCTAATCGCCAATGAATTAATAGATGAATGGAATAGAAGAAATCGAAAGGGGGTGATCATCAAGCTTGACATTGAGAAAGCCTTTG
ACACGGTGGATTGGGATTTCCTTGACAATGTCCTTCAAGCCAAAGGCTTCGGCCAAAGGTGGAGGAAATGGATTAGGGACTGTATTTCATCGGTCAACTATTCCATCATT
ATTAATGGCAAGCCGAGAGGAAAAATCAAAGCATCCAGAGGTCTTAGACAATCAACCACTATGATAAATAAGAAGGGGGAGAGAGGGTCACCTTGTCTTAGTAGAATGCT
TCTATCGGCTGAACAGAATGGCTCCATCGAAGGTTTTTGGGTCGGCATTAATCAACAGCCCATAAATATCACTCATCTCCAATTCGCAGATGATACCATTTTATTTTCCT
CCCCAAAGCAGGATCTTATTACAAACCTCTTCAACGTTATACACAACTTTGAGGCGATATCGAGGCTGAATATAAACCGTAACAAGTCTGAATTTCTGGGCATAGGATTG
ACTCAACAACAGATTGACCTCTTGGCATCGGGTTTTGGTGGCAAGGTTGGCTCATGGCCGATGATCTACTTAGGCCTTCCTCTAAATGGCAACCCGAAAAGGATCAGTTT
TTGGACACCAATGATTGAGAAAATTCAAAAGAGATTGATCAATTGGGGATCCACCAATATCTCCAAAGGCGCTCCTACTGACGAAGGGGGATTAAGCATTTATGATATTC
ACAAGAAAAATACTTCTCTCTTGGCCAAATGGATTTGGAGATTTTACCAAGAACCGGAGGCTTTATGGAGGAAAATCATCGCTACAAAATATGGGGTGGCTCGAAATCCC
CTAAAATTGGGTAACCACTCCATTAGCTCCTCCAAAGGACCATGGAAAGCTATCCACAGCTTGCAACATTTTATCTATGATAACATTGATTCCAGAGTTGGGAAGGAAGC
ATTCATTGCTGATTTATGGAATCCTAGTACTGGGTCTTGGAACCTTCATTTGAGGCGAGATCTCAATGAGAATGAAATCCTTGAGTGGGCCATTTTATCTCATCATCTTA
CCAACTTCTCCCTCAAAGACGAGGAAGACTCATGGAGTTGGCAGCTCGGAAACATTGACAGCTTCACCACGGGGTCTCTCACGAAAAAATTGGCTTCCTCCTCTCCCACA
ACCGGAACAGAATTATATAGTGGACTGTGGAAAGGTCCTATGCCGAAAAAGGTTAAATTCTTCATATGGGAACTCAGTCATGCTAGCATCAACACCGCAGACATCATTCA
GAAAAGATATCCTGGAAATAGTCTTTCTCCTAGCTGCTGTGTCATGTGCCACAAAGCTAGCAAGACCCAAGCTCATCTTTTCAGTAGCTGCGATTTTGCATCAGAATTTT
GGTCTCATCTCCAGCAAGCTTTTGGGTGGCTTTTTGCTCGTCCGGGTGACATCTACACCCTTCTCTCTTTATCTGTTTGTGGACCTCCATTTACAAAAGACAAAAAGTTG
ATATGGCAGATTTTTTTGTATGCTTTCCTATGGAATCTTTGGCTGGAAAGAAACGCTAGAGTCTGA
Protein sequenceShow/hide protein sequence
MIFISWNVRGLGSWKKRALVKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIAWASLDAEGTSGGIAIIWNESSFSVLETRAMRRFNRFIETAALKDIPLTNGKFT
WSSFRPNPSLSLIDRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQSVESWWRTNSLLGWPGHGFIQKLKGLKKELKQWNHNVF
GQQSDKRNSLNQELMIIDKKEENGLISDLDISRRTEIKADLINIAATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKSSINEILSANGNSLTSDEDIEKEFISFYQ
KLFTKLNSSPPLPTIDDWSPISSEQQAVLEAPFTEAEIYKAVTDLGSNKTPGPDGFTAEFFKKSWNILKNDIKGVFNDFFKSGIINASLNETYICLIPKKIGAKAVGDYR
PISLTTCLYKIIARVLSDRLKKVLPDTITPYQSTFVGNKQIMDASLIANELIDEWNRRNRKGVIIKLDIEKAFDTVDWDFLDNVLQAKGFGQRWRKWIRDCISSVNYSII
INGKPRGKIKASRGLRQSTTMINKKGERGSPCLSRMLLSAEQNGSIEGFWVGINQQPINITHLQFADDTILFSSPKQDLITNLFNVIHNFEAISRLNINRNKSEFLGIGL
TQQQIDLLASGFGGKVGSWPMIYLGLPLNGNPKRISFWTPMIEKIQKRLINWGSTNISKGAPTDEGGLSIYDIHKKNTSLLAKWIWRFYQEPEALWRKIIATKYGVARNP
LKLGNHSISSSKGPWKAIHSLQHFIYDNIDSRVGKEAFIADLWNPSTGSWNLHLRRDLNENEILEWAILSHHLTNFSLKDEEDSWSWQLGNIDSFTTGSLTKKLASSSPT
TGTELYSGLWKGPMPKKVKFFIWELSHASINTADIIQKRYPGNSLSPSCCVMCHKASKTQAHLFSSCDFASEFWSHLQQAFGWLFARPGDIYTLLSLSVCGPPFTKDKKL
IWQIFLYAFLWNLWLERNARV