; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006641 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006641
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr6:44453005..44456913
RNA-Seq ExpressionLag0006641
SyntenyLag0006641
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.7e-17335.89Show/hide
Query:  LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
        L SWGP PF+  N  L  P F +N+  WW      GHPG+SF+R+LK L+  +++ ++ N     E K     +I ++D LE+ G L +    +R  LK+
Subjt:  LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS

Query:  DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
        D+      EA+ W Q+ K+LW+++GDENT++FHK+C+AR+RR+ I  + S + +    +  + K  L HF  IY    E  P LI DN++W PI+  +A 
Subjt:  DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS

Query:  ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
         L   F+E+E++E + +  +NK+PGPDGFT+EFYK  W  LK  I+ +F DF    I+N+ VN T IALI KK    + + YRPISLTT++YK++AKV+A
Subjt:  ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA

Query:  ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
        ERLK  L  T++  Q AFV GRQI DAIL+ANEA+D W+  K +GF+IKLDIEKAFDK+NW FIDF+LMKKG+PF+WR WI ACISSV YSI +NGRPR 
Subjt:  ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD

Query:  QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
        +  PS                                          +THLLFADDILLF++DD+  I N   II  F+ ASGL INL+KS ++ INV  
Subjt:  QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN

Query:  QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
         R+ +IA++W    + LP +YLGVPLGG     TFW  + EKI +++  W+++MLSKGG++TL++S L ++P Y LS+FKAP  +            WK 
Subjt:  QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE

Query:  SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
          E                                                           +S G +         R+P        E F ++ +W+++
Subjt:  SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR

Query:  DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
        +G+S  FW   W     L S   RL+ LS  K   + D W+     W++ PRR L + E   WA   + L       G D   W  +  G++T  S +  
Subjt:  DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI

Query:  LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
        L+     +L    ++ F NLW+ SIPK                                     +R+ E   HLF+ C  A  +    +        C L
Subjt:  LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL

Query:  SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
        S   LC    ++K K++++++  N + +  W IW ERNAR+F G   ++ +IW+D  +LA LW+S S   S+   +S+
Subjt:  SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV

KAA0046762.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-17336.43Show/hide
Query:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
        WGP PFR ++  L +P F  N+ RWW  S   GHPGYSFI+RLKSLA  +K W+K   +SF   K  +  ++ ++D  E    L      +RLALK+DL 
Subjt:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ

Query:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKASALI
        E++L E+++W QR KKLWL +GDEN+++FH++CTAR++RN I E+     L   ++  +    +  FS I+    + DP   +DN++W PI H +   L 
Subjt:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKASALI

Query:  KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERL
         PF E E+   I S+   K PGPDGF I F+K +W  LK  IM++F DF+ K ++N+N+N+TYIALIPKK +      +RPISLTT++YKI+AK L+ RL
Subjt:  KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERL

Query:  KSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD---
        K+ L +TIS  Q AFV  RQI+DAIL+ANEAVD WK  K +GF++KLDIEKAFD +NW FIDFVL KK FP  WR+WI  CIS+V+YSI +NGRP+    
Subjt:  KSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD---

Query:  ------QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQ
              QG P                                   +I+H+LFADDILLF++D+D +++N    +  FE+ASGL+INL KSA+  +NV   
Subjt:  ------QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQ

Query:  RSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKES
        R+ E A+ W    Q LP SYLGVPLGGNP    FW  + EKI+++++ W++A +SKGGRLTL++S L+++P Y LSVF+APSL+            WK +
Subjt:  RSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKES

Query:  --------------------------------------------------------------------SEISSGRVRAPQLQ-----ESFIQNSAWELRD
                                                                            S ISS   +AP        + F  N +W+L +
Subjt:  --------------------------------------------------------------------SEISSGRVRAPQLQ-----ESFIQNSAWELRD

Query:  GKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNIL
        G  I FW+  W+    L +   RLF LS +K + V DAW+    +W I+ RR L DRE   WA     LP+P  + G     WIP  +  F+  SA+ ++
Subjt:  GKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNIL

Query:  RGHRAPVLSHGE---SVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCS-----WASYLRFKFNLAAGL
           R    S G+    +   +W+++IP                                      ++ +ES  HLF+HC      W S+L+  F+LA   
Subjt:  RGHRAPVLSHGE---SVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCS-----WASYLRFKFNLAAGL

Query:  QAPCPLSIDHLCAEAFAYK-----AKSQRDILCRNFFVAYTWYIWKERNARVFQGTS--CSIYQIWDDSISLAALWSSNSKDPSHSDVASV
             LS D L  + F ++     + +++ + C    +A  W IW ERN R+F   S   +   +W++   L   W S      +   A++
Subjt:  QAPCPLSIDHLCAEAFAYK-----AKSQRDILCRNFFVAYTWYIWKERNARVFQGTS--CSIYQIWDDSISLAALWSSNSKDPSHSDVASV

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.8e-17135.79Show/hide
Query:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
        WGPCPFR +N  L +  F  N   WW+ S  +G PGY+FI+ L SL+K +K+W+    + +   K+ L  +I  +D LE  G +  ++ QKR++LKSDL 
Subjt:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ

Query:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
         I   +A+ W QR ++ W   GDEN +YFH++CT  +R+N I  +      SL +  D+ +  ++HF +IY  E    +++DN+ W PI+ +  S L KP
Subjt:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP

Query:  FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
        F E E+   I S  + KAPGPDG+T+ FYKK W  LK  ++ VF DF K  IVN NVN+T+IALI KK    K S YRPISLTT+LYKI+AK LA RLKS
Subjt:  FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS

Query:  CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
         L DTI+  Q AF+ GRQI+DAILIANEA+D WK  K +GF++KLDIEKAFDKI+WSFID++L KK FP +WR+WI ACIS+V YSI LNG P+      
Subjt:  CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----

Query:  ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
            QG P                                   +I+HLLFADD+L+F++D+++Y++N    +  FE+ASGL  N SKS ++ IN+   R+
Subjt:  ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS

Query:  MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
         +IA+ +    + LP +YLGVPLGGNP   +FWD  IE I ++++GW+++ +SKGGRLTLL++ L+++P Y LS FKAP                     
Subjt:  MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------

Query:  ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
                                                  + S + WK+                   +S  +S      + ++ +    +W   DG 
Subjt:  ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK

Query:  SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
        S+ FW  KW     L     RL+ LS  +S  V + W   S  WN+KPRR L +RE Q+W +    LPR   ++G     W PS    +T  SA++I  +
Subjt:  SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR

Query:  GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
            P  ++ E    +LW++ IP+                                     R S E ++HLF+ C +A  L   ++   G       ++ 
Subjt:  GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID

Query:  HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
         LC +      ++ ++I+  N  +A  W IW  RN  +F     S    W+D  +L   WSS SK   +   A++
Subjt:  HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.4e-17035.69Show/hide
Query:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
        WGPCPFR +N  L +  F  N   WW+ S  +G PGY+FI+ L SL+K +K+W+    + +   K+ L  +I  +D LE  G +  ++ QKR++LKSDL 
Subjt:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ

Query:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
         I   +A+ W QR ++ W   GDEN +YFH++CT  +R+N I  +      SL +  D+ +  ++HF +IY  E    +++DN+ W PI+ +  S L KP
Subjt:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP

Query:  FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
        F E E+   I S  + KAPGPDG+T+ FYKK W  LK  ++ VF DF K  IVN NVN+T+IALI KK    K S YRPISLTT+LYKI+AK LA RLKS
Subjt:  FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS

Query:  CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
         L DTI+  Q AF+ GRQI+DAILIANE +D WK  K +GF++KLDIEKAFDKI+WSFID++L KK FP +WR+WI ACIS+V YSI LNG P+      
Subjt:  CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----

Query:  ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
            QG P                                   +I+HLLFADD+L+F++D+++Y++N    +  FE+ASGL  N SKS ++ IN+   R+
Subjt:  ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS

Query:  MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
         +IA+ +    + LP +YLGVPLGGNP   +FWD  IE I ++++GW+++ +SKGGRLTLL++ L+++P Y LS FKAP                     
Subjt:  MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------

Query:  ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
                                                  + S + WK+                   +S  +S      + ++ +    +W   DG 
Subjt:  ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK

Query:  SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
        S+ FW  KW     L     RL+ LS  +S  V + W   S  WN+KPRR L +RE Q+W +    LPR   ++G     W PS    +T  SA++I  +
Subjt:  SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR

Query:  GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
            P  ++ E    +LW++ IP+                                     R S E ++HLF+ C +A  L   ++   G       ++ 
Subjt:  GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID

Query:  HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
         LC +      ++ ++I+  N  +A  W IW  RN  +F     S    W+D  +L   WSS SK   +   A++
Subjt:  HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]2.5e-17235.79Show/hide
Query:  LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
        L SWGP PF+  N  L  P F +N+  WW      GHPG+SF+R+LK L+  +++ ++ N     E K     +I ++D LE+ G L +    +R  LK+
Subjt:  LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS

Query:  DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
        D+      EA+ W Q+ K+LW+++GDENT++FHK+C+AR+RR+ I  + S + +    +  + K  L HF  IY    E  P LI DN++W PI+  +A 
Subjt:  DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS

Query:  ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
         L   F+E+E++E + +  +NK+PGPDGFT+EFYK  W  LK  I+ +F DF    I+N+ VN T IALI KK    + + YRPISLTT++YK++AKV+A
Subjt:  ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA

Query:  ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
        ERLK  L  T++  Q AFV GRQI DAIL+ANEA+D W+  K +GF+IKLDIEKAFDK+NW FIDF+LMKKG+PF+WR WI ACISSV YSI +NGRPR 
Subjt:  ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD

Query:  QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
        +  PS                                          +THLLFADDILLF++DD+  I N   II  F+ ASGL INL+KS ++ INV  
Subjt:  QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN

Query:  QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
         R+ +IA++W    + LP +YLGVPLGG     TFW  + EKI +++  W+++MLSKGG++TL++S L ++P Y LS+FK P  +            WK 
Subjt:  QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE

Query:  SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
          E                                                           +S G +         R+P        E F ++ +W+++
Subjt:  SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR

Query:  DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
        +G+S  FW   W     L S   RL+ LS  K   + D W+     W++ PRR L + E   WA   + L       G D   W  +  G++T  S +  
Subjt:  DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI

Query:  LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
        L+     +L    ++ F NLW+ SIPK                                     +R+ E   HLF+ C  A  +    +        C L
Subjt:  LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL

Query:  SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
        S   LC    ++K K++++++  N + +  W IW ERNAR+F G   ++ +IW+D  +LA LW+S S   S+   +S+
Subjt:  SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein1.2e-17235.79Show/hide
Query:  LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
        L SWGP PF+  N  L  P F +N+  WW      GHPG+SF+R+LK L+  +++ ++ N     E K     +I ++D LE+ G L +    +R  LK+
Subjt:  LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS

Query:  DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
        D+      EA+ W Q+ K+LW+++GDENT++FHK+C+AR+RR+ I  + S + +    +  + K  L HF  IY    E  P LI DN++W PI+  +A 
Subjt:  DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS

Query:  ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
         L   F+E+E++E + +  +NK+PGPDGFT+EFYK  W  LK  I+ +F DF    I+N+ VN T IALI KK    + + YRPISLTT++YK++AKV+A
Subjt:  ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA

Query:  ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
        ERLK  L  T++  Q AFV GRQI DAIL+ANEA+D W+  K +GF+IKLDIEKAFDK+NW FIDF+LMKKG+PF+WR WI ACISSV YSI +NGRPR 
Subjt:  ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD

Query:  QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
        +  PS                                          +THLLFADDILLF++DD+  I N   II  F+ ASGL INL+KS ++ INV  
Subjt:  QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN

Query:  QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
         R+ +IA++W    + LP +YLGVPLGG     TFW  + EKI +++  W+++MLSKGG++TL++S L ++P Y LS+FK P  +            WK 
Subjt:  QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE

Query:  SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
          E                                                           +S G +         R+P        E F ++ +W+++
Subjt:  SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR

Query:  DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
        +G+S  FW   W     L S   RL+ LS  K   + D W+     W++ PRR L + E   WA   + L       G D   W  +  G++T  S +  
Subjt:  DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI

Query:  LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
        L+     +L    ++ F NLW+ SIPK                                     +R+ E   HLF+ C  A  +    +        C L
Subjt:  LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL

Query:  SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
        S   LC    ++K K++++++  N + +  W IW ERNAR+F G   ++ +IW+D  +LA LW+S S   S+   +S+
Subjt:  SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV

A0A5A7TTK1 LINE-1 retrotransposable element ORF2 protein6.5e-17436.43Show/hide
Query:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
        WGP PFR ++  L +P F  N+ RWW  S   GHPGYSFI+RLKSLA  +K W+K   +SF   K  +  ++ ++D  E    L      +RLALK+DL 
Subjt:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ

Query:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKASALI
        E++L E+++W QR KKLWL +GDEN+++FH++CTAR++RN I E+     L   ++  +    +  FS I+    + DP   +DN++W PI H +   L 
Subjt:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKASALI

Query:  KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERL
         PF E E+   I S+   K PGPDGF I F+K +W  LK  IM++F DF+ K ++N+N+N+TYIALIPKK +      +RPISLTT++YKI+AK L+ RL
Subjt:  KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERL

Query:  KSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD---
        K+ L +TIS  Q AFV  RQI+DAIL+ANEAVD WK  K +GF++KLDIEKAFD +NW FIDFVL KK FP  WR+WI  CIS+V+YSI +NGRP+    
Subjt:  KSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD---

Query:  ------QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQ
              QG P                                   +I+H+LFADDILLF++D+D +++N    +  FE+ASGL+INL KSA+  +NV   
Subjt:  ------QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQ

Query:  RSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKES
        R+ E A+ W    Q LP SYLGVPLGGNP    FW  + EKI+++++ W++A +SKGGRLTL++S L+++P Y LSVF+APSL+            WK +
Subjt:  RSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKES

Query:  --------------------------------------------------------------------SEISSGRVRAPQLQ-----ESFIQNSAWELRD
                                                                            S ISS   +AP        + F  N +W+L +
Subjt:  --------------------------------------------------------------------SEISSGRVRAPQLQ-----ESFIQNSAWELRD

Query:  GKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNIL
        G  I FW+  W+    L +   RLF LS +K + V DAW+    +W I+ RR L DRE   WA     LP+P  + G     WIP  +  F+  SA+ ++
Subjt:  GKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNIL

Query:  RGHRAPVLSHGE---SVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCS-----WASYLRFKFNLAAGL
           R    S G+    +   +W+++IP                                      ++ +ES  HLF+HC      W S+L+  F+LA   
Subjt:  RGHRAPVLSHGE---SVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCS-----WASYLRFKFNLAAGL

Query:  QAPCPLSIDHLCAEAFAYK-----AKSQRDILCRNFFVAYTWYIWKERNARVFQGTS--CSIYQIWDDSISLAALWSSNSKDPSHSDVASV
             LS D L  + F ++     + +++ + C    +A  W IW ERN R+F   S   +   +W++   L   W S      +   A++
Subjt:  QAPCPLSIDHLCAEAFAYK-----AKSQRDILCRNFFVAYTWYIWKERNARVFQGTS--CSIYQIWDDSISLAALWSSNSKDPSHSDVASV

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein2.3e-17135.79Show/hide
Query:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
        WGPCPFR +N  L +  F  N   WW+ S  +G PGY+FI+ L SL+K +K+W+    + +   K+ L  +I  +D LE  G +  ++ QKR++LKSDL 
Subjt:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ

Query:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
         I   +A+ W QR ++ W   GDEN +YFH++CT  +R+N I  +      SL +  D+ +  ++HF +IY  E    +++DN+ W PI+ +  S L KP
Subjt:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP

Query:  FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
        F E E+   I S  + KAPGPDG+T+ FYKK W  LK  ++ VF DF K  IVN NVN+T+IALI KK    K S YRPISLTT+LYKI+AK LA RLKS
Subjt:  FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS

Query:  CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
         L DTI+  Q AF+ GRQI+DAILIANEA+D WK  K +GF++KLDIEKAFDKI+WSFID++L KK FP +WR+WI ACIS+V YSI LNG P+      
Subjt:  CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----

Query:  ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
            QG P                                   +I+HLLFADD+L+F++D+++Y++N    +  FE+ASGL  N SKS ++ IN+   R+
Subjt:  ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS

Query:  MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
         +IA+ +    + LP +YLGVPLGGNP   +FWD  IE I ++++GW+++ +SKGGRLTLL++ L+++P Y LS FKAP                     
Subjt:  MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------

Query:  ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
                                                  + S + WK+                   +S  +S      + ++ +    +W   DG 
Subjt:  ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK

Query:  SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
        S+ FW  KW     L     RL+ LS  +S  V + W   S  WN+KPRR L +RE Q+W +    LPR   ++G     W PS    +T  SA++I  +
Subjt:  SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR

Query:  GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
            P  ++ E    +LW++ IP+                                     R S E ++HLF+ C +A  L   ++   G       ++ 
Subjt:  GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID

Query:  HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
         LC +      ++ ++I+  N  +A  W IW  RN  +F     S    W+D  +L   WSS SK   +   A++
Subjt:  HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein6.7e-17135.69Show/hide
Query:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
        WGPCPFR +N  L +  F  N   WW+ S  +G PGY+FI+ L SL+K +K+W+    + +   K+ L  +I  +D LE  G +  ++ QKR++LKSDL 
Subjt:  WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ

Query:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
         I   +A+ W QR ++ W   GDEN +YFH++CT  +R+N I  +      SL +  D+ +  ++HF +IY  E    +++DN+ W PI+ +  S L KP
Subjt:  EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP

Query:  FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
        F E E+   I S  + KAPGPDG+T+ FYKK W  LK  ++ VF DF K  IVN NVN+T+IALI KK    K S YRPISLTT+LYKI+AK LA RLKS
Subjt:  FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS

Query:  CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
         L DTI+  Q AF+ GRQI+DAILIANE +D WK  K +GF++KLDIEKAFDKI+WSFID++L KK FP +WR+WI ACIS+V YSI LNG P+      
Subjt:  CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----

Query:  ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
            QG P                                   +I+HLLFADD+L+F++D+++Y++N    +  FE+ASGL  N SKS ++ IN+   R+
Subjt:  ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS

Query:  MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
         +IA+ +    + LP +YLGVPLGGNP   +FWD  IE I ++++GW+++ +SKGGRLTLL++ L+++P Y LS FKAP                     
Subjt:  MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------

Query:  ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
                                                  + S + WK+                   +S  +S      + ++ +    +W   DG 
Subjt:  ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK

Query:  SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
        S+ FW  KW     L     RL+ LS  +S  V + W   S  WN+KPRR L +RE Q+W +    LPR   ++G     W PS    +T  SA++I  +
Subjt:  SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR

Query:  GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
            P  ++ E    +LW++ IP+                                     R S E ++HLF+ C +A  L   ++   G       ++ 
Subjt:  GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID

Query:  HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
         LC +      ++ ++I+  N  +A  W IW  RN  +F     S    W+D  +L   WSS SK   +   A++
Subjt:  HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein4.2e-17335.89Show/hide
Query:  LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
        L SWGP PF+  N  L  P F +N+  WW      GHPG+SF+R+LK L+  +++ ++ N     E K     +I ++D LE+ G L +    +R  LK+
Subjt:  LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS

Query:  DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
        D+      EA+ W Q+ K+LW+++GDENT++FHK+C+AR+RR+ I  + S + +    +  + K  L HF  IY    E  P LI DN++W PI+  +A 
Subjt:  DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS

Query:  ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
         L   F+E+E++E + +  +NK+PGPDGFT+EFYK  W  LK  I+ +F DF    I+N+ VN T IALI KK    + + YRPISLTT++YK++AKV+A
Subjt:  ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA

Query:  ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
        ERLK  L  T++  Q AFV GRQI DAIL+ANEA+D W+  K +GF+IKLDIEKAFDK+NW FIDF+LMKKG+PF+WR WI ACISSV YSI +NGRPR 
Subjt:  ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD

Query:  QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
        +  PS                                          +THLLFADDILLF++DD+  I N   II  F+ ASGL INL+KS ++ INV  
Subjt:  QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN

Query:  QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
         R+ +IA++W    + LP +YLGVPLGG     TFW  + EKI +++  W+++MLSKGG++TL++S L ++P Y LS+FKAP  +            WK 
Subjt:  QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE

Query:  SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
          E                                                           +S G +         R+P        E F ++ +W+++
Subjt:  SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR

Query:  DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
        +G+S  FW   W     L S   RL+ LS  K   + D W+     W++ PRR L + E   WA   + L       G D   W  +  G++T  S +  
Subjt:  DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI

Query:  LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
        L+     +L    ++ F NLW+ SIPK                                     +R+ E   HLF+ C  A  +    +        C L
Subjt:  LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL

Query:  SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
        S   LC    ++K K++++++  N + +  W IW ERNAR+F G   ++ +IW+D  +LA LW+S S   S+   +S+
Subjt:  SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.0e-2723.89Show/hide
Query:  SNRQKRLALKSDLQEI-------ALFEARYW-SQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY----DVE
        S RQ+   ++++L+EI        + E+R W  +R  K+     D   A   ++   +R +N I  + +          +++  I  ++  +Y    +  
Subjt:  SNRQKRLALKSDLQEI-------ALFEARYW-SQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY----DVE

Query:  PDPGLIVDNIDWCPINHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKA-NSEK
         +    +D      +N  +  +L +P +  E+   I S+ + K+PGPDGFT EFY+++ + L   ++++F    K+ I+  +     I LIPK   ++ K
Subjt:  PDPGLIVDNIDWCPINHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKA-NSEK

Query:  ISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFP
           +RPISL     KIL K+LA R++  +   I   Q  F+ G Q    I  +I   N    + +   K   +I +D EKAFDKI   F+   L K G  
Subjt:  ISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFP

Query:  FQWREWIMACISSVSYSIFLNGRPRD---------QGHP-------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIK
          + + I A     + +I LNG+  +         QG P                                +   LFADD+++++++      N   +I 
Subjt:  FQWREWIMACISSVSYSIFLNGRPRD---------QGHP-------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIK

Query:  SFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL--TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVL
        +F + SG +IN+ KS     N   Q   +I       +      YLG+ L  +   L    + P++++IK   + W+    S  GR+ +++  +
Subjt:  SFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL--TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVL

P08548 LINE-1 reverse transcriptase homolog7.0e-2423.2Show/hide
Query:  RRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDN-IDWCPINHI---KASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKK
        +R ++ I  + + N       ++++K +  ++  +Y  + +    +D  ++ C +  +   +   L +P S  E+   I+++   K+PGPDGFT EFY+ 
Subjt:  RRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDN-IDWCPINHI---KASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKK

Query:  FWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKA-NSEKISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIA
        F + L   ++ +F +  K+ I+        I LIPK   +  +   YRPISL     KIL K+L  R++  +   I   Q  F+ G Q    I  +I   
Subjt:  FWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKA-NSEKISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIA

Query:  NEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNG--------------------------------RPR
        N    + K   K   ++ +D EKAFD I   F+   L K G    + + I A  S  + +I LNG                                  R
Subjt:  NEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNG--------------------------------RPR

Query:  DQ--------GHPSITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL
        ++        G   I   LFADD+++++++          +IK +   SG +IN  KS        NQ    +       + P    YLGV L  +   L
Subjt:  DQ--------GHPSITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL

Query:  --TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSV--FKAPSLSANQWKESSEISSGRV---RAPQLQESFIQNSAWELRDGKSILFW
            ++ + ++I   V+ W+    S  GR+ +++  +    +Y  +    KAP    + +K+  +I    +   + PQ+ ++ + N              
Subjt:  --TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSV--FKAPSLSANQWKESSEISSGRV---RAPQLQESFIQNSAWELRDGKSILFW

Query:  FDKWAGPDSLCSINNRLFHLSEEKSLLVSDAW----SPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKD--FLKW
            AG  +L  +  RL++    KS+++  AW    + E   WN +     +D     +  F  D P  ++  GKD  F KW
Subjt:  FDKWAGPDSLCSINNRLFHLSEEKSLLVSDAW----SPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKD--FLKW

P11369 LINE-1 retrotransposable element ORF2 protein1.7e-2223.94Show/hide
Query:  SNRQKRLALKSDLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKV-----CTARRRRNHIHELLSTNSLSLVAD-----ADLEKEILTHFSSIYDVE--
        S RQ+ + L+ ++ ++   E R   QR  +         + +F K+       AR  + H  ++L     +   D      +++  I + +  +Y  +  
Subjt:  SNRQKRLALKSDLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKV-----CTARRRRNHIHELLSTNSLSLVAD-----ADLEKEILTHFSSIYDVE--

Query:  --PDPGLIVDNIDWCPINHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPK-KANS
           +    +D      +N  +   L  P S +E+   I S+ + K+PGPDGF+ EFY+ F + L   + ++FH    +  +  +     I LIPK + + 
Subjt:  --PDPGLIVDNIDWCPINHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPK-KANS

Query:  EKISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKG
         KI  +RPISL     KIL K+LA R++  +   I P Q  F+ G Q    I  +I + +    + K   K   +I LD EKAFDKI   F+  VL + G
Subjt:  EKISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKG

Query:  FPFQWREWIMACISSVSYSIFLNGRPRD---------QGHPSITHL-------------------------------LFADDILLFMQDDDKYIDNFFFI
            +   I A  S    +I +NG   +         QG P   +L                               L ADD+++++ D          +
Subjt:  FPFQWREWIMACISSVSYSIFLNGRPRD---------QGHPSITHL-------------------------------LFADDILLFMQDDDKYIDNFFFI

Query:  IKSFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL--TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIP
        I SF +  G +IN +KS         Q   EI       +      YLGV L      L    +  + ++IK  +  W+    S  GR+ +++  +    
Subjt:  IKSFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL--TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIP

Query:  LYTLSV--FKAPSLSANQ
        +Y  +    K P+   N+
Subjt:  LYTLSV--FKAPSLSANQ

P14381 Transposon TX1 uncharacterized 149 kDa protein3.1e-2425.96Show/hide
Query:  AGLLDDSNRQKRLALKSDLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEP-DPGL
        +G  D + + + L  K  L+ +   +AR    R +   L D D  + +F+ +   +  R  I  L + +   L     +     + + +++  +P  P  
Subjt:  AGLLDDSNRQKRLALKSDLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEP-DPGL

Query:  IVDNIDWCP-INHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYR
          +  D  P ++  +   L  P +  E+ + ++ +  NK+PG DG TIEF++ FW TL      V  + FKK  +  +     ++L+PKK +   I  +R
Subjt:  IVDNIDWCP-INHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYR

Query:  PISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMA
        P+SL +  YKI+AK ++ RLKS L + I P QS  V GR I D + +  + +   + +      + LD EKAFD+++  ++   L    F  Q+  ++  
Subjt:  PISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMA

Query:  CISSVSYSIFLN
          +S    + +N
Subjt:  CISSVSYSIFLN

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)9.2e-0822.6Show/hide
Query:  KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIA----LIPKKANSEKISKYRPISLTTALYKILAKVL
        +P + +E+   IK      APG DG T++   +           +  +F +  ++  +V   + A    LIPK  + E  S +RPI++ +AL ++L ++L
Subjt:  KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIA----LIPKKANSEKISKYRPISLTTALYKILAKVL

Query:  AERLKSCLVDTISPFQSAF--VCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSI-----
        A+RL++ +   + P Q  +  + G  ++   L+ +  +   +  +K   ++ LD+ KAFD ++ S I   L + G       +I   +S  + +I     
Subjt:  AERLKSCLVDTISPFQSAF--VCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSI-----

Query:  --------------------FLNGRPRDQ---------------GHPSITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKS
                            FL     D+               G   I  L FADD+LL ++D+D  +      + +F +  G+ +N  KS
Subjt:  --------------------FLNGRPRDQ---------------GHPSITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.3e-2529.01Show/hide
Query:  FRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSN---RQKRLALKSDLQEI
        FR+ +FL  +P+F  ++   W E    G   +S    LK+ AKK    K LN   F   +    + + +L+ ++S  L + S+   R + +A K      
Subjt:  FRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSN---RQKRLALKSDLQEI

Query:  ALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYD-----VEPDPGLIVDNIDWCPINHIKASAL
        A  E+ ++ Q+ +  WL DGD NT +FHKV  A + +N I  L   + + +     +++ I+ +++ +       + PD    + +I     N   AS L
Subjt:  ALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYD-----VEPDPGLIVDNIDWCPINHIKASAL

Query:  IKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKIL
            S++E+   + ++  NKAPGPD FT EF+ + W  +K S +    +FF+   + +  N T I LIPK    +++S +RP+S  T +YKI+
Subjt:  IKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKIL

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.1e-0737.31Show/hide
Query:  LPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPS
        LP  YLG+PL       + + P++EKI+ R+  W    LS  GRL L+ SV++++  + +S F+ PS
Subjt:  LPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.5e-1039.51Show/hide
Query:  LAERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSK--KRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQW
        + ERLK  + + I P Q++F+ GR  +D I+   EAV   +  K  K   L+KLD+EKA+D+I W +++  L+  GFP  W
Subjt:  LAERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSK--KRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTTAGGGGGTTTAGGGTTTTAGTGGTTAAGATTTTGGCTTTAGAGTTTTACGGTTTTAGGATTATAGTTTTAAGTTCTTGGGGTCCTTGCCCTTTTCGGTTTGA
TAATTTTTTGCTGGCTAATCCATCCTTCACTTCTAATATCGAAAGATGGTGGTCCGAATCTACTGCCTCTGGCCATCCGGGCTATTCGTTTATCCGCAGGCTTAAATCTC
TAGCCAAGAAAGTTAAGGATTGGAAAAAATTAAACACAGATTCGTTTAAAGAAAAAAAACGTTGCCTAGCTGATGACATTCAGAATCTTGACATCCTTGAATCAGCTGGT
CTTTTAGACGACTCGAACAGACAGAAGCGTTTGGCCCTTAAATCGGATCTCCAGGAGATTGCTCTCTTTGAAGCCCGGTACTGGAGTCAACGTTGCAAAAAATTATGGCT
CAGTGATGGCGATGAGAATACTGCCTATTTCCACAAAGTTTGTACAGCCCGTCGGAGAAGGAATCATATCCACGAACTTTTATCCACAAATAGTCTTAGCTTGGTGGCTG
ATGCAGATCTTGAAAAGGAAATTCTTACTCACTTCTCCTCCATTTATGATGTGGAGCCCGACCCAGGTTTGATTGTTGATAATATTGATTGGTGCCCTATTAACCACATC
AAAGCCTCTGCTCTAATTAAGCCTTTCAGCGAGCAGGAAGTGTACGAGGGAATCAAAAGCATTGGTTCAAATAAAGCTCCGGGACCAGACGGATTCACCATTGAATTTTA
CAAAAAGTTTTGGAAGACTCTCAAGCATTCTATTATGGAAGTGTTCCACGATTTTTTCAAAAAGAAGATTGTGAACCGCAACGTCAATCATACCTACATTGCCCTCATCC
CAAAGAAAGCCAACTCAGAGAAAATTTCAAAATACAGACCAATTAGTTTGACCACTGCTCTCTACAAAATCCTTGCTAAGGTGCTTGCCGAAAGACTAAAATCTTGTCTG
GTGGATACAATCAGCCCTTTCCAATCGGCTTTTGTGTGCGGTAGACAAATATCCGATGCTATCTTGATTGCTAACGAAGCGGTGGACCTTTGGAAATGTTCGAAAAAGAG
AGGATTCCTCATAAAGCTAGACATAGAGAAAGCTTTTGACAAAATCAATTGGAGCTTCATTGATTTTGTTCTTATGAAGAAAGGTTTTCCCTTTCAGTGGAGGGAGTGGA
TCATGGCTTGCATCTCTTCAGTTTCATATTCAATTTTCTTGAACGGAAGACCGAGAGATCAGGGGCACCCTTCCATCACTCACTTACTCTTTGCAGACGATATTTTGCTC
TTTATGCAGGATGATGACAAATATATTGATAACTTTTTCTTCATCATCAAATCTTTCGAACAAGCCTCGGGCCTTCGAATCAACTTATCTAAGTCTGCAGTGACAGGTAT
AAATGTTCCTAATCAGAGATCTATGGAGATTGCTGCTAGATGGAATTGTTTGCTCCAGCCACTCCCTACTTCATATCTCGGAGTGCCCTTGGGAGGCAATCCTACAAAAC
TCACGTTCTGGGATCCTATGATCGAGAAAATCAAGCGCAGGGTTGACGGTTGGCGCTTTGCGATGCTGTCTAAAGGTGGTCGTCTGACCCTCCTTCAATCAGTGCTTAAC
AATATTCCTTTATACACTCTCTCAGTTTTCAAAGCCCCATCTCTGTCTGCAAATCAATGGAAAGAATCTTCCGAAATTTCCTCTGGAAGGGTTCGGGCTCCTCAACTGCA
AGAATCCTTTATTCAAAATTCCGCATGGGAGCTTAGAGACGGTAAATCCATTCTTTTTTGGTTTGATAAATGGGCTGGTCCCGATTCTCTATGTTCCATCAACAATCGGC
TTTTTCATCTATCTGAGGAGAAAAGCTTATTGGTGTCTGATGCTTGGTCCCCTGAATCTCAAAGGTGGAACATTAAGCCTAGAAGAAATCTTCTTGACAGAGAGTTGCAA
TCTTGGGCTGCTTTCACCTCTGATTTACCTAGGCCTGATGTTTCCAAAGGCAAAGATTTTCTCAAATGGATCCCTTCAAAGGAGGGCATATTTACAACCAAGTCTGCTCG
GAATATTTTACGAGGTCACAGAGCCCCCGTCCTCAGTCATGGGGAATCAGTTTTTAATAATCTCTGGCAAGCCAGTATTCCTAAGAGAAGAAGCACTGAATCTATAGACC
ACCTCTTCGTTCATTGCAGTTGGGCTTCCTACCTTCGGTTTAAATTTAATTTAGCGGCTGGTCTTCAAGCTCCATGCCCTCTCTCGATTGACCATCTCTGTGCCGAAGCC
TTCGCGTATAAAGCTAAATCCCAAAGAGATATCCTTTGTCGAAATTTCTTTGTTGCTTATACTTGGTATATTTGGAAGGAGAGGAACGCTAGAGTCTTTCAGGGAACCTC
TTGCTCTATCTATCAGATCTGGGATGACTCCATCTCTCTTGCTGCGCTTTGGTCCTCTAACTCCAAGGATCCTTCTCATTCGGATGTGGCCTCAGTTCATTCATGTACCC
CTCCTAACTCGGGTGTTACATGCCCACCAGCTTCCGCCTTGGTTCGTCCCCGAACCACATCTTACTGGGAGAGGTTCCGCTCTGATACCATCTGTAACGCCCCAGGCCCA
GGATTCGGAATCCGGATTCGGCCCTTAACGGCCCCGGCAATCCCCTGCGACTTGGTCGCGCCACCATACTTCTCTGATACTACTTCTAAGATTGAAGACTATCCCCACAA
ACCAACACGGGGTCCTTTTAGCATGCTTTGTCCTCACTCACATGCTTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTTAGGGGGTTTAGGGTTTTAGTGGTTAAGATTTTGGCTTTAGAGTTTTACGGTTTTAGGATTATAGTTTTAAGTTCTTGGGGTCCTTGCCCTTTTCGGTTTGA
TAATTTTTTGCTGGCTAATCCATCCTTCACTTCTAATATCGAAAGATGGTGGTCCGAATCTACTGCCTCTGGCCATCCGGGCTATTCGTTTATCCGCAGGCTTAAATCTC
TAGCCAAGAAAGTTAAGGATTGGAAAAAATTAAACACAGATTCGTTTAAAGAAAAAAAACGTTGCCTAGCTGATGACATTCAGAATCTTGACATCCTTGAATCAGCTGGT
CTTTTAGACGACTCGAACAGACAGAAGCGTTTGGCCCTTAAATCGGATCTCCAGGAGATTGCTCTCTTTGAAGCCCGGTACTGGAGTCAACGTTGCAAAAAATTATGGCT
CAGTGATGGCGATGAGAATACTGCCTATTTCCACAAAGTTTGTACAGCCCGTCGGAGAAGGAATCATATCCACGAACTTTTATCCACAAATAGTCTTAGCTTGGTGGCTG
ATGCAGATCTTGAAAAGGAAATTCTTACTCACTTCTCCTCCATTTATGATGTGGAGCCCGACCCAGGTTTGATTGTTGATAATATTGATTGGTGCCCTATTAACCACATC
AAAGCCTCTGCTCTAATTAAGCCTTTCAGCGAGCAGGAAGTGTACGAGGGAATCAAAAGCATTGGTTCAAATAAAGCTCCGGGACCAGACGGATTCACCATTGAATTTTA
CAAAAAGTTTTGGAAGACTCTCAAGCATTCTATTATGGAAGTGTTCCACGATTTTTTCAAAAAGAAGATTGTGAACCGCAACGTCAATCATACCTACATTGCCCTCATCC
CAAAGAAAGCCAACTCAGAGAAAATTTCAAAATACAGACCAATTAGTTTGACCACTGCTCTCTACAAAATCCTTGCTAAGGTGCTTGCCGAAAGACTAAAATCTTGTCTG
GTGGATACAATCAGCCCTTTCCAATCGGCTTTTGTGTGCGGTAGACAAATATCCGATGCTATCTTGATTGCTAACGAAGCGGTGGACCTTTGGAAATGTTCGAAAAAGAG
AGGATTCCTCATAAAGCTAGACATAGAGAAAGCTTTTGACAAAATCAATTGGAGCTTCATTGATTTTGTTCTTATGAAGAAAGGTTTTCCCTTTCAGTGGAGGGAGTGGA
TCATGGCTTGCATCTCTTCAGTTTCATATTCAATTTTCTTGAACGGAAGACCGAGAGATCAGGGGCACCCTTCCATCACTCACTTACTCTTTGCAGACGATATTTTGCTC
TTTATGCAGGATGATGACAAATATATTGATAACTTTTTCTTCATCATCAAATCTTTCGAACAAGCCTCGGGCCTTCGAATCAACTTATCTAAGTCTGCAGTGACAGGTAT
AAATGTTCCTAATCAGAGATCTATGGAGATTGCTGCTAGATGGAATTGTTTGCTCCAGCCACTCCCTACTTCATATCTCGGAGTGCCCTTGGGAGGCAATCCTACAAAAC
TCACGTTCTGGGATCCTATGATCGAGAAAATCAAGCGCAGGGTTGACGGTTGGCGCTTTGCGATGCTGTCTAAAGGTGGTCGTCTGACCCTCCTTCAATCAGTGCTTAAC
AATATTCCTTTATACACTCTCTCAGTTTTCAAAGCCCCATCTCTGTCTGCAAATCAATGGAAAGAATCTTCCGAAATTTCCTCTGGAAGGGTTCGGGCTCCTCAACTGCA
AGAATCCTTTATTCAAAATTCCGCATGGGAGCTTAGAGACGGTAAATCCATTCTTTTTTGGTTTGATAAATGGGCTGGTCCCGATTCTCTATGTTCCATCAACAATCGGC
TTTTTCATCTATCTGAGGAGAAAAGCTTATTGGTGTCTGATGCTTGGTCCCCTGAATCTCAAAGGTGGAACATTAAGCCTAGAAGAAATCTTCTTGACAGAGAGTTGCAA
TCTTGGGCTGCTTTCACCTCTGATTTACCTAGGCCTGATGTTTCCAAAGGCAAAGATTTTCTCAAATGGATCCCTTCAAAGGAGGGCATATTTACAACCAAGTCTGCTCG
GAATATTTTACGAGGTCACAGAGCCCCCGTCCTCAGTCATGGGGAATCAGTTTTTAATAATCTCTGGCAAGCCAGTATTCCTAAGAGAAGAAGCACTGAATCTATAGACC
ACCTCTTCGTTCATTGCAGTTGGGCTTCCTACCTTCGGTTTAAATTTAATTTAGCGGCTGGTCTTCAAGCTCCATGCCCTCTCTCGATTGACCATCTCTGTGCCGAAGCC
TTCGCGTATAAAGCTAAATCCCAAAGAGATATCCTTTGTCGAAATTTCTTTGTTGCTTATACTTGGTATATTTGGAAGGAGAGGAACGCTAGAGTCTTTCAGGGAACCTC
TTGCTCTATCTATCAGATCTGGGATGACTCCATCTCTCTTGCTGCGCTTTGGTCCTCTAACTCCAAGGATCCTTCTCATTCGGATGTGGCCTCAGTTCATTCATGTACCC
CTCCTAACTCGGGTGTTACATGCCCACCAGCTTCCGCCTTGGTTCGTCCCCGAACCACATCTTACTGGGAGAGGTTCCGCTCTGATACCATCTGTAACGCCCCAGGCCCA
GGATTCGGAATCCGGATTCGGCCCTTAACGGCCCCGGCAATCCCCTGCGACTTGGTCGCGCCACCATACTTCTCTGATACTACTTCTAAGATTGAAGACTATCCCCACAA
ACCAACACGGGGTCCTTTTAGCATGCTTTGTCCTCACTCACATGCTTCCTAG
Protein sequenceShow/hide protein sequence
MVFRGFRVLVVKILALEFYGFRIIVLSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAG
LLDDSNRQKRLALKSDLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHI
KASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKSCL
VDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRDQGHPSITHLLFADDILL
FMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLN
NIPLYTLSVFKAPSLSANQWKESSEISSGRVRAPQLQESFIQNSAWELRDGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQ
SWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNILRGHRAPVLSHGESVFNNLWQASIPKRRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSIDHLCAEA
FAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASVHSCTPPNSGVTCPPASALVRPRTTSYWERFRSDTICNAPGP
GFGIRIRPLTAPAIPCDLVAPPYFSDTTSKIEDYPHKPTRGPFSMLCPHSHAS