| GenBank top hits | e value | %identity | Alignment |
|---|
| KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | 8.7e-173 | 35.89 | Show/hide |
Query: LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
L SWGP PF+ N L P F +N+ WW GHPG+SF+R+LK L+ +++ ++ N E K +I ++D LE+ G L + +R LK+
Subjt: LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
Query: DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
D+ EA+ W Q+ K+LW+++GDENT++FHK+C+AR+RR+ I + S + + + + K L HF IY E P LI DN++W PI+ +A
Subjt: DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
Query: ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
L F+E+E++E + + +NK+PGPDGFT+EFYK W LK I+ +F DF I+N+ VN T IALI KK + + YRPISLTT++YK++AKV+A
Subjt: ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
Query: ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
ERLK L T++ Q AFV GRQI DAIL+ANEA+D W+ K +GF+IKLDIEKAFDK+NW FIDF+LMKKG+PF+WR WI ACISSV YSI +NGRPR
Subjt: ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
Query: QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
+ PS +THLLFADDILLF++DD+ I N II F+ ASGL INL+KS ++ INV
Subjt: QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
Query: QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
R+ +IA++W + LP +YLGVPLGG TFW + EKI +++ W+++MLSKGG++TL++S L ++P Y LS+FKAP + WK
Subjt: QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
Query: SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
E +S G + R+P E F ++ +W+++
Subjt: SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
Query: DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
+G+S FW W L S RL+ LS K + D W+ W++ PRR L + E WA + L G D W + G++T S +
Subjt: DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
Query: LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
L+ +L ++ F NLW+ SIPK +R+ E HLF+ C A + + C L
Subjt: LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
Query: SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
S LC ++K K++++++ N + + W IW ERNAR+F G ++ +IW+D +LA LW+S S S+ +S+
Subjt: SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
|
|
| KAA0046762.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | 1.3e-173 | 36.43 | Show/hide |
Query: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
WGP PFR ++ L +P F N+ RWW S GHPGYSFI+RLKSLA +K W+K +SF K + ++ ++D E L +RLALK+DL
Subjt: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
Query: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKASALI
E++L E+++W QR KKLWL +GDEN+++FH++CTAR++RN I E+ L ++ + + FS I+ + DP +DN++W PI H + L
Subjt: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKASALI
Query: KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERL
PF E E+ I S+ K PGPDGF I F+K +W LK IM++F DF+ K ++N+N+N+TYIALIPKK + +RPISLTT++YKI+AK L+ RL
Subjt: KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERL
Query: KSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD---
K+ L +TIS Q AFV RQI+DAIL+ANEAVD WK K +GF++KLDIEKAFD +NW FIDFVL KK FP WR+WI CIS+V+YSI +NGRP+
Subjt: KSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD---
Query: ------QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQ
QG P +I+H+LFADDILLF++D+D +++N + FE+ASGL+INL KSA+ +NV
Subjt: ------QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQ
Query: RSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKES
R+ E A+ W Q LP SYLGVPLGGNP FW + EKI+++++ W++A +SKGGRLTL++S L+++P Y LSVF+APSL+ WK +
Subjt: RSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKES
Query: --------------------------------------------------------------------SEISSGRVRAPQLQ-----ESFIQNSAWELRD
S ISS +AP + F N +W+L +
Subjt: --------------------------------------------------------------------SEISSGRVRAPQLQ-----ESFIQNSAWELRD
Query: GKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNIL
G I FW+ W+ L + RLF LS +K + V DAW+ +W I+ RR L DRE WA LP+P + G WIP + F+ SA+ ++
Subjt: GKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNIL
Query: RGHRAPVLSHGE---SVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCS-----WASYLRFKFNLAAGL
R S G+ + +W+++IP ++ +ES HLF+HC W S+L+ F+LA
Subjt: RGHRAPVLSHGE---SVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCS-----WASYLRFKFNLAAGL
Query: QAPCPLSIDHLCAEAFAYK-----AKSQRDILCRNFFVAYTWYIWKERNARVFQGTS--CSIYQIWDDSISLAALWSSNSKDPSHSDVASV
LS D L + F ++ + +++ + C +A W IW ERN R+F S + +W++ L W S + A++
Subjt: QAPCPLSIDHLCAEAFAYK-----AKSQRDILCRNFFVAYTWYIWKERNARVFQGTS--CSIYQIWDDSISLAALWSSNSKDPSHSDVASV
|
|
| KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | 4.8e-171 | 35.79 | Show/hide |
Query: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
WGPCPFR +N L + F N WW+ S +G PGY+FI+ L SL+K +K+W+ + + K+ L +I +D LE G + ++ QKR++LKSDL
Subjt: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
Query: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
I +A+ W QR ++ W GDEN +YFH++CT +R+N I + SL + D+ + ++HF +IY E +++DN+ W PI+ + S L KP
Subjt: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
Query: FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
F E E+ I S + KAPGPDG+T+ FYKK W LK ++ VF DF K IVN NVN+T+IALI KK K S YRPISLTT+LYKI+AK LA RLKS
Subjt: FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
Query: CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
L DTI+ Q AF+ GRQI+DAILIANEA+D WK K +GF++KLDIEKAFDKI+WSFID++L KK FP +WR+WI ACIS+V YSI LNG P+
Subjt: CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
Query: ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
QG P +I+HLLFADD+L+F++D+++Y++N + FE+ASGL N SKS ++ IN+ R+
Subjt: ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
Query: MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
+IA+ + + LP +YLGVPLGGNP +FWD IE I ++++GW+++ +SKGGRLTLL++ L+++P Y LS FKAP
Subjt: MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
Query: ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
+ S + WK+ +S +S + ++ + +W DG
Subjt: ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
Query: SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
S+ FW KW L RL+ LS +S V + W S WN+KPRR L +RE Q+W + LPR ++G W PS +T SA++I +
Subjt: SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
Query: GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
P ++ E +LW++ IP+ R S E ++HLF+ C +A L ++ G ++
Subjt: GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
Query: HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
LC + ++ ++I+ N +A W IW RN +F S W+D +L WSS SK + A++
Subjt: HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
|
|
| TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | 1.4e-170 | 35.69 | Show/hide |
Query: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
WGPCPFR +N L + F N WW+ S +G PGY+FI+ L SL+K +K+W+ + + K+ L +I +D LE G + ++ QKR++LKSDL
Subjt: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
Query: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
I +A+ W QR ++ W GDEN +YFH++CT +R+N I + SL + D+ + ++HF +IY E +++DN+ W PI+ + S L KP
Subjt: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
Query: FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
F E E+ I S + KAPGPDG+T+ FYKK W LK ++ VF DF K IVN NVN+T+IALI KK K S YRPISLTT+LYKI+AK LA RLKS
Subjt: FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
Query: CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
L DTI+ Q AF+ GRQI+DAILIANE +D WK K +GF++KLDIEKAFDKI+WSFID++L KK FP +WR+WI ACIS+V YSI LNG P+
Subjt: CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
Query: ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
QG P +I+HLLFADD+L+F++D+++Y++N + FE+ASGL N SKS ++ IN+ R+
Subjt: ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
Query: MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
+IA+ + + LP +YLGVPLGGNP +FWD IE I ++++GW+++ +SKGGRLTLL++ L+++P Y LS FKAP
Subjt: MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
Query: ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
+ S + WK+ +S +S + ++ + +W DG
Subjt: ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
Query: SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
S+ FW KW L RL+ LS +S V + W S WN+KPRR L +RE Q+W + LPR ++G W PS +T SA++I +
Subjt: SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
Query: GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
P ++ E +LW++ IP+ R S E ++HLF+ C +A L ++ G ++
Subjt: GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
Query: HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
LC + ++ ++I+ N +A W IW RN +F S W+D +L WSS SK + A++
Subjt: HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
|
|
| XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo] | 2.5e-172 | 35.79 | Show/hide |
Query: LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
L SWGP PF+ N L P F +N+ WW GHPG+SF+R+LK L+ +++ ++ N E K +I ++D LE+ G L + +R LK+
Subjt: LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
Query: DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
D+ EA+ W Q+ K+LW+++GDENT++FHK+C+AR+RR+ I + S + + + + K L HF IY E P LI DN++W PI+ +A
Subjt: DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
Query: ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
L F+E+E++E + + +NK+PGPDGFT+EFYK W LK I+ +F DF I+N+ VN T IALI KK + + YRPISLTT++YK++AKV+A
Subjt: ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
Query: ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
ERLK L T++ Q AFV GRQI DAIL+ANEA+D W+ K +GF+IKLDIEKAFDK+NW FIDF+LMKKG+PF+WR WI ACISSV YSI +NGRPR
Subjt: ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
Query: QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
+ PS +THLLFADDILLF++DD+ I N II F+ ASGL INL+KS ++ INV
Subjt: QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
Query: QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
R+ +IA++W + LP +YLGVPLGG TFW + EKI +++ W+++MLSKGG++TL++S L ++P Y LS+FK P + WK
Subjt: QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
Query: SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
E +S G + R+P E F ++ +W+++
Subjt: SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
Query: DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
+G+S FW W L S RL+ LS K + D W+ W++ PRR L + E WA + L G D W + G++T S +
Subjt: DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
Query: LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
L+ +L ++ F NLW+ SIPK +R+ E HLF+ C A + + C L
Subjt: LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
Query: SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
S LC ++K K++++++ N + + W IW ERNAR+F G ++ +IW+D +LA LW+S S S+ +S+
Subjt: SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein | 1.2e-172 | 35.79 | Show/hide |
Query: LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
L SWGP PF+ N L P F +N+ WW GHPG+SF+R+LK L+ +++ ++ N E K +I ++D LE+ G L + +R LK+
Subjt: LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
Query: DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
D+ EA+ W Q+ K+LW+++GDENT++FHK+C+AR+RR+ I + S + + + + K L HF IY E P LI DN++W PI+ +A
Subjt: DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
Query: ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
L F+E+E++E + + +NK+PGPDGFT+EFYK W LK I+ +F DF I+N+ VN T IALI KK + + YRPISLTT++YK++AKV+A
Subjt: ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
Query: ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
ERLK L T++ Q AFV GRQI DAIL+ANEA+D W+ K +GF+IKLDIEKAFDK+NW FIDF+LMKKG+PF+WR WI ACISSV YSI +NGRPR
Subjt: ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
Query: QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
+ PS +THLLFADDILLF++DD+ I N II F+ ASGL INL+KS ++ INV
Subjt: QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
Query: QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
R+ +IA++W + LP +YLGVPLGG TFW + EKI +++ W+++MLSKGG++TL++S L ++P Y LS+FK P + WK
Subjt: QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
Query: SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
E +S G + R+P E F ++ +W+++
Subjt: SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
Query: DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
+G+S FW W L S RL+ LS K + D W+ W++ PRR L + E WA + L G D W + G++T S +
Subjt: DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
Query: LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
L+ +L ++ F NLW+ SIPK +R+ E HLF+ C A + + C L
Subjt: LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
Query: SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
S LC ++K K++++++ N + + W IW ERNAR+F G ++ +IW+D +LA LW+S S S+ +S+
Subjt: SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
|
|
| A0A5A7TTK1 LINE-1 retrotransposable element ORF2 protein | 6.5e-174 | 36.43 | Show/hide |
Query: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
WGP PFR ++ L +P F N+ RWW S GHPGYSFI+RLKSLA +K W+K +SF K + ++ ++D E L +RLALK+DL
Subjt: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
Query: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKASALI
E++L E+++W QR KKLWL +GDEN+++FH++CTAR++RN I E+ L ++ + + FS I+ + DP +DN++W PI H + L
Subjt: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKASALI
Query: KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERL
PF E E+ I S+ K PGPDGF I F+K +W LK IM++F DF+ K ++N+N+N+TYIALIPKK + +RPISLTT++YKI+AK L+ RL
Subjt: KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERL
Query: KSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD---
K+ L +TIS Q AFV RQI+DAIL+ANEAVD WK K +GF++KLDIEKAFD +NW FIDFVL KK FP WR+WI CIS+V+YSI +NGRP+
Subjt: KSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD---
Query: ------QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQ
QG P +I+H+LFADDILLF++D+D +++N + FE+ASGL+INL KSA+ +NV
Subjt: ------QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQ
Query: RSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKES
R+ E A+ W Q LP SYLGVPLGGNP FW + EKI+++++ W++A +SKGGRLTL++S L+++P Y LSVF+APSL+ WK +
Subjt: RSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKES
Query: --------------------------------------------------------------------SEISSGRVRAPQLQ-----ESFIQNSAWELRD
S ISS +AP + F N +W+L +
Subjt: --------------------------------------------------------------------SEISSGRVRAPQLQ-----ESFIQNSAWELRD
Query: GKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNIL
G I FW+ W+ L + RLF LS +K + V DAW+ +W I+ RR L DRE WA LP+P + G WIP + F+ SA+ ++
Subjt: GKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNIL
Query: RGHRAPVLSHGE---SVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCS-----WASYLRFKFNLAAGL
R S G+ + +W+++IP ++ +ES HLF+HC W S+L+ F+LA
Subjt: RGHRAPVLSHGE---SVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCS-----WASYLRFKFNLAAGL
Query: QAPCPLSIDHLCAEAFAYK-----AKSQRDILCRNFFVAYTWYIWKERNARVFQGTS--CSIYQIWDDSISLAALWSSNSKDPSHSDVASV
LS D L + F ++ + +++ + C +A W IW ERN R+F S + +W++ L W S + A++
Subjt: QAPCPLSIDHLCAEAFAYK-----AKSQRDILCRNFFVAYTWYIWKERNARVFQGTS--CSIYQIWDDSISLAALWSSNSKDPSHSDVASV
|
|
| A0A5A7US62 LINE-1 retrotransposable element ORF2 protein | 2.3e-171 | 35.79 | Show/hide |
Query: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
WGPCPFR +N L + F N WW+ S +G PGY+FI+ L SL+K +K+W+ + + K+ L +I +D LE G + ++ QKR++LKSDL
Subjt: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
Query: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
I +A+ W QR ++ W GDEN +YFH++CT +R+N I + SL + D+ + ++HF +IY E +++DN+ W PI+ + S L KP
Subjt: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
Query: FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
F E E+ I S + KAPGPDG+T+ FYKK W LK ++ VF DF K IVN NVN+T+IALI KK K S YRPISLTT+LYKI+AK LA RLKS
Subjt: FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
Query: CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
L DTI+ Q AF+ GRQI+DAILIANEA+D WK K +GF++KLDIEKAFDKI+WSFID++L KK FP +WR+WI ACIS+V YSI LNG P+
Subjt: CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
Query: ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
QG P +I+HLLFADD+L+F++D+++Y++N + FE+ASGL N SKS ++ IN+ R+
Subjt: ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
Query: MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
+IA+ + + LP +YLGVPLGGNP +FWD IE I ++++GW+++ +SKGGRLTLL++ L+++P Y LS FKAP
Subjt: MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
Query: ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
+ S + WK+ +S +S + ++ + +W DG
Subjt: ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
Query: SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
S+ FW KW L RL+ LS +S V + W S WN+KPRR L +RE Q+W + LPR ++G W PS +T SA++I +
Subjt: SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
Query: GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
P ++ E +LW++ IP+ R S E ++HLF+ C +A L ++ G ++
Subjt: GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
Query: HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
LC + ++ ++I+ N +A W IW RN +F S W+D +L WSS SK + A++
Subjt: HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
|
|
| A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein | 6.7e-171 | 35.69 | Show/hide |
Query: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
WGPCPFR +N L + F N WW+ S +G PGY+FI+ L SL+K +K+W+ + + K+ L +I +D LE G + ++ QKR++LKSDL
Subjt: WGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKSDLQ
Query: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
I +A+ W QR ++ W GDEN +YFH++CT +R+N I + SL + D+ + ++HF +IY E +++DN+ W PI+ + S L KP
Subjt: EIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDNIDWCPINHIKASALIKP
Query: FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
F E E+ I S + KAPGPDG+T+ FYKK W LK ++ VF DF K IVN NVN+T+IALI KK K S YRPISLTT+LYKI+AK LA RLKS
Subjt: FSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLAERLKS
Query: CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
L DTI+ Q AF+ GRQI+DAILIANE +D WK K +GF++KLDIEKAFDKI+WSFID++L KK FP +WR+WI ACIS+V YSI LNG P+
Subjt: CLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD-----
Query: ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
QG P +I+HLLFADD+L+F++D+++Y++N + FE+ASGL N SKS ++ IN+ R+
Subjt: ----QGHP-----------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRS
Query: MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
+IA+ + + LP +YLGVPLGGNP +FWD IE I ++++GW+++ +SKGGRLTLL++ L+++P Y LS FKAP
Subjt: MEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAP---------------------
Query: ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
+ S + WK+ +S +S + ++ + +W DG
Subjt: ------------------------------------------SLSANQWKE-------------------SSEISSGRVRAPQLQESFIQNSAWELRDGK
Query: SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
S+ FW KW L RL+ LS +S V + W S WN+KPRR L +RE Q+W + LPR ++G W PS +T SA++I +
Subjt: SILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI-LR
Query: GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
P ++ E +LW++ IP+ R S E ++HLF+ C +A L ++ G ++
Subjt: GHRAPVLSHGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPLSID
Query: HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
LC + ++ ++I+ N +A W IW RN +F S W+D +L WSS SK + A++
Subjt: HLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
|
|
| A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein | 4.2e-173 | 35.89 | Show/hide |
Query: LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
L SWGP PF+ N L P F +N+ WW GHPG+SF+R+LK L+ +++ ++ N E K +I ++D LE+ G L + +R LK+
Subjt: LSSWGPCPFRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSNRQKRLALKS
Query: DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
D+ EA+ W Q+ K+LW+++GDENT++FHK+C+AR+RR+ I + S + + + + K L HF IY E P LI DN++W PI+ +A
Subjt: DLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY--DVEPDPGLIVDNIDWCPINHIKAS
Query: ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
L F+E+E++E + + +NK+PGPDGFT+EFYK W LK I+ +F DF I+N+ VN T IALI KK + + YRPISLTT++YK++AKV+A
Subjt: ALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKILAKVLA
Query: ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
ERLK L T++ Q AFV GRQI DAIL+ANEA+D W+ K +GF+IKLDIEKAFDK+NW FIDF+LMKKG+PF+WR WI ACISSV YSI +NGRPR
Subjt: ERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNGRPRD
Query: QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
+ PS +THLLFADDILLF++DD+ I N II F+ ASGL INL+KS ++ INV
Subjt: QGHPS------------------------------------------ITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPN
Query: QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
R+ +IA++W + LP +YLGVPLGG TFW + EKI +++ W+++MLSKGG++TL++S L ++P Y LS+FKAP + WK
Subjt: QRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPSLSANQ---------WKE
Query: SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
E +S G + R+P E F ++ +W+++
Subjt: SSE-----------------------------------------------------------ISSGRV---------RAPQLQ-----ESFIQNSAWELR
Query: DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
+G+S FW W L S RL+ LS K + D W+ W++ PRR L + E WA + L G D W + G++T S +
Subjt: DGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLKWIPSKEGIFTTKSARNI
Query: LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
L+ +L ++ F NLW+ SIPK +R+ E HLF+ C A + + C L
Subjt: LRGHRAPVLS-HGESVFNNLWQASIPK-------------------------------------RRSTESIDHLFVHCSWASYLRFKFNLAAGLQAPCPL
Query: SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
S LC ++K K++++++ N + + W IW ERNAR+F G ++ +IW+D +LA LW+S S S+ +S+
Subjt: SIDHLCAEAFAYKAKSQRDILCRNFFVAYTWYIWKERNARVFQGTSCSIYQIWDDSISLAALWSSNSKDPSHSDVASV
|
|
| SwissProt top hits | e value | %identity | Alignment |
|---|
| O00370 LINE-1 retrotransposable element ORF2 protein | 3.0e-27 | 23.89 | Show/hide |
Query: SNRQKRLALKSDLQEI-------ALFEARYW-SQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY----DVE
S RQ+ ++++L+EI + E+R W +R K+ D A ++ +R +N I + + +++ I ++ +Y +
Subjt: SNRQKRLALKSDLQEI-------ALFEARYW-SQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIY----DVE
Query: PDPGLIVDNIDWCPINHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKA-NSEK
+ +D +N + +L +P + E+ I S+ + K+PGPDGFT EFY+++ + L ++++F K+ I+ + I LIPK ++ K
Subjt: PDPGLIVDNIDWCPINHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKA-NSEK
Query: ISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFP
+RPISL KIL K+LA R++ + I Q F+ G Q I +I N + + K +I +D EKAFDKI F+ L K G
Subjt: ISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFP
Query: FQWREWIMACISSVSYSIFLNGRPRD---------QGHP-------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIK
+ + I A + +I LNG+ + QG P + LFADD+++++++ N +I
Subjt: FQWREWIMACISSVSYSIFLNGRPRD---------QGHP-------------------------------SITHLLFADDILLFMQDDDKYIDNFFFIIK
Query: SFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL--TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVL
+F + SG +IN+ KS N Q +I + YLG+ L + L + P++++IK + W+ S GR+ +++ +
Subjt: SFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL--TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVL
|
|
| P08548 LINE-1 reverse transcriptase homolog | 7.0e-24 | 23.2 | Show/hide |
Query: RRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDN-IDWCPINHI---KASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKK
+R ++ I + + N ++++K + ++ +Y + + +D ++ C + + + L +P S E+ I+++ K+PGPDGFT EFY+
Subjt: RRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEPDPGLIVDN-IDWCPINHI---KASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKK
Query: FWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKA-NSEKISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIA
F + L ++ +F + K+ I+ I LIPK + + YRPISL KIL K+L R++ + I Q F+ G Q I +I
Subjt: FWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKA-NSEKISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIA
Query: NEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNG--------------------------------RPR
N + K K ++ +D EKAFD I F+ L K G + + I A S + +I LNG R
Subjt: NEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSIFLNG--------------------------------RPR
Query: DQ--------GHPSITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL
++ G I LFADD+++++++ +IK + SG +IN KS NQ + + P YLGV L + L
Subjt: DQ--------GHPSITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL
Query: --TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSV--FKAPSLSANQWKESSEISSGRV---RAPQLQESFIQNSAWELRDGKSILFW
++ + ++I V+ W+ S GR+ +++ + +Y + KAP + +K+ +I + + PQ+ ++ + N
Subjt: --TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSV--FKAPSLSANQWKESSEISSGRV---RAPQLQESFIQNSAWELRDGKSILFW
Query: FDKWAGPDSLCSINNRLFHLSEEKSLLVSDAW----SPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKD--FLKW
AG +L + RL++ KS+++ AW + E WN + +D + F D P ++ GKD F KW
Subjt: FDKWAGPDSLCSINNRLFHLSEEKSLLVSDAW----SPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKD--FLKW
|
|
| P11369 LINE-1 retrotransposable element ORF2 protein | 1.7e-22 | 23.94 | Show/hide |
Query: SNRQKRLALKSDLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKV-----CTARRRRNHIHELLSTNSLSLVAD-----ADLEKEILTHFSSIYDVE--
S RQ+ + L+ ++ ++ E R QR + + +F K+ AR + H ++L + D +++ I + + +Y +
Subjt: SNRQKRLALKSDLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKV-----CTARRRRNHIHELLSTNSLSLVAD-----ADLEKEILTHFSSIYDVE--
Query: --PDPGLIVDNIDWCPINHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPK-KANS
+ +D +N + L P S +E+ I S+ + K+PGPDGF+ EFY+ F + L + ++FH + + + I LIPK + +
Subjt: --PDPGLIVDNIDWCPINHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPK-KANS
Query: EKISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKG
KI +RPISL KIL K+LA R++ + I P Q F+ G Q I +I + + + K K +I LD EKAFDKI F+ VL + G
Subjt: EKISKYRPISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQ----ISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKG
Query: FPFQWREWIMACISSVSYSIFLNGRPRD---------QGHPSITHL-------------------------------LFADDILLFMQDDDKYIDNFFFI
+ I A S +I +NG + QG P +L L ADD+++++ D +
Subjt: FPFQWREWIMACISSVSYSIFLNGRPRD---------QGHPSITHL-------------------------------LFADDILLFMQDDDKYIDNFFFI
Query: IKSFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL--TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIP
I SF + G +IN +KS Q EI + YLGV L L + + ++IK + W+ S GR+ +++ +
Subjt: IKSFEQASGLRINLSKSAVTGINVPNQRSMEIAARWNCLLQPLPTSYLGVPLGGNPTKL--TFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIP
Query: LYTLSV--FKAPSLSANQ
+Y + K P+ N+
Subjt: LYTLSV--FKAPSLSANQ
|
|
| P14381 Transposon TX1 uncharacterized 149 kDa protein | 3.1e-24 | 25.96 | Show/hide |
Query: AGLLDDSNRQKRLALKSDLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEP-DPGL
+G D + + + L K L+ + +AR R + L D D + +F+ + + R I L + + L + + + +++ +P P
Subjt: AGLLDDSNRQKRLALKSDLQEIALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYDVEP-DPGL
Query: IVDNIDWCP-INHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYR
+ D P ++ + L P + E+ + ++ + NK+PG DG TIEF++ FW TL V + FKK + + ++L+PKK + I +R
Subjt: IVDNIDWCP-INHIKASALIKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYR
Query: PISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMA
P+SL + YKI+AK ++ RLKS L + I P QS V GR I D + + + + + + + LD EKAFD+++ ++ L F Q+ ++
Subjt: PISLTTALYKILAKVLAERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMA
Query: CISSVSYSIFLN
+S + +N
Subjt: CISSVSYSIFLN
|
|
| Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment) | 9.2e-08 | 22.6 | Show/hide |
Query: KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIA----LIPKKANSEKISKYRPISLTTALYKILAKVL
+P + +E+ IK APG DG T++ + + +F + ++ +V + A LIPK + E S +RPI++ +AL ++L ++L
Subjt: KPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIA----LIPKKANSEKISKYRPISLTTALYKILAKVL
Query: AERLKSCLVDTISPFQSAF--VCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSI-----
A+RL++ + + P Q + + G ++ L+ + + + +K ++ LD+ KAFD ++ S I L + G +I +S + +I
Subjt: AERLKSCLVDTISPFQSAF--VCGRQISDAILIANEAVDLWKCSKKRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQWREWIMACISSVSYSI-----
Query: --------------------FLNGRPRDQ---------------GHPSITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKS
FL D+ G I L FADD+LL ++D+D + + +F + G+ +N KS
Subjt: --------------------FLNGRPRDQ---------------GHPSITHLLFADDILLFMQDDDKYIDNFFFIIKSFEQASGLRINLSKS
|
|
| Arabidopsis top hits | e value | %identity | Alignment |
|---|
| AT1G43760.1 DNAse I-like superfamily protein | 1.3e-25 | 29.01 | Show/hide |
Query: FRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSN---RQKRLALKSDLQEI
FR+ +FL +P+F ++ W E G +S LK+ AKK K LN F + + + +L+ ++S L + S+ R + +A K
Subjt: FRFDNFLLANPSFTSNIERWWSESTASGHPGYSFIRRLKSLAKKVKDWKKLNTDSFKEKKRCLADDIQNLDILESAGLLDDSN---RQKRLALKSDLQEI
Query: ALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYD-----VEPDPGLIVDNIDWCPINHIKASAL
A E+ ++ Q+ + WL DGD NT +FHKV A + +N I L + + + +++ I+ +++ + + PD + +I N AS L
Subjt: ALFEARYWSQRCKKLWLSDGDENTAYFHKVCTARRRRNHIHELLSTNSLSLVADADLEKEILTHFSSIYD-----VEPDPGLIVDNIDWCPINHIKASAL
Query: IKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKIL
S++E+ + ++ NKAPGPD FT EF+ + W +K S + +FF+ + + N T I LIPK +++S +RP+S T +YKI+
Subjt: IKPFSEQEVYEGIKSIGSNKAPGPDGFTIEFYKKFWKTLKHSIMEVFHDFFKKKIVNRNVNHTYIALIPKKANSEKISKYRPISLTTALYKIL
|
|
| AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein | 6.1e-07 | 37.31 | Show/hide |
Query: LPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPS
LP YLG+PL + + P++EKI+ R+ W LS GRL L+ SV++++ + +S F+ PS
Subjt: LPTSYLGVPLGGNPTKLTFWDPMIEKIKRRVDGWRFAMLSKGGRLTLLQSVLNNIPLYTLSVFKAPS
|
|
| AT4G20520.1 RNA binding;RNA-directed DNA polymerases | 3.5e-10 | 39.51 | Show/hide |
Query: LAERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSK--KRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQW
+ ERLK + + I P Q++F+ GR +D I+ EAV + K K L+KLD+EKA+D+I W +++ L+ GFP W
Subjt: LAERLKSCLVDTISPFQSAFVCGRQISDAILIANEAVDLWKCSK--KRGFLIKLDIEKAFDKINWSFIDFVLMKKGFPFQW
|
|