| GenBank top hits | e value | %identity | Alignment |
|---|
| GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum] | 1.1e-95 | 30.41 | Show/hide |
Query: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDP--HEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKA
L + +SVKLD N+ LW+ +VL ++ G K+DGY+L T P E + ++ + ++ Q D +++ T + ++ +TS+++W
Subjt: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDP--HEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKA
Query: LEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLI-RPTT----LLAALN-----LNHMVI----PTRILGVAIGAEVVA----
+ + GA ++++I ++ + +KG MKM +YL MK NL + L L P + ++ LN N +V+ T + V + A+++
Subjt: LEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLI-RPTT----LLAALN-----LNHMVI----PTRILGVAIGAEVVA----
Query: VIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSG------RRGMSGALSAEAGFRILGLGRYIC-----------APNNNQGGNN--GGSSTYLASPEI
+ + + L +++ S + K+S W G R G S + ++ GL +I + +N+ G++ G + +LAS
Subjt: VIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSG------RRGMSGALSAEAGFRILGLGRYIC-----------APNNNQGGNN--GGSSTYLASPEI
Query: LSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCD
+ D W DS A+NHVT +++GK L V N KL + G+S + S + L DIL+VP+I +NL+S+++L ADNN VEF +C
Subjt: LSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCD
Query: GQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVS-KSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLS
+ K + V + LL L + + S VS K WH+ LGH + KV + VL SC ++ SF + CQY K H LPF S S
Subjt: GQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVS-KSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLS
Query: SSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPH
++ PLEL+H D+WGP+P+ ++ G++YY+ FVD++SR T+IYPLK KS+ + F Q+K L EN+F+K++K +Q D E++ + + I R CP+
Subjt: SSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPH
Query: TRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------------
T QQNG ERKH + E GLTL+AQA +PL +WW+AFS+AV++ NRLP+QV SP+ + D L+ FGCAC+PCL+P
Subjt: TRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------------
Query: --------TKLTNSNTTVPNVCSLVI----------VLPTKS--STTASSPQSVSVPTQSPNIASPTLSPVSRPLSPIAQS--PALCLSSGSEASHSLPS
K NS+ + ++ L T+S TT + P + + N+ P+ +P + + ++S +E +++ PS
Subjt: --------TKLTNSNTTVPNVCSLVI----------VLPTKS--STTASSPQSVSVPTQSPNIASPTLSPVSRPLSPIAQS--PALCLSSGSEASHSLPS
Query: S-----ESSLDKS-------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTN
E +LD + TR+KSG+ KPK ++ + + EP KEAL+ PLW++AM E AL N+TW LV N
Subjt: S-----ESSLDKS-------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTN
Query: LIGNKWIFKLK
++ +KW+FK K
Subjt: LIGNKWIFKLK
|
|
| GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum] | 2.7e-94 | 30.15 | Show/hide |
Query: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNI-------KTSR
L +++SVKLD N+ LW+ +VL+++ G K+DGY+L T P + V A+ ++V+ F ++ Q L L + ++I +TS+
Subjt: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNI-------KTSR
Query: EVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA---
++W + + GA +K+RI ++ N +KG MKM EYL MK S+ L N L+ T N +V+ + V + A+++A
Subjt: EVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA---
Query: -VIIVERTSGLPVSSAVTPDLHSWKQFK----TSMIMW--SGRRGM-------------------SGALSAEAGFRILGLGRYICAPNNNQGGNNGGSST
+ SGL ++++ + + +F+ S W S RGM +G ++ + +R Y + + G S
Subjt: -VIIVERTSGLPVSSAVTPDLHSWKQFK----TSMIMW--SGRRGM-------------------SGALSAEAGFRILGLGRYICAPNNNQGGNNGGSST
Query: YLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEF
++ASP D +W DS A NHVT ++NGK L V N KL + G S + + L D+L+VP I +NL+S+++L ADNN VEF
Subjt: YLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEF
Query: HPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLP
D ++ Q LL LS+ ++ +S WH+ LGH + KV + VL+ CN ++ SF + CQ+ K H LP
Subjt: HPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLP
Query: FSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDS
F S S + PL LIH D+WGP+P+ S G++YY+ F+D++SR T+I+PLK KSD F Q+K L EN+F+KK+K +Q D E+++ I + I
Subjt: FSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDS
Query: RHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------
R CP+T QQNG ERKH V E+GLTL+AQA +PL++WW+AFS+AV++ NRLP+ V SP+ F+ D + L+ FGCAC+PCL+P
Subjt: RHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------
Query: --------------TKLTNSN-----------------------------TTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNIASPTLSPVSRPLSP
K NS+ T+ + S+++ + +TT + + + T N S S +
Subjt: --------------TKLTNSN-----------------------------TTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNIASPTLSPVSRPLSP
Query: IAQSPALCLSSGS-----EASHSLPS-------------SESSLDKS------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMT
+ S ++ S EA +S+ S ++ D S TR+K G+ KPK ++ + + EP+++KEAL P+W++AM
Subjt: IAQSPALCLSSGS-----EASHSLPS-------------SESSLDKS------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMT
Query: EGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
E AL N TW+LV N+I +KWIFK K
Subjt: EGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
|
|
| PNY01489.1 copia-like polyprotein, partial [Trifolium pratense] | 1.1e-95 | 31.24 | Show/hide |
Query: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALE
L +++SVKLD N+ LW+ +VL ++ G K DGY+L T P + V A+ + + + + A + ++ ++ +TS+++W +
Subjt: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALE
Query: EVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA----VIIVER
+ GA +K+RI ++ N +KG MKM EYL MK S+ L N L+ T N +V+ + V + A+++A + +
Subjt: EVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA----VIIVER
Query: TSGLPVSSAVT---------PDLHSWKQFKTSMIM-WSGRRGMSGALSAEAGFRILGLGR------------YICAPNNNQGGNNGGSSTYLASPEILSD
SGL ++++ HS ++ S G RG G +S G G Y + + G S ++ASP D
Subjt: TSGLPVSSAVT---------PDLHSWKQFKTSMIM-WSGRRGMSGALSAEAGFRILGLGR------------YICAPNNNQGGNNGGSSTYLASPEILSD
Query: PKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQR
+W DS A+NHVT ++NGK L V N KL + G S + + L D+L+VP I +NL+S+++L ADNN FVEF D
Subjt: PKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQR
Query: YKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLSSSKC
++ Q LL LS ++ + + V +S WH+ LGH + KV VL+ CN ++ SF + CQ+ K H LPF S S +
Subjt: YKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLSSSKC
Query: PLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQ
PL LIH D+WGP+P+ S G++YY+ F+D++SR T+I+PLK KSD F Q+K L EN+F+KK+K +Q D E+++ I + I R CP+T QQ
Subjt: PLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQ
Query: NGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTKLTNSNTTVPNVCSLVIVL
NG ERKH VVE+GLTL+AQA +PL++WW+AFS+AV++ NRL + V SP+ F+ D + L+ FGCAC+PCL+P +
Subjt: NGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTKLTNSNTTVPNVCSLVIVL
Query: PTKSSTTASSPQSVS------VPTQSPNIASPTLSPVSRPLSPIAQSPALCLSSGSEASHSLPSSESSLDKS-----------------------TRAKS
K STT + S + T N S S + + ++GS + + S D++ TR+K+
Subjt: PTKSSTTASSPQSVS------VPTQSPNIASPTLSPVSRPLSPIAQSPALCLSSGSEASHSLPSSESSLDKS-----------------------TRAKS
Query: GVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
G+ KPK ++ + + EP ++KEAL P+W++AM E AL N TW+LV N+I +KWIFK K
Subjt: GVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
|
|
| RVW85836.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera] | 7.9e-94 | 30.78 | Show/hide |
Query: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREV
L+ L VKLD N++LW+ + ++ + ++ + P S ++ A + R D +S + + A + +S
Subjt: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREV
Query: WKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL--NLNHMVIPTRILGVAIGAEVVAVII---
W ALE+ + ++S+ARI +R LQ+ KKG++ M++Y+ +K A+ +L + P + LL L + N +V I I E V ++
Subjt: WKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL--NLNHMVIPTRILGVAIGAEVVAVII---
Query: ---VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR----------------------------------------YICA
+E+ S + S ++ + S + ++G RG + + + + G GR Y +
Subjt: ---VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR----------------------------------------YICA
Query: PNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQR
++N +N G+ +AS L+D W DS A++H+T NL + Y G K+T+ N L++ N G+ + S+ + LK + HVP I
Subjt: PNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQR
Query: NLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAY
NLIS+A+ +DNNA +EF + + Q L L+ +AF A + +S C ++WH LGHAS + +++SCN S
Subjt: NLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAY
Query: VNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKT
N+ + CQ +KSHRLP SLS + PLEL+H DLWGP+PV ST G RY+I F+D+YSR T+ YPL+ K VF ++K VEN+FD K+K
Subjt: VNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKT
Query: LQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFL
LQSD EFRSF +FL + I R CP+ QNG VERKH VVE GL L+A ASLP++FW AF +A F+ NR+P++VL +SP+ F+ V D L
Subjt: LQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFL
Query: RVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNI----ASPTLSPVSRPL
RVFGC C+P +RP LT P+V P + S + S T +P I +PT P
Subjt: RVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNI----ASPTLSPVSRPL
Query: SPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALY
S ++ SP++ +S S +S ++ PSS + +TR G+ + K S ++SEP T+K+AL P W QAM E AL+
Subjt: SPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALY
Query: HNRTWSLVSPPPNTNLIGNKWIFKLK
N+TW LV PP NLIG KW++KLK
Subjt: HNRTWSLVSPPPNTNLIGNKWIFKLK
|
|
| RVX23584.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera] | 7.9e-94 | 30.68 | Show/hide |
Query: SSPATVPISASTIVS----LSFGHP----LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHF
SSP S+ +I S HP L+ L VKLD N++LW+ + ++ + ++ + P S ++ A + R D
Subjt: SSPATVPISASTIVS----LSFGHP----LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHF
Query: AAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL-
+S + + A + +S W ALE+++ ++S+ARI +R LQ+ KKG++ M++Y+ +K A+++L + P + LL L
Subjt: AAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL-
Query: -NLNHMVIPTRILGVAIGAEVVAVII------VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR---------------
+ N +V I I E V ++ +E+ S + S ++ + S + ++G RG + + + + G GR
Subjt: -NLNHMVIPTRILGVAIGAEVVAVII------VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR---------------
Query: -------------------------YICAPNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVY
Y + ++N +N G+ +AS L+D W DS A++H+T NL + Y G K+T+ N L++
Subjt: -------------------------YICAPNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVY
Query: NVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKV
N G+ + S+ + LK + HVP I NLIS+A+ +DNNA +EF + + Q L L+ +AF A + +S C
Subjt: NVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKV
Query: SKSIWHQCLGHASVKVFNSVLRSCNQSAYVNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIY
++WH LGHAS + +++SCN S N+ + CQ +KSHRLP SLS + PLEL+H DLWGP+PV ST G RY+I F+D+YSR T+ Y
Subjt: SKSIWHQCLGHASVKVFNSVLRSCNQSAYVNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIY
Query: PLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVF
PL+ K VF ++K VEN+FD K+K LQSD EFRSF +FL + I R CP+ QNG VERKH VVE GL L+A ASLP++FW AF +A F
Subjt: PLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVF
Query: ITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASS
+ NR+P++VL +SP+ F+ V D LRVFGC C+P +RP LT P+V P + S
Subjt: ITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASS
Query: PQSVSVPTQSPNI----ASPTLSPVSRPLSPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSRE
+ S T +P I +PT P S ++ SP++ +S S +S ++ PSS + +TR G+ + K S
Subjt: PQSVSVPTQSPNI----ASPTLSPVSRPLSPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSRE
Query: QLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
++SEP T+K+AL P W QAM E AL+ N+TW LV PP NLIG KW++KLK
Subjt: QLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A2K3NEN7 Copia-like polyprotein (Fragment) | 5.3e-96 | 31.24 | Show/hide |
Query: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALE
L +++SVKLD N+ LW+ +VL ++ G K DGY+L T P + V A+ + + + + A + ++ ++ +TS+++W +
Subjt: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALE
Query: EVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA----VIIVER
+ GA +K+RI ++ N +KG MKM EYL MK S+ L N L+ T N +V+ + V + A+++A + +
Subjt: EVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA----VIIVER
Query: TSGLPVSSAVT---------PDLHSWKQFKTSMIM-WSGRRGMSGALSAEAGFRILGLGR------------YICAPNNNQGGNNGGSSTYLASPEILSD
SGL ++++ HS ++ S G RG G +S G G Y + + G S ++ASP D
Subjt: TSGLPVSSAVT---------PDLHSWKQFKTSMIM-WSGRRGMSGALSAEAGFRILGLGR------------YICAPNNNQGGNNGGSSTYLASPEILSD
Query: PKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQR
+W DS A+NHVT ++NGK L V N KL + G S + + L D+L+VP I +NL+S+++L ADNN FVEF D
Subjt: PKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQR
Query: YKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLSSSKC
++ Q LL LS ++ + + V +S WH+ LGH + KV VL+ CN ++ SF + CQ+ K H LPF S S +
Subjt: YKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLSSSKC
Query: PLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQ
PL LIH D+WGP+P+ S G++YY+ F+D++SR T+I+PLK KSD F Q+K L EN+F+KK+K +Q D E+++ I + I R CP+T QQ
Subjt: PLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQ
Query: NGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTKLTNSNTTVPNVCSLVIVL
NG ERKH VVE+GLTL+AQA +PL++WW+AFS+AV++ NRL + V SP+ F+ D + L+ FGCAC+PCL+P +
Subjt: NGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTKLTNSNTTVPNVCSLVIVL
Query: PTKSSTTASSPQSVS------VPTQSPNIASPTLSPVSRPLSPIAQSPALCLSSGSEASHSLPSSESSLDKS-----------------------TRAKS
K STT + S + T N S S + + ++GS + + S D++ TR+K+
Subjt: PTKSSTTASSPQSVS------VPTQSPNIASPTLSPVSRPLSPIAQSPALCLSSGSEASHSLPSSESSLDKS-----------------------TRAKS
Query: GVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
G+ KPK ++ + + EP ++KEAL P+W++AM E AL N TW+LV N+I +KWIFK K
Subjt: GVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
|
|
| A0A2Z6MBG6 Integrase catalytic domain-containing protein | 5.3e-96 | 30.41 | Show/hide |
Query: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDP--HEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKA
L + +SVKLD N+ LW+ +VL ++ G K+DGY+L T P E + ++ + ++ Q D +++ T + ++ +TS+++W
Subjt: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDP--HEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKA
Query: LEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLI-RPTT----LLAALN-----LNHMVI----PTRILGVAIGAEVVA----
+ + GA ++++I ++ + +KG MKM +YL MK NL + L L P + ++ LN N +V+ T + V + A+++
Subjt: LEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLI-RPTT----LLAALN-----LNHMVI----PTRILGVAIGAEVVA----
Query: VIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSG------RRGMSGALSAEAGFRILGLGRYIC-----------APNNNQGGNN--GGSSTYLASPEI
+ + + L +++ S + K+S W G R G S + ++ GL +I + +N+ G++ G + +LAS
Subjt: VIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSG------RRGMSGALSAEAGFRILGLGRYIC-----------APNNNQGGNN--GGSSTYLASPEI
Query: LSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCD
+ D W DS A+NHVT +++GK L V N KL + G+S + S + L DIL+VP+I +NL+S+++L ADNN VEF +C
Subjt: LSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCD
Query: GQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVS-KSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLS
+ K + V + LL L + + S VS K WH+ LGH + KV + VL SC ++ SF + CQY K H LPF S S
Subjt: GQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVS-KSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLS
Query: SSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPH
++ PLEL+H D+WGP+P+ ++ G++YY+ FVD++SR T+IYPLK KS+ + F Q+K L EN+F+K++K +Q D E++ + + I R CP+
Subjt: SSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPH
Query: TRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------------
T QQNG ERKH + E GLTL+AQA +PL +WW+AFS+AV++ NRLP+QV SP+ + D L+ FGCAC+PCL+P
Subjt: TRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------------
Query: --------TKLTNSNTTVPNVCSLVI----------VLPTKS--STTASSPQSVSVPTQSPNIASPTLSPVSRPLSPIAQS--PALCLSSGSEASHSLPS
K NS+ + ++ L T+S TT + P + + N+ P+ +P + + ++S +E +++ PS
Subjt: --------TKLTNSNTTVPNVCSLVI----------VLPTKS--STTASSPQSVSVPTQSPNIASPTLSPVSRPLSPIAQS--PALCLSSGSEASHSLPS
Query: S-----ESSLDKS-------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTN
E +LD + TR+KSG+ KPK ++ + + EP KEAL+ PLW++AM E AL N+TW LV N
Subjt: S-----ESSLDKS-------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTN
Query: LIGNKWIFKLK
++ +KW+FK K
Subjt: LIGNKWIFKLK
|
|
| A0A2Z6P4D5 Integrase catalytic domain-containing protein | 1.3e-94 | 30.15 | Show/hide |
Query: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNI-------KTSR
L +++SVKLD N+ LW+ +VL+++ G K+DGY+L T P + V A+ ++V+ F ++ Q L L + ++I +TS+
Subjt: LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNI-------KTSR
Query: EVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA---
++W + + GA +K+RI ++ N +KG MKM EYL MK S+ L N L+ T N +V+ + V + A+++A
Subjt: EVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA---
Query: -VIIVERTSGLPVSSAVTPDLHSWKQFK----TSMIMW--SGRRGM-------------------SGALSAEAGFRILGLGRYICAPNNNQGGNNGGSST
+ SGL ++++ + + +F+ S W S RGM +G ++ + +R Y + + G S
Subjt: -VIIVERTSGLPVSSAVTPDLHSWKQFK----TSMIMW--SGRRGM-------------------SGALSAEAGFRILGLGRYICAPNNNQGGNNGGSST
Query: YLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEF
++ASP D +W DS A NHVT ++NGK L V N KL + G S + + L D+L+VP I +NL+S+++L ADNN VEF
Subjt: YLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEF
Query: HPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLP
D ++ Q LL LS+ ++ +S WH+ LGH + KV + VL+ CN ++ SF + CQ+ K H LP
Subjt: HPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLP
Query: FSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDS
F S S + PL LIH D+WGP+P+ S G++YY+ F+D++SR T+I+PLK KSD F Q+K L EN+F+KK+K +Q D E+++ I + I
Subjt: FSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDS
Query: RHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------
R CP+T QQNG ERKH V E+GLTL+AQA +PL++WW+AFS+AV++ NRLP+ V SP+ F+ D + L+ FGCAC+PCL+P
Subjt: RHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------
Query: --------------TKLTNSN-----------------------------TTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNIASPTLSPVSRPLSP
K NS+ T+ + S+++ + +TT + + + T N S S +
Subjt: --------------TKLTNSN-----------------------------TTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNIASPTLSPVSRPLSP
Query: IAQSPALCLSSGS-----EASHSLPS-------------SESSLDKS------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMT
+ S ++ S EA +S+ S ++ D S TR+K G+ KPK ++ + + EP+++KEAL P+W++AM
Subjt: IAQSPALCLSSGS-----EASHSLPS-------------SESSLDKS------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMT
Query: EGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
E AL N TW+LV N+I +KWIFK K
Subjt: EGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
|
|
| A0A438KQV8 Retrovirus-related Pol polyprotein from transposon RE1 | 3.8e-94 | 30.68 | Show/hide |
Query: SSPATVPISASTIVS----LSFGHP----LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHF
SSP S+ +I S HP L+ L VKLD N++LW+ + ++ + ++ + P S ++ A + R D
Subjt: SSPATVPISASTIVS----LSFGHP----LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHF
Query: AAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL-
+S + + A + +S W ALE+++ ++S+ARI +R LQ+ KKG++ M++Y+ +K A+++L + P + LL L
Subjt: AAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL-
Query: -NLNHMVIPTRILGVAIGAEVVAVII------VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR---------------
+ N +V I I E V ++ +E+ S + S ++ + S + ++G RG + + + + G GR
Subjt: -NLNHMVIPTRILGVAIGAEVVAVII------VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR---------------
Query: -------------------------YICAPNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVY
Y + ++N +N G+ +AS L+D W DS A++H+T NL + Y G K+T+ N L++
Subjt: -------------------------YICAPNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVY
Query: NVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKV
N G+ + S+ + LK + HVP I NLIS+A+ +DNNA +EF + + Q L L+ +AF A + +S C
Subjt: NVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKV
Query: SKSIWHQCLGHASVKVFNSVLRSCNQSAYVNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIY
++WH LGHAS + +++SCN S N+ + CQ +KSHRLP SLS + PLEL+H DLWGP+PV ST G RY+I F+D+YSR T+ Y
Subjt: SKSIWHQCLGHASVKVFNSVLRSCNQSAYVNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIY
Query: PLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVF
PL+ K VF ++K VEN+FD K+K LQSD EFRSF +FL + I R CP+ QNG VERKH VVE GL L+A ASLP++FW AF +A F
Subjt: PLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVF
Query: ITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASS
+ NR+P++VL +SP+ F+ V D LRVFGC C+P +RP LT P+V P + S
Subjt: ITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASS
Query: PQSVSVPTQSPNI----ASPTLSPVSRPLSPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSRE
+ S T +P I +PT P S ++ SP++ +S S +S ++ PSS + +TR G+ + K S
Subjt: PQSVSVPTQSPNI----ASPTLSPVSRPLSPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSRE
Query: QLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
++SEP T+K+AL P W QAM E AL+ N+TW LV PP NLIG KW++KLK
Subjt: QLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
|
|
| A0A803PM38 Uncharacterized protein | 1.4e-101 | 32.99 | Show/hide |
Query: ASTIVSLSFGHPLSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVN
A IV FG L+ ++KLD NF LWR MV AI+ G ++DGY+ T+ +P E + +++ + F + Q L L+ +
Subjt: ASTIVSLSFGHPLSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVN
Query: I-------KTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQ-------ASENLYENLFLIRPTTLLAALNLNHMVIPTRILGVAI
I +S +W ALEE++GA SKA+++ R +Q +KGA+ M +YL +Q A E EN + + +L+ L++ + +P +L A
Subjt: I-------KTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQ-------ASENLYENLFLIRPTTLLAALNLNHMVIPTRILGVAI
Query: GAEVVAVIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGRYICAPNNNQGG--NNGGSS---------------TYLA
G+ + + L + S + LHS F S + S +L+ + G + NNN+GG NN GS+ T
Subjt: GAEVVAVIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGRYICAPNNNQGG--NNGGSS---------------TYLA
Query: SPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVG-NSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHP
+ + A+NH+TS+ + +K +YNGK+K+TV+N ++L ++++G SL SA SP++LK+ILHVP I +NL+SI++L +DNN VEF
Subjt: SPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVG-NSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHP
Query: LLSC-----DGQ-----RYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSI-----------WHQCLGHASVKVFNSVLRSCNQSAYV
L GQ + K+ P T + +SN +F+ V S+ V+K + WH+ LGH S++V ++VL N +
Subjt: LLSC-----DGQ-----RYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSI-----------WHQCLGHASVKVFNSVLRSCNQSAYV
Query: NEVDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSD
N SF D CQ KSH LPF + + PLEL+H D+WGPSP+ S +RYYI F+D++SR T+IYPLK KS+ F Q+K LVEN+F+ +VK +Q+D
Subjt: NEVDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSD
Query: LEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFG
E++ F F I +HPCPHT QNG ERKH +VEMGLTL+AQA +P K+WWDAF +AV++ NRLPT VL +P+E F+ D FL+VFG
Subjt: LEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFG
Query: CACFPCLR----------PTKLTN-------------SNTTVPNVCSLVIV----LPTKS----STTASSPQSVSVP-----------------------
+CFPCLR TK N S+T + VI P KS + +P SV VP
Subjt: CACFPCLR----------PTKLTN-------------SNTTVPNVCSLVIV----LPTKS----STTASSPQSVSVP-----------------------
Query: ---TQSPNIASPTLSPVSRPLSPI------------------------AQSPALCLSSG------SEASHSLPSSESSLDKSTRAKSGVFKPKDWGSFLS
T + +PT S V LS + L S S + H+L + S+ TRAK+G+FKPK ++L+
Subjt: ---TQSPNIASPTLSPVSRPLSPI------------------------AQSPALCLSSG------SEASHSLPSSESSLDKSTRAKSGVFKPKDWGSFLS
Query: TCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
SEP++I+EAL W AM +E AL N TW LV P+ ++I NKW++K K
Subjt: TCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
|
|
| SwissProt top hits | e value | %identity | Alignment |
|---|
| P04146 Copia protein | 2.6e-23 | 25.51 | Show/hide |
Query: ILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSC
++ + ++ DS A++H+ +D + K+ V+ + +Y +V I L+D+L NL+S+ RL + +EF
Subjt: ILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSC
Query: DGQRYKESPSVSSQPLLTAASNS--NNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRS--CNQSAYVNEVDSFYDVCQ---YSKSHRL
K ++S L+ ++ NN+ + + AY + + K + +WH+ GH S + R + + +N ++ ++C+ K RL
Subjt: DGQRYKESPSVSSQPLLTAASNS--NNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRS--CNQSAYVNEVDSFYDVCQ---YSKSHRL
Query: PFS--RSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRS--FSSFLIA
PF + + K PL ++H D+ GP + Y++ FVD+++ Y +K KSD F +F + A E F+ KV L D RE+ S F +
Subjt: PFS--RSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRS--FSSFLIA
Query: SRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSS--PWECAFRIVLDISFLRVFGCACF
I PHT Q NG+ ER + E T+++ A L FW +A +A ++ NR+P++ L SS P+E + LRVFG +
Subjt: SRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSS--PWECAFRIVLDISFLRVFGCACF
|
|
| P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-94 | 8.9e-32 | 25.29 | Show/hide |
Query: YLASPEILSDPKWLADSSATNHVT--SDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFV
+L+ PE +W+ D++A++H T D + D+ + + N S + +G+ + ++ + ++LKD+ HVP ++ NLIS L D
Subjt: YLASPEILSDPKWLADSSATNHVT--SDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFV
Query: EFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHR
+ + R + V ++ + N + ++ ++S +WH+ +GH S K + + S D C + K HR
Subjt: EFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHR
Query: LPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREF--RSFSSFLIAS
+ F S L+L++ D+ GP + S G +Y+++F+D+ SR ++Y LK K F+VF ++ ALVE + +K+K L+SD E+ R F + +
Subjt: LPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREF--RSFSSFLIAS
Query: RIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACF---PCLRPTKL
I P T Q NG+ ER + +VE +++ A LP FW +A +A ++ NR P+ L P + S L+VFGC F P + TKL
Subjt: RIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACF---PCLRPTKL
Query: TNSN--------------------TTVPNVCSLVIVLPTKSSTTASS----------PQSVSVPTQS--PNIASPTLSPVS----RPLSPIAQSPALCLS
+ + + S +V TA+ P V++P+ S P A T VS +P I Q L
Subjt: TNSN--------------------TTVPNVCSLVIVLPTKSSTTASS----------PQSVSVPTQS--PNIASPTLSPVS----RPLSPIAQSPALCLS
Query: SGSEASHSLPSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQ---AMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKL
G E E R++ + + + S S ++ EP ++KE L+ P Q AM E +L N T+ LV P + KW+FKL
Subjt: SGSEASHSLPSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQ---AMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKL
Query: K
K
Subjt: K
|
|
| Q07163 Transposon TyH3 Gag-Pol polyprotein | 2.7e-12 | 25.08 | Show/hide |
Query: ILHVPHIQRNLISIARLIA-DNNAFVEFHPLLSCDGQ------RYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVK
+LH P+I +L+S+ L A D A + L DG +Y + VS + LL +N+S+ ++ + ES++ K H+ L HA+ +
Subjt: ILHVPHIQRNLISIARLIA-DNNAFVEFHPLLSCDGQ------RYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVK
Query: VFNSVLRSCNQSAYVNEVDSFYD-----------VCQYSKSHRLPFSR-SLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPL--KL
L++ N Y NE D + + + +K + SR +S P + +H D++GP Y+ISF DE ++ ++YPL +
Subjt: VFNSVLRSCNQSAYVNEVDSFYD-----------VCQYSKSHRLPFSR-SLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPL--KL
Query: KSDPFRVFCQYKALVENKFDKKVKTLQSDLEREF--RSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFIT
+ VF A ++N+F V +Q D E+ R+ FL + I + + +G+ ER + +++ T + + LP W+ A + +
Subjt: KSDPFRVFCQYKALVENKFDKKVKTLQSDLEREF--RSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFIT
Query: NRLPTQVLDGSSPWECAFRIVLDISFLRVFG
N L + S + A LDIS L FG
Subjt: NRLPTQVLDGSSPWECAFRIVLDISFLRVFG
|
|
| Q94HW2 Retrovirus-related Pol polyprotein from transposon RE1 | 7.5e-63 | 30.35 | Show/hide |
Query: LASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFH
L SP S WL DS AT+H+TSD NL++ Y G + V++ S + + + G++ + S + P+ L +IL+VP+I +NLIS+ RL N VEF
Subjt: LASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFH
Query: PLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVE--SSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDV--CQYSKSH
P ++ + PLL + L +A + V +S S K + S WH LGH + + NSV+ + + S +N F C +KS+
Subjt: PLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVE--SSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDV--CQYSKSH
Query: RLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASR
++PFS+S +S PLE I+ D+W SP+ S YRYY+ FVD ++R T++YPLK KS F +K L+EN+F ++ T SD EF + +
Subjt: RLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASR
Query: IDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP-------
I PHT + NG+ ERKH +VE GLTL++ AS+P +W AF+ AV++ NRLPT +L SP++ F + LRVFGCAC+P LRP
Subjt: IDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP-------
Query: ----------TKLTNS--------------------------------------------------NTTVPNVCSLVIVLPTKSST----TASSPQSVSV
LT S +TT+P + VLP S + A+ P S S
Subjt: ----------TKLTNS--------------------------------------------------NTTVPNVCSLVIVLPTKSST----TASSPQSVSV
Query: PTQSPNIASPTL----------------------SPVSRPL---------------SPIAQSP---ALCLSSGSEASHSLPSSESSLDKS----------
P ++ ++S L P ++P +P +SP A LS+ +++S S PS +S S
Subjt: PTQSPNIASPTL----------------------SPVSRPL---------------SPIAQSP---ALCLSSGSEASHSLPSSESSLDKS----------
Query: -------------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPP-NTNLIGNKW
TRAK+G+ KP S + ++ SEPRT +AL WR AM +E A N TW LV PPP + ++G +W
Subjt: -------------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPP-NTNLIGNKW
Query: IFKLK
IF K
Subjt: IFKLK
|
|
| Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE2 | 4.0e-64 | 31.56 | Show/hide |
Query: WLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYK
WL DS AT+H+TSD NL+ Y G + +++ S + + + G++ +P+S+ + + L +L+VP+I +NLIS+ RL N VEF P ++
Subjt: WLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYK
Query: ESPSVSSQPLLTAASNSNNLSLS-SLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYV----NEVDSFYDVCQYSKSHRLPFSRSLSS
+ PLL + + + A + +S K + S WH LGH S+ + NSV+ N S V +++ S D C +KSH++PFS S +
Subjt: ESPSVSSQPLLTAASNSNNLSLS-SLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYV----NEVDSFYDVCQYSKSHRLPFSRSLSS
Query: SKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHT
S PLE I+ D+W SP+ S YRYY+ FVD ++R T++YPLK KS F +K+LVEN+F ++ TL SD EF +L I PHT
Subjt: SKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHT
Query: RQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLR------------------
+ NG+ ERKH +VEMGLTL++ AS+P +W AFS AV++ NRLPT +L SP++ F + L+VFGCAC+P LR
Subjt: RQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLR------------------
Query: -------------------------------PTKLTN------------------SNTTVP-------------------------------------NV
P TN S+TT+P N+
Subjt: -------------------------------PTKLTN------------------SNTTVP-------------------------------------NV
Query: CSLVIVLPTKSSTTASS---PQSVSVP--TQSPNIASPTL--------SPVS------RPLSPIAQSPALCLSSGSEASHSLPSSESSL-----------
S I P+ S TA S PQ + P TQ+ N SP L SP S P SPI+ SP + S S + + PSS S+
Subjt: CSLVIVLPTKSSTTASS---PQSVSVP--TQSPNIASPTL--------SPVS------RPLSPIAQSPALCLSSGSEASHSLPSSESSL-----------
Query: --------------DKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLV-SPPPNTNLIGNKWIFKLK
+TRAK G+ KP S+ ++ ++ SEPRT +A+ WRQAM +E A N TW LV PPP+ ++G +WIF K
Subjt: --------------DKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLV-SPPPNTNLIGNKWIFKLK
|
|