| GenBank top hits | e value | %identity | Alignment |
|---|
| GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum] | 3.9e-170 | 35.28 | Show/hide |
Query: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KN----------------LMIETNLSRSSSSGR
+SW + Q+QLL FE R+EQL + + ++ AN + + N S R R KN + SRS+ S
Subjt: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KN----------------LMIETNLSRSSSSGR
Query: NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTM
+ +G + +A + + + D WY +SGASNH+T +++ G + G+G +L I GSS +KS L L +++VP I+KNL+S+S+L
Subjt: NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTM
Query: DNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRI-----------------------------------------PSIKSASCPGLQ-----------
DN+++VEF ++CC VKDK T KV+L+G LKDGLY++ PS + C Q
Subjt: DNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRI-----------------------------------------PSIKSASCPGLQ-----------
Query: -----------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLA
+D +++WI+PLK KS+ V F QFK+L EN ++ +IKVI+CD GGEYKP+ +A + GIQ +++
Subjt: -----------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLA
Query: CPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKC
CPYTS QNGR ERKHRH+ E L LLA A+MPL YWWEAF T+VYLIN +P++ + ++++ +++PDY L+ FG ACYPCL+ Y HK +HTT+C
Subjt: CPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKC
Query: VFIGYSSNHK-----------------------------------VAASSINSPTSTTTSSSDGRSHESSLATPRLLDYPSPTNV----SINEETMDAIN
VF+GYS++HK ++IN P+++ + G + + + P+ TN +N +T N
Subjt: VFIGYSSNHK-----------------------------------VAASSINSPTSTTTSSSDGRSHESSLATPRLLDYPSPTNV----SINEETMDAIN
Query: SSA------------TRPSVINSSTTRPSTSIHPMVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV
+ T+ + ++ +TS H + TR KSG+ KPK + G + +M EP + +AL P W+ AM++EF L N+TW LV
Subjt: SSA------------TRPSVINSSTTRPSTSIHPMVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV
Query: PPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFED
P + ++V +KW+FK K G++ KARLVA+GF QT GID+ ET++PVIK T +IIL++A +W VRQLD++N FLNG L E +FM QP+ F D
Subjt: PPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFED
Query: PKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFARTKK---YFFLCMSMTCFLRVPPHTYLISLFKSFTSRL-LFEISEQSTILGI
KP++IC+L KA+YGLKQ PRA Q S F K F L + +L + K L ++ LGI
Subjt: PKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFARTKK---YFFLCMSMTCFLRVPPHTYLISLFKSFTSRL-LFEISEQSTILGI
Query: QITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILR
++ R + G +L Q+ Y+ DLL + +++ PTPM T + L DP+++R I L Y+ T PDIAF+VN LSQ+ +P HW +KRILR
Subjt: QITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILR
Query: YLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
YL GT N L ++P+ + +TG+SD DWA DRKS+ G CVFLG +LISWSS+KQ+VV+R STESEYR LA +A ++A
Subjt: YLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
|
|
| GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum] | 1.7e-173 | 35.4 | Show/hide |
Query: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KNLMIETNLSRSSSSGR----------------
+SW + Q+QLL FE RL+Q ++ SAN A+ + N F + +++ R K M T + +G
Subjt: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KNLMIETNLSRSSSSGR----------------
Query: ----NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISIS
+K + SA I +P H +D WY +SGA+NH+T +++ G + G+G +L I GS + + NL L +++VPQI+KNL+S+S
Subjt: ----NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISIS
Query: RLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP--------SIKS-------------------------------ASCPGLQ---------
+LT DN+++VEF +CC+VKDK T + LL+G LKDGLY++ S+K + C Q
Subjt: RLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP--------SIKS-------------------------------ASCPGLQ---------
Query: -------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQ
+D +++WIFPLK KSD + F QFK+L EN ++ KIK+I+CD GGEYK + ++ + GIQ +
Subjt: -------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQ
Query: LACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTT
++CPYTS QNGR ERKHRHV E L LLA AKMPL+YWWEAF T+VYLIN +P+ + ++++ K +PDYN+L+ FG ACYPCL+ Y HK FHTT
Subjt: LACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTT
Query: KCVFIGYSSNHKVAASSINS-------------------------------------------------------PTSTTTS--------SSDGRSHESS
+CVF+GYS++HK INS P + TTS SSD +E
Subjt: KCVFIGYSSNHKVAASSINS-------------------------------------------------------PTSTTTS--------SSDGRSHESS
Query: LATPRLL---DYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAM
+ + + S ++ + +++T I + +++ H M TR K G+ KPK A+T D EP S +AL P W+ AM
Subjt: LATPRLL---DYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAM
Query: KEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNG
+E+ L N TW LVP +++ +KWIFK K + G+I KARLVA+GF QT G+DF ET++PV+K T +IILT+A ++W VRQLD++N FLNG
Subjt: KEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNG
Query: PLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA---CQRQTPPCSFFARTKK----YFFLCMSMTCFLRVPPHTYL-----ISLFKSFTSRL--
L E +FM QP+ + D KP++IC+L KA+YGLKQ PRA R T F K +F T FL + + I ++FT++L
Subjt: PLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA---CQRQTPPCSFFARTKK----YFFLCMSMTCFLRVPPHTYL-----ISLFKSFTSRL--
Query: ---LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFA
L ++ LG+++ R G +L Q Y++D+L + ++E+ + PTPM A +++ N P+LYR I +L Y+ TRPDIAFAVN LSQ+
Subjt: ---LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFA
Query: QAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
P HW +KRILRYL GT N L ++P+ ++ + G+ D DWA DRKS GG CVFLG +L+SW+S+KQ+VV+R STESEYR+LA + +++
Subjt: QAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
|
|
| KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] | 1.8e-175 | 36.8 | Show/hide |
Query: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQ--------------QYNTFQHQSRHMNQS--------------------YQRKRKNL
++W E Q+QLL +E RLEQ+ + ++ PS+N+++ Q N R ++ Y R KN
Subjt: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQ--------------QYNTFQHQSRHMNQS--------------------YQRKRKNL
Query: MIETNLSRSSSSGRNQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVP
+ + + + S + QN N +A + +P + D WY +SGASNH+T D + + ++ G +T G+G L I G S + + K+L LK +++VP
Subjt: MIETNLSRSSSSGRNQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVP
Query: QISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------SIKSASCPGL
+I+KNL+SIS+LT DN + VEF D C VKDK T ++LLEG +KDGLY++P +I+++ C
Subjt: QISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------SIKSASCPGL
Query: Q----------------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGE
+ +D +++WI+PLK KSD F QF++LVEN ++ +IK ++CD GGE
Subjt: Q----------------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGE
Query: YKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACY
+K + + + GIQ++ +CPYTSAQNGR ERKHRHVVE+ L LLA AKMPL YWWEAF T+V+LIN +PT+ I K + L + PDY +++ FG ACY
Subjt: YKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACY
Query: PCLRQYQHHKFDFHTTKCVFIGYSSNHKVAASSINSPTSTTTSS-----------SDG----RSHESSLATPRLLDYP-SPT--NVSINEETMDAINSSA
PCL+ Y HK FHTTKCVF+GYS +HK +NS S DG R + P L +P SPT NV+ E+ + N+S+
Subjt: PCLRQYQHHKFDFHTTKCVFIGYSSNHKVAASSINSPTSTTTSS-----------SDG----RSHESSLATPRLLDYP-SPT--NVSINEETMDAINSSA
Query: ----------------TRPSVINSST--------TRPSTSIHPMVTRDKSGVTKPKK-FFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTL
T + I+ +T S + H M TR K G+ KPKK + G + EP + +AL+ P W++AM EF L
Subjt: ----------------TRPSVINSST--------TRPSTSIHPMVTRDKSGVTKPKK-FFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTL
Query: TKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIF
N+TW LVP +++ KW+FK K A G I KARLVA+GF QTLG+D+ ET++PVIK +T +IIL++A ++W +RQ+D++N FLNG L E +F
Subjt: TKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIF
Query: MPQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQTPPCSF-----FARTKKYFFLCMSMT--CFLRVPPHTYLI-----SLFKSFTSRL--LFEISEQ
M QP+ F D +P +IC+L KA+YGLKQ PR+ + R+ F+ MS FL + +I S SF +L +F + +
Subjt: MPQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQTPPCSF-----FARTKKYFFLCMSMT--CFLRVPPHTYLI-----SLFKSFTSRL--LFEISEQ
Query: STI---LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAH
++ LG++ R + G +L Q YV DLL + +LEH+ S PTPM S++ ++++ N P+LYR I L Y+ TRPDIA++VN LSQ+ QAP H
Subjt: STI---LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAH
Query: WLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTES
W S+KR+ RYL GT N L ++P+ + +TG+SD DWA DRKSV GYCVFLG SLI+WSSKKQ+VV+R STES
Subjt: WLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTES
|
|
| PNY02796.1 copia protein (gag-int-pol protein), partial [Trifolium pratense] | 3.0e-170 | 36.5 | Show/hide |
Query: SSSGRNQ-NKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLIS
S SG NQ + + SA I +P + +D WY +SGASNH+T + + G + G+G +L I GS + +KNL L +++VP+I+KNL+S
Subjt: SSSGRNQ-NKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLIS
Query: ISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIPSIKSAS---------------------------------------------CPGLQ-
+S+LT DN++IVEF CC+VKDK T K LL+G LK+GLY++ ++ S S C Q
Subjt: ISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIPSIKSAS---------------------------------------------CPGLQ-
Query: ---------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIA
+D +++WI+PLK KS+ + F QFK+LVEN ++ +IK+++CD GGEYK + +A
Subjt: ---------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIA
Query: SQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQH
+ GIQ +++CPYTS QNGR ERKHRHV E L +LA A+MPL YWWEAF TSVYLIN +P+ +T++ K++PDY+ L+ FG ACYPCL+ Y
Subjt: SQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQH
Query: HKFDFHTTKCVFIGYSSNHK------------------------------------VAASSINSPTSTTTSSSDGRS-----------------------
HK FHTT+CVF+GYS++HK + + N P + +DG +
Subjt: HKFDFHTTKCVFIGYSSNHK------------------------------------VAASSINSPTSTTTSSSDGRS-----------------------
Query: ---HESS--LATPRLLDYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVP
HESS L S N ++ + + +S I +++ +T+ H M TR K G+ KPK+ + +A T EP + +AL P
Subjt: ---HESS--LATPRLLDYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVP
Query: SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS
W+ AM EF L N TW LVP +++ +KW+FK K A G+I KARLVA+GF QT G+D+ ET++PV+K T +IIL++A ++W VRQLD++
Subjt: SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS
Query: NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFAR--TKKYFFLCMSMTCFLRVPPHTYLISLF
N FLNG L E +FM QP+ + D KP++IC+L KA+YGLKQ PRA Q S F T + L + + + +T + F
Subjt: NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFAR--TKKYFFLCMSMTCFLRVPPHTYLISLF
Query: --KSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVN
+ T L ++ LGI++ R++ G +L Q+ Y+ DLL + +E + PTPM T + + +P++YR I +L Y+ TRPDIAFAVN
Subjt: --KSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVN
Query: YLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACD
LSQ+ +P HW +KRILRYL GT N L ++P+ + + G+SD DWA DRKS+ G CVFLG SLISWSS+KQ+VV+R STESEYR LA +A +
Subjt: YLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACD
|
|
| RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera] | 2.6e-166 | 35.07 | Show/hide |
Query: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKRKNLMIETNLSRSSSSGRNQN-----------------
MS + S LL E+RL + ++SV SANLA+ Q +N + ++ + +R TN RS SS
Subjt: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKRKNLMIETNLSRSSSSGRNQN-----------------
Query: -----KGDNPS----------------AMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGL
+G NP+ AM+ +P + D AW+ ++GA++HL+ + LS Y GN+ + G+G L I + G+++ SS K L+ +
Subjt: -----KGDNPS----------------AMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGL
Query: MHVPQISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------------
+HVP I+ NLIS+S+ DN+ EF VKD+ T+K+LL+G+L+ GLYR P
Subjt: MHVPQISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------------
Query: -------------------------------SIKSASCP------------------GLQ------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYS
S+ AS P G + +D ++SWI+PL K A++VF +FKSLVEN ++
Subjt: -------------------------------SIKSASCP------------------GLQ------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYS
Query: SKIKVIRCDEGGEYKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKP
S+I+ +R D GGE+K + GI+ Q +CPYT QNGR ERK RH++ET LALLA A +P ++W AFHT+++LIN +PT+ + + F +L + P
Subjt: SKIKVIRCDEGGEYKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKP
Query: DYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNHK--VAASSINSPTSTT------------TSSSDGRSHESSLATPRLLDYPSPTNVSINEE
+Y+ +IFG CYP +R Y +K + +++CVF+GYSSNHK + + + T S+ D S ++ TP L SP S+
Subjt: DYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNHK--VAASSINSPTSTT------------TSSSDGRSHESSLATPRLLDYPSPTNVSINEE
Query: TMDAI---------NSSATRPSVI-----NSSTTRP-STSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLT
T + +S+ + P +I + ST+ P T+ HPMVTR K+G++K K +F + +EPT++ A+K +W AM++EF+ L
Subjt: TMDAI---------NSSATRPSVI-----NSSTTRP-STSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLT
Query: KNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFM
+N TW LVPP SN +++G KW++KLK G + YKARLVAQGF+QTLG+D+ ET++PV+K T +IIL +A +++WSV QLDV N FL+G L E +FM
Subjt: KNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFM
Query: PQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQ--TPPCSFFARTKKYFFLCMSMTCFLRVPPHTYLISLF---------------KSFTSRL-----
QP F + + P ++C+L KALYGLKQ PRA + T + + + + F+ H LI L SF +RL
Subjt: PQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQ--TPPCSFFARTKKYFFLCMSMTCFLRVPPHTYLISLF---------------KSFTSRL-----
Query: LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAP
L ++ + LGI++ RS FHLSQ Y QDLLSR + K A TP +++ F+D +LYRST+ +L Y+ LTRPDI+FAVN QF P
Subjt: LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAP
Query: QQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
HWL++KRILRYL GT + G+ +Q + S+ + GY+D DWA CP DR+S GGY +FLG +L+SWSS KQ+VV+R S ESEYR LA ++
Subjt: QQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-94 | 8.8e-176 | 36.8 | Show/hide |
Query: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQ--------------QYNTFQHQSRHMNQS--------------------YQRKRKNL
++W E Q+QLL +E RLEQ+ + ++ PS+N+++ Q N R ++ Y R KN
Subjt: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQ--------------QYNTFQHQSRHMNQS--------------------YQRKRKNL
Query: MIETNLSRSSSSGRNQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVP
+ + + + S + QN N +A + +P + D WY +SGASNH+T D + + ++ G +T G+G L I G S + + K+L LK +++VP
Subjt: MIETNLSRSSSSGRNQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVP
Query: QISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------SIKSASCPGL
+I+KNL+SIS+LT DN + VEF D C VKDK T ++LLEG +KDGLY++P +I+++ C
Subjt: QISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------SIKSASCPGL
Query: Q----------------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGE
+ +D +++WI+PLK KSD F QF++LVEN ++ +IK ++CD GGE
Subjt: Q----------------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGE
Query: YKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACY
+K + + + GIQ++ +CPYTSAQNGR ERKHRHVVE+ L LLA AKMPL YWWEAF T+V+LIN +PT+ I K + L + PDY +++ FG ACY
Subjt: YKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACY
Query: PCLRQYQHHKFDFHTTKCVFIGYSSNHKVAASSINSPTSTTTSS-----------SDG----RSHESSLATPRLLDYP-SPT--NVSINEETMDAINSSA
PCL+ Y HK FHTTKCVF+GYS +HK +NS S DG R + P L +P SPT NV+ E+ + N+S+
Subjt: PCLRQYQHHKFDFHTTKCVFIGYSSNHKVAASSINSPTSTTTSS-----------SDG----RSHESSLATPRLLDYP-SPT--NVSINEETMDAINSSA
Query: ----------------TRPSVINSST--------TRPSTSIHPMVTRDKSGVTKPKK-FFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTL
T + I+ +T S + H M TR K G+ KPKK + G + EP + +AL+ P W++AM EF L
Subjt: ----------------TRPSVINSST--------TRPSTSIHPMVTRDKSGVTKPKK-FFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTL
Query: TKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIF
N+TW LVP +++ KW+FK K A G I KARLVA+GF QTLG+D+ ET++PVIK +T +IIL++A ++W +RQ+D++N FLNG L E +F
Subjt: TKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIF
Query: MPQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQTPPCSF-----FARTKKYFFLCMSMT--CFLRVPPHTYLI-----SLFKSFTSRL--LFEISEQ
M QP+ F D +P +IC+L KA+YGLKQ PR+ + R+ F+ MS FL + +I S SF +L +F + +
Subjt: MPQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQTPPCSF-----FARTKKYFFLCMSMT--CFLRVPPHTYLI-----SLFKSFTSRL--LFEISEQ
Query: STI---LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAH
++ LG++ R + G +L Q YV DLL + +LEH+ S PTPM S++ ++++ N P+LYR I L Y+ TRPDIA++VN LSQ+ QAP H
Subjt: STI---LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAH
Query: WLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTES
W S+KR+ RYL GT N L ++P+ + +TG+SD DWA DRKSV GYCVFLG SLI+WSSKKQ+VV+R STES
Subjt: WLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTES
|
|
| A0A2K3NIC3 Copia protein (Gag-int-pol protein) (Fragment) | 1.5e-170 | 36.5 | Show/hide |
Query: SSSGRNQ-NKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLIS
S SG NQ + + SA I +P + +D WY +SGASNH+T + + G + G+G +L I GS + +KNL L +++VP+I+KNL+S
Subjt: SSSGRNQ-NKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLIS
Query: ISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIPSIKSAS---------------------------------------------CPGLQ-
+S+LT DN++IVEF CC+VKDK T K LL+G LK+GLY++ ++ S S C Q
Subjt: ISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIPSIKSAS---------------------------------------------CPGLQ-
Query: ---------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIA
+D +++WI+PLK KS+ + F QFK+LVEN ++ +IK+++CD GGEYK + +A
Subjt: ---------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIA
Query: SQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQH
+ GIQ +++CPYTS QNGR ERKHRHV E L +LA A+MPL YWWEAF TSVYLIN +P+ +T++ K++PDY+ L+ FG ACYPCL+ Y
Subjt: SQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQH
Query: HKFDFHTTKCVFIGYSSNHK------------------------------------VAASSINSPTSTTTSSSDGRS-----------------------
HK FHTT+CVF+GYS++HK + + N P + +DG +
Subjt: HKFDFHTTKCVFIGYSSNHK------------------------------------VAASSINSPTSTTTSSSDGRS-----------------------
Query: ---HESS--LATPRLLDYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVP
HESS L S N ++ + + +S I +++ +T+ H M TR K G+ KPK+ + +A T EP + +AL P
Subjt: ---HESS--LATPRLLDYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVP
Query: SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS
W+ AM EF L N TW LVP +++ +KW+FK K A G+I KARLVA+GF QT G+D+ ET++PV+K T +IIL++A ++W VRQLD++
Subjt: SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS
Query: NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFAR--TKKYFFLCMSMTCFLRVPPHTYLISLF
N FLNG L E +FM QP+ + D KP++IC+L KA+YGLKQ PRA Q S F T + L + + + +T + F
Subjt: NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFAR--TKKYFFLCMSMTCFLRVPPHTYLISLF
Query: --KSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVN
+ T L ++ LGI++ R++ G +L Q+ Y+ DLL + +E + PTPM T + + +P++YR I +L Y+ TRPDIAFAVN
Subjt: --KSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVN
Query: YLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACD
LSQ+ +P HW +KRILRYL GT N L ++P+ + + G+SD DWA DRKS+ G CVFLG SLISWSS+KQ+VV+R STESEYR LA +A +
Subjt: YLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACD
|
|
| A0A2Z6MBG6 Integrase catalytic domain-containing protein | 1.9e-170 | 35.28 | Show/hide |
Query: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KN----------------LMIETNLSRSSSSGR
+SW + Q+QLL FE R+EQL + + ++ AN + + N S R R KN + SRS+ S
Subjt: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KN----------------LMIETNLSRSSSSGR
Query: NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTM
+ +G + +A + + + D WY +SGASNH+T +++ G + G+G +L I GSS +KS L L +++VP I+KNL+S+S+L
Subjt: NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTM
Query: DNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRI-----------------------------------------PSIKSASCPGLQ-----------
DN+++VEF ++CC VKDK T KV+L+G LKDGLY++ PS + C Q
Subjt: DNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRI-----------------------------------------PSIKSASCPGLQ-----------
Query: -----------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLA
+D +++WI+PLK KS+ V F QFK+L EN ++ +IKVI+CD GGEYKP+ +A + GIQ +++
Subjt: -----------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLA
Query: CPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKC
CPYTS QNGR ERKHRH+ E L LLA A+MPL YWWEAF T+VYLIN +P++ + ++++ +++PDY L+ FG ACYPCL+ Y HK +HTT+C
Subjt: CPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKC
Query: VFIGYSSNHK-----------------------------------VAASSINSPTSTTTSSSDGRSHESSLATPRLLDYPSPTNV----SINEETMDAIN
VF+GYS++HK ++IN P+++ + G + + + P+ TN +N +T N
Subjt: VFIGYSSNHK-----------------------------------VAASSINSPTSTTTSSSDGRSHESSLATPRLLDYPSPTNV----SINEETMDAIN
Query: SSA------------TRPSVINSSTTRPSTSIHPMVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV
+ T+ + ++ +TS H + TR KSG+ KPK + G + +M EP + +AL P W+ AM++EF L N+TW LV
Subjt: SSA------------TRPSVINSSTTRPSTSIHPMVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV
Query: PPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFED
P + ++V +KW+FK K G++ KARLVA+GF QT GID+ ET++PVIK T +IIL++A +W VRQLD++N FLNG L E +FM QP+ F D
Subjt: PPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFED
Query: PKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFARTKK---YFFLCMSMTCFLRVPPHTYLISLFKSFTSRL-LFEISEQSTILGI
KP++IC+L KA+YGLKQ PRA Q S F K F L + +L + K L ++ LGI
Subjt: PKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFARTKK---YFFLCMSMTCFLRVPPHTYLISLFKSFTSRL-LFEISEQSTILGI
Query: QITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILR
++ R + G +L Q+ Y+ DLL + +++ PTPM T + L DP+++R I L Y+ T PDIAF+VN LSQ+ +P HW +KRILR
Subjt: QITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILR
Query: YLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
YL GT N L ++P+ + +TG+SD DWA DRKS+ G CVFLG +LISWSS+KQ+VV+R STESEYR LA +A ++A
Subjt: YLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
|
|
| A0A2Z6P4D5 Integrase catalytic domain-containing protein | 8.2e-174 | 35.4 | Show/hide |
Query: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KNLMIETNLSRSSSSGR----------------
+SW + Q+QLL FE RL+Q ++ SAN A+ + N F + +++ R K M T + +G
Subjt: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KNLMIETNLSRSSSSGR----------------
Query: ----NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISIS
+K + SA I +P H +D WY +SGA+NH+T +++ G + G+G +L I GS + + NL L +++VPQI+KNL+S+S
Subjt: ----NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISIS
Query: RLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP--------SIKS-------------------------------ASCPGLQ---------
+LT DN+++VEF +CC+VKDK T + LL+G LKDGLY++ S+K + C Q
Subjt: RLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP--------SIKS-------------------------------ASCPGLQ---------
Query: -------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQ
+D +++WIFPLK KSD + F QFK+L EN ++ KIK+I+CD GGEYK + ++ + GIQ +
Subjt: -------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQ
Query: LACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTT
++CPYTS QNGR ERKHRHV E L LLA AKMPL+YWWEAF T+VYLIN +P+ + ++++ K +PDYN+L+ FG ACYPCL+ Y HK FHTT
Subjt: LACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTT
Query: KCVFIGYSSNHKVAASSINS-------------------------------------------------------PTSTTTS--------SSDGRSHESS
+CVF+GYS++HK INS P + TTS SSD +E
Subjt: KCVFIGYSSNHKVAASSINS-------------------------------------------------------PTSTTTS--------SSDGRSHESS
Query: LATPRLL---DYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAM
+ + + S ++ + +++T I + +++ H M TR K G+ KPK A+T D EP S +AL P W+ AM
Subjt: LATPRLL---DYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAM
Query: KEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNG
+E+ L N TW LVP +++ +KWIFK K + G+I KARLVA+GF QT G+DF ET++PV+K T +IILT+A ++W VRQLD++N FLNG
Subjt: KEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNG
Query: PLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA---CQRQTPPCSFFARTKK----YFFLCMSMTCFLRVPPHTYL-----ISLFKSFTSRL--
L E +FM QP+ + D KP++IC+L KA+YGLKQ PRA R T F K +F T FL + + I ++FT++L
Subjt: PLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA---CQRQTPPCSFFARTKK----YFFLCMSMTCFLRVPPHTYL-----ISLFKSFTSRL--
Query: ---LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFA
L ++ LG+++ R G +L Q Y++D+L + ++E+ + PTPM A +++ N P+LYR I +L Y+ TRPDIAFAVN LSQ+
Subjt: ---LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFA
Query: QAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
P HW +KRILRYL GT N L ++P+ ++ + G+ D DWA DRKS GG CVFLG +L+SW+S+KQ+VV+R STESEYR+LA + +++
Subjt: QAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
|
|
| A0A438FWJ3 Retrovirus-related Pol polyprotein from transposon TNT 1-94 | 1.3e-166 | 35.07 | Show/hide |
Query: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKRKNLMIETNLSRSSSSGRNQN-----------------
MS + S LL E+RL + ++SV SANLA+ Q +N + ++ + +R TN RS SS
Subjt: MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKRKNLMIETNLSRSSSSGRNQN-----------------
Query: -----KGDNPS----------------AMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGL
+G NP+ AM+ +P + D AW+ ++GA++HL+ + LS Y GN+ + G+G L I + G+++ SS K L+ +
Subjt: -----KGDNPS----------------AMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGL
Query: MHVPQISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------------
+HVP I+ NLIS+S+ DN+ EF VKD+ T+K+LL+G+L+ GLYR P
Subjt: MHVPQISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------------
Query: -------------------------------SIKSASCP------------------GLQ------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYS
S+ AS P G + +D ++SWI+PL K A++VF +FKSLVEN ++
Subjt: -------------------------------SIKSASCP------------------GLQ------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYS
Query: SKIKVIRCDEGGEYKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKP
S+I+ +R D GGE+K + GI+ Q +CPYT QNGR ERK RH++ET LALLA A +P ++W AFHT+++LIN +PT+ + + F +L + P
Subjt: SKIKVIRCDEGGEYKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKP
Query: DYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNHK--VAASSINSPTSTT------------TSSSDGRSHESSLATPRLLDYPSPTNVSINEE
+Y+ +IFG CYP +R Y +K + +++CVF+GYSSNHK + + + T S+ D S ++ TP L SP S+
Subjt: DYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNHK--VAASSINSPTSTT------------TSSSDGRSHESSLATPRLLDYPSPTNVSINEE
Query: TMDAI---------NSSATRPSVI-----NSSTTRP-STSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLT
T + +S+ + P +I + ST+ P T+ HPMVTR K+G++K K +F + +EPT++ A+K +W AM++EF+ L
Subjt: TMDAI---------NSSATRPSVI-----NSSTTRP-STSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLT
Query: KNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFM
+N TW LVPP SN +++G KW++KLK G + YKARLVAQGF+QTLG+D+ ET++PV+K T +IIL +A +++WSV QLDV N FL+G L E +FM
Subjt: KNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFM
Query: PQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQ--TPPCSFFARTKKYFFLCMSMTCFLRVPPHTYLISLF---------------KSFTSRL-----
QP F + + P ++C+L KALYGLKQ PRA + T + + + + F+ H LI L SF +RL
Subjt: PQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQ--TPPCSFFARTKKYFFLCMSMTCFLRVPPHTYLISLF---------------KSFTSRL-----
Query: LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAP
L ++ + LGI++ RS FHLSQ Y QDLLSR + K A TP +++ F+D +LYRST+ +L Y+ LTRPDI+FAVN QF P
Subjt: LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAP
Query: QQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
HWL++KRILRYL GT + G+ +Q + S+ + GY+D DWA CP DR+S GGY +FLG +L+SWSS KQ+VV+R S ESEYR LA ++
Subjt: QQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
|
|
| SwissProt top hits | e value | %identity | Alignment |
|---|
| P04146 Copia protein | 7.2e-50 | 32.42 | Show/hide |
Query: SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS
SW+ A+ E N TW + N ++V ++W+F +K + G + YKARLVA+GF+Q ID+ ET+ PV + +F+ IL+L ++ V Q+DV
Subjt: SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS
Query: NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPR----ACQRQTPPCSFF-------------ARTKKYFFLCMSMTCFLRVPPHTYLIS
FLNG L E I+M P+ +C+L KA+YGLKQ R ++ C F + ++ + + + ++
Subjt: NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPR----ACQRQTPPCSFF-------------ARTKKYFFLCMSMTCFLRVPPHTYLIS
Query: LFKSFTSR--LLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSI-TASSKVLFNDPSLYRSTIRSLHYVLL-TRPDIA
FK + + +++E +GI+I +LSQ+ YV+ +LS+ ++E+ + TP+ + +S N P RS I L Y++L TRPD+
Subjt: LFKSFTSR--LLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSI-TASSKVLFNDPSLYRSTIRSLHYVLL-TRPDIA
Query: FAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASI--SLTGYSDVDWAGCPIDRKSVGGYCV-FLGSSLISWSSKKQQVVARFSTESEYRT
AVN LS+++ W +LKR+LRYL GT ++ L + + + GY D DWAG IDRKS GY +LI W++K+Q VA STE+EY
Subjt: FAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASI--SLTGYSDVDWAGCPIDRKSVGGYCV-FLGSSLISWSSKKQQVVARFSTESEYRT
Query: L
L
Subjt: L
|
|
| P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-94 | 3.5e-81 | 31.25 | Show/hide |
Query: EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEY--KPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPL
+D + W++ LK K VF +F +LVE K+K +R D GGEY + S GI+ + P T NG ER +R +VE ++L AK+P
Subjt: EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEY--KPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPL
Query: QYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNH------------KVAASSINSPTS
+W EA T+ YLIN P+ + ++ V ++ Y+ L++FG + + + Q K D + C+FIGY + + + S
Subjt: QYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNH------------KVAASSINSPTS
Query: TTTSSSDGRSHESSLATPRLLDYPSPTNVSIN-EETMDAINSSATRPSVINSSTTRPSTSI----HPMVTRDK----SGVTKPKKFFGCYSQAQTSMDWS
+++D + P + PS +N + E T D ++ +P + + + HP ++ +P+ Y + +
Subjt: TTTSSSDGRSHESSLATPRLLDYPSPTNVSIN-EETMDAINSSATRPSVINSSTTRPSTSI----HPMVTRDK----SGVTKPKKFFGCYSQAQTSMDWS
Query: CNEPTSYIDALKVP---SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQII
EP S + L P +AM+EE +L KN T+ LV + KW+FKLK+D +V YKARLV +GF Q GIDF E ++PV+K + + I
Subjt: CNEPTSYIDALKVP---SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQII
Query: LTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA--------------CQRQTPPCSFFAR--TKKYFFLCM
L+LA + D V QLDV FL+G L E I+M QP+ FE K H +C+L K+LYGLKQ PR + + PC +F R + L +
Subjt: LTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA--------------CQRQTPPCSFFAR--TKKYFFLCM
Query: SMTCFLRVPPHTYLISLFKSFTSRL--LFEISEQSTILGIQIT--RSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPS----
+ L V LI+ K S+ + ++ ILG++I R+S+ LSQ Y++ +L R ++++ K TP+ + + S K+
Subjt: SMTCFLRVPPHTYLISLFKSFTSRL--LFEISEQSTILGIQIT--RSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPS----
Query: ----LYRSTIRSLHYVLL-TRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSL
Y S + SL Y ++ TRPDIA AV +S+F + P + HW ++K ILRYL GT L + I L GY+D D AG +RKS GY
Subjt: ----LYRSTIRSLHYVLL-TRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSL
Query: ISWSSKKQQVVARFSTESEY
ISW SK Q+ VA +TE+EY
Subjt: ISWSSKKQQVVARFSTESEY
|
|
| P92519 Uncharacterized mitochondrial protein AtMg00810 | 6.1e-33 | 41.85 | Show/hide |
Query: LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLK--SAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSL
LGIQI G LSQ Y + +L+ + K S P P+ +SS++ + + DPS +RS + +L Y+ LTRPDI++AVN + Q P A + L
Subjt: LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLK--SAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSL
Query: KRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
KR+LRY+ GT GL + + +++ + D DWAGC R+S G+C FLG ++ISWS+K+Q V+R STE+EYR LA A +L
Subjt: KRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
|
|
| Q94HW2 Retrovirus-related Pol polyprotein from transposon RE1 | 5.8e-132 | 33.24 | Show/hide |
Query: PSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTMDNSVIVE
P A + W L+SGA++H+TSD +NLSL YTG + + DG+ +PI + GS+ + + + L L +++VP I KNLIS+ RL N V VE
Subjt: PSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTMDNSVIVE
Query: FCDSCCAVKDKETRKVLLEGALKDGLYR------------------------------------------------IPSIKSASC---------------
F + VKD T LL+G KD LY PS K SC
Subjt: FCDSCCAVKDKETRKVLLEGALKDGLYR------------------------------------------------IPSIKSASC---------------
Query: -------------------PGLQED-----------IKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLAC
P L D +Y+W++PLK KS F FK+L+EN + ++I D GGE+ + SQ GI +
Subjt: -------------------PGLQED-----------IKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLAC
Query: PYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCV
P+T NG ERKHRH+VET L LL+HA +P YW AF +VYLIN +PT + + F L P+Y+ LR+FG ACYP LR Y HK D + +CV
Subjt: PYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCV
Query: FIGYS-------------------------------SNHKVAASSINS---------------PTST---------------TTSSSDGRSHESSLATPR
F+GYS SN+ S + PT T T SS +S +
Subjt: FIGYS-------------------------------SNHKVAASSINS---------------PTST---------------TTSSSDGRSHESSLATPR
Query: LLDY-------------------PSPT--------------NVSINEETMDA-----------INSSATRPSVINS------STTRPSTSIHP-------
LD P PT N S N T ++ SS++ PS S S T PS IHP
Subjt: LLDY-------------------PSPT--------------NVSINEETMDA-----------INSSATRPSVINS------STTRPSTSIHP-------
Query: -------------MVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV-PPSSNCHLVGNKWIFKLKRD
M TR K+G+ KP K+ S A S EP + I ALK W+ AM E N TWDLV PP S+ +VG +WIF K +
Subjt: -------------MVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV-PPSSNCHLVGNKWIFKLKRD
Query: AHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQV
+ G++ YKARLVA+G++Q G+D+ ET++PVIK + +I+L +A W +RQLDV+N FL G L + ++M QP F D +P+Y+C+LRKALYGLKQ
Subjt: AHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQV
Query: PRACQ-------------RQTPPCSFFA--RTKKYFFLCMSMTCFLRVPP-----HTYLISLFKSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYV
PRA S F R K ++ + + L H L +L + F+ + + E LGI+ R G HLSQ Y+
Subjt: PRACQ-------------RQTPPCSFFA--RTKKYFFLCMSMTCFLRVPP-----HTYLISLFKSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYV
Query: QDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTAS
DLL+R ++ K TPM S ++ S DP+ YR + SL Y+ TRPDI++AVN LSQF P + H +LKRILRYL GT N G+ L+ +
Subjt: QDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTAS
Query: ISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
+SL YSD DWAG D S GY V+LG ISWSSKKQ+ V R STE+EYR++A+ + ++
Subjt: ISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
|
|
| Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE2 | 7.0e-130 | 31.89 | Show/hide |
Query: PSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTMDNSVIVE
P A + W L+SGA++H+TSD +NLS YTG + + DG+ +PI + GS+ + +S ++L L +++VP I KNLIS+ RL N V VE
Subjt: PSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTMDNSVIVE
Query: FCDSCCAVKDKETRKVLLEGALKDGLYR------------------------------------------------IPSIKSASC---------------
F + VKD T LL+G KD LY PS K SC
Subjt: FCDSCCAVKDKETRKVLLEGALKDGLYR------------------------------------------------IPSIKSASC---------------
Query: -------------------PGLQED-----------IKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLAC
P L D +Y+W++PLK KS F FKSLVEN + ++I + D GGE+ + SQ GI +
Subjt: -------------------PGLQED-----------IKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLAC
Query: PYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCV
P+T NG ERKHRH+VE L LL+HA +P YW AF +VYLIN +PT + + F L + P+Y L++FG ACYP LR Y HK + + +C
Subjt: PYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCV
Query: FIGYS---------------------------------------------------------------------------------------------SN
F+GYS S+
Subjt: FIGYS---------------------------------------------------------------------------------------------SN
Query: HKVAASSINSPTST--TTSSSDG-------RSHESSLATPRLLDYPSPTNVSINEETMDA--------------INSSATRPSVINSSTT----------
+ +SSI+SP+S+ T S +G ++S + +L+ P+P + S N ++ ++S + P+ +SS+T
Subjt: HKVAASSINSPTST--TTSSSDG-------RSHESSLATPRLLDYPSPTNVSINEETMDA--------------INSSATRPSVINSSTT----------
Query: ---------RPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV-PPSSNCHLVGNKWIFKL
+ + H M TR K G+ KP + YS A TS+ + +EP + I A+K W++AM E N TWDLV PP + +VG +WIF
Subjt: ---------RPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV-PPSSNCHLVGNKWIFKL
Query: KRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGL
K ++ G++ YKARLVA+G++Q G+D+ ET++PVIK + +I+L +A W +RQLDV+N FL G L + ++M QP F D +P Y+CRLRKA+YGL
Subjt: KRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGL
Query: KQVPRACQ-------------RQTPPCSFFA--RTKKYFFLCMSMTCFLRVPPHTYLISLFKSFTSRLLFEISEQSTI---LGIQITRSSKGFHLSQATY
KQ PRA S F R + ++ + + L T L+ S+ F + E + LGI+ R +G HLSQ Y
Subjt: KQVPRACQ-------------RQTPPCSFFA--RTKKYFFLCMSMTCFLRVPPHTYLISLFKSFTSRLLFEISEQSTI---LGIQITRSSKGFHLSQATY
Query: VQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTA
DLL+R ++ K TPM S +T S DP+ YR + SL Y+ TRPD+++AVN LSQ+ P HW +LKR+LRYL GT + G+ L+
Subjt: VQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTA
Query: SISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
++SL YSD DWAG D S GY V+LG ISWSSKKQ+ V R STE+EYR++A+ + +L
Subjt: SISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
|
|
| Arabidopsis top hits | e value | %identity | Alignment |
|---|
| AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8 | 8.7e-67 | 36.8 | Show/hide |
Query: EPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAP
EP++Y +A + W AM +E + TW++ N +G KW++K+K ++ G I YKARLVA+G++Q GIDF ET++PV K + ++IL ++
Subjt: EPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAP
Query: TWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFE----DPKKPHYICRLRKALYGLKQVPR-------------ACQRQTPPCSFFARTKKYFFLCM----
+++++ QLD+SN FLNG L+E I+M P + D P+ +C L+K++YGLKQ R + ++F + FLC+
Subjt: TWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFE----DPKKPHYICRLRKALYGLKQVPR-------------ACQRQTPPCSFFARTKKYFFLCM----
Query: --SMTCFLRVPPHTYLISLFKSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTI
+ C L S KS L ++ LG++I RS+ G ++ Q Y DLL L K + PM S + +A S F D YR I
Subjt: --SMTCFLRVPPHTYLISLFKSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTI
Query: RSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVV
L Y+ +TR DI+FAVN LSQF++AP+ AH ++ +IL Y+ GT GL A + L +SD + C R+S GYC+FLG+SLISW SKKQQVV
Subjt: RSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVV
Query: ARFSTESEYRTLA
++ S E+EYR L+
Subjt: ARFSTESEYRTLA
|
|
| ATMG00240.1 Gag-Pol-related retrotransposon family protein | 5.0e-14 | 43.59 | Show/hide |
Query: YVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYC
Y+ +TRPD+ FAVN LSQF+ A + A ++ ++L Y+ GT GL T+ + L ++D DWA CP R+SV G+C
Subjt: YVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYC
|
|
| ATMG00810.1 DNA/RNA polymerases superfamily protein | 4.4e-34 | 41.85 | Show/hide |
Query: LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLK--SAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSL
LGIQI G LSQ Y + +L+ + K S P P+ +SS++ + + DPS +RS + +L Y+ LTRPDI++AVN + Q P A + L
Subjt: LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLK--SAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSL
Query: KRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
KR+LRY+ GT GL + + +++ + D DWAGC R+S G+C FLG ++ISWS+K+Q V+R STE+EYR LA A +L
Subjt: KRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
|
|
| ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase) | 7.7e-23 | 46.21 | Show/hide |
Query: MVTRDKSGVTK--PKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLV
M+TR K+G+ K PK YS T+ EP S I ALK P W +AM+EE L++N+TW LVPP N +++G KW+FK K + G + KARLV
Subjt: MVTRDKSGVTK--PKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLV
Query: AQGFSQTLGIDFHETYTPVIKPMTFQIILTLA
A+GF Q GI F ETY+PV++ T + IL +A
Subjt: AQGFSQTLGIDFHETYTPVIKPMTFQIILTLA
|
|