| GenBank top hits | e value | %identity | Alignment |
|---|
| KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa] | 9.1e-274 | 50.42 | Show/hide |
Query: MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLI-----ESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFR
M +RG+ PAA E ++ A E LSPRT++ L +VE S+ ++R + + ++ E+ +L R E P QN R RGRR E FR
Subjt: MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLI-----ESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFR
Query: NVNNPRGFQRR---RPGYAIPQQFDEDFQEDQEV-WQEIQED-DSSSGDEQGNMWNFNDDLRAGRNNQ--RNEVRRGEYHDYKMKIDLPVYDGKQNIEAF
R FQ R + +PQ + D+ + WQ +E+ ++SS E+ + NDD+ R ++ +NE ++ E +YKMKIDLP YDGK+NIE F
Subjt: NVNNPRGFQRR---RPGYAIPQQFDEDFQEDQEV-WQEIQED-DSSSGDEQGNMWNFNDDLRAGRNNQ--RNEVRRGEYHDYKMKIDLPVYDGKQNIEAF
Query: LDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLS
LDW+K+TENFF YM + KKVHLVALKL+ GAS WWDQ+ +NRQ+ GK IRSWEKMKKL+K RF+PPNYEQTLY QYQNCRQG+R +YIEEFH L
Subjt: LDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLS
Query: ARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVE
RTNL E E+H I+ FVGGLR D+KEKVKLQPF+ LSEAI++AETVEEMI R+K+ ++RP WE ++++ KT S ++ QE +
Subjt: ARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVE
Query: RKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS-KEAEDETELIEADDEERVSCVIQR------------------AR
+K + +N Y RP G +RCGQ H SN CPQRKTIA+A++ S S E ++ETE+IEAD+ + +SC++QR R
Subjt: RKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS-KEAEDETELIEADDEERVSCVIQR------------------AR
Query: CTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGR
CTI G+VC+VIID+ SS+NFV+KKLVT LNLK + H PYKIGW++KGGE +SEIC VPLSI N+YKDQ+VCDVIEMDVCH+LLGRPWQ+D QS+H+GR
Subjt: CTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGR
Query: ENTYELQWMGRKVVLLPITRKNKEGLRGEKQ---LFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDIQHHI
ENTYE WM +KV+LLP+ ++ + + ++ LF T +L ++P+I +EP LPPLRDI H+I
Subjt: ENTYELQWMGRKVVLLPITRKNKEGLRGEKQ---LFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDIQHHI
Query: DLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLK
+L+SGAS P+L HY MSP EYK LHD IEELLKKGHI+PS S C VPALLTPKK G+WRMCVDSRAIN+ITVKYRF IPR+SDLLDQLG A IFSKIDL+
Subjt: DLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLK
Query: SGYHQIRIRPGDEWKTTFKTNEGLFEW-----------------MHKQRGAFTTS-----------------------KKNVPGLTETELYINTKKSMFM
S YHQIRIRPGDEWKT FKTNEGLFEW M+K F + L ELY+N KK +F
Subjt: SGYHQIRIRPGDEWKTTFKTNEGLFEW-----------------MHKQRGAFTTS-----------------------KKNVPGLTETELYINTKKSMFM
Query: KREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------------------------FS
EIAFL F+I++ + M+ KK+EAI W P ++ ++QAFLGLASFYRKFI+N +S+AAP+TD F
Subjt: KREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------------------------FS
Query: SPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
FEV VD C TGIG VL+QQ HPIEYFSE+L+ SRQSWSTYEQELYALVRALKQWEHYLLS+EF+LLTDHFSLK
Subjt: SPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
|
|
| TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa] | 3.7e-291 | 59.62 | Show/hide |
Query: PPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYH
P R E +N++ G GRRAR N++N N QRRRP Q D++ QE+ E WQ Q+ DSS GDEQGN+WN + + R + + E RR YH
Subjt: PPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYH
Query: DYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTL---YN
DYKMKIDLP Y+GK++IE+FLDWIK+TENFF YM P+RKKVHLVALKL+ GAS W P +Y Q + Y+
Subjt: DYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTL---YN
Query: QYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKT
QYQNCRQG + V +YIEEFH L AR NLSENEQHQIARF+GGLR DIKEKVKL FR LSEAIS AETVEEM+ VR KN RR AWET ++ +Y KT
Subjt: QYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKT
Query: NDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCV
++QPSTS KGK ++ QE +K E + +QNNY+RP LGK FRCG+ HLSNNC QRKTIA+AE E MS +E E+ETELIEADD +R+SC+
Subjt: NDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCV
Query: IQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQS
+QR T + + ++ K +N K HP PYKIGWV+KGGE ++EICT+PLSI N+YKDQIVCDVIEMDVCHLLLGRPWQ+DTQ+
Subjt: IQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQS
Query: LHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDI
LH+GRENTYE QWMG+KV+LLP+ +KN E +R ++QLF T +L EFP +K+EP+GLPPLRDI
Subjt: LHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDI
Query: QHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSK
QH IDL+ ASLPNL HYRMSP+EY+ LHDHIE+LLKKGHI+PSLSPCAVPALLTP K GSWRMCVDSRAINR+T KYRF IPRI DLLDQLGKA IFSK
Subjt: QHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSK
Query: IDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIK
IDL++GYHQI+IRPGDEWKT FKTNEGLFE ++ +K LTE ELYIN KK ++ +EI FL F+IK+G I MEPKKIEAI + P P SIK
Subjt: IDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIK
Query: EIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLL
E+QAFLGLASFYR+FIRNF+ + APLTDYF+SPFEV V+AC TGIG VL+QQGHPIEYFSEKL+ SRQSWSTYEQELYALVRALKQWEHYLLS +F ++
Subjt: EIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLL
|
|
| XP_011648447.2 uncharacterized protein LOC105434464 [Cucumis sativus] | 1.7e-256 | 75.39 | Show/hide |
Query: MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSW
MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDL VYDGKQNIEAFLDWIKSTENFFNYMD PE KKVHLVALKLR
Subjt: MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSW
Query: EKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSK
AETVEEMIAVRSK
Subjt: EKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSK
Query: NLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS
NLKRRPAW+TTSTRMNNYADKTNDQPSTSTKGKGKEVENQEV VERKNEQ FKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS
Subjt: NLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS
Query: KEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEI
K AEDE ELIEADD ERVSCVIQR ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPT YKIGWVRK GE TVSEI
Subjt: KEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEI
Query: CTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEK-QLFTTQLLYEFPRIKEEPEGLPPLR
CTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQ MGRKVVLLPITRKNKEGLR E + QLLYEFPRIKEEPEGLPPLR
Subjt: CTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEK-QLFTTQLLYEFPRIKEEPEGLPPLR
Query: DIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIF
DIQHHIDLI GASLPNLAHYRMSPQEYKTLHDHIEELLKKGHI+PSLSPCAVPALLT KK GSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIF
Subjt: DIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIF
Query: SKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM
SKIDLKSGYHQIRIRPGDEWKTTFKT EGLFEWM
Subjt: SKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM
|
|
| XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus] | 8.4e-296 | 84.24 | Show/hide |
Query: MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFRNVNNP
MAGRR NPA GENR QE AEEIT LSP+T+TVRLLAVEESLGDL N FDRL+ESVELL+RR+E+PQPPPRNEINFQN+QRFGE RGRRAR RN+NNP
Subjt: MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFRNVNNP
Query: RGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFN
RG QRRRPGYAI QQ DEDFQEDQE WQE QEDDSSSGDEQGNMWNFND+ RAGRNNQR E RRGEYHDYKMKIDLP+YDGK+NIEAFLDWIKSTENFFN
Subjt: RGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFN
Query: YMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQ
YMD PERKKVHLVALKLRAGAS WWDQLEINRQRCGKQ +RSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSV +YIEEFH LSARTNLSENEQHQ
Subjt: YMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQ
Query: IARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQ
+ARFVGGLR DIKEKV+LQPFRFLSEAISFAETVEEMIA+RSKNL RR AWET ST+ KTNDQPSTSTK KGKE++NQEVAVERK EQTFK S Q
Subjt: IARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQ
Query: NNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDSKEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIID
N+YSRP LGK FRCGQT HLS+NCPQRKTIAIAEEG Q+SEDS EAE+ETELIEADD ERVSCVIQR RCTINGRVCDVIID
Subjt: NNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDSKEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIID
Query: NDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKV
+ SS+NFVAKKLVTVLNLKAEAHP PYKIGWVRKGGE TVSEICTVPLSI NAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYE QWMGRKV
Subjt: NDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKV
Query: VLLPITRKNKEGLRGEKQLFTT
VLLPIT+K EGLRGEKQLF T
Subjt: VLLPITRKNKEGLRGEKQLFTT
|
|
| XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus] | 1.3e-299 | 79.67 | Show/hide |
Query: MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSW
MWN NDDLRAGRNN+R EVRRGEYHDYKMKIDLP+YDGK+NIEAFLDWIKSTENFFNYMD PERKKVHLVALKLRAGAS WWDQLEINRQRCGKQ IRSW
Subjt: MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSW
Query: EKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSK
EKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSV DYIEEFH LSARTNLSENEQHQ+ARFVG ETVEEMIA+RSK
Subjt: EKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSK
Query: NLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS
NL RR AWETTST+ KTNDQPSTSTKGKGKEV+NQEVAVERK EQTFK S QN+YSRP LGK FRCGQT HLSNNCPQRKTIAIAEEG Q SEDS
Subjt: NLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS
Query: KEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEI
EAE+ETELIEADD ERVSC IQR RCTINGRVCDVIID+ SS+NFVAKKLV VLNLKAEAHPTPYKIGWVRKGGE TVSEI
Subjt: KEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEI
Query: CTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTT---------------------
CTVPLSI NAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYE QWMGRKVVLLPIT+K EGLRGEKQLF T
Subjt: CTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTT---------------------
Query: ---------------QLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWR
QLL+EFP IKEEP+GLPPLRDIQHHIDLI GASLPNLAHYRMSPQEYK LHDHIEELLKKGHI+PSLSPCAVPALLTPKK GSWR
Subjt: ---------------QLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWR
Query: MCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM
MCVDSRAINRITVKYRF IPRISDLLDQLGKASIFSKIDLKSGYHQIR+RPGDEWKT FKTNEGLFEWM
Subjt: MCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A5A7T256 Reverse transcriptase | 8.9e-251 | 63.96 | Show/hide |
Query: YNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYAD
Y+QYQNCRQG + V +YIEEFH LSAR NLSENEQHQIARF+GGLR DIKEKVKL FR LSEAIS AETVEEM+ VR KN RR AWET ++ +Y
Subjt: YNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYAD
Query: KTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMS-EDSKEAEDETELIEADDEERV
KT++QPSTS KGK ++ QE +K E + +QNNY+RP LGK FRCG+ HLSNNC QRKTIA+AE E MS D +E E+ETELIEADD +R+
Subjt: KTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMS-EDSKEAEDETELIEADDEERV
Query: SCVIQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYD
SC++QR T + + ++ K +N K HP PYKIGWV+KGGE ++EICT+PLSI N+YKDQIVCDVIEMDVCHLLLGRPWQ+D
Subjt: SCVIQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYD
Query: TQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPL
TQ+LH+GRENTYE QWMG+KV+LLP+ +KN E +R ++QLF T +L EFP +K+EP+GLPPL
Subjt: TQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPL
Query: RDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASI
RDIQH IDL+ ASLPNL HYRMSP+EY+ LHDHIE+LLKKGHI+PSLSPCAVPALLTP K GSWRMCVDSRAINR+T KYRF IPRI DLLDQLGKA I
Subjt: RDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASI
Query: FSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPA
FSKIDL++GYHQI+IRPGDEWKT FKTNEGLFE ++ +K LTE ELYIN KK ++ +EI FL F+IK+G I MEPKKIEAIH+ P P
Subjt: FSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPA
Query: SIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFV
SIKE+QAFLGLASFYR+FIRNF+ + APLTDYF+SPFEV V+AC TGIG VL+QQGHPIEYFSEKL+ SRQSWSTYEQELYALVRALKQWEH+LLS +F
Subjt: SIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFV
Query: LL
++
Subjt: LL
|
|
| A0A5A7V4G7 Retrovirus-related Pol polyprotein from transposon 17.6 | 1.3e-222 | 46.89 | Show/hide |
Query: KNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLSRR-KEYPQPPPRNEINFQNNQRFGEARGRR-ARENFRNVNNPRGFQ
K P+ G QE EEI LS RT+TVRLLAVE+SLGDL DR+++ ++ L+RR E P P + R RG R R N+RN N R Q
Subjt: KNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLSRR-KEYPQPPPRNEINFQNNQRFGEARGRR-ARENFRNVNNPRGFQ
Query: RRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDI
RRRP Q D++ QE+ E WQ I
Subjt: RRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDI
Query: PERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARF
KKVHLVALKL+ GAS WWDQLE+NRQ+
Subjt: PERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARF
Query: VGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYS
+ EEM+ VR KN +R WET ++ + KT++QPSTS KGK ++ QE K E + +QNNY+
Subjt: VGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYS
Query: RPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDS
RP LGK FRCG+ +HLSNNCPQRKTIA+AE E MSE KE ++E ELIEAD+ +R+SC++QR RCTI+G+VCDVIID+ S
Subjt: RPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDS
Query: SKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLL
S+NFVA+KLV LNLK + HP PYKIGWV+K GE ++EICT+PLSI N+YKDQIVCDVIEMDVCHLLL RPW+ D ++ L
Subjt: SKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLL
Query: PITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKY
+ K++ G + +L EFP +K+EP+GLPPL DIQH IDL+ GASLP+L HYRMSP+EY+ LHD+IE LLKKGHI+PSLSPC VPALLTPKK
Subjt: PITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKY
Query: GSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------AFTTS---------KK
SWRMCVDSRAINRITVKY F IP++ DLLDQLGKA++FSKIDL+S YHQIRIRP DEWKTTFK NEGLFEW+ G FT+ +K
Subjt: GSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------AFTTS---------KK
Query: NVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------
L E ELYIN KK F+ +EI FL F+IK+G I MEPKK+EAI +WP P SIKE+QAFLGLASFY++FIRNF+S+ PLTDY
Subjt: NVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------
Query: ------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ
F+SPFEV VDAC GIG VL+QQGHPIEYFSEKL+ SRQSWSTYEQELYALVRALKQWEHYLL KEFVLLT+HFSLK Q
Subjt: ------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ
|
|
| A0A5B7BER3 Uncharacterized protein | 1.9e-213 | 44.08 | Show/hide |
Query: SSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQR
+++ D +F D GR + R +Y+MKIDLP ++G +IE+FLDWI E FF+ M+I + K+V LVA KL+ GAS WWDQ++ NR+R
Subjt: SSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQR
Query: CGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETV
GKQ +R+W+KM++LL+ RFLP +YEQ LY QYQNCRQG RSV +Y +EF+ LS+R NL+E E Q+AR+VGGLR I++++ L+ L+EA S A V
Subjt: CGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETV
Query: EEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKG-----KEVENQEVAVERKNEQTFKTSSQ---NNYSRPLLGKFFRCGQTEHLSNNCPQ
E R+P S R +Y D + +Q + + +G +++ ++ A KN+ T SQ N Y+RP+ GK FRC Q H SN CP
Subjt: EEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKG-----KEVENQEVAVERKNEQTFKTSSQ---NNYSRPLLGKFFRCGQTEHLSNNCPQ
Query: RK---TIAIAEEGRQMSEDSKEAE--DE---TELIEADDEERVSCVIQ------------------RARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNL
R+ + + E+ E+ +EAE DE E+ E D+ E VSCV+Q R RCTIN +VCDVIID+ SS+N V+K LV L L
Subjt: RK---TIAIAEEGRQMSEDSKEAE--DE---TELIEADDEERVSCVIQ------------------RARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNL
Query: KAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLP--------------
K E HP PYKIGW++KG E V+EIC VP SI YKD++ CD+++MD CH+LLGRPWQ+D + HKG++NTY W +KVVL+P
Subjt: KAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLP--------------
Query: ----ITRKNKEGLRGEKQL---------------------FTTQLLYEFPRI--KEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIE
+T + + K+ LL EF I E P+ LPP+RDIQHHIDL+ GASLPNL HYRMSP+E + L +E
Subjt: ----ITRKNKEGLRGEKQL---------------------FTTQLLYEFPRI--KEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIE
Query: ELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM-
+L+ KG IQ S+SPCAVPALLTPKK GSWRMCVDSRAIN+ITVKYRF IPR++D+LD L + IFSKIDL+SGYHQIRIRPGDEWKT FKT EGL+EW+
Subjt: ELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM-
Query: ---------------------------------------HKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTW
+R ++ + L E++LYIN KK F+ + FL F+I I ++ +K+ AI W
Subjt: ---------------------------------------HKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTW
Query: PIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTD---------------------------------YFSSPFEVEVDACCTGIGVVLAQQGHPIEYFS
P P ++ +I++F GLA+FYR+FIRNF+S+ AP+TD F F+V+ DA TGIG VL+Q+G P+E+FS
Subjt: PIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTD---------------------------------YFSSPFEVEVDACCTGIGVVLAQQGHPIEYFS
Query: EKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
EKLN +RQ W+TYE EL+A+VRALK WEHYL+ +EFV+ +DH +LK
Subjt: EKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
|
|
| A0A5D3DGR0 Reverse transcriptase | 4.4e-274 | 50.42 | Show/hide |
Query: MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLI-----ESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFR
M +RG+ PAA E ++ A E LSPRT++ L +VE S+ ++R + + ++ E+ +L R E P QN R RGRR E FR
Subjt: MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLI-----ESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFR
Query: NVNNPRGFQRR---RPGYAIPQQFDEDFQEDQEV-WQEIQED-DSSSGDEQGNMWNFNDDLRAGRNNQ--RNEVRRGEYHDYKMKIDLPVYDGKQNIEAF
R FQ R + +PQ + D+ + WQ +E+ ++SS E+ + NDD+ R ++ +NE ++ E +YKMKIDLP YDGK+NIE F
Subjt: NVNNPRGFQRR---RPGYAIPQQFDEDFQEDQEV-WQEIQED-DSSSGDEQGNMWNFNDDLRAGRNNQ--RNEVRRGEYHDYKMKIDLPVYDGKQNIEAF
Query: LDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLS
LDW+K+TENFF YM + KKVHLVALKL+ GAS WWDQ+ +NRQ+ GK IRSWEKMKKL+K RF+PPNYEQTLY QYQNCRQG+R +YIEEFH L
Subjt: LDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLS
Query: ARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVE
RTNL E E+H I+ FVGGLR D+KEKVKLQPF+ LSEAI++AETVEEMI R+K+ ++RP WE ++++ KT S ++ QE +
Subjt: ARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVE
Query: RKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS-KEAEDETELIEADDEERVSCVIQR------------------AR
+K + +N Y RP G +RCGQ H SN CPQRKTIA+A++ S S E ++ETE+IEAD+ + +SC++QR R
Subjt: RKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS-KEAEDETELIEADDEERVSCVIQR------------------AR
Query: CTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGR
CTI G+VC+VIID+ SS+NFV+KKLVT LNLK + H PYKIGW++KGGE +SEIC VPLSI N+YKDQ+VCDVIEMDVCH+LLGRPWQ+D QS+H+GR
Subjt: CTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGR
Query: ENTYELQWMGRKVVLLPITRKNKEGLRGEKQ---LFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDIQHHI
ENTYE WM +KV+LLP+ ++ + + ++ LF T +L ++P+I +EP LPPLRDI H+I
Subjt: ENTYELQWMGRKVVLLPITRKNKEGLRGEKQ---LFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDIQHHI
Query: DLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLK
+L+SGAS P+L HY MSP EYK LHD IEELLKKGHI+PS S C VPALLTPKK G+WRMCVDSRAIN+ITVKYRF IPR+SDLLDQLG A IFSKIDL+
Subjt: DLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLK
Query: SGYHQIRIRPGDEWKTTFKTNEGLFEW-----------------MHKQRGAFTTS-----------------------KKNVPGLTETELYINTKKSMFM
S YHQIRIRPGDEWKT FKTNEGLFEW M+K F + L ELY+N KK +F
Subjt: SGYHQIRIRPGDEWKTTFKTNEGLFEW-----------------MHKQRGAFTTS-----------------------KKNVPGLTETELYINTKKSMFM
Query: KREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------------------------FS
EIAFL F+I++ + M+ KK+EAI W P ++ ++QAFLGLASFYRKFI+N +S+AAP+TD F
Subjt: KREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------------------------FS
Query: SPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
FEV VD C TGIG VL+QQ HPIEYFSE+L+ SRQSWSTYEQELYALVRALKQWEHYLLS+EF+LLTDHFSLK
Subjt: SPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
|
|
| A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X1 | 1.8e-291 | 59.62 | Show/hide |
Query: PPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYH
P R E +N++ G GRRAR N++N N QRRRP Q D++ QE+ E WQ Q+ DSS GDEQGN+WN + + R + + E RR YH
Subjt: PPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYH
Query: DYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTL---YN
DYKMKIDLP Y+GK++IE+FLDWIK+TENFF YM P+RKKVHLVALKL+ GAS W P +Y Q + Y+
Subjt: DYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTL---YN
Query: QYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKT
QYQNCRQG + V +YIEEFH L AR NLSENEQHQIARF+GGLR DIKEKVKL FR LSEAIS AETVEEM+ VR KN RR AWET ++ +Y KT
Subjt: QYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKT
Query: NDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCV
++QPSTS KGK ++ QE +K E + +QNNY+RP LGK FRCG+ HLSNNC QRKTIA+AE E MS +E E+ETELIEADD +R+SC+
Subjt: NDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCV
Query: IQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQS
+QR T + + ++ K +N K HP PYKIGWV+KGGE ++EICT+PLSI N+YKDQIVCDVIEMDVCHLLLGRPWQ+DTQ+
Subjt: IQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQS
Query: LHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDI
LH+GRENTYE QWMG+KV+LLP+ +KN E +R ++QLF T +L EFP +K+EP+GLPPLRDI
Subjt: LHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDI
Query: QHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSK
QH IDL+ ASLPNL HYRMSP+EY+ LHDHIE+LLKKGHI+PSLSPCAVPALLTP K GSWRMCVDSRAINR+T KYRF IPRI DLLDQLGKA IFSK
Subjt: QHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSK
Query: IDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIK
IDL++GYHQI+IRPGDEWKT FKTNEGLFE ++ +K LTE ELYIN KK ++ +EI FL F+IK+G I MEPKKIEAI + P P SIK
Subjt: IDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIK
Query: EIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLL
E+QAFLGLASFYR+FIRNF+ + APLTDYF+SPFEV V+AC TGIG VL+QQGHPIEYFSEKL+ SRQSWSTYEQELYALVRALKQWEHYLLS +F ++
Subjt: EIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLL
|
|
| SwissProt top hits | e value | %identity | Alignment |
|---|
| P04323 Retrovirus-related Pol polyprotein from transposon 17.6 | 2.0e-42 | 30.62 | Show/hide |
Query: EGLRGEKQLFTTQLLYEFPRIK-EEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEY-KTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGS---
E L E++ LL ++ I+ E + L +H I+ + +LP + Y PQ Y + + I+++L +G I+ S SP P + PKK +
Subjt: EGLRGEKQLFTTQLLYEFPRIK-EEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEY-KTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGS---
Query: --WRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG----------------------
+R+ +D R +N ITV R IP + ++L +LG+ + F+ IDL G+HQI + P KT F T G +E++ G
Subjt: --WRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG----------------------
Query: ----------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNF
F+TS L + L + K F+K+E FL V+ I P+KIEAI +PIP KEI+AFLGL +YRKFI NF
Subjt: ----------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNF
Query: NSLAAPLT--------------DY---------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRAL
+A P+T +Y F+ F + DA +G VL+Q GHP+ Y S LN ++ST E+EL A+V A
Subjt: NSLAAPLT--------------DY---------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRAL
Query: KQWEHYLLSKEFVLLTDH
K + HYLL + F + +DH
Subjt: KQWEHYLLSKEFVLLTDH
|
|
| P10401 Retrovirus-related Pol polyprotein from transposon gypsy | 1.0e-38 | 27.36 | Show/hide |
Query: LHDHIEELLKKGHIQPSLSPCAVPALLTPKK----YG--SWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTT
+++ +++LLK G I+PS SP P + KK +G + R+ +D R +N T+ R+ +P I +L LGKA F+ +DLKSGYHQI + D KT+
Subjt: LHDHIEELLKKGHIQPSLSPCAVPALLTPKK----YG--SWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTT
Query: FKTNEGLFEW---------------------MHKQRG-----------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSIS
F N G +E+ + +Q G F+ ++ + + L + + ++ +K+ F K + +L F++ +
Subjt: FKTNEGLFEW---------------------MHKQRG-----------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSIS
Query: MEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY----------------------------------------------FSSPFE
+P+K++AI +P P + ++++FLGLAS+YR FI++F ++A P+TD F PF+
Subjt: MEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY----------------------------------------------FSSPFE
Query: VEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLL-SKEFVLLTDH----FSLKQRQQGGRCPKQKRLPTHIVVFENH
+ DA +GIG VL+Q+G PI S L Q+++T E+EL A+V AL + +++L S+E + TDH F++ R K KR ++I + H
Subjt: VEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLL-SKEFVLLTDH----FSLKQRQQGGRCPKQKRLPTHIVVFENH
Query: SIQAFTRPIRRRY
+ + F +P + +
Subjt: SIQAFTRPIRRRY
|
|
| P20825 Retrovirus-related Pol polyprotein from transposon 297 | 2.5e-40 | 27.29 | Show/hide |
Query: HKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKG
H +E T++L+ + K +N E GEK FT + + + + H+ + S Y ++ + + ++E+L +G
Subjt: HKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKG
Query: HIQPSLSPCAVPALLTPKK-----YGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHK
I+ S SP P + PKK +R+ +D R +N IT+ R+ IP + ++L +LGK F+ IDL G+HQI + KT F T G +E++
Subjt: HIQPSLSPCAVPALLTPKK-----YGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHK
Query: QRG--------------------------------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWP
G F+TS L + L + K F+K+E FL ++ I P K++AI ++P
Subjt: QRG--------------------------------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWP
Query: IPASIKEIQAFLGLASFYRKFIRNFNSLAAPLT--------------DY---------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYF
IP KEI+AFLGL +YRKFI N+ +A P+T +Y F F + DA +G VL+Q GHPI +
Subjt: IPASIKEIQAFLGLASFYRKFIRNFNSLAAPLT--------------DY---------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYF
Query: SEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
S LN ++S E+EL A+V A K + HYLL ++F++ +DH L+
Subjt: SEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
|
|
| Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein | 2.7e-42 | 29.27 | Show/hide |
Query: IQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFS
++H I++ GA LP L Y ++ + + ++ +++LL I PS SPC+ P +L PKK G++R+CVD R +N+ T+ F +PRI +LL ++G A IF+
Subjt: IQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFS
Query: KIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------------------------------AFTTSKKN--------VPGLTETELYINTKKS
+DL SGYHQI + P D +KT F T G +E+ G F+ S + + L L + KK
Subjt: KIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------------------------------AFTTSKKN--------VPGLTETELYINTKKS
Query: MFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYF----------------------SSP------
F E FL + I I+ K AI +P P ++K+ Q FLG+ ++YR+FI N + +A P+ + +SP
Subjt: MFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYF----------------------SSP------
Query: ----FEVEVDACCTGIGVVLAQQGHP------IEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ
+ + DA GIG VL + + + YFS+ L +++++ E EL +++AL + + L K F L TDH SL Q
Subjt: ----FEVEVDACCTGIGVVLAQQGHP------IEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ
|
|
| Q99315 Transposon Ty3-G Gag-Pol polyprotein | 2.7e-42 | 29.27 | Show/hide |
Query: IQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFS
++H I++ GA LP L Y ++ + + ++ +++LL I PS SPC+ P +L PKK G++R+CVD R +N+ T+ F +PRI +LL ++G A IF+
Subjt: IQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFS
Query: KIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------------------------------AFTTSKKN--------VPGLTETELYINTKKS
+DL SGYHQI + P D +KT F T G +E+ G F+ S + + L L + KK
Subjt: KIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------------------------------AFTTSKKN--------VPGLTETELYINTKKS
Query: MFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYF----------------------SSP------
F E FL + I I+ K AI +P P ++K+ Q FLG+ ++YR+FI N + +A P+ + +SP
Subjt: MFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYF----------------------SSP------
Query: ----FEVEVDACCTGIGVVLAQQGHP------IEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ
+ + DA GIG VL + + + YFS+ L +++++ E EL +++AL + + L K F L TDH SL Q
Subjt: ----FEVEVDACCTGIGVVLAQQGHP------IEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ
|
|