| GenBank top hits | e value | %identity | Alignment |
|---|
| DAD45748.1 TPA_asm: hypothetical protein HUJ06_003978 [Nelumbo nucifera] | 4.9e-105 | 44.17 | Show/hide |
Query: MNGG---NNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGK-----
MNGG +NICVDKL +NYSYWKLCMEA+LQGQDLWDL++ D+ IP D +N E ++KWK+KCGK LFALRT I KEYIEHV D+ + +
Subjt: MNGG---NNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGK-----
Query: --------------HLKGCLLKRTRQGCSIWRMNLLKLFKEALMKQMFGNNKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRC
L G + +Q I NLL +EAL+KQM N+ +VE A++ + K N S K NDS+ +++EGQSKGN K C+RC
Subjt: --------------HLKGCLLKRTRQGCSIWRMNLLKLFKEALMKQMFGNNKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRC
Query: GKPGHIK----RDCRAKVS--------------------------------------------------------------------------------Y
GKPGHI+ R+CR ++ +
Subjt: GKPGHIK----RDCRAKVS--------------------------------------------------------------------------------Y
Query: CNE---------------EGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDA
C + EG NV +++ G+SL+DVYHVPGLKKNL SVSQI D GRYVLFGPN+V+I+ N+K EAD+L TGKRK+SLYVLSA+DA
Subjt: CNE---------------EGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDA
Query: YVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFV
YV++TGQN SAALWHARLGH+GYQLLQ+IS +KLLDG+P+FK+ HHD+VY MGPT+TPSYSG +YVM+FV
Subjt: YVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFV
Query: DDFSRFTWVYFLKAKSETFSKFV
DDF RFTWVYFL+ KSE FSKF+
Subjt: DDFSRFTWVYFLKAKSETFSKFV
|
|
| KAA8537014.1 hypothetical protein F0562_029492 [Nyssa sinensis] | 2.7e-103 | 56.95 | Show/hide |
Query: DLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGCLLKR---TRQGCSIWRMNLLKLFKEALMKQMFGN
DLWDL+ GDD IP DT N E ++KWK+KCGK LFALRT I +EYIEHVRDV K G G + GCS + L E
Subjt: DLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGCLLKR---TRQGCSIWRMNLLKLFKEALMKQMFGN
Query: NKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNL
+ HY A V ++++S V +EG NVK+D N GVSL+DVYHVPGLKKNL
Subjt: NKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNL
Query: VSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVV
SVSQIAD+GRYVLF PNDVKI+ NIK EADV+ TGKRKDSLYVLSASDAYVE+TGQ+ S LWHARLGHVGYQLLQ+IS KKLLDGVPLFKEIH DVV
Subjt: VSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVV
Query: YLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
LGCQ+GKSHRLPFPNS NR L +VHSDLMGPTRTPSY G YVMV VD FSRFTWV+FL+ KSETFSKF+
Subjt: YLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
|
|
| KAA8540328.1 hypothetical protein F0562_024753 [Nyssa sinensis] | 9.6e-109 | 44.96 | Show/hide |
Query: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRD-----------------
MNGG+++ +DKL +NYSYWKLCMEA+LQGQDLWDL+SGDD IP DT +N E +RKWK+KCGK LFALRTSI +EYI+HVRD
Subjt: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRD-----------------
Query: ---------------------------VNLQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
+ +++ C + L+ L++ R QG SI + L +EALMKQM +N
Subjt: ---------------------------VNLQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
Query: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
KQ +VE A+Y KDK K NS K SS D+K SK EGQS+GN + C+RCGK GH+KRDCR KV
Subjt: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
Query: ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
+Y N +EG NVK D N GVSL+DVY
Subjt: ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
Query: HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
HVPGLKKNL SVSQIAD GRYVLFGP+DVKI+ NIK EADVL TGKRKDSLYVLSASDAYVE+ GQN S LWHARLGHVGYQLL +IS KKLLDGVPL
Subjt: HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
Query: FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHV
FKEIH DVV GCQ+GKSHR PFPNS NR AL +
Subjt: FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHV
|
|
| KAA8549858.1 hypothetical protein F0562_001542 [Nyssa sinensis] | 2.7e-103 | 42.02 | Show/hide |
Query: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGC---
MNGG+++ VDKL +NYSY KLCMEA+LQGQ+LWDL+SGDD I DT +NVE +RKWK+K GK LFALRTSI +EYI+HVRD + K L+
Subjt: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGC---
Query: -------LLKRTRQGCSIWRMNLLKLF-------------------------------------------------------------KEALMKQMFGNN
LK G + +++L+ F +EALMKQ+ NN
Subjt: -------LLKRTRQGCSIWRMNLLKLF-------------------------------------------------------------KEALMKQMFGNN
Query: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
KQ +VE A+Y KDK K NS K SS DSK SK +GQS+GN K +RCGK GH+KRDC KV
Subjt: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
Query: ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
+Y N +EG NVK D NV+GVSL+DVY
Subjt: ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
Query: HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
HVP LKKNL SVSQI D GRYVLFGP+DVKI+ NIK EADVL TGKRKDSLYVLSASDAYVE+TGQN S LWHARLGHVGYQ LQ+IS KKLLDGVPL
Subjt: HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
Query: FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
FKEIH DVV GCQ+GKSH LPF NS N+ AL + V+FL+ KSETFSKF+
Subjt: FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
|
|
| RWR74934.1 Integrase, catalytic core [Cinnamomum micranthum f. kanehirae] | 7.3e-117 | 44.79 | Show/hide |
Query: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVN---------------
MNGG+N+ +DKL +NY+YWKLCMEA+LQGQDLWDL+SGD+ IP DTS+N + RKWK+KCGK LFALRTSI ++YI VRDV+
Subjt: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVN---------------
Query: -----------------------------LQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
+++ C + L L++ R QG SI + L +EAL+KQM N+
Subjt: -----------------------------LQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
Query: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV--SYCNEEG------RVNVKDDAPNVA----------
K+ VE A+Y KD+G N K+ S+D++ S EG+ +GN KGCFRCG+ GHIKRDC A+V + C + G RV + + NVA
Subjt: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV--SYCNEEG------RVNVKDDAPNVA----------
Query: -------------------------------------------------------------------------------------GVSLEDVYHVPGLKK
GVSL +VYHV GLKK
Subjt: -------------------------------------------------------------------------------------GVSLEDVYHVPGLKK
Query: NLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHD
NL SVSQI D RYVLFGP +V+I+ NIK EADVL TG+RK+SLYVLSASDAYVE+T QN+SA LWH+RLGHVGYQLLQ+IS KKLL+G+PLFKEIHHD
Subjt: NLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHD
Query: VVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
VV GCQ+ KSHRLPFP S NR + L +VHSDLMGPT+T SYS RYVM+ VDDFSRFTWVYFL+ KSE FSKFV
Subjt: VVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A443N8T5 Integrase, catalytic core | 3.5e-117 | 44.79 | Show/hide |
Query: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVN---------------
MNGG+N+ +DKL +NY+YWKLCMEA+LQGQDLWDL+SGD+ IP DTS+N + RKWK+KCGK LFALRTSI ++YI VRDV+
Subjt: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVN---------------
Query: -----------------------------LQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
+++ C + L L++ R QG SI + L +EAL+KQM N+
Subjt: -----------------------------LQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
Query: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV--SYCNEEG------RVNVKDDAPNVA----------
K+ VE A+Y KD+G N K+ S+D++ S EG+ +GN KGCFRCG+ GHIKRDC A+V + C + G RV + + NVA
Subjt: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV--SYCNEEG------RVNVKDDAPNVA----------
Query: -------------------------------------------------------------------------------------GVSLEDVYHVPGLKK
GVSL +VYHV GLKK
Subjt: -------------------------------------------------------------------------------------GVSLEDVYHVPGLKK
Query: NLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHD
NL SVSQI D RYVLFGP +V+I+ NIK EADVL TG+RK+SLYVLSASDAYVE+T QN+SA LWH+RLGHVGYQLLQ+IS KKLL+G+PLFKEIHHD
Subjt: NLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHD
Query: VVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
VV GCQ+ KSHRLPFP S NR + L +VHSDLMGPT+T SYS RYVM+ VDDFSRFTWVYFL+ KSE FSKFV
Subjt: VVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
|
|
| A0A5J5B3A5 Integrase catalytic domain-containing protein | 1.3e-103 | 56.95 | Show/hide |
Query: DLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGCLLKR---TRQGCSIWRMNLLKLFKEALMKQMFGN
DLWDL+ GDD IP DT N E ++KWK+KCGK LFALRT I +EYIEHVRDV K G G + GCS + L E
Subjt: DLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGCLLKR---TRQGCSIWRMNLLKLFKEALMKQMFGN
Query: NKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNL
+ HY A V ++++S V +EG NVK+D N GVSL+DVYHVPGLKKNL
Subjt: NKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNL
Query: VSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVV
SVSQIAD+GRYVLF PNDVKI+ NIK EADV+ TGKRKDSLYVLSASDAYVE+TGQ+ S LWHARLGHVGYQLLQ+IS KKLLDGVPLFKEIH DVV
Subjt: VSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVV
Query: YLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
LGCQ+GKSHRLPFPNS NR L +VHSDLMGPTRTPSY G YVMV VD FSRFTWV+FL+ KSETFSKF+
Subjt: YLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
|
|
| A0A5J5B552 Uncharacterized protein | 1.1e-91 | 52.94 | Show/hide |
Query: YSVKMNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKG
+ VKMN G+++ VDKL +NYSYWKLC+EA+LQGQDLWDL+SGDD IP DT +N E +RKWK+KCGK LFALRTSI +EYI+HVRD L + C H G
Subjt: YSVKMNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKG
Query: CLLKRTRQGCSIWRMNLLKLFKEALMKQMFGNNKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCN
A++ ++D+ ++++S V
Subjt: CLLKRTRQGCSIWRMNLLKLFKEALMKQMFGNNKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCN
Query: EEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHAR
+E NVK D N GVSL+DVYHVPGLKKNL SVSQIAD RYVLFGP+DVKI+ NIK EADVL TGKRKDSLYVLS SDAYVE+TGQN S LWHAR
Subjt: EEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHAR
Query: LGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVAL
LGHVGYQLLQ+IS KKLLD LFKEIH +VV GCQ+GKSHRLPFPNSNNR AL
Subjt: LGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVAL
|
|
| A0A5J5BCB3 Uncharacterized protein | 4.6e-109 | 44.96 | Show/hide |
Query: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRD-----------------
MNGG+++ +DKL +NYSYWKLCMEA+LQGQDLWDL+SGDD IP DT +N E +RKWK+KCGK LFALRTSI +EYI+HVRD
Subjt: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRD-----------------
Query: ---------------------------VNLQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
+ +++ C + L+ L++ R QG SI + L +EALMKQM +N
Subjt: ---------------------------VNLQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
Query: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
KQ +VE A+Y KDK K NS K SS D+K SK EGQS+GN + C+RCGK GH+KRDCR KV
Subjt: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
Query: ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
+Y N +EG NVK D N GVSL+DVY
Subjt: ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
Query: HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
HVPGLKKNL SVSQIAD GRYVLFGP+DVKI+ NIK EADVL TGKRKDSLYVLSASDAYVE+ GQN S LWHARLGHVGYQLL +IS KKLLDGVPL
Subjt: HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
Query: FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHV
FKEIH DVV GCQ+GKSHR PFPNS NR AL +
Subjt: FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHV
|
|
| A0A5J5C3K7 Uncharacterized protein | 1.3e-103 | 42.02 | Show/hide |
Query: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGC---
MNGG+++ VDKL +NYSY KLCMEA+LQGQ+LWDL+SGDD I DT +NVE +RKWK+K GK LFALRTSI +EYI+HVRD + K L+
Subjt: MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGC---
Query: -------LLKRTRQGCSIWRMNLLKLF-------------------------------------------------------------KEALMKQMFGNN
LK G + +++L+ F +EALMKQ+ NN
Subjt: -------LLKRTRQGCSIWRMNLLKLF-------------------------------------------------------------KEALMKQMFGNN
Query: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
KQ +VE A+Y KDK K NS K SS DSK SK +GQS+GN K +RCGK GH+KRDC KV
Subjt: KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
Query: ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
+Y N +EG NVK D NV+GVSL+DVY
Subjt: ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
Query: HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
HVP LKKNL SVSQI D GRYVLFGP+DVKI+ NIK EADVL TGKRKDSLYVLSASDAYVE+TGQN S LWHARLGHVGYQ LQ+IS KKLLDGVPL
Subjt: HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
Query: FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
FKEIH DVV GCQ+GKSH LPF NS N+ AL + V+FL+ KSETFSKF+
Subjt: FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
|
|
| SwissProt top hits | e value | %identity | Alignment |
|---|
| P04146 Copia protein | 1.2e-13 | 27.49 | Show/hide |
Query: YCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALW
Y + G V +++D ++LEDV NL+SV ++ + G + F + V I N V+ +++ V++ AY ++ LW
Subjt: YCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALW
Query: HARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDV-VYLGCQFGKSHRLPFPNSNNRVAV--ALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYF
H R GH+ L I K + L + + C GK RLPF ++ + L VVHSD+ GP + Y ++FVD F+ + Y
Subjt: HARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDV-VYLGCQFGKSHRLPFPNSNNRVAV--ALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYF
Query: LKAKSETFSKF
+K KS+ FS F
Subjt: LKAKSETFSKF
|
|
| P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-94 | 6.2e-18 | 26.87 | Show/hide |
Query: SIKRSSNDSKDSKVEGQSKGNFK----GCFRCGKPGHIKRDC----RAK----------------------VSYCNEE----------------------
S +RSSN+ S G+SK K C+ C +PGH KRDC + K V + NEE
Subjt: SIKRSSNDSKDSKVEGQSKGNFK----GCFRCGKPGHIKRDC----RAK----------------------VSYCNEE----------------------
Query: ----------------GRVNVKDDA-PNVAGVS-------------LEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRK
G V + + + +AG+ L+DV HVP L+ NL+S + G F ++ + V+ G +
Subjt: ----------------GRVNVKDDA-PNVAGVS-------------LEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRK
Query: DSLYVLSASDAYVEQTGQND--SAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRT
+LY +A E D S LWH R+GH+ + LQ ++ K L+ D C FGK HR+ F S+ R L +V+SD+ GP
Subjt: DSLYVLSASDAYVEQTGQND--SAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRT
Query: PSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKF
S G +Y + F+DD SR WVY LK K + F F
Subjt: PSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKF
|
|
| Q94HW2 Retrovirus-related Pol polyprotein from transposon RE1 | 3.8e-15 | 31.63 | Show/hide |
Query: VSLEDVYHVPGLKKNLVSVSQIAD-YGRYVLFGPNDVKIVYNIKQFEADV-LLTGKRKDSLYVLS-ASDAYVEQTGQNDSAAL---WHARLGHVGYQLLQ
++L ++ +VP + KNL+SV ++ + G V F P + +K V LL GK KD LY AS V S A WHARLGH +L
Subjt: VSLEDVYHVPGLKKNLVSVSQIAD-YGRYVLFGPNDVKIVYNIKQFEADV-LLTGKRKDSLYVLS-ASDAYVEQTGQNDSAAL---WHARLGHVGYQLLQ
Query: RISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
+ L + H + C KS+++PF S L ++SD+ + S+ RY ++FVD F+R+TW+Y LK KS+ F+
Subjt: RISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
|
|
| Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE2 | 6.8e-17 | 33.16 | Show/hide |
Query: VSLEDVYHVPGLKKNLVSVSQIADYGRY-VLFGPNDVKIVYNIKQFEADV-LLTGKRKDSLYVLS-ASDAYVEQTGQNDSAAL---WHARLGHVGYQLLQ
+ L V +VP + KNL+SV ++ + R V F P + +K V LL GK KD LY AS V S A WH+RLGH +L
Subjt: VSLEDVYHVPGLKKNLVSVSQIADYGRY-VLFGPNDVKIVYNIKQFEADV-LLTGKRKDSLYVLS-ASDAYVEQTGQNDSAAL---WHARLGHVGYQLLQ
Query: RISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
+ L P+ H + C KSH++PF NS + L ++SD+ + S RY ++FVD F+R+TW+Y LK KS+ F+
Subjt: RISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
|
|