| GenBank top hits | e value | %identity | Alignment |
|---|
| KAE8673149.1 hypothetical protein F3Y22_tig00111810pilonHSYRG00151 [Hibiscus syriacus] | 8.8e-229 | 49.83 | Show/hide |
Query: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P EDA L+KWKIKAGKAMF
Subjt: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
Query: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
+KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+H L+PEYR F+
Subjt: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
Query: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG----------------------------EPDSKSTSNAMKKED---------
AVQGW QPSL++ EN L GQEA+ KQM V+LK +EEAL++ + +G +P +T+ + KE+
Subjt: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG----------------------------EPDSKSTSNAMKKED---------
Query: ----------LTLHA-EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVS
LT+ E + Y+NDWIVDSG SNHMTGDK+KLQN EY G RVVVTA+NS+LPITH+GKT++ PR N+ QV+L++V++VPG+KK L+SV+
Subjt: ----------LTLHA-EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVS
Query: QLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQY
QLTSSG++V+FG DVKVY+D+K+S P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH +LGHVSY+KL +M+ KSMLKGLPQLD+R D VC GCQY
Subjt: QLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQY
Query: GKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE-------------------------
GKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+KFKEF++ E
Subjt: GKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE-------------------------
Query: --------------------------------------------------------------------------------------------------DH
DH
Subjt: --------------------------------------------------------------------------------------------------DH
Query: LRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQ
LRSKFDKKA+ CIFVGYD+QRKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E P D R F + L+++M + V ++ D E+ NG++ EQ TQ
Subjt: LRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQ
Query: SPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
+PW++GV+ Q + QLRRSTR R+PNPKY NA AI+E EP +EE SK+ QW ++EE+ AL++ + WD+
Subjt: SPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
|
|
| KAE8684576.1 hypothetical protein F3Y22_tig00111127pilonHSYRG00074 [Hibiscus syriacus] | 1.2e-230 | 49.08 | Show/hide |
Query: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P EDA L+KWKIKAGKAMF
Subjt: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
Query: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
+KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+HGL+PEYR F+
Subjt: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
Query: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
AVQGW QPSL++ ENLL GQEA+ KQM V+LK +EEAL++ Q KG P S S
Subjt: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
Query: ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
A ++E+L L E + Y+NDWIVDSGCSNHMTGDK+KLQN EY G RVVVTA+NS+LPITH
Subjt: ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
Query: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
+GKT++ PR N+ QV+L++V++VPGMKKNL+SV+QLTSSG++V+FGP DVKVY+D+K++ P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH RLGH
Subjt: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
Query: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
Query: FKEFKEQVE-------------------------------------------------------------------------------------------
FKEF++ E
Subjt: FKEFKEQVE-------------------------------------------------------------------------------------------
Query: --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG
DHLRSKFDKKA+ CIFVGYD+QRKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E P D R F +
Subjt: --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG
Query: LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI
L+++M + V ++ D E+ NG++ EQ TQ+PWQ+GV+ Q + QLRRSTR R+PNPKY NA AI+E EP +EE SK+
Subjt: LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI
Query: QWRKVVEEEVGALRRKRRWDL
+W ++EE+ AL++ + WD+
Subjt: QWRKVVEEEVGALRRKRRWDL
|
|
| KAE8705435.1 hypothetical protein F3Y22_tig00110429pilonHSYRG01243 [Hibiscus syriacus] | 8.0e-230 | 48.97 | Show/hide |
Query: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P EDA L+KWKIKAGKAMF
Subjt: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
Query: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
+KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+HGL+PEYR F+
Subjt: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
Query: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
AVQGW QPSL++ ENLL GQEA+ KQM V+LK +EEAL++ Q KG P S S
Subjt: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
Query: ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
A ++E+L L E + Y+NDWIVDSGCSNHMTGDK+KLQN EY G RVVVTA+NS+LPITH
Subjt: ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
Query: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
+GKT++ PR N+ QV+L++V++VPGMKKNL+SV+QLTSSG++V+FGP DVKVY+D+K++ P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH RLGH
Subjt: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
Query: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
Query: FKEFKEQVE-------------------------------------------------------------------------------------------
FKEF++ E
Subjt: FKEFKEQVE-------------------------------------------------------------------------------------------
Query: --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG
DHLRSKFDKKA+ CIFVGYD+ RKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E P D R F +
Subjt: --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG
Query: LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI
L+++M + V ++ D E+ NG++ EQ TQ+PWQ+GV+ Q + QLRRSTR R+PNPKY NA AI+E EP +EE SK+
Subjt: LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI
Query: QWRKVVEEEVGALRRKRRWDL
+W ++EE+ AL++ + WD+
Subjt: QWRKVVEEEVGALRRKRRWDL
|
|
| KAE8715296.1 hypothetical protein F3Y22_tig00110183pilonHSYRG00102 [Hibiscus syriacus] | 6.1e-230 | 48.97 | Show/hide |
Query: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P EDA L+KWKIKAGKAMF
Subjt: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
Query: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
+KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+HGL+PEYR F+
Subjt: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
Query: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
AVQGW QPSL++ ENLL GQEA+ KQM V+LK +EEAL++ Q KG P S S
Subjt: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
Query: ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
A ++E+L L E + Y+NDWIVDSGCSNHMTGDK+KLQN EY G RVVVTA+NS+LPITH
Subjt: ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
Query: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
+GKT++ PR N+ QV+L++V++VPGMKKNL+SV+QLTSSG++V+FGP DVKVY+D+K++ P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH RLGH
Subjt: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
Query: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
Query: FKEFKEQVE-------------------------------------------------------------------------------------------
FKEF++ E
Subjt: FKEFKEQVE-------------------------------------------------------------------------------------------
Query: --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG
DHLRSKFDKKA+ CIFVGYD+QRKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E P D R F +
Subjt: --------------------------------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEG
Query: LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI
L+ ++ + V ++ D E+ NG++ EQ TQ+PWQ+GV+ Q + QLRRSTR R+PNPKY NA AI+E EP +EE SK+
Subjt: LKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNI
Query: QWRKVVEEEVGALRRKRRWDL
+W ++EE+ AL++ + WD+
Subjt: QWRKVVEEEVGALRRKRRWDL
|
|
| KAE8733549.1 hypothetical protein F3Y22_tig00001120pilonHSYRG00173 [Hibiscus syriacus] | 1.2e-233 | 50.62 | Show/hide |
Query: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P EDA L+KWKIKAGKAMF
Subjt: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
Query: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
+KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+HGL+PEYR F+
Subjt: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
Query: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
AVQGW QPSL++ ENLL GQEA+ KQM V+LK +EEAL++ Q KG P S S
Subjt: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
Query: ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
A ++E+L L E + Y+NDWIVDSGCSNHMTGDK+KLQN EY G RVVVTA+NS+LPITH
Subjt: ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
Query: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
+GKT++ PR N+ QV+L++V++VPGMKKNL+SV+QLTSSG++V+FGP DVKVY+D+K++ P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH RLGH
Subjt: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
Query: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
Query: FKEFKEQVE-------------------------------------------------------------------------------------------
FKEF++ E
Subjt: FKEFKEQVE-------------------------------------------------------------------------------------------
Query: ----DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-
DHLRSKFDKKA+ CIFVGYD+QRKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E P D R F + L+ +M + V ++ D E+ NG++
Subjt: ----DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-
Query: EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
EQ TQ+PWQ+GV+ Q + QLRRSTR R+PNPKY NA AI+E EP +EE SK+ +W ++EE+ AL++ + WD+
Subjt: EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A2N9EQ78 Uncharacterized protein | 8.9e-235 | 51.03 | Show/hide |
Query: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
MGDLQVVGGIKKLN +NY TW+TC+ESYLQGQDLWEVVGG+EVT P EDA + L+KWKIKAGKAMF
Subjt: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
Query: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
+KTTI+EEMLE+IR A+TPK AWDTF +LFSK+ND RLQ LENELLS+AQR+MTI QYF+KVK + REIS+LDPTA I +SR++RIIIHGL+PEYR F+
Subjt: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
Query: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG----------------------------------------------------
A+QGW QPSL++ ENLL QEA+ KQM V+LK +EEAL++ + +G
Subjt: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG----------------------------------------------------
Query: -------------EPDSKSTSN--------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
E ++ ++S+ AM++E+L L E++ Y+NDWI+DSGCSNHMTGDK KLQN EYKG RVVVTA+NS+LPI H
Subjt: -------------EPDSKSTSN--------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
Query: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
+GKT++ PR NS QV L++V++VPGMKKNL+SV+QLT SG++V+FGP DVKVY+DLK+S P+M+G+R++S+YVMSAE+AYV++TRKNETTDLWH RLGH
Subjt: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
Query: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP+EESKFKAK+PLELVHSDVFGPVKQPSI GMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
Query: FKEFKEQVE----------------------------------------------------------------------------DHLRSKFDKKAINCI
FKEF+E E +HLRSKFDKKA+ CI
Subjt: FKEFKEQVE----------------------------------------------------------------------------DHLRSKFDKKAINCI
Query: FVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---
FVGYD+QRKGW+C DP +GRCYTSR+V+FDEA+SWW+ + E +D R F + L+++M + V ++ D + NG++ EQ Q+PWQ+GV+ Q
Subjt: FVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPWQSGVHGQ---
Query: ------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
+ QLRRSTR R+PNPKY N + A + EP +EE S++ +W K +EEE+ AL++ + WDL
Subjt: ------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
|
|
| A0A2N9HNS8 Integrase catalytic domain-containing protein | 2.0e-234 | 50.79 | Show/hide |
Query: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
MGDLQVVGGIKKLN +NY TW+TC+ESYLQGQDLWEVVGG+EVT P EDA + L+KWKIKAGKAMF
Subjt: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
Query: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
+KTTI+EEMLE+IR A+TPK AWDTF +LFSK+ND RLQ LENELLS+AQR+MTI QYF+KVK + REIS+LDPTA I +SR++RIIIHGL+PEYR F+
Subjt: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
Query: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG---------------------------------EPDSKSTSN---------A
A+QGW QPSL++ ENLL QEA+ KQM V+LK +EEAL++ + +G E SK S A
Subjt: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG---------------------------------EPDSKSTSN---------A
Query: MKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLT
M++++L L E++ Y+NDWIVDSGCSNHMTGDK KLQN EYKG RVVVTA+NS+LPI H+GKT++ PR NS QV L++V++VPGMKKNL+SV+QLT
Subjt: MKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLT
Query: SSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKA
SG++V+FGP DVKVY+DLK+S P+M+G+R++S+YVMSAE+AYV++TRKNETTDLWH RLGHVSY+KL IM+ KSMLKGLPQLD+R D VCAGCQYGKA
Subjt: SSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKA
Query: HQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE----------------------------
HQLP++ESKFKAK+PLELVHSDVFGPVKQPSI GMRYMVTFIDD SRYVWVFFMKEKS+TF+KFKEF+E E
Subjt: HQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE----------------------------
Query: -----------------------------------------------------------------------------------------------DHLRS
+HLRS
Subjt: -----------------------------------------------------------------------------------------------DHLRS
Query: KFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPW
KFDKKA+ CIFVGYD+QRKGW+C DP +GRCYTSR+V+FDEA+SWW+ + E +D R F + L+++M + V ++ D ++NG++ EQ Q+PW
Subjt: KFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPW
Query: QSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
Q+GV+ Q + QLRRSTR R+PNPKY NA + A + EP +EE S++ +W K +EEE+ AL++ + WDL
Subjt: QSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
|
|
| A0A2N9HY47 Integrase catalytic domain-containing protein | 1.7e-233 | 50.57 | Show/hide |
Query: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
MGDLQVVGGIKKLN +NY TW+TC+ESYLQGQDLWEVVGG+EVT P EDA + L+KWKIKAGKAMF
Subjt: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
Query: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
+KTTI+EEMLE+IR A+TPK AWDTF +LFSK+ND RLQ LENELLS+AQR+MTI QYF+KVK + REIS+LDPTA I +SR++RIIIHGL+PEYR F+
Subjt: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
Query: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKGE------PDSKSTSN------------------------------------A
A+QGW QPSL++ ENLL QEA+ KQM V+LK +EEAL++ + +G +SK + A
Subjt: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKGE------PDSKSTSN------------------------------------A
Query: MKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLT
M++E+L L E++ Y+NDWIVDSGCSNHMTGDK KLQN EYKG RVVVTA+NS+LPI H+GKT++ PR NS QV L++V++VPGMKKNL+SV+QLT
Subjt: MKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLT
Query: SSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKA
SG++V+FGP DVKVY+DLK+S P+M+G+R++S+YVMSAE+AYV++TRKNETTDLWH RLGHVSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKA
Subjt: SSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKA
Query: HQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE----------------------------
HQLP++ESKFKAK+PLELVHSDVFGPVKQPSI GMRYMVTFIDD SRYVWVFFMKEKS+TF+KFKEF+E E
Subjt: HQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE----------------------------
Query: -----------------------------------------------------------------------------------------------DHLRS
+HLRS
Subjt: -----------------------------------------------------------------------------------------------DHLRS
Query: KFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPW
KFDKKA+ CIFVGYD+QRKGW+C DP +GRCYTSR+V+FDEA+SWW+ + E +D R F + L+++M + V ++ D + NG++ EQ Q+PW
Subjt: KFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-EQLRTQSPW
Query: QSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
Q+GV+ Q + QLRRSTR R+PNPKY NA + A + EP +EE S++ +W K +EEE+ AL++ + WDL
Subjt: QSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
|
|
| A0A2N9IHF5 Uncharacterized protein | 3.2e-232 | 49.83 | Show/hide |
Query: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
MGDLQVVGGIKKLN +NY TW+TC+ESYLQGQDLWEVVGG+EVT P EDA + L+KWKIKAGKAMF
Subjt: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
Query: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
+KTTI+EEMLE+IR A+TPK AWDTF +LFSK+ND RLQ LENELLS+AQR+MTI QYF+KVK + REIS+LDPTA I +SR++RIIIHGL+PEYR F+
Subjt: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
Query: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG-------------------------------------------EPDSKSTSN
A+QGW QPSL++ ENLL QEA+ KQM V+LK +EEAL++ + +G E ++ ++S+
Subjt: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKG-------------------------------------------EPDSKSTSN
Query: --------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFY
AM++E+L L E++ Y+NDWIVDSGCSNHMTGDK KLQN EYKG RVVVTA+NS+LPI H+GKT++ PR NS QV L++V++
Subjt: --------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFY
Query: VPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLD
VPGMKKNL+SV+QLT SG++V+FGP DVKVY+DLK+S P+M+G+R++S+YVMSAE+AYV++TRKNETTDLWH RLGHVSY+KL IM+ KSMLKGLPQLD
Subjt: VPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLD
Query: IREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE-------------
+R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQPSI GMRYMVTFIDD SRYVWVFFMKEKS+TF+KFKEF+E E
Subjt: IREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE-------------
Query: ----------------------------------------------------------------------------------------------------
Subjt: ----------------------------------------------------------------------------------------------------
Query: ----------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEE
+HLRSKFDKKA+ CIFVGYD+QRKGW+C DP +GRCYTSR+V+FDEA+SWW+ + E +D R F + L+++M + V ++ D +
Subjt: ----------DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEE
Query: NNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
NG++ EQ Q+PWQ+GV+ Q + QLRRSTR R+PNPKY NA + A + EP +EE S++ +W K +EEE+ AL++ + WDL
Subjt: NNGEE-EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATL---AILEEPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
|
|
| A0A6A3D2P3 Uncharacterized protein | 5.8e-234 | 50.62 | Show/hide |
Query: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
MGDLQVVGGIKKLN +NY TW+TCMESYLQGQDLWEVVGG EVT P EDA L+KWKIKAGKAMF
Subjt: MGDLQVVGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPP--EDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFV
Query: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
+KTTI+EEMLE+IR A+TPK AWDTF +LFSKRND +LQ LENELLS+AQR+M + QYF+KVK++ REISELDPTA I ++R++RII+HGL+PEYR F+
Subjt: IKTTIDEEMLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
Query: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
AVQGW QPSL++ ENLL GQEA+ KQM V+LK +EEAL++ Q KG P S S
Subjt: AVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEALFSG------------------------QRKGEPDSKSTSN-------------------
Query: ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
A ++E+L L E + Y+NDWIVDSGCSNHMTGDK+KLQN EY G RVVVTA+NS+LPITH
Subjt: ------------------------------------AMKKEDLTLHA---EEVSYENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITH
Query: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
+GKT++ PR N+ QV+L++V++VPGMKKNL+SV+QLTSSG++V+FGP DVKVY+D+K++ P M+GRR++SIYVMSAE+AYV++TRKNET+DLWH RLGH
Subjt: VGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAYVNKTRKNETTDLWHARLGH
Query: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
VSY+KL +M+ KSMLKGLPQLD+R D VCAGCQYGKAHQLP++ESKFKAK+PLELVHSDVFGPVKQ SISGMRYMVTFIDD SRYVWVFFMKEKS+TF+K
Subjt: VSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTK
Query: FKEFKEQVE-------------------------------------------------------------------------------------------
FKEF++ E
Subjt: FKEFKEQVE-------------------------------------------------------------------------------------------
Query: ----DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-
DHLRSKFDKKA+ CIFVGYD+QRKGW+C DP++GRCYTSRNV+FDEA+SWW+ + E P D R F + L+ +M + V ++ D E+ NG++
Subjt: ----DHLRSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEATSWWAPKSEKAPTDERSFKEGLKEEMSQVQQVPIEEKEDPPEENNGEE-
Query: EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
EQ TQ+PWQ+GV+ Q + QLRRSTR R+PNPKY NA AI+E EP +EE SK+ +W ++EE+ AL++ + WD+
Subjt: EQLRTQSPWQSGVHGQ---------------EPQLRRSTRQRKPNPKYVNATLAILE---EPTIYEETSKNIQWRKVVEEEVGALRRKRRWDL
|
|
| SwissProt top hits | e value | %identity | Alignment |
|---|
| P04146 Copia protein | 3.6e-23 | 29.01 | Show/hide |
Query: WIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKV
+++DSG S+H+ D+ ++ E + A + + K I+ N ++ LE+V + NL+SV +L +G + F + V + +
Subjt: WIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKV
Query: SGMPLMKGRRM-DSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIRE--DMVCAGCQYGKAHQLPFEESKFKA--KQPL
+G+ ++K M +++ V++ +A +N KN LWH R GH+S KL + K+M L+ E +C C GK +LPF++ K K K+PL
Subjt: SGMPLMKGRRM-DSIYVMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIRE--DMVCAGCQYGKAHQLPFEESKFKA--KQPL
Query: ELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK
+VHSDV GP+ ++ Y V F+D + Y + +K KS+ F+ F++F + E H K
Subjt: ELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK
|
|
| P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-94 | 2.3e-22 | 23.36 | Show/hide |
Query: GQRKGEPDSKSTSNAMKKED---LTLHAEEVSY-----ENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVEL
G+ G+ + +T+ ++ D L ++ EE E++W+VD+ S+H T + L + V N S I +G I V L
Subjt: GQRKGEPDSKSTSNAMKKED---LTLHAEEVSY-----ENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVEL
Query: ENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAY--VNKTRKNETTDLWHARLGHVSYNKLKIMISKSML
++V +VP ++ NL+S L G F + L + + KG ++Y +AE +N + + DLWH R+GH+S L+I+ KS++
Subjt: ENVFYVPGMKKNLVSVSQLTSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIYVMSAEAAY--VNKTRKNETTDLWHARLGHVSYNKLKIMISKSML
Query: KGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE------
++ C C +GK H++ F+ S + L+LV+SDV GP++ S+ G +Y VTFIDD SR +WV+ +K K + F F++F VE
Subjt: KGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVFGPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVE------
Query: ----------------------------------------------------------------------------------------------------
Subjt: ----------------------------------------------------------------------------------------------------
Query: --DHL---------------RSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEA-TSWWAPKSEK-------------APTDERSFKEGL
HL R+K D K+I CIF+GY ++ G+R DPV + SR+V+F E+ A SEK + ++ + E
Subjt: --DHL---------------RSKFDKKAINCIFVGYDNQRKGWRCIDPVTGRCYTSRNVIFDEA-TSWWAPKSEK-------------APTDERSFKEGL
Query: KEEMSQVQQVPIEEKEDPPEENNGEEEQLRTQSPWQSGVHGQEPQLRRSTRQRKPNPKYVNATLAIL---EEPTIYEET----SKNIQWRKVVEEEVGAL
+E+S+ + P E E + + G EE + P Q Q LRRS R R + +Y + ++ EP +E KN Q K ++EE+ +L
Subjt: KEEMSQVQQVPIEEKEDPPEENNGEEEQLRTQSPWQSGVHGQEPQLRRSTRQRKPNPKYVNATLAIL---EEPTIYEET----SKNIQWRKVVEEEVGAL
Query: RRKRRWDL
++ + L
Subjt: RRKRRWDL
|
|
| P93293 Uncharacterized mitochondrial protein AtMg00300 | 1.2e-10 | 34.86 | Show/hide |
Query: LMKGRRMDSIYVM--SAEAAYVN--KTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHS
++KG R DS+Y++ S E N +T K+ET LWH+RL H+S +++++ K L ++ C C YGK H++ F + K PL+ VHS
Subjt: LMKGRRMDSIYVM--SAEAAYVN--KTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHS
Query: DVFGPVKQP
D++G P
Subjt: DVFGPVKQP
|
|
| Q94HW2 Retrovirus-related Pol polyprotein from transposon RE1 | 1.3e-33 | 23.17 | Show/hide |
Query: VGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPPEDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFVIKTTIDEE
+ + KL + NY WS + + G +L + G+ PP AT+G A V+ P T +WK + + I
Subjt: VGGIKKLNTQNYKTWSTCMESYLQGQDLWEVVGGTEVTPPEDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFVIKTTIDEE
Query: MLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIVAVQGWSVQ
+ + A T W+T +++ + + L +L + TI+ Y + T + +++ L + R ++ L EY+ I +
Subjt: MLEYIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIVAVQGWSVQ
Query: PSLIDLENLLVGQEALGKQMSRVTLKSNKEEALF-----------SGQRKGEPDSKSTSNAMK---KEDLTLH----------------------AEEVS
P+L ++ L+ E+ +S T+ A+ +G R D+++ +N K + H A+ S
Subjt: PSLIDLENLLVGQEALGKQMSRVTLKSNKEEALF-----------SGQRKGEPDSKSTSNAMK---KEDLTLH----------------------AEEVS
Query: Y---------------------------------ENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVF
N+W++DSG ++H+T D L Y G V+ A+ S +PI+H G T + + S+ + L N+
Subjt: Y---------------------------------ENDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVF
Query: YVPGMKKNLVSVSQL-TSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIY----VMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLK
YVP + KNL+SV +L ++G V F P +V +DL +G+PL++G+ D +Y S + T WHARLGH + + L +IS L
Subjt: YVPGMKKNLVSVSQL-TSSGNFVVFGPNDVKVYQDLKVSGMPLMKGRRMDSIY----VMSAEAAYVNKTRKNETTDLWHARLGHVSYNKLKIMISKSMLK
Query: GLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVF-GPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK
L + + C+ C K++++PF +S + +PLE ++SDV+ P+ S RY V F+D +RY W++ +K+KS+ F FK +E+ +++
Subjt: GLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHSDVF-GPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK
|
|
| Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE2 | 1.9e-32 | 34.21 | Show/hide |
Query: NDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNF-VVFGPNDVKVYQD
N+W++DSG ++H+T D L Y G V+ A+ S +PITH G + ++S+ ++L V YVP + KNL+SV +L ++ V F P +V +D
Subjt: NDWIVDSGCSNHMTGDKKKLQNTFEYKGSRVVVTANNSKLPITHVGKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSGNF-VVFGPNDVKVYQD
Query: LKVSGMPLMKGRRMDSIY---VMSAEAAYVNKTRKNETT-DLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMV-CAGCQYGKAHQLPFEESKFKAK
L +G+PL++G+ D +Y + S++A + + ++ T WH+RLGH S L +IS LP L+ ++ C+ C K+H++PF S +
Subjt: LKVSGMPLMKGRRMDSIY---VMSAEAAYVNKTRKNETT-DLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMV-CAGCQYGKAHQLPFEESKFKAK
Query: QPLELVHSDVF-GPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK
+PLE ++SDV+ P+ SI RY V F+D +RY W++ +K+KS+ F FK VE+ +++
Subjt: QPLELVHSDVF-GPVKQPSISGMRYMVTFIDDLSRYVWVFFMKEKSETFTKFKEFKEQVEDHLRSK
|
|
| Arabidopsis top hits | e value | %identity | Alignment |
|---|
| AT3G20980.1 Gag-Pol-related retrotransposon family protein | 8.5e-04 | 22.02 | Show/hide |
Query: NYKTWSTCMESYLQGQDLWEVVGGTEVTPPEDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFVIKTTIDEEMLEYIRGAET
NY+ W+ M++ L + LW++V Y + SK P T L ++ SA +K KA+ ++++ + + + +
Subjt: NYKTWSTCMESYLQGQDLWEVVGGTEVTPPEDATVGFYALKLVDSKCPLGPTGSSLGALRVFIPSATALKKWKIKAGKAMFVIKTTIDEEMLEYIRGAET
Query: PKAAWDTF------ASLFSKRNDARLQFLENELLSVAQREMTI-----------NQYFNK-VKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
K WD A L ++ F N L + +++T+ + NK +K + ++ + M+R + +G+ P+
Subjt: PKAAWDTF------ASLFSKRNDARLQFLENELLSVAQREMTI-----------NQYFNK-VKTLYREISELDPTATISKSRMRRIIIHGLKPEYRSFIV
Query: AVQGWSVQPSL---IDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKGEPDS--KSTSNAMKKEDLTLHAEEVS------------YENDWIVDSGC
S P L I E++ +E + K M + E FS PDS + T +A+ +DL EV +EN W++ S
Subjt: AVQGWSVQPSL---IDLENLLVGQEALGKQMSRVTLKSNKEEALFSGQRKGEPDS--KSTSNAMKKEDLTLHAEEVS------------YENDWIVDSGC
Query: SNHMTGDKKKLQNTFEYKGSRV-VVTANNSKLPITHV-GKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSG
SNHMT K + +V ++ + S+ + V G + +N ++NV YVPG++ N +SVSQL +G
Subjt: SNHMTGDKKKLQNTFEYKGSRV-VVTANNSKLPITHV-GKTMIMPRSNSKQVELENVFYVPGMKKNLVSVSQLTSSG
|
|
| AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162) | 4.1e-06 | 23.6 | Show/hide |
Query: PSATALKKWKIKAGKAMFVIKTTIDEEMLE-YIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISK
P+ K+WK + G I TI + +L+ I+ T + W + +LF +AR ENEL + ++++++Y K+K+L ++ +D + IS
Subjt: PSATALKKWKIKAGKAMFVIKTTIDEEMLE-YIRGAETPKAAWDTFASLFSKRNDARLQFLENELLSVAQREMTINQYFNKVKTLYREISELDPTATISK
Query: SRMRRIIIHGLKPEYRSFIVAVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEAL
+ +++GL +Y + ++ S PS + ++L+ +E+ S+ +L +L
Subjt: SRMRRIIIHGLKPEYRSFIVAVQGWSVQPSLIDLENLLVGQEALGKQMSRVTLKSNKEEAL
|
|
| ATMG00300.1 Gag-Pol-related retrotransposon family protein | 8.5e-12 | 34.86 | Show/hide |
Query: LMKGRRMDSIYVM--SAEAAYVN--KTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHS
++KG R DS+Y++ S E N +T K+ET LWH+RL H+S +++++ K L ++ C C YGK H++ F + K PL+ VHS
Subjt: LMKGRRMDSIYVM--SAEAAYVN--KTRKNETTDLWHARLGHVSYNKLKIMISKSMLKGLPQLDIREDMVCAGCQYGKAHQLPFEESKFKAKQPLELVHS
Query: DVFGPVKQP
D++G P
Subjt: DVFGPVKQP
|
|