| GenBank top hits | e value | %identity | Alignment |
|---|
| KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis] | 1.2e-127 | 41.83 | Show/hide |
Query: MASSSSSL-SESVSASFLQPNTS---IFLLSNICNLVPIRLDSTNYLFWKFQVESMLRAHSLFDIVDGTIPCPPKFLCDAEGNKLTTVNTAYTQWIAQDH
MA+++ +L + S +++ PN S IFLLSNICNL+ RLDS+NY+ WKFQ+ S+L+AHSL +DGT PCP KF+ D G +N Y W QD
Subjt: MASSSSSL-SESVSASFLQPNTS---IFLLSNICNLVPIRLDSTNYLFWKFQVESMLRAHSLFDIVDGTIPCPPKFLCDAEGNKLTTVNTAYTQWIAQDH
Query: TLITLINVTLSKQAFSFVVRCKSSKEVWEALSKHFSSLTRSHIHKLKSALHIVSKSLAESIDDYLIRIKETVDKLETVSVTVDDEDILLYTLNGRPAKFN
L+TL+N TLS+ A S V+ +S+E W AL + FS+ TRS+I +LKSALH +SK +SID Y+ +IK+ D L +VSV ++DEDIL+Y LNG P ++N
Subjt: TLITLINVTLSKQAFSFVVRCKSSKEVWEALSKHFSSLTRSHIHKLKSALHIVSKSLAESIDDYLIRIKETVDKLETVSVTVDDEDILLYTLNGRPAKFN
Query: SFRTSIRTRKDSVTLDELHSLLKSEAKFIEQQNKIVANPLFNPTAMYA-------NLGRGSTSSGFRGRGRSNQGRGFSPGNSNPVQGRGSSGNFSPNPA
+F+TSIRT+ +++TL+E++++LK E + IE +K +P F P AM A + RG + S F GRGR +GR + G GR S NF +
Subjt: SFRTSIRTRKDSVTLDELHSLLKSEAKFIEQQNKIVANPLFNPTAMYA-------NLGRGSTSSGFRGRGRSNQGRGFSPGNSNPVQGRGSSGNFSPNPA
Query: SSNSVNAGCGSNGNTPNNQGSSNSGQGRVICQICNRPGHGALDCFNRLNLSYQGRYPPSKLVSMAVANDPSSTTST--WLADSGCNIHVTHNSSNLALNS
N + +NQ S+NS V+CQICN+ GH ALDC++R++ SYQG+ P +L +M+ + S S W D+G H+T + +NL
Subjt: SSNSVNAGCGSNGNTPNNQGSSNSGQGRVICQICNRPGHGALDCFNRLNLSYQGRYPPSKLVSMAVANDPSSTTST--WLADSGCNIHVTHNSSNLALNS
Query: NYNGEEAITLANGQAFPVAQAGFGTLSTSQNDLHLSNLFCVPDLTTNLLLVSQCCIDNNCIFVFDAEWFSIQDKPSGRVLYMSKSRDGLYPISAAVKTLS
Y G++ IT+ANGQA ++ +G ++ + + L+N+ CVP + TNLL V Q C DN+C F+FD+E F IQDK + ++L+ S GLYP+ + T
Subjt: NYNGEEAITLANGQAFPVAQAGFGTLSTSQNDLHLSNLFCVPDLTTNLLLVSQCCIDNNCIFVFDAEWFSIQDKPSGRVLYMSKSRDGLYPISAAVKTLS
Query: STTSL--------LNASFLNHVPV----------CATTSRTSTTDLWHYRLGHPSTVVLHKLLSTYSIA---HDAPLQNKDYISCLQGKMTKLPFPLSIS
S SL N NH P+ A + +T LWH RLGHPST L +LS+ SI APL CL GKMTKLPFPLS +
Subjt: STTSL--------LNASFLNHVPV----------CATTSRTSTTDLWHYRLGHPSTVVLHKLLSTYSIA---HDAPLQNKDYISCLQGKMTKLPFPLSIS
Query: ESHAPLELIHSDFWGPSPSLSVSCFKYY----------------------------GITHQRSCPYTPEQNGVVEHKHRSIVDIALSLMFHASVLLEF
ES APL+L+HSD WGP+P S F YY GI H+RSCP+TP+QNG+ E KHR IV+ L+L+ AS+ L++
Subjt: ESHAPLELIHSDFWGPSPSLSVSCFKYY----------------------------GITHQRSCPYTPEQNGVVEHKHRSIVDIALSLMFHASVLLEF
|
|
| XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera] | 1.9e-117 | 49.71 | Show/hide |
Query: IRWILESIIVERAIQDSADKHSKIMEWKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----
+R I S+ + R + + D H I+E + P++ E ++ D +Y DS W+SPVQ V KKGG+T+V N++NELIPTRTVTGWR+
Subjt: IRWILESIIVERAIQDSADKHSKIMEWKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----
Query: ------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTG-----RLLLG---------ECLLA-----------------
AYYCFLDGYSGYNQI+I+PEDQEKTTF G R+ G C++A
Subjt: ------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTG-----RLLLG---------ECLLA-----------------
Query: ------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCN
F + S VL+RCE+T LV+NWEKCHFMV+EGIVLGH+IS GLEVDRAKIE+IE+L PP +VKG++SFLGH GFYRRFIKDFSKISKPLCN
Subjt: ------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCN
Query: LLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWAKAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPY
LL D VFDF+ DC AF LK L+SAPI+ S + A GQ +H IYYASRVLN AQ+NY T EKELLA+VFAF KFR Y
Subjt: LLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWAKAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPY
Query: LVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIFDSFPDEQLFAVE--------VNHLC---
LVGSKV V+TDH+ I+YL KKDAKPRL RWVLLLQEFDLEI DK+G ENV+ADHL RL+ S + I +SFPDEQL AV VN+L
Subjt: LVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIFDSFPDEQLFAVE--------VNHLC---
Query: ----MDWRQKKKFKHDI
+ + QKKKF D+
Subjt: ----MDWRQKKKFKHDI
|
|
| XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera] | 1.9e-117 | 49.71 | Show/hide |
Query: IRWILESIIVERAIQDSADKHSKIMEWKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----
+R I S+ + R + + D H I+E + P++ E ++ D +Y DS W+SPVQ V KKGG+T+V N++NELIPTRTVTGWR+
Subjt: IRWILESIIVERAIQDSADKHSKIMEWKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----
Query: ------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTG-----RLLLG---------ECLLA-----------------
AYYCFLDGYSGYNQI+I+PEDQEKTTF G R+ G C++A
Subjt: ------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTG-----RLLLG---------ECLLA-----------------
Query: ------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCN
F + S VL+RCE+T LV+NWEKCHFMV+EGIVLGH+IS GLEVDRAKIE+IE+L PP +VKG++SFLGH GFYRRFIKDFSKISKPLCN
Subjt: ------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCN
Query: LLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWAKAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPY
LL D VFDF+ DC AF LK L+SAPI+ S + A GQ +H IYYASRVLN AQ+NY T EKELLA+VFAF KFR Y
Subjt: LLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWAKAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPY
Query: LVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIFDSFPDEQLFAVE--------VNHLC---
LVGSKV V+TDH+ I+YL KKDAKPRL RWVLLLQEFDLEI DK+G ENV+ADHL RL+ S + I +SFPDEQL AV VN+L
Subjt: LVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIFDSFPDEQLFAVE--------VNHLC---
Query: ----MDWRQKKKFKHDI
+ + QKKKF D+
Subjt: ----MDWRQKKKFKHDI
|
|
| XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera] | 2.5e-117 | 49.52 | Show/hide |
Query: IRWILESIIVERAIQDSADKHSKIMEWKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----
+R I S+ + R + + D H I+E + P++ E ++ D +Y DS W+SPVQ V KKGG+T+V N++NELIPTRTVTGWR+
Subjt: IRWILESIIVERAIQDSADKHSKIMEWKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----
Query: ------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTG-----RLLLG---------ECLLA-----------------
AYYCFLDGYSGYNQI+I+PEDQEKTTF G R+ G C++A
Subjt: ------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTG-----RLLLG---------ECLLA-----------------
Query: ------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCN
F + S VL+RCE+T LV+NWEKCHFMV+EGI+LGH+IS GLEVDRAKIE+IE+L PP +VKG++SFLGH GFYRRFIKDFSKISKPLCN
Subjt: ------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCN
Query: LLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWAKAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPY
LL D VFDF+ DC AF LK L+SAPI+ S + A GQ +H IYYASRVLN AQ+NY T EKELLA+VFAF KFR Y
Subjt: LLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWAKAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPY
Query: LVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIFDSFPDEQLFAVE--------VNHLC---
LVGSKV V+TDH+ I+YL KKDAKPRL RWVLLLQEFDLEI DK+G ENV+ADHL RL+ S + I +SFPDEQL AV VN+L
Subjt: LVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIFDSFPDEQLFAVE--------VNHLC---
Query: ----MDWRQKKKFKHDI
+ + QKKKF D+
Subjt: ----MDWRQKKKFKHDI
|
|
| XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera] | 1.9e-117 | 49.71 | Show/hide |
Query: IRWILESIIVERAIQDSADKHSKIMEWKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----
+R I S+ + R + + D H I+E + P++ E ++ D +Y DS W+SPVQ V KKGG+T+V N++NELIPTRTVTGWR+
Subjt: IRWILESIIVERAIQDSADKHSKIMEWKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----
Query: ------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTG-----RLLLG---------ECLLA-----------------
AYYCFLDGYSGYNQI+I+PEDQEKTTF G R+ G C++A
Subjt: ------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTG-----RLLLG---------ECLLA-----------------
Query: ------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCN
F + S VL+RCE+T LV+NWEKCHFMV+EGIVLGH+IS GLEVDRAKIE+IE+L PP +VKG++SFLGH GFYRRFIKDFSKISKPLCN
Subjt: ------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCN
Query: LLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWAKAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPY
LL D VFDF+ DC AF LK L+SAPI+ S + A GQ +H IYYASRVLN AQ+NY T EKELLA+VFAF KFR Y
Subjt: LLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWAKAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPY
Query: LVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIFDSFPDEQLFAVE--------VNHLC---
LVGSKV V+TDH+ I+YL KKDAKPRL RWVLLLQEFDLEI DK+G ENV+ADHL RL+ S + I +SFPDEQL AV VN+L
Subjt: LVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIFDSFPDEQLFAVE--------VNHLC---
Query: ----MDWRQKKKFKHDI
+ + QKKKF D+
Subjt: ----MDWRQKKKFKHDI
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A2G9FWY3 Reverse transcriptase | 1.4e-113 | 53.72 | Show/hide |
Query: VYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTT
+Y DS+WVSPVQCV KKGG+T+V N NELIPTRTVTGWR+ + +YCFLDGYSGYNQI IAPEDQEK T
Subjt: VYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTT
Query: FIALTG-----RLLLG---------ECLLA-----------------------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLE
F G R+ G C++A F + S VLKRCEDT L++NWEKCHFMV+EGIVLGH++S G+E
Subjt: FIALTG-----RLLLG---------ECLLA-----------------------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLE
Query: VDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWA
VD+AK+E IE+L PP SVKG++SFLGHAGFYRRFIKDFSKISKPLCNLL D F+F+ CR AF LK L+SAPI+ S V A
Subjt: VDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWA
Query: KAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIA
GQ IYYAS+ LN+AQ+NY T EKELLA+VFAF KFR YLVG+KV V+TDHA IRYL KKDAKPRL RWVLLLQEFDLEI D+KG+EN IA
Subjt: KAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIA
Query: DHLPRLDPSSSLLKQSTIFDSFPDEQLFAV
DHL RL+ + + + I D+FPDEQL A+
Subjt: DHLPRLDPSSSLLKQSTIFDSFPDEQLFAV
|
|
| A0A2G9HYA0 Reverse transcriptase | 1.4e-113 | 53.72 | Show/hide |
Query: VYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTT
+Y DS+WVSPVQCV KKGG+T+V N NELIPTRTVTGWR+ + +YCFLDGYSGYNQI IAPEDQEKTT
Subjt: VYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTT
Query: FIALTG-----RLLLG---------ECLLA-----------------------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLE
F G R+ G C++A F + S VLKRCEDT L++NWEKCHFMV+EGIVLGH++S G+E
Subjt: FIALTG-----RLLLG---------ECLLA-----------------------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLE
Query: VDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWA
VD+AK+E IE+L PP SVKG++SFLGHAGFYRRFIKDFSKISKPLCNLL D F+F+ C AF LK L+SAPI+ S V A
Subjt: VDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWA
Query: KAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIA
GQ IYYAS+ LN+AQ+NY T EKELLA+VFAF KFR YLVG+KV V+TDHA IRYL KKDAKPRL RWVLLLQEFDLEI D+KG+EN IA
Subjt: KAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIA
Query: DHLPRLDPSSSLLKQSTIFDSFPDEQLFAV
DHL RL+ + + + I D+FPDEQL A+
Subjt: DHLPRLDPSSSLLKQSTIFDSFPDEQLFAV
|
|
| A0A2G9HYD8 Reverse transcriptase | 4.7e-114 | 53.72 | Show/hide |
Query: VYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTT
+Y DS+WVSPVQCV KKGG+T+V N NELIPTRTVTGWR+ + +YCFLDGYSGYNQI IAPEDQEKTT
Subjt: VYLGEDSNWVSPVQCVHKKGGVTMVSNKDNELIPTRTVTGWRI----------------------------GWQAYYCFLDGYSGYNQITIAPEDQEKTT
Query: FIALTG-----RLLLG---------ECLLA-----------------------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLE
F G R+ G C++A F + S VLKRCEDT LV+NWEKCHFMV+EGIVLGH++S G+E
Subjt: FIALTG-----RLLLG---------ECLLA-----------------------FAMLQQHFSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLE
Query: VDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWA
VD+AK+E IE+L PP SVKG++SFLGHAGFYRRFIKDFSKISKPLCNLL D F F+ C AF+ LK L+SAPI+ S + A
Subjt: VDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFDFNADCRKAFETLKAALMSAPILC-------------CSRCYVWA
Query: KAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIA
GQ IYYAS+ LN+AQ+NY T EKELLA+VFAF KFR YLVG+KV V+TDHA IRYL KKDAKPRL RWVLLLQEFDLEI D+KG+EN IA
Subjt: KAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIA
Query: DHLPRLDPSSSLLKQSTIFDSFPDEQLFAV
DHL RL+ + + + I D+FPDEQL A+
Subjt: DHLPRLDPSSSLLKQSTIFDSFPDEQLFAV
|
|
| A0A5J5A1U7 Integrase catalytic domain-containing protein | 5.7e-128 | 41.83 | Show/hide |
Query: MASSSSSL-SESVSASFLQPNTS---IFLLSNICNLVPIRLDSTNYLFWKFQVESMLRAHSLFDIVDGTIPCPPKFLCDAEGNKLTTVNTAYTQWIAQDH
MA+++ +L + S +++ PN S IFLLSNICNL+ RLDS+NY+ WKFQ+ S+L+AHSL +DGT PCP KF+ D G +N Y W QD
Subjt: MASSSSSL-SESVSASFLQPNTS---IFLLSNICNLVPIRLDSTNYLFWKFQVESMLRAHSLFDIVDGTIPCPPKFLCDAEGNKLTTVNTAYTQWIAQDH
Query: TLITLINVTLSKQAFSFVVRCKSSKEVWEALSKHFSSLTRSHIHKLKSALHIVSKSLAESIDDYLIRIKETVDKLETVSVTVDDEDILLYTLNGRPAKFN
L+TL+N TLS+ A S V+ +S+E W AL + FS+ TRS+I +LKSALH +SK +SID Y+ +IK+ D L +VSV ++DEDIL+Y LNG P ++N
Subjt: TLITLINVTLSKQAFSFVVRCKSSKEVWEALSKHFSSLTRSHIHKLKSALHIVSKSLAESIDDYLIRIKETVDKLETVSVTVDDEDILLYTLNGRPAKFN
Query: SFRTSIRTRKDSVTLDELHSLLKSEAKFIEQQNKIVANPLFNPTAMYA-------NLGRGSTSSGFRGRGRSNQGRGFSPGNSNPVQGRGSSGNFSPNPA
+F+TSIRT+ +++TL+E++++LK E + IE +K +P F P AM A + RG + S F GRGR +GR + G GR S NF +
Subjt: SFRTSIRTRKDSVTLDELHSLLKSEAKFIEQQNKIVANPLFNPTAMYA-------NLGRGSTSSGFRGRGRSNQGRGFSPGNSNPVQGRGSSGNFSPNPA
Query: SSNSVNAGCGSNGNTPNNQGSSNSGQGRVICQICNRPGHGALDCFNRLNLSYQGRYPPSKLVSMAVANDPSSTTST--WLADSGCNIHVTHNSSNLALNS
N + +NQ S+NS V+CQICN+ GH ALDC++R++ SYQG+ P +L +M+ + S S W D+G H+T + +NL
Subjt: SSNSVNAGCGSNGNTPNNQGSSNSGQGRVICQICNRPGHGALDCFNRLNLSYQGRYPPSKLVSMAVANDPSSTTST--WLADSGCNIHVTHNSSNLALNS
Query: NYNGEEAITLANGQAFPVAQAGFGTLSTSQNDLHLSNLFCVPDLTTNLLLVSQCCIDNNCIFVFDAEWFSIQDKPSGRVLYMSKSRDGLYPISAAVKTLS
Y G++ IT+ANGQA ++ +G ++ + + L+N+ CVP + TNLL V Q C DN+C F+FD+E F IQDK + ++L+ S GLYP+ + T
Subjt: NYNGEEAITLANGQAFPVAQAGFGTLSTSQNDLHLSNLFCVPDLTTNLLLVSQCCIDNNCIFVFDAEWFSIQDKPSGRVLYMSKSRDGLYPISAAVKTLS
Query: STTSL--------LNASFLNHVPV----------CATTSRTSTTDLWHYRLGHPSTVVLHKLLSTYSIA---HDAPLQNKDYISCLQGKMTKLPFPLSIS
S SL N NH P+ A + +T LWH RLGHPST L +LS+ SI APL CL GKMTKLPFPLS +
Subjt: STTSL--------LNASFLNHVPV----------CATTSRTSTTDLWHYRLGHPSTVVLHKLLSTYSIA---HDAPLQNKDYISCLQGKMTKLPFPLSIS
Query: ESHAPLELIHSDFWGPSPSLSVSCFKYY----------------------------GITHQRSCPYTPEQNGVVEHKHRSIVDIALSLMFHASVLLEF
ES APL+L+HSD WGP+P S F YY GI H+RSCP+TP+QNG+ E KHR IV+ L+L+ AS+ L++
Subjt: ESHAPLELIHSDFWGPSPSLSVSCFKYY----------------------------GITHQRSCPYTPEQNGVVEHKHRSIVDIALSLMFHASVLLEF
|
|
| A0A6J1E110 uncharacterized protein LOC111025424 | 1.6e-114 | 41.06 | Show/hide |
Query: PPQLLMGGQGSFAP-QNSESSLEAMMKEYMAHTDATIQSNQASMRALELHVGQLANELKARPQGKLPQILNILKGSNKD---AGASGSVLDVEPPYVPLP
PPQ + P QN+ S+LE MKEYMA TDA IQS ASMR E +G LAN LK RPQG + K K+ A S L + P +P
Subjt: PPQLLMGGQGSFAP-QNSESSLEAMMKEYMAHTDATIQSNQASMRALELHVGQLANELKARPQGKLPQILNILKGSNKD---AGASGSVLDVEPPYVPLP
Query: PYVPPLPFPQRQSLRIRWILESIIVERAIQDSADKHSKIMEWKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGE---------------------------
+ + +P+++E PTL+ KPL HLKY YLG+
Subjt: PYVPPLPFPQRQSLRIRWILESIIVERAIQDSADKHSKIMEWKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGE---------------------------
Query: ---------DSNWVSPVQCVHK------------------------KGGVTMVSNKDNELIPTRTVTGWRI----------------------------G
D +S C+HK K G+T+ +N+ NELI TRTV+GWR+
Subjt: ---------DSNWVSPVQCVHK------------------------KGGVTMVSNKDNELIPTRTVTGWRI----------------------------G
Query: WQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTG-----RLLLG---------ECLLA----------------FAMLQQHFSG-------VLKRCEDTQ
+ +YCFLDGYSGYNQITIAPEDQ KTTF G R+ G C++A F++ F VL+RCE T
Subjt: WQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTG-----RLLLG---------ECLLA----------------FAMLQQHFSG-------VLKRCEDTQ
Query: LVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFDFNADCRKAFETLKAAL
LV+NWEKCHFMV+EGIVLGH+ISK G+EVD AKI++I +L PP +VKGI+SFLGH GFYRRFIKDF+KISKPLC LL D F F DC K+FE LK AL
Subjt: LVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFDFNADCRKAFETLKAAL
Query: MSAPIL------------CCSRCY-VWAKAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDHATIRYLKSKKDAK
SAPI+ C + Y + A GQ +HP+YYAS+ L AQ+NY T EKELLA+VFAF KFR YL+G+KV VFTDH+ ++YL +KKDAK
Subjt: MSAPIL------------CCSRCY-VWAKAGQ----FIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDHATIRYLKSKKDAK
Query: PRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIF-DSFPDEQLFAVE
PRL RW+LLLQEFD+E+ D+KG+EN +ADHL RL+ S L T+ + F DEQL V+
Subjt: PRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIF-DSFPDEQLFAVE
|
|
| SwissProt top hits | e value | %identity | Alignment |
|---|
| P04323 Retrovirus-related Pol polyprotein from transposon 17.6 | 3.8e-36 | 33.95 | Show/hide |
Query: LGECLLAFAMLQQHFSG---VLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISK
L + ++ L +H V ++ L + +KC F+ +E LGH ++ +G++ + KIE I++ P K I++FLG G+YR+FI +F+ I+K
Subjt: LGECLLAFAMLQQHFSG---VLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISK
Query: PLCNLLCTDHVFD-FNADCRKAFETLKAALMSAPIL-------------CCSRCYVWAKAGQFIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRP
P+ L + D N + AF+ LK + PIL S + A Q HP+ Y SR LNE ++NY T+EKELLA+V+A FR
Subjt: PLCNLLCTDHVFD-FNADCRKAFETLKAALMSAPIL-------------CCSRCYVWAKAGQFIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRP
Query: YLVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQST
YL+G + +DH + +L KD +L RW + L EFD +I KG EN +AD L R+ + L + T
Subjt: YLVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQST
|
|
| P10394 Retrovirus-related Pol polyprotein from transposon 412 | 1.1e-30 | 28.85 | Show/hide |
Query: ITIAPEDQEKTTFIALTG-----RLLLGECLLAFAMLQQH----FSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPN
+ IAP ++ IA +G L + L+ ++H + V +C + L ++ EKC F + E LGH+ + G+ D K +VI+ P+
Subjt: ITIAPEDQEKTTFIALTG-----RLLLGECLLAFAMLQQH----FSGVLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPN
Query: SVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFDFNADCRKAFETLKAALMSAPIL------------------CCSRCYVWAKAGQFIHPIYY
+ F+ +YRRFIK+F+ S+ + L + F++ +C+KAF LK+ L++ +L C G + P+ Y
Subjt: SVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFDFNADCRKAFETLKAALMSAPIL------------------CCSRCYVWAKAGQFIHPIYY
Query: ASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRL------DPS
ASR + + N T E+EL A+ +A + FRPY+ G TV TDH + YL S + +L R L L+E++ + KG +N +AD L R+ D +
Subjt: ASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRL------DPS
Query: SSLLKQSTIFDS
++LK +T F S
Subjt: SSLLKQSTIFDS
|
|
| P10401 Retrovirus-related Pol polyprotein from transposon gypsy | 1.2e-29 | 27.75 | Show/hide |
Query: IGWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTGRLLLGECLLAFAMLQ---------------------------------------QHFSGVLKRC
+G ++ LD SGY+QI +A D+EKT+F G+ C L F + +H VLK
Subjt: IGWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTGRLLLGECLLAFAMLQ---------------------------------------QHFSGVLKRC
Query: EDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLL------CTDHV-----FDF
D + ++ EK F + LG +SK+G + D K++ I+ P+ V ++SFLG A +YR FIKDF+ I++P+ ++L + H+ +F
Subjt: EDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLL------CTDHV-----FDF
Query: NADCRKAFETLKAALMSAPILC--------------CSRCYVWAKAGQFIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSK-VTVFTDH
N R AF+ L+ L S ++ S + A Q PI SR L + + NY T E+ELLA+V+A K + +L GS+ + +FTDH
Subjt: NADCRKAFETLKAALMSAPILC--------------CSRCYVWAKAGQFIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSK-VTVFTDH
Query: ATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPR
+ + + ++ ++ RW + + + ++ K G EN +AD L R
Subjt: ATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPR
|
|
| P20825 Retrovirus-related Pol polyprotein from transposon 297 | 9.4e-35 | 28.82 | Show/hide |
Query: DNELIPTRTVTGWRIGWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTGR----------------------------------LLLGECLLAFAMLQQ
D IP ++G Y+ +D G++QI + E KT F +G + L + ++ L +
Subjt: DNELIPTRTVTGWRIGWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTGR----------------------------------LLLGECLLAFAMLQQ
Query: HFSG---VLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFD
H + V + D L + +KC F+ KE LGH ++ +G++ + K++ I P K I++FLG G+YR+FI +++ I+KP+ + L D
Subjt: HFSG---VLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLLCTDHVFD
Query: F-NADCRKAFETLKAALMSAPIL-------------CCSRCYVWAKAGQFIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDH
+ +AFE LKA ++ PIL S + A Q HPI + SR LN+ ++NY +EKELLA+V+A FR YL+G + + +DH
Subjt: F-NADCRKAFETLKAALMSAPIL-------------CCSRCYVWAKAGQFIHPIYYASRVLNEAQVNYKTVEKELLAMVFAFVKFRPYLVGSKVTVFTDH
Query: ATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRL
+R+L + K+ +L RW + L E+ +I+ KG EN +AD L R+
Subjt: ATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRL
|
|
| Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus | 1.6e-34 | 26.9 | Show/hide |
Query: VSNKDNELIPTRTVTGWRIGWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTGR----------------------------------LLLGECLLAFA
V+ D IP T +G Y+ LD SG++QI + D KT F L G+ + + + ++
Subjt: VSNKDNELIPTRTVTGWRIGWQAYYCFLDGYSGYNQITIAPEDQEKTTFIALTGR----------------------------------LLLGECLLAFA
Query: MLQQHFSG---VLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLL---
H+ VL L +N EK HF+ + LG+ ++ +G++ D K+ I + PP SVK ++ FLG +YR+FI+D++K++KPL NL
Subjt: MLQQHFSG---VLKRCEDTQLVINWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLGHAGFYRRFIKDFSKISKPLCNLL---
Query: --------CTDHVFDFNADCRKAFETLKAALMSAPIL---CCSRCY-------VWAKAGQFI-------HPIYYASRVLNEAQVNYKTVEKELLAMVFAF
+ + ++F LK+ L S+ IL C ++ + WA PI Y SR LN+ + NY T+EKE+LA++++
Subjt: --------CTDHVFDFNADCRKAFETLKAALMSAPIL---CCSRCY-------VWAKAGQFI-------HPIYYASRVLNEAQVNYKTVEKELLAMVFAF
Query: VKFRPYLVGS-KVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIFDSFPDEQLFAVEVNH
R YL G+ + V+TDH + + ++ +L RW ++E++ E+ K G NV+AD L R+ P + L ST D+ P++ + ++ H
Subjt: VKFRPYLVGS-KVTVFTDHATIRYLKSKKDAKPRLNRWVLLLQEFDLEINDKKGSENVIADHLPRLDPSSSLLKQSTIFDSFPDEQLFAVEVNH
|
|