| GenBank top hits | e value | %identity | Alignment |
|---|
| KAA8516701.1 hypothetical protein F0562_016793 [Nyssa sinensis] | 2.3e-93 | 43.41 | Show/hide |
Query: TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
T +++ NPS S L+NICNL++ LDS+NY+ W+FQIS + K+H L Y+DGT P DE+ +Y+ W +DQAL+TL+NAT
Subjt: TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
Query: LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
LSQTALS+VIG TS++ W LE+ FS+STR+N++ LK+ L +IS K +SID+Y++++K+ + LA+VS +I+ ED +IY +NGLP YN FKTS+RT+
Subjt: LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
Query: AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
+ ++T E++ ++K EE ++ K + M T+ S +RG S+ GRG GR + GR F +P+ S+ +P+ P Q +
Subjt: AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
Query: GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
R +N V QIC + GH+ALDCY++M++SYQG+ P +L AMSA+ + S + SP N W +DTG H+T+DLANL Y G++NIT+ N
Subjt: GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
Query: GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP
GQ+L ISH G + + +F L+N+ VP ++TNLLSVHQ C DN+C FIFDS F IQDK+T ++LF GPS +GLYPL S PSP
Subjt: GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP
|
|
| KAA8519786.1 hypothetical protein F0562_014124 [Nyssa sinensis] | 6.1e-94 | 43.41 | Show/hide |
Query: TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
T +++ NPS S L+NICNL++ LDS+NY+ W+FQIS + K+H L Y+DGT P DE+ +Y+ W +DQAL+TL+NAT
Subjt: TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
Query: LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
LSQTALS+VIG TS++ W LE+ FS+STR+N++ LK+ L +IS K +SID+Y++++K+ + LA+VS +I+ ED +IY +NGLP YN FKTS+RT+
Subjt: LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
Query: AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
+ ++T E++ ++K EE ++ K + M T+ S +RG S+ GRG GR + GR F +P+ S+ +P+ P Q +
Subjt: AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
Query: GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
R +N V QIC + GH+ALDCY++M++SYQG+ P +L AMSA+ + S + SP N W +DTG H+T+DLANL Y G++NIT+ N
Subjt: GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
Query: GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP
GQ+L ISH G + + +F L+N+ VP ++TNLLSVHQ C DN+C FIFDS F IQDK+T ++LF GPS +GLYPL S PSP
Subjt: GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP
|
|
| KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis] | 1.7e-107 | 41.8 | Show/hide |
Query: TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
T +++ NPS S L+NICNL++ LDS+NY+ W+FQIS + K+H L Y+DGT P DE+ +Y+ W +DQAL+TL+NAT
Subjt: TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
Query: LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
LSQTALS+VIG TS++ W LE+ FS+STR+N++ LK+ L +IS K +SID+Y++++K+ + LA+VS +I+ ED +IY +NGLP YN FKTS+RT+
Subjt: LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
Query: AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
+ ++T E++ ++K EE ++ K + M T+ S +RG S+ GRG GR + GR F +P+ S+ +P+ P Q +
Subjt: AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
Query: GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
R +N V QIC + GH+ALDCY++M++SYQG+ P +L AMSA+ + S + SP N W +DTG H+T+DLANL Y G++NIT+ N
Subjt: GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
Query: GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPL----VAKSPSPA-QVTL---
GQ+L ISH G + + +F L+N+ VP ++TNLLSVHQ C DN+C FIFDS F IQDK+T ++LF GPS +GLYPL + K +P+ Q L
Subjt: GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPL----VAKSPSPA-QVTL---
Query: ----------------------TAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPF
TA +G + ST +WHDRLGHP + L S+L+S+ I R +C+HCL GK++K PF
Subjt: ----------------------TAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPF
|
|
| KAA8535282.1 hypothetical protein F0562_030285 [Nyssa sinensis] | 1.8e-93 | 43.41 | Show/hide |
Query: TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
T +++ NPS S L+NICNL++ LDS+NY+ W+FQIS + K+H L Y+DGT P DE+ +Y+ W +DQAL+TL+NAT
Subjt: TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
Query: LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
LSQTALS+VIG TS++ W LE+ FS+STR+N++ LK+ L +IS K +SID+Y++++K + LA+VS +I+ ED +IY +NGLP YN FKTS+RT+
Subjt: LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
Query: AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
+ ++T E++ ++K EE ++ K + M T+ S +RG S+ GRG GR + GR F +P+ S+ +P+ P Q +
Subjt: AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
Query: GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
R +N V QIC + GH+ALDCY++M++SYQG+ P +L AMSA+ + S + SP N W +DTG H+T+DLANL Y G++NIT+ N
Subjt: GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
Query: GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP
GQ+L ISH G + + +F L+N+ VP ++TNLLSVHQ C DN+C FIFDS F IQDK+T ++LF GPS +GLYPL S PSP
Subjt: GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP
|
|
| XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia] | 2.3e-85 | 46.45 | Show/hide |
Query: MTTPENNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE--------------------KSVDYEAWYERDQALI
MT+ N+ + + + L+NICNLVS+ LDST++ILW+FQ++ + K+HKLF ++DG+ AP + + +E W +DQAL+
Subjt: MTTPENNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE--------------------KSVDYEAWYERDQALI
Query: TLINATLSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFK
TLINATLS AL+YV+ TS+QVWE LEKH+SS++RTNV+ LK++LQSI KK+ ESIDAYV+R+KEI +K A VS I+ E +IY +NGL + YN
Subjt: TLINATLSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFK
Query: TSLRTRAHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGNW----RSHGRGRGGGRSDGNRNNGRGGRGF--LFPNPSNAPSHSQF
TS+RTRA S++F ELH+ MK+EESA+++Q K + + +S SQ+R + +SH RGRG +NNGRG F F N S F
Subjt: TSLRTRAHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGNW----RSHGRGRGGGRSDGNRNNGRGGRGF--LFPNPSNAPSHSQF
Query: PSPPQFDGRLSNRVTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPA---YNG
+ Q D NR QIC + GH ALDCYN+MN+ +QGRHPP +LAAM A ++S L S WL+D+ CN H+T+DL+NL I+ YNG
Subjt: PSPPQFDGRLSNRVTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPA---YNG
Query: EENITVGNGQSLPISHFGPGQL
EENI+VG+GQS PI+HFG GQ+
Subjt: EENITVGNGQSLPISHFGPGQL
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A2N9EZ90 Uncharacterized protein | 2.4e-96 | 38.45 | Show/hide |
Query: TTPENNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDEKSVD------------YEAWYERDQALITLINATLSQ
TT + N+ + L+NI V+V LD +N++ W+FQI+ + +++ L +YV+G + P + + Y W RD+AL++LI+ATLS
Subjt: TTPENNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDEKSVD------------YEAWYERDQALITLINATLSQ
Query: TALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHS
+A S VIG ++ +W L K ++S +R+N++ LK +L + KK+ ++I Y++R+KE ++KLAAV T++D ED + + GLPS Y F +++ T+ S
Subjt: TALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHS
Query: LTF-ELHILMKTEESAL-DQQTKIYETSNISHLAMTTSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFL---------------------FP-NPS
++F ELH+LM ++E L Q E S ++ T S + SRG + + GRG R GN G GF FP +P
Subjt: LTF-ELHILMKTEESAL-DQQTKIYETSNISHLAMTTSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFL---------------------FP-NPS
Query: N----APSHSQFP-----SPPQFDGRLSNRVTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLT
N +P+ SQ P S P F NR QICQ+ GH ALDCYN+MNYSYQGRHPPAKLAAM+++ S S +N W+SDTG H T
Subjt: N----APSHSQFP-----SPPQFDGRLSNRVTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLT
Query: SDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLY
DLANL S YN + ++VGNGQ LPISH G QL + F L N+ RVP +++NLLSV++ C DN+C F FDS F IQD+ +GK L+ G S +GLY
Subjt: SDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLY
Query: PL---------VAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPFSPIILSFLFSFRV
PL ++S SP+ Q +++S+T+WH R GHP +L +L++ F P D C+HC GK+++ PFS S F ++
Subjt: PL---------VAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPFSPIILSFLFSFRV
|
|
| A0A2N9G7E3 Integrase catalytic domain-containing protein | 2.3e-99 | 41.94 | Show/hide |
Query: LNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVI
++SNP+ + L+NI NLVSV LD TNY+LW+FQI+ K++KL VDG+ P+ + D+ W +DQALI++I ATLS +AL+ VI
Subjt: LNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVI
Query: GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTF-ELH
G ++++ VW+ LEK F+S +R+NV+ LK +L SI KK+ ESI+ Y++++KE +KL AV I+AE+ + ++GLP+ + F +++RTR S++F ELH
Subjt: GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTF-ELH
Query: ILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGN-----WRSHGRGRGGGRSD--GNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVT
+LM EE +L++ T+ + + HLAM + GN + S + GGR N GRGGR F N + ++ P P ++ + S+R T
Subjt: ILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGN-----WRSHGRGRGGGRSD--GNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVT
Query: FQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFG
QIC + GH ALDCY++M++++QG+HPP KLAAM+ S++ SS SN W+SDTG H T DLANL + YNG + +TVGNGQ LPI+H G
Subjt: FQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFG
Query: PGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLV--------AKSPSPA-------QVTLTAQ
QL L RVP++ TNLLSV + C DNNCCF FD+S F+IQD +GKVL+ G + GLYP+ ++P P +A
Subjt: PGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLV--------AKSPSPA-------QVTLTAQ
Query: VGIKASTTVWHDRLGHPCLSILNSV---LNSSFIPVSRSDIGVCKHCLDGKLSKQPFS
K S++ WH RLGHP IL SV L +S I S S+ CKHC GK+S+ PFS
Subjt: VGIKASTTVWHDRLGHPCLSILNSV---LNSSFIPVSRSDIGVCKHCLDGKLSKQPFS
|
|
| A0A2N9HPA0 Uncharacterized protein | 1.3e-97 | 41.39 | Show/hide |
Query: LTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWERL
L+NI NLVSV LD +NY+LW++QI+ + K++ + +VDGT + P E ++ Y+ W RDQ L+TLIN+TLS TALS V+G T+ VW L
Subjt: LTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWERL
Query: EKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFE-LHILMKTEESALDQ
EK ++SS+R+N++ LK EL +I K+S +SI+++++++K+ ++L AV ID E+ + + GLP Y+ F T++RTR + +FE +H+L+ EE +L
Subjt: EKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFE-LHILMKTEESALDQ
Query: QTKIYETSNISHLAMTTSVDS---QSRGNWRSHGR---GRGGGRSDGNRNNGRGGRGFLFPNPS-NAPSHSQF-----PSPPQFDGRLSNRVTFQICQEY
Q+ I + +H+AM + + S+GN R GR RG GR+ N N+GRGG N S NA S F SP Q + R QIC +
Subjt: QTKIYETSNISHLAMTTSVDS---QSRGNWRSHGR---GRGGGRSDGNRNNGRGGRGFLFPNPS-NAPSHSQF-----PSPPQFDGRLSNRVTFQICQEY
Query: GHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLP
GH ALDCY++M+YSYQG+ PP+KLAAM+A++ N+ SD + W+SDTG H T DL+ + Y G + TVGNGQ++PI+H G QL
Subjt: GHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLP
Query: NASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSIL
+ F L + RVP +++NLLSV++ C DNNCCF+FD++ F I+D TGK+L+ GPS N LYP+ S P T S+ VWHDRLGHP +
Subjt: NASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSIL
Query: NSVLNSSFIPVSRSD--IGVCKHCLDGKLSKQPF
+ ++S + S S+ C HC+ GK++ PF
Subjt: NSVLNSSFIPVSRSD--IGVCKHCLDGKLSKQPF
|
|
| A0A2N9IB37 Uncharacterized protein | 5.2e-99 | 41.76 | Show/hide |
Query: LNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVI
++SNP+ + L+NI NLVSV LD TNY+LW+FQI+ K++KL VDG+ P+ + D+ W +DQALI++I ATLS +AL+ VI
Subjt: LNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVI
Query: GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTF-ELH
G ++++ VW+ LEK F+S +R+NV+ LK +L SI KK+ ESI+ Y++++KE +KL A+ I+AE+ + ++GLP+ + F +++RTR S++F ELH
Subjt: GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTF-ELH
Query: ILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGN-----WRSHGRGRGGGRSD--GNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVT
+LM EE +L++ T+ + + HLAM + GN + S + GGR N GRGGR F N + ++ P P ++ + S+R T
Subjt: ILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGN-----WRSHGRGRGGGRSD--GNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVT
Query: FQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFG
QIC + GH ALDCY++M++++QG+HPP KLAAM+ S++ SS SN W+SDTG H T DLANL + YNG + +TVGNGQ LPI+H G
Subjt: FQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFG
Query: PGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLV--------AKSPSPA-------QVTLTAQ
QL L RVP++ TNLLSV + C DNNCCF FD+S F+IQD +GKVL+ G + GLYP+ ++P P +A
Subjt: PGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLV--------AKSPSPA-------QVTLTAQ
Query: VGIKASTTVWHDRLGHPCLSILNSV---LNSSFIPVSRSDIGVCKHCLDGKLSKQPFS
K S++ WH RLGHP IL SV L +S I S S+ CKHC GK+S+ PFS
Subjt: VGIKASTTVWHDRLGHPCLSILNSV---LNSSFIPVSRSDIGVCKHCLDGKLSKQPFS
|
|
| A0A5J5A1U7 Integrase catalytic domain-containing protein | 8.0e-108 | 41.8 | Show/hide |
Query: TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
T +++ NPS S L+NICNL++ LDS+NY+ W+FQIS + K+H L Y+DGT P DE+ +Y+ W +DQAL+TL+NAT
Subjt: TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
Query: LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
LSQTALS+VIG TS++ W LE+ FS+STR+N++ LK+ L +IS K +SID+Y++++K+ + LA+VS +I+ ED +IY +NGLP YN FKTS+RT+
Subjt: LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
Query: AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
+ ++T E++ ++K EE ++ K + M T+ S +RG S+ GRG GR + GR F +P+ S+ +P+ P Q +
Subjt: AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
Query: GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
R +N V QIC + GH+ALDCY++M++SYQG+ P +L AMSA+ + S + SP N W +DTG H+T+DLANL Y G++NIT+ N
Subjt: GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
Query: GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPL----VAKSPSPA-QVTL---
GQ+L ISH G + + +F L+N+ VP ++TNLLSVHQ C DN+C FIFDS F IQDK+T ++LF GPS +GLYPL + K +P+ Q L
Subjt: GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPL----VAKSPSPA-QVTL---
Query: ----------------------TAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPF
TA +G + ST +WHDRLGHP + L S+L+S+ I R +C+HCL GK++K PF
Subjt: ----------------------TAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPF
|
|
| SwissProt top hits | e value | %identity | Alignment |
|---|
| P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-94 | 1.4e-11 | 24.15 | Show/hide |
Query: WRFQISPLRKSHKLFKYVDGTTKAPDEKSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGES
W+ ++ L L K +D +K PD ++ E W + D+ + I LS ++ +I T++ +W RLE + S T TN + LK +L ++ G +
Subjt: WRFQISPLRKSHKLFKYVDGTTKAPDEKSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGES
Query: IDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFELHILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGNWRSH
+++ ++ +LA + I+ ED+ I +N LPS+Y+ T++ L + I +K SAL K+ + A+ T
Subjt: IDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFELHILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGNWRSH
Query: GRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEYGHNALDCYN--KMNYSYQGRHPPAKLAAMSASTSH-----SS
GRGR RS + N GR G N S + R+ N C + GH DC N K G+ AAM + + +
Subjt: GRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEYGHNALDCYN--KMNYSYQGRHPPAKLAAMSASTSH-----SS
Query: PGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLP---NASFTLSNLFRVPDISTNLLSVHQLCIDNNCC
++ S +S W+ DT + H T + +L + +GN I+ G G + + + L ++ VPD+ NL+S L D
Subjt: PGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLP---NASFTLSNLFRVPDISTNLLSVHQLCIDNNCC
Query: FIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSR-SDIGVCKHCLDGKLSKQP
+ F + + + S V+ G + LY A+ Q L A + S +WH R+GH L + S I ++ + + C +CL GK +
Subjt: FIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSR-SDIGVCKHCLDGKLSKQP
Query: F
F
Subjt: F
|
|
| Q94HW2 Retrovirus-related Pol polyprotein from transposon RE1 | 1.9e-45 | 28.86 | Show/hide |
Query: NNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP---------DEKSVDYEAWYERDQALITLINATLSQTALSYVI
N S LN N ++N+ L STNY++W Q+ L ++L ++DG+T P + DY W +D+ + + + +S + V
Subjt: NNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP---------DEKSVDYEAWYERDQALITLINATLSQTALSYVI
Query: GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFELHI
T+ Q+WE L K +++ + +V L+T+L+ +K + ++ID Y++ + ++LA + +D ++Q+ + LP Y + + T
Subjt: GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFELHI
Query: LMKTEESALDQQTKIYETSNISHLAMT-TSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEY
L + E L+ ++KI S+ + + +T +V ++ ++ G R D NRNN + + + + P+++Q S P QIC
Subjt: LMKTEESALDQQTKIYETSNISHLAMT-TSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEY
Query: GHNALDCYNKMNY--SYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLS
GH+A C ++ S + PP S T L SP SN WL D+G H+TSD NL + Y G +++ V +G ++PISH G LS
Subjt: GHNALDCYNKMNY--SYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLS
Query: LPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLS
+ L N+ VP+I NL+SV++LC N F +SF ++D +TG L G + + LY S P V+L A KA+ + WH RLGHP S
Subjt: LPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLS
Query: ILNSVLNSSFIPVSRSD--IGVCKHCLDGKLSKQPFS
ILNSV+++ + V C CL K +K PFS
Subjt: ILNSVLNSSFIPVSRSD--IGVCKHCLDGKLSKQPFS
|
|
| Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE2 | 2.8e-41 | 27.7 | Show/hide |
Query: TNICNLVS---VHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP---------DEKSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWER
TNI N+ L STNY++W Q+ L ++L ++DG+T P + DY W +D+ + + I +S + V T+ Q+WE
Subjt: TNICNLVS---VHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP---------DEKSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWER
Query: LEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYN--VFKTSLRTRAHSLTFELHILMKTEESAL
L K +++ + +V T+L+ I++ ++LA + +D ++Q+ + LP Y + + + + SLT E+H E +
Subjt: LEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYN--VFKTSLRTRAHSLTFELHILMKTEESAL
Query: DQQTKIYETSNISHLAMTTSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEYGHNALDCYNK
++++K+ ++ + +T +V + N + RG R+ N NN PS++ S S P + GR QIC GH+A C
Subjt: DQQTKIYETSNISHLAMTTSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEYGHNALDCYNK
Query: MNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLF
+Q + + + S T L SP ++N WL D+G H+TSD NL Y G +++ + +G ++PI+H G L + S L+ +
Subjt: MNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLF
Query: RVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLY--PLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSF
VP+I NL+SV++LC N F +SF ++D +TG L G + + LY P+ S V++ A KA+ + WH RLGHP L+ILNSV+++
Subjt: RVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLY--PLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSF
Query: IPVSRSD--IGVCKHCLDGKLSKQPFS
+PV + C C K K PFS
Subjt: IPVSRSD--IGVCKHCLDGKLSKQPFS
|
|