| GenBank top hits | e value | %identity | Alignment |
|---|
| XP_016646912.1 PREDICTED: uncharacterized protein LOC103318979 [Prunus mume] | 1.3e-145 | 39.67 | Show/hide |
Query: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRL---QVEALT
WER+KD LRKCP H P W+Q+Q FYNGL + +T++D AAGG+ +AKT +A L+E + +YQW SE+ + KP EV A+ L Q+ LT
Subjt: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRL---QVEALT
Query: KILNKL----------IGSEIPSLPKVKVE---------------------------------------------------PKPEEIVQEKQNPLEEML-
K ++ L G E+ + P E P P QEK+ LE+ L
Subjt: KILNKL----------IGSEIPSLPKVKVE---------------------------------------------------PKPEEIVQEKQNPLEEML-
Query: ------GNFINETRNLSVKQDIVIGKME--------AINRSNDNIHEGDP--------------------------------------------------
F+ ET+ Q I +E I+ N + P
Subjt: ------GNFINETRNLSVKQDIVIGKME--------AINRSNDNIHEGDP--------------------------------------------------
Query: ----TRKLIQEEETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKC
T + QEEE + P+P S + + QIP+PQR RK ++D +F+KFL++ R+L INIPFA+ALEQMP YAKFMK+ILS KRKF HE + +T +C
Subjt: ----TRKLIQEEETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKC
Query: SAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEE
SAI+Q+ LP K KD GSF +PC +G+ F ++ALCDLG+SINL+P V +K+G+GE+K TT+ LQ+ DRS+TYP G++EDVLV VD IFPADF+VLDMEE
Subjt: SAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEE
Query: DPEGPIILGRPFLATARALIEVHKGTLTLRVEDDES---VSPADHYEISMPDVFGDELID------VNKEN--------------ALIDRFLGREEKLSL
D + +ILGRPFL T+R LI+V +G LTLRV ++++ V A + D F ELID KEN + D + E L L
Subjt: DPEGPIILGRPFLATARALIEVHKGTLTLRVEDDES---VSPADHYEISMPDVFGDELID------VNKEN--------------ALIDRFLGREEKLSL
Query: DMC-----------------------ARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR-------------------------------------
D + + P+L LK LP HLRYA+LG S T PVII+A+L
Subjt: DMC-----------------------ARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR-------------------------------------
Query: -----------------------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFI
IIY ISDS WVSP QVVPKKGGM VVKNE +E + TRTVTGWRVCIDYRKLN ATRKDHFPLPFI
Subjt: -----------------------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFI
Query: DQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
DQML RLAG+ YYCFLDGYSGYNQ+ IAPEDQEKT FTCP+GT+A+RRMPFG
Subjt: DQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
|
|
| XP_017970395.1 PREDICTED: uncharacterized protein LOC108660654 [Theobroma cacao] | 1.9e-147 | 47.36 | Show/hide |
Query: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
WER+K+ L++CP H PDWLQ+Q FYNGL KT +D AAGG+ ++K DA L+EE+ L +YQW SE+ S E + +S L +QV AL+K +
Subjt: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
Query: NKLIGSEIPSLPKVKVEPKPEEIVQEKQNPLEEMLGNFINETR-----NLSVK---QDIVIGKMEAINRSNDNIHEGD----PTRKLIQEE---ETAQPV
+ L V QN + EM G+ + + N S K Q I + + I N+ + E + ++ ++E E + V
Subjt: NKLIGSEIPSLPKVKVEPKPEEIVQEKQNPLEEMLGNFINETR-----NLSVK---QDIVIGKMEAINRSNDNIHEGD----PTRKLIQEE---ETAQPV
Query: PDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSF
K+ ++ P+PQR +K++L+ +F KF+++ ++LHINIPFA+ALEQMP Y KF+K ILS K K ET+++T +CSAI+Q LPLKLKD GSF
Subjt: PDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSF
Query: TLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARA
T+P +G+ F KAL DLG SINL+P+ ++ KLGLGE K T + L+L DRS YP+G++EDVLV VDKFIFP DF++LDMEED + PIILGRPFLATARA
Subjt: TLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARA
Query: LIEVHKGTLTLRVEDDE-----------SVSPADHYEISM-----PDVF--------------------GDELIDVNKENALIDRFLGREEKLSLDMCA-
LI+V KG LTLRV+D + V+ D + +S+ DVF DE I+ + RF + SLD+
Subjt: LIEVHKGTLTLRVEDDE-----------SVSPADHYEISM-----PDVF--------------------GDELIDVNKENALIDRFLGREEKLSLDMCA-
Query: -------RVNVPS-LELKQLPAHLR--YAFLGESSTFPVIISADLRIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKA
+ PS LELK LPAHLR + E +I D IIY ISDS WVSPVQ V KKG MIVV N+ +E I TRTVTGWRVC+DY KLNKA
Subjt: -------RVNVPS-LELKQLPAHLR--YAFLGESSTFPVIISADLRIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKA
Query: TRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
TRKDHF PFIDQML RLAG YYCFL+GYS YNQ+AIAP+DQEKT FTCPYGT+AFRRMPFG
Subjt: TRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
|
|
| XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber] | 1.7e-161 | 41.88 | Show/hide |
Query: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
WERYKD +R CPQHG PDWLQ+Q FYNGL +TI+D A+GG+ ++KT + A +L+EE+ +YQW +E+ + E E +AL QV +L+ +
Subjt: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
Query: NKLIGSEIP--------SLPKVKVEPKPEEIVQ------------------------------------------------EKQNPLEEMLGNFINETRN
+ L IP S V + +E VQ EK+ LE+ + +F+ ET+
Subjt: NKLIGSEIP--------SLPKVKVEPKPEEIVQ------------------------------------------------EKQNPLEEMLGNFINETRN
Query: LSVKQDIVIGKMEA-----------------------------------------------------INRSNDNIHEGDPT--------RKLIQEE----
K D + +E I RS E PT K+ +EE
Subjt: LSVKQDIVIGKMEA-----------------------------------------------------INRSNDNIHEGDPT--------RKLIQEE----
Query: ---ETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLP
ET P D+ P++ +PYPQRF+K++LD +FSKFLDI +++HINIPFADALEQMP+YAKF+K+I+S KR+ E ETV ++ +CSAIIQK LP
Subjt: ---ETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLP
Query: LKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILG
KLKDPGSFTLPC +G+ F K LCDLGASINL+P VYRKLGLGE+K TTI LQL DRS+ YP+G++EDVLV VDKFIFPADFVVLDMEED E P+ILG
Subjt: LKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILG
Query: RPFLATARALIEVHKGTLTLRVEDDESV---------------------------------SPADHYE-------------------ISMPDVFGD----
RPFLAT RAL++V KG LTLRV +E PADH E +P V +
Subjt: RPFLATARALIEVHKGTLTLRVEDDESV---------------------------------SPADHYE-------------------ISMPDVFGD----
Query: --------ELIDVNKENALIDRFLGREEK-LSLDMCARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR---------------------------
++ +KE ++ + + + +S + + P ELKQLP HLRYAFLG+S TFPVI++A L
Subjt: --------ELIDVNKENALIDRFLGREEK-LSLDMCARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR---------------------------
Query: ---------------------------------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKAT
IIYAISDS WVSPVQVVPKKGGM VVKN+ +E I TRTVTGWRVC+DYRKLNKAT
Subjt: ---------------------------------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKAT
Query: RKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
RKDHFPLPFIDQML RLAGY YYCFLDGYSGYNQ+AIAPEDQEKT FTCPYGT+AFRRMPFG
Subjt: RKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
|
|
| XP_030497826.1 LOW QUALITY PROTEIN: uncharacterized protein LOC115713483 [Cannabis sativa] | 2.8e-151 | 45.51 | Show/hide |
Query: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSAL----------R
WER+KD LRKCP HG WLQ+ FYNGL + +T++D AAGG+F+ K+ +A L+EE+ +T+ QW +E+ S +K EV A+ +
Subjt: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSAL----------R
Query: LQVEALTKILNKLIGSEIPSLPKVKVEPKPE-EIVQEKQNPLEEML-----GNFINETRNLSVKQDIVIGKMEAINRSNDNIHEGDPTRKLIQEEETAQP
Q++ T+ + G + L + E + + +Q + L + GN + T ++ K++ +AI + ++G P++ + +E QP
Subjt: LQVEALTKILNKLIGSEIPSLPKVKVEPKPE-EIVQEKQNPLEEML-----GNFINETRNLSVKQDIVIGKMEAINRSNDNIHEGDPTRKLIQEEETAQP
Query: VP----DKSSDSI-------PLVLN---QIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAI
P K++D + P+ ++ +IPYPQR RK D +FSKFL++ R+LHINIPFA+ALEQMP Y KFMKEILS KRK E+ ETV +T +CSAI
Subjt: VP----DKSSDSI-------PLVLN---QIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAI
Query: IQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPE
+QK LP KLKDPGSFT+PC +G + ALCDLGASINL+P V+++L LGE K TT+ LQL DRS+ +P+GV+EDVLV VDKFIFPADF+VLDMEED
Subjt: IQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPE
Query: GPIILGRPFLATARALIEVHKG--------TLTLRVEDDESVSPADHYEISM-PDVFGDELIDVNKENALIDRFLGREEKLSLDMCARVNVPSLELKQLP
PIILGRPFLAT +ALI++ + LTL E+ + YE +M D +G +E +I + + P LELK LP
Subjt: GPIILGRPFLATARALIEVHKG--------TLTLRVEDDESVSPADHYEISM-PDVFGDELIDVNKENALIDRFLGREEKLSLDMCARVNVPSLELKQLP
Query: AHLRYAFLGESSTFPVIISA------------------------------------------------------------------------DLRIIYAI
HLRY +LGE+ T PVI+++ D + Y I
Subjt: AHLRYAFLGESSTFPVIISA------------------------------------------------------------------------DLRIIYAI
Query: SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
SDS WVSPVQVVPKKGGM VVKNEK+E I TRTVTGWR+CIDYRKLNKATRKDHFPLPFIDQML +LAG YYCFLDGYSGY+Q+AIAPEDQEKT FTCP
Subjt: SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
Query: YGTYAFRRMPFG
YGT+AFRRMPFG
Subjt: YGTYAFRRMPFG
|
|
| XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera] | 8.1e-143 | 38.59 | Show/hide |
Query: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKP--CENEEVSALRLQVEALTK
WER+KD RKCP HG PDWL +Q FYNGL + +D AAGG+ ++K+ ++A L+EE+ +YQW +E+ + K+ P + + ++ L +V++L K
Subjt: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKP--CENEEVSALRLQVEALTK
Query: ILNKL-----IGSEI-----------------------------------------------------------------------PSLPKVK-------
+ +KL + S + PS P+ K
Subjt: ILNKL-----IGSEI-----------------------------------------------------------------------PSLPKVK-------
Query: -------------VEPKPEEIVQEKQNPLEEMLGNFINETR-----NLSVKQDI---------------VIGKMEAINRSNDNIHEGDPTRKLIQEEETA
+E K +++ +N +E LG N NL K ++ +G++ D + + +K+ +E E
Subjt: -------------VEPKPEEIVQEKQNPLEEMLGNFINETR-----NLSVKQDI---------------VIGKMEAINRSNDNIHEGDPTRKLIQEEETA
Query: QPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDP
P P V IP+PQR ++ ++D +F KFL + RQLHINIPFADAL Q+P Y KF+KEI+S KRK E+ ET+ +T +CSAIIQ LP KL+DP
Subjt: QPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDP
Query: GSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLAT
GSF++PC +G + +ALCDLGAS++L+P V RKLGL ELK TTI LQL DRS+ YP G+LE+VL+ V KFI P DF+VL+MEED E PIILGRPFLAT
Subjt: GSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLAT
Query: ARALIEVHKGTLTLRVEDDE---SVSPADHYEISMPDVFGDELIDVNK-------------ENALIDRFLGREEKLSLDMCA------------------
A A+I+V G LTL+V ++E ++ A Y VF +++D + E L+ +++ L + A
Subjt: ARALIEVHKGTLTLRVEDDE---SVSPADHYEISMPDVFGDELIDVNK-------------ENALIDRFLGREEKLSLDMCA------------------
Query: ------------RVNVPSLELKQLPAHLRYAFLGESSTFPVIIS--------------------------ADLR--------------------------
V P LELK LP+HL YAFLGE++T PVI+S +DLR
Subjt: ------------RVNVPSLELKQLPAHLRYAFLGESSTFPVIIS--------------------------ADLR--------------------------
Query: --------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCF
IIY ISDS W+SPVQVVPKKGGM VV NE +E I TRTVTGWRVCIDYRKLN TRKDHFPLPF+DQ+L RLAGY YYCF
Subjt: --------------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCF
Query: LDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
LDGYSGYNQ++I+PEDQEKT FTCPYGT+AFRRMPFG
Subjt: LDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A2G9HYA0 Reverse transcriptase | 6.2e-141 | 40.46 | Show/hide |
Query: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
W R++ LR CP H P +Q+ FY+GL K LD G SFL+ T + L+ L Y+ +SE+ E ++V+AL +++ L + +
Subjt: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
Query: NKLIGSEIPSLPKV---------------------------KVEPKP-------------------------------------EEIVQEKQNPLEEMLG
+++ P K + P ++ +QEK+ LEE L
Subjt: NKLIGSEIPSLPKV---------------------------KVEPKP-------------------------------------EEIVQEKQNPLEEMLG
Query: NFINETRNLSVKQDIVIGKM-EAIN----------------------------RSNDNIHE--GDPTR---KLIQEEETAQPVPDKSSDSIPLVLNQIPY
F+ T + IG++ AIN R+ + E +PT+ K + EE + V S P L Q P+
Subjt: NFINETRNLSVKQDIVIGKM-EAIN----------------------------RSNDNIHE--GDPTR---KLIQEEETAQPVPDKSSDSIPLVLNQIPY
Query: PQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCD
PQR +K++L+ +F KFL++ ++LHINIPFA+ALEQMP Y KFMK+ILS KR+ ++ETV +T +CSAIIQ LP KLKDPGSFT+PC +G+ F +ALCD
Subjt: PQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCD
Query: LGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVED--
LGASINL+P+ +YR LGLGE K T+I LQL DRS+TYPKGV+ED+LV VDKFIFPADFVVLDME D E PIILGRPFLAT R LI+V KG LT+RV+D
Subjt: LGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVED--
Query: -----------------------------DESVS--PADHYEISMPDVFGDELIDVNKENALI------DRFLGREEKLSLDMCARVNV--------PSL
+ES++ P D E ++ D+ +E N+E+ + +FL SL+ V P+L
Subjt: -----------------------------DESVS--PADHYEISMPDVFGDELIDVNKENALI------DRFLGREEKLSLDMCARVNV--------PSL
Query: ELKQLPAHLRYAFLGESSTFPVIISA------------------------------------------------------------------------DL
ELK LP+HL YA+LGES T PVIIS+ D
Subjt: ELKQLPAHLRYAFLGESSTFPVIISA------------------------------------------------------------------------DL
Query: RIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEK
IIY ISDS WVSPVQ VPKKGG+ VV N +E I TRTVTGWRVC+DYRKLNKATRKDHFPLPFIDQML RLAG +YCFLDGYSGYNQ+AIAPEDQEK
Subjt: RIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEK
Query: TKFTCPYGTYAFRRMPFG
T FTCPYGT+AFRRMPFG
Subjt: TKFTCPYGTYAFRRMPFG
|
|
| A0A2G9HYD8 Reverse transcriptase | 5.3e-140 | 41.3 | Show/hide |
Query: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
W R++ LR CP H P +Q+ FY+GL K LD G SFL+ T + L+ L Y+ +SE+ + E ++V+AL +++ L + +
Subjt: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
Query: NKL-------IGSEIPSLPKVKVEPKP-----------------------------------------EEIVQEKQNPLEEMLGNFINETRNLSVKQDIV
+ S+ V KP ++ +QEK+ LEE L F+ +
Subjt: NKL-------IGSEIPSLPKVKVEPKP-----------------------------------------EEIVQEKQNPLEEMLGNFINETRNLSVKQDIV
Query: IGKM-EAINR----SNDNIHEGDPT---------------RKLIQEEETAQPVPDKSSDSI----------PLVLN-----QIPYPQRFRKKRLDAEFSK
IG++ AIN S + E +P RKL +E +P K + I PL ++ Q P+PQ+ +K++L+ +F K
Subjt: IGKM-EAINR----SNDNIHEGDPT---------------RKLIQEEETAQPVPDKSSDSI----------PLVLN-----QIPYPQRFRKKRLDAEFSK
Query: FLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRK
FL++ ++LHINIPFA+ALEQMP Y KFMK+ILS KR+ ++ET +T +C+AIIQ LP KLKDPGSFT+PC +G+ F +ALCDLGASINL+P+ +YR
Subjt: FLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRK
Query: LGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVEDDE---SVSPADHYEISM
LGLGE K T+I LQL DRS+TYPKGV+ED+LV VDKFIFPADFVVLDME D E PIILGRPFLAT R LI+V KG LT+RV+D + +V A +
Subjt: LGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVEDDE---SVSPADHYEISM
Query: PDVFGDELID-------------VNKENALIDRF-LGREEKL------------------SLDMCARVNV--------PSLELKQLPAHLRYAFLGESST
+ F L D + E AL+D G EE L SL+ V P+LELK LP HL Y +LGES T
Subjt: PDVFGDELID-------------VNKENALIDRF-LGREEKL------------------SLDMCARVNV--------PSLELKQLPAHLRYAFLGESST
Query: FPVIISA------------------------------------------------------------------------DLRIIYAISDSPWVSPVQVVP
PVIIS+ D IIY ISDS WVSPVQ VP
Subjt: FPVIISA------------------------------------------------------------------------DLRIIYAISDSPWVSPVQVVP
Query: KKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
KKGG+ VV N +E I TRTVTGWRVC+DYRKLNKATRKDHFPLPFIDQML RLAG +YCFLDGYSGYNQ+AIAPEDQEKT FTCPYGT+AFRRMPFG
Subjt: KKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
|
|
| A0A2P5F9V5 Uncharacterized protein | 1.6e-141 | 53.21 | Show/hide |
Query: DPTRKLIQEEETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSA
+ T++++ E +P D+ P + IPYPQRF+K++LD FSKFLD+ ++LHINIPFADALEQM Y KFMKEILSNKRK E +ETV +T +CSA
Subjt: DPTRKLIQEEETAQPVPDKSSDSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSA
Query: IIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDP
I+QK LP KLKDPGSFT+PC +G+ F KALCDLGASINL+P VY+KLGLGE K TT+ LQL DRS+ YP+G +EDVLV VD+FIFPADF+VLDMEED
Subjt: IIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDP
Query: EGPIILGRPFLATARALIEVHKGTLTLRVEDD------------ESVSPADHYEISMPDVFGDELIDVNKENALIDRF--------------LGREEKLS
+ IILGRPFLAT R LI+V KG L LRV+++ V I + + + K + R LG
Subjt: EGPIILGRPFLATARALIEVHKGTLTLRVEDD------------ESVSPADHYEISMPDVFGDELIDVNKENALIDRF--------------LGREEKLS
Query: LDMCARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR-----------------------------------------------------------
L + V P LELK L AHLRYAFLGESST PVIISA L
Subjt: LDMCARVNVPSLELKQLPAHLRYAFLGESSTFPVIISADLR-----------------------------------------------------------
Query: -------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGY
IIY ISDS WVSPVQVVPKKGGM VV+N+ +E I TRTVTGWRVCIDYRKLNKATRKDHFPLPF+DQML RLAGYP+YCFLDGYSGY
Subjt: -------------IIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGY
Query: NQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
NQ+AIAPEDQEKT FTCPYGT+A+RR+PFG
Subjt: NQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
|
|
| A0A6P6GDJ4 Reverse transcriptase | 5.3e-140 | 39.8 | Show/hide |
Query: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQP-ESIIKEKPCENEEVSALRLQVEA-LTK
WER+K+ LR+CP HG+P W+Q+Q FYNGL K ++D AAGGS + K +A L+EE+ +YQ+ SE+ + K E+ +++ L Q+ K
Subjt: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQP-ESIIKEKPCENEEVSALRLQVEA-LTK
Query: ILNKLIG-----------------SEIPSLPKV--------------------KVEPKPEEIVQ--------EKQNPLEEMLGNFI----NETRNLSVKQ
LN G ++I ++ + + E P+E VQ + P E G I +E +
Subjt: ILNKLIG-----------------SEIPSLPKV--------------------KVEPKPEEIVQ--------EKQNPLEEMLGNFI----NETRNLSVKQ
Query: DIV--------IGKMEAINRSNDNIHEG--DPTRKLIQ---EEETAQPVPDKSSDSIPLVLNQI-------------------PYPQRFRKKRLDAEFSK
+V + EA R ++ + + P K I E + P K + P + I P+P RFR +LD +F K
Subjt: DIV--------IGKMEAINRSNDNIHEG--DPTRKLIQ---EEETAQPVPDKSSDSIPLVLNQI-------------------PYPQRFRKKRLDAEFSK
Query: FLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRK
FLD+ +QLH+NIPF DALEQMP Y KF+KEILSNKR++EN+E V ++ + SA + +P KLKDPGSF +PC +G + ALCDLGASINL+P+ +YRK
Subjt: FLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQKHLPLKLKDPGSFTLPCIVGSLFIKKALCDLGASINLIPFIVYRK
Query: LGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVEDDE---------------
LG+G++K TT+ LQ+ DRS+ P+G+LEDVLV V+KFIFPADFV+LDMEED PIILGRPFLAT RALI+V + +TLRV ++E
Subjt: LGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALIEVHKGTLTLRVEDDE---------------
Query: ------SVSPADHY-----------EISMPDVFGDELIDVNKENALID-------RFLGREEKLSLDMCARV----NVPSLELKQLPAHLRYAFLGESST
V D + + ++FG + + +E +ID ++G+ E L ++ + P ++LK LP+HL+Y FLG ++T
Subjt: ------SVSPADHY-----------EISMPDVFGDELIDVNKENALID-------RFLGREEKLSLDMCARV----NVPSLELKQLPAHLRYAFLGESST
Query: FPVIISA------------------------------------------------------------------------DLRIIYAISDSPWVSPVQVVP
PVIIS+ D IIY ISDS WVSP+QVVP
Subjt: FPVIISA------------------------------------------------------------------------DLRIIYAISDSPWVSPVQVVP
Query: KKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
KKGGM V +N+K E I TR VTGWRVCIDYRKLNKATRKDHFPLPFIDQML RLAG YYCFLDGYSGYNQ+AIAP+DQEKT FTCPYGT+A+RRMPFG
Subjt: KKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCPYGTYAFRRMPFG
|
|
| A0A6P8DKJ2 uncharacterized protein LOC116204231 | 2.7e-136 | 37.65 | Show/hide |
Query: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
WER+K+ +RKCP HG PD L I+ FY L ++++D AAGG+ + K +A L+EE+ +++ WQ+E+ +S + + + ++ L Q+ ALT +
Subjt: WERYKDFLRKCPQHGYPDWLQIQFFYNGLKRDFKTILDTAAGGSFLAKTVKDARTLVEELPLTSYQWQSEQPESIIKEKPCENEEVSALRLQVEALTKIL
Query: NKLIGS-----------EIPSLPKVKVE---------PKPEEI-----------------------------------------------------VQEK
+KL + E+ S P +E P E++ Q+
Subjt: NKLIGS-----------EIPSLPKVKVE---------PKPEEI-----------------------------------------------------VQEK
Query: QNPLEEMLGNFINETRNLSVKQDIVI----GKMEAINRSNDNIHEGD-------------------------PTRKLIQEEETAQ--------PVPDKSS
Q+ +EE++ +++ +T + Q I G++ I++ N G RK +EE+ + P + S
Subjt: QNPLEEMLGNFINETRNLSVKQDIVI----GKMEAINRSNDNIHEGD-------------------------PTRKLIQEEETAQ--------PVPDKSS
Query: DSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQK---HLPLKLKDPGSFTL
+ + +P+P R ++++LDA+F+KFLD+ ++L INIPFA+AL+QMP YA+FMK++L+ KRKF+ E V +T +CS I+QK +LP K +D GSFT+
Subjt: DSIPLVLNQIPYPQRFRKKRLDAEFSKFLDILRQLHINIPFADALEQMPHYAKFMKEILSNKRKFENHETVNMTAKCSAIIQK---HLPLKLKDPGSFTL
Query: PCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALI
PC +G+ + L D GASINL+P ++RKLGLGE K T I LQL DRS+ YPKG++E+VLV VDKFIFP DF+VL+MEED E P+ILGRPFLAT +ALI
Subjt: PCIVGSLFIKKALCDLGASINLIPFIVYRKLGLGELKSTTIVLQLPDRSMTYPKGVLEDVLVNVDKFIFPADFVVLDMEEDPEGPIILGRPFLATARALI
Query: EVHKGTLTLRVEDD-------ESVSPADHYEISMPDVFGDELIDVN-KENALIDRFLG------------REEKLSLDMCARVNV---------------
+V +G LTLRV ++ +++ D + DELI + +E A +D E+ S++ + +
Subjt: EVHKGTLTLRVEDD-------ESVSPADHYEISMPDVFGDELIDVN-KENALIDRFLG------------REEKLSLDMCARVNV---------------
Query: -----PSLELKQLPAHLRYAFLGESSTFPVIISA------------------------------------------------------------------
P LELK LP+HL+YA+LG T P+IIS+
Subjt: -----PSLELKQLPAHLRYAFLGESSTFPVIISA------------------------------------------------------------------
Query: ------DLRIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVA
D IIY ISDS WVSPVQVVPKKGGM VVKNE ++ I TRTVTGWRVCIDYRKLN ATRKDHFPLPFIDQML +LAG+ YYCFLDGYSGYNQ+
Subjt: ------DLRIIYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVA
Query: IAPEDQEKTKFTCPYGTYAFRRMPFG
IAPEDQEKT FTCPYGT+AFRRMPFG
Subjt: IAPEDQEKTKFTCPYGTYAFRRMPFG
|
|
| SwissProt top hits | e value | %identity | Alignment |
|---|
| P04323 Retrovirus-related Pol polyprotein from transposon 17.6 | 1.6e-13 | 37.93 | Show/hide |
Query: IYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTK
I S+SP+ SP+ VVPKK +D S + +R+ IDYRKLN+ T D P+P +D++L +L Y+ +D G++Q+ + PE KT
Subjt: IYAISDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTK
Query: FTCPYGTYAFRRMPFG
F+ +G Y + RMPFG
Subjt: FTCPYGTYAFRRMPFG
|
|
| P10401 Retrovirus-related Pol polyprotein from transposon gypsy | 1.8e-12 | 39.29 | Show/hide |
Query: SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
S SP+ SP VV KKG K R+ ID+RKLN+ T D +P+P I +L L ++ LD SGY+Q+ +A D+EKT F+
Subjt: SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
Query: YGTYAFRRMPFG
G Y F R+PFG
Subjt: YGTYAFRRMPFG
|
|
| P20825 Retrovirus-related Pol polyprotein from transposon 297 | 1.4e-12 | 38.39 | Show/hide |
Query: SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
S+SP+ SP VVPKK D S + + +RV IDYRKLN+ T D +P+P +D++L +L Y+ +D G++Q+ + E KT F+
Subjt: SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
Query: YGTYAFRRMPFG
G Y + RMPFG
Subjt: YGTYAFRRMPFG
|
|
| Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein | 1.1e-17 | 41.13 | Show/hide |
Query: SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
S SP SPV +VPKK G +R+C+DYR LNKAT D FPLP ID +L R+ + LD +SGY+Q+ + P+D+ KT F P
Subjt: SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
Query: YGTYAFRRMPFGPGENPQIGSPYM
G Y + MPFG P + YM
Subjt: YGTYAFRRMPFGPGENPQIGSPYM
|
|
| Q99315 Transposon Ty3-G Gag-Pol polyprotein | 1.1e-17 | 41.13 | Show/hide |
Query: SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
S SP SPV +VPKK G +R+C+DYR LNKAT D FPLP ID +L R+ + LD +SGY+Q+ + P+D+ KT F P
Subjt: SDSPWVSPVQVVPKKGGMIVVKNEKDESISTRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLYRLAGYPYYCFLDGYSGYNQVAIAPEDQEKTKFTCP
Query: YGTYAFRRMPFGPGENPQIGSPYM
G Y + MPFG P + YM
Subjt: YGTYAFRRMPFGPGENPQIGSPYM
|
|