| GenBank top hits | e value | %identity | Alignment |
|---|
| AAO73527.1 gag-pol polyprotein [Glycine max] | 5.4e-272 | 47.79 | Show/hide |
Query: GNQRNNREDQSLRHRQKEDRRSRSYPATTRTNSRSSGREPKLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRP
G+ + + + R + + ++ T+ R+S +E DWY DS CSRHMTG +F ++ C + FGDG KGKIIG G +
Subjt: GNQRNNREDQSLRHRQKEDRRSRSYPATTRTNSRSSGREPKLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRP
Query: GLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRC----NDAEVTL----------------------CNLLKVEEVGLWHKRLRHLSGSTISKVTK
GLP L V LV+GL ANLISI+QL D+G+ V+F C +EV + C K +EV +WH+R HL + K+
Subjt: GLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRC----NDAEVTL----------------------CNLLKVEEVGLWHKRLRHLSGSTISKVTK
Query: VDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQR
A+ G+ L C EC GKQVK H+ + +TS +LELLH+DLMGPMQ ESLG KRYA V VDDF R+TW+ FI +KSETF+ + L +LQR
Subjt: VDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQR
Query: EKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELW
EK+ + RIR+DHG EFEN F EFC +EGI HEFSA +TPQQNG+VER+NRTL+E AR AE +NTA +IHNRV L GT T YE+W
Subjt: EKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELW
Query: KGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELEST
KGRKP+VK+FHIFGS C+I++DR+ RRK D KSD GIFLGYS NSRAYRV+N R++TVMESINV++DDL P R D E++V
Subjt: KGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELEST
Query: APTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANV
LG + +D + +A +D+ E++I+ + D T I K HP IIGD + G+ TR +E ++V+N
Subjt: APTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANV
Query: CYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRL
C+ S +EP V AL+D+ WI AMQEEL QF+RN+VWELVP+P N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ETFAPVARLE+IRL
Subjt: CYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRL
Query: LLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI---
LL C +FKL+QMDVKSAFLNGYL EEVYV QPKGF + H DHVY+L+KALYGLKQAPRAWYERL+ +L QQG +KG D+T+F+ + + +I
Subjt: LLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI---
Query: -----------------FVAQIKGEFEMNMVGELTFFLGTLYISLE---------WIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSII
FV Q++ EFEM++VGELT+FLG +E + ++ +K+ THLK++KD G VD +LYRS+I
Subjt: -----------------FVAQIKGEFEMNMVGELTFFLGTLYISLE---------WIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSII
Query: GSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSV
GSLLYLTASRPDI +A GV ARYQA+P+ SHL KRILKY++ +WFSKKQN V
Subjt: GSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSV
Query: SLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVAT
SLSTAEAEYI AGSSCSQL+WMKQML EY + Q M LYCDN+SAI+ISKN VQHSR +HIDI+HH+IR+LV+ +I L+HV + Q+ADIF K LD
Subjt: SLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVAT
Query: FEGLRANVGVC
FE LR +G+C
Subjt: FEGLRANVGVC
|
|
| AAO73529.1 gag-pol polyprotein [Glycine max] | 7.6e-274 | 48.67 | Show/hide |
Query: TNSRSSGRE----PK-----LPILYSLNSS---DWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANL
T SSGR+ PK L + SL +S DWY DS CSRHMTG +F ++ C + FGDG KGKI G G + GLP L V LV+GL NL
Subjt: TNSRSSGRE----PK-----LPILYSLNSS---DWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANL
Query: ISITQLYDQGYQVSFNNDRC--------------------------NDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERC
ISI+QL D+G+ V+F C + + C K +EV +WH+R HL + K+ A+ G+ L C
Subjt: ISITQLYDQGYQVSFNNDRC--------------------------NDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERC
Query: LECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFE
EC GKQVK H+ + +TS +LELLH+DLMGPMQ ESLG KRYA V VDDF R+TW+ FI +KS+TF+ + L +LQREK+ + RIR+DHG EFE
Subjt: LECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFE
Query: NQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCF
N F EFC +EGI HEFSA +TPQQNG+VER+NRTL+E AR AE +NTA +IHNRV L GT T YE+WKGRKP VK+FHIFGS C+
Subjt: NQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCF
Query: IMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRS
I++DR+ RRK D KSD GIFLGYS NSRAYRV+N R++TVMESINV++DDL P R D E++V + G + +
Subjt: IMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRS
Query: DMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDK
D + + +A +D+ E +I+ + D P I K HP IIGD + G+ TR +E ++V+N C+ S +EP V AL+D+
Subjt: DMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDK
Query: HWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVK
WI AMQEEL QF+RN+VWELVP+P N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ETFAPVARLE+IRLLL C +FKL+QMDVK
Subjt: HWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVK
Query: SAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI--------------------F
SAFLNGYL EE YV QPKGFV+ H DHVY+L+KALYGLKQAPRAWYERL+ +L QQG +KG D+T+F+ + + +I F
Subjt: SAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI--------------------F
Query: VAQIKGEFEMNMVGELTFFLG--------TLYIS-LEWIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGG
V Q++ EFEM++VGELT+FLG ++++S ++ ++ +K+ THLK++KD G VD +LYRS+IGSLLYLTASRPDI +A G
Subjt: VAQIKGEFEMNMVGELTFFLG--------TLYIS-LEWIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGG
Query: VYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQ
V ARYQA+P+ SHL+ KRILKY++ +WFSKKQN VSLSTAEAEYI AGSSCSQ
Subjt: VYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQ
Query: LLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVC
L+WMKQML EY + Q M LYCDN+SAI+ISKN VQHSR +HIDI+HH+IRELV+ +I LEHV + Q+ADIF K LD FE LR +G+C
Subjt: LLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVC
|
|
| KAA0059225.1 gag-pol polyprotein [Cucumis melo var. makuwa] | 0.0e+00 | 81.65 | Show/hide |
Query: SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
S NSSDWYFDS CSRHMTGNADFFS+LSECK GS+VFGDGGKGKIIGKGTIN GLPFLLDVRL+QGLAANLISI+QL DQGYQVSFN DRCN
Subjt: SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
Query: ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
DAEVTLCNL KVEE LWHKRL HLSG+TISKVTKVDAIIGL L+F SLE C EC AGKQVKSVHKP+NISSTSHILELLHIDL
Subjt: ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
Query: MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
MGPMQTESLGRK YAVVCVDDF RYTWIKFILDK ETFKTCQTLFTQLQREKNTG+G+I+TDHGHEFENQHFAEFCDNEGIFHEFSAPLT QQNGV
Subjt: MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
Query: NRTLEEMARAEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESIN
AE LNTA HIHNRVIL GTT TSYELWKGRKPNVKYFHIFGSTCFI+SDRDHRRKWDSKSDRGIFLGY ANSRAYRVYNQ SK VMESIN
Subjt: NRTLEEMARAEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESIN
Query: VIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPP
VIIDDL EGELES A TNETTYLPSHLG SR DMSTPSTSAI +THESEA +SASQHT E+T GATDS KCDLIPP
Subjt: VIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPP
Query: THIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG
TH AKNHPSSFII DIHSGIITRKKERKDYAKMVANVCYTS LEPTTVSAALSD+HWIL +QEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG
Subjt: THIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG
Query: RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRA
RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSY CF RFKLFQMDVKSAFLNGYL EEVYVAQPKGFV+ VH+DHVYKLRKALY LKQAPRA
Subjt: RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRA
Query: WYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKG
WYERLSTYLLQQG ++GSADQTMFIYRQG +FLI + G
Subjt: WYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKG
|
|
| KAA0059225.1 gag-pol polyprotein [Cucumis melo var. makuwa] | 1.1e-11 | 50.44 | Show/hide |
Query: MKLDELFGSLRTFEIHLRHTASRRKLGLALTSVKEEPIEEHRVMQGNDALTESVVVLKKQVAKLKNQFHKYMGNQRNNREDQSLRHRQKEDRRSRSYPAT
MKLDELFGSLR FEIHL HT SRRKLGLALTS VAKLKNQFHK+MG+QRNNREDQ+LR + D S +
Subjt: MKLDELFGSLRTFEIHLRHTASRRKLGLALTSVKEEPIEEHRVMQGNDALTESVVVLKKQVAKLKNQFHKYMGNQRNNREDQSLRHRQKEDRRSRSYPAT
Query: TRTNSRSSGREPK
R G+E K
Subjt: TRTNSRSSGREPK
|
|
| KAA0059225.1 gag-pol polyprotein [Cucumis melo var. makuwa] | 0.0e+00 | 64.07 | Show/hide |
Query: SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
S NS DWYFDS CSRHMTGNADFFS+LSECKAGS+VF DGGKGKIIGKGTINRPGLPFLLDVRLVQGL+ANLIS +QL DQGY+V+F+ DRCN
Subjt: SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
Query: ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
DAEVTLCNL KVEE GLWHKRL HL G+TISKV K +AIIGL LSFSSLE C ECPAGKQVKSVHKP+NIS TSHILELLHIDL
Subjt: ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
Query: MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
M PMQTESLGRKRYAVVCVDDF RYTWIKFIL+K ETFKTCQTL TQLQREKNTG+GRIRT+HG EFEN+HFAEFCDNEGIFHEFSA LTPQ+NGVVE+R
Subjt: MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
Query: NRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRV
N+TL+EMAR AE LNTA HIHNRVIL TT TSYELWKGRKPNVKYFHIFG TCFI+SDRDHRRKWDSKSDRGIFLGYSAN+RAYRV
Subjt: NRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRV
Query: YNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTT
YNQR+K V+ESINVIIDDLGKEPNRNLDDEDEVFWNSLS K EGE EST PTNETTYLPSH S++ DMSTPSTS ++T+ESEA++SASQHT ERT
Subjt: YNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTT
Query: GATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIG
GATDS K DLIPP HIAKNHP SFIIGD+HSGIIT+KKERKDYAKMVANVCYT SLEPTT+SA L+D+H ILAMQEELLQFERNQVWELVPK PYANIIG
Subjt: GATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIG
Query: TKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKL
TKWIFKNKTDEEGRVI NKARLVAQGYSQIEG N +
Subjt: TKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKL
Query: RKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVD
Subjt: RKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVD
Query: TNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMIL
AAWFSKKQNSVSLSTAEAEYI GSSCSQLLWMKQMLDEYGITQSSMIL
Subjt: TNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMIL
Query: YCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRPT
YCDNLSAISISKN VQHS+ +HI+IQHHFIRELVEANII LEHVQSAFQLADIF KPLDVATFEGLRA+VGVCQRPT
Subjt: YCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRPT
|
|
| TYK21888.1 hypothetical protein E5676_scaffold494G00240 [Cucumis melo var. makuwa] | 0.0e+00 | 64.9 | Show/hide |
Query: SDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTIS
S L ECK GS+VFG G KGK IGKGTINRP LPFLLDVRLVQGL+ANLISI+QL DQGYQ DAEV LCNL KVEEVGLWHKRL HLSG+TIS
Subjt: SDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTIS
Query: KVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFT
KVTK DAIIGL LSFSSLE C EC A KQ +P+++SSTSH LELLHIDLMGPMQTESLGRK
Subjt: KVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFT
Query: QLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITS
REKNTG+ RI+TDHG EFEN++F EFCDNEGIFHEFSA L PQQNGVVERRNRTL+EMA+ E LNTA HIHNRVIL GTT TS
Subjt: QLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITS
Query: YELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGE
YELWKGRKPNVKYFHIF STCFI+SDR+H RKWDSKSDRGIFLGYS NSRAYRVYNQR+KTVMES+NVII DLGKEPNRNLDDEDE FW+SLS K V+ E
Subjt: YELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGE
Query: LESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKM
EST+ T ETTY P H S+R DMSTPSTS +T E EA++SASQHT ERT G+TDS K L+P T+IAK+HPSSFII D+HSGIITRKKERKDYAKM
Subjt: LESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKM
Query: VANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLE
V N+CYTSSLEPTTVS AL+++HWILAMQEELLQFERNQVWELVPKPP+ANIIGTKWIFKNKTDE+GRVIRNKARLVAQGYSQIEGLDFGETF PVARLE
Subjt: VANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLE
Query: AIRLLLSYTCFR---------------RFKLFQMDVKSAFLNG--YLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKK
AIRLLLSY CFR F+ MD +G +L ++YV D ++ RK + Q+ C K
Subjt: AIRLLLSYTCFR---------------RFKLFQMDVKSAFLNG--YLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKK
Query: GSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYA
I + G+D A+ K THLKMTKD +KVDTNLYRSIIGSLLYLTASRPDIAF GV A
Subjt: GSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYA
Query: RYQ----------------ADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKN
RY D R S + I A WFSKKQNSVSLSTAEAEYI AGSSCSQLLWMKQMLDEYGITQSSM LYCDNLSAISISKN
Subjt: RYQ----------------ADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKN
Query: LVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP
VQHSR +HIDI+HHFIRELVEANII LEHVQSAFQLADIF KPLDVATFEGLRA+VGVCQRP
Subjt: LVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A5A7V046 Gag-pol polyprotein | 0.0e+00 | 81.65 | Show/hide |
Query: SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
S NSSDWYFDS CSRHMTGNADFFS+LSECK GS+VFGDGGKGKIIGKGTIN GLPFLLDVRL+QGLAANLISI+QL DQGYQVSFN DRCN
Subjt: SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
Query: ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
DAEVTLCNL KVEE LWHKRL HLSG+TISKVTKVDAIIGL L+F SLE C EC AGKQVKSVHKP+NISSTSHILELLHIDL
Subjt: ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
Query: MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
MGPMQTESLGRK YAVVCVDDF RYTWIKFILDK ETFKTCQTLFTQLQREKNTG+G+I+TDHGHEFENQHFAEFCDNEGIFHEFSAPLT QQNGV
Subjt: MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
Query: NRTLEEMARAEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESIN
AE LNTA HIHNRVIL GTT TSYELWKGRKPNVKYFHIFGSTCFI+SDRDHRRKWDSKSDRGIFLGY ANSRAYRVYNQ SK VMESIN
Subjt: NRTLEEMARAEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESIN
Query: VIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPP
VIIDDL EGELES A TNETTYLPSHLG SR DMSTPSTSAI +THESEA +SASQHT E+T GATDS KCDLIPP
Subjt: VIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPP
Query: THIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG
TH AKNHPSSFII DIHSGIITRKKERKDYAKMVANVCYTS LEPTTVSAALSD+HWIL +QEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG
Subjt: THIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG
Query: RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRA
RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSY CF RFKLFQMDVKSAFLNGYL EEVYVAQPKGFV+ VH+DHVYKLRKALY LKQAPRA
Subjt: RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRA
Query: WYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKG
WYERLSTYLLQQG ++GSADQTMFIYRQG +FLI + G
Subjt: WYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKG
|
|
| A0A5A7V046 Gag-pol polyprotein | 5.5e-12 | 50.44 | Show/hide |
Query: MKLDELFGSLRTFEIHLRHTASRRKLGLALTSVKEEPIEEHRVMQGNDALTESVVVLKKQVAKLKNQFHKYMGNQRNNREDQSLRHRQKEDRRSRSYPAT
MKLDELFGSLR FEIHL HT SRRKLGLALTS VAKLKNQFHK+MG+QRNNREDQ+LR + D S +
Subjt: MKLDELFGSLRTFEIHLRHTASRRKLGLALTSVKEEPIEEHRVMQGNDALTESVVVLKKQVAKLKNQFHKYMGNQRNNREDQSLRHRQKEDRRSRSYPAT
Query: TRTNSRSSGREPK
R G+E K
Subjt: TRTNSRSSGREPK
|
|
| A0A5A7V046 Gag-pol polyprotein | 0.0e+00 | 64.07 | Show/hide |
Query: SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
S NS DWYFDS CSRHMTGNADFFS+LSECKAGS+VF DGGKGKIIGKGTINRPGLPFLLDVRLVQGL+ANLIS +QL DQGY+V+F+ DRCN
Subjt: SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
Query: ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
DAEVTLCNL KVEE GLWHKRL HL G+TISKV K +AIIGL LSFSSLE C ECPAGKQVKSVHKP+NIS TSHILELLHIDL
Subjt: ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
Query: MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
M PMQTESLGRKRYAVVCVDDF RYTWIKFIL+K ETFKTCQTL TQLQREKNTG+GRIRT+HG EFEN+HFAEFCDNEGIFHEFSA LTPQ+NGVVE+R
Subjt: MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
Query: NRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRV
N+TL+EMAR AE LNTA HIHNRVIL TT TSYELWKGRKPNVKYFHIFG TCFI+SDRDHRRKWDSKSDRGIFLGYSAN+RAYRV
Subjt: NRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRV
Query: YNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTT
YNQR+K V+ESINVIIDDLGKEPNRNLDDEDEVFWNSLS K EGE EST PTNETTYLPSH S++ DMSTPSTS ++T+ESEA++SASQHT ERT
Subjt: YNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTT
Query: GATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIG
GATDS K DLIPP HIAKNHP SFIIGD+HSGIIT+KKERKDYAKMVANVCYT SLEPTT+SA L+D+H ILAMQEELLQFERNQVWELVPK PYANIIG
Subjt: GATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIG
Query: TKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKL
TKWIFKNKTDEEGRVI NKARLVAQGYSQIEG N +
Subjt: TKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKL
Query: RKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVD
Subjt: RKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVD
Query: TNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMIL
AAWFSKKQNSVSLSTAEAEYI GSSCSQLLWMKQMLDEYGITQSSMIL
Subjt: TNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMIL
Query: YCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRPT
YCDNLSAISISKN VQHS+ +HI+IQHHFIRELVEANII LEHVQSAFQLADIF KPLDVATFEGLRA+VGVCQRPT
Subjt: YCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRPT
|
|
| A0A5D3DEL4 Integrase catalytic domain-containing protein | 0.0e+00 | 64.9 | Show/hide |
Query: SDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTIS
S L ECK GS+VFG G KGK IGKGTINRP LPFLLDVRLVQGL+ANLISI+QL DQGYQ DAEV LCNL KVEEVGLWHKRL HLSG+TIS
Subjt: SDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTIS
Query: KVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFT
KVTK DAIIGL LSFSSLE C EC A KQ +P+++SSTSH LELLHIDLMGPMQTESLGRK
Subjt: KVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFT
Query: QLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITS
REKNTG+ RI+TDHG EFEN++F EFCDNEGIFHEFSA L PQQNGVVERRNRTL+EMA+ E LNTA HIHNRVIL GTT TS
Subjt: QLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITS
Query: YELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGE
YELWKGRKPNVKYFHIF STCFI+SDR+H RKWDSKSDRGIFLGYS NSRAYRVYNQR+KTVMES+NVII DLGKEPNRNLDDEDE FW+SLS K V+ E
Subjt: YELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGE
Query: LESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKM
EST+ T ETTY P H S+R DMSTPSTS +T E EA++SASQHT ERT G+TDS K L+P T+IAK+HPSSFII D+HSGIITRKKERKDYAKM
Subjt: LESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKM
Query: VANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLE
V N+CYTSSLEPTTVS AL+++HWILAMQEELLQFERNQVWELVPKPP+ANIIGTKWIFKNKTDE+GRVIRNKARLVAQGYSQIEGLDFGETF PVARLE
Subjt: VANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLE
Query: AIRLLLSYTCFR---------------RFKLFQMDVKSAFLNG--YLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKK
AIRLLLSY CFR F+ MD +G +L ++YV D ++ RK + Q+ C K
Subjt: AIRLLLSYTCFR---------------RFKLFQMDVKSAFLNG--YLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKK
Query: GSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYA
I + G+D A+ K THLKMTKD +KVDTNLYRSIIGSLLYLTASRPDIAF GV A
Subjt: GSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYA
Query: RYQ----------------ADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKN
RY D R S + I A WFSKKQNSVSLSTAEAEYI AGSSCSQLLWMKQMLDEYGITQSSM LYCDNLSAISISKN
Subjt: RYQ----------------ADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKN
Query: LVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP
VQHSR +HIDI+HHFIRELVEANII LEHVQSAFQLADIF KPLDVATFEGLRA+VGVCQRP
Subjt: LVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP
|
|
| Q84VH6 Gag-pol polyprotein | 3.7e-274 | 48.67 | Show/hide |
Query: TNSRSSGRE----PK-----LPILYSLNSS---DWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANL
T SSGR+ PK L + SL +S DWY DS CSRHMTG +F ++ C + FGDG KGKI G G + GLP L V LV+GL NL
Subjt: TNSRSSGRE----PK-----LPILYSLNSS---DWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANL
Query: ISITQLYDQGYQVSFNNDRC--------------------------NDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERC
ISI+QL D+G+ V+F C + + C K +EV +WH+R HL + K+ A+ G+ L C
Subjt: ISITQLYDQGYQVSFNNDRC--------------------------NDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERC
Query: LECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFE
EC GKQVK H+ + +TS +LELLH+DLMGPMQ ESLG KRYA V VDDF R+TW+ FI +KS+TF+ + L +LQREK+ + RIR+DHG EFE
Subjt: LECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFE
Query: NQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCF
N F EFC +EGI HEFSA +TPQQNG+VER+NRTL+E AR AE +NTA +IHNRV L GT T YE+WKGRKP VK+FHIFGS C+
Subjt: NQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCF
Query: IMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRS
I++DR+ RRK D KSD GIFLGYS NSRAYRV+N R++TVMESINV++DDL P R D E++V + G + +
Subjt: IMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRS
Query: DMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDK
D + + +A +D+ E +I+ + D P I K HP IIGD + G+ TR +E ++V+N C+ S +EP V AL+D+
Subjt: DMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDK
Query: HWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVK
WI AMQEEL QF+RN+VWELVP+P N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ETFAPVARLE+IRLLL C +FKL+QMDVK
Subjt: HWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVK
Query: SAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI--------------------F
SAFLNGYL EE YV QPKGFV+ H DHVY+L+KALYGLKQAPRAWYERL+ +L QQG +KG D+T+F+ + + +I F
Subjt: SAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI--------------------F
Query: VAQIKGEFEMNMVGELTFFLG--------TLYIS-LEWIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGG
V Q++ EFEM++VGELT+FLG ++++S ++ ++ +K+ THLK++KD G VD +LYRS+IGSLLYLTASRPDI +A G
Subjt: VAQIKGEFEMNMVGELTFFLG--------TLYIS-LEWIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGG
Query: VYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQ
V ARYQA+P+ SHL+ KRILKY++ +WFSKKQN VSLSTAEAEYI AGSSCSQ
Subjt: VYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQ
Query: LLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVC
L+WMKQML EY + Q M LYCDN+SAI+ISKN VQHSR +HIDI+HH+IRELV+ +I LEHV + Q+ADIF K LD FE LR +G+C
Subjt: LLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVC
|
|
| Q84VH8 Gag-pol polyprotein | 2.6e-272 | 47.79 | Show/hide |
Query: GNQRNNREDQSLRHRQKEDRRSRSYPATTRTNSRSSGREPKLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRP
G+ + + + R + + ++ T+ R+S +E DWY DS CSRHMTG +F ++ C + FGDG KGKIIG G +
Subjt: GNQRNNREDQSLRHRQKEDRRSRSYPATTRTNSRSSGREPKLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRP
Query: GLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRC----NDAEVTL----------------------CNLLKVEEVGLWHKRLRHLSGSTISKVTK
GLP L V LV+GL ANLISI+QL D+G+ V+F C +EV + C K +EV +WH+R HL + K+
Subjt: GLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRC----NDAEVTL----------------------CNLLKVEEVGLWHKRLRHLSGSTISKVTK
Query: VDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQR
A+ G+ L C EC GKQVK H+ + +TS +LELLH+DLMGPMQ ESLG KRYA V VDDF R+TW+ FI +KSETF+ + L +LQR
Subjt: VDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQR
Query: EKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELW
EK+ + RIR+DHG EFEN F EFC +EGI HEFSA +TPQQNG+VER+NRTL+E AR AE +NTA +IHNRV L GT T YE+W
Subjt: EKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELW
Query: KGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELEST
KGRKP+VK+FHIFGS C+I++DR+ RRK D KSD GIFLGYS NSRAYRV+N R++TVMESINV++DDL P R D E++V
Subjt: KGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELEST
Query: APTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANV
LG + +D + +A +D+ E++I+ + D T I K HP IIGD + G+ TR +E ++V+N
Subjt: APTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANV
Query: CYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRL
C+ S +EP V AL+D+ WI AMQEEL QF+RN+VWELVP+P N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ETFAPVARLE+IRL
Subjt: CYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRL
Query: LLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI---
LL C +FKL+QMDVKSAFLNGYL EEVYV QPKGF + H DHVY+L+KALYGLKQAPRAWYERL+ +L QQG +KG D+T+F+ + + +I
Subjt: LLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI---
Query: -----------------FVAQIKGEFEMNMVGELTFFLGTLYISLE---------WIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSII
FV Q++ EFEM++VGELT+FLG +E + ++ +K+ THLK++KD G VD +LYRS+I
Subjt: -----------------FVAQIKGEFEMNMVGELTFFLGTLYISLE---------WIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSII
Query: GSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSV
GSLLYLTASRPDI +A GV ARYQA+P+ SHL KRILKY++ +WFSKKQN V
Subjt: GSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSV
Query: SLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVAT
SLSTAEAEYI AGSSCSQL+WMKQML EY + Q M LYCDN+SAI+ISKN VQHSR +HIDI+HH+IR+LV+ +I L+HV + Q+ADIF K LD
Subjt: SLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVAT
Query: FEGLRANVGVC
FE LR +G+C
Subjt: FEGLRANVGVC
|
|
| SwissProt top hits | e value | %identity | Alignment |
|---|
| P04146 Copia protein | 4.3e-86 | 26.69 | Show/hide |
Query: LLDVRLVQGLAANLISITQLYDQGYQVSF---------------------NNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSL
L DV + A NL+S+ +L + G + F NN + + N LWH+R H+S + ++ + + SL
Subjt: LLDVRLVQGLAANLISITQLYDQGYQVSF---------------------NNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSL
Query: LSFSSL--ERCLECPAGKQVKSVHKPINISSTSHI---LELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNT
L+ L E C C GKQ + K + +HI L ++H D+ GP+ +L K Y V+ VD F Y I KS+ F Q + + N
Subjt: LSFSSL--ERCLECPAGKQVKSVHKPINISSTSHI---LELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNT
Query: GVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHS--GTTITSYELWKG
V + D+G E+ + +FC +GI + + P TPQ NGV ER RT+ E AR E + TA ++ NR+ + ++ T YE+W
Subjt: GVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHS--GTTITSYELWKG
Query: RKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNR-----------NLDDEDEVFWNSLSPK
+KP +K+ +FG+T ++ ++ + K+D KS + IF+GY N +++++ ++ + + +V++D+ +R + + E++ F N S K
Subjt: RKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNR-----------NLDDEDEVFWNSLSPK
Query: PVEGEL-ESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQT----------------------------------DTHESEASISASQHTLERTTGATDSL
++ E + + +L S + S IQT D H +E+ S + + + A
Subjt: PVEGEL-ESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQT----------------------------------DTHESEASISASQHTLERTTGATDSL
Query: KCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAAL---------------SDK-HWILAMQEELLQFERNQVWELV
+ + PT KN II R+ ER ++ +SL ++A DK W A+ EL + N W +
Subjt: KCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAAL---------------SDK-HWILAMQEELLQFERNQVWELV
Query: PKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVN
+P NI+ ++W+F K +E G IR KARLVA+G++Q +D+ ETFAPVAR+ + R +LS K+ QMDVK+AFLNG L EE+Y+ P+G
Subjt: PKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVN
Query: RVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG-----------IDFLI-----------FVAQIKGEFEMNMVGELTFFL
+ D+V KL KA+YGLKQA R W+E L + S D+ ++I +G +D ++ F + +F M + E+ F+
Subjt: RVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG-----------IDFLI-----------FVAQIKGEFEMNMVGELTFFL
Query: GTLYISLEWIRSDLKELTHLKMTKD----TNGKKVDTNL------------------YRSIIGSLLY-LTASRPDIAFAGGVYARYQADPRTSHLHLAKR
G + I ++ + L + ++K N V T L RS+IG L+Y + +RPD+ A + +RY + + KR
Subjt: GTLYISLEWIRSDLKELTHLKMTKD----TNGKKVDTNL------------------YRSIIGSLLY-LTASRPDIAFAGGVYARYQADPRTSHLHLAKR
Query: ILKYISA-------------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQS
+L+Y+ W +K+QNSV+ S+ EAEY+ + + LW+K +L I
Subjt: ILKYISA-------------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQS
Query: SMI-LYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQ
+ I +Y DN ISI+ N H R +HIDI++HF RE V+ N+I LE++ + QLADIF KPL A F LR +G+ Q
Subjt: SMI-LYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQ
|
|
| P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-94 | 3.3e-102 | 28.23 | Show/hide |
Query: SDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTI---NRPGLPFLL-DVRLVQGLAANLISITQLYDQGYQVSFNNDRC--------
S+W D+ S H T D F G++ G+ KI G G I G +L DVR V L NLIS L GY+ F N +
Subjt: SDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTI---NRPGLPFLL-DVRLVQGLAANLISITQLYDQGYQVSFNNDRC--------
Query: ------------NDAEVTLCNLLKVEE---VGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHID
+AE+ L ++ V LWHKR+ H+S + + K I S ++++ C C GKQ + V + +IL+L++ D
Subjt: ------------NDAEVTLCNLLKVEE---VGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHID
Query: LMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVER
+ GPM+ ES+G +Y V +DD R W+ + K + F+ Q ++RE + R+R+D+G E+ ++ F E+C + GI HE + P TPQ NGV ER
Subjt: LMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVER
Query: RNRTLEEMARA-------------EGLNTAGHIHNRVILHSGTTITSYEL----WKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANS
NRT+ E R+ E + TA ++ NR S + ++E+ W ++ + + +FG F ++ R K D KS IF+GY
Subjt: RNRTLEEMARA-------------EGLNTAGHIHNRVILHSGTTITSYEL----WKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANS
Query: RAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHT
YR+++ K V+ S +V+ + R D E N + P V +PS +S + S ST+ ++ E +
Subjt: RAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHT
Query: LERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALS--DKHWIL-AMQEELLQFERNQVWELVPK
L+ + PT + H + + +R+ +Y V + EP ++ LS +K+ ++ AMQEE+ ++N ++LV
Subjt: LERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALS--DKHWIL-AMQEELLQFERNQVWELVPK
Query: PPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRV
P + KW+FK K D + +++R KARLV +G+ Q +G+DF E F+PV ++ +IR +LS ++ Q+DVK+AFL+G L EE+Y+ QP+GF
Subjt: PPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRV
Query: HQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG----------IDFLIFVAQIKG-----------EFEMNMVGELTFFLG--
+ V KL K+LYGLKQAPR WY + +++ Q K +D ++ R +D ++ V + KG F+M +G LG
Subjt: HQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG----------IDFLIFVAQIKG-----------EFEMNMVGELTFFLG--
Query: --------TLYISLE-WIRSDLKELT-------------HLKMTKDTNGKKVDTN------LYRSIIGSLLY-LTASRPDIAFAGGVYARYQADPRTSHL
L++S E +I L+ HLK++K V+ Y S +GSL+Y + +RPDIA A GV +R+ +P H
Subjt: --------TLYISLE-WIRSDLKELT-------------HLKMTKDTNGKKVDTN------LYRSIIGSLLY-LTASRPDIAFAGGVYARYQADPRTSHL
Query: HLAKRILKYI---------------------------------------------SAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQ
K IL+Y+ + +W SK Q V+LST EAEYI A + +++W+K+ L E G+ Q
Subjt: HLAKRILKYI---------------------------------------------SAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQ
Query: SSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGV
++YCD+ SAI +SKN + H+R +HID+++H+IRE+V+ +++ + + AD+ K + FE + VG+
Subjt: SSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGV
|
|
| P92520 Uncharacterized mitochondrial protein AtMg00820 | 1.5e-19 | 44.72 | Show/hide |
Query: IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI
++TR K K K + T EP +V AL D W AMQEEL RN+ W LVP P NI+G KW+FK K +G + R KARLVA+G+ Q
Subjt: IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI
Query: EGLDFGETFAPVARLEAIRLLLS
EG+ F ET++PV R IR +L+
Subjt: EGLDFGETFAPVARLEAIRLLLS
|
|
| Q94HW2 Retrovirus-related Pol polyprotein from transposon RE1 | 1.1e-89 | 28.66 | Show/hide |
Query: WHKRLRHLSGSTISKVTKVDAIIGLSLLSFS-SLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFI
WH RL H + S ++ V + LS+L+ S C +C K K I+ST LE ++ D+ S RY V+ VD F RYTW+ +
Subjt: WHKRLRHLSGSTISKVTKVDAIIGLSLLSFS-SLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFI
Query: LDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEE-----MARAE--------GLNTAGH
KS+ +T T L+ T +G +D+G EF E+ GI H S P TP+ NG+ ER++R + E ++ A A +
Subjt: LDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEE-----MARAE--------GLNTAGH
Query: IHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDD---------LGKE
+ NR+ + ++ G PN +FG C+ ++ K D KS + +FLGYS AY + ++ + S +V D+
Subjt: IHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDD---------LGKE
Query: PNRNLDDEDEVFWNSLSPKPVEGEL-----------------ESTAPTNETTYLPSHLGSSRSDM--STPSTSA--------------IQTDTHESE---
P + E W+ + P + +AP + S+L SS S S+P +A QT TH S+
Subjt: PNRNLDDEDEVFWNSLSPKPVEGEL-----------------ESTAPTNETTYLPSHLGSSRSDM--STPSTSA--------------IQTDTHESE---
Query: -----------------ASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGII------TRKKER--KDYAKMVANVCYTSSLEPTTV
+S + TT A+ S P I P + I+ + + + TR K K K V + EP T
Subjt: -----------------ASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGII------TRKKER--KDYAKMVANVCYTSSLEPTTV
Query: SAALSDKHWILAMQEELLQFERNQVWELVPKPP-YANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRF
AL D+ W AM E+ N W+LVP PP + I+G +WIF K + +G + R KARLVA+GY+Q GLD+ ETF+PV + +IR++L R +
Subjt: SAALSDKHWILAMQEELLQFERNQVWELVPKPP-YANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRF
Query: KLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG---IDFLIFVAQI-----
+ Q+DV +AFL G L ++VY++QP GF+++ ++V KLRKALYGLKQAPRAWY L YLL G +D ++F+ ++G + L++V I
Subjt: KLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG---IDFLIFVAQI-----
Query: ------------KGEFEMNMVGELTFFLGT--------LYISLEWIRSDLKELTHL--------------KMTKDTNGKKVDTNLYRSIIGSLLYLTASR
F + EL +FLG L++S DL T++ K++ + K D YR I+GSL YL +R
Subjt: ------------KGEFEMNMVGELTFFLGT--------LYISLEWIRSDLKELTHL--------------KMTKDTNGKKVDTNLYRSIIGSLLYLTASR
Query: PDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYI
PDI++A +++ P HL KRIL+Y++ +W SKKQ V S+ EAEY
Subjt: PDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYI
Query: VAGSSCSQLLWMKQMLDEYGITQS-SMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVG
++ S++ W+ +L E GI + ++YCDN+ A + N V HSR +HI I +HFIR V++ +R+ HV + QLAD KPL F+ + +G
Subjt: VAGSSCSQLLWMKQMLDEYGITQS-SMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVG
Query: VCQRP
V + P
Subjt: VCQRP
|
|
| Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE2 | 4.1e-89 | 26.75 | Show/hide |
Query: HRQKEDRRSRSYPATTRTNSRSSGREP-----KLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAG-SIVFGDGGKGKIIGKGTINRPGLPFLLD
H K + + +TT +S P L + N+++W DS + H+T + + S G ++ DG I G+ + P LD
Subjt: HRQKEDRRSRSYPATTRTNSRSSGREP-----KLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAG-SIVFGDGGKGKIIGKGTINRPGLPFLLD
Query: ---VRLVQGLAANLISITQLYD------QGYQVSFNNDRCNDAEVTLCNLLKVE----------------------EVGLWHKRLRHLSGSTISKVTKVD
V V + NLIS+ +L + + + SF N L K E WH RL H S + ++ V
Subjt: ---VRLVQGLAANLISITQLYD------QGYQVSFNNDRCNDAEVTLCNLLKVE----------------------EVGLWHKRLRHLSGSTISKVTKVD
Query: AIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREK
++ L+ L C +C K K I+S S LE ++ D+ S+ RY V+ VD F RYTW+ + KS+ T + ++
Subjt: AIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREK
Query: NTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMARA-------------EGLNTAGHIHNRVILHSGTTITSYELWKG
T +G + +D+G EF ++ GI H S P TP+ NG+ ER++R + EM + A ++ NR+ + ++ G
Subjt: NTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMARA-------------EGLNTAGHIHNRVILHSGTTITSYELWKG
Query: RKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDD---------LGKEPNRNLDDEDEVFWNS------
+ PN + +FG C+ +R K + KS + F+GYS AY + + + S +V D+ G ++ + W S
Subjt: RKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDD---------LGKEPNRNLDDEDEVFWNS------
Query: ----LSPKPVEGELESTAPTNETTYLP---SHLGSSR---SDMSTPSTSAIQTDTHESEASISASQHTLERTTGA------------------TDSLKCD
L P G T+P ++ P + + SS S +S+PS+S +H + T + + L
Subjt: ----LSPKPVEGELESTAPTNETTYLP---SHLGSSR---SDMSTPSTSAIQTDTHESEASISASQHTLERTTGA------------------TDSLKCD
Query: LIPPTHI--------AKNHPSSFIIG----------------------DIHSGIITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEE
I HI N PSS + HS + TR K+ RK K ++ EP T A+ D W AM E
Subjt: LIPPTHI--------AKNHPSSFIIG----------------------DIHSGIITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEE
Query: LLQFERNQVWELV-PKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYL
+ N W+LV P PP I+G +WIF K + +G + R KARLVA+GY+Q GLD+ ETF+PV + +IR++L R + + Q+DV +AFL G L
Subjt: LLQFERNQVWELV-PKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYL
Query: YEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG---IDFLIFVAQI-----------------KGEF
+EVY++QP GFV++ D+V +LRKA+YGLKQAPRAWY L TYLL G +D ++F+ ++G I L++V I F
Subjt: YEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG---IDFLIFVAQI-----------------KGEF
Query: EMNMVGELTFFLGT--------LYISLEWIRSDLKELTHL--------------KMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYARYQAD
+ +L +FLG L++S DL T++ K+T + K D YR I+GSL YL +RPD+++A ++Y
Subjt: EMNMVGELTFFLGT--------LYISLEWIRSDLKELTHL--------------KMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYARYQAD
Query: PRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQML
P H + KR+L+Y++ +W SKKQ V S+ EAEY ++ S+L W+ +L
Subjt: PRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQML
Query: DEYGITQS-SMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP
E GI S ++YCDN+ A + N V HSR +HI + +HFIR V++ +R+ HV + QLAD KPL F+ +GV + P
Subjt: DEYGITQS-SMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP
|
|
| Arabidopsis top hits | e value | %identity | Alignment |
|---|
| AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8 | 8.1e-56 | 31.3 | Show/hide |
Query: VCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIR
VC + EP+T + A W AM +E+ E WE+ PP IG KW++K K + +G + R KARLVA+GY+Q EG+DF ETF+PV +L +++
Subjt: VCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIR
Query: LLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNR----VHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDF
L+L+ + F L Q+D+ +AFLNG L EE+Y+ P G+ R + + V L+K++YGLKQA R W+ + S L+ G + +D T F+ F
Subjt: LLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNR----VHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDF
Query: ---LIFV-----------------AQIKGEFEMNMVGELTFFLGT--------LYISLEWIRSDLKELTHLKMTKDTN--------------GKKVDTNL
L++V +Q+K F++ +G L +FLG + I DL + T L K ++ G VD
Subjt: ---LIFV-----------------AQIKGEFEMNMVGELTFFLGT--------LYISLEWIRSDLKELTHLKMTKDTN--------------GKKVDTNL
Query: YRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSK
YR +IG L+YL +R DI+FA +++ PR +H +IL YI +W SK
Subjt: YRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSK
Query: KQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSS-MILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRE
KQ VS S+AEAEY + +++W+ Q E + S +L+CDN +AI I+ N V H R +HI+ H +RE
Subjt: KQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSS-MILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRE
|
|
| ATMG00810.1 DNA/RNA polymerases superfamily protein | 3.8e-05 | 23.53 | Show/hide |
Query: TNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA-----------------------------------------
+ K D + +RSI+G+L YLT +RPDI++A + + +P + L KR+L+Y+
Subjt: TNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA-----------------------------------------
Query: -----AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLW
+W +K+Q +VS S+ E EY + ++L W
Subjt: -----AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLW
|
|
| ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase) | 1.1e-20 | 44.72 | Show/hide |
Query: IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI
++TR K K K + T EP +V AL D W AMQEEL RN+ W LVP P NI+G KW+FK K +G + R KARLVA+G+ Q
Subjt: IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI
Query: EGLDFGETFAPVARLEAIRLLLS
EG+ F ET++PV R IR +L+
Subjt: EGLDFGETFAPVARLEAIRLLLS
|
|