| GenBank top hits | e value | %identity | Alignment |
|---|
| CAN72141.1 hypothetical protein VITISV_017108 [Vitis vinifera] | 6.9e-157 | 40.04 | Show/hide |
Query: FETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALVTESTAM-KASDQSNKT
+ ++ +++ D +H++KT++D RI+KFLVGLNVEFDEVR RI+ + LP++ + FS+VRREES+RNVM+GKK +++ S LVT K + K+
Subjt: FETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALVTESTAM-KASDQSNKT
Query: HDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPW
++P VWCD CNKP HTRE CWK+HGK NWK K ++ AN ++S EQ++ +L LLKSN T G SVSLA TGN ALSC S+PW
Subjt: HDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPW
Query: IIDSGATDHMTSFSCLFDSYSP---------------------------------------------------------VYSKEKSVLPMDQDSGETIGR
IIDSGA+DHMT+ S +F+SYSP V E + D+ S +TIG
Subjt: IIDSGATDHMTSFSCLFDSYSP---------------------------------------------------------VYSKEKSVLPMDQDSGETIGR
Query: ARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVW
ARMI+GLYYF++ S+K QGLSS+SSL V++ IM WH +LG P+F YLKHLFP LF+ +D FQCE C K R T++ K Y S PFYL H+DVW
Subjt: ARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVW
Query: GPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKN
GPSKV T +GK+W IE QFQTKI IL SDNGT++FN+ TF + KGI+HQ++C DTPQQNG+A+RKN
Subjt: GPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKN
Query: RHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-------------------------------------------------------
+HLLE+ARA+MF M++PKYL GDA+LTA+YLINRMPTK + ++P
Subjt: RHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-------------------------------------------------------
Query: ----------------SISSMEN------------------------------------------------SSTG---GETLQTDLTGRDPELKFYTRRN
+S MEN S G E L+ + E Y+R+
Subjt: ----------------SISSMEN------------------------------------------------SSTG---GETLQTDLTGRDPELKFYTRRN
Query: RTQRGRNQTVELTQDQS-----------------DTPVNGPKNSGISLS-PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIAN
+ R ++Q + Q TP++ +S LS PS P +S DLD+PIA RKG+ CTK+LIA
Subjt: RTQRGRNQTVELTQDQS-----------------DTPVNGPKNSGISLS-PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIAN
Query: YLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
Y+SY LSDNH+AFT+ I+ L +PRNIQEAL++ +WKLAV +EMNAL K+GTW+ VDLP +KK VGCKWVFTIK ADGS+ERYKARLVAKGFT
Subjt: YLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
|
|
| CAN79134.1 hypothetical protein VITISV_000843 [Vitis vinifera] | 5.8e-180 | 41.76 | Show/hide |
Query: ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
ENSMVMTWL+NSM EDIN NYMCY T +ELW++V QMY DLGNQSQ+FEL LKLG++RQG ++VT+YF+SLK+IWQ+LD F TYEWKS D H++KT++
Subjt: ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
Query: DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
D RI+KFL GLNVEFDE K+ ++P WCD CNKP HTRE CW
Subjt: DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
Query: KLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPWIIDSGATDHMTSFSCLFDSYSP
K+HGKP NWK K ++ N ++SP EQ++ L LLKSN T G SVSLA TGN ALSC S+PWI+D GA+DHMT+ S +F+SYSP
Subjt: KLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPWIIDSGATDHMTSFSCLFDSYSP
Query: VYSKE--------------------------KSVL---PMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYL
+ KSVL +DQ SG+TIG ARMIDGLYYF++ S+K QGLSS+SSL V++ IM WH RLGHP+F YL
Subjt: VYSKE--------------------------KSVL---PMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYL
Query: KHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQ
KHLFP LF+ +D FQCE C K R T++PK Y S PFYL H+DVWGPSKV T +GK+W IE Q
Subjt: KHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQ
Query: FQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-----
FQTKI IL SDNG E+FN+ TF ++KGI+HQ++C DT +QNG+AE KN+HLLE+ARA+MF M++PKYL DA+LTA+YLINRMPTK + ++P
Subjt: FQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-----
Query: ------------------------------------------------------------------SISSMEN---------------------------
+S MEN
Subjt: ------------------------------------------------------------------SISSMEN---------------------------
Query: -----SSTGGETLQTDLTGRDPE--------LKFYTRRNRTQ---RGRNQTVELTQDQSDTPVNG-PK---NSGISLS----------------------
S E +T T + E L+ RN + R + ++DQ P +G PK N +++S
Subjt: -----SSTGGETLQTDLTGRDPE--------LKFYTRRNRTQ---RGRNQTVELTQDQSDTPVNG-PK---NSGISLS----------------------
Query: PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
PS P +S DLD+PIA RKG+ CTK+ I+ Y+SY LSDN++AFT+ I+ L +PRNIQE L++ +WKLAV EEMNAL K+
Subjt: PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
Query: GTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAK
GTW+++DLP +KK VGCKWVFTIK DGS+ERYKARLVAK
Subjt: GTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAK
|
|
| GAU39772.1 hypothetical protein TSUD_220160 [Trifolium subterraneum] | 1.5e-204 | 46.35 | Show/hide |
Query: TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV
++SVRMY+RG+ ENSMVMTWL+NSM E+I++NY+CY TAK+LWD+V+QMYSDL NQSQV+EL L+LG ++QG +SV
Subjt: TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV
Query: TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDS---VDSSA
T+YF+ LKRIWQ+LDLF+ YEWKS D KHY KTVD R++KFL GLNVEFDEVRGRILG++ +P + +VF++VRREESRR VM+GKK V + V+ SA
Subjt: TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDS---VDSSA
Query: LVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYTGN-PSVSLAQ
L K+ DK H++CD+C + H RE C+KLHG+P N K+ K + ++ ++AN SSP KEQ+D + KLL+SN + N P ++AQ
Subjt: LVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYTGN-PSVSLAQ
Query: TGNYPQALSCLN-SSPWIIDSGATDHMTSFSCLFDSYSPVYSKEK-------------------------------------------------------
TG ALS N S+PWIIDSGA++HMT+ S LF SY EK
Subjt: TGNYPQALSCLN-SSPWIIDSGATDHMTSFSCLFDSYSPVYSKEK-------------------------------------------------------
Query: --SVLPMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVS-SLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFL
S + DQ+SG+ IG AR I+GLYY DE +KK L S S L V + +M WHRRLGHP+F YLK+LFP K I+ S CE C K HR +F
Subjt: --SVLPMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVS-SLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFL
Query: PKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIH
K Y S PFYL H+DVWGPSK+ T +GK+W IETQFQTKI IL SDNGTE+FN+ TFL KGIIH
Subjt: PKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIH
Query: QATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTK------------------------------------SCRSSS--
Q+TCRDTPQQNG+AERKNRHLLE+ RA+M SM+VPKYL G+A+LTA YLINRMPT+ SC SS+
Subjt: QATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTK------------------------------------SCRSSS--
Query: ------------------PSISSME------------NSSTGGETLQTDLTG-RDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGP-KNSGISLSPS
P+ ME S TGGET LTG R+ ELK Y R+ + + QSD+P GP NS + SP
Subjt: ------------------PSISSME------------NSSTGGETLQTDLTG-RDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGP-KNSGISLSPS
Query: ----SHNTLP---------------NVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
S N LP N+ DLD+PIA RK CTK+ I+NYLSY +LS HKA+ S+I+NLF+PR +QEAL D NWKLAV EEM+AL K+
Subjt: ----SHNTLP---------------NVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
Query: GTWDIVD-LPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
TW I D LP+ KKAVGCKWVFT+KC ADGS+ERYKARLVAKGFT
Subjt: GTWDIVD-LPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
|
|
| XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina] | 4.4e-164 | 37.61 | Show/hide |
Query: ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
+NSM+M+WL+NSM ++I Y+ TAK+LWD+VT+ YSDLGN +Q+++L ++ + +QG VT+Y++ LK +WQELD + EW+ D Y+K ++
Subjt: ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
Query: DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
R+++FL GL+ + DEVRGR+LGK LP+ +VFS VRREESR+NVM+G + ++ ++ E+ + + K+ +K VWCD+C+KP HTR+ CW
Subjt: DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
Query: KLHGKPPNWKSSKQYERYS--------HQHASNANVVDSSPL-KEQIDQILKLL-KSNYTGNPS--VSLAQTGNYPQALSCL--NSSPWIIDSGATDHMT
KLHGKPPN K++K ++S +Q +N +S KEQ++Q+ + L +S NPS SLAQ GN AL + PWIIDSGATDHMT
Subjt: KLHGKPPNWKSSKQYERYS--------HQHASNANVVDSSPL-KEQIDQILKLL-KSNYTGNPS--VSLAQTGNYPQALSCL--NSSPWIIDSGATDHMT
Query: SFSCLFDSYSPVYSKEK--------------------------SVLPM-------------------------------DQDSGETIGRARMIDGLYYFD
S S LF SY P +K SVL + D SG+ IG AR +DGLYYF+
Subjt: SFSCLFDSYSPVYSKEK--------------------------SVLPM-------------------------------DQDSGETIGRARMIDGLYYFD
Query: EVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGK
E + + Q ++ + +++ IM WH RLGHP+F YL+HLFP LFK + S+FQCE C KHHR++F + YK S+PF LIH+D+WGPS+V +G
Subjt: EVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGK
Query: RW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALM
+W I+TQFQ KI++ +DNG E+F + + GI+HQ++C DTPQQNGVAERKNRHLLE+AR+LM
Subjt: RW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALM
Query: FSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGGETL-----------------------
F+ VPK G+A+LTA+YLINRMPT+ SP + S L
Subjt: FSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGGETL-----------------------
Query: -----------------QTDLTGRD------------------------------------------PELKFYTRRNRTQRGRNQTVELTQDQSDTPVNG
+T L G D PEL+ YTRRN ++R + + QD N
Subjt: -----------------QTDLTGRD------------------------------------------PELKFYTRRNRTQRGRNQTVELTQDQSDTPVNG
Query: PKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDL
+ +P S + L +DLD+PIAQRKG+ CT + I+ Y+SYHRLS +AFT+ ++ + +P+++Q+AL+ W+ AV EM AL K+ TW++V L
Subjt: PKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDL
Query: PEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
PE+KK VGCKW+FT+K ADGS+ERYKARLVAKGFT
Subjt: PEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
|
|
| XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina] | 4.4e-164 | 37.61 | Show/hide |
Query: ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
+NSM+M+WL+NSM ++I Y+ TAK+LWD+VT+ YSDLGN +Q+++L ++ + +QG VT+Y++ LK +WQELD + EW+ D Y+K ++
Subjt: ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
Query: DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
R+++FL GL+ + DEVRGR+LGK LP+ +VFS VRREESR+NVM+G + ++ ++ E+ + + K+ +K VWCD+C+KP HTR+ CW
Subjt: DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
Query: KLHGKPPNWKSSKQYERYS--------HQHASNANVVDSSPL-KEQIDQILKLL-KSNYTGNPS--VSLAQTGNYPQALSCL--NSSPWIIDSGATDHMT
KLHGKPPN K++K ++S +Q +N +S KEQ++Q+ + L +S NPS SLAQ GN AL + PWIIDSGATDHMT
Subjt: KLHGKPPNWKSSKQYERYS--------HQHASNANVVDSSPL-KEQIDQILKLL-KSNYTGNPS--VSLAQTGNYPQALSCL--NSSPWIIDSGATDHMT
Query: SFSCLFDSYSPVYSKEK--------------------------SVLPM-------------------------------DQDSGETIGRARMIDGLYYFD
S S LF SY P +K SVL + D SG+ IG AR +DGLYYF+
Subjt: SFSCLFDSYSPVYSKEK--------------------------SVLPM-------------------------------DQDSGETIGRARMIDGLYYFD
Query: EVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGK
E + + Q ++ + +++ IM WH RLGHP+F YL+HLFP LFK + S+FQCE C KHHR++F + YK S+PF LIH+D+WGPS+V +G
Subjt: EVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGK
Query: RW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALM
+W I+TQFQ KI++ +DNG E+F + + GI+HQ++C DTPQQNGVAERKNRHLLE+AR+LM
Subjt: RW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALM
Query: FSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGGETL-----------------------
F+ VPK G+A+LTA+YLINRMPT+ SP + S L
Subjt: FSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGGETL-----------------------
Query: -----------------QTDLTGRD------------------------------------------PELKFYTRRNRTQRGRNQTVELTQDQSDTPVNG
+T L G D PEL+ YTRRN ++R + + QD N
Subjt: -----------------QTDLTGRD------------------------------------------PELKFYTRRNRTQRGRNQTVELTQDQSDTPVNG
Query: PKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDL
+ +P S + L +DLD+PIAQRKG+ CT + I+ Y+SYHRLS +AFT+ ++ + +P+++Q+AL+ W+ AV EM AL K+ TW++V L
Subjt: PKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDL
Query: PEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
PE+KK VGCKW+FT+K ADGS+ERYKARLVAKGFT
Subjt: PEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A2N9GQ49 Uncharacterized protein | 6.0e-159 | 45.01 | Show/hide |
Query: TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV
++SVRMYIRG+ ENSMVMTWL+NSM EDI+SNYMCY TA+ELW++V QMYSDLGNQSQ+FEL LKLG+MRQG +SV
Subjt: TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV
Query: TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALV
T+YF+SLKR+WQ+LDLF TYEWKS D +H++K V+D RI+KFL GLN+E DEVRGR++G+ +P + DVFS+VRREESRRNVM+GKK +V+SSALV
Subjt: TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALV
Query: -TESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPLKEQIDQILKLLKSN-YTGNPSVSLAQTG
++ + KA +T DKP VWCD+CNKP HTRETCWK+HGKP NWKSSK +R + +S KEQ++ +L LLKSN +G PSVS+AQTG
Subjt: -TESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPLKEQIDQILKLLKSN-YTGNPSVSLAQTG
Query: NYPQALS-CLNSS-PWIIDSGATDHMTSFSCLFDSYSPVYSKE--------------------------KSVLPM---------DQDSGETIGRARMIDG
N P ALS CLNSS PWIIDSGA+DHMTS F+SYSP E KSVL + DQ SG TIG ARMI+G
Subjt: NYPQALS-CLNSS-PWIIDSGATDHMTSFSCLFDSYSPVYSKE--------------------------KSVLPM---------DQDSGETIGRARMIDG
Query: LYYFDEVSTSHKKIQGLSSVSSLPVQETIM---------FWHRRLGHPNFV-----YLKHLFPGLFKGIDCSVFQCEDCKHHRSTFLPKSYKPSSPFYLI
LYYFD+ +S KK QG SS+SS+ V+E IM FW + PN + P +F I+ S+ +D R +FLP + ++
Subjt: LYYFDEVSTSHKKIQGLSSVSSLPVQETIM---------FWHRRLGHPNFV-----YLKHLFPGLFKGIDCSVFQCEDCKHHRSTFLPKSYKPSSPFYLI
Query: HTDVWGP-SKVLTKNGKRWIETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLT
D P S++L KR E I P QN +E N L I+
Subjt: HTDVWGP-SKVLTKNGKRWIETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLT
Query: AAYLINRMPTKSCRSSSPSISSMENSSTGGETLQTDLTGRDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGPKNSGISLSPSSHNTLPNVSDLDIPI
N P +S P +SS S + P+ PKN SDLDIPI
Subjt: AAYLINRMPTKSCRSSSPSISSMENSSTGGETLQTDLTGRDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGPKNSGISLSPSSHNTLPNVSDLDIPI
Query: AQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITN-LFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIER
A RKG CTKY IA Y+SY RLS+NH+AF S I++ + +PRNIQEAL+D NWKLAV+EEMNAL K+GTW++VDLP DKK VGCKWVF++K ADGSIER
Subjt: AQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITN-LFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIER
Query: YKARLVAKGFT
YKARLVAKGFT
Subjt: YKARLVAKGFT
|
|
| A0A2Z6NTX3 Integrase catalytic domain-containing protein | 7.3e-205 | 46.35 | Show/hide |
Query: TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV
++SVRMY+RG+ ENSMVMTWL+NSM E+I++NY+CY TAK+LWD+V+QMYSDL NQSQV+EL L+LG ++QG +SV
Subjt: TKSVRMYIRGQ-------------------------ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSV
Query: TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDS---VDSSA
T+YF+ LKRIWQ+LDLF+ YEWKS D KHY KTVD R++KFL GLNVEFDEVRGRILG++ +P + +VF++VRREESRR VM+GKK V + V+ SA
Subjt: TQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDS---VDSSA
Query: LVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYTGN-PSVSLAQ
L K+ DK H++CD+C + H RE C+KLHG+P N K+ K + ++ ++AN SSP KEQ+D + KLL+SN + N P ++AQ
Subjt: LVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYTGN-PSVSLAQ
Query: TGNYPQALSCLN-SSPWIIDSGATDHMTSFSCLFDSYSPVYSKEK-------------------------------------------------------
TG ALS N S+PWIIDSGA++HMT+ S LF SY EK
Subjt: TGNYPQALSCLN-SSPWIIDSGATDHMTSFSCLFDSYSPVYSKEK-------------------------------------------------------
Query: --SVLPMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVS-SLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFL
S + DQ+SG+ IG AR I+GLYY DE +KK L S S L V + +M WHRRLGHP+F YLK+LFP K I+ S CE C K HR +F
Subjt: --SVLPMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVS-SLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFL
Query: PKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIH
K Y S PFYL H+DVWGPSK+ T +GK+W IETQFQTKI IL SDNGTE+FN+ TFL KGIIH
Subjt: PKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIH
Query: QATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTK------------------------------------SCRSSS--
Q+TCRDTPQQNG+AERKNRHLLE+ RA+M SM+VPKYL G+A+LTA YLINRMPT+ SC SS+
Subjt: QATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTK------------------------------------SCRSSS--
Query: ------------------PSISSME------------NSSTGGETLQTDLTG-RDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGP-KNSGISLSPS
P+ ME S TGGET LTG R+ ELK Y R+ + + QSD+P GP NS + SP
Subjt: ------------------PSISSME------------NSSTGGETLQTDLTG-RDPELKFYTRRNRTQRGRNQTVELTQDQSDTPVNGP-KNSGISLSPS
Query: ----SHNTLP---------------NVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
S N LP N+ DLD+PIA RK CTK+ I+NYLSY +LS HKA+ S+I+NLF+PR +QEAL D NWKLAV EEM+AL K+
Subjt: ----SHNTLP---------------NVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
Query: GTWDIVD-LPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
TW I D LP+ KKAVGCKWVFT+KC ADGS+ERYKARLVAKGFT
Subjt: GTWDIVD-LPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
|
|
| A5B9Y8 Integrase catalytic domain-containing protein | 3.3e-157 | 40.04 | Show/hide |
Query: FETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALVTESTAM-KASDQSNKT
+ ++ +++ D +H++KT++D RI+KFLVGLNVEFDEVR RI+ + LP++ + FS+VRREES+RNVM+GKK +++ S LVT K + K+
Subjt: FETYEWKSTNDQKHYRKTVDDGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVD-SVDSSALVTESTAM-KASDQSNKT
Query: HDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPW
++P VWCD CNKP HTRE CWK+HGK NWK K ++ AN ++S EQ++ +L LLKSN T G SVSLA TGN ALSC S+PW
Subjt: HDKPHVWCDHCNKPCHTRETCWKLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPW
Query: IIDSGATDHMTSFSCLFDSYSP---------------------------------------------------------VYSKEKSVLPMDQDSGETIGR
IIDSGA+DHMT+ S +F+SYSP V E + D+ S +TIG
Subjt: IIDSGATDHMTSFSCLFDSYSP---------------------------------------------------------VYSKEKSVLPMDQDSGETIGR
Query: ARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVW
ARMI+GLYYF++ S+K QGLSS+SSL V++ IM WH +LG P+F YLKHLFP LF+ +D FQCE C K R T++ K Y S PFYL H+DVW
Subjt: ARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVW
Query: GPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKN
GPSKV T +GK+W IE QFQTKI IL SDNGT++FN+ TF + KGI+HQ++C DTPQQNG+A+RKN
Subjt: GPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKN
Query: RHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-------------------------------------------------------
+HLLE+ARA+MF M++PKYL GDA+LTA+YLINRMPTK + ++P
Subjt: RHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-------------------------------------------------------
Query: ----------------SISSMEN------------------------------------------------SSTG---GETLQTDLTGRDPELKFYTRRN
+S MEN S G E L+ + E Y+R+
Subjt: ----------------SISSMEN------------------------------------------------SSTG---GETLQTDLTGRDPELKFYTRRN
Query: RTQRGRNQTVELTQDQS-----------------DTPVNGPKNSGISLS-PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIAN
+ R ++Q + Q TP++ +S LS PS P +S DLD+PIA RKG+ CTK+LIA
Subjt: RTQRGRNQTVELTQDQS-----------------DTPVNGPKNSGISLS-PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIAN
Query: YLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
Y+SY LSDNH+AFT+ I+ L +PRNIQEAL++ +WKLAV +EMNAL K+GTW+ VDLP +KK VGCKWVFTIK ADGS+ERYKARLVAKGFT
Subjt: YLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
|
|
| A5BNN1 Integrase catalytic domain-containing protein | 2.8e-180 | 41.76 | Show/hide |
Query: ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
ENSMVMTWL+NSM EDIN NYMCY T +ELW++V QMY DLGNQSQ+FEL LKLG++RQG ++VT+YF+SLK+IWQ+LD F TYEWKS D H++KT++
Subjt: ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
Query: DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
D RI+KFL GLNVEFDE K+ ++P WCD CNKP HTRE CW
Subjt: DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKKAVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRETCW
Query: KLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPWIIDSGATDHMTSFSCLFDSYSP
K+HGKP NWK K ++ N ++SP EQ++ L LLKSN T G SVSLA TGN ALSC S+PWI+D GA+DHMT+ S +F+SYSP
Subjt: KLHGKPPNWKSSKQYERYSHQHASNANVVDSSPL-KEQIDQILKLLKSNYT-GNPSVSLAQTGNYPQALSC-LNSSPWIIDSGATDHMTSFSCLFDSYSP
Query: VYSKE--------------------------KSVL---PMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYL
+ KSVL +DQ SG+TIG ARMIDGLYYF++ S+K QGLSS+SSL V++ IM WH RLGHP+F YL
Subjt: VYSKE--------------------------KSVL---PMDQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYL
Query: KHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQ
KHLFP LF+ +D FQCE C K R T++PK Y S PFYL H+DVWGPSKV T +GK+W IE Q
Subjt: KHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQ
Query: FQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-----
FQTKI IL SDNG E+FN+ TF ++KGI+HQ++C DT +QNG+AE KN+HLLE+ARA+MF M++PKYL DA+LTA+YLINRMPTK + ++P
Subjt: FQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP-----
Query: ------------------------------------------------------------------SISSMEN---------------------------
+S MEN
Subjt: ------------------------------------------------------------------SISSMEN---------------------------
Query: -----SSTGGETLQTDLTGRDPE--------LKFYTRRNRTQ---RGRNQTVELTQDQSDTPVNG-PK---NSGISLS----------------------
S E +T T + E L+ RN + R + ++DQ P +G PK N +++S
Subjt: -----SSTGGETLQTDLTGRDPE--------LKFYTRRNRTQ---RGRNQTVELTQDQSDTPVNG-PK---NSGISLS----------------------
Query: PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
PS P +S DLD+PIA RKG+ CTK+ I+ Y+SY LSDN++AFT+ I+ L +PRNIQE L++ +WKLAV EEMNAL K+
Subjt: PSSHNTLPNVS-----------------DLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNAL-KH
Query: GTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAK
GTW+++DLP +KK VGCKWVFTIK DGS+ERYKARLVAK
Subjt: GTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAK
|
|
| A5BR93 Integrase catalytic domain-containing protein | 1.8e-150 | 38.62 | Show/hide |
Query: ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
ENSM+M+WLINSM DI N++ + TAK++WD+ + YS N S++F++ L D RQG SVTQY+++L R WQ+LDLFET+ WK ++D YR+ V+
Subjt: ENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETYEWKSTNDQKHYRKTVD
Query: DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKK--AVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRET
R++KF +GLN E D+VRGRI+G LP+L + FS+VRREESR+ VM+G K ++D+SAL S D+ + D+P WCD+C KP H +ET
Subjt: DGRIYKFLVGLNVEFDEVRGRILGKSILPNLNDVFSKVRREESRRNVMIGKK--AVDSVDSSALVTESTAMKASDQSNKTHDKPHVWCDHCNKPCHTRET
Query: CWKLHGKPPNWKSSKQYERYSHQH----ASNANVVDSSPL-KEQIDQILKLLKSNYTGNPSVSLAQTGNYPQALSCLNSSPWIIDSGATDHMTSFSCLFD
CWKLHGKP +WK +++R H + + +V + SP KEQ++ + KLL +G+ + +A T N PWI+D+GA+DHMT + +
Subjt: CWKLHGKPPNWKSSKQYERYSHQH----ASNANVVDSSPL-KEQIDQILKLLKSNYTGNPSVSLAQTGNYPQALSCLNSSPWIIDSGATDHMTSFSCLFD
Query: SYSPVY----------SKEK----------------SVLPM-------------------------------DQDSGETIGRARMIDGLY------YFDE
+Y P SK K SVL + D SG+ IG A + GLY + ++
Subjt: SYSPVY----------SKEK----------------SVLPM-------------------------------DQDSGETIGRARMIDGLY------YFDE
Query: VS-----TSHKKIQGLSSVSSLPVQE--TIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKV
VS S + +SVS+ V + I+ H RLGHP+FVYL LFP LF + + + CE C KH R+ + YKPS+ F L+H+DVWGPS++
Subjt: VS-----TSHKKIQGLSSVSSLPVQE--TIMFWHRRLGHPNFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKV
Query: LTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLE
+G RW ++ QF +KI++L SDN E+F +T+L + GIIH ++C DTPQQNGVAERKNRHLLE
Subjt: LTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLE
Query: IARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSPS---ISSMENSSTGGETLQTDLTG--------RDPELKFYTRRNR--------TQRGRNQ
+AR LMFS +VP Y G+A+LTA YLINRMP++ SP + ++ L + G KF R N+ TQ+GR +
Subjt: IARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSPS---ISSMENSSTGGETLQTDLTG--------RDPELKFYTRRNR--------TQRGRNQ
Query: TVELTQDQSDT------PVNGPKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWK
EL T + I + +P++ D +PIA RKG +CT + I NY++Y LS +++AF + + + +P IQEAL S WK
Subjt: TVELTQDQSDT------PVNGPKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWK
Query: LAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
AV +E++AL K+GTW I DLP K+ VGCKW+FTIK ADGS+ER+KARLVA+GFT
Subjt: LAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
|
|
| SwissProt top hits | e value | %identity | Alignment |
|---|
| P04146 Copia protein | 1.1e-16 | 29.63 | Show/hide |
Query: WHRRLGHPN-----FVYLKHLF--PGLFKGIDCSVFQCEDCKHHRSTFLP-KSYKPSS----PFYLIHTDVWGPSKVLTKNGKRWI--------------
WH R GH + + K++F L ++ S CE C + + LP K K + P +++H+DV GP +T + K +
Subjt: WHRRLGHPN-----FVYLKHLF--PGLFKGIDCSVFQCEDCKHHRSTFLP-KSYKPSS----PFYLIHTDVWGPSKVLTKNGKRWI--------------
Query: -------------------ETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTA
E F K+ L+ DNG E+ + F KGI + T TPQ NGV+ER R + E AR ++ + K G+AVLTA
Subjt: -------------------ETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTA
Query: AYLINRMPTKSCRSSS
YLINR+P+++ SS
Subjt: AYLINRMPTKSCRSSS
|
|
| P04146 Copia protein | 2.9e-09 | 48.53 | Show/hide |
Query: IQEALNDSNWKLAVIEEMNALK-HGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
IQ + S+W+ A+ E+NA K + TW I PE+K V +WVF++K N G+ RYKARLVA+GFT
Subjt: IQEALNDSNWKLAVIEEMNALK-HGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
|
|
| P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-94 | 6.1e-23 | 24.68 | Show/hide |
Query: TIMFWHRRLGHPN----FVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW----------------
++ WH+R+GH + + K KG +V C+ C K HR +F S + + L+++DV GP ++ + G ++
Subjt: TIMFWHRRLGHPN----FVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKSYKPSSPFYLIHTDVWGPSKVLTKNGKRW----------------
Query: -----------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAA
+E + K++ L SDNG E+ + + GI H+ T TPQ NGVAER NR ++E R+++ +PK G+AV TA
Subjt: -----------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAA
Query: YLINRMPTK------------------------SCRSSSPSISSMENSSTGGETLQTDLTGRDPE---------LKFYTRRNRTQRGRNQTVELTQDQSD
YLINR P+ CR+ + + + + +++ G E +K R+R R V D S+
Subjt: YLINRMPTK------------------------SCRSSSPSISSMENSSTGGETLQTDLTGRDPE---------LKFYTRRNRTQRGRNQTVELTQDQSD
Query: TPVNGPKNSGISLSPSSHN------TLPNVSD--------------LDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSK---ITNLFLPRNIQEA
NG + +++ +S+N T VS+ LD + + + Q + S ++ + +++ I++ P +++E
Subjt: TPVNGPKNSGISLSPSSHN------TLPNVSD--------------LDIPIAQRKGSCQCTKYLIANYLSYHRLSDNHKAFTSK---ITNLFLPRNIQEA
Query: LN---DSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF
L+ + A+ EEM +L K+GT+ +V+LP+ K+ + CKWVF +K + D + RYKARLV KGF
Subjt: LN---DSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF
|
|
| P92520 Uncharacterized mitochondrial protein AtMg00820 | 2.4e-11 | 47.14 | Show/hide |
Query: PRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF
P+++ AL D W A+ EE++AL ++ TW +V P ++ +GCKWVF K ++DG+++R KARLVAKGF
Subjt: PRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF
|
|
| Q94HW2 Retrovirus-related Pol polyprotein from transposon RE1 | 1.7e-17 | 22.71 | Show/hide |
Query: DQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGI---DCSVFQCEDC---KHHRSTFLPKSYK
D ++G + + + D LY + S+ Q +S +S + T WH RLGHP L + + C DC K ++ F +
Subjt: DQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHPNFVYLKHLFPGLFKGI---DCSVFQCEDC---KHHRSTFLPKSYK
Query: PSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCR
+ P I++DVW S +L+ + R+ +E +FQT+I +SDNG EF + GI H +
Subjt: PSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQATCR
Query: DTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPT--------------------------------------------------
TP+ NG++ERK+RH++E L+ +PK A A YLINR+PT
Subjt: DTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPT--------------------------------------------------
Query: -----------------------------------------------------------------------KSCRS--------SSPSISSMENSSTGGE
SC SSPS + NS
Subjt: -----------------------------------------------------------------------KSCRS--------SSPSISSMENSSTGGE
Query: TLQTDLTGRDPELKFYT--RRNRTQRGRNQTVELTQDQS--DTPVNGPKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTK---------------Y
L + + P T R+N Q T TQ S +T N P N S S +T P S P S T
Subjt: TLQTDLTGRDPELKFYT--RRNRTQRGRNQTVELTQDQS--DTPVNGPKNSGISLSPSSHNTLPNVSDLDIPIAQRKGSCQCTK---------------Y
Query: LIANY----LSYHRLSDNHKAFTSKITNLFL----------PRNIQEALNDSNWKLAVIEEMNA-LKHGTWDIVDLPEDK-KAVGCKWVFTIKCNADGSI
++ N L+ H + KA K + PR +AL D W+ A+ E+NA + + TWD+V P VGC+W+FT K N+DGS+
Subjt: LIANY----LSYHRLSDNHKAFTSKITNLFL----------PRNIQEALNDSNWKLAVIEEMNA-LKHGTWDIVDLPEDK-KAVGCKWVFTIKCNADGSI
Query: ERYKARLVAKGF
RYKARLVAKG+
Subjt: ERYKARLVAKGF
|
|
| Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE2 | 1.7e-20 | 23.54 | Show/hide |
Query: DQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHP-----NFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKS
D ++G + + + D LY + S+ Q +S +S + T WH RLGHP N V H P L + C DC K H+ F +
Subjt: DQDSGETIGRARMIDGLYYFDEVSTSHKKIQGLSSVSSLPVQETIMFWHRRLGHP-----NFVYLKHLFPGLFKGIDCSVFQCEDC---KHHRSTFLPKS
Query: YKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQAT
S P I++DVW S +L+ + R+ +E +FQT+I L+SDNG EF +L GI H +
Subjt: YKPSSPFYLIHTDVWGPSKVLTKNGKRW---------------------------------IETQFQTKIRILHSDNGTEFFNEPQTTFLHDKGIIHQAT
Query: CRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGG
TP+ NG++ERK+RH++E+ L+ VPK A A YLINR+PT + SP + +E+ S
Subjt: CRDTPQQNGVAERKNRHLLEIARALMFSMHVPKYLLGDAVLTAAYLINRMPTKSCRSSSP----------------------------SISSMENSSTGG
Query: ETLQTDLTG------RDPELKFYTRR-----------NRTQRGRNQTVELTQDQS----------DTPV-------------------------------
+ LT P + YT R + T G + + E D + TP+
Subjt: ETLQTDLTG------RDPELKFYTRR-----------NRTQRGRNQTVELTQDQS----------DTPV-------------------------------
Query: ----------------------NGPK-------------NSGISLSPSSHNTLPNVSDLDIPIAQR--------------------KGSCQCTKYL----
NGP+ NS I +P+ ++ PN + + P+ Q S T L
Subjt: ----------------------NGPK-------------NSGISLSPSSHNTLPNVSDLDIPIAQR--------------------KGSCQCTKYL----
Query: -------------IANYLSYHRLSD-----NHK-AFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNA-LKHGTWDIVDLPEDK-KAVGCKWVFTIKCNA
+ + R D N K ++ + + PR +A+ D W+ A+ E+NA + + TWD+V P VGC+W+FT K N+
Subjt: -------------IANYLSYHRLSD-----NHK-AFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNA-LKHGTWDIVDLPEDK-KAVGCKWVFTIKCNA
Query: DGSIERYKARLVAKGF
DGS+ RYKARLVAKG+
Subjt: DGSIERYKARLVAKGF
|
|
| Arabidopsis top hits | e value | %identity | Alignment |
|---|
| AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). | 4.5e-13 | 31.13 | Show/hide |
Query: QENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETY-EWKSTNDQKHYRKT
Q N+MVM WL+NSM + + + M TA ++W+ + +++ + ++++L +L +RQGG+SV +YF L ++W EL + E K K
Subjt: QENSMVMTWLINSMVEDINSNYMCYTTAKELWDSVTQMYSDLGNQSQVFELNLKLGDMRQGGNSVTQYFHSLKRIWQELDLFETY-EWKSTNDQKHYRKT
Query: VDDGR----IYKFLVG--LNVEFDEVRGRILGKSILPNLNDVFSKVRREES
++ R Y+FL+G LN F+ V +I+ + P+L++ F+ V+ ES
Subjt: VDDGR----IYKFLVG--LNVEFDEVRGRILGKSILPNLNDVFSKVRREES
|
|
| AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8 | 8.7e-17 | 45.36 | Show/hide |
Query: IANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNALK-HGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
I+ +LSY ++S + +F I P EA W A+ +E+ A++ TW+I LP +KK +GCKWV+ IK N+DG+IERYKARLVAKG+T
Subjt: IANYLSYHRLSDNHKAFTSKITNLFLPRNIQEALNDSNWKLAVIEEMNALK-HGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGFT
|
|
| ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase) | 1.7e-12 | 47.14 | Show/hide |
Query: PRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF
P+++ AL D W A+ EE++AL ++ TW +V P ++ +GCKWVF K ++DG+++R KARLVAKGF
Subjt: PRNIQEALNDSNWKLAVIEEMNAL-KHGTWDIVDLPEDKKAVGCKWVFTIKCNADGSIERYKARLVAKGF
|
|