; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0026645 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0026645
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag/pol protein
Genome locationchr03:4279628..4287054
RNA-Seq ExpressionIVF0026645
SyntenyIVF0026645
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.093.12Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
        DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------
        GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE            
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------

Query:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
                                         DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
Subjt:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG                                   
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------

Query:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
            +TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
        LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
Subjt:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK

Query:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.065.42Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        M+S+ + +L  D+L G NYA+WK+ +N +L+I DL FVL+EECP  P  +A+++VR+ Y+RW KAN+KAR +ILAS+S++L+KKHE M+TAR+IMDSL+E
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------
        MFGQ S QIK                                                                                          
Subjt:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------

Query:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL
           EANVA S R+F          +PS SG++K +K+K G+G    +A     K   A K  CFHCN + HWK NCPKYL  KKK K+GKYDLLVLETCL
Subjt:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL

Query:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI
        VEND +AWI+DSGATNHVCSS Q  SS++QLE  EMT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FI
Subjt:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI

Query:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE---------
        YKNGV ICSAKLENNLYVLR   +KA+LN EMF+TA TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++L++ SLP CESCLE         
Subjt:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE---------

Query:  ------------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS
                                            DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  ------------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGQ-------------------------------
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW G+                               
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGQ-------------------------------

Query:  --------TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSE----ATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN
                TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL+E     T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P 
Subjt:  --------TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSE----ATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN

Query:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY
        RY+ LTET  VI D  +EDPL++K+AM DVDKD+W+KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDY
Subjt:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC

Query:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS
        VYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LS
Subjt:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS
        KEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKS
Subjt:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR
        TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV R
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR

Query:  GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
        GDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLGLRDM
Subjt:  GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.065.42Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        M+S+ + +L  D+L G NYA+WK+ +N +L+I DL FVL+EECP  P  +A+++VR+ Y+RW KAN+KAR +ILAS+S++L+KKHE M+TAR+IMDSL+E
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------
        MFGQ S QIK                                                                                          
Subjt:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------

Query:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL
           EANVA S R+F          +PS SG++K +K+K G+G    +A     K   A K  CFHCN + HWK NCPKYL  KKK K+GKYDLLVLETCL
Subjt:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL

Query:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI
        VEND +AWI+DSGATNHVCSS Q  SS++QLE  EMT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FI
Subjt:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI

Query:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE---------
        YKNGV ICSAKLENNLYVLR   +KA+LN EMF+TA TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++L++ SLP CESCLE         
Subjt:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE---------

Query:  ------------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS
                                            DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  ------------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGQ-------------------------------
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW G+                               
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGQ-------------------------------

Query:  --------TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSE----ATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN
                TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL+E     T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P 
Subjt:  --------TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSE----ATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN

Query:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY
        RY+ LTET  VI D  +EDPL++K+AM DVDKD+W+KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDY
Subjt:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC

Query:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS
        VYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LS
Subjt:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS
        KEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKS
Subjt:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR
        TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV R
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR

Query:  GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
        GDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLGLRDM
Subjt:  GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]0.091.58Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENY TWKSKLN ILVI DLSFVLMEECP FPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKK EKE KYDLLVLETCLVEN
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
        DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------
        GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+GLLNKLKDVSLPPCESCLE            
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------

Query:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
                                         DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLS
Subjt:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG                                   
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------

Query:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
            +TRGGLFFDP+ENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS V
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
        LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
Subjt:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK

Query:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]0.082.91Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEG              
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
                  ATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKL                                               
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------
                                                               GHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE            
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------

Query:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
                                         DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
Subjt:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG                                   
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------

Query:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
            +TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
        LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
Subjt:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK

Query:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0065.42Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        M+S+ + +L  D+L G NYA+WK+ +N +L+I DL FVL+EECP  P  +A+++VR+ Y+RW KAN+KAR +ILAS+S++L+KKHE M+TAR+IMDSL+E
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------
        MFGQ S QIK                                                                                          
Subjt:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------

Query:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL
           EANVA S R+F          +PS SG++K +K+K G+G    +A     K   A K  CFHCN + HWK NCPKYL  KKK K+GKYDLLVLETCL
Subjt:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL

Query:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI
        VEND +AWI+DSGATNHVCSS Q  SS++QLE  EMT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FI
Subjt:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI

Query:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE---------
        YKNGV ICSAKLENNLYVLR   +KA+LN EMF+TA TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++L++ SLP CESCLE         
Subjt:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE---------

Query:  ------------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS
                                            DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  ------------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGQ-------------------------------
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW G+                               
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGQ-------------------------------

Query:  --------TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN
                TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL+    E T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P 
Subjt:  --------TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN

Query:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY
        RY+ LTET  VI D  +EDPL++K+AM DVDKD+W+KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDY
Subjt:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC

Query:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS
        VYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LS
Subjt:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS
        KEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKS
Subjt:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR
        TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV R
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR

Query:  GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
        GDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLGLRDM
Subjt:  GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM

A0A5A7T2V9 Gag/pol protein0.0e+0091.58Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENY TWKSKLN ILVI DLSFVLMEECP FPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKK EKE KYDLLVLETCLVEN
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
        DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------
        GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+GLLNKLKDVSLPPCESCLE            
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------

Query:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
                                         DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLS
Subjt:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG                                   
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------

Query:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
            +TRGGLFFDP+ENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS V
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
        LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
Subjt:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK

Query:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

A0A5A7TZD0 Gag/pol protein0.0e+0093.12Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
        DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------
        GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE            
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------

Query:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
                                         DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
Subjt:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG                                   
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------

Query:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
            +TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
        LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
Subjt:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK

Query:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

A0A5A7UYE8 Gag/pol protein0.0e+0082.91Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKE               
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
                 GATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAK                                                
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------
                                                              LGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE            
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE------------

Query:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
                                         DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
Subjt:  ---------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG                                   
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRG-----------------------------------

Query:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
            +TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  ----QTRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
        LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK
Subjt:  LNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK

Query:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  IASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

A0A5D3CPJ6 Gag/pol protein0.0e+0065.42Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        M+S+ + +L  D+L G NYA+WK+ +N +L+I DL FVL+EECP  P  +A+++VR+ Y+RW KAN+KAR +ILAS+S++L+KKHE M+TAR+IMDSL+E
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------
        MFGQ S QIK                                                                                          
Subjt:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------

Query:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL
           EANVA S R+F          +PS SG++K +K+K G+G    +A     K   A K  CFHCN + HWK NCPKYL  KKK K+GKYDLLVLETCL
Subjt:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL

Query:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI
        VEND +AWI+DSGATNHVCSS Q  SS++QLE  EMT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FI
Subjt:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI

Query:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE---------
        YKNGV ICSAKLENNLYVLR   +KA+LN EMF+TA TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++L++ SLP CESCLE         
Subjt:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLE---------

Query:  ------------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS
                                            DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  ------------------------------------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGQ-------------------------------
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW G+                               
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGQ-------------------------------

Query:  --------TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN
                TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL+    E T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P 
Subjt:  --------TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN

Query:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY
        RY+ LTET  VI D  +EDPL++K+AM DVDKD+W+KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDY
Subjt:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC

Query:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS
        VYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LS
Subjt:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS
        KEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKS
Subjt:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR
        TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV R
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR

Query:  GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
        GDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLGLRDM
Subjt:  GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-11928.15Show/hide
Query:  KGKAKVAIKRKCFHCNVDEHWKTNCPKY--LVKKKEKEGKYDL-------LVLETCLVEN----DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLK
        KG +K   K KC HC  + H K +C  Y  ++  K KE +  +       +      V N    D   ++LDSGA++H+   + + S +    +    LK
Subjt:  KGKAKVAIKRKCFHCNVDEHWKTNCPKY--LVKKKEKEGKYDL-------LVLETCLVEN----DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLK

Query:  VGT---GDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHIC-SAKLENNLYVLRPNEAKAVLNHEMFRT
        +     G+ I A   G  +L   ++ + LE++    +   NL+SV  L E   SI F  +   I KNG+ +  ++ + NN+          V+N + + +
Subjt:  VGT---GDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHIC-SAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHIN------LDRIGRLVKNGLLNKLKDVSLPPCESCLE--------------------------------------
         N ++K       NN  LWH R GHI+      + R        LLN L ++S   CE CL                                       
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHIN------LDRIGRLVKNGLLNKLKDVSLPPCESCLE--------------------------------------

Query:  ---------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRS
                 D ++ Y   YL+++KS+    F+++  + E   + K+  L  D G EY+    + + ++ GI   L+ P TPQ NGVSER  RT+ +  R+
Subjt:  ---------DDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSV---SETPFELWRG-------------------QTRGGLFFDPQENRVFV-----------STNATFL--
        M+S A+L  SFWG AV TA +++N +PS+++   S+TP+E+W                     + + G F D     +FV           + N  F+  
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSV---SETPFELWRG-------------------QTRGGLFFDPQENRVFV-----------STNATFL--

Query:  -----EEDHMRNH----------KPRSKLVLSEATDESTRVVDEVGP--SSRVDETTTSGQSHPSQSLRMPRRSGRVV--------------------SQ
             +E +M N           K   +       ++S +++    P  S   D       S  S++   P  S +++                     +
Subjt:  -----EEDHMRNH----------KPRSKLVLSEATDESTRVVDEVGP--SSRVDETTTSGQSHPSQSLRMPRRSGRVV--------------------SQ

Query:  PNRYL-------------------------GLTETQVVIPDDGVEDP---------------------LSYKQ--------------AMNDV--------
         N+Y                            +ET   + + G+++P                     +SY +                NDV        
Subjt:  PNRYL-------------------------GLTETQVVIPDDGVEDP---------------------LSYKQ--------------AMNDV--------

Query:  ---DKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEI
           DK  W +A++ E+ +   N+ W +   PE    +  +W++  K +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++
Subjt:  ---DKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEI

Query:  WQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIGN
         QMDVKTAFLNG L+E I+M  P+G         VCKLN++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K N  +  +++LYVDD+++   
Subjt:  WQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIGN

Query:  DVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGS
        D+  + + K +L  +F+M DL E ++ +GI+I  + +   + LSQ+ Y+ K+L +++M+N      P    ++     S       ++    P  S +G 
Subjt:  DVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGS

Query:  LMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL-NGGAVVWRSIKQG
        LMY MLCTRPD+  AV I+SRY S    + W  +K VL+YL+ T D  L++    A +  + GY DSD+   +  RKST+G +F + +   + W + +Q 
Subjt:  LMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL-NGGAVVWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKT
         +A S+ EAEY+A  EA +EA+WL+  L  + +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  I +E+ +AD FTK 
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKT

Query:  LTAKVFEGHLESLGL
        L A  F    + LGL
Subjt:  LTAKVFEGHLESLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-17833.72Show/hide
Query:  MFGQPSIQIKQ--EANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVED------KGKAKVAIK---RKCFHCNVDEHWKTNCPKYLVKKKEKEGK--
        + G+ +I++K    A + + K R  P   G   I    EG+G+    +  +      +GK+K   K   R C++CN   H+K +CP     K E  G+  
Subjt:  MFGQPSIQIKQ--EANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVED------KGKAKVAIK---RKCFHCNVDEHWKTNCPKYLVKKKEKEGK--

Query:  ----------YDLLVL-----ETCL-VENDQNAWILDSGATNHV-------CSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLEN
                   D +VL     E C+ +   ++ W++D+ A++H        C  +       ++ ++  +   G GD+     VG          + L++
Subjt:  ----------YDLLVL-----ETCL-VENDQNAWILDSGATNHV-------CSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLEN

Query:  LYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIG
        +  VP ++ NL+S   L    Y   F+  +  + K  + I        LY       +  LN            +  IS +    LWH R+GH++   + 
Subjt:  LYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIG

Query:  RLVKNGLLNKLKDVSLPPCESCL---------------------------------------------EDDYSRYGYLYLMEHKSEALEKFKEYKTEVEN
         L K  L++  K  ++ PC+ CL                                              DD SR  ++Y+++ K +  + F+++   VE 
Subjt:  RLVKNGLLNKLKDVSLPPCESCL---------------------------------------------EDDYSRYGYLYLMEHKSEALEKFKEYKTEVEN

Query:  LLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELW
           +K+K LRSD GGEY    F++Y   HGI+ + + PGTPQ NGV+ER NRT+++ VRSM+  A+LP SFWG AV+TA +++N  PS  ++ E P  +W
Subjt:  LLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELW

Query:  RG-----------------------------------------QTRGGLFFDPQENRVFVSTNATFLEEDHM----RNHKPRSKLVLSEATDESTRVVDE
                                                   +  G   +DP + +V  S +  F E +       + K ++ ++ +  T  ST   + 
Subjt:  RG-----------------------------------------QTRGGLFFDPQENRVFVSTNATFLEEDHM----RNHKPRSKLVLSEATDESTRVVDE

Query:  VGPSSRVDETTTSGQ-------------------SHPSQSLRMP---RRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLE
            S  DE +  G+                    HP+Q        RRS R   +  RY   +   V+I DD   +P S K+ ++  +K+Q +KAM  E
Subjt:  VGPSSRVDETTTSGQ-------------------SHPSQSLRMP---RRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLE

Query:  MESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLE
        MES+  N  ++LV+LP+G +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LE
Subjt:  MESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLE

Query:  ESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQF
        E I+M QPEGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS  + +   +PCVY K+ ++     L+LYVDD+L++G D G +  +K  L+  F
Subjt:  ESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQF

Query:  QMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAV
         MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L R++M+N+K    P    + LSK+  P T +E  +M ++PY+SAVGSLMYAM+CTRPDI +AV
Subjt:  QMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAV

Query:  GIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKE
        G+VSR+  NPG +HW AVK +L+YLR T    L +G  D IL GYTD+D   D D+RKS++G +FT +GGA+ W+S  Q C+A ST EAEY+AA E  KE
Subjt:  GIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKE

Query:  AVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL
         +WL++FL +L +         +YCD+  A+  SK    H R KHI+ +YH IRE+V    + V KI++  N AD  TK +    FE   E +G+
Subjt:  AVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL

P25600 Putative transposon Ty5-1 protein YCL074W3.2e-3633.76Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDV TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+       
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYA
           VK  L   + MKDLG+    LG+  I    N  + LS   YI K      +   K    P  +   L +  SP     ++D+   PY S VG L++ 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADST
            RPDI Y V ++SR+   P   H  + + VL+YL  TR   L Y     L LT Y D+      D   ST G V  L G  V W S K +G I   +
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADST

Query:  MEAEYVAACEAAKE
         EAEY+ A E   E
Subjt:  MEAEYVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-10326.79Show/hide
Query:  NAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAF
        N W+LDSGAT+H+ S     S  +     +  + V  G  I     G   L   ++ + L N+  VP I +NL+SV  L          +  +F + +  
Subjt:  NAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAF

Query:  IYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCL--------
            GV +   K ++ LY     E     +  +   A+  +K    S       WH RLGH     +  ++ N  L+ L        C  CL        
Subjt:  IYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCL--------

Query:  ------------------------------------EDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQ
                                             D ++RY +LY ++ KS+  E F  +K  +EN    +I    SD GGE++ L   +Y  +HGI 
Subjt:  ------------------------------------EDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQ

Query:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGQT----------------------------
           S P TP+ NG+SER++R +++   +++S+A +P ++W YA   AV+++N +P+  +  E+PF+   G +                            
Subjt:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGQT----------------------------

Query:  -------------RGGLFFDPQENRVFVSTNATFLEE-----------DHMRNHKPRSKLVLSEATDESTR-----------------------------
                        L    Q +R+++S +  F E              ++  +  S  V S  T   TR                             
Subjt:  -------------RGGLFFDPQENRVFVSTNATFLEE-----------DHMRNHKPRSKLVLSEATDESTR-----------------------------

Query:  ----------------------VVDEVGPSSRVDETTTSGQSHPS-----------------QSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPL
                                 + GP      T T  Q+H S                 QSL  P +S      P      + T    P   +  P 
Subjt:  ----------------------VVDEVGPSSRVDETTTSGQSHPS-----------------QSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPL

Query:  SYKQAMND--------------------------------------------VDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRDS
           Q +N+                                            +  ++W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K +S
Subjt:  SYKQAMND--------------------------------------------VDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRDS

Query:  AGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQAS
         G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + ++MSQP GFI + +   VCKL +++YGLKQA 
Subjt:  AGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQAS

Query:  RSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDK
        R+W +     + + GF  +V +  ++       + ++++YVDDIL+ GND   L +    L+ +F +KD  E  Y LGI+    R    L LSQ  YI  
Subjt:  RSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDK

Query:  LLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLV
        LL R +M  +K    P      LS     K     E      Y   VGSL Y +  TRPDI YAV  +S++   P  +H  A+K +L+YL  T ++ + +
Subjt:  LLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLV

Query:  YGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANS
             L L  Y+D+D+  DKD   ST+G +  L    + W S KQ  +  S+ EAEY +    + E  W+   L +L +   +  P  +YCDN GA    
Subjt:  YGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANS

Query:  KEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL
          P  H R KHI   YH IR  VQ G + V  +++   +AD  TK L+   F+     +G+
Subjt:  KEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-10727.35Show/hide
Query:  KCFHCNVDEHWKTNCPKY-----LVKKKEKEGKYDLLVLETCLVEN---DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDA
        +C  C+V  H    CP+         +++    +        L  N   + N W+LDSGAT+H+ S      SF Q       + +  G  I     G A
Subjt:  KCFHCNVDEHWKTNCPKY-----LVKKKEKEGKYDLLVLETCLVEN---DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDA

Query:  KLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIYKN---GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNN
         L   ++ + L  +  VP I +NL+SV  L   +  S+ F    +F  K+   GV +   K ++ LY      ++AV    MF  A+  +K    S    
Subjt:  KLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIYKN---GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNN

Query:  TYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESC--------------------------------------------LEDDYSRYGYLYLMEH
           WH RLGH +L  +  ++ N  L  L     L  C  C                                              D ++RY +LY ++ 
Subjt:  TYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESC--------------------------------------------LEDDYSRYGYLYLMEH

Query:  KSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHIL
        KS+  + F  +K+ VEN    +I  L SD GGE++ LR  DY+ +HGI    S P TP+ NG+SER++R +++M  +++S+A +P ++W YA   AV+++
Subjt:  KSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHIL

Query:  NNVPSKSVS-ETPFELWRGQ-----------------------------------------TRGGLFFDPQENRVFVSTNATFLEE--------------
        N +P+  +  ++PF+   GQ                                             L       R++ S +  F E               
Subjt:  NNVPSKSVS-ETPFELWRGQ-----------------------------------------TRGGLFFDPQENRVFVSTNATFLEE--------------

Query:  --------DHMRNHK--PRSKLVL------------------SEATDESTRVVDEVGPSSRVDETTTSGQSHPSQS--------------------LRMP
                 +  +H   P + LVL                  S +   +T+V     PSS +   ++S  + PS +                    L  P
Subjt:  --------DHMRNHK--PRSKLVL------------------SEATDESTRVVDEVGPSSRVDETTTSGQSHPSQS--------------------LRMP

Query:  RRSGRVVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP---LSY----------K
          +    + PN+   L ++ +  P                                                     DG+  P    SY          +
Subjt:  RRSGRVVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP---LSY----------K

Query:  QAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFY
         A+  +  D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K +S G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A   
Subjt:  QAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFY

Query:  DYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI
         + I Q+DV  AFL G L + ++MSQP GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  ++ +  ++       + ++++YVDDIL+ 
Subjt:  DYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI

Query:  GNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAV
        GND   L      L+ +F +K+  +  Y LGI+    R  + L LSQ  Y   LL R +M  +K    P      L+     K P   E      Y   V
Subjt:  GNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAV

Query:  GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC
        GSL Y +  TRPD+ YAV  +S+Y   P  DHW A+K VL+YL  T D+ + +     L L  Y+D+D+  D D   ST+G +  L    + W S KQ  
Subjt:  GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC

Query:  IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTL
        +  S+ EAEY +    + E  W+   L +L +   ++ P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   +AD  TK L
Subjt:  IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTL

Query:  TAKVFEGHLESLGL
        +   F+     +G+
Subjt:  TAKVFEGHLESLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.0e-8336.16Show/hide
Query:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +Y +A   +    W  AMD E+ +M     WE+  LP   KPIGCKW+YK K +S G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL
        +I+  Y++ + Q+D+  AFLNG+L+E I+M  P G+   QG       VC L +SIYGLKQASR W ++F   +  +GF Q+  +   + KI       +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL

Query:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVED
        ++YVDDI++  N+   + ++K+ L + F+++DLG  +Y LG++I R      + + Q  Y   LL    +   K   +P    V  S         +  D
Subjt:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVED

Query:  MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGA
         +   Y   +G LMY  + TR DI +AV  +S++   P L H  AV  +L Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L    
Subjt:  MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGA

Query:  VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
        + W+S KQ  ++ S+ EAEY A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein9.5e-0437.5Show/hide
Query:  TRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGA-KDLILTGYTDSDFQTDKDSRKSTSG
        TRPD+ +AV  +S++ S        AV  VL Y++ T    L Y A  DL L  + DSD+ +  D+R+S +G
Subjt:  TRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGA-KDLILTGYTDSDFQTDKDSRKSTSG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0444Show/hide
Query:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELW
        NRT+++ VRSM+    LP +F   A  TAVHI+N  PS +++   P E+W
Subjt:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELW

ATMG00810.1 DNA/RNA polymerases superfamily protein1.7e-2133.76Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEV
        +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI        L LSQ  Y +++L    M + K    P    + L+   S     + 
Subjt:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEV

Query:  EDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNG
         D R     S VG+L Y  L TRPDI YAV IV +    P L  +  +K VL+Y++ T  + + ++    L +  + DSD+     +R+ST+G    L  
Subjt:  EDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNG

Query:  GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVW
          + W + +Q  ++ S+ E EY A    A E  W
Subjt:  GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.2e-1536.09Show/hide
Query:  MPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL
        M  RS   +++ N    LT T  +      ++P S   A+ D     W +AM  E++++  N  W LV  P     +GCKW++K K  S G +   KARL
Subjt:  MPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        VAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCAATTATTGCATTGCTTAAAAAAGATCAATTAACCGGTGAAAATTATGCGACGTGGAAATCGAAACTGAATATGATTTTAGTTATCGTTGATCTAAGCTT
CGTCTTAATGGAGGAATGTCCTCCTTTCCCCACTAAACATGCATCCCAAAGTGTTAGGGATGCATATGACCGTTGGACAAAGGCCAATGACAAGGCTCGCCTCCACATCT
TGGCTAGTATGTCTGACATACTAAGCAAGAAACATGAGATCATGGTCACTGCACGTCAGATCATGGACTCTCTTAGAGAGATGTTTGGACAACCGTCCATTCAGATCAAA
CAAGAGGCAAATGTTGCCCATTCTAAGAGGAGGTTTGTACCTTCACCTTCTGGATCTGAGAAAATTCAAAAGAGGAAAGAAGGGAAAGGGAAAGGTCCTACTATTGCTGT
TGAGGACAAAGGGAAGGCTAAGGTAGCCATCAAGAGGAAATGTTTTCACTGCAATGTTGATGAGCATTGGAAAACAAATTGCCCTAAGTACCTTGTTAAGAAGAAAGAAA
AGGAAGGTAAATATGATTTACTTGTCTTGGAAACATGTTTAGTGGAAAATGACCAAAATGCCTGGATACTTGATTCAGGGGCCACTAACCATGTTTGTTCTTCTTTACAA
GAAACTAGTTCCTTCAAGCAGCTTGAGGACAGTGAGATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGGAAA
TAAATTCATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATG
AAGCGTTCATTTATAAGAATGGTGTACATATTTGTTCAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTT
AGAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGT
AAAGAATGGACTTCTAAACAAGTTAAAAGATGTTTCATTACCTCCATGTGAATCTTGTCTTGAAGATGATTATTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGT
CTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGA
TTTCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAGAGGAGAAATAGAACCTTGTTAGACATGGT
TCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGAAACAC
CTTTCGAGCTATGGAGAGGACAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAGACCACATGAGA
AATCATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTTGATGAAACCACCACATCAGG
TCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAACTCAAGTTGTCATACCAGATG
ATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTG
TGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTTAAAGCTAGACTTGT
GGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATT
ATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAA
GTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGA
ACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAG
CTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCA
ACCTATATCGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGGAACAGAGTCCTAAGACACC
TCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCA
GTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTG
ATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCAT
CAAGCAAGGATGCATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTG
TTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTAT
CATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGA
GGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGATTCAGAGTCGTCAACGAGCATCAACCAAATAATGTAAAACCTATTGGTTGTAAATGGATCAACAAAA
GAAAACGAGACCAAGCTGGAAAAGTACAGACTTTCAAAGCTCAACTTGTGGCAAGAGGTTATACACAGAAAGAGGGAATAGATTATGAAGAAACTTTCTCTCCTGTTGCC
ATGATAAAGTCAATTAGAATACTATTATCCATCATCATTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACCTTTTTTAACGGTAATCTTGAAGAGAGTAT
TCATATGGTCCACCCAGAGGGGTTTATACAAAAGGGTCAAGAAAGAAAAACAAAAGACTACATGCTTGTGTATGGTTCTAAGGATCTGATCCTTACTGGACACACTGACT
CCGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTTATCACTTTGAACGGAGGAGCAGTAGTATGGAGAAGCATAAAACAATCTTGTATTGTTGAC
TCCACTATGAAAGTTGAATATGTAGCTGTCTGTAAAGCAGCGAAAGAAGCAGTATGGCTTAAAAATGGTGCAGTTGCAAATTCACGAGAACCTAGAAGTCATAAACGTGG
AAAGTACATTGAACGAAACTACCATCTTATCAGGGAAATCATAAATCGAAGAGACATTACAGTAACAAAAATCTCCTCCGAGCAAAACATGGCTGATCTGTTTACAAAAG
CTCTCACAGCTAAAGTGTTTGAGAGCCACCTACACGGTTTAGGTCTACGCAGTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCAATTATTGCATTGCTTAAAAAAGATCAATTAACCGGTGAAAATTATGCGACGTGGAAATCGAAACTGAATATGATTTTAGTTATCGTTGATCTAAGCTT
CGTCTTAATGGAGGAATGTCCTCCTTTCCCCACTAAACATGCATCCCAAAGTGTTAGGGATGCATATGACCGTTGGACAAAGGCCAATGACAAGGCTCGCCTCCACATCT
TGGCTAGTATGTCTGACATACTAAGCAAGAAACATGAGATCATGGTCACTGCACGTCAGATCATGGACTCTCTTAGAGAGATGTTTGGACAACCGTCCATTCAGATCAAA
CAAGAGGCAAATGTTGCCCATTCTAAGAGGAGGTTTGTACCTTCACCTTCTGGATCTGAGAAAATTCAAAAGAGGAAAGAAGGGAAAGGGAAAGGTCCTACTATTGCTGT
TGAGGACAAAGGGAAGGCTAAGGTAGCCATCAAGAGGAAATGTTTTCACTGCAATGTTGATGAGCATTGGAAAACAAATTGCCCTAAGTACCTTGTTAAGAAGAAAGAAA
AGGAAGGTAAATATGATTTACTTGTCTTGGAAACATGTTTAGTGGAAAATGACCAAAATGCCTGGATACTTGATTCAGGGGCCACTAACCATGTTTGTTCTTCTTTACAA
GAAACTAGTTCCTTCAAGCAGCTTGAGGACAGTGAGATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGGAAA
TAAATTCATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATG
AAGCGTTCATTTATAAGAATGGTGTACATATTTGTTCAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTT
AGAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGT
AAAGAATGGACTTCTAAACAAGTTAAAAGATGTTTCATTACCTCCATGTGAATCTTGTCTTGAAGATGATTATTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGT
CTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGA
TTTCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAGAGGAGAAATAGAACCTTGTTAGACATGGT
TCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGAAACAC
CTTTCGAGCTATGGAGAGGACAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAGACCACATGAGA
AATCATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTTGATGAAACCACCACATCAGG
TCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAACTCAAGTTGTCATACCAGATG
ATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTG
TGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTTAAAGCTAGACTTGT
GGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATT
ATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAA
GTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGA
ACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAG
CTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCA
ACCTATATCGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGGAACAGAGTCCTAAGACACC
TCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCA
GTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTG
ATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCAT
CAAGCAAGGATGCATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTG
TTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTAT
CATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGA
GGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGATTCAGAGTCGTCAACGAGCATCAACCAAATAATGTAAAACCTATTGGTTGTAAATGGATCAACAAAA
GAAAACGAGACCAAGCTGGAAAAGTACAGACTTTCAAAGCTCAACTTGTGGCAAGAGGTTATACACAGAAAGAGGGAATAGATTATGAAGAAACTTTCTCTCCTGTTGCC
ATGATAAAGTCAATTAGAATACTATTATCCATCATCATTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACCTTTTTTAACGGTAATCTTGAAGAGAGTAT
TCATATGGTCCACCCAGAGGGGTTTATACAAAAGGGTCAAGAAAGAAAAACAAAAGACTACATGCTTGTGTATGGTTCTAAGGATCTGATCCTTACTGGACACACTGACT
CCGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTTATCACTTTGAACGGAGGAGCAGTAGTATGGAGAAGCATAAAACAATCTTGTATTGTTGAC
TCCACTATGAAAGTTGAATATGTAGCTGTCTGTAAAGCAGCGAAAGAAGCAGTATGGCTTAAAAATGGTGCAGTTGCAAATTCACGAGAACCTAGAAGTCATAAACGTGG
AAAGTACATTGAACGAAACTACCATCTTATCAGGGAAATCATAAATCGAAGAGACATTACAGTAACAAAAATCTCCTCCGAGCAAAACATGGCTGATCTGTTTACAAAAG
CTCTCACAGCTAAAGTGTTTGAGAGCCACCTACACGGTTTAGGTCTACGCAGTTTGTAA
Protein sequenceShow/hide protein sequence
MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLREMFGQPSIQIK
QEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVENDQNAWILDSGATNHVCSSLQ
ETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMF
RTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLR
FQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGQTRGGLFFDPQENRVFVSTNATFLEEDHMR
NHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQK
VCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQA
TYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDL
ILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKY
HLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIRFRVVNEHQPNNVKPIGCKWINKRKRDQAGKVQTFKAQLVARGYTQKEGIDYEETFSPVA
MIKSIRILLSIIIFYDYEIWQMDVKTTFFNGNLEESIHMVHPEGFIQKGQERKTKDYMLVYGSKDLILTGHTDSDFQTDKDARKSTSGSVITLNGGAVVWRSIKQSCIVD
STMKVEYVAVCKAAKEAVWLKNGAVANSREPRSHKRGKYIERNYHLIREIINRRDITVTKISSEQNMADLFTKALTAKVFESHLHGLGLRSL