; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0004028 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0004028
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag/pol protein
Genome locationchr10:21322661..21327993
RNA-Seq ExpressionIVF0004028
SyntenyIVF0004028
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.069.5Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        M++SI+ LL  ++L G+NY+ WKS LN ILV+ DL FVL EECP  P  +A+++VR+AYDRW KANDKAR++ILASM+D+L+KKH+ + TA+ IMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEA---------------------------------------------------------------------------------------
        MFGQPS  ++ EA                                                                                       
Subjt:  MFGQPSIQIKQEA---------------------------------------------------------------------------------------

Query:  -----NVAHSKRRFVPSPS-----GSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEK---EGKYDLLVLETCLVEN
             NVA +KR+F+   S     G  K Q +K+GKGK P  +   K     A K KCFHCN D HWK NCPKYL +KK +   +GKYDLLV+ETCLVE 
Subjt:  -----NVAHSKRRFVPSPS-----GSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEK---EGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
        D + WILDSGATNH+C S QETSS+K+L++ E+TLKVGTG+V+SA AVGD  LFF+++++ L+++  VP +KRNL+S++C++EH+Y+I+F +NE FI   
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
        G+ ICSA  ENNLY LRP  A  VLN EMFRT  TQNK+Q++S N   YLWHLRLGHINL+RI RLVK+G+LN+L+D SLPPCESCLEGKMTKR FTGKG
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG

Query:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
         RAK PLEL+HSDLCGPMNVKARGG+EYFISFIDD+SRYG++YL+ HKSE+ EKFKEYK EVEN + K IK LRSDRGGEYMD +FQDY+IE GIQSQLS
Subjt:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
        AP TPQQNGVSERRNRTLLDMVRSMMSYAQLP SFWGYA+ETA+HILNNVPSKSV ETP+ELW+GRK SL +FRIWGCPAHVLV NPKKLEPRS+LC FV
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV

Query:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSE----ATDE---STRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN
        GYPKE+RGGLF+ PQEN+VFVSTNATFLEEDH RNH+PRSK+VL E    ATD+   ST+VVD+            S QSH SQ LR+PRRSGRVV QPN
Subjt:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSE----ATDE---STRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN

Query:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY
        RYLGL ETQ++IPDDGVEDPL+YKQAMNDVD+DQW+KAM+LEMESMYFNSVW LVDLP  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDY
Subjt:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC
        EETFSPVAMLKSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESI+M QPEGFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGF+QNVDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC

Query:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS
        VYKKI    VAFL+LYVDDILLIGNDV YLTDVK WL  QFQMKDLGEAQY+LGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKG LPFRHG+HLS
Subjt:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS
        KEQ PKTPQEVEDMR IPY+SAVGSLMYAMLCTRPDICY+VGIVSRYQSNPG DHWTAVK ILKYLRRTR+YMLVYGAKDLILTGYTDSDFQ+DKD+RKS
Subjt:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS

Query:  TSGSVFTLNG
        TSGSVFTLNG
Subjt:  TSGSVFTLNG

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.099.73Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENYATWKSKLNMILVI DLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
        DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
        GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG

Query:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
        YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
Subjt:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV

Query:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
        GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNG
        LNG
Subjt:  LNG

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]0.098.37Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENY TWKSKLN ILVIADLSFVLMEECP FPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKK EKE KYDLLVLETCLVEN
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
        DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
        GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+GLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG

Query:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
        YRAKEPLELIHSDLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLS
Subjt:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV

Query:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
        GYPKETRGGLFFDP+ENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS V
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNG
        LNG
Subjt:  LNG

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]0.088.4Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENYATWKSKLNMILVI DLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEG              
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
                  ATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKL                                               
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
                                                               GHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG

Query:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
        YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
Subjt:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV

Query:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
        GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNG
        LNG
Subjt:  LNG

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]0.068.76Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        M+S+ + +L  D+L G NYA+WK+ +N +L+I DL FVL+EECP  P  +A+++VR+ Y+RW KAN+KAR +ILAS+S++L+KKHE M+TAR+IMDSL+E
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------
        MFGQ S QIK                                                                                          
Subjt:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------

Query:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL
           EANVA S R+F          +PS SG++K +K+K G+G    +A     K   A K  CFHCN + HWK NCPKYL  KKK K+GKYDLLVLETCL
Subjt:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL

Query:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI
        VEND +AWI+DSGATNHVCSS Q  SS++QLE  EMT++VGTG V+SA AVG  +L  +  F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FI
Subjt:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI

Query:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFT
        YKNGV ICSAKLENNLYVLR   +KA+LN EMF+TA TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++L++ SLP CESCLEGKMTKRPFT
Subjt:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFT

Query:  GKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS
        GKG+RAKEPLEL+HSDLCGPMNVKARGGFEYFI+F DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  GKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPRS+LC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLC

Query:  QFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSE----ATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN
         FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL+E     T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P 
Subjt:  QFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSE----ATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN

Query:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY
        RY+ LTET  VI D  +EDPL++K+AM DVDKD+W+KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDY
Subjt:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC

Query:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS
        VYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LS
Subjt:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS
        KEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL HWTAVK ILKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKS
Subjt:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS

Query:  TSGSVFTLNG
        TSGSVFTLNG
Subjt:  TSGSVFTLNG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0068.76Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        M+S+ + +L  D+L G NYA+WK+ +N +L+I DL FVL+EECP  P  +A+++VR+ Y+RW KAN+KAR +ILAS+S++L+KKHE M+TAR+IMDSL+E
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------
        MFGQ S QIK                                                                                          
Subjt:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------

Query:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL
           EANVA S R+F          +PS SG++K +K+K G+G    +A     K   A K  CFHCN + HWK NCPKYL  KKK K+GKYDLLVLETCL
Subjt:  ---EANVAHSKRRF----------VPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLV-KKKEKEGKYDLLVLETCL

Query:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI
        VEND +AWI+DSGATNHVCSS Q  SS++QLE  EMT++VGTG V+SA AVG  +L  +  F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FI
Subjt:  VENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI

Query:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFT
        YKNGV ICSAKLENNLYVLR   +KA+LN EMF+TA TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++L++ SLP CESCLEGKMTKRPFT
Subjt:  YKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFT

Query:  GKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS
        GKG+RAKEPLEL+HSDLCGPMNVKARGGFEYFI+F DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  GKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPRS+LC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLC

Query:  QFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN
         FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL+    E T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P 
Subjt:  QFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPN

Query:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY
        RY+ LTET  VI D  +EDPL++K+AM DVDKD+W+KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDY
Subjt:  RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC

Query:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS
        VYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LS
Subjt:  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS
        KEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL HWTAVK ILKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKS
Subjt:  KEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKS

Query:  TSGSVFTLNG
        TSGSVFTLNG
Subjt:  TSGSVFTLNG

A0A5A7T2V9 Gag/pol protein0.0e+0098.37Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENY TWKSKLN ILVIADLSFVLMEECP FPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKK EKE KYDLLVLETCLVEN
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
        DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
        GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+GLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG

Query:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
        YRAKEPLELIHSDLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLS
Subjt:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV

Query:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
        GYPKETRGGLFFDP+ENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS V
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNG
        LNG
Subjt:  LNG

A0A5A7TZD0 Gag/pol protein0.0e+0099.73Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENYATWKSKLNMILVI DLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
        DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
        GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG

Query:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
        YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
Subjt:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV

Query:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
        GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNG
        LNG
Subjt:  LNG

A0A5A7UYE8 Gag/pol protein0.0e+0088.4Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        MSSSIIALLKKDQLTGENYATWKSKLNMILVI DLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN
        MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKE               
Subjt:  MFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
                 GATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAK                                                
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
                                                              LGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG

Query:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
        YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
Subjt:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
        APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV

Query:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
        GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE
Subjt:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTE

Query:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
        TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
Subjt:  TQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
        AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
        GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNG
        LNG
Subjt:  LNG

E2GK51 Gag/pol protein (Fragment)0.0e+0069.6Show/hide
Query:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE
        M++SI+ LL  ++L G+NY+ WKS LN ILV+ DL FVL EECP  P  +A+++VR+AYDRW KANDKAR++ILASM+D+L+KKH+ + TA+ IMDSLRE
Subjt:  MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLRE

Query:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------
        MFGQPS  ++                                                                                          
Subjt:  MFGQPSIQIKQ-----------------------------------------------------------------------------------------

Query:  ---EANVAHSKRRFVPSPS-----GSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEK---EGKYDLLVLETCLVEN
           EANVA +KR+F+   S     G  K Q +K+GKGK P  +   K     A K KCFHCN D HWK NCPKYL +KK +   +GKYDLLV+ETCLVE 
Subjt:  ---EANVAHSKRRFVPSPS-----GSEKIQKRKEGKGKGPTIAVEDKGKAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEK---EGKYDLLVLETCLVEN

Query:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN
        D + WILDSGATNH+C S QETSS+K+L++ E+TLKVGTG+V+SA AVGD  LFF+++++ L+++  VP +KRNL+S++C++EH+Y+I+F +NE FI   
Subjt:  DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKN

Query:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG
        G+ ICSA  ENNLY LRP  A  VLN EMFRT  TQNK+Q++S  +N YLWHLRLGHINL+RI RLVK+G+LN+L+D SLPPCESCLEGKMTKR FTGKG
Subjt:  GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKG

Query:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS
         RAK PLEL+HSDLCGPMNVKARGG+EYFISFIDD+SRYG++YL+ HKSE+ EKFKEYK EVEN + K IK LRSDRGGEYMD +FQDY+IE GIQSQLS
Subjt:  YRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLS

Query:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV
        AP TPQQNGVSERRNRTLLDMVRSMMSYAQLP SFWGYA+ETA+HILNNVPSKSV ETP+ELW+GRK SL +FRIWGCPAHVLV NPKKLEPRS+LC FV
Subjt:  APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFV

Query:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSR-VDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLT
        GYPKE+RGGLF+ PQEN+VFVSTNATFLEEDH RNH+PRSK+VL E    +T   D+   S++ VD+   S QSH SQ LR+PRRSGRVV QPNRYLGL 
Subjt:  GYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSR-VDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLT

Query:  ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSP
        ETQ++IPDDGVEDPL+YKQAMNDVD+DQW+KAM+LEMESMYFNSVW LVDLP  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSP
Subjt:  ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSP

Query:  VAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKIN
        VAMLKSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESI+M QPEGFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGF+QNVDEPCVYKKI 
Subjt:  VAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKIN

Query:  KGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPK
           VAFL+LYVDDILLIGNDV YLTDVK WL  QFQMKDLGEAQY+LGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKG LPFRHG+HLSKEQ PK
Subjt:  KGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPK

Query:  TPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVF
        TPQEVEDMR IPY+SAVGSLMYAMLCTRPDICY+VGIVSRYQSNPG DHWTAVK ILKYLRRTR+YMLVYGAKDLILTGYTDSDFQ+DKD+RKSTSGSVF
Subjt:  TPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVF

Query:  TLNG
        TLNG
Subjt:  TLNG

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.1e-12229.23Show/hide
Query:  KGKAKVAIKRKCFHCNVDEHWKTNCPKY--LVKKKEKEGKYDL-------LVLETCLVEN----DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLK
        KG +K   K KC HC  + H K +C  Y  ++  K KE +  +       +      V N    D   ++LDSGA++H+   + + S +    +    LK
Subjt:  KGKAKVAIKRKCFHCNVDEHWKTNCPKY--LVKKKEKEGKYDL-------LVLETCLVEN----DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLK

Query:  VGT---GDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHIC-SAKLENNLYVLRPNEAKAVLNHEMFRT
        +     G+ I A   G  +L  +++ + LE++    +   NL+SV  L E   SI F  +   I KNG+ +  ++ + NN+          V+N + + +
Subjt:  VGT---GDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHIC-SAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHIN------LDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARG
         N ++K       NN  LWH R GHI+      + R        LLN L ++S   CE CL GK  + PF     +   K PL ++HSD+CGP+      
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHIN------LDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARG

Query:  GFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRS
           YF+ F+D ++ Y   YL+++KS+    F+++  + E   + K+  L  D G EY+    + + ++ GI   L+ P TPQ NGVSER  RT+ +  R+
Subjt:  GFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRS

Query:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSV---SETPFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFDPQENRVF
        M+S A+L  SFWG AV TA +++N +PS+++   S+TP+E+W  +KP L H R++G   +V + N + K + +S    FVGY  E  G   +D    +  
Subjt:  MMSYAQLPSSFWGYAVETAVHILNNVPSKSV---SETPFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFDPQENRVF

Query:  VSTNATFLEEDHMRNHKPRSKLVLSEATDE---------STRVVDEVGP--SSRVDETTTSGQSHPSQSLRMPRRSGRVV--------------------
        V+ +    E + + +   + + V  + + E         S +++    P  S   D       S  S++   P  S +++                    
Subjt:  VSTNATFLEEDHMRNHKPRSKLVLSEATDE---------STRVVDEVGP--SSRVDETTTSGQSHPSQSLRMPRRSGRVV--------------------

Query:  SQPNRYL-------------------------GLTETQVVIPDDGVEDP---------------------LSYKQ--------------AMNDV------
         + N+Y                            +ET   + + G+++P                     +SY +                NDV      
Subjt:  SQPNRYL-------------------------GLTETQVVIPDDGVEDP---------------------LSYKQ--------------AMNDV------

Query:  -----DKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDY
             DK  W +A++ E+ +   N+ W +   PE    +  +W++  K +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ 
Subjt:  -----DKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDY

Query:  EIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLI
        ++ QMDVKTAFLNG L+E I+M  P+G         VCKLN++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K N  +  +++LYVDD+++ 
Subjt:  EIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLI

Query:  GNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAV
          D+  + + K +L  +F+M DL E ++ +GI+I  + +   + LSQ+ Y+ K+L +++M+N      P    ++     S       ++    P  S +
Subjt:  GNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAV

Query:  GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL
        G LMY MLCTRPD+  AV I+SRY S    + W  +K +L+YL+ T D  L++    A +  + GY DSD+   +  RKST+G +F +
Subjt:  GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-18436.11Show/hide
Query:  MFGQPSIQIKQ--EANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVED------KGKAKVAIK---RKCFHCNVDEHWKTNCPKYLVKKKEKEGK--
        + G+ +I++K    A + + K R  P   G   I    EG+G+    +  +      +GK+K   K   R C++CN   H+K +CP     K E  G+  
Subjt:  MFGQPSIQIKQ--EANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVED------KGKAKVAIK---RKCFHCNVDEHWKTNCPKYLVKKKEKEGK--

Query:  ----------YDLLVL-----ETCL-VENDQNAWILDSGATNHV-------CSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLEN
                   D +VL     E C+ +   ++ W++D+ A++H        C  +       ++ ++  +   G GD+     VG          + L++
Subjt:  ----------YDLLVL-----ETCL-VENDQNAWILDSGATNHV-------CSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLEN

Query:  LYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIG
        +  VP ++ NL+S   L    Y   F+  +  + K  + I        LY       +  LN            +  IS +    LWH R+GH++   + 
Subjt:  LYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIG

Query:  RLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVEN
         L K  L++  K  ++ PC+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +YF++FIDD SR  ++Y+++ K +  + F+++   VE 
Subjt:  RLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVEN

Query:  LLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELW
           +K+K LRSD GGEY    F++Y   HGI+ + + PGTPQ NGV+ER NRT+++ VRSM+  A+LP SFWG AV+TA +++N  PS  ++ E P  +W
Subjt:  LLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELW

Query:  RGRKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHM----RNHKPRSKLVLSEATDESTRVVDE
          ++ S SH +++GC   AHV      KL+ +S  C F+GY  E  G   +DP + +V  S +  F E +       + K ++ ++ +  T  ST   + 
Subjt:  RGRKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHM----RNHKPRSKLVLSEATDESTRVVDE

Query:  VGPSSRVDETTTSGQ-------------------SHPSQSLRMP---RRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLE
            S  DE +  G+                    HP+Q        RRS R   +  RY   +   V+I DD   +P S K+ ++  +K+Q +KAM  E
Subjt:  VGPSSRVDETTTSGQ-------------------SHPSQSLRMP---RRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLE

Query:  MESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLE
        MES+  N  ++LV+LP+G +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LE
Subjt:  MESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLE

Query:  ESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQF
        E I+M QPEGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS  + +   +PCVY K+ ++     L+LYVDD+L++G D G +  +K  L+  F
Subjt:  ESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQF

Query:  QMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAV
         MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L R++M+N+K    P    + LSK+  P T +E  +M ++PY+SAVGSLMYAM+CTRPDI +AV
Subjt:  QMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAV

Query:  GIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNG
        G+VSR+  NPG +HW AVK IL+YLR T    L +G  D IL GYTD+D   D D+RKS++G +FT +G
Subjt:  GIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNG

P25600 Putative transposon Ty5-1 protein YCL074W1.3e-3032.62Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDV TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+       
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYA
           VK  L   + MKDLG+    LG+  I    N  + LS   YI K      +   K    P  +   L +  SP     ++D+   PY S VG L++ 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNG
            RPDI Y V ++SR+   P   H  + + +L+YL  TR   L Y     L LT Y D+      D   ST G V  L G
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-10928.63Show/hide
Query:  NAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAF
        N W+LDSGAT+H+ S     S  +     +  + V  G  I     G   L  +++ + L N+  VP I +NL+SV  L          +  +F + +  
Subjt:  NAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAF

Query:  IYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRP
            GV +   K ++ LY     E     +  +   A+  +K    S       WH RLGH     +  ++ N  L+ L        C  CL  K  K P
Subjt:  IYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRP

Query:  FTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGI
        F+     +  PLE I+SD+     + +   + Y++ F+D ++RY +LY ++ KS+  E F  +K  +EN    +I    SD GGE++ L   +Y  +HGI
Subjt:  FTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGI

Query:  QSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEP
            S P TP+ NG+SER++R +++   +++S+A +P ++W YA   AV+++N +P+  +  E+PF+   G  P+    R++GC  +  +   N  KL+ 
Subjt:  QSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEP

Query:  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE-----------DHMRNHKPRSKLVLSEATDESTR----------------------------
        +SR C F+GY       L    Q +R+++S +  F E              ++  +  S  V S  T   TR                            
Subjt:  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE-----------DHMRNHKPRSKLVLSEATDESTR----------------------------

Query:  -----------------------VVDEVGPSSRVDETTTSGQSHPS-----------------QSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDP
                                  + GP      T T  Q+H S                 QSL  P +S      P      + T    P   +  P
Subjt:  -----------------------VVDEVGPSSRVDETTTSGQSHPS-----------------QSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDP

Query:  LSYKQAMND--------------------------------------------VDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRD
            Q +N+                                            +  ++W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K +
Subjt:  LSYKQAMND--------------------------------------------VDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRD

Query:  SAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQA
        S G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + ++MSQP GFI + +   VCKL +++YGLKQA
Subjt:  SAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQA

Query:  SRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYID
         R+W +     + + GF  +V +  ++       + ++++YVDDIL+ GND   L +    L+ +F +KD  E  Y LGI+    R    L LSQ  YI 
Subjt:  SRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYID

Query:  KLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDY-ML
         LL R +M  +K    P      LS     K     E      Y   VGSL Y +  TRPDI YAV  +S++   P  +H  A+K IL+YL  T ++ + 
Subjt:  KLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDY-ML

Query:  VYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTL
        +     L L  Y+D+D+  DKD   ST+G +  L
Subjt:  VYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.8e-11228.79Show/hide
Query:  KCFHCNVDEHWKTNCPKY-----LVKKKEKEGKYDLLVLETCLVEN---DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDA
        +C  C+V  H    CP+         +++    +        L  N   + N W+LDSGAT+H+ S      SF Q       + +  G  I     G A
Subjt:  KCFHCNVDEHWKTNCPKY-----LVKKKEKEGKYDLLVLETCLVEN---DQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDA

Query:  KLFFENKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIYKN---GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNN
         L   ++ + L  +  VP I +NL+SV  L   +  S+ F    +F  K+   GV +   K ++ LY      ++AV    MF  A+  +K    S    
Subjt:  KLFFENKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIYKN---GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNN

Query:  TYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLME
           WH RLGH +L  +  ++ N  L  L     L  C  C   K  K PF+     + +PLE I+SD+     + +   + Y++ F+D ++RY +LY ++
Subjt:  TYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLME

Query:  HKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHI
         KS+  + F  +K+ VEN    +I  L SD GGE++ LR  DY+ +HGI    S P TP+ NG+SER++R +++M  +++S+A +P ++W YA   AV++
Subjt:  HKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHI

Query:  LNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE-------------
        +N +P+  +  ++PF+   G+ P+    +++GC  +  +   N  KLE +S+ C F+GY       L       R++ S +  F E              
Subjt:  LNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE-------------

Query:  ---------DHMRNHK--PRSKLVL------------------SEATDESTRVVDEVGPSSRVDETTTSGQSHPSQS--------------------LRM
                  +  +H   P + LVL                  S +   +T+V     PSS +   ++S  + PS +                    L  
Subjt:  ---------DHMRNHK--PRSKLVL------------------SEATDESTRVVDEVGPSSRVDETTTSGQSHPSQS--------------------LRM

Query:  PRRSGRVVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP---LSY----------
        P  +    + PN+   L ++ +  P                                                     DG+  P    SY          
Subjt:  PRRSGRVVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP---LSY----------

Query:  KQAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATF
        + A+  +  D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K +S G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A  
Subjt:  KQAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATF

Query:  YDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILL
          + I Q+DV  AFL G L + ++MSQP GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  ++ +  ++       + ++++YVDDIL+
Subjt:  YDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILL

Query:  IGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASA
         GND   L      L+ +F +K+  +  Y LGI+    R  + L LSQ  Y   LL R +M  +K    P      L+     K P   E      Y   
Subjt:  IGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASA

Query:  VGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTL
        VGSL Y +  TRPD+ YAV  +S+Y   P  DHW A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+G +  L
Subjt:  VGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-6636.5Show/hide
Query:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +Y +A   +    W  AMD E+ +M     WE+  LP   KPIGCKW+YK K +S G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL
        +I+  Y++ + Q+D+  AFLNG+L+E I+M  P G+   QG       VC L +SIYGLKQASR W ++F   +  +GF Q+  +   + KI       +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL

Query:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVED
        ++YVDDI++  N+   + ++K+ L + F+++DLG  +Y LG++I R      + + Q  Y   LL    +   K   +P    V  S         +  D
Subjt:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVED

Query:  MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGEL
         +   Y   +G LMY  + TR DI +AV  +S++   P L H  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L   L
Subjt:  MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGEL

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.4e-0840Show/hide
Query:  NNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
        + T LWH RL H++   +  LVK G L+  K  SL  CE C+ GK  +  F+   +  K PL+ +HSDL G  +V
Subjt:  NNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.7e-1039.02Show/hide
Query:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR
        NRT+++ VRSM+    LP +F   A  TAVHI+N  PS +++   P E+W    P+ S+ R +GC A++   +  KL+PR++
Subjt:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR

ATMG00810.1 DNA/RNA polymerases superfamily protein2.4e-1634.72Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEV
        +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI        L LSQ  Y +++L    M + K    P    + L+   S     + 
Subjt:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEV

Query:  EDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSG
         D R     S VG+L Y  L TRPDI YAV IV +    P L  +  +K +L+Y++ T  + + ++    L +  + DSD+     +R+ST+G
Subjt:  EDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSG

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.6e-1536.09Show/hide
Query:  MPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL
        M  RS   +++ N    LT T  +      ++P S   A+ D     W +AM  E++++  N  W LV  P     +GCKW++K K  S G +   KARL
Subjt:  MPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        VAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGGACGGCTCCCGCGGCTCGGCTCGGCGGCACTTGTACGGACGACTCGGTTGTACGGACAGCTCGGTTCGGCTCGGTTGTACGGACGGCTCGGCTCGGCGGTACG
GACGGCTCGATTCGCTCGGCTGTACGGACGGCTCGGTCGGTCGGCTCGGTTCGACCCGTTTTTTCATATGGTGCCGGTTCAAGTGGTTTTGGGCCGCATGTCTTCCTCAA
TTATTGCATTGCTTAAAAAAGATCAATTAACCGGTGAAAATTATGCGACGTGGAAATCGAAACTGAATATGATTCTAGTTATCGCTGATCTAAGCTTCGTCTTAATGGAG
GAATGTCCTCCTTTCCCCACTAAACATGCATCCCAAAGTGTTAGGGATGCATATGACCGTTGGACAAAGGCCAATGACAAGGCTCGCCTCCACATCTTGGCTAGTATGTC
TGACATACTAAGCAAGAAACATGAGATCATGGTCACTGCACGTCAGATCATGGACTCTCTTAGAGAGATGTTTGGACAACCGTCCATTCAGATCAAACAAGAGGCAAATG
TTGCCCATTCTAAGAGGAGGTTTGTACCTTCACCTTCTGGATCTGAGAAAATTCAAAAGAGGAAAGAAGGGAAAGGGAAAGGTCCTACTATTGCTGTTGAGGACAAAGGG
AAGGCTAAGGTAGCCATCAAGAGGAAATGTTTTCACTGCAATGTTGATGAGCATTGGAAAACAAATTGCCCTAAGTACCTTGTTAAGAAGAAAGAAAAGGAAGGTAAATA
TGATTTACTTGTCTTGGAAACATGTTTAGTGGAAAATGACCAAAATGCCTGGATACTTGATTCAGGGGCCACTAACCATGTTTGTTCTTCTTTACAAGAAACTAGTTCCT
TCAAGCAGCTTGAGGACAGTGAGATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGAAAATAAATTCATGTTT
TTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTA
TAAGAATGGTGTACATATTTGTTCAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATA
CTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTT
CTAAACAAGTTAAAAGATGTTTCATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGA
ACTTATACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCTAGGTATGGTTATTTATACTTAA
TGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTAC
ATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAGAGGAGAAATAGAACCTT
GTTAGACATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTG
TTTCTGAAACACCTTTCGAGCTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTGGAA
CCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTT
GGAAGAAGACCACATGAGAAATCATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTTG
ATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAACT
CAAGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTC
TATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGA
CCTTTAAAGCTAGACTTGTGGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCC
ATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAAC
CCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCGATCAAATCCTATGGTT
TTGACCAGAATGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGT
TACCTTACTGACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAAC
GCTAGCACTGTCTCAAGCAACCTATATCGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGG
AACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGT
TATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTATTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGT
GTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGAGC
TGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTACGGACGGCTCCCGCGGCTCGGCTCGGCGGCACTTGTACGGACGACTCGGTTGTACGGACAGCTCGGTTCGGCTCGGTTGTACGGACGGCTCGGCTCGGCGGTACG
GACGGCTCGATTCGCTCGGCTGTACGGACGGCTCGGTCGGTCGGCTCGGTTCGACCCGTTTTTTCATATGGTGCCGGTTCAAGTGGTTTTGGGCCGCATGTCTTCCTCAA
TTATTGCATTGCTTAAAAAAGATCAATTAACCGGTGAAAATTATGCGACGTGGAAATCGAAACTGAATATGATTCTAGTTATCGCTGATCTAAGCTTCGTCTTAATGGAG
GAATGTCCTCCTTTCCCCACTAAACATGCATCCCAAAGTGTTAGGGATGCATATGACCGTTGGACAAAGGCCAATGACAAGGCTCGCCTCCACATCTTGGCTAGTATGTC
TGACATACTAAGCAAGAAACATGAGATCATGGTCACTGCACGTCAGATCATGGACTCTCTTAGAGAGATGTTTGGACAACCGTCCATTCAGATCAAACAAGAGGCAAATG
TTGCCCATTCTAAGAGGAGGTTTGTACCTTCACCTTCTGGATCTGAGAAAATTCAAAAGAGGAAAGAAGGGAAAGGGAAAGGTCCTACTATTGCTGTTGAGGACAAAGGG
AAGGCTAAGGTAGCCATCAAGAGGAAATGTTTTCACTGCAATGTTGATGAGCATTGGAAAACAAATTGCCCTAAGTACCTTGTTAAGAAGAAAGAAAAGGAAGGTAAATA
TGATTTACTTGTCTTGGAAACATGTTTAGTGGAAAATGACCAAAATGCCTGGATACTTGATTCAGGGGCCACTAACCATGTTTGTTCTTCTTTACAAGAAACTAGTTCCT
TCAAGCAGCTTGAGGACAGTGAGATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGAAAATAAATTCATGTTT
TTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTA
TAAGAATGGTGTACATATTTGTTCAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATA
CTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTT
CTAAACAAGTTAAAAGATGTTTCATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGA
ACTTATACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCTAGGTATGGTTATTTATACTTAA
TGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTAC
ATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAGAGGAGAAATAGAACCTT
GTTAGACATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTG
TTTCTGAAACACCTTTCGAGCTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTGGAA
CCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTT
GGAAGAAGACCACATGAGAAATCATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTTG
ATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAACT
CAAGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTC
TATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGA
CCTTTAAAGCTAGACTTGTGGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCC
ATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAAC
CCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCGATCAAATCCTATGGTT
TTGACCAGAATGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGT
TACCTTACTGACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAAC
GCTAGCACTGTCTCAAGCAACCTATATCGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGG
AACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGT
TATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTATTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGT
GTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGAGC
TGTAG
Protein sequenceShow/hide protein sequence
MYGRLPRLGSAALVRTTRLYGQLGSARLYGRLGSAVRTARFARLYGRLGRSARFDPFFHMVPVQVVLGRMSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLME
ECPPFPTKHASQSVRDAYDRWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLREMFGQPSIQIKQEANVAHSKRRFVPSPSGSEKIQKRKEGKGKGPTIAVEDKG
KAKVAIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGKYDLLVLETCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFENKFMF
LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGL
LNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEY
MDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLE
PRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTET
QVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLS
IATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVG
YLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDIC
YAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGEL