; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0096681 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0096681
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:11192118..11194493
RNA-Seq ExpressionCmc04g0096681
SyntenyCmc04g0096681
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0099.62Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQ+GVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
        HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
Subjt:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL

Query:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
        SEATD+STRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
Subjt:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV

Query:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
        WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
Subjt:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE

Query:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
        GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
Subjt:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDKLLVRY MQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP

Query:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
        GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
Subjt:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

KAA0033121.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0094.52Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        MEHKSEALEKFKEYK EVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQ QLSAPGTPQQ+GV ERRNRT+LDMVRSMMSYAQLPSSFWGYAVETAV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
        HILNNV SKSVSETPFELWRGRKPSLSHF+I GCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQ+NRV VSTNATFLEEDHMR+HKP++KLVL
Subjt:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL

Query:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
        +EA D+STRVVDEVGPSSRV+ETTTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKDQWVKAMDLEMESMYFN +
Subjt:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV

Query:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
        WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIW+MDV TAFLNGNLEESIFMSQPE
Subjt:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE

Query:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
        GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGF+QNVDEPCVYKKINKGKV FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
Subjt:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDK+LVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY + CTR +ICYAV IVSRYQSN 
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP

Query:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
        GLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQT+KDSRKSTS SVFTLNGGA+VWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
Subjt:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHKR KHIERKYHLI+EIVQ
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0098.1Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        MEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLSAPGTPQQ+GVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
        HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEEDHMRNHKPRSKLVL
Subjt:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL

Query:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
        SEATD+STRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
Subjt:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV

Query:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
        WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESIFMSQPE
Subjt:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE

Query:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
        GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE QY
Subjt:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDKLLVRY MQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP

Query:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
        GLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWL+KFLHD
Subjt:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0082.58Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        M+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQ+GVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
        +ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPRS+LC FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL
Subjt:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL

Query:  S----EATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMY
        +    E T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P RY+ LTET  VI D  +EDPL++K+AM DVDKD+W+KAM+LE+ESMY
Subjt:  S----EATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMY

Query:  FNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFM
        FNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+I+M
Subjt:  FNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFM

Query:  SQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLG
         QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPCVYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLG
Subjt:  SQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLG

Query:  EAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQ+VLGIQI RDRKNK LALSQA+YIDK++V+Y MQNSK+GLLPFRHGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGL HWTAVK +LKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRK

Query:  FLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
        FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLGLRDM
Subjt:  FLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0099.62Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQ+GVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
        HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
Subjt:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL

Query:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
        SEATD+STRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
Subjt:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV

Query:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
        WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
Subjt:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE

Query:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
        GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
Subjt:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDKLLVRY MQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP

Query:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
        GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
Subjt:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0082.58Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        M+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQ+GVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
        +ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPRS+LC FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL
Subjt:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL

Query:  S----EATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMY
        +    E T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P RY+ LTET  VI D  +EDPL++K+AM DVDKD+W+KAM+LE+ESMY
Subjt:  S----EATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMY

Query:  FNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFM
        FNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+I+M
Subjt:  FNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFM

Query:  SQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLG
         QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPCVYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLG
Subjt:  SQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLG

Query:  EAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQ+VLGIQI RDRKNK LALSQA+YIDK++V+Y MQNSK+GLLPFRHGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGL HWTAVK +LKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRK

Query:  FLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
        FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLGLRDM
Subjt:  FLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM

A0A5A7T2V9 Gag/pol protein0.0e+0098.1Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        MEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLSAPGTPQQ+GVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
        HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEEDHMRNHKPRSKLVL
Subjt:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL

Query:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
        SEATD+STRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
Subjt:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV

Query:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
        WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESIFMSQPE
Subjt:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE

Query:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
        GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE QY
Subjt:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDKLLVRY MQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP

Query:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
        GLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWL+KFLHD
Subjt:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

A0A5A7TZD0 Gag/pol protein0.0e+0099.62Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQ+GVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
        HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
Subjt:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL

Query:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
        SEATD+STRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
Subjt:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV

Query:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
        WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
Subjt:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE

Query:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
        GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
Subjt:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDKLLVRY MQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP

Query:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
        GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
Subjt:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

A0A5A7UYE8 Gag/pol protein0.0e+0099.62Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQ+GVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
        HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
Subjt:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL

Query:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
        SEATD+STRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
Subjt:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV

Query:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
        WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
Subjt:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE

Query:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
        GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
Subjt:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDKLLVRY MQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP

Query:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
        GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
Subjt:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

A0A5D3CZY3 Gag/pol protein0.0e+0094.52Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        MEHKSEALEKFKEYK EVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQ QLSAPGTPQQ+GV ERRNRT+LDMVRSMMSYAQLPSSFWGYAVETAV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL
        HILNNV SKSVSETPFELWRGRKPSLSHF+I GCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQ+NRV VSTNATFLEEDHMR+HKP++KLVL
Subjt:  HILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVL

Query:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV
        +EA D+STRVVDEVGPSSRV+ETTTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKDQWVKAMDLEMESMYFN +
Subjt:  SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSV

Query:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE
        WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIW+MDV TAFLNGNLEESIFMSQPE
Subjt:  WELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPE

Query:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
        GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGF+QNVDEPCVYKKINKGKV FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
Subjt:  GFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDK+LVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY + CTR +ICYAV IVSRYQSN 
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP

Query:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
        GLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQT+KDSRKSTS SVFTLNGGA+VWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD
Subjt:  GLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHKR KHIERKYHLI+EIVQ
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-11830.77Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        +++KS+    F+++  + E   + K+  L  D G EY+    + + ++ GI   L+ P TPQ +GVSER  RT+ +  R+M+S A+L  SFWG AV TA 
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSV---SETPFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRS
        +++N +PS+++   S+TP+E+W  +KP L H R++G   +V + N + K + +S    FVGY  E  G   +D    +  V+ +    E + + +   + 
Subjt:  HILNNVPSKSV---SETPFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRS

Query:  KLVL----SEATDK-----STRVVDEVGP--SSRVDETTTSGQSHPSQSLRMPRRSGRVV--------------------SQPNRYL-------------
        + V      E+ +K     S +++    P  S   D       S  S++   P  S +++                     + N+Y              
Subjt:  KLVL----SEATDK-----STRVVDEVGP--SSRVDETTTSGQSHPSQSLRMPRRSGRVV--------------------SQPNRYL-------------

Query:  ------------GLTETQVVIPDDGVEDP---------------------LSYKQ--------------AMNDV-----------DKDQWVKAMDLEMES
                      +ET   + + G+++P                     +SY +                NDV           DK  W +A++ E+ +
Subjt:  ------------GLTETQVVIPDDGVEDP---------------------LSYKQ--------------AMNDV-----------DKDQWVKAMDLEMES

Query:  MYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI
           N+ W +   PE    +  +W++  K +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ QMDVKTAFLNG L+E I
Subjt:  MYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI

Query:  FMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQM
        +M  P+G         VCKLN++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K N  +  +++LYVDD+++   D+  + + K +L  +F+M
Subjt:  FMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQM

Query:  KDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGI
         DL E ++ +GI+I  + +   + LSQ+ Y+ K+L ++ M+N      P    ++     S       ++    P  S +G LMY MLCTRPD+  AV I
Subjt:  KDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGI

Query:  VSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL-NGGAVVWRSIKQGCIADSTMEAEYVAACEAA
        +SRY S    + W  +K VL+YL+ T D  L++    A +  + GY DSD+   +  RKST+G +F + +   + W + +Q  +A S+ EAEY+A  EA 
Subjt:  VSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL-NGGAVVWRSIKQGCIADSTMEAEYVAACEAA

Query:  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL
        +EA+WL+  L  + +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  I +E+ +AD FTK L A  F    + LGL
Subjt:  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-17941.72Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        ++ K +  + F+++   VE    +K+K LRSD GGEY    F++Y   HGI+ + + PGTPQ +GV+ER NRT+++ VRSM+  A+LP SFWG AV+TA 
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHM----RNHK
        +++N  PS  ++ E P  +W  ++ S SH +++GC   AHV      KL+ +S  C F+GY  E  G   +DP + +V  S +  F E +       + K
Subjt:  HILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHM----RNHK

Query:  PRSKLVLSEATDKSTRVVDEVGPSSRVDETTTSGQ-------------------SHPSQSLRMP---RRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLS
         ++ ++ +  T  ST   +     S  DE +  G+                    HP+Q        RRS R   +  RY   +   V+I DD   +P S
Subjt:  PRSKLVLSEATDKSTRVVDEVGPSSRVDETTTSGQ-------------------SHPSQSLRMP---RRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLS

Query:  YKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATF
         K+ ++  +K+Q +KAM  EMES+  N  ++LV+LP+G +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A  
Subjt:  YKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATF

Query:  YDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDIL
         D E+ Q+DVKTAFL+G+LEE I+M QPEGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS  + +   +PCVY K+ ++     L+LYVDD+L
Subjt:  YDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDIL

Query:  LIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS
        ++G D G +  +K  L+  F MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L R+ M+N+K    P    + LSK+  P T +E  +M ++PY+S
Subjt:  LIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS

Query:  AVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQG
        AVGSLMYAM+CTRPDI +AVG+VSR+  NPG +HW AVK +L+YLR T    L +G  D IL GYTD+D   D D+RKS++G +FT +GGA+ W+S  Q 
Subjt:  AVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKT
        C+A ST EAEY+AA E  KE +WL++FL +L +         +YCD+  A+  SK    H R KHI+ +YH IRE+V    + V KI++  N AD  TK 
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKT

Query:  LTAKVFEGHLESLGL
        +    FE   E +G+
Subjt:  LTAKVFEGHLESLGL

P25600 Putative transposon Ty5-1 protein YCL074W4.0e-3633.76Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDV TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+       
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYA
           VK  L   + MKDLG+    LG+  I    N  + LS   YI K      +   K    P  +   L +  SP     ++D+   PY S VG L++ 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADST
            RPDI Y V ++SR+   P   H  + + VL+YL  TR   L Y     L LT Y D+      D   ST G V  L G  V W S K +G I   +
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADST

Query:  MEAEYVAACEAAKE
         EAEY+ A E   E
Subjt:  MEAEYVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-10529.57Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        ++ KS+  E F  +K  +EN    +I    SD GGE++ L   +Y  +HGI    S P TP+ +G+SER++R +++   +++S+A +P ++W YA   AV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE-----------
        +++N +P+  +  E+PF+   G  P+    R++GC  +  +   N  KL+ +SR C F+GY       L    Q +R+++S +  F E            
Subjt:  HILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE-----------

Query:  DHMRNHKPRSKLVLSEATDKSTR---------------------------------------------------VVDEVGPSSRVDETTTSGQSHPS---
          ++  +  S  V S  T   TR                                                      + GP      T T  Q+H S   
Subjt:  DHMRNHKPRSKLVLSEATDKSTR---------------------------------------------------VVDEVGPSSRVDETTTSGQSHPS---

Query:  --------------QSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMND----------------------------------------
                      QSL  P +S      P      + T    P   +  P    Q +N+                                        
Subjt:  --------------QSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMND----------------------------------------

Query:  ----VDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYD
            +  ++W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K +S G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    
Subjt:  ----VDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYD

Query:  YEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIG
        + I Q+DV  AFL G L + ++MSQP GFI + +   VCKL +++YGLKQA R+W +     + + GF  +V +  ++       + ++++YVDDIL+ G
Subjt:  YEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIG

Query:  NDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVG
        ND   L +    L+ +F +KD  E  Y LGI+    R    L LSQ  YI  LL R  M  +K    P      LS     K     E      Y   VG
Subjt:  NDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVG

Query:  SLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCI
        SL Y +  TRPDI YAV  +S++   P  +H  A+K +L+YL  T ++ + +     L L  Y+D+D+  DKD   ST+G +  L    + W S KQ  +
Subjt:  SLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCI

Query:  ADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLT
          S+ EAEY +    + E  W+   L +L +   +  P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   +AD  TK L+
Subjt:  ADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLT

Query:  AKVFEGHLESLGL
           F+     +G+
Subjt:  AKVFEGHLESLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-10729.77Show/hide
Query:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV
        ++ KS+  + F  +K+ VEN    +I  L SD GGE++ LR  DY+ +HGI    S P TP+ +G+SER++R +++M  +++S+A +P ++W YA   AV
Subjt:  MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV

Query:  HILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE-----------
        +++N +P+  +  ++PF+   G+ P+    +++GC  +  +   N  KLE +S+ C F+GY       L       R++ S +  F E            
Subjt:  HILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE-----------

Query:  -----------DHMRNHK--PRSKLVL------------------SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQS--------------------L
                    +  +H   P + LVL                  S +   +T+V     PSS +   ++S  + PS +                    L
Subjt:  -----------DHMRNHK--PRSKLVL------------------SEATDKSTRVVDEVGPSSRVDETTTSGQSHPSQS--------------------L

Query:  RMPRRSGRVVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP---LSY--------
          P  +    + PN+   L ++ +  P                                                     DG+  P    SY        
Subjt:  RMPRRSGRVVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP---LSY--------

Query:  --KQAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
          + A+  +  D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K +S G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A
Subjt:  --KQAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA

Query:  TFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDI
            + I Q+DV  AFL G L + ++MSQP GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  ++ +  ++       + ++++YVDDI
Subjt:  TFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDI

Query:  LLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYA
        L+ GND   L      L+ +F +K+  +  Y LGI+    R  + L LSQ  Y   LL R  M  +K    P      L+     K P   E      Y 
Subjt:  LLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYA

Query:  SAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK
          VGSL Y +  TRPD+ YAV  +S+Y   P  DHW A+K VL+YL  T D+ + +     L L  Y+D+D+  D D   ST+G +  L    + W S K
Subjt:  SAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK

Query:  QGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFT
        Q  +  S+ EAEY +    + E  W+   L +L +   ++ P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   +AD  T
Subjt:  QGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFT

Query:  KTLTAKVFEGHLESLGL
        K L+   F+     +G+
Subjt:  KTLTAKVFEGHLESLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.9e-8336.16Show/hide
Query:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +Y +A   +    W  AMD E+ +M     WE+  LP   KPIGCKW+YK K +S G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL
        +I+  Y++ + Q+D+  AFLNG+L+E I+M  P G+   QG       VC L +SIYGLKQASR W ++F   +  +GF Q+  +   + KI       +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL

Query:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVED
        ++YVDDI++  N+   + ++K+ L + F+++DLG  +Y LG++I R      + + Q  Y   LL    +   K   +P    V  S         +  D
Subjt:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEVED

Query:  MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGA
         +   Y   +G LMY  + TR DI +AV  +S++   P L H  AV  +L Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L    
Subjt:  MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGA

Query:  VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
        + W+S KQ  ++ S+ EAEY A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.3e-0437.5Show/hide
Query:  TRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGA-KDLILTGYTDSDFQTDKDSRKSTSG
        TRPD+ +AV  +S++ S        AV  VL Y++ T    L Y A  DL L  + DSD+ +  D+R+S +G
Subjt:  TRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGA-KDLILTGYTDSDFQTDKDSRKSTSG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.5e-1039.02Show/hide
Query:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR
        NRT+++ VRSM+    LP +F   A  TAVHI+N  PS +++   P E+W    P+ S+ R +GC A++   +  KL+PR++
Subjt:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR

ATMG00810.1 DNA/RNA polymerases superfamily protein2.8e-2133.76Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEV
        +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI        L LSQ  Y +++L    M + K    P    + L+   S     + 
Subjt:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSKEQSPKTPQEV

Query:  EDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNG
         D R     S VG+L Y  L TRPDI YAV IV +    P L  +  +K VL+Y++ T  + + ++    L +  + DSD+     +R+ST+G    L  
Subjt:  EDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNG

Query:  GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVW
          + W + +Q  ++ S+ E EY A    A E  W
Subjt:  GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.8e-1536.09Show/hide
Query:  MPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL
        M  RS   +++ N    LT T  +      ++P S   A+ D     W +AM  E++++  N  W LV  P     +GCKW++K K  S G +   KARL
Subjt:  MPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        VAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTA
CATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAGTGGTGTATCAGAGAGGAGAAATAGAACCT
TGTTAGACATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGT
GTTTCTGAAACACCTTTCGAGCTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTAGA
ACCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCT
TGGAAGAAGACCACATGAGAAATCATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATAAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTT
GATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAAC
TCAAGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGT
CTATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAG
ACCTTTAAAGCTAGACTTGTGGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTC
CATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAA
CCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCAATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGATTTGATACTGCAATCAAATCCTATGGT
TTTGACCAGAATGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGG
TTACCTTACTGACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAA
CGCTAGCACTGTCTCAAGCAACCTATATCGACAAATTGTTGGTTCGATATTTGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAG
GAACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTG
TTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTG
TGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGA
GCTGTAGTATGGCGTAGCATCAAGCAAGGATGCATCGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTT
CCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGA
AACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACT
CTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTA
CATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAGTGGTGTATCAGAGAGGAGAAATAGAACCT
TGTTAGACATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGT
GTTTCTGAAACACCTTTCGAGCTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTAGA
ACCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCT
TGGAAGAAGACCACATGAGAAATCATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATAAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTT
GATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAAC
TCAAGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGT
CTATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAG
ACCTTTAAAGCTAGACTTGTGGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTC
CATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAA
CCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCAATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGATTTGATACTGCAATCAAATCCTATGGT
TTTGACCAGAATGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGG
TTACCTTACTGACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAA
CGCTAGCACTGTCTCAAGCAACCTATATCGACAAATTGTTGGTTCGATATTTGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAG
GAACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTG
TTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTG
TGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGA
GCTGTAGTATGGCGTAGCATCAAGCAAGGATGCATCGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTT
CCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGA
AACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACT
CTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA
Protein sequenceShow/hide protein sequence
MEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQSGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKS
VSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDKSTRVVDEVGPSSRV
DETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQ
TFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYG
FDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYLMQNSKKGLLPFRHGVHLSK
EQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGG
AVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKT
LTAKVFEGHLESLGLRDMYIR