; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0248521 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0248521
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr09:14122027..14125038
RNA-Seq ExpressionCmc09g0248521
SyntenyCmc09g0248521
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+00100Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
        DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG

Query:  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA
        NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA
Subjt:  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA

Query:  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY
        QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY
Subjt:  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY

Query:  AVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
        AVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
Subjt:  AVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA

Query:  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
        KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
Subjt:  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM

Query:  YIR
        YIR
Subjt:  YIR

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0080.68Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FIYKNGV ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        A TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++L++ SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEYFI+F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
         DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        +SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPRS+LC FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQW
        +R HKPRSK+VL+    E T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P RY+ LTET  VI D  +EDPL++K+AM DVDKD+W
Subjt:  MRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQW

Query:  VKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA
        +KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTA
Subjt:  VKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA

Query:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKA
        FLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPCVYK+I    VAFLVLYVDDILLIGND+G LTD+K 
Subjt:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKA

Query:  WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRP
        WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRP
Subjt:  WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRP

Query:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
        DICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
Subjt:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA

Query:  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLG
        CEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLG
Subjt:  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLG

Query:  LRDM
        LRDM
Subjt:  LRDM

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0098.6Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+GLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
        DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNG
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG

Query:  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA
        NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA
Subjt:  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA

Query:  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY
        QFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY
Subjt:  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY

Query:  AVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
        AVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
Subjt:  AVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA

Query:  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
        KEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
Subjt:  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM

Query:  YIR
        YIR
Subjt:  YIR

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0080.68Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FIYKNGV ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        A TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++L++ SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEYFI+F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
         DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        +SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPRS+LC FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQW
        +R HKPRSK+VL+    E T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P RY+ LTET  VI D  +EDPL++K+AM DVDKD+W
Subjt:  MRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQW

Query:  VKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA
        +KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTA
Subjt:  VKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA

Query:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKA
        FLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPCVYK+I    VAFLVLYVDDILLIGND+G LTD+K 
Subjt:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKA

Query:  WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRP
        WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRP
Subjt:  WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRP

Query:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
        DICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
Subjt:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA

Query:  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLG
        CEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLG
Subjt:  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLG

Query:  LRDM
        LRDM
Subjt:  LRDM

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0099.89Show/hide
Query:  RLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALE
        +LGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALE
Subjt:  RLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALE

Query:  KFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSK
        KFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSK
Subjt:  KFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSK

Query:  SVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR
        SVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR
Subjt:  SVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR

Query:  VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEG
        VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEG
Subjt:  VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEG

Query:  VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQ
        VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQ
Subjt:  VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQ

Query:  KVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD
        KVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD
Subjt:  KVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD

Query:  RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK
        RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK
Subjt:  RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK

Query:  IVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNL
        IVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNL
Subjt:  IVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNL

Query:  PITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        PITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  PITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0080.68Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FIYKNGV ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        A TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++L++ SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEYFI+F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
         DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        +SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPRS+LC FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQW
        +R HKPRSK+VL+    E T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P RY+ LTET  VI D  +EDPL++K+AM DVDKD+W
Subjt:  MRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQW

Query:  VKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA
        +KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTA
Subjt:  VKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA

Query:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKA
        FLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPCVYK+I    VAFLVLYVDDILLIGND+G LTD+K 
Subjt:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKA

Query:  WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRP
        WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRP
Subjt:  WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRP

Query:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
        DICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
Subjt:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA

Query:  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLG
        CEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLG
Subjt:  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLG

Query:  LRDM
        LRDM
Subjt:  LRDM

A0A5A7T2V9 Gag/pol protein0.0e+0098.6Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+GLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
        DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNG
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG

Query:  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA
        NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA
Subjt:  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA

Query:  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY
        QFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY
Subjt:  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY

Query:  AVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
        AVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
Subjt:  AVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA

Query:  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
        KEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
Subjt:  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM

Query:  YIR
        YIR
Subjt:  YIR

A0A5A7TZD0 Gag/pol protein0.0e+00100Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
        DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG

Query:  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA
        NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA
Subjt:  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA

Query:  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY
        QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY
Subjt:  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICY

Query:  AVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
        AVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
Subjt:  AVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA

Query:  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
        KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM
Subjt:  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM

Query:  YIR
        YIR
Subjt:  YIR

A0A5A7UYE8 Gag/pol protein0.0e+0099.89Show/hide
Query:  RLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALE
        +LGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALE
Subjt:  RLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALE

Query:  KFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSK
        KFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSK
Subjt:  KFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSK

Query:  SVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR
        SVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR
Subjt:  SVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR

Query:  VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEG
        VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEG
Subjt:  VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEG

Query:  VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQ
        VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQ
Subjt:  VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQ

Query:  KVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD
        KVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD
Subjt:  KVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD

Query:  RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK
        RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK
Subjt:  RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK

Query:  IVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNL
        IVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNL
Subjt:  IVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNL

Query:  PITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        PITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  PITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

A0A5D3CPJ6 Gag/pol protein0.0e+0080.68Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FIYKNGV ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        A TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++L++ SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEYFI+F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
         DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        +SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPRS+LC FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQW
        +R HKPRSK+VL+    E T+ STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P RY+ LTET  VI D  +EDPL++K+AM DVDKD+W
Subjt:  MRNHKPRSKLVLS----EATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQW

Query:  VKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA
        +KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR + GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTA
Subjt:  VKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA

Query:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKA
        FLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ VDEPCVYK+I    VAFLVLYVDDILLIGND+G LTD+K 
Subjt:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKA

Query:  WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRP
        WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMYAMLCTRP
Subjt:  WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRP

Query:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
        DICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LVYG+KDLILTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
Subjt:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA

Query:  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLG
        CEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLG
Subjt:  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLG

Query:  LRDM
        LRDM
Subjt:  LRDM

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.8e-14230.85Show/hide
Query:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHIC-SAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHIN-
        LE++    +   NL+SV  L E   SI F  +   I KNG+ +  ++ + NN+          V+N + + + N ++K       NN  LWH R GHI+ 
Subjt:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHIC-SAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHIN-

Query:  -----LDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
             + R        LLN L ++S   CE CL GK  + PF     +   K PL ++HSD+CGP+         YF+ F+D ++ Y   YL+++KS+  
Subjt:  -----LDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPS
          F+++  + E   + K+  L  D G EY+    + + ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S A+L  SFWG AV TA +++N +PS
Subjt:  EKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPS

Query:  KSV---SETPFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEAT
        +++   S+TP+E+W  +KP L H R++G   +V + N + K + +S    FVGY  E  G   +D    +  V+ +    E + + +   + + V  + +
Subjt:  KSV---SETPFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEAT

Query:  DE---------STRVVDEVGP--SSRVDETTTSGQSHPSQSLRMPRRSGRVV--------------------SQPNRYL---------------------
         E         S +++    P  S   D       S  S++   P  S +++                     + N+Y                      
Subjt:  DE---------STRVVDEVGP--SSRVDETTTSGQSHPSQSLRMPRRSGRVV--------------------SQPNRYL---------------------

Query:  ----GLTETQVVIPDDGVEDP---------------------LSYKQ--------------AMNDV-----------DKDQWVKAMDLEMESMYFNSVWE
              +ET   + + G+++P                     +SY +                NDV           DK  W +A++ E+ +   N+ W 
Subjt:  ----GLTETQVVIPDDGVEDP---------------------LSYKQ--------------AMNDV-----------DKDQWVKAMDLEMESMYFNSVWE

Query:  LVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGF
        +   PE    +  +W++  K +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ QMDVKTAFLNG L+E I+M  P+G 
Subjt:  LVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGF

Query:  ITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY
                VCKLN++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K N  +  +++LYVDD+++   D+  + + K +L  +F+M DL E ++
Subjt:  ITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP
         +GI+I  + +   + LSQ+ Y+ K+L +++M+N      P    ++     S       ++    P  S +G LMY MLCTRPD+  AV I+SRY S  
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP

Query:  GLDHWTAVKIVLKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL-NGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRK
          + W  +K VL+YL+ T D  L++    A +  + GY DSD+   +  RKST+G +F + +   + W + +Q  +A S+ EAEY+A  EA +EA+WL+ 
Subjt:  GLDHWTAVKIVLKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL-NGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRK

Query:  FLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL
         L  + +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  I +E+ +AD FTK L A  F    + LGL
Subjt:  FLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.7e-20638.95Show/hide
Query:  TLKVGTGDVISARAVGDAKLFFG-NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        T+K+G         +GD  +       + L+++  VP ++ NL+S   L    Y   F+  +  + K  + I        LY       +  LN      
Subjt:  TLKVGTGDVISARAVGDAKLFFG-NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
              +  IS +    LWH R+GH++   +  L K  L++  K  ++ PC+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +YF++F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
        IDD SR  ++Y+++ K +  + F+++   VE    +K+K LRSD GGEY    F++Y   HGI+ + + PGTPQ NGV+ER NRT+++ VRSM+  A+LP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLE
         SFWG AV+TA +++N  PS  ++ E P  +W  ++ S SH +++GC   AHV      KL+ +S  C F+GY  E  G   +DP + +V  S +  F E
Subjt:  SSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLE

Query:  EDHM----RNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQ-------------------SHPSQSLRMP---RRSGRVVSQPNRYLGLTETQV
         +       + K ++ ++ +  T  ST   +     S  DE +  G+                    HP+Q        RRS R   +  RY   +   V
Subjt:  EDHM----RNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQ-------------------SHPSQSLRMP---RRSGRVVSQPNRYLGLTETQV

Query:  VIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAML
        +I DD   +P S K+ ++  +K+Q +KAM  EMES+  N  ++LV+LP+G +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  +
Subjt:  VIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAML

Query:  KSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGK
         SIR +LS+A   D E+ Q+DVKTAFL+G+LEE I+M QPEGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS  + +   +PCVY K+ ++  
Subjt:  KSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGK

Query:  VAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ
           L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L R++M+N+K    P    + LSK+  P T +
Subjt:  VAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ

Query:  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLN
        E  +M ++PY+SAVGSLMYAM+CTRPDI +AVG+VSR+  NPG +HW AVK +L+YLR T    L +G  D IL GYTD+D   D D+RKS++G +FT +
Subjt:  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLN

Query:  GGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIA
        GGA+ W+S  Q C+A ST EAEY+AA E  KE +WL++FL +L +         +YCD+  A+  SK    H R KHI+ +YH IRE+V    + V KI+
Subjt:  GGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIA

Query:  SEHNIADPFTKTLTAKVFEGHLESLGL
        +  N AD  TK +    FE   E +G+
Subjt:  SEHNIADPFTKTLTAKVFEGHLESLGL

P25600 Putative transposon Ty5-1 protein YCL074W2.3e-3633.76Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDV TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+       
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYA
           VK  L   + MKDLG+    LG+  I    N  + LS   YI K      +   K    P  +   L +  SP     ++D+   PY S VG L++ 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADST
            RPDI Y V ++SR+   P   H  + + VL+YL  TR   L Y     L LT Y D+      D   ST G V  L G  V W S K +G I   +
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADST

Query:  MEAEYVAACEAAKE
         EAEY+ A E   E
Subjt:  MEAEYVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.9e-12528.81Show/hide
Query:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMF
        V  G  I     G   L   ++ + L N+  VP I +NL+SV  L          +  +F + +      GV +   K ++ LY     E     +  + 
Subjt:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMF

Query:  RTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYF
          A+  +K    S       WH RLGH     +  ++ N  L+ L        C  CL  K  K PF+     +  PLE I+SD+     + +   + Y+
Subjt:  RTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYF

Query:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA
        + F+D ++RY +LY ++ KS+  E F  +K  +EN    +I    SD GGE++ L   +Y  +HGI    S P TP+ NG+SER++R +++   +++S+A
Subjt:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA

Query:  QLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT
         +P ++W YA   AV+++N +P+  +  E+PF+   G  P+    R++GC  +  +   N  KL+ +SR C F+GY       L    Q +R+++S +  
Subjt:  QLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT

Query:  FLEE-----------DHMRNHKPRSKLVLSEATDESTR---------------------------------------------------VVDEVGPSSRV
        F E              ++  +  S  V S  T   TR                                                      + GP    
Subjt:  FLEE-----------DHMRNHKPRSKLVLSEATDESTR---------------------------------------------------VVDEVGPSSRV

Query:  DETTTSGQSHPS-----------------QSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMND-------------------------
          T T  Q+H S                 QSL  P +S      P      + T    P   +  P    Q +N+                         
Subjt:  DETTTSGQSHPS-----------------QSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMND-------------------------

Query:  -------------------VDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAM
                           +  ++W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K +S G +  +KARLVAKGY QR G+DY ETFSPV  
Subjt:  -------------------VDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAM

Query:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGK
          SIRI+L +A    + I Q+DV  AFL G L + ++MSQP GFI + +   VCKL +++YGLKQA R+W +     + + GF  +V +  ++       
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGK

Query:  VAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ
        + ++++YVDDIL+ GND   L +    L+ +F +KD  E  Y LGI+    R    L LSQ  YI  LL R +M  +K    P      LS     K   
Subjt:  VAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ

Query:  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTL
          E      Y   VGSL Y +  TRPDI YAV  +S++   P  +H  A+K +L+YL  T ++ + +     L L  Y+D+D+  DKD   ST+G +  L
Subjt:  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTL

Query:  NGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKI
            + W S KQ  +  S+ EAEY +    + E  W+   L +L +   +  P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  +
Subjt:  NGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKI

Query:  ASEHNIADPFTKTLTAKVFEGHLESLGL
        ++   +AD  TK L+   F+     +G+
Subjt:  ASEHNIADPFTKTLTAKVFEGHLESLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-12729.56Show/hide
Query:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIYKN---GVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        +  G  I     G A L   ++ + L  +  VP I +NL+SV  L   +  S+ F    +F  K+   GV +   K ++ LY      ++AV    MF  
Subjt:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIYKN---GVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFIS
        A+  +K    S       WH RLGH +L  +  ++ N  L  L     L  C  C   K  K PF+     + +PLE I+SD+     + +   + Y++ 
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFIS

Query:  FIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQL
        F+D ++RY +LY ++ KS+  + F  +K+ VEN    +I  L SD GGE++ LR  DY+ +HGI    S P TP+ NG+SER++R +++M  +++S+A +
Subjt:  FIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQL

Query:  PSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFL
        P ++W YA   AV+++N +P+  +  ++PF+   G+ P+    +++GC  +  +   N  KLE +S+ C F+GY       L       R++ S +  F 
Subjt:  PSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFL

Query:  EE----------------------DHMRNHK--PRSKLVL------------------SEATDESTRVVDEVGPSSRVDETTTSGQSHPSQS--------
        E                        +  +H   P + LVL                  S +   +T+V     PSS +   ++S  + PS +        
Subjt:  EE----------------------DHMRNHK--PRSKLVL------------------SEATDESTRVVDEVGPSSRVDETTTSGQSHPSQS--------

Query:  ------------LRMPRRSGRVVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP-
                    L  P  +    + PN+   L ++ +  P                                                     DG+  P 
Subjt:  ------------LRMPRRSGRVVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP-

Query:  --LSY----------KQAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV
           SY          + A+  +  D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K +S G +  +KARLVAKGY QR G+DY ETFSPV
Subjt:  --LSY----------KQAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK
            SIRI+L +A    + I Q+DV  AFL G L + ++MSQP GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  ++ +  ++     
Subjt:  AMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINK

Query:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT
          + ++++YVDDIL+ GND   L      L+ +F +K+  +  Y LGI+    R  + L LSQ  Y   LL R +M  +K    P      L+     K 
Subjt:  GKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT

Query:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVF
        P   E      Y   VGSL Y +  TRPD+ YAV  +S+Y   P  DHW A+K VL+YL  T D+ + +     L L  Y+D+D+  D D   ST+G + 
Subjt:  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVF

Query:  TLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVT
         L    + W S KQ  +  S+ EAEY +    + E  W+   L +L +   ++ P  +YCDN GA      P  H R KHI   YH IR  VQ G + V 
Subjt:  TLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVT

Query:  KIASEHNIADPFTKTLTAKVFEGHLESLGL
         +++   +AD  TK L+   F+     +G+
Subjt:  KIASEHNIADPFTKTLTAKVFEGHLESLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.3e-8336.16Show/hide
Query:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +Y +A   +    W  AMD E+ +M     WE+  LP   KPIGCKW+YK K +S G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL
        +I+  Y++ + Q+D+  AFLNG+L+E I+M  P G+   QG       VC L +SIYGLKQASR W ++F   +  +GF Q+  +   + KI       +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL

Query:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVED
        ++YVDDI++  N+   + ++K+ L + F+++DLG  +Y LG++I R      + + Q  Y   LL    +   K   +P    V  S         +  D
Subjt:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVED

Query:  MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGA
         +   Y   +G LMY  + TR DI +AV  +S++   P L H  AV  +L Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L    
Subjt:  MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGA

Query:  VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
        + W+S KQ  ++ S+ EAEY A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.2e-0840Show/hide
Query:  NNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
        + T LWH RL H++   +  LVK G L+  K  SL  CE C+ GK  +  F+   +  K PL+ +HSDL G  +V
Subjt:  NNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.3e-1039.02Show/hide
Query:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR
        NRT+++ VRSM+    LP +F   A  TAVHI+N  PS +++   P E+W    P+ S+ R +GC A++   +  KL+PR++
Subjt:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR

ATMG00810.1 DNA/RNA polymerases superfamily protein1.2e-2133.76Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEV
        +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI        L LSQ  Y +++L    M + K    P    + L+   S     + 
Subjt:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEV

Query:  EDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNG
         D R     S VG+L Y  L TRPDI YAV IV +    P L  +  +K VL+Y++ T  + + ++    L +  + DSD+     +R+ST+G    L  
Subjt:  EDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNG

Query:  GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVW
          + W + +Q  ++ S+ E EY A    A E  W
Subjt:  GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.2e-1536.09Show/hide
Query:  MPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL
        M  RS   +++ N    LT T  +      ++P S   A+ D     W +AM  E++++  N  W LV  P     +GCKW++K K  S G +   KARL
Subjt:  MPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        VAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTATAAGAATGGTGTACATATTTGTT
CAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAAAAGATGTTTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTG
GTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTTCAGGACTA
TATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAGAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCAATGA
TGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGAAACACCTTTCGAGCTA
TGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTGGAACCTCGTTCAAGGTTATGCCAATT
TGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAGACCACATGAGAAATC
ATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTTGATGAAACCACCACATCAGGTCAA
TCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAACTCAAGTTGTCATACCAGATGATGG
TGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGG
AGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTTAAAGCTAGACTTGTGGCA
AAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATTATGA
AATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTT
GCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCT
TGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTG
GCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCT
ATATCGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGGAACAGAGTCCTAAGACACCTCAA
GAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAG
GTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCC
TTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAG
CAAGGATGCATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCC
AAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATC
TGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGT
CATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTATAAGAATGGTGTACATATTTGTT
CAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAAAAGATGTTTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTG
GTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTTCAGGACTA
TATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAGAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCAATGA
TGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGAAACACCTTTCGAGCTA
TGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTGGAACCTCGTTCAAGGTTATGCCAATT
TGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAGACCACATGAGAAATC
ATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTTGATGAAACCACCACATCAGGTCAA
TCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAACTCAAGTTGTCATACCAGATGATGG
TGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGG
AGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTTAAAGCTAGACTTGTGGCA
AAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATTATGA
AATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTT
GCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCT
TGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTG
GCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCT
ATATCGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGGAACAGAGTCCTAAGACACCTCAA
GAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAG
GTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCC
TTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAG
CAAGGATGCATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCC
AAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATC
TGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGT
CATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA
Protein sequenceShow/hide protein sequence
MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRI
SPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
EKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFEL
WRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQ
SHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVA
KGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP
CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ
EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK
QGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
HLESLGLRDMYIR