; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010709 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010709
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr1:4256038..4265309
RNA-Seq ExpressionLag0010709
SyntenyLag0010709
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0078.74Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE
        MSSSI++LLK ++LTGENYATWKS LN ILV+ DL FVL E+CPP P ++ASQ+V+DAYDRWTKANDK R++ILAS+S++L+K+HE MVTAR+IM+SL+E
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQPS                                                                                              
Subjt:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLETCL
        QI+ EANV HSKR+F    S         S+K QK+K  K K P+ A + KGKAK VA K KCFHCNVD HWK NCPKYL  KKKEKEGK+DLLVLETCL
Subjt:  QIEGEANVVHSKRKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLETCL

Query:  IEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNEAFI
        +E+D+  WILDSGATNHVCSS QE SSF+QL D EMTL+VGTG+VISA+AVG AKLFF N+F+ LE+L IVPKIKRNL+S+SCLIE  Y+  F MNEAFI
Subjt:  IEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNEAFI

Query:  YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTKRPF
        YKNGV ICSAKLENNLYVLRP E KA+LNHE+F+TA TQ+KRQ+ISP +NNT+LWHLRLGHIN+DRIGRLVKNGLL+KL+D SLPPCESCL+GKMTKRPF
Subjt:  YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTKRPF

Query:  TGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEHGIQ
        TGKGYRAKEPLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVENLL K IK+LRSDRGGEYMD RFQDYMIEHGIQ
Subjt:  TGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEHGIQ

Query:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPRSKL
        SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV+ILNNVPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLEPRS+L
Subjt:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPRSKL

Query:  CQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILGEATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRVVLQPDRY
        CQF        GG F+DPQEN+V VSTNATFLEEDHMR+HKPRSK++L EAT+ES+RVVDEVGPS+RVDE T++S  S PSQ+L MPRRSGRVV QP+RY
Subjt:  CQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILGEATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRVVLQPDRY

Query:  LGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEE
        LGLTETQVVIP+DG+EDPLSY+QAMND DKD+WVKAM+LEMESMY N VWEL DLP GVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVDYEE
Subjt:  LGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEE

Query:  TFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVY
        TFSPVAMLKSIRILLSIAT+YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDE CVY
Subjt:  TFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVY

Query:  KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKE
        KK+NK KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDK+L RYSMQNSKKGLLPFRHGVHLSKE
Subjt:  KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKE

Query:  QSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADKDSRKSTS
        QSPKTPQEVE+MR IPYASAVGSLMYAMLCTRPDICYAVG+VSRYQSNPGLDHW AVK +LKYLRRTRDYMLVYGAKDLILTGYTDSDFQ DKDSRKSTS
Subjt:  QSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADKDSRKSTS

Query:  GSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
        GSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE+AKEAVWLRKFL DLEVVPNMN+PITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
Subjt:  GSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTKIASEHNIADPFTKALTAKVFEGHLEA
        VIVTKIASEHNIADPFTK LTAKVFEGHLE+
Subjt:  VIVTKIASEHNIADPFTKALTAKVFEGHLEA

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0076.61Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDLRFVL E+CP  PA NA++ V++ Y+RW KAN+K R YILASLSEVLAK+HE M+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+  M EG SVREHVL+MMV FNVAE NGAVIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE
        Q +GEANV  S RKF  GS+SGTKS+ +SS  KK +KKKG +  KA   AAK+  KAKA   KG CFHCN +GHWKRNCPKYLAEKKK K+GK+DLLVLE
Subjt:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE

Query:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE
        TCL+E+D+  WI+DSGATNHVCSSFQ  SS+RQL  GEMT+RVGTG V+SA AVG  +L  +  FLLLE++ +VP +KRNLIS+ CL+EQSY+ TF++N+
Subjt:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE

Query:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK
         FIYKNGV ICSAKLENNLYVLR   +KALLN E+FKTA TQ+KR KISP   N HLWHLRLGHIN++RI RLVKNGLLS+LE++SLP CESCL+GKMTK
Subjt:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK

Query:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH
        RPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKFKE+KAEVEN L K+IK  RSDRGGEYMD +FQ+Y++E 
Subjt:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH

Query:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR
        GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAVYILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPR
Subjt:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR

Query:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV
        SKLC F        GGYFYDP++NKV VSTNATFLEEDH+R+HKPRSKI+L     E TE S+RVV+E    TRV    SS+   +P Q+L  PRRSGRV
Subjt:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV

Query:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK
           P RY+ LTET  VI +  IEDPL++++AM D DKDEW+KAMNLE+ESMY N VW+L D P+GVKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ 
Subjt:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK

Query:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN
        EGVDYEETFSPVAMLKSIRILLSIA Y+DYEIWQMDVKTAFLNGNLEE I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQ 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN

Query:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH
        VDE CVYK++  + VAFL+LYVDDILLIGND+G LTDIK+WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++ +YSMQNSK+GLLPFRH
Subjt:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH

Query:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK
        GV LSKEQ PKTPQ+VE MRHIPYASAVGSLMYAMLCTRPDICYAVG+VSRYQSNPGL HW AVK ILKYLRRTRDY LVYG+KDLILTGYTDSDFQ D+
Subjt:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK

Query:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE+AKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA
        EIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLE+
Subjt:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0076.53Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDLRFVL E+CP  PA NA++ V++ Y+RW KAN+K R YILASLSEVLAK+HE M+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+  M EG SVREHVL+MMV FNVAE NGAVIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE
        Q +GEANV  S RKF  GS+SGTKS+ +SS  KK +KKKG +  KA   AAK+  K KA   KG CFHCN +GHWKRNCPKYLAEKKK K+GK+DLLVLE
Subjt:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE

Query:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE
        TCL+E+D+  WI+DSGATNHVCSSFQ  SS+RQL  GEMT+RVGTG V+SA AVG  +L  +  FLLLE++ +VP +KRNLIS+ CL+EQSY+ TF++N+
Subjt:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE

Query:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK
         FIYKNGV ICSAKLENNLYVLR   +KALLN E+FKTA TQ+KR KISP   N HLWHLRLGHIN++RI RLVKNGLLS+LE++SLP CESCL+GKMTK
Subjt:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK

Query:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH
        RPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKFKE+KAEVEN L K+IK  RSDRGGEYMD +FQ+Y++E 
Subjt:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH

Query:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR
        GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAVYILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPR
Subjt:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR

Query:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV
        SKLC F        GGYFYDP++NKV VSTNATFLEEDH+R+HKPRSKI+L     E TE S+RVV+E    TRV    SS+   +P Q+L  PRRSGRV
Subjt:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV

Query:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK
           P RY+ LTET  VI +  IEDPL++++AM D DKDEW+KAMNLE+ESMY N VW+L D P+GVKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ 
Subjt:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK

Query:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN
        EGVDYEETFSPVAMLKSIRILLSIA Y+DYEIWQMDVKTAFLNGNLEE I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQ 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN

Query:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH
        VDE CVYK++  + VAFL+LYVDDILLIGND+G LTDIK+WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++ +YSMQNSK+GLLPFRH
Subjt:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH

Query:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK
        GV LSKEQ PKTPQ+VE MRHIPYASAVGSLMYAMLCTRPDICYAVG+VSRYQSNPGL HW AVK ILKYLRRTRDY LVYG+KDLILTGYTDSDFQ D+
Subjt:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK

Query:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE+AKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA
        EIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLE+
Subjt:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0077.76Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE
        MSSSI++LLK ++LTGENY TWKS LNTILV+ DL FVL E+CP  P ++ASQ+V+DAYDRWTKANDK R++ILAS+S++L+K+HE MVTAR+IM+SL+E
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQPS                                                                                              
Subjt:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLETCL
        QI+ EANV HSKR+F    S         S+K QK+K  K K P+ A + KGKAK VA K KCFHCNVD HWK NCPKYL  KK EKE K+DLLVLETCL
Subjt:  QIEGEANVVHSKRKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLETCL

Query:  IEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNEAFI
        +E+D+  WILDSGATNHVCSS QE SSF+QL D EMTL+VGTG+VISA+AVG AKLFF N+F+ LE+L IVPKIKRNL+S+SCLIE  Y+  F MNEAFI
Subjt:  IEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNEAFI

Query:  YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTKRPF
        YKNGV ICSAKLENNLYVLRP E KA+LNHE+F+TA TQ+KRQ+ISP +NNT+LWHLRLGHIN+DRIGRLVK+GLL+KL+D SLPPCESCL+GKMTKRPF
Subjt:  YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTKRPF

Query:  TGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEHGIQ
        TGKGYRAKEPLELIHSDLCGPMNVKARG +EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVENLL K IK+ RSDRGGEYMD  FQDYMIEHGIQ
Subjt:  TGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEHGIQ

Query:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPRSKL
        SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV+ILNNVPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLEPRS+L
Subjt:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPRSKL

Query:  CQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILGEATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRVVLQPDRY
        CQF        GG F+DP+EN+V VSTNATFLEEDHMR+HKPRSK++L EAT+ES+RVVDEVGPS+RVDE T++S  S PSQ+L MPRRSGRVV QP+RY
Subjt:  CQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILGEATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRVVLQPDRY

Query:  LGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEE
        LGLTETQVVIP+DG+EDPLSY+QAMND DKD+WVKAM+LEMESMY N VWEL DLP GVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT+KEGVDYEE
Subjt:  LGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEE

Query:  TFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVY
        TFS VAMLKSIRILLSIA +YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDE CVY
Subjt:  TFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVY

Query:  KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKE
        KK+NK KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDK+L RYSMQNSKKGLLPFRHGVHLSKE
Subjt:  KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKE

Query:  QSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADKDSRKSTS
        QSPKTPQEVE+MR IPYASAVGSLMYAMLCTRPDICYAVG+VSRYQSNPGLDHW AVK ILKYLRRTRDYMLVYGAKDLILTGYT+SDFQ DKDSRKSTS
Subjt:  QSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADKDSRKSTS

Query:  GSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
         SVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE+AKEAVWL+KFL DLEVVPNMN+PITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
Subjt:  GSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTKIASEHNIADPFTKALTAKVFEGHLEA
        VIVTKIASEHNIADPFTK LTAKVFEGHLE+
Subjt:  VIVTKIASEHNIADPFTKALTAKVFEGHLEA

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0076.68Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK N NT+L++DDLRFVL E+CP  PA NA++ V++ Y+RW KAN+K R YILASLSEVLAK+HE M+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+  M EG SVREHVL+MMV FNVAE NGAVIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE
        Q +GEANV  S RKF  GS+SGTKS+ +SS  KK +KKKG +  KA   AAK+  KAKA   KG CFHCN +GHWKRNCPKYLAEKKK K+GK+DLLVLE
Subjt:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE

Query:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE
        TCL+E+D+  WI+DSGATNHVCSSFQ  SS+RQL  GEMT+RVGTG V+SA AVG  +L  +  FLLLE++ +VP +KRNLIS+ CL+EQSY+ TF++N+
Subjt:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE

Query:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK
         FIYKNGV ICSAKLENNLYVLR   +KALLN E+FKTA TQ+KR KISP   N HLWHLRLGHIN++RI RLVKNGLLS+LE++SLP CESCL+GKMTK
Subjt:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK

Query:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH
        RPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKFKE+KAEVEN L K+IK  RSDRGGEYMD +FQ+Y++E 
Subjt:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH

Query:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR
        GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAVYILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPR
Subjt:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR

Query:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV
        SKLC F        GGYFYDP++NKV VSTNATFLEEDH+R+HKPRSKI+L     E TE S+RVV+E    TRV    SS+   +P Q+L  PRRSGRV
Subjt:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV

Query:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK
           P RY+ LTET  VI +  IEDPL++++AM D DKDEW+KAMNLE+ESMY N VW+L D P+GVKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ 
Subjt:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK

Query:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN
        EGVDYEETFSPVAMLKSIRILLSIA Y+DYEIWQMDVKTAFLNGNLEE I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQ 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN

Query:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH
        VDE CVYK++  + VAFL+LYVDDILLIGND+G LTDIK+WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++ +YSMQNSK+GLLPFRH
Subjt:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH

Query:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK
        GV LSKEQ PKTPQ+VE MRHIPYASAVGSLMYAMLCTRPDICYAVG+VSRYQSNPGL HW AVK ILKYLRRTRDY LVYG+KDLILTGYTDSDFQ D+
Subjt:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK

Query:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE+AKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA
        EIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLE+
Subjt:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0076.61Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDLRFVL E+CP  PA NA++ V++ Y+RW KAN+K R YILASLSEVLAK+HE M+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+  M EG SVREHVL+MMV FNVAE NGAVIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE
        Q +GEANV  S RKF  GS+SGTKS+ +SS  KK +KKKG +  KA   AAK+  KAKA   KG CFHCN +GHWKRNCPKYLAEKKK K+GK+DLLVLE
Subjt:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE

Query:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE
        TCL+E+D+  WI+DSGATNHVCSSFQ  SS+RQL  GEMT+RVGTG V+SA AVG  +L  +  FLLLE++ +VP +KRNLIS+ CL+EQSY+ TF++N+
Subjt:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE

Query:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK
         FIYKNGV ICSAKLENNLYVLR   +KALLN E+FKTA TQ+KR KISP   N HLWHLRLGHIN++RI RLVKNGLLS+LE++SLP CESCL+GKMTK
Subjt:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK

Query:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH
        RPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKFKE+KAEVEN L K+IK  RSDRGGEYMD +FQ+Y++E 
Subjt:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH

Query:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR
        GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAVYILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPR
Subjt:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR

Query:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV
        SKLC F        GGYFYDP++NKV VSTNATFLEEDH+R+HKPRSKI+L     E TE S+RVV+E    TRV    SS+   +P Q+L  PRRSGRV
Subjt:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV

Query:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK
           P RY+ LTET  VI +  IEDPL++++AM D DKDEW+KAMNLE+ESMY N VW+L D P+GVKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ 
Subjt:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK

Query:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN
        EGVDYEETFSPVAMLKSIRILLSIA Y+DYEIWQMDVKTAFLNGNLEE I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQ 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN

Query:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH
        VDE CVYK++  + VAFL+LYVDDILLIGND+G LTDIK+WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++ +YSMQNSK+GLLPFRH
Subjt:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH

Query:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK
        GV LSKEQ PKTPQ+VE MRHIPYASAVGSLMYAMLCTRPDICYAVG+VSRYQSNPGL HW AVK ILKYLRRTRDY LVYG+KDLILTGYTDSDFQ D+
Subjt:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK

Query:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE+AKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA
        EIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLE+
Subjt:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA

A0A5A7T2V9 Gag/pol protein0.0e+0077.76Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE
        MSSSI++LLK ++LTGENY TWKS LNTILV+ DL FVL E+CP  P ++ASQ+V+DAYDRWTKANDK R++ILAS+S++L+K+HE MVTAR+IM+SL+E
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQPS                                                                                              
Subjt:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLETCL
        QI+ EANV HSKR+F    S         S+K QK+K  K K P+ A + KGKAK VA K KCFHCNVD HWK NCPKYL  KK EKE K+DLLVLETCL
Subjt:  QIEGEANVVHSKRKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLETCL

Query:  IEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNEAFI
        +E+D+  WILDSGATNHVCSS QE SSF+QL D EMTL+VGTG+VISA+AVG AKLFF N+F+ LE+L IVPKIKRNL+S+SCLIE  Y+  F MNEAFI
Subjt:  IEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNEAFI

Query:  YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTKRPF
        YKNGV ICSAKLENNLYVLRP E KA+LNHE+F+TA TQ+KRQ+ISP +NNT+LWHLRLGHIN+DRIGRLVK+GLL+KL+D SLPPCESCL+GKMTKRPF
Subjt:  YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTKRPF

Query:  TGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEHGIQ
        TGKGYRAKEPLELIHSDLCGPMNVKARG +EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVENLL K IK+ RSDRGGEYMD  FQDYMIEHGIQ
Subjt:  TGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEHGIQ

Query:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPRSKL
        SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV+ILNNVPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLEPRS+L
Subjt:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPRSKL

Query:  CQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILGEATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRVVLQPDRY
        CQF        GG F+DP+EN+V VSTNATFLEEDHMR+HKPRSK++L EAT+ES+RVVDEVGPS+RVDE T++S  S PSQ+L MPRRSGRVV QP+RY
Subjt:  CQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILGEATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRVVLQPDRY

Query:  LGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEE
        LGLTETQVVIP+DG+EDPLSY+QAMND DKD+WVKAM+LEMESMY N VWEL DLP GVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT+KEGVDYEE
Subjt:  LGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEE

Query:  TFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVY
        TFS VAMLKSIRILLSIA +YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDE CVY
Subjt:  TFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVY

Query:  KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKE
        KK+NK KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDK+L RYSMQNSKKGLLPFRHGVHLSKE
Subjt:  KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKE

Query:  QSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADKDSRKSTS
        QSPKTPQEVE+MR IPYASAVGSLMYAMLCTRPDICYAVG+VSRYQSNPGLDHW AVK ILKYLRRTRDYMLVYGAKDLILTGYT+SDFQ DKDSRKSTS
Subjt:  QSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADKDSRKSTS

Query:  GSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
         SVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE+AKEAVWL+KFL DLEVVPNMN+PITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
Subjt:  GSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTKIASEHNIADPFTKALTAKVFEGHLEA
        VIVTKIASEHNIADPFTK LTAKVFEGHLE+
Subjt:  VIVTKIASEHNIADPFTKALTAKVFEGHLEA

A0A5A7TZD0 Gag/pol protein0.0e+0078.74Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE
        MSSSI++LLK ++LTGENYATWKS LN ILV+ DL FVL E+CPP P ++ASQ+V+DAYDRWTKANDK R++ILAS+S++L+K+HE MVTAR+IM+SL+E
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQPS                                                                                              
Subjt:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLETCL
        QI+ EANV HSKR+F    S         S+K QK+K  K K P+ A + KGKAK VA K KCFHCNVD HWK NCPKYL  KKKEKEGK+DLLVLETCL
Subjt:  QIEGEANVVHSKRKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLETCL

Query:  IEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNEAFI
        +E+D+  WILDSGATNHVCSS QE SSF+QL D EMTL+VGTG+VISA+AVG AKLFF N+F+ LE+L IVPKIKRNL+S+SCLIE  Y+  F MNEAFI
Subjt:  IEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNEAFI

Query:  YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTKRPF
        YKNGV ICSAKLENNLYVLRP E KA+LNHE+F+TA TQ+KRQ+ISP +NNT+LWHLRLGHIN+DRIGRLVKNGLL+KL+D SLPPCESCL+GKMTKRPF
Subjt:  YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTKRPF

Query:  TGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEHGIQ
        TGKGYRAKEPLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVENLL K IK+LRSDRGGEYMD RFQDYMIEHGIQ
Subjt:  TGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEHGIQ

Query:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPRSKL
        SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAV+ILNNVPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLEPRS+L
Subjt:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPRSKL

Query:  CQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILGEATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRVVLQPDRY
        CQF        GG F+DPQEN+V VSTNATFLEEDHMR+HKPRSK++L EAT+ES+RVVDEVGPS+RVDE T++S  S PSQ+L MPRRSGRVV QP+RY
Subjt:  CQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILGEATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRVVLQPDRY

Query:  LGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEE
        LGLTETQVVIP+DG+EDPLSY+QAMND DKD+WVKAM+LEMESMY N VWEL DLP GVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVDYEE
Subjt:  LGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEE

Query:  TFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVY
        TFSPVAMLKSIRILLSIAT+YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDE CVY
Subjt:  TFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVY

Query:  KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKE
        KK+NK KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDK+L RYSMQNSKKGLLPFRHGVHLSKE
Subjt:  KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKE

Query:  QSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADKDSRKSTS
        QSPKTPQEVE+MR IPYASAVGSLMYAMLCTRPDICYAVG+VSRYQSNPGLDHW AVK +LKYLRRTRDYMLVYGAKDLILTGYTDSDFQ DKDSRKSTS
Subjt:  QSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADKDSRKSTS

Query:  GSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
        GSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE+AKEAVWLRKFL DLEVVPNMN+PITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
Subjt:  GSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTKIASEHNIADPFTKALTAKVFEGHLEA
        VIVTKIASEHNIADPFTK LTAKVFEGHLE+
Subjt:  VIVTKIASEHNIADPFTKALTAKVFEGHLEA

A0A5A7TZD7 Gag/pol protein0.0e+0076.68Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK N NT+L++DDLRFVL E+CP  PA NA++ V++ Y+RW KAN+K R YILASLSEVLAK+HE M+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+  M EG SVREHVL+MMV FNVAE NGAVIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE
        Q +GEANV  S RKF  GS+SGTKS+ +SS  KK +KKKG +  KA   AAK+  KAKA   KG CFHCN +GHWKRNCPKYLAEKKK K+GK+DLLVLE
Subjt:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE

Query:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE
        TCL+E+D+  WI+DSGATNHVCSSFQ  SS+RQL  GEMT+RVGTG V+SA AVG  +L  +  FLLLE++ +VP +KRNLIS+ CL+EQSY+ TF++N+
Subjt:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE

Query:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK
         FIYKNGV ICSAKLENNLYVLR   +KALLN E+FKTA TQ+KR KISP   N HLWHLRLGHIN++RI RLVKNGLLS+LE++SLP CESCL+GKMTK
Subjt:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK

Query:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH
        RPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKFKE+KAEVEN L K+IK  RSDRGGEYMD +FQ+Y++E 
Subjt:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH

Query:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR
        GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAVYILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPR
Subjt:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR

Query:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV
        SKLC F        GGYFYDP++NKV VSTNATFLEEDH+R+HKPRSKI+L     E TE S+RVV+E    TRV    SS+   +P Q+L  PRRSGRV
Subjt:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV

Query:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK
           P RY+ LTET  VI +  IEDPL++++AM D DKDEW+KAMNLE+ESMY N VW+L D P+GVKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ 
Subjt:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK

Query:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN
        EGVDYEETFSPVAMLKSIRILLSIA Y+DYEIWQMDVKTAFLNGNLEE I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQ 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN

Query:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH
        VDE CVYK++  + VAFL+LYVDDILLIGND+G LTDIK+WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++ +YSMQNSK+GLLPFRH
Subjt:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH

Query:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK
        GV LSKEQ PKTPQ+VE MRHIPYASAVGSLMYAMLCTRPDICYAVG+VSRYQSNPGL HW AVK ILKYLRRTRDY LVYG+KDLILTGYTDSDFQ D+
Subjt:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK

Query:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE+AKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA
        EIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLE+
Subjt:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA

A0A5D3CPJ6 Gag/pol protein0.0e+0076.53Show/hide
Query:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE
        M+S+ +++L  +KL G NYA+WK+ +NT+L++DDLRFVL E+CP  PA NA++ V++ Y+RW KAN+K R YILASLSEVLAK+HE M+TAREIM+SLQE
Subjt:  MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG
        MFGQ SYQ+ HDALKY+Y+  M EG SVREHVL+MMV FNVAE NGAVIDE SQV+FILESLP+SFLQFRSNA+MNKI + LT+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKG

Query:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE
        Q +GEANV  S RKF  GS+SGTKS+ +SS  KK +KKKG +  KA   AAK+  K KA   KG CFHCN +GHWKRNCPKYLAEKKK K+GK+DLLVLE
Subjt:  QIEGEANVVHSKRKFENGSSSGTKSVATSS--KKTQKKKGNK-RKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLE

Query:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE
        TCL+E+D+  WI+DSGATNHVCSSFQ  SS+RQL  GEMT+RVGTG V+SA AVG  +L  +  FLLLE++ +VP +KRNLIS+ CL+EQSY+ TF++N+
Subjt:  TCLIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMNE

Query:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK
         FIYKNGV ICSAKLENNLYVLR   +KALLN E+FKTA TQ+KR KISP   N HLWHLRLGHIN++RI RLVKNGLLS+LE++SLP CESCL+GKMTK
Subjt:  AFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTK

Query:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH
        RPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKFKE+KAEVEN L K+IK  RSDRGGEYMD +FQ+Y++E 
Subjt:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEH

Query:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR
        GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAVYILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEPR
Subjt:  GIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPR

Query:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV
        SKLC F        GGYFYDP++NKV VSTNATFLEEDH+R+HKPRSKI+L     E TE S+RVV+E    TRV    SS+   +P Q+L  PRRSGRV
Subjt:  SKLCQF--------GGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG----EATEESSRVVDEVGPSTRVDEGTSSSDPSRPSQTLGMPRRSGRV

Query:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK
           P RY+ LTET  VI +  IEDPL++++AM D DKDEW+KAMNLE+ESMY N VW+L D P+GVKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ 
Subjt:  VLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQK

Query:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN
        EGVDYEETFSPVAMLKSIRILLSIA Y+DYEIWQMDVKTAFLNGNLEE I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQ 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQN

Query:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH
        VDE CVYK++  + VAFL+LYVDDILLIGND+G LTDIK+WLA QFQMKDLGEAQ+VLGIQI RDRKNK LALSQA+YIDK++ +YSMQNSK+GLLPFRH
Subjt:  VDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRH

Query:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK
        GV LSKEQ PKTPQ+VE MRHIPYASAVGSLMYAMLCTRPDICYAVG+VSRYQSNPGL HW AVK ILKYLRRTRDY LVYG+KDLILTGYTDSDFQ D+
Subjt:  GVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADK

Query:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE+AKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA
        EIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLE+
Subjt:  EIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEA

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-15329.18Show/hide
Query:  GENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQEMFGQPSYQLHHDALK
        GE YA WK  +  +L   D+  V+    P            +  D W KA    +  I+  LS+         +TAR+I+ +L  ++ + S        K
Subjt:  GENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQEMFGQPSYQLHHDALK

Query:  YVYSCCMKEGTSVRE--HVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQIEGEANVVHSKR
         + S  +    S+    H+ D ++   +A   GA I+E  +++ +L +LP  +     + ++  I       L+E  L  + +KN+  ++ E  +   K 
Subjt:  YVYSCCMKEGTSVRE--HVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQIEGEANVVHSKR

Query:  KFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKY---LAEKKKEKEGKFDLL-----------VLETC
           + S     ++  ++  T K    K +        KG +K    K KC HC  +GH K++C  Y   L  K KE E +               V  T 
Subjt:  KFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKY---LAEKKKEKEGKFDLL-----------VLETC

Query:  LIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGT---GEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMN
        ++++  F  +LDSGA++H+     + S +    +    L++     GE I A   G  +L   +  + LE +L   +   NL+S+  L E   +  FD +
Subjt:  LIEHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGT---GEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYT-TFDMN

Query:  EAFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHIN------IDRIGRLVKNGLLSKLEDDSLPPCESC
           I KNG+++               +N  +LN+      +  S   K     NN  LWH R GHI+      I R        LL+ LE  S   CE C
Subjt:  EAFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTHLWHLRLGHIN------IDRIGRLVKNGLLSKLEDDSLPPCESC

Query:  LKGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQ
        L GK  + PF     +   K PL ++HSD+CGP+         YF+ F+D ++ Y   YL+ +KS+    F++F A+ E      +  L  D G EY+  
Subjt:  LKGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQ

Query:  RFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSV---SETPFELWKGRKPSLHHFRIWGCPAH
          + + ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S A+L  SFWG AV TA Y++N +PS+++   S+TP+E+W  +KP L H R++G   +
Subjt:  RFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSV---SETPFELWKGRKPSLHHFRIWGCPAH

Query:  VLVTNPK-KLEPRSKLCQFGGY------FYD-------------------------------------------PQENKVIVST----------NATFLE
        V + N + K + +S    F GY       +D                                           P +++ I+ T          N  FL+
Subjt:  VLVTNPK-KLEPRSKLCQFGGY------FYD-------------------------------------------PQENKVIVST----------NATFLE

Query:  EDHMRDHK----PRSKII----------------LGEATEESSRVVDEVGPSTRVD-----EGTSSSDPSRPSQT------LGMP------------RRS
        +    ++K       KII                L ++ E +   ++E     R D     +G+ + + SR S+T      +G+             RRS
Subjt:  EDHMRDHK----PRSKII----------------LGEATEESSRVVDEVGPSTRVD-----EGTSSSDPSRPSQT------LGMP------------RRS

Query:  GRVVLQPDRYLGLTE---TQVVIPNDGI--EDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARL
         R+  +P       +    +VV+    I  + P S+ +    +DK  W +A+N E+ +   N  W +   P     +  +W++  K +  G    +KARL
Subjt:  GRVVLQPDRYLGLTE---TQVVIPNDGI--EDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARL

Query:  VAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAI
        VA+G+TQK  +DYEETF+PVA + S R +LS+   Y+ ++ QMDVKTAFLNG L+E I+M  P+G         VCKLN++IYGLKQA+R W   F+ A+
Subjt:  VAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAI

Query:  KSFGFDQNVDESCVY--KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQN
        K   F  +  + C+Y   K N  +  +++LYVDD+++   D+  + + K++L  +F+M DL E ++ +GI+I  + +   + LSQ+ Y+ K+LS+++M+N
Subjt:  KSFGFDQNVDESCVY--KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQN

Query:  SKKGLLPFRHGVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYG---AKDLI
              P    ++     S       +   + P  S +G LMY MLCTRPD+  AV ++SRY S    + W  +K +L+YL+ T D  L++    A +  
Subjt:  SKKGLLPFRHGVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYG---AKDLI

Query:  LTGYTDSDFQADKDSRKSTSGSVFTL-NGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSH
        + GY DSD+   +  RKST+G +F + +   + W + +Q  +A S+ EAEY+A  E+ +EA+WL+  LT + +   +  PI +Y DN G ++ +  P  H
Subjt:  LTGYTDSDFQADKDSRKSTSGSVFTL-NGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSH

Query:  KRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVF
        KR KHI+ KYH  RE VQ   + +  I +E+ +AD FTK L A  F
Subjt:  KRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-22034.55Show/hide
Query:  KLTGEN-YATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQEMFGQPSYQLHH
        K  G+N ++TW+  +  +L+   L  VL  D        A        + W   +++    I   LS+ +        TAR I   L+ ++   +     
Subjt:  KLTGEN-YATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQEMFGQPSYQLHH

Query:  DALKYVYSCCMKEGTSVREHVLDMMVQFNVAEAN-GAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQIEGEANVVH
           K +Y+  M EGT+   H L++        AN G  I+E  +   +L SLP S+    +  +  K T  L  + + L L + + K K + +G+A +  
Subjt:  DALKYVYSCCMKEGTSVREHVLDMMVQFNVAEAN-GAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQIEGEANVVH

Query:  SK-RKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCP---KYLAEKKKEKEG--------KFDLLVL--
         + R ++  S++  +S                      A+ K K ++ +    C++CN  GH+KR+CP   K   E   +K            D +VL  
Subjt:  SK-RKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCP---KYLAEKKKEKEG--------KFDLLVL--

Query:  ---ETCL-IEHDEFTWILDSGATNHV-------CSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLI
           E C+ +   E  W++D+ A++H        C     +    ++ +   +   G G++     VG          L+L+ +  VP ++ NLIS   L 
Subjt:  ---ETCL-IEHDEFTWILDSGATNHV-------CSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLI

Query:  EQSYTTFDMNEAFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKT-AETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLP
           Y ++  N+ +    G L+ +               K +    L++T AE        +    +  LWH R+GH++   +  L K  L+S  +  ++ 
Subjt:  EQSYTTFDMNEAFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKT-AETQSKRQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLP

Query:  PCESCLKGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEY
        PC+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +YF++FIDD SR  ++Y++  K +  + F++F A VE   G+ +K LRSD GGEY
Subjt:  PCESCLKGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEY

Query:  MDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVS-ETPFELWKGRKPSLHHFRIWGCP-
          + F++Y   HGI+ + + PGTPQ NGV+ER NRT+++ VRSM+  A+LP SFWG AV+TA Y++N  PS  ++ E P  +W  ++ S  H +++GC  
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVS-ETPFELWKGRKPSLHHFRIWGCP-

Query:  -AHVLVTNPKKLEPRSKLCQFGGY--------FYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG-----------------------EATEESSRVV
         AHV      KL+ +S  C F GY         +DP + KVI S +  F E +         K+  G                       E +E+  +  
Subjt:  -AHVLVTNPKKLEPRSKLCQFGGY--------FYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILG-----------------------EATEESSRVV

Query:  DEVGPSTRVDEGTSSSDPSRPSQTLGMP-RRSGRVVLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNG
        + +    ++DEG    +     +    P RRS R  ++  RY     T+ V+ +D   +P S ++ ++  +K++ +KAM  EMES+  N  ++L +LP G
Subjt:  DEVGPSTRVDEGTSSSDPSRPSQTLGMP-RRSGRVVLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNG

Query:  VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQ
         +P+ CKW++K K+D   K+  +KARLV KG+ QK+G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE I+M QPEGF   G++ 
Subjt:  VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQ

Query:  KVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVY-KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIR
         VCKLN+S+YGLKQA R W ++FD+ +KS  + +   + CVY K+ ++     L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LG++I+R
Subjt:  KVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVY-KKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIR

Query:  DRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAV
        +R ++ L LSQ  YI+++L R++M+N+K    P    + LSK+  P T +E  NM  +PY+SAVGSLMYAM+CTRPDI +AVGVVSR+  NPG +HW AV
Subjt:  DRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAV

Query:  KNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMN
        K IL+YLR T    L +G  D IL GYTD+D   D D+RKS++G +FT +GGA+ W+S  Q C+A ST EAEY+AA E+ KE +WL++FL +L +     
Subjt:  KNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMN

Query:  VPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVFE
        V   +YCD+  A+  SK    H R KHI+ +YH IRE+V    + V KI++  N AD  TK +    FE
Subjt:  VPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVFE

P25600 Putative transposon Ty5-1 protein YCL074W9.2e-3733.65Show/hide
Query:  MDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVYKKVNKRKVAFLILYVDDILLIGNDVGY
        MDV TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+       
Subjt:  MDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVYKKVNKRKVAFLILYVDDILLIGNDVGY

Query:  LTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVENMRHI-PYASAVGSLMY
           +K+ L   + MKDLG+    LG+  I    N  + LS   YI K  S   +   K    P  +   L +  SP       +++ I PY S VG L++
Subjt:  LTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVENMRHI-PYASAVGSLMY

Query:  AMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVY-GAKDLILTGYTDSDFQADKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADS
             RPDI Y V ++SR+   P   H  + + +L+YL  TR   L Y     L LT Y D+   A  D   ST G V  L G  V W S K +G I   
Subjt:  AMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVY-GAKDLILTGYTDSDFQADKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADS

Query:  TMEAEYVAACESAKE
        + EAEY+ A E+  E
Subjt:  TMEAEYVAACESAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-14127.71Show/hide
Query:  VSLLKNEKLTGENYATWKSNLNTILVVDDLRFVL--TEDCPPAP-ARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQEMF
        V++    KLT  NY  W   ++ +    +L   L  +   PPA    +A+  V   Y RW + +  I   +L ++S  +        TA +I  +L++++
Subjt:  VSLLKNEKLTGENYATWKSNLNTILVVDDLRFVL--TEDCPPAP-ARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQEMF

Query:  GQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQS---LLKNK
          PSY  H   L+       K   ++ +++  ++ +F+     G  +D   QV  +LE+LP+ +               LT +   L  ++S    + + 
Subjt:  GQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQS---LLKNK

Query:  GQIEGEANVV-HSKRKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPK---YLAE-KKKEKEGKFDLL
          I   AN V H      N +++G ++    ++         +++ +    +  ++K     GKC  C V GH  + C +   +L+    ++    F   
Subjt:  GQIEGEANVV-HSKRKFENGSSSGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPK---YLAE-KKKEKEGKFDLL

Query:  VLETCLI---EHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYTT
             L     +    W+LDSGAT+H+ S F  N S  Q   G   + V  G  I     G+  L  ++R L L ++L VP I +NLIS+  L   +  +
Subjt:  VLETCLI---EHDEFTWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYTT

Query:  FDMNEAFI----YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTH-LWHLRLGHINIDRIGRLVKNGLLSKLE-DDSLPP
         +   A         GV +   K ++ LY               +  A +Q      SPSS  TH  WH RLGH     +  ++ N  LS L        
Subjt:  FDMNEAFI----YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTH-LWHLRLGHINIDRIGRLVKNGLLSKLE-DDSLPP

Query:  CESCLKGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYM
        C  CL  K  K PF+     +  PLE I+SD+     + +   Y Y++ F+D ++RY +LY +  KS+  E F  FK  +EN     I    SD GGE++
Subjt:  CESCLKGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYM

Query:  DQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVS-ETPFELWKGRKPSLHHFRIWGCPAH
             +Y  +HGI    S P TP+ NG+SER++R +++   +++S+A +P ++W YA   AVY++N +P+  +  E+PF+   G  P+    R++GC  +
Subjt:  DQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVS-ETPFELWKGRKPSLHHFRIWGCPAH

Query:  VLVT--NPKKLEPRSKLCQFGGYFYDP--------QENKVIVSTNATFLEE-----------DHMRDHKPRSKIILGEATEESSRVVDEVGP--------
          +   N  KL+ +S+ C F GY            Q +++ +S +  F E              +++ +  S  +    T   +R      P        
Subjt:  VLVT--NPKKLEPRSKLCQFGGYFYDP--------QENKVIVSTNATFLEE-----------DHMRDHKPRSKIILGEATEESSRVVDEVGP--------

Query:  -----------------STRVDEGTSSSDPSRP------------------------------------------SQTLGMPRRSGRVVLQPDRYLGLTE
                         S+ +D   SSS PS P                                          +Q+L  P +S      P      + 
Subjt:  -----------------STRVDEGTSSSDPSRP------------------------------------------SQTLGMPRRSGRVVLQPDRYLGLTE

Query:  TQVVIPNDGIEDPLSYRQAMNDED-------------------------------------------KDE-WVKAMNLEMESMYSNFVWELA-DLPNGVK
        T    P+  I  P    Q +N+ +                                           KDE W  AM  E+ +   N  W+L    P+ V 
Subjt:  TQVVIPNDGIEDPLSYRQAMNDED-------------------------------------------KDE-WVKAMNLEMESMYSNFVWELA-DLPNGVK

Query:  PIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKV
         +GC+WI+ +K ++ G +  +KARLVAKGY Q+ G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L ++++MSQP GFI + +   V
Subjt:  PIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKV

Query:  CKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRK
        CKL +++YGLKQA R+W +     + + GF  +V ++ ++     + + ++++YVDDIL+ GND   L +    L+ +F +KD  E  Y LGI+    R 
Subjt:  CKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRK

Query:  NKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNI
           L LSQ  YI  +L+R +M  +K    P      LS     K     E      Y   VGSL Y +  TRPDI YAV  +S++   P  +H  A+K I
Subjt:  NKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNI

Query:  LKYLRRTRDY-MLVYGAKDLILTGYTDSDFQADKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVP
        L+YL  T ++ + +     L L  Y+D+D+  DKD   ST+G +  L    + W S KQ  +  S+ EAEY +   ++ E  W+   LT+L +   +  P
Subjt:  LKYLRRTRDY-MLVYGAKDLILTGYTDSDFQADKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVP

Query:  ITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHL-EAGVT
          +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   +AD  TK L+   F+    + GVT
Subjt:  ITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHL-EAGVT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.0e-14427.75Show/hide
Query:  VSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQA---VKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQEMF
        V++    KLT  NY  W   ++ +    +L   L    P  PA   + A   V   Y RW + +  I   IL ++S  +        TA +I  +L++++
Subjt:  VSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQA---VKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQEMF

Query:  GQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQS---LLKNK
          PSY  H   L+++                    +F+     G  +D   QV  +LE+LP  +              +LT +   L   +S    L + 
Subjt:  GQPSYQLHHDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQS---LLKNK

Query:  GQIEGEANVVHSKRKFENGSSSGTKSVATSSKKTQKKKGNKRK-----------APSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPK----YLAEKK
          +   ANVV  +               T++ + Q  +G+ R             PS++       +     G+C  C+V GH  + CP+         +
Subjt:  GQIEGEANVVHSKRKFENGSSSGTKSVATSSKKTQKKKGNKRK-----------APSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPK----YLAEKK

Query:  KEKEGKFDLLVLETCLIEHDEF---TWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISI
        ++    F        L  +  +    W+LDSGAT+H+ S F  N SF Q   G   + +  G  I     G+A L   +R L L  +L VP I +NLIS+
Subjt:  KEKEGKFDLLVLETCLIEHDEF---TWILDSGATNHVCSSFQENSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISI

Query:  SCLIEQSYTTFDMNEAFI----YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTH-LWHLRLGHINIDRIGRLVKNGLLS
          L   +  + +   A         GV +   K ++ LY               +  A +Q+     SP S  TH  WH RLGH ++  +  ++ N  L 
Subjt:  SCLIEQSYTTFDMNEAFI----YKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSKRQKISPSSNNTH-LWHLRLGHINIDRIGRLVKNGLLS

Query:  KLE-DDSLPPCESCLKGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKV
         L     L  C  C   K  K PF+     + +PLE I+SD+     + +   Y Y++ F+D ++RY +LY +  KS+  + F  FK+ VEN     I  
Subjt:  KLE-DDSLPPCESCLKGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKV

Query:  LRSDRGGEYMDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVS-ETPFELWKGRKPSLH
        L SD GGE++  R  DY+ +HGI    S P TP+ NG+SER++R +++M  +++S+A +P ++W YA   AVY++N +P+  +  ++PF+   G+ P+  
Subjt:  LRSDRGGEYMDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVS-ETPFELWKGRKPSLH

Query:  HFRIWGCPAHVLVT--NPKKLEPRSKLCQFGGYFYDPQ--------ENKVIVSTNATFLE---------------EDHMRDHKPR---------------
          +++GC  +  +   N  KLE +SK C F GY               ++  S +  F E               ++   D  P                
Subjt:  HFRIWGCPAHVLVT--NPKKLEPRSKLCQFGGYFYDPQ--------ENKVIVSTNATFLE---------------EDHMRDHKPR---------------

Query:  SKIILGEATEESSRVVDEVGP--STRVDEGT---------SSSDPSRPSQT--------------------LGMPRRSGRVVLQPDRYLGLTETQVVIPN
        +   LG   + S R      P  +T+V             SSS+P+ PS                      L  P  +      P++   L ++ +  P+
Subjt:  SKIILGEATEESSRVVDEVGP--STRVDEGT---------SSSDPSRPSQT--------------------LGMPRRSGRVVLQPDRYLGLTETQVVIPN

Query:  ----------------------------------------------------DGIEDP---LSY----------RQAMNDEDKDEWVKAMNLEMESMYSN
                                                            DGI  P    SY          R A+     D W +AM  E+ +   N
Subjt:  ----------------------------------------------------DGIEDP---LSY----------RQAMNDEDKDEWVKAMNLEMESMYSN

Query:  FVWELA-DLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMS
          W+L    P  V  +GC+WI+ +K ++ G +  +KARLVAKGY Q+ G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + ++MS
Subjt:  FVWELA-DLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMS

Query:  QPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGE
        QP GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  ++ ++ ++     R + ++++YVDDIL+ GND   L      L+ +F +K+  +
Subjt:  QPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVYKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGE

Query:  AQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQ
          Y LGI+    R  + L LSQ  Y   +L+R +M  +K    P      L+     K P   E      Y   VGSL Y +  TRPD+ YAV  +S+Y 
Subjt:  AQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQ

Query:  SNPGLDHWGAVKNILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQADKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRK
          P  DHW A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+G +  L    + W S KQ  +  S+ EAEY +   ++ E  W+  
Subjt:  SNPGLDHWGAVKNILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQADKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRK

Query:  FLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVFE
         LT+L +   ++ P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   +AD  TK L+   F+
Subjt:  FLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVFE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-8235.54Show/hide
Query:  EDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILL
        ++P +Y +A   ++   W  AM+ E+ +M +   WE+  LP   KPIGCKW+YK K ++ G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILL

Query:  SIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIA-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVYKKVNKRKVAFL
        +I+  Y++ + Q+D+  AFLNG+L+E I+M  P G+ A QG       VC L +SIYGLKQASR W ++F   +  FGF Q+  +   + K+       +
Subjt:  SIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIA-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCVYKKVNKRKVAFL

Query:  ILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEN
        ++YVDDI++  N+   + ++K  L + F+++DLG  +Y LG++I R      + + Q  Y   +L    +   K   +P    V  S           + 
Subjt:  ILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEN

Query:  MRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAK-DLILTGYTDSDFQADKDSRKSTSGSVFTLNGGA
        +    Y   +G LMY  + TR DI +AV  +S++   P L H  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L    
Subjt:  MRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAK-DLILTGYTDSDFQADKDSRKSTSGSVFTLNGGA

Query:  VVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
        + W+S KQ  ++ S+ EAEY A   +  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  VVWRSIKQGCIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0837.8Show/hide
Query:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVS-ETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPRSK
        NRT+++ VRSM+    LP +F   A  TAV+I+N  PS +++   P E+W    P+  + R +GC A++   +  KL+PR+K
Subjt:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVS-ETPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPRSK

ATMG00750.1 GAG/POL/ENV polyprotein2.2e-1766.07Show/hide
Query:  ILHSGFFWPTLFKDAHCFYKQCDACQRRGNLRPRDEMPLTYILEVELFDVWGIDFM
        +L +GF+WPT FKDAH F   CDACQR+GN   R+EMP  +ILEVE+FDVWGI FM
Subjt:  ILHSGFFWPTLFKDAHCFYKQCDACQRRGNLRPRDEMPLTYILEVELFDVWGIDFM

ATMG00810.1 DNA/RNA polymerases superfamily protein3.8e-2232.48Show/hide
Query:  FLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEV
        +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI        L LSQ  Y +++L+   M + K    P    ++ S   + K P   
Subjt:  FLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEV

Query:  ENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQADKDSRKSTSGSVFTLNG
        +      + S VG+L Y  L TRPDI YAV +V +    P L  +  +K +L+Y++ T  + + ++    L +  + DSD+     +R+ST+G    L  
Subjt:  ENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQADKDSRKSTSGSVFTLNG

Query:  GAVVWRSIKQGCIADSTMEAEYVAACESAKEAVW
          + W + +Q  ++ S+ E EY A   +A E  W
Subjt:  GAVVWRSIKQGCIADSTMEAEYVAACESAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.6e-1437.86Show/hide
Query:  EDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILL
        ++P S   A+ D     W +AM  E++++  N  W L   P     +GCKW++K K  + G +   KARLVAKG+ Q+EG+ + ET+SPV    +IR +L
Subjt:  EDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILL

Query:  SIA
        ++A
Subjt:  SIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCAATTGTCTCTCTACTTAAAAATGAGAAATTAACAGGCGAAAATTATGCTACGTGGAAGTCGAACTTGAATACGATTCTTGTTGTCGATGATCTGCGGTT
CGTCTTAACAGAGGATTGTCCTCCAGCCCCTGCTCGTAATGCATCCCAGGCAGTTAAGGATGCTTATGACCGTTGGACAAAGGCCAATGATAAGATTCGCGTCTATATCT
TAGCCAGCTTATCTGAAGTTTTGGCTAAAAGGCATGAGGGCATGGTAACAGCGAGGGAGATTATGAACTCTCTCCAGGAGATGTTTGGACAACCGTCCTACCAACTCCAC
CATGACGCTCTCAAATACGTTTATAGCTGTTGCATGAAAGAGGGCACGTCTGTTCGGGAGCATGTCCTGGATATGATGGTCCAATTCAACGTGGCAGAGGCAAACGGGGC
GGTCATAGATGAGCGTAGTCAGGTTGCATTCATCTTAGAATCTCTTCCGAAGAGTTTTCTACAGTTTCGAAGCAATGCAATGATGAATAAAATAACATTCAACCTGACTA
GCCTCCTTAATGAGCTACAACTCTATCAGTCTCTTCTTAAGAACAAGGGACAGATAGAAGGAGAGGCAAACGTTGTTCACTCCAAAAGAAAGTTCGAGAATGGTTCATCC
TCTGGAACTAAATCTGTAGCCACTTCTTCAAAGAAAACTCAGAAGAAGAAAGGAAACAAGAGGAAAGCTCCCAGTACTGCTGCTAAAAGCAAGGGAAAAGCCAAAGCTGT
GGCAGACAAGGGCAAGTGTTTTCACTGCAATGTAGATGGACATTGGAAGAGAAACTGCCCAAAGTACCTTGCTGAGAAAAAGAAGGAAAAGGAAGGTAAATTTGATTTAC
TGGTTTTAGAAACATGCCTGATAGAACATGATGAGTTTACCTGGATACTTGATTCAGGAGCAACTAATCATGTTTGCTCTTCTTTTCAGGAAAATAGTTCCTTTCGACAG
TTGAATGATGGGGAAATGACACTCAGGGTCGGAACTGGAGAAGTCATTTCAGCTAAAGCAGTGGGAGCTGCGAAGTTATTTTTTAGAAATAGATTCTTACTTTTAGAAAG
TTTGTTAATCGTTCCCAAGATTAAGAGGAACTTGATTTCTATTTCATGTTTGATTGAACAATCGTATACTACTTTTGATATGAATGAAGCGTTCATTTATAAAAATGGTG
TACTTATTTGTTCAGCCAAATTGGAAAATAACTTGTATGTTTTAAGACCAACCGAAAACAAAGCACTTTTAAATCACGAATTATTTAAAACTGCTGAAACTCAAAGTAAA
AGGCAAAAGATTTCTCCATCAAGTAACAATACTCATCTTTGGCATCTTAGACTTGGTCATATAAACATCGATAGGATCGGTCGTTTAGTAAAGAATGGACTTCTAAGCAA
GTTAGAAGATGATTCACTACCACCTTGTGAATCTTGTCTCAAGGGAAAAATGACCAAAAGACCCTTTACTGGAAAAGGTTATAGAGCCAAAGAACCCTTAGAGTTAATAC
ATTCGGATCTATGTGGTCCAATGAATGTAAAAGCTCGAGGAGGGTACGAATATTTCATCTCATTTATAGATGATTATTCAAGATATGGTTACTTATACCTAATGGGACAT
AAGTCTGAAGCCCTTGAAAAGTTTAAGGAGTTTAAGGCTGAAGTTGAAAACCTATTAGGTAAATCTATTAAAGTACTTCGATCTGATCGAGGAGGGGAGTATATGGATCA
AAGATTCCAGGATTATATGATAGAACATGGAATCCAATCTCAACTCTCAGCCCCTGGTACACCTCAGCAGAATGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTAGACA
TGGTTCGATCAATGATGAGCTATGCTCAATTGCCTAGCTCGTTTTGGGGATATGCAGTAGAGACTGCAGTATACATCTTGAACAATGTTCCCTCGAAAAGTGTTTCTGAA
ACACCTTTCGAGTTATGGAAGGGGCGTAAACCTAGTTTGCATCACTTTCGCATATGGGGTTGTCCAGCACACGTGCTTGTGACGAATCCTAAGAAACTGGAACCTCGTTC
AAAATTGTGCCAATTTGGTGGTTATTTCTACGACCCACAAGAAAATAAGGTGATTGTATCGACAAACGCCACTTTCTTAGAGGAAGACCACATGAGAGATCATAAACCAC
GAAGCAAAATAATATTAGGAGAAGCTACTGAAGAATCATCAAGAGTTGTTGATGAAGTTGGACCTTCAACAAGGGTTGATGAAGGAACTAGTTCTTCAGATCCATCCCGT
CCATCTCAAACGTTGGGAATGCCTCGACGCAGTGGGAGGGTTGTGTTACAACCTGATCGCTACTTGGGTTTAACTGAAACTCAGGTTGTCATACCTAATGATGGCATTGA
GGATCCATTGTCCTATCGACAGGCAATGAATGACGAAGATAAGGATGAATGGGTCAAAGCCATGAACCTGGAAATGGAGTCAATGTACTCCAATTTTGTATGGGAACTTG
CAGATCTACCAAATGGGGTAAAACCCATAGGGTGCAAATGGATCTATAAGAGAAAAAGAGATGCAGCTGGAAAGGTACAGACCTTCAAAGCTAGACTAGTGGCAAAGGGT
TATACCCAAAAGGAGGGGGTTGACTATGAGGAAACCTTTTCTCCTGTAGCCATGTTAAAGTCCATAAGAATTCTTTTATCCATTGCCACATATTATGACTATGAAATATG
GCAAATGGACGTCAAGACTGCCTTTCTGAATGGTAATCTTGAAGAGAATATCTTTATGTCTCAACCCGAAGGGTTCATAGCCCAAGGTCAGGAACAAAAGGTTTGTAAGC
TAAATCGATCCATTTATGGATTGAAACAAGCATCCAGATCATGGAACATTAGATTTGATACTGCAATCAAATCTTTTGGCTTTGACCAAAACGTTGATGAGTCTTGTGTA
TACAAGAAAGTCAACAAAAGGAAAGTAGCTTTTCTAATACTTTATGTAGACGATATCCTACTCATTGGGAATGATGTAGGATACCTCACTGACATTAAGAAATGGCTAGC
AGCCCAATTCCAAATGAAAGATCTGGGAGAGGCTCAATATGTTCTGGGAATCCAAATCATTAGGGATCGTAAGAACAAAACGCTAGCTCTGTCTCAAGCAACGTATATTG
ACAAGATGTTGTCTCGATATTCGATGCAAAACTCCAAGAAGGGACTCCTACCCTTCAGACATGGGGTACACCTGTCTAAGGAACAATCTCCTAAGACACCTCAAGAAGTT
GAGAACATGAGACACATTCCCTACGCCTCAGCTGTGGGCAGCTTAATGTATGCCATGCTCTGCACTAGGCCTGACATATGTTATGCAGTGGGAGTAGTCAGTAGATATCA
GTCTAATCCAGGGTTGGACCACTGGGGAGCGGTTAAGAATATCCTCAAGTATCTTCGGAGAACGAGGGACTACATGCTTGTGTATGGAGCTAAGGATTTGATTCTTACTG
GATACACTGATTCTGACTTTCAAGCTGATAAGGATTCTAGGAAATCCACATCAGGGTCAGTATTCACCCTCAATGGAGGAGCAGTAGTGTGGAGAAGCATCAAACAAGGA
TGCATAGCAGACTCAACCATGGAGGCTGAGTATGTAGCAGCTTGCGAATCAGCTAAGGAGGCAGTATGGCTCAGAAAGTTCTTGACTGATTTGGAAGTGGTTCCGAATAT
GAACGTGCCCATAACACTATACTGTGATAACAGTGGGGCTGTAGCCAACTCAAAAGAACCTCGAAGCCACAAACGGGGTAAACACATCGAGAGAAAGTATCACCTGATAC
GGGAGATTGTGCAAAGAGGAGATGTGATCGTCACTAAGATCGCCTCGGAGCACAACATCGCTGATCCATTTACAAAGGCTCTCACAGCTAAAGTGTTTGAGGGTCATCTA
GAAGCTGGTGTTACAGACACTCGTGGAGGACTAACTAGTCGATATTGGTCTAAGTTCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGCAACTGTCCATAAGGT
CCCACTGGTAGCTCGTAAAGGGCAAAAGCAGGGCAAATTTATCCATCCTATATATTATGCAAGCAAGAATTTAAATGAGGCACAAGTCAACTATACAACTACTGAAAAGG
AGTTGTTAGCTGTGGTGTTTGCTTTTGAGAAATTCCGGCCATATTTTGTTGGATGCAAAGTCACAGTGTTCACGGATCATGCAGCAATAAGGTATCTAATGGCTAAGAAA
GATGCAAAACCGAGGCTAATTCGTTGGGTTTTATTATTGCAGGAATTCGACTTGGAGATAAAGGATAAGAAGGGATCAGAAAATGTCATTGCAGATCATTTGTCTCGTCT
TGATCCATCATCATCTTTGCTGGAGCAATCTGCCATTTCAGATTCTTTTCCGGATGAACATCTTTTTGCTGTTGAGGTAAAGGTAGTCAGGGATGTCCCTTGCTATGCTG
ATATTGCCAACTTTTTGGTAAAGGGAGTCACTCCTATTGACATGGACTGGAGGCAGAAGAAAAAGTTTAAGCATGATGCAAAATTTTTCTATTGGGATGAGCCATTTATG
TATAAGCAATGCTCTGACGGTATTATTCGTAGGTGTGTTTCAGGTGATGAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCTGTATGGAGGTCATTTTAGCAGTCA
GAGGACAGCTATGAGGATTTTGCATAGTGGATTCTTCTGGCCTACCTTATTTAAGGATGCCCATTGCTTTTACAAGCAATGCGATGCTTGCCAAAGGAGAGGAAATTTGA
GACCTAGAGATGAAATGCCTCTTACTTATATTTTAGAAGTTGAATTATTTGATGTATGGGGTATTGATTTTATGGGGCCATTTCCCCCTTCTAATGGCAATGTTTTTATC
TTATTGGCAGTTGATTACGTGTCCAAGCGGGTGGAGGCCATTGCATGCCATCAGAGTGATGCCAAGACAGTTGCAAGGTTTCTTCAATCGCACATCTTTGTGCGGTTTGG
GACACCTAGGGCTCTAGTGAGTGACGAGGGTACACATTTTGTTAATAATATCTTAACTAAGCTATTAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTTATCACC
CACAAGTAAATGGTCAAGCTGAAATTAGTAATAGGGAAATTAAATCTATTCTGGAGAAAGTAGTCCATCCATCTAGGAAGGATTGGTCTTTTAGGTTGGATGAGGCTCTT
TGGGCTTATAGGGCAGCCTATAAGACTCCTCTAGCTCTATTTATTCCAAACCCTTCAGTAGGTGCTTCTCGCTCCATTTTCTTGTTTCGAATCTTTGTTGGTTGCATTCT
TCTTTCGTTATTGTTATTTTCTAACGCCAGAGGAAAAAAAGCCAAGAAAAGGAGAAGCAACAAAGGGCTACGGGACAAGAAGAAGTACAGGAGGCAACAGAGGTTGTTGC
TACCGCAGTGGTTGAAGAAGACCCGCAAGAACCAGAAAACAAACCCAGAGGAGGCTGAGCAGAGAGTTGCGGATACAGAAGAAGTTCGAGAGGAGGTTCGAGAAACACAG
GATGAGGTTGTGAAAAAGAACATGCAGAGGTTGTGCCTGAAGGAGATAATGAGCCAGTGCATGAGGCTCGGGTGGAGGTGGTCATGCCGGAGGTGCCCAGACGTCGCCGC
ATTAAGCGGAAGGCTGGATGCATTCAACGGTGATCTTCCACAGTTTCTGAGGACCGGTATTGCAGACCACGGCTGGGAGTTGTTTTGTGCGAAGCCTGAGTTTGTAAACG
CACAAGTGGTGCATGAATTTTATGCTAACATTGACAAAGAAGATGGTTTCCAGGTGATTGTCCGAGGAGTTGAGGTAGATTGGAGTCCTAGTGCTATCAACGCACTGTAC
AATCTTCAGAACTTCCCTCGTGCGGCATATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGAGGGAGTTAGGTATTGAAGGAGCACAGTGCAG
CTATCCAAGACTCAGAAGAGGACATTCCAGTCGGCTTATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCAATTGTCTCTCTACTTAAAAATGAGAAATTAACAGGCGAAAATTATGCTACGTGGAAGTCGAACTTGAATACGATTCTTGTTGTCGATGATCTGCGGTT
CGTCTTAACAGAGGATTGTCCTCCAGCCCCTGCTCGTAATGCATCCCAGGCAGTTAAGGATGCTTATGACCGTTGGACAAAGGCCAATGATAAGATTCGCGTCTATATCT
TAGCCAGCTTATCTGAAGTTTTGGCTAAAAGGCATGAGGGCATGGTAACAGCGAGGGAGATTATGAACTCTCTCCAGGAGATGTTTGGACAACCGTCCTACCAACTCCAC
CATGACGCTCTCAAATACGTTTATAGCTGTTGCATGAAAGAGGGCACGTCTGTTCGGGAGCATGTCCTGGATATGATGGTCCAATTCAACGTGGCAGAGGCAAACGGGGC
GGTCATAGATGAGCGTAGTCAGGTTGCATTCATCTTAGAATCTCTTCCGAAGAGTTTTCTACAGTTTCGAAGCAATGCAATGATGAATAAAATAACATTCAACCTGACTA
GCCTCCTTAATGAGCTACAACTCTATCAGTCTCTTCTTAAGAACAAGGGACAGATAGAAGGAGAGGCAAACGTTGTTCACTCCAAAAGAAAGTTCGAGAATGGTTCATCC
TCTGGAACTAAATCTGTAGCCACTTCTTCAAAGAAAACTCAGAAGAAGAAAGGAAACAAGAGGAAAGCTCCCAGTACTGCTGCTAAAAGCAAGGGAAAAGCCAAAGCTGT
GGCAGACAAGGGCAAGTGTTTTCACTGCAATGTAGATGGACATTGGAAGAGAAACTGCCCAAAGTACCTTGCTGAGAAAAAGAAGGAAAAGGAAGGTAAATTTGATTTAC
TGGTTTTAGAAACATGCCTGATAGAACATGATGAGTTTACCTGGATACTTGATTCAGGAGCAACTAATCATGTTTGCTCTTCTTTTCAGGAAAATAGTTCCTTTCGACAG
TTGAATGATGGGGAAATGACACTCAGGGTCGGAACTGGAGAAGTCATTTCAGCTAAAGCAGTGGGAGCTGCGAAGTTATTTTTTAGAAATAGATTCTTACTTTTAGAAAG
TTTGTTAATCGTTCCCAAGATTAAGAGGAACTTGATTTCTATTTCATGTTTGATTGAACAATCGTATACTACTTTTGATATGAATGAAGCGTTCATTTATAAAAATGGTG
TACTTATTTGTTCAGCCAAATTGGAAAATAACTTGTATGTTTTAAGACCAACCGAAAACAAAGCACTTTTAAATCACGAATTATTTAAAACTGCTGAAACTCAAAGTAAA
AGGCAAAAGATTTCTCCATCAAGTAACAATACTCATCTTTGGCATCTTAGACTTGGTCATATAAACATCGATAGGATCGGTCGTTTAGTAAAGAATGGACTTCTAAGCAA
GTTAGAAGATGATTCACTACCACCTTGTGAATCTTGTCTCAAGGGAAAAATGACCAAAAGACCCTTTACTGGAAAAGGTTATAGAGCCAAAGAACCCTTAGAGTTAATAC
ATTCGGATCTATGTGGTCCAATGAATGTAAAAGCTCGAGGAGGGTACGAATATTTCATCTCATTTATAGATGATTATTCAAGATATGGTTACTTATACCTAATGGGACAT
AAGTCTGAAGCCCTTGAAAAGTTTAAGGAGTTTAAGGCTGAAGTTGAAAACCTATTAGGTAAATCTATTAAAGTACTTCGATCTGATCGAGGAGGGGAGTATATGGATCA
AAGATTCCAGGATTATATGATAGAACATGGAATCCAATCTCAACTCTCAGCCCCTGGTACACCTCAGCAGAATGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTAGACA
TGGTTCGATCAATGATGAGCTATGCTCAATTGCCTAGCTCGTTTTGGGGATATGCAGTAGAGACTGCAGTATACATCTTGAACAATGTTCCCTCGAAAAGTGTTTCTGAA
ACACCTTTCGAGTTATGGAAGGGGCGTAAACCTAGTTTGCATCACTTTCGCATATGGGGTTGTCCAGCACACGTGCTTGTGACGAATCCTAAGAAACTGGAACCTCGTTC
AAAATTGTGCCAATTTGGTGGTTATTTCTACGACCCACAAGAAAATAAGGTGATTGTATCGACAAACGCCACTTTCTTAGAGGAAGACCACATGAGAGATCATAAACCAC
GAAGCAAAATAATATTAGGAGAAGCTACTGAAGAATCATCAAGAGTTGTTGATGAAGTTGGACCTTCAACAAGGGTTGATGAAGGAACTAGTTCTTCAGATCCATCCCGT
CCATCTCAAACGTTGGGAATGCCTCGACGCAGTGGGAGGGTTGTGTTACAACCTGATCGCTACTTGGGTTTAACTGAAACTCAGGTTGTCATACCTAATGATGGCATTGA
GGATCCATTGTCCTATCGACAGGCAATGAATGACGAAGATAAGGATGAATGGGTCAAAGCCATGAACCTGGAAATGGAGTCAATGTACTCCAATTTTGTATGGGAACTTG
CAGATCTACCAAATGGGGTAAAACCCATAGGGTGCAAATGGATCTATAAGAGAAAAAGAGATGCAGCTGGAAAGGTACAGACCTTCAAAGCTAGACTAGTGGCAAAGGGT
TATACCCAAAAGGAGGGGGTTGACTATGAGGAAACCTTTTCTCCTGTAGCCATGTTAAAGTCCATAAGAATTCTTTTATCCATTGCCACATATTATGACTATGAAATATG
GCAAATGGACGTCAAGACTGCCTTTCTGAATGGTAATCTTGAAGAGAATATCTTTATGTCTCAACCCGAAGGGTTCATAGCCCAAGGTCAGGAACAAAAGGTTTGTAAGC
TAAATCGATCCATTTATGGATTGAAACAAGCATCCAGATCATGGAACATTAGATTTGATACTGCAATCAAATCTTTTGGCTTTGACCAAAACGTTGATGAGTCTTGTGTA
TACAAGAAAGTCAACAAAAGGAAAGTAGCTTTTCTAATACTTTATGTAGACGATATCCTACTCATTGGGAATGATGTAGGATACCTCACTGACATTAAGAAATGGCTAGC
AGCCCAATTCCAAATGAAAGATCTGGGAGAGGCTCAATATGTTCTGGGAATCCAAATCATTAGGGATCGTAAGAACAAAACGCTAGCTCTGTCTCAAGCAACGTATATTG
ACAAGATGTTGTCTCGATATTCGATGCAAAACTCCAAGAAGGGACTCCTACCCTTCAGACATGGGGTACACCTGTCTAAGGAACAATCTCCTAAGACACCTCAAGAAGTT
GAGAACATGAGACACATTCCCTACGCCTCAGCTGTGGGCAGCTTAATGTATGCCATGCTCTGCACTAGGCCTGACATATGTTATGCAGTGGGAGTAGTCAGTAGATATCA
GTCTAATCCAGGGTTGGACCACTGGGGAGCGGTTAAGAATATCCTCAAGTATCTTCGGAGAACGAGGGACTACATGCTTGTGTATGGAGCTAAGGATTTGATTCTTACTG
GATACACTGATTCTGACTTTCAAGCTGATAAGGATTCTAGGAAATCCACATCAGGGTCAGTATTCACCCTCAATGGAGGAGCAGTAGTGTGGAGAAGCATCAAACAAGGA
TGCATAGCAGACTCAACCATGGAGGCTGAGTATGTAGCAGCTTGCGAATCAGCTAAGGAGGCAGTATGGCTCAGAAAGTTCTTGACTGATTTGGAAGTGGTTCCGAATAT
GAACGTGCCCATAACACTATACTGTGATAACAGTGGGGCTGTAGCCAACTCAAAAGAACCTCGAAGCCACAAACGGGGTAAACACATCGAGAGAAAGTATCACCTGATAC
GGGAGATTGTGCAAAGAGGAGATGTGATCGTCACTAAGATCGCCTCGGAGCACAACATCGCTGATCCATTTACAAAGGCTCTCACAGCTAAAGTGTTTGAGGGTCATCTA
GAAGCTGGTGTTACAGACACTCGTGGAGGACTAACTAGTCGATATTGGTCTAAGTTCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGCAACTGTCCATAAGGT
CCCACTGGTAGCTCGTAAAGGGCAAAAGCAGGGCAAATTTATCCATCCTATATATTATGCAAGCAAGAATTTAAATGAGGCACAAGTCAACTATACAACTACTGAAAAGG
AGTTGTTAGCTGTGGTGTTTGCTTTTGAGAAATTCCGGCCATATTTTGTTGGATGCAAAGTCACAGTGTTCACGGATCATGCAGCAATAAGGTATCTAATGGCTAAGAAA
GATGCAAAACCGAGGCTAATTCGTTGGGTTTTATTATTGCAGGAATTCGACTTGGAGATAAAGGATAAGAAGGGATCAGAAAATGTCATTGCAGATCATTTGTCTCGTCT
TGATCCATCATCATCTTTGCTGGAGCAATCTGCCATTTCAGATTCTTTTCCGGATGAACATCTTTTTGCTGTTGAGGTAAAGGTAGTCAGGGATGTCCCTTGCTATGCTG
ATATTGCCAACTTTTTGGTAAAGGGAGTCACTCCTATTGACATGGACTGGAGGCAGAAGAAAAAGTTTAAGCATGATGCAAAATTTTTCTATTGGGATGAGCCATTTATG
TATAAGCAATGCTCTGACGGTATTATTCGTAGGTGTGTTTCAGGTGATGAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCTGTATGGAGGTCATTTTAGCAGTCA
GAGGACAGCTATGAGGATTTTGCATAGTGGATTCTTCTGGCCTACCTTATTTAAGGATGCCCATTGCTTTTACAAGCAATGCGATGCTTGCCAAAGGAGAGGAAATTTGA
GACCTAGAGATGAAATGCCTCTTACTTATATTTTAGAAGTTGAATTATTTGATGTATGGGGTATTGATTTTATGGGGCCATTTCCCCCTTCTAATGGCAATGTTTTTATC
TTATTGGCAGTTGATTACGTGTCCAAGCGGGTGGAGGCCATTGCATGCCATCAGAGTGATGCCAAGACAGTTGCAAGGTTTCTTCAATCGCACATCTTTGTGCGGTTTGG
GACACCTAGGGCTCTAGTGAGTGACGAGGGTACACATTTTGTTAATAATATCTTAACTAAGCTATTAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTTATCACC
CACAAGTAAATGGTCAAGCTGAAATTAGTAATAGGGAAATTAAATCTATTCTGGAGAAAGTAGTCCATCCATCTAGGAAGGATTGGTCTTTTAGGTTGGATGAGGCTCTT
TGGGCTTATAGGGCAGCCTATAAGACTCCTCTAGCTCTATTTATTCCAAACCCTTCAGTAGGTGCTTCTCGCTCCATTTTCTTGTTTCGAATCTTTGTTGGTTGCATTCT
TCTTTCGTTATTGTTATTTTCTAACGCCAGAGGAAAAAAAGCCAAGAAAAGGAGAAGCAACAAAGGGCTACGGGACAAGAAGAAGTACAGGAGGCAACAGAGGTTGTTGC
TACCGCAGTGGTTGAAGAAGACCCGCAAGAACCAGAAAACAAACCCAGAGGAGGCTGAGCAGAGAGTTGCGGATACAGAAGAAGTTCGAGAGGAGGTTCGAGAAACACAG
GATGAGGTTGTGAAAAAGAACATGCAGAGGTTGTGCCTGAAGGAGATAATGAGCCAGTGCATGAGGCTCGGGTGGAGGTGGTCATGCCGGAGGTGCCCAGACGTCGCCGC
ATTAAGCGGAAGGCTGGATGCATTCAACGGTGATCTTCCACAGTTTCTGAGGACCGGTATTGCAGACCACGGCTGGGAGTTGTTTTGTGCGAAGCCTGAGTTTGTAAACG
CACAAGTGGTGCATGAATTTTATGCTAACATTGACAAAGAAGATGGTTTCCAGGTGATTGTCCGAGGAGTTGAGGTAGATTGGAGTCCTAGTGCTATCAACGCACTGTAC
AATCTTCAGAACTTCCCTCGTGCGGCATATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGAGGGAGTTAGGTATTGAAGGAGCACAGTGCAG
CTATCCAAGACTCAGAAGAGGACATTCCAGTCGGCTTATTTGA
Protein sequenceShow/hide protein sequence
MSSSIVSLLKNEKLTGENYATWKSNLNTILVVDDLRFVLTEDCPPAPARNASQAVKDAYDRWTKANDKIRVYILASLSEVLAKRHEGMVTAREIMNSLQEMFGQPSYQLH
HDALKYVYSCCMKEGTSVREHVLDMMVQFNVAEANGAVIDERSQVAFILESLPKSFLQFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQIEGEANVVHSKRKFENGSS
SGTKSVATSSKKTQKKKGNKRKAPSTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKKKEKEGKFDLLVLETCLIEHDEFTWILDSGATNHVCSSFQENSSFRQ
LNDGEMTLRVGTGEVISAKAVGAAKLFFRNRFLLLESLLIVPKIKRNLISISCLIEQSYTTFDMNEAFIYKNGVLICSAKLENNLYVLRPTENKALLNHELFKTAETQSK
RQKISPSSNNTHLWHLRLGHINIDRIGRLVKNGLLSKLEDDSLPPCESCLKGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGH
KSEALEKFKEFKAEVENLLGKSIKVLRSDRGGEYMDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNNVPSKSVSE
TPFELWKGRKPSLHHFRIWGCPAHVLVTNPKKLEPRSKLCQFGGYFYDPQENKVIVSTNATFLEEDHMRDHKPRSKIILGEATEESSRVVDEVGPSTRVDEGTSSSDPSR
PSQTLGMPRRSGRVVLQPDRYLGLTETQVVIPNDGIEDPLSYRQAMNDEDKDEWVKAMNLEMESMYSNFVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKG
YTQKEGVDYEETFSPVAMLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDESCV
YKKVNKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLSRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEV
ENMRHIPYASAVGSLMYAMLCTRPDICYAVGVVSRYQSNPGLDHWGAVKNILKYLRRTRDYMLVYGAKDLILTGYTDSDFQADKDSRKSTSGSVFTLNGGAVVWRSIKQG
CIADSTMEAEYVAACESAKEAVWLRKFLTDLEVVPNMNVPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHL
EAGVTDTRGGLTSRYWSKFRGHRKYVCSEKSATVHKVPLVARKGQKQGKFIHPIYYASKNLNEAQVNYTTTEKELLAVVFAFEKFRPYFVGCKVTVFTDHAAIRYLMAKK
DAKPRLIRWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEHLFAVEVKVVRDVPCYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFM
YKQCSDGIIRRCVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHSGFFWPTLFKDAHCFYKQCDACQRRGNLRPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFI
LLAVDYVSKRVEAIACHQSDAKTVARFLQSHIFVRFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQVNGQAEISNREIKSILEKVVHPSRKDWSFRLDEAL
WAYRAAYKTPLALFIPNPSVGASRSIFLFRIFVGCILLSLLLFSNARGKKAKKRRSNKGLRDKKKYRRQQRLLLPQWLKKTRKNQKTNPEEAEQRVADTEEVREEVRETQ
DEVVKKNMQRLCLKEIMSQCMRLGWRWSCRRCPDVAALSGRLDAFNGDLPQFLRTGIADHGWELFCAKPEFVNAQVVHEFYANIDKEDGFQVIVRGVEVDWSPSAINALY
NLQNFPRAAYNEMAVAPSNEQLSDAVRELGIEGAQCSYPRLRRGHSSRLI