; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G23600 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G23600
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
Genome locationChr3:20563838..20568793
RNA-Seq ExpressionCSPI03G23600
SyntenyCSPI03G23600
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.0e+0074.92Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEECPQ PA NANRT R+AYDRW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKARPSKPNAQIKKKGKGKT-------HKQNKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLGEYEN
        K+VEANVA TKRKFIRGSSSK K  PSK  AQ+KKKGKGK           +KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDLLVVETCL E + 
Subjt:  KQVEANVATTKRKFIRGSSSKTKARPSKPNAQIKKKGKGKT-------HKQNKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLGEYEN

Query:  CTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASA-------------------------------------------------------
         TWILDSGATNHICFSFQE SSWKKL EGE TLKVGTGE+VSA A                                                       
Subjt:  CTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASA-------------------------------------------------------

Query:  ----------------------------------------------------------------------LENNTLPPCESCLEGKMTKRSFTGKGLRAK
                                                                              LE+N+LPPCESCLEGKMTKRSFTGKGLRAK
Subjt:  ----------------------------------------------------------------------LENNTLPPCESCLEGKMTKRSFTGKGLRAK

Query:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASST
         PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFK+YKAEVEN++GKTIK LRSDRGGEYMD +F+DYLIE GIQSQL A ST
Subjt:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASST

Query:  PQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGY
        PQQ  NGV+ERRNRTLLDMVRSMMS+AQ+PDSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GY
Subjt:  PQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGY

Query:  PKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVV
        PKESRGGLFY PQENK+FVSTNATFLEEDH  +HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRRSGRVVHQP+ YLGL+ETQ++
Subjt:  PKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVV

Query:  IPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLK
        IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLK
Subjt:  IPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLK

Query:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVA
        SIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRSWNIRF+TAIKSYGFEQNVDEPCVYKK+VNS+VA
Subjt:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVA

Query:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------YGIHLSKEQCPKTPQEV
        FL+LYVDDILLIGNDV YLTD+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+                 +GIHLSKEQCPKTPQEV
Subjt:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------YGIHLSKEQCPKTPQEV

Query:  EDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDKDARKSTSGSVFTLNGG
        EDMRNIPY+SAVGSLMYA+                                      T++YMLVYG KDLIL GYTDSDFQ+DKDARKSTSGSVFTLNGG
Subjt:  EDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDKDARKSTSGSVFTLNGG

Query:  AVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQ
        AVVWRS+KQTCIADSTMEAEYVAACEAAKEAVWLRKFLTD EVV NMHLPITLY DNSGAVANS+EPRSHKRGKHIERKYHLIREIVHRGDVVVTQ
Subjt:  AVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQ

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0063.41Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+ +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------
        CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA                                                  
Subjt:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------

Query:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS
                                                                                     S LE N+LP CESCLEGKMTKR 
Subjt:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR
         SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE R
Subjt:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR

Query:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV
        SKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV
Subjt:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV

Query:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR
         + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Subjt:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR

Query:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN
        EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN

Query:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y
        VDEPCVYK+++N  VAFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDKI                 +
Subjt:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y

Query:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK
        G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+                                      T+DY LVYG+KDLIL GYTDSDFQTD+
Subjt:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK

Query:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR
        D+RKSTSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLR FL D EVV NM  PITLY DNSGAVANSREPRSHKRGKHIERKYHLIR
Subjt:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR

Query:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR
        EIVHRGDV+VTQI+S  N+  PFTK +TAKVFEGHL+SLGLR
Subjt:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0063.41Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+ +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------
        CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA                                                  
Subjt:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------

Query:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS
                                                                                     S LE N+LP CESCLEGKMTKR 
Subjt:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR
         SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE R
Subjt:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR

Query:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV
        SKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV
Subjt:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV

Query:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR
         + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Subjt:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR

Query:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN
        EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN

Query:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y
        VDEPCVYK+++N  VAFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDKI                 +
Subjt:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y

Query:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK
        G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+                                      T+DY LVYG+KDLIL GYTDSDFQTD+
Subjt:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK

Query:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR
        D+RKSTSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLR FL D EVV NM  PITLY DNSGAVANSREPRSHKRGKHIERKYHLIR
Subjt:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR

Query:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR
        EIVHRGDV+VTQI+S  N+  PFTK +TAKVFEGHL+SLGLR
Subjt:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0063.49Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK N NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+ +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------
        CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA                                                  
Subjt:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------

Query:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS
                                                                                     S LE N+LP CESCLEGKMTKR 
Subjt:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR
         SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE R
Subjt:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR

Query:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV
        SKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV
Subjt:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV

Query:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR
         + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Subjt:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR

Query:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN
        EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN

Query:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y
        VDEPCVYK+++N  VAFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDKI                 +
Subjt:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y

Query:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK
        G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+                                      T+DY LVYG+KDLIL GYTDSDFQTD+
Subjt:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK

Query:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR
        D+RKSTSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLR FL D EVV NM  PITLY DNSGAVANSREPRSHKRGKHIERKYHLIR
Subjt:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR

Query:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR
        EIVHRGDV+VTQI+S  N+  PFTK +TAKVFEGHL+SLGLR
Subjt:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0063.16Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+ +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------
        CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA                                                  
Subjt:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------

Query:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS
                                                                                     S LE N+LP CESCLEGKMTKR 
Subjt:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR
         SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE R
Subjt:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR

Query:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV
        SKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV
Subjt:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV

Query:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR
         + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Subjt:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR

Query:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN
        EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN

Query:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y
        VDEPCVYK+++N  VAFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDKI                 +
Subjt:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y

Query:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK
        G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+                                      T+DY LVYG+KDLIL GYTDSDFQTD+
Subjt:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK

Query:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR
        D+RKSTSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLR FL D EVV NM  PITLY DNSGAVANSREPRSHKRGKHIERKYHLIR
Subjt:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR

Query:  EIVHRGDVVVTQISSQQNIVVPFTKVITAK
        EIVHRGDV+VTQI+S  N+  PFTK +TAK
Subjt:  EIVHRGDVVVTQISSQQNIVVPFTKVITAK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0063.41Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+ +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------
        CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA                                                  
Subjt:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------

Query:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS
                                                                                     S LE N+LP CESCLEGKMTKR 
Subjt:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR
         SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE R
Subjt:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR

Query:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV
        SKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV
Subjt:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV

Query:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR
         + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Subjt:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR

Query:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN
        EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN

Query:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y
        VDEPCVYK+++N  VAFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDKI                 +
Subjt:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y

Query:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK
        G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+                                      T+DY LVYG+KDLIL GYTDSDFQTD+
Subjt:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK

Query:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR
        D+RKSTSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLR FL D EVV NM  PITLY DNSGAVANSREPRSHKRGKHIERKYHLIR
Subjt:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR

Query:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR
        EIVHRGDV+VTQI+S  N+  PFTK +TAKVFEGHL+SLGLR
Subjt:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR

A0A5A7TWB9 Gag/pol protein0.0e+0063.16Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+ +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------
        CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA                                                  
Subjt:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------

Query:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS
                                                                                     S LE N+LP CESCLEGKMTKR 
Subjt:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR
         SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE R
Subjt:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR

Query:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV
        SKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV
Subjt:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV

Query:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR
         + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Subjt:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR

Query:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN
        EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN

Query:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y
        VDEPCVYK+++N  VAFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDKI                 +
Subjt:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y

Query:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK
        G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+                                      T+DYMLVYG+KDLIL GYTDSDFQTD+
Subjt:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK

Query:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR
        D+RKSTSGSVF LNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLR FL D EVV NM  PITLY DNSGAVANSREPRSHKRGKHIERKYHLIR
Subjt:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR

Query:  EIVHRGDVVVTQISSQQNIVVPFTKVITAK
        EIVHRGDV+VTQI+S  N+  PFTK +TAK
Subjt:  EIVHRGDVVVTQISSQQNIVVPFTKVITAK

A0A5A7TZD7 Gag/pol protein0.0e+0063.49Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK N NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+ +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------
        CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA                                                  
Subjt:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------

Query:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS
                                                                                     S LE N+LP CESCLEGKMTKR 
Subjt:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR
         SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE R
Subjt:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR

Query:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV
        SKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV
Subjt:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV

Query:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR
         + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Subjt:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR

Query:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN
        EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN

Query:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y
        VDEPCVYK+++N  VAFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDKI                 +
Subjt:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y

Query:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK
        G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+                                      T+DY LVYG+KDLIL GYTDSDFQTD+
Subjt:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK

Query:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR
        D+RKSTSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLR FL D EVV NM  PITLY DNSGAVANSREPRSHKRGKHIERKYHLIR
Subjt:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR

Query:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR
        EIVHRGDV+VTQI+S  N+  PFTK +TAKVFEGHL+SLGLR
Subjt:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR

A0A5D3CPJ6 Gag/pol protein0.0e+0063.41Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+ +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKGKTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------
        CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA                                                  
Subjt:  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA--------------------------------------------------

Query:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS
                                                                                     S LE N+LP CESCLEGKMTKR 
Subjt:  -----------------------------------------------------------------------------SALENNTLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR
         SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE R
Subjt:  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHR

Query:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV
        SKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV
Subjt:  SKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRV

Query:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR
         + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Subjt:  VHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR

Query:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN
        EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q 
Subjt:  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQN

Query:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y
        VDEPCVYK+++N  VAFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDKI                 +
Subjt:  VDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------Y

Query:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK
        G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+                                      T+DY LVYG+KDLIL GYTDSDFQTD+
Subjt:  GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDK

Query:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR
        D+RKSTSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLR FL D EVV NM  PITLY DNSGAVANSREPRSHKRGKHIERKYHLIR
Subjt:  DARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIR

Query:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR
        EIVHRGDV+VTQI+S  N+  PFTK +TAKVFEGHL+SLGLR
Subjt:  EIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLR

E2GK51 Gag/pol protein (Fragment)0.0e+0074.92Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEECPQ PA NANRT R+AYDRW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKARPSKPNAQIKKKGKGKT-------HKQNKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLGEYEN
        K+VEANVA TKRKFIRGSSSK K  PSK  AQ+KKKGKGK           +KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDLLVVETCL E + 
Subjt:  KQVEANVATTKRKFIRGSSSKTKARPSKPNAQIKKKGKGKT-------HKQNKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLGEYEN

Query:  CTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASA-------------------------------------------------------
         TWILDSGATNHICFSFQE SSWKKL EGE TLKVGTGE+VSA A                                                       
Subjt:  CTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASA-------------------------------------------------------

Query:  ----------------------------------------------------------------------LENNTLPPCESCLEGKMTKRSFTGKGLRAK
                                                                              LE+N+LPPCESCLEGKMTKRSFTGKGLRAK
Subjt:  ----------------------------------------------------------------------LENNTLPPCESCLEGKMTKRSFTGKGLRAK

Query:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASST
         PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFK+YKAEVEN++GKTIK LRSDRGGEYMD +F+DYLIE GIQSQL A ST
Subjt:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASST

Query:  PQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGY
        PQQ  NGV+ERRNRTLLDMVRSMMS+AQ+PDSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GY
Subjt:  PQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGY

Query:  PKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVV
        PKESRGGLFY PQENK+FVSTNATFLEEDH  +HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRRSGRVVHQP+ YLGL+ETQ++
Subjt:  PKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVV

Query:  IPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLK
        IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLK
Subjt:  IPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLK

Query:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVA
        SIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRSWNIRF+TAIKSYGFEQNVDEPCVYKK+VNS+VA
Subjt:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVA

Query:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------YGIHLSKEQCPKTPQEV
        FL+LYVDDILLIGNDV YLTD+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+                 +GIHLSKEQCPKTPQEV
Subjt:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-----------------YGIHLSKEQCPKTPQEV

Query:  EDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDKDARKSTSGSVFTLNGG
        EDMRNIPY+SAVGSLMYA+                                      T++YMLVYG KDLIL GYTDSDFQ+DKDARKSTSGSVFTLNGG
Subjt:  EDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQTDKDARKSTSGSVFTLNGG

Query:  AVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQ
        AVVWRS+KQTCIADSTMEAEYVAACEAAKEAVWLRKFLTD EVV NMHLPITLY DNSGAVANS+EPRSHKRGKHIERKYHLIREIVHRGDVVVTQ
Subjt:  AVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-11225.63Show/hide
Query:  NGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAV
        +G+ YA WK  +  +L   D+  V+    P            +  D W KA   A+  I+  +SD       S  TA++I+++L  ++ +   + +    
Subjt:  NGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAV

Query:  KYIYTKRMKEGTSVRE--HVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKR
        K + + ++    S+    H+ D ++   +A   G  IEE++++S +L +LP  +    T A     E NLT     L   +N  + +  +++ +   T +
Subjt:  KYIYTKRMKEGTSVRE--HVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKR

Query:  KFIRG---SSSKTKARPSKPNAQIKKKGKGKTHKQNKGKCYHCGQNGHWLRNCPKY---LAEKKAEKETQ-------GKYDLLVVETCLGEYENCTWILD
        K +     +++ T       N   K K   K + + K KC+HCG+ GH  ++C  Y   L  K  E E Q       G   ++         +NC ++LD
Subjt:  KFIRG---SSSKTKARPSKPNAQIKKKGKGKTHKQNKGKCYHCGQNGHWLRNCPKY---LAEKKAEKETQ-------GKYDLLVVETCLGEYENCTWILD

Query:  SGATNHI---------------------------------------------------------------------------------------------
        SGA++H+                                                                                             
Subjt:  SGATNHI---------------------------------------------------------------------------------------------

Query:  ------CFSFQENS----------SWKK----LSEGENTLKVGTGEMVSASALENN---TLPPCESCLEGKMTKRSFTGKGLR----AKGPLELVHSDLC
                +FQ  S           W +    +S+G+  L++    M S  +L NN   +   CE CL GK  +  F  K L+     K PL +VHSD+C
Subjt:  ------CFSFQENS----------SWKK----LSEGENTLKVGTGEMVSASALENN---TLPPCESCLEGKMTKRSFTGKGLR----AKGPLELVHSDLC

Query:  GPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASSTPQQNSNGVAER
        GP+         YF+ F+D ++ Y   YLI +KS     F+ + A+ E      +  L  D G EY+    R + ++ GI   L    TPQ   NGV+ER
Subjt:  GPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASSTPQQNSNGVAER

Query:  RNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQNPK-KLEHRSKLCFFIGYPKESRGG
          RT+ +  R+M+S A++  SFWG A+ TA Y++N +PS+++   S+TPYE+W  +K  L+H R++G   +V ++N + K + +S    F+GY  E  G 
Subjt:  RNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQNPK-KLEHRSKLCFFIGYPKESRGG

Query:  LFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------DKTRKSGQSHPS---------------
          +D    K  V+ +    E + ++    + + V  + SK + +K  P+ S K++                  D      ++ P+               
Subjt:  LFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------DKTRKSGQSHPS---------------

Query:  -----QQLREPRRSGRVV------HQPDNYLG------------LIETQVVIPDDGIEDP---------------------LTYKQ--------------
             Q L++ + S +         + D++L               ET   + + GI++P                     ++Y +              
Subjt:  -----QQLREPRRSGRVV------HQPDNYLG------------LIETQVVIPDDGIEDP---------------------LTYKQ--------------

Query:  AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR
           DV           D+  W +A++ E+ +   N+ WT+  +P +   +  +W++  K +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S R
Subjt:  AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR

Query:  ILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVY---KKVVNSIVA
         +LS+   Y+ ++ QMDVKTAFLNG L+E IYM  P+G         VCKL K+IYGLKQA+R W   FE A+K   F  +  + C+Y   K  +N  + 
Subjt:  ILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVY---KKVVNSIVA

Query:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKIYGIHLSKEQC----PKTPQEV-------EDMRNI
        +++LYVDD+++   D+  + + K++L  +F+M DL + ++ +GI+I    +   + +SQ++Y+ KI     + E C       P ++       ++  N 
Subjt:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKIYGIHLSKEQC----PKTPQEV-------EDMRNI

Query:  PYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLI----LIGYTDSDFQTDKDARKSTSGSVFTL-NGGA
        P  S +G LMY +                                      T D  L++  K+L     +IGY DSD+   +  RKST+G +F + +   
Subjt:  PYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLI----LIGYTDSDFQTDKDARKSTSGSVFTL-NGGA

Query:  VVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQ
        + W + +Q  +A S+ EAEY+A  EA +EA+WL+  LT   +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE V    + +  I ++ 
Subjt:  VVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQ

Query:  NIVVPFTKVITAKVFEGHLQSLGL
         +   FTK + A  F      LGL
Subjt:  NIVVPFTKVITAKVFEGHLQSLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-16330.69Show/hide
Query:  KLNGDN-YAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRH
        K NGDN ++ W+  +  +L+   L  VL  +  +     A        + W   +E+A   I   +SD +        TA+ I   L  ++     + + 
Subjt:  KLNGDN-YAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRH

Query:  EAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTK
           K +Y   M EGT+   H+             G  IEE ++   +L SLP S+    T     K    L  + + L   + +      Q +A +   +
Subjt:  EAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTK

Query:  RKFIRGSSSKTKARPSKPNAQIKKKGKGKTHKQNKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD-----LLV-----VETCLGEYENC-------
         +  + SS+       +  A+ K K + K+  +N   CY+C Q GH+ R+CP     +K + ET G+ +      +V     V   + E E C       
Subjt:  RKFIRGSSSKTKARPSKPNAQIKKKGKGKTHKQNKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD-----LLV-----VETCLGEYENC-------

Query:  -TWILDSGATNH----------------------------------ICF-----------------------------------SFQENSSWKKLSEGEN
          W++D+ A++H                                  IC                                    S+  N  W +L++G  
Subjt:  -TWILDSGATNH----------------------------------ICF-----------------------------------SFQENSSWKKLSEGEN

Query:  TLKVGT--------------GEMVSA------------------------------SALENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCG
         +  G               GE+ +A                              S  +  T+ PC+ CL GK  + SF     R    L+LV+SD+CG
Subjt:  TLKVGT--------------GEMVSA------------------------------SALENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCG

Query:  PMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASSTPQQNSNGVAERR
        PM +++ GG +YF++FIDD SR   +Y++  K    + F+++ A VE + G+ +K LRSD GGEY    F +Y   +GI+ +     TPQ   NGVAER 
Subjt:  PMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASSTPQQNSNGVAERR

Query:  NRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQNPKKLEHRSKLCFFIGYPKESRGGLF
        NRT+++ VRSM+  A++P SFWG A++TA Y++N  PS  ++ E P  +W  ++ S  H +++GC   AHV  +   KL+ +S  C FIGY  E  G   
Subjt:  NRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQNPKKLEHRSKLCFFIGYPKESRGGLF

Query:  YDPQENKIFVSTNATFLEED-----HISDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQ-------------------SHPSQ---QLREPRRSG
        +DP + K+  S +  F E +      +S+      +       S  + P+S+    D+  + G+                    HP+Q   Q +  RRS 
Subjt:  YDPQENKIFVSTNATFLEED-----HISDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQ-------------------SHPSQ---QLREPRRSG

Query:  RVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYT
        R   +   Y       V+I DD   +P + K+ +   +++Q +KAM  EMES+  N  + LV+ P   +P+ CKW++K K+D   K+  +KARLV KG+ 
Subjt:  RVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYT

Query:  QREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFE
        Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPEGF    ++  VCKL KS+YGLKQA R W ++F++ +KS  + 
Subjt:  QREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFE

Query:  QNVDEPCVY-KKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKIY--------------
        +   +PCVY K+   +    L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LG++IVR R ++ L +SQ  YI+++               
Subjt:  QNVDEPCVY-KKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKIY--------------

Query:  ---GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQ
            + LSK+ CP T +E  +M  +PY+SAVGSLMYA+                                      T    L +G  D IL GYTD+D  
Subjt:  ---GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------------------------TKDYMLVYGTKDLILIGYTDSDFQ

Query:  TDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYH
         D D RKS++G +FT +GGA+ W+S  Q C+A ST EAEY+AA E  KE +WL++FL +  +    ++   +Y D+  A+  S+    H R KHI+ +YH
Subjt:  TDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYH

Query:  LIREIVHRGDVVVTQISSQQNIVVPFTKVITAKVFE
         IRE+V    + V +IS+ +N     TKV+    FE
Subjt:  LIREIVHRGDVVVTQISSQQNIVVPFTKVITAKVFE

Q12490 Transposon Ty1-BL Gag-Pol polyprotein1.4e-2127.82Show/hide
Query:  CESCLEGKMTK-RSFTGKGLRAKG---PLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH--HKSSSLEKFKQYKAEVENKLGKTIKILRSD
        C  CL GK TK R   G  L+ +    P + +H+D+ GP++   +    YFISF D+ +++  +Y +H   + S L+ F    A ++N+   ++ +++ D
Subjt:  CESCLEGKMTK-RSFTGKGLRAKG---PLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH--HKSSSLEKFKQYKAEVENKLGKTIKILRSD

Query:  RGGEYMDLRFRDYLIENGIQSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFR
        RG EY +     +L +NGI      ++T    ++GVAER NRTLLD  R+ +  + +P+  W  A+E +  + N++ S           K +K + +H  
Subjt:  RGGEYMDLRFRDYLIENGIQSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFR

Query:  I----------WGCPAHVLVQNPKKLEHRSKLCFFIGYP-KESRGGLFYDPQENKIFVSTNATFLE
        +          +G P  V   NP    H   +  +  +P + S G + Y P   K   +TN   L+
Subjt:  I----------WGCPAHVLVQNPKKLEHRSKLCFFIGYP-KESRGGLFYDPQENKIFVSTNATFLE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-9223.21Show/hide
Query:  NSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPAS----NANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM
        N+SI+ +  S   KL   NY  W   ++ +    +L   L       PA+     A R + D Y RW + ++     +L ++S  +        TA +I 
Subjt:  NSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPAS----NANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM

Query:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGT-SVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQN
        ++LR ++  P +   H        K+  +GT ++ +++  ++  F+   + G P++   QV  +LE+LP+ + P     +       LT +   L   ++
Subjt:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGT-SVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQN

Query:  LTMGKGKQV----------EANVATTKRKFIRGSSSKTKARPSKPNAQIKKKGKGKTHKQNK------GKCYHCGQNGHWLRNCPK---YLAEKKAEKET
          +                  N  TT        +++   R +  N++  ++     H  N       GKC  CG  GH  + C +   +L+   +++  
Subjt:  LTMGKGKQV----------EANVATTKRKFIRGSSSKTKARPSKPNAQIKKKGKGKTHKQNK------GKCYHCGQNGHWLRNCPK---YLAEKKAEKET

Query:  ------QGKYDLLVVETCLGE-YENCTWILDSGATNHICFSFQENSSWKKLSEGENTL------------------------------------------
              Q + +L      LG  Y +  W+LDSGAT+HI   F   S  +  + G++ +                                          
Subjt:  ------QGKYDLLVVETCLGE-YENCTWILDSGATNHICFSFQENSSWKKLSEGENTL------------------------------------------

Query:  -----------------------------------------KVGTGEMVSA--------------------------SALENNTLP---------PCESC
                                                  + + + VS                           S + N +L           C  C
Subjt:  -----------------------------------------KVGTGEMVSA--------------------------SALENNTLP---------PCESC

Query:  LEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRF
        L  K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   E F  +K  +EN+    I    SD GGE++ L  
Subjt:  LEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRF

Query:  RDYLIENGIQSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVL
         +Y  ++GI        TP+   NG++ER++R +++   +++S A +P ++W YA   A Y++N +P+  +  E+P++   G   +    R++GC  +  
Subjt:  RDYLIENGIQSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVL

Query:  VQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATF----------------------------------------LEEDHISD------
        ++  N  KL+ +S+ C F+GY       L    Q +++++S +  F                                        L     SD      
Subjt:  VQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATF----------------------------------------LEEDHISD------

Query:  --HQPRSKLVLKEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS-----------------QQLREPRRSGR---------------
            P +     ++S S +D   SS+                     T+   Q+H S                 Q L  P +S                 
Subjt:  --HQPRSKLVLKEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS-----------------QQLREPRRSGR---------------

Query:  ------VVHQPDNYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDV
              ++H P     ++      P            GI                 +P T  QA+KD   ++W  AM  E+ +   N  W LV   P+ V
Subjt:  ------VVHQPDNYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDV

Query:  KPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK
          +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YMSQP GFI++D+   
Subjt:  KPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK

Query:  VCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNR
        VCKL+K++YGLKQA R+W +     + + GF  +V +  ++       + ++++YVDDIL+ GND   L +    L+ +F +KD  +  Y LGI+    R
Subjt:  VCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNR

Query:  KNKTLAMSQASYI-DKIYGIHLSKEQCPKTPQEVEDMRNI----------PYASAVGSLMY----------------------------AITKDYMLVYG
            L +SQ  YI D +   ++   +   TP       ++           Y   VGSL Y                            A+ +    + G
Subjt:  KNKTLAMSQASYI-DKIYGIHLSKEQCPKTPQEVEDMRNI----------PYASAVGSLMY----------------------------AITKDYMLVYG

Query:  TKD----------LILIGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYD
        T +          L L  Y+D+D+  DKD   ST+G +  L    + W S KQ  +  S+ EAEY +    + E  W+   LT+  +   +  P  +Y D
Subjt:  TKD----------LILIGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYD

Query:  NSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGL
        N GA      P  H R KHI   YH IR  V  G + V  +S+   +    TK ++   F+     +G+
Subjt:  NSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.2e-9523.59Show/hide
Query:  MNSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPA---SNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM
        +N++I+ +  S   KL   NY  W   ++ +    +L   L    P  PA   ++A       Y RW + ++     IL ++S  +        TA +I 
Subjt:  MNSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPA---SNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM

Query:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHV--LDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNEL
        ++LR ++  P +                       HV  L  +  F+   + G P++   QV  +LE+LP  + P     +      +LT     L+N  
Subjt:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHV--LDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNEL

Query:  QRFQNLTMGKGKQVEANVA------TTKRKFIRGSSSKTKARPSKPNA-QIKKKGKGKTHKQNK---GKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKY
         +   L   +   + ANV       T + +  RG +       ++ N+ Q    G    ++Q K   G+C  C   GH  + CP+    +    + Q   
Subjt:  QRFQNLTMGKGKQVEANVA------TTKRKFIRGSSSKTKARPSKPNA-QIKKKGKGKTHKQNK---GKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKY

Query:  DLLVVE-----TCLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTL------------------------------------------------
             +          Y    W+LDSGAT+HI   F   S  +  + G++ +                                                
Subjt:  DLLVVE-----TCLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTL------------------------------------------------

Query:  -----------------------------------KVGTGEMVSA--------------------------SALENNTLP---------PCESCLEGKMT
                                            + + + VS                           S + N++LP          C  C   K  
Subjt:  -----------------------------------KVGTGEMVSA--------------------------SALENNTLP---------PCESCLEGKMT

Query:  KRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIE
        K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   + F  +K+ VEN+    I  L SD GGE++ L  RDYL +
Subjt:  KRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIE

Query:  NGIQSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NP
        +GI        TP+   NG++ER++R +++M  +++S A +P ++W YA   A Y++N +P+  +  ++P++   G+  +    +++GC  +  ++  N 
Subjt:  NGIQSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NP

Query:  KKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLE---------------EDHISDHQPR---------SKLVL-------------------
         KLE +SK C F+GY       L       +++ S +  F E               ++  SD  P          + LVL                   
Subjt:  KKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLE---------------EDHISDHQPR---------SKLVL-------------------

Query:  -----------KEISKSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPRRSGRVVHQPDNYLGLIETQVVIP-------------------
                     +  S+I  PSSS       +  + + Q H +Q        L  P  +    + P+    L ++ +  P                   
Subjt:  -----------KEISKSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPRRSGRVVHQPDNYLGLIETQVVIP-------------------

Query:  ---------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKP
                                          DGI                 +P T  QAMKD   D+W +AM  E+ +   N  W LV   P  V  
Subjt:  ---------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKP

Query:  IGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVC
        +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YMSQP GF+++D+   VC
Subjt:  IGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVC

Query:  KLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKN
        +L+K+IYGLKQA R+W +   T + + GF  ++ +  ++       + ++++YVDDIL+ GND   L      L+ +F +K+  D  Y LGI+    R  
Subjt:  KLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKN

Query:  KTLAMSQASY-IDKIYGIHLSKEQCPKTPQEVEDMRNI----------PYASAVGSLMY----------------------------AITKDYMLVYGTK
        + L +SQ  Y +D +   ++   +   TP        +           Y   VGSL Y                            A+ +    + GT 
Subjt:  KTLAMSQASY-IDKIYGIHLSKEQCPKTPQEVEDMRNI----------PYASAVGSLMY----------------------------AITKDYMLVYGTK

Query:  D----------LILIGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNS
        D          L L  Y+D+D+  D D   ST+G +  L    + W S KQ  +  S+ EAEY +    + E  W+   LT+  +   +  P  +Y DN 
Subjt:  D----------LILIGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNS

Query:  GAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGL
        GA      P  H R KHI   YH IR  V  G + V  +S+   +    TK ++   F+   + +G+
Subjt:  GAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.8e-6832Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P TY +A + +    W  AMD E+ +M     W +   P + KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFL
        +I+  Y++ + Q+D+  AFLNG+L+E IYM  P G+  +  +      VC LKKSIYGLKQASR W ++F   +  +GF Q+  +   + K+  ++   +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFL

Query:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQ--ASYIDKIYGIHLSKEQCPKTPQEVEDMRN-------IPYAS
        ++YVDDI++  N+   + ++K  L   F+++DLG  +Y LG++I R+     +   +     +D+   +       P  P       +         Y  
Subjt:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQ--ASYIDKIYGIHLSKEQCPKTPQEVEDMRN-------IPYAS

Query:  AVGSLMY----------------------------AITKDYMLVYGT----------KDLILIGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQT
         +G LMY                            A+ K    + GT           ++ L  ++D+ FQ+ KD R+ST+G    L    + W+S KQ 
Subjt:  AVGSLMY----------------------------AITKDYMLVYGT----------KDLILIGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQT

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIRE
         ++ S+ EAEY A   A  E +WL +F  + ++   +  P  L+ DN+ A+  +     H+R KHIE   H +RE
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIRE

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0532.93Show/hide
Query:  NRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
        NRT+++ VRSM+    +P +F   A  TA +I+N  PS +++   P E+W     +  + R +GC A++   +  KL+ R+K
Subjt:  NRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

ATMG00810.1 DNA/RNA polymerases superfamily protein2.0e-0725.66Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKIY---GIHLSKEQCPKTPQEVEDMRNI-------P
        +L+LYVDDILL G+    L  +   L+  F MKDLG   Y LGIQI  +     L +SQ  Y ++I    G+   K      P ++    +         
Subjt:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKIY---GIHLSKEQCPKTPQEVEDMRNI-------P

Query:  YASAVGSLMY-AITK--------------------DY-----------------MLVYGTKDLILIGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSI
        + S VG+L Y  +T+                    D+                 + ++    L +  + DSD+      R+ST+G    L    + W + 
Subjt:  YASAVGSLMY-AITK--------------------DY-----------------MLVYGTKDLILIGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSI

Query:  KQTCIADSTMEAEYVAACEAAKEAVW
        +Q  ++ S+ E EY A    A E  W
Subjt:  KQTCIADSTMEAEYVAACEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.3e-1538.83Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +   A+KD     W +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIA
        ++A
Subjt:  SIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATT
TGTCTTAACTGAGGAATGTCCTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTC
TTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGA
CACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGG
TCCCATCGAAGAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAA
CCCTTCTGAATGAACTCCAGCGATTCCAAAACCTAACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCT
AAAACAAAAGCTAGACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAAGGAAAGACTCACAAACAGAACAAGGGTAAGTGTTATCATTGTGGCCAAAACGGGCA
CTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTACTAGTTGTAGAAACATGTTTAGGGGAATATGAAAATT
GTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAGAACACTCTCAAGGTTGGAACA
GGAGAGATGGTCTCAGCTTCAGCATTAGAAAATAATACTTTACCTCCTTGTGAATCTTGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGC
CAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATG
GTCATATTTACCTAATACATCATAAGTCTAGTAGTCTTGAAAAGTTCAAACAATATAAGGCTGAAGTAGAAAACAAATTAGGTAAAACAATAAAAATACTTCGATCAGAT
CGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTTTGCATCTAGTACACCTCAACAGAACAGTAACGGTGTAGC
AGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTGCTCAGATGCCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGA
ATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTA
CAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGT
GTCAACAAATGCCACATTCTTAGAGGAAGACCATATCAGTGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCAT
CCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATAACTATTTG
GGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGA
CCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATG
CCGGTAAAGTACAGACTTTTAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATT
AGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACC
AGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCTTCTAGATCCTGGAATATAAGATTTGAGACTGCGA
TCAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTGTAGCATTTTTAGTTTTATATGTAGATGATATTCTACTTATT
GGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAA
CCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATA
TGAGAAATATTCCCTATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATAACAAAAGACTACATGCTCGTGTATGGTACAAAGGATCTTATCCTTATTGGATACACTGAT
TCAGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGA
TTCCACAATGGAAGCTGAATACGTAGCGGCTTGTGAAGCAGCAAAAGAAGCAGTATGGCTAAGAAAATTCTTGACAGATTTCGAAGTCGTTTCAAATATGCATCTACCAA
TCACTTTATACTATGACAACAGTGGTGCGGTTGCAAATTCAAGAGAACCTAGAAGCCATAAAAGAGGAAAGCATATTGAACGCAAGTACCATCTTATCAGGGAAATTGTA
CATCGTGGTGACGTTGTAGTAACCCAGATTTCTTCTCAGCAAAACATTGTTGTTCCTTTTACAAAAGTCATCACGGCTAAAGTGTTTGAGGGTCACCTACAAAGTTTAGG
TCTACGATGTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATT
TGTCTTAACTGAGGAATGTCCTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTC
TTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGA
CACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGG
TCCCATCGAAGAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAA
CCCTTCTGAATGAACTCCAGCGATTCCAAAACCTAACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCT
AAAACAAAAGCTAGACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAAGGAAAGACTCACAAACAGAACAAGGGTAAGTGTTATCATTGTGGCCAAAACGGGCA
CTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTACTAGTTGTAGAAACATGTTTAGGGGAATATGAAAATT
GTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAGAACACTCTCAAGGTTGGAACA
GGAGAGATGGTCTCAGCTTCAGCATTAGAAAATAATACTTTACCTCCTTGTGAATCTTGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGC
CAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATG
GTCATATTTACCTAATACATCATAAGTCTAGTAGTCTTGAAAAGTTCAAACAATATAAGGCTGAAGTAGAAAACAAATTAGGTAAAACAATAAAAATACTTCGATCAGAT
CGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTTTGCATCTAGTACACCTCAACAGAACAGTAACGGTGTAGC
AGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTGCTCAGATGCCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGA
ATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTA
CAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGT
GTCAACAAATGCCACATTCTTAGAGGAAGACCATATCAGTGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCAT
CCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATAACTATTTG
GGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGA
CCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATG
CCGGTAAAGTACAGACTTTTAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATT
AGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACC
AGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCTTCTAGATCCTGGAATATAAGATTTGAGACTGCGA
TCAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTGTAGCATTTTTAGTTTTATATGTAGATGATATTCTACTTATT
GGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAA
CCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATA
TGAGAAATATTCCCTATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATAACAAAAGACTACATGCTCGTGTATGGTACAAAGGATCTTATCCTTATTGGATACACTGAT
TCAGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGA
TTCCACAATGGAAGCTGAATACGTAGCGGCTTGTGAAGCAGCAAAAGAAGCAGTATGGCTAAGAAAATTCTTGACAGATTTCGAAGTCGTTTCAAATATGCATCTACCAA
TCACTTTATACTATGACAACAGTGGTGCGGTTGCAAATTCAAGAGAACCTAGAAGCCATAAAAGAGGAAAGCATATTGAACGCAAGTACCATCTTATCAGGGAAATTGTA
CATCGTGGTGACGTTGTAGTAACCCAGATTTCTTCTCAGCAAAACATTGTTGTTCCTTTTACAAAAGTCATCACGGCTAAAGTGTTTGAGGGTCACCTACAAAGTTTAGG
TCTACGATGTTTGTAA
Protein sequenceShow/hide protein sequence
MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLR
HEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSS
KTKARPSKPNAQIKKKGKGKTHKQNKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGT
GEMVSASALENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSD
RGGEYMDLRFRDYLIENGIQSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLV
QNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDNYL
GLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI
RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVLYVDDILLI
GNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKIYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAITKDYMLVYGTKDLILIGYTD
SDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIREIV
HRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLRCL