; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G12000 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G12000
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
Genome locationChr4:10323123..10326980
RNA-Seq ExpressionCSPI04G12000
SyntenyCSPI04G12000
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.0e+0087.26Show/hide
Query:  IKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESL
        +KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SL
Subjt:  IKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESL

Query:  PKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQN
        PKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KGK+VEANVA TKRKFIRGSSSK K GPSK  AQ+KKKGKGK P  +K KK A+KGKC+HC Q+
Subjt:  PKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQN

Query:  GHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKN
        GHW RNCPKYLAEKKAEK TQGKYDLLVVETCLVE +  TWILDSGATNHICFSFQE SSWKKL EGEITLKVGTGE+VSA AVGDL LFF+DRY+ILK+
Subjt:  GHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKN

Query:  VLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRL
        VLYVP MKRNLISI+CILE +Y ISFE+NE F+  KGI +CSAI E+NLYKLRPTRAN VLNTE+FRT ETQNK+QKVSSNAYLWHLRLGHINLNRI RL
Subjt:  VLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRL

Query:  VKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENEL
        VKSG+L+ LEDNSLPPCESCLEGKMTKRSFTGKGLRAK PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+
Subjt:  VKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENEL

Query:  GKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGR
        GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGR
Subjt:  GKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGR

Query:  KGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTR
        K SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK  
Subjt:  KGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTR

Query:  KSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRD
         S QSH SQ+LR PRRSGRVVHQP+RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD
Subjt:  KSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRD

Query:  HAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQAC
         AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGL+  
Subjt:  HAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQAC

Query:  SRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID
        SRSWNIRFDTAIKSYGFEQNVDEPCVYKK+VNS++AFL+LYVDDILLIGNDV YLTD+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYID
Subjt:  SRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID

Query:  KMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLM
        K+LSRYKMQNSKKG LP+R+GIHLSKEQCPKTPQEVEDMRNIPY+SAVGSLMYAMLCTRPDICYSVG+VSRYQSNPGRDHWTAVKNILKYLR T++YML+
Subjt:  KMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLM

Query:  YGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANS
        YG KDLILTGYTDSDFQ+DKDARKSTSGSVFTLNG AVVWRS+KQ CI DSTMEAEYVA CEAAKE VW RKFLTDLEVVPNMHLPITLYCDNSGAVANS
Subjt:  YGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANS

Query:  REPRSHKRGKHIERKYHLIREIVHRGDVVVTQ
        +EPRSHKRGKHIERKYHLIREIVHRGDVVVTQ
Subjt:  REPRSHKRGKHIERKYHLIREIVHRGDVVVTQ

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0073.06Show/hide
Query:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP
        KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP
Subjt:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP

Query:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC
        +SF+ F++NA +NKI + LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C
Subjt:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC

Query:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR
        +HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ETCLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  
Subjt:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR

Query:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI
        +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHI
Subjt:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI

Query:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY
        NLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEY
Subjt:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY

Query:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET
        KAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Subjt:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET

Query:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A
        P +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         
Subjt:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A

Query:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND
        +++PS+ T+VV     S ++H  Q LREPRRSGRV + P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ 
Subjt:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND

Query:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ
        VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQ
Subjt:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ

Query:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN
        K+CKL +SIYGL+  SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+
Subjt:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN

Query:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK
        RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK
Subjt:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK

Query:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL
         ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKSTSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM  
Subjt:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR
        PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S  N+ADPFTK LTAKVFEGHL +LGLR
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0073.06Show/hide
Query:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP
        KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP
Subjt:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP

Query:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC
        +SF+ F++NA +NKI + LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C
Subjt:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC

Query:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR
        +HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ETCLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  
Subjt:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR

Query:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI
        +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHI
Subjt:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI

Query:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY
        NLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEY
Subjt:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY

Query:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET
        KAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Subjt:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET

Query:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A
        P +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         
Subjt:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A

Query:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND
        +++PS+ T+VV     S ++H  Q LREPRRSGRV + P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ 
Subjt:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND

Query:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ
        VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQ
Subjt:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ

Query:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN
        K+CKL +SIYGL+  SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+
Subjt:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN

Query:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK
        RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK
Subjt:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK

Query:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL
         ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKSTSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM  
Subjt:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR
        PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S  N+ADPFTK LTAKVFEGHL +LGLR
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0073.06Show/hide
Query:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP
        KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP
Subjt:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP

Query:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC
        +SF+ F++NA +NKI + LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C
Subjt:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC

Query:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR
        +HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ETCLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  
Subjt:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR

Query:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI
        +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHI
Subjt:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI

Query:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY
        NLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEY
Subjt:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY

Query:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET
        KAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Subjt:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET

Query:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A
        P +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         
Subjt:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A

Query:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND
        +++PS+ T+VV     S ++H  Q LREPRRSGRV + P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ 
Subjt:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND

Query:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ
        VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQ
Subjt:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ

Query:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN
        K+CKL +SIYGL+  SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+
Subjt:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN

Query:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK
        RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK
Subjt:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK

Query:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL
         ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKSTSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM  
Subjt:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR
        PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S  N+ADPFTK LTAKVFEGHL +LGLR
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0072.96Show/hide
Query:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP
        KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP
Subjt:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP

Query:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC
        +SF+ F++NA +NKI + LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C
Subjt:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC

Query:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR
        +HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ETCLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  
Subjt:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR

Query:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI
        +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHI
Subjt:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI

Query:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY
        NLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEY
Subjt:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY

Query:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET
        KAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Subjt:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET

Query:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A
        P +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         
Subjt:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A

Query:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND
        +++PS+ T+VV     S ++H  Q LREPRRSGRV + P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ 
Subjt:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND

Query:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ
        VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQ
Subjt:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ

Query:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN
        K+CKL +SIYGL+  SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+
Subjt:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN

Query:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK
        RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK
Subjt:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK

Query:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL
         ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKSTSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM  
Subjt:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAK
        PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S  N+ADPFTK LTAK
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0073.06Show/hide
Query:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP
        KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP
Subjt:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP

Query:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC
        +SF+ F++NA +NKI + LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C
Subjt:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC

Query:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR
        +HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ETCLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  
Subjt:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR

Query:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI
        +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHI
Subjt:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI

Query:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY
        NLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEY
Subjt:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY

Query:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET
        KAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Subjt:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET

Query:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A
        P +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         
Subjt:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A

Query:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND
        +++PS+ T+VV     S ++H  Q LREPRRSGRV + P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ 
Subjt:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND

Query:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ
        VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQ
Subjt:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ

Query:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN
        K+CKL +SIYGL+  SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+
Subjt:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN

Query:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK
        RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK
Subjt:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK

Query:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL
         ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKSTSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM  
Subjt:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR
        PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S  N+ADPFTK LTAKVFEGHL +LGLR
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR

A0A5A7TWB9 Gag/pol protein0.0e+0072.96Show/hide
Query:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP
        KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP
Subjt:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP

Query:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC
        +SF+ F++NA +NKI + LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C
Subjt:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC

Query:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR
        +HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ETCLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  
Subjt:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR

Query:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI
        +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHI
Subjt:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI

Query:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY
        NLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEY
Subjt:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY

Query:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET
        KAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Subjt:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET

Query:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A
        P +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         
Subjt:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A

Query:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND
        +++PS+ T+VV     S ++H  Q LREPRRSGRV + P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ 
Subjt:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND

Query:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ
        VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQ
Subjt:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ

Query:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN
        K+CKL +SIYGL+  SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+
Subjt:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN

Query:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK
        RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK
Subjt:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK

Query:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL
         ILKYLR T+DYML+YG+KDLILTGYTDSDFQTD+D+RKSTSGSVF LNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM  
Subjt:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAK
        PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S  N+ADPFTK LTAK
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAK

A0A5A7TZD7 Gag/pol protein0.0e+0073.06Show/hide
Query:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP
        KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP
Subjt:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP

Query:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC
        +SF+ F++NA +NKI + LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C
Subjt:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC

Query:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR
        +HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ETCLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  
Subjt:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR

Query:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI
        +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHI
Subjt:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI

Query:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY
        NLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEY
Subjt:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY

Query:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET
        KAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Subjt:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET

Query:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A
        P +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         
Subjt:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A

Query:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND
        +++PS+ T+VV     S ++H  Q LREPRRSGRV + P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ 
Subjt:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND

Query:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ
        VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQ
Subjt:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ

Query:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN
        K+CKL +SIYGL+  SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+
Subjt:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN

Query:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK
        RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK
Subjt:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK

Query:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL
         ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKSTSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM  
Subjt:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR
        PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S  N+ADPFTK LTAKVFEGHL +LGLR
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR

A0A5D3CPJ6 Gag/pol protein0.0e+0073.06Show/hide
Query:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP
        KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP
Subjt:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLP

Query:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC
        +SF+ F++NA +NKI + LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C
Subjt:  KSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKC

Query:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR
        +HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ETCLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  
Subjt:  YHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDR

Query:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI
        +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHI
Subjt:  YVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHI

Query:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY
        NLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEY
Subjt:  NLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEY

Query:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET
        KAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSET
Subjt:  KAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSET

Query:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A
        P +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         
Subjt:  PYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------A

Query:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND
        +++PS+ T+VV     S ++H  Q LREPRRSGRV + P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ 
Subjt:  IDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND

Query:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ
        VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQ
Subjt:  VKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQ

Query:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN
        K+CKL +SIYGL+  SRSWNIRFDTAIKSYGF+Q VDEPCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+
Subjt:  KVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRN

Query:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK
        RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK
Subjt:  RKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVK

Query:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL
         ILKYLR T+DY L+YG+KDLILTGYTDSDFQTD+D+RKSTSGSVFTLNG AVVWRSIKQ CI DSTMEAEYVA CEAAKE VW R FL DLEVVPNM  
Subjt:  NILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR
        PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDV+VTQI+S  N+ADPFTK LTAKVFEGHL +LGLR
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLR

E2GK51 Gag/pol protein (Fragment)0.0e+0087.26Show/hide
Query:  IKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESL
        +KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SL
Subjt:  IKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESL

Query:  PKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQN
        PKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KGK+VEANVA TKRKFIRGSSSK K GPSK  AQ+KKKGKGK P  +K KK A+KGKC+HC Q+
Subjt:  PKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQN

Query:  GHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKN
        GHW RNCPKYLAEKKAEK TQGKYDLLVVETCLVE +  TWILDSGATNHICFSFQE SSWKKL EGEITLKVGTGE+VSA AVGDL LFF+DRY+ILK+
Subjt:  GHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKN

Query:  VLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRL
        VLYVP MKRNLISI+CILE +Y ISFE+NE F+  KGI +CSAI E+NLYKLRPTRAN VLNTE+FRT ETQNK+QKVSSNAYLWHLRLGHINLNRI RL
Subjt:  VLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRL

Query:  VKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENEL
        VKSG+L+ LEDNSLPPCESCLEGKMTKRSFTGKGLRAK PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+
Subjt:  VKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENEL

Query:  GKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGR
        GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGR
Subjt:  GKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGR

Query:  KGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTR
        K SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK  
Subjt:  KGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTR

Query:  KSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRD
         S QSH SQ+LR PRRSGRVVHQP+RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD
Subjt:  KSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRD

Query:  HAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQAC
         AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGL+  
Subjt:  HAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQAC

Query:  SRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID
        SRSWNIRFDTAIKSYGFEQNVDEPCVYKK+VNS++AFL+LYVDDILLIGNDV YLTD+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYID
Subjt:  SRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID

Query:  KMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLM
        K+LSRYKMQNSKKG LP+R+GIHLSKEQCPKTPQEVEDMRNIPY+SAVGSLMYAMLCTRPDICYSVG+VSRYQSNPGRDHWTAVKNILKYLR T++YML+
Subjt:  KMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLM

Query:  YGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANS
        YG KDLILTGYTDSDFQ+DKDARKSTSGSVFTLNG AVVWRS+KQ CI DSTMEAEYVA CEAAKE VW RKFLTDLEVVPNMHLPITLYCDNSGAVANS
Subjt:  YGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANS

Query:  REPRSHKRGKHIERKYHLIREIVHRGDVVVTQ
        +EPRSHKRGKHIERKYHLIREIVHRGDVVVTQ
Subjt:  REPRSHKRGKHIERKYHLIREIVHRGDVVVTQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.5e-14627.82Show/hide
Query:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVRE--HVLDMMMHFNIAEVNGGPIEEVNQVSFILES
        KA   A+  I+  +SD       S  TA++I+++L  ++ +   + +    K + + ++    S+    H+ D ++   +A   G  IEE++++S +L +
Subjt:  KANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVRE--HVLDMMMHFNIAEVNGGPIEEVNQVSFILES

Query:  LPKSF----IPFQTNASLN-KIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKC
        LP  +       +T +  N  + F    LL++  + +N      K+V   +        + +  K +   +KP    K   K K              KC
Subjt:  LPKSF----IPFQTNASLN-KIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKC

Query:  YHCGQNGHWLRNCPKYLA---------EKKAEKETQGKYDLLVVETCLVE-YENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGT---GEMVSA
        +HCG+ GH  ++C  Y           EK+ +  T      +V E       +NC ++LDSGA++H+     + S +    E    LK+     GE + A
Subjt:  YHCGQNGHWLRNCPKYLA---------EKKAEKETQGKYDLLVVETCLVE-YENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGT---GEMVSA

Query:  SAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSN
        +  G ++L   D  + L++VL+  +   NL+S+  + E    I F+ +   +   G++V               + + +LN       +  +   K  +N
Subjt:  SAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSN

Query:  AYLWHLRLGHINLNRIGRLVKSGLLSPLE-----DNSLPPCESCLEGKMTKRSFTGKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSR
          LWH R GHI+  ++  + +  + S        + S   CE CL GK  +  F  K L+     K PL +VHSD+CGP+         YF+ F+D ++ 
Subjt:  AYLWHLRLGHINLNRIGRLVKSGLLSPLE-----DNSLPPCESCLEGKMTKRSFTGKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSR

Query:  YGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGY
        Y   YLI +KS+    F+++ A+ E      +  L  D G EY+    R + ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S +++  SFWG 
Subjt:  YGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGY

Query:  ALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQNPK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIR
        A+ TA Y++N +PS+++   S+TPYE+W  +K  L+H R++G   +V ++N + K + +S    F+GY  E  G   +D    K  V+ +    E + + 
Subjt:  ALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQNPK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIR

Query:  DHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------DKTRKSGQSHPS--------------------QQLREPRRSGRVV------H
            + + V  + SK + +K  P+ S K++                  D      ++ P+                    Q L++ + S +         
Subjt:  DHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------DKTRKSGQSHPS--------------------QQLREPRRSGRVV------H

Query:  QPDRYLG------------LIETQVVIPDDGIEDP---------------------LTYKQ--------------AMKDV-----------DRDQWIKAM
        + D +L               ET   + + GI++P                     ++Y +                 DV           D+  W +A+
Subjt:  QPDRYLG------------LIETQVVIPDDGIEDP---------------------LTYKQ--------------AMKDV-----------DRDQWIKAM

Query:  DLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
        + E+ +   N+ WT+  +P +   +  +W++  K +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ QMDVKTAFLNG
Subjt:  DLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG

Query:  NLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVY---KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKW
         L+E IYM  P+G         VCKL K+IYGL+  +R W   F+ A+K   F  +  + C+Y   K  +N  I +++LYVDD+++   D+  + + K++
Subjt:  NLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVY---KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKW

Query:  LAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLP----YRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLC
        L  +F+M DL + ++ +GI+I    +   + +SQ++Y+ K+LS++ M+N      P      Y +  S E C           N P  S +G LMY MLC
Subjt:  LAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLP----YRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLC

Query:  TRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLI----LTGYTDSDFQTDKDARKSTSGSVFTL-NGRAVVWRSIKQICIVDST
        TRPD+  +V ++SRY S    + W  +K +L+YL+ T D  L++  K+L     + GY DSD+   +  RKST+G +F + +   + W + +Q  +  S+
Subjt:  TRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLI----LTGYTDSDFQTDKDARKSTSGSVFTL-NGRAVVWRSIKQICIVDST

Query:  MEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVF
         EAEY+A  EA +E +W +  LT + +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE V    + +  I ++  +AD FTK L A  F
Subjt:  MEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVF

Query:  EGHLHNLGL
              LGL
Subjt:  EGHLHNLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-20835.09Show/hide
Query:  NEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKS
        +E+A   I   +SD +        TA+ I   L  ++     + +    K +Y   M EGT+   H+             G  IEE ++   +L SLP S
Subjt:  NEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKS

Query:  FIPFQTNASLNKIEFNL-----TTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCG
        +    T     K    L       LLNE  R +    G+    E       R + R S++  ++G     A+ K K + K+  +N          CY+C 
Subjt:  FIPFQTNASLNKIEFNL-----TTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCG

Query:  QNGHWLRNCPKYLAEKKAEKETQGKYD-----LLV-----VETCLVEYENC--------TWILDSGATNHICFSFQENSSWKKLSEGEI-TLKVGTGEMV
        Q GH+ R+CP     +K + ET G+ +      +V     V   + E E C         W++D+ A++H   +      + +   G+  T+K+G     
Subjt:  QNGHWLRNCPKYLAEKKAEKETQGKYD-----LLV-----VETCLVEYENC--------TWILDSGATNHICFSFQENSSWKKLSEGEI-TLKVGTGEMV

Query:  SASAVGDLKLFFR-DRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKG-ILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQK
          + +GD+ +       ++LK+V +VP ++ NLIS   +    Y  S+  N+ +   KG +++   +    LY+          N EI +  E    + +
Subjt:  SASAVGDLKLFFR-DRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKG-ILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQK

Query:  VSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIY
        +S +  LWH R+GH++   +  L K  L+S  +  ++ PC+ CL GK  + SF     R    L+LV+SD+CGPM +++ GG +YF++FIDD SR   +Y
Subjt:  VSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIY

Query:  LIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETA
        ++  K    + F+++ A VE E G+ +K LRSD GGEY    F +Y   +GI+ + + P TPQ NGV+ER NRT+++ VRSM+  +++  SFWG A++TA
Subjt:  LIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETA

Query:  AYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRS
         Y++N  PS  ++ E P  +W  ++ S  H +++GC   AHV  +   KL+ +S  C FIGY  E  G   +DP + K+  S +  F  E  +R     S
Subjt:  AYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRS

Query:  KLVLKEI------SKSAIDKPSSSTKVVDKTRKSGQ-------------------SHPSQ---QLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPL
        + V   I        S  + P+S+    D+  + G+                    HP+Q   Q +  RRS R   +  RY       V+I DD   +P 
Subjt:  KLVLKEI------SKSAIDKPSSSTKVVDKTRKSGQ-------------------SHPSQ---QLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPL

Query:  TYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAT
        + K+ +   +++Q +KAM  EMES+  N  + LV+ P   +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A 
Subjt:  TYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAT

Query:  FYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVY-KKVVNSIIAFLVLYVDDI
          D E+ Q+DVKTAFL+G+LEE IYM QPEGF    ++  VCKL KS+YGL+   R W ++FD+ +KS  + +   +PCVY K+   +    L+LYVDD+
Subjt:  FYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVY-KKVVNSIIAFLVLYVDDI

Query:  LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYA
        L++G D G +  +K  L+  F MKDLG AQ +LG++IVR R ++ L +SQ  YI+++L R+ M+N+K    P    + LSK+ CP T +E  +M  +PY+
Subjt:  LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYA

Query:  SAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQ
        SAVGSLMYAM+CTRPDI ++VG+VSR+  NPG++HW AVK IL+YLR T    L +G  D IL GYTD+D   D D RKS++G +FT +G A+ W+S  Q
Subjt:  SAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQ

Query:  ICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTK
         C+  ST EAEY+A  E  KE +W ++FL +L +    ++   +YCD+  A+  S+    H R KHI+ +YH IRE+V    + V +IS+ +N AD  TK
Subjt:  ICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTK

Query:  ALTAKVFE
         +    FE
Subjt:  ALTAKVFE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-12826.52Show/hide
Query:  ILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGT-SVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQT
        +L ++S  +        TA +I ++LR ++  P +   H        K+  +GT ++ +++  ++  F+   + G P++   QV  +LE+LP+ + P   
Subjt:  ILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGT-SVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQT

Query:  NASLNKIEFNLT----TLLNELQRFQNLTMGKGKQVEANV-----ATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQN
          +       LT     LLN   +   ++      + AN       TT      G+ +      +  N     +        N  +     GKC  CG  
Subjt:  NASLNKIEFNLT----TLLNELQRFQNLTMGKGKQVEANV-----ATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQN

Query:  GHWLRNCPK---YLAEKKAEKET------QGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFF
        GH  + C +   +L+   +++        Q + +L +       Y +  W+LDSGAT+HI   F   S  +  + G+  + V  G  +  S  G   L  
Subjt:  GHWLRNCPK---YLAEKKAEKET------QGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFF

Query:  RDRYVILKNVLYVPQMKRNLISI------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLW
        + R + L N+LYVP + +NLIS+      + +  + +  SF++ +      G+ +     +D LY+        + +++      + + +   SS    W
Subjt:  RDRYVILKNVLYVPQMKRNLISI------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLW

Query:  HLRLGHINLNRIGRLVKSGLLSPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN
        H RLGH   + +  ++ +  LS L   +    C  CL  K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS 
Subjt:  HLRLGHINLNRIGRLVKSGLLSPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSN

Query:  SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNV
          E F  +K  +EN     I    SD GGE++ L   +Y  ++GI    S P TP+ NG+SER++R +++   +++S + +  ++W YA   A Y++N +
Subjt:  SLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNV

Query:  PSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDH
        P+  +  E+P++   G   +    R++GC  +  ++  N  KL+ +S+ C F+GY       L    Q +++++S +  F E              +++ 
Subjt:  PSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDH

Query:  QPRSKLVL-------------------------------------KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS---------
        +  S  V                                       ++S S +D   SS+                     T+   Q+H S         
Subjt:  QPRSKLVL-------------------------------------KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS---------

Query:  --------QQLREPRRSGR---------------------VVHQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMK
                Q L  P +S                       ++H P     ++      P            GI                 +P T  QA+K
Subjt:  --------QQLREPRRSGR---------------------VVHQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMK

Query:  DVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEI
        D   ++W  AM  E+ +   N  W LV   P+ V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I
Subjt:  DVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEI

Query:  WQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDV
         Q+DV  AFL G L + +YMSQP GFI++D+   VCKL+K++YGL+   R+W +     + + GF  +V +  ++       I ++++YVDDIL+ GND 
Subjt:  WQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDV

Query:  GYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLM
          L +    L+ +F +KD  +  Y LGI+    R    L +SQ  YI  +L+R  M  +K    P      LS     K     E      Y   VGSL 
Subjt:  GYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLM

Query:  YAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDS
        Y +  TRPDI Y+V  +S++   P  +H  A+K IL+YL  T ++ + +     L L  Y+D+D+  DKD   ST+G +  L    + W S KQ  +V S
Subjt:  YAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDS

Query:  TMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKV
        + EAEY +    + E  W    LT+L +   +  P  +YCDN GA      P  H R KHI   YH IR  V  G + V  +S+   +AD  TK L+   
Subjt:  TMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKV

Query:  FEGHLHNLGL
        F+     +G+
Subjt:  FEGHLHNLGL

Q99337 Transposon Ty1-NL2 Gag-Pol polyprotein5.4e-3521.49Show/hide
Query:  ILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVC
        +LDSGA+  +  S     S    S  +I +       +  +A+GDL+  F+D       VL+ P +  +L+S++ +        F  N       G ++ 
Subjt:  ILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVC

Query:  SAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLW---HLRLGHINLNRIGRLVKSGLLSPLEDNSLP-------PCESCLEGKMTK-RSF
          +   + Y +      ++L + I       N     S+  Y +   H  L H N   I   +K+  ++   ++ +         C  CL GK TK R  
Subjt:  SAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLW---HLRLGHINLNRIGRLVKSGLLSPLEDNSLP-------PCESCLEGKMTK-RSF

Query:  TGKGLRAKG---PLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH--HKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLI
         G  L+ +    P + +H+D+ GP++   +    YFISF D+ +++  +Y +H   + + L+ F    A ++N+   ++ +++ DRG EY +     +L 
Subjt:  TGKGLRAKG---PLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH--HKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLI

Query:  ENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRI----------WGCPAH
        +NGI    +  +  + +GV+ER NRTLLD  R+ +  S + +  W  A+E +  + N++ S           K +K + +H  +          +G P  
Subjt:  ENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRI----------WGCPAH

Query:  VLVQNPKKLEH------------RSKLCFFIGYP----------------KESR------GGLFYDPQENKI------FVSTN-------ATFLEEDHIR
        V   NP    H            R+   + I  P                KESR        L +D   N++      F+++N         F  +  + 
Subjt:  VLVQNPKKLEH------------RSKLCFFIGYP----------------KESR------GGLFYDPQENKI------FVSTN-------ATFLEEDHIR

Query:  DHQPR---------------------SKLVLK-----------EISKSAI--DKPSSSTKVVDKTRKSG----------------------QSHPSQ---
          QPR                     SK V K            ISKS I   K  SST  +     +G                       SH S+   
Subjt:  DHQPR---------------------SKLVLK-----------EISKSAI--DKPSSSTKVVDKTRKSG----------------------QSHPSQ---

Query:  -------------------------------QLREPRRSGRVVHQ-----------------------------------------PD------------
                                       Q  E     R++H+                                         PD            
Subjt:  -------------------------------QLREPRRSGRVVHQ-----------------------------------------PD------------

Query:  --------------------------------RYLGLIETQVVIPDD--------GIEDP------------------------LTYKQAM---KDV-DR
                                        R L   ET++ +  D         +E P                        L Y +A+   KD+ ++
Subjt:  --------------------------------RYLGLIETQVVIPDD--------GIEDP------------------------LTYKQAM---KDV-DR

Query:  DQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCK------WIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE
        +++I+A   E+  +     W   D+  D K I  K      +I+ RKRD      T KAR VA+G  Q          S      ++   LS+A   +Y 
Subjt:  DQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCK------WIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE

Query:  IWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQ-NVDE----PCVYKKVVNSIIAFLVLYVDDIL
        I Q+D+ +A+L  +++E +Y+  P      D   K+ +LKKS+YGL+    +W   ++T IKSY  EQ +++E     CV+K   NS +  + L+VDD++
Subjt:  IWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQ-NVDE----PCVYKKVVNSIIAFLVLYVDDIL

Query:  LIGNDVGYLTDIKKWLAMQFQMK--DLGDA----QY-VLGIQIVRNRKNKTLAMSQASYIDKM--LSRYKMQNSKKGLLPYRYGIHLSKEQC----PKTP
        L   D+     I   L  Q+  K  +LG++    QY +LG++I   R        + S  +K+  L+ +     KK   P + G+++ +++      +  
Subjt:  LIGNDVGYLTDIKKWLAMQFQMK--DLGDA----QY-VLGIQIVRNRKNKTLAMSQASYIDKM--LSRYKMQNSKKGLLPYRYGIHLSKEQC----PKTP

Query:  QEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMY----GTK-DLILTGYTDSDFQTDKDARKSTSG
        ++V +M+ +     +G   Y     R D+ Y +  ++++   P R        +++++  T+D  L++     TK D  L   +D+ +  ++   KS  G
Subjt:  QEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMY----GTK-DLILTGYTDSDFQTDKDARKSTSG

Query:  SVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHK-RGKHIERKYHLIREIVHRGD
        ++F LNG+ +  +S K      ST EAE  A  EA          + +L   P +     L  D+   ++  +     K R +    K   +R+ V   +
Subjt:  SVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHK-RGKHIERKYHLIREIVHRGD

Query:  VVVTQISSQQNIADPFTKALTAKVFE
        + V  I +++NIAD  TK L  K F+
Subjt:  VVVTQISSQQNIADPFTKALTAKVFE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-13027.17Show/hide
Query:  ILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHV--LDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQ
        IL ++S  +        TA +I ++LR ++  P +                       HV  L  +  F+   + G P++   QV  +LE+LP  + P  
Subjt:  ILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHV--LDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQ

Query:  TNASLNKIEFNLT----TLLNELQRFQNLTMGKGKQVEANVA------TTKRKFIRGSSSKTKAGPSKPNA-QIKKKGKGKTPKQNKGKKGAEKGKCYHC
           +      +LT     L+N   +   L   +   + ANV       T + +  RG +       ++ N+ Q    G     +Q K       G+C  C
Subjt:  TNASLNKIEFNLT----TLLNELQRFQNLTMGKGKQVEANVA------TTKRKFIRGSSSKTKAGPSKPNA-QIKKKGKGKTPKQNKGKKGAEKGKCYHC

Query:  GQNGHWLRNCPKYLAEKKAEKETQGKYDL--------LVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKL
           GH  + CP+    +    + Q             L V +    Y    W+LDSGAT+HI   F  N S+ +   G   + +  G  +  +  G   L
Subjt:  GQNGHWLRNCPKYLAEKKAEKETQGKYDL--------LVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKL

Query:  FFRDRYVILKNVLYVPQMKRNLISI------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAY
            R + L  VLYVP + +NLIS+      + +  + +  SF++ +      G+ +     +D LY+        + +++      +   +   SS   
Subjt:  FFRDRYVILKNVLYVPQMKRNLISI------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAY

Query:  LWHLRLGHINLNRIGRLVKSGLLSPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHK
         WH RLGH +L  +  ++ +  L  L   + L  C  C   K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  K
Subjt:  LWHLRLGHINLNRIGRLVKSGLLSPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHK

Query:  SNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILN
        S   + F  +K+ VEN     I  L SD GGE++ L  RDYL ++GI    S P TP+ NG+SER++R +++M  +++S + +  ++W YA   A Y++N
Subjt:  SNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILN

Query:  NVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE---------------
         +P+  +  ++P++   G+  +    +++GC  +  ++  N  KLE +SK C F+GY       L       +++ S +  F E                
Subjt:  NVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE---------------

Query:  ------------------------------DHIRDHQPR-----SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREP
                                       H+ D  PR     S L   ++S      S+I  PSSS       +  + + Q H +Q        L  P
Subjt:  ------------------------------DHIRDHQPR-----SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREP

Query:  RRSGRVVHQPDRYLGLIETQVVIP----------------------------------------------------DDGI----------------EDPL
          +    + P++   L ++ +  P                                                     DGI                 +P 
Subjt:  RRSGRVVHQPDRYLGLIETQVVIP----------------------------------------------------DDGI----------------EDPL

Query:  TYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        T  QAMKD   D+W +AM  E+ +   N  W LV   P  V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A
Subjt:  TYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA

Query:  TFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDI
            + I Q+DV  AFL G L + +YMSQP GF+++D+   VC+L+K+IYGL+   R+W +   T + + GF  ++ +  ++       I ++++YVDDI
Subjt:  TFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDI

Query:  LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYA
        L+ GND   L      L+ +F +K+  D  Y LGI+    R  + L +SQ  Y   +L+R  M  +K    P      L+     K P   E      Y 
Subjt:  LLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYA

Query:  SAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIK
          VGSL Y +  TRPD+ Y+V  +S+Y   P  DHW A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+G +  L    + W S K
Subjt:  SAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIK

Query:  QICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFT
        Q  +V S+ EAEY +    + E  W    LT+L +   +  P  +YCDN GA      P  H R KHI   YH IR  V  G + V  +S+   +AD  T
Subjt:  QICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFT

Query:  KALTAKVFEGHLHNLGL
        K L+   F+     +G+
Subjt:  KALTAKVFEGHLHNLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.8e-7934.3Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P TY +A + +    W  AMD E+ +M     W +   P + KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFL
        +I+  Y++ + Q+D+  AFLNG+L+E IYM  P G+  +  +      VC LKKSIYGL+  SR W ++F   +  +GF Q+  +   + K+  ++   +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFL

Query:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVED
        ++YVDDI++  N+   + ++K  L   F+++DLG  +Y LG++I R+     + + Q  Y   +L    +   K   +P    +  S      +  +  D
Subjt:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVED

Query:  MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTK-DLILTGYTDSDFQTDKDARKSTSGSVFTLNGRA
         +   Y   +G LMY  + TR DI ++V  +S++   P   H  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+ KD R+ST+G    L    
Subjt:  MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMYGTK-DLILTGYTDSDFQTDKDARKSTSGSVFTLNGRA

Query:  VVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIRE
        + W+S KQ  +  S+ EAEY A   A  E +W  +F  +L++   +  P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  VVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.3e-0434.72Show/hide
Query:  TRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMY-GTKDLILTGYTDSDFQTDKDARKSTSG
        TRPD+ ++V  +S++ S        AV  +L Y++ T    L Y  T DL L  + DSD+ +  D R+S +G
Subjt:  TRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDYMLMY-GTKDLILTGYTDSDFQTDKDARKSTSG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.0e-0431.71Show/hide
Query:  NRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
        NRT+++ VRSM+    +  +F   A  TA +I+N  PS +++   P E+W     +  + R +GC A++   +  KL+ R+K
Subjt:  NRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-1931.78Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK--KGLLPYRYGIHLSKEQCPKTPQ
        +L+LYVDDILL G+    L  +   L+  F MKDLG   Y LGIQI  +     L +SQ  Y +++L+   M + K     LP +    +S  + P    
Subjt:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK--KGLLPYRYGIHLSKEQCPKTPQ

Query:  EVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTL
        +  D R+I     VG+L Y  L TRPDI Y+V +V +    P    +  +K +L+Y++ T  + + ++    L +  + DSD+      R+ST+G    L
Subjt:  EVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRSTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTL

Query:  NGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVW
            + W + +Q  +  S+ E EY A    A E  W
Subjt:  NGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.8e-1538.83Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +   A+KD     W +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIA
        ++A
Subjt:  SIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACAGCCTCGAAAGAAACTACCCAGCTCAGAACAATGTCAAAGGTCTCAGAAGTTCGGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTCTTGCCAGCATGTC
TGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGACACGAGGCAGTCA
AATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGGTCCCATCGAAGAG
GTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAACCCTTCTGAATGA
ACTCCAGCGATTCCAAAACCTTACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCTAAAACCAAAGCTG
GACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAGGGAAAGACTCCCAAACAGAACAAGGGTAAGAAAGGTGCAGAAAAAGGTAAGTGTTACCATTGTGGCCAA
AACGGGCACTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTACTAGTTGTAGAAACATGTTTAGTGGAATA
TGAAAATTGTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAGATCACTCTCAAGG
TTGGAACAGGAGAGATGGTCTCAGCTTCAGCAGTGGGAGATTTAAAGTTGTTTTTTAGAGATAGATATGTCATACTTAAGAATGTCTTATATGTACCTCAAATGAAAAGA
AATTTAATATCTATCTCTTGTATTTTGGAACAAATGTATAGAATATCTTTTGAAATTAATGAAGCGTTCGTTTTCTATAAAGGTATTCTAGTTTGTTCTGCTATACTTGA
AGACAACTTATATAAGTTAAGACCAACTAGAGCAAATTTTGTCTTAAATACTGAAATATTCAGAACAGCTGAAACTCAGAATAAAAGACAAAAAGTTTCTTCTAATGCCT
ATTTATGGCACTTAAGACTTGGTCACATAAATCTCAATAGGATTGGGAGATTGGTTAAAAGTGGGCTTCTAAGTCCGTTAGAAGATAACTCTTTACCTCCTTGTGAATCT
TGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAAGC
TCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAATATA
AGGCTGAAGTAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATC
CAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTC
AGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAA
GTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAA
TCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACT
AGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAG
AGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAA
CAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGT
AAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAG
TAGACTATGAGGAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACA
GCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGG
ATTACAAGCATGTTCTAGATCCTGGAATATAAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATT
CCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAA
GATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATA
TAAAATGCAGAATTCCAAAAAGGGTCTGCTGCCGTACAGATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTC
CCTATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATGTTATGTACTAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGAT
CACTGGACAGCCGTTAAAAACATTCTAAAATATCTTCGAAGTACAAAAGACTACATGCTCATGTATGGTACAAAGGATCTGATCCTTACTGGATACACTGATTCAGATTT
CCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAAGAGCAGTAGTTTGGAGAAGCATAAAGCAAATTTGTATAGTTGACTCCACAA
TGGAAGCTGAATACGTAGCGACTTGTGAAGCAGCCAAAGAACCAGTATGGCAAAGAAAATTCTTGACAGATTTGGAAGTCGTTCCAAATATGCATCTACCAATAACTTTA
TATTGTGACAACAGTGGTGCAGTTGCAAATTCAAGAGAACCTAGAAGCCATAAAAGAGGAAAGCATATTGAACGCAAGTACCATCTTATCAGGGAAATTGTACATCGTGG
TGACGTTGTAGTAACCCAGATTTCTTCTCAGCAAAACATTGCTGATCCTTTTACAAAAGCCCTCACGGCTAAAGTGTTTGAGGGTCACCTACACAATTTAGGTCTACGAT
GTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCACAGCCTCGAAAGAAACTACCCAGCTCAGAACAATGTCAAAGGTCTCAGAAGTTCGGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTCTTGCCAGCATGTC
TGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGACACGAGGCAGTCA
AATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGGTCCCATCGAAGAG
GTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAACCCTTCTGAATGA
ACTCCAGCGATTCCAAAACCTTACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCTAAAACCAAAGCTG
GACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAGGGAAAGACTCCCAAACAGAACAAGGGTAAGAAAGGTGCAGAAAAAGGTAAGTGTTACCATTGTGGCCAA
AACGGGCACTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTACTAGTTGTAGAAACATGTTTAGTGGAATA
TGAAAATTGTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAGATCACTCTCAAGG
TTGGAACAGGAGAGATGGTCTCAGCTTCAGCAGTGGGAGATTTAAAGTTGTTTTTTAGAGATAGATATGTCATACTTAAGAATGTCTTATATGTACCTCAAATGAAAAGA
AATTTAATATCTATCTCTTGTATTTTGGAACAAATGTATAGAATATCTTTTGAAATTAATGAAGCGTTCGTTTTCTATAAAGGTATTCTAGTTTGTTCTGCTATACTTGA
AGACAACTTATATAAGTTAAGACCAACTAGAGCAAATTTTGTCTTAAATACTGAAATATTCAGAACAGCTGAAACTCAGAATAAAAGACAAAAAGTTTCTTCTAATGCCT
ATTTATGGCACTTAAGACTTGGTCACATAAATCTCAATAGGATTGGGAGATTGGTTAAAAGTGGGCTTCTAAGTCCGTTAGAAGATAACTCTTTACCTCCTTGTGAATCT
TGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAAGC
TCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAATATA
AGGCTGAAGTAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATC
CAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTC
AGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAA
GTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAA
TCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACT
AGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAG
AGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAA
CAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGT
AAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAG
TAGACTATGAGGAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACA
GCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGG
ATTACAAGCATGTTCTAGATCCTGGAATATAAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATT
CCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAA
GATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATA
TAAAATGCAGAATTCCAAAAAGGGTCTGCTGCCGTACAGATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTC
CCTATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATGTTATGTACTAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGAT
CACTGGACAGCCGTTAAAAACATTCTAAAATATCTTCGAAGTACAAAAGACTACATGCTCATGTATGGTACAAAGGATCTGATCCTTACTGGATACACTGATTCAGATTT
CCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAAGAGCAGTAGTTTGGAGAAGCATAAAGCAAATTTGTATAGTTGACTCCACAA
TGGAAGCTGAATACGTAGCGACTTGTGAAGCAGCCAAAGAACCAGTATGGCAAAGAAAATTCTTGACAGATTTGGAAGTCGTTCCAAATATGCATCTACCAATAACTTTA
TATTGTGACAACAGTGGTGCAGTTGCAAATTCAAGAGAACCTAGAAGCCATAAAAGAGGAAAGCATATTGAACGCAAGTACCATCTTATCAGGGAAATTGTACATCGTGG
TGACGTTGTAGTAACCCAGATTTCTTCTCAGCAAAACATTGCTGATCCTTTTACAAAAGCCCTCACGGCTAAAGTGTTTGAGGGTCACCTACACAATTTAGGTCTACGAT
GTTTGTAA
Protein sequenceShow/hide protein sequence
MPQPRKKLPSSEQCQRSQKFGIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEE
VNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQ
NGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKR
NLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCES
CLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKE
SRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYK
QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKT
AFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLQACSRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMK
DLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRD
HWTAVKNILKYLRSTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGRAVVWRSIKQICIVDSTMEAEYVATCEAAKEPVWQRKFLTDLEVVPNMHLPITL
YCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIADPFTKALTAKVFEGHLHNLGLRCL