; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G11450 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G11450
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
Genome locationChr2:11714985..11718399
RNA-Seq ExpressionCSPI02G11450
SyntenyCSPI02G11450
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.0e+0075.24Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG
        MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEEC Q PA NANRT R+AYDRW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMD LR 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI + NQVSFIL+SLPKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKLNAQIKKKRKGKTPKQNKGKKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDL---ETCLVEYGN
        K+VEANVA TKRKFIRGSSSK K GPSK  AQ+KKK KGK P  +K KK  +KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDL   ETCLVE   
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKLNAQIKKKRKGKTPKQNKGKKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDL---ETCLVEYGN

Query:  CTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI
         TWILDSGATNHICFSFQE SSWKKL EGEITLKVGTGE+VSA AVGDL LFF+DRY+ILK+VLYVP MKRNLISI+CILE +Y ISFE+NE F+  KGI
Subjt:  CTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI

Query:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRSFTGKGLRAK
         +CSAI E+NLYKLRPTRAN VLNTE+FRT ETQNK+QKVSSNAYLWHLRLGHINLNRI RLVK+G+L +LED SLPPCESCLEGKMTKRSFTGKGLRAK
Subjt:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRSFTGKGLRAK

Query:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST
         PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPST
Subjt:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST

Query:  PQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL---------------------
        PQQNGVSERRNRTLLDMVRSMMS++Q+PDSFWGYALE A++IL NVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVL                     
Subjt:  PQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL---------------------

Query:  --------------------------------------------------------VVDKTRKSGQSHPSQQLREPRRSGRVVHKPDRYLGLIETQVVIP
                                                                VVDK   S QSH SQ+LR PRRSGRVVH+P+RYLGL+ETQ++IP
Subjt:  --------------------------------------------------------VVDKTRKSGQSHPSQQLREPRRSGRVVHKPDRYLGLIETQVVIP

Query:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND------------------------------GYTQREGVDYEETFSPVAMLKSI
        DDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+D                              GYTQ+EGVDYEETFSPVAMLKSI
Subjt:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND------------------------------GYTQREGVDYEETFSPVAMLKSI

Query:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFL
        RILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKK+VNS++AFL
Subjt:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFL

Query:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIHLSKEQCPKTPHLHLM
        +LYVDDILLIGNDV YLTD+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+LSRYKMQNSKKG LP+R+ IHLSKEQCPKTP    +
Subjt:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIHLSKEQCPKTPHLHLM

Query:  EFAYNNSFQTTIGMTPFEALYGKCCRSSVCW
        E   N  + + +G   +  L   C R  +C+
Subjt:  EFAYNNSFQTTIGMTPFEALYGKCCRSSVCW

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0060.91Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EEC Q PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMD L+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I + +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK+ G+  K N        K    KG C+HC Q GHW RNCPKYLAEKK  K  QGKYD   LET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET

Query:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE  +  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVKNGLL  LE+ SLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ +P+SFWGYA++ AVYIL  VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL            
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------

Query:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK
                                                                      VV++    TR      S ++H  Q LREPRRSGRV + 
Subjt:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK

Query:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV
        P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+                               GYTQ EGV
Subjt:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV

Query:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE
        DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDE
Subjt:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE

Query:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH
        PCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+ + 
Subjt:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH

Query:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW
        LSKEQCPKTP    +E   +  + + +G   +  L   C R  +C+
Subjt:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-1784.62Show/hide
Query:  HLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCWDEVGEQRLMGPELVQS
        HLHLM+ +YNNSFQ TI M PFEALYGKCCRS VCWDEVGEQRLM PELVQS
Subjt:  HLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCWDEVGEQRLMGPELVQS

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0060.91Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EEC Q PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMD L+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I + +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK+ G+  K N        K    KG C+HC Q GHW RNCPKYLAEKK  K  QGKYD   LET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET

Query:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE  +  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVKNGLL  LE+ SLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ +P+SFWGYA++ AVYIL  VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL            
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------

Query:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK
                                                                      VV++    TR      S ++H  Q LREPRRSGRV + 
Subjt:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK

Query:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV
        P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+                               GYTQ EGV
Subjt:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV

Query:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE
        DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDE
Subjt:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE

Query:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH
        PCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+ + 
Subjt:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH

Query:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW
        LSKEQCPKTP    +E   +  + + +G   +  L   C R  +C+
Subjt:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0060.91Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EEC Q PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMD L+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I + +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK+ G+  K N        K    KG C+HC Q GHW RNCPKYLAEKK  K  QGKYD   LET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET

Query:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE  +  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVKNGLL  LE+ SLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ +P+SFWGYA++ AVYIL  VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL            
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------

Query:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK
                                                                      VV++    TR      S ++H  Q LREPRRSGRV + 
Subjt:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK

Query:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV
        P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+                               GYTQ EGV
Subjt:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV

Query:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE
        DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDE
Subjt:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE

Query:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH
        PCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+ + 
Subjt:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH

Query:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW
        LSKEQCPKTP    +E   +  + + +G   +  L   C R  +C+
Subjt:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-1784.62Show/hide
Query:  HLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCWDEVGEQRLMGPELVQS
        HLHLM+ +YNNSFQ TI M PFEALYGKCCRS VCWDEVGEQRLM PELVQS
Subjt:  HLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCWDEVGEQRLMGPELVQS

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0060.91Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EEC Q PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMD L+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I + +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK+ G+  K N        K    KG C+HC Q GHW RNCPKYLAEKK  K  QGKYD   LET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET

Query:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE  +  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVKNGLL  LE+ SLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ +P+SFWGYA++ AVYIL  VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL            
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------

Query:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK
                                                                      VV++    TR      S ++H  Q LREPRRSGRV + 
Subjt:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK

Query:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV
        P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+                               GYTQ EGV
Subjt:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV

Query:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE
        DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDE
Subjt:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE

Query:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH
        PCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+ + 
Subjt:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH

Query:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW
        LSKEQCPKTP    +E   +  + + +G   +  L   C R  +C+
Subjt:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0060.91Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EEC Q PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMD L+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I + +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK+ G+  K N        K    KG C+HC Q GHW RNCPKYLAEKK  K  QGKYD   LET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET

Query:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE  +  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVKNGLL  LE+ SLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ +P+SFWGYA++ AVYIL  VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL            
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------

Query:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK
                                                                      VV++    TR      S ++H  Q LREPRRSGRV + 
Subjt:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK

Query:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV
        P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+                               GYTQ EGV
Subjt:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV

Query:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE
        DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDE
Subjt:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE

Query:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH
        PCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+ + 
Subjt:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH

Query:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW
        LSKEQCPKTP    +E   +  + + +G   +  L   C R  +C+
Subjt:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW

A0A5A7TWB9 Gag/pol protein0.0e+0060.91Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EEC Q PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMD L+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I + +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK+ G+  K N        K    KG C+HC Q GHW RNCPKYLAEKK  K  QGKYD   LET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET

Query:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE  +  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVKNGLL  LE+ SLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ +P+SFWGYA++ AVYIL  VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL            
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------

Query:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK
                                                                      VV++    TR      S ++H  Q LREPRRSGRV + 
Subjt:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK

Query:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV
        P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+                               GYTQ EGV
Subjt:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV

Query:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE
        DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDE
Subjt:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE

Query:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH
        PCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+ + 
Subjt:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH

Query:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW
        LSKEQCPKTP    +E   +  + + +G   +  L   C R  +C+
Subjt:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW

A0A5D3CPJ6 Gag/pol protein0.0e+0060.91Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EEC Q PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMD L+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I + +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK+ G+  K N        K    KG C+HC Q GHW RNCPKYLAEKK  K  QGKYD   LET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET

Query:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE  +  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVKNGLL  LE+ SLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ +P+SFWGYA++ AVYIL  VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL            
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------

Query:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK
                                                                      VV++    TR      S ++H  Q LREPRRSGRV + 
Subjt:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK

Query:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV
        P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+                               GYTQ EGV
Subjt:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV

Query:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE
        DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDE
Subjt:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE

Query:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH
        PCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+ + 
Subjt:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH

Query:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW
        LSKEQCPKTP    +E   +  + + +G   +  L   C R  +C+
Subjt:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW

A0A5D3CSZ6 Gag/pol protein0.0e+0060.91Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EEC Q PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMD L+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I + +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK+ G+  K N        K    KG C+HC Q GHW RNCPKYLAEKK  K  QGKYD   LET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKL-NAQIKKKRKGKTPKQNKG------KKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD---LET

Query:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE  +  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVKNGLL  LE+ SLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ +P+SFWGYA++ AVYIL  VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL            
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL------------

Query:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK
                                                                      VV++    TR      S ++H  Q LREPRRSGRV + 
Subjt:  --------------------------------------------------------------VVDK----TR-----KSGQSHPSQQLREPRRSGRVVHK

Query:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV
        P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+                               GYTQ EGV
Subjt:  PDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN------------------------------DGYTQREGV

Query:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE
        DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDE
Subjt:  DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDE

Query:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH
        PCVYK+++N  +AFLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYIDK++ +Y MQNSK+GLLP+R+ + 
Subjt:  PCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIH

Query:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW
        LSKEQCPKTP    +E   +  + + +G   +  L   C R  +C+
Subjt:  LSKEQCPKTPHLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCW

A0A5D3CSZ6 Gag/pol protein4.3e-1784.62Show/hide
Query:  HLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCWDEVGEQRLMGPELVQS
        HLHLM+ +YNNSFQ TI M PFEALYGKCCRS VCWDEVGEQRLM PELVQS
Subjt:  HLHLMEFAYNNSFQTTIGMTPFEALYGKCCRSSVCWDEVGEQRLMGPELVQS

E2GK51 Gag/pol protein (Fragment)0.0e+0075.24Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG
        MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEEC Q PA NANRT R+AYDRW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMD LR 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI + NQVSFIL+SLPKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKLNAQIKKKRKGKTPKQNKGKKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDL---ETCLVEYGN
        K+VEANVA TKRKFIRGSSSK K GPSK  AQ+KKK KGK P  +K KK  +KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDL   ETCLVE   
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKLNAQIKKKRKGKTPKQNKGKKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDL---ETCLVEYGN

Query:  CTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI
         TWILDSGATNHICFSFQE SSWKKL EGEITLKVGTGE+VSA AVGDL LFF+DRY+ILK+VLYVP MKRNLISI+CILE +Y ISFE+NE F+  KGI
Subjt:  CTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI

Query:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRSFTGKGLRAK
         +CSAI E+NLYKLRPTRAN VLNTE+FRT ETQNK+QKVSSNAYLWHLRLGHINLNRI RLVK+G+L +LED SLPPCESCLEGKMTKRSFTGKGLRAK
Subjt:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRSFTGKGLRAK

Query:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST
         PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPST
Subjt:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST

Query:  PQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL---------------------
        PQQNGVSERRNRTLLDMVRSMMS++Q+PDSFWGYALE A++IL NVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVL                     
Subjt:  PQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVL---------------------

Query:  --------------------------------------------------------VVDKTRKSGQSHPSQQLREPRRSGRVVHKPDRYLGLIETQVVIP
                                                                VVDK   S QSH SQ+LR PRRSGRVVH+P+RYLGL+ETQ++IP
Subjt:  --------------------------------------------------------VVDKTRKSGQSHPSQQLREPRRSGRVVHKPDRYLGLIETQVVIP

Query:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND------------------------------GYTQREGVDYEETFSPVAMLKSI
        DDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+D                              GYTQ+EGVDYEETFSPVAMLKSI
Subjt:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND------------------------------GYTQREGVDYEETFSPVAMLKSI

Query:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFL
        RILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKK+VNS++AFL
Subjt:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFL

Query:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIHLSKEQCPKTPHLHLM
        +LYVDDILLIGNDV YLTD+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+LSRYKMQNSKKG LP+R+ IHLSKEQCPKTP    +
Subjt:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIHLSKEQCPKTPHLHLM

Query:  EFAYNNSFQTTIGMTPFEALYGKCCRSSVCW
        E   N  + + +G   +  L   C R  +C+
Subjt:  EFAYNNSFQTTIGMTPFEALYGKCCRSSVCW

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.8e-8425.02Show/hide
Query:  NGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRGMFGQPEWSLRHEAV
        +G+ YA WK  +  +L   D+  V+                 +  D W KA   A+  I+  +SD       S  TA++I++ L  ++ +   + +    
Subjt:  NGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRGMFGQPEWSLRHEAV

Query:  KYIYTKRMKEGTSVRE--HVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKR
        K + + ++    S+    H+ D ++   +A   G  I++++++S +L +LP  +    T A     E NLT     L   +N  + +  +++ +   T +
Subjt:  KYIYTKRMKEGTSVRE--HVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKR

Query:  KFIRGSSSKTKAGPSKLNAQIKKKRKGKTPKQNKGKKVVEKGKCYHCGQNGHWLRNCPKY---LAEKKAEKE----TQGKYDLETCLVEYG------NCT
        K +           +     + K R  K  K  KG     K KC+HCG+ GH  ++C  Y   L  K  E E    T   + +   + E        NC 
Subjt:  KFIRGSSSKTKAGPSKLNAQIKKKRKGKTPKQNKGKKVVEKGKCYHCGQNGHWLRNCPKY---LAEKKAEKE----TQGKYDLETCLVEYG------NCT

Query:  WILDSGATNHICFSFQENSSWKKLSEGEITLKVGT---GEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKG
        ++LDSGA++H+     + S +    E    LK+     GE + A+  G ++L   D  + L++VL+  +   NL+S+  + E    I F+ +   +   G
Subjt:  WILDSGATNHICFSFQENSSWKKLSEGEITLKVGT---GEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKG

Query:  ILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHIN------LNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRSFT
        ++V               + + +LN       +  +   K  +N  LWH R GHI+      + R        LL  LE  S   CE CL GK  +  F 
Subjt:  ILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHIN------LNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRSFT

Query:  GKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIEN
         K L+     K PL +VHSD+CGP+         YF+ F+D ++ Y   YLI +KS+    F+++ A+ E      +  L  D G EY+    R + ++ 
Subjt:  GKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIEN

Query:  GIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHV--------
        GI   L+ P TPQ NGVSER  RT+ +  R+M+S +++  SFWG A+  A Y++  +PS+++   S+TPYE+W  +K  L+H R++G   +V        
Subjt:  GIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHV--------

Query:  ---------------------------------LVVDKT------------------------------RKSGQ------------------SHPSQQLR
                                         +VVD+T                              RK  Q                  S  S+   
Subjt:  ---------------------------------LVVDKT------------------------------RKSGQ------------------SHPSQQLR

Query:  EPRRSGRVV---------------------------------HKPDRYLG------------LIETQVVIPDDGIEDP---------------------L
         P  S +++                                  K D +L               ET   + + GI++P                     +
Subjt:  EPRRSGRVV---------------------------------HKPDRYLG------------LIETQVVIPDDGIEDP---------------------L

Query:  TYKQ--------------AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPND------------------------------GYTQRE
        +Y +                 DV           D+  W +A++ E+ +   N+ WT+  +P +                              G+TQ+ 
Subjt:  TYKQ--------------AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPND------------------------------GYTQRE

Query:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNV
         +DYEETF+PVA + S R +LS+   Y+ ++ QMDVKTAFLNG L+E IYM  P+G         VCKL K+IYGLKQA+R W   F+ A+K   F  + 
Subjt:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNV

Query:  DEPCVY---KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLP-
         + C+Y   K  +N  I +++LYVDD+++   D+  + + K++L  +F+M DL + ++ +GI+I    +   + +SQ++Y+ K+LS++ M+N      P 
Subjt:  DEPCVY---KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLP-

Query:  ---YRYEIHLSKEQC
             YE+  S E C
Subjt:  ---YRYEIHLSKEQC

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-12229.05Show/hide
Query:  KLNGDN-YAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRGMFGQPEWSLRH
        K NGDN ++ W+  +  +L+   L  VL  + ++     A        + W   +E+A   I   +SD +        TA+ I   L  ++     + + 
Subjt:  KLNGDN-YAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRGMFGQPEWSLRH

Query:  EAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNL-----TTLLNELQRFQNLTMGKGKQVEAN
           K +Y   M EGT+   H+             G  I++ ++   +L SLP S+    T     K    L       LLNE  R +    G+    E  
Subjt:  EAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNL-----TTLLNELQRFQNLTMGKGKQVEAN

Query:  VATTKRKFIRGSSSKTKAGPSKLNAQIKKKRKGKTPKQNKGKKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQG-KYDLETCLVEYGN--------
             R + R S++  ++G     A+ K K + K+  +N          CY+C Q GH+ R+CP     +K + ET G K D  T  +   N        
Subjt:  VATTKRKFIRGSSSKTKAGPSKLNAQIKKKRKGKTPKQNKGKKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQG-KYDLETCLVEYGN--------

Query:  ------------CTWILDSGATNHICFSFQENSSWKKLSEGEI-TLKVGTGEMVSASAVGDLKLFFR-DRYVILKNVLYVPQMKRNLISISCILEQMYRI
                      W++D+ A++H   +      + +   G+  T+K+G       + +GD+ +       ++LK+V +VP ++ NLIS   +    Y  
Subjt:  ------------CTWILDSGATNHICFSFQENSSWKKLSEGEI-TLKVGTGEMVSASAVGDLKLFFR-DRYVILKNVLYVPQMKRNLISISCILEQMYRI

Query:  SFEINEAFVFYKG-ILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEG
        S+  N+ +   KG +++   +    LY+          N EI +  E    + ++S +  LWH R+GH++   +  L K  L+   +  ++ PC+ CL G
Subjt:  SFEINEAFVFYKG-ILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEG

Query:  KMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDY
        K  + SF     R    L+LV+SD+CGPM +++ GG +YF++FIDD SR   +Y++  K    + F+++ A VE E G+ +K LRSD GGEY    F +Y
Subjt:  KMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDY

Query:  LIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVVD
           +GI+ + + P TPQ NGV+ER NRT+++ VRSM+  +++P SFWG A++ A Y++   PS  ++ E P  +W  ++ S  H +++GC   AHV    
Subjt:  LIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVVD

Query:  KTRKSGQSHP-----------SQQLREP-----RRSGRVVHKPDRYL-----------GLIETQVVIP--------------------------------
        +T+   +S P             +L +P      RS  VV +                G+I   V IP                                
Subjt:  KTRKSGQSHP-----------SQQLREP-----RRSGRVVHKPDRYL-----------GLIETQVVIP--------------------------------

Query:  -DDGIE----------------------------------------DPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN-------------
         D+G+E                                        +P + K+ +   +++Q +KAM  EMES+  N  + LV+ P              
Subjt:  -DDGIE----------------------------------------DPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPN-------------

Query:  -----------------DGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGL
                          G+ Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPEGF    ++  VCKL KS+YGL
Subjt:  -----------------DGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGL

Query:  KQASRSWNIRFDTAIKSYGFEQNVDEPCVY-KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA
        KQA R W ++FD+ +KS  + +   +PCVY K+   +    L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LG++IVR R ++ L +SQ 
Subjt:  KQASRSWNIRFDTAIKSYGFEQNVDEPCVY-KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA

Query:  SYIDKMLSRYKMQNSKKGLLPYRYEIHLSKEQCPKT
         YI+++L R+ M+N+K    P    + LSK+ CP T
Subjt:  SYIDKMLSRYKMQNSKKGLLPYRYEIHLSKEQCPKT

Q12491 Transposon Ty2-B Gag-Pol polyprotein5.1e-2325.07Show/hide
Query:  ILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVC
        ++DSGA+  +  S   +         EI +     + +  +A+G+L   F++        L+ P +  +L+S+S +  Q     F  N       G ++ 
Subjt:  ILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVC

Query:  SAILEDNLYKLRPTRANFVLNTEIFR-TAETQNKRQKVSSNAY-LWHLRLGHINLNRIGRLVKNGLLGRLEDKSLP-------PCESCLEGKMTKRSFTG
          +   + Y L      +++ + I + T    NK + V+   Y L H  LGH N   I + +K   +  L++  +         C  CL GK TK     
Subjt:  SAILEDNLYKLRPTRANFVLNTEIFR-TAETQNKRQKVSSNAY-LWHLRLGHINLNRIGRLVKNGLLGRLEDKSLP-------PCESCLEGKMTKRSFTG

Query:  KGLRAK-----GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH--HKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLI
        KG R K      P + +H+D+ GP++   +    YFISF D+ +R+  +Y +H   + + L  F    A ++N+    + +++ DRG EY +     +  
Subjt:  KGLRAK-----GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH--HKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLI

Query:  ENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYI
          GI +  +  +  + +GV+ER NRTLL+  R+++  S +P+  W  A+E +  I
Subjt:  ENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-7023.05Show/hide
Query:  NSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPAS----NANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM
        N+SI+ +  S   KL   NY  W   ++ +    +L   L       PA+     A R + D Y RW + ++     +L ++S  +        TA +I 
Subjt:  NSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPAS----NANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM

Query:  DLLRGMFGQPEWSLRHEAVKYIYTKRMKEGT-SVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNELQ
        + LR ++  P +   H        K+  +GT ++ +++  ++  F+   + G P+    QV  +LE+LP+ + P     +       LT     LLN   
Subjt:  DLLRGMFGQPEWSLRHEAVKYIYTKRMKEGT-SVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNELQ

Query:  RFQNLTMGKGKQVEANVA----TTKRKFIRGSSSKTKAGPSKLNAQIKKKRKGKT---PKQNKGKKVVEKGKCYHCGQNGHWLRNCPK---YLAEKKAEK
        +   ++      + AN      TT        +   +      N   K  ++  T   P  N+ K  +  GKC  CG  GH  + C +   +L+   +++
Subjt:  RFQNLTMGKGKQVEANVA----TTKRKFIRGSSSKTKAGPSKLNAQIKKKRKGKT---PKQNKGKKVVEKGKCYHCGQNGHWLRNCPK---YLAEKKAEK

Query:  ET------QGKYDLETCLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISI-
                Q + +L      Y +  W+LDSGAT+HI   F   S  +  + G+  + V  G  +  S  G   L  + R + L N+LYVP + +NLIS+ 
Subjt:  ET------QGKYDLETCLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISI-

Query:  -----SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKNGLLGRL
             + +  + +  SF++ +      G+ +     +D LY+        + +++      + + +   SS    WH RLGH   + +  ++ N  L  L
Subjt:  -----SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKNGLLGRL

Query:  E-DKSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILR
                C  CL  K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   E F  +K  +EN     I    
Subjt:  E-DKSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILR

Query:  SDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVS-ETPYELWKGRKGSLRHF
        SD GGE++ L   +Y  ++GI    S P TP+ NG+SER++R +++   +++S + +P ++W YA  +AVY++  +P+  +  E+P++   G   +    
Subjt:  SDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVS-ETPYELWKGRKGSLRHF

Query:  RIWGC-----------------------------------------------------------------------------------------------
        R++GC                                                                                               
Subjt:  RIWGC-----------------------------------------------------------------------------------------------

Query:  ----PAHVL--------------------------------------------VVDKTRKSGQSHPS-----------------QQLREPRRSGR-----
            P H                                                  T+   Q+H S                 Q L  P +S       
Subjt:  ----PAHVL--------------------------------------------VVDKTRKSGQSHPS-----------------QQLREPRRSGR-----

Query:  ----------------VVHKPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVW
                        ++H P     ++      P            GI                 +P T  QA+KD   ++W  AM  E+ +   N  W
Subjt:  ----------------VVHKPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVW

Query:  TLVDQPND-------------------------------GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPE
         LV  P                                 GY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YMSQP 
Subjt:  TLVDQPND-------------------------------GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPE

Query:  GFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQY
        GFI++D+   VCKL+K++YGLKQA R+W +     + + GF  +V +  ++       I ++++YVDDIL+ GND   L +    L+ +F +KD  +  Y
Subjt:  GFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQY

Query:  VLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLP
         LGI+    R    L +SQ  YI  +L+R  M  +K    P
Subjt:  VLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.0e-6922.46Show/hide
Query:  MNSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPA---SNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM
        +N++I+ +  S   KL   NY  W   ++ +    +L   L       PA   ++A       Y RW + ++     IL ++S  +        TA +I 
Subjt:  MNSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPA---SNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM

Query:  DLLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHV--LDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNEL
        + LR ++  P +                       HV  L  +  F+   + G P+    QV  +LE+LP  + P     +      +LT     L+N  
Subjt:  DLLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHV--LDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNEL

Query:  QRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKLNAQIKKKRKGKTPKQNKGKKVVEK------GKCYHCGQNGHWLRNCPKYLAEKKAEKETQ
         +   L   +   + ANV T +      + ++   G ++ N      R       + G +   +      G+C  C   GH  + CP+    +    + Q
Subjt:  QRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKAGPSKLNAQIKKKRKGKTPKQNKGKKVVEK------GKCYHCGQNGHWLRNCPKYLAEKKAEKETQ

Query:  GKYDLE--------TCLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISI--
                           Y    W+LDSGAT+HI   F  N S+ +   G   + +  G  +  +  G   L    R + L  VLYVP + +NLIS+  
Subjt:  GKYDLE--------TCLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISI--

Query:  ----SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKNGLLGRLE
            + +  + +  SF++ +      G+ +     +D LY+        + +++      +   +   SS    WH RLGH +L  +  ++ N  L  L 
Subjt:  ----SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKNGLLGRLE

Query:  -DKSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRS
            L  C  C   K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   + F  +K+ VEN     I  L S
Subjt:  -DKSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRS

Query:  DRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVS-ETPYELWKGRKGSLRHFR
        D GGE++ L  RDYL ++GI    S P TP+ NG+SER++R +++M  +++S + +P ++W YA  +AVY++  +P+  +  ++P++   G+  +    +
Subjt:  DRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVS-ETPYELWKGRKGSLRHFR

Query:  IWGC-------------------------------------------------------------------------------PAHVLV-----------
        ++GC                                                                               P+H  +           
Subjt:  IWGC-------------------------------------------------------------------------------PAHVLV-----------

Query:  ------------------VDKTRKSGQSHPSQQLREPRRSGRVV-----------------------------------HKPDRYLGLIETQVVIP----
                          +  T+ S  + PS  +  P  S                                       + P++   L ++ +  P    
Subjt:  ------------------VDKTRKSGQSHPSQQLREPRRSGRVV-----------------------------------HKPDRYLGLIETQVVIP----

Query:  ------------------------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFN
                                                         DGI                 +P T  QAMKD   D+W +AM  E+ +   N
Subjt:  ------------------------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFN

Query:  SVWTLVDQP-------------------------------NDGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMS
          W LV  P                                 GY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YMS
Subjt:  SVWTLVDQP-------------------------------NDGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMS

Query:  QPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGD
        QP GF+++D+   VC+L+K+IYGLKQA R+W +   T + + GF  ++ +  ++       I ++++YVDDIL+ GND   L      L+ +F +K+  D
Subjt:  QPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGD

Query:  AQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIHLSKEQCPKTP
          Y LGI+    R  + L +SQ  Y   +L+R  M  +K    P      L+     K P
Subjt:  AQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIHLSKEQCPKTP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-3631.18Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND------------------------------GYTQREGVDYEETFSPVAMLKSIRILL
        ++P TY +A + +    W  AMD E+ +M     W +   P +                              GYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPND------------------------------GYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFL
        +I+  Y++ + Q+D+  AFLNG+L+E IYM  P G+  +  +      VC LKKSIYGLKQASR W ++F   +  +GF Q+  +   + K+  ++   +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFL

Query:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLP
        ++YVDDI++  N+   + ++K  L   F+++DLG  +Y LG++I R+     + + Q  Y   +L    +   K   +P
Subjt:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLP

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.1e-0431.88Show/hide
Query:  NRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHV
        NRT+++ VRSM+    +P +F   A   AV+I+   PS +++   P E+W     +  + R +GC A++
Subjt:  NRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHV

ATMG00810.1 DNA/RNA polymerases superfamily protein2.9e-0537.08Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIHLS
        +L+LYVDDILL G+    L  +   L+  F MKDLG   Y LGIQI  +     L +SQ  Y +++L+   M + K    P   +++ S
Subjt:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIHLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATT
TGTCTTAACTGAGGAATGTCGTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTC
TTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTTATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGA
CACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATTGCTGAAGTAAATGGTGG
TCCCATCAAACAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAA
CCCTTCTGAATGAACTCCAGCGATTCCAAAACCTTACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCT
AAAACCAAAGCTGGACCCTCAAAACTTAATGCTCAAATAAAAAAGAAGAGAAAGGGAAAAACTCCCAAACAGAACAAGGGTAAGAAAGTTGTAGAAAAAGGTAAGTGTTA
CCATTGTGGCCAAAACGGGCATTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTGGAAACATGTTTAGTGG
AATATGGAAATTGTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAGATCACTCTC
AAGGTTGGAACAGGAGAGATGGTCTCAGCTTCAGCAGTGGGAGATTTAAAGTTGTTTTTTAGAGATAGATATGTCATACTTAAGAATGTCTTATATGTACCTCAAATGAA
AAGAAATTTAATATCTATCTCTTGTATTTTGGAACAAATGTATAGAATATCTTTTGAAATTAATGAAGCGTTCGTTTTCTATAAAGGTATTCTAGTTTGTTCTGCTATAC
TTGAAGACAACTTATATAAGTTAAGACCAACTAGAGCAAATTTTGTCTTAAATACTGAAATATTCAGAACAGCTGAAACTCAGAATAAAAGACAAAAAGTTTCTTCTAAT
GCCTATTTATGGCACTTAAGACTTGGTCACATAAATCTCAATAGGATTGGGAGATTGGTTAAAAATGGGCTTCTAGGTCGGTTAGAAGATAAATCTTTACCTCCTTGTGA
ATCTTGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCA
AAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAA
TATAAGGCTGAAGTAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGG
AATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGA
TGCCAGATTCTTTTTGGGGATATGCTTTAGAAATAGCTGTTTATATTTTGATTAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAA
GGCAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCG
ACGTAGTGGGAGGGTTGTTCATAAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAA
TGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGGTTATACC
CAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCCGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGAT
GGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAA
AATCCATTTATGGATTAAAACAAGCTTCTAGATCCTGGAATATAAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAA
AAGGTCGTCAATTCCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCA
ATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCTATGTCTCAAGCATCTTACATAGACAAAA
TGTTGTCTAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTACCGTACAGATATGAAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCACTTGCATTTGATG
GAGTTTGCATATAATAATAGCTTTCAGACTACTATTGGCATGACACCATTTGAGGCTTTGTATGGTAAATGTTGTAGATCTTCTGTTTGTTGGGATGAGGTTGGTGAACA
AAGATTGATGGGTCCTGAGTTGGTTCAATCCACAAACGAGGCTATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATT
TGTCTTAACTGAGGAATGTCGTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTC
TTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTTATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGA
CACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATTGCTGAAGTAAATGGTGG
TCCCATCAAACAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAA
CCCTTCTGAATGAACTCCAGCGATTCCAAAACCTTACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCT
AAAACCAAAGCTGGACCCTCAAAACTTAATGCTCAAATAAAAAAGAAGAGAAAGGGAAAAACTCCCAAACAGAACAAGGGTAAGAAAGTTGTAGAAAAAGGTAAGTGTTA
CCATTGTGGCCAAAACGGGCATTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTGGAAACATGTTTAGTGG
AATATGGAAATTGTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAGATCACTCTC
AAGGTTGGAACAGGAGAGATGGTCTCAGCTTCAGCAGTGGGAGATTTAAAGTTGTTTTTTAGAGATAGATATGTCATACTTAAGAATGTCTTATATGTACCTCAAATGAA
AAGAAATTTAATATCTATCTCTTGTATTTTGGAACAAATGTATAGAATATCTTTTGAAATTAATGAAGCGTTCGTTTTCTATAAAGGTATTCTAGTTTGTTCTGCTATAC
TTGAAGACAACTTATATAAGTTAAGACCAACTAGAGCAAATTTTGTCTTAAATACTGAAATATTCAGAACAGCTGAAACTCAGAATAAAAGACAAAAAGTTTCTTCTAAT
GCCTATTTATGGCACTTAAGACTTGGTCACATAAATCTCAATAGGATTGGGAGATTGGTTAAAAATGGGCTTCTAGGTCGGTTAGAAGATAAATCTTTACCTCCTTGTGA
ATCTTGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCA
AAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAA
TATAAGGCTGAAGTAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGG
AATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGA
TGCCAGATTCTTTTTGGGGATATGCTTTAGAAATAGCTGTTTATATTTTGATTAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAA
GGCAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCG
ACGTAGTGGGAGGGTTGTTCATAAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAA
TGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGGTTATACC
CAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCCGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGAT
GGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAA
AATCCATTTATGGATTAAAACAAGCTTCTAGATCCTGGAATATAAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAA
AAGGTCGTCAATTCCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCA
ATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCTATGTCTCAAGCATCTTACATAGACAAAA
TGTTGTCTAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTACCGTACAGATATGAAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCACTTGCATTTGATG
GAGTTTGCATATAATAATAGCTTTCAGACTACTATTGGCATGACACCATTTGAGGCTTTGTATGGTAAATGTTGTAGATCTTCTGTTTGTTGGGATGAGGTTGGTGAACA
AAGATTGATGGGTCCTGAGTTGGTTCAATCCACAAACGAGGCTATTTAG
Protein sequenceShow/hide protein sequence
MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECRQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDLLRGMFGQPEWSLR
HEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIKQVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSS
KTKAGPSKLNAQIKKKRKGKTPKQNKGKKVVEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLETCLVEYGNCTWILDSGATNHICFSFQENSSWKKLSEGEITL
KVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSN
AYLWHLRLGHINLNRIGRLVKNGLLGRLEDKSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKE
YKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMPDSFWGYALEIAVYILINVPSKSVSETPYELWKGRK
GSLRHFRIWGCPAHVLVVDKTRKSGQSHPSQQLREPRRSGRVVHKPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDGYT
QREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYK
KVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYEIHLSKEQCPKTPHLHLM
EFAYNNSFQTTIGMTPFEALYGKCCRSSVCWDEVGEQRLMGPELVQSTNEAI