; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G15820 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G15820
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
Genome locationChr6:14164307..14167264
RNA-Seq ExpressionCSPI06G15820
SyntenyCSPI06G15820
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.0e+0086.28Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEECPQ PA NANRT R+AYDRW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYEN
        K+VEANVA TKRKFIRGSSSK K GPSK  AQ+KKKGKGK P  +K KK A+KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDLLVVETCLVE + 
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYEN

Query:  CTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI
         TWILDSGATNHICFSFQE SSWKKL EGEITLKVGTGE+VSA AVGDL LFF+DRY+ILK+VLYVP MKRNLISI+CILE +Y ISFE+NE F+  KGI
Subjt:  CTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI

Query:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK
         +CSAI E+NLYKLRPTRAN VLNTE+FRT ETQNK+QKVSSNAYLWHLRLGHINLNRI RLVKSG+L+ LEDNSLPPCESCLEGKMTKRSFTGKGLRAK
Subjt:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK

Query:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST
         PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPST
Subjt:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST

Query:  PQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK
        PQQNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPK
Subjt:  PQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK

Query:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIP
        ESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRRSGRVVHQP+RYLGL+ETQ++IP
Subjt:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIP

Query:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI
        DDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSI
Subjt:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI

Query:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY
        RILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRS  IRFD +IKSY
Subjt:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0068.88Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH

Query:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY
        VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRS  IRFD +IKSY
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0068.88Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH

Query:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY
        VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRS  IRFD +IKSY
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0068.88Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH

Query:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY
        VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRS  IRFD +IKSY
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0068.88Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH

Query:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY
        VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRS  IRFD +IKSY
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0068.88Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH

Query:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY
        VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRS  IRFD +IKSY
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY

A0A5A7TWB9 Gag/pol protein0.0e+0068.88Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH

Query:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY
        VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRS  IRFD +IKSY
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY

A0A5D3CPJ6 Gag/pol protein0.0e+0068.88Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH

Query:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY
        VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRS  IRFD +IKSY
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY

A0A5D3CSZ6 Gag/pol protein0.0e+0068.88Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE+T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSMMS++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVH

Query:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY
        VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRS  IRFD +IKSY
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY

E2GK51 Gag/pol protein (Fragment)0.0e+0086.28Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEECPQ PA NANRT R+AYDRW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYEN
        K+VEANVA TKRKFIRGSSSK K GPSK  AQ+KKKGKGK P  +K KK A+KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDLLVVETCLVE + 
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYEN

Query:  CTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI
         TWILDSGATNHICFSFQE SSWKKL EGEITLKVGTGE+VSA AVGDL LFF+DRY+ILK+VLYVP MKRNLISI+CILE +Y ISFE+NE F+  KGI
Subjt:  CTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI

Query:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK
         +CSAI E+NLYKLRPTRAN VLNTE+FRT ETQNK+QKVSSNAYLWHLRLGHINLNRI RLVKSG+L+ LEDNSLPPCESCLEGKMTKRSFTGKGLRAK
Subjt:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK

Query:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST
         PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPST
Subjt:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST

Query:  PQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK
        PQQNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPK
Subjt:  PQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK

Query:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIP
        ESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRRSGRVVHQP+RYLGL+ETQ++IP
Subjt:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIP

Query:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI
        DDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSI
Subjt:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI

Query:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY
        RILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRS  IRFD +IKSY
Subjt:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSY

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.6e-9325.96Show/hide
Query:  NGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAV
        +G+ YA WK  +  +L   D+  V+    P            +  D W KA   A+  I+  +SD       S  TA++I+++L  ++ +   + +    
Subjt:  NGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAV

Query:  KYIYTKRMKEGTSVRE--HVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSF----IPFQTNASLN-KIEFNLTTLLNELQRFQNLTMGKGKQVEANV
        K + + ++    S+    H+ D ++   +A   G  IEE++++S +L +LP  +       +T +  N  + F    LL++  + +N      K+V   +
Subjt:  KYIYTKRMKEGTSVRE--HVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSF----IPFQTNASLN-KIEFNLTTLLNELQRFQNLTMGKGKQVEANV

Query:  ATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLA---------EKKAEKETQGKYDLLVVETCLVE-
                + +  K +   +KP    K   K K              KC+HCG+ GH  ++C  Y           EK+ +  T      +V E      
Subjt:  ATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLA---------EKKAEKETQGKYDLLVVETCLVE-

Query:  YENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGT---GEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAF
         +NC ++LDSGA++H+     + S +    E    LK+     GE + A+  G ++L   D  + L++VL+  +   NL+S+  + E    I F+ +   
Subjt:  YENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGT---GEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAF

Query:  VFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLE-----DNSLPPCESCLEGKMTK
        +   G++V               + + +LN       +  +   K  +N  LWH R GHI+  ++  + +  + S        + S   CE CL GK  +
Subjt:  VFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLE-----DNSLPPCESCLEGKMTK

Query:  RSFTGKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDY
          F  K L+     K PL +VHSD+CGP+         YF+ F+D ++ Y   YLI +KS+    F+++ A+ E      +  L  D G EY+    R +
Subjt:  RSFTGKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDY

Query:  LIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQN
         ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S +++  SFWG A+ TA Y++N +PS+++   S+TPYE+W  +K  L+H R++G   +V ++N
Subjt:  LIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQN

Query:  PK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------DKT
         + K + +S    F+GY  E  G   +D    K  V+ +    E + +     + + V  + SK + +K  P+ S K++                  D  
Subjt:  PK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------DKT

Query:  RKSGQSHPS--------------------QQLREPRRSGRVV------HQPDRYLG------------LIETQVVIPDDGIEDP----------------
            ++ P+                    Q L++ + S +         + D +L               ET   + + GI++P                
Subjt:  RKSGQSHPS--------------------QQLREPRRSGRVV------HQPDRYLG------------LIETQVVIPDDGIEDP----------------

Query:  -----LTYKQ--------------AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK
             ++Y +                 DV           D+  W +A++ E+ +   N+ WT+  +P +   +  +W++  K +  G    +KARLVA+
Subjt:  -----LTYKQ--------------AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK

Query:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIK
        G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ QMDVKTAFLNG L+E IYM  P+G         VCKL K+IYGLKQA+R     F++++K
Subjt:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIK

P0C2J3 Transposon Ty2-LR1 Gag-Pol polyprotein3.5e-2624.21Show/hide
Query:  ILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVC
        ++DSGA+  +  S   +         EI +     + +  +A+G+L   F++        L+ P +  +L+S+S +  Q     F  N       G ++ 
Subjt:  ILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVC

Query:  SAILEDNLYKLRPTRANFVLNTEIFR-TAETQNKRQKVSSNAY-LWHLRLGHINLNRIGRLVKSGLLSPLEDNSLP-------PCESCLEGKMTKRSFTG
          +   + Y L      +++ + I + T    NK + V+   Y L H  LGH N   I + +K   ++ L+++ +         C  CL GK TK     
Subjt:  SAILEDNLYKLRPTRANFVLNTEIFR-TAETQNKRQKVSSNAY-LWHLRLGHINLNRIGRLVKSGLLSPLEDNSLP-------PCESCLEGKMTKRSFTG

Query:  KGLRAK-----GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH--HKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLI
        KG R K      P + +H+D+ GP++   +    YFISF D+ +R+  +Y +H   + + L  F    A ++N+    + +++ DRG EY +     +  
Subjt:  KGLRAK-----GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH--HKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLI

Query:  ENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKG-SLRHFRIWGCPAHVLVQNPKKL
          GI +  +  +  + +GV+ER NRTLL+  R+++  S + +  W  A+E +  I N++ S    ++  +   G  G  +     +G P  V   NP   
Subjt:  ENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKG-SLRHFRIWGCPAHVLVQNPKKL

Query:  EHRSKLCFFIGYP-KESRGGLFYDPQENKIFVSTNATFLEED
         H   +  +  +P + S G + Y P   K   +TN   L+++
Subjt:  EHRSKLCFFIGYP-KESRGGLFYDPQENKIFVSTNATFLEED

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-12431.51Show/hide
Query:  KLNGDN-YAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRH
        K NGDN ++ W+  +  +L+   L  VL  +  +     A        + W   +E+A   I   +SD +        TA+ I   L  ++     + + 
Subjt:  KLNGDN-YAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRH

Query:  EAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNL-----TTLLNELQRFQNLTMGKGKQVEAN
           K +Y   M EGT+   H+             G  IEE ++   +L SLP S+    T     K    L       LLNE  R +    G+    E  
Subjt:  EAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNL-----TTLLNELQRFQNLTMGKGKQVEAN

Query:  VATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD-----LLV-----VETCLV
             R + R S++  ++G     A+ K K + K+  +N          CY+C Q GH+ R+CP     +K + ET G+ +      +V     V   + 
Subjt:  VATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD-----LLV-----VETCLV

Query:  EYENC--------TWILDSGATNHICFSFQENSSWKKLSEGEI-TLKVGTGEMVSASAVGDLKLFFR-DRYVILKNVLYVPQMKRNLISISCILEQMYRI
        E E C         W++D+ A++H   +      + +   G+  T+K+G       + +GD+ +       ++LK+V +VP ++ NLIS   +    Y  
Subjt:  EYENC--------TWILDSGATNHICFSFQENSSWKKLSEGEI-TLKVGTGEMVSASAVGDLKLFFR-DRYVILKNVLYVPQMKRNLISISCILEQMYRI

Query:  SFEINEAFVFYKG-ILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEG
        S+  N+ +   KG +++   +    LY+          N EI +  E    + ++S +  LWH R+GH++   +  L K  L+S  +  ++ PC+ CL G
Subjt:  SFEINEAFVFYKG-ILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEG

Query:  KMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDY
        K  + SF     R    L+LV+SD+CGPM +++ GG +YF++FIDD SR   +Y++  K    + F+++ A VE E G+ +K LRSD GGEY    F +Y
Subjt:  KMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDY

Query:  LIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQN
           +GI+ + + P TPQ NGV+ER NRT+++ VRSM+  +++  SFWG A++TA Y++N  PS  ++ E P  +W  ++ S  H +++GC   AHV  + 
Subjt:  LIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQN

Query:  PKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI------SKSAIDKPSSSTKVVDKTRKSGQ----------
          KL+ +S  C FIGY  E  G   +DP + K+  S +  F  E  +R     S+ V   I        S  + P+S+    D+  + G+          
Subjt:  PKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI------SKSAIDKPSSSTKVVDKTRKSGQ----------

Query:  ---------SHPSQ---QLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCK
                  HP+Q   Q +  RRS R   +  RY       V+I DD   +P + K+ +   +++Q +KAM  EMES+  N  + LV+ P   +P+ CK
Subjt:  ---------SHPSQ---QLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCK

Query:  WIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKK
        W++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPEGF    ++  VCKL K
Subjt:  WIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKK

Query:  SIYGLKQASRSKAIRFDRSIKS
        S+YGLKQA R   ++FD  +KS
Subjt:  SIYGLKQASRSKAIRFDRSIKS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-8224.82Show/hide
Query:  NSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPAS----NANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM
        N+SI+ +  S   KL   NY  W   ++ +    +L   L       PA+     A R + D Y RW + ++     +L ++S  +        TA +I 
Subjt:  NSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPAS----NANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM

Query:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGT-SVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNELQ
        ++LR ++  P +   H        K+  +GT ++ +++  ++  F+   + G P++   QV  +LE+LP+ + P     +       LT     LLN   
Subjt:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGT-SVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNELQ

Query:  RFQNLTMGKGKQVEANV-----ATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPK---YLAEKKAEKET
        +   ++      + AN       TT      G+ +      +  N     +        N  +     GKC  CG  GH  + C +   +L+   +++  
Subjt:  RFQNLTMGKGKQVEANV-----ATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPK---YLAEKKAEKET

Query:  ------QGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISI
              Q + +L +       Y +  W+LDSGAT+HI   F   S  +  + G+  + V  G  +  S  G   L  + R + L N+LYVP + +NLIS+
Subjt:  ------QGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISI

Query:  ------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSP
              + +  + +  SF++ +      G+ +     +D LY+        + +++      + + +   SS    WH RLGH   + +  ++ +  LS 
Subjt:  ------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSP

Query:  LE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKIL
        L   +    C  CL  K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   E F  +K  +EN     I   
Subjt:  LE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKIL

Query:  RSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRH
         SD GGE++ L   +Y  ++GI    S P TP+ NG+SER++R +++   +++S + +  ++W YA   A Y++N +P+  +  E+P++   G   +   
Subjt:  RSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRH

Query:  FRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDHQPRSKLVL---------------
         R++GC  +  ++  N  KL+ +S+ C F+GY       L    Q +++++S +  F E              +++ +  S  V                
Subjt:  FRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDHQPRSKLVL---------------

Query:  ----------------------KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS-----------------QQLREPRRSGR----
                               ++S S +D   SS+                     T+   Q+H S                 Q L  P +S      
Subjt:  ----------------------KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS-----------------QQLREPRRSGR----

Query:  -----------------VVHQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSV
                         ++H P     ++      P            GI                 +P T  QA+KD   ++W  AM  E+ +   N  
Subjt:  -----------------VVHQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSV

Query:  WTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQP
        W LV   P+ V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YMSQP
Subjt:  WTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQP

Query:  EGFIEQDQEQKVCKLKKSIYGLKQASRS
         GFI++D+   VCKL+K++YGLKQA R+
Subjt:  EGFIEQDQEQKVCKLKKSIYGLKQASRS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-8225.46Show/hide
Query:  MNSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPA---SNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM
        +N++I+ +  S   KL   NY  W   ++ +    +L   L    P  PA   ++A       Y RW + ++     IL ++S  +        TA +I 
Subjt:  MNSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPA---SNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM

Query:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHV--LDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNEL
        ++LR ++  P +                       HV  L  +  F+   + G P++   QV  +LE+LP  + P     +      +LT     L+N  
Subjt:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHV--LDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNEL

Query:  QRFQNLTMGKGKQVEANVA------TTKRKFIRGSSSKTKAGPSKPNA-QIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKET
         +   L   +   + ANV       T + +  RG +       ++ N+ Q    G     +Q K       G+C  C   GH  + CP+    +    + 
Subjt:  QRFQNLTMGKGKQVEANVA------TTKRKFIRGSSSKTKAGPSKPNA-QIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKET

Query:  QGKYDL--------LVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLI
        Q             L V +    Y    W+LDSGAT+HI   F  N S+ +   G   + +  G  +  +  G   L    R + L  VLYVP + +NLI
Subjt:  QGKYDL--------LVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGEITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLI

Query:  SI------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLL
        S+      + +  + +  SF++ +      G+ +     +D LY+        + +++      +   +   SS    WH RLGH +L  +  ++ +  L
Subjt:  SI------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLL

Query:  SPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIK
          L   + L  C  C   K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   + F  +K+ VEN     I 
Subjt:  SPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIK

Query:  ILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSL
         L SD GGE++ L  RDYL ++GI    S P TP+ NG+SER++R +++M  +++S + +  ++W YA   A Y++N +P+  +  ++P++   G+  + 
Subjt:  ILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSL

Query:  RHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE--------------------------------------
           +++GC  +  ++  N  KLE +SK C F+GY       L       +++ S +  F E                                       
Subjt:  RHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE--------------------------------------

Query:  -------DHIRDHQPR-----SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPRRSGRVVHQPDRYLGLIETQVVI
                H+ D  PR     S L   ++S      S+I  PSSS       +  + + Q H +Q        L  P  +    + P++   L ++ +  
Subjt:  -------DHIRDHQPR-----SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPRRSGRVVHQPDRYLGLIETQVVI

Query:  P----------------------------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEME
        P                                                     DGI                 +P T  QAMKD   D+W +AM  E+ 
Subjt:  P----------------------------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEME

Query:  SMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEE
        +   N  W LV   P  V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L +
Subjt:  SMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEE

Query:  SIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRS
         +YMSQP GF+++D+   VC+L+K+IYGLKQA R+
Subjt:  SIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-3845.88Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P TY +A + +    W  AMD E+ +M     W +   P + KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSKAIRF
        +I+  Y++ + Q+D+  AFLNG+L+E IYM  P G+  +  +      VC LKKSIYGLKQASR   ++F
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSKAIRF

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-0431.71Show/hide
Query:  NRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
        NRT+++ VRSM+    +  +F   A  TA +I+N  PS +++   P E+W     +  + R +GC A++   +  KL+ R+K
Subjt:  NRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.9e-1538.83Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +   A+KD     W +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIA
        ++A
Subjt:  SIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATT
TGTCTTAACTGAGGAATGTCCTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTC
TTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGA
CACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGG
TCCCATCGAAGAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAA
CCCTTCTGAATGAACTCCAGCGATTCCAAAACCTTACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCT
AAAACCAAAGCTGGACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAGGGAAAGACTCCCAAACAGAACAAGGGTAAGAAAGGTGCAGAAAAAGGTAAGTGTTA
CCATTGTGGCCAAAACGGGCACTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTACTAGTTGTAGAAACAT
GTTTAGTGGAATATGAAAATTGTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAG
ATCACTCTCAAGGTTGGAACAGGAGAGATGGTCTCAGCTTCAGCAGTGGGAGATTTAAAGTTGTTTTTTAGAGATAGATATGTCATACTTAAGAATGTCTTATATGTACC
TCAAATGAAAAGAAATTTAATATCTATCTCTTGTATTTTGGAACAAATGTATAGAATATCTTTTGAAATTAATGAAGCGTTCGTTTTCTATAAAGGTATTCTAGTTTGTT
CTGCTATACTTGAAGACAACTTATATAAGTTAAGACCAACTAGAGCAAATTTTGTCTTAAATACTGAAATATTCAGAACAGCTGAAACTCAGAATAAAAGACAAAAAGTT
TCTTCTAATGCCTATTTATGGCACTTAAGACTTGGTCACATAAATCTCAATAGGATTGGGAGATTGGTTAAAAGTGGGCTTCTAAGTCCGTTAGAAGATAACTCTTTACC
TCCTTGTGAATCTTGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAA
TGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAG
TTCAAAGAATATAAGGCTGAAGTAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAAT
AGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTT
TTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAA
GGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGG
TTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAAC
CTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCT
CAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCC
ATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATC
AACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTTAAGGCTCGACTTGTGGCAAAGGGTTATACC
CAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCCGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGAT
GGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAA
AATCCATTTATGGATTAAAACAAGCTTCTAGATCCAAAGCCATAAGATTTGATCGTAGTATCAAATCTTATATTTCAGGATCTAGAAGCTTGTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATT
TGTCTTAACTGAGGAATGTCCTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTC
TTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGA
CACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGG
TCCCATCGAAGAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAA
CCCTTCTGAATGAACTCCAGCGATTCCAAAACCTTACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCT
AAAACCAAAGCTGGACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAGGGAAAGACTCCCAAACAGAACAAGGGTAAGAAAGGTGCAGAAAAAGGTAAGTGTTA
CCATTGTGGCCAAAACGGGCACTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTACTAGTTGTAGAAACAT
GTTTAGTGGAATATGAAAATTGTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAG
ATCACTCTCAAGGTTGGAACAGGAGAGATGGTCTCAGCTTCAGCAGTGGGAGATTTAAAGTTGTTTTTTAGAGATAGATATGTCATACTTAAGAATGTCTTATATGTACC
TCAAATGAAAAGAAATTTAATATCTATCTCTTGTATTTTGGAACAAATGTATAGAATATCTTTTGAAATTAATGAAGCGTTCGTTTTCTATAAAGGTATTCTAGTTTGTT
CTGCTATACTTGAAGACAACTTATATAAGTTAAGACCAACTAGAGCAAATTTTGTCTTAAATACTGAAATATTCAGAACAGCTGAAACTCAGAATAAAAGACAAAAAGTT
TCTTCTAATGCCTATTTATGGCACTTAAGACTTGGTCACATAAATCTCAATAGGATTGGGAGATTGGTTAAAAGTGGGCTTCTAAGTCCGTTAGAAGATAACTCTTTACC
TCCTTGTGAATCTTGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAA
TGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAG
TTCAAAGAATATAAGGCTGAAGTAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAAT
AGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTT
TTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAA
GGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGG
TTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAAC
CTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCT
CAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCC
ATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATC
AACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTTAAGGCTCGACTTGTGGCAAAGGGTTATACC
CAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCCGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGAT
GGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAA
AATCCATTTATGGATTAAAACAAGCTTCTAGATCCAAAGCCATAAGATTTGATCGTAGTATCAAATCTTATATTTCAGGATCTAGAAGCTTGTTTTAA
Protein sequenceShow/hide protein sequence
MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLR
HEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSS
KTKAGPSKPNAQIKKKGKGKTPKQNKGKKGAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGE
ITLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKV
SSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEK
FKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWK
GRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPS
QQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYT
QREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSKAIRFDRSIKSYISGSRSLF