; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G12820 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G12820
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
Genome locationChr2:13021966..13024718
RNA-Seq ExpressionCSPI02G12820
SyntenyCSPI02G12820
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.0e+0084.34Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEECPQ PA NANRT R+AYDRW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYEN
        K+VEANVA TKRKFIRGSSSK K GPSK  AQ+KKKGKGK P  +K KK A+KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDLLVVETCLVE + 
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYEN

Query:  CTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI
         TWILDSGATNHICFSFQE SSWKKL EGE TLKVGTGE+VSA AVGDL LFF+DRY+ILK+VLYVP MKRNLISI+CILE +Y ISFE+NE F+  KGI
Subjt:  CTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI

Query:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK
         +CSAI E+NLYKLRPTRAN VLNTE+FRT ETQNK+QKVSSNAYLWHLRLGHINLNRI RLVKSG+L+ LEDNSLPPCESCLEGKMTKRSFTGKGLRAK
Subjt:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK

Query:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST
         PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPST
Subjt:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST

Query:  PQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK
        PQQNGVSERRNRTLLDMVRSM+S++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPK
Subjt:  PQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK

Query:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVHQPDRYLGLIET-----
        ESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR P+RSGRVVHQP+RYLGL+ET     
Subjt:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVHQPDRYLGLIET-----

Query:  -----------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI
                   QAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSI
Subjt:  -----------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI

Query:  RILLSIATFYDYE
        RILLSIATFY+YE
Subjt:  RILLSIATFYDYE

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0066.74Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S+++H  Q LREP+RSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH

Query:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET                +AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYE
        VDYEETFSPVAMLKSIRILLSIA ++DYE
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYE

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0066.74Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S+++H  Q LREP+RSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH

Query:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET                +AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYE
        VDYEETFSPVAMLKSIRILLSIA ++DYE
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYE

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0066.74Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S+++H  Q LREP+RSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH

Query:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET                +AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYE
        VDYEETFSPVAMLKSIRILLSIA ++DYE
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYE

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0066.74Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S+++H  Q LREP+RSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH

Query:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET                +AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYE
        VDYEETFSPVAMLKSIRILLSIA ++DYE
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYE

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0066.74Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S+++H  Q LREP+RSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH

Query:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET                +AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYE
        VDYEETFSPVAMLKSIRILLSIA ++DYE
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYE

A0A5A7TWB9 Gag/pol protein0.0e+0066.74Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S+++H  Q LREP+RSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH

Query:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET                +AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYE
        VDYEETFSPVAMLKSIRILLSIA ++DYE
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYE

A0A5D3CPJ6 Gag/pol protein0.0e+0066.74Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S+++H  Q LREP+RSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH

Query:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET                +AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYE
        VDYEETFSPVAMLKSIRILLSIA ++DYE
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYE

A0A5D3CSZ6 Gag/pol protein0.0e+0066.74Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET

Query:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA
        CLVE ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA AVG L+L  +  +++L+NV  VP +KRNLIS+ C+LEQ Y ++F +N+ 
Subjt:  CLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEA

Query:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS
        F++  G+ +CSA LE+NLY LR   +  +LNTE+F+TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR 
Subjt:  FVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRS

Query:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI
        FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Subjt:  FTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGI

Query:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
         SQLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSK
Subjt:  QSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

Query:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH
        LC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S+++H  Q LREP+RSGRV +
Subjt:  LCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVH

Query:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG
         P RY+ L ET                +AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG
Subjt:  QPDRYLGLIET----------------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYE
        VDYEETFSPVAMLKSIRILLSIA ++DYE
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYE

E2GK51 Gag/pol protein (Fragment)0.0e+0084.34Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEECPQ PA NANRT R+AYDRW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYEN
        K+VEANVA TKRKFIRGSSSK K GPSK  AQ+KKKGKGK P  +K KK A+KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDLLVVETCLVE + 
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYEN

Query:  CTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI
         TWILDSGATNHICFSFQE SSWKKL EGE TLKVGTGE+VSA AVGDL LFF+DRY+ILK+VLYVP MKRNLISI+CILE +Y ISFE+NE F+  KGI
Subjt:  CTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGI

Query:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK
         +CSAI E+NLYKLRPTRAN VLNTE+FRT ETQNK+QKVSSNAYLWHLRLGHINLNRI RLVKSG+L+ LEDNSLPPCESCLEGKMTKRSFTGKGLRAK
Subjt:  LVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK

Query:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST
         PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPST
Subjt:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST

Query:  PQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK
        PQQNGVSERRNRTLLDMVRSM+S++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPK
Subjt:  PQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK

Query:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVHQPDRYLGLIET-----
        ESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR P+RSGRVVHQP+RYLGL+ET     
Subjt:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSSQSHPSQQLREPQRSGRVVHQPDRYLGLIET-----

Query:  -----------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI
                   QAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSI
Subjt:  -----------QAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI

Query:  RILLSIATFYDYE
        RILLSIATFY+YE
Subjt:  RILLSIATFYDYE

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.8e-7524.47Show/hide
Query:  NGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAV
        +G+ YA WK  +  +L   D+  V+    P            +  D W KA   A+  I+  +SD       S  TA++I+++L  ++ +   + +    
Subjt:  NGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAV

Query:  KYIYTKRMKEGTSVRE--HVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSF----IPFQTNASLN-KIEFNLTTLLNELQRFQNLTMGKGKQVEANV
        K + + ++    S+    H+ D ++   +A   G  IEE++++S +L +LP  +       +T +  N  + F    LL++  + +N      K+V   +
Subjt:  KYIYTKRMKEGTSVRE--HVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSF----IPFQTNASLN-KIEFNLTTLLNELQRFQNLTMGKGKQVEANV

Query:  ATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLA---------EKKAEKETQGKYDLLVVETCLVE-
                + +  K +   +KP    K   K K              KC+HCG+ GH  ++C  Y           EK+ +  T      +V E      
Subjt:  ATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLA---------EKKAEKETQGKYDLLVVETCLVE-

Query:  YENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGT---GEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAF
         +NC ++LDSGA++H+     + S +    E    LK+     GE + A+  G ++L   D  + L++VL+  +   NL+S+  + E    I F+ +   
Subjt:  YENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGT---GEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAF

Query:  VFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLE-----DNSLPPCESCLEGKMTK
        +   G++V               + + +LN       +  +   K  +N  LWH R GHI+  ++  + +  + S        + S   CE CL GK  +
Subjt:  VFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLE-----DNSLPPCESCLEGKMTK

Query:  RSFTGKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDY
          F  K L+     K PL +VHSD+CGP+         YF+ F+D ++ Y   YLI +KS+    F+++ A+ E      +  L  D G EY+    R +
Subjt:  RSFTGKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDY

Query:  LIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQN
         ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S +++  SFWG A+ TA Y++N +PS+++   S+TPYE+W  +K  L+H R++G   +V ++N
Subjt:  LIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQN

Query:  PK-KLEHRSKLCFFIGY-------------------------------------------PKESRGGLFYDPQENKIFVST----------NATFLEE--
         + K + +S    F+GY                                            KES    F  P +++  + T          N  FL++  
Subjt:  PK-KLEHRSKLCFFIGY-------------------------------------------PKESRGGLFYDPQENKIFVST----------NATFLEE--

Query:  -----------------------------DHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSSQSHPSQQLRE---------------PQRSGRV
                                       ++D +  +K  L E  K   D   + +K      +S +S  ++ L+E                +RS R+
Subjt:  -----------------------------DHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSSQSHPSQQLRE---------------PQRSGRV

Query:  VHQP-------DRYLGLIETQA---MKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK
          +P       D  L  +   A     DV           D+  W +A++ E+ +   N+ WT+  +P +   +  +W++  K +  G    +KARLVA+
Subjt:  VHQP-------DRYLGLIETQA---MKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK

Query:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYD
        G+TQ+  +DYEETF+PVA + S R +LS+   Y+
Subjt:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-10029.45Show/hide
Query:  KLNGDN-YAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRH
        K NGDN ++ W+  +  +L+   L  VL  +  +     A        + W   +E+A   I   +SD +        TA+ I   L  ++     + + 
Subjt:  KLNGDN-YAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRH

Query:  EAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNL-----TTLLNELQRFQNLTMGKGKQVEAN
           K +Y   M EGT+   H+             G  IEE ++   +L SLP S+    T     K    L       LLNE  R +    G+    E  
Subjt:  EAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNL-----TTLLNELQRFQNLTMGKGKQVEAN

Query:  VATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD-----LLV-----VETCLV
             R + R S++  ++G     A+ K K + K+  +N          CY+C Q GH+ R+CP     +K + ET G+ +      +V     V   + 
Subjt:  VATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYD-----LLV-----VETCLV

Query:  EYENC--------TWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFR-DRYVILKNVLYVPQMKRNLISISCILEQMYRIS
        E E C         W++D+ A++H   +   +   + ++    T+K+G       + +GD+ +       ++LK+V +VP ++ NLIS   +    Y  S
Subjt:  EYENC--------TWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFR-DRYVILKNVLYVPQMKRNLISISCILEQMYRIS

Query:  FEINEAFVFYKG-ILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGK
        +  N+ +   KG +++   +    LY+          N EI +  E    + ++S +  LWH R+GH++   +  L K  L+S  +  ++ PC+ CL GK
Subjt:  FEINEAFVFYKG-ILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGK

Query:  MTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL
          + SF     R    L+LV+SD+CGPM +++ GG +YF++FIDD SR   +Y++  K    + F+++ A VE E G+ +K LRSD GGEY    F +Y 
Subjt:  MTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL

Query:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQNP
          +GI+ + + P TPQ NGV+ER NRT+++ VRSM+  +++  SFWG A++TA Y++N  PS  ++ E P  +W  ++ S  H +++GC   AHV  +  
Subjt:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQNP

Query:  KKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSST--------------------KVVDKTRK
         KL+ +S  C FIGY  E  G   +DP + K+  S +  F  E  +R     S+ V   I  + +  PS+S                     +V+++  +
Subjt:  KKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSST--------------------KVVDKTRK

Query:  SSQ-----SHPSQ---QLREPQRSGRVVHQPDRYLG----LI----ETQAMKDV----DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKR
          +      HP+Q   Q +  +RS R   +  RY      LI    E +++K+V    +++Q +KAM  EMES+  N  + LV+ P   +P+ CKW++K 
Subjt:  SSQ-----SHPSQ---QLREPQRSGRVVHQPDRYLG----LI----ETQAMKDV----DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKR

Query:  KRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE
        K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A   D E
Subjt:  KRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE

Q12491 Transposon Ty2-B Gag-Pol polyprotein7.2e-2624.69Show/hide
Query:  VSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFR-TAETQNKRQK
        +  +A+G+L   F++        L+ P +  +L+S+S +  Q     F  N       G ++   +   + Y L      +++ + I + T    NK + 
Subjt:  VSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFR-TAETQNKRQK

Query:  VSSNAY-LWHLRLGHINLNRIGRLVKSGLLSPLEDNSLP-------PCESCLEGKMTKRSFTGKGLRAK-----GPLELVHSDLCGPMNVKARGGYEYFI
        V+   Y L H  LGH N   I + +K   ++ L+++ +         C  CL GK TK     KG R K      P + +H+D+ GP++   +    YFI
Subjt:  VSSNAY-LWHLRLGHINLNRIGRLVKSGLLSPLEDNSLP-------PCESCLEGKMTKRSFTGKGLRAK-----GPLELVHSDLCGPMNVKARGGYEYFI

Query:  SFIDDYSRYGHIYLIH--HKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISF
        SF D+ +R+  +Y +H   + + L  F    A ++N+    + +++ DRG EY +     +    GI +  +  +  + +GV+ER NRTLL+  R+++  
Subjt:  SFIDDYSRYGHIYLIH--HKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISF

Query:  SQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKG-SLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYP-KESRGGLFYDPQENKIFVSTNAT
        S + +  W  A+E +  I N++ S    ++  +   G  G  +     +G P  V   NP    H   +  +  +P + S G + Y P   K   +TN  
Subjt:  SQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKG-SLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYP-KESRGGLFYDPQENKIFVSTNAT

Query:  FLEED
         L+++
Subjt:  FLEED

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.3e-6322.71Show/hide
Query:  NSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPAS----NANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM
        N+SI+ +  S   KL   NY  W   ++ +    +L   L       PA+     A R + D Y RW + ++     +L ++S  +        TA +I 
Subjt:  NSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPAS----NANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM

Query:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGT-SVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNELQ
        ++LR ++  P +   H        K+  +GT ++ +++  ++  F+   + G P++   QV  +LE+LP+ + P     +       LT     LLN   
Subjt:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGT-SVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNELQ

Query:  RFQNLTMGKGKQVEANV-----ATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPK---YLAEKKAEKET
        +   ++      + AN       TT      G+ +      +  N     +        N  +     GKC  CG  GH  + C +   +L+   +++  
Subjt:  RFQNLTMGKGKQVEANV-----ATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPK---YLAEKKAEKET

Query:  ------QGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISI
              Q + +L +       Y +  W+LDSGAT+HI   F   S  +  + G++ + V  G  +  S  G   L  + R + L N+LYVP + +NLIS+
Subjt:  ------QGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISI

Query:  ------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSP
              + +  + +  SF++ +      G+ +     +D LY+        + +++      + + +   SS    WH RLGH   + +  ++ +  LS 
Subjt:  ------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSP

Query:  LE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKIL
        L   +    C  CL  K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   E F  +K  +EN     I   
Subjt:  LE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKIL

Query:  RSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRH
         SD GGE++ L   +Y  ++GI    S P TP+ NG+SER++R +++   +++S + +  ++W YA   A Y++N +P+  +  E+P++   G   +   
Subjt:  RSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRH

Query:  FRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDHQPRSKLVL---------------
         R++GC  +  ++  N  KL+ +S+ C F+GY       L    Q +++++S +  F E              +++ +  S  V                
Subjt:  FRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDHQPRSKLVL---------------

Query:  ----------------------KEISKSAID-------------------------KPSSSTKVVDKTRKSSQSHP--------SQQLREPQRSGR----
                               ++S S +D                         +P+ +      ++ +SQ++P        +Q L  P +S      
Subjt:  ----------------------KEISKSAID-------------------------KPSSSTKVVDKTRKSSQSHP--------SQQLREPQRSGR----

Query:  -----------------VVHQPDRYLGLIET------------------------------------------QAMKDVDRDQWIKAMDLEMESMYFNSV
                         ++H P     ++                                            QA+KD   ++W  AM  E+ +   N  
Subjt:  -----------------VVHQPDRYLGLIET------------------------------------------QAMKDVDRDQWIKAMDLEMESMYFNSV

Query:  WTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        W LV   P+ V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A
Subjt:  WTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-6123.96Show/hide
Query:  MNSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPA---SNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM
        +N++I+ +  S   KL   NY  W   ++ +    +L   L    P  PA   ++A       Y RW + ++     IL ++S  +        TA +I 
Subjt:  MNSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPA---SNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM

Query:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHV--LDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNEL
        ++LR ++  P +                       HV  L  +  F+   + G P++   QV  +LE+LP  + P     +      +LT     L+N  
Subjt:  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHV--LDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLT----TLLNEL

Query:  QRFQNLTMGKGKQVEANVA------TTKRKFIRGSSSKTKAGPSKPNA-QIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKET
         +   L   +   + ANV       T + +  RG +       ++ N+ Q    G     +Q K       G+C  C   GH  + CP+    +    + 
Subjt:  QRFQNLTMGKGKQVEANVA------TTKRKFIRGSSSKTKAGPSKPNA-QIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKET

Query:  QGKYDL--------LVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLI
        Q             L V +    Y    W+LDSGAT+HI   F  N S+ +   G + + +  G  +  +  G   L    R + L  VLYVP + +NLI
Subjt:  QGKYDL--------LVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLI

Query:  SI------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLL
        S+      + +  + +  SF++ +      G+ +     +D LY+        + +++      +   +   SS    WH RLGH +L  +  ++ +  L
Subjt:  SI------SCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLL

Query:  SPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIK
          L   + L  C  C   K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   + F  +K+ VEN     I 
Subjt:  SPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIK

Query:  ILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSL
         L SD GGE++ L  RDYL ++GI    S P TP+ NG+SER++R +++M  +++S + +  ++W YA   A Y++N +P+  +  ++P++   G+  + 
Subjt:  ILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSL

Query:  RHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE--------------------------------------
           +++GC  +  ++  N  KLE +SK C F+GY       L       +++ S +  F E                                       
Subjt:  RHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE--------------------------------------

Query:  -------DHIRDHQPR-----SKLVLKEIS-----KSAIDKPSSS------------TKVVDKTRKS---------------SQSHPSQQLREPQRSGRV
                H+ D  PR     S L   ++S      S+I  PSSS            T    +T+ S               S + P+Q    PQ     
Subjt:  -------DHIRDHQPR-----SKLVLKEIS-----KSAIDKPSSS------------TKVVDKTRKS---------------SQSHPSQQLREPQRSGRV

Query:  VHQPDRYLGLIE-------------------------------------------------------------------TQAMKDVDRDQWIKAMDLEME
         H P     + E                                                                    QAMKD   D+W +AM  E+ 
Subjt:  VHQPDRYLGLIE-------------------------------------------------------------------TQAMKDVDRDQWIKAMDLEME

Query:  SMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        +   N  W LV   P  V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A
Subjt:  SMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-2148.35Show/hide
Query:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDY
        W  AMD E+ +M     W +   P + KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L+I+  Y++
Subjt:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDY

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.6e-0431.71Show/hide
Query:  NRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
        NRT+++ VRSM+    +  +F   A  TA +I+N  PS +++   P E+W     +  + R +GC A++   +  KL+ R+K
Subjt:  NRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.6e-1541.86Show/hide
Query:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        W +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATT
TGTCTTAACTGAGGAATGTCCTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTC
TTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGA
CACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGG
TCCCATCGAAGAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAA
CCCTTCTGAATGAACTCCAGCGATTCCAAAACCTAACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCT
AAAACCAAAGCTGGACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAGGGAAAGACTCCCAAACAGAACAAGGGTAAGAAAGCTGCAGAAAAAGGTAAGTGTTA
CCATTGTGGCCAAAACGGGCACTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTACTAGTTGTAGAAACAT
GTTTAGTGGAATATGAAAATTGTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAG
AACACTCTCAAGGTTGGAACAGGAGAGATGGTCTCAGCTTCAGCAGTGGGAGATTTAAAGTTGTTTTTTAGAGATAGATATGTCATACTTAAGAATGTCTTATATGTACC
TCAAATGAAAAGAAATTTAATATCTATCTCTTGTATTTTGGAACAAATGTATAGAATATCTTTTGAAATTAATGAAGCGTTCGTTTTCTATAAAGGTATTCTAGTTTGTT
CTGCTATACTTGAAGACAACTTATATAAGTTAAGACCAACTAGAGCAAATTTTGTCTTAAATACTGAAATATTCAGAACAGCTGAAACTCAGAATAAAAGACAAAAAGTT
TCTTCTAATGCCTATTTATGGCACTTAAGACTTGGTCACATAAATCTCAATAGGATTGGGAGATTAGTTAAAAGTGGGCTTCTAAGTCCGTTAGAAGATAACTCTTTACC
TCCTTGTGAATCTTGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAA
TGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAG
TTCAAAGAATATAAGGCTGAAGTAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAAT
AGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATAAGTT
TTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAA
GGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTTATAGG
TTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAAC
CTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAAGTAGTTGATAAGACTAGGAAATCTAGTCAATCACATCCTTCT
CAACAGTTGAGAGAGCCTCAACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGCAATGAAAGATGTAGATCGTGACCAATGGAT
CAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAA
AACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCTGTTGCCATG
TTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAAATGGGCTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATT
TGTCTTAACTGAGGAATGTCCTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTC
TTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGA
CACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGG
TCCCATCGAAGAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAA
CCCTTCTGAATGAACTCCAGCGATTCCAAAACCTAACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCT
AAAACCAAAGCTGGACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAGGGAAAGACTCCCAAACAGAACAAGGGTAAGAAAGCTGCAGAAAAAGGTAAGTGTTA
CCATTGTGGCCAAAACGGGCACTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTACTAGTTGTAGAAACAT
GTTTAGTGGAATATGAAAATTGTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAG
AACACTCTCAAGGTTGGAACAGGAGAGATGGTCTCAGCTTCAGCAGTGGGAGATTTAAAGTTGTTTTTTAGAGATAGATATGTCATACTTAAGAATGTCTTATATGTACC
TCAAATGAAAAGAAATTTAATATCTATCTCTTGTATTTTGGAACAAATGTATAGAATATCTTTTGAAATTAATGAAGCGTTCGTTTTCTATAAAGGTATTCTAGTTTGTT
CTGCTATACTTGAAGACAACTTATATAAGTTAAGACCAACTAGAGCAAATTTTGTCTTAAATACTGAAATATTCAGAACAGCTGAAACTCAGAATAAAAGACAAAAAGTT
TCTTCTAATGCCTATTTATGGCACTTAAGACTTGGTCACATAAATCTCAATAGGATTGGGAGATTAGTTAAAAGTGGGCTTCTAAGTCCGTTAGAAGATAACTCTTTACC
TCCTTGTGAATCTTGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAA
TGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAG
TTCAAAGAATATAAGGCTGAAGTAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAAT
AGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATAAGTT
TTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAA
GGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTTATAGG
TTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAAC
CTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAAGTAGTTGATAAGACTAGGAAATCTAGTCAATCACATCCTTCT
CAACAGTTGAGAGAGCCTCAACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGCAATGAAAGATGTAGATCGTGACCAATGGAT
CAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAA
AACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCTGTTGCCATG
TTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAAATGGGCTTTCTTGA
Protein sequenceShow/hide protein sequence
MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLR
HEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSS
KTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLVEYENCTWILDSGATNHICFSFQENSSWKKLSEGE
NTLKVGTGEMVSASAVGDLKLFFRDRYVILKNVLYVPQMKRNLISISCILEQMYRISFEINEAFVFYKGILVCSAILEDNLYKLRPTRANFVLNTEIFRTAETQNKRQKV
SSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEK
FKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWK
GRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSSQSHPS
QQLREPQRSGRVVHQPDRYLGLIETQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAM
LKSIRILLSIATFYDYENGLS