; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G15660 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G15660
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
Genome locationChr2:15198349..15202126
RNA-Seq ExpressionCSPI02G15660
SyntenyCSPI02G15660
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.0e+0074.69Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEECPQ PA NANRT R+AYDRW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKGREGNTRKIVL------------
        K+VEANVA TKRKFIRGSSSK K GPSK  AQ+KKKGKGK P  +K KK A+KGKC+HC Q+GHW RNCPKYLAEKK  +    K  L            
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKGREGNTRKIVL------------

Query:  -------------GKSFQKARSLSRLE--------------------------QERW-------------------------------------------
                       SFQ+  S  +L+                          Q+R+                                           
Subjt:  -------------GKSFQKARSLSRLE--------------------------QERW-------------------------------------------

Query:  ------------------------SQLQQTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK
                                +++ +T ETQNK+QKVSSNAYLWHLRLGHINLNRI RLVKSG+L+ LEDNSLPPCESCLEGKMTKRSFTGKGLRAK
Subjt:  ------------------------SQLQQTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK

Query:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST
         PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPST
Subjt:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST

Query:  PQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK
        PQQNGVSERRNRTLLDMVRSM+S++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPK
Subjt:  PQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK

Query:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIP
        ESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRRSGRVVHQP+RYLGL+ETQ++IP
Subjt:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIP

Query:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI
        DDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSI
Subjt:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI

Query:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNFIIAFL
        RILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKK+VN ++AFL
Subjt:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNFIIAFL

Query:  VLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS--------GRCTIRSRN---------------------PNCSK-----PYGIHLSKEQCPKTPQEVED
        +LYVDD+LLIGNDV YLTD+KKWL  QFQ +         G   +R+R                       N  K      +GIHLSKEQCPKTPQEVED
Subjt:  VLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS--------GRCTIRSRN---------------------PNCSK-----PYGIHLSKEQCPKTPQEVED

Query:  MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAV
        MRNIPY+SAVGSLMYAMLCTRPDICYSVG+VSRYQSNPGRDHWTAVKNILKYLRRT++YML+YG KDLILTGYTDSDFQ+DKDARKSTSGSVFTLNGGAV
Subjt:  MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAV

Query:  VWRSIKQTCIADSTMEAEYVAACEAAKEA
        VWRS+KQTCIADSTMEAEYVAACEAAKEA
Subjt:  VWRSIKQTCIADSTMEAEYVAACEAAKEA

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0063.15Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAE KK ++G    +VL    
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---

Query:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------
                            SFQ   S  +LE                                                     ++ +S          
Subjt:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------

Query:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT
                                      ++ +TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FT
Subjt:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT

Query:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS
        GKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI S
Subjt:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS

Query:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC
        QLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC
Subjt:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC

Query:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP
         F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV + P
Subjt:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP

Query:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD
         RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP

Query:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL
        CVYK+++N  +AFLVLYVDD+LLIGND+G LTDIK+WLA QFQ +                               +  ++    N  +      +G+ L
Subjt:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL

Query:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK
        SKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK ILKYLRRT+DY L+YG+KDLILTGYTDSDFQTD+D+RK
Subjt:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK

Query:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA
        STSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEA
Subjt:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0063.15Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAE KK ++G    +VL    
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---

Query:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------
                            SFQ   S  +LE                                                     ++ +S          
Subjt:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------

Query:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT
                                      ++ +TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FT
Subjt:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT

Query:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS
        GKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI S
Subjt:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS

Query:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC
        QLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC
Subjt:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC

Query:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP
         F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV + P
Subjt:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP

Query:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD
         RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP

Query:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL
        CVYK+++N  +AFLVLYVDD+LLIGND+G LTDIK+WLA QFQ +                               +  ++    N  +      +G+ L
Subjt:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL

Query:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK
        SKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK ILKYLRRT+DYML+YG+KDLILTGYTDSDFQTD+D+RK
Subjt:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK

Query:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA
        STSGSVF LNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEA
Subjt:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0063.15Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAE KK ++G    +VL    
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---

Query:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------
                            SFQ   S  +LE                                                     ++ +S          
Subjt:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------

Query:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT
                                      ++ +TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FT
Subjt:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT

Query:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS
        GKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI S
Subjt:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS

Query:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC
        QLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC
Subjt:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC

Query:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP
         F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV + P
Subjt:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP

Query:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD
         RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP

Query:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL
        CVYK+++N  +AFLVLYVDD+LLIGND+G LTDIK+WLA QFQ +                               +  ++    N  +      +G+ L
Subjt:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL

Query:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK
        SKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK ILKYLRRT+DY L+YG+KDLILTGYTDSDFQTD+D+RK
Subjt:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK

Query:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA
        STSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEA
Subjt:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0063.15Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAE KK ++G    +VL    
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---

Query:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------
                            SFQ   S  +LE                                                     ++ +S          
Subjt:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------

Query:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT
                                      ++ +TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FT
Subjt:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT

Query:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS
        GKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI S
Subjt:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS

Query:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC
        QLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC
Subjt:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC

Query:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP
         F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV + P
Subjt:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP

Query:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD
         RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP

Query:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL
        CVYK+++N  +AFLVLYVDD+LLIGND+G LTDIK+WLA QFQ +                               +  ++    N  +      +G+ L
Subjt:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL

Query:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK
        SKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK ILKYLRRT+DY L+YG+KDLILTGYTDSDFQTD+D+RK
Subjt:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK

Query:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA
        STSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEA
Subjt:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0063.15Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAE KK ++G    +VL    
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---

Query:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------
                            SFQ   S  +LE                                                     ++ +S          
Subjt:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------

Query:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT
                                      ++ +TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FT
Subjt:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT

Query:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS
        GKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI S
Subjt:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS

Query:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC
        QLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC
Subjt:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC

Query:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP
         F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV + P
Subjt:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP

Query:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD
         RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP

Query:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL
        CVYK+++N  +AFLVLYVDD+LLIGND+G LTDIK+WLA QFQ +                               +  ++    N  +      +G+ L
Subjt:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL

Query:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK
        SKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK ILKYLRRT+DY L+YG+KDLILTGYTDSDFQTD+D+RK
Subjt:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK

Query:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA
        STSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEA
Subjt:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA

A0A5A7TWB9 Gag/pol protein0.0e+0063.15Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAE KK ++G    +VL    
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---

Query:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------
                            SFQ   S  +LE                                                     ++ +S          
Subjt:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------

Query:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT
                                      ++ +TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FT
Subjt:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT

Query:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS
        GKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI S
Subjt:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS

Query:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC
        QLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC
Subjt:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC

Query:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP
         F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV + P
Subjt:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP

Query:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD
         RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP

Query:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL
        CVYK+++N  +AFLVLYVDD+LLIGND+G LTDIK+WLA QFQ +                               +  ++    N  +      +G+ L
Subjt:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL

Query:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK
        SKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK ILKYLRRT+DYML+YG+KDLILTGYTDSDFQTD+D+RK
Subjt:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK

Query:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA
        STSGSVF LNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEA
Subjt:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA

A0A5D3CPJ6 Gag/pol protein0.0e+0063.15Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAE KK ++G    +VL    
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---

Query:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------
                            SFQ   S  +LE                                                     ++ +S          
Subjt:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------

Query:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT
                                      ++ +TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FT
Subjt:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT

Query:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS
        GKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI S
Subjt:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS

Query:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC
        QLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC
Subjt:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC

Query:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP
         F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV + P
Subjt:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP

Query:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD
         RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP

Query:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL
        CVYK+++N  +AFLVLYVDD+LLIGND+G LTDIK+WLA QFQ +                               +  ++    N  +      +G+ L
Subjt:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL

Query:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK
        SKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK ILKYLRRT+DY L+YG+KDLILTGYTDSDFQTD+D+RK
Subjt:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK

Query:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA
        STSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEA
Subjt:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA

A0A5D3CSZ6 Gag/pol protein0.0e+0063.15Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQ  + ++H+A+KYIY  RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI + LTTLLNELQ F++L   KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---
        ++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G+  K N        K  A KG C+HC Q GHW RNCPKYLAE KK ++G    +VL    
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKP-NAQIKKKGKGKTPKQNKG------KKAAEKGKCYHCGQNGHWLRNCPKYLAE-KKGREGNTRKIVLG---

Query:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------
                            SFQ   S  +LE                                                     ++ +S          
Subjt:  -------------------KSFQKARSLSRLE-----------------------------------------------------QERWS----------

Query:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT
                                      ++ +TA TQNKR K+S   NA+LWHLRLGHINLNRI RLVK+GLLS LE+NSLP CESCLEGKMTKR FT
Subjt:  ------------------------------QLQQTAETQNKRQKVS--SNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFT

Query:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS
        GKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI S
Subjt:  GKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQS

Query:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC
        QLSAP TPQQNGVSERRNRTLLDMVRSM+S++ + +SFWGYA++TA YILN VPSKSVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC
Subjt:  QLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLC

Query:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP
         F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++PS+ T+VV     S ++H  Q LREPRRSGRV + P
Subjt:  FFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP

Query:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD
         RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL +SIYGLKQASRSWNIRFDTAIKSYGF+Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEP

Query:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL
        CVYK+++N  +AFLVLYVDD+LLIGND+G LTDIK+WLA QFQ +                               +  ++    N  +      +G+ L
Subjt:  CVYKKVVNFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS-----------------------------GRCTIRSRNPNCSK-----PYGIHL

Query:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK
        SKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG  HWTAVK ILKYLRRT+DY L+YG+KDLILTGYTDSDFQTD+D+RK
Subjt:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK

Query:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA
        STSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEA
Subjt:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEA

E2GK51 Gag/pol protein (Fragment)0.0e+0074.69Show/hide
Query:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG
        MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEECPQ PA NANRT R+AYDRW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR 
Subjt:  MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRG

Query:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG
        MFGQP WSLRHEA+K+IYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEFNLTTLLNELQRFQNLT+ KG
Subjt:  MFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKG

Query:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKGREGNTRKIVL------------
        K+VEANVA TKRKFIRGSSSK K GPSK  AQ+KKKGKGK P  +K KK A+KGKC+HC Q+GHW RNCPKYLAEKK  +    K  L            
Subjt:  KQVEANVATTKRKFIRGSSSKTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKGREGNTRKIVL------------

Query:  -------------GKSFQKARSLSRLE--------------------------QERW-------------------------------------------
                       SFQ+  S  +L+                          Q+R+                                           
Subjt:  -------------GKSFQKARSLSRLE--------------------------QERW-------------------------------------------

Query:  ------------------------SQLQQTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK
                                +++ +T ETQNK+QKVSSNAYLWHLRLGHINLNRI RLVKSG+L+ LEDNSLPPCESCLEGKMTKRSFTGKGLRAK
Subjt:  ------------------------SQLQQTAETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAK

Query:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST
         PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPST
Subjt:  GPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPST

Query:  PQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK
        PQQNGVSERRNRTLLDMVRSM+S++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPK
Subjt:  PQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPK

Query:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIP
        ESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRRSGRVVHQP+RYLGL+ETQ++IP
Subjt:  ESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIP

Query:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI
        DDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSI
Subjt:  DDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI

Query:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNFIIAFL
        RILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKK+VN ++AFL
Subjt:  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNFIIAFL

Query:  VLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS--------GRCTIRSRN---------------------PNCSK-----PYGIHLSKEQCPKTPQEVED
        +LYVDD+LLIGNDV YLTD+KKWL  QFQ +         G   +R+R                       N  K      +GIHLSKEQCPKTPQEVED
Subjt:  VLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS--------GRCTIRSRN---------------------PNCSK-----PYGIHLSKEQCPKTPQEVED

Query:  MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAV
        MRNIPY+SAVGSLMYAMLCTRPDICYSVG+VSRYQSNPGRDHWTAVKNILKYLRRT++YML+YG KDLILTGYTDSDFQ+DKDARKSTSGSVFTLNGGAV
Subjt:  MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAV

Query:  VWRSIKQTCIADSTMEAEYVAACEAAKEA
        VWRS+KQTCIADSTMEAEYVAACEAAKEA
Subjt:  VWRSIKQTCIADSTMEAEYVAACEAAKEA

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.3e-10529.55Show/hide
Query:  KVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLE-----DNSLPPCESCLEGKMTKRSFTGKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFI
        K  +N  LWH R GHI+  ++  + +  + S        + S   CE CL GK  +  F  K L+     K PL +VHSD+CGP+         YF+ F+
Subjt:  KVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLE-----DNSLPPCESCLEGKMTKRSFTGKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFI

Query:  DDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSD
        D ++ Y   YLI +KS+    F+++ A+ E      +  L  D G EY+    R + ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S +++  
Subjt:  DDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSD

Query:  SFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQNPK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLE
        SFWG A+ TA Y++N +PS+++   S+TPYE+W  +K  L+H R++G   +V ++N + K + +S    F+GY  E  G   +D    K  V+ +    E
Subjt:  SFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQNPK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLE

Query:  EDHIRDHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------DKTRKSGQSHPS--------------------QQLREPRRSGRVV--
         + +     + + V  + SK + +K  P+ S K++                  D      ++ P+                    Q L++ + S +    
Subjt:  EDHIRDHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------DKTRKSGQSHPS--------------------QQLREPRRSGRVV--

Query:  ----HQPDRYLG------------LIETQVVIPDDGIEDP---------------------LTYKQ--------------AMKDV-----------DRDQ
             + D +L               ET   + + GI++P                     ++Y +                 DV           D+  
Subjt:  ----HQPDRYLG------------LIETQVVIPDDGIEDP---------------------LTYKQ--------------AMKDV-----------DRDQ

Query:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKT
        W +A++ E+ +   N+ WT+  +P +   +  +W++  K +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ QMDVKT
Subjt:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKT

Query:  AFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVY---KKVVNFIIAFLVLYVDDVLLIGNDVGYLT
        AFLNG L+E IYM  P+G         VCKL K+IYGLKQA+R W   F+ A+K   F  +  + C+Y   K  +N  I +++LYVDDV++   D+  + 
Subjt:  AFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVY---KKVVNFIIAFLVLYVDDVLLIGNDVGYLT

Query:  DIKKWLAMQFQ-------KRSGRCTIRSRNPNCSKPYGIHLSK-------EQC----PKTPQEV-------EDMRNIPYASAVGSLMYAMLCTRPDICYS
        + K++L  +F+       K      I  +          ++ K       E C       P ++       ++  N P  S +G LMY MLCTRPD+  +
Subjt:  DIKKWLAMQFQ-------KRSGRCTIRSRNPNCSKPYGIHLSK-------EQC----PKTPQEV-------EDMRNIPYASAVGSLMYAMLCTRPDICYS

Query:  VGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLI----LTGYTDSDFQTDKDARKSTSGSVFTL-NGGAVVWRSIKQTCIADSTMEAEYVAA
        V ++SRY S    + W  +K +L+YL+ T D  L++  K+L     + GY DSD+   +  RKST+G +F + +   + W + +Q  +A S+ EAEY+A 
Subjt:  VGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLI----LTGYTDSDFQTDKDARKSTSGSVFTL-NGGAVVWRSIKQTCIADSTMEAEYVAA

Query:  CEAAKEA
         EA +EA
Subjt:  CEAAKEA

P0CV72 Secreted RxLR effector protein 1611.2e-2748.48Show/hide
Query:  MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMY---GTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNG
        M+N+PY SAVG++MY M+ TRPD+  +VG++S++ S+P   HW A+K +L+YL+ T+ Y L +   GT  L+  GY+D+D+  D ++R+STSG +F LNG
Subjt:  MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMY---GTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNG

Query:  GAVVWRSIKQTCIADSTMEAEYVAACEAAKEA
        G V WRS KQ  +A S+ E EY+A  EA +EA
Subjt:  GAVVWRSIKQTCIADSTMEAEYVAACEAAKEA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.8e-16433.04Show/hide
Query:  KLNGDN-YAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRH
        K NGDN ++ W+  +  +L+   L  VL  +  +     A        + W   +E+A   I   +SD +        TA+ I   L  ++     + + 
Subjt:  KLNGDN-YAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRH

Query:  EAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNL-----TTLLNELQRFQN-------LTMGK
           K +Y   M EGT+   H+             G  IEE ++   +L SLP S+    T     K    L       LLNE  R +        +T G+
Subjt:  EAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNL-----TTLLNELQRFQN-------LTMGK

Query:  GKQVEANVATTKRKFIRGSS---SKTKA---------GPSKPNAQIKKKGKGKTPKQNKGKKAA--------------EKGKCYH-CGQNGHWLRNCP--
        G+  + +     R   RG S   SK++          G  K +    +KGKG+T  Q      A              E+ +C H  G    W+ +    
Subjt:  GKQVEANVATTKRKFIRGSS---SKTKA---------GPSKPNAQIKKKGKGKTPKQNKGKKAA--------------EKGKCYH-CGQNGHWLRNCP--

Query:  -----------KYLAEKKG--REGNT---------------------------------RKIVLGKSFQKARSLSRLEQERW---------------SQL
                   +Y+A   G  + GNT                                   ++ G +  +    S    ++W                 L
Subjt:  -----------KYLAEKKG--REGNT---------------------------------RKIVLGKSFQKARSLSRLEQERW---------------SQL

Query:  QQT------AETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGG
         +T       E    + ++S +  LWH R+GH++   +  L K  L+S  +  ++ PC+ CL GK  + SF     R    L+LV+SD+CGPM +++ GG
Subjt:  QQT------AETQNKRQKVSSNAYLWHLRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGG

Query:  YEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSM
         +YF++FIDD SR   +Y++  K    + F+++ A VE E G+ +K LRSD GGEY    F +Y   +GI+ + + P TPQ NGV+ER NRT+++ VRSM
Subjt:  YEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSM

Query:  ISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVS
        +  +++  SFWG A++TA Y++N  PS  ++ E P  +W  ++ S  H +++GC   AHV  +   KL+ +S  C FIGY  E  G   +DP + K+  S
Subjt:  ISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVS

Query:  TNATFLEEDHIRDHQPRSKLVLKEI------SKSAIDKPSSSTKVVDKTRKSGQ-------------------SHPSQ---QLREPRRSGRVVHQPDRYL
         +  F  E  +R     S+ V   I        S  + P+S+    D+  + G+                    HP+Q   Q +  RRS R   +  RY 
Subjt:  TNATFLEEDHIRDHQPRSKLVLKEI------SKSAIDKPSSSTKVVDKTRKSGQ-------------------SHPSQ---QLREPRRSGRVVHQPDRYL

Query:  GLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEET
              V+I DD   +P + K+ +   +++Q +KAM  EMES+  N  + LV+ P   +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E 
Subjt:  GLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEET

Query:  FSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYK
        FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPEGF    ++  VCKL KS+YGLKQA R W ++FD+ +KS  + +   +PCVY 
Subjt:  FSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYK

Query:  KVV---NFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQF--------QKRSGRCTIRSR--------------------NPNCSKPYG------IHLS
        K     NFII  L+LYVDD+L++G D G +  +K  L+  F        Q+  G   +R R                    N   +KP        + LS
Subjt:  KVV---NFIIAFLVLYVDDVLLIGNDVGYLTDIKKWLAMQF--------QKRSGRCTIRSR--------------------NPNCSKPYG------IHLS

Query:  KEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKS
        K+ CP T +E  +M  +PY+SAVGSLMYAM+CTRPDI ++VG+VSR+  NPG++HW AVK IL+YLR T    L +G  D IL GYTD+D   D D RKS
Subjt:  KEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKS

Query:  TSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKE
        ++G +FT +GGA+ W+S  Q C+A ST EAEY+AA E  KE
Subjt:  TSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-8927.2Show/hide
Query:  WHLRLGHINLNRIGRLVKSGLLSPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKS
        WH RLGH   + +  ++ +  LS L   +    C  CL  K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS
Subjt:  WHLRLGHINLNRIGRLVKSGLLSPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKS

Query:  NSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNN
           E F  +K  +EN     I    SD GGE++ L   +Y  ++GI    S P TP+ NG+SER++R +++   +++S + +  ++W YA   A Y++N 
Subjt:  NSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNN

Query:  VPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRD
        +P+  +  E+P++   G   +    R++GC  +  ++  N  KL+ +S+ C F+GY       L    Q +++++S +  F E              +++
Subjt:  VPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRD

Query:  HQPRSKLVL-------------------------------------KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS--------
         +  S  V                                       ++S S +D   SS+                     T+   Q+H S        
Subjt:  HQPRSKLVL-------------------------------------KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS--------

Query:  ---------QQLREPRRSGR---------------------VVHQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAM
                 Q L  P +S                       ++H P     ++      P            GI                 +P T  QA+
Subjt:  ---------QQLREPRRSGR---------------------VVHQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAM

Query:  KDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE
        KD   ++W  AM  E+ +   N  W LV   P+ V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + 
Subjt:  KDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE

Query:  IWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNFIIAFLVLYVDDVLLIGND
        I Q+DV  AFL G L + +YMSQP GFI++D+   VCKL+K++YGLKQA R+W +     + + GF  +V +  ++       I ++++YVDD+L+ GND
Subjt:  IWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNFIIAFLVLYVDDVLLIGND

Query:  VGYLTDIKKWLAMQFQKRSGRCTIRSRNPNCSK-PYGIHLSKEQ------------------CPKTPQEVEDMRN-------IPYASAVGSLMYAMLCTR
           L +    L+ +F  +              + P G+HLS+ +                   P  P     + +         Y   VGSL Y +  TR
Subjt:  VGYLTDIKKWLAMQFQKRSGRCTIRSRNPNCSK-PYGIHLSKEQ------------------CPKTPQEVEDMRN-------IPYASAVGSLMYAMLCTR

Query:  PDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYV
        PDI Y+V  +S++   P  +H  A+K IL+YL  T ++ + +     L L  Y+D+D+  DKD   ST+G +  L    + W S KQ  +  S+ EAEY 
Subjt:  PDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYV

Query:  AACEAAKE
        +    + E
Subjt:  AACEAAKE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-9428.15Show/hide
Query:  WHLRLGHINLNRIGRLVKSGLLSPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKS
        WH RLGH +L  +  ++ +  L  L   + L  C  C   K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS
Subjt:  WHLRLGHINLNRIGRLVKSGLLSPLE-DNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKS

Query:  NSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNN
           + F  +K+ VEN     I  L SD GGE++ L  RDYL ++GI    S P TP+ NG+SER++R +++M  +++S + +  ++W YA   A Y++N 
Subjt:  NSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNN

Query:  VPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE----------------
        +P+  +  ++P++   G+  +    +++GC  +  ++  N  KLE +SK C F+GY       L       +++ S +  F E                 
Subjt:  VPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE----------------

Query:  -----------------------------DHIRDHQPR-----SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPR
                                      H+ D  PR     S L   ++S      S+I  PSSS       +  + + Q H +Q        L  P 
Subjt:  -----------------------------DHIRDHQPR-----SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPR

Query:  RSGRVVHQPDRYLGLIETQVVIP----------------------------------------------------DDGI----------------EDPLT
         +    + P++   L ++ +  P                                                     DGI                 +P T
Subjt:  RSGRVVHQPDRYLGLIETQVVIP----------------------------------------------------DDGI----------------EDPLT

Query:  YKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAT
          QAMKD   D+W +AM  E+ +   N  W LV   P  V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A 
Subjt:  YKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAT

Query:  FYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNFIIAFLVLYVDDVL
           + I Q+DV  AFL G L + +YMSQP GF+++D+   VC+L+K+IYGLKQA R+W +   T + + GF  ++ +  ++       I ++++YVDD+L
Subjt:  FYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNFIIAFLVLYVDDVL

Query:  LIGNDVGYLTDIKKWLAMQFQKRSGRCTIRSRNPNCSK-PYGIHLSKEQCP---------------KTPQEVEDMRNI----------PYASAVGSLMYA
        + GND   L      L+ +F  +              + P G+HLS+ +                  TP        +           Y   VGSL Y 
Subjt:  LIGNDVGYLTDIKKWLAMQFQKRSGRCTIRSRNPNCSK-PYGIHLSKEQCP---------------KTPQEVEDMRNI----------PYASAVGSLMYA

Query:  MLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTM
        +  TRPD+ Y+V  +S+Y   P  DHW A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+G +  L    + W S KQ  +  S+ 
Subjt:  MLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTM

Query:  EAEYVAACEAAKE
        EAEY +    + E
Subjt:  EAEYVAACEAAKE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-6334.44Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P TY +A + +    W  AMD E+ +M     W +   P + KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNFIIAFL
        +I+  Y++ + Q+D+  AFLNG+L+E IYM  P G+  +  +      VC LKKSIYGLKQASR W ++F   +  +GF Q+  +   + K+   +   +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNFIIAFL

Query:  VLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS--------GRCTIRSRN--PNCSKPYGIHLSKE---------QCPKTPQEVEDMRN-------IPYAS
        ++YVDD+++  N+   + ++K  L   F+ R         G    RS      C + Y + L  E           P  P       +         Y  
Subjt:  VLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRS--------GRCTIRSRN--PNCSKPYGIHLSKE---------QCPKTPQEVEDMRN-------IPYAS

Query:  AVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTK-DLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQ
         +G LMY  + TR DI ++V  +S++   P   H  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+ KD R+ST+G    L    + W+S KQ
Subjt:  AVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTK-DLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQ

Query:  TCIADSTMEAEYVAACEAAKE
          ++ S+ EAEY A   A  E
Subjt:  TCIADSTMEAEYVAACEAAKE

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.3e-0434.72Show/hide
Query:  TRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMY-GTKDLILTGYTDSDFQTDKDARKSTSG
        TRPD+ ++V  +S++ S        AV  +L Y++ T    L Y  T DL L  + DSD+ +  D R+S +G
Subjt:  TRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMY-GTKDLILTGYTDSDFQTDKDARKSTSG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.9e-0431.71Show/hide
Query:  NRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
        NRT+++ VRSM+    +  +F   A  TA +I+N  PS +++   P E+W     +  + R +GC A++   +  KL+ R+K
Subjt:  NRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.2e-1538.83Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +   A+KD     W +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIA
        ++A
Subjt:  SIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATT
TGTCTTAACTGAGGAATGTCCTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTC
TTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGA
CACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGG
TCCCATCGAAGAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACCTGACAA
CCCTTCTGAATGAACTCCAGCGATTCCAAAACCTAACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCT
AAAACCAAAGCTGGACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAGGGAAAGACTCCCAAACAGAACAAGGGTAAGAAAGCTGCAGAAAAAGGTAAGTGTTA
CCATTGTGGCCAAAACGGGCACTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAAGGCAGAGAAGGAAACACAAGGAAAATAGTTCTTGGAAAAAGCTTTCAGA
AGGCGAGATCACTCTCAAGGTTGGAACAGGAGAGATGGTCTCAGCTTCAGCAAACAGCTGAAACTCAGAATAAAAGACAAAAAGTTTCTTCTAATGCCTATTTATGGCAC
TTAAGACTTGGTCACATAAATCTCAATAGGATTGGGAGATTGGTTAAAAGTGGGCTTCTAAGTCCGTTAGAAGATAACTCTTTACCTCCTTGTGAATCTTGTCTTGAAGG
AAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGAT
ATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAATATAAGGCTGAAGTA
GAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACT
CTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATAAGTTTTTCTCAGATGTCAGATTCTTTTT
GGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCAT
TTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTTATAGGTTATCCAAAAGAATCAAGAGGTGG
TTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAG
AAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGT
AGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAA
AGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTG
GTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAG
GAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAA
TGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAG
CTTCTAGATCCTGGAATATAAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTTCATTATAGCA
TTTTTAGTCTTATATGTAGATGATGTTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAAAAAGATCTGGGAGATG
CACAATACGTTCTCGGAATCCAAATTGTTCGAAACCATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCT
ATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATGTTATGTACTAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCAC
TGGACAGCCGTTAAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATGGTACAAAGGATCTGATCCTTACTGGATACACTGATTCAGATTTCCA
AACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGATTCCACAATGG
AAGCTGAATACGTAGCGGCTTGTGAAGCAGCAAAAGAAGCATGGTGCAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATT
TGTCTTAACTGAGGAATGTCCTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTC
TTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGA
CACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGG
TCCCATCGAAGAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACCTGACAA
CCCTTCTGAATGAACTCCAGCGATTCCAAAACCTAACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCT
AAAACCAAAGCTGGACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAGGGAAAGACTCCCAAACAGAACAAGGGTAAGAAAGCTGCAGAAAAAGGTAAGTGTTA
CCATTGTGGCCAAAACGGGCACTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAAGGCAGAGAAGGAAACACAAGGAAAATAGTTCTTGGAAAAAGCTTTCAGA
AGGCGAGATCACTCTCAAGGTTGGAACAGGAGAGATGGTCTCAGCTTCAGCAAACAGCTGAAACTCAGAATAAAAGACAAAAAGTTTCTTCTAATGCCTATTTATGGCAC
TTAAGACTTGGTCACATAAATCTCAATAGGATTGGGAGATTGGTTAAAAGTGGGCTTCTAAGTCCGTTAGAAGATAACTCTTTACCTCCTTGTGAATCTTGTCTTGAAGG
AAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGAT
ATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAATATAAGGCTGAAGTA
GAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACT
CTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATAAGTTTTTCTCAGATGTCAGATTCTTTTT
GGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCAT
TTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTTATAGGTTATCCAAAAGAATCAAGAGGTGG
TTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAG
AAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGT
AGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAA
AGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTG
GTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAG
GAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAA
TGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAG
CTTCTAGATCCTGGAATATAAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTTCATTATAGCA
TTTTTAGTCTTATATGTAGATGATGTTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAAAAAGATCTGGGAGATG
CACAATACGTTCTCGGAATCCAAATTGTTCGAAACCATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCT
ATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATGTTATGTACTAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCAC
TGGACAGCCGTTAAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATGGTACAAAGGATCTGATCCTTACTGGATACACTGATTCAGATTTCCA
AACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGATTCCACAATGG
AAGCTGAATACGTAGCGGCTTGTGAAGCAGCAAAAGAAGCATGGTGCAGTTGA
Protein sequenceShow/hide protein sequence
MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLR
HEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSS
KTKAGPSKPNAQIKKKGKGKTPKQNKGKKAAEKGKCYHCGQNGHWLRNCPKYLAEKKGREGNTRKIVLGKSFQKARSLSRLEQERWSQLQQTAETQNKRQKVSSNAYLWH
LRLGHINLNRIGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV
ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMISFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRH
FRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRR
SGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYE
ETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKVVNFIIA
FLVLYVDDVLLIGNDVGYLTDIKKWLAMQFQKRSGRCTIRSRNPNCSKPYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDH
WTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAWCS