; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh00G000080 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh00G000080
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol protein
Genome locationCmo_Chr00:1560296..1564305
RNA-Seq ExpressionCmoCh00G000080
SyntenyCmoCh00G000080
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.0e+0064.71Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DVLAKKHD + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK----------------
        +  EANVA++K K +RGSSS+NK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAEKKAEK                
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------TQQG
                                                                                                        T +G
Subjt:  ------------------------------------------------------------------------------------------------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
         RAK  LELVH+DLCGPMNVKARGGYEYFISFIDD+SRYG++YL+HHKSE+ EKF+EYK EV+N +GKTIKTLRSDRGGEYMD +FQDY+IE GI+SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+AQLPD FWGYA+ETA +ILN VP+KSV ETPYELWKGRK                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDK---TTRVVDQAG---------------PSTRVVDGAD----TSETQ
        GYPKE++GGLFY PQEN+VFVSTNATFLEEDH RNHQPRSK+VL E+ K ATDK   +T+VVD+A                 S RVV   +      ETQ
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDK---TTRVVDQAG---------------PSTRVVDGAD----TSETQ

Query:  VIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAM
        +IIPDDGVEDPLTYKQAMND DRDQWIKAMNLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQ+EGVDYEETFSPVAM
Subjt:  VIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAM

Query:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNST
        LKSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPE FI QD EQ+VCKL++SIYGLKQASRSWNI+FDTAIKSYGF+QNVDEPCVYK+IVNS 
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNST

Query:  VAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQ
        VAFL+LYVDDILLIGNDV  LTD+K WL TQFQMKDLGEAQ++L IQIVRNRKNKTLA+SQASYIDK+L RYKMQ+SKKG LPFRHG+HLSKEQ PKTPQ
Subjt:  VAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQ

Query:  EVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLN
        EVEDM++IPY+SAVGSLMYAMLCTRPDICY +GIVSRYQSNPGR HWTAVKNILKYLRRTR+YML+YGAKDLILTGYTDSDFQ+D D+RKSTSGSVFTLN
Subjt:  EVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLN

Query:  GGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQRGDVIVTQ
        GGA++WRS++Q CIADSTMEAEYVAACEAAKE+VWLRKFLTDLEVVPNMHLP+TLYCDNSG VANSKEPRSHKR KHIE KYHLIREIV RGDV+VTQ
Subjt:  GGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQRGDVIVTQ

KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0071.13Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        ++++ + +LA++KLNG+NY +WK+ +N +L+IDDL+FVL E+CP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVA  NEAVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CF CN+ GHWKRNCPKYLA+KK  K         T +G
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
        +RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW G K                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVV-------------------DGADT-------
        GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++     RVV                    G  T       
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVV-------------------DGADT-------

Query:  --SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEET
          +ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEET
Subjt:  --SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEET

Query:  FSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYK
        FSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPE FI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPCVYK
Subjt:  FSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYK

Query:  RIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQ
        RI+N +VAFLVLYVDDILLIGND+G+LTDIK WLATQFQMKDLGEAQFVL IQI R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LSKEQ
Subjt:  RIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQ

Query:  SPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSG
         PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICY +GIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKSTSG
Subjt:  SPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSG

Query:  SVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQRGDV
        SVFTLNGGA++WRSI+QGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSG VANS+EPRSHKR KHIE KYHLIREIV RGDV
Subjt:  SVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQRGDV

Query:  IVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM
        IVTQIAS  N+ADPFTKPLTAKVFE HL SLGLR M
Subjt:  IVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0061.02Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TRVV  G+ T                      
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------

Query:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY
             +ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDY
Subjt:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPE FI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC

Query:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS
        VYKRI+N +VAFLVLYVDDILLIGND+G+LTDIK WLATQFQMKDLGEAQFVL IQI R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LS
Subjt:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS
        KEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICY +GIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKS
Subjt:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS

Query:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR
        TSGSVFTLNGGA++WRSI+QGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSG VANS+EPRSHKR KHIE KYHLIREIV R
Subjt:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR

Query:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM
        GDVIVTQIAS  N+ADPFTKPLTAKVFE HL SLGLR M
Subjt:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0061.02Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TRVV  G+ T                      
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------

Query:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY
             +ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDY
Subjt:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPE FI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC

Query:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS
        VYKRI+N +VAFLVLYVDDILLIGND+G+LTDIK WLATQFQMKDLGEAQFVL IQI R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LS
Subjt:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS
        KEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICY +GIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKS
Subjt:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS

Query:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR
        TSGSVFTLNGGA++WRSI+QGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSG VANS+EPRSHKR KHIE KYHLIREIV R
Subjt:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR

Query:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM
        GDVIVTQIAS  N+ADPFTKPLTAKVFE HL SLGLR M
Subjt:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0061.09Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK N NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TRVV  G+ T                      
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------

Query:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY
             +ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDY
Subjt:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPE FI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC

Query:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS
        VYKRI+N +VAFLVLYVDDILLIGND+G+LTDIK WLATQFQMKDLGEAQFVL IQI R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LS
Subjt:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS
        KEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICY +GIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKS
Subjt:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS

Query:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR
        TSGSVFTLNGGA++WRSI+QGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSG VANS+EPRSHKR KHIE KYHLIREIV R
Subjt:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR

Query:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM
        GDVIVTQIAS  N+ADPFTKPLTAKVFE HL SLGLR M
Subjt:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0061.02Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TRVV  G+ T                      
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------

Query:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY
             +ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDY
Subjt:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPE FI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC

Query:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS
        VYKRI+N +VAFLVLYVDDILLIGND+G+LTDIK WLATQFQMKDLGEAQFVL IQI R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LS
Subjt:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS
        KEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICY +GIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKS
Subjt:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS

Query:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR
        TSGSVFTLNGGA++WRSI+QGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSG VANS+EPRSHKR KHIE KYHLIREIV R
Subjt:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR

Query:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM
        GDVIVTQIAS  N+ADPFTKPLTAKVFE HL SLGLR M
Subjt:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM

A0A5A7SNP8 Gag/pol protein0.0e+0071.13Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        ++++ + +LA++KLNG+NY +WK+ +N +L+IDDL+FVL E+CP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVA  NEAVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CF CN+ GHWKRNCPKYLA+KK  K         T +G
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
        +RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW G K                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVV-------------------DGADT-------
        GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++     RVV                    G  T       
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVV-------------------DGADT-------

Query:  --SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEET
          +ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEET
Subjt:  --SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEET

Query:  FSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYK
        FSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPE FI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPCVYK
Subjt:  FSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYK

Query:  RIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQ
        RI+N +VAFLVLYVDDILLIGND+G+LTDIK WLATQFQMKDLGEAQFVL IQI R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LSKEQ
Subjt:  RIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQ

Query:  SPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSG
         PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICY +GIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKSTSG
Subjt:  SPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSG

Query:  SVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQRGDV
        SVFTLNGGA++WRSI+QGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSG VANS+EPRSHKR KHIE KYHLIREIV RGDV
Subjt:  SVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQRGDV

Query:  IVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM
        IVTQIAS  N+ADPFTKPLTAKVFE HL SLGLR M
Subjt:  IVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM

A0A5A7TZD7 Gag/pol protein0.0e+0061.09Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK N NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TRVV  G+ T                      
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------

Query:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY
             +ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDY
Subjt:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPE FI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC

Query:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS
        VYKRI+N +VAFLVLYVDDILLIGND+G+LTDIK WLATQFQMKDLGEAQFVL IQI R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LS
Subjt:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS
        KEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICY +GIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKS
Subjt:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS

Query:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR
        TSGSVFTLNGGA++WRSI+QGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSG VANS+EPRSHKR KHIE KYHLIREIV R
Subjt:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR

Query:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM
        GDVIVTQIAS  N+ADPFTKPLTAKVFE HL SLGLR M
Subjt:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM

A0A5D3CPJ6 Gag/pol protein0.0e+0061.02Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TRVV  G+ T                      
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVD-GADT----------------------

Query:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY
             +ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDY
Subjt:  -----SETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDY

Query:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC
        EETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPE FI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPC
Subjt:  EETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC

Query:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS
        VYKRI+N +VAFLVLYVDDILLIGND+G+LTDIK WLATQFQMKDLGEAQFVL IQI R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LS
Subjt:  VYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLS

Query:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS
        KEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICY +GIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKS
Subjt:  KEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKS

Query:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR
        TSGSVFTLNGGA++WRSI+QGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSG VANS+EPRSHKR KHIE KYHLIREIV R
Subjt:  TSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQR

Query:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM
        GDVIVTQIAS  N+ADPFTKPLTAKVFE HL SLGLR M
Subjt:  GDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM

E2GK51 Gag/pol protein (Fragment)0.0e+0064.71Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DVLAKKHD + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK----------------
        +  EANVA++K K +RGSSS+NK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAEKKAEK                
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------TQQG
                                                                                                        T +G
Subjt:  ------------------------------------------------------------------------------------------------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
         RAK  LELVH+DLCGPMNVKARGGYEYFISFIDD+SRYG++YL+HHKSE+ EKF+EYK EV+N +GKTIKTLRSDRGGEYMD +FQDY+IE GI+SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+AQLPD FWGYA+ETA +ILN VP+KSV ETPYELWKGRK                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDK---TTRVVDQAG---------------PSTRVVDGAD----TSETQ
        GYPKE++GGLFY PQEN+VFVSTNATFLEEDH RNHQPRSK+VL E+ K ATDK   +T+VVD+A                 S RVV   +      ETQ
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDK---TTRVVDQAG---------------PSTRVVDGAD----TSETQ

Query:  VIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAM
        +IIPDDGVEDPLTYKQAMND DRDQWIKAMNLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQ+EGVDYEETFSPVAM
Subjt:  VIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAM

Query:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNST
        LKSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPE FI QD EQ+VCKL++SIYGLKQASRSWNI+FDTAIKSYGF+QNVDEPCVYK+IVNS 
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNST

Query:  VAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQ
        VAFL+LYVDDILLIGNDV  LTD+K WL TQFQMKDLGEAQ++L IQIVRNRKNKTLA+SQASYIDK+L RYKMQ+SKKG LPFRHG+HLSKEQ PKTPQ
Subjt:  VAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQ

Query:  EVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLN
        EVEDM++IPY+SAVGSLMYAMLCTRPDICY +GIVSRYQSNPGR HWTAVKNILKYLRRTR+YML+YGAKDLILTGYTDSDFQ+D D+RKSTSGSVFTLN
Subjt:  EVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLN

Query:  GGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQRGDVIVTQ
        GGA++WRS++Q CIADSTMEAEYVAACEAAKE+VWLRKFLTDLEVVPNMHLP+TLYCDNSG VANSKEPRSHKR KHIE KYHLIREIV RGDV+VTQ
Subjt:  GGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQRGDVIVTQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.7e-11128.92Show/hide
Query:  KETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPG
        K  L +VH+D+CGP+         YF+ F+D ++ Y   YL+ +KS+    F+++  + +      +  L  D G EY+    + + ++ GI   L+ P 
Subjt:  KETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPG

Query:  MPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV---SETPYELWKGRKGYPK-------------ETKGGLF---------
         PQ NGVSER  RT+ +  R+M+S A+L   FWG AV TATY++N +P++++   S+TPYE+W  +K Y K             + K G F         
Subjt:  MPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV---SETPYELWKGRKGYPK-------------ETKGGLF---------

Query:  --YDPQENRVFVSTNATFL--------EEDHVRNHQPRSKLVLSEISKEATDK-----TTRVVDQAGP--------------------------STRVV-
          Y+P   +++ + N  F+        E + V +   + + V  + SKE+ +K     + +++    P                          S +++ 
Subjt:  --YDPQENRVFVSTNATFL--------EEDHVRNHQPRSKLVLSEISKEATDK-----TTRVVDQAGP--------------------------STRVV-

Query:  ------------------------------------------------DGADTSETQVIIPDDGVEDP---------------------LTYKQAMND--
                                                        + +  SET   + + G+++P                     ++Y +  N   
Subjt:  ------------------------------------------------DGADTSETQVIIPDDGVEDP---------------------LTYKQAMND--

Query:  -----------------------EDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSP
                               +D+  W +A+N E+ +   N+ W +  +P+    +  +W++  K ++ G    +KARLVA+G+TQ+  +DYEETF+P
Subjt:  -----------------------EDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSP

Query:  VAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY---K
        VA + S R +LS+   Y+ ++ QMDVKTAFLNG L+E IYM  P+      +   VCKL ++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K
Subjt:  VAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY---K

Query:  RIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQ
          +N  + +++LYVDD+++   D+  + + K +L  +F+M DL E +  + I+I    +   + LSQ++Y+ K+L ++ M++      P    ++     
Subjt:  RIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQ

Query:  SPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYG---AKDLILTGYTDSDFQTDVDSRKS
        S       ++  + P  S +G LMY MLCTRPD+   + I+SRY S      W  +K +L+YL+ T D  L++    A +  + GY DSD+      RKS
Subjt:  SPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYG---AKDLILTGYTDSDFQTDVDSRKS

Query:  TSGSVFTL-NGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQ
        T+G +F + +   I W + RQ  +A S+ EAEY+A  EA +E++WL+  LT + +   +  P+ +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ
Subjt:  TSGSVFTL-NGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQ

Query:  RGDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGL
           + +  I +E  +AD FTKPL A  F E    LGL
Subjt:  RGDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-17039.13Show/hide
Query:  RAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSA
        R    L+LV++D+CGPM +++ GG +YF++FIDD SR  ++Y++  K +  + F+++   V+   G+ +K LRSD GGEY    F++Y   HGIR + + 
Subjt:  RAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSA

Query:  PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVS-ETPYELWKGRK---------------------------------
        PG PQ NGV+ER NRT+++ VRSM+  A+LP  FWG AV+TA Y++N  P+  ++ E P  +W  ++                                 
Subjt:  PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVS-ETPYELWKGRK---------------------------------

Query:  --GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEI----------------SKEATDKTTRVVDQAGPSTRVVDGADTSETQVIIP
          GY  E  G   +DP + +V  S +  F  E  VR     S+ V + I                ++  TD+ +   +Q G      +  D    +V  P
Subjt:  --GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEI----------------SKEATDKTTRVVDQAGPSTRVVDGADTSETQVIIP

Query:  DDGVE------------------------------DPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTF
          G E                              +P + K+ ++  +++Q +KAM  EMES+  N  ++LV+ P G +P+ CKW++K K+D   K+  +
Subjt:  DDGVE------------------------------DPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTF

Query:  KARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKF
        KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPE F     +  VCKL +S+YGLKQA R W +KF
Subjt:  KARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKF

Query:  DTAIKSYGFKQNVDEPCVY-KRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYK
        D+ +KS  + +   +PCVY KR   +    L+LYVDD+L++G D G++  +K  L+  F MKDLG AQ +L ++IVR R ++ L LSQ  YI+++L R+ 
Subjt:  DTAIKSYGFKQNVDEPCVY-KRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYK

Query:  MQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLI
        M+++K    P    + LSK+  P T +E  +M  +PY+SAVGSLMYAM+CTRPDI + +G+VSR+  NPG+ HW AVK IL+YLR T    L +G  D I
Subjt:  MQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLI

Query:  LTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHK
        L GYTD+D   D+D+RKS++G +FT +GGAI W+S  Q C+A ST EAEY+AA E  KE +WL++FL +L +    ++   +YCD+   +  SK    H 
Subjt:  LTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHK

Query:  REKHIEHKYHLIREIVQRGDVIVTQIASEINIADPFTKPLTAKVFE
        R KHI+ +YH IRE+V    + V +I++  N AD  TK +    FE
Subjt:  REKHIEHKYHLIREIVQRGDVIVTQIASEINIADPFTKPLTAKVFE

P25600 Putative transposon Ty5-1 protein YCL074W1.7e-3432.48Show/hide
Query:  MDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGI
        MDV TAFLN  ++E IY+ QP  F+ + +   V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y R  +    ++ +YVDD+L+      I
Subjt:  MDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGI

Query:  LTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYA
           +K  L   + MKDLG+    L + I     N  + LS   YI K     ++   K    P  +   L +  SP     ++D+   PY S VG L++ 
Subjt:  LTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYA

Query:  MLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY-GAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIR-QGCIADST
            RPDI Y + ++SR+   P   H  + + +L+YL  TR   L Y     L LT Y D+      D   ST G V  L G  + W S + +G I   +
Subjt:  MLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY-GAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIR-QGCIADST

Query:  MEAEYVAACEAAKE
         EAEY+ A E   E
Subjt:  MEAEYVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-9227.48Show/hide
Query:  NCPKYLAEK--KAEKTQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEY
        +C   L  K  K   +Q    +   LE +++D+     + +   Y Y++ F+D ++RY +LY +  KS+  E F  +K  ++N     I T  SD GGE+
Subjt:  NCPKYLAEK--KAEKTQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEY

Query:  MDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVS-ETPYEL------------------
        + L   +Y  +HGI    S P  P+ NG+SER++R +++   +++S A +P  +W YA   A Y++N +PT  +  E+P++                   
Subjt:  MDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVS-ETPYEL------------------

Query:  --W-----------KGRK----GYPKETKGGLFYDPQENRVFVSTNATFLE---------------------------------------------EDH-
          W           K R+    GY       L    Q +R+++S +  F E                                             + H 
Subjt:  --W-----------KGRK----GYPKETKGGLFYDPQENRVFVSTNATFLE---------------------------------------------EDH-

Query:  -----------VRNHQPRSKLVLSEISK----------------EATDKTTRVVDQAGPSTRVVDGADTSETQ---------------------------
                    RN Q  S  + S  S                 + T + T+   Q   S        T+E+                            
Subjt:  -----------VRNHQPRSKLVLSEISK----------------EATDKTTRVVDQAGPSTRVVDGADTSETQ---------------------------

Query:  -----------------------------------------VIIPDD---------GVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPD
                                                 +I P+             +P T  QA+ DE   +W  AM  E+ +   N  W+LV  P 
Subjt:  -----------------------------------------VIIPDD---------GVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPD

Query:  G-VKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDH
          V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YM+QP  FI++D 
Subjt:  G-VKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDH

Query:  EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIV
           VCKL++++YGLKQA R+W ++    + + GF  +V +  ++      ++ ++++YVDDIL+ GND  +L +    L+ +F +KD  E  + L I+  
Subjt:  EQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIV

Query:  RNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTA
          R    L LSQ  YI  +L R  M  +K    P      LS     K     E      Y   VGSL Y +  TRPDI Y +  +S++   P   H  A
Subjt:  RNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTA

Query:  VKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPN
        +K IL+YL  T ++ + +     L L  Y+D+D+  D D   ST+G +  L    I W S +Q  +  S+ EAEY +    + E  W+   LT+L +   
Subjt:  VKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPN

Query:  MHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQRGDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGL
        +  P  +YCDN G       P  H R KHI   YH IR  VQ G + V  +++   +AD  TKPL+   F+     +G+
Subjt:  MHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIREIVQRGDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.0e-9527.44Show/hide
Query:  AKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP
        + + LE +++D+     + +   Y Y++ F+D ++RY +LY +  KS+  + F  +K+ V+N     I TL SD GGE++ LR  DY+ +HGI    S P
Subjt:  AKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP

Query:  GMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVS-ETPYELWKGR-----------------------------------
          P+ NG+SER++R +++M  +++S A +P  +W YA   A Y++N +PT  +  ++P++   G+                                   
Subjt:  GMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVS-ETPYELWKGR-----------------------------------

Query:  KGYPKETKGGLFYDPQENRVFVSTNATFLE-------------------EDHVRNHQ-----PRSKLVL-------------------------------
         GY       L       R++ S +  F E                    D   N       P + LVL                               
Subjt:  KGYPKETKGGLFYDPQENRVFVSTNATFLE-------------------EDHVRNHQ-----PRSKLVL-------------------------------

Query:  ----SEISKEATDKTTRVVDQA--------------------------------------------------GPSTRVVDGADTSETQVIIP--------
            S IS  ++ + T                                                         PST + +    S +    P        
Subjt:  ----SEISKEATDKTTRVVDQA--------------------------------------------------GPSTRVVDGADTSETQVIIP--------

Query:  ---------------------DDGV----------------EDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELV-DQPDGVKPIGCKWIYKRKRD
                              DG+                 +P T  QAM D   D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K +
Subjt:  ---------------------DDGV----------------EDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELV-DQPDGVKPIGCKWIYKRKRD

Query:  QTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQA
          G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YM+QP  F+++D    VC+L+++IYGLKQA
Subjt:  QTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQA

Query:  SRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYID
         R+W ++  T + + GF  ++ +  ++      ++ ++++YVDDIL+ GND  +L      L+ +F +K+  +  + L I+    R  + L LSQ  Y  
Subjt:  SRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYID

Query:  KMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-ML
         +L R  M  +K    P      L+     K P   E      Y   VGSL Y +  TRPD+ Y +  +S+Y   P   HW A+K +L+YL  T D+ + 
Subjt:  KMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-ML

Query:  MYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVAN
        +     L L  Y+D+D+  D D   ST+G +  L    I W S +Q  +  S+ EAEY +    + E  W+   LT+L +   +  P  +YCDN G    
Subjt:  MYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVAN

Query:  SKEPRSHKREKHIEHKYHLIREIVQRGDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGL
           P  H R KHI   YH IR  VQ G + V  +++   +AD  TKPL+   F+     +G+
Subjt:  SKEPRSHKREKHIEHKYHLIREIVQRGDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.8e-7834.5Show/hide
Query:  EDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P TY +A   ++   W  AM+ E+ +M     WE+   P   KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHE----QRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFL
        +I+  Y++ + Q+D+  AFLNG+L+E IYM  P  +  +  +      VC LK+SIYGLKQASR W +KF   +  +GF Q+  +   + +I  +    +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPERFIEQDHE----QRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFL

Query:  VLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED
        ++YVDDI++  N+   + ++K  L + F+++DLG  ++ L ++I R+     + + Q  Y   +L    +   K   +P    V  S         +  D
Subjt:  VLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED

Query:  MKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAK-DLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGA
         K   Y   +G LMY  + TR DI + +  +S++   P  AH  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+  D+R+ST+G    L    
Subjt:  MKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAK-DLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGA

Query:  IIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIRE
        I W+S +Q  ++ S+ EAEY A   A  E +WL +F  +L++   +  P  L+CDN+  +  +     H+R KHIE   H +RE
Subjt:  IIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGEVANSKEPRSHKREKHIEHKYHLIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.5e-0434.72Show/hide
Query:  TRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTSG
        TRPD+ + +  +S++ S    A   AV  +L Y++ T    L Y A  DL L  + DSD+ +  D+R+S +G
Subjt:  TRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTSG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.9e-2032.48Show/hide
Query:  FLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEV
        +L+LYVDDILL G+   +L  +   L++ F MKDLG   + L IQI  +     L LSQ  Y +++L    M D K    P    ++ S   + K P   
Subjt:  FLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLGEAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEV

Query:  EDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNG
        +      + S VG+L Y  L TRPDI Y + IV +    P  A +  +K +L+Y++ T  + + ++    L +  + DSD+     +R+ST+G    L  
Subjt:  EDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNG

Query:  GAIIWRSIRQGCIADSTMEAEYVAACEAAKESVW
          I W + RQ  ++ S+ E EY A    A E  W
Subjt:  GAIIWRSIRQGCIADSTMEAEYVAACEAAKESVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.3e-1541.86Show/hide
Query:  WIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        W +AM  E++++  N  W LV  P     +GCKW++K K    G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  WIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCACTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTG
CTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCT
TAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGGGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTAC
CTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGA
GTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAA
CTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAG
AGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAAC
GAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTG
ATCAAGCTGGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATTGACCTATAAACAGGCAATG
AATGATGAGGATAGAGATCAGTGGATTAAAGCCATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTAT
TGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGCAAAGGGTTATACCCAAAGAGAGGGGGTGGACTATG
AAGAAACCTTCTCTCCTGTTGCTATGCTTAAATCAATAAGAATACTCTTGTCTATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTTAAGACAGCTTTTCTA
AACGGCAATCTTGAAGAGAGTATCTATATGGCTCAACCAGAGAGGTTCATTGAACAGGATCATGAGCAAAGGGTTTGCAAGCTTAAAAGATCCATTTATGGGTTGAAGCA
AGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAG
CTTTCCTAGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGGGAATTCTGACTGACATTAAGCATTGGCTGGCGACACAATTCCAAATGAAAGATTTGGGA
GAGGCTCAGTTTGTTCTTGAAATCCAAATTGTTCGGAATCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGATAAAATGTTGATTCGATATAAGATGCA
GGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCAT
CAGCGGTCGGTAGTCTTATGTATGCCATGCTATGTACCCGACCAGACATATGCTATGTTATGGGAATTGTCAGCAGATACCAGTCCAATCCGGGACGTGCTCATTGGACT
GCCGTTAAGAATATCCTCAAGTATCTTAGGAGAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGA
TGTAGATTCGAGAAAATCGACATCAGGATCTGTCTTCACTTTGAACGGAGGAGCAATAATATGGAGAAGCATAAGGCAAGGTTGCATTGCTGATTCCACCATGGAGGCTG
AGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTAGGAAGTTCTTAACTGATTTGGAAGTAGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGAT
AACAGTGGAGAAGTTGCAAATTCAAAAGAACCAAGAAGCCATAAGCGAGAAAAACATATTGAGCACAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGAT
CGTCACACAGATAGCTTCAGAGATCAACATTGCTGATCCATTTACAAAGCCCCTCACGGCTAAAGTGTTTGAAGAGCACCTAGTGAGTCTAGGACTACGAGTTATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCACTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTG
CTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCT
TAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGGGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTAC
CTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGA
GTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAA
CTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAG
AGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAAC
GAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTG
ATCAAGCTGGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATTGACCTATAAACAGGCAATG
AATGATGAGGATAGAGATCAGTGGATTAAAGCCATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTAT
TGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGCAAAGGGTTATACCCAAAGAGAGGGGGTGGACTATG
AAGAAACCTTCTCTCCTGTTGCTATGCTTAAATCAATAAGAATACTCTTGTCTATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTTAAGACAGCTTTTCTA
AACGGCAATCTTGAAGAGAGTATCTATATGGCTCAACCAGAGAGGTTCATTGAACAGGATCATGAGCAAAGGGTTTGCAAGCTTAAAAGATCCATTTATGGGTTGAAGCA
AGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAG
CTTTCCTAGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGGGAATTCTGACTGACATTAAGCATTGGCTGGCGACACAATTCCAAATGAAAGATTTGGGA
GAGGCTCAGTTTGTTCTTGAAATCCAAATTGTTCGGAATCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGATAAAATGTTGATTCGATATAAGATGCA
GGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCAT
CAGCGGTCGGTAGTCTTATGTATGCCATGCTATGTACCCGACCAGACATATGCTATGTTATGGGAATTGTCAGCAGATACCAGTCCAATCCGGGACGTGCTCATTGGACT
GCCGTTAAGAATATCCTCAAGTATCTTAGGAGAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGA
TGTAGATTCGAGAAAATCGACATCAGGATCTGTCTTCACTTTGAACGGAGGAGCAATAATATGGAGAAGCATAAGGCAAGGTTGCATTGCTGATTCCACCATGGAGGCTG
AGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTAGGAAGTTCTTAACTGATTTGGAAGTAGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGAT
AACAGTGGAGAAGTTGCAAATTCAAAAGAACCAAGAAGCCATAAGCGAGAAAAACATATTGAGCACAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGAT
CGTCACACAGATAGCTTCAGAGATCAACATTGCTGATCCATTTACAAAGCCCCTCACGGCTAAAGTGTTTGAAGAGCACCTAGTGAGTCTAGGACTACGAGTTATGTAA
Protein sequenceShow/hide protein sequence
MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSLR
HEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSQ
NKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLY
LMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTK
SVSETPYELWKGRKGYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSETQVIIPDDGVEDPLTYKQAM
NDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFL
NGNLEESIYMAQPERFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGILTDIKHWLATQFQMKDLG
EAQFVLEIQIVRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYVMGIVSRYQSNPGRAHWT
AVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIRQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCD
NSGEVANSKEPRSHKREKHIEHKYHLIREIVQRGDVIVTQIASEINIADPFTKPLTAKVFEEHLVSLGLRVM