; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G023510 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G023510
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol protein
Genome locationCmo_Chr04:17472419..17476429
RNA-Seq ExpressionCmoCh04G023510
SyntenyCmoCh04G023510
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.0e+0057.93Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP     NANRT R+AYDRW+KANDKARVYILAS++DVLAKKHD + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK----------------
        +  EANVA++K K +RGSSS+NK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAEKKAEK                
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------TQQG
                                                                                                        T +G
Subjt:  ------------------------------------------------------------------------------------------------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
         RAK  LELVH+DLCGPMNVKARGGYEYFISFIDD+SRYG++YL+HHKSE+ EKF+EYK EV+N +GKTIKTLRSDRGGEYMD +FQDY+IE GI+SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  ATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------
        A   PQQNGVSERRNRTLLDMVRSMMS+AQLPD FWGYA+ETA +ILN VP+KSVLETPYELWKGRK                                 
Subjt:  ATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPKE++GGLFY PQEN+VFVSTNATFLEEDH RNHQPRSK+VL E+ K ATDK +        ST+VVD A+ S QSH SQELR+PRRSGRV+ QP+RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK-----------------------
        LGL ETQ+IIPDDGVEDPLTYKQAMND DRDQWIKAMNLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRDQ GK                       
Subjt:  LGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK-----------------------

Query:  --------------------------------------------------------------------------------------------NVDEPCVY
                                                                                                    NVDEPCVY
Subjt:  --------------------------------------------------------------------------------------------NVDEPCVY

Query:  KRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKE
        K+IVNS VAFL+LYVDDILLIGNDV  LTD+K WL TQFQMKDLGEAQ++LGI+I+RNRKNKTLA+SQASYIDK+L RYKMQ+SKKG LPFRHG+HLSKE
Subjt:  KRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKE

Query:  QSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRKSTS
        Q PKTPQEVEDM++IPY+SAVGSLMYAMLCTRPDICY+VGIVSRYQSNPGR HWTAVKNILKYLRRTR+YML+YG KDLIL GYTDSDFQ+D D+RKSTS
Subjt:  QSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRKSTS

Query:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
        GSVFTLNGGA++WRS+KQ CIADSTMEAEYV ACEAAKE+VWLRKFLTDLEVVPNMHLP+TLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV RGD
Subjt:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTQ
        V+VTQ
Subjt:  VIVTQ

KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0063.76Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        ++++ + +LA++KLNG+NY +WK+ +N +L+IDDL+FVL E+CP    +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVA  NEAVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CF CN+ GHWKRNCPKYLA+KK  K         T +G
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
        +RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  ATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------
        A   PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW G K                                 
Subjt:  ATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++     RVV    +S ++H  Q LR PRRSGRV   P RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK-----------------------
        + L ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GK                       
Subjt:  LGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK-----------------------

Query:  --------------------------------------------------------------------------------------------NVDEPCVY
                                                                                                     VDEPCVY
Subjt:  --------------------------------------------------------------------------------------------NVDEPCVY

Query:  KRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKE
        KRI+N +VAFLVLYVDDILLIGND+ +LTDIK WLATQFQMKDLGEAQFVLGI+I R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LSKE
Subjt:  KRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKE

Query:  QSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRKSTS
        Q PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG KDLIL GYTDSDFQTD DSRKSTS
Subjt:  QSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRKSTS

Query:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
        GSVFTLNGGA++WRSIKQGCIADSTMEAEYV ACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGD
Subjt:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        VIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  VIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0054.78Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP    +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------
        QLSA G PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW GRK                              
Subjt:  QLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TRVV    +S ++H  Q LR PRRSGRV   P
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP

Query:  DRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK--------------------
         RY+ L ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GK                    
Subjt:  DRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK--------------------

Query:  -----------------------------------------------------------------------------------------------NVDEP
                                                                                                        VDEP
Subjt:  -----------------------------------------------------------------------------------------------NVDEP

Query:  CVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHL
        CVYKRI+N +VAFLVLYVDDILLIGND+ +LTDIK WLATQFQMKDLGEAQFVLGI+I R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV L
Subjt:  CVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHL

Query:  SKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRK
        SKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG KDLIL GYTDSDFQTD DSRK
Subjt:  SKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRK

Query:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        STSGSVFTLNGGA++WRSIKQGCIADSTMEAEYV ACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV 
Subjt:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

Query:  RGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        RGDVIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  RGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0054.78Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP    +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------
        QLSA G PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW GRK                              
Subjt:  QLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TRVV    +S ++H  Q LR PRRSGRV   P
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP

Query:  DRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK--------------------
         RY+ L ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GK                    
Subjt:  DRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK--------------------

Query:  -----------------------------------------------------------------------------------------------NVDEP
                                                                                                        VDEP
Subjt:  -----------------------------------------------------------------------------------------------NVDEP

Query:  CVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHL
        CVYKRI+N +VAFLVLYVDDILLIGND+ +LTDIK WLATQFQMKDLGEAQFVLGI+I R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV L
Subjt:  CVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHL

Query:  SKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRK
        SKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG KDLIL GYTDSDFQTD DSRK
Subjt:  SKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRK

Query:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        STSGSVFTLNGGA++WRSIKQGCIADSTMEAEYV ACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV 
Subjt:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

Query:  RGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        RGDVIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  RGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0054.85Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK N NT+L+IDDL+FVL EECP    +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------
        QLSA G PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW GRK                              
Subjt:  QLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TRVV    +S ++H  Q LR PRRSGRV   P
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP

Query:  DRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK--------------------
         RY+ L ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GK                    
Subjt:  DRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK--------------------

Query:  -----------------------------------------------------------------------------------------------NVDEP
                                                                                                        VDEP
Subjt:  -----------------------------------------------------------------------------------------------NVDEP

Query:  CVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHL
        CVYKRI+N +VAFLVLYVDDILLIGND+ +LTDIK WLATQFQMKDLGEAQFVLGI+I R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV L
Subjt:  CVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHL

Query:  SKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRK
        SKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG KDLIL GYTDSDFQTD DSRK
Subjt:  SKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRK

Query:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        STSGSVFTLNGGA++WRSIKQGCIADSTMEAEYV ACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV 
Subjt:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

Query:  RGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        RGDVIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  RGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0054.78Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP    +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------
        QLSA G PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW GRK                              
Subjt:  QLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TRVV    +S ++H  Q LR PRRSGRV   P
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP

Query:  DRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK--------------------
         RY+ L ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GK                    
Subjt:  DRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK--------------------

Query:  -----------------------------------------------------------------------------------------------NVDEP
                                                                                                        VDEP
Subjt:  -----------------------------------------------------------------------------------------------NVDEP

Query:  CVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHL
        CVYKRI+N +VAFLVLYVDDILLIGND+ +LTDIK WLATQFQMKDLGEAQFVLGI+I R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV L
Subjt:  CVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHL

Query:  SKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRK
        SKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG KDLIL GYTDSDFQTD DSRK
Subjt:  SKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRK

Query:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        STSGSVFTLNGGA++WRSIKQGCIADSTMEAEYV ACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV 
Subjt:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

Query:  RGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        RGDVIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  RGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5A7SNP8 Gag/pol protein0.0e+0063.76Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        ++++ + +LA++KLNG+NY +WK+ +N +L+IDDL+FVL E+CP    +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVA  NEAVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CF CN+ GHWKRNCPKYLA+KK  K         T +G
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
        +RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  ATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------
        A   PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW G K                                 
Subjt:  ATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++     RVV    +S ++H  Q LR PRRSGRV   P RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK-----------------------
        + L ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GK                       
Subjt:  LGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK-----------------------

Query:  --------------------------------------------------------------------------------------------NVDEPCVY
                                                                                                     VDEPCVY
Subjt:  --------------------------------------------------------------------------------------------NVDEPCVY

Query:  KRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKE
        KRI+N +VAFLVLYVDDILLIGND+ +LTDIK WLATQFQMKDLGEAQFVLGI+I R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV LSKE
Subjt:  KRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKE

Query:  QSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRKSTS
        Q PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG KDLIL GYTDSDFQTD DSRKSTS
Subjt:  QSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRKSTS

Query:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
        GSVFTLNGGA++WRSIKQGCIADSTMEAEYV ACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGD
Subjt:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        VIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  VIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5A7ULH1 Gag/pol protein0.0e+0064.07Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP    +NA RT R+ Y+RW K N+KAR YILAS+S+VLAKKH+ M TA+EIM SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYI N RM EG SVREHV++MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GE NVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  -----------------------------TQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLG
                                     T++G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+  L 
Subjt:  -----------------------------TQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLG

Query:  KTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK
        KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSA G PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW GRK
Subjt:  KTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
                                    EEDH+R H PRSK+VL+E+SK  T+ +TRVV++    TRVV    +S ++H  Q LR PRRSGRV   P RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK-----------------NVDEPC
        + L ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GK                  VD   
Subjt:  LGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK-----------------NVDEPC

Query:  VYK--------RIVNSTVAFL-----VLYVDDILLIGNDVR--------------ILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASY
         +         RI+ S  A+       + V    L GN                 +LTDIK WLATQFQMKDLGEAQFVLGI+I R+RKNK LALSQASY
Subjt:  VYK--------RIVNSTVAFL-----VLYVDDILLIGNDVR--------------ILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASY

Query:  IDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYM
        IDK++++Y MQ+SK+ LLPFRHGV LSKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDYM
Subjt:  IDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYM

Query:  LMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVA
        L+YG KDLIL GYTDSDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTME EYV ACEAAKE VWLR FL DLEVVPNM  P+TLYCDN+G VA
Subjt:  LMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVA

Query:  NSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        NS+EPRSHKRGKHIERKYHLIREIV RGDVIVTQI S HN+ADPFTKPLTAKVFEGHL SLG+R M
Subjt:  NSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5D3BHG7 Gag/pol protein0.0e+0059.49Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT + + +LA++KLNG+NY +WK+ +N +L+IDDLKFVL EECP    +NA +T R+ Y+RW K N+K R YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ + H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQ                                             G
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFH N+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  -----------------------------TQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLG
                                     T +G+RAKE LELVH+DLCGPMNVKARG +EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L 
Subjt:  -----------------------------TQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLG

Query:  KTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK
        KTIKT RSDRGGEYMDL+FQ+Y++E  I SQLSA G PQQNGVSERRNRTLLDMVRSM+S+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW GRK
Subjt:  KTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK

Query:  ---------------------------------GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTR
                                         GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TR
Subjt:  ---------------------------------GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTR

Query:  VVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWI
        VV    +S ++H  Q LR PRRSGRV   P RY+ L ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWI
Subjt:  VVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWI

Query:  YKRKRDQTGK------------------------------------------------------------------------------------------
        YKRKR   GK                                                                                          
Subjt:  YKRKRDQTGK------------------------------------------------------------------------------------------

Query:  -------------------------NVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALS
                                  VDEPCVYKRI+N +VAFLVLYVDDILLIGND+ +LTDIK WLATQFQMKDLGEAQFVLGI+I R+RKNK LALS
Subjt:  -------------------------NVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALS

Query:  QASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRT
        QASYIDK++++Y MQ+SK+GLLPFRHGV LSKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRT
Subjt:  QASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRT

Query:  RDYMLMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNS
        RDY L+YG KDLIL GYTDSDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTMEAEYV ACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNS
Subjt:  RDYMLMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNS

Query:  GAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        GAVANS+EPRSHKRGKHIERKYHLIREIV RGDVIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  GAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5D3CPJ6 Gag/pol protein0.0e+0054.78Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP    +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------
        QLSA G PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW GRK                              
Subjt:  QLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++    TRVV    +S ++H  Q LR PRRSGRV   P
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP

Query:  DRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK--------------------
         RY+ L ET  +I D  +EDPLT+K+AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GK                    
Subjt:  DRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK--------------------

Query:  -----------------------------------------------------------------------------------------------NVDEP
                                                                                                        VDEP
Subjt:  -----------------------------------------------------------------------------------------------NVDEP

Query:  CVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHL
        CVYKRI+N +VAFLVLYVDDILLIGND+ +LTDIK WLATQFQMKDLGEAQFVLGI+I R+RKNK LALSQASYIDK++++Y MQ+SK+GLLPFRHGV L
Subjt:  CVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHL

Query:  SKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRK
        SKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG KDLIL GYTDSDFQTD DSRK
Subjt:  SKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRK

Query:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        STSGSVFTLNGGA++WRSIKQGCIADSTMEAEYV ACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV 
Subjt:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

Query:  RGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        RGDVIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  RGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-7123.74Show/hide
Query:  KETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSATG
        K  L +VH+D+CGP+         YF+ F+D ++ Y   YL+ +KS+    F+++  + +      +  L  D G EY+    + + ++ GI   L+   
Subjt:  KETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSATG

Query:  MPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLE---TPYELWKGRKGYPK-------------ETKGGLF---------
         PQ NGVSER  RT+ +  R+M+S A+L   FWG AV TATY++N +P++++++   TPYE+W  +K Y K             + K G F         
Subjt:  MPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLE---TPYELWKGRKGYPK-------------ETKGGLF---------

Query:  --YDPQENRVFVSTNATFL--------EEDHVRNHQPRSKLVLSEISKEATDK-----TTRVVDQAGPS-TRVVDGADTSGQSHPSQELRMPRRSGRVI-
          Y+P   +++ + N  F+        E + V +   + + V  + SKE+ +K     + +++    P+ ++  D       S  S+    P  S ++I 
Subjt:  --YDPQENRVFVSTNATFL--------EEDHVRNHQPRSKLVLSEISKEATDK-----TTRVVDQAGPS-TRVVDGADTSGQSHPSQELRMPRRSGRVI-

Query:  -------------------TQPDRYL-------------------------GLAETQVIIPDDGVEDP---------------------LTYKQAMND--
                            + ++Y                            +ET   + + G+++P                     ++Y +  N   
Subjt:  -------------------TQPDRYL-------------------------GLAETQVIIPDDGVEDP---------------------LTYKQAMND--

Query:  -----------------------EDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK---------------------------
                               +D+  W +A+N E+ +   N+ W +  +P+    +  +W++  K ++ G                            
Subjt:  -----------------------EDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK---------------------------

Query:  -------------------------------------------------NVDEPCVYKR------------------------IVNSTV-----------
                                                         N D  C   +                         VNS+V           
Subjt:  -------------------------------------------------NVDEPCVYKR------------------------IVNSTV-----------

Query:  ----AFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPK
             +++LYVDD+++   D+  + + K +L  +F+M DL E +  +GI I    +   + LSQ++Y+ K+L ++ M++      P    ++     S  
Subjt:  ----AFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPK

Query:  TPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLI----LIGYTDSDFQTDVDSRKSTS
             ++  + P  S +G LMY MLCTRPD+  AV I+SRY S      W  +K +L+YL+ T D  L++  K+L     +IGY DSD+      RKST+
Subjt:  TPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLI----LIGYTDSDFQTDVDSRKSTS

Query:  GSVFTL-NGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRG
        G +F + +   I W + +Q  +A S+ EAEY+   EA +E++WL+  LT + +   +  P+ +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ  
Subjt:  GSVFTL-NGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRG

Query:  DVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGL
         + +  I +E+ +AD FTKPL A  F      LGL
Subjt:  DVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGL

P0CV72 Secreted RxLR effector protein 1612.3e-3048.87Show/hide
Query:  MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY-GVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGA
        MK++PY SAVG++MY M+ TRPD+  AVG++S++ S+P   HW A+K +L+YL+ T+ Y L +       L+GY+D+D+  DV+SR+STSG +F LNGG 
Subjt:  MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY-GVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGA

Query:  IIWRSIKQGCIADSTMEAEYVVACEAAKESVWL
        + WRS KQ  +A S+ E EY+   EA +E+VWL
Subjt:  IIWRSIKQGCIADSTMEAEYVVACEAAKESVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-12433.29Show/hide
Query:  RAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSA
        R    L+LV++D+CGPM +++ GG +YF++FIDD SR  ++Y++  K +  + F+++   V+   G+ +K LRSD GGEY    F++Y   HGIR + + 
Subjt:  RAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSA

Query:  TGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYELWKGRK---------------------------------
         G PQ NGV+ER NRT+++ VRSM+  A+LP  FWG AV+TA Y++N  P+  +  E P  +W  ++                                 
Subjt:  TGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYELWKGRK---------------------------------

Query:  --GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEI----------------SKEATDKTTRVVDQAGPSTRVVDGADTSGQ--SHP
          GY  E  G   +DP + +V  S +  F  E  VR     S+ V + I                ++  TD+ +   +Q G      +  D   +   HP
Subjt:  --GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEI----------------SKEATDKTTRVVDQAGPSTRVVDGADTSGQ--SHP

Query:  SQ--ELRMP-RRSGRVITQPDRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRD----
        +Q  E   P RRS R   +  RY   +   V+I DD   +P + K+ ++  +++Q +KAM  EMES+  N  ++LV+ P G +P+ CKW++K K+D    
Subjt:  SQ--ELRMP-RRSGRVITQPDRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRD----

Query:  -------------QTGKNVD--------------------------------------------------------------------------------
                     +  K +D                                                                                
Subjt:  -------------QTGKNVD--------------------------------------------------------------------------------

Query:  ------------------EPCVY-KRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKML
                          +PCVY KR   +    L+LYVDD+L++G D  ++  +K  L+  F MKDLG AQ +LG++I+R R ++ L LSQ  YI+++L
Subjt:  ------------------EPCVY-KRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKML

Query:  IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGV
         R+ M+++K    P    + LSK+  P T +E  +M  +PY+SAVGSLMYAM+CTRPDI +AVG+VSR+  NPG+ HW AVK IL+YLR T    L +G 
Subjt:  IRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGV

Query:  KDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEP
         D IL GYTD+D   D+D+RKS++G +FT +GGAI W+S  Q C+A ST EAEY+ A E  KE +WL++FL +L +    ++   +YCD+  A+  SK  
Subjt:  KDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEP

Query:  RSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFE
          H R KHI+ +YH IRE+V    + V +I++  N AD  TK +    FE
Subjt:  RSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-4034.65Show/hide
Query:  TVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTP
        ++ ++++YVDDIL+ GND  +L +    L+ +F +KD  E  + LGIE    R    L LSQ  YI  +L R  M  +K    P      LS     K  
Subjt:  TVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTP

Query:  QEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFT
           E      Y   VGSL Y +  TRPDI YAV  +S++   P   H  A+K IL+YL  T ++ + +     L L  Y+D+D+  D D   ST+G +  
Subjt:  QEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFT

Query:  LNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQ
        L    I W S KQ  +  S+ EAEY      + E  W+   LT+L +   +  P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  
Subjt:  LNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQ

Query:  IASEHNIADPFTKPLTAKVFEGHLVSLGL
        +++   +AD  TKPL+   F+     +G+
Subjt:  IASEHNIADPFTKPLTAKVFEGHLVSLGL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-2030.94Show/hide
Query:  NCPKYLAEK--KAEKTQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEY
        +C   L  K  K   +Q    +   LE +++D+     + +   Y Y++ F+D ++RY +LY +  KS+  E F  +K  ++N     I T  SD GGE+
Subjt:  NCPKYLAEK--KAEKTQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEY

Query:  MDLRFQDYMIEHGIRSQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYE
        + L   +Y  +HGI    S    P+ NG+SER++R +++   +++S A +P  +W YA   A Y++N +PT  + LE+P++
Subjt:  MDLRFQDYMIEHGIRSQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.0e-4234.04Show/hide
Query:  TVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTP
        ++ ++++YVDDIL+ GND  +L      L+ +F +K+  +  + LGIE    R  + L LSQ  Y   +L R  M  +K    P      L+     K P
Subjt:  TVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTP

Query:  QEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFT
           E      Y   VGSL Y +  TRPD+ YAV  +S+Y   P   HW A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+G +  
Subjt:  QEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFT

Query:  LNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQ
        L    I W S KQ  +  S+ EAEY      + E  W+   LT+L +   +  P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  
Subjt:  LNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQ

Query:  IASEHNIADPFTKPLTAKVFEGHLVSLGL
        +++   +AD  TKPL+   F+     +G+
Subjt:  IASEHNIADPFTKPLTAKVFEGHLVSLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.9e-2233.75Show/hide
Query:  AKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAT
        + + LE +++D+     + +   Y Y++ F+D ++RY +LY +  KS+  + F  +K+ V+N     I TL SD GGE++ LR  DY+ +HGI    S  
Subjt:  AKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAT

Query:  GMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYE
          P+ NG+SER++R +++M  +++S A +P  +W YA   A Y++N +PT  + L++P++
Subjt:  GMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.7e-3229.15Show/hide
Query:  ESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTG----KNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLG
        +S+  N+V  L     G+K    +W  K      G    ++  +   + +I  +    +++YVDDI++  N+   + ++K  L + F+++DLG  ++ LG
Subjt:  ESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTG----KNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLG

Query:  IEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRA
        +EI R+     + + Q  Y   +L    +   K   +P    V  S         +  D K   Y   +G LMY  + TR DI +AV  +S++   P  A
Subjt:  IEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRA

Query:  HWTAVKNILKYLRRTRDYMLMYGVK-DLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLE
        H  AV  IL Y++ T    L Y  + ++ L  ++D+ FQ+  D+R+ST+G    L    I W+S KQ  ++ S+ EAEY     A  E +WL +F  +L+
Subjt:  HWTAVKNILKYLRRTRDYMLMYGVK-DLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESVWLRKFLTDLE

Query:  VVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
        +   +  P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  VVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.9e-0436.11Show/hide
Query:  TRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY-GVKDLILIGYTDSDFQTDVDSRKSTSG
        TRPD+ +AV  +S++ S    A   AV  +L Y++ T    L Y    DL L  + DSD+ +  D+R+S +G
Subjt:  TRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY-GVKDLILIGYTDSDFQTDVDSRKSTSG

ATMG00810.1 DNA/RNA polymerases superfamily protein4.5e-2132.48Show/hide
Query:  FLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEV
        +L+LYVDDILL G+   +L  +   L++ F MKDLG   + LGI+I  +     L LSQ  Y +++L    M D K    P    ++ S   + K P   
Subjt:  FLVLYVDDILLIGNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEV

Query:  EDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNG
        +      + S VG+L Y  L TRPDI YAV IV +    P  A +  +K +L+Y++ T  + + ++    L +  + DSD+     +R+ST+G    L  
Subjt:  EDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNG

Query:  GAIIWRSIKQGCIADSTMEAEYVVACEAAKESVW
          I W + +Q  ++ S+ E EY      A E  W
Subjt:  GAIIWRSIKQGCIADSTMEAEYVVACEAAKESVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACACAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTG
CTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCT
TAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGGGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTAC
CTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGA
GTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCACTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAA
CTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAG
AGTGTTTTAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAAC
GAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTG
ATCAAGCTGGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAA
CCCGATCGTTACTTGGGTTTAGCAGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATTGACCTATAAACAGGCAATGAATGATGAGGATAGAGATCAGTG
GATTAAAGCCATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGA
GGAAACGAGACCAAACCGGTAAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTGGTGTTGTACGTTGACGATATCCTACTCATT
GGAAATGATGTGAGAATTCTGACTGACATTAAGCATTGGCTGGCAACACAATTCCAAATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCGAAATTATTCGGAA
TCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAG
TTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCATCAGCGGTCGGTAGTCTGATGTATGCCATGCTTTGTACT
CGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATTCTCAAGTATCTTAGGAGAACGAG
GGACTATATGCTAATGTACGGTGTTAAGGATCTGATCCTTATAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCGAGGAAATCGACATCAGGATCTGTCTTCA
CTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTGATTCCACCATGGAGGCTGAGTATGTTGTTGCTTGTGAAGCAGCGAAAGAATCTGTA
TGGCTTAGGAAGTTCTTAACTGATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAG
CCATAAGCGAGGAAAACATATTGAACGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCGTCACACAGATAGCTTCAGAGCACAACATTGCTGATC
CATTTACAAAGCCCCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACACAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTG
CTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCT
TAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGGGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTAC
CTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGA
GTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCACTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAA
CTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAG
AGTGTTTTAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAAC
GAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTG
ATCAAGCTGGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAA
CCCGATCGTTACTTGGGTTTAGCAGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATTGACCTATAAACAGGCAATGAATGATGAGGATAGAGATCAGTG
GATTAAAGCCATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGA
GGAAACGAGACCAAACCGGTAAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTGGTGTTGTACGTTGACGATATCCTACTCATT
GGAAATGATGTGAGAATTCTGACTGACATTAAGCATTGGCTGGCAACACAATTCCAAATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCGAAATTATTCGGAA
TCGCAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTATTACCTTTCAGGCATGGAG
TTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCATCAGCGGTCGGTAGTCTGATGTATGCCATGCTTTGTACT
CGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATTCTCAAGTATCTTAGGAGAACGAG
GGACTATATGCTAATGTACGGTGTTAAGGATCTGATCCTTATAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCGAGGAAATCGACATCAGGATCTGTCTTCA
CTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTGATTCCACCATGGAGGCTGAGTATGTTGTTGCTTGTGAAGCAGCGAAAGAATCTGTA
TGGCTTAGGAAGTTCTTAACTGATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAG
CCATAAGCGAGGAAAACATATTGAACGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCGTCACACAGATAGCTTCAGAGCACAACATTGCTGATC
CATTTACAAAGCCCCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTAA
Protein sequenceShow/hide protein sequence
MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNTNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSLR
HEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSQ
NKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYGYLY
LMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSATGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTK
SVLETPYELWKGRKGYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQ
PDRYLGLAETQVIIPDDGVEDPLTYKQAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKNVDEPCVYKRIVNSTVAFLVLYVDDILLI
GNDVRILTDIKHWLATQFQMKDLGEAQFVLGIEIIRNRKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCT
RPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGVKDLILIGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVVACEAAKESV
WLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM