; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G009470 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G009470
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol protein
Genome locationCmo_Chr20:5025004..5029482
RNA-Seq ExpressionCmoCh20G009470
SyntenyCmoCh20G009470
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.0e+0057.78Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  N NRTVR+AYDRW+KANDKARVYILAS++DVLAKKHD              
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------

Query:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
                                                + E N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSKNKFGPSTSESVLMKKKGKGKNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK----------------
        +  EANVA++K K +RGSSSKNK GPS ++   MKKKGKG  K P   K K + ADKGKCF+CN++GHWKRNCPKYLAEKKAEK                
Subjt:  QTGEANVAISK-KLLRGSSSKNKFGPSTSESVLMKKKGKGKNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------TQQG
                                                                                                        T +G
Subjt:  ------------------------------------------------------------------------------------------------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
         RAK  LELVH+DLCGPMNVKARGGYEYFISFIDD+SRY ++YL+HHKSE+ EKF+EYK EVEN +GKTIKTLRSDRGGEYMD +FQDY+IE GI+SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+AQLPD FWGYA+ETA +ILN VP+KSV ETPYELWKGRK                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPKE++GGLFY PQEN+VFVSTNATFLEEDH RNHQPRSK+VL E+ K ATDK +        ST+VVD  + S QSH SQELR+PRRSGRV+ QP+RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEE
        L L ETQ+IIPDDGVEDPLTYK AMND DRDQWIKAMNLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQ EGVDYEE
Subjt:  LDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEE

Query:  TFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY
        TFSPVAMLKSIRILLSIATFY+YEIWQMDVKTAFLNGNL+ESIYM + EGFI QD EQ+VCKL++SIYGLKQASRSWNI+FDTAIKSYGF+QNVDEPCVY
Subjt:  TFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY

Query:  KRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-------------------------
        K+IVNS VAFL+LYVDDILLIGNDV  +TD+K WL TQFQMKDLGEAQ++LGIQIVRNRKNK LA+SQASYIDK+                         
Subjt:  KRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-------------------------

Query:  --------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRKSTS
                                                    YQSNPGR HWTAVKNILKYLRRTR+YML+YGAKDLIL  YTDSDFQ+D D+RKSTS
Subjt:  --------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRKSTS

Query:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
        GSVFTLNGGA++WRS+KQ CIADSTMEAEYVAACEAAKE+VWLRKFLTDLEVVPNMHLP+TLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV RGD
Subjt:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTQ
        V+VTQ
Subjt:  VIVTQ

KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0064.2Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKH----------DVMEE
        ++++ + +LA++KLNG+NY +WK+ +N +L+IDDL+FVL E+CP  P +N  RTVR+ Y+RW KAN+KAR YILAS+S+VLAKKH          D ++E
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKH----------DVMEE

Query:  --------------------------------------------NEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
                                                    NEAVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  --------------------------------------------NEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK---------TQQG
        Q GEANVA S +K  RGS+S  K  PS+S +   KKK  G+G K  +   +  K  KA KG CF CN+ GHWKRNCPKYLA+KK  K         T +G
Subjt:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK---------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
        +RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW G K                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPK T+GG FYDP++N+VFVSTNATFLEEDHIR H+PRSK+VL+E+SKE T+ +TRVV++     RVV  V +S ++H  Q LR PRRSGRV   P RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEE
        + L ET  +I D  +EDPLT+K AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEE
Subjt:  LDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEE

Query:  TFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY
        TFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNL+E+IYM + EGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPCVY
Subjt:  TFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY

Query:  KRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-------------------------
        KRI+N +VAFLVLYVDDILLIGND+G++TDIK WLATQFQMKDLGEAQFVLGIQI R+RKNK+LALSQASYIDKI                         
Subjt:  KRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-------------------------

Query:  --------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRKSTS
                                                    YQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLIL  YTDSDFQTD DSRKSTS
Subjt:  --------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRKSTS

Query:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
        GSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGD
Subjt:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM
        VIVTQIAS HN+ DPFTK LT KVFEGHL SLGLR M
Subjt:  VIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0055Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +N  RTVR+ Y+RW KAN+KAR YILAS+S+VLAKKH+              
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------

Query:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
                                                V E N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  K  PS+S +   KKK  G+G K  +   +  K  KA KG CF+CN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQP
           GYPK T+GG FYDP++N+VFVSTNATFLEEDHIR H+PRSK+VL+E+SKE T+ +TRVV++    TRVV  V +S ++H  Q LR PRRSGRV   P
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQP

Query:  DRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVD
         RY+ L ET  +I D  +EDPLT+K AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNL+E+IYM + EGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEP

Query:  CVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI----------------------
        CVYKRI+N +VAFLVLYVDDILLIGND+G++TDIK WLATQFQMKDLGEAQFVLGIQI R+RKNK+LALSQASYIDKI                      
Subjt:  CVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI----------------------

Query:  -----------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRK
                                                       YQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLIL  YTDSDFQTD DSRK
Subjt:  -----------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRK

Query:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        STSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV 
Subjt:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

Query:  RGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM
        RGDVIVTQIAS HN+ DPFTK LT KVFEGHL SLGLR M
Subjt:  RGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0055Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +N  RTVR+ Y+RW KAN+KAR YILAS+S+VLAKKH+              
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------

Query:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
                                                V E N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  K  PS+S +   KKK  G+G K  +   +  K  KA KG CF+CN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQP
           GYPK T+GG FYDP++N+VFVSTNATFLEEDHIR H+PRSK+VL+E+SKE T+ +TRVV++    TRVV  V +S ++H  Q LR PRRSGRV   P
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQP

Query:  DRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVD
         RY+ L ET  +I D  +EDPLT+K AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNL+E+IYM + EGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEP

Query:  CVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI----------------------
        CVYKRI+N +VAFLVLYVDDILLIGND+G++TDIK WLATQFQMKDLGEAQFVLGIQI R+RKNK+LALSQASYIDKI                      
Subjt:  CVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI----------------------

Query:  -----------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRK
                                                       YQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLIL  YTDSDFQTD DSRK
Subjt:  -----------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRK

Query:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        STSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV 
Subjt:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

Query:  RGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM
        RGDVIVTQIAS HN+ DPFTK LT KVFEGHL SLGLR M
Subjt:  RGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0063.49Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMEENEAVIDEKSQ
        M++SI+ LL  ++L G+NY TWKS LN ILVI DL FVL EECPP P  + +++VRDAYDRW KANDKAR++ILAS+SD+L+KKH++M     ++D    
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMEENEAVIDEKSQ

Query:  VSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSKNKFGPSTSESVLMKKKGKGKN-KIPTNRKNKVQK
              SL + F Q                    +Q  Q          EANVA SK+    S       PS SE +  +K+GKGK   I    K K + 
Subjt:  VSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSKNKFGPSTSESVLMKKKGKGKN-KIPTNRKNKVQK

Query:  ADKGKCFNCNENGHWKRNCPKYLAEKKAEK----------------------------------------------------------------------
        A K KCF+CN + HWK NCPKYL +KK ++                                                                      
Subjt:  ADKGKCFNCNENGHWKRNCPKYLAEKKAEK----------------------------------------------------------------------

Query:  ---------------TQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEY
                       T +GYRAKE LEL+H+DLCGPMNVKARGG+EYFISFIDDYSRY YLYLM HKSEALEKF+EYKTEVENLL K IK LRSDRGGEY
Subjt:  ---------------TQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEY

Query:  MDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK--------------
        MDLRFQDYMIEHGI+SQLSAPG PQQNGVSERRNRTLLDMVRSMMS+AQLP  FWGYAVETA +ILN VP+KSVSETP+ELW+GRK              
Subjt:  MDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK--------------

Query:  -------------------GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPS
                           GYPKET+GGLF+DPQENRVFVSTNATFLEEDH+RNH+PRSKLVLS    EATD++TRVVD+ GPS+R VD   TSGQSHPS
Subjt:  -------------------GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPS

Query:  QELRMPRRSGRVITQPDRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTF
        Q LRMPRRSGRV++QP+RYL L ETQV+IPDDGVEDPL+YK AMND D+DQW+KAM+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTF
Subjt:  QELRMPRRSGRVITQPDRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTF

Query:  KARLVAKGYTQSEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKF
        KARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNL+ESI+M++ EGFI Q  EQ+VCKL RSIYGLKQASRSWNI+F
Subjt:  KARLVAKGYTQSEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKF

Query:  DTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI------
        DTAIKSYGF QNVDEPCVYK+I    VAFLVLYVDDILLIGNDVG +TD+K WLA QFQMKDLGEAQ+VLGIQI+R+RKNK LALSQA+YIDK+      
Subjt:  DTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI------

Query:  ---------------------------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLIL
                                                                       YQSNPG  HWTAVK +LKYLRRTRDYML+YGAKDLIL
Subjt:  ---------------------------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLIL

Query:  IRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKR
          YTDSDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLRKFL DLEVVPNM+LP+TLYCDNSGAVANSKEPRSHKR
Subjt:  IRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKR

Query:  GKHIERKYHLIREIVQRGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM
        GKHIERKYHLIREIVQRGDVIVT+IASEHNI DPFTK LT KVFEGHL SLGLR M
Subjt:  GKHIERKYHLIREIVQRGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0055Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +N  RTVR+ Y+RW KAN+KAR YILAS+S+VLAKKH+              
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------

Query:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
                                                V E N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  K  PS+S +   KKK  G+G K  +   +  K  KA KG CF+CN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQP
           GYPK T+GG FYDP++N+VFVSTNATFLEEDHIR H+PRSK+VL+E+SKE T+ +TRVV++    TRVV  V +S ++H  Q LR PRRSGRV   P
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQP

Query:  DRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVD
         RY+ L ET  +I D  +EDPLT+K AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNL+E+IYM + EGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEP

Query:  CVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI----------------------
        CVYKRI+N +VAFLVLYVDDILLIGND+G++TDIK WLATQFQMKDLGEAQFVLGIQI R+RKNK+LALSQASYIDKI                      
Subjt:  CVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI----------------------

Query:  -----------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRK
                                                       YQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLIL  YTDSDFQTD DSRK
Subjt:  -----------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRK

Query:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        STSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV 
Subjt:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

Query:  RGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM
        RGDVIVTQIAS HN+ DPFTK LT KVFEGHL SLGLR M
Subjt:  RGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM

A0A5A7SNP8 Gag/pol protein0.0e+0064.2Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKH----------DVMEE
        ++++ + +LA++KLNG+NY +WK+ +N +L+IDDL+FVL E+CP  P +N  RTVR+ Y+RW KAN+KAR YILAS+S+VLAKKH          D ++E
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKH----------DVMEE

Query:  --------------------------------------------NEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
                                                    NEAVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  --------------------------------------------NEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK---------TQQG
        Q GEANVA S +K  RGS+S  K  PS+S +   KKK  G+G K  +   +  K  KA KG CF CN+ GHWKRNCPKYLA+KK  K         T +G
Subjt:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK---------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
        +RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW G K                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPK T+GG FYDP++N+VFVSTNATFLEEDHIR H+PRSK+VL+E+SKE T+ +TRVV++     RVV  V +S ++H  Q LR PRRSGRV   P RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEE
        + L ET  +I D  +EDPLT+K AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEE
Subjt:  LDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEE

Query:  TFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY
        TFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNL+E+IYM + EGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEPCVY
Subjt:  TFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY

Query:  KRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-------------------------
        KRI+N +VAFLVLYVDDILLIGND+G++TDIK WLATQFQMKDLGEAQFVLGIQI R+RKNK+LALSQASYIDKI                         
Subjt:  KRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-------------------------

Query:  --------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRKSTS
                                                    YQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLIL  YTDSDFQTD DSRKSTS
Subjt:  --------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRKSTS

Query:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
        GSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGD
Subjt:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM
        VIVTQIAS HN+ DPFTK LT KVFEGHL SLGLR M
Subjt:  VIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM

A0A5A7UYE8 Gag/pol protein0.0e+0063.49Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMEENEAVIDEKSQ
        M++SI+ LL  ++L G+NY TWKS LN ILVI DL FVL EECPP P  + +++VRDAYDRW KANDKAR++ILAS+SD+L+KKH++M     ++D    
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMEENEAVIDEKSQ

Query:  VSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSKNKFGPSTSESVLMKKKGKGKN-KIPTNRKNKVQK
              SL + F Q                    +Q  Q          EANVA SK+    S       PS SE +  +K+GKGK   I    K K + 
Subjt:  VSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSKNKFGPSTSESVLMKKKGKGKN-KIPTNRKNKVQK

Query:  ADKGKCFNCNENGHWKRNCPKYLAEKKAEK----------------------------------------------------------------------
        A K KCF+CN + HWK NCPKYL +KK ++                                                                      
Subjt:  ADKGKCFNCNENGHWKRNCPKYLAEKKAEK----------------------------------------------------------------------

Query:  ---------------TQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEY
                       T +GYRAKE LEL+H+DLCGPMNVKARGG+EYFISFIDDYSRY YLYLM HKSEALEKF+EYKTEVENLL K IK LRSDRGGEY
Subjt:  ---------------TQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEY

Query:  MDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK--------------
        MDLRFQDYMIEHGI+SQLSAPG PQQNGVSERRNRTLLDMVRSMMS+AQLP  FWGYAVETA +ILN VP+KSVSETP+ELW+GRK              
Subjt:  MDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK--------------

Query:  -------------------GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPS
                           GYPKET+GGLF+DPQENRVFVSTNATFLEEDH+RNH+PRSKLVLS    EATD++TRVVD+ GPS+R VD   TSGQSHPS
Subjt:  -------------------GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPS

Query:  QELRMPRRSGRVITQPDRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTF
        Q LRMPRRSGRV++QP+RYL L ETQV+IPDDGVEDPL+YK AMND D+DQW+KAM+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTF
Subjt:  QELRMPRRSGRVITQPDRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTF

Query:  KARLVAKGYTQSEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKF
        KARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNL+ESI+M++ EGFI Q  EQ+VCKL RSIYGLKQASRSWNI+F
Subjt:  KARLVAKGYTQSEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKF

Query:  DTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI------
        DTAIKSYGF QNVDEPCVYK+I    VAFLVLYVDDILLIGNDVG +TD+K WLA QFQMKDLGEAQ+VLGIQI+R+RKNK LALSQA+YIDK+      
Subjt:  DTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI------

Query:  ---------------------------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLIL
                                                                       YQSNPG  HWTAVK +LKYLRRTRDYML+YGAKDLIL
Subjt:  ---------------------------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLIL

Query:  IRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKR
          YTDSDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLRKFL DLEVVPNM+LP+TLYCDNSGAVANSKEPRSHKR
Subjt:  IRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKR

Query:  GKHIERKYHLIREIVQRGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM
        GKHIERKYHLIREIVQRGDVIVT+IASEHNI DPFTK LT KVFEGHL SLGLR M
Subjt:  GKHIERKYHLIREIVQRGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM

A0A5D3CPJ6 Gag/pol protein0.0e+0055Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +N  RTVR+ Y+RW KAN+KAR YILAS+S+VLAKKH+              
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------

Query:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
                                                V E N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  K  PS+S +   KKK  G+G K  +   +  K  KA KG CF+CN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSKNKFGPSTSESVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARGG+EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSVSETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQP
           GYPK T+GG FYDP++N+VFVSTNATFLEEDHIR H+PRSK+VL+E+SKE T+ +TRVV++    TRVV  V +S ++H  Q LR PRRSGRV   P
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQP

Query:  DRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVD
         RY+ L ET  +I D  +EDPLT+K AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVD
Subjt:  DRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEP
        YEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNL+E+IYM + EGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIKSYGF Q VDEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEP

Query:  CVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI----------------------
        CVYKRI+N +VAFLVLYVDDILLIGND+G++TDIK WLATQFQMKDLGEAQFVLGIQI R+RKNK+LALSQASYIDKI                      
Subjt:  CVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI----------------------

Query:  -----------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRK
                                                       YQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLIL  YTDSDFQTD DSRK
Subjt:  -----------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRK

Query:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        STSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV 
Subjt:  STSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

Query:  RGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM
        RGDVIVTQIAS HN+ DPFTK LT KVFEGHL SLGLR M
Subjt:  RGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM

E2GK51 Gag/pol protein (Fragment)0.0e+0057.78Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  N NRTVR+AYDRW+KANDKARVYILAS++DVLAKKHD              
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHD--------------

Query:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
                                                + E N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  ----------------------------------------VMEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSKNKFGPSTSESVLMKKKGKGKNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK----------------
        +  EANVA++K K +RGSSSKNK GPS ++   MKKKGKG  K P   K K + ADKGKCF+CN++GHWKRNCPKYLAEKKAEK                
Subjt:  QTGEANVAISK-KLLRGSSSKNKFGPSTSESVLMKKKGKGKNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPKYLAEKKAEK----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------TQQG
                                                                                                        T +G
Subjt:  ------------------------------------------------------------------------------------------------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
         RAK  LELVH+DLCGPMNVKARGGYEYFISFIDD+SRY ++YL+HHKSE+ EKF+EYK EVEN +GKTIKTLRSDRGGEYMD +FQDY+IE GI+SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+AQLPD FWGYA+ETA +ILN VP+KSV ETPYELWKGRK                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPKE++GGLFY PQEN+VFVSTNATFLEEDH RNHQPRSK+VL E+ K ATDK +        ST+VVD  + S QSH SQELR+PRRSGRV+ QP+RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEE
        L L ETQ+IIPDDGVEDPLTYK AMND DRDQWIKAMNLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRDQ GKVQTFKARLVAKGYTQ EGVDYEE
Subjt:  LDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEE

Query:  TFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY
        TFSPVAMLKSIRILLSIATFY+YEIWQMDVKTAFLNGNL+ESIYM + EGFI QD EQ+VCKL++SIYGLKQASRSWNI+FDTAIKSYGF+QNVDEPCVY
Subjt:  TFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY

Query:  KRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-------------------------
        K+IVNS VAFL+LYVDDILLIGNDV  +TD+K WL TQFQMKDLGEAQ++LGIQIVRNRKNK LA+SQASYIDK+                         
Subjt:  KRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-------------------------

Query:  --------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRKSTS
                                                    YQSNPGR HWTAVKNILKYLRRTR+YML+YGAKDLIL  YTDSDFQ+D D+RKSTS
Subjt:  --------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRKSTS

Query:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD
        GSVFTLNGGA++WRS+KQ CIADSTMEAEYVAACEAAKE+VWLRKFLTDLEVVPNMHLP+TLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV RGD
Subjt:  GSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTQ
        V+VTQ
Subjt:  VIVTQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.9e-9827.74Show/hide
Query:  KETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPG
        K  L +VH+D+CGP+         YF+ F+D ++ Y   YL+ +KS+    F+++  + E      +  L  D G EY+    + + ++ GI   L+ P 
Subjt:  KETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPG

Query:  MPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSV---SETPYELWKGRKGYPK-------------ETKGGLF---------
         PQ NGVSER  RT+ +  R+M+S A+L   FWG AV TATY++N +P++++   S+TPYE+W  +K Y K             + K G F         
Subjt:  MPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSV---SETPYELWKGRKGYPK-------------ETKGGLF---------

Query:  --YDPQENRVFVSTNATFL--------EEDHIRNHQPRSKLVLSEISKEATDK-----TTRVVDQAGPS-TRVVDGVDTSGQSHPSQELRMPRRSGRVI-
          Y+P   +++ + N  F+        E + + +   + + V  + SKE+ +K     + +++    P+ ++  D +     S  S+    P  S ++I 
Subjt:  --YDPQENRVFVSTNATFL--------EEDHIRNHQPRSKLVLSEISKEATDK-----TTRVVDQAGPS-TRVVDGVDTSGQSHPSQELRMPRRSGRVI-

Query:  --------------------------------TQPDRYLD------------LAETQVIIPDDGVEDP---------------------LTY--------
                                         + D +L+             +ET   + + G+++P                     ++Y        
Subjt:  --------------------------------TQPDRYLD------------LAETQVIIPDDGVEDP---------------------LTY--------

Query:  KLAMN-----------------DEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEETFSP
        K+ +N                  +D+  W +A+N E+ +   N+ W +  +P+    +  +W++  K ++ G    +KARLVA+G+TQ   +DYEETF+P
Subjt:  KLAMN-----------------DEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEETFSP

Query:  VAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY---K
        VA + S R +LS+   Y+ ++ QMDVKTAFLNG L E IYM   +G     +   VCKL ++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K
Subjt:  VAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY---K

Query:  RIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI--------------------------
          +N  + +++LYVDD+++   D+  M + K +L  +F+M DL E +  +GI+I   +++KI  LSQ++Y+ KI                          
Subjt:  RIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI--------------------------

Query:  ------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYG---AKDLILIRYTDSDFQTDVDSRKSTSGSVFT
                                            Y S      W  +K +L+YL+ T D  L++    A +  +I Y DSD+      RKST+G +F 
Subjt:  ------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYG---AKDLILIRYTDSDFQTDVDSRKSTSGSVFT

Query:  L-NGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVT
        + +   I W + +Q  +A S+ EAEY+A  EA +E++WL+  LT + +   +  P+ +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + + 
Subjt:  L-NGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVT

Query:  QIASEHNIVDPFTKLLTTKVFEGHLVSLGL
         I +E+ + D FTK L    F      LGL
Subjt:  QIASEHNIVDPFTKLLTTKVFEGHLVSLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-14436Show/hide
Query:  RAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSA
        R    L+LV++D+CGPM +++ GG +YF++FIDD SR  ++Y++  K +  + F+++   VE   G+ +K LRSD GGEY    F++Y   HGIR + + 
Subjt:  RAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSA

Query:  PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVS-ETPYELWKGRK---------------------------------
        PG PQ NGV+ER NRT+++ VRSM+  A+LP  FWG AV+TA Y++N  P+  ++ E P  +W  ++                                 
Subjt:  PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVS-ETPYELWKGRK---------------------------------

Query:  --GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEI----------------SKEATDKTTRVVDQAGPSTRVVDGVDTSGQ--SHP
          GY  E  G   +DP + +V  S +  F  E  +R     S+ V + I                ++  TD+ +   +Q G      + +D   +   HP
Subjt:  --GYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRSKLVLSEI----------------SKEATDKTTRVVDQAGPSTRVVDGVDTSGQ--SHP

Query:  SQ--ELRMP-RRSGRVITQPDRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK
        +Q  E   P RRS R   +  RY   +   V+I DD   +P + K  ++  +++Q +KAM  EMES+  N  ++LV+ P G +P+ CKW++K K+D   K
Subjt:  SQ--ELRMP-RRSGRVITQPDRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGK

Query:  VQTFKARLVAKGYTQSEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSW
        +  +KARLV KG+ Q +G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+L+E IYM + EGF     +  VCKL +S+YGLKQA R W
Subjt:  VQTFKARLVAKGYTQSEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSW

Query:  NIKFDTAIKSYGFKQNVDEPCVY-KRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-
         +KFD+ +KS  + +   +PCVY KR   +    L+LYVDD+L++G D G++  +K  L+  F MKDLG AQ +LG++IVR R ++ L LSQ  YI+++ 
Subjt:  NIKFDTAIKSYGFKQNVDEPCVY-KRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-

Query:  --------------------------------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGA
                                                                            +  NPG+ HW AVK IL+YLR T    L +G 
Subjt:  --------------------------------------------------------------------YQSNPGRAHWTAVKNILKYLRRTRDYMLMYGA

Query:  KDLILIRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEP
         D IL  YTD+D   D+D+RKS++G +FT +GGAI W+S  Q C+A ST EAEY+AA E  KE +WL++FL +L +    ++   +YCD+  A+  SK  
Subjt:  KDLILIRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEP

Query:  RSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIVDPFTKLLTTKVFE
          H R KHI+ +YH IRE+V    + V +I++  N  D  TK++    FE
Subjt:  RSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIVDPFTKLLTTKVFE

P25600 Putative transposon Ty5-1 protein YCL074W4.4e-2128.9Show/hide
Query:  MDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGI
        MDV TAFLN  +DE IY+ +  GF+ + +   V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y R  +    ++ +YVDD+L+      I
Subjt:  MDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGI

Query:  MTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-----------------------------------YQS---------NPGR-
           +K  L   + MKDLG+    LG+ I     N  + LS   YI K                                    YQS         N GR 
Subjt:  MTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKI-----------------------------------YQS---------NPGR-

Query:  ------------------AHWTAVKNILKYLRRTRDYMLMY-GAKDLILIRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIK-QGCIADSTMEAEYV
                           H  + + +L+YL  TR   L Y     L L  Y D+      D   ST G V  L G  + W S K +G I   + EAEY+
Subjt:  ------------------AHWTAVKNILKYLRRTRDYMLMY-GAKDLILIRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIK-QGCIADSTMEAEYV

Query:  AACEAAKE
         A E   E
Subjt:  AACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.8e-8225.9Show/hide
Query:  NCPKYLAEK--KAEKTQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEY
        +C   L  K  K   +Q    +   LE +++D+     + +   Y Y++ F+D ++RY +LY +  KS+  E F  +K  +EN     I T  SD GGE+
Subjt:  NCPKYLAEK--KAEKTQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEY

Query:  MDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVS-ETPYEL------------------
        + L   +Y  +HGI    S P  P+ NG+SER++R +++   +++S A +P  +W YA   A Y++N +PT  +  E+P++                   
Subjt:  MDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVS-ETPYEL------------------

Query:  --W-----------KGRK----GYPKETKGGLFYDPQENRVFVSTNATFLE---------------------------------------------EDH-
          W           K R+    GY       L    Q +R+++S +  F E                                             + H 
Subjt:  --W-----------KGRK----GYPKETKGGLFYDPQENRVFVSTNATFLE---------------------------------------------EDH-

Query:  -----------IRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPS-----------------QELRMPRRSGRVITQPDRYLDLA
                    RN Q  S  + S  S            +            T  Q+H S                 Q L  P +S      P      +
Subjt:  -----------IRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPS-----------------QELRMPRRSGRVITQPDRYLDLA

Query:  ETQVIIPDDGVEDP------------------------------------LTYKLAMNDEDR--------DQWIKAMNLEMESMYFNSVWELVDQPDG-V
         T    P   +  P                                    L   LA   E R        ++W  AM  E+ +   N  W+LV  P   V
Subjt:  ETQVIIPDDGVEDP------------------------------------LTYKLAMNDEDR--------DQWIKAMNLEMESMYFNSVWELVDQPDG-V

Query:  KPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQR
          +GC+WI+ +K +  G +  +KARLVAKGY Q  G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YM++  GFI++D    
Subjt:  KPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQR

Query:  VCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNR
        VCKL++++YGLKQA R+W ++    + + GF  +V +  ++      ++ ++++YVDDIL+ GND  ++ +    L+ +F +KD  E  + LGI+  R  
Subjt:  VCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNR

Query:  KNKILALSQASYI-DKIYQSN-------------------------------------------------------------PGRAHWTAVKNILKYLRR
            L LSQ  YI D + ++N                                                             P   H  A+K IL+YL  
Subjt:  KNKILALSQASYI-DKIYQSN-------------------------------------------------------------PGRAHWTAVKNILKYLRR

Query:  TRDY-MLMYGAKDLILIRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCD
        T ++ + +     L L  Y+D+D+  D D   ST+G +  L    I W S KQ  +  S+ EAEY +    + E  W+   LT+L +   +  P  +YCD
Subjt:  TRDY-MLMYGAKDLILIRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCD

Query:  NSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGL
        N GA      P  H R KHI   YH IR  VQ G + V  +++   + D  TK L+   F+     +G+
Subjt:  NSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.8e-8426.16Show/hide
Query:  AKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP
        + + LE +++D+     + +   Y Y++ F+D ++RY +LY +  KS+  + F  +K+ VEN     I TL SD GGE++ LR  DY+ +HGI    S P
Subjt:  AKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP

Query:  GMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVS-ETPYELWKGR-----------------------------------
          P+ NG+SER++R +++M  +++S A +P  +W YA   A Y++N +PT  +  ++P++   G+                                   
Subjt:  GMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVS-ETPYELWKGR-----------------------------------

Query:  KGYPKETKGGLFYDPQENRVFVSTNATFLE-------------------EDHIRNHQ-----PRSKLVL--------------SEISKEATDKTTRVVDQ
         GY       L       R++ S +  F E                    D   N       P + LVL                 S  +   TT+V   
Subjt:  KGYPKETKGGLFYDPQENRVFVSTNATFLE-------------------EDHIRNHQ-----PRSKLVL--------------SEISKEATDKTTRVVDQ

Query:  AGPSTRVV------------DGVDTSGQSHPSQE-------LRMPRRSGRVITQPDRYLDLAETQVIIP-------------------------------
          PS+ +             +G   + Q H +Q        L  P  +      P++   L ++ +  P                               
Subjt:  AGPSTRVV------------DGVDTSGQSHPSQE-------LRMPRRSGRVITQPDRYLDLAETQVIIP-------------------------------

Query:  ---------------------DDGVEDP-----LTYKLAMNDEDR--------DQWIKAMNLEMESMYFNSVWELV-DQPDGVKPIGCKWIYKRKRDQTG
                              DG+  P         LA N E R        D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K +  G
Subjt:  ---------------------DDGVEDP-----LTYKLAMNDEDR--------DQWIKAMNLEMESMYFNSVWELV-DQPDGVKPIGCKWIYKRKRDQTG

Query:  KVQTFKARLVAKGYTQSEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRS
         +  +KARLVAKGY Q  G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YM++  GF+++D    VC+L+++IYGLKQA R+
Subjt:  KVQTFKARLVAKGYTQSEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRVCKLKRSIYGLKQASRS

Query:  WNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASY-IDKI
        W ++  T + + GF  ++ +  ++      ++ ++++YVDDIL+ GND  ++      L+ +F +K+  +  + LGI+  R  +   L LSQ  Y +D +
Subjt:  WNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASY-IDKI

Query:  YQSN-------------------------------------------------------------PGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILI
         ++N                                                             P   HW A+K +L+YL  T D+ + +     L L 
Subjt:  YQSN-------------------------------------------------------------PGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILI

Query:  RYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRG
         Y+D+D+  D D   ST+G +  L    I W S KQ  +  S+ EAEY +    + E  W+   LT+L +   +  P  +YCDN GA      P  H R 
Subjt:  RYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRG

Query:  KHIERKYHLIREIVQRGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGL
        KHI   YH IR  VQ G + V  +++   + D  TK L+   F+     +G+
Subjt:  KHIERKYHLIREIVQRGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.0e-6933.68Show/hide
Query:  EDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEETFSPVAMLKSIRILL
        ++P TY  A   ++   W  AM+ E+ +M     WE+   P   KPIGCKW+YK K +  G ++ +KARLVAKGYTQ EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHE----QRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFL
        +I+  Y++ + Q+D+  AFLNG+LDE IYM    G+  +  +      VC LK+SIYGLKQASR W +KF   +  +GF Q+  +   + +I  +    +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHE----QRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFL

Query:  VLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRN--------RKNKI------------------------LALSQASYID-KIYQS
        ++YVDDI++  N+   + ++K  L + F+++DLG  ++ LG++I R+        RK  +                         A S   ++D K Y+ 
Subjt:  VLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRN--------RKNKI------------------------LALSQASYID-KIYQS

Query:  NPGR---------------------------AHWTAVKNILKYLRRTRDYMLMYGAK-DLILIRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQG
          GR                           AH  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+  D+R+ST+G    L    I W+S KQ 
Subjt:  NPGR---------------------------AHWTAVKNILKYLRRTRDYMLMYGAK-DLILIRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQG

Query:  CIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
         ++ S+ EAEY A   A  E +WL +F  +L++   +  P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  CIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00810.1 DNA/RNA polymerases superfamily protein7.5e-0823.89Show/hide
Query:  FLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKIYQS-------------------------------
        +L+LYVDDILL G+   ++  +   L++ F MKDLG   + LGIQI  +     L LSQ  Y ++I  +                               
Subjt:  FLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQASYIDKIYQS-------------------------------

Query:  ------------------------------NPGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILIRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSI
                                       P  A +  +K +L+Y++ T  + + ++    L +  + DSD+     +R+ST+G    L    I W + 
Subjt:  ------------------------------NPGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILIRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSI

Query:  KQGCIADSTMEAEYVAACEAAKESVW
        +Q  ++ S+ E EY A    A E  W
Subjt:  KQGCIADSTMEAEYVAACEAAKESVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.3e-1537.86Show/hide
Query:  EDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEETFSPVAMLKSIRILL
        ++P +   A+ D     W +AM  E++++  N  W LV  P     +GCKW++K K    G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L
Subjt:  EDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEETFSPVAMLKSIRILL

Query:  SIA
        ++A
Subjt:  SIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAACTCAATAGTACAGTTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCCAACTCAAATGTAAACCGAACAGTTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGAAGAAAATGAAGCTGTCATTGATGAGAAGAGCCAAGTCAGTTTTATCATGATGTCTCTTCCGAAG
AGCTTCTTCCAGTTCCGCACCAATGTGGTAATGAACAAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACGAACAAGGGACA
AACAGGAGAAGCAAATGTTGCTATCTCCAAGAAATTACTACGAGGATCGTCCTCCAAAAATAAGTTTGGACCTTCAACTTCTGAAAGTGTTTTGATGAAGAAGAAGGGAA
AAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGGAAAATGTTTCAATTGCAACGAAAACGGGCACTGGAAGAGAAACTGCCCAAAA
TACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCTTAGAACTTGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGG
AGGGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTG
AGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGAATACATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCT
CAACTCTCAGCCCCTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCT
GTTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCA
AAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAACGAACGCTACATTCTTAGAGGAAGACCACATAAGAAATCATCAACCTCGTAGC
AAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTGATCAAGCTGGTCCTTCAACCAGAGTTGTTGATGGAGTTGACACTTCTGGTCA
ATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGGATTTGGCAGAAACTCAAGTCATCATACCTGATGATG
GCGTTGAGGATCCATTGACCTATAAACTGGCAATGAATGATGAGGATAGAGATCAGTGGATTAAAGCCATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGG
GAACTTGTAGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGC
AAAGGGTTATACCCAAAGTGAGGGGGTGGACTATGAAGAAACCTTCTCTCCTGTTGCTATGCTTAAATCAATTAGAATACTCTTGTCTATTGCCACATTTTATGATTATG
AAATTTGGCAAATGGATGTTAAGACAGCTTTTCTAAACGGCAATCTTGATGAGAGTATCTATATGGCTAAACTAGAGGGGTTCATTGAACAGGATCACGAGCAAAGGGTT
TGCAAGCTTAAAAGATCCATTTATGGGTTGAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACC
TTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTGGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGGGAATTATGACTGACATTAAGCATT
GGCTGGCGACACAATTTCAAATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCCAAATTGTTCGGAATCGCAAGAACAAAATACTAGCATTGTCTCAAGCATCG
TACATCGACAAAATATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATCCTCAAGTATCTTAGGAGAACGAGGGACTATATGCTAATGTACGGTGC
TAAGGATCTGATCCTTATAAGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCGAGGAAATCGACATCAGGATCTGTCTTCACTCTGAACGGAGGAGCAATAATAT
GGAGAAGCATAAAGCAAGGTTGCATTGCTGATTCCACCATGGAGGCTGAGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTAGGAAGTTCTTAACGGAT
TTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGA
GCGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCGTCACACAGATAGCTTCAGAGCACAACATTGTTGATCCATTTACAAAGCTCCTCACGACTA
AAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAAACTCAATAGTACAGTTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCCAACTCAAATGTAAACCGAACAGTTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGAAGAAAATGAAGCTGTCATTGATGAGAAGAGCCAAGTCAGTTTTATCATGATGTCTCTTCCGAAG
AGCTTCTTCCAGTTCCGCACCAATGTGGTAATGAACAAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACGAACAAGGGACA
AACAGGAGAAGCAAATGTTGCTATCTCCAAGAAATTACTACGAGGATCGTCCTCCAAAAATAAGTTTGGACCTTCAACTTCTGAAAGTGTTTTGATGAAGAAGAAGGGAA
AAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGGAAAATGTTTCAATTGCAACGAAAACGGGCACTGGAAGAGAAACTGCCCAAAA
TACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCTTAGAACTTGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGG
AGGGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTG
AGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGAATACATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCT
CAACTCTCAGCCCCTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCT
GTTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCA
AAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAACGAACGCTACATTCTTAGAGGAAGACCACATAAGAAATCATCAACCTCGTAGC
AAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTGATCAAGCTGGTCCTTCAACCAGAGTTGTTGATGGAGTTGACACTTCTGGTCA
ATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGGATTTGGCAGAAACTCAAGTCATCATACCTGATGATG
GCGTTGAGGATCCATTGACCTATAAACTGGCAATGAATGATGAGGATAGAGATCAGTGGATTAAAGCCATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGG
GAACTTGTAGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGC
AAAGGGTTATACCCAAAGTGAGGGGGTGGACTATGAAGAAACCTTCTCTCCTGTTGCTATGCTTAAATCAATTAGAATACTCTTGTCTATTGCCACATTTTATGATTATG
AAATTTGGCAAATGGATGTTAAGACAGCTTTTCTAAACGGCAATCTTGATGAGAGTATCTATATGGCTAAACTAGAGGGGTTCATTGAACAGGATCACGAGCAAAGGGTT
TGCAAGCTTAAAAGATCCATTTATGGGTTGAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACC
TTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTGGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGGGAATTATGACTGACATTAAGCATT
GGCTGGCGACACAATTTCAAATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCCAAATTGTTCGGAATCGCAAGAACAAAATACTAGCATTGTCTCAAGCATCG
TACATCGACAAAATATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATCCTCAAGTATCTTAGGAGAACGAGGGACTATATGCTAATGTACGGTGC
TAAGGATCTGATCCTTATAAGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCGAGGAAATCGACATCAGGATCTGTCTTCACTCTGAACGGAGGAGCAATAATAT
GGAGAAGCATAAAGCAAGGTTGCATTGCTGATTCCACCATGGAGGCTGAGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTAGGAAGTTCTTAACGGAT
TTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGA
GCGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCGTCACACAGATAGCTTCAGAGCACAACATTGTTGATCCATTTACAAAGCTCCTCACGACTA
AAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTAA
Protein sequenceShow/hide protein sequence
MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNVNRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMEENEAVIDEKSQVSFIMMSLPK
SFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSKNKFGPSTSESVLMKKKGKGKNKIPTNRKNKVQKADKGKCFNCNENGHWKRNCPK
YLAEKKAEKTQQGYRAKETLELVHTDLCGPMNVKARGGYEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDLFWGYAVETATYILNMVPTKSVSETPYELWKGRKGYPKETKGGLFYDPQENRVFVSTNATFLEEDHIRNHQPRS
KLVLSEISKEATDKTTRVVDQAGPSTRVVDGVDTSGQSHPSQELRMPRRSGRVITQPDRYLDLAETQVIIPDDGVEDPLTYKLAMNDEDRDQWIKAMNLEMESMYFNSVW
ELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQSEGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIYMAKLEGFIEQDHEQRV
CKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYVDDILLIGNDVGIMTDIKHWLATQFQMKDLGEAQFVLGIQIVRNRKNKILALSQAS
YIDKIYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILIRYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTD
LEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIVDPFTKLLTTKVFEGHLVSLGLRVM