; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0011461 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0011461
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr01:6809343..6811458
RNA-Seq ExpressionCmc01g0011461
SyntenyCmc01g0011461
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU46010.1 hypothetical protein TSUD_401320 [Trifolium subterraneum]1.1e-19249.6Show/hide
Query:  GLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHW---SSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDI
        GL ANLISISQLCDQG  VNF  T C+VTD+  ++ M G    DNCY W      S + C + K D+  LWH++LG+++L S+ K I  EAI G+P L I
Subjt:  GLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHW---SSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDI

Query:  NGKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSD
             CGDCQ+GKQTKT H+ L+   T RVLELLH+DL+GPMQ  SLG K+Y  VVV D+ ++TW+ F+K KSDT  +   LC+ LQR K + I+RI SD
Subjt:  NGKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSD

Query:  HGNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHI
        HG EF +   + F  SEGI HEF++PIT QQNGVVERKNRTLQE A+ M+H K L  +FW EA+NTAC+IHNRVT RSGTT TLYELWK RKP VKYFH+
Subjt:  HGNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHI

Query:  FGSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVVIRQLES---------------------LPEGKRKLITS--------
        FGS CYIL DRE  RK D K D+GIFLGYS NSR Y+V+N ++  +ME+IN+V+    E                      L E    + TS        
Subjt:  FGSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVVIRQLES---------------------LPEGKRKLITS--------

Query:  -------------------------------------------------VEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDE
                                                         V++AL DEYWINAMQEEL QFK N V  LV +P+ VNVI TKW++KNK+DE
Subjt:  -------------------------------------------------VEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDE

Query:  SGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQAL
        +GNVT+NKARLVAQGYA++EGVDFDETFA VA LE+IRLLL ++C  KF+L+QMDVKSAFL+GYLN EVYV QPKGFVD   P +VYKL KALYGLKQA 
Subjt:  SGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQAL

Query:  RVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGF---------------------------------------------------
        R WYERLT +L  +GY +G  +K LF+      LI+AQIYVDDIIFGG                                                    
Subjt:  RVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGF---------------------------------------------------

Query:  ------------PQTTTHAKITKDIVDTTIDHKLYRSMIGSLLYLTAA
                        TH K+TKD     +D  LY+SMIGSLLYLTA+
Subjt:  ------------PQTTTHAKITKDIVDTTIDHKLYRSMIGSLLYLTAA

KAA0042206.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.6e-20858.13Show/hide
Query:  CVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLT-KTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDINGKFFCGDCQVGKQTKTSHRSLKECYT
        C +T  NN V   G         W  K+S  C++   T Q  +   KL +ISL SLDKVIRNEA+VGIPSLDINGKFFCGDCQVGKQTKTSHR LKECY 
Subjt:  CVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLT-KTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDINGKFFCGDCQVGKQTKTSHRSLKECYT

Query:  IRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPI
        IRVLELLHLDL+GPMQTESL  KKYVLVVV DY+ FTWVRFLK KSDT+KLCISLC+NLQR KG+KII++ SDHG EFD+E+LNNF +++GIHHEF API
Subjt:  IRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPI

Query:  TLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFL
        T QQNGVVERKNRTLQEMA+VMIH  NLPLNF  EAVNT CHI  +                         H    TCYIL DREYHRKWDVK DQGIFL
Subjt:  TLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFL

Query:  GYSQNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEG-------------------------------KRKLITSV
        GYS NSR Y+VFNIKSGTVME IN+VV      + Q                  L+ +P+G                               K  L +S+
Subjt:  GYSQNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEG-------------------------------KRKLITSV

Query:  --------------EKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEA
                      + ALKDEYWIN MQEELLQFK NN+ TLV KPD  N+I TKWIFKNKTDES +V +N+ARLVAQGYA+V+GVDF++TFA VARLEA
Subjt:  --------------EKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEA

Query:  IRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIV
        IRLLLSISCFRKFKL+QMDVKSAFL+GYLN EVYVAQ K FVDS+FPQYVYK NKALYGLKQA R WYE+LTMYL ERGYSRGE +K LFINRTST LIV
Subjt:  IRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIV

Query:  AQIYVDDIIFGGFPQT---------------------------------------------------------------TTHAKITKDIVDTTIDHKLYR
        AQIYVDDIIFGGFP+T                                                                THAKI KD VD  +DHKLYR
Subjt:  AQIYVDDIIFGGFPQT---------------------------------------------------------------TTHAKITKDIVDTTIDHKLYR

Query:  SMIGSLLYLTAADLISPMLLEYVLGI
        SMIGSLLYLTA    S   + YV+GI
Subjt:  SMIGSLLYLTAADLISPMLLEYVLGI

KAA0042877.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.3e-21658.16Show/hide
Query:  MDGLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDIN
        +DGLKANLIS+SQLCDQGYSVNFNNTGCVVTD NNQVFMSG+R+ADNCYHW+S  SNICHLTK  QTWLWHRKLGYISL SLDKVI NEA+VGIPSLDIN
Subjt:  MDGLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDIN

Query:  GKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDH
        GKFFCGDC+VGKQTKTSH+ LKECYTIRVL+LLHLDL+G MQTE                                              +KII+I SDH
Subjt:  GKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDH

Query:  GNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIF
        G EFD+E+LNNF ++EGIHHEFAAPIT QQNGVVERKNRTLQEMA+VMIH KNLPLNFW EAVNTACHIHNRVTTRS T VTL                 
Subjt:  GNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIF

Query:  GSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVV------IRQLESLPEGKRKLITSVEKALKDEYWINAMQEELL-----
                  EYHRKWDVK DQGIFLGYSQNSR Y+VFNIKS TVMETIN++V      + Q  ++ + +  +   V     DE      Q +       
Subjt:  GSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVV------IRQLESLPEGKRKLITSVEKALKDEYWINAMQEELL-----

Query:  ---QFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYL
           +FKCNNV TLV KPD  N+I TKWIFKNKTDESG+V +NKARLVAQGYA+VEGVD DETFA VAR EAI LL SI+CFRKFKL+QMDVKSAFL+GYL
Subjt:  ---QFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYL

Query:  NVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQ--------------
        N EVYVAQP+ FVD +FPQYVYKLNKALYGLKQA R WY+ LTMYLGERGYSRGET+K LFINRTSTDLIVAQIYVDDIIFGGFP+              
Subjt:  NVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQ--------------

Query:  ------------------------------------------------TTTHAKITKD------------------------------------------
                                                        TTTHAKITKD                                          
Subjt:  ------------------------------------------------TTTHAKITKD------------------------------------------

Query:  ---IVDTTIDHKLYRSMIGSLLYLTAADLISPMLLEYVLGI
           IV T +DHK YRSMIGSLLYLTA    S   + YV+GI
Subjt:  ---IVDTTIDHKLYRSMIGSLLYLTAADLISPMLLEYVLGI

KAA0054435.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.5e-21862.83Show/hide
Query:  VTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDINGKFFCGDCQVGKQTKTSHRSLKECYTIRV
        +TD NNQV MSGRR++DNCYHWSS  SNICHLTK DQTWLWHRKLG+ISL SLDKVIRNEA+VGIPSLDIN KFFCGDCQVGKQTK+SH  LKECYTIRV
Subjt:  VTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDINGKFFCGDCQVGKQTKTSHRSLKECYTIRV

Query:  LELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPITLQ
        LELLHLDL+GPMQTESLG KKYVLVVV DY +FTWV FLKGKSDT KLCISLCLNLQ  KG+KIIRI SDH  EFD+E+LNNF + EGIHHE AAPIT Q
Subjt:  LELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPITLQ

Query:  QNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFLGYS
        QNGVVERKNRTLQEMA+VMIH KNLPLNFW EAVNTACHIHNRVTTRSGTTVTLYELWKGRKPN+                                   
Subjt:  QNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFLGYS

Query:  QNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEGKRKLITSVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDG
          +R Y+VFNIKSGTVMETIN+VV      I Q                  L+ +P+      TS+E ALKDEY IN MQEELLQFK NNV TLV KP+G
Subjt:  QNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEGKRKLITSVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDG

Query:  VNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQ
         N                                VEGVDFDETFA VARLEAIRLLLSISCFRKFKL+QM VKSAFL+GYLN EVYV QPKGFVD +FPQ
Subjt:  VNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQ

Query:  YVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQT--------------------------------
        YVYKLNKALY LKQA + WYERLTMYLGER YSRGET+K LFINRTST+LIVAQIYVDDIIFGGFP+T                                
Subjt:  YVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQT--------------------------------

Query:  -------------------------------TTHAKITKDIVDTTIDHKLYRSMIGSLLYLTAADLISPMLLEYVLGI
                                        THAKITKD+V T +DHKLYRSMIGSLLYLT     S   + YV+GI
Subjt:  -------------------------------TTHAKITKDIVDTTIDHKLYRSMIGSLLYLTAADLISPMLLEYVLGI

TYK26041.1 gag/pol polyprotein [Cucumis melo var. makuwa]4.3e-20061.57Show/hide
Query:  MDGLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDIN
        +DGLK NLIS+SQLCDQGYSVNFNNT CV TD NNQVF+SGRR+A+NC HWSS  SNICHLTK DQTWLWHRKLG+ISL SLDKVIRN+A+VGIPSLDIN
Subjt:  MDGLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDIN

Query:  GKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDH
        GKFFCGDC+VGKQTK SHR LKECYTIRVLELLHLDLIGPM+TESLGRKKYVLVVV DY +FT VRFLKGKSDTVKLCISL LNLQR KG+KIIRI SDH
Subjt:  GKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDH

Query:  GNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIF
        G EFD+E+LNNF ++EGIHHEF APIT QQNGVVERKNRT                                VTTRSGTT+ LYELWKGRKPNVKYFHIF
Subjt:  GNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIF

Query:  GSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEGKRK----------
        GSTCYIL DR YHRKWDVK DQ IFLGYSQNSR Y+VFNIKS TVMETIN+VV      + Q                  L+ +P+G  +          
Subjt:  GSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEGKRK----------

Query:  ----------LI------------------TSVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYA
                  L+                  TSVE ALKDEY INAMQEELLQFK NNV TLV KPD  NVI TKWIFKNKTDESGN+             
Subjt:  ----------LI------------------TSVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYA

Query:  KVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYS
                    L+A      +  ++ CF K+ L                                +YVYKLNKALYGLKQA   WYERLTMYLGERGYS
Subjt:  KVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYS

Query:  RGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQT
        RGET+K +F+NRT+ DLIVAQIYVDDIIFGGFP+T
Subjt:  RGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQT

TrEMBL top hitse value%identityAlignment
A0A2Z6P936 Integrase catalytic domain-containing protein5.5e-19349.6Show/hide
Query:  GLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHW---SSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDI
        GL ANLISISQLCDQG  VNF  T C+VTD+  ++ M G    DNCY W      S + C + K D+  LWH++LG+++L S+ K I  EAI G+P L I
Subjt:  GLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHW---SSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDI

Query:  NGKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSD
             CGDCQ+GKQTKT H+ L+   T RVLELLH+DL+GPMQ  SLG K+Y  VVV D+ ++TW+ F+K KSDT  +   LC+ LQR K + I+RI SD
Subjt:  NGKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSD

Query:  HGNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHI
        HG EF +   + F  SEGI HEF++PIT QQNGVVERKNRTLQE A+ M+H K L  +FW EA+NTAC+IHNRVT RSGTT TLYELWK RKP VKYFH+
Subjt:  HGNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHI

Query:  FGSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVVIRQLES---------------------LPEGKRKLITS--------
        FGS CYIL DRE  RK D K D+GIFLGYS NSR Y+V+N ++  +ME+IN+V+    E                      L E    + TS        
Subjt:  FGSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVVIRQLES---------------------LPEGKRKLITS--------

Query:  -------------------------------------------------VEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDE
                                                         V++AL DEYWINAMQEEL QFK N V  LV +P+ VNVI TKW++KNK+DE
Subjt:  -------------------------------------------------VEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDE

Query:  SGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQAL
        +GNVT+NKARLVAQGYA++EGVDFDETFA VA LE+IRLLL ++C  KF+L+QMDVKSAFL+GYLN EVYV QPKGFVD   P +VYKL KALYGLKQA 
Subjt:  SGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQAL

Query:  RVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGF---------------------------------------------------
        R WYERLT +L  +GY +G  +K LF+      LI+AQIYVDDIIFGG                                                    
Subjt:  RVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGF---------------------------------------------------

Query:  ------------PQTTTHAKITKDIVDTTIDHKLYRSMIGSLLYLTAA
                        TH K+TKD     +D  LY+SMIGSLLYLTA+
Subjt:  ------------PQTTTHAKITKDIVDTTIDHKLYRSMIGSLLYLTAA

A0A5D3C1P5 Gag-pol polyprotein2.1e-21658.16Show/hide
Query:  MDGLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDIN
        +DGLKANLIS+SQLCDQGYSVNFNNTGCVVTD NNQVFMSG+R+ADNCYHW+S  SNICHLTK  QTWLWHRKLGYISL SLDKVI NEA+VGIPSLDIN
Subjt:  MDGLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDIN

Query:  GKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDH
        GKFFCGDC+VGKQTKTSH+ LKECYTIRVL+LLHLDL+G MQTE                                              +KII+I SDH
Subjt:  GKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDH

Query:  GNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIF
        G EFD+E+LNNF ++EGIHHEFAAPIT QQNGVVERKNRTLQEMA+VMIH KNLPLNFW EAVNTACHIHNRVTTRS T VTL                 
Subjt:  GNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIF

Query:  GSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVV------IRQLESLPEGKRKLITSVEKALKDEYWINAMQEELL-----
                  EYHRKWDVK DQGIFLGYSQNSR Y+VFNIKS TVMETIN++V      + Q  ++ + +  +   V     DE      Q +       
Subjt:  GSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVV------IRQLESLPEGKRKLITSVEKALKDEYWINAMQEELL-----

Query:  ---QFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYL
           +FKCNNV TLV KPD  N+I TKWIFKNKTDESG+V +NKARLVAQGYA+VEGVD DETFA VAR EAI LL SI+CFRKFKL+QMDVKSAFL+GYL
Subjt:  ---QFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYL

Query:  NVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQ--------------
        N EVYVAQP+ FVD +FPQYVYKLNKALYGLKQA R WY+ LTMYLGERGYSRGET+K LFINRTSTDLIVAQIYVDDIIFGGFP+              
Subjt:  NVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQ--------------

Query:  ------------------------------------------------TTTHAKITKD------------------------------------------
                                                        TTTHAKITKD                                          
Subjt:  ------------------------------------------------TTTHAKITKD------------------------------------------

Query:  ---IVDTTIDHKLYRSMIGSLLYLTAADLISPMLLEYVLGI
           IV T +DHK YRSMIGSLLYLTA    S   + YV+GI
Subjt:  ---IVDTTIDHKLYRSMIGSLLYLTAADLISPMLLEYVLGI

A0A5D3CS19 Gag-pol polyprotein1.7e-21862.83Show/hide
Query:  VTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDINGKFFCGDCQVGKQTKTSHRSLKECYTIRV
        +TD NNQV MSGRR++DNCYHWSS  SNICHLTK DQTWLWHRKLG+ISL SLDKVIRNEA+VGIPSLDIN KFFCGDCQVGKQTK+SH  LKECYTIRV
Subjt:  VTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDINGKFFCGDCQVGKQTKTSHRSLKECYTIRV

Query:  LELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPITLQ
        LELLHLDL+GPMQTESLG KKYVLVVV DY +FTWV FLKGKSDT KLCISLCLNLQ  KG+KIIRI SDH  EFD+E+LNNF + EGIHHE AAPIT Q
Subjt:  LELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPITLQ

Query:  QNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFLGYS
        QNGVVERKNRTLQEMA+VMIH KNLPLNFW EAVNTACHIHNRVTTRSGTTVTLYELWKGRKPN+                                   
Subjt:  QNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFLGYS

Query:  QNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEGKRKLITSVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDG
          +R Y+VFNIKSGTVMETIN+VV      I Q                  L+ +P+      TS+E ALKDEY IN MQEELLQFK NNV TLV KP+G
Subjt:  QNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEGKRKLITSVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDG

Query:  VNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQ
         N                                VEGVDFDETFA VARLEAIRLLLSISCFRKFKL+QM VKSAFL+GYLN EVYV QPKGFVD +FPQ
Subjt:  VNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQ

Query:  YVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQT--------------------------------
        YVYKLNKALY LKQA + WYERLTMYLGER YSRGET+K LFINRTST+LIVAQIYVDDIIFGGFP+T                                
Subjt:  YVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQT--------------------------------

Query:  -------------------------------TTHAKITKDIVDTTIDHKLYRSMIGSLLYLTAADLISPMLLEYVLGI
                                        THAKITKD+V T +DHKLYRSMIGSLLYLT     S   + YV+GI
Subjt:  -------------------------------TTHAKITKDIVDTTIDHKLYRSMIGSLLYLTAADLISPMLLEYVLGI

A0A5D3DQT9 Gag/pol polyprotein2.1e-20061.57Show/hide
Query:  MDGLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDIN
        +DGLK NLIS+SQLCDQGYSVNFNNT CV TD NNQVF+SGRR+A+NC HWSS  SNICHLTK DQTWLWHRKLG+ISL SLDKVIRN+A+VGIPSLDIN
Subjt:  MDGLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDIN

Query:  GKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDH
        GKFFCGDC+VGKQTK SHR LKECYTIRVLELLHLDLIGPM+TESLGRKKYVLVVV DY +FT VRFLKGKSDTVKLCISL LNLQR KG+KIIRI SDH
Subjt:  GKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDH

Query:  GNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIF
        G EFD+E+LNNF ++EGIHHEF APIT QQNGVVERKNRT                                VTTRSGTT+ LYELWKGRKPNVKYFHIF
Subjt:  GNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIF

Query:  GSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEGKRK----------
        GSTCYIL DR YHRKWDVK DQ IFLGYSQNSR Y+VFNIKS TVMETIN+VV      + Q                  L+ +P+G  +          
Subjt:  GSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEGKRK----------

Query:  ----------LI------------------TSVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYA
                  L+                  TSVE ALKDEY INAMQEELLQFK NNV TLV KPD  NVI TKWIFKNKTDESGN+             
Subjt:  ----------LI------------------TSVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYA

Query:  KVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYS
                    L+A      +  ++ CF K+ L                                +YVYKLNKALYGLKQA   WYERLTMYLGERGYS
Subjt:  KVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYS

Query:  RGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQT
        RGET+K +F+NRT+ DLIVAQIYVDDIIFGGFP+T
Subjt:  RGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQT

A0A5D3DSN1 Gag-pol polyprotein2.7e-20858.13Show/hide
Query:  CVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLT-KTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDINGKFFCGDCQVGKQTKTSHRSLKECYT
        C +T  NN V   G         W  K+S  C++   T Q  +   KL +ISL SLDKVIRNEA+VGIPSLDINGKFFCGDCQVGKQTKTSHR LKECY 
Subjt:  CVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLT-KTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDINGKFFCGDCQVGKQTKTSHRSLKECYT

Query:  IRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPI
        IRVLELLHLDL+GPMQTESL  KKYVLVVV DY+ FTWVRFLK KSDT+KLCISLC+NLQR KG+KII++ SDHG EFD+E+LNNF +++GIHHEF API
Subjt:  IRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPI

Query:  TLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFL
        T QQNGVVERKNRTLQEMA+VMIH  NLPLNF  EAVNT CHI  +                         H    TCYIL DREYHRKWDVK DQGIFL
Subjt:  TLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFL

Query:  GYSQNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEG-------------------------------KRKLITSV
        GYS NSR Y+VFNIKSGTVME IN+VV      + Q                  L+ +P+G                               K  L +S+
Subjt:  GYSQNSREYKVFNIKSGTVMETINIVV------IRQ------------------LESLPEG-------------------------------KRKLITSV

Query:  --------------EKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEA
                      + ALKDEYWIN MQEELLQFK NN+ TLV KPD  N+I TKWIFKNKTDES +V +N+ARLVAQGYA+V+GVDF++TFA VARLEA
Subjt:  --------------EKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEA

Query:  IRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIV
        IRLLLSISCFRKFKL+QMDVKSAFL+GYLN EVYVAQ K FVDS+FPQYVYK NKALYGLKQA R WYE+LTMYL ERGYSRGE +K LFINRTST LIV
Subjt:  IRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIV

Query:  AQIYVDDIIFGGFPQT---------------------------------------------------------------TTHAKITKDIVDTTIDHKLYR
        AQIYVDDIIFGGFP+T                                                                THAKI KD VD  +DHKLYR
Subjt:  AQIYVDDIIFGGFPQT---------------------------------------------------------------TTHAKITKDIVDTTIDHKLYR

Query:  SMIGSLLYLTAADLISPMLLEYVLGI
        SMIGSLLYLTA    S   + YV+GI
Subjt:  SMIGSLLYLTAADLISPMLLEYVLGI

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.7e-5324.36Show/hide
Query:  NLISISQLCDQGYSVNFNNTGCVVTDNNNQVFM-SGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVG---IPSLDINGK
        NL+S+ +L + G S+ F+ +G  ++ N   V   SG        ++ + S N  H    +   LWH + G+IS   L ++ R         + +L+++ +
Subjt:  NLISISQLCDQGYSVNFNNTGCVVTDNNNQVFM-SGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVG---IPSLDINGK

Query:  FFCGDCQVGKQTKTSHRSLKE-CYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHG
          C  C  GKQ +   + LK+  +  R L ++H D+ GP+   +L  K Y ++ V  +  +     +K KSD   +        +     K++ ++ D+G
Subjt:  FFCGDCQVGKQTKTSHRSLKE-CYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHG

Query:  NEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRS--GTTVTLYELWKGRKPNVKYFHI
         E+ S E+  F   +GI +    P T Q NGV ER  RT+ E A+ M+    L  +FW EAV TA ++ NR+ +R+   ++ T YE+W  +KP +K+  +
Subjt:  NEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRS--GTTVTLYELWKGRKPNVKYFHI

Query:  FGSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREY-----KVFNIKSGTVMETINIVVIRQL-------------------------------------
        FG+T Y+   +    K+D K  + IF+GY  N  +      + F +    V++  N+V  R +                                     
Subjt:  FGSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREY-----KVFNIKSGTVMETINIVVIRQL-------------------------------------

Query:  --------------ESLPEGKRKLI----------------------------TSVEKALKDEY------------------------------------
                      ++ P   RK+I                               +K  +D++                                    
Subjt:  --------------ESLPEGKRKLI----------------------------TSVEKALKDEY------------------------------------

Query:  ------------------------------------------------------WINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNV
                                                              W  A+  EL   K NN  T+  +P+  N++ ++W+F  K +E GN 
Subjt:  ------------------------------------------------------WINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNV

Query:  TKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWY
         + KARLVA+G+ +   +D++ETFA VAR+ + R +LS+      K++QMDVK+AFL+G L  E+Y+  P+G   S     V KLNKA+YGLKQA R W+
Subjt:  TKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWY

Query:  ERLTMYLGERGYSRGETNKRLFI--NRTSTDLIVAQIYVDDII
        E     L E  +     ++ ++I       + I   +YVDD++
Subjt:  ERLTMYLGERGYSRGETNKRLFI--NRTSTDLIVAQIYVDDII

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein1.2e-1423.97Show/hide
Query:  HRKLGYISLISLDKVIR-NEAIVGIPSLDINGKFFCGDCQVGKQTKTSH--RSLKECYTIRVL-ELLHLDLIGPMQTESLGRKKYVLVVVGD--YYKFTW
        H+++G+  +  ++  I+ N     +  +    +F+C  C++ K TK +H   S+    T         +D+ GP+ + +   K+Y+L++V +   Y  T 
Subjt:  HRKLGYISLISLDKVIR-NEAIVGIPSLDINGKFFCGDCQVGKQTKTSH--RSLKECYTIRVL-ELLHLDLIGPMQTESLGRKKYVLVVVGD--YYKFTW

Query:  VRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVN
          F K     +         ++    +K+  I+SD G EF ++++  ++ S+GIHH   +      NG  ER  RT+   A  ++   NL + FW  AV 
Subjt:  VRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVN

Query:  TACHIHNRVTTRSGTTVTLYELWKGRKP---NVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVV
        +A +I N +  +S   + L  +   R+P    +  F  FG    I +    H+K        I L    NS  YK F      ++ + N  +
Subjt:  TACHIHNRVTTRSGTTVTLYELWKGRKP---NVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-6428.48Show/hide
Query:  LWHRKLGYISLISLDKVIRNEAIVGIPSLDINGKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFL
        LWH+++G++S   L  + +   I       +     C  C  GKQ + S ++  E   + +L+L++ D+ GPM+ ES+G  KY +  + D  +  WV  L
Subjt:  LWHRKLGYISLISLDKVIRNEAIVGIPSLDINGKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFL

Query:  KGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACH
        K K    ++       ++R  G+K+ R+ SD+G E+ S E   +  S GI HE   P T Q NGV ER NRT+ E  + M+ +  LP +FW EAV TAC+
Subjt:  KGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELNNFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACH

Query:  IHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFN----------------------------I
        + NR  +          +W  ++ +  +  +FG   +    +E   K D K    IF+GY      Y++++                            +
Subjt:  IHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILDDREYHRKWDVKYDQGIFLGYSQNSREYKVFN----------------------------I

Query:  KSGTVMETINI-------------------------VVIRQLESLPEG-----------------KRKLITSVE------------------KALKD---
        K+G +   + I                          VI Q E L EG                 +R     VE                  ++LK+   
Subjt:  KSGTVMETINI-------------------------VVIRQLESLPEG-----------------KRKLITSVE------------------KALKD---

Query:  ----EYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLY
               + AMQEE+   + N    LV  P G   +  KW+FK K D    + + KARLV +G+ + +G+DFDE F+ V ++ +IR +LS++     ++ 
Subjt:  ----EYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLY

Query:  QMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTS-TDLIVAQIYVDDIIFGG
        Q+DVK+AFL G L  E+Y+ QP+GF  +     V KLNK+LYGLKQA R WY +   ++  + Y +  ++  ++  R S  + I+  +YVDD++  G
Subjt:  QMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTS-TDLIVAQIYVDDIIFGG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.3e-3740.98Show/hide
Query:  KALKDEYWINAMQEELLQFKCNNVLTLVHKPDG-VNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFK
        +ALKDE W NAM  E+     N+   LV  P   V ++  +WIF  K +  G++ + KARLVA+GY +  G+D+ ETF+ V +  +IR++L ++  R + 
Subjt:  KALKDEYWINAMQEELLQFKCNNVLTLVHKPDG-VNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFK

Query:  LYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGFP
        + Q+DV +AFL G L  +VY++QP GF+D   P YV KL KALYGLKQA R WY  L  YL   G+    ++  LF+ +    ++   +YVDDI+  G  
Subjt:  LYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGFP

Query:  QTTTH
         T  H
Subjt:  QTTTH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.7e-3540.4Show/hide
Query:  KALKDEYWINAMQEELLQFKCNNVLTLV-HKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFK
        +A+KD+ W  AM  E+     N+   LV   P  V ++  +WIF  K +  G++ + KARLVA+GY +  G+D+ ETF+ V +  +IR++L ++  R + 
Subjt:  KALKDEYWINAMQEELLQFKCNNVLTLV-HKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFK

Query:  LYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGG
        + Q+DV +AFL G L  EVY++QP GFVD   P YV +L KA+YGLKQA R WY  L  YL   G+    ++  LF+ +    +I   +YVDDI+  G
Subjt:  LYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.1e-3135.27Show/hide
Query:  KRKLITSVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLS
        K K  ++  +A +   W  AM +E+   +  +   +   P     I  KW++K K +  G + + KARLVA+GY + EG+DF ETF+ V +L +++L+L+
Subjt:  KRKLITSVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLS

Query:  ISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFV----DSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQ
        IS    F L+Q+D+ +AFL+G L+ E+Y+  P G+     DS  P  V  L K++YGLKQA R W+ + ++ L   G+ +  ++   F+  T+T  +   
Subjt:  ISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFV----DSKFPQYVYKLNKALYGLKQALRVWYERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQ

Query:  IYVDDII
        +YVDDII
Subjt:  IYVDDII

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.3e-1342.71Show/hide
Query:  SVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSIS
        SV  ALKD  W  AMQEEL     N    LV  P   N++  KW+FK K    G + + KARLVA+G+ + EG+ F ET++ V R   IR +L+++
Subjt:  SVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTDESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGACTGAAGGCAAACTTGATTAGTATAAGTCAACTATGTGACCAAGGATACAGTGTAAACTTTAACAACACTGGTTGTGTAGTTACTGACAATAATAAT
CAAGTGTTTATGAGTGGCAGACGACAAGCAGATAACTGTTATCATTGGAGCTCCAAGAGTTCAAACATATGTCACTTAACTAAAACTGACCAAACTTGGTTGTGG
CATAGGAAATTGGGGTACATTAGCTTGATAAGCTTAGATAAAGTTATCAGAAACGAGGCAATTGTAGGCATTCCTTCTTTAGATATCAATGGTAAATTCTTTTGT
GGTGACTGTCAAGTTGGGAAGCAAACTAAAACTTCTCACAGAAGCTTAAAGGAATGTTATACAATAAGAGTCCTTGAACTTCTACATCTTGATCTTATAGGTCCC
ATGCAAACTGAAAGTTTGGGTCGAAAGAAGTATGTGTTGGTTGTTGTAGGTGACTACTACAAATTTACTTGGGTTCGGTTCTTAAAAGGAAAGTCAGATACTGTT
AAATTATGTATCAGTCTCTGTTTGAATTTGCAACGTTTAAAAGGGAAAAAGATAATTAGGATCCATAGTGATCATGGGAATGAGTTTGATAGTGAAGAACTTAAT
AATTTCTATAAGTCGGAAGGAATCCACCATGAATTTGCTGCTCCCATAACTCTTCAGCAAAATGGAGTAGTTGAACGGAAGAACAGAACTTTACAAGAAATGGCT
CAAGTTATGATACATGTCAAAAACTTACCTTTGAATTTTTGGCCTGAAGCTGTAAACACAGCATGTCATATTCACAATAGGGTCACTACTCGTTCTGGTACGACA
GTCACATTATATGAACTATGGAAAGGAAGGAAGCCAAATGTTAAGTATTTTCATATTTTTGGAAGTACCTGTTACATTTTAGATGATAGAGAGTATCATCGAAAG
TGGGATGTGAAATATGATCAAGGAATTTTCCTTGGATACTCTCAGAATAGTCGAGAGTACAAAGTCTTTAATATTAAATCAGGAACAGTTATGGAAACAATCAAT
ATTGTGGTGATCCGTCAGTTGGAGTCACTACCAGAAGGAAAGAGAAAGTTAATTACATCTGTTGAAAAAGCTCTCAAAGATGAATACTGGATAAATGCTATGCAA
GAAGAATTACTACAATTTAAGTGTAACAATGTGTTGACTCTGGTTCATAAGCCTGATGGGGTGAATGTTATAGTAACTAAGTGGATTTTTAAAAATAAAACTGAT
GAATCAGGCAATGTAACAAAGAACAAAGCTCGTTTGGTGGCTCAAGGTTATGCTAAGGTTGAAGGTGTTGATTTTGATGAAACCTTTGCACTTGTGGCTAGACTT
GAAGCTATTCGCCTTTTGCTCAGTATATCCTGTTTTCGAAAATTTAAATTGTATCAAATGGACGTTAAAAGTGCTTTCTTGGATGGATACTTGAATGTAGAAGTT
TATGTAGCACAACCTAAAGGGTTTGTTGATTCAAAATTTCCTCAGTATGTATACAAACTTAATAAAGCTCTATATGGATTGAAGCAAGCACTTAGGGTTTGGTAT
GAACGTCTAACAATGTATCTAGGTGAAAGAGGATACTCCAGGGGAGAAACTAACAAGAGACTGTTTATTAATAGAACCAGCACTGATCTCATCGTAGCTCAAATT
TATGTGGATGATATTATTTTTGGTGGATTTCCTCAAACTACGACACATGCTAAAATTACCAAGGATATTGTTGATACTACAATAGATCATAAATTGTACAGAAGC
ATGATTGGGAGTCTCTTATATTTAACAGCAGCAGACTTGATATCGCCTATGCTGTTGGAATATGTGCTCGGTATCAGTCAGATCCTCGTATCTCTCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGGACTGAAGGCAAACTTGATTAGTATAAGTCAACTATGTGACCAAGGATACAGTGTAAACTTTAACAACACTGGTTGTGTAGTTACTGACAATAATAAT
CAAGTGTTTATGAGTGGCAGACGACAAGCAGATAACTGTTATCATTGGAGCTCCAAGAGTTCAAACATATGTCACTTAACTAAAACTGACCAAACTTGGTTGTGG
CATAGGAAATTGGGGTACATTAGCTTGATAAGCTTAGATAAAGTTATCAGAAACGAGGCAATTGTAGGCATTCCTTCTTTAGATATCAATGGTAAATTCTTTTGT
GGTGACTGTCAAGTTGGGAAGCAAACTAAAACTTCTCACAGAAGCTTAAAGGAATGTTATACAATAAGAGTCCTTGAACTTCTACATCTTGATCTTATAGGTCCC
ATGCAAACTGAAAGTTTGGGTCGAAAGAAGTATGTGTTGGTTGTTGTAGGTGACTACTACAAATTTACTTGGGTTCGGTTCTTAAAAGGAAAGTCAGATACTGTT
AAATTATGTATCAGTCTCTGTTTGAATTTGCAACGTTTAAAAGGGAAAAAGATAATTAGGATCCATAGTGATCATGGGAATGAGTTTGATAGTGAAGAACTTAAT
AATTTCTATAAGTCGGAAGGAATCCACCATGAATTTGCTGCTCCCATAACTCTTCAGCAAAATGGAGTAGTTGAACGGAAGAACAGAACTTTACAAGAAATGGCT
CAAGTTATGATACATGTCAAAAACTTACCTTTGAATTTTTGGCCTGAAGCTGTAAACACAGCATGTCATATTCACAATAGGGTCACTACTCGTTCTGGTACGACA
GTCACATTATATGAACTATGGAAAGGAAGGAAGCCAAATGTTAAGTATTTTCATATTTTTGGAAGTACCTGTTACATTTTAGATGATAGAGAGTATCATCGAAAG
TGGGATGTGAAATATGATCAAGGAATTTTCCTTGGATACTCTCAGAATAGTCGAGAGTACAAAGTCTTTAATATTAAATCAGGAACAGTTATGGAAACAATCAAT
ATTGTGGTGATCCGTCAGTTGGAGTCACTACCAGAAGGAAAGAGAAAGTTAATTACATCTGTTGAAAAAGCTCTCAAAGATGAATACTGGATAAATGCTATGCAA
GAAGAATTACTACAATTTAAGTGTAACAATGTGTTGACTCTGGTTCATAAGCCTGATGGGGTGAATGTTATAGTAACTAAGTGGATTTTTAAAAATAAAACTGAT
GAATCAGGCAATGTAACAAAGAACAAAGCTCGTTTGGTGGCTCAAGGTTATGCTAAGGTTGAAGGTGTTGATTTTGATGAAACCTTTGCACTTGTGGCTAGACTT
GAAGCTATTCGCCTTTTGCTCAGTATATCCTGTTTTCGAAAATTTAAATTGTATCAAATGGACGTTAAAAGTGCTTTCTTGGATGGATACTTGAATGTAGAAGTT
TATGTAGCACAACCTAAAGGGTTTGTTGATTCAAAATTTCCTCAGTATGTATACAAACTTAATAAAGCTCTATATGGATTGAAGCAAGCACTTAGGGTTTGGTAT
GAACGTCTAACAATGTATCTAGGTGAAAGAGGATACTCCAGGGGAGAAACTAACAAGAGACTGTTTATTAATAGAACCAGCACTGATCTCATCGTAGCTCAAATT
TATGTGGATGATATTATTTTTGGTGGATTTCCTCAAACTACGACACATGCTAAAATTACCAAGGATATTGTTGATACTACAATAGATCATAAATTGTACAGAAGC
ATGATTGGGAGTCTCTTATATTTAACAGCAGCAGACTTGATATCGCCTATGCTGTTGGAATATGTGCTCGGTATCAGTCAGATCCTCGTATCTCTCACTTGA
Protein sequenceShow/hide protein sequence
MDGLKANLISISQLCDQGYSVNFNNTGCVVTDNNNQVFMSGRRQADNCYHWSSKSSNICHLTKTDQTWLWHRKLGYISLISLDKVIRNEAIVGIPSLDINGKFFC
GDCQVGKQTKTSHRSLKECYTIRVLELLHLDLIGPMQTESLGRKKYVLVVVGDYYKFTWVRFLKGKSDTVKLCISLCLNLQRLKGKKIIRIHSDHGNEFDSEELN
NFYKSEGIHHEFAAPITLQQNGVVERKNRTLQEMAQVMIHVKNLPLNFWPEAVNTACHIHNRVTTRSGTTVTLYELWKGRKPNVKYFHIFGSTCYILDDREYHRK
WDVKYDQGIFLGYSQNSREYKVFNIKSGTVMETINIVVIRQLESLPEGKRKLITSVEKALKDEYWINAMQEELLQFKCNNVLTLVHKPDGVNVIVTKWIFKNKTD
ESGNVTKNKARLVAQGYAKVEGVDFDETFALVARLEAIRLLLSISCFRKFKLYQMDVKSAFLDGYLNVEVYVAQPKGFVDSKFPQYVYKLNKALYGLKQALRVWY
ERLTMYLGERGYSRGETNKRLFINRTSTDLIVAQIYVDDIIFGGFPQTTTHAKITKDIVDTTIDHKLYRSMIGSLLYLTAADLISPMLLEYVLGISQILVSLT