; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009842 (gene) of Snake gourd v1 genome

Gene IDTan0009842
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG03:5978916..5981614
RNA-Seq ExpressionTan0009842
SyntenyTan0009842
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025159.1 gag/pol protein [Cucumis melo var. makuwa]9.6e-29767.26Show/hide
Query:  RLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLE
        +LGHINL++I RLV +GLL++L+++SLP CESCLEGKMTKRPF+ KGYRAKEP ELIHSDLCG MNVKARGG+EYF+SFIDDY RYGY+YLM  KSE LE
Subjt:  RLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLE

Query:  KFKEFKTEVENLL------------------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKL
        KFKE+KTEVENLL                                    VE AV+ILN  PSKSV ETPFELW GRK SL HFRIWGC THVLV+NPKKL
Subjt:  KFKEFKTEVENLL------------------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKL

Query:  ESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRDHLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVR
        +SR +LC FVGYPKETRG +FFDP++NRV VSTNATFLEE+H+R+H PRSK+VL+E    S RV D    S+ V + ++S +   SQ LRMPRRS RVV 
Subjt:  ESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRDHLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVR

Query:  QLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEE
        Q  RY+GL +T  V PDD  EDPL+Y QAM  VDK++W+KAMD EMESMYFN +WELVD P+GVKPIGCKWIYKRKRD   KVQTFKARLVAK +TQ E 
Subjt:  QLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEE

Query:  VDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVD
        VDY+ETFSPVAM+KSIRILL+IA +YDYE+W+MDVKTAF+N NL+E+I+M +P+GFI +GQEQKVC+L RSIYGLKQAS+SWNIRFD AIKSYGF+QNVD
Subjt:  VDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVD

Query:  EPCVYKKIVNKTIAFLILYVDDILLIGNE-----------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDH
        EPC+YKKI    +AFL+LYVDDIL IGN+                 +QCPKTPQ+VEDMRRIPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GLDH
Subjt:  EPCVYKKIVNKTIAFLILYVDDILLIGNE-----------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDH

Query:  WTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEAEKEAVWLRKFMMDLEVV
        WT VK ILKYLRR R+Y LVY   DLILT                                    GCIA+STMEAEYV ACEA KEAVWLRKF+ DLEVV
Subjt:  WTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEAEKEAVWLRKFMMDLEVV

Query:  PNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQ
        PNMNL ITL+ DNS AVANS+EPRSHKRGKHIERKYHLIREI+ RGD  VT+IASEHN+ADPF K LTAKVF+GHL+SLGL+
Subjt:  PNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQ

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-29760.78Show/hide
Query:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD
        + KR ++S   + +LWHLRLGHINL++I RLV +GLL++L++ SLP CESCLEGKMTKRPF+GKGYRAKEP ELIHSDLCGPMNVKARGG+EYF+SFIDD
Subjt:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD

Query:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------
        Y RYGY+YLM  KSE LEKFKE+KTEVENLL                                                                     
Subjt:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------

Query:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD
            VE AV+ILN VPSKSV ETPFELW GRK SL HFRIWGC  HVLV+NPKKLE R +LC FVGYPKETRG +FFDP++NRV VSTNATFLEE+H+R+
Subjt:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD

Query:  HLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQE
        H PRSK+VL+E    S RV D    S+ V + ++S +   SQ LRMPRRS RVV Q  RY+GL +T VV PDD  EDPL+Y QAM  VDK++W+KAMD E
Subjt:  HLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQE

Query:  MESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLD
        MESMYFNS+WELVD P+GVKPIGCKWIYKRKRD   KVQTFKARLVAK +TQ E VDY+ETFSPVAM+KSIRILL+IA +YDYE+W+MDVKTAF+N NL+
Subjt:  MESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLD

Query:  ETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE----------------
        E+I+M QP+GFI +GQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGF+QNVDEPCVYKKI    +AFL+LYVDDILLIGN+                
Subjt:  ETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE----------------

Query:  -----------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVG
                                                                   +Q PKTPQ+VEDMRRIPYASAV SLMYAMLCTRPDICYAVG
Subjt:  -----------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEAEKEA
        IVSRYQSN GLDHWT VK +LKYLRR R+Y LVY   DLILT                                    GCIA+STMEAEYV ACEA KEA
Subjt:  IVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEAEKEA

Query:  VWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQ
        VWLRKF+ DLEVVPNMNL ITL+CDNSGAVANS+EPRSHKRGKHIERKYHLIREI+ RGDV VT+IASEHN+ADPF K LTAKVF+GHLESLGL+
Subjt:  VWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQ

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-30161.2Show/hide
Query:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD
        + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRPF+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DD
Subjt:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD

Query:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------
        Y RYGY+YLM  KSE LEKFKE+K EVEN L                                                                     
Subjt:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------

Query:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD
            V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRIWGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Subjt:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD

Query:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA
        H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV     RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKA
Subjt:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA

Query:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN
        M+ E+ESMYFNS+W+LVDQPDGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+IA Y+DYE+W+MDVKTAF+N
Subjt:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN

Query:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------
         NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRSWNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Subjt:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------

Query:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC
                                                                       +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDIC
Subjt:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA
        YAVGIVSRYQSN GL HWT VK ILKYLRR R+Y LVY   DLILT                                    GCIA+STMEAEYV ACEA
Subjt:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA

Query:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV
         KEAVWLR F++DLEVVPNM+  ITL+CDNSGAVANSREPRSHKRGKHIERKYHLIREI+HRGDV VTQIAS HNVADPF K LTAKVF+GHLESLGL+ 
Subjt:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV

Query:  LP
        +P
Subjt:  LP

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-30161.2Show/hide
Query:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD
        + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRPF+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DD
Subjt:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD

Query:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------
        Y RYGY+YLM  KSE LEKFKE+K EVEN L                                                                     
Subjt:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------

Query:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD
            V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRIWGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Subjt:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD

Query:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA
        H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV     RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKA
Subjt:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA

Query:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN
        M+ E+ESMYFNS+W+LVDQPDGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+IA Y+DYE+W+MDVKTAF+N
Subjt:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN

Query:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------
         NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRSWNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Subjt:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------

Query:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC
                                                                       +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDIC
Subjt:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA
        YAVGIVSRYQSN GL HWT VK ILKYLRR R+Y LVY   DLILT                                    GCIA+STMEAEYV ACEA
Subjt:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA

Query:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV
         KEAVWLR F++DLEVVPNM+  ITL+CDNSGAVANSREPRSHKRGKHIERKYHLIREI+HRGDV VTQIAS HNVADPF K LTAKVF+GHLESLGL+ 
Subjt:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV

Query:  LP
        +P
Subjt:  LP

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-30161.2Show/hide
Query:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD
        + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRPF+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DD
Subjt:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD

Query:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------
        Y RYGY+YLM  KSE LEKFKE+K EVEN L                                                                     
Subjt:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------

Query:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD
            V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRIWGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Subjt:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD

Query:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA
        H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV     RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKA
Subjt:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA

Query:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN
        M+ E+ESMYFNS+W+LVDQPDGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+IA Y+DYE+W+MDVKTAF+N
Subjt:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN

Query:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------
         NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRSWNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Subjt:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------

Query:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC
                                                                       +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDIC
Subjt:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA
        YAVGIVSRYQSN GL HWT VK ILKYLRR R+Y LVY   DLILT                                    GCIA+STMEAEYV ACEA
Subjt:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA

Query:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV
         KEAVWLR F++DLEVVPNM+  ITL+CDNSGAVANSREPRSHKRGKHIERKYHLIREI+HRGDV VTQIAS HNVADPF K LTAKVF+GHLESLGL+ 
Subjt:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV

Query:  LP
        +P
Subjt:  LP

TrEMBL top hitse value%identityAlignment
A0A5A7SIN2 Gag/pol protein4.6e-29767.26Show/hide
Query:  RLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLE
        +LGHINL++I RLV +GLL++L+++SLP CESCLEGKMTKRPF+ KGYRAKEP ELIHSDLCG MNVKARGG+EYF+SFIDDY RYGY+YLM  KSE LE
Subjt:  RLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMYRKSETLE

Query:  KFKEFKTEVENLL------------------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKL
        KFKE+KTEVENLL                                    VE AV+ILN  PSKSV ETPFELW GRK SL HFRIWGC THVLV+NPKKL
Subjt:  KFKEFKTEVENLL------------------------------------VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKL

Query:  ESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRDHLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVR
        +SR +LC FVGYPKETRG +FFDP++NRV VSTNATFLEE+H+R+H PRSK+VL+E    S RV D    S+ V + ++S +   SQ LRMPRRS RVV 
Subjt:  ESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRDHLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVR

Query:  QLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEE
        Q  RY+GL +T  V PDD  EDPL+Y QAM  VDK++W+KAMD EMESMYFN +WELVD P+GVKPIGCKWIYKRKRD   KVQTFKARLVAK +TQ E 
Subjt:  QLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEE

Query:  VDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVD
        VDY+ETFSPVAM+KSIRILL+IA +YDYE+W+MDVKTAF+N NL+E+I+M +P+GFI +GQEQKVC+L RSIYGLKQAS+SWNIRFD AIKSYGF+QNVD
Subjt:  VDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVD

Query:  EPCVYKKIVNKTIAFLILYVDDILLIGNE-----------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDH
        EPC+YKKI    +AFL+LYVDDIL IGN+                 +QCPKTPQ+VEDMRRIPYASAV SLMYAMLCTRPDICYAVGIVSRYQSN GLDH
Subjt:  EPCVYKKIVNKTIAFLILYVDDILLIGNE-----------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVGIVSRYQSNQGLDH

Query:  WTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEAEKEAVWLRKFMMDLEVV
        WT VK ILKYLRR R+Y LVY   DLILT                                    GCIA+STMEAEYV ACEA KEAVWLRKF+ DLEVV
Subjt:  WTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEAEKEAVWLRKFMMDLEVV

Query:  PNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQ
        PNMNL ITL+ DNS AVANS+EPRSHKRGKHIERKYHLIREI+ RGD  VT+IASEHN+ADPF K LTAKVF+GHL+SLGL+
Subjt:  PNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQ

A0A5A7SMH8 Gag/pol protein1.1e-30161.2Show/hide
Query:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD
        + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRPF+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DD
Subjt:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD

Query:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------
        Y RYGY+YLM  KSE LEKFKE+K EVEN L                                                                     
Subjt:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------

Query:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD
            V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRIWGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Subjt:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD

Query:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA
        H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV     RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKA
Subjt:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA

Query:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN
        M+ E+ESMYFNS+W+LVDQPDGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+IA Y+DYE+W+MDVKTAF+N
Subjt:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN

Query:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------
         NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRSWNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Subjt:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------

Query:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC
                                                                       +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDIC
Subjt:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA
        YAVGIVSRYQSN GL HWT VK ILKYLRR R+Y LVY   DLILT                                    GCIA+STMEAEYV ACEA
Subjt:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA

Query:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV
         KEAVWLR F++DLEVVPNM+  ITL+CDNSGAVANSREPRSHKRGKHIERKYHLIREI+HRGDV VTQIAS HNVADPF K LTAKVF+GHLESLGL+ 
Subjt:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV

Query:  LP
        +P
Subjt:  LP

A0A5A7TZD0 Gag/pol protein9.4e-29860.78Show/hide
Query:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD
        + KR ++S   + +LWHLRLGHINL++I RLV +GLL++L++ SLP CESCLEGKMTKRPF+GKGYRAKEP ELIHSDLCGPMNVKARGG+EYF+SFIDD
Subjt:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD

Query:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------
        Y RYGY+YLM  KSE LEKFKE+KTEVENLL                                                                     
Subjt:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------

Query:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD
            VE AV+ILN VPSKSV ETPFELW GRK SL HFRIWGC  HVLV+NPKKLE R +LC FVGYPKETRG +FFDP++NRV VSTNATFLEE+H+R+
Subjt:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD

Query:  HLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQE
        H PRSK+VL+E    S RV D    S+ V + ++S +   SQ LRMPRRS RVV Q  RY+GL +T VV PDD  EDPL+Y QAM  VDK++W+KAMD E
Subjt:  HLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVR-SQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQE

Query:  MESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLD
        MESMYFNS+WELVD P+GVKPIGCKWIYKRKRD   KVQTFKARLVAK +TQ E VDY+ETFSPVAM+KSIRILL+IA +YDYE+W+MDVKTAF+N NL+
Subjt:  MESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLD

Query:  ETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE----------------
        E+I+M QP+GFI +GQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGF+QNVDEPCVYKKI    +AFL+LYVDDILLIGN+                
Subjt:  ETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE----------------

Query:  -----------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVG
                                                                   +Q PKTPQ+VEDMRRIPYASAV SLMYAMLCTRPDICYAVG
Subjt:  -----------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEAEKEA
        IVSRYQSN GLDHWT VK +LKYLRR R+Y LVY   DLILT                                    GCIA+STMEAEYV ACEA KEA
Subjt:  IVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEAEKEA

Query:  VWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQ
        VWLRKF+ DLEVVPNMNL ITL+CDNSGAVANS+EPRSHKRGKHIERKYHLIREI+ RGDV VT+IASEHN+ADPF K LTAKVF+GHLESLGL+
Subjt:  VWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQ

A0A5A7TZD7 Gag/pol protein1.1e-30161.2Show/hide
Query:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD
        + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRPF+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DD
Subjt:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD

Query:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------
        Y RYGY+YLM  KSE LEKFKE+K EVEN L                                                                     
Subjt:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------

Query:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD
            V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRIWGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Subjt:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD

Query:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA
        H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV     RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKA
Subjt:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA

Query:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN
        M+ E+ESMYFNS+W+LVDQPDGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+IA Y+DYE+W+MDVKTAF+N
Subjt:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN

Query:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------
         NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRSWNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Subjt:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------

Query:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC
                                                                       +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDIC
Subjt:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA
        YAVGIVSRYQSN GL HWT VK ILKYLRR R+Y LVY   DLILT                                    GCIA+STMEAEYV ACEA
Subjt:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA

Query:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV
         KEAVWLR F++DLEVVPNM+  ITL+CDNSGAVANSREPRSHKRGKHIERKYHLIREI+HRGDV VTQIAS HNVADPF K LTAKVF+GHLESLGL+ 
Subjt:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV

Query:  LP
        +P
Subjt:  LP

A0A5D3CPJ6 Gag/pol protein1.1e-30161.2Show/hide
Query:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD
        + KR+K+S KE+AHLWHLRLGHINLN+IERLV +GLLSELEENSLPVCESCLEGKMTKRPF+GKG+RAKEP EL+HSDLCGPMNVKARGG+EYF++F DD
Subjt:  RTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDD

Query:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------
        Y RYGY+YLM  KSE LEKFKE+K EVEN L                                                                     
Subjt:  YLRYGYIYLMYRKSETLEKFKEFKTEVENLL---------------------------------------------------------------------

Query:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD
            V+ AVYILN VPSKSV ETP +LW+GRK SLRHFRIWGC  HVL +NPKKLE R KLCLFVGYPK TRG  F+DPKDN+V VSTNATFLEE+HIR+
Subjt:  ----VEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRD

Query:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA
        H PRSKIVLNE+       S RV +  S  T VV   SS++  + Q LR PRRS RV     RYM L +TL V  D D EDPLT+ +AM  VDK+EWIKA
Subjt:  HLPRSKIVLNEMDSS----SARVADGASTSTSVVDPSSSSKV-RSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKA

Query:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN
        M+ E+ESMYFNS+W+LVDQPDGVKPIGCKWIYKRKR  D KVQTFKARLVAK +TQVE VDY+ETFSPVAM+KSIRILL+IA Y+DYE+W+MDVKTAF+N
Subjt:  MDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMN

Query:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------
         NL+ETIYM QP+GFI  GQEQK+C+L RSIYGLKQASRSWNIRFD AIKSYGF+Q VDEPCVYK+I+NK++AFL+LYVDDILLIGN+            
Subjt:  DNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE------------

Query:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC
                                                                       +QCPKTPQ VE+MR IPYASAV SLMYAMLCTRPDIC
Subjt:  ---------------------------------------------------------------DQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDIC

Query:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA
        YAVGIVSRYQSN GL HWT VK ILKYLRR R+Y LVY   DLILT                                    GCIA+STMEAEYV ACEA
Subjt:  YAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILT------------------------------------GCIANSTMEAEYVVACEA

Query:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV
         KEAVWLR F++DLEVVPNM+  ITL+CDNSGAVANSREPRSHKRGKHIERKYHLIREI+HRGDV VTQIAS HNVADPF K LTAKVF+GHLESLGL+ 
Subjt:  EKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQV

Query:  LP
        +P
Subjt:  LP

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-7324.43Show/hide
Query:  IKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELE-----ENSLPVCESCLEGKMTKRPFSGKGYRA--KEPFELIHSDLCGPMNVKARGGYEYFVSF
        I    K +  LWH R GHI+  K+  +    + S+       E S  +CE CL GK  + PF     +   K P  ++HSD+CGP+         YFV F
Subjt:  IKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELE-----ENSLPVCESCLEGKMTKRPFSGKGYRA--KEPFELIHSDLCGPMNVKARGGYEYFVSF

Query:  IDDYLRYGYIYLMYRKSETLEKFKEFKTEVE---NL----------------------------------------------------------------
        +D +  Y   YL+  KS+    F++F  + E   NL                                                                
Subjt:  IDDYLRYGYIYLMYRKSETLEKFKEFKTEVE---NL----------------------------------------------------------------

Query:  ------LVEIAVYILNTVPSKSVCE---TPFELWHGRKISLRHFRIWGCLTHVLVSNPK-KLESRLKLCLFVGY--------------------------
               V  A Y++N +PS+++ +   TP+E+WH +K  L+H R++G   +V + N + K + +    +FVGY                          
Subjt:  ------LVEIAVYILNTVPSKSVCE---TPFELWHGRKISLRHFRIWGCLTHVLVSNPK-KLESRLKLCLFVGY--------------------------

Query:  -----------------PKETRGDMFFDPKDNRVLVST----------NATFLEEN-------------------------------HIRDHLPRSKIVL
                          KE+    F  P D+R ++ T          N  FL+++                                ++D    +K  L
Subjt:  -----------------PKETRGDMFFDPKDNRVLVST----------NATFLEEN-------------------------------HIRDHLPRSKIVL

Query:  NEMDSSSARVADGASTSTSVVDPSSSSKVRSQE-LR---------------MPRRSERV-----VRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVD
        NE  S   +  D  + S    +P+ S +  + E L+               + RRSER+     +   E    L K ++       + P ++D+   + D
Subjt:  NEMDSSSARVADGASTSTSVVDPSSSSKVRSQE-LR---------------MPRRSERV-----VRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVD

Query:  KNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMD
        K+ W +A++ E+ +   N+ W +  +P+    +  +W++  K +       +KARLVA+ FTQ  ++DY+ETF+PVA + S R +L++ + Y+ +V +MD
Subjt:  KNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMD

Query:  VKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVY---KKIVNKTIAFLILYVDDILL------
        VKTAF+N  L E IYM  P+G         VC+L ++IYGLKQA+R W   F++A+K   F  +  + C+Y   K  +N+ I +++LYVDD+++      
Subjt:  VKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVY---KKIVNKTIAFLILYVDDILL------

Query:  ------------------------IG-------------------------NEDQC----PKTPQKV-------EDMRRIPYASAVRSLMYAMLCTRPDI
                                IG                         N + C       P K+       ++    P  S +  LMY MLCTRPD+
Subjt:  ------------------------IG-------------------------NEDQC----PKTPQKV-------EDMRRIPYASAVRSLMYAMLCTRPDI

Query:  CYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRG-----------------------------------DLIL-----TGCIANSTMEAEYV
          AV I+SRY S    + W  +K +L+YL+   +  L++ +                                    +LI         +A S+ EAEY+
Subjt:  CYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRG-----------------------------------DLIL-----TGCIANSTMEAEYV

Query:  VACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLES
           EA +EA+WL+  +  + +   +   I ++ DN G ++ +  P  HKR KHI+ KYH  RE +    + +  I +E+ +AD F K L A  F    + 
Subjt:  VACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLES

Query:  LGL
        LGL
Subjt:  LGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.1e-10928.59Show/hide
Query:  SAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMY
        S  LWH R+GH++   ++ L    L+S  +  ++  C+ CL GK  +  F     R     +L++SD+CGPM +++ GG +YFV+FIDD  R  ++Y++ 
Subjt:  SAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYLMY

Query:  RKSETLEKFKEF---------------------------------------------------------KTEVENL----------------LVEIAVYI
         K +  + F++F                                                         +T VE +                 V+ A Y+
Subjt:  RKSETLEKFKEF---------------------------------------------------------KTEVENL----------------LVEIAVYI

Query:  LNTVPSKSVC-ETPFELWHGRKISLRHFRIWGC--LTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRDHLPRSKIV
        +N  PS  +  E P  +W  +++S  H +++GC    HV      KL+ +   C+F+GY  E  G   +DP   +V+ S +  F  E+ +R     S+ V
Subjt:  LNTVPSKSVC-ETPFELWHGRKISLRHFRIWGC--LTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATFLEENHIRDHLPRSKIV

Query:  LN-----------------EMDSSSARVADGASTSTSVVDPSSSSKVRSQELRMP----------RRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYD
         N                   +S++  V++       V++         +E+  P          RRSER   +  RY      L+     D  +P +  
Subjt:  LN-----------------EMDSSSARVADGASTSTSVVDPSSSSKVRSQELRMP----------RRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYD

Query:  QAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYD
        + ++  +KN+ +KAM +EMES+  N  ++LV+ P G +P+ CKW++K K+D D K+  +KARLV K F Q + +D+ E FSPV  + SIR +L++A   D
Subjt:  QAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYD

Query:  YEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVY-KKIVNKTIAFLILYVDDILLI
         EV ++DVKTAF++ +L+E IYM+QP+GF   G++  VC+L +S+YGLKQA R W ++FD  +KS  + +   +PCVY K+        L+LYVDD+L++
Subjt:  YEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVY-KKIVNKTIAFLILYVDDILLI

Query:  G---------------------------------------------------------------------------NEDQCPKTPQKVEDMRRIPYASAV
        G                                                                           ++  CP T ++  +M ++PY+SAV
Subjt:  G---------------------------------------------------------------------------NEDQCPKTPQKVEDMRRIPYASAV

Query:  RSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILTG------------------------------------CI
         SLMYAM+CTRPDI +AVG+VSR+  N G +HW  VK IL+YLR      L +   D IL G                                    C+
Subjt:  RSLMYAMLCTRPDICYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILTG------------------------------------CI

Query:  ANSTMEAEYVVACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALT
        A ST EAEY+ A E  KE +WL++F+ +L +         ++CD+  A+  S+    H R KHI+ +YH IRE++    + V +I++  N AD   K + 
Subjt:  ANSTMEAEYVVACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALT

Query:  AKVFKGHLESLGL
           F+   E +G+
Subjt:  AKVFKGHLESLGL

P25600 Putative transposon Ty5-1 protein YCL074W7.3e-1324.28Show/hide
Query:  MDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE---
        MDV TAF+N  +DE IY+ QP GF+ +     V  L   +YGLKQA   WN   +  +K  GF ++  E  +Y +  +    ++ +YVDD+L+       
Subjt:  MDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNE---

Query:  -DQCPKTPQKVEDMRRI----------------------------------------------------------------PYASAVRSLMYAMLCTRPD
         D+  +   K+  M+ +                                                                PY S V  L++     RPD
Subjt:  -DQCPKTPQKVEDMRRI----------------------------------------------------------------PYASAVRSLMYAMLCTRPD

Query:  ICYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRG
        I Y V ++SR+       H    + +L+YL   R+  L Y  G
Subjt:  ICYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.7e-4926.72Show/hide
Query:  AMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDG-VKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYD
        A+  +    W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K + D  +  +KARLVAK + Q   +DY ETFSPV    SIRI+L +AV   
Subjt:  AMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDG-VKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYD

Query:  YEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIG
        + + ++DV  AF+   L + +YM QP GFI K +   VC+L++++YGLKQA R+W +     + + GF  +V +  ++     K+I ++++YVDDIL+ G
Subjt:  YEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIG

Query:  NE-----DQCPKTPQKVE-------------DMRRIP-------------------------------------------------YASAVRSLMYAMLC
        N+     +      Q+               + +R+P                                                 Y   V SL Y +  
Subjt:  NE-----DQCPKTPQKVE-------------DMRRIP-------------------------------------------------YASAVRSLMYAMLC

Query:  TRPDICYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVY------------------DRGDLILTG-------------------CIANSTMEAE
        TRPDI YAV  +S++      +H   +K IL+YL    N+ +                    D+ D + T                     +  S+ EAE
Subjt:  TRPDICYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVY------------------DRGDLILTG-------------------CIANSTMEAE

Query:  YVVACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHL
        Y        E  W+   + +L +   +     ++CDN GA      P  H R KHI   YH IR  +  G + V  +++   +AD   K L+   F+   
Subjt:  YVVACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHL

Query:  ESLGLQVLP
          +G+  +P
Subjt:  ESLGLQVLP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-5026.04Show/hide
Query:  HLPRSKIVLNEMDSSSARVADGASTSTSVVDP--------SSSSKVRSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEW
        H+P     ++E +S S+     +STST  + P          +++       M  R++  +R+  +    A +L         +P T  QAM     + W
Subjt:  HLPRSKIVLNEMDSSSARVADGASTSTSVVDP--------SSSSKVRSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEW

Query:  IKAMDQEMESMYFNSIWELV-DQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKT
         +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K + D  +  +KARLVAK + Q   +DY ETFSPV    SIRI+L +AV   + + ++DV  
Subjt:  IKAMDQEMESMYFNSIWELV-DQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKT

Query:  AFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNEDQCPK---
        AF+   L + +YM QP GF+ K +   VCRL+++IYGLKQA R+W +     + + GF  ++ +  ++     ++I ++++YVDDIL+ GN+    K   
Subjt:  AFMNDNLDETIYMDQPKGFIAKGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNEDQCPK---

Query:  -------TPQKVEDM--------RRIP-------------------------------------------------YASAVRSLMYAMLCTRPDICYAVG
               + ++ ED+        +R+P                                                 Y   V SL Y +  TRPD+ YAV 
Subjt:  -------TPQKVEDM--------RRIP-------------------------------------------------YASAVRSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRG------------------DLILTG-------------------CIANSTMEAEYVVACEAEKE
         +S+Y      DHW  +K +L+YL    ++ +   +G                  D + T                     +  S+ EAEY        E
Subjt:  IVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRG------------------DLILTG-------------------CIANSTMEAEYVVACEAEKE

Query:  AVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQVLP
          W+   + +L +   ++    ++CDN GA      P  H R KHI   YH IR  +  G + V  +++   +AD   K L+   F+     +G+  +P
Subjt:  AVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQVLP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.8e-5028.84Show/hide
Query:  EDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILL
        ++P TY++A   +    W  AMD E+ +M     WE+   P   KPIGCKW+YK K + D  ++ +KARLVAK +TQ E +D+ ETFSPV  + S++++L
Subjt:  EDPLTYDQAMAKVDKNEWIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILL

Query:  AIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQE----QKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFL
        AI+  Y++ + ++D+  AF+N +LDE IYM  P G+ A+  +      VC L++SIYGLKQASR W ++F   +  +GF Q+  +   + KI       +
Subjt:  AIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIAKGQE----QKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFL

Query:  ILYVDDILLIGNEDQCP-------KTPQKVEDMRRIPY--------------------------------------------------------ASAVRS
        ++YVDDI++  N D          K+  K+ D+  + Y                                                        A A R 
Subjt:  ILYVDDILLIGNEDQCP-------KTPQKVEDMRRIPY--------------------------------------------------------ASAVRS

Query:  LMYAML---CTRPDICYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDR---------GDLILTGC---------------------------
        L+  ++    TR DI +AV  +S++     L H   V  IL Y++      L Y            D     C                           
Subjt:  LMYAML---CTRPDICYAVGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDR---------GDLILTGC---------------------------

Query:  -IANSTMEAEYVVACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIRE
         ++ S+ EAEY     A  E +WL +F  +L++   ++    LFCDN+ A+  +     H+R KHIE   H +RE
Subjt:  -IANSTMEAEYVVACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKHIERKYHLIRE

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.9e-0938.16Show/hide
Query:  KESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNV
        K+   LWH RL H++   +E LV  G L   + +SL  CE C+ GK  +  FS   +  K P + +HSDL G  +V
Subjt:  KESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.7e-1340.7Show/hide
Query:  WIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIA
        W +AM +E++++  N  W LV  P     +GCKW++K K   D  +   KARLVAK F Q E + + ET+SPV    +IR +L +A
Subjt:  WIKAMDQEMESMYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAACTAAGAGAATAAAAGTTTCTCTAAAAGAAAGTGCCCATCTTTGGCATCTAAGGTTAGGCCACATAAATCTCAATAAGATTGAGAGACTAGTAATGAGTGGACT
TCTAAGCGAGTTGGAAGAAAACTCTTTACCGGTATGTGAGTCATGCCTTGAAGGCAAAATGACCAAACGTCCTTTTAGTGGAAAAGGGTATAGAGCCAAGGAGCCCTTTG
AGCTTATACATTCTGACCTCTGTGGTCCGATGAATGTTAAAGCACGAGGAGGTTACGAATACTTCGTATCTTTCATAGATGACTATTTGAGGTATGGGTACATTTACCTA
ATGTATAGGAAGTCTGAAACACTTGAAAAGTTCAAGGAGTTCAAGACTGAGGTTGAGAACCTGTTAGTGGAGATTGCGGTATACATTTTGAACACAGTTCCGTCGAAAAG
TGTTTGTGAAACACCTTTTGAACTTTGGCATGGTCGTAAAATCAGTTTACGTCATTTCAGAATTTGGGGATGCCTGACCCATGTGTTGGTGTCAAACCCGAAAAAGTTGG
AATCCCGTTTAAAGTTGTGCCTCTTTGTAGGTTACCCAAAAGAGACTAGGGGTGATATGTTTTTCGATCCTAAGGATAATAGGGTGCTTGTGTCGACAAATGCCACTTTC
CTTGAGGAAAACCACATCAGGGATCACTTACCAAGGAGTAAGATTGTGTTGAATGAAATGGACAGTTCATCAGCAAGAGTTGCTGATGGGGCTAGTACGTCAACAAGTGT
TGTTGATCCTAGCTCGTCTAGTAAAGTCCGTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGAGAGGGTTGTGAGACAGCTTGAACGTTACATGGGTTTGGCTAAAACCC
TGGTCGTCACCCCTGATGATGACTGTGAGGATCCATTGACCTATGATCAGGCAATGGCAAAGGTTGACAAAAATGAATGGATTAAAGCTATGGATCAGGAAATGGAGTCA
ATGTACTTCAATTCCATCTGGGAGCTTGTGGACCAACCAGATGGGGTTAAACCTATTGGTTGCAAGTGGATCTACAAGCGTAAACGTGACGTAGATGAGAAGGTGCAAAC
CTTCAAAGCACGACTAGTGGCAAAGAGTTTTACCCAGGTGGAAGAAGTTGACTATAAGGAAACCTTTTCACCTGTTGCCATGGTAAAGTCGATTAGAATCCTTCTAGCTA
TTGCCGTATATTATGACTATGAGGTATGGAAAATGGATGTCAAGACCGCTTTTATGAATGACAACCTTGACGAAACCATCTACATGGACCAACCCAAGGGGTTCATTGCC
AAAGGCCAAGAGCAAAAGGTTTGCCGGCTTCAAAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTT
TAATCAAAATGTTGACGAGCCTTGTGTCTACAAGAAAATCGTTAACAAAACTATCGCATTTCTAATTTTGTATGTGGATGATATCCTTCTCATTGGGAATGAGGATCAAT
GTCCTAAGACACCTCAAAAGGTTGAGGATATGAGACGAATCCCCTATGCTTCAGCTGTAAGGAGCCTCATGTATGCCATGCTGTGTACTAGGCCTGACATCTGTTATGCA
GTTGGGATTGTCAGTAGGTATCAATCCAATCAAGGATTAGATCACTGGACAATCGTAAAGGCAATCCTCAAGTATCTTAGGAGAATGAGGAACTACAACCTTGTGTATGA
CAGAGGGGATTTGATCCTTACGGGATGCATCGCTAATTCCACGATGGAAGCCGAGTATGTTGTGGCTTGTGAAGCTGAAAAGGAAGCTGTTTGGCTAAGGAAATTCATGA
TGGATTTGGAAGTTGTTCCAAATATGAACTTATCGATCACGTTGTTTTGTGACAACAGTGGTGCAGTAGCCAACTCGAGAGAACCTCGGAGTCATAAAAGGGGCAAGCAC
ATAGAGCGTAAGTATCACCTGATACGGGAGATTATGCACCGTGGAGACGTGACAGTCACGCAGATAGCTTCGGAGCACAACGTTGCTGATCCATTTATAAAGGCCCTCAC
GGCTAAGGTGTTTAAGGGTCACCTAGAGAGTCTAGGTCTTCAAGTGCTTCCTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGAACTAAGAGAATAAAAGTTTCTCTAAAAGAAAGTGCCCATCTTTGGCATCTAAGGTTAGGCCACATAAATCTCAATAAGATTGAGAGACTAGTAATGAGTGGACT
TCTAAGCGAGTTGGAAGAAAACTCTTTACCGGTATGTGAGTCATGCCTTGAAGGCAAAATGACCAAACGTCCTTTTAGTGGAAAAGGGTATAGAGCCAAGGAGCCCTTTG
AGCTTATACATTCTGACCTCTGTGGTCCGATGAATGTTAAAGCACGAGGAGGTTACGAATACTTCGTATCTTTCATAGATGACTATTTGAGGTATGGGTACATTTACCTA
ATGTATAGGAAGTCTGAAACACTTGAAAAGTTCAAGGAGTTCAAGACTGAGGTTGAGAACCTGTTAGTGGAGATTGCGGTATACATTTTGAACACAGTTCCGTCGAAAAG
TGTTTGTGAAACACCTTTTGAACTTTGGCATGGTCGTAAAATCAGTTTACGTCATTTCAGAATTTGGGGATGCCTGACCCATGTGTTGGTGTCAAACCCGAAAAAGTTGG
AATCCCGTTTAAAGTTGTGCCTCTTTGTAGGTTACCCAAAAGAGACTAGGGGTGATATGTTTTTCGATCCTAAGGATAATAGGGTGCTTGTGTCGACAAATGCCACTTTC
CTTGAGGAAAACCACATCAGGGATCACTTACCAAGGAGTAAGATTGTGTTGAATGAAATGGACAGTTCATCAGCAAGAGTTGCTGATGGGGCTAGTACGTCAACAAGTGT
TGTTGATCCTAGCTCGTCTAGTAAAGTCCGTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGAGAGGGTTGTGAGACAGCTTGAACGTTACATGGGTTTGGCTAAAACCC
TGGTCGTCACCCCTGATGATGACTGTGAGGATCCATTGACCTATGATCAGGCAATGGCAAAGGTTGACAAAAATGAATGGATTAAAGCTATGGATCAGGAAATGGAGTCA
ATGTACTTCAATTCCATCTGGGAGCTTGTGGACCAACCAGATGGGGTTAAACCTATTGGTTGCAAGTGGATCTACAAGCGTAAACGTGACGTAGATGAGAAGGTGCAAAC
CTTCAAAGCACGACTAGTGGCAAAGAGTTTTACCCAGGTGGAAGAAGTTGACTATAAGGAAACCTTTTCACCTGTTGCCATGGTAAAGTCGATTAGAATCCTTCTAGCTA
TTGCCGTATATTATGACTATGAGGTATGGAAAATGGATGTCAAGACCGCTTTTATGAATGACAACCTTGACGAAACCATCTACATGGACCAACCCAAGGGGTTCATTGCC
AAAGGCCAAGAGCAAAAGGTTTGCCGGCTTCAAAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTT
TAATCAAAATGTTGACGAGCCTTGTGTCTACAAGAAAATCGTTAACAAAACTATCGCATTTCTAATTTTGTATGTGGATGATATCCTTCTCATTGGGAATGAGGATCAAT
GTCCTAAGACACCTCAAAAGGTTGAGGATATGAGACGAATCCCCTATGCTTCAGCTGTAAGGAGCCTCATGTATGCCATGCTGTGTACTAGGCCTGACATCTGTTATGCA
GTTGGGATTGTCAGTAGGTATCAATCCAATCAAGGATTAGATCACTGGACAATCGTAAAGGCAATCCTCAAGTATCTTAGGAGAATGAGGAACTACAACCTTGTGTATGA
CAGAGGGGATTTGATCCTTACGGGATGCATCGCTAATTCCACGATGGAAGCCGAGTATGTTGTGGCTTGTGAAGCTGAAAAGGAAGCTGTTTGGCTAAGGAAATTCATGA
TGGATTTGGAAGTTGTTCCAAATATGAACTTATCGATCACGTTGTTTTGTGACAACAGTGGTGCAGTAGCCAACTCGAGAGAACCTCGGAGTCATAAAAGGGGCAAGCAC
ATAGAGCGTAAGTATCACCTGATACGGGAGATTATGCACCGTGGAGACGTGACAGTCACGCAGATAGCTTCGGAGCACAACGTTGCTGATCCATTTATAAAGGCCCTCAC
GGCTAAGGTGTTTAAGGGTCACCTAGAGAGTCTAGGTCTTCAAGTGCTTCCTGACTAG
Protein sequenceShow/hide protein sequence
MRTKRIKVSLKESAHLWHLRLGHINLNKIERLVMSGLLSELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPFELIHSDLCGPMNVKARGGYEYFVSFIDDYLRYGYIYL
MYRKSETLEKFKEFKTEVENLLVEIAVYILNTVPSKSVCETPFELWHGRKISLRHFRIWGCLTHVLVSNPKKLESRLKLCLFVGYPKETRGDMFFDPKDNRVLVSTNATF
LEENHIRDHLPRSKIVLNEMDSSSARVADGASTSTSVVDPSSSSKVRSQELRMPRRSERVVRQLERYMGLAKTLVVTPDDDCEDPLTYDQAMAKVDKNEWIKAMDQEMES
MYFNSIWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFKARLVAKSFTQVEEVDYKETFSPVAMVKSIRILLAIAVYYDYEVWKMDVKTAFMNDNLDETIYMDQPKGFIA
KGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFNQNVDEPCVYKKIVNKTIAFLILYVDDILLIGNEDQCPKTPQKVEDMRRIPYASAVRSLMYAMLCTRPDICYA
VGIVSRYQSNQGLDHWTIVKAILKYLRRMRNYNLVYDRGDLILTGCIANSTMEAEYVVACEAEKEAVWLRKFMMDLEVVPNMNLSITLFCDNSGAVANSREPRSHKRGKH
IERKYHLIREIMHRGDVTVTQIASEHNVADPFIKALTAKVFKGHLESLGLQVLPD