; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0010531 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0010531
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr01:5532789..5534474
RNA-Seq ExpressionCmc01g0010531
SyntenyCmc01g0010531
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040689.1 pol protein [Cucumis melo var. makuwa]1.1e-25984.29Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPS+SPWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPVKIEAVT WTRPSTVSEVR FLGLA                        APFVWSKACEDSFQNLKQKLV APVL VPDGSGSFVIYSDASKKGL
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                              DYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLE+AEIAVSVGAVT+QLAQLTVQPTLRQRIIDAQSND YLVEKRGLAEAGQAV FSISS+GGL+FERRLCVPSDSA+KT
Subjt:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL

KAA0047433.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.1e-26590.63Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVT WTRPSTVSEVR FLGLA                        APFVWSKACEDSFQNLKQKLV APVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQL
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV        DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE+AEIAVSVG VT+QL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQL

Query:  AQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA
        AQLTVQPTLRQRIIDAQ ND YLVEKRGLAEAGQAV FSISS+GGL+FERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA
Subjt:  AQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA

Query:  EFVSRCLVCQQVKAPRQKPAGLL
        +FVSRCLVCQQVKAPRQKPAGLL
Subjt:  EFVSRCLVCQQVKAPRQKPAGLL

KAA0048687.1 pol protein [Cucumis melo var. makuwa]2.4e-25983.96Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVT WTRPSTVSEVR FLGLA                        APFVWSKACEDSFQNLKQKLV APVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                              DYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLE+AEIAVSVGAVT+QLAQLTVQPTLRQRIIDAQSND YLVEKRGLAEAGQAV FS+SS+GGL+FERRLCVPSDS VKT
Subjt:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
        ELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL

KAA0053368.1 pol protein [Cucumis melo var. makuwa]2.4e-25984.14Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNA AVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVT WTRPSTVSEVR FLGLA                        APFVWSKACEDSFQNLKQKLV AP+LTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA
        GCVLMQQ KVVAYASRQLKSHEQNYPTHDLELAAV                                              DYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLE+AEIAVSVGAVT+QLAQLTVQ TLRQRII AQSND YLVEKRGLAEAGQA GFSISS+GGL FERRLCVPSDS +KT
Subjt:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL

KAA0062245.1 pol protein [Cucumis melo var. makuwa]3.1e-26289.48Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVT WTRPST+SEVR FLGLA                        APFVWSKACEDSFQNLKQKLV APVLTVPDGSGSFVIYSDA KKGL
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQL
        GCVLMQQGKVV YASRQLKSHEQNYPTHDLELAAV        DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE+AEIAVSVGAVT+QL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQL

Query:  AQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA
        AQLTVQPTLRQRIIDAQSND YLVEKRGLAEAGQA  FS+SS+GGL+FERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA
Subjt:  AQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA

Query:  EFVSRCLVCQQVKAPRQKPAGLL
        EFVS+CLVCQQVKAP QKPAGLL
Subjt:  EFVSRCLVCQQVKAPRQKPAGLL

TrEMBL top hitse value%identityAlignment
A0A5A7THE6 Reverse transcriptase5.3e-26084.29Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPS+SPWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPVKIEAVT WTRPSTVSEVR FLGLA                        APFVWSKACEDSFQNLKQKLV APVL VPDGSGSFVIYSDASKKGL
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                              DYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLE+AEIAVSVGAVT+QLAQLTVQPTLRQRIIDAQSND YLVEKRGLAEAGQAV FSISS+GGL+FERRLCVPSDSA+KT
Subjt:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL

A0A5A7TV57 Ty3-gypsy retrotransposon protein2.5e-26590.63Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVT WTRPSTVSEVR FLGLA                        APFVWSKACEDSFQNLKQKLV APVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQL
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV        DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE+AEIAVSVG VT+QL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQL

Query:  AQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA
        AQLTVQPTLRQRIIDAQ ND YLVEKRGLAEAGQAV FSISS+GGL+FERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA
Subjt:  AQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA

Query:  EFVSRCLVCQQVKAPRQKPAGLL
        +FVSRCLVCQQVKAPRQKPAGLL
Subjt:  EFVSRCLVCQQVKAPRQKPAGLL

A0A5A7U330 Reverse transcriptase1.2e-25983.96Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVT WTRPSTVSEVR FLGLA                        APFVWSKACEDSFQNLKQKLV APVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                              DYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLE+AEIAVSVGAVT+QLAQLTVQPTLRQRIIDAQSND YLVEKRGLAEAGQAV FS+SS+GGL+FERRLCVPSDS VKT
Subjt:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
        ELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL

A0A5A7UE75 Reverse transcriptase1.2e-25984.14Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNA AVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVT WTRPSTVSEVR FLGLA                        APFVWSKACEDSFQNLKQKLV AP+LTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA
        GCVLMQQ KVVAYASRQLKSHEQNYPTHDLELAAV                                              DYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV----------------------------------------------DYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLE+AEIAVSVGAVT+QLAQLTVQ TLRQRII AQSND YLVEKRGLAEAGQA GFSISS+GGL FERRLCVPSDS +KT
Subjt:  LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL

A0A5A7V8L8 Pol protein1.5e-26289.48Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVT WTRPST+SEVR FLGLA                        APFVWSKACEDSFQNLKQKLV APVLTVPDGSGSFVIYSDA KKGL
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLGLA------------------------APFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQL
        GCVLMQQGKVV YASRQLKSHEQNYPTHDLELAAV        DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE+AEIAVSVGAVT+QL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------DYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQL

Query:  AQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA
        AQLTVQPTLRQRIIDAQSND YLVEKRGLAEAGQA  FS+SS+GGL+FERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA
Subjt:  AQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA

Query:  EFVSRCLVCQQVKAPRQKPAGLL
        EFVS+CLVCQQVKAP QKPAGLL
Subjt:  EFVSRCLVCQQVKAPRQKPAGLL

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein7.8e-6728.15Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK +  N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
         AFR   G +E++VM +G++ A A F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF    V F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLG--------------LAAP----------FVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
         +     I+ V  W +P    E+R+FLG              L  P          + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLG--------------LAAP----------FVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------------------DYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A+                                                  D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------------------DYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND+ L+    L    + V  +I    GL+   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERR--

Query:  LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G L
Subjt:  LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL

P0CT35 Transposon Tf2-2 polyprotein7.8e-6728.15Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK +  N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
         AFR   G +E++VM +G++ A A F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF    V F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLG--------------LAAP----------FVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
         +     I+ V  W +P    E+R+FLG              L  P          + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLG--------------LAAP----------FVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------------------DYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A+                                                  D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------------------DYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND+ L+    L    + V  +I    GL+   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERR--

Query:  LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G L
Subjt:  LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL

P0CT36 Transposon Tf2-3 polyprotein7.8e-6728.15Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK +  N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
         AFR   G +E++VM +G++ A A F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF    V F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLG--------------LAAP----------FVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
         +     I+ V  W +P    E+R+FLG              L  P          + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLG--------------LAAP----------FVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------------------DYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A+                                                  D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------------------DYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND+ L+    L    + V  +I    GL+   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERR--

Query:  LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G L
Subjt:  LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL

P0CT37 Transposon Tf2-4 polyprotein7.8e-6728.15Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK +  N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
         AFR   G +E++VM +G++ A A F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF    V F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLG--------------LAAP----------FVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
         +     I+ V  W +P    E+R+FLG              L  P          + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLG--------------LAAP----------FVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------------------DYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A+                                                  D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------------------DYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND+ L+    L    + V  +I    GL+   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERR--

Query:  LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G L
Subjt:  LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL

P0CT41 Transposon Tf2-12 polyprotein7.8e-6728.15Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK +  N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG
         AFR   G +E++VM +G++ A A F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF    V F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAG

Query:  VSVDPVKIEAVTSWTRPSTVSEVRRFLG--------------LAAP----------FVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL
         +     I+ V  W +P    E+R+FLG              L  P          + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +
Subjt:  VSVDPVKIEAVTSWTRPSTVSEVRRFLG--------------LAAP----------FVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------------------DYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A+                                                  D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------------------DYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND+ L+    L    + V  +I    GL+   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERR--

Query:  LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G L
Subjt:  LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.9e-1231.54Show/hide
Query:  HLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLG--HVVSKAGVSVDPVKIEAVTSWTRPSTVSEVRRFLGLAAPF-----------------------VWS
        HL +VLQ    ++ YA   KC F    +++LG  H++S  GVS DP K+EA+  W  P   +E+R FLGL   +                        W+
Subjt:  HLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLG--HVVSKAGVSVDPVKIEAVTSWTRPSTVSEVRRFLGLAAPF-----------------------VWS

Query:  KACEDSFQNLKQKLVNAPVLTVPDGSGSFV
        +    +F+ LK  +   PVL +PD    FV
Subjt:  KACEDSFQNLKQKLVNAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCCCGCAGAGCTGAAAGAACTGAAGGTGCAGTTACAAGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTATTCGTTAA
GAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAGGTAATCGTAAAGAACAGATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTAC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAGGTATGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCTGGCAGTGTTTATGAACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATTGA
CGATATCTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACGTATAGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGT
TTTGGCTGAAGCATGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGTTAAGATAGAGGCAGTCACCAGTTGGACCCGACCTTCCACAGTC
AGTGAGGTTCGTAGATTTCTGGGTTTAGCAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTAACGCACCGGTTCTTAC
TGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAAGGTAAGGTGGTCGCTTATGCTTCTCGTCAGT
TGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAGTTGGCAGCAGTGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCT
CTTAGTAGAAAGGTATCACATTCGGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGCAGGCTGAGATTGCAGTGTCGGTGGGGGCAGTCACTATACA
GTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAAAGTAACGATTCTTATTTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGG
TTGGGTTCTCCATATCCTCTAATGGTGGACTTGTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCA
TTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTG
TCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCCCGCAGAGCTGAAAGAACTGAAGGTGCAGTTACAAGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTATTCGTTAA
GAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAGGTAATCGTAAAGAACAGATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTAC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAGGTATGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCTGGCAGTGTTTATGAACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATTGA
CGATATCTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACGTATAGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGT
TTTGGCTGAAGCATGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGTTAAGATAGAGGCAGTCACCAGTTGGACCCGACCTTCCACAGTC
AGTGAGGTTCGTAGATTTCTGGGTTTAGCAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTAACGCACCGGTTCTTAC
TGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAAGGTAAGGTGGTCGCTTATGCTTCTCGTCAGT
TGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAGTTGGCAGCAGTGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCT
CTTAGTAGAAAGGTATCACATTCGGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGCAGGCTGAGATTGCAGTGTCGGTGGGGGCAGTCACTATACA
GTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAAAGTAACGATTCTTATTTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGG
TTGGGTTCTCCATATCCTCTAATGGTGGACTTGTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCA
TTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTG
TCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTATAA
Protein sequenceShow/hide protein sequence
MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVIVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHY
EFIVMSFGLTNALAVFMNLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKHVSFLGHVVSKAGVSVDPVKIEAVTSWTRPSTV
SEVRRFLGLAAPFVWSKACEDSFQNLKQKLVNAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVDYDCEILYHPGKANVVADA
LSRKVSHSAALITRQAPLHRDLEQAEIAVSVGAVTIQLAQLTVQPTLRQRIIDAQSNDSYLVEKRGLAEAGQAVGFSISSNGGLVFERRLCVPSDSAVKTELLSEAHSSP
FSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLL