; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0193301 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0193301
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr07:14729085..14730430
RNA-Seq ExpressionCmc07g0193301
SyntenyCmc07g0193301
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032248.1 pol protein [Cucumis melo var. makuwa]5.6e-19583.69Show/hide
Query:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
        MAP E KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN YPLPRIDDLFDQLQGATVFSKIDL SGYHQLR+RD DIPK
Subjt:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
        TAFRSRYG+YEF+VMSFGLTNAPAVFMDLMN VFKDFLDSF+IVFIDDILIYSKTEAEHEEHLHQVL+TLR+NKLYAKFSKCEFWL+KVTFLGHVVSSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRRL--------------------------------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNY
        VSVDPAKIEAVTNWPR                                               FVIYSDASKKGLGCVLMQQGKVV YASRQLK HEQNY
Subjt:  VSVDPAKIEAVTNWPRRL--------------------------------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNY

Query:  PTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERA
        PTHDLELA VVFALKIWRHYLY EKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALS+KVAHSAA ITKQ PLLRDFERA
Subjt:  PTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERA

Query:  EIAVSVGEVTSQLAQLSVQPTLR
        EIAVSVGEVTSQLAQLSVQPTLR
Subjt:  EIAVSVGEVTSQLAQLSVQPTLR

KAA0032541.1 pol protein [Cucumis melo var. makuwa]4.7e-19479.69Show/hide
Query:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
        MAP E KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLR+RD DIPK
Subjt:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
        TAFRSRYG+Y+F+VMSFGLTNAPAVFMDLMN VFKDFLDSF+IVFIDDILIYSKTEAEHEEHLHQVLETLR+NKLYAKFSKCEFWL+KVTFLGHVVSSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPR-------------------------RL--------------------------------------------RFVIYSDASKKGL
        VSVDPAKIEAVTNWPR                         R+                                             FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPR-------------------------RL--------------------------------------------RFVIYSDASKKGL

Query:  GCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADA
        GCVLMQQGKVV YASRQLK HEQNYPTHDLELA VVFALKIWRHYLYSEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADA
Subjt:  GCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADA

Query:  LSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEVTSQLAQLSVQPTLR
        LS+KVAHSAA ITKQ PLLRDFERAEIAVSVGEVT+QLAQLSVQPTLR
Subjt:  LSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEVTSQLAQLSVQPTLR

KAA0045284.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.5e-20393.33Show/hide
Query:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
        MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
Subjt:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
        TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRRLR-------------------------FVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRH
        VSVDPAKIEAVTNWPRRLR                         FVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRH
Subjt:  VSVDPAKIEAVTNWPRRLR-------------------------FVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRH

Query:  YLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEV
        YLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERAEIAVSVGE+
Subjt:  YLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEV

KAA0065652.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.5e-19583.69Show/hide
Query:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
        MAP E KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLR+RD DIPK
Subjt:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
        TAFRSRYG+YEF+VMSFGLTNAPAVFMDLMN VFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLR+NKLYAKFSKCEFWL+KV FLGHVVS EG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRRL--------------------------------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNY
        VSVDPAKIEAVTNWPR                                               FVIYSDASKKGLGCVLMQQGKVV YASRQLK HEQNY
Subjt:  VSVDPAKIEAVTNWPRRL--------------------------------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNY

Query:  PTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERA
        PTHDLELA VVFALKIWRHYLY EKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALS+KVAHSAA ITKQ PLLRDFERA
Subjt:  PTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERA

Query:  EIAVSVGEVTSQLAQLSVQPTLR
        EIAVSVGEVT+QLAQLSVQP LR
Subjt:  EIAVSVGEVTSQLAQLSVQPTLR

KAA0066451.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-19486.53Show/hide
Query:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
        MAP E KELKVQLQELLDKGFIRP+VSPWGAPVLF KK DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLR+RDSDIPK
Subjt:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
         AFRSRYG+Y+FIVMSFGLTNAPAVFMDLMN VFKDFLDSF+IVFIDDILIYSKTEAEHEEHLHQVLETLR+NKLYAKFSKCEFW++KVTFLGHVVSSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRRL----------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLY
        VSVDPAKIEA+TNWPR                        +FVIYSDAS+KGLGCVLMQQGKVV YA RQLKSHEQNYPTHDLELA VVFALKIWRHYLY
Subjt:  VSVDPAKIEAVTNWPRRL----------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLY

Query:  SEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEVTSQLAQLSVQPTL
         EKIQI+TDHKS KYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVAD LS+KVAHSAA ITKQ PLL DFER EIAVSVGEVTSQLAQLSVQPTL
Subjt:  SEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEVTSQLAQLSVQPTL

Query:  R
        R
Subjt:  R

TrEMBL top hitse value%identityAlignment
A0A5A7SNR3 Reverse transcriptase2.7e-19583.69Show/hide
Query:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
        MAP E KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN YPLPRIDDLFDQLQGATVFSKIDL SGYHQLR+RD DIPK
Subjt:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
        TAFRSRYG+YEF+VMSFGLTNAPAVFMDLMN VFKDFLDSF+IVFIDDILIYSKTEAEHEEHLHQVL+TLR+NKLYAKFSKCEFWL+KVTFLGHVVSSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRRL--------------------------------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNY
        VSVDPAKIEAVTNWPR                                               FVIYSDASKKGLGCVLMQQGKVV YASRQLK HEQNY
Subjt:  VSVDPAKIEAVTNWPRRL--------------------------------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNY

Query:  PTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERA
        PTHDLELA VVFALKIWRHYLY EKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALS+KVAHSAA ITKQ PLLRDFERA
Subjt:  PTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERA

Query:  EIAVSVGEVTSQLAQLSVQPTLR
        EIAVSVGEVTSQLAQLSVQPTLR
Subjt:  EIAVSVGEVTSQLAQLSVQPTLR

A0A5A7SSL3 Reverse transcriptase2.3e-19479.69Show/hide
Query:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
        MAP E KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLR+RD DIPK
Subjt:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
        TAFRSRYG+Y+F+VMSFGLTNAPAVFMDLMN VFKDFLDSF+IVFIDDILIYSKTEAEHEEHLHQVLETLR+NKLYAKFSKCEFWL+KVTFLGHVVSSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPR-------------------------RL--------------------------------------------RFVIYSDASKKGL
        VSVDPAKIEAVTNWPR                         R+                                             FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPR-------------------------RL--------------------------------------------RFVIYSDASKKGL

Query:  GCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADA
        GCVLMQQGKVV YASRQLK HEQNYPTHDLELA VVFALKIWRHYLYSEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADA
Subjt:  GCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADA

Query:  LSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEVTSQLAQLSVQPTLR
        LS+KVAHSAA ITKQ PLLRDFERAEIAVSVGEVT+QLAQLSVQPTLR
Subjt:  LSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEVTSQLAQLSVQPTLR

A0A5A7TQ85 Reverse transcriptase1.2e-20393.33Show/hide
Query:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
        MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
Subjt:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
        TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRRLR-------------------------FVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRH
        VSVDPAKIEAVTNWPRRLR                         FVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRH
Subjt:  VSVDPAKIEAVTNWPRRLR-------------------------FVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRH

Query:  YLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEV
        YLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERAEIAVSVGE+
Subjt:  YLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEV

A0A5A7VBI7 Reverse transcriptase1.2e-19583.69Show/hide
Query:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
        MAP E KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLR+RD DIPK
Subjt:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
        TAFRSRYG+YEF+VMSFGLTNAPAVFMDLMN VFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLR+NKLYAKFSKCEFWL+KV FLGHVVS EG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRRL--------------------------------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNY
        VSVDPAKIEAVTNWPR                                               FVIYSDASKKGLGCVLMQQGKVV YASRQLK HEQNY
Subjt:  VSVDPAKIEAVTNWPRRL--------------------------------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNY

Query:  PTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERA
        PTHDLELA VVFALKIWRHYLY EKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVADALS+KVAHSAA ITKQ PLLRDFERA
Subjt:  PTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERA

Query:  EIAVSVGEVTSQLAQLSVQPTLR
        EIAVSVGEVT+QLAQLSVQP LR
Subjt:  EIAVSVGEVTSQLAQLSVQPTLR

A0A5A7VLF2 Reverse transcriptase7.9e-19586.53Show/hide
Query:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK
        MAP E KELKVQLQELLDKGFIRP+VSPWGAPVLF KK DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLR+RDSDIPK
Subjt:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG
         AFRSRYG+Y+FIVMSFGLTNAPAVFMDLMN VFKDFLDSF+IVFIDDILIYSKTEAEHEEHLHQVLETLR+NKLYAKFSKCEFW++KVTFLGHVVSSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRRL----------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLY
        VSVDPAKIEA+TNWPR                        +FVIYSDAS+KGLGCVLMQQGKVV YA RQLKSHEQNYPTHDLELA VVFALKIWRHYLY
Subjt:  VSVDPAKIEAVTNWPRRL----------------------RFVIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLY

Query:  SEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEVTSQLAQLSVQPTL
         EKIQI+TDHKS KYFFTQKELNMRQRRWLELVKDYDCEILYH GKANVVAD LS+KVAHSAA ITKQ PLL DFER EIAVSVGEVTSQLAQLSVQPTL
Subjt:  SEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEVTSQLAQLSVQPTL

Query:  R
        R
Subjt:  R

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.3e-6133.25Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPKT
        +E++ Q+Q++L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +    + KT
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPKT

Query:  AFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEGV
        AF +++G+YE++ M FGL NAPA F   MN + +  L+   +V++DDI+++S +  EH + L  V E L    L  +  KCEF  ++ TFLGHV++ +G+
Subjt:  AFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEGV

Query:  SVDPAKIEAVTNWP-----------------------------------------------------RRL-----------------RFVIYSDASKKGL
          +P KIEA+  +P                                                     ++L                 +F + +DAS   L
Subjt:  SVDPAKIEAVTNWP-----------------------------------------------------RRL-----------------RFVIYSDASKKGL

Query:  GCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADA
        G VL Q G  + Y SR L  HE NY T + EL  +V+A K +RHYL     +I +DH+ L + +  K+ N +  RW   + ++D +I Y  GK N VADA
Subjt:  GCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADA

Query:  LSK
        LS+
Subjt:  LSK

P10401 Retrovirus-related Pol polyprotein from transposon gypsy8.5e-5330.34Show/hide
Query:  QLQELLDKGFIRPSVSPWGAPVLFVKKK------DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPKTAFRS
        ++++LL  G IRPS SP+ +P   V KK      + + RL ID+R+LN+ T+ +RYP+P I  +   L  A  F+ +DL+SGYHQ+ + + D  KT+F  
Subjt:  QLQELLDKGFIRPSVSPWGAPVLFVKKK------DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPKTAFRS

Query:  RYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEGVSVDP
          G YEF  + FGL NA ++F   ++ V ++ +     V++DD++I+S+ E++H  H+  VL+ L    +     K  F+ + V +LG +VS +G   DP
Subjt:  RYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEGVSVDP

Query:  AKIEAVTNWP---------------------------------------------------------------RRLR------------------FVIYS
         K++A+  +P                                                               +RLR                  F + +
Subjt:  AKIEAVTNWP---------------------------------------------------------------RRLR------------------FVIYS

Query:  DASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLY-SEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLG
        DAS  G+G VL Q+G+ +   SR LK  EQNY T++ EL  +V+AL   +++LY S +I IFTDH+ L +    +  N + +RW   +  ++ ++ Y  G
Subjt:  DASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLY-SEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLG

Query:  KANVVADALSKK
        K N VADALS++
Subjt:  KANVVADALSKK

P20825 Retrovirus-related Pol polyprotein from transposon 2971.6e-5932.27Show/hide
Query:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRD
        +A T   E++ Q+QE+L++G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G+HQ+ + +
Subjt:  MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRD

Query:  SDIPKTAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHV
          I KTAF ++ G+YE++ M FGL NAPA F   MN + +  L+   +V++DDI+I+S +  EH   +  V   L    L  +  KCEF  K+  FLGH+
Subjt:  SDIPKTAFRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHV

Query:  VSSEGVSVDPAKIEAVTNWP--------------------------------------------RRL--------------------------RFVIYSD
        V+ +G+  +P K++A+ ++P                                            ++L                          +FV+ +D
Subjt:  VSSEGVSVDPAKIEAVTNWP--------------------------------------------RRL--------------------------RFVIYSD

Query:  ASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKA
        AS   LG VL Q G  + + SR L  HE NY   + EL  +V+A K +RHYL   +  I +DH+ L++    KE   +  RW   + +Y  +I Y  GK 
Subjt:  ASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKA

Query:  NVVADALSK
        N VADALS+
Subjt:  NVVADALSK

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.2e-5429.98Show/hide
Query:  ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPKTA
        E++ Q+ ELL  G IRPS SP+ +P+  V KK     +   R+ +D++ LN VT+ + YP+P I+     L  A  F+ +DL SG+HQ+ +++SDIPKTA
Subjt:  ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPKTA

Query:  FRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEGVS
        F +  G YEF+ + FGL NAPA+F  +++ + ++ +     V+IDDI+++S+    H ++L  VL +L    L     K  F   +V FLG++V+++G+ 
Subjt:  FRSRYGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEGVS

Query:  VDPAKIEAVTNWP-----RRLR---------------------------------------------------------------------------FVI
         DP K+ A++  P     + L+                                                                           F +
Subjt:  VDPAKIEAVTNWP-----RRLR---------------------------------------------------------------------------FVI

Query:  YSDASKKGLGCVLMQ----QGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLY-SEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCE
         +DAS   +G VL Q    + + + Y SR L   E+NY T + E+  ++++L   R YLY +  I+++TDH+ L +    +  N + +RW   +++Y+CE
Subjt:  YSDASKKGLGCVLMQ----QGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLY-SEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCE

Query:  ILYHLGKANVVADALSK
        ++Y  GK+NVVADALS+
Subjt:  ILYHLGKANVVADALSK

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.4e-4443.54Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPKTAFRSR
        +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPKTAFRSR

Query:  YGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEGVSVDPA
         G YE+ VM FGL NAP+ F   M   F+D    F+ V++DDILI+S++  EH +HL  VLE L++  L  K  KC+F  ++  FLG+ +  + ++    
Subjt:  YGYYEFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEGVSVDPA

Query:  KIEAVTNWP
        K  A+ ++P
Subjt:  KIEAVTNWP

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.0e-0844.64Show/hide
Query:  HLHQVLETLRSNKLYAKFSKCEFWLKKVTFLG--HVVSSEGVSVDPAKIEAVTNWP
        HL  VL+    ++ YA   KC F   ++ +LG  H++S EGVS DPAK+EA+  WP
Subjt:  HLHQVLETLRSNKLYAKFSKCEFWLKKVTFLG--HVVSSEGVSVDPAKIEAVTNWP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCAACCGAGCCGAAGGAGCTGAAGGTACAACTGCAGGAGTTGCTGGACAAGGGTTTCATCCGACCTAGTGTGTCACCTTGGGGAGCACCAGTGTTGTTTGTGAA
GAAGAAGGATGGGTCAATGCGCCTTTGCATTGACTACAGAGAGCTGAACAAGGTGACAGTTAAGAACCGCTACCCCTTGCCCAGGATTGATGACTTGTTCGATCAGTTGC
AGGGAGCCACTGTCTTTTCTAAGATCGACCTGCGATCAGGCTATCACCAATTGAGGGTCAGGGACAGTGATATTCCTAAGACGGCCTTCCGTTCAAGATACGGATATTAC
GAGTTCATTGTGATGTCTTTTGGGTTGACTAATGCTCCTGCGGTATTCATGGATTTGATGAACATGGTGTTTAAGGATTTCTTGGACTCGTTCATCATAGTTTTCATTGA
CGACATTTTGATTTACTCCAAGACTGAGGCTGAGCATGAGGAGCATTTGCACCAGGTTTTGGAGACCCTTCGATCCAATAAGCTGTATGCCAAGTTCTCCAAGTGTGAGT
TCTGGCTGAAGAAGGTGACTTTCCTCGGCCACGTGGTTTCCAGTGAGGGAGTTTCTGTAGATCCAGCAAAGATCGAAGCGGTTACCAATTGGCCTCGACGTCTACGCTTT
GTGATCTACAGTGATGCCTCCAAAAAGGGACTGGGCTGTGTTCTGATGCAGCAAGGTAAGGTAGTTGTTTATGCCTCCCGTCAGTTGAAGAGTCATGAGCAGAACTACCC
TACCCACGACCTAGAGTTGGCAACAGTGGTTTTTGCACTGAAGATATGGAGGCACTACCTGTACAGTGAGAAGATACAGATTTTCACTGACCATAAGAGCCTGAAGTACT
TTTTCACCCAAAAGGAGCTGAACATGAGGCAGAGGAGGTGGCTTGAGTTAGTGAAAGACTACGACTGCGAGATTCTGTATCACCTAGGTAAGGCAAATGTAGTAGCTGAC
GCGCTGAGCAAGAAGGTTGCTCATTCAGCAGCGTTTATCACCAAGCAAGCTCCCTTACTCAGAGATTTTGAGAGAGCCGAGATTGCAGTCTCAGTAGGAGAGGTTACCTC
ACAGTTGGCTCAGTTGTCAGTACAGCCGACCCTGAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCAACCGAGCCGAAGGAGCTGAAGGTACAACTGCAGGAGTTGCTGGACAAGGGTTTCATCCGACCTAGTGTGTCACCTTGGGGAGCACCAGTGTTGTTTGTGAA
GAAGAAGGATGGGTCAATGCGCCTTTGCATTGACTACAGAGAGCTGAACAAGGTGACAGTTAAGAACCGCTACCCCTTGCCCAGGATTGATGACTTGTTCGATCAGTTGC
AGGGAGCCACTGTCTTTTCTAAGATCGACCTGCGATCAGGCTATCACCAATTGAGGGTCAGGGACAGTGATATTCCTAAGACGGCCTTCCGTTCAAGATACGGATATTAC
GAGTTCATTGTGATGTCTTTTGGGTTGACTAATGCTCCTGCGGTATTCATGGATTTGATGAACATGGTGTTTAAGGATTTCTTGGACTCGTTCATCATAGTTTTCATTGA
CGACATTTTGATTTACTCCAAGACTGAGGCTGAGCATGAGGAGCATTTGCACCAGGTTTTGGAGACCCTTCGATCCAATAAGCTGTATGCCAAGTTCTCCAAGTGTGAGT
TCTGGCTGAAGAAGGTGACTTTCCTCGGCCACGTGGTTTCCAGTGAGGGAGTTTCTGTAGATCCAGCAAAGATCGAAGCGGTTACCAATTGGCCTCGACGTCTACGCTTT
GTGATCTACAGTGATGCCTCCAAAAAGGGACTGGGCTGTGTTCTGATGCAGCAAGGTAAGGTAGTTGTTTATGCCTCCCGTCAGTTGAAGAGTCATGAGCAGAACTACCC
TACCCACGACCTAGAGTTGGCAACAGTGGTTTTTGCACTGAAGATATGGAGGCACTACCTGTACAGTGAGAAGATACAGATTTTCACTGACCATAAGAGCCTGAAGTACT
TTTTCACCCAAAAGGAGCTGAACATGAGGCAGAGGAGGTGGCTTGAGTTAGTGAAAGACTACGACTGCGAGATTCTGTATCACCTAGGTAAGGCAAATGTAGTAGCTGAC
GCGCTGAGCAAGAAGGTTGCTCATTCAGCAGCGTTTATCACCAAGCAAGCTCCCTTACTCAGAGATTTTGAGAGAGCCGAGATTGCAGTCTCAGTAGGAGAGGTTACCTC
ACAGTTGGCTCAGTTGTCAGTACAGCCGACCCTGAGATAG
Protein sequenceShow/hide protein sequence
MAPTEPKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRVRDSDIPKTAFRSRYGYY
EFIVMSFGLTNAPAVFMDLMNMVFKDFLDSFIIVFIDDILIYSKTEAEHEEHLHQVLETLRSNKLYAKFSKCEFWLKKVTFLGHVVSSEGVSVDPAKIEAVTNWPRRLRF
VIYSDASKKGLGCVLMQQGKVVVYASRQLKSHEQNYPTHDLELATVVFALKIWRHYLYSEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVAD
ALSKKVAHSAAFITKQAPLLRDFERAEIAVSVGEVTSQLAQLSVQPTLR