; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0227531 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0227531
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:20676456..20678039
RNA-Seq ExpressionCmc08g0227531
SyntenyCmc08g0227531
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037244.1 reverse transcriptase [Cucumis melo var. makuwa]1.2e-26488.61Show/hide
Query:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS
        +RASKLLSQGTW ILASVVDTREVDVSLSLEPVVRDY DVFPEELPGLPPH+E+EFAIELEPGTVPIS+AP RMAPA+LKELKVQLQELL KGFIRPSVS
Subjt:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS

Query:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM
        PWGAP+LFVKKKDGSMRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM

Query:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREFLDTFVIVFI+DILIYSK E EHEEHLRMVLQTL+DNKLY KF KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA
        GLA                        APFVWSKACEDSFQ LKQKLVTAP+LT+PDG GSFVIYS+ASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP 
Subjt:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA

Query:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        HDLELAAVVF LK+WRHYLYGEKI+IFTDHKSLKYFFTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVSHSAALITRQAPLHRD E+AEI
Subjt:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

Query:  LVSVGAVTMQLAQLTVQPTLRQRIIDA
         VSVGAVTMQLAQLTVQPTLRQRIIDA
Subjt:  LVSVGAVTMQLAQLTVQPTLRQRIIDA

KAA0048687.1 pol protein [Cucumis melo var. makuwa]1.2e-26488.61Show/hide
Query:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS
        +RASKLLSQGTW ILASVVDTRE DVSLS EPVVRDY DVFPEELPGLPPH+E+EFAIELEPGTVPIS+AP RMAPA+LKELKVQLQELL KGFIRPSVS
Subjt:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS

Query:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM
        PWGAP+LFVKKKDGSMRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM

Query:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREFLDTFVIVFI+DILIYSK EAEHEEHLRMVLQTL+DNKLY KFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA
        GLA                        APFVWSKACEDSFQ LKQKLVTAP+LT+PDG GSFVIYS+ASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP 
Subjt:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA

Query:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        HDLELAAVVF LK+WRHYLYGEKI+IFTDHKSLKYFFTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVSHSAALITRQAPLHRD E+AEI
Subjt:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

Query:  LVSVGAVTMQLAQLTVQPTLRQRIIDA
         VSVGAVTMQLAQLTVQPTLRQRIIDA
Subjt:  LVSVGAVTMQLAQLTVQPTLRQRIIDA

KAA0057672.1 pol protein [Cucumis melo var. makuwa]1.3e-26388.43Show/hide
Query:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS
        +RASKLLSQGTW ILASVVDTRE DVSLS EPVVRDY DVFPEELPGLPPH+E+EFAIELEPGTVPIS+AP RMAPA+LKELKVQLQELL KGFIRPSVS
Subjt:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS

Query:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM
        PWGAP+LFVKKKDGSMRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSG HQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM

Query:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREFLDTFVIVFI+DILIYSK EAEHEEHLRMVLQTL+DNKLY KFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA
        GLA                        APFVWSKACEDSFQ LKQKLVTAP+LT+PDG GSFVIYS+ASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP 
Subjt:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA

Query:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        HDLELAAVVF LK+WRHYLYGEKI+IFTDHKSLKYFFTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVSHSAALITRQAPLHRD E+AEI
Subjt:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

Query:  LVSVGAVTMQLAQLTVQPTLRQRIIDA
         VSVGAVTMQLAQLTVQPTLRQRIIDA
Subjt:  LVSVGAVTMQLAQLTVQPTLRQRIIDA

KAA0058464.1 pol protein [Cucumis melo var. makuwa]2.2e-26387.86Show/hide
Query:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS
        +RASKLLSQGTW ILASVVDTREVDVSLS EPVVRDY +VFPEELPGLPPH+E+EFAIELEPGTVPIS+AP RMAPA+LKELKVQLQELL KGFIRPSVS
Subjt:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS

Query:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM
        PWGAP+LFVKKKDGSMRLCI+YRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKI+LRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM

Query:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREFLDTFVI+FI+DILIYSK EAEHEEHLR+VLQTL+DNKLY KFSKCEFWLKQVSFLGHVVSK  VSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA
        GLA                        APFVWSKACEDSFQ LKQKLVTAP+LT+PDG+GSFVIYS+ASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP 
Subjt:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA

Query:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        HDLELAAVVF LK+WRHYLYGEKI+IFTDHKSLKYFFTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVSHSAALITRQAPLHRD E+AEI
Subjt:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

Query:  LVSVGAVTMQLAQLTVQPTLRQRIIDA
         VSVGAVTMQLAQLTVQPTLRQRIIDA
Subjt:  LVSVGAVTMQLAQLTVQPTLRQRIIDA

TYK01613.1 pol protein [Cucumis melo var. makuwa]1.5e-26488.61Show/hide
Query:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS
        +RASKLLSQGTW ILASVVDTRE DVSLS EPVVRDY DVFPEELPGLPPH+E+EFAIELEPGTVPIS+AP RMAPA+LKELKVQLQELL KGFIRPSVS
Subjt:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS

Query:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM
        PWGAP+LFVKKKDGSMRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM

Query:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREFLDTFVIVFI+DILIYSK EAEHEEHLRMVLQTL+DNKLY KFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA
        GLA                        APFVWSKACEDSFQ LKQKLVTAP+LT+PDG GSFVIYS+ASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP 
Subjt:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA

Query:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        HDLELAAVVF LK+WRHYLYGEKI+IFTDHKSLKYFFTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVSHSAALITRQAPLHRD E+AEI
Subjt:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

Query:  LVSVGAVTMQLAQLTVQPTLRQRIIDA
         VSVGAVTMQLAQLTVQPTLRQRIIDA
Subjt:  LVSVGAVTMQLAQLTVQPTLRQRIIDA

TrEMBL top hitse value%identityAlignment
A0A5A7T190 Reverse transcriptase5.6e-26588.61Show/hide
Query:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS
        +RASKLLSQGTW ILASVVDTREVDVSLSLEPVVRDY DVFPEELPGLPPH+E+EFAIELEPGTVPIS+AP RMAPA+LKELKVQLQELL KGFIRPSVS
Subjt:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS

Query:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM
        PWGAP+LFVKKKDGSMRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM

Query:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREFLDTFVIVFI+DILIYSK E EHEEHLRMVLQTL+DNKLY KF KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA
        GLA                        APFVWSKACEDSFQ LKQKLVTAP+LT+PDG GSFVIYS+ASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP 
Subjt:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA

Query:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        HDLELAAVVF LK+WRHYLYGEKI+IFTDHKSLKYFFTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVSHSAALITRQAPLHRD E+AEI
Subjt:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

Query:  LVSVGAVTMQLAQLTVQPTLRQRIIDA
         VSVGAVTMQLAQLTVQPTLRQRIIDA
Subjt:  LVSVGAVTMQLAQLTVQPTLRQRIIDA

A0A5A7U330 Reverse transcriptase5.6e-26588.61Show/hide
Query:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS
        +RASKLLSQGTW ILASVVDTRE DVSLS EPVVRDY DVFPEELPGLPPH+E+EFAIELEPGTVPIS+AP RMAPA+LKELKVQLQELL KGFIRPSVS
Subjt:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS

Query:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM
        PWGAP+LFVKKKDGSMRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM

Query:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREFLDTFVIVFI+DILIYSK EAEHEEHLRMVLQTL+DNKLY KFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA
        GLA                        APFVWSKACEDSFQ LKQKLVTAP+LT+PDG GSFVIYS+ASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP 
Subjt:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA

Query:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        HDLELAAVVF LK+WRHYLYGEKI+IFTDHKSLKYFFTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVSHSAALITRQAPLHRD E+AEI
Subjt:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

Query:  LVSVGAVTMQLAQLTVQPTLRQRIIDA
         VSVGAVTMQLAQLTVQPTLRQRIIDA
Subjt:  LVSVGAVTMQLAQLTVQPTLRQRIIDA

A0A5A7UP94 Pol protein6.2e-26488.43Show/hide
Query:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS
        +RASKLLSQGTW ILASVVDTRE DVSLS EPVVRDY DVFPEELPGLPPH+E+EFAIELEPGTVPIS+AP RMAPA+LKELKVQLQELL KGFIRPSVS
Subjt:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS

Query:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM
        PWGAP+LFVKKKDGSMRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSG HQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM

Query:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREFLDTFVIVFI+DILIYSK EAEHEEHLRMVLQTL+DNKLY KFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA
        GLA                        APFVWSKACEDSFQ LKQKLVTAP+LT+PDG GSFVIYS+ASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP 
Subjt:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA

Query:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        HDLELAAVVF LK+WRHYLYGEKI+IFTDHKSLKYFFTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVSHSAALITRQAPLHRD E+AEI
Subjt:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

Query:  LVSVGAVTMQLAQLTVQPTLRQRIIDA
         VSVGAVTMQLAQLTVQPTLRQRIIDA
Subjt:  LVSVGAVTMQLAQLTVQPTLRQRIIDA

A0A5A7UTH9 Reverse transcriptase1.1e-26387.86Show/hide
Query:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS
        +RASKLLSQGTW ILASVVDTREVDVSLS EPVVRDY +VFPEELPGLPPH+E+EFAIELEPGTVPIS+AP RMAPA+LKELKVQLQELL KGFIRPSVS
Subjt:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS

Query:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM
        PWGAP+LFVKKKDGSMRLCI+YRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKI+LRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM

Query:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREFLDTFVI+FI+DILIYSK EAEHEEHLR+VLQTL+DNKLY KFSKCEFWLKQVSFLGHVVSK  VSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA
        GLA                        APFVWSKACEDSFQ LKQKLVTAP+LT+PDG+GSFVIYS+ASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP 
Subjt:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA

Query:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        HDLELAAVVF LK+WRHYLYGEKI+IFTDHKSLKYFFTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVSHSAALITRQAPLHRD E+AEI
Subjt:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

Query:  LVSVGAVTMQLAQLTVQPTLRQRIIDA
         VSVGAVTMQLAQLTVQPTLRQRIIDA
Subjt:  LVSVGAVTMQLAQLTVQPTLRQRIIDA

A0A5D3BPI1 Reverse transcriptase7.4e-26588.61Show/hide
Query:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS
        +RASKLLSQGTW ILASVVDTRE DVSLS EPVVRDY DVFPEELPGLPPH+E+EFAIELEPGTVPIS+AP RMAPA+LKELKVQLQELL KGFIRPSVS
Subjt:  MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVS

Query:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM
        PWGAP+LFVKKKDGSMRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPILFVKKKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFM

Query:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREFLDTFVIVFI+DILIYSK EAEHEEHLRMVLQTL+DNKLY KFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA
        GLA                        APFVWSKACEDSFQ LKQKLVTAP+LT+PDG GSFVIYS+ASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP 
Subjt:  GLA------------------------APFVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPA

Query:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        HDLELAAVVF LK+WRHYLYGEKI+IFTDHKSLKYFFTQKELNMRQRRWLELVKDYD EILYHPGKANVVADALSRKVSHSAALITRQAPLHRD E+AEI
Subjt:  HDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

Query:  LVSVGAVTMQLAQLTVQPTLRQRIIDA
         VSVGAVTMQLAQLTVQPTLRQRIIDA
Subjt:  LVSVGAVTMQLAQLTVQPTLRQRIIDA

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.5e-7335.98Show/hide
Query:  KELKVQLQELLHKGFIRPSVSPWGAPILFV-KKKDGS----MRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKT
        +E++ Q+Q++L++G IR S SP+ +PI  V KK+D S     R+ I+YR+LN++TV +R+P+P +D++  +L     F+ I+L  G+HQ+ +    V KT
Subjt:  KELKVQLQELLHKGFIRPSVSPWGAPILFV-KKKDGS----MRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKT

Query:  AFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGV
        AF +++GHYE++ M FGL NAP  F   MN + R  L+   +V+++DI+++S    EH + L +V + L    L  +  KCEF  ++ +FLGHV++  G+
Subjt:  AFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGV

Query:  SVDPAKIEAVTSWPRPSTVSEVRSFLGLAAPF-------------------------VWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGL
          +P KIEA+  +P P+   E+++FLGL   +                           +   + +F+KLK  +   PIL +PD    F + ++AS   L
Subjt:  SVDPAKIEAVTSWPRPSTVSEVRSFLGLAAPF-------------------------VWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGL

Query:  GCVLMQQGKVVAYASCQLKSHEQNYPAHDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADA
        G VL Q G  ++Y S  L  HE NY   + EL A+V+  K +RHYL G   +I +DH+ L + +  K+ N +  RW   + ++D++I Y  GK N VADA
Subjt:  GCVLMQQGKVVAYASCQLKSHEQNYPAHDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHPGKANVVADA

Query:  LSR
        LSR
Subjt:  LSR

P0CT34 Transposon Tf2-1 polyprotein3.5e-7033.33Show/hide
Query:  LEPVVRDYSDVF----PEELPGLPPHKEIEFAIEL--EPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVSPWGAPILFVKKKDGSMRLCINYR
        L  + +++ D+      E+LP   P K +EF +EL  E   +PI   P  + P K++ +  ++ + L  G IR S +    P++FV KK+G++R+ ++Y+
Subjt:  LEPVVRDYSDVF----PEELPGLPPHKEIEFAIEL--EPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVSPWGAPILFVKKKDGSMRLCINYR

Query:  ELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFINDIL
         LNK    N YPLP I+ L  ++QG+T+F+K++L+S YH +R++ GD  K AFR   G +E++VM +G++ AP  F   +N +  E  ++ V+ +++DIL
Subjt:  ELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFINDIL

Query:  IYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLG--------------LAAP----
        I+SK E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG              L  P    
Subjt:  IYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLG--------------LAAP----

Query:  ------FVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPAHDLELAAVVFVLKVWRHY
              + W+     + + +KQ LV+ P+L   D     ++ ++AS   +G VL Q+        V Y S ++   + NY   D E+ A++  LK WRHY
Subjt:  ------FVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPAHDLELAAVVFVLKVWRHY

Query:  LYG--EKIKIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        L    E  KI TDH++L    T +    N R  RW   ++D+++EI Y PG AN +ADALSR       ++    P+ +D E   I
Subjt:  LYG--EKIKIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

P0CT35 Transposon Tf2-2 polyprotein3.5e-7033.33Show/hide
Query:  LEPVVRDYSDVF----PEELPGLPPHKEIEFAIEL--EPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVSPWGAPILFVKKKDGSMRLCINYR
        L  + +++ D+      E+LP   P K +EF +EL  E   +PI   P  + P K++ +  ++ + L  G IR S +    P++FV KK+G++R+ ++Y+
Subjt:  LEPVVRDYSDVF----PEELPGLPPHKEIEFAIEL--EPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVSPWGAPILFVKKKDGSMRLCINYR

Query:  ELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFINDIL
         LNK    N YPLP I+ L  ++QG+T+F+K++L+S YH +R++ GD  K AFR   G +E++VM +G++ AP  F   +N +  E  ++ V+ +++DIL
Subjt:  ELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFINDIL

Query:  IYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLG--------------LAAP----
        I+SK E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG              L  P    
Subjt:  IYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLG--------------LAAP----

Query:  ------FVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPAHDLELAAVVFVLKVWRHY
              + W+     + + +KQ LV+ P+L   D     ++ ++AS   +G VL Q+        V Y S ++   + NY   D E+ A++  LK WRHY
Subjt:  ------FVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPAHDLELAAVVFVLKVWRHY

Query:  LYG--EKIKIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        L    E  KI TDH++L    T +    N R  RW   ++D+++EI Y PG AN +ADALSR       ++    P+ +D E   I
Subjt:  LYG--EKIKIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

P0CT41 Transposon Tf2-12 polyprotein3.5e-7033.33Show/hide
Query:  LEPVVRDYSDVF----PEELPGLPPHKEIEFAIEL--EPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVSPWGAPILFVKKKDGSMRLCINYR
        L  + +++ D+      E+LP   P K +EF +EL  E   +PI   P  + P K++ +  ++ + L  G IR S +    P++FV KK+G++R+ ++Y+
Subjt:  LEPVVRDYSDVF----PEELPGLPPHKEIEFAIEL--EPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVSPWGAPILFVKKKDGSMRLCINYR

Query:  ELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFINDIL
         LNK    N YPLP I+ L  ++QG+T+F+K++L+S YH +R++ GD  K AFR   G +E++VM +G++ AP  F   +N +  E  ++ V+ +++DIL
Subjt:  ELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFINDIL

Query:  IYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLG--------------LAAP----
        I+SK E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG              L  P    
Subjt:  IYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLG--------------LAAP----

Query:  ------FVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPAHDLELAAVVFVLKVWRHY
              + W+     + + +KQ LV+ P+L   D     ++ ++AS   +G VL Q+        V Y S ++   + NY   D E+ A++  LK WRHY
Subjt:  ------FVWSKACEDSFQKLKQKLVTAPILTIPDGFGSFVIYSNASKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPAHDLELAAVVFVLKVWRHY

Query:  LYG--EKIKIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI
        L    E  KI TDH++L    T +    N R  RW   ++D+++EI Y PG AN +ADALSR       ++    P+ +D E   I
Subjt:  LYG--EKIKIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDYEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFEKAEI

P20825 Retrovirus-related Pol polyprotein from transposon 2973.0e-7435.73Show/hide
Query:  PISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVSPWGAPILFVKKKD-----GSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSG
        PI      +A     E++ Q+QE+L++G IR S SP+ +P   V KK         R+ I+YR+LN++T+ +RYP+P +D++  +L     F+ I+L  G
Subjt:  PISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVSPWGAPILFVKKKD-----GSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSG

Query:  YHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLK
        +HQ+ + +  + KTAF ++ GHYE++ M FGL NAP  F   MN + R  L+   +V+++DI+I+S    EH   +++V   L D  L  +  KCEF  K
Subjt:  YHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFINDILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLK

Query:  QVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAAPF-----------------------VWSKACE--DSFQKLKQKLVTAPILTIPDGF
        + +FLGH+V+  G+  +P K++A+ S+P P+   E+R+FLGL   +                       + ++  E  ++F+KLK  ++  PIL +PD  
Subjt:  QVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAAPF-----------------------VWSKACE--DSFQKLKQKLVTAPILTIPDGF

Query:  GSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPAHDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYE
          FV+ ++AS   LG VL Q G  +++ S  L  HE NY A + EL A+V+  K +RHYL G +  I +DH+ L++    KE   +  RW   + +Y ++
Subjt:  GSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPAHDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYE

Query:  ILYHPGKANVVADALSR
        I Y  GK N VADALSR
Subjt:  ILYHPGKANVVADALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.3e-1533.87Show/hide
Query:  HLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAAPF-----------------------VWS
        HL MVLQ  + ++ Y    KC F   Q+++LG  H++S  GVS DPAK+EA+  WP P   +E+R FLGL   +                        W+
Subjt:  HLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAAPF-----------------------VWS

Query:  KACEDSFQKLKQKLVTAPILTIPD
        +    +F+ LK  + T P+L +PD
Subjt:  KACEDSFQKLKQKLVTAPILTIPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGCCAGCAAACTGCTCAGTCAGGGTACTTGGAGTATCTTAGCGAGCGTGGTGGATACTAGAGAGGTTGACGTATCCCTGTCATTGGAACCAGTGGTAAGGGACTA
CTCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCTCACAAAGAGATCGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAAAGCCCCAAACAGAA
TGGCCCCAGCAAAGTTGAAAGAACTGAAAGTGCAGTTGCAGGAATTGCTTCATAAGGGCTTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAATTTTATTTGTTAAG
AAGAAAGATGGATCGATGCGCCTATGCATTAACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATCTGTTTGACCAGTTACA
GGGAGCTACAGTATTCTCTAAGATTAATCTTCGGTCGGGATATCATCAGTTGAGAATTAAGGATGGTGATGTACCGAAGACAGCTTTTCGTTCTAGATACGGACACTATG
AGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGTGTTTATTAAT
GATATCTTGATATATTCCAAGATGGAGGCCGAGCATGAGGAGCATTTACGTATGGTTCTACAAACCCTTCAGGATAATAAATTGTATACAAAGTTCTCAAAATGCGAGTT
TTGGCTGAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTCTAAGGCTGGAGTTTCTGTAGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACAGTCA
GTGAGGTTCGTAGCTTTCTGGGTTTAGCAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAAACTTAAACAGAAGCTAGTTACTGCACCGATTCTTACC
ATACCTGATGGTTTCGGCAGTTTTGTGATTTACAGTAATGCTTCTAAGAAGGGTTTGGGTTGTGTATTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTTGTCAGTT
GAAGAGTCATGAGCAAAATTACCCTGCACACGATTTAGAGTTGGCAGCAGTGGTTTTTGTATTGAAGGTATGGAGGCATTACTTGTATGGTGAAAAGATAAAGATCTTCA
CGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAGATGGCTTGAGTTAGTGAAGGATTATGATTATGAGATATTGTATCATCCA
GGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATTTTGAGAAGGCTGAGATTTT
AGTGTCAGTAGGGGCAGTCACTATGCAGTTAGCCCAGTTGACGGTACAACCGACTTTGAGGCAAAGGATCATTGATGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGGCCAGCAAACTGCTCAGTCAGGGTACTTGGAGTATCTTAGCGAGCGTGGTGGATACTAGAGAGGTTGACGTATCCCTGTCATTGGAACCAGTGGTAAGGGACTA
CTCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCTCACAAAGAGATCGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAAAGCCCCAAACAGAA
TGGCCCCAGCAAAGTTGAAAGAACTGAAAGTGCAGTTGCAGGAATTGCTTCATAAGGGCTTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAATTTTATTTGTTAAG
AAGAAAGATGGATCGATGCGCCTATGCATTAACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATCTGTTTGACCAGTTACA
GGGAGCTACAGTATTCTCTAAGATTAATCTTCGGTCGGGATATCATCAGTTGAGAATTAAGGATGGTGATGTACCGAAGACAGCTTTTCGTTCTAGATACGGACACTATG
AGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGTGTTTATTAAT
GATATCTTGATATATTCCAAGATGGAGGCCGAGCATGAGGAGCATTTACGTATGGTTCTACAAACCCTTCAGGATAATAAATTGTATACAAAGTTCTCAAAATGCGAGTT
TTGGCTGAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTCTAAGGCTGGAGTTTCTGTAGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACAGTCA
GTGAGGTTCGTAGCTTTCTGGGTTTAGCAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAAACTTAAACAGAAGCTAGTTACTGCACCGATTCTTACC
ATACCTGATGGTTTCGGCAGTTTTGTGATTTACAGTAATGCTTCTAAGAAGGGTTTGGGTTGTGTATTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTTGTCAGTT
GAAGAGTCATGAGCAAAATTACCCTGCACACGATTTAGAGTTGGCAGCAGTGGTTTTTGTATTGAAGGTATGGAGGCATTACTTGTATGGTGAAAAGATAAAGATCTTCA
CGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAGATGGCTTGAGTTAGTGAAGGATTATGATTATGAGATATTGTATCATCCA
GGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATTTTGAGAAGGCTGAGATTTT
AGTGTCAGTAGGGGCAGTCACTATGCAGTTAGCCCAGTTGACGGTACAACCGACTTTGAGGCAAAGGATCATTGATGCTTAG
Protein sequenceShow/hide protein sequence
MRASKLLSQGTWSILASVVDTREVDVSLSLEPVVRDYSDVFPEELPGLPPHKEIEFAIELEPGTVPISKAPNRMAPAKLKELKVQLQELLHKGFIRPSVSPWGAPILFVK
KKDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIN
DILIYSKMEAEHEEHLRMVLQTLQDNKLYTKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAAPFVWSKACEDSFQKLKQKLVTAPILT
IPDGFGSFVIYSNASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPAHDLELAAVVFVLKVWRHYLYGEKIKIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDYEILYHP
GKANVVADALSRKVSHSAALITRQAPLHRDFEKAEILVSVGAVTMQLAQLTVQPTLRQRIIDA