; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0225811 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0225811
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:17143262..17144932
RNA-Seq ExpressionCmc08g0225811
SyntenyCmc08g0225811
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048687.1 pol protein [Cucumis melo var. makuwa]1.2e-30294.95Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
        MLDFDVILGMDWLAA+HASIDCS KEV FNPPS  SFKFKG GSRSLPQVISA+RASKLLSQGTW ILASVVDTRE DVS SSEPVVRDYPDVF EELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG

Query:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPI RAPYRMAP ELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
        GATVFSKIDLRSGYHQLRIKD DV KTAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQT RDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK

Query:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
        LYAKFSKCEFWLKQVSFLGHVV KAGVSVDPAKIE VT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKACEDSFQNLKQKL
Subjt:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL

Query:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        V+APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY+SRQLKSHEQNYPTHDLELAAVVFA KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
        RWLELVKDYDCEILYH GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
Subjt:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV

KAA0057668.1 pol protein [Cucumis melo var. makuwa]2.0e-30294.59Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
        MLDFDVILGMDWLAANHASIDCS KEV FNPPS  SFKFKGEGSRSLPQVISA+RASKLLSQGTW ILASVVDTRE DVS SSEPVVRDYPDVF EELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG

Query:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPI RAPYRMAP ELKELKVQLQELLDKGFIRPSVSPWGAPVLF KKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
        GATVFSKIDLRSGYHQLRIKD DV KTAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQT RDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK

Query:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
        LYAKFSKCEFWLK VSFLGHVV KAGVSVDPAKIE  T W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKACEDSFQNLKQKL
Subjt:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL

Query:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        V+AP+LTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY+SRQLKSHEQNYPTHDLELAAVVFA KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
        RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQ PLHRDLERAEIAV
Subjt:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV

KAA0062719.1 pol protein [Cucumis melo var. makuwa]1.2e-30294.95Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
        MLDFDVIL  DWLAANHASIDCS KEV FNPPS  SFKFKGEGSRSLPQVISA+RASKLLSQGTW ILASVVDTR+ DVS SSEPVVRDYPDVF EELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG

Query:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPI RAPYRMAP ELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
        GATVFSKIDLRSGYHQLRIKD DV KTAFRSRYGHYEFIVMSFGLTNALAVFM+LMNRVFREFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQT RDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK

Query:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
        LYAKFSKCEFWLKQVSFLGHVV KAGVSVDPAKIE VTSW RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKACEDSFQNLKQKL
Subjt:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL

Query:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        V+APVLTVPDGSGSFVIYSDASKKGLGCVL+QQGKVVAY+SRQLKSHEQNYPTHDLELAAVVFA KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
        RWLELVKDYDCEILYH GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
Subjt:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV

KAA0067821.1 reverse transcriptase [Cucumis melo var. makuwa]3.6e-30799.44Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
        MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG

Query:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
        GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK

Query:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
        LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIE VTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
Subjt:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL

Query:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHLGKANVVADALSRKVS
        RWLELVKDYDCEILYHLGKANVVADALSRK++
Subjt:  RWLELVKDYDCEILYHLGKANVVADALSRKVS

TYK01613.1 pol protein [Cucumis melo var. makuwa]7.0e-30394.95Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
        MLDFDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKG GS+SLPQVISA+RASKLLSQGTW ILASVVDTRE DVS SSEPVVRDYPDVF EELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG

Query:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPI RAPYRMAP ELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
        GATVFSKIDLRSGYHQLRIKD DV KTAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQT RDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK

Query:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
        LYAKFSKCEFWLKQVSFLGHVV KAGVSVDPAKIE VT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKACEDSFQ LKQKL
Subjt:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL

Query:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        V+APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY+SRQLKSHEQNYPTHDLELAAVVFA KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
        RWLELVKDYDCEILYH GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
Subjt:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV

TrEMBL top hitse value%identityAlignment
A0A5A7U330 Reverse transcriptase5.8e-30394.95Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
        MLDFDVILGMDWLAA+HASIDCS KEV FNPPS  SFKFKG GSRSLPQVISA+RASKLLSQGTW ILASVVDTRE DVS SSEPVVRDYPDVF EELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG

Query:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPI RAPYRMAP ELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
        GATVFSKIDLRSGYHQLRIKD DV KTAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQT RDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK

Query:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
        LYAKFSKCEFWLKQVSFLGHVV KAGVSVDPAKIE VT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKACEDSFQNLKQKL
Subjt:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL

Query:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        V+APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY+SRQLKSHEQNYPTHDLELAAVVFA KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
        RWLELVKDYDCEILYH GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
Subjt:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV

A0A5A7URA1 Reverse transcriptase9.8e-30394.59Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
        MLDFDVILGMDWLAANHASIDCS KEV FNPPS  SFKFKGEGSRSLPQVISA+RASKLLSQGTW ILASVVDTRE DVS SSEPVVRDYPDVF EELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG

Query:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPI RAPYRMAP ELKELKVQLQELLDKGFIRPSVSPWGAPVLF KKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
        GATVFSKIDLRSGYHQLRIKD DV KTAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQT RDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK

Query:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
        LYAKFSKCEFWLK VSFLGHVV KAGVSVDPAKIE  T W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKACEDSFQNLKQKL
Subjt:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL

Query:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        V+AP+LTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY+SRQLKSHEQNYPTHDLELAAVVFA KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
        RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQ PLHRDLERAEIAV
Subjt:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV

A0A5A7VAL8 Pol protein5.8e-30394.95Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
        MLDFDVIL  DWLAANHASIDCS KEV FNPPS  SFKFKGEGSRSLPQVISA+RASKLLSQGTW ILASVVDTR+ DVS SSEPVVRDYPDVF EELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG

Query:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPI RAPYRMAP ELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
        GATVFSKIDLRSGYHQLRIKD DV KTAFRSRYGHYEFIVMSFGLTNALAVFM+LMNRVFREFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQT RDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK

Query:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
        LYAKFSKCEFWLKQVSFLGHVV KAGVSVDPAKIE VTSW RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKACEDSFQNLKQKL
Subjt:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL

Query:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        V+APVLTVPDGSGSFVIYSDASKKGLGCVL+QQGKVVAY+SRQLKSHEQNYPTHDLELAAVVFA KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
        RWLELVKDYDCEILYH GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
Subjt:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV

A0A5A7VQE0 Reverse transcriptase1.7e-30799.44Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
        MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG

Query:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
        GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK

Query:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
        LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIE VTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
Subjt:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL

Query:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHLGKANVVADALSRKVS
        RWLELVKDYDCEILYHLGKANVVADALSRK++
Subjt:  RWLELVKDYDCEILYHLGKANVVADALSRKVS

A0A5D3BPI1 Reverse transcriptase3.4e-30394.95Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG
        MLDFDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKG GS+SLPQVISA+RASKLLSQGTW ILASVVDTRE DVS SSEPVVRDYPDVF EELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPG

Query:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPI RAPYRMAP ELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK
        GATVFSKIDLRSGYHQLRIKD DV KTAFRSRYGHYEFIVMSFGLTNA AVFM+LMNRVFREFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQT RDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNK

Query:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL
        LYAKFSKCEFWLKQVSFLGHVV KAGVSVDPAKIE VT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKACEDSFQ LKQKL
Subjt:  LYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKL

Query:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        V+APVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY+SRQLKSHEQNYPTHDLELAAVVFA KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
        RWLELVKDYDCEILYH GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV
Subjt:  RWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.5e-8640.2Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVLKT
        +E++ Q+Q++L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +    V KT
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVLKT

Query:  AFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLYAKFSKCEFWLKQVSFLGHVVFKAGV
        AF +++GHYE++ M FGL NA A F   MN + R  L    +V++DDI+++S +  EH + L +V +      L  +  KCEF  ++ +FLGHV+   G+
Subjt:  AFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLYAKFSKCEFWLKQVSFLGHVVFKAGV

Query:  SVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDS-FQNLKQKLVSAPVLTVPDGSGSFVIYSDASKKGL
          +P KIE +  +P P+   E+++FLGL GYYR+F+ NF+ IA P+T+  +K      +    DS F+ LK  +   P+L VPD +  F + +DAS   L
Subjt:  SVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDS-FQNLKQKLVSAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADA
        G VL Q G  ++Y SR L  HE NY T + EL A+V+A K +RHYL G   +I +DH+ L + +  K+ N +  RW   + ++D +I Y  GK N VADA
Subjt:  GCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADA

Query:  LSR
        LSR
Subjt:  LSR

P20825 Retrovirus-related Pol polyprotein from transposon 2971.0e-8638.85Show/hide
Query:  PIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG
        PI+   Y +A     E++ Q+QE+L++G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G
Subjt:  PIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG

Query:  YHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLYAKFSKCEFWLK
        +HQ+ + +  + KTAF ++ GHYE++ M FGL NA A F   MN + R  L    +V++DDI+I+S +  EH   +++V     D  L  +  KCEF  K
Subjt:  YHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLYAKFSKCEFWLK

Query:  QVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSK-ACEDSFQNLKQKLVSAPVLTVPDGS
        + +FLGH+V   G+  +P K++ + S+P P+   E+R+FLGL GYYR+F+ N++ IA P+T   +K       K    ++F+ LK  ++  P+L +PD  
Subjt:  QVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSK-ACEDSFQNLKQKLVSAPVLTVPDGS

Query:  GSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCE
          FV+ +DAS   LG VL Q G  +++ SR L  HE NY   + EL A+V+A K +RHYL G +  I +DH+ L++    KE   +  RW   + +Y  +
Subjt:  GSFVIYSDASKKGLGCVLMQQGKVVAYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCE

Query:  ILYHLGKANVVADALSR
        I Y  GK N VADALSR
Subjt:  ILYHLGKANVVADALSR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.1e-8035.63Show/hide
Query:  ASKLLSQGTWSILASVVDTREVDVSSSSEP---------VVRDYPDVFLEELPGLPP---HREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELL
        AS L   G +S + S + + E + +  S           + + Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LL
Subjt:  ASKLLSQGTWSILASVVDTREVDVSSSSEP---------VVRDYPDVFLEELPGLPP---HREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELL

Query:  DKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSF
        D  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++  D  KTAF +  G YE+ VM F
Subjt:  DKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSF

Query:  GLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRP
        GL NA + F   M   FR+    FV V++DDILI+S++  EH +HL  VL+  ++  L  K  KC+F  ++  FLG+ +    ++    K   +  +P P
Subjt:  GLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRP

Query:  STVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK------VV
         TV + + FLG+  YYRRF+ N S+IA P+       +   W++  + + + LK  L ++PVL   +   ++ + +DASK G+G VL +         VV
Subjt:  STVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK------VV

Query:  AYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKV
         Y S+ L+S ++NYP  +LEL  ++ A   +R+ L+G+   + TDH SL     + E   R +RWL+ +  YD  + Y  G  NVVADA+SR +
Subjt:  AYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKV

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.4e-8235.84Show/hide
Query:  DYPDVFLEELPGLPPHREVEFAIELEPGT---VPIFRAPYRMAPVELK-ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELN
        ++P +F   L G+     VE A++ E  T    PI+   Y   PV ++ E++ Q+ ELL  G IRPS SP+ +P+  V KK     +   R+ +D++ LN
Subjt:  DYPDVFLEELPGLPPHREVEFAIELEPGT---VPIFRAPYRMAPVELK-ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELN

Query:  KVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYS
         VT+ + YP+P I+     L  A  F+ +DL SG+HQ+ +K+ D+ KTAF +  G YEF+ + FGL NA A+F  +++ + RE +G    V+IDDI+++S
Subjt:  KVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYS

Query:  KTEAEHEEHLRMVLQTFRDNKLYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR-
        +    H ++LR+VL +     L     K  F   QV FLG++V   G+  DP K+  ++  P P++V E++ FLG+  YYR+F+++++++A PLT LTR 
Subjt:  KTEAEHEEHLRMVLQTFRDNKLYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR-

Query:  ----------KAAPFVWSKACEDSFQNLKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQ----QGKVVAYSSRQLKSHEQNYPTHDLELAAVVFA
                     P    +    SF +LK  L S+ +L  P  +  F + +DAS   +G VL Q    + + +AY SR L   E+NY T + E+ A++++
Subjt:  ----------KAAPFVWSKACEDSFQNLKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQ----QGKVVAYSSRQLKSHEQNYPTHDLELAAVVFA

Query:  FKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSR
            R YLYG   I+++TDH+ L +    +  N + +RW   +++Y+CE++Y  GK+NVVADALSR
Subjt:  FKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSR

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-8035.83Show/hide
Query:  ASKLLSQGTWSILASVVDTREVDVSSSSEP---------VVRDYPDVFLEELPGLPP---HREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELL
        AS L   G +S + S + + E + +  S           + + Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LL
Subjt:  ASKLLSQGTWSILASVVDTREVDVSSSSEP---------VVRDYPDVFLEELPGLPP---HREVEFAIELEPGTVPIFRAPYRMAPVELKELKVQLQELL

Query:  DKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSF
        D  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++  D  KTAF +  G YE+ VM F
Subjt:  DKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVLKTAFRSRYGHYEFIVMSF

Query:  GLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRP
        GL NA + F   M   FR+    FV V++DDILI+S++  EH +HL  VL+  ++  L  K  KC+F  ++  FLG+ +    ++    K   +  +P P
Subjt:  GLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLYAKFSKCEFWLKQVSFLGHVVFKAGVSVDPAKIEVVTSWPRP

Query:  STVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK------VV
         TV + + FLG+  YYRRF+ N S+IA P+       +   W++  + +   LK  L ++PVL   +   ++ + +DASK G+G VL +         VV
Subjt:  STVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK------VV

Query:  AYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKV
         Y S+ L+S ++NYP  +LEL  ++ A   +R+ L+G+   + TDH SL     + E   R +RWL+ +  YD  + Y  G  NVVADA+SR V
Subjt:  AYSSRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKV

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.7e-2643.51Show/hide
Query:  HLRMVLQTFRDNKLYAKFSKCEFWLKQVSFLG--HVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVW
        HL MVLQ +  ++ YA   KC F   Q+++LG  H++   GVS DPAK+E +  WP P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRMVLQTFRDNKLYAKFSKCEFWLKQVSFLG--HVVFKAGVSVDPAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVW

Query:  SKACEDSFQNLKQKLVSAPVLTVPDGSGSFV
        ++    +F+ LK  + + PVL +PD    FV
Subjt:  SKACEDSFQNLKQKLVSAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGACTTTGATGTAATTCTGGGTATGGATTGGTTGGCCGCTAACCATGCCAGCATAGATTGTTCCCATAAGGAGGTAGCATTTAATCCTCCCTCGATGGTCAGTTT
TAAATTTAAGGGAGAAGGGTCAAGGTCGTTACCTCAGGTAATCTCAGCCATGAGGGCCAGCAAACTGCTCAGTCAAGGTACTTGGAGTATCTTAGCGAGCGTGGTGGATA
CTAGAGAGGTTGATGTATCCTCGTCATCAGAACCAGTGGTAAGGGACTATCCGGATGTCTTTCTTGAAGAACTTCCAGGGTTACCTCCTCACAGAGAGGTTGAGTTTGCC
ATAGAGTTGGAGCCGGGCACGGTTCCTATATTCAGAGCCCCGTACAGAATGGCCCCAGTAGAGTTGAAGGAACTGAAAGTGCAGTTACAGGAATTGCTTGATAAGGGCTT
CATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCAATGCGCCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTTA
AGAACAGATATCCCTTGCCCAGGATCGACGACCTGTTTGACCAGTTACAGGGAGCTACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATCAGCTGAGGATTAAG
GATGGTGATGTACTGAAGACGGCCTTTCGTTCCAGATACGGACACTATGAGTTTATTGTGATGTCTTTTGGATTGACGAATGCTCTGGCAGTGTTTATGAACTTGATGAA
CAGAGTGTTTAGGGAGTTCCTAGGCACTTTTGTGATCGTGTTTATTGATGACATCTTGATATATTCCAAGACGGAGGCCGAGCATGAGGAACATTTACGTATGGTTTTGC
AAACCTTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTAAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTTTAAGGCTGGAGTTTCTGTGGAT
CCAGCTAAGATAGAGGTAGTCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGTTTTCTAGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTC
CCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAAGCAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAATCTTAAACAGAAGCTAGTTTCTGCAC
CAGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGAAGGGCTTGGGTTGTGTATTGATGCAGCAAGGCAAGGTAGTCGCTTATTCT
TCTCGTCAGTTGAAGAGTCACGAGCAGAATTACCCCACACATGATTTAGAGTTGGCAGCAGTGGTTTTTGCATTCAAGATATGGAGGCATTACTTGTATGGTGAAAAGAT
ACAGATTTTCACGGATCATAAGAGCTTGAAATATTTCTTTACTCAAAAGGAATTGAATATGAGACAACGAAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATAC
TGTATCATCTAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGAGG
GCTGAGATTGCAGTTCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCGACTTTGATGTAATTCTGGGTATGGATTGGTTGGCCGCTAACCATGCCAGCATAGATTGTTCCCATAAGGAGGTAGCATTTAATCCTCCCTCGATGGTCAGTTT
TAAATTTAAGGGAGAAGGGTCAAGGTCGTTACCTCAGGTAATCTCAGCCATGAGGGCCAGCAAACTGCTCAGTCAAGGTACTTGGAGTATCTTAGCGAGCGTGGTGGATA
CTAGAGAGGTTGATGTATCCTCGTCATCAGAACCAGTGGTAAGGGACTATCCGGATGTCTTTCTTGAAGAACTTCCAGGGTTACCTCCTCACAGAGAGGTTGAGTTTGCC
ATAGAGTTGGAGCCGGGCACGGTTCCTATATTCAGAGCCCCGTACAGAATGGCCCCAGTAGAGTTGAAGGAACTGAAAGTGCAGTTACAGGAATTGCTTGATAAGGGCTT
CATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCAATGCGCCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTTA
AGAACAGATATCCCTTGCCCAGGATCGACGACCTGTTTGACCAGTTACAGGGAGCTACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATCAGCTGAGGATTAAG
GATGGTGATGTACTGAAGACGGCCTTTCGTTCCAGATACGGACACTATGAGTTTATTGTGATGTCTTTTGGATTGACGAATGCTCTGGCAGTGTTTATGAACTTGATGAA
CAGAGTGTTTAGGGAGTTCCTAGGCACTTTTGTGATCGTGTTTATTGATGACATCTTGATATATTCCAAGACGGAGGCCGAGCATGAGGAACATTTACGTATGGTTTTGC
AAACCTTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTAAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTTTAAGGCTGGAGTTTCTGTGGAT
CCAGCTAAGATAGAGGTAGTCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGTTTTCTAGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTC
CCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAAGCAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAATCTTAAACAGAAGCTAGTTTCTGCAC
CAGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGAAGGGCTTGGGTTGTGTATTGATGCAGCAAGGCAAGGTAGTCGCTTATTCT
TCTCGTCAGTTGAAGAGTCACGAGCAGAATTACCCCACACATGATTTAGAGTTGGCAGCAGTGGTTTTTGCATTCAAGATATGGAGGCATTACTTGTATGGTGAAAAGAT
ACAGATTTTCACGGATCATAAGAGCTTGAAATATTTCTTTACTCAAAAGGAATTGAATATGAGACAACGAAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATAC
TGTATCATCTAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGAGG
GCTGAGATTGCAGTTCAGTAG
Protein sequenceShow/hide protein sequence
MLDFDVILGMDWLAANHASIDCSHKEVAFNPPSMVSFKFKGEGSRSLPQVISAMRASKLLSQGTWSILASVVDTREVDVSSSSEPVVRDYPDVFLEELPGLPPHREVEFA
IELEPGTVPIFRAPYRMAPVELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
DGDVLKTAFRSRYGHYEFIVMSFGLTNALAVFMNLMNRVFREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTFRDNKLYAKFSKCEFWLKQVSFLGHVVFKAGVSVD
PAKIEVVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKAAPFVWSKACEDSFQNLKQKLVSAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYS
SRQLKSHEQNYPTHDLELAAVVFAFKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHLGKANVVADALSRKVSHSAALITRQAPLHRDLER
AEIAVQ