; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0025741 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0025741
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr01:26654301..26655654
RNA-Seq ExpressionCmc01g0025741
SyntenyCmc01g0025741
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031895.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.5e-18986.65Show/hide
Query:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD
        K + CQIEIAGHVIEVTL+V DMLDFDVILGMDWLA NHASIDCSRKEVTFNPPSMA+FKFK G S+SLPQVISAIRASKLLSQGTWGILASVVDTR AD
Subjt:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD

Query:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
        VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELK+QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV------------------------------------LLGHVVSKA
        NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRS                                       LGHVVSKA
Subjt:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV------------------------------------LLGHVVSKA

Query:  GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
        GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFV NFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA+
Subjt:  GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

KAA0037244.1 reverse transcriptase [Cucumis melo var. makuwa]1.6e-18678.6Show/hide
Query:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD
        K +ACQIEIAGHVIEVTLIV DMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFK GGSKSLPQVISAIRASKLLSQGTWGILASVVDTR  D
Subjt:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD

Query:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
        VSLS EPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELK+QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------
        NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRS                                               
Subjt:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------

Query:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
                                               LGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
Subjt:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR

Query:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
        KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA+
Subjt:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

KAA0037291.1 pol protein [Cucumis melo var. makuwa]6.0e-18678.38Show/hide
Query:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD
        K +ACQIEIAGHVIEVTLIV DMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFK GGSKSLPQVISAIRASKLLSQGTWGILASVVDTR AD
Subjt:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD

Query:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
        VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELK+QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------
        NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRS                                               
Subjt:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------

Query:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
                                               LGHVVSKAGVSVDPAKI+AVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
Subjt:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR

Query:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
        KG PFVWSKACEDSFQNLKQKLVTAPVLTVPDGS SFVIYSDA+
Subjt:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

KAA0043391.1 pol protein [Cucumis melo var. makuwa]4.6e-18678.6Show/hide
Query:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD
        K +ACQIEIAGHVIEVTLIV DMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFK GGSKSLPQVISAIRASKLLSQGTWGILASVVDTR AD
Subjt:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD

Query:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
        VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELK+QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------
        NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRS                                               
Subjt:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------

Query:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
                                               LGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY RFVENFS IATPLTQLTR
Subjt:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR

Query:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
        KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA+
Subjt:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

TYK01613.1 pol protein [Cucumis melo var. makuwa]5.5e-18778.83Show/hide
Query:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD
        K +ACQIEIAGHVIEVTLIV DMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFK GGSKSLPQVISAIRASKLLSQGTWGILASVVDTR AD
Subjt:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD

Query:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
        VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELK+QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------
        NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRS                                               
Subjt:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------

Query:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
                                               LGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
Subjt:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR

Query:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
        KGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDA+
Subjt:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

TrEMBL top hitse value%identityAlignment
A0A5A7SRV9 Reverse transcriptase7.4e-19086.65Show/hide
Query:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD
        K + CQIEIAGHVIEVTL+V DMLDFDVILGMDWLA NHASIDCSRKEVTFNPPSMA+FKFK G S+SLPQVISAIRASKLLSQGTWGILASVVDTR AD
Subjt:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD

Query:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
        VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELK+QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV------------------------------------LLGHVVSKA
        NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRS                                       LGHVVSKA
Subjt:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV------------------------------------LLGHVVSKA

Query:  GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
        GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFV NFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA+
Subjt:  GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

A0A5A7T190 Reverse transcriptase7.7e-18778.6Show/hide
Query:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD
        K +ACQIEIAGHVIEVTLIV DMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFK GGSKSLPQVISAIRASKLLSQGTWGILASVVDTR  D
Subjt:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD

Query:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
        VSLS EPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELK+QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------
        NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRS                                               
Subjt:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------

Query:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
                                               LGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
Subjt:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR

Query:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
        KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA+
Subjt:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

A0A5A7T6R9 Reverse transcriptase2.9e-18678.38Show/hide
Query:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD
        K +ACQIEIAGHVIEVTLIV DMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFK GGSKSLPQVISAIRASKLLSQGTWGILASVVDTR AD
Subjt:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD

Query:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
        VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELK+QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------
        NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRS                                               
Subjt:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------

Query:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
                                               LGHVVSKAGVSVDPAKI+AVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
Subjt:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR

Query:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
        KG PFVWSKACEDSFQNLKQKLVTAPVLTVPDGS SFVIYSDA+
Subjt:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

A0A5A7TP96 Reverse transcriptase2.2e-18678.6Show/hide
Query:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD
        K +ACQIEIAGHVIEVTLIV DMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFK GGSKSLPQVISAIRASKLLSQGTWGILASVVDTR AD
Subjt:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD

Query:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
        VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELK+QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------
        NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRS                                               
Subjt:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------

Query:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
                                               LGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY RFVENFS IATPLTQLTR
Subjt:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR

Query:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
        KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA+
Subjt:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

A0A5D3BPI1 Reverse transcriptase2.6e-18778.83Show/hide
Query:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD
        K +ACQIEIAGHVIEVTLIV DMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFK GGSKSLPQVISAIRASKLLSQGTWGILASVVDTR AD
Subjt:  KGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGAD

Query:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
        VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELK+QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------
        NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRS                                               
Subjt:  NKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSV---------------------------------------------

Query:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
                                               LGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
Subjt:  --------------------------------------LLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR

Query:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
        KGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDA+
Subjt:  KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.2e-3128.29Show/hide
Query:  YRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRI
        Y    A  +E++ Q+Q++L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +
Subjt:  YRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRI

Query:  KDEDVPKTAF-----------------------------------------------------------------------------------RSSVLLG
          E V KTAF                                                                                   + +  LG
Subjt:  KDEDVPKTAF-----------------------------------------------------------------------------------RSSVLLG

Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDS-FQNLKQKLVTAPVLTVPDGSGSFVIY
        HV++  G+  +P KIEA+  +  P+   E+++FLGL GYYR+F+ NF+ IA P+T+  +K      +    DS F+ LK  +   P+L VPD +  F + 
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDS-FQNLKQKLVTAPVLTVPDGSGSFVIY

Query:  SDAA
        +DA+
Subjt:  SDAA

P10394 Retrovirus-related Pol polyprotein from transposon 4121.2e-3225.63Show/hide
Query:  RDYPDVFPEELPGLPPHREVEFAIELEPGTV--------------PISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDG------
        +++P++F  +L  +       FA+E EP TV              P+    YR   ++++E++ Q+Q+L+    + PSVS + +P+L V KK        
Subjt:  RDYPDVFPEELPGLPPHREVEFAIELEPGTV--------------PISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDG------

Query:  SMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSS-----------------------------------
          RL IDYR++NK  + +++PLPRIDD+ DQL  A  FS +DL SG+HQ+ + +     T+F +S                                   
Subjt:  SMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSS-----------------------------------

Query:  ------------------------------------------------VLLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFS
                                                          LGH  +  G+  D  K + +  +  P      R F+    YYRRF++NF+
Subjt:  ------------------------------------------------VLLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFS

Query:  RIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
          +  +T+L +K  PF W+  C+ +F +LK +L+   +L  PD S  F I +DA+
Subjt:  RIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

P20825 Retrovirus-related Pol polyprotein from transposon 2971.4e-3126.77Show/hide
Query:  PISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG
        PI    Y +A     E++ Q+QE+L++G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G
Subjt:  PISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG

Query:  YHQLRIKDEDVPKTAF-----------------------------------------------------------------------------------R
        +HQ+ + +E + KTAF                                                                                   +
Subjt:  YHQLRIKDEDVPKTAF-----------------------------------------------------------------------------------R

Query:  SSVLLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSK-ACEDSFQNLKQKLVTAPVLTVPDGS
         +  LGH+V+  G+  +P K++A+  +  P+   E+R+FLGL GYYR+F+ N++ IA P+T   +K       K    ++F+ LK  ++  P+L +PD  
Subjt:  SSVLLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSK-ACEDSFQNLKQKLVTAPVLTVPDGS

Query:  GSFVIYSDAA
          FV+ +DA+
Subjt:  GSFVIYSDAA

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-3026.65Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAF---------------------------------------------------------
        +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF                                                         
Subjt:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAF---------------------------------------------------------

Query:  ------------------------RSSVLLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKA
                                  +  LG+ +    ++    K  A+  +  P TV + + FLG+  YYRRF+ N S+IA P+       +   W++ 
Subjt:  ------------------------RSSVLLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKA

Query:  CEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
         + + + LK  L  +PVL   +   ++ + +DA+
Subjt:  CEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.5e-3026.65Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAF---------------------------------------------------------
        +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF                                                         
Subjt:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAF---------------------------------------------------------

Query:  ------------------------RSSVLLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKA
                                  +  LG+ +    ++    K  A+  +  P TV + + FLG+  YYRRF+ N S+IA P+       +   W++ 
Subjt:  ------------------------RSSVLLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKA

Query:  CEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA
         + +   LK  L  +PVL   +   ++ + +DA+
Subjt:  CEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAA

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein6.5e-2147.42Show/hide
Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFV
        H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W++    +F+ LK  + T PVL +PD    FV
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGTTGTCGAAGGAAAAGGTGAAGCATGCCAGATTGAGATAGCAGGCCATGTGATTGAGGTAACGCTGATAGTCCGTGATATGCTGGACTTTGATGTAATCCTGGG
TATGGATTGGTTGGCCGCTAACCACGCCAGTATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGAAGGAGGGTCAAAGT
CGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATACTAGAGGGGCGGATGTATCCCTGTCG
TCAGAACCGGTGGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTTCCAGGGTTACCTCCTCACAGGGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCC
TATATCCAGAGCCCCTTACAGGATGGCCCCCGCAGAACTGAAGGAACTGAAGATACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGG
GTGCGCCAGTCTTATTCGTTAAGAAGAAGGACGGATCGATGCGTCTGTGCATTGACTATAGGGAGTTGAACAAAGTAACCGTAAAGAACAGATATCCCTTGCCCAGGATT
GACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATGTACCGAAGACAGCATT
TCGTTCCAGTGTCCTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGG
TTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGGAGAACTTCTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGG
AGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCAGC
AGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATGTTGTCGAAGGAAAAGGTGAAGCATGCCAGATTGAGATAGCAGGCCATGTGATTGAGGTAACGCTGATAGTCCGTGATATGCTGGACTTTGATGTAATCCTGGG
TATGGATTGGTTGGCCGCTAACCACGCCAGTATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGAAGGAGGGTCAAAGT
CGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATACTAGAGGGGCGGATGTATCCCTGTCG
TCAGAACCGGTGGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTTCCAGGGTTACCTCCTCACAGGGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCC
TATATCCAGAGCCCCTTACAGGATGGCCCCCGCAGAACTGAAGGAACTGAAGATACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGG
GTGCGCCAGTCTTATTCGTTAAGAAGAAGGACGGATCGATGCGTCTGTGCATTGACTATAGGGAGTTGAACAAAGTAACCGTAAAGAACAGATATCCCTTGCCCAGGATT
GACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATGTACCGAAGACAGCATT
TCGTTCCAGTGTCCTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGG
TTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGGAGAACTTCTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGG
AGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCAGC
AGGGTAA
Protein sequenceShow/hide protein sequence
MYVVEGKGEACQIEIAGHVIEVTLIVRDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKEGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRGADVSLS
SEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKIQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRI
DDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSSVLLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
SKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDAAG