; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0102991 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0102991
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:19870411..19871488
RNA-Seq ExpressionCmc04g0102991
SyntenyCmc04g0102991
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO45752.1 pol protein [Cucumis melo subsp. melo]5.1e-17992.49Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW
        TAFRSRYGHY+FIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL                 GHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW

Query:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQL+RKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL
        GCVLMQQGKVVAYASRQLKSHEQ  PTHDLELAAVVFALKIWRHYL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL

KAA0037244.1 reverse transcriptase [Cucumis melo var. makuwa]6.1e-18093.06Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHEEHLRMVLQTLRDNKL                 GHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW

Query:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQL+RKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL
        GCVLMQQGKVVAYASRQLKSHEQ  PTHDLELAAVVFALKIWRHYL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL

KAA0048687.1 pol protein [Cucumis melo var. makuwa]2.1e-18093.35Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL                 GHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW

Query:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQL+RKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL
        GCVLMQQGKVVAYASRQLKSHEQ  PTHDLELAAVVFALKIWRHYL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL

KAA0051357.1 pol protein [Cucumis melo var. makuwa]3.9e-17992.77Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGY+QLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL                 GHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW

Query:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQL+RKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL
        GCVLMQQGKVVAYASRQLKSHEQ  PTHDLELAAVVFALKIWRHYL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL

TYK01613.1 pol protein [Cucumis melo var. makuwa]1.0e-17993.06Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL                 GHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW

Query:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQL+RKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL
        GCVLMQQGKVVAYASRQLKSHEQ  PTHDLELAAVVFALKIWRHYL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL

TrEMBL top hitse value%identityAlignment
A0A5A7T190 Reverse transcriptase2.9e-18093.06Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHEEHLRMVLQTLRDNKL                 GHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW

Query:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQL+RKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL
        GCVLMQQGKVVAYASRQLKSHEQ  PTHDLELAAVVFALKIWRHYL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL

A0A5A7U330 Reverse transcriptase1.0e-18093.35Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL                 GHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW

Query:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQL+RKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL
        GCVLMQQGKVVAYASRQLKSHEQ  PTHDLELAAVVFALKIWRHYL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL

A0A5A7UAA8 Reverse transcriptase1.9e-17992.77Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGY+QLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL                 GHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW

Query:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQL+RKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL
        GCVLMQQGKVVAYASRQLKSHEQ  PTHDLELAAVVFALKIWRHYL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL

A0A5A7URA1 Reverse transcriptase2.5e-17992.49Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLF KKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL                 GHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW

Query:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEA TGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQL+RKGAPFVWSKACEDSFQNLKQKLVTAP+LTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL
        GCVLMQQGKVVAYASRQLKSHEQ  PTHDLELAAVVFALKIWRHYL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL

A0A5D3BPI1 Reverse transcriptase5.0e-18093.06Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL                 GHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGW

Query:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQL+RKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL
        GCVLMQQGKVVAYASRQLKSHEQ  PTHDLELAAVVFALKIWRHYL
Subjt:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.5e-6738.42Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKT
        +E++ Q+Q++L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +  E V KT
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKT

Query:  AFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL------CP--------SGHVVSKGWV
        AF +++GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+++S +  EH + L +V + L    L      C          GHV++   +
Subjt:  AFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL------CP--------SGHVVSKGWV

Query:  SVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDS-FQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
          +P KIEA+  +  P+   E+++FLGL GYYR+F+ NF+ IA P+T+  +K      +    DS F+ LK  +   P+L VPD +  F + +DAS   L
Subjt:  SVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDS-FQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYLMVKRYRYS
        G VL Q G  ++Y SR L  HE    T + EL A+V+A K +RHYL+ + +  S
Subjt:  GCVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYLMVKRYRYS

P0CT41 Transposon Tf2-12 polyprotein3.2e-5934.38Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++  D  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPSG---------------HVVSKG
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L  +                H+  KG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPSG---------------HVVSKG

Query:  WVSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKG
        +       I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   
Subjt:  WVSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKG

Query:  LGCVLMQQGK-----VVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL
        +G VL Q+        V Y S ++   +      D E+ A++ +LK WRHYL
Subjt:  LGCVLMQQGK-----VVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYL

P20825 Retrovirus-related Pol polyprotein from transposon 2979.4e-6736.86Show/hide
Query:  ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTA
        E++ Q+QE+L++G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G+HQ+ + +E + KTA
Subjt:  ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTA

Query:  FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL------CP--------SGHVVSKGWVS
        F ++ GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+I+S +  EH   +++V   L D  L      C          GH+V+   + 
Subjt:  FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKL------CP--------SGHVVSKGWVS

Query:  VDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSK-ACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLG
         +P K++A+  +  P+   E+R+FLGL GYYR+F+ N++ IA P+T   +K       K    ++F+ LK  ++  P+L +PD    FV+ +DAS   LG
Subjt:  VDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSK-ACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLG

Query:  CVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYLMVKRY
         VL Q G  +++ SR L  HE      + EL A+V+A K +RHYL+ +++
Subjt:  CVLMQQGKVVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYLMVKRY

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.5e-6036.75Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSR
        +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSR

Query:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGWVSVDPA
         G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH +HL  VL+ L++  L                 G+ +    ++    
Subjt:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGWVSVDPA

Query:  KIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQ
        K  A+  +  P T+ + + FLG+  YYRRF+ N S+IA P+       +   W++  + + + LK  L  +PVL   +   ++ + +DASK G+G VL +
Subjt:  KIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQ

Query:  QGK------VVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYLMVKRY
                 VV Y S+ L+S ++  P  +LEL  ++ AL  +R+ L  K +
Subjt:  QGK------VVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYLMVKRY

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-5936.75Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSR
        +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSR

Query:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGWVSVDPA
         G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH +HL  VL+ L++  L                 G+ +    ++    
Subjt:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPS--------------GHVVSKGWVSVDPA

Query:  KIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQ
        K  A+  +  P T+ + + FLG+  YYRRF+ N S+IA P+       +   W++  + +   LK  L  +PVL   +   ++ + +DASK G+G VL +
Subjt:  KIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQ

Query:  QGK------VVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYLMVKRY
                 VV Y S+ L+S ++  P  +LEL  ++ AL  +R+ L  K +
Subjt:  QGK------VVAYASRQLKSHEQQLPTHDLELAAVVFALKIWRHYLMVKRY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.4e-2041.22Show/hide
Query:  HLRMVLQT------LRDNKLCPSG----------HVVSKGWVSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVW
        HL MVLQ         + K C  G          H++S   VS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRMVLQT------LRDNKLCPSG----------HVVSKGWVSVDPAKIEAVTGWTRPSTISEVRSFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVW

Query:  SKACEDSFQNLKQKLVTAPVLTVPDGSGSFV
        ++    +F+ LK  + T PVL +PD    FV
Subjt:  SKACEDSFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCCCGCAGAACTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTATTT
GTTAAGAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAGGTGACCGTAAAGAACAGATATCCTTTGCCCAGGATTGACGACCTATTC
GACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATCAGCTGAGGATTAAGGATGAGGATGTACCGAAGACAGCATTTCGTTCC
AGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTAATGAACAGAGTGTTTAGGGAGTTCCTAGATACT
TTTGTGATCGTGTTTATCGACGATATCTTGATATACTCCAAGACAGAGGCTGAACATGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTG
TGTCCTTCTGGTCACGTGGTTTCTAAGGGCTGGGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAATCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCAGGTTATTATCGACGATTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGTCCAGAAAGGGAGCTCCTTTTGTTTGG
AGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGAT
GCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAGGGTAAGGTGGTCGCTTATGCGTCCCGTCAGTTGAAGAGTCATGAGCAGCAACTACCTACACATGAT
CTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTAATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCCCGCAGAACTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTATTT
GTTAAGAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAGGTGACCGTAAAGAACAGATATCCTTTGCCCAGGATTGACGACCTATTC
GACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATCAGCTGAGGATTAAGGATGAGGATGTACCGAAGACAGCATTTCGTTCC
AGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTAATGAACAGAGTGTTTAGGGAGTTCCTAGATACT
TTTGTGATCGTGTTTATCGACGATATCTTGATATACTCCAAGACAGAGGCTGAACATGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTG
TGTCCTTCTGGTCACGTGGTTTCTAAGGGCTGGGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAATCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCAGGTTATTATCGACGATTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGTCCAGAAAGGGAGCTCCTTTTGTTTGG
AGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGAT
GCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAGGGTAAGGTGGTCGCTTATGCGTCCCGTCAGTTGAAGAGTCATGAGCAGCAACTACCTACACATGAT
CTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTAATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGA
Protein sequenceShow/hide protein sequence
MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRS
RYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLCPSGHVVSKGWVSVDPAKIEAVTGWTRPSTISEVR
SFLGLAGYYRRFVENFSRIATPLTQLSRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQQLPTHD
LELAAVVFALKIWRHYLMVKRYRYSRIIRA