; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0070871 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0070871
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:16827061..16828154
RNA-Seq ExpressionCmc03g0070871
SyntenyCmc03g0070871
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO45752.1 pol protein [Cucumis melo subsp. melo]2.4e-17687.91Show/hide
Query:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK
        MAPA LKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDG+MRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQGATVFSKIDLRSGYH+LRIKD DV K
Subjt:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHY+FIVMSFGLTNAP VFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
        VSVDP  IEAVT W RPSTVSEVRSFLGLA                       G+ FVWSKACEDSFQ+LKQKLVTAPVLTVPDGSG+FVIYSDASKK L
Subjt:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF
        GCVLMQQGKVVAYASRQLKSHEQNYPTH+LELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFF
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF

KAA0040689.1 pol protein [Cucumis melo var. makuwa]1.8e-17688.19Show/hide
Query:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK
        MAPA LKELKVQLQELLDKGFIRPS+SPWGA VLFVKKKDG+MRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQGATVFSKIDLRSGYH+LRIKD DV K
Subjt:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
        VSVDPV IEAVT W RPSTVSEVRSFLGLA                       G+ FVWSKACEDSFQ+LKQKLVTAPVL VPDGSGSFVIYSDASKK L
Subjt:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF
        GCVLMQQGKVVAYASRQLKSHEQNYPTH+LELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFF
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF

KAA0043391.1 pol protein [Cucumis melo var. makuwa]1.4e-17688.19Show/hide
Query:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK
        MAPA LKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDG+MRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQGATVFSKIDLRSGYH+LRIKD DV K
Subjt:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
        VSVDP  IEAVT W RPSTVSEVRSFLGLA                       G+ FVWSKACEDSFQ+LKQKLVTAPVLTVPDGSGSFVIYSDASKK L
Subjt:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF
        GCVLMQQGKVVAYASRQLKSH+QNYPTH+LELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFF
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF

KAA0048687.1 pol protein [Cucumis melo var. makuwa]4.7e-17788.46Show/hide
Query:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK
        MAPA LKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDG+MRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQGATVFSKIDLRSGYH+LRIKD DV K
Subjt:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
        VSVDP  IEAVT W RPSTVSEVRSFLGLA                       G+ FVWSKACEDSFQ+LKQKLVTAPVLTVPDGSGSFVIYSDASKK L
Subjt:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF
        GCVLMQQGKVVAYASRQLKSHEQNYPTH+LELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFF
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF

TYK01613.1 pol protein [Cucumis melo var. makuwa]4.7e-17788.46Show/hide
Query:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK
        MAPA LKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDG+MRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQGATVFSKIDLRSGYH+LRIKD DV K
Subjt:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
        VSVDP  IEAVT W RPSTVSEVRSFLGLA                       G+ FVWSKACEDSFQ+LKQKLVTAPVLTVPDGSGSFVIYSDASKK L
Subjt:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF
        GCVLMQQGKVVAYASRQLKSHEQNYPTH+LELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFF
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF

TrEMBL top hitse value%identityAlignment
A0A5A7THE6 Reverse transcriptase8.7e-17788.19Show/hide
Query:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK
        MAPA LKELKVQLQELLDKGFIRPS+SPWGA VLFVKKKDG+MRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQGATVFSKIDLRSGYH+LRIKD DV K
Subjt:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
        VSVDPV IEAVT W RPSTVSEVRSFLGLA                       G+ FVWSKACEDSFQ+LKQKLVTAPVL VPDGSGSFVIYSDASKK L
Subjt:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF
        GCVLMQQGKVVAYASRQLKSHEQNYPTH+LELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFF
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF

A0A5A7TP96 Reverse transcriptase6.7e-17788.19Show/hide
Query:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK
        MAPA LKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDG+MRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQGATVFSKIDLRSGYH+LRIKD DV K
Subjt:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
        VSVDP  IEAVT W RPSTVSEVRSFLGLA                       G+ FVWSKACEDSFQ+LKQKLVTAPVLTVPDGSGSFVIYSDASKK L
Subjt:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF
        GCVLMQQGKVVAYASRQLKSH+QNYPTH+LELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFF
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF

A0A5A7U330 Reverse transcriptase2.3e-17788.46Show/hide
Query:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK
        MAPA LKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDG+MRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQGATVFSKIDLRSGYH+LRIKD DV K
Subjt:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
        VSVDP  IEAVT W RPSTVSEVRSFLGLA                       G+ FVWSKACEDSFQ+LKQKLVTAPVLTVPDGSGSFVIYSDASKK L
Subjt:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF
        GCVLMQQGKVVAYASRQLKSHEQNYPTH+LELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFF
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF

A0A5A7UC07 Reverse transcriptase1.1e-17687.91Show/hide
Query:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK
        MAPA LKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDG+MRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQGATVFSKIDLRSGYH+LRIK+GDV K
Subjt:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
        VSVDP  IEAVT W RPSTVSEVRSFLGLA                       G+ FVWSKACEDSFQ+LKQKLVTAPVLTVPDGSGSFVIYSDASKK L
Subjt:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF
        GCVLMQQGKVVAYASRQLKSHEQ YPTH+LELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFF
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF

A0A5D3BPI1 Reverse transcriptase2.3e-17788.46Show/hide
Query:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK
        MAPA LKELKVQLQELLDKGFIRPSVSPWGA VLFVKKKDG+MRLCIDYRELNKVTVKNRYPLPRI+DLFDQLQGATVFSKIDLRSGYH+LRIKD DV K
Subjt:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
        VSVDP  IEAVT W RPSTVSEVRSFLGLA                       G+ FVWSKACEDSFQ+LKQKLVTAPVLTVPDGSGSFVIYSDASKK L
Subjt:  VSVDPV-IEAVTSWPRPSTVSEVRSFLGLA-----------------------GSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF
        GCVLMQQGKVVAYASRQLKSHEQNYPTH+LELA VVFALKIWRHYLYGEKIQIFTDHKSLKYFF
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.1e-6335.71Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGA-----MRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLKT
        +E++ Q+Q++L++G IR S SP+ + +  V KK  A      R+ IDYR+LN++TV +R+P+P ++++  +L     F+ IDL  G+H++ +    V KT
Subjt:  KELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGA-----MRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLKT

Query:  AFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV
        AF +++GHYE++ M FGL NAP  F   MN + R  L+   +V++DDI+++S +  EH + L +V + L    L  +  KCEF  ++ +FLGHV++  G+
Subjt:  AFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV

Query:  SVDP-VIEAVTSWPRPSTVSEVRSFLGLAG--SSFVWSKA----------------------CEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
          +P  IEA+  +P P+   E+++FLGL G    F+ + A                       + +F+ LK  +   P+L VPD +  F + +DAS  +L
Subjt:  SVDP-VIEAVTSWPRPSTVSEVRSFLGLAG--SSFVWSKA----------------------CEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF
        G VL Q G  ++Y SR L  HE NY T   EL  +V+A K +RHYL G   +I +DH+ L + +
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFF

P0CT41 Transposon Tf2-12 polyprotein6.7e-5733.79Show/hide
Query:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK
        + P  ++ +  ++ + L  G IR S +     V+FV KK+G +R+ +DY+ LNK    N YPLP I  L  ++QG+T+F+K+DL+S YH +R++ GD  K
Subjt:  MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AFR   G +E++VM +G++ AP  F   +N +  E  ++ ++ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VS-VDPVIEAVTSWPRPSTVSEVRSFLGLAG--SSFV---------------------WSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL
         +     I+ V  W +P    E+R FLG       F+                     W+     + +++KQ LV+ PVL   D S   ++ +DAS  ++
Subjt:  VS-VDPVIEAVTSWPRPSTVSEVRSFLGLAG--SSFV---------------------WSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYG--EKIQIFTDHKSL
        G VL Q+        V Y S ++   + NY   + E+  ++ +LK WRHYL    E  +I TDH++L
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYG--EKIQIFTDHKSL

P20825 Retrovirus-related Pol polyprotein from transposon 2971.3e-6535.99Show/hide
Query:  ELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGA-----MRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLKTA
        E++ Q+QE+L++G IR S SP+ +    V KK  A      R+ IDYR+LN++T+ +RYP+P ++++  +L     F+ IDL  G+H++ + +  + KTA
Subjt:  ELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGA-----MRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLKTA

Query:  FRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVS
        F ++ GHYE++ M FGL NAP  F   MN + R  L+   +V++DDI+I+S +  EH   +++V   L D  L  +  KCEF  K+ +FLGH+V+  G+ 
Subjt:  FRSRYGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVS

Query:  VDPV-IEAVTSWPRPSTVSEVRSFLGLAG--SSFVWSKA----------------------CEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSLG
         +P+ ++A+ S+P P+   E+R+FLGL G    F+ + A                        ++F+ LK  ++  P+L +PD    FV+ +DAS  +LG
Subjt:  VDPV-IEAVTSWPRPSTVSEVRSFLGLAG--SSFVWSKA----------------------CEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSLG

Query:  CVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFFN
         VL Q G  +++ SR L  HE NY     EL  +V+A K +RHYL G +  I +DH+ L++  N
Subjt:  CVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSLKYFFN

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-5835.75Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLKTAFRSR
        +E+   +Q+LLD  FI PS SP  + V+ V KKDG  RLC+DYR LNK T+ + +PLPRI++L  ++  A +F+ +DL SGYH++ ++  D  KTAF + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLKTAFRSR

Query:  YGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVS-VDP
         G YE+ VM FGL NAP+ F   M   FR+    F+ V++DDILI+S++  EH +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++ +  
Subjt:  YGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVS-VDP

Query:  VIEAVTSWPRPSTVSEVRSFLGLAG---------------------SSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSLGCVLMQQG
           A+  +P P TV + + FLG+                           W++  + + + LK  L  +PVL   +   ++ + +DASK  +G VL +  
Subjt:  VIEAVTSWPRPSTVSEVRSFLGLAG---------------------SSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSLGCVLMQQG

Query:  K------VVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSL
               VV Y S+ L+S ++NYP   LEL  ++ AL  +R+ L+G+   + TDH SL
Subjt:  K------VVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSL

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.6e-5835.75Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLKTAFRSR
        +E+   +Q+LLD  FI PS SP  + V+ V KKDG  RLC+DYR LNK T+ + +PLPRI++L  ++  A +F+ +DL SGYH++ ++  D  KTAF + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLKTAFRSR

Query:  YGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVS-VDP
         G YE+ VM FGL NAP+ F   M   FR+    F+ V++DDILI+S++  EH +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++ +  
Subjt:  YGHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVS-VDP

Query:  VIEAVTSWPRPSTVSEVRSFLGLAG---------------------SSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSLGCVLMQQG
           A+  +P P TV + + FLG+                           W++  + +   LK  L  +PVL   +   ++ + +DASK  +G VL +  
Subjt:  VIEAVTSWPRPSTVSEVRSFLGLAG---------------------SSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSLGCVLMQQG

Query:  K------VVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSL
               VV Y S+ L+S ++NYP   LEL  ++ AL  +R+ L+G+   + TDH SL
Subjt:  K------VVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIFTDHKSL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCCAGCAGTATTGAAAGAACTGAAAGTGCAGTTACAGGAGTTGCTTGATAAGGGCTTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCTAGTTTTATTTGTTAA
GAAGAAGGATGGAGCGATGCGCCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCAACGACCTGTTTGACCAGTTAC
AGGGAGCTACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATGAGCTAAGGATCAAGGATGGTGATGTACTGAAGACGGCCTTTCGTTCCAGATACGGACACTAT
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACACTTTTCTGATCGTGTTTATTGA
TGATATTTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACGTATGGTTCTACAAACCCTTCGGGATAATAAATTGTATGCAAAGTTCTCTAAATGTGAGT
TTTGGTTGAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTCCAAGGCTGGAGTTTCTGTGGATCCAGTTATAGAGGCAGTCACCAGTTGGCCCCGACCCTCCACAGTTAGT
GAGGTTCGTAGCTTTCTGGGTTTAGCAGGCAGCTCCTTCGTTTGGAGCAAGGCATGTGAAGATAGTTTCCAGAGCCTTAAACAGAAGCTAGTTACTGCACCGGTTCTTAC
TGTACCTGATGGTTCAGGGAGTTTTGTGATTTACAGTGATGCTTCTAAGAAGAGTTTGGGTTGTGTATTGATGCAACAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGT
TGAAGAGTCATGAGCAGAATTACCCTACACACAATTTAGAGTTGGCAGTAGTGGTTTTTGCACTGAAGATATGGAGGCATTACTTGTATGGTGAAAAGATACAGATCTTC
ACGGATCATAAGAGTTTGAAATACTTCTTTAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCCAGCAGTATTGAAAGAACTGAAAGTGCAGTTACAGGAGTTGCTTGATAAGGGCTTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCTAGTTTTATTTGTTAA
GAAGAAGGATGGAGCGATGCGCCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCAACGACCTGTTTGACCAGTTAC
AGGGAGCTACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATGAGCTAAGGATCAAGGATGGTGATGTACTGAAGACGGCCTTTCGTTCCAGATACGGACACTAT
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACACTTTTCTGATCGTGTTTATTGA
TGATATTTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACGTATGGTTCTACAAACCCTTCGGGATAATAAATTGTATGCAAAGTTCTCTAAATGTGAGT
TTTGGTTGAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTCCAAGGCTGGAGTTTCTGTGGATCCAGTTATAGAGGCAGTCACCAGTTGGCCCCGACCCTCCACAGTTAGT
GAGGTTCGTAGCTTTCTGGGTTTAGCAGGCAGCTCCTTCGTTTGGAGCAAGGCATGTGAAGATAGTTTCCAGAGCCTTAAACAGAAGCTAGTTACTGCACCGGTTCTTAC
TGTACCTGATGGTTCAGGGAGTTTTGTGATTTACAGTGATGCTTCTAAGAAGAGTTTGGGTTGTGTATTGATGCAACAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGT
TGAAGAGTCATGAGCAGAATTACCCTACACACAATTTAGAGTTGGCAGTAGTGGTTTTTGCACTGAAGATATGGAGGCATTACTTGTATGGTGAAAAGATACAGATCTTC
ACGGATCATAAGAGTTTGAAATACTTCTTTAATTAG
Protein sequenceShow/hide protein sequence
MAPAVLKELKVQLQELLDKGFIRPSVSPWGALVLFVKKKDGAMRLCIDYRELNKVTVKNRYPLPRINDLFDQLQGATVFSKIDLRSGYHELRIKDGDVLKTAFRSRYGHY
EFIVMSFGLTNAPTVFMDLMNRVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPVIEAVTSWPRPSTVS
EVRSFLGLAGSSFVWSKACEDSFQSLKQKLVTAPVLTVPDGSGSFVIYSDASKKSLGCVLMQQGKVVAYASRQLKSHEQNYPTHNLELAVVVFALKIWRHYLYGEKIQIF
TDHKSLKYFFN