; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0100721 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0100721
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:16847854..16848933
RNA-Seq ExpressionCmc04g0100721
SyntenyCmc04g0100721
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040542.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]9.4e-17691.23Show/hide
Query:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK
        MA  ELKELKVQLQELLDKGFIRPSVSPW APVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK

Query:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG
        T FRSRYGHYEF+VMSF LTN P VFMDLMNRVFKDFLD+FVIVFIDDILIYSK EAE+EEHLH VLETLRANKLYAKFSKCEFWL+KVTFL HVVSS+G
Subjt:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG

Query:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL
        V VDPAKIEAVT+WPRPSTVSEIRSFL LAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACESSFQELKQKLV+APVLT+PDGSR+FVI SDASKKGL
Subjt:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL

Query:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIW
        GCVLMQQGKVV YASRQLK HEQNY THDLEL  +VFALKIW
Subjt:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIW

KAA0058812.1 pol protein [Cucumis melo var. makuwa]2.7e-17587.71Show/hide
Query:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK
        MA  ELKELKVQLQELLDKGFIRPSVSPW APVLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK

Query:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG
        T F SRYGHYEF+VMSF LTNAP VFMDLMNRVFKDF+D+FVIVFIDDILIYSK EAE+EEHLH VLETLRANKLYAKFSKCEFWL+KVTFL HVVSS+G
Subjt:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG

Query:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL
        V VDPAKIEAVT+WPRPSTVSEIRSFL LAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACESSFQELKQKLV+APVLTVPDGS +FVI SDASKKGL
Subjt:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL

Query:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR
        GCVLMQQGKVV YASRQLK HEQNY THDLEL T+VFALKIW      R Y++   I+
Subjt:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR

KAA0063098.1 pol protein [Cucumis melo var. makuwa]3.6e-17587.71Show/hide
Query:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK
        MAL ELKELKVQLQELLDKGFIRPSVSPW APVLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK

Query:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG
        T FRSRYGHYEF+VMSF LTNAP VFMDLMNRVFKDFLD+FVIVFIDDILIYSK EAE+EEHLH VLETLRANKLYAKFSKCEFWL+KVTFL HVVSS+G
Subjt:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG

Query:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL
        V VDPAKIEAVT+WPRPSTVSEIRSFL LAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACE SFQELKQKLV+APVLTVPDGS +FVI SDASKK L
Subjt:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL

Query:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR
        GCVLMQQGKVV YASRQLK+HEQNY THDLEL  +VFALKIW      R Y++   I+
Subjt:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR

KAA0063793.1 pol protein [Cucumis melo var. makuwa]4.7e-17587.99Show/hide
Query:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK
        MA  ELKELKVQLQELLDKGFIRPSVSPW APVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK

Query:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG
        T FRSRYGHYEF+VMSF LTNAP VFMDLMNRVFKDFLD+FVIVFIDDILIYSK EAE+EEHLH VLETLRANKLYAKFSKCEFWL+KVTFL HVVSS+G
Subjt:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG

Query:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL
        V VDPAKIEAVT+WPRPSTVSEIRSFL LAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACE SFQELKQKLV+APVLTVPDGS +FVI SDASKKGL
Subjt:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL

Query:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR
        GCVLMQQGKVV YASRQLK HEQNY THDLEL  +VFALKIW      R Y++   I+
Subjt:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR

TYK05193.1 pol protein [Cucumis melo var. makuwa]2.1e-17588.27Show/hide
Query:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK
        MA  ELKELKVQLQELLDKGFIRPSVSPW A VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK

Query:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG
        T FRSRYGHYEF+VMSF LTNAP VFMDLMNRVFKDFLD+FVIVFIDDILIYSK EAE+EEHLH VLETLRANKLYAKFSKCEFWL+KVTFL HVVSS+G
Subjt:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG

Query:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL
        V VDP KIEAVT+WPRPSTVSEIRSFL LAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACESSFQELKQKLV+APVLTVPDGS +FVI SDASKKGL
Subjt:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL

Query:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR
        GCVLMQQGKVV YASRQLKSHEQNY THDLEL  +VFALKIW      RRY++   I+
Subjt:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR

TrEMBL top hitse value%identityAlignment
A0A5A7TAT0 DNA/RNA polymerases superfamily protein4.6e-17691.23Show/hide
Query:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK
        MA  ELKELKVQLQELLDKGFIRPSVSPW APVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK

Query:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG
        T FRSRYGHYEF+VMSF LTN P VFMDLMNRVFKDFLD+FVIVFIDDILIYSK EAE+EEHLH VLETLRANKLYAKFSKCEFWL+KVTFL HVVSS+G
Subjt:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG

Query:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL
        V VDPAKIEAVT+WPRPSTVSEIRSFL LAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACESSFQELKQKLV+APVLT+PDGSR+FVI SDASKKGL
Subjt:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL

Query:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIW
        GCVLMQQGKVV YASRQLK HEQNY THDLEL  +VFALKIW
Subjt:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIW

A0A5A7USG7 Reverse transcriptase1.3e-17587.71Show/hide
Query:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK
        MA  ELKELKVQLQELLDKGFIRPSVSPW APVLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK

Query:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG
        T F SRYGHYEF+VMSF LTNAP VFMDLMNRVFKDF+D+FVIVFIDDILIYSK EAE+EEHLH VLETLRANKLYAKFSKCEFWL+KVTFL HVVSS+G
Subjt:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG

Query:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL
        V VDPAKIEAVT+WPRPSTVSEIRSFL LAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACESSFQELKQKLV+APVLTVPDGS +FVI SDASKKGL
Subjt:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL

Query:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR
        GCVLMQQGKVV YASRQLK HEQNY THDLEL T+VFALKIW      R Y++   I+
Subjt:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR

A0A5A7V646 Reverse transcriptase1.7e-17587.71Show/hide
Query:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK
        MAL ELKELKVQLQELLDKGFIRPSVSPW APVLFVKKKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK

Query:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG
        T FRSRYGHYEF+VMSF LTNAP VFMDLMNRVFKDFLD+FVIVFIDDILIYSK EAE+EEHLH VLETLRANKLYAKFSKCEFWL+KVTFL HVVSS+G
Subjt:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG

Query:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL
        V VDPAKIEAVT+WPRPSTVSEIRSFL LAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACE SFQELKQKLV+APVLTVPDGS +FVI SDASKK L
Subjt:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL

Query:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR
        GCVLMQQGKVV YASRQLK+HEQNY THDLEL  +VFALKIW      R Y++   I+
Subjt:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR

A0A5A7V6R2 Reverse transcriptase2.3e-17587.99Show/hide
Query:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK
        MA  ELKELKVQLQELLDKGFIRPSVSPW APVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK

Query:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG
        T FRSRYGHYEF+VMSF LTNAP VFMDLMNRVFKDFLD+FVIVFIDDILIYSK EAE+EEHLH VLETLRANKLYAKFSKCEFWL+KVTFL HVVSS+G
Subjt:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG

Query:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL
        V VDPAKIEAVT+WPRPSTVSEIRSFL LAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACE SFQELKQKLV+APVLTVPDGS +FVI SDASKKGL
Subjt:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL

Query:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR
        GCVLMQQGKVV YASRQLK HEQNY THDLEL  +VFALKIW      R Y++   I+
Subjt:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR

A0A5D3BZN1 Reverse transcriptase1.0e-17588.27Show/hide
Query:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK
        MA  ELKELKVQLQELLDKGFIRPSVSPW A VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPK

Query:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG
        T FRSRYGHYEF+VMSF LTNAP VFMDLMNRVFKDFLD+FVIVFIDDILIYSK EAE+EEHLH VLETLRANKLYAKFSKCEFWL+KVTFL HVVSS+G
Subjt:  TTFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKG

Query:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL
        V VDP KIEAVT+WPRPSTVSEIRSFL LAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACESSFQELKQKLV+APVLTVPDGS +FVI SDASKKGL
Subjt:  VFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL

Query:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR
        GCVLMQQGKVV YASRQLKSHEQNY THDLEL  +VFALKIW      RRY++   I+
Subjt:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFSLTIR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.8e-6837.85Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWRAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPKT
        +E++ Q+Q++L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +    + KT
Subjt:  KELKVQLQELLDKGFIRPSVSPWRAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPKT

Query:  TFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKGV
         F +++GHYE++ M F L NAP  F   MN + +  L+   +V++DDI+++S    E+ + L  V E L    L  +  KCEF  ++ TFL HV++  G+
Subjt:  TFRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKGV

Query:  FVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL
          +P KIEA+  +P P+   EI++FL L GYYR+F+ +F+ IA P+T+  +K       +P  +S+F++LK  +   P+L VPD ++ F + +DAS   L
Subjt:  FVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGL

Query:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFS
        G VL Q G  + Y SR L  HE NY T + EL  +V+A K +    + R +  S
Subjt:  GCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFS

P10401 Retrovirus-related Pol polyprotein from transposon gypsy2.6e-5936.13Show/hide
Query:  QLQELLDKGFIRPSVSPWRAPVLFVKKK------DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPKTTFRS
        ++++LL  G IRPS SP+ +P   V KK      + + RL ID+R+LN+ T+ +RYP+P I  +   L  A  F+ +DL+SGYHQ+ + ++D  KT+F  
Subjt:  QLQELLDKGFIRPSVSPWRAPVLFVKKK------DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPKTTFRS

Query:  RYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKGVFVDP
          G YEF  + F L NA  +F   ++ V ++ +     V++DD++I+S+ E+++  H+  VL+ L    +     K  F+ + V +L  +VS  G   DP
Subjt:  RYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKGVFVDP

Query:  AKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTR-----------KGTPFVWSPACESSFQELKQKLVSAPV-LTVPDGSRSFVIDS
         K++A+  +P P  V ++RSFL LA YYR F++DF+ IA P+T + +           K  P  ++    ++FQ L+  L S  V L  PD  + F + +
Subjt:  AKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTR-----------KGTPFVWSPACESSFQELKQKLVSAPV-LTVPDGSRSFVIDS

Query:  DASKKGLGCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFAL
        DAS  G+G VL Q+G+ +   SR LK  EQNY T++ EL  +V+AL
Subjt:  DASKKGLGCVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFAL

P20825 Retrovirus-related Pol polyprotein from transposon 2974.8e-6636.26Show/hide
Query:  ELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPKTT
        E++ Q+QE+L++G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G+HQ+ + +  I KT 
Subjt:  ELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPKTT

Query:  FRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKGVF
        F ++ GHYE++ M F L NAP  F   MN + +  L+   +V++DDI+I+S    E+   +  V   L    L  +  KCEF  K+  FL H+V+  G+ 
Subjt:  FRSRYGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKGVF

Query:  VDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGLG
         +P K++A+ S+P P+   EIR+FL L GYYR+F+ +++ IA P+T   +K T           +F++LK  ++  P+L +PD  + FV+ +DAS   LG
Subjt:  VDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGLG

Query:  CVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFS
         VL Q G  + + SR L  HE NY   + EL  +V+A K +    + R+++ +
Subjt:  CVLMQQGKVVPYASRQLKSHEQNYRTHDLELTTMVFALKIWGTTCMVRRYIFS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.2e-6038.35Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPKTTFRSR
        +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KT F + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPKTTFRSR

Query:  YGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKGVFVDPA
         G YE+ VM F L NAP  F   M   F+D    FV V++DDILI+S+   E+ +HL  VLE L+   L  K  KC+F  ++  FL + +  + +     
Subjt:  YGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKGVFVDPA

Query:  KIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGLGCVLMQ
        K  A+  +P P TV + + FL +  YYRRF+ + S IA P+       +   W+   + + ++LK  L ++PVL   +   ++ + +DASK G+G VL +
Subjt:  KIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGLGCVLMQ

Query:  QGK------VVPYASRQLKSHEQNYRTHDLELTTMVFAL
                 VV Y S+ L+S ++NY   +LEL  ++ AL
Subjt:  QGK------VVPYASRQLKSHEQNYRTHDLELTTMVFAL

Q99315 Transposon Ty3-G Gag-Pol polyprotein6.8e-6038.35Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPKTTFRSR
        +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KT F + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPKTTFRSR

Query:  YGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKGVFVDPA
         G YE+ VM F L NAP  F   M   F+D    FV V++DDILI+S+   E+ +HL  VLE L+   L  K  KC+F  ++  FL + +  + +     
Subjt:  YGHYEFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKGVFVDPA

Query:  KIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGLGCVLMQ
        K  A+  +P P TV + + FL +  YYRRF+ + S IA P+       +   W+   + +  +LK  L ++PVL   +   ++ + +DASK G+G VL +
Subjt:  KIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGLGCVLMQ

Query:  QGK------VVPYASRQLKSHEQNYRTHDLELTTMVFAL
                 VV Y S+ L+S ++NY   +LEL  ++ AL
Subjt:  QGK------VVPYASRQLKSHEQNYRTHDLELTTMVFAL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.0e-2339.69Show/hide
Query:  HLHPVLETLRANKLYAKFSKCEFWLKKVTFLD--HVVSSKGVFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVW
        HL  VL+    ++ YA   KC F   ++ +L   H++S +GV  DPAK+EA+  WP P   +E+R FL L GYYRRFV+++  I  PLT+L +K +   W
Subjt:  HLHPVLETLRANKLYAKFSKCEFWLKKVTFLD--HVVSSKGVFVDPAKIEAVTSWPRPSTVSEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVW

Query:  SPACESSFQELKQKLVSAPVLTVPDGSRSFV
        +     +F+ LK  + + PVL +PD    FV
Subjt:  SPACESSFQELKQKLVSAPVLTVPDGSRSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTAGTTGAGCTAAAGGAGCTGAAGGTGCAGTTGCAGGAGTTACTAGACAAGGGTTTTATTCGACCCAGTGTGTCACCTTGGAGAGCACCAGTGTTGTTTGTGAA
GAAAAAGGATGGGTCGATGCGCCTTTGCATTGACTACAGAGAGCTGAACAAGGTGACAGTTAAGAATCGCTATCCCTTGCCTAGGATTGATGATTTGTTCGATCAGTTGC
AAGGAGCCACCGTCTTTTCTAAGATCGACCTGCGATCAGGCTACCACCAATTGAGGATCAGGGATAATGATATTCCTAAGACCACTTTCCGTTCAAGATACGGACATTAC
GAGTTCATTGTGATGTCTTTTGAGTTGACTAATGCTCCTGTGGTATTCATGGACTTGATGAACAGGGTATTTAAGGATTTCTTAGACACGTTTGTCATAGTTTTCATTGA
TGATATTTTGATTTACTCTAAGATTGAGGCTGAGAATGAGGAGCACTTGCACCCGGTTTTGGAGACTCTTCGAGCTAATAAGCTGTACGCCAAGTTCTCCAAGTGTGAGT
TCTGGCTGAAGAAGGTTACTTTTCTCGACCACGTGGTTTCCAGTAAGGGAGTTTTTGTGGACCCAGCAAAGATCGAAGCGGTTACCAGTTGGCCTCGACCGTCTACAGTT
AGCGAGATTCGTAGTTTTCTAAGTTTGGCAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCTTGTATAGCCAGTCCCTTGACCCAGTTGACCAGGAAGGGGACTCCTTT
TGTTTGGAGCCCAGCTTGTGAGAGTAGCTTCCAGGAGCTCAAGCAGAAGCTTGTGTCTGCACCAGTTCTGACAGTACCAGATGGATCTAGAAGTTTCGTGATCGACAGTG
ATGCCTCAAAGAAAGGACTAGGCTGTGTTCTGATGCAGCAAGGTAAGGTAGTTCCTTATGCCTCCCGTCAGTTGAAGAGTCATGAGCAGAACTACCGTACCCATGACCTA
GAGTTGACAACAATGGTTTTTGCATTGAAGATATGGGGCACTACCTGTATGGTGAGAAGATACATATTTTCACTGACCATAAGAGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTAGTTGAGCTAAAGGAGCTGAAGGTGCAGTTGCAGGAGTTACTAGACAAGGGTTTTATTCGACCCAGTGTGTCACCTTGGAGAGCACCAGTGTTGTTTGTGAA
GAAAAAGGATGGGTCGATGCGCCTTTGCATTGACTACAGAGAGCTGAACAAGGTGACAGTTAAGAATCGCTATCCCTTGCCTAGGATTGATGATTTGTTCGATCAGTTGC
AAGGAGCCACCGTCTTTTCTAAGATCGACCTGCGATCAGGCTACCACCAATTGAGGATCAGGGATAATGATATTCCTAAGACCACTTTCCGTTCAAGATACGGACATTAC
GAGTTCATTGTGATGTCTTTTGAGTTGACTAATGCTCCTGTGGTATTCATGGACTTGATGAACAGGGTATTTAAGGATTTCTTAGACACGTTTGTCATAGTTTTCATTGA
TGATATTTTGATTTACTCTAAGATTGAGGCTGAGAATGAGGAGCACTTGCACCCGGTTTTGGAGACTCTTCGAGCTAATAAGCTGTACGCCAAGTTCTCCAAGTGTGAGT
TCTGGCTGAAGAAGGTTACTTTTCTCGACCACGTGGTTTCCAGTAAGGGAGTTTTTGTGGACCCAGCAAAGATCGAAGCGGTTACCAGTTGGCCTCGACCGTCTACAGTT
AGCGAGATTCGTAGTTTTCTAAGTTTGGCAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCTTGTATAGCCAGTCCCTTGACCCAGTTGACCAGGAAGGGGACTCCTTT
TGTTTGGAGCCCAGCTTGTGAGAGTAGCTTCCAGGAGCTCAAGCAGAAGCTTGTGTCTGCACCAGTTCTGACAGTACCAGATGGATCTAGAAGTTTCGTGATCGACAGTG
ATGCCTCAAAGAAAGGACTAGGCTGTGTTCTGATGCAGCAAGGTAAGGTAGTTCCTTATGCCTCCCGTCAGTTGAAGAGTCATGAGCAGAACTACCGTACCCATGACCTA
GAGTTGACAACAATGGTTTTTGCATTGAAGATATGGGGCACTACCTGTATGGTGAGAAGATACATATTTTCACTGACCATAAGAGCCTGA
Protein sequenceShow/hide protein sequence
MALVELKELKVQLQELLDKGFIRPSVSPWRAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDNDIPKTTFRSRYGHY
EFIVMSFELTNAPVVFMDLMNRVFKDFLDTFVIVFIDDILIYSKIEAENEEHLHPVLETLRANKLYAKFSKCEFWLKKVTFLDHVVSSKGVFVDPAKIEAVTSWPRPSTV
SEIRSFLSLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSRSFVIDSDASKKGLGCVLMQQGKVVPYASRQLKSHEQNYRTHDL
ELTTMVFALKIWGTTCMVRRYIFSLTIRA