; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0165941 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0165941
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr06:18013112..18014134
RNA-Seq ExpressionCmc06g0165941
SyntenyCmc06g0165941
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040871.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.5e-16187.65Show/hide
Query:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS
        +RASKLLS+GTW ILASVVDTREVDVSLSSEPVVRDYPDVF EELPGLPP+RE+EFAIELEPGTVPISRAPY+M  AELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLP IDDLFDQLQGATVFSKIDLRS YHQLRIKD DVPK AF SRYGHYEFIVM FGLTNA AVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM

Query:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL
        DLMN VFREFLDTFVI+FIDDILIYSKTEAEHEEHLR+VL+TLR NK              VSFLGHVVSKA VSVDPAKIEA+T W RPSTVSEVRSFL
Subjt:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF
        GLAGYYRRFVENFSRIATPL QLTRKGAPFVWSK  EDSF
Subjt:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF

KAA0047433.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.5e-16187.65Show/hide
Query:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS
        +RASKLLS+GTW ILASVVDTREVDVSLSSEPVVRDYPDVF EELPGLPP+RE+EFAIELEPGTVPISRAPYRM  AELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGATVFSKIDLRS YHQLRIKD DVPK AF SRYGHYEFIVM FGLTNA AVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM

Query:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL
        DLMN VFREFLDTFVI+FIDDILIYSKTEAEHEEHLR+VL+TLR NK              VSFLGHVVSKA VSVDPAKIEA+T W RPSTVSEVRSFL
Subjt:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF
        GLAGYYRRFVENFSRIATPL QLTRKGAPFVWSK  EDSF
Subjt:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF

KAA0048687.1 pol protein [Cucumis melo var. makuwa]3.5e-16187.65Show/hide
Query:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS
        +RASKLLS+GTW ILASVVDTRE DVSLSSEPVVRDYPDVF EELPGLPP+RE+EFAIELEPGTVPISRAPYRM  AELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGATVFSKIDLRS YHQLRIKD DVPK AF SRYGHYEFIVM FGLTNA AVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM

Query:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL
        DLMN VFREFLDTFVI+FIDDILIYSKTEAEHEEHLRMVL+TLR NK              VSFLGHVVSKA VSVDPAKIEA+T W RPSTVSEVRSFL
Subjt:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF
        GLAGYYRRFVENFSRIATPL QLTRKGAPFVWSK  EDSF
Subjt:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF

KAA0062719.1 pol protein [Cucumis melo var. makuwa]4.6e-16187.65Show/hide
Query:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS
        +RASKLLS+GTW ILASVVDTR+ DVSLSSEPVVRDYPDVF EELPGLPP+RE+EFAIELEPGTVPISRAPYRM  AELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGATVFSKIDLRS YHQLRIKD DVPK AF SRYGHYEFIVM FGLTNA AVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM

Query:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL
        DLMN VFREFLDTFVI+FIDDILIYSKTEAEHEEHLRMVL+TLR NK              VSFLGHVVSKA VSVDPAKIEA+TSW RPSTVSEVRSFL
Subjt:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF
        GLAGYYRRFVENFSRIATPL QLTRKGAPFVWSK  EDSF
Subjt:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF

TYK01613.1 pol protein [Cucumis melo var. makuwa]3.5e-16187.65Show/hide
Query:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS
        +RASKLLS+GTW ILASVVDTRE DVSLSSEPVVRDYPDVF EELPGLPP+RE+EFAIELEPGTVPISRAPYRM  AELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGATVFSKIDLRS YHQLRIKD DVPK AF SRYGHYEFIVM FGLTNA AVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM

Query:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL
        DLMN VFREFLDTFVI+FIDDILIYSKTEAEHEEHLRMVL+TLR NK              VSFLGHVVSKA VSVDPAKIEA+T W RPSTVSEVRSFL
Subjt:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF
        GLAGYYRRFVENFSRIATPL QLTRKGAPFVWSK  EDSF
Subjt:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF

TrEMBL top hitse value%identityAlignment
A0A5A7TGS7 Reverse transcriptase1.7e-16187.65Show/hide
Query:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS
        +RASKLLS+GTW ILASVVDTREVDVSLSSEPVVRDYPDVF EELPGLPP+RE+EFAIELEPGTVPISRAPY+M  AELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLP IDDLFDQLQGATVFSKIDLRS YHQLRIKD DVPK AF SRYGHYEFIVM FGLTNA AVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM

Query:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL
        DLMN VFREFLDTFVI+FIDDILIYSKTEAEHEEHLR+VL+TLR NK              VSFLGHVVSKA VSVDPAKIEA+T W RPSTVSEVRSFL
Subjt:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF
        GLAGYYRRFVENFSRIATPL QLTRKGAPFVWSK  EDSF
Subjt:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF

A0A5A7TV57 Ty3-gypsy retrotransposon protein1.7e-16187.65Show/hide
Query:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS
        +RASKLLS+GTW ILASVVDTREVDVSLSSEPVVRDYPDVF EELPGLPP+RE+EFAIELEPGTVPISRAPYRM  AELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGATVFSKIDLRS YHQLRIKD DVPK AF SRYGHYEFIVM FGLTNA AVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM

Query:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL
        DLMN VFREFLDTFVI+FIDDILIYSKTEAEHEEHLR+VL+TLR NK              VSFLGHVVSKA VSVDPAKIEA+T W RPSTVSEVRSFL
Subjt:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF
        GLAGYYRRFVENFSRIATPL QLTRKGAPFVWSK  EDSF
Subjt:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF

A0A5A7U330 Reverse transcriptase1.7e-16187.65Show/hide
Query:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS
        +RASKLLS+GTW ILASVVDTRE DVSLSSEPVVRDYPDVF EELPGLPP+RE+EFAIELEPGTVPISRAPYRM  AELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGATVFSKIDLRS YHQLRIKD DVPK AF SRYGHYEFIVM FGLTNA AVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM

Query:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL
        DLMN VFREFLDTFVI+FIDDILIYSKTEAEHEEHLRMVL+TLR NK              VSFLGHVVSKA VSVDPAKIEA+T W RPSTVSEVRSFL
Subjt:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF
        GLAGYYRRFVENFSRIATPL QLTRKGAPFVWSK  EDSF
Subjt:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF

A0A5A7VAL8 Pol protein2.2e-16187.65Show/hide
Query:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS
        +RASKLLS+GTW ILASVVDTR+ DVSLSSEPVVRDYPDVF EELPGLPP+RE+EFAIELEPGTVPISRAPYRM  AELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGATVFSKIDLRS YHQLRIKD DVPK AF SRYGHYEFIVM FGLTNA AVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM

Query:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL
        DLMN VFREFLDTFVI+FIDDILIYSKTEAEHEEHLRMVL+TLR NK              VSFLGHVVSKA VSVDPAKIEA+TSW RPSTVSEVRSFL
Subjt:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF
        GLAGYYRRFVENFSRIATPL QLTRKGAPFVWSK  EDSF
Subjt:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF

A0A5D3BPI1 Reverse transcriptase1.7e-16187.65Show/hide
Query:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS
        +RASKLLS+GTW ILASVVDTRE DVSLSSEPVVRDYPDVF EELPGLPP+RE+EFAIELEPGTVPISRAPYRM  AELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGATVFSKIDLRS YHQLRIKD DVPK AF SRYGHYEFIVM FGLTNA AVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFM

Query:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL
        DLMN VFREFLDTFVI+FIDDILIYSKTEAEHEEHLRMVL+TLR NK              VSFLGHVVSKA VSVDPAKIEA+T W RPSTVSEVRSFL
Subjt:  DLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKTLRANK--------------VSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF
        GLAGYYRRFVENFSRIATPL QLTRKGAPFVWSK  EDSF
Subjt:  GLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.6e-4632.76Show/hide
Query:  EELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDL
        E+LP   P + +EF +EL      +    Y +   +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP+I+ L
Subjt:  EELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDL

Query:  FDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFMDLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKT
          ++QG+T+F+K+DL+S YH +R++  D  K+AF    G +E++VM +G++ A A F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VL+ 
Subjt:  FDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFMDLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKT

Query:  LR--------------ANKVSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKT
        L+               ++V F+G+ +S+   +     I+ +  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+ T
Subjt:  LR--------------ANKVSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKT

P0CT41 Transposon Tf2-12 polyprotein8.6e-4632.76Show/hide
Query:  EELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDL
        E+LP   P + +EF +EL      +    Y +   +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP+I+ L
Subjt:  EELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNKYPLPMIDDL

Query:  FDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFMDLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKT
          ++QG+T+F+K+DL+S YH +R++  D  K+AF    G +E++VM +G++ A A F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VL+ 
Subjt:  FDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFMDLMNIVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRMVLKT

Query:  LR--------------ANKVSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKT
        L+               ++V F+G+ +S+   +     I+ +  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+ T
Subjt:  LR--------------ANKVSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKT

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.2e-4837.98Show/hide
Query:  YPDVFLEELPGLPP---YREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNK
        Y ++   +LP  P       ++  IE++PG       PY +T    +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFLEELPGLPP---YREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNK

Query:  YPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFMDLMNIVFREFLDTFVIMFIDDILIYSKTEAEHE
        +PLP ID+L  ++  A +F+ +DL S YHQ+ ++  D  K AF +  G YE+ VM FGL NA + F   M   FR+    FV +++DDILI+S++  EH 
Subjt:  YPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFMDLMNIVFREFLDTFVIMFIDDILIYSKTEAEHE

Query:  EHLRMVLKTLR--------------ANKVSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        +HL  VL+ L+              + +  FLG+ +    ++    K  AI  +P P TV + + FLG+  YYRRF+ N S+IA P+
Subjt:  EHLRMVLKTLR--------------ANKVSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.6e-4735.23Show/hide
Query:  DYPDVFLEELPGLPPYREIEFAIELEPGT---VPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNK
        ++P +F   L G+     +E A++ E  T    PI    Y   +    E++ Q+ ELL  G IRPS SP+ +P+  V KK     +   R+ +D++ LN 
Subjt:  DYPDVFLEELPGLPPYREIEFAIELEPGT---VPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNK

Query:  VTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFMDLMNIVFREFLDTFVIMFIDDILIYSK
        VT+ + YP+P I+     L  A  F+ +DL S +HQ+ +K+SD+PK AF +  G YEF+ + FGL NA A+F  +++ + RE +     ++IDDI+++S+
Subjt:  VTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFMDLMNIVFREFLDTFVIMFIDDILIYSK

Query:  TEAEHEEHLRMVLKTL-RAN-------------KVSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLIQLTR
            H ++LR+VL +L +AN             +V FLG++V+   +  DP K+ AI+  P P++V E++ FLG+  YYR+F+++++++A PL  LTR
Subjt:  TEAEHEEHLRMVLKTL-RAN-------------KVSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLIQLTR

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.2e-4837.98Show/hide
Query:  YPDVFLEELPGLPP---YREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNK
        Y ++   +LP  P       ++  IE++PG       PY +T    +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFLEELPGLPP---YREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNK

Query:  YPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFMDLMNIVFREFLDTFVIMFIDDILIYSKTEAEHE
        +PLP ID+L  ++  A +F+ +DL S YHQ+ ++  D  K AF +  G YE+ VM FGL NA + F   M   FR+    FV +++DDILI+S++  EH 
Subjt:  YPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFMDLMNIVFREFLDTFVIMFIDDILIYSKTEAEHE

Query:  EHLRMVLKTLR--------------ANKVSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        +HL  VL+ L+              + +  FLG+ +    ++    K  AI  +P P TV + + FLG+  YYRRF+ N S+IA P+
Subjt:  EHLRMVLKTLR--------------ANKVSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.7e-1548.57Show/hide
Query:  KVSFLG--HVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLIQLTRKGA
        ++++LG  H++S   VS DPAK+EA+  WP P   +E+R FLGL GYYRRFV+N+ +I  PL +L +K +
Subjt:  KVSFLG--HVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLIQLTRKGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGCCAGTAAACTGCTTAGTCGGGGTACTTGGAGTATCTTAGCGAGTGTGGTGGATACTAGAGAGGTTGACGTGTCCTTGTCATCAGAACCAGTGGTGAGGGATTA
TCCGGATGTCTTTCTTGAAGAACTTCCAGGGTTACCTCCTTACAGAGAGATTGAGTTTGCCATAGAGCTGGAGCCTGGTACGGTTCCTATATCCAGAGCTCCATATAGAA
TGACCCTAGCAGAATTGAAAGAGCTGAAAGTGCAGTTACAGGAGTTGCTTGATAAAGGCTTCATTCGGCCGAGCGTGTCACCTTGGGGTGCACCAGTCTTATTTGTTAAA
AAGAAGGATGGATCGATGCGCTTATGTATTGACTACAGGGAGTTGAATAAGGTAACTGTTAAGAACAAATATCCCTTGCCCATGATCGACGATCTGTTTGACCAGTTACA
GGGAGCTACAGTGTTCTCTAAGATCGACCTTCGATCAGAATATCATCAGCTAAGGATTAAGGATAGCGATGTACCAAAGATAGCCTTTTGTTCCAGATATGGACACTACG
AGTTTATTGTGATGTATTTTGGTTTGACGAATGCTCAGGCTGTGTTCATGGATTTGATGAACATAGTGTTTAGAGAGTTCCTAGATACTTTTGTGATAATGTTTATTGAT
GATATTTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACGTATGGTTCTAAAAACCCTTCGAGCTAATAAAGTATCCTTTCTAGGCCACGTGGTTTCTAA
AGCTGTTGTTTCTGTGGATCCAGCTAAGATAGAGGCAATCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGAC
GGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTATTCAGTTGACCAGGAAGGGAGCTCCATTTGTTTGGAGCAAGACCTATGAGGACAGTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGGCCAGTAAACTGCTTAGTCGGGGTACTTGGAGTATCTTAGCGAGTGTGGTGGATACTAGAGAGGTTGACGTGTCCTTGTCATCAGAACCAGTGGTGAGGGATTA
TCCGGATGTCTTTCTTGAAGAACTTCCAGGGTTACCTCCTTACAGAGAGATTGAGTTTGCCATAGAGCTGGAGCCTGGTACGGTTCCTATATCCAGAGCTCCATATAGAA
TGACCCTAGCAGAATTGAAAGAGCTGAAAGTGCAGTTACAGGAGTTGCTTGATAAAGGCTTCATTCGGCCGAGCGTGTCACCTTGGGGTGCACCAGTCTTATTTGTTAAA
AAGAAGGATGGATCGATGCGCTTATGTATTGACTACAGGGAGTTGAATAAGGTAACTGTTAAGAACAAATATCCCTTGCCCATGATCGACGATCTGTTTGACCAGTTACA
GGGAGCTACAGTGTTCTCTAAGATCGACCTTCGATCAGAATATCATCAGCTAAGGATTAAGGATAGCGATGTACCAAAGATAGCCTTTTGTTCCAGATATGGACACTACG
AGTTTATTGTGATGTATTTTGGTTTGACGAATGCTCAGGCTGTGTTCATGGATTTGATGAACATAGTGTTTAGAGAGTTCCTAGATACTTTTGTGATAATGTTTATTGAT
GATATTTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACGTATGGTTCTAAAAACCCTTCGAGCTAATAAAGTATCCTTTCTAGGCCACGTGGTTTCTAA
AGCTGTTGTTTCTGTGGATCCAGCTAAGATAGAGGCAATCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGAC
GGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTATTCAGTTGACCAGGAAGGGAGCTCCATTTGTTTGGAGCAAGACCTATGAGGACAGTTTTTAG
Protein sequenceShow/hide protein sequence
MRASKLLSRGTWSILASVVDTREVDVSLSSEPVVRDYPDVFLEELPGLPPYREIEFAIELEPGTVPISRAPYRMTLAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVK
KKDGSMRLCIDYRELNKVTVKNKYPLPMIDDLFDQLQGATVFSKIDLRSEYHQLRIKDSDVPKIAFCSRYGHYEFIVMYFGLTNAQAVFMDLMNIVFREFLDTFVIMFID
DILIYSKTEAEHEEHLRMVLKTLRANKVSFLGHVVSKAVVSVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLIQLTRKGAPFVWSKTYEDSF