; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0020821 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0020821
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr01:19195725..19196759
RNA-Seq ExpressionCmc01g0020821
SyntenyCmc01g0020821
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032431.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]3.1e-17088.66Show/hide
Query:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS
        +RASKLLSQGTW +LAS+VDTREVDV LSSEPVVRDYPDVFPEELPGLPPHRE+EF IELEPGTVPISRAPYRM PAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
        PWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHM-----------------------VSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREF+DTFVIVFIDDILIYSKTEAEHEEHLH+                       VSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHM-----------------------VSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQSLK
        GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ+LK
Subjt:  GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQSLK

KAA0032794.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.5e-17495.64Show/hide
Query:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS
        MRA KLLSQGTWSIL+S+VDTREVDV LSSEPVVRDY DVFPEELPGLPPHREIEF IELEPGTVPISRAPYRMAPAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKDGDVPKT FRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL
        DLMNRVFREF+DTFVIVFIDDILIYSKTEAEHEEHL MVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQL
Subjt:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL

Query:  TRKGAPFVWSKACEDSFQSLK
        TRKGAPFVWSKACEDSFQ+LK
Subjt:  TRKGAPFVWSKACEDSFQSLK

KAA0036553.1 pol protein [Cucumis melo var. makuwa]2.4e-17093.77Show/hide
Query:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS
        +RASKLLSQGTW ILAS+VDTREVDV LSSEPVVRDYPDVFPEELPGLPPHRE+EF IELEP TVPISRAPYRMAPAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP IDDLFDQLQ ATVFSKI+LRSGYHQLRIKDGDVPKT FRSRYGHYEFIVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL
        DLMNRVFREF+DTFVIVFIDDILIYSKTEAEHEEHL +VSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSE RSFLGLAGYYRRFVENFS IATPLTQL
Subjt:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL

Query:  TRKGAPFVWSKACEDSFQSLK
        TRKGAPFVWSKACEDSFQ+LK
Subjt:  TRKGAPFVWSKACEDSFQSLK

KAA0040699.1 pol protein [Cucumis melo var. makuwa]8.7e-17395.02Show/hide
Query:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS
        +RASKLLSQGTW ILAS+VDTREVDV LSSEPVVRDYPDVFPEELPGL PHRE+EF IELEPGTVPISRAPYRMAPAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKD DVPKT FRSRYGHYEFIVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL
        DLMNRVFREF+DTFVIVFIDDILIYSKTEAEHEEHL MVSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL
Subjt:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL

Query:  TRKGAPFVWSKACEDSFQSLK
        TRKGAPFVWSKACEDSFQ+LK
Subjt:  TRKGAPFVWSKACEDSFQSLK

KAA0047433.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.1e-17088.95Show/hide
Query:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS
        +RASKLLSQGTW ILAS+VDTREVDV LSSEPVVRDYPDVFPEELPGLPPHRE+EF IELEPGTVPISRAPYRMAPAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKDGDVPKT FRSRYGHYEFIVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHM-----------------------VSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREF+DTFVIVFIDDILIYSKTEAEHEEHL +                       VSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHM-----------------------VSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQSLK
        GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ+LK
Subjt:  GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQSLK

TrEMBL top hitse value%identityAlignment
A0A5A7SSA3 Reverse transcriptase1.5e-17088.66Show/hide
Query:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS
        +RASKLLSQGTW +LAS+VDTREVDV LSSEPVVRDYPDVFPEELPGLPPHRE+EF IELEPGTVPISRAPYRM PAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
        PWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHM-----------------------VSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREF+DTFVIVFIDDILIYSKTEAEHEEHLH+                       VSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHM-----------------------VSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQSLK
        GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ+LK
Subjt:  GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQSLK

A0A5A7T0Y9 Reverse transcriptase1.1e-17093.77Show/hide
Query:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS
        +RASKLLSQGTW ILAS+VDTREVDV LSSEPVVRDYPDVFPEELPGLPPHRE+EF IELEP TVPISRAPYRMAPAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP IDDLFDQLQ ATVFSKI+LRSGYHQLRIKDGDVPKT FRSRYGHYEFIVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL
        DLMNRVFREF+DTFVIVFIDDILIYSKTEAEHEEHL +VSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSE RSFLGLAGYYRRFVENFS IATPLTQL
Subjt:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL

Query:  TRKGAPFVWSKACEDSFQSLK
        TRKGAPFVWSKACEDSFQ+LK
Subjt:  TRKGAPFVWSKACEDSFQSLK

A0A5A7THF3 Reverse transcriptase4.2e-17395.02Show/hide
Query:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS
        +RASKLLSQGTW ILAS+VDTREVDV LSSEPVVRDYPDVFPEELPGL PHRE+EF IELEPGTVPISRAPYRMAPAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKD DVPKT FRSRYGHYEFIVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL
        DLMNRVFREF+DTFVIVFIDDILIYSKTEAEHEEHL MVSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL
Subjt:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL

Query:  TRKGAPFVWSKACEDSFQSLK
        TRKGAPFVWSKACEDSFQ+LK
Subjt:  TRKGAPFVWSKACEDSFQSLK

A0A5A7TV57 Ty3-gypsy retrotransposon protein1.5e-17088.95Show/hide
Query:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS
        +RASKLLSQGTW ILAS+VDTREVDV LSSEPVVRDYPDVFPEELPGLPPHRE+EF IELEPGTVPISRAPYRMAPAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKDGDVPKT FRSRYGHYEFIVMSFGLTNAPAVFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHM-----------------------VSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL
        DLMNRVFREF+DTFVIVFIDDILIYSKTEAEHEEHL +                       VSFLGHVVSKAGVSVDPAKIEAVT W RPSTVSEVRSFL
Subjt:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHM-----------------------VSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFL

Query:  GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQSLK
        GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ+LK
Subjt:  GLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQSLK

A0A5D3E456 Reverse transcriptase1.7e-17495.64Show/hide
Query:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS
        MRA KLLSQGTWSIL+S+VDTREVDV LSSEPVVRDY DVFPEELPGLPPHREIEF IELEPGTVPISRAPYRMAPAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVS

Query:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM
        PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKI+LRSGYHQLRIKDGDVPKT FRSRYGHYEFIVMSFGLTNAP VFM
Subjt:  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFM

Query:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL
        DLMNRVFREF+DTFVIVFIDDILIYSKTEAEHEEHL MVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQL
Subjt:  DLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQL

Query:  TRKGAPFVWSKACEDSFQSLK
        TRKGAPFVWSKACEDSFQ+LK
Subjt:  TRKGAPFVWSKACEDSFQSLK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.7e-4936.56Show/hide
Query:  YRMAPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRI
        Y    A  +E++ Q+Q +L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ I+L  G+HQ+ +
Subjt:  YRMAPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRI

Query:  KDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMV-----------------------SFLG
            V KT F +++GHYE++ M FGL NAPA F   MN + R  ++   +V++DDI+++S +  EH + L +V                       +FLG
Subjt:  KDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMV-----------------------SFLG

Query:  HVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDS-FQSLK
        HV++  G+  +P KIEA+  +P P+   E+++FLGL GYYR+F+ NF+ IA P+T+  +K      +    DS F+ LK
Subjt:  HVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDS-FQSLK

P20825 Retrovirus-related Pol polyprotein from transposon 2972.8e-4935.71Show/hide
Query:  PISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSG
        PI    Y +A     E++ Q+Q++L++G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ I+L  G
Subjt:  PISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSG

Query:  YHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMV---------------------
        +HQ+ + +  + KT F ++ GHYE++ M FGL NAPA F   MN + R  ++   +V++DDI+I+S +  EH   + +V                     
Subjt:  YHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFIDDILIYSKTEAEHEEHLHMV---------------------

Query:  --SFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK
          +FLGH+V+  G+  +P K++A+ S+P P+   E+R+FLGL GYYR+F+ N++ IA P+T   +K
Subjt:  --SFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein7.4e-5037.98Show/hide
Query:  YPDVFPEELPGLPP---HREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  ++  IE++PG       PY +     +E+   +QKLLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFPEELPGLPP---HREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFIDDILIYSKTEAEHE
        +PLPRID+L  ++  A +F+ ++L SGYHQ+ ++  D  KT F +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFIDDILIYSKTEAEHE

Query:  EHLHMV-----------------------SFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        +HL  V                        FLG+ +    ++    K  A+  +P P TV + + FLG+  YYRRF+ N S+IA P+
Subjt:  EHLHMV-----------------------SFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus5.3e-4832.88Show/hide
Query:  DYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKVTV
        ++P +F   L G+     ++  I       PI    Y        E++ Q+ +LL  G IRPS SP+ +P+  V KK     +   R+ +D++ LN VT+
Subjt:  DYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKVTV

Query:  KNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFIDDILIYSKTEA
         + YP+P I+     L  A  F+ ++L SG+HQ+ +K+ D+PKT F +  G YEF+ + FGL NAPA+F  +++ + RE +     V+IDDI+++S+   
Subjt:  KNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFIDDILIYSKTEA

Query:  EHEEHLHM-----------------------VSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR
         H ++L +                       V FLG++V+  G+  DP K+ A++  P P++V E++ FLG+  YYR+F+++++++A PLT LTR
Subjt:  EHEEHLHM-----------------------VSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.4e-5037.98Show/hide
Query:  YPDVFPEELPGLPP---HREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  ++  IE++PG       PY +     +E+   +QKLLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFPEELPGLPP---HREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFIDDILIYSKTEAEHE
        +PLPRID+L  ++  A +F+ ++L SGYHQ+ ++  D  KT F +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFIDDILIYSKTEAEHE

Query:  EHLHMV-----------------------SFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        +HL  V                        FLG+ +    ++    K  A+  +P P TV + + FLG+  YYRRF+ N S+IA P+
Subjt:  EHLHMV-----------------------SFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.4e-1847.06Show/hide
Query:  VSFLG--HVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQSLK
        +++LG  H++S  GVS DPAK+EA+  WP P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W++    +F++LK
Subjt:  VSFLG--HVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQSLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGCCAGCAAACTGCTTAGTCAGGGTACTTGGAGTATCTTAGCGAGCTTGGTGGATACTAGAGAGGTTGATGTACCTCTGTCATCAGAACCAGTGGTAAGGGACTA
TCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCTCACAGAGAGATTGAGTTTACCATAGAGCTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCATACAGAA
TGGCCCCAGCAGAATTAAAAGAACTGAAAGTGCAATTACAGAAGTTGCTTGATAAGGGCTTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAAG
AAGAAGGATGGATCGATGCGCCTATGCATTGACTATAGGGAGTTAAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGACCTGTTTGACCAGTTACA
GGGAGCTACAGTGTTCTCTAAGATTAATCTTCGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCAAAGACGACCTTTCGTTCCAGATACGGACACTATG
AGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGACTTGATGAATAGAGTGTTTAGGGAGTTCGTAGACACTTTTGTGATCGTGTTTATTGAT
GATATTTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACATATGGTGTCCTTTCTAGGCCATGTGGTTTCTAAGGCTGGAGTTTCTGTGGATCCAGCTAA
GATAGAGGCAGTCACCAGTTGGCCCCGACCCTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTAGAGAACTTTTCCCGTATAG
CTACTCCTCTTACTCAGTTGACTAGGAAGGGAGCTCCTTTCGTTTGGAGCAAGGCATGTGAAGACAGTTTCCAGAGCCTTAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGGCCAGCAAACTGCTTAGTCAGGGTACTTGGAGTATCTTAGCGAGCTTGGTGGATACTAGAGAGGTTGATGTACCTCTGTCATCAGAACCAGTGGTAAGGGACTA
TCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCTCACAGAGAGATTGAGTTTACCATAGAGCTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCATACAGAA
TGGCCCCAGCAGAATTAAAAGAACTGAAAGTGCAATTACAGAAGTTGCTTGATAAGGGCTTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAAG
AAGAAGGATGGATCGATGCGCCTATGCATTGACTATAGGGAGTTAAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGACCTGTTTGACCAGTTACA
GGGAGCTACAGTGTTCTCTAAGATTAATCTTCGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCAAAGACGACCTTTCGTTCCAGATACGGACACTATG
AGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGACTTGATGAATAGAGTGTTTAGGGAGTTCGTAGACACTTTTGTGATCGTGTTTATTGAT
GATATTTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACATATGGTGTCCTTTCTAGGCCATGTGGTTTCTAAGGCTGGAGTTTCTGTGGATCCAGCTAA
GATAGAGGCAGTCACCAGTTGGCCCCGACCCTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTAGAGAACTTTTCCCGTATAG
CTACTCCTCTTACTCAGTTGACTAGGAAGGGAGCTCCTTTCGTTTGGAGCAAGGCATGTGAAGACAGTTTCCAGAGCCTTAAATAG
Protein sequenceShow/hide protein sequence
MRASKLLSQGTWSILASLVDTREVDVPLSSEPVVRDYPDVFPEELPGLPPHREIEFTIELEPGTVPISRAPYRMAPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVK
KKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKINLRSGYHQLRIKDGDVPKTTFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFVDTFVIVFID
DILIYSKTEAEHEEHLHMVSFLGHVVSKAGVSVDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQSLK