; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0223041 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0223041
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:12250551..12251408
RNA-Seq ExpressionCmc08g0223041
SyntenyCmc08g0223041
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040542.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]5.4e-14289.05Show/hide
Query:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF
        E+DFAIELEPGTAPISRAPYRMA AELKELKVQLQELLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT+KNRYPLP IDD FDQLQG T+F
Subjt:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF

Query:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF
        SKIDLRSGYHQLRIRD DIPK AFRSRY HYEF+VM F LTN PAVFMDLMNR+FKDFLD+FVIVFIDDILIYSKTEAEHEEHLH VLETLRANKLY KF
Subjt:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF

Query:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS
        SKCEFWL+KV+FL HVVSSEG+SVDP KIEA T+WPRP TVSEIRSFLGLAG YRRFVEDFSRIASPLTQLTRKGT FVW+P+
Subjt:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS

KAA0056047.1 reverse transcriptase [Cucumis melo var. makuwa]1.9e-14289.75Show/hide
Query:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF
        EIDFAIELEPGTAPISRAPYRMA  ELKELKVQLQELLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT+KNRYPLP IDD FDQLQG T+F
Subjt:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF

Query:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF
        SKIDL+SGYHQLRIRDSDIPK AFRSRY HYEFIVM F LTNAPAVFMDLMNR+FKDFLD+FVIVFIDDILIYSKTEAEHEEHLH VLETLRANKLYVKF
Subjt:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF

Query:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS
        SKCEFWLKKV+FL HVVSSEG+S+DP KIEA T+WPRP TVSEI+SFLGLAG YRRFVEDFSRIASPLTQLTRKGT FVW+P+
Subjt:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS

KAA0060745.1 pol protein [Cucumis melo var. makuwa]1.9e-14289.4Show/hide
Query:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF
        E+DFAIELEPGTAPISRAPYRMA AELKELKVQLQELLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT+KNRYPLP IDD FDQLQG T+F
Subjt:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF

Query:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF
        SKIDLRSGYHQLRIRD DIPK AFRSRY HYEF+VM F LTNAPAVFMDLMNR+FKDFLD+FVIVFIDDILIYSKTEAEHEEHLH VLETLRANKLY KF
Subjt:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF

Query:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS
        SKCEFWL+KV+FL HVVSSEG+SVDP KIEA T+WPRP TVSEIRSFLGLAG YRRFVEDFSRIASPLTQLTRKGT FVW+P+
Subjt:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS

KAA0063098.1 pol protein [Cucumis melo var. makuwa]2.2e-14390.11Show/hide
Query:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF
        E+DFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLP IDD FDQLQG T+F
Subjt:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF

Query:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF
        SKIDLRSGYHQLRIRD DIPK AFRSRY HYEF+VM F LTNAPAVFMDLMNR+FKDFLD+FVIVFIDDILIYSKTEAEHEEHLH VLETLRANKLY KF
Subjt:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF

Query:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS
        SKCEFWL+KV+FL HVVSSEG+SVDP KIEA T+WPRP TVSEIRSFLGLAG YRRFVEDFSRIASPLTQLTRKGT FVW+P+
Subjt:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS

KAA0063793.1 pol protein [Cucumis melo var. makuwa]1.9e-14289.4Show/hide
Query:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF
        E+DFAIELEPGTAPISRAPYRMA AELKELKVQLQELLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT+KNRYPLP IDD FDQLQG T+F
Subjt:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF

Query:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF
        SKIDLRSGYHQLRIRD DIPK AFRSRY HYEF+VM F LTNAPAVFMDLMNR+FKDFLD+FVIVFIDDILIYSKTEAEHEEHLH VLETLRANKLY KF
Subjt:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF

Query:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS
        SKCEFWL+KV+FL HVVSSEG+SVDP KIEA T+WPRP TVSEIRSFLGLAG YRRFVEDFSRIASPLTQLTRKGT FVW+P+
Subjt:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS

TrEMBL top hitse value%identityAlignment
A0A5A7UN68 Reverse transcriptase9.0e-14389.75Show/hide
Query:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF
        EIDFAIELEPGTAPISRAPYRMA  ELKELKVQLQELLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT+KNRYPLP IDD FDQLQG T+F
Subjt:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF

Query:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF
        SKIDL+SGYHQLRIRDSDIPK AFRSRY HYEFIVM F LTNAPAVFMDLMNR+FKDFLD+FVIVFIDDILIYSKTEAEHEEHLH VLETLRANKLYVKF
Subjt:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF

Query:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS
        SKCEFWLKKV+FL HVVSSEG+S+DP KIEA T+WPRP TVSEI+SFLGLAG YRRFVEDFSRIASPLTQLTRKGT FVW+P+
Subjt:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS

A0A5A7V4E4 Reverse transcriptase9.0e-14389.4Show/hide
Query:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF
        E+DFAIELEPGTAPISRAPYRMA AELKELKVQLQELLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT+KNRYPLP IDD FDQLQG T+F
Subjt:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF

Query:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF
        SKIDLRSGYHQLRIRD DIPK AFRSRY HYEF+VM F LTNAPAVFMDLMNR+FKDFLD+FVIVFIDDILIYSKTEAEHEEHLH VLETLRANKLY KF
Subjt:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF

Query:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS
        SKCEFWL+KV+FL HVVSSEG+SVDP KIEA T+WPRP TVSEIRSFLGLAG YRRFVEDFSRIASPLTQLTRKGT FVW+P+
Subjt:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS

A0A5A7V646 Reverse transcriptase1.1e-14390.11Show/hide
Query:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF
        E+DFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLP IDD FDQLQG T+F
Subjt:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF

Query:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF
        SKIDLRSGYHQLRIRD DIPK AFRSRY HYEF+VM F LTNAPAVFMDLMNR+FKDFLD+FVIVFIDDILIYSKTEAEHEEHLH VLETLRANKLY KF
Subjt:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF

Query:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS
        SKCEFWL+KV+FL HVVSSEG+SVDP KIEA T+WPRP TVSEIRSFLGLAG YRRFVEDFSRIASPLTQLTRKGT FVW+P+
Subjt:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS

A0A5A7V6R2 Reverse transcriptase9.0e-14389.4Show/hide
Query:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF
        E+DFAIELEPGTAPISRAPYRMA AELKELKVQLQELLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT+KNRYPLP IDD FDQLQG T+F
Subjt:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF

Query:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF
        SKIDLRSGYHQLRIRD DIPK AFRSRY HYEF+VM F LTNAPAVFMDLMNR+FKDFLD+FVIVFIDDILIYSKTEAEHEEHLH VLETLRANKLY KF
Subjt:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF

Query:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS
        SKCEFWL+KV+FL HVVSSEG+SVDP KIEA T+WPRP TVSEIRSFLGLAG YRRFVEDFSRIASPLTQLTRKGT FVW+P+
Subjt:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS

A0A5D3BZN1 Reverse transcriptase2.6e-14289.4Show/hide
Query:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF
        E+DFAIELEPGTAPISRAPYRMA AELKELKVQLQELLDKGFIR SVSPWGA VLFVKKKDGSMRLCIDYRELNKVT+KNRYPLP IDD FDQLQG T+F
Subjt:  EIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIF

Query:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF
        SKIDLRSGYHQLRIRD DIPK AFRSRY HYEF+VM F LTNAPAVFMDLMNR+FKDFLD+FVIVFIDDILIYSKTEAEHEEHLH VLETLRANKLY KF
Subjt:  SKIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKF

Query:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS
        SKCEFWL+KV+FL HVVSSEG+SVDPTKIEA T+WPRP TVSEIRSFLGLAG YRRFVEDFSRIASPLTQLTRKGT FVW+P+
Subjt:  SKCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.6e-5038.46Show/hide
Query:  YRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIFSKIDLRSGYHQLRI
        Y    A  +E++ Q+Q++L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++T+ +R+P+P +D+   +L     F+ IDL  G+HQ+ +
Subjt:  YRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIFSKIDLRSGYHQLRI

Query:  RDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKFSKCEFWLKKVSFLA
            + K AF +++ HYE++ M F L NAPA F   MN + +  L+   +V++DDI+++S +  EH + L  V E L    L ++  KCEF  ++ +FL 
Subjt:  RDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKFSKCEFWLKKVSFLA

Query:  HVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRK
        HV++ +GI  +P KIEA   +P P    EI++FLGL G YR+F+ +F+ IA P+T+  +K
Subjt:  HVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRK

P0CT41 Transposon Tf2-12 polyprotein3.6e-4833.69Show/hide
Query:  IDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIFS
        ++F +EL      +    Y +   +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP+I+    ++QG+TIF+
Subjt:  IDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIFS

Query:  KIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKFS
        K+DL+S YH +R+R  D  K AFR     +E++VM + ++ APA F   +N +  +  ++ V+ ++DDILI+SK+E+EH +H+  VL+ L+   L +  +
Subjt:  KIDLRSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKFS

Query:  KCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS
        KCEF   +V F+ + +S +G +     I+    W +P    E+R FLG     R+F+   S++  PL  L +K   + W P+
Subjt:  KCEFWLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPS

P20825 Retrovirus-related Pol polyprotein from transposon 2977.8e-5138.66Show/hide
Query:  APISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIFSKIDLRS
        +PI    Y +A     E++ Q+QE+L++G IR S SP+ +P   V KK         R+ IDYR+LN++TI +RYP+P +D+   +L     F+ IDL  
Subjt:  APISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIFSKIDLRS

Query:  GYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKFSKCEFWL
        G+HQ+ + +  I K AF ++  HYE++ M F L NAPA F   MN + +  L+   +V++DDI+I+S +  EH   +  V   L    L ++  KCEF  
Subjt:  GYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKFSKCEFWL

Query:  KKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGT
        K+ +FL H+V+ +GI  +P K++A  S+P P    EIR+FLGL G YR+F+ +++ IA P+T   +K T
Subjt:  KKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGT

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.5e-4941.06Show/hide
Query:  IELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIFSKIDL
        IE++PG       PY +     +E+   +Q+LLD  FI  S SP  +PV+ V KKDG+ RLC+DYR LNK TI + +PLP ID+   ++    IF+ +DL
Subjt:  IELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIFSKIDL

Query:  RSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKFSKCEF
         SGYHQ+ +   D  K AF +    YE+ VM F L NAP+ F   M   F+D    FV V++DDILI+S++  EH +HL  VLE L+   L VK  KC+F
Subjt:  RSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKFSKCEF

Query:  WLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPL
          ++  FL + +  + I+    K  A   +P P TV + + FLG+   YRRF+ + S+IA P+
Subjt:  WLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPL

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.5e-4941.06Show/hide
Query:  IELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIFSKIDL
        IE++PG       PY +     +E+   +Q+LLD  FI  S SP  +PV+ V KKDG+ RLC+DYR LNK TI + +PLP ID+   ++    IF+ +DL
Subjt:  IELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIFSKIDL

Query:  RSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKFSKCEF
         SGYHQ+ +   D  K AF +    YE+ VM F L NAP+ F   M   F+D    FV V++DDILI+S++  EH +HL  VLE L+   L VK  KC+F
Subjt:  RSGYHQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKFSKCEF

Query:  WLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPL
          ++  FL + +  + I+    K  A   +P P TV + + FLG+   YRRF+ + S+IA P+
Subjt:  WLKKVSFLAHVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein8.1e-1942.27Show/hide
Query:  HLHHVLETLRANKLYVKFSKCEFWLKKVSFLA--HVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTL
        HL  VL+    ++ Y    KC F   ++++L   H++S EG+S DP K+EA   WP P   +E+R FLGL G YRRFV+++ +I  PLT+L +K +L
Subjt:  HLHHVLETLRANKLYVKFSKCEFWLKKVSFLA--HVVSSEGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATAGACTTCGCAATTGAGTTAGAGCCAGGCACTGCTCCTATCTCGAGGGCCCCTTACAGAATGGCCCTAGCTGAGCTAAAGGAGCTGAAGGTGCAGTTGCAGGA
GTTACTAGACAAGGGTTTTATTCGACTCAGTGTGTCACCTTGGGGAGCACCAGTGTTGTTTGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTGACTACAGAGAGC
TGAACAAGGTGACAATTAAGAATCGCTATCCCTTGCCCATGATTGATGATTTTTTCGATCAGTTGCAAGGAACCACCATCTTTTCTAAGATTGACCTACGATCAGGCTAC
CACCAACTAAGGATTAGAGATAGTGATATTCCTAAGGCCGCTTTCCGTTCAAGATACAGACATTACGAGTTCATTGTGATGTTTTTTAGGTTGACTAATGCTCCTGCGGT
ATTCATGGACTTGATGAACAGGATGTTTAAGGATTTCTTAGACACGTTTGTCATAGTTTTCATTGATGACATTTTGATTTACTCCAAGACTGAGGCTGAGCATGAGGAGC
ACTTGCACCATGTTTTGGAGACTCTTCGAGCTAATAAGCTGTATGTCAAGTTCTCCAAGTGTGAGTTCTGGCTGAAGAAGGTATCATTTCTTGCACATGTGGTGTCCAGT
GAGGGAATTTCTGTAGACCCAACAAAGATTGAAGCCGATACTAGTTGGCCTCGACCGTTTACAGTCAGTGAGATCCGTAGCTTTCTGGGTCTAGCAGGTTGTTATAGGAG
GTTCGTGGAAGACTTTTCTCGTATTGCTAGTCCCTTGACTCAGTTGACCAGGAAGGGGACTCTATTTGTTTGGAATCCAAGCTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATAGACTTCGCAATTGAGTTAGAGCCAGGCACTGCTCCTATCTCGAGGGCCCCTTACAGAATGGCCCTAGCTGAGCTAAAGGAGCTGAAGGTGCAGTTGCAGGA
GTTACTAGACAAGGGTTTTATTCGACTCAGTGTGTCACCTTGGGGAGCACCAGTGTTGTTTGTGAAGAAGAAGGATGGGTCGATGCGCCTTTGCATTGACTACAGAGAGC
TGAACAAGGTGACAATTAAGAATCGCTATCCCTTGCCCATGATTGATGATTTTTTCGATCAGTTGCAAGGAACCACCATCTTTTCTAAGATTGACCTACGATCAGGCTAC
CACCAACTAAGGATTAGAGATAGTGATATTCCTAAGGCCGCTTTCCGTTCAAGATACAGACATTACGAGTTCATTGTGATGTTTTTTAGGTTGACTAATGCTCCTGCGGT
ATTCATGGACTTGATGAACAGGATGTTTAAGGATTTCTTAGACACGTTTGTCATAGTTTTCATTGATGACATTTTGATTTACTCCAAGACTGAGGCTGAGCATGAGGAGC
ACTTGCACCATGTTTTGGAGACTCTTCGAGCTAATAAGCTGTATGTCAAGTTCTCCAAGTGTGAGTTCTGGCTGAAGAAGGTATCATTTCTTGCACATGTGGTGTCCAGT
GAGGGAATTTCTGTAGACCCAACAAAGATTGAAGCCGATACTAGTTGGCCTCGACCGTTTACAGTCAGTGAGATCCGTAGCTTTCTGGGTCTAGCAGGTTGTTATAGGAG
GTTCGTGGAAGACTTTTCTCGTATTGCTAGTCCCTTGACTCAGTTGACCAGGAAGGGGACTCTATTTGTTTGGAATCCAAGCTTGTGA
Protein sequenceShow/hide protein sequence
MEIDFAIELEPGTAPISRAPYRMALAELKELKVQLQELLDKGFIRLSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTIKNRYPLPMIDDFFDQLQGTTIFSKIDLRSGY
HQLRIRDSDIPKAAFRSRYRHYEFIVMFFRLTNAPAVFMDLMNRMFKDFLDTFVIVFIDDILIYSKTEAEHEEHLHHVLETLRANKLYVKFSKCEFWLKKVSFLAHVVSS
EGISVDPTKIEADTSWPRPFTVSEIRSFLGLAGCYRRFVEDFSRIASPLTQLTRKGTLFVWNPSL