; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0065681 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0065681
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:7164857..7165660
RNA-Seq ExpressionCmc03g0065681
SyntenyCmc03g0065681
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040542.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.0e-14294.38Show/hide
Query:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
        MAPAELKELKVQLQELLDKGFI+PSVSPWGAP+LFVKKKDGSMRLCI+Y ELNKVT+KNRYPL RIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG
        TAFRSRYG+YEF+VMSFGLTN P VFMDLMNRVFKDFLDSFVIVFIDDILIYSK EAEHEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFLGHV+SSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFV SPACESSF
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF

KAA0058812.1 pol protein [Cucumis melo var. makuwa]5.1e-14294.38Show/hide
Query:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
        MAPAELKELKVQLQELLDKGFI+PSVSPWGAP+LFVKKKDGSMRLCI+Y ELNKVTIKNRYPL RIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG
        TAF SRYG+YEF+VMSFGLTNAP VFMDLMNRVFKDF+DSFVIVFIDDILIYSK EAEHEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFLGHV+SSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFV SPACESSF
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF

KAA0060745.1 pol protein [Cucumis melo var. makuwa]2.3e-14294.38Show/hide
Query:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
        MAPAELKELKVQLQELLDKGFI+PSVSPWGAP+LFVKKKDGSMRLCI+Y ELNKVT+KNRYPL +IDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG
        TAFRSRYG+YEF+VMSFGLTNAP VFMDLMNRVFKDFLDSFVIVFIDDILIYSK EAEHEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFLGHV+SSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFV SPACESSF
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF

KAA0063793.1 pol protein [Cucumis melo var. makuwa]3.9e-14294.38Show/hide
Query:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
        MAPAELKELKVQLQELLDKGFI+PSVSPWGAP+LFVKKKDGSMRLCI+Y ELNKVT+KNRYPL RIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG
        TAFRSRYG+YEF+VMSFGLTNAP VFMDLMNRVFKDFLDSFVIVFIDDILIYSK EAEHEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFLGHV+SSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFV SPACE SF
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF

TYK18480.1 pol protein [Cucumis melo var. makuwa]7.8e-14395.13Show/hide
Query:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
        MAPAELKELKVQLQELLDKGFI+PSVSPWGAP+LFVKKKDGSMRLCI+Y ELNKVT+KNRYPL RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG
        TAFRSRYG+YEFIVMSFGLTNAP VFMDLMNRVFKDFLDSFVIVFIDDILIYSK EAEHEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFLGHV+SSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF
        VSVDP KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFV SPACESSF
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF

TrEMBL top hitse value%identityAlignment
A0A5A7TAT0 DNA/RNA polymerases superfamily protein1.4e-14294.38Show/hide
Query:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
        MAPAELKELKVQLQELLDKGFI+PSVSPWGAP+LFVKKKDGSMRLCI+Y ELNKVT+KNRYPL RIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG
        TAFRSRYG+YEF+VMSFGLTN P VFMDLMNRVFKDFLDSFVIVFIDDILIYSK EAEHEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFLGHV+SSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFV SPACESSF
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF

A0A5A7USG7 Reverse transcriptase2.5e-14294.38Show/hide
Query:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
        MAPAELKELKVQLQELLDKGFI+PSVSPWGAP+LFVKKKDGSMRLCI+Y ELNKVTIKNRYPL RIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG
        TAF SRYG+YEF+VMSFGLTNAP VFMDLMNRVFKDF+DSFVIVFIDDILIYSK EAEHEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFLGHV+SSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFV SPACESSF
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF

A0A5A7V4E4 Reverse transcriptase1.1e-14294.38Show/hide
Query:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
        MAPAELKELKVQLQELLDKGFI+PSVSPWGAP+LFVKKKDGSMRLCI+Y ELNKVT+KNRYPL +IDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG
        TAFRSRYG+YEF+VMSFGLTNAP VFMDLMNRVFKDFLDSFVIVFIDDILIYSK EAEHEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFLGHV+SSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFV SPACESSF
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF

A0A5A7V6R2 Reverse transcriptase1.9e-14294.38Show/hide
Query:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
        MAPAELKELKVQLQELLDKGFI+PSVSPWGAP+LFVKKKDGSMRLCI+Y ELNKVT+KNRYPL RIDDLFDQLQGATVFSKIDLRSGYHQLRIRD DIPK
Subjt:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG
        TAFRSRYG+YEF+VMSFGLTNAP VFMDLMNRVFKDFLDSFVIVFIDDILIYSK EAEHEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFLGHV+SSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFV SPACE SF
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF

A0A5D3D4M7 Pol protein3.8e-14395.13Show/hide
Query:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
        MAPAELKELKVQLQELLDKGFI+PSVSPWGAP+LFVKKKDGSMRLCI+Y ELNKVT+KNRYPL RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPK

Query:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG
        TAFRSRYG+YEFIVMSFGLTNAP VFMDLMNRVFKDFLDSFVIVFIDDILIYSK EAEHEEHLHQVLETLRANKLYAKFSKCEFWL+KVTFLGHV+SSEG
Subjt:  TAFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF
        VSVDP KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFV SPACESSF
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.1e-5337.45Show/hide
Query:  KELKVQLQELLDKGFIQPSVSPWGAPMLFV-KKKDGS----MRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKT
        +E++ Q+Q++L++G I+ S SP+ +P+  V KK+D S     R+ I+Y +LN++T+ +R+P+  +D++  +L     F+ IDL  G+HQ+ +    + KT
Subjt:  KELKVQLQELLDKGFIQPSVSPWGAPMLFV-KKKDGS----MRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKT

Query:  AFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEGV
        AF +++G+YE++ M FGL NAP  F   MN + +  L+   +V++DDI+++S    EH + L  V E L    L  +  KCEF  ++ TFLGHV++ +G+
Subjt:  AFRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEGV

Query:  SVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VSSPACESSF
          +P KIEA+  +P P+   EI++FLGL GYYR+F+ +F+ IA P+T+  +K      ++P  +S+F
Subjt:  SVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VSSPACESSF

P20825 Retrovirus-related Pol polyprotein from transposon 2973.9e-5237.94Show/hide
Query:  ELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKD-----GSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTA
        E++ Q+QE+L++G I+ S SP+ +P   V KK         R+ I+Y +LN++TI +RYP+  +D++  +L     F+ IDL  G+HQ+ + +  I KTA
Subjt:  ELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKD-----GSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTA

Query:  FRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEGVS
        F ++ G+YE++ M FGL NAP  F   MN + +  L+   +V++DDI+I+S    EH   +  V   L    L  +  KCEF  K+  FLGH+++ +G+ 
Subjt:  FRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEGVS

Query:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGT
         +P K++A+ ++P P+   EIR+FLGL GYYR+F+ +++ IA P+T   +K T
Subjt:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGT

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.3e-5243.57Show/hide
Query:  KELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTAFRSR
        +E+   +Q+LLD  FI PS SP  +P++ V KKDG+ RLC++Y  LNK TI + +PL RID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF + 
Subjt:  KELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTAFRSR

Query:  YGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEGVSVDPA
         G YE+ VM FGL NAP+ F   M   F+D    FV V++DDILI+S+   EH +HL  VLE L+   L  K  KC+F  ++  FLG+ I  + ++    
Subjt:  YGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEGVSVDPA

Query:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPL
        K  A+ ++P P TV + + FLG+  YYRRF+ + S+IA P+
Subjt:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.1e-5137.6Show/hide
Query:  ELKVQLQELLDKGFIQPSVSPWGAPMLFVKKK-----DGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTA
        E++ Q+ ELL  G I+PS SP+ +P+  V KK     +   R+ +++  LN VTI + YP+  I+     L  A  F+ +DL SG+HQ+ +++SDIPKTA
Subjt:  ELKVQLQELLDKGFIQPSVSPWGAPMLFVKKK-----DGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTA

Query:  FRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEGVS
        F +  G YEF+ + FGL NAP +F  +++ + ++ +     V+IDDI+++S+    H ++L  VL +L    L     K  F   +V FLG++++++G+ 
Subjt:  FRSRYGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEGVS

Query:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTR
         DP K+ A++  P P++V E++ FLG+  YYR+F++D++++A PLT LTR
Subjt:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTR

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.3e-5243.57Show/hide
Query:  KELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTAFRSR
        +E+   +Q+LLD  FI PS SP  +P++ V KKDG+ RLC++Y  LNK TI + +PL RID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF + 
Subjt:  KELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTAFRSR

Query:  YGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEGVSVDPA
         G YE+ VM FGL NAP+ F   M   F+D    FV V++DDILI+S+   EH +HL  VLE L+   L  K  KC+F  ++  FLG+ I  + ++    
Subjt:  YGYYEFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEGVSVDPA

Query:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPL
        K  A+ ++P P TV + + FLG+  YYRRF+ + S+IA P+
Subjt:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.5e-2247.92Show/hide
Query:  HLHQVLETLRANKLYAKFSKCEFWLKKVTFLG--HVISSEGVSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGT
        HL  VL+    ++ YA   KC F   ++ +LG  H+IS EGVS DPAK+EA+  WP P   +E+R FLGL GYYRRFV+++ +I  PLT+L +K +
Subjt:  HLHQVLETLRANKLYAKFSKCEFWLKKVTFLG--HVISSEGVSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCAGCTGAGCTAAAAGAGCTGAAGGTCCAGTTGCAGGAGTTGTTGGACAAGGGTTTCATCCAGCCCAGTGTGTCACCTTGGGGGGCCCCAATGTTGTTTGTGAA
GAAGAAGGATGGGTCGATGCGCCTTTGCATTAACTACGGAGAGCTGAACAAGGTGACAATTAAGAACCGCTATCCCTTGGCTAGGATTGATGACTTGTTCGATCAGTTGC
AGGGAGCCACTGTCTTTTCCAAGATCGACCTGCGATCAGGCTATCACCAGTTGAGGATCAGGGACAGTGACATTCCCAAGACGGCCTTTCGTTCGAGATACGGATATTAC
GAGTTCATTGTGATGTCTTTCGGCTTGACTAATGCCCCTACGGTGTTCATGGACTTGATGAACAGGGTGTTTAAGGACTTCCTAGACTCGTTCGTCATAGTCTTCATTGA
TGACATCTTGATTTACTCCAAACCTGAGGCTGAGCATGAGGAGCACTTGCACCAGGTTTTGGAGACTCTTCGAGCCAACAAGCTGTATGCCAAGTTCTCCAAGTGTGAGT
TCTGGTTAAAAAAGGTGACGTTCCTCGGCCACGTGATTTCCAGTGAGGGAGTTTCTGTGGATCCAGCAAAGATCGAAGCGGTGACCAACTGGCCTCGACCGTCCACAGTT
AGTGAGATTCGAAGTTTTCTGGGCTTGGCAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCACGTATAGCCAGCCCGTTGACCCAGTTGACCAGGAAGGGAACCCCTTT
TGTGTCGAGCCCAGCATGCGAGAGTAGCTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCAGCTGAGCTAAAAGAGCTGAAGGTCCAGTTGCAGGAGTTGTTGGACAAGGGTTTCATCCAGCCCAGTGTGTCACCTTGGGGGGCCCCAATGTTGTTTGTGAA
GAAGAAGGATGGGTCGATGCGCCTTTGCATTAACTACGGAGAGCTGAACAAGGTGACAATTAAGAACCGCTATCCCTTGGCTAGGATTGATGACTTGTTCGATCAGTTGC
AGGGAGCCACTGTCTTTTCCAAGATCGACCTGCGATCAGGCTATCACCAGTTGAGGATCAGGGACAGTGACATTCCCAAGACGGCCTTTCGTTCGAGATACGGATATTAC
GAGTTCATTGTGATGTCTTTCGGCTTGACTAATGCCCCTACGGTGTTCATGGACTTGATGAACAGGGTGTTTAAGGACTTCCTAGACTCGTTCGTCATAGTCTTCATTGA
TGACATCTTGATTTACTCCAAACCTGAGGCTGAGCATGAGGAGCACTTGCACCAGGTTTTGGAGACTCTTCGAGCCAACAAGCTGTATGCCAAGTTCTCCAAGTGTGAGT
TCTGGTTAAAAAAGGTGACGTTCCTCGGCCACGTGATTTCCAGTGAGGGAGTTTCTGTGGATCCAGCAAAGATCGAAGCGGTGACCAACTGGCCTCGACCGTCCACAGTT
AGTGAGATTCGAAGTTTTCTGGGCTTGGCAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCACGTATAGCCAGCCCGTTGACCCAGTTGACCAGGAAGGGAACCCCTTT
TGTGTCGAGCCCAGCATGCGAGAGTAGCTTCTAG
Protein sequenceShow/hide protein sequence
MAPAELKELKVQLQELLDKGFIQPSVSPWGAPMLFVKKKDGSMRLCINYGELNKVTIKNRYPLARIDDLFDQLQGATVFSKIDLRSGYHQLRIRDSDIPKTAFRSRYGYY
EFIVMSFGLTNAPTVFMDLMNRVFKDFLDSFVIVFIDDILIYSKPEAEHEEHLHQVLETLRANKLYAKFSKCEFWLKKVTFLGHVISSEGVSVDPAKIEAVTNWPRPSTV
SEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVSSPACESSF