; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0110331 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0110331
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:29618896..29619750
RNA-Seq ExpressionCmc04g0110331
SyntenyCmc04g0110331
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7021913.1 unnamed protein product [Microthlaspi erraticum]8.1e-10662.68Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        MCVDSRAINKIT+KYRFPIPR+ D+LD+L G+ +FSKIDLRSGY+QIRIRPGDEWK  FK+ +GL+EWLVM FGLSNAPSTFMR+MN++LHPF+  F++V
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        YFDDIL++SKT D HL H+ ++ QVL  N+LYVNLKKC FC N++ FLGF++ +D + +DE KV  IK+W    +V +V++F GLA+F R+F+++FS+I 
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        APIT+CLKKG FQWG +Q  SF L+KE L ++PVL L DF + F+V  DA G GIG V+SQ+   IA+FSEKLS++RQ WSTY+
Subjt:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

KAA0051933.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.2e-12577.82Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        MCVDSRAIN+I VKYRFPIPR+ DLLDQLGGA IFSKIDLRSGY QIRIRPGDEWK  FKTNEGLF+WLVM FGLSNAPSTFMRLMN+VLHPFLNKF+IV
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        YFDDIL FS+T D+H  H+ QLF+ L  NELY+NLKKCIFC  EIAFLGFIIRK+H+LMDEKKVE IKNW   T+VK+VQAF+GLASFYRKFI NF +IA
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        API DCLKKG+F WG K+Q SF LLKE LS++P+L+L DFSQ FEVAVDACGTGIG  +SQ GH I +FSEKL  SRQ+WSTYE
Subjt:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]2.9e-14390.49Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        MCVDSRAINKITVKYRFPIPRV DLLDQLGGACIFSKIDLRS Y+QIRIRPGDEWK  FKTNEGLFEWLVM F LSNAPSTFMRLMNKVLHPFLNKFIIV
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFC NEIAFLGFIIRKDHVLMDEKKVE IKNWST TTV QVQAFLGLASFYRKFIQN SSIA
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        APITDCLKKGAF+WGPKQQ SFNLLKESL +  VLKL DF QAFEVAVD CGTGIG V+SQQ H I YFSE+LSKSRQSWSTYE
Subjt:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

KAA0057262.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.7e-15998.94Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFC NEIAFLGFIIRKDHVLMDEKKVE IKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        APITDCLKKGAFQWGPKQQHSFNLLKESLS+SPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
Subjt:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

TYK06567.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]8.7e-11672.89Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        MC+DSR INKITVKYRFP PR+ +LLDQLG A IFSKIDL+SGY QIRI+PGDEWK  FKTNEGLFEWLVM FGLSNAPSTFMRLMN+VLH FLNKF++V
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        YFDDILVFS+   +H  H+ Q+F+VL  NELY+NLKKC FC  EIAFLGFII+K+H+LMDEKKVE I+NW    ++K+VQAFLGLASFYRKFI NFS+IA
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        A ITDCLKKG F WG KQQ SF  LK+ LS+ PVLKL  F+Q FEV VDA  TG G V+SQ GH I YFSEKL++SRQ  STYE
Subjt:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

TrEMBL top hitse value%identityAlignment
A0A5D3C402 DNA/RNA polymerases superfamily protein4.2e-11672.89Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        MC+DSR INKITVKYRFP PR+ +LLDQLG A IFSKIDL+SGY QIRI+PGDEWK  FKTNEGLFEWLVM FGLSNAPSTFMRLMN+VLH FLNKF++V
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        YFDDILVFS+   +H  H+ Q+F+VL  NELY+NLKKC FC  EIAFLGFII+K+H+LMDEKKVE I+NW    ++K+VQAFLGLASFYRKFI NFS+IA
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        A ITDCLKKG F WG KQQ SF  LK+ LS+ PVLKL  F+Q FEV VDA  TG G V+SQ GH I YFSEKL++SRQ  STYE
Subjt:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

A0A5D3CPI6 Putative gag-pol polyprotein5.8e-12677.82Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        MCVDSRAIN+I VKYRFPIPR+ DLLDQLGGA IFSKIDLRSGY QIRIRPGDEWK  FKTNEGLF+WLVM FGLSNAPSTFMRLMN+VLHPFLNKF+IV
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        YFDDIL FS+T D+H  H+ QLF+ L  NELY+NLKKCIFC  EIAFLGFIIRK+H+LMDEKKVE IKNW   T+VK+VQAF+GLASFYRKFI NF +IA
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        API DCLKKG+F WG K+Q SF LLKE LS++P+L+L DFSQ FEVAVDACGTGIG  +SQ GH I +FSEKL  SRQ+WSTYE
Subjt:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

A0A5D3D575 Putative gag-pol polyprotein8.1e-16098.94Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFC NEIAFLGFIIRKDHVLMDEKKVE IKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        APITDCLKKGAFQWGPKQQHSFNLLKESLS+SPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
Subjt:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

A0A5D3DGR0 Reverse transcriptase1.4e-14390.49Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        MCVDSRAINKITVKYRFPIPRV DLLDQLGGACIFSKIDLRS Y+QIRIRPGDEWK  FKTNEGLFEWLVM F LSNAPSTFMRLMNKVLHPFLNKFIIV
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFC NEIAFLGFIIRKDHVLMDEKKVE IKNWST TTV QVQAFLGLASFYRKFIQN SSIA
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        APITDCLKKGAF+WGPKQQ SFNLLKESL +  VLKL DF QAFEVAVD CGTGIG V+SQQ H I YFSE+LSKSRQSWSTYE
Subjt:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

A0A6D2I9U6 Uncharacterized protein3.9e-10662.68Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        MCVDSRAINKIT+KYRFPIPR+ D+LD+L G+ +FSKIDLRSGY+QIRIRPGDEWK  FK+ +GL+EWLVM FGLSNAPSTFMR+MN++LHPF+  F++V
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        YFDDIL++SKT D HL H+ ++ QVL  N+LYVNLKKC FC N++ FLGF++ +D + +DE KV  IK+W    +V +V++F GLA+F R+F+++FS+I 
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        APIT+CLKKG FQWG +Q  SF L+KE L ++PVL L DF + F+V  DA G GIG V+SQ+   IA+FSEKLS++RQ WSTY+
Subjt:  APITDCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.1e-6041.55Show/hide
Query:  VDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIVYF
        +D R +N+ITV  R PIP + ++L +LG    F+ IDL  G++QI + P    K  F T  G +E+L M FGL NAP+TF R MN +L P LNK  +VY 
Subjt:  VDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIVYF

Query:  DDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIAAP
        DDI+VFS + D+HLQ +  +F+ L    L + L KC F   E  FLG ++  D +  + +K+E I+ +   T  K+++AFLGL  +YRKFI NF+ IA P
Subjt:  DDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIAAP

Query:  ITDCLKKG--AFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        +T CLKK        P+   +F  LK  +S  P+LK+ DF++ F +  DA    +G V+SQ GH ++Y S  L++   ++ST E
Subjt:  ITDCLKKG--AFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.9e-4936.36Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        + +D R +N+ T+  R+P+P +  +L  LG A  F+ +DL+SGY+QI +   D  K +F  N G +E+  + FGL NA S F R ++ VL   + K   V
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        Y DD+++FS+    H++HID + + L    + V+ +K  F    + +LGFI+ KD    D +KV+ I+ +     V +V++FLGLAS+YR FI++F++IA
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APITDCL------------KKGAFQWGPKQQHSFNLLKESLSSSPV-LKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
         PITD L            KK   ++   Q+++F  L+  L+S  V LK  DF + F++  DA  +GIG V+SQ+G  I   S  L +  Q+++T E
Subjt:  APITDCL------------KKGAFQWGPKQQHSFNLLKESLSSSPV-LKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

P20825 Retrovirus-related Pol polyprotein from transposon 2971.7e-5339.44Show/hide
Query:  VDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIVYF
        +D R +N+IT+  R+PIP + ++L +LG    F+ IDL  G++QI +      K  F T  G +E+L M FGL NAP+TF R MN +L P LNK  +VY 
Subjt:  VDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIVYF

Query:  DDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIAAP
        DDI++FS +  +HL  I  +F  L    L + L KC F   E  FLG I+  D +  +  KV+ I ++   T  K+++AFLGL  +YRKFI N++ IA P
Subjt:  DDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIAAP

Query:  ITDCLKKGAFQWGPKQQH--SFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE
        +T CLKK       K ++  +F  LK  +   P+L+L DF + F +  DA    +G V+SQ GH I++ S  L+    ++S  E
Subjt:  ITDCLKKGAFQWGPKQQH--SFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.0e-5138.7Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        +CVD R +NK T+   FP+PR+ +LL ++G A IF+ +DL SGY+QI + P D +K  F T  G +E+ VM FGL NAPSTF R M         +F+ V
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        Y DDIL+FS++ ++H +H+D + + L +  L V  KKC F   E  FLG+ I    +   + K   I+++ T  TVKQ Q FLG+ ++YR+FI N S IA
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APIT--DCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHS------IAYFSEKLSKSRQSWSTYE
         PI    C K    QW  KQ  +   LK +L +SPVL   +    + +  DA   GIG V+ +  +       + YFS+ L  +++++   E
Subjt:  APIT--DCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHS------IAYFSEKLSKSRQSWSTYE

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.0e-5238.7Show/hide
Query:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV
        +CVD R +NK T+   FP+PR+ +LL ++G A IF+ +DL SGY+QI + P D +K  F T  G +E+ VM FGL NAPSTF R M         +F+ V
Subjt:  MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIV

Query:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA
        Y DDIL+FS++ ++H +H+D + + L +  L V  KKC F   E  FLG+ I    +   + K   I+++ T  TVKQ Q FLG+ ++YR+FI N S IA
Subjt:  YFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIA

Query:  APIT--DCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHS------IAYFSEKLSKSRQSWSTYE
         PI    C K    QW  KQ  + + LK++L +SPVL   +    + +  DA   GIG V+ +  +       + YFS+ L  +++++   E
Subjt:  APIT--DCLKKGAFQWGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHS------IAYFSEKLSKSRQSWSTYE

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein8.9e-1832.59Show/hide
Query:  LQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLG--FIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIAAPITDCLKKGAFQ
        + H+  + Q+   ++ Y N KKC F   +IA+LG   II  + V  D  K+E +  W       +++ FLGL  +YR+F++N+  I  P+T+ LKK + +
Subjt:  LQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLG--FIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIAAPITDCLKKGAFQ

Query:  WGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAV
        W      +F  LK ++++ PVL L D    F   V
Subjt:  WGPKQQHSFNLLKESLSSSPVLKLLDFSQAFEVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGTGGATAGTAGAGCTATCAACAAAATCACAGTAAAATATAGATTTCCAATCCCAAGGGTCAGGGATTTGTTAGATCAATTGGGAGGTGCTTGTATCTTTTCGAA
GATTGATCTAAGAAGTGGCTATTATCAAATACGTATTAGACCTGGAGATGAATGGAAAATAACCTTCAAGACCAACGAAGGACTTTTTGAGTGGCTTGTAATGCAATTTG
GCCTTTCTAATGCTCCTAGCACTTTCATGAGGTTGATGAACAAGGTACTACATCCTTTCTTAAACAAGTTCATTATAGTTTACTTTGATGACATTCTTGTCTTTAGCAAA
ACCTATGATCAACACCTCCAACACATTGACCAGCTGTTCCAAGTACTTAATCACAATGAACTTTATGTAAATCTCAAGAAGTGCATTTTCTGCTGTAATGAAATAGCCTT
CTTGGGGTTCATAATCAGAAAAGATCATGTTCTAATGGATGAGAAGAAGGTAGAAACAATTAAAAACTGGTCAACTTCAACTACTGTCAAACAAGTGCAAGCATTCTTGG
GATTGGCTTCATTCTACAGGAAGTTTATCCAAAACTTCAGTTCCATTGCTGCACCTATAACAGATTGTTTAAAGAAAGGAGCATTCCAGTGGGGTCCTAAACAACAACAT
AGTTTCAACCTGCTAAAAGAAAGTTTGAGCAGCAGCCCAGTTCTTAAACTACTAGACTTTTCCCAAGCTTTTGAAGTAGCAGTAGACGCCTGTGGCACTGGTATTGGAGT
TGTCATTTCTCAACAAGGACATTCAATCGCATATTTCAGTGAGAAATTAAGCAAGTCTAGACAGTCATGGAGTACTTATGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTGTGGATAGTAGAGCTATCAACAAAATCACAGTAAAATATAGATTTCCAATCCCAAGGGTCAGGGATTTGTTAGATCAATTGGGAGGTGCTTGTATCTTTTCGAA
GATTGATCTAAGAAGTGGCTATTATCAAATACGTATTAGACCTGGAGATGAATGGAAAATAACCTTCAAGACCAACGAAGGACTTTTTGAGTGGCTTGTAATGCAATTTG
GCCTTTCTAATGCTCCTAGCACTTTCATGAGGTTGATGAACAAGGTACTACATCCTTTCTTAAACAAGTTCATTATAGTTTACTTTGATGACATTCTTGTCTTTAGCAAA
ACCTATGATCAACACCTCCAACACATTGACCAGCTGTTCCAAGTACTTAATCACAATGAACTTTATGTAAATCTCAAGAAGTGCATTTTCTGCTGTAATGAAATAGCCTT
CTTGGGGTTCATAATCAGAAAAGATCATGTTCTAATGGATGAGAAGAAGGTAGAAACAATTAAAAACTGGTCAACTTCAACTACTGTCAAACAAGTGCAAGCATTCTTGG
GATTGGCTTCATTCTACAGGAAGTTTATCCAAAACTTCAGTTCCATTGCTGCACCTATAACAGATTGTTTAAAGAAAGGAGCATTCCAGTGGGGTCCTAAACAACAACAT
AGTTTCAACCTGCTAAAAGAAAGTTTGAGCAGCAGCCCAGTTCTTAAACTACTAGACTTTTCCCAAGCTTTTGAAGTAGCAGTAGACGCCTGTGGCACTGGTATTGGAGT
TGTCATTTCTCAACAAGGACATTCAATCGCATATTTCAGTGAGAAATTAAGCAAGTCTAGACAGTCATGGAGTACTTATGAGTAG
Protein sequenceShow/hide protein sequence
MCVDSRAINKITVKYRFPIPRVRDLLDQLGGACIFSKIDLRSGYYQIRIRPGDEWKITFKTNEGLFEWLVMQFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK
TYDQHLQHIDQLFQVLNHNELYVNLKKCIFCCNEIAFLGFIIRKDHVLMDEKKVETIKNWSTSTTVKQVQAFLGLASFYRKFIQNFSSIAAPITDCLKKGAFQWGPKQQH
SFNLLKESLSSSPVLKLLDFSQAFEVAVDACGTGIGVVISQQGHSIAYFSEKLSKSRQSWSTYE