; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0059921 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0059921
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:155093..156100
RNA-Seq ExpressionCmc03g0059921
SyntenyCmc03g0059921
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051933.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]2.6e-12777.63Show/hide
Query:  MCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVV
        MC+DSR IN+I VKYRFP PRIN+LLDQLG A IFSKIDL+SGY QIRIRPGDEWKT FKTNEGLF+WLVMPFGLSN PSTFMRLMNQVLH FLNKFV+V
Subjt:  MCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVV

Query:  YFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIA
        YFDDIL FSR L+EH+ HL Q+F+ LA            FCVEEIAFLGFII+KNHILMDEKKVEAI+NWPIP S+KEVQAF+GLASFY+KFI+NF TIA
Subjt:  YFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIA

Query:  ASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK
        A I DCLKKG FLWG K+QDSFE LK+KLSN P+L+LP F+QPFEV VDA GTG G+ LSQ+GHPIE+FSEKL  SRQ WSTYEQEMYALVRALK
Subjt:  ASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK

KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]5.2e-13672.27Show/hide
Query:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK
        MSP EY IL  AIEELL KGHI+PS S C VPALLTPKKDG+WRMC+DSR INKITVKYRFP PR+++LLDQLG A IFSKIDL+S Y QIRIRPGDEWK
Subjt:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK

Query:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNH
        T FKTNEGLFEWLVMPF LSN PSTFMRLMN+VLH FLNKF++VYFDDILVFS+  ++H  H+ Q+F+VL             FC  EIAFLGFII+K+H
Subjt:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNH

Query:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG
        +LMDEKKVEAI+NW  P ++ +VQAFLGLASFY+KFI N S+IAA ITDCLKKG F WG KQQDSF  LK+ L N  VLKLP F Q FEV VD  GTG G
Subjt:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG

Query:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK
        AVLSQ  HPIEYFSE+L++SRQ WSTYEQE+YALVRALK
Subjt:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]9.9e-11961.36Show/hide
Query:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK
        MSPKE  IL++ +EELL KGHIQ S+SPCAVPALLTPKKDGSWRMC+DSR INKITV+YRFP PR+++LLDQL  A +FSKIDL+SGY QIRI+PGDEWK
Subjt:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK

Query:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNH
        T FKT +GL+EWLVMPFGLSN PSTFMRLM QVL  F+ KFVVVYFDDILV+S+  +EH  HL +V K L             F   ++ FLG+I+  + 
Subjt:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNH

Query:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG
        I +DE KV+A+R+WP PK++ EV++F GLA+FY++F+ NFS+I A IT+C+KKG F W ++ ++SF+ +K++L+  PVL LP F   FE+  DA GTG G
Subjt:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG

Query:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK
        AVLSQ G P+ + SEKLN++RQ WSTYEQE+YA+V+A+K
Subjt:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK

TYK06567.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.7e-16691.46Show/hide
Query:  GHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEGLFEWLVMPFGL
        GH  PS S   +PALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRI+PGDEWKT FKTNEGLFEWLVMPFGL
Subjt:  GHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEGLFEWLVMPFGL

Query:  SNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILMDEKKVEAIRNWPIPKS
        SN PSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCL+EHN HLYQVF+VLA            FCVEEIAFLGFIIKKNHILMDEKKVEAIRNWPIPKS
Subjt:  SNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILMDEKKVEAIRNWPIPKS

Query:  IKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQSGHPIEYFSEKLNQ
        IKEVQAFLGLASFY+KFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDAS TGTGAVLSQSGHPIEYFSEKLNQ
Subjt:  IKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQSGHPIEYFSEKLNQ

Query:  SRQMWSTYEQEMYALVRALKTMGALPTI
        SRQM STYEQEMYALVRALKTMGALPTI
Subjt:  SRQMWSTYEQEMYALVRALKTMGALPTI

VVA31129.1 PREDICTED: reverse mRNAase, partial [Prunus dulcis]4.1e-11761.65Show/hide
Query:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK
        MSPKE  IL++ IEELL KG I+ SLSPCAVP LL PKKD +WRMC+DSR INKITVKYRFP PR+ ++LD L  + +FSKIDL+SGY QIRIRPGDEWK
Subjt:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK

Query:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNH
        T FK+ +GLFEWLVMPFGLSN PSTFMRLMNQVL  F+  FVVVYFDDIL++S   EEH  HL QV  VL             FC  ++ FLGF++ +N 
Subjt:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNH

Query:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG
        I +D++K++AI +WP PK++ EV++F GLA+FY++F+ +FS+IAA IT+CLKKG F WG++Q+ SF  +K+KL   PVL LP F + FEV  DASG G G
Subjt:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG

Query:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK
        AVLSQ   P+ +FSEKL+ +RQ WSTY+QE YA+VRALK
Subjt:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK

TrEMBL top hitse value%identityAlignment
A0A2U1P6A2 Transposon Ty3-I Gag-Pol polyprotein4.8e-11961.36Show/hide
Query:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK
        MSPKE  IL++ +EELL KGHIQ S+SPCAVPALLTPKKDGSWRMC+DSR INKITV+YRFP PR+++LLDQL  A +FSKIDL+SGY QIRI+PGDEWK
Subjt:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK

Query:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNH
        T FKT +GL+EWLVMPFGLSN PSTFMRLM QVL  F+ KFVVVYFDDILV+S+  +EH  HL +V K L             F   ++ FLG+I+  + 
Subjt:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNH

Query:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG
        I +DE KV+A+R+WP PK++ EV++F GLA+FY++F+ NFS+I A IT+C+KKG F W ++ ++SF+ +K++L+  PVL LP F   FE+  DA GTG G
Subjt:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG

Query:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK
        AVLSQ G P+ + SEKLN++RQ WSTYEQE+YA+V+A+K
Subjt:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK

A0A5B7BER3 Uncharacterized protein1.4e-11863.13Show/hide
Query:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK
        MSPKE  ILQQ +E+L+ KG IQ S+SPCAVPALLTPKKDGSWRMC+DSR INKITVKYRFP PR+N++LD L  + IFSKIDL+SGY QIRIRPGDEWK
Subjt:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK

Query:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPH------------LYQVFKVLAFCVEEIAFLGFIIKKNH
        T FKT EGL+EWLVMPFGLSN PSTFMR+MNQVL  F+ KFVVVYFDDIL++S+   EH  H            LY   K   F    + FLGFII    
Subjt:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPH------------LYQVFKVLAFCVEEIAFLGFIIKKNH

Query:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG
        I +DE+KV AIR+WP PK++ ++++F GLA+FY++FI NFS+I A ITDC+KKG F W   Q+ SF  +K+KLS  PVL LP F + F+V  DAS TG G
Subjt:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG

Query:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK
        AVLSQ G P+E+FSEKLN++RQ W+TYE E++A+VRALK
Subjt:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK

A0A5D3C402 DNA/RNA polymerases superfamily protein8.0e-16791.46Show/hide
Query:  GHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEGLFEWLVMPFGL
        GH  PS S   +PALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRI+PGDEWKT FKTNEGLFEWLVMPFGL
Subjt:  GHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEGLFEWLVMPFGL

Query:  SNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILMDEKKVEAIRNWPIPKS
        SN PSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCL+EHN HLYQVF+VLA            FCVEEIAFLGFIIKKNHILMDEKKVEAIRNWPIPKS
Subjt:  SNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILMDEKKVEAIRNWPIPKS

Query:  IKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQSGHPIEYFSEKLNQ
        IKEVQAFLGLASFY+KFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDAS TGTGAVLSQSGHPIEYFSEKLNQ
Subjt:  IKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQSGHPIEYFSEKLNQ

Query:  SRQMWSTYEQEMYALVRALKTMGALPTI
        SRQM STYEQEMYALVRALKTMGALPTI
Subjt:  SRQMWSTYEQEMYALVRALKTMGALPTI

A0A5D3CPI6 Putative gag-pol polyprotein1.3e-12777.63Show/hide
Query:  MCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVV
        MC+DSR IN+I VKYRFP PRIN+LLDQLG A IFSKIDL+SGY QIRIRPGDEWKT FKTNEGLF+WLVMPFGLSN PSTFMRLMNQVLH FLNKFV+V
Subjt:  MCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVV

Query:  YFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIA
        YFDDIL FSR L+EH+ HL Q+F+ LA            FCVEEIAFLGFII+KNHILMDEKKVEAI+NWPIP S+KEVQAF+GLASFY+KFI+NF TIA
Subjt:  YFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIA

Query:  ASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK
        A I DCLKKG FLWG K+QDSFE LK+KLSN P+L+LP F+QPFEV VDA GTG G+ LSQ+GHPIE+FSEKL  SRQ WSTYEQEMYALVRALK
Subjt:  ASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK

A0A5D3DGR0 Reverse transcriptase2.5e-13672.27Show/hide
Query:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK
        MSP EY IL  AIEELL KGHI+PS S C VPALLTPKKDG+WRMC+DSR INKITVKYRFP PR+++LLDQLG A IFSKIDL+S Y QIRIRPGDEWK
Subjt:  MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWK

Query:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNH
        T FKTNEGLFEWLVMPF LSN PSTFMRLMN+VLH FLNKF++VYFDDILVFS+  ++H  H+ Q+F+VL             FC  EIAFLGFII+K+H
Subjt:  TVFKTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNH

Query:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG
        +LMDEKKVEAI+NW  P ++ +VQAFLGLASFY+KFI N S+IAA ITDCLKKG F WG KQQDSF  LK+ L N  VLKLP F Q FEV VD  GTG G
Subjt:  ILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTG

Query:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK
        AVLSQ  HPIEYFSE+L++SRQ WSTYEQE+YALVRALK
Subjt:  AVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.2e-6641.89Show/hide
Query:  LQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGS-----WRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVF
        ++  I+++L +G I+ S SP   P  + PKK  +     +R+ ID R +N+ITV  R P P ++E+L +LG    F+ IDL  G+ QI + P    KT F
Subjt:  LQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGS-----WRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVF

Query:  KTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILM
         T  G +E+L MPFGL N P+TF R MN +L   LNK  +VY DDI+VFS  L+EH   L  VF+ LA            F  +E  FLG ++  + I  
Subjt:  KTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILM

Query:  DEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFL--WGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGA
        + +K+EAI+ +PIP   KE++AFLGL  +Y+KFI NF+ IA  +T CLKK   +     +   +F+ LK  +S  P+LK+P FT+ F +  DAS    GA
Subjt:  DEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFL--WGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGA

Query:  VLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALKT
        VLSQ GHP+ Y S  LN+    +ST E+E+ A+V A KT
Subjt:  VLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALKT

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.2e-5537.97Show/hide
Query:  IEELLIKGHIQPSLSPCAVPALLTPKK------DGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTN
        +++LL  G I+PS SP   P  +  KK      + + R+ ID R +N+ T+  R+P P I  +L  LG+A  F+ +DLKSGY QI +   D  KT F  N
Subjt:  IEELLIKGHIQPSLSPCAVPALLTPKK------DGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTN

Query:  EGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNHILMDEK
         G +E+  +PFGL N  S F R ++ VL   + K   VY DD+++FS    +H  H+  V K L             F  E + +LGFI+ K+    D +
Subjt:  EGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVL------------AFCVEEIAFLGFIIKKNHILMDEK

Query:  KVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCL------------KKGGFLWGKKQQDSFEALKKKLSNKPV-LKLPKFTQPFEVVVD
        KV+AI+ +P P  + +V++FLGLAS+Y+ FI +F+ IA  ITD L            KK    + + Q+++F+ L+  L+++ V LK P F +PF++  D
Subjt:  KVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCL------------KKGGFLWGKKQQDSFEALKKKLSNKPV-LKLPKFTQPFEVVVD

Query:  ASGTGTGAVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRAL
        AS +G GAVLSQ G PI   S  L Q  Q ++T E+E+ A+V AL
Subjt:  ASGTGTGAVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRAL

P20825 Retrovirus-related Pol polyprotein from transposon 2971.8e-6240.41Show/hide
Query:  LQQAIEELLIKGHIQPSLSPCAVPALLTPKKD-----GSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVF
        ++  ++E+L +G I+ S SP   P  + PKK        +R+ ID R +N+IT+  R+P P ++E+L +LG+   F+ IDL  G+ QI +      KT F
Subjt:  LQQAIEELLIKGHIQPSLSPCAVPALLTPKKD-----GSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVF

Query:  KTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILM
         T  G +E+L MPFGL N P+TF R MN +L   LNK  +VY DDI++FS  L EH   +  VF  LA            F  +E  FLG I+  + I  
Subjt:  KTNEGLFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVLA------------FCVEEIAFLGFIIKKNHILM

Query:  DEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQ--DSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGA
        +  KV+AI ++PIP   KE++AFLGL  +Y+KFI N++ IA  +T CLKK   +  +K +  ++FE LK  +   P+L+LP F + F +  DAS    GA
Subjt:  DEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQ--DSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGA

Query:  VLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALKT
        VLSQ+GHPI + S  LN     +S  E+E+ A+V A KT
Subjt:  VLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALKT

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.8e-5940.24Show/hide
Query:  LQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEG
        + + +++LL    I PS SPC+ P +L PKKDG++R+C+D RT+NK T+   FP PRI+ LL ++G A IF+ +DL SGY QI + P D +KT F T  G
Subjt:  LQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEG

Query:  LFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVF------------KVLAFCVEEIAFLGFIIKKNHILMDEKKV
         +E+ VMPFGL N PSTF R M         +FV VY DDIL+FS   EEH  HL  V             K   F  EE  FLG+ I    I   + K 
Subjt:  LFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVF------------KVLAFCVEEIAFLGFIIKKNHILMDEKKV

Query:  EAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASIT--DCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQS
         AIR++P PK++K+ Q FLG+ ++Y++FI N S IA  I    C K     W +KQ  + E LK  L N PVL        + +  DAS  G GAVL + 
Subjt:  EAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASIT--DCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQS

Query:  GHP------IEYFSEKLNQSRQMWSTYEQEMYALVRAL
         +       + YFS+ L  +++ +   E E+  +++AL
Subjt:  GHP------IEYFSEKLNQSRQMWSTYEQEMYALVRAL

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.1e-5939.94Show/hide
Query:  LQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEG
        + + +++LL    I PS SPC+ P +L PKKDG++R+C+D RT+NK T+   FP PRI+ LL ++G A IF+ +DL SGY QI + P D +KT F T  G
Subjt:  LQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEG

Query:  LFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVF------------KVLAFCVEEIAFLGFIIKKNHILMDEKKV
         +E+ VMPFGL N PSTF R M         +FV VY DDIL+FS   EEH  HL  V             K   F  EE  FLG+ I    I   + K 
Subjt:  LFEWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVF------------KVLAFCVEEIAFLGFIIKKNHILMDEKKV

Query:  EAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASIT--DCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQS
         AIR++P PK++K+ Q FLG+ ++Y++FI N S IA  I    C K     W +KQ  + + LK  L N PVL        + +  DAS  G GAVL + 
Subjt:  EAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASIT--DCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQS

Query:  GHP------IEYFSEKLNQSRQMWSTYEQEMYALVRAL
         +       + YFS+ L  +++ +   E E+  +++AL
Subjt:  GHP------IEYFSEKLNQSRQMWSTYEQEMYALVRAL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein6.4e-1536.89Show/hide
Query:  EEHNPHLYQVFKVLAFCVEEIAFLG--FIIKKNHILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSF
        E+H    Y   K  AF   +IA+LG   II    +  D  K+EA+  WP PK+  E++ FLGL  +Y++F+ N+  I   +T+ LKK    W +    +F
Subjt:  EEHNPHLYQVFKVLAFCVEEIAFLG--FIIKKNHILMDEKKVEAIRNWPIPKSIKEVQAFLGLASFYKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSF

Query:  EALKKKLSNKPVLKLPKFTQPF
        +ALK  ++  PVL LP    PF
Subjt:  EALKKKLSNKPVLKLPKFTQPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCCAAAAGAATACGCTATTTTACAGCAGGCAATTGAAGAATTGCTGATAAAAGGACATATCCAACCAAGCTTAAGTCCATGTGCAGTCCCTGCCTTACTAACACC
AAAGAAAGATGGAAGTTGGAGGATGTGCATTGATAGCAGGACAATCAACAAAATCACAGTAAAATACAGGTTCCCTACTCCAAGAATAAATGAACTACTTGACCAACTTG
GAGAAGCAGGGATTTTTTCTAAAATTGACCTAAAGAGTGGTTATGACCAGATTAGAATCAGGCCCGGAGATGAATGGAAGACAGTATTCAAAACAAATGAAGGCCTCTTT
GAATGGCTTGTGATGCCATTTGGGCTCTCCAACACTCCTAGCACTTTCATGAGACTCATGAACCAAGTACTTCATCATTTCCTTAATAAATTTGTCGTTGTTTATTTTGA
TGATATCTTGGTTTTTAGCAGATGCCTAGAAGAGCATAATCCACATCTATACCAAGTGTTCAAAGTGCTGGCCTTTTGTGTAGAAGAAATAGCATTCTTGGGATTTATTA
TCAAGAAGAATCACATCCTAATGGATGAAAAGAAAGTCGAGGCAATTAGAAACTGGCCAATACCGAAATCAATAAAAGAAGTACAAGCATTCCTAGGCTTGGCATCATTC
TACAAGAAGTTCATTTATAACTTCAGTACCATTGCCGCCTCAATTACAGACTGCCTAAAGAAGGGAGGCTTCCTATGGGGAAAGAAACAACAAGATAGCTTTGAAGCATT
GAAGAAAAAGCTTAGTAATAAACCAGTCCTAAAACTCCCCAAGTTTACACAGCCATTTGAAGTAGTTGTTGATGCCTCCGGGACTGGCACTGGAGCTGTCTTATCTCAAT
CAGGTCATCCCATTGAGTACTTCAGTGAAAAACTCAACCAATCAAGACAGATGTGGAGCACTTATGAGCAAGAAATGTATGCTCTCGTCCGAGCACTAAAAACAATGGGA
GCACTGCCTACTATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCCAAAAGAATACGCTATTTTACAGCAGGCAATTGAAGAATTGCTGATAAAAGGACATATCCAACCAAGCTTAAGTCCATGTGCAGTCCCTGCCTTACTAACACC
AAAGAAAGATGGAAGTTGGAGGATGTGCATTGATAGCAGGACAATCAACAAAATCACAGTAAAATACAGGTTCCCTACTCCAAGAATAAATGAACTACTTGACCAACTTG
GAGAAGCAGGGATTTTTTCTAAAATTGACCTAAAGAGTGGTTATGACCAGATTAGAATCAGGCCCGGAGATGAATGGAAGACAGTATTCAAAACAAATGAAGGCCTCTTT
GAATGGCTTGTGATGCCATTTGGGCTCTCCAACACTCCTAGCACTTTCATGAGACTCATGAACCAAGTACTTCATCATTTCCTTAATAAATTTGTCGTTGTTTATTTTGA
TGATATCTTGGTTTTTAGCAGATGCCTAGAAGAGCATAATCCACATCTATACCAAGTGTTCAAAGTGCTGGCCTTTTGTGTAGAAGAAATAGCATTCTTGGGATTTATTA
TCAAGAAGAATCACATCCTAATGGATGAAAAGAAAGTCGAGGCAATTAGAAACTGGCCAATACCGAAATCAATAAAAGAAGTACAAGCATTCCTAGGCTTGGCATCATTC
TACAAGAAGTTCATTTATAACTTCAGTACCATTGCCGCCTCAATTACAGACTGCCTAAAGAAGGGAGGCTTCCTATGGGGAAAGAAACAACAAGATAGCTTTGAAGCATT
GAAGAAAAAGCTTAGTAATAAACCAGTCCTAAAACTCCCCAAGTTTACACAGCCATTTGAAGTAGTTGTTGATGCCTCCGGGACTGGCACTGGAGCTGTCTTATCTCAAT
CAGGTCATCCCATTGAGTACTTCAGTGAAAAACTCAACCAATCAAGACAGATGTGGAGCACTTATGAGCAAGAAATGTATGCTCTCGTCCGAGCACTAAAAACAATGGGA
GCACTGCCTACTATCTAA
Protein sequenceShow/hide protein sequence
MSPKEYAILQQAIEELLIKGHIQPSLSPCAVPALLTPKKDGSWRMCIDSRTINKITVKYRFPTPRINELLDQLGEAGIFSKIDLKSGYDQIRIRPGDEWKTVFKTNEGLF
EWLVMPFGLSNTPSTFMRLMNQVLHHFLNKFVVVYFDDILVFSRCLEEHNPHLYQVFKVLAFCVEEIAFLGFIIKKNHILMDEKKVEAIRNWPIPKSIKEVQAFLGLASF
YKKFIYNFSTIAASITDCLKKGGFLWGKKQQDSFEALKKKLSNKPVLKLPKFTQPFEVVVDASGTGTGAVLSQSGHPIEYFSEKLNQSRQMWSTYEQEMYALVRALKTMG
ALPTI