; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G01970 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G01970
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
Genome locationChr7:1670078..1671278
RNA-Seq ExpressionCSPI07G01970
SyntenyCSPI07G01970
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]8.0e-15174.86Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        MSP EYKILHD IEELLKKGHIKPS S C VPALLTPKKDG+WRMCV SRAIN+ITVKYRF IPR+SDLLDQLG A IFSKIDL+S YHQIR+RPGDEWK
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
        TAFKTNEGLFEW+VMPF LSNAPSTFMRLMN+VLHPFLNKFI+VYFDDILV+S   D+HL H+ +LFQVL   ELY N KK +F   EIAFLGFII++  
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG
        + M+ KK+EAI  W TP ++ ++QAFLGLASFYRKFI+N SS+AAP+TDCLKKG F+W P QQ+SF  +K+ L +  +LKLPDF   FEV VD C TGIG
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG

Query:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF
         VL QQ HPIEYFSE+LS SRQ+WSTYEQELYALVRALKQWEHYLLS+EF
Subjt:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF

KAA0062943.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cucumis melo var. makuwa]7.3e-14471.75Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        MSP+EY++LHD+IE LLKKGHIKPSLSPC VPALLTPKKD SWRMCV SRAINRITVKY F IP++ DLLDQLGKA++FSKIDL+S YHQIR+RP DEWK
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
        T FK NEGLFEW+ MPFGLSNAPSTF                          S + ++HL HLRKLFQVL E ELY N KK  F+ KEI FLGF+IK+G 
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG
        I MEPKK+EAI +WP P SIKE+QAFLGLASFY++FIRNFSS+  PLTD LKK NFKW  +QQ+SFE+IK++LTSSPIL+LPDF+SPFEVVVDAC  GIG
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG

Query:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFAKRT
         VL QQGHPIEYFSEKLS SRQ+WSTYEQELYALVRALKQWEHYLL KEF   T
Subjt:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFAKRT

PKU83407.1 RNA-directed DNA polymerase [Dendrobium catenatum]3.5e-13063.71Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        MSP+EY IL + +E+LL K  I+ SLSPCAVPAL+ PKKDG WRMC+ SRAIN+ITVKYRF +PR+SDLLD+L  A+IFSK+DL+SGYHQIRVRPGDEWK
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
        TAFKT +GL+EW VMPFGL NAPSTFMRLM +VL PFL K  V YFDDILV+STN  +H+ HL  +F+VL + +LY N  K  F    + FLGFI+  G 
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG
        + ++ +K++ I  WPTP S+ E+++F GLA+FYR+FIR FS L APLTDC+K G+FKWT  QQ+SFE IK+ L ++P+L LP+F  PF V  DA   GIG
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG

Query:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF
         VL Q+  PIE+FSEKLS +RQ WSTYEQELYA+VRALKQWEHYLL ++F
Subjt:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]3.6e-13563.71Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        MSP+E  IL + +EELL+KGHI+ S+SPCAVPALLTPKKDGSWRMCV SRAIN+ITV+YRF IPR+ DLLDQL  A +FSKIDL+SGYHQIR++PGDEWK
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
        TAFKT +GL+EW+VMPFGLSNAPSTFMRLM QVL PF+ KF+VVYFDDILVYS    EHL HLRK+ + LTE EL+ N KK  F+  ++ FLG+I+    
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG
        I ++  K++A+  WP+P ++ E+++F GLA+FYR+F+RNFSS+ AP+T+C+KKG FKWT   +ESF+ IK++LT++P+L LP+F + FE+  DAC TGIG
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG

Query:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF
         VL Q+G P+ + SEKL+ +RQ WSTYEQELYA+V+A+K+WEHYL+ +EF
Subjt:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF

VVA31129.1 PREDICTED: reverse mRNAase, partial [Prunus dulcis]3.2e-13163.14Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        MSP+E  IL + IEELL+KG I+ SLSPCAVP LL PKKD +WRMCV SRAIN+ITVKYRF IPR+ D+LD L  + +FSKIDL+SGYHQIR+RPGDEWK
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
        TAFK+ +GLFEW+VMPFGLSNAPSTFMRLMNQVL PF+  F+VVYFDDIL+YST  +EHL+HLR++  VL E +LY N KK  F   ++ FLGF++ +  
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG
        I ++ +KI+AI  WP P ++ E+++F GLA+FYR+F+R+FSS+AAP+T+CLKKG F W   Q+ SF DIK+KL ++P+L LP+F   FEV  DA   G+G
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG

Query:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF
         VL Q   P+ +FSEKLS +RQ WSTY+QE YA+VRALKQWEHYL+ KEF
Subjt:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF

TrEMBL top hitse value%identityAlignment
A0A2U1P6A2 Transposon Ty3-I Gag-Pol polyprotein1.8e-13563.71Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        MSP+E  IL + +EELL+KGHI+ S+SPCAVPALLTPKKDGSWRMCV SRAIN+ITV+YRF IPR+ DLLDQL  A +FSKIDL+SGYHQIR++PGDEWK
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
        TAFKT +GL+EW+VMPFGLSNAPSTFMRLM QVL PF+ KF+VVYFDDILVYS    EHL HLRK+ + LTE EL+ N KK  F+  ++ FLG+I+    
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG
        I ++  K++A+  WP+P ++ E+++F GLA+FYR+F+RNFSS+ AP+T+C+KKG FKWT   +ESF+ IK++LT++P+L LP+F + FE+  DAC TGIG
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG

Query:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF
         VL Q+G P+ + SEKL+ +RQ WSTYEQELYA+V+A+K+WEHYL+ +EF
Subjt:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF

A0A5A7V4G7 Retrovirus-related Pol polyprotein from transposon 17.63.5e-14471.75Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        MSP+EY++LHD+IE LLKKGHIKPSLSPC VPALLTPKKD SWRMCV SRAINRITVKY F IP++ DLLDQLGKA++FSKIDL+S YHQIR+RP DEWK
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
        T FK NEGLFEW+ MPFGLSNAPSTF                          S + ++HL HLRKLFQVL E ELY N KK  F+ KEI FLGF+IK+G 
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG
        I MEPKK+EAI +WP P SIKE+QAFLGLASFY++FIRNFSS+  PLTD LKK NFKW  +QQ+SFE+IK++LTSSPIL+LPDF+SPFEVVVDAC  GIG
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG

Query:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFAKRT
         VL QQGHPIEYFSEKLS SRQ+WSTYEQELYALVRALKQWEHYLL KEF   T
Subjt:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFAKRT

A0A5B7BER3 Uncharacterized protein5.8e-13163.14Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        MSP+E +IL   +E+L+ KG I+ S+SPCAVPALLTPKKDGSWRMCV SRAIN+ITVKYRF IPR++D+LD L  + IFSKIDL+SGYHQIR+RPGDEWK
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
        TAFKT EGL+EW+VMPFGLSNAPSTFMR+MNQVL PF+ KF+VVYFDDIL+YS +  EHL H+R++   L E++LY N KK  F+   + FLGFII    
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG
        I ++ +K+ AI  WPTP ++ +I++F GLA+FYR+FIRNFSS+ AP+TDC+KKG F+W   Q+ SF  IK+KL+++P+L LP F   F+V  DA  TGIG
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG

Query:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF
         VL Q+G P+E+FSEKL+ +RQ W+TYE EL+A+VRALK WEHYL+ +EF
Subjt:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF

A0A5D3DGR0 Reverse transcriptase3.9e-15174.86Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        MSP EYKILHD IEELLKKGHIKPS S C VPALLTPKKDG+WRMCV SRAIN+ITVKYRF IPR+SDLLDQLG A IFSKIDL+S YHQIR+RPGDEWK
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
        TAFKTNEGLFEW+VMPF LSNAPSTFMRLMN+VLHPFLNKFI+VYFDDILV+S   D+HL H+ +LFQVL   ELY N KK +F   EIAFLGFII++  
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG
        + M+ KK+EAI  W TP ++ ++QAFLGLASFYRKFI+N SS+AAP+TDCLKKG F+W P QQ+SF  +K+ L +  +LKLPDF   FEV VD C TGIG
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG

Query:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF
         VL QQ HPIEYFSE+LS SRQ+WSTYEQELYALVRALKQWEHYLLS+EF
Subjt:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF

A0A5E4FUK3 PREDICTED: reverse mRNAase (Fragment)1.5e-13163.14Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        MSP+E  IL + IEELL+KG I+ SLSPCAVP LL PKKD +WRMCV SRAIN+ITVKYRF IPR+ D+LD L  + +FSKIDL+SGYHQIR+RPGDEWK
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
        TAFK+ +GLFEW+VMPFGLSNAPSTFMRLMNQVL PF+  F+VVYFDDIL+YST  +EHL+HLR++  VL E +LY N KK  F   ++ FLGF++ +  
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG
        I ++ +KI+AI  WP P ++ E+++F GLA+FYR+F+R+FSS+AAP+T+CLKKG F W   Q+ SF DIK+KL ++P+L LP+F   FEV  DA   G+G
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIG

Query:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF
         VL Q   P+ +FSEKLS +RQ WSTY+QE YA+VRALKQWEHYL+ KEF
Subjt:  VVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.8e-7643.14Show/hide
Query:  PQEY-KILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGS-----WRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPG
        PQ Y + +   I+++L +G I+ S SP   P  + PKK  +     +R+ +  R +N ITV  R  IP + ++L +LG+ + F+ IDL  G+HQI + P 
Subjt:  PQEY-KILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGS-----WRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPG

Query:  DEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFII
           KTAF T  G +E++ MPFGL NAP+TF R MN +L P LNK  +VY DDI+V+ST+ DEHL  L  +F+ L +A L     K  F+K+E  FLG ++
Subjt:  DEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFII

Query:  KQGSISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFK---WTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVD
            I   P+KIEAI  +P P   KEI+AFLGL  +YRKFI NF+ +A P+T CLKK N K     P    +F+ +K  ++  PILK+PDF+  F +  D
Subjt:  KQGSISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFK---WTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVD

Query:  ACCTGIGVVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF
        A    +G VL Q GHP+ Y S  L+     +ST E+EL A+V A K + HYLL + F
Subjt:  ACCTGIGVVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF

P0CT34 Transposon Tf2-1 polyprotein7.0e-6536.54Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        + P + + ++D I + LK G I+ S +  A P +  PKK+G+ RM V  + +N+      + +P I  LL ++  ++IF+K+DLKS YH IRVR GDE K
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
         AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL++S +  EH+ H++ + Q L  A L  N  K  F + ++ F+G+ I +  
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGI
         +   + I+ +  W  P + KE++ FLG  ++ RKFI   S L  PL + LKK   +KWTP Q ++ E+IK+ L S P+L+  DFS    +  DA    +
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGI

Query:  GVVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLS
        G VL Q+      +P+ Y+S K+S ++  +S  ++E+ A++++LK W HYL S
Subjt:  GVVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLS

P0CT35 Transposon Tf2-2 polyprotein7.0e-6536.54Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        + P + + ++D I + LK G I+ S +  A P +  PKK+G+ RM V  + +N+      + +P I  LL ++  ++IF+K+DLKS YH IRVR GDE K
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
         AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL++S +  EH+ H++ + Q L  A L  N  K  F + ++ F+G+ I +  
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGI
         +   + I+ +  W  P + KE++ FLG  ++ RKFI   S L  PL + LKK   +KWTP Q ++ E+IK+ L S P+L+  DFS    +  DA    +
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGI

Query:  GVVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLS
        G VL Q+      +P+ Y+S K+S ++  +S  ++E+ A++++LK W HYL S
Subjt:  GVVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLS

P0CT41 Transposon Tf2-12 polyprotein7.0e-6536.54Show/hide
Query:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK
        + P + + ++D I + LK G I+ S +  A P +  PKK+G+ RM V  + +N+      + +P I  LL ++  ++IF+K+DLKS YH IRVR GDE K
Subjt:  MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWK

Query:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS
         AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL++S +  EH+ H++ + Q L  A L  N  K  F + ++ F+G+ I +  
Subjt:  TAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGS

Query:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGI
         +   + I+ +  W  P + KE++ FLG  ++ RKFI   S L  PL + LKK   +KWTP Q ++ E+IK+ L S P+L+  DFS    +  DA    +
Subjt:  ISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGI

Query:  GVVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLS
        G VL Q+      +P+ Y+S K+S ++  +S  ++E+ A++++LK W HYL S
Subjt:  GVVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLS

P20825 Retrovirus-related Pol polyprotein from transposon 2977.0e-7341.55Show/hide
Query:  LHDHIEELLKKGHIKPSLSPCAVPALLTPKKD-----GSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWKTAF
        + + ++E+L +G I+ S SP   P  + PKK        +R+ +  R +N IT+  R+ IP + ++L +LGK   F+ IDL  G+HQI +      KTAF
Subjt:  LHDHIEELLKKGHIKPSLSPCAVPALLTPKKD-----GSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWKTAF

Query:  KTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGSISM
         T  G +E++ MPFGL NAP+TF R MN +L P LNK  +VY DDI+++ST+  EHL  ++ +F  L +A L     K  F+KKE  FLG I+    I  
Subjt:  KTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGSISM

Query:  EPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQ--ESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIGV
         P K++AI ++P P   KEI+AFLGL  +YRKFI N++ +A P+T CLKK     T   +  E+FE +K  +   PIL+LPDF   F +  DA    +G 
Subjt:  EPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQ--ESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIGV

Query:  VLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF
        VL Q GHPI + S  L+     +S  E+EL A+V A K + HYLL ++F
Subjt:  VLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEF

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein8.0e-2438.76Show/hide
Query:  HLRKLFQVLTEAELYTNTKKSMFMKKEIAFLG--FIIKQGSISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWT
        HL  + Q+  + + Y N KK  F + +IA+LG   II    +S +P K+EA+  WP P +  E++ FLGL  +YR+F++N+  +  PLT+ LKK + KWT
Subjt:  HLRKLFQVLTEAELYTNTKKSMFMKKEIAFLG--FIIKQGSISMEPKKIEAIHTWPTPVSIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWT

Query:  PLQQESFEDIKKKLTSSPILKLPDFSSPF
         +   +F+ +K  +T+ P+L LPD   PF
Subjt:  PLQQESFEDIKKKLTSSPILKLPDFSSPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCCCCAGGAGTACAAAATACTTCATGATCATATTGAGGAATTACTAAAGAAAGGGCACATCAAACCTAGCCTCAGCCCGTGTGCAGTACCAGCACTTCTCACGCC
AAAGAAAGATGGAAGCTGGAGAATGTGCGTTGGCAGCAGAGCCATCAACCGTATCACGGTAAAGTATAGATTTCTCATTCCAAGGATTAGTGACCTGCTAGATCAACTCG
GTAAAGCCAGTATCTTTTCGAAAATTGACTTGAAAAGTGGCTATCATCAAATACGGGTAAGACCTGGCGACGAATGGAAGACAGCCTTCAAGACAAACGAAGGATTATTT
GAATGGATGGTCATGCCATTCGGCCTTTCTAACGCACCCAGCACCTTCATGAGATTGATGAACCAGGTACTTCACCCATTTCTCAACAAATTCATAGTAGTTTACTTCGA
TGACATACTTGTTTACAGCACAAACAATGATGAGCATTTACTACACCTAAGGAAGCTGTTCCAAGTCTTAACAGAGGCAGAACTCTACACAAATACTAAGAAAAGCATGT
TTATGAAAAAAGAAATTGCATTCCTTGGCTTTATAATCAAACAAGGAAGCATAAGCATGGAACCAAAGAAAATCGAAGCCATCCATACATGGCCGACTCCTGTCTCCATT
AAAGAAATACAAGCCTTCCTCGGCCTGGCTTCATTTTACAGGAAATTCATCAGAAACTTCAGCTCTTTAGCCGCACCCCTAACTGACTGTCTAAAGAAAGGAAACTTCAA
ATGGACCCCATTACAACAAGAAAGCTTTGAAGATATCAAAAAGAAACTAACATCCAGCCCTATCCTTAAATTACCTGACTTCTCTTCACCTTTTGAAGTAGTAGTTGATG
CATGCTGCACAGGGATTGGAGTTGTCCTAGTTCAACAAGGACATCCTATCGAATACTTCAGTGAAAAACTCAGCACCTCAAGACAGACCTGGAGCACATACGAACAAGAG
CTGTATGCCCTCGTCCGAGCACTAAAACAATGGGAACACTACCTACTCTCTAAAGAATTTGCAAAGAGAACAAGGTGGCCGATGCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCCCCAGGAGTACAAAATACTTCATGATCATATTGAGGAATTACTAAAGAAAGGGCACATCAAACCTAGCCTCAGCCCGTGTGCAGTACCAGCACTTCTCACGCC
AAAGAAAGATGGAAGCTGGAGAATGTGCGTTGGCAGCAGAGCCATCAACCGTATCACGGTAAAGTATAGATTTCTCATTCCAAGGATTAGTGACCTGCTAGATCAACTCG
GTAAAGCCAGTATCTTTTCGAAAATTGACTTGAAAAGTGGCTATCATCAAATACGGGTAAGACCTGGCGACGAATGGAAGACAGCCTTCAAGACAAACGAAGGATTATTT
GAATGGATGGTCATGCCATTCGGCCTTTCTAACGCACCCAGCACCTTCATGAGATTGATGAACCAGGTACTTCACCCATTTCTCAACAAATTCATAGTAGTTTACTTCGA
TGACATACTTGTTTACAGCACAAACAATGATGAGCATTTACTACACCTAAGGAAGCTGTTCCAAGTCTTAACAGAGGCAGAACTCTACACAAATACTAAGAAAAGCATGT
TTATGAAAAAAGAAATTGCATTCCTTGGCTTTATAATCAAACAAGGAAGCATAAGCATGGAACCAAAGAAAATCGAAGCCATCCATACATGGCCGACTCCTGTCTCCATT
AAAGAAATACAAGCCTTCCTCGGCCTGGCTTCATTTTACAGGAAATTCATCAGAAACTTCAGCTCTTTAGCCGCACCCCTAACTGACTGTCTAAAGAAAGGAAACTTCAA
ATGGACCCCATTACAACAAGAAAGCTTTGAAGATATCAAAAAGAAACTAACATCCAGCCCTATCCTTAAATTACCTGACTTCTCTTCACCTTTTGAAGTAGTAGTTGATG
CATGCTGCACAGGGATTGGAGTTGTCCTAGTTCAACAAGGACATCCTATCGAATACTTCAGTGAAAAACTCAGCACCTCAAGACAGACCTGGAGCACATACGAACAAGAG
CTGTATGCCCTCGTCCGAGCACTAAAACAATGGGAACACTACCTACTCTCTAAAGAATTTGCAAAGAGAACAAGGTGGCCGATGCTCTAA
Protein sequenceShow/hide protein sequence
MSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVGSRAINRITVKYRFLIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLF
EWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYTNTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPVSI
KEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVVVDACCTGIGVVLVQQGHPIEYFSEKLSTSRQTWSTYEQE
LYALVRALKQWEHYLLSKEFAKRTRWPML