; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G15460 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G15460
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr4:12852424..12853603
RNA-Seq ExpressionCSPI04G15460
SyntenyCSPI04G15460
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7028195.1 unnamed protein product [Microthlaspi erraticum]9.5e-8956.85Show/hide
Query:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEW
        IE+LLKKG I+ S+S CAVP LL PKK   WRMCVDSRAINKI +KY                         RSGYHQIRI PGDEWKTA K+ +GL+EW
Subjt:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEW

Query:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEI-----AIRKNHILTDEKKVEAIR
        LVMPFGLSNAPSTFMRLMNQ+L PF   FV+VYFDDIL++S++ E H  HL Q+L+VL +N+LY+NLKKC FC  ++      + +  I  DE+KV AIR
Subjt:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEI-----AIRKNHILTDEKKVEAIR

Query:  NWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQD--FEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS
        +WP P S  EV++F GLT+FYR+F+++FSTI APIT+CLK+G F W ++QD  F ++K+KL   PVL LPDF + F+V+ DAS  GIGA  S
Subjt:  NWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQD--FEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS

KAA0051933.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.4e-10073.28Show/hide
Query:  MCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIV
        MCVDSRAIN+I+VKY                         RSGY QIRI PGDEWKTA KTNEGLF+WLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIV
Subjt:  MCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIV

Query:  YFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIA
        YFDDIL FSR+L+ H++HL QL E LAKNELYINLKKCIFCVEEIA     IRKNHIL DEKKVEAI+NWPIPTS KEVQAF+GL SFYRKFI NF TIA
Subjt:  YFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIA

Query:  APITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFSIT
        API DCLK+G+FLW NK+   FE+LK+KLSNNP+L+LPDFSQPFEV VDA  TGIG+F S T
Subjt:  APITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFSIT

KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]1.2e-10455.33Show/hide
Query:  MNKNKKFNSLFMTISGKKLIKECEADILGLV---------------------------------------------------------------------
        + KN+K  SLF+TISGKK ++E E +ILG+V                                                                     
Subjt:  MNKNKKFNSLFMTISGKKLIKECEADILGLV---------------------------------------------------------------------

Query:  -AIEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLF
         AIEELLKKGHI+PS S C VPALLTPKKDG+WRMCVDSRAINKI VKY                         RS YHQIRI PGDEWKTA KTNEGLF
Subjt:  -AIEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLF

Query:  EWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEA
        EWLVMPF LSNAPSTFMRLMN+VLHPFLNKF+IVYFDDILVFS++ + H  H+DQL +VL  NELY+NLKKCIFC  EIA     IRK+H+L DEKKVEA
Subjt:  EWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEA

Query:  IRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS
        I+NW  PT+  +VQAF+GL SFYRKFIQN S+IAAPITDCLK+GAF W  KQ   F +LK+ L N  VLKLPDF Q FEV VD   TGIGA  S
Subjt:  IRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS

TYK06567.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]4.5e-10772.98Show/hide
Query:  GHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKYR-------------------------SGYHQIRIMPGDEWKTAVKTNEGLFEWLVMPFGL
        GH  PS S   +PALLTPKKDGSWRMC+DSR INKI VKYR                         SGY QIRI PGDEWKTA KTNEGLFEWLVMPFGL
Subjt:  GHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKYR-------------------------SGYHQIRIMPGDEWKTAVKTNEGLFEWLVMPFGL

Query:  SNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEAIRNWPIPTS
        SNAPSTFMRLMNQVLH FLNKFV+VYFDDILVFSR L+ HN HL Q+ EVLAKNELYINLKKC FCVEEIA     I+KNHIL DEKKVEAIRNWPIP S
Subjt:  SNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEAIRNWPIPTS

Query:  TKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS
         KEVQAF+GL SFYRKFI NFSTIAA ITDCLK+G FLW  KQ   FE LKKKLSN PVLKLP F+QPFEV VDAS TG GA  S
Subjt:  TKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS

XP_040245606.1 uncharacterized protein LOC109732219 isoform X1 [Aegilops tauschii subsp. strangulata]3.3e-8957.88Show/hide
Query:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEW
        +EELL+KGHI+ S+S CAVPALL PKKDGSWRMC DSRA+NKI V+Y                         RSGYHQIRI PGDEWKTA KT EGLFEW
Subjt:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEW

Query:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEI-----AIRKNHILTDEKKVEAIR
        LVMPFGLSNAPSTFM LMNQVL PFL+ FV+VYFDDIL++S+  + H  H+ ++LEVL +NELY+NLKKC+F   ++      I  + I  D+ KVEAIR
Subjt:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEI-----AIRKNHILTDEKKVEAIR

Query:  NWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWEN--KQDFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS
         WP P +  EV++F GL +FYR+F++NFSTI APIT+CLK+G F W    +  F  +K+KLS  PVL LPDF++ FE++ DAS  GIGA  S
Subjt:  NWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWEN--KQDFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS

TrEMBL top hitse value%identityAlignment
A0A5D3C402 DNA/RNA polymerases superfamily protein2.2e-10772.98Show/hide
Query:  GHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKYR-------------------------SGYHQIRIMPGDEWKTAVKTNEGLFEWLVMPFGL
        GH  PS S   +PALLTPKKDGSWRMC+DSR INKI VKYR                         SGY QIRI PGDEWKTA KTNEGLFEWLVMPFGL
Subjt:  GHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKYR-------------------------SGYHQIRIMPGDEWKTAVKTNEGLFEWLVMPFGL

Query:  SNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEAIRNWPIPTS
        SNAPSTFMRLMNQVLH FLNKFV+VYFDDILVFSR L+ HN HL Q+ EVLAKNELYINLKKC FCVEEIA     I+KNHIL DEKKVEAIRNWPIP S
Subjt:  SNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEAIRNWPIPTS

Query:  TKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS
         KEVQAF+GL SFYRKFI NFSTIAA ITDCLK+G FLW  KQ   FE LKKKLSN PVLKLP F+QPFEV VDAS TG GA  S
Subjt:  TKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS

A0A5D3CPI6 Putative gag-pol polyprotein6.8e-10173.28Show/hide
Query:  MCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIV
        MCVDSRAIN+I+VKY                         RSGY QIRI PGDEWKTA KTNEGLF+WLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIV
Subjt:  MCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIV

Query:  YFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIA
        YFDDIL FSR+L+ H++HL QL E LAKNELYINLKKCIFCVEEIA     IRKNHIL DEKKVEAI+NWPIPTS KEVQAF+GL SFYRKFI NF TIA
Subjt:  YFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIA

Query:  APITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFSIT
        API DCLK+G+FLW NK+   FE+LK+KLSNNP+L+LPDFSQPFEV VDA  TGIG+F S T
Subjt:  APITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFSIT

A0A5D3DGR0 Reverse transcriptase6.0e-10555.33Show/hide
Query:  MNKNKKFNSLFMTISGKKLIKECEADILGLV---------------------------------------------------------------------
        + KN+K  SLF+TISGKK ++E E +ILG+V                                                                     
Subjt:  MNKNKKFNSLFMTISGKKLIKECEADILGLV---------------------------------------------------------------------

Query:  -AIEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLF
         AIEELLKKGHI+PS S C VPALLTPKKDG+WRMCVDSRAINKI VKY                         RS YHQIRI PGDEWKTA KTNEGLF
Subjt:  -AIEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLF

Query:  EWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEA
        EWLVMPF LSNAPSTFMRLMN+VLHPFLNKF+IVYFDDILVFS++ + H  H+DQL +VL  NELY+NLKKCIFC  EIA     IRK+H+L DEKKVEA
Subjt:  EWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIA-----IRKNHILTDEKKVEA

Query:  IRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS
        I+NW  PT+  +VQAF+GL SFYRKFIQN S+IAAPITDCLK+GAF W  KQ   F +LK+ L N  VLKLPDF Q FEV VD   TGIGA  S
Subjt:  IRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS

A0A6D2HLB5 Reverse transcriptase4.6e-8956.85Show/hide
Query:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEW
        IE+LLKKG I+ S+S CAVP LL PKK   WRMCVDSRAINKI +KY                         RSGYHQIRI PGDEWKTA K+ +GL+EW
Subjt:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEW

Query:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEI-----AIRKNHILTDEKKVEAIR
        LVMPFGLSNAPSTFMRLMNQ+L PF   FV+VYFDDIL++S++ E H  HL Q+L+VL +N+LY+NLKKC FC  ++      + +  I  DE+KV AIR
Subjt:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEI-----AIRKNHILTDEKKVEAIR

Query:  NWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQD--FEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS
        +WP P S  EV++F GLT+FYR+F+++FSTI APIT+CLK+G F W ++QD  F ++K+KL   PVL LPDF + F+V+ DAS  GIGA  S
Subjt:  NWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQD--FEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS

A0A6D2IKM3 Reverse transcriptase4.6e-8956.85Show/hide
Query:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEW
        IE+LLKKG I+ S+S CAVP LL PKK   WRMCVDSRAINKI +KY                         RSGYHQIRI PGDEWKTA K+ +GL+EW
Subjt:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKY-------------------------RSGYHQIRIMPGDEWKTAVKTNEGLFEW

Query:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEI-----AIRKNHILTDEKKVEAIR
        LVMPFGLSNAPSTFMRLMNQ+L PF   FV+VYFDDIL++S++ E H  HL Q+L+VL +N+LY+NLKKC FC  ++      + +  I  DE+KV AIR
Subjt:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEI-----AIRKNHILTDEKKVEAIR

Query:  NWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQD--FEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS
        +WP P S  EV++F GLT+FYR+F+++FSTI APIT+CLK+G F W ++QD  F ++K+KL   PVL LPDF + F+V+ DAS  GIGA  S
Subjt:  NWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQD--FEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.8e-5139.33Show/hide
Query:  IEELLKKGHIQPSLSQCAVPALLTPKKDGS-----WRMCVDSRAINKIIVKYR-------------------------SGYHQIRIMPGDEWKTAVKTNE
        I+++L +G I+ S S    P  + PKK  +     +R+ +D R +N+I V  R                          G+HQI + P    KTA  T  
Subjt:  IEELLKKGHIQPSLSQCAVPALLTPKKDGS-----WRMCVDSRAINKIIVKYR-------------------------SGYHQIRIMPGDEWKTAVKTNE

Query:  GLFEWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIAIRKNHILTDE------K
        G +E+L MPFGL NAP+TF R MN +L P LNK  +VY DDI+VFS SL+ H   L  + E LAK  L + L KC F  +E      H+LT +      +
Subjt:  GLFEWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIAIRKNHILTDE------K

Query:  KVEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQDFEVLKKKL----SNNPVLKLPDFSQPFEVKVDASETGIGAFFS
        K+EAI+ +PIPT  KE++AF+GLT +YRKFI NF+ IA P+T CLK+   +     +++   KKL    S +P+LK+PDF++ F +  DAS+  +GA  S
Subjt:  KVEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQDFEVLKKKL----SNNPVLKLPDFSQPFEVKVDASETGIGAFFS

P20825 Retrovirus-related Pol polyprotein from transposon 2974.6e-4637.67Show/hide
Query:  IEELLKKGHIQPSLSQCAVPALLTPKKD-----GSWRMCVDSRAINKIIVKYR-------------------------SGYHQIRIMPGDEWKTAVKTNE
        ++E+L +G I+ S S    P  + PKK        +R+ +D R +N+I +  R                          G+HQI +      KTA  T  
Subjt:  IEELLKKGHIQPSLSQCAVPALLTPKKD-----GSWRMCVDSRAINKIIVKYR-------------------------SGYHQIRIMPGDEWKTAVKTNE

Query:  GLFEWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIAIRKNHILTDEK------
        G +E+L MPFGL NAP+TF R MN +L P LNK  +VY DDI++FS SL  H   +  +   LA   L + L KC F  +E A    HI+T +       
Subjt:  GLFEWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIAIRKNHILTDEK------

Query:  KVEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENK----QDFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS
        KV+AI ++PIPT  KE++AF+GLT +YRKFI N++ IA P+T CLK+   +   K    + FE LK  +  +P+L+LPDF + F +  DAS   +GA  S
Subjt:  KVEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENK----QDFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein7.1e-4740.55Show/hide
Query:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVK-------------------------YRSGYHQIRIMPGDEWKTAVKTNEGLFEW
        +++LL    I PS S C+ P +L PKKDG++R+CVD R +NK  +                            SGYHQI + P D +KTA  T  G +E+
Subjt:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVK-------------------------YRSGYHQIRIMPGDEWKTAVKTNEGLFEW

Query:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEE-------IAIRKNHILTDEKKVEA
         VMPFGL NAPSTF R M         +FV VY DDIL+FS S E H  HLD +LE L    L +  KKC F  EE       I I+K  I   + K  A
Subjt:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEE-------IAIRKNHILTDEKKVEA

Query:  IRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQD--FEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGA
        IR++P P + K+ Q F+G+ ++YR+FI N S IA PI   + + +  W  KQD   E LK  L N+PVL   +    + +  DAS+ GIGA
Subjt:  IRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQD--FEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGA

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus6.5e-4035.16Show/hide
Query:  IEELLKKGHIQPSLSQCAVPALLTPKK-----DGSWRMCVDSRAINKIIV-------------------KY------RSGYHQIRIMPGDEWKTAVKTNE
        I+ELL+ G I+PS S    P  + PKK     +  +RM VD + +N + +                   KY       SG+HQI +   D  KTA  T  
Subjt:  IEELLKKGHIQPSLSQCAVPALLTPKK-----DGSWRMCVDSRAINKIIV-------------------KY------RSGYHQIRIMPGDEWKTAVKTNE

Query:  GLFEWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEI-----AIRKNHILTDEKK
        G +E+L +PFGL NAP+ F R+++ +L   + K   VY DDI+VFS   + H  +L  +L  L+K  L +NL+K  F   ++      +  + I  D KK
Subjt:  GLFEWLVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEI-----AIRKNHILTDEKK

Query:  VEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAF--------------LWENK-QDFEVLKKKLSNNPVLKLPDFSQPFEVKVDA
        V AI   P PTS KE++ F+G+TS+YRKFIQ+++ +A P+T+ L  G +              L E   Q F  LK  L ++ +L  P F++PF +  DA
Subjt:  VEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAF--------------LWENK-QDFEVLKKKLSNNPVLKLPDFSQPFEVKVDA

Query:  SETGIGAFFS
        S   IGA  S
Subjt:  SETGIGAFFS

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.6e-4640.21Show/hide
Query:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVK-------------------------YRSGYHQIRIMPGDEWKTAVKTNEGLFEW
        +++LL    I PS S C+ P +L PKKDG++R+CVD R +NK  +                            SGYHQI + P D +KTA  T  G +E+
Subjt:  IEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVK-------------------------YRSGYHQIRIMPGDEWKTAVKTNEGLFEW

Query:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEE-------IAIRKNHILTDEKKVEA
         VMPFGL NAPSTF R M         +FV VY DDIL+FS S E H  HLD +LE L    L +  KKC F  EE       I I+K  I   + K  A
Subjt:  LVMPFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEE-------IAIRKNHILTDEKKVEA

Query:  IRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQD--FEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGA
        IR++P P + K+ Q F+G+ ++YR+FI N S IA PI   + + +  W  KQD   + LK  L N+PVL   +    + +  DAS+ GIGA
Subjt:  IRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWENKQD--FEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGA

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.7e-1936.09Show/hide
Query:  HLDQLLEVLAKNELYINLKKCIFCVEEIA-IRKNHILT------DEKKVEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWE
        HL  +L++  +++ Y N KKC F   +IA +   HI++      D  K+EA+  WP P +T E++ F+GLT +YR+F++N+  I  P+T+ LK+ +  W 
Subjt:  HLDQLLEVLAKNELYINLKKCIFCVEEIA-IRKNHILT------DEKKVEAIRNWPIPTSTKEVQAFIGLTSFYRKFIQNFSTIAAPITDCLKEGAFLWE

Query:  NKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKV
              F+ LK  ++  PVL LPD   PF  +V
Subjt:  NKQ--DFEVLKKKLSNNPVLKLPDFSQPFEVKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAGAATAAGAAATTCAACAGCCTCTTTATGACTATCAGTGGAAAGAAATTGATTAAAGAATGTGAAGCTGATATTTTGGGATTAGTTGCAATTGAAGAGTTGCT
GAAAAAAGGACATATTCAACCAAGCTTAAGCCAATGTGCAGTCCCTGCGCTGCTAACACCAAAGAAAGATGGAAGTTGGAGGATGTGCGTTGACAGCAGAGCAATCAATA
AAATTATAGTGAAGTACAGGAGTGGTTATCACCAGATCAGAATCATGCCTGGAGATGAGTGGAAAACGGCTGTCAAAACAAATGAAGGCCTCTTTGAATGGCTTGTGATG
CCATTTGGACTCTCCAACGCTCCAAGTACCTTTATGAGACTCATGAACCAAGTACTTCACCCTTTCCTTAATAAATTTGTCATTGTTTATTTTGATGATATCTTGGTTTT
CAGCAGATCTTTAGAAGGACATAATGTGCATCTAGATCAATTGTTGGAAGTGCTGGCTAAAAATGAACTATACATCAACCTCAAGAAATGCATCTTTTGTGTGGAAGAAA
TAGCCATCAGGAAGAATCATATACTAACGGATGAAAAGAAAGTTGAAGCCATTAGAAATTGGCCAATACCGACTTCAACAAAGGAAGTTCAAGCATTCATTGGCTTGACA
TCATTCTACAGAAAGTTTATCCAAAACTTCAGCACCATTGCTGCACCAATTACTGATTGTTTGAAAGAAGGAGCTTTCCTATGGGAAAATAAACAAGACTTTGAAGTATT
GAAGAAAAAGTTGAGTAATAATCCAGTCTTGAAACTCCCCGATTTTTCACAGCCATTTGAAGTTAAAGTAGATGCTTCCGAGACCGGCATTGGAGCTTTTTTCTCAATCA
CACCATCCAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAAGAATAAGAAATTCAACAGCCTCTTTATGACTATCAGTGGAAAGAAATTGATTAAAGAATGTGAAGCTGATATTTTGGGATTAGTTGCAATTGAAGAGTTGCT
GAAAAAAGGACATATTCAACCAAGCTTAAGCCAATGTGCAGTCCCTGCGCTGCTAACACCAAAGAAAGATGGAAGTTGGAGGATGTGCGTTGACAGCAGAGCAATCAATA
AAATTATAGTGAAGTACAGGAGTGGTTATCACCAGATCAGAATCATGCCTGGAGATGAGTGGAAAACGGCTGTCAAAACAAATGAAGGCCTCTTTGAATGGCTTGTGATG
CCATTTGGACTCTCCAACGCTCCAAGTACCTTTATGAGACTCATGAACCAAGTACTTCACCCTTTCCTTAATAAATTTGTCATTGTTTATTTTGATGATATCTTGGTTTT
CAGCAGATCTTTAGAAGGACATAATGTGCATCTAGATCAATTGTTGGAAGTGCTGGCTAAAAATGAACTATACATCAACCTCAAGAAATGCATCTTTTGTGTGGAAGAAA
TAGCCATCAGGAAGAATCATATACTAACGGATGAAAAGAAAGTTGAAGCCATTAGAAATTGGCCAATACCGACTTCAACAAAGGAAGTTCAAGCATTCATTGGCTTGACA
TCATTCTACAGAAAGTTTATCCAAAACTTCAGCACCATTGCTGCACCAATTACTGATTGTTTGAAAGAAGGAGCTTTCCTATGGGAAAATAAACAAGACTTTGAAGTATT
GAAGAAAAAGTTGAGTAATAATCCAGTCTTGAAACTCCCCGATTTTTCACAGCCATTTGAAGTTAAAGTAGATGCTTCCGAGACCGGCATTGGAGCTTTTTTCTCAATCA
CACCATCCAATTGA
Protein sequenceShow/hide protein sequence
MNKNKKFNSLFMTISGKKLIKECEADILGLVAIEELLKKGHIQPSLSQCAVPALLTPKKDGSWRMCVDSRAINKIIVKYRSGYHQIRIMPGDEWKTAVKTNEGLFEWLVM
PFGLSNAPSTFMRLMNQVLHPFLNKFVIVYFDDILVFSRSLEGHNVHLDQLLEVLAKNELYINLKKCIFCVEEIAIRKNHILTDEKKVEAIRNWPIPTSTKEVQAFIGLT
SFYRKFIQNFSTIAAPITDCLKEGAFLWENKQDFEVLKKKLSNNPVLKLPDFSQPFEVKVDASETGIGAFFSITPSN