; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0164471 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0164471
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTransposon Ty1-DR1 Gag-Pol polyprotein
Genome locationCMiso1.1chr06:15045192..15046061
RNA-Seq ExpressionCmc06g0164471
SyntenyCmc06g0164471
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAU90333.1 Putative gag and pol polyprotein, identical [Solanum demissum]3.8e-6650.52Show/hide
Query:  FLFIYL-KIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE
        F ++YL K K DA+E FK ++ E+ENQF ++IKR+ SDRG EY+S  FN F  S GIIHETT PYSP  NG AERKNRTL+EL  A+L+E  A  ++WGE
Subjt:  FLFIYL-KIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE

Query:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS
         I T  YVLNR+P   SK + +E+ K   P+L YLR W CLA+VR+ DPK  KL  +  T  F+GY  N+  YRF++LE+ ++IE  D  F E+++ F S
Subjt:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS

Query:  RNSGGLNSQTSEGLSSSSLPSTRIQS-QDKEV-DFEPRKSKRARTVKDFREDFETYNV-EDPKDLTEALFSVDANLWQVAINDEMDS
        +NSGG   +     +  SLPS+   + ++KEV DFE R+SKRAR  KDF  +F  +NV +DP  L EAL S D+  W+ A+NDEM+S
Subjt:  RNSGGLNSQTSEGLSSSSLPSTRIQS-QDKEV-DFEPRKSKRARTVKDFREDFETYNV-EDPKDLTEALFSVDANLWQVAINDEMDS

ABI34306.1 Polyprotein, putative [Solanum demissum]1.3e-6651.05Show/hide
Query:  FLFIYL-KIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE
        F ++YL K K DA+E FK ++ E+ENQF ++IKR+ SDRG EY+S  FN F  S GIIHETT PYSP  NG AERKNRTL+EL  A+L+E  A  ++WGE
Subjt:  FLFIYL-KIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE

Query:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS
         I T  YVLNR+P   SK +P+E+ K   P+L YLR W CLA+VR+ DPK  KL  +  T  F+GY  N+  YRF++LE+ ++IE  D  F E+++ F S
Subjt:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS

Query:  RNSGGLNSQTSEGLSSSSLPSTRIQSQDKEV-DFEPRKSKRARTVKDFREDFETYNVEDPK-DLTEALFSVDANLWQVAINDEMDS
        +NSGG   + +     SS  ST    ++KEV DFE R+SKRAR  KDF  DF  +NV D +  L EAL S D+  W+ A+NDEM+S
Subjt:  RNSGGLNSQTSEGLSSSSLPSTRIQSQDKEV-DFEPRKSKRARTVKDFREDFETYNVEDPK-DLTEALFSVDANLWQVAINDEMDS

CAX68207.1 Copia-like retrotransposon [Helianthus annuus]1.3e-5844.7Show/hide
Query:  FLFIYLKIKRD-AYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE
        +L++YL   +D A+E FK +  E+E Q  KRIK L SDRG EY +  F+ F   +GIIHE TAPY+P+ NG AERKNRTL+E+   +L + G   + WGE
Subjt:  FLFIYLKIKRD-AYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE

Query:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS
         + T  YV NRI      TSPYE+ K + PNL +LR W CLAY R+PDPK  KL  RA+  VFIGY  ++K+YR  D E+ +++E  DV+FFE+++S   
Subjt:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS

Query:  RNSGGLNSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNVE------------------DPKDLTEALFSVDANLWQVAINDEM
         N        S G ++SS    R+  Q   +  EPRKS R R  K + +DF +Y VE                  DPK   EA+ S DA LW+ A+NDEM
Subjt:  RNSGGLNSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNVE------------------DPKDLTEALFSVDANLWQVAINDEM

Query:  DS
        DS
Subjt:  DS

KAA0034938.1 putative Polyprotein [Cucumis melo var. makuwa]8.5e-12783.8Show/hide
Query:  FLFIY-LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE
        + FIY LK K DAYEMFK FVTEIENQFNKRIKRL SDRGTEYDS+AFNEFYNSKGIIHETT PYSPEMNGK ERKNRTL EL VAILLE  AAPSWWGE
Subjt:  FLFIY-LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE

Query:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS
        IIKT+NYVLNRIPKS SKTSPYEVLKHK PNLSYLRTW CLAYVRIP+P+RRKLAS+AY  VFIGY EN+K YRFYDLEN+VIIE NDVDFFE ++ FKS
Subjt:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS

Query:  RNSGGLNSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNVEDPKDLTEALFSVDANLWQVAINDEMDS
        RNSGGL SQTS G S SSLPS RIQ+QDKEVD EPR+SKRARTVKDFREDFE YNVEDPKDLT+AL SVDANLWQ AIND +DS
Subjt:  RNSGGLNSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNVEDPKDLTEALFSVDANLWQVAINDEMDS

RZC09450.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]1.8e-9264.87Show/hide
Query:  LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTIN
        +K K +A +MFK FVTEIENQFNK+IK+L SDRGT+YDS  FNEFYN  GIIHETTAPYSPEMNGKAERKNRT  ELVVA +L   A   WWGEI+ T+ 
Subjt:  LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTIN

Query:  YVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNSGGL
        YVLNRIPKS SKTSPYE+LK + PNLSYLRTW CLAYVRIPDPKR KLASRAY  VFIGY  N+K YRFYDL  +VIIE ND DF+E+++ FK R+SGG 
Subjt:  YVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNSGGL

Query:  NSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNV-EDPKDLTEALFSVDANLWQVAINDEMDS
        +S     +SS +L   +        D EPR+ KRAR  KD+  D+  Y + EDP +L EAL  +DA+LWQ AINDEMDS
Subjt:  NSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNV-EDPKDLTEALFSVDANLWQVAINDEMDS

TrEMBL top hitse value%identityAlignment
A0A445KFK2 Retrovirus-related Pol polyprotein from transposon TNT 1-948.7e-9364.87Show/hide
Query:  LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTIN
        +K K +A +MFK FVTEIENQFNK+IK+L SDRGT+YDS  FNEFYN  GIIHETTAPYSPEMNGKAERKNRT  ELVVA +L   A   WWGEI+ T+ 
Subjt:  LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTIN

Query:  YVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNSGGL
        YVLNRIPKS SKTSPYE+LK + PNLSYLRTW CLAYVRIPDPKR KLASRAY  VFIGY  N+K YRFYDL  +VIIE ND DF+E+++ FK R+SGG 
Subjt:  YVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNSGGL

Query:  NSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNV-EDPKDLTEALFSVDANLWQVAINDEMDS
        +S     +SS +L   +        D EPR+ KRAR  KD+  D+  Y + EDP +L EAL  +DA+LWQ AINDEMDS
Subjt:  NSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNV-EDPKDLTEALFSVDANLWQVAINDEMDS

A0A5D3DCJ1 Putative Polyprotein4.1e-12783.8Show/hide
Query:  FLFIY-LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE
        + FIY LK K DAYEMFK FVTEIENQFNKRIKRL SDRGTEYDS+AFNEFYNSKGIIHETT PYSPEMNGK ERKNRTL EL VAILLE  AAPSWWGE
Subjt:  FLFIY-LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE

Query:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS
        IIKT+NYVLNRIPKS SKTSPYEVLKHK PNLSYLRTW CLAYVRIP+P+RRKLAS+AY  VFIGY EN+K YRFYDLEN+VIIE NDVDFFE ++ FKS
Subjt:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS

Query:  RNSGGLNSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNVEDPKDLTEALFSVDANLWQVAINDEMDS
        RNSGGL SQTS G S SSLPS RIQ+QDKEVD EPR+SKRARTVKDFREDFE YNVEDPKDLT+AL SVDANLWQ AIND +DS
Subjt:  RNSGGLNSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNVEDPKDLTEALFSVDANLWQVAINDEMDS

A0A7N2L531 Uncharacterized protein7.6e-7354.48Show/hide
Query:  LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTIN
        LK K DA+E F+ F+ E+ENQF ++IKR+ SDRG EY+S AFN F  S GIIHETTAPYSP  NG AERKNRTLIEL  A+L+E GA   +WGE I T  
Subjt:  LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTIN

Query:  YVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNSGGL
        +VLNR+P   S T+P+E+ K   PNL YLR W CLAYVR+ DPK  KL  RA T  F+GY  N+  YRF+DLEN++I E  D  F E ++ FK +NSGG 
Subjt:  YVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNSGGL

Query:  NSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNVED-PKDLTEALFSVDANLWQVAINDEMDS
         +  S+  SS+S      Q+Q+   + EPR+SKRAR  KDF  D+  +N+E+ PK+L EAL S DA  W+ A+NDEM+S
Subjt:  NSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNVED-PKDLTEALFSVDANLWQVAINDEMDS

A0A7N2R9F3 Uncharacterized protein3.5e-7053.05Show/hide
Query:  LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTIN
        LK K DA+E F+ F+ E+ENQF ++IKR+ SDRG EY+S AFN F  S GIIHETTAPYSP  NG  ERKNRTLIEL  A+L+E GA   +WGE I T  
Subjt:  LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTIN

Query:  YVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNSGGL
        +VLNR+P   S T+P+E+ K   PNL YLR W CLAYVR+ DPK  KL  RA T  F+GY  N+  YRF+DLEN++I E  D  F E ++ FK +NSGG 
Subjt:  YVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNSGGL

Query:  NSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNVED-PKDLTEALFSVDANLWQVAINDEMDS
         +   +  SS+S     +Q+Q+   + E R+SKRAR  KDF  D+  +N+E+ P++L EAL S DA  W+ A+NDEM+S
Subjt:  NSQTSEGLSSSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNVED-PKDLTEALFSVDANLWQVAINDEMDS

Q0KIN7 Polyprotein, putative6.2e-6751.05Show/hide
Query:  FLFIYL-KIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE
        F ++YL K K DA+E FK ++ E+ENQF ++IKR+ SDRG EY+S  FN F  S GIIHETT PYSP  NG AERKNRTL+EL  A+L+E  A  ++WGE
Subjt:  FLFIYL-KIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGE

Query:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS
         I T  YVLNR+P   SK +P+E+ K   P+L YLR W CLA+VR+ DPK  KL  +  T  F+GY  N+  YRF++LE+ ++IE  D  F E+++ F S
Subjt:  IIKTINYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS

Query:  RNSGGLNSQTSEGLSSSSLPSTRIQSQDKEV-DFEPRKSKRARTVKDFREDFETYNVEDPK-DLTEALFSVDANLWQVAINDEMDS
        +NSGG   + +     SS  ST    ++KEV DFE R+SKRAR  KDF  DF  +NV D +  L EAL S D+  W+ A+NDEM+S
Subjt:  RNSGGLNSQTSEGLSSSSLPSTRIQSQDKEV-DFEPRKSKRARTVKDFREDFETYNVEDPK-DLTEALFSVDANLWQVAINDEMDS

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.1e-2329.5Show/hide
Query:  LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTIN
        +K K D + MF+ FV + E  FN ++  L  D G EY S    +F   KGI +  T P++P++NG +ER  RT+ E    ++       S+WGE + T  
Subjt:  LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTIN

Query:  YVLNRIPKSY---SKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNS
        Y++NRIP      S  +PYE+  +K P L +LR +    YV I + K+ K   +++  +F+GY  N   ++ +D  NE  I   DV   E        NS
Subjt:  YVLNRIPKSY---SKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNS

Query:  GGLNSQTSEGLSSSSLPSTRIQSQDKEV---DFEPRKSKRARTVKDFREDFETYNVEDPKD
          +  +T     S    +    +  +++   +F P +SK    ++  ++  E+ N   P D
Subjt:  GGLNSQTSEGLSSSSLPSTRIQSQDKEV---DFEPRKSKRARTVKDFREDFETYNVEDPKD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-2733.48Show/hide
Query:  LFIY-LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEI
        L++Y LK K   +++F+ F   +E +  +++KRL SD G EY S  F E+ +S GI HE T P +P+ NG AER NRT++E V ++L       S+WGE 
Subjt:  LFIY-LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEI

Query:  IKTINYVLNRIPK-SYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS
        ++T  Y++NR P    +   P  V  +K  + S+L+ + C A+  +P  +R KL  ++   +FIGY +    YR +D   + +I   DV F E     + 
Subjt:  IKTINYVLNRIPK-SYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKS

Query:  RNSGGLNSQTSEGLSSS--SLPST
        R +  ++ +   G+  +  ++PST
Subjt:  RNSGGLNSQTSEGLSSS--SLPST

P47024 Transposon Ty4-J Gag-Pol polyprotein2.0e-0619.68Show/hide
Query:  IENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTINYVLNRIPKSYSKTSPYE
        +E QF+++++ + SDRGTE+ +    E++ SKGI H  T+      NG+AER  RT+I     +L +      +W   + +   + N +    +   P +
Subjt:  IENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTINYVLNRIPKSYSKTSPYE

Query:  VLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNSGGLNSQTSEGLSSSSLPSTR
         +  +   +  +          I +   +KL       + +    N+  Y+F+      I+  ++     +    + RN+  +N   S   SS +     
Subjt:  VLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNSGGLNSQTSEGLSSSSLPSTR

Query:  IQSQDKEVDFEPRKSKRARTVKDFREDFETYNVEDPKDLTEALFSVDAN
            D E D     +     ++++ +D +     +     E L  +D+N
Subjt:  IQSQDKEVDFEPRKSKRARTVKDFREDFETYNVEDPKDLTEALFSVDAN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-1829.56Show/hide
Query:  TTFLFIY-LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWW
        T + ++Y LK K    E F  F   +EN+F  RI    SD G E+  +A  E+++  GI H T+ P++PE NG +ERK+R ++E  + +L       ++W
Subjt:  TTFLFIY-LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWW

Query:  GEIIKTINYVLNRIPKSYSK-TSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYS
                Y++NR+P    +  SP++ L   +PN   LR + C  Y  +    + KL  ++   VF+GY+     Y    L+   +     V F E+ + 
Subjt:  GEIIKTINYVLNRIPKSYSK-TSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYS

Query:  FKS
        F +
Subjt:  FKS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-1928.12Show/hide
Query:  TTFLFIY-LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWW
        T + ++Y LK K    + F  F + +EN+F  RI  L SD G E+  +   ++ +  GI H T+ P++PE NG +ERK+R ++E+ + +L       ++W
Subjt:  TTFLFIY-LKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWW

Query:  GEIIKTINYVLNRIPKSYSK-TSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYS
                Y++NR+P    +  SP++ L  + PN   L+ + C  Y  +    R KL  ++    F+GY+     Y    +    +     V F E  + 
Subjt:  GEIIKTINYVLNRIPKSYSK-TSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYS

Query:  FKSRNSGGLNSQTSEGLSSSSLPS
        F + N G   SQ     S+ + PS
Subjt:  FKSRNSGGLNSQTSEGLSSSSLPS

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0430.49Show/hide
Query:  NRTLIELVVAILLELGAAPSWWGEIIKTINYVLNRIPK-SYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASR
        NRT+IE V ++L E G   ++  +   T  +++N+ P  + +   P EV     P  SYLR + C+AY+   + K +  A +
Subjt:  NRTLIELVVAILLELGAAPSWWGEIIKTINYVLNRIPK-SYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGTTCTGACTACATTTTTATTTATCTACTTAAAAATAAAAAGGGATGCCTATGAAATGTTCAAAGCCTTTGTAACTGAAATAGAGAACCAGTTTAATAAAAGAAT
TAAGAGACTTTGTAGTGATAGAGGAACTGAATATGATTCAATTGCTTTCAATGAGTTTTATAACTCAAAAGGAATAATACATGAAACTACTGCGCCTTATTCTCCTGAAA
TGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAATTGAGTTAGTAGTTGCTATCTTACTTGAGTTAGGAGCAGCACCATCTTGGTGGGGTGAAATAATTAAAACTATT
AATTATGTTCTTAATAGGATTCCTAAATCATACAGTAAAACTTCACCATACGAAGTCCTTAAACATAAAACACCAAACTTGTCTTATCTTAGAACTTGGGTTTGTCTAGC
TTATGTTAGAATACCTGATCCAAAAAGAAGGAAATTAGCAAGTAGAGCCTATACACGTGTCTTCATAGGATACACTGAAAACAATAAAACCTATAGGTTCTATGACTTAG
AAAACGAAGTAATCATAGAATTGAATGACGTAGATTTTTTCGAGCATAGATATTCTTTTAAATCTAGAAATAGTGGGGGCCTAAATAGTCAAACTAGTGAGGGCTTAAGT
TCTAGTAGCCTACCTTCAACTAGGATCCAATCCCAAGATAAGGAAGTAGATTTTGAACCTAGAAAAAGCAAGAGAGCTAGAACAGTAAAAGATTTTAGAGAAGACTTCGA
AACGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCCTTATTCTCAGTAGATGCCAATTTATGGCAAGTAGCTATCAATGATGAAATGGACTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGTTCTGACTACATTTTTATTTATCTACTTAAAAATAAAAAGGGATGCCTATGAAATGTTCAAAGCCTTTGTAACTGAAATAGAGAACCAGTTTAATAAAAGAAT
TAAGAGACTTTGTAGTGATAGAGGAACTGAATATGATTCAATTGCTTTCAATGAGTTTTATAACTCAAAAGGAATAATACATGAAACTACTGCGCCTTATTCTCCTGAAA
TGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAATTGAGTTAGTAGTTGCTATCTTACTTGAGTTAGGAGCAGCACCATCTTGGTGGGGTGAAATAATTAAAACTATT
AATTATGTTCTTAATAGGATTCCTAAATCATACAGTAAAACTTCACCATACGAAGTCCTTAAACATAAAACACCAAACTTGTCTTATCTTAGAACTTGGGTTTGTCTAGC
TTATGTTAGAATACCTGATCCAAAAAGAAGGAAATTAGCAAGTAGAGCCTATACACGTGTCTTCATAGGATACACTGAAAACAATAAAACCTATAGGTTCTATGACTTAG
AAAACGAAGTAATCATAGAATTGAATGACGTAGATTTTTTCGAGCATAGATATTCTTTTAAATCTAGAAATAGTGGGGGCCTAAATAGTCAAACTAGTGAGGGCTTAAGT
TCTAGTAGCCTACCTTCAACTAGGATCCAATCCCAAGATAAGGAAGTAGATTTTGAACCTAGAAAAAGCAAGAGAGCTAGAACAGTAAAAGATTTTAGAGAAGACTTCGA
AACGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCCTTATTCTCAGTAGATGCCAATTTATGGCAAGTAGCTATCAATGATGAAATGGACTCTTGA
Protein sequenceShow/hide protein sequence
MTVLTTFLFIYLKIKRDAYEMFKAFVTEIENQFNKRIKRLCSDRGTEYDSIAFNEFYNSKGIIHETTAPYSPEMNGKAERKNRTLIELVVAILLELGAAPSWWGEIIKTI
NYVLNRIPKSYSKTSPYEVLKHKTPNLSYLRTWVCLAYVRIPDPKRRKLASRAYTRVFIGYTENNKTYRFYDLENEVIIELNDVDFFEHRYSFKSRNSGGLNSQTSEGLS
SSSLPSTRIQSQDKEVDFEPRKSKRARTVKDFREDFETYNVEDPKDLTEALFSVDANLWQVAINDEMDS