; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0227761 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0227761
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr08:21084441..21085316
RNA-Seq ExpressionCmc08g0227761
SyntenyCmc08g0227761
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAU90333.1 Putative gag and pol polyprotein, identical [Solanum demissum]3.0e-7954.95Show/hide
Query:  FKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSN
        FK ++ E+ENQF ++IKR+ SDRG EY+S  FN F  S GIIH+TT  YS   NG AERKNRTL +L  A+L+ES A  ++WGE I T  YVLNR+P   
Subjt:  FKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSN

Query:  SKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSS
        SK + +E+ K   P++ YLR WGCLA+VR+ DPK  KL  +V  C F+GY  NS AYRF++LE+ ++IES D  F +++FPF S+NSGG   +     + 
Subjt:  SKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSS

Query:  SSLPSIRIQT-QDKEV-DPEPRRSKRARTIKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK
         SLPS    T ++KEV D E RRSKRAR  KDFG +F ++NV +DP  L EALSS D+  W+EA+NDEM+SL SN+TW LVDLPPGCK IGCK
Subjt:  SSLPSIRIQT-QDKEV-DPEPRRSKRARTIKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK

ABI34306.1 Polyprotein, putative [Solanum demissum]1.3e-7954.95Show/hide
Query:  FKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSN
        FK ++ E+ENQF ++IKR+ SDRG EY+S  FN F  S GIIH+TT  YS   NG AERKNRTL +L  A+L+ES A  ++WGE I T  YVLNR+P   
Subjt:  FKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSN

Query:  SKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSS
        SK +P+E+ K   P++ YLR WGCLA+VR+ DPK  KL  +V  C F+GY  NS AYRF++LE+ ++IES D  F +++FPF S+NSGG   +     + 
Subjt:  SKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSS

Query:  SSLPSIRIQT-QDKEV-DPEPRRSKRARTIKDFGEDFEMYNVEDPK-DLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK
         +LPS    T ++KEV D E RRSKRAR  KDFG DF ++NV D +  L EALSS D+  W+EA+NDEM+SL SN+TW LVDLPPGCK IGCK
Subjt:  SSLPSIRIQT-QDKEV-DPEPRRSKRARTIKDFGEDFEMYNVEDPK-DLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK

KAA0034938.1 putative Polyprotein [Cucumis melo var. makuwa]3.1e-14590.18Show/hide
Query:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKS
        MFKVFVTEIENQFNKRIKRL SDRGTEYDSVAFNEFYNSKGIIH+TT  YS EMNGK ERKNRTLT+L VAILLES AAPSWWGEIIKTVNYVLNRIPKS
Subjt:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKS

Query:  NSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSS
        NSKTSPYEVLKHK PN++YLRTWGCLAYVRIP+P+RRKL S+ Y CVFIGY ENSKAYRFYDLENKVIIESNDVDFF+D+FPFKSRNSGGLYSQTSGGSS
Subjt:  NSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSS

Query:  SSSLPSIRIQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGC
         SSLPSIRIQTQDKEVDPEPRRSKRART+KDF EDFEMYNVEDPKDLT+ALSSVDANLWQEAIND +DSLESNRTWHLVDLPP C
Subjt:  SSSLPSIRIQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGC

RZC09450.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]1.6e-10969.62Show/hide
Query:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKS
        MFK+FVTEIENQFNK+IK+L SDRGT+YDS  FNEFYN  GIIH+TTA YS EMNGKAERKNRT T+LVVA +L S A   WWGEI+ TV YVLNRIPKS
Subjt:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKS

Query:  NSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSS
         SKTSPYE+LK + PN++YLRTWGCLAYVRIPDPKR KL SR Y CVFIGY  NSKAYRFYDL  KVIIESND DF++++FPFK R+        SGG+S
Subjt:  NSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSS

Query:  SSSLPSIRIQT-QDKEVDPEPRRSKRARTIKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK
        S+ LP+I  +     + D EPRR KRAR  KD+G D+  Y + EDP +L EALS +DA+LWQEAINDEMDSLES++TWHLVDLPPGCK IGCK
Subjt:  SSSLPSIRIQT-QDKEVDPEPRRSKRARTIKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK

XP_023158131.2 uncharacterized protein LOC103653943 isoform X1 [Zea mays]2.3e-7149.66Show/hide
Query:  FKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSN
        FK++ TE+ENQ +K+IKRL SDRG EY S  F+E+    GIIH+TTA YS + NG AERKNRT+  L  A+L  SG    WWGE + TV YVLNR+P  N
Subjt:  FKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSN

Query:  SKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENK-------VIIESNDVDFFKDRFPFKSRNSGGLYSQ
         + +PYE  K + P++++LRTWGCLA V +P PK+RKL  +   CVF+GY  NS AYRF  + ++       VI+ES DV FF+  FP + +        
Subjt:  SKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENK-------VIIESNDVDFFKDRFPFKSRNSGGLYSQ

Query:  TSGGSSSSSLPSIRIQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK
          G S + SLPS      D+  D E RRSKR RT K  G+D+ +Y V E+P+ LTEA +S DA  W+EA+  EMDS+ SN TW + DLP GCK +GCK
Subjt:  TSGGSSSSSLPSIRIQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK

TrEMBL top hitse value%identityAlignment
A0A445KFK2 Retrovirus-related Pol polyprotein from transposon TNT 1-947.8e-11069.62Show/hide
Query:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKS
        MFK+FVTEIENQFNK+IK+L SDRGT+YDS  FNEFYN  GIIH+TTA YS EMNGKAERKNRT T+LVVA +L S A   WWGEI+ TV YVLNRIPKS
Subjt:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKS

Query:  NSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSS
         SKTSPYE+LK + PN++YLRTWGCLAYVRIPDPKR KL SR Y CVFIGY  NSKAYRFYDL  KVIIESND DF++++FPFK R+        SGG+S
Subjt:  NSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSS

Query:  SSSLPSIRIQT-QDKEVDPEPRRSKRARTIKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK
        S+ LP+I  +     + D EPRR KRAR  KD+G D+  Y + EDP +L EALS +DA+LWQEAINDEMDSLES++TWHLVDLPPGCK IGCK
Subjt:  SSSLPSIRIQT-QDKEVDPEPRRSKRARTIKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK

A0A5D3DCJ1 Putative Polyprotein1.5e-14590.18Show/hide
Query:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKS
        MFKVFVTEIENQFNKRIKRL SDRGTEYDSVAFNEFYNSKGIIH+TT  YS EMNGK ERKNRTLT+L VAILLES AAPSWWGEIIKTVNYVLNRIPKS
Subjt:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKS

Query:  NSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSS
        NSKTSPYEVLKHK PN++YLRTWGCLAYVRIP+P+RRKL S+ Y CVFIGY ENSKAYRFYDLENKVIIESNDVDFF+D+FPFKSRNSGGLYSQTSGGSS
Subjt:  NSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSS

Query:  SSSLPSIRIQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGC
         SSLPSIRIQTQDKEVDPEPRRSKRART+KDF EDFEMYNVEDPKDLT+ALSSVDANLWQEAIND +DSLESNRTWHLVDLPP C
Subjt:  SSSLPSIRIQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGC

A0A7N2L531 Uncharacterized protein2.3e-8557.73Show/hide
Query:  FKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSN
        F+ F+ E+ENQF ++IKR+ SDRG EY+S AFN F  S GIIH+TTA YS   NG AERKNRTL +L  A+L+ESGA   +WGE I T  +VLNR+P   
Subjt:  FKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSN

Query:  SKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSS
        S T+P+E+ K   PN+ YLR W CLAYVR+ DPK  KL  R   C F+GY  NS AYRF+DLENK+I ES D  F +++FPFK +NSGG  +  S  SSS
Subjt:  SKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSS

Query:  SSLPSIRIQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK
        +S      Q Q+   + EPRRSKRAR  KDFG D+ ++N+E+ PK+L EAL+S DA  W+EA+NDEM+SL SNRTW LVDLPPGCK IGCK
Subjt:  SSLPSIRIQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK

A0A7N2N1S1 Integrase catalytic domain-containing protein2.6e-8158.48Show/hide
Query:  RIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTP
        +IKR+ SDRG EY+S AFN F  S GIIH+TTA YS   NG AERKNRTL +L  A+L+ESGA   +WGE I T  +VLNR+P   S T+P+E+ K   P
Subjt:  RIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKTP

Query:  NVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSSSSLPSIRIQTQDKE
        N+ YLR WGCLAYVR+ DPK  KL  R   C F+GY  NS AYRF+DLENK+I ES D  F +++FPFK +NSGG  +  S  SSS+S      Q Q+  
Subjt:  NVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSSSSLPSIRIQTQDKE

Query:  VDPEPRRSKRARTIKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK
         + EPRRSKRAR  KDFG D+ ++N+E+ PK+L EAL+S DA  W+EA+NDEM+SL SNRTW LVDLPPGCK IGCK
Subjt:  VDPEPRRSKRARTIKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK

A0A7N2R9F3 Uncharacterized protein1.6e-8356.7Show/hide
Query:  FKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSN
        F+ F+ E+ENQF ++IKR+ SDRG EY+S AFN F  S GIIH+TTA YS   NG  ERKNRTL +L  A+L+ESGA   +WGE I T  +VLNR+P   
Subjt:  FKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSN

Query:  SKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSS
        S T+P+E+ K   PN+ YLR WGCLAYVR+ DPK  KL  R   C F+GY  NS AYRF+DLENK+I ES D  F +++FPFK +NSGG  +     SSS
Subjt:  SKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSS

Query:  SSLPSIRIQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK
        +S     +Q Q+   + E RRSKRAR  KDFG D+ ++N+E+ P++L EAL+S DA  W+EA+NDEM+SL SNRTW LVDLPPGCK IGCK
Subjt:  SSLPSIRIQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK

SwissProt top hitse value%identityAlignment
A0A0B7P3V8 Transposon Ty4-P Gag-Pol polyprotein2.0e-0920.48Show/hide
Query:  IENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYE
        +E QF+++++ ++SDRGTE+ +    E++ SKGI H  T++     NG+AER  RT+      +L +S     +W   + +   + N +   ++   P +
Subjt:  IENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYE

Query:  VLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSSSSLPSIR
         +  +   V  +          I +   +KL       + +    NS  Y+F+      I+ S++          + RN+  +Y      S +       
Subjt:  VLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSSSSLPSIR

Query:  IQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDAN
            D E D     +     ++++ +D +     +     E LS +D+N
Subjt:  IQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDAN

P04146 Copia protein1.5e-2028.11Show/hide
Query:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKS
        MF+ FV + E  FN ++  L+ D G EY S    +F   KGI +  T  ++ ++NG +ER  RT+T+    ++  +    S+WGE + T  Y++NRIP  
Subjt:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKS

Query:  ---NSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFK-DRFPFKSRNSGGLYSQTS
           +S  +PYE+  +K P + +LR +G   YV I + K+ K   + +  +F+GY  N   ++ +D  N+  I + DV   + +    ++     ++ + S
Subjt:  ---NSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFK-DRFPFKSRNSGGLYSQTS

Query:  GGSSSSSLPSIRIQTQDKEVDPE-PRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEM--DSLESNR
          S + + P+       K +  E P  SK    I+   +  E  N   P D  + + +   N  +E  N +   DS ESN+
Subjt:  GGSSSSSLPSIRIQTQDKEVDPE-PRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEM--DSLESNR

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein2.0e-0920.48Show/hide
Query:  IENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYE
        +E QF+++++ ++SDRGTE+ +    E++ SKGI H  T++     NG+AER  RT+      +L +S     +W   + +   + N +   ++   P +
Subjt:  IENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYE

Query:  VLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSSSSLPSIR
         +  +   V  +          I +   +KL       + +    NS  Y+F+      I+ S++          + RN+  +Y      S +       
Subjt:  VLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSSSSLPSIR

Query:  IQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDAN
            D E D     +     ++++ +D +     +     E LS +D+N
Subjt:  IQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDAN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-3330.51Show/hide
Query:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPK-
        +F+ F   +E +  +++KRL SD G EY S  F E+ +S GI H+ T   + + NG AER NRT+ + V ++L  +    S+WGE ++T  Y++NR P  
Subjt:  MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPK-

Query:  SNSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDR-------------------
          +   P  V  +K  + ++L+ +GC A+  +P  +R KL  +   C+FIGY +    YR +D   K +I S DV F +                     
Subjt:  SNSKTSPYEVLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDR-------------------

Query:  --FPFKSRNSGGLYSQTSGGSSSSSLPSIRIQ---------------TQDKEVDPEPRRSKRARTIKDFGEDFEMYNVED---PKDLTEALSSVDANLWQ
           P  S N     S T   S     P   I+               TQ +E     RRS+R R         E   + D   P+ L E LS  + N   
Subjt:  --FPFKSRNSGGLYSQTSGGSSSSSLPSIRIQ---------------TQDKEVDPEPRRSKRARTIKDFGEDFEMYNVED---PKDLTEALSSVDANLWQ

Query:  EAINDEMDSLESNRTWHLVDLPPGCKGIGCK
        +A+ +EM+SL+ N T+ LV+LP G + + CK
Subjt:  EAINDEMDSLESNRTWHLVDLPPGCKGIGCK

P47024 Transposon Ty4-J Gag-Pol polyprotein1.3e-0820.88Show/hide
Query:  IENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYE
        +E QF+++++ ++SDRGTE+ +    E++ SKGI H  T++     NG+AER  RT+      +L +S     +W   + +   + N +   ++   P +
Subjt:  IENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYE

Query:  VLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSSSSLPSIR
         +  +   V  +          I +   +KL       + +    NS  Y+F+      I+ S       D +   +    G    T   + S    S  
Subjt:  VLKHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSSSSLPSIR

Query:  IQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDAN
            D E D     +     ++++ +D +     +     E LS +D+N
Subjt:  IQTQDKEVDPEPRRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDAN

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.4e-0429.33Show/hide
Query:  NRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTS-PYEVLKHKTPNVTYLRTWGCLAYVRIPDPK
        NRT+ + V ++L E G   ++  +   T  +++N+ P +      P EV     P  +YLR +GC+AY+   + K
Subjt:  NRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTS-PYEVLKHKTPNVTYLRTWGCLAYVRIPDPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAAGTCTTTGTAACTGAAATAGAGAACCAATTTAACAAAAGAATTAAGAGACTTCATAGTGATAGAGGAACTGAATATGATTCAGTTGCTTTCAATGAATTTTA
TAACTCAAAAGGAATAATACATAAAACTACTGCGTCTTATTCTACTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTAAGTTAGTAGTTGCTATCTTAC
TTGAGTCAGGAGCAGCACCATCTTGGTGGGGTGAAATAATTAAGACTGTTAATTATGTTCTTAATAGAATTCCTAAATCTAACAGTAAAACTTCACCATACGAAGTCCTT
AAACATAAAACACCAAACGTGACTTATCTTAGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGGAAATTAGTAAGTAGAGTCTATGGATGTGT
CTTCATAGGATACACTGAAAATAGTAAAGCCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATCGAATGACGTAGATTTTTTCAAGGACAGATTTCCTTTTA
AATCTAGAAATAGTGGGGGCCTATATAGTCAAACTAGTGGGGGCTCAAGTTCCAGTAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAGTAGATCCTGAACCT
AGAAGAAGCAAGAGAGCTAGAACAATAAAAGACTTCGGAGAAGACTTTGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTATCATCAGTAGATGCCAA
TTTATGGCAAGAAGCTATCAATGATGAAATGGACTCTCTTGAATCCAATAGAACTTGGCACCTAGTTGACTTACCCCCTGGATGTAAAGGTATAGGCTGCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAAAGTCTTTGTAACTGAAATAGAGAACCAATTTAACAAAAGAATTAAGAGACTTCATAGTGATAGAGGAACTGAATATGATTCAGTTGCTTTCAATGAATTTTA
TAACTCAAAAGGAATAATACATAAAACTACTGCGTCTTATTCTACTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTAAGTTAGTAGTTGCTATCTTAC
TTGAGTCAGGAGCAGCACCATCTTGGTGGGGTGAAATAATTAAGACTGTTAATTATGTTCTTAATAGAATTCCTAAATCTAACAGTAAAACTTCACCATACGAAGTCCTT
AAACATAAAACACCAAACGTGACTTATCTTAGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGGAAATTAGTAAGTAGAGTCTATGGATGTGT
CTTCATAGGATACACTGAAAATAGTAAAGCCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATCGAATGACGTAGATTTTTTCAAGGACAGATTTCCTTTTA
AATCTAGAAATAGTGGGGGCCTATATAGTCAAACTAGTGGGGGCTCAAGTTCCAGTAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAGTAGATCCTGAACCT
AGAAGAAGCAAGAGAGCTAGAACAATAAAAGACTTCGGAGAAGACTTTGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTATCATCAGTAGATGCCAA
TTTATGGCAAGAAGCTATCAATGATGAAATGGACTCTCTTGAATCCAATAGAACTTGGCACCTAGTTGACTTACCCCCTGGATGTAAAGGTATAGGCTGCAAATGA
Protein sequenceShow/hide protein sequence
MFKVFVTEIENQFNKRIKRLHSDRGTEYDSVAFNEFYNSKGIIHKTTASYSTEMNGKAERKNRTLTKLVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYEVL
KHKTPNVTYLRTWGCLAYVRIPDPKRRKLVSRVYGCVFIGYTENSKAYRFYDLENKVIIESNDVDFFKDRFPFKSRNSGGLYSQTSGGSSSSSLPSIRIQTQDKEVDPEP
RRSKRARTIKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEMDSLESNRTWHLVDLPPGCKGIGCK