; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0052001 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0052001
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr02:19069272..19070147
RNA-Seq ExpressionCmc02g0052001
SyntenyCmc02g0052001
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAU90333.1 Putative gag and pol polyprotein, identical [Solanum demissum]1.2e-8055.63Show/hide
Query:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN
        FK ++ E+ENQF ++IKR+RSDRG EY+S  FN F  S GIIHETT PYSP  NG AERKNRTL EL  A+L+ES A  ++W E I +  YVLNR+P   
Subjt:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN

Query:  SKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSS
        SK + +E+ K   P+L YLR WGCLA+VR+ DPK  KL  +   C F+GYA NS  YRF++LE+ ++IES D  F E+ FPF S+NSGG   +     + 
Subjt:  SKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSS

Query:  NSLPSIRIQT-QDKEV-DPEPRRNKRAKTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK
         SLPS    T ++KEV D E RR+KRA+  KDFG +F ++NV +DP  L EALSS D+  W+EA+NDEM SL SN+TW LVDLPPGCK IGCK
Subjt:  NSLPSIRIQT-QDKEV-DPEPRRNKRAKTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK

ABI34306.1 Polyprotein, putative [Solanum demissum]6.0e-8055.29Show/hide
Query:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN
        FK ++ E+ENQF ++IKR+RSDRG EY+S  FN F  S GIIHETT PYSP  NG AERKNRTL EL  A+L+ES A  ++W E I +  YVLNR+P   
Subjt:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN

Query:  SKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSS
        SK + +E+ K   P+L YLR WGCLA+VR+ DPK  KL  +   C F+GYA NS  YRF++LE+ ++IES D  F E+ FPF S+NSGG   +     + 
Subjt:  SKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSS

Query:  NSLPSIRIQT-QDKEV-DPEPRRNKRAKTVKDFGEDFEMYNVEDPK-DLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK
         +LPS    T ++KEV D E RR+KRA+  KDFG DF ++NV D +  L EALSS D+  W+EA+NDEM SL SN+TW LVDLPPGCK IGCK
Subjt:  NSLPSIRIQT-QDKEV-DPEPRRNKRAKTVKDFGEDFEMYNVEDPK-DLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK

KAA0034938.1 putative Polyprotein [Cucumis melo var. makuwa]3.1e-14591.23Show/hide
Query:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS
        MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFY+SKGIIHETT PYSPEMNGK ERKNRTLTEL VAILLES AAPSWW EIIK+VNYVLNRIPKS
Subjt:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS

Query:  NSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSS
        NSKTS YEVLKHK PNLSYLRTWGCLAYVRIP+P+RRKLAS+AYECVFIGYAENSK YRFYDLENKVIIESNDVDFFED FPFKSRNSGGL SQTSGGSS
Subjt:  NSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSS

Query:  SNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGC
         +SLPSIRIQTQDKEVDPEPRR+KRA+TVKDF EDFEMYNVEDPKDLT+ALSSVDANLWQEAIND ++SLESNRTWHLVDLPP C
Subjt:  SNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGC

RZC09450.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]7.8e-11271.67Show/hide
Query:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS
        MFK+FVTEIENQFNK+IK+LRSDRGT+YDS  FNEFY+  GIIHETTAPYSPEMNGKAERKNRT TELVVA +L S A   WW EI+ +V YVLNRIPKS
Subjt:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS

Query:  NSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSS
         SKTS YE+LK + PNLSYLRTWGCLAYVRIPDPKR KLASRAYECVFIGYA NSK YRFYDL  KVIIESND DF+E+ FPFK R+        SGG+S
Subjt:  NSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSS

Query:  SNSLPSIRIQT-QDKEVDPEPRRNKRAKTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK
        SN LP+I  +     + D EPRR KRA+  KD+G D+  Y + EDP +L EALS +DA+LWQEAINDEM+SLES++TWHLVDLPPGCK IGCK
Subjt:  SNSLPSIRIQT-QDKEVDPEPRRNKRAKTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK

XP_023158131.2 uncharacterized protein LOC103653943 isoform X1 [Zea mays]1.4e-7350.34Show/hide
Query:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN
        FK++ TE+ENQ +K+IKRLRSDRG EY S  F+E+    GIIHETTAPYSP+ NG AERKNRT+ +L  A+L  SG    WW E + +V YVLNR+P  N
Subjt:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN

Query:  SKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENK-------VIIESNDVDFFEDIFPFKSRNSGGLCSQ
         + + YE  K + P+LS+LRTWGCLA V +P PK+RKL  +  +CVF+GYA NS  YRF  + ++       VI+ES DV FFE IFP + +        
Subjt:  SKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENK-------VIIESNDVDFFEDIFPFKSRNSGGLCSQ

Query:  TSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK
          G S + SLPS      D+  D E RR+KR +T K  G+D+ +Y V E+P+ LTEA +S DA  W+EA+  EM+S+ SN TW + DLP GCK +GCK
Subjt:  TSGGSSSNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK

TrEMBL top hitse value%identityAlignment
A0A445KFK2 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-11271.67Show/hide
Query:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS
        MFK+FVTEIENQFNK+IK+LRSDRGT+YDS  FNEFY+  GIIHETTAPYSPEMNGKAERKNRT TELVVA +L S A   WW EI+ +V YVLNRIPKS
Subjt:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS

Query:  NSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSS
         SKTS YE+LK + PNLSYLRTWGCLAYVRIPDPKR KLASRAYECVFIGYA NSK YRFYDL  KVIIESND DF+E+ FPFK R+        SGG+S
Subjt:  NSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSS

Query:  SNSLPSIRIQT-QDKEVDPEPRRNKRAKTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK
        SN LP+I  +     + D EPRR KRA+  KD+G D+  Y + EDP +L EALS +DA+LWQEAINDEM+SLES++TWHLVDLPPGCK IGCK
Subjt:  SNSLPSIRIQT-QDKEVDPEPRRNKRAKTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK

A0A5D3DCJ1 Putative Polyprotein1.5e-14591.23Show/hide
Query:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS
        MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFY+SKGIIHETT PYSPEMNGK ERKNRTLTEL VAILLES AAPSWW EIIK+VNYVLNRIPKS
Subjt:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS

Query:  NSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSS
        NSKTS YEVLKHK PNLSYLRTWGCLAYVRIP+P+RRKLAS+AYECVFIGYAENSK YRFYDLENKVIIESNDVDFFED FPFKSRNSGGL SQTSGGSS
Subjt:  NSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSS

Query:  SNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGC
         +SLPSIRIQTQDKEVDPEPRR+KRA+TVKDF EDFEMYNVEDPKDLT+ALSSVDANLWQEAIND ++SLESNRTWHLVDLPP C
Subjt:  SNSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGC

A0A7N2L531 Uncharacterized protein1.6e-8658.76Show/hide
Query:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN
        F+ F+ E+ENQF ++IKR+RSDRG EY+S AFN F  S GIIHETTAPYSP  NG AERKNRTL EL  A+L+ESGA   +W E I +  +VLNR+P   
Subjt:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN

Query:  SKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSS
        S T+ +E+ K   PNL YLR W CLAYVR+ DPK  KL  RA  C F+GYA NS  YRF+DLENK+I ES D  F E+ FPFK +NSGG  +  S  SSS
Subjt:  SKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSS

Query:  NSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK
         S      Q Q+   + EPRR+KRA+  KDFG D+ ++N+E+ PK+L EAL+S DA  W+EA+NDEM SL SNRTW LVDLPPGCK IGCK
Subjt:  NSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK

A0A7N2N1S1 Integrase catalytic domain-containing protein1.8e-8259.57Show/hide
Query:  RIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTP
        +IKR+RSDRG EY+S AFN F  S GIIHETTAPYSP  NG AERKNRTL EL  A+L+ESGA   +W E I +  +VLNR+P   S T+ +E+ K   P
Subjt:  RIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVLKHKTP

Query:  NLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKE
        NL YLR WGCLAYVR+ DPK  KL  RA  C F+GYA NS  YRF+DLENK+I ES D  F E+ FPFK +NSGG  +  S  SSS S      Q Q+  
Subjt:  NLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKE

Query:  VDPEPRRNKRAKTVKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK
         + EPRR+KRA+  KDFG D+ ++N+E+ PK+L EAL+S DA  W+EA+NDEM SL SNRTW LVDLPPGCK IGCK
Subjt:  VDPEPRRNKRAKTVKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK

A0A7N2R9F3 Uncharacterized protein1.1e-8457.73Show/hide
Query:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN
        F+ F+ E+ENQF ++IKR+RSDRG EY+S AFN F  S GIIHETTAPYSP  NG  ERKNRTL EL  A+L+ESGA   +W E I +  +VLNR+P   
Subjt:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN

Query:  SKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSS
        S T+ +E+ K   PNL YLR WGCLAYVR+ DPK  KL  RA  C F+GYA NS  YRF+DLENK+I ES D  F E+ FPFK +NSGG  +     SSS
Subjt:  SKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSS

Query:  NSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK
         S     +Q Q+   + E RR+KRA+  KDFG D+ ++N+E+ P++L EAL+S DA  W+EA+NDEM SL SNRTW LVDLPPGCK IGCK
Subjt:  NSLPSIRIQTQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-2027.76Show/hide
Query:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS
        MF+ FV + E  FN ++  L  D G EY S    +F   KGI +  T P++P++NG +ER  RT+TE    ++  +    S+W E + +  Y++NRIP  
Subjt:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS

Query:  ---NSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFE-DIFPFKSRNSGGLCSQTS
           +S  + YE+  +K P L +LR +G   YV I + K+ K   ++++ +F+GY  N   ++ +D  N+  I + DV   E ++   ++     +  + S
Subjt:  ---NSKTSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFE-DIFPFKSRNSGGLCSQTS

Query:  GGSSSNSLPSIRIQTQDKEVDPE-PRRNKRAKTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEM--NSLESNR
          S + + P+       K +  E P  +K    ++   +  E  N   P D  + + +   N  +E  N +   +S ESN+
Subjt:  GGSSSNSLPSIRIQTQDKEVDPE-PRRNKRAKTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEM--NSLESNR

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein3.1e-1027.54Show/hide
Query:  IENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIP-KSNSKTSLY
        +E QF+++++ + SDRGTE+ +    E++ SKGI H  T+      NG+AER  RT+      +L +S     +WE  + S   + N +  KS  K  L 
Subjt:  IENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIP-KSNSKTSLY

Query:  EVLKHK-TPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFY-DLENKVIIESN
         + +   T  L     +G      I +   +KL       + +    NS  Y+F+   +NK++   N
Subjt:  EVLKHK-TPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFY-DLENKVIIESN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.2e-3631.72Show/hide
Query:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS
        +F+ F   +E +  +++KRLRSD G EY S  F E+ SS GI HE T P +P+ NG AER NRT+ E V ++L  +    S+W E +++  Y++NR P  
Subjt:  MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKS

Query:  NSKTSLYE-VLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFED--------------------
             + E V  +K  + S+L+ +GC A+  +P  +R KL  ++  C+FIGY +    YR +D   K +I S DV F E                     
Subjt:  NSKTSLYE-VLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFED--------------------

Query:  -IFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQ---------------TQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVED---PKDLTEALSSVDANLWQ
           P  S N     S T   S     P   I+               TQ +E     RR++R +         E   + D   P+ L E LS  + N   
Subjt:  -IFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQ---------------TQDKEVDPEPRRNKRAKTVKDFGEDFEMYNVED---PKDLTEALSSVDANLWQ

Query:  EAINDEMNSLESNRTWHLVDLPPGCKAIGCK
        +A+ +EM SL+ N T+ LV+LP G + + CK
Subjt:  EAINDEMNSLESNRTWHLVDLPPGCKAIGCK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-2132.43Show/hide
Query:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN
        F  F   +EN+F  RI    SD G E+  VA  E++S  GI H T+ P++PE NG +ERK+R + E  + +L  +    ++W        Y++NR+P   
Subjt:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN

Query:  SK-TSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKS
         +  S ++ L   +PN   LR +GC  Y  +    + KL  ++ +CVF+GY+     Y    L+   +  S  V F E+ FPF +
Subjt:  SK-TSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.4e-2331.07Show/hide
Query:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN
        F +F + +EN+F  RI  L SD G E+  V   ++ S  GI H T+ P++PE NG +ERK+R + E+ + +L  +    ++W        Y++NR+P   
Subjt:  FKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSN

Query:  SK-TSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSS
         +  S ++ L  + PN   L+ +GC  Y  +    R KL  ++ +C F+GY+     Y    +    +  S  V F E  FPF + N G   SQ     S
Subjt:  SK-TSLYEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSS

Query:  SNSLPS
        + + PS
Subjt:  SNSLPS

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.6e-0428.24Show/hide
Query:  NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSL-YEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYE
        NRT+ E V ++L E G   ++  +   +  +++N+ P +     +  EV     P  SYLR +GC+AY+   + K +  A +  E
Subjt:  NRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSL-YEVLKHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAAGTCTTTGTAACTGAAATAGAGAACCAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAACTGAATACGATTCAGTTGCTTTCAATGAGTTTTA
TAGCTCAAAAGGAATAATACATGAAACTACTGCGCCTTATTCTCCTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAGTTGCTATCTTAC
TTGAGTCAGGAGCCGCACCATCTTGGTGGGAAGAAATAATTAAGAGTGTTAATTATGTTCTTAATAGGATTCCTAAATCAAACAGTAAAACTTCACTATACGAAGTCCTT
AAACATAAAACACCAAACCTATCTTATCTTCGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGAAAATTAGCAAGTAGAGCCTATGAATGTGT
TTTCATAGGATATGCTGAAAATAGTAAAACCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATCGAATGACGTAGATTTTTTCGAGGACATATTTCCTTTTA
AATCTAGAAATAGTGGGGGCCTATGTAGTCAAACTAGTGGGGGCTCAAGTTCCAATAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAGTAGATCCTGAACCT
AGAAGAAACAAGAGAGCTAAAACAGTAAAAGACTTTGGAGAAGACTTCGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTGTCATCAGTAGATGCTAA
TTTATGGCAAGAAGCTATCAATGATGAAATGAACTCTCTTGAATCCAATAGAACTTGGCACCTAGTTGACTTACCCCCTGGATGTAAAGCTATAGGCTGCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAAAGTCTTTGTAACTGAAATAGAGAACCAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAACTGAATACGATTCAGTTGCTTTCAATGAGTTTTA
TAGCTCAAAAGGAATAATACATGAAACTACTGCGCCTTATTCTCCTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAGTTGCTATCTTAC
TTGAGTCAGGAGCCGCACCATCTTGGTGGGAAGAAATAATTAAGAGTGTTAATTATGTTCTTAATAGGATTCCTAAATCAAACAGTAAAACTTCACTATACGAAGTCCTT
AAACATAAAACACCAAACCTATCTTATCTTCGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGAAAATTAGCAAGTAGAGCCTATGAATGTGT
TTTCATAGGATATGCTGAAAATAGTAAAACCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATCGAATGACGTAGATTTTTTCGAGGACATATTTCCTTTTA
AATCTAGAAATAGTGGGGGCCTATGTAGTCAAACTAGTGGGGGCTCAAGTTCCAATAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAGTAGATCCTGAACCT
AGAAGAAACAAGAGAGCTAAAACAGTAAAAGACTTTGGAGAAGACTTCGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTGTCATCAGTAGATGCTAA
TTTATGGCAAGAAGCTATCAATGATGAAATGAACTCTCTTGAATCCAATAGAACTTGGCACCTAGTTGACTTACCCCCTGGATGTAAAGCTATAGGCTGCAAATGA
Protein sequenceShow/hide protein sequence
MFKVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYSSKGIIHETTAPYSPEMNGKAERKNRTLTELVVAILLESGAAPSWWEEIIKSVNYVLNRIPKSNSKTSLYEVL
KHKTPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIGYAENSKTYRFYDLENKVIIESNDVDFFEDIFPFKSRNSGGLCSQTSGGSSSNSLPSIRIQTQDKEVDPEP
RRNKRAKTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINDEMNSLESNRTWHLVDLPPGCKAIGCK