; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G006780 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G006780
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCG_Chr07:13618498..13620533
RNA-Seq ExpressionClCG07G006780
SyntenyClCG07G006780
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039651.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]7.5e-4241.1Show/hide
Query:  TEDIGLMAKKIPTEIETSEERILEGD---------------------WFVEYKSKVGDSVYMDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRN
        TEDI LMA+K   E E  +E+I E +                     WFV+YKS+ GDSVYM NN +CEII  GS+LLKLS+NREVLLKGVRH PKL RN
Subjt:  TEDIGLMAKKIPTEIETSEERILEGD---------------------WFVEYKSKVGDSVYMDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRN

Query:  LISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLASESIALMASQSGKNSR--KENTFQEQY
        LISLGMLDDLGC I+ E+G +++ + G+ IL ++K E LY V NV +PKYALIS +E+  +     R L  ++ + +  +  Q    +R  K   F E  
Subjt:  LISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLASESIALMASQSGKNSR--KENTFQEQY

Query:  LTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCNEFDISRHKTMAYTPQQNGVVERMNRTLIE
        +                 F   +  K  K E  ++             L+++  +    +  N +  SRH+T+AYTPQQNGV E MNRTL+E
Subjt:  LTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCNEFDISRHKTMAYTPQQNGVVERMNRTLIE

KAA0045569.1 putative retroelement pol polyprotein [Cucumis melo var. makuwa]9.8e-4237.06Show/hide
Query:  MYQSKTLDENLDEFKKLTNAFNQERK---VRLQAAILINSIHDSYKESKELELKTENKISSGAESLCSKGNNHFKRSHNNKSQRMNKDKNY-PERGKKLE
        M ++K LDENLDEFKKLTNA NQ  +      +AAILIN IHD+YKE K + L+ +N            G   F ++ N + +   K + +  + G    
Subjt:  MYQSKTLDENLDEFKKLTNAFNQERK---VRLQAAILINSIHDSYKESKELELKTENKISSGAESLCSKGNNHFKRSHNNKSQRMNKDKNY-PERGKKLE

Query:  E--KTTEDIGLMAKKIPTEIETSEER-ILEG----------DWFVEYKSKVGDSVYMDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLG
        E  K TE +    +K   EIET EE  +L+           +WFV+YKS+  DS+YM NN +CEII  G +LLKLS+NREVLLKGVRH PKL RNLISLG
Subjt:  E--KTTEDIGLMAKKIPTEIETSEER-ILEG----------DWFVEYKSKVGDSVYMDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLG

Query:  MLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEI
                 H  K  L+ + HG     A+                                                                       
Subjt:  MLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEI

Query:  FGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEFDISRHKTMAYTPQQNGVVERMNRTLIETV---------------
                            W               RT N LEFLSN+FN LCNEF IS+H  +AYTPQQN V ERMNRTL+E V               
Subjt:  FGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEFDISRHKTMAYTPQQNGVVERMNRTLIETV---------------

Query:  EALATATYTVNRVLCVSIEMKTLEKRWTG
        EALA ATYTVNR LCVSI+ KT E+RWTG
Subjt:  EALATATYTVNRVLCVSIEMKTLEKRWTG

KAA0054988.1 hypothetical protein E6C27_scaffold43052G001360 [Cucumis melo var. makuwa]2.3e-4336.21Show/hide
Query:  DEFKKLTNAFNQERK---VRLQAAILINSIHDSYKE----------------------SKELELKTENKISSGAESLCSKGNNHFKRSHNNKSQRMNKDK
        +EFKKLTNAFNQ  +      +AAILINSIHD+YKE                      S+ELELKTENK S+ AESL  KG N F R ++NK+QR ++DK
Subjt:  DEFKKLTNAFNQERK---VRLQAAILINSIHDSYKE----------------------SKELELKTENKISSGAESLCSKGNNHFKRSHNNKSQRMNKDK

Query:  ---------------NYPERGKKL---EEKTTEDIGLMAKKIPTEIETSEERILEGDWFVEYKSKVGDSVYMDNNHECEIIST---GSMLLKLSDNREVL
                       N P+RGK     E +     G          +  + R   G         VG+  +       E+++T    +M +K  +   VL
Subjt:  ---------------NYPERGKKL---EEKTTEDIGLMAKKIPTEIETSEERILEGDWFVEYKSKVGDSVYMDNNHECEIIST---GSMLLKLSDNREVL

Query:  LKG-------------VRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLAS
          G              RH PKL RNLISLGMLDDLGC I+ E+G +++ + GR IL ++K E LY V NV +PKYALIS +E+        + L  ++ 
Subjt:  LKG-------------VRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLAS

Query:  ESIALMASQSGKNSR--KENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCNEFDISRHKTMA
        + +  +       +R  K   F E        IFG   ++           K  K E  T+             L+++  +    +  + +  SRH+T+A
Subjt:  ESIALMASQSGKNSR--KENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCNEFDISRHKTMA

Query:  YTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG
        YTPQQNGV ERMNRTL+E V               EALATATYTV R LCVSI+MKT E+RWTG
Subjt:  YTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG

KAA0056038.1 hypothetical protein E6C27_scaffold319G001970 [Cucumis melo var. makuwa]3.5e-3942.86Show/hide
Query:  MDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIR
        M NN +CEII   S+LLKLS+NREVLLK VRH PKL RNLISLG                                                        
Subjt:  MDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIR

Query:  ANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEF
                                          E +   YM IFGDQL++ LG  Q          +VETQTE+ IK+ RT NGLEFLSN+FN LCNEF
Subjt:  ANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEF

Query:  DISRHKTMAYTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG
         ISRH+T+AYTPQQNGV ERMNRTL+E V               EALATATYTVNR  CVSI+MKT E+RWTG
Subjt:  DISRHKTMAYTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG

KAA0062924.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]1.1e-4042.91Show/hide
Query:  MDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIR
        M NN  CEII   S+LLKLS+NREVLLKGVRH PKL R+LISLGM+DDLGC I+ EKG +++ + GR IL ++K E LYIV NV +PKYALIS +E+   
Subjt:  MDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIR

Query:  ANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCN
             R L  ++ + +  +  Q    +R+            ++ FG       G+ +  ++ K   +   T              L ++  +    +  +
Subjt:  ANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCN

Query:  EFDISRHKTMAYTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG
         +  SRH+T+AYT QQNGV ERMNRTL+E V               EALATATYTVNR  CVSI+MKT E+RWTG
Subjt:  EFDISRHKTMAYTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG

TrEMBL top hitse value%identityAlignment
A0A5A7TWF0 Putative retroelement pol polyprotein4.8e-4237.06Show/hide
Query:  MYQSKTLDENLDEFKKLTNAFNQERK---VRLQAAILINSIHDSYKESKELELKTENKISSGAESLCSKGNNHFKRSHNNKSQRMNKDKNY-PERGKKLE
        M ++K LDENLDEFKKLTNA NQ  +      +AAILIN IHD+YKE K + L+ +N            G   F ++ N + +   K + +  + G    
Subjt:  MYQSKTLDENLDEFKKLTNAFNQERK---VRLQAAILINSIHDSYKESKELELKTENKISSGAESLCSKGNNHFKRSHNNKSQRMNKDKNY-PERGKKLE

Query:  E--KTTEDIGLMAKKIPTEIETSEER-ILEG----------DWFVEYKSKVGDSVYMDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLG
        E  K TE +    +K   EIET EE  +L+           +WFV+YKS+  DS+YM NN +CEII  G +LLKLS+NREVLLKGVRH PKL RNLISLG
Subjt:  E--KTTEDIGLMAKKIPTEIETSEER-ILEG----------DWFVEYKSKVGDSVYMDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLG

Query:  MLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEI
                 H  K  L+ + HG     A+                                                                       
Subjt:  MLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEI

Query:  FGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEFDISRHKTMAYTPQQNGVVERMNRTLIETV---------------
                            W               RT N LEFLSN+FN LCNEF IS+H  +AYTPQQN V ERMNRTL+E V               
Subjt:  FGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEFDISRHKTMAYTPQQNGVVERMNRTLIETV---------------

Query:  EALATATYTVNRVLCVSIEMKTLEKRWTG
        EALA ATYTVNR LCVSI+ KT E+RWTG
Subjt:  EALATATYTVNRVLCVSIEMKTLEKRWTG

A0A5A7UJ23 Integrase catalytic domain-containing protein1.1e-4336.21Show/hide
Query:  DEFKKLTNAFNQERK---VRLQAAILINSIHDSYKE----------------------SKELELKTENKISSGAESLCSKGNNHFKRSHNNKSQRMNKDK
        +EFKKLTNAFNQ  +      +AAILINSIHD+YKE                      S+ELELKTENK S+ AESL  KG N F R ++NK+QR ++DK
Subjt:  DEFKKLTNAFNQERK---VRLQAAILINSIHDSYKE----------------------SKELELKTENKISSGAESLCSKGNNHFKRSHNNKSQRMNKDK

Query:  ---------------NYPERGKKL---EEKTTEDIGLMAKKIPTEIETSEERILEGDWFVEYKSKVGDSVYMDNNHECEIIST---GSMLLKLSDNREVL
                       N P+RGK     E +     G          +  + R   G         VG+  +       E+++T    +M +K  +   VL
Subjt:  ---------------NYPERGKKL---EEKTTEDIGLMAKKIPTEIETSEERILEGDWFVEYKSKVGDSVYMDNNHECEIIST---GSMLLKLSDNREVL

Query:  LKG-------------VRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLAS
          G              RH PKL RNLISLGMLDDLGC I+ E+G +++ + GR IL ++K E LY V NV +PKYALIS +E+        + L  ++ 
Subjt:  LKG-------------VRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLAS

Query:  ESIALMASQSGKNSR--KENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCNEFDISRHKTMA
        + +  +       +R  K   F E        IFG   ++           K  K E  T+             L+++  +    +  + +  SRH+T+A
Subjt:  ESIALMASQSGKNSR--KENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCNEFDISRHKTMA

Query:  YTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG
        YTPQQNGV ERMNRTL+E V               EALATATYTV R LCVSI+MKT E+RWTG
Subjt:  YTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG

A0A5A7ULT2 Integrase catalytic domain-containing protein1.7e-3942.86Show/hide
Query:  MDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIR
        M NN +CEII   S+LLKLS+NREVLLK VRH PKL RNLISLG                                                        
Subjt:  MDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIR

Query:  ANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEF
                                          E +   YM IFGDQL++ LG  Q          +VETQTE+ IK+ RT NGLEFLSN+FN LCNEF
Subjt:  ANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEF

Query:  DISRHKTMAYTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG
         ISRH+T+AYTPQQNGV ERMNRTL+E V               EALATATYTVNR  CVSI+MKT E+RWTG
Subjt:  DISRHKTMAYTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG

A0A5D3BAM8 Retrotransposon protein, putative, Ty1-copia subclass3.6e-4241.1Show/hide
Query:  TEDIGLMAKKIPTEIETSEERILEGD---------------------WFVEYKSKVGDSVYMDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRN
        TEDI LMA+K   E E  +E+I E +                     WFV+YKS+ GDSVYM NN +CEII  GS+LLKLS+NREVLLKGVRH PKL RN
Subjt:  TEDIGLMAKKIPTEIETSEERILEGD---------------------WFVEYKSKVGDSVYMDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRN

Query:  LISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLASESIALMASQSGKNSR--KENTFQEQY
        LISLGMLDDLGC I+ E+G +++ + G+ IL ++K E LY V NV +PKYALIS +E+  +     R L  ++ + +  +  Q    +R  K   F E  
Subjt:  LISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLASESIALMASQSGKNSR--KENTFQEQY

Query:  LTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCNEFDISRHKTMAYTPQQNGVVERMNRTLIE
        +                 F   +  K  K E  ++             L+++  +    +  N +  SRH+T+AYTPQQNGV E MNRTL+E
Subjt:  LTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCNEFDISRHKTMAYTPQQNGVVERMNRTLIE

A0A5D3D1S7 Retrotransposon protein, putative, Ty1-copia subclass5.3e-4142.91Show/hide
Query:  MDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIR
        M NN  CEII   S+LLKLS+NREVLLKGVRH PKL R+LISLGM+DDLGC I+ EKG +++ + GR IL ++K E LYIV NV +PKYALIS +E+   
Subjt:  MDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIR

Query:  ANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCN
             R L  ++ + +  +  Q    +R+            ++ FG       G+ +  ++ K   +   T              L ++  +    +  +
Subjt:  ANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEF--NSLCN

Query:  EFDISRHKTMAYTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG
         +  SRH+T+AYT QQNGV ERMNRTL+E V               EALATATYTVNR  CVSI+MKT E+RWTG
Subjt:  EFDISRHKTMAYTPQQNGVVERMNRTLIETV---------------EALATATYTVNRVLCVSIEMKTLEKRWTG

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.6e-0822.76Show/hide
Query:  KTLDENLDEFKKLTNAFNQERKVRLQAAILINSI---HDSYKESKELELKTENKISSGAESLCSKGNNHFKRSHNNKSQRMNKDKNYPERGKKLEEKTTE
        K  +++ D  KK+ NA         +  +  N +      +K + + ++K  +    G E    K   H+KR  NNK++         E  K+++  T+ 
Subjt:  KTLDENLDEFKKLTNAFNQERKVRLQAAILINSI---HDSYKESKELELKTENKISSGAESLCSKGNNHFKRSHNNKSQRMNKDKNYPERGKKLEEKTTE

Query:  DIGLMAKKI-PTEIETSEERILE---GDWFVEYKSKVGDSVYMDNNHECEIISTGSM-------LLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGC
         I  M K++  T +  +   +L+    D  +  +S   DSV +    +  +   G         +++L ++ E+ L+ V    +   NL+S+  L + G 
Subjt:  DIGLMAKKI-PTEIETSEERILE---GDWFVEYKSKVGDSVYMDNNHECEIISTGSM-------LLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGC

Query:  SIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIR------ANSSSRSL-----KDLASESIALMASQ----------SGKNSR--
        SI  +K  + I K+G  ++  K    L  V  +N   Y++ +  + N R       + S   L     K++ S+   L   +          +GK +R  
Subjt:  SIHAEKGCLEILKHGRAILTAKKRERLYIVINVNRPKYALISYSERNIR------ANSSSRSL-----KDLASESIALMASQ----------SGKNSR--

Query:  ----KENTF----------------------QEQYLTMYMEIFGDQLKIYLGRFQT--FEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEF
            K+ T                        + Y  ++++ F      YL ++++  F  F+ +  + E      + Y    NG E+LSNE    C + 
Subjt:  ----KENTF----------------------QEQYLTMYMEIFGDQLKIYLGRFQT--FEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEF

Query:  DISRHKTMAYTPQQNGVVERMNRTLIETV---------------EALATATYTVNRV
         IS H T+ +TPQ NGV ERM RT+ E                 EA+ TATY +NR+
Subjt:  DISRHKTMAYTPQQNGVVERMNRTLIETV---------------EALATATYTVNRV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-1426.33Show/hide
Query:  DWFVEYKSKVGDSVYMDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGCSIH-------AEKGCLEILKH-GRAILTAKKRER
        D F  Y +    +V M N    +I   G + +K +    ++LK VRH P L  NLIS   LD  G   +         KG L I K   R  L     E 
Subjt:  DWFVEYKSKVGDSVYMDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGCSIH-------AEKGCLEILKH-GRAILTAKKRER

Query:  LYIVINVNRPKYALISYSERNIRANSSSRSLKDLASESIALMASQS----------GKNSRK--------------------------ENTFQEQYLTMY
            +N  + + ++  + +R    + S + L+ LA +S+   A  +          GK  R                           E+    +Y   +
Subjt:  LYIVINVNRPKYALISYSERNIRANSSSRSLKDLASESIALMASQS----------GKNSRK--------------------------ENTFQEQYLTMY

Query:  MEIFGDQLKIYL--GRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEFDISRHKTMAYTPQQNGVVERMNRTLIETV----------
        ++    +L +Y+   + Q F+ F+ +   VE +T + +K  R+ NG E+ S EF   C+   I   KT+  TPQ NGV ERMNRT++E V          
Subjt:  MEIFGDQLKIYL--GRFQTFEYFKIWKNEVETQTEKNIKYPRTYNGLEFLSNEFNSLCNEFDISRHKTMAYTPQQNGVVERMNRTLIETV----------

Query:  -----EALATATYTVNRVLCVSIEMKTLEKRWTGEIIT
             EA+ TA Y +NR   V +  +  E+ WT + ++
Subjt:  -----EALATATYTVNRVLCVSIEMKTLEKRWTGEIIT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTTCTCTTTTAGAATGTATCAAAGCAAGACTCTTGATGAAAATTTAGATGAGTTCAAGAAATTGACCAATGCCTTCAATCAGGAGCGGAAAGTGAGGCTT
CAGGCTGCTATTCTCATTAATTCGATCCATGATTCCTACAAAGAAAGCAAAGAGTTAGAGCTGAAAACAGAGAACAAGATCTCTAGTGGAGCAGAATCTCTCTGT
TCAAAGGGAAACAATCATTTCAAAAGAAGCCACAACAATAAAAGCCAAAGAATGAATAAAGACAAAAATTATCCTGAAAGGGGAAAAAAATTAGAAGAGAAGACA
ACAGAAGATATAGGCCTTATGGCAAAGAAGATACCAACAGAAATAGAGACTTCAGAAGAGAGGATCCTAGAAGGGGACTGGTTTGTTGAATACAAATCAAAAGTG
GGAGACTCAGTCTACATGGACAATAATCATGAGTGTGAGATTATTAGTACAGGCTCAATGTTATTGAAGCTCTCAGACAACAGGGAGGTTCTCCTTAAAGGAGTG
AGACATGCTCCAAAATTAAATAGAAACCTCATCTCTTTAGGTATGCTTGATGATTTAGGCTGCTCTATTCATGCTGAGAAGGGGTGCTTGGAAATATTGAAACAT
GGCAGGGCAATACTCACAGCAAAAAAGAGAGAACGGTTGTATATTGTGATAAATGTGAATAGACCGAAATATGCATTGATATCTTACTCTGAAAGGAATATAAGG
GCTAATTCAAGCTCGAGGTCACTAAAGGACTTAGCTTCTGAGAGCATTGCACTTATGGCAAGTCAAAGCGGCAAAAATTCTCGAAAGGAGAACACTTTTCAAGAG
CAATACTTGACTATGTACATGGAGATCTTTGGGGACCAGCTGAAAATCTATCTTGGGAGGTTCCAAACTTTTGAATACTTTAAAATCTGGAAAAACGAGGTTGAA
ACTCAAACTGAGAAGAATATTAAATACCCGAGAACTTATAACGGTCTAGAGTTCCTAAGTAATGAATTTAACTCTCTATGCAATGAGTTTGACATATCTAGACAC
AAAACAATGGCTTACACTCCCCAACAAAATGGGGTTGTTGAAAGGATGAACAGAACCTTGATAGAAACAGTCGAAGCATTAGCCACTGCCACCTACACAGTAAAT
AGAGTCCTATGTGTTTCTATTGAGATGAAGACCCTTGAAAAAAGATGGACTGGTGAAATCATCACAAATCCTAACCCTGGTAATGATCAATTAGCTGAAACTTTA
CATAGCTCTCAATCTCAAGAAGGAGTATACGGAGAGCTTCTTTCCATGACGCACAAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTTCTCTTTTAGAATGTATCAAAGCAAGACTCTTGATGAAAATTTAGATGAGTTCAAGAAATTGACCAATGCCTTCAATCAGGAGCGGAAAGTGAGGCTT
CAGGCTGCTATTCTCATTAATTCGATCCATGATTCCTACAAAGAAAGCAAAGAGTTAGAGCTGAAAACAGAGAACAAGATCTCTAGTGGAGCAGAATCTCTCTGT
TCAAAGGGAAACAATCATTTCAAAAGAAGCCACAACAATAAAAGCCAAAGAATGAATAAAGACAAAAATTATCCTGAAAGGGGAAAAAAATTAGAAGAGAAGACA
ACAGAAGATATAGGCCTTATGGCAAAGAAGATACCAACAGAAATAGAGACTTCAGAAGAGAGGATCCTAGAAGGGGACTGGTTTGTTGAATACAAATCAAAAGTG
GGAGACTCAGTCTACATGGACAATAATCATGAGTGTGAGATTATTAGTACAGGCTCAATGTTATTGAAGCTCTCAGACAACAGGGAGGTTCTCCTTAAAGGAGTG
AGACATGCTCCAAAATTAAATAGAAACCTCATCTCTTTAGGTATGCTTGATGATTTAGGCTGCTCTATTCATGCTGAGAAGGGGTGCTTGGAAATATTGAAACAT
GGCAGGGCAATACTCACAGCAAAAAAGAGAGAACGGTTGTATATTGTGATAAATGTGAATAGACCGAAATATGCATTGATATCTTACTCTGAAAGGAATATAAGG
GCTAATTCAAGCTCGAGGTCACTAAAGGACTTAGCTTCTGAGAGCATTGCACTTATGGCAAGTCAAAGCGGCAAAAATTCTCGAAAGGAGAACACTTTTCAAGAG
CAATACTTGACTATGTACATGGAGATCTTTGGGGACCAGCTGAAAATCTATCTTGGGAGGTTCCAAACTTTTGAATACTTTAAAATCTGGAAAAACGAGGTTGAA
ACTCAAACTGAGAAGAATATTAAATACCCGAGAACTTATAACGGTCTAGAGTTCCTAAGTAATGAATTTAACTCTCTATGCAATGAGTTTGACATATCTAGACAC
AAAACAATGGCTTACACTCCCCAACAAAATGGGGTTGTTGAAAGGATGAACAGAACCTTGATAGAAACAGTCGAAGCATTAGCCACTGCCACCTACACAGTAAAT
AGAGTCCTATGTGTTTCTATTGAGATGAAGACCCTTGAAAAAAGATGGACTGGTGAAATCATCACAAATCCTAACCCTGGTAATGATCAATTAGCTGAAACTTTA
CATAGCTCTCAATCTCAAGAAGGAGTATACGGAGAGCTTCTTTCCATGACGCACAAAAGATGA
Protein sequenceShow/hide protein sequence
MLFSFRMYQSKTLDENLDEFKKLTNAFNQERKVRLQAAILINSIHDSYKESKELELKTENKISSGAESLCSKGNNHFKRSHNNKSQRMNKDKNYPERGKKLEEKT
TEDIGLMAKKIPTEIETSEERILEGDWFVEYKSKVGDSVYMDNNHECEIISTGSMLLKLSDNREVLLKGVRHAPKLNRNLISLGMLDDLGCSIHAEKGCLEILKH
GRAILTAKKRERLYIVINVNRPKYALISYSERNIRANSSSRSLKDLASESIALMASQSGKNSRKENTFQEQYLTMYMEIFGDQLKIYLGRFQTFEYFKIWKNEVE
TQTEKNIKYPRTYNGLEFLSNEFNSLCNEFDISRHKTMAYTPQQNGVVERMNRTLIETVEALATATYTVNRVLCVSIEMKTLEKRWTGEIITNPNPGNDQLAETL
HSSQSQEGVYGELLSMTHKR