; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0053371 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0053371
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr02:20802271..20803035
RNA-Seq ExpressionCmc02g0053371
SyntenyCmc02g0053371
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]5.5e-13897.24Show/hide
Query:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGD ISAR VGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE FI KNGVHICSAKLE+NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINL+RIGRLVKNGLLNKLKD SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-13595.67Show/hide
Query:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGD ISAR VGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE FI KNGVHICSAKLE+NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINL+RIGRLVK+GLLNKLKD SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL F
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF

KAA0046415.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-10474.02Show/hide
Query:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT
        MT++VGTG  ISA  VG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+VFI KNGV ICSAKLE+NLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        A TQNKR +ISP  N +LWHLRLGHINLNRI RLVKNGLL++L+++SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEYFI+F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
         DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+F
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-13494.49Show/hide
Query:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT
        M LKVGTGD ISAR VGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNE FISKNGVHICS KLEDNLYVL+PNE KAVLNHEMFRT
Subjt:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRIS NNNTYLWHLRLGHINL+RIGRLVKNGLLNKL+DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKA GGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
        IDDYS YGYLYL+EHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]7.2e-13093.7Show/hide
Query:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT
        MTL VGTGD ISAR VGD KLFFG KFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNE FISKNG     AKLEDNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRL HINL+RIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGK YRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
        IDDYSRYGYLYLMEHK EALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.2e-13595.67Show/hide
Query:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGD ISAR VGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE FI KNGVHICSAKLE+NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINL+RIGRLVK+GLLNKLKD SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL F
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF

A0A5A7TYF5 Gag/pol protein8.7e-10574.02Show/hide
Query:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT
        MT++VGTG  ISA  VG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+VFI KNGV ICSAKLE+NLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        A TQNKR +ISP  N +LWHLRLGHINLNRI RLVKNGLL++L+++SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEYFI+F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
         DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+F
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF

A0A5A7TZD0 Gag/pol protein2.7e-13897.24Show/hide
Query:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGD ISAR VGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE FI KNGVHICSAKLE+NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINL+RIGRLVKNGLLNKLKD SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF

A0A5A7VJG3 Gag/pol protein3.5e-13093.7Show/hide
Query:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT
        MTL VGTGD ISAR VGD KLFFG KFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNE FISKNG     AKLEDNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRL HINL+RIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGK YRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
        IDDYSRYGYLYLMEHK EALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF

A0A5D3BNE1 Gag/pol protein1.4e-13494.49Show/hide
Query:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT
        M LKVGTGD ISAR VGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNE FISKNGVHICS KLEDNLYVL+PNE KAVLNHEMFRT
Subjt:  MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRIS NNNTYLWHLRLGHINL+RIGRLVKNGLLNKL+DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKA GGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
        IDDYS YGYLYL+EHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.1e-1930.87Show/hide
Query:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHIC-SAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHIN-
        LE++    +   NL+SV  L E   SI F  + V ISKNG+ +  ++ + +N+          V+N + + + N ++K       NN  LWH R GHI+ 
Subjt:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHIC-SAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHIN-

Query:  -----LNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
             + R        LLN L + S   CE CL GK  + PF     +   K PL ++HSD+CGP+         YF+ F+D ++ Y   YL+++KS+  
Subjt:  -----LNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKTEVENLLSKKIKILRSDRGGEYM
          F+++  + E   + K+  L  D G EY+
Subjt:  EKFKEYKTEVENLLSKKIKILRSDRGGEYM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-2530.4Show/hide
Query:  MFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHIN
        + L+++  VP ++ NL+S   L    Y   F+  +  ++K  + I        LY       +  LN            +  IS +    LWH R+GH++
Subjt:  MFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHIN

Query:  LNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYK
           +  L K  L++  K  ++ PC+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +YF++FIDD SR  ++Y+++ K +  + F+++ 
Subjt:  LNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYK

Query:  TEVENLLSKKIKILRSDRGGEYMDLRF
          VE    +K+K LRSD GGEY    F
Subjt:  TEVENLLSKKIKILRSDRGGEYMDLRF

Q12491 Transposon Ty2-B Gag-Pol polyprotein2.8e-1228.07Show/hide
Query:  PKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTY-LWHLRLGHINLNRIGRLV
        P I  +L+S+S L     +  F+ N +  S +G  +       + Y L  ++   + +H    T N  NK +  S N   Y L H  LGH N   I + +
Subjt:  PKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTY-LWHLRLGHINLNRIGRLV

Query:  KNGLLNKLKDDSLP-------PCESCLEGKMTKRPFTGKGYRAK-----EPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSE--ALE
        K   +  LK+  +         C  CL GK TK     KG R K     EP + +H+D+ GP++   +    YFISF D+ +R+ ++Y +  + E   L 
Subjt:  KNGLLNKLKDDSLP-------PCESCLEGKMTKRPFTGKGYRAK-----EPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSE--ALE

Query:  KFKEYKTEVENLLSKKIKILRSDRGGEY
         F      ++N  + ++ +++ DRG EY
Subjt:  KFKEYKTEVENLLSKKIKILRSDRGGEY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-1727.06Show/hide
Query:  VGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMF
        V  G  I     G   L   ++ + L N+  VP I +NL+SV  L          +  +F + ++     GV +   K +D LY     E     +  + 
Subjt:  VGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMF

Query:  RTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLK-DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYF
          A+  +K    S       WH RLGH   + +  ++ N  L+ L        C  CL  K  K PF+     +  PLE I+SD+     + +   + Y+
Subjt:  RTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLK-DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYF

Query:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDL
        + F+D ++RY +LY ++ KS+  E F  +K  +EN    +I    SD GGE++ L
Subjt:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.1e-1930.04Show/hide
Query:  VGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINF--SMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTA
        +  G  I     G A L   ++ + L  +  VP I +NL+SV  L   +  S+ F  +  +V     GV +   K +D LY      ++AV    MF  A
Subjt:  VGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINF--SMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTA

Query:  NTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLK-DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        +  +K    S       WH RLGH +L  +  ++ N  L  L     L  C  C   K  K PF+     + +PLE I+SD+     + +   + Y++ F
Subjt:  NTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLK-DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLR
        +D ++RY +LY ++ KS+  + F  +K+ VEN    +I  L SD GGE++ LR
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLR

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.8e-0940Show/hide
Query:  NNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
        + T LWH RL H++   +  LVK G L+  K  SL  CE C+ GK  +  F+   +  K PL+ +HSDL G  +V
Subjt:  NNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTCAAGGTTGGAACGGGAGATGCCATTTCAGCTCGTGTAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGTGTTCATTTCTAAGAATGGTGTACATATTTGTT
CAGCTAAGCTTGAAGACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCAATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAAAAGATGATTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTG
GTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGACGATTATTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACACTCAAGGTTGGAACGGGAGATGCCATTTCAGCTCGTGTAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGTGTTCATTTCTAAGAATGGTGTACATATTTGTT
CAGCTAAGCTTGAAGACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCAATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAAAAGATGATTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTG
GTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGACGATTATTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCTAG
Protein sequenceShow/hide protein sequence
MTLKVGTGDAISARVVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEVFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKRQRI
SPNNNTYLWHLRLGHINLNRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
EKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRF