; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0246681 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0246681
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr09:10762519..10763316
RNA-Seq ExpressionCmc09g0246681
SyntenyCmc09g0246681
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-13994.32Show/hide
Query:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT
        MTLKVGTGDVISA AVG+A+LFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKLENNLYVL+PNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT

Query:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF
         NTQNKRQRISPNNNTYLWHLRLGHINL+RI RLVKNGLLN+L+D SLPPCESCLEGKMTKRPFT KGYR KEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKILRSDRGGEYMDLRFQDYMIEH IQ
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]4.1e-13692.8Show/hide
Query:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT
        MTLKVGTGDVISA AVG+A+LFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKLENNLYVL+PNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT

Query:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF
         NTQNKRQRISPNNNTYLWHLRLGHINL+RI RLVK+GLLN+L+D SLPPCESCLEGKMTKRPFT KGYR KEPLELIHSDLCGPMNVKARG FEYFISF
Subjt:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI RSDRGGEYMDL FQDYMIEH IQ
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ

KAA0046415.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-10774.23Show/hide
Query:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT
        MT++VGTG VISA AVG   L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FI KNGV ICSAKLENNLYVL+   +KA+LN EMF+T
Subjt:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT

Query:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF
          TQNKR +ISP  N +LWHLRLGHINLNRIERLVKNGLL+ELE++SLP CESCLEGKMTKRPFT KG+R KEPLEL+HSDLCGPMNVKARGGFEYFI+F
Subjt:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIE
         DDYSRYGY+YLM+HKSEALEKFKEYKAEVEN LSK IK  RSDRGGEYMDL+FQ+Y++E
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIE

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-13592.42Show/hide
Query:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT
        M LKVGTGDVISA AVG+A+LFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNEAFISKNGVHICS KLE+NLYVLKPNE KAVLNHEMFRT
Subjt:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT

Query:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF
         NTQNKRQRIS NNNTYLWHLRLGHINL+RI RLVKNGLLN+LEDDSLPPCESCLEGKMTKRPFT KGYR KEPLELIHSDLCGPMNVKA GGFEYFISF
Subjt:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ
        IDDYS YGYLYL+EHKSEALEKFKEYK EVENLLSKKIKILRSDRGGEYMDLRFQDYMIEH IQ
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-12990.15Show/hide
Query:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT
        MTL VGTGDVISA AVG+ +LFFG KFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFISKNG     AKLE+NLYVL+PNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT

Query:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF
         NTQNKRQRISPNNNTYLWHLRL HINL+RI RLVKNGLLN+L+DDSLPPCESCLEGKMTKRPFT K YR KEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ
        IDDYSRYGYLYLMEHK EALEKFKEYK EVENLLSKKIKILRSDRGGEYMDLRFQDYMIEH IQ
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein2.0e-13692.8Show/hide
Query:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT
        MTLKVGTGDVISA AVG+A+LFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKLENNLYVL+PNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT

Query:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF
         NTQNKRQRISPNNNTYLWHLRLGHINL+RI RLVK+GLLN+L+D SLPPCESCLEGKMTKRPFT KGYR KEPLELIHSDLCGPMNVKARG FEYFISF
Subjt:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI RSDRGGEYMDL FQDYMIEH IQ
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ

A0A5A7TYF5 Gag/pol protein6.7e-10874.23Show/hide
Query:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT
        MT++VGTG VISA AVG   L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FI KNGV ICSAKLENNLYVL+   +KA+LN EMF+T
Subjt:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT

Query:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF
          TQNKR +ISP  N +LWHLRLGHINLNRIERLVKNGLL+ELE++SLP CESCLEGKMTKRPFT KG+R KEPLEL+HSDLCGPMNVKARGGFEYFI+F
Subjt:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIE
         DDYSRYGY+YLM+HKSEALEKFKEYKAEVEN LSK IK  RSDRGGEYMDL+FQ+Y++E
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIE

A0A5A7TZD0 Gag/pol protein4.3e-13994.32Show/hide
Query:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT
        MTLKVGTGDVISA AVG+A+LFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKLENNLYVL+PNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT

Query:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF
         NTQNKRQRISPNNNTYLWHLRLGHINL+RI RLVKNGLLN+L+D SLPPCESCLEGKMTKRPFT KGYR KEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKILRSDRGGEYMDLRFQDYMIEH IQ
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ

A0A5A7VJG3 Gag/pol protein6.2e-13090.15Show/hide
Query:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT
        MTL VGTGDVISA AVG+ +LFFG KFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFISKNG     AKLE+NLYVL+PNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT

Query:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF
         NTQNKRQRISPNNNTYLWHLRL HINL+RI RLVKNGLLN+L+DDSLPPCESCLEGKMTKRPFT K YR KEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ
        IDDYSRYGYLYLMEHK EALEKFKEYK EVENLLSKKIKILRSDRGGEYMDLRFQDYMIEH IQ
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ

A0A5D3BNE1 Gag/pol protein7.6e-13692.42Show/hide
Query:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT
        M LKVGTGDVISA AVG+A+LFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNEAFISKNGVHICS KLE+NLYVLKPNE KAVLNHEMFRT
Subjt:  MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRT

Query:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF
         NTQNKRQRIS NNNTYLWHLRLGHINL+RI RLVKNGLLN+LEDDSLPPCESCLEGKMTKRPFT KGYR KEPLELIHSDLCGPMNVKA GGFEYFISF
Subjt:  VNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ
        IDDYS YGYLYL+EHKSEALEKFKEYK EVENLLSKKIKILRSDRGGEYMDLRFQDYMIEH IQ
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.1e-2230.2Show/hide
Query:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHIC-SAKLENNLYVLKPNEAKAVLNHEMFRTVNTQNKRQRISPNNNTYLWHLRLGHIN-
        LE++    +   NL+SV  L E   SI F  +   ISKNG+ +  ++ + NN+          V+N + + ++N ++K       NN  LWH R GHI+ 
Subjt:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHIC-SAKLENNLYVLKPNEAKAVLNHEMFRTVNTQNKRQRISPNNNTYLWHLRLGHIN-

Query:  -----LNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPF--TRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
             + R        LLN LE  S   CE CL GK  + PF   +    +K PL ++HSD+CGP+         YF+ F+D ++ Y   YL+++KS+  
Subjt:  -----LNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPF--TRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQF
          F+++ A+ E   + K+  L  D G EY+    + + ++  I +
Subjt:  EKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-2730.38Show/hide
Query:  MFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRTVNTQNKRQRISPNNNTYLWHLRLGHIN
        + L+++  VP ++ NL+S   L    Y   F+  +  ++K  + I        LY       +  LN            +  IS +    LWH R+GH++
Subjt:  MFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRTVNTQNKRQRISPNNNTYLWHLRLGHIN

Query:  LNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYK
           ++ L K  L++  +  ++ PC+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +YF++FIDD SR  ++Y+++ K +  + F+++ 
Subjt:  LNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYK

Query:  AEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ
        A VE    +K+K LRSD GGEY    F++Y   H I+
Subjt:  AEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQ

Q12491 Transposon Ty2-B Gag-Pol polyprotein2.4e-1427.17Show/hide
Query:  ISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRTVNTQNKRQRI
        I  +A+GN    F N           P I  +L+S+S L     +  F+ N      +G  +       + Y L  ++   + +H    T+N  NK +  
Subjt:  ISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRTVNTQNKRQRI

Query:  SPNNNTY-LWHLRLGHINLNRIERLVKNGLLNELEDDSLP-------PCESCLEGKMTKRPFTRKGYRVK-----EPLELIHSDLCGPMNVKARGGFEYF
        S N   Y L H  LGH N   I++ +K   +  L++  +         C  CL GK TK     KG R+K     EP + +H+D+ GP++   +    YF
Subjt:  SPNNNTY-LWHLRLGHINLNRIERLVKNGLLNELEDDSLP-------PCESCLEGKMTKRPFTRKGYRVK-----EPLELIHSDLCGPMNVKARGGFEYF

Query:  ISFIDDYSRYGYLYLMEHKSE--ALEKFKEYKAEVENLLSKKIKILRSDRGGEY
        ISF D+ +R+ ++Y +  + E   L  F    A ++N  + ++ +++ DRG EY
Subjt:  ISFIDDYSRYGYLYLMEHKSE--ALEKFKEYKAEVENLLSKKIKILRSDRGGEY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-1625.94Show/hide
Query:  VGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMF
        V  G  I     G+  L   ++ + L N+  VP I +NL+SV  L          +  +F + +      GV +   K ++ LY      ++ V    +F
Subjt:  VGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMF

Query:  RTVNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELE-DDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYF
         + +++              WH RLGH   + +  ++ N  L+ L        C  CL  K  K PF++       PLE I+SD+     + +   + Y+
Subjt:  RTVNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELE-DDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYF

Query:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEI
        + F+D ++RY +LY ++ KS+  E F  +K  +EN    +I    SD GGE++ L   +Y  +H I
Subjt:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-1828.52Show/hide
Query:  VGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIS--KNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRTV
        +  G  I     G+A L   ++ + L  +  VP I +NL+SV  L   +  S+ F      +     GV +   K ++ LY      ++AV    MF + 
Subjt:  VGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIS--KNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRTV

Query:  NTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELE-DDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF
         ++              WH RLGH +L  +  ++ N  L  L     L  C  C   K  K PF+       +PLE I+SD+     + +   + Y++ F
Subjt:  NTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELE-DDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEI
        +D ++RY +LY ++ KS+  + F  +K+ VEN    +I  L SD GGE++ LR  DY+ +H I
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEI

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein6.4e-1035.96Show/hide
Query:  TVNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNV
        +V T       +  + T LWH RL H++   +E LVK G L+  +  SL  CE C+ GK  +  F+   +  K PL+ +HSDL G  +V
Subjt:  TVNTQNKRQRISPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTCAAGGTTGGAACAGGAGATGTCATTTCAGCTCATGCAGTGGGAAATGCTGAGTTATTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCTGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTACATATTTGTT
CGGCTAAGCTTGAAAACAACTTGTATGTATTAAAACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGTTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCAATCGGATCGAGAGATTGGTAAAGAATGGACTTCTAAACGAGTTAGAAGATGATTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTACTAGAAAAGGTTATAGAGTCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTG
GTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGACTATTCGAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCCAAGACTA
TATGATAGAACATGAAATCCAATTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACACTCAAGGTTGGAACAGGAGATGTCATTTCAGCTCATGCAGTGGGAAATGCTGAGTTATTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCTGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTACATATTTGTT
CGGCTAAGCTTGAAAACAACTTGTATGTATTAAAACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGTTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCAATCGGATCGAGAGATTGGTAAAGAATGGACTTCTAAACGAGTTAGAAGATGATTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTACTAGAAAAGGTTATAGAGTCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTG
GTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGACTATTCGAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCCAAGACTA
TATGATAGAACATGAAATCCAATTCTAA
Protein sequenceShow/hide protein sequence
MTLKVGTGDVISAHAVGNAELFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLKPNEAKAVLNHEMFRTVNTQNKRQRI
SPNNNTYLWHLRLGHINLNRIERLVKNGLLNELEDDSLPPCESCLEGKMTKRPFTRKGYRVKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
EKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHEIQF