; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0165701 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0165701
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr06:17673767..17674654
RNA-Seq ExpressionCmc06g0165701
SyntenyCmc06g0165701
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.7e-15294.85Show/hide
Query:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVG  DVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFI KNG+HICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKL+D SLPP ESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKARG FEYFI F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF

Query:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR
        IDDYS YGYLYLM HKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVR
Subjt:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-15093.81Show/hide
Query:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVG  DVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFI KNG+HICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+GLLNKL+D SLPP ESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKARG FEYFI F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF

Query:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR
        IDDYS YGYLYLM HKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVR
Subjt:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR

KAA0046415.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-11673.2Show/hide
Query:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT++VG   VISA AVG  +L  +  F+ LEN+Y+VP +KRNL+ V CL+E  YS+ F++N+ FI KNG+ ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF
        A TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++LE++SLP  ESCLEGKMTKRPFTGKG+RAKEPLEL+HSDL GPMNVKARG FEYFI F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF

Query:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR
         DDYS YGY+YLM HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQNGVSERRN TLLDMVR
Subjt:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-14892.76Show/hide
Query:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        M LKVG  DVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLV VSCLIEHMYSI+FSMNEAFISKNG+HICS KLE+NLYVL+PNE KAVLNHEMFRT
Subjt:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF
        ANTQNKRQRIS NNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPP ESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKA G FEYFI F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF

Query:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMV
        IDDYS YGYLYL+ HKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMV
Subjt:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMV

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]7.1e-14592.1Show/hide
Query:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTL VG  DVISARAVGD KLFF  KFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNG     AKLE+NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF
        ANTQNKRQRISPNNNTYLWHLRL HINLDRIGRLVKNGLLNKL+DDSLPP ESCLEGKMTKRPFTGK YRAKEPLELIHSDL GPMNVKARG FEYFI F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF

Query:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR
        IDDYS YGYLYLM HK EALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVR
Subjt:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein9.3e-15193.81Show/hide
Query:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVG  DVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFI KNG+HICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+GLLNKL+D SLPP ESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKARG FEYFI F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF

Query:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR
        IDDYS YGYLYLM HKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVR
Subjt:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR

A0A5A7TYF5 Gag/pol protein1.1e-11673.2Show/hide
Query:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT++VG   VISA AVG  +L  +  F+ LEN+Y+VP +KRNL+ V CL+E  YS+ F++N+ FI KNG+ ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF
        A TQNKR +ISP  N +LWHLRLGHINL+RI RLVKNGLL++LE++SLP  ESCLEGKMTKRPFTGKG+RAKEPLEL+HSDL GPMNVKARG FEYFI F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF

Query:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR
         DDYS YGY+YLM HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQNGVSERRN TLLDMVR
Subjt:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR

A0A5A7TZD0 Gag/pol protein1.3e-15294.85Show/hide
Query:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVG  DVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFI KNG+HICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKL+D SLPP ESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKARG FEYFI F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF

Query:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR
        IDDYS YGYLYLM HKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVR
Subjt:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR

A0A5A7VJG3 Gag/pol protein3.4e-14592.1Show/hide
Query:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTL VG  DVISARAVGD KLFF  KFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNG     AKLE+NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF
        ANTQNKRQRISPNNNTYLWHLRL HINLDRIGRLVKNGLLNKL+DDSLPP ESCLEGKMTKRPFTGK YRAKEPLELIHSDL GPMNVKARG FEYFI F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF

Query:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR
        IDDYS YGYLYLM HK EALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVR
Subjt:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR

A0A5D3BNE1 Gag/pol protein6.7e-14992.76Show/hide
Query:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        M LKVG  DVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLV VSCLIEHMYSI+FSMNEAFISKNG+HICS KLE+NLYVL+PNE KAVLNHEMFRT
Subjt:  MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF
        ANTQNKRQRIS NNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPP ESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKA G FEYFI F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF

Query:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMV
        IDDYS YGYLYL+ HKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMV
Subjt:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMV

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.6e-2531.37Show/hide
Query:  LENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHIC-SAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHIN-
        LE++    +   NL+ V  L E   SI F  +   ISKNG+ +  ++ + NN+          V+N + + + N ++K       NN  LWH R GHI+ 
Subjt:  LENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHIC-SAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHIN-

Query:  -----LDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLYGPMNVKARGDFEYFIFFIDDYSSYGYLYLMNHKSEAL
             + R        LLN LE  S    E CL GK  + PF     +   K PL ++HSD+ GP+      D  YF+ F+D ++ Y   YL+ +KS+  
Subjt:  -----LDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLYGPMNVKARGDFEYFIFFIDDYSSYGYLYLMNHKSEAL

Query:  EKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR
          F+++  + E   + K+  L  D G EY+    + + ++ GI   L+ P TPQ NGVSER   T+ +  R
Subjt:  EKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.4e-3229.55Show/hide
Query:  TLKVGMEDVISARAVGDAKLFFE-NKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        T+K+G         +GD  +       + L+++  VP ++ NL+    L    Y   F+  +  ++K  + I        LY       +  LN      
Subjt:  TLKVGMEDVISARAVGDAKLFFE-NKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF
              +  IS +    LWH R+GH++   +  L K  L++  +  ++ P + CL GK  +  F     R    L+L++SD+ GPM +++ G  +YF+ F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFF

Query:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR
        IDD S   ++Y++  K +  + F+++   VE    +K+K LRSD GGEY    F++Y   HGI+ + + PGTPQ NGV+ER N T+++ VR
Subjt:  IDDYSSYGYLYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.5e-1425.34Show/hide
Query:  ISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRI
        I   A+G+    F+N           P I  +L+ +S L     +  F+ N      +G  +       + Y L  ++   + +H    T N  NK +  
Subjt:  ISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRI

Query:  SPNNNTY-LWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSES-------CLEGKMTKRPFTGKGYRAK-----EPLELIHSDLYGPMNVKARGDFEYF
        S N   Y L H  LGH N   I + +K   +  L++  +  S +       CL GK TK     KG R K     EP + +H+D++GP++   +    YF
Subjt:  SPNNNTY-LWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSES-------CLEGKMTKRPFTGKGYRAK-----EPLELIHSDLYGPMNVKARGDFEYF

Query:  IFFIDDYSSYGYLYLMNHKSE--ALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR
        I F D+ + + ++Y ++ + E   L  F      ++N  + ++ +++ DRG EY +     +    GI +  +     + +GV+ER N TLL+  R
Subjt:  IFFIDDYSSYGYLYLMNHKSE--ALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-2026.52Show/hide
Query:  GDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIE------HMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRI
        G   L  +++ + L N+  VP I +NL+ V  L          +  +F + +      G+ +   K ++ LY     E     +  +   A+  +K    
Subjt:  GDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIE------HMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRI

Query:  SPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLE-DDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFFIDDYSSYGY
        S       WH RLGH     +  ++ N  L+ L           CL  K  K PF+     +  PLE I+SD++    + +  ++ Y++ F+D ++ Y +
Subjt:  SPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLE-DDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFFIDDYSSYGY

Query:  LYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLD
        LY +  KS+  E F  +K  +EN    +I    SD GGE++ L   +Y  +HGI    S P TP+ NG+SER++  +++
Subjt:  LYLMNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.2e-2329.96Show/hide
Query:  GDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIE-HMYSINFSMNEAFIS--KNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPN
        G A L   ++ + L  +  VP I +NL+ V  L   +  S+ F      +     G+ +   K ++ LY      ++AV    MF  A+  +K    S  
Subjt:  GDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIE-HMYSINFSMNEAFIS--KNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPN

Query:  NNTYLWHLRLGHINLDRIGRLVKNGLLNKLE-DDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFFIDDYSSYGYLYL
             WH RLGH +L  +  ++ N  L  L     L     C   K  K PF+     + +PLE I+SD++    + +  ++ Y++ F+D ++ Y +LY 
Subjt:  NNTYLWHLRLGHINLDRIGRLVKNGLLNKLE-DDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFFIDDYSSYGYLYL

Query:  MNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDM
        +  KS+  + F  +K+ VEN    +I  L SD GGE++ LR  DY+ +HGI    S P TP+ NG+SER++  +++M
Subjt:  MNHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDM

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein5.1e-0837.33Show/hide
Query:  NNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNV
        + T LWH RL H++   +  LVK G L+  +  SL   E C+ GK  +  F+   +  K PL+ +HSDL+G  +V
Subjt:  NNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTCAAGGTTGGAATGGAAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGAAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTTCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCATTCATTTCTAAGAATGGTATACATATTTGTT
CGGCTAAGCTTGAAAATAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAGAAGATGATTC
ATTACCACCAAGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTATG
GTCCGATGAATGTAAAAGCTAGAGGGGATTTTGAATACTTCATCTTTTTTATAGATGATTATTCAAGTTATGGTTATTTATACTTAATGAACCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCCAGGACTA
TATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGTACCTTGTTAGACATGGTTCGTTTTGGGG
GTATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACACTCAAGGTTGGAATGGAAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGAAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTTCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCATTCATTTCTAAGAATGGTATACATATTTGTT
CGGCTAAGCTTGAAAATAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAGAAGATGATTC
ATTACCACCAAGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTATG
GTCCGATGAATGTAAAAGCTAGAGGGGATTTTGAATACTTCATCTTTTTTATAGATGATTATTCAAGTTATGGTTATTTATACTTAATGAACCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCCAGGACTA
TATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGTACCTTGTTAGACATGGTTCGTTTTGGGG
GTATGTAG
Protein sequenceShow/hide protein sequence
MTLKVGMEDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISKNGIHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRI
SPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLEDDSLPPSESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGDFEYFIFFIDDYSSYGYLYLMNHKSEAL
EKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNSTLLDMVRFGGM