; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0108601 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0108601
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:28125744..28126469
RNA-Seq ExpressionCmc04g0108601
SyntenyCmc04g0108601
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-12394.61Show/hide
Query:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT
        MTLKVGTGDVISARA+GDAKLFFGNKFMFLENLYIVPKIKRNL+SVSCLIEHMYSINFSMNEAFI KNGVHICSAKLENNLYVLR NEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
        ANTQNKRQRISPNNN YLW LRLGHINLDRIGRLVKN LLNKL+D SLPPCES LEGKMTKRPFTGKGYRAKEPLELIHSDLCG MNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKIL
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-12294.17Show/hide
Query:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT
        MTLKVGTGDVISARA+GDAKLFFGNKFMFLENLYIVPKIKRNL+SVSCLIEHMYSINFSMNEAFI KNGVHICSAKLENNLYVLR NEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
        ANTQNKRQRISPNNN YLW LRLGHINLDRIGRLVK+ LLNKL+D SLPPCES LEGKMTKRPFTGKGYRAKEPLELIHSDLCG MNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI

KAA0046415.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-9273.22Show/hide
Query:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT
        MT++VGTG VISA A+G  +L     F+ LEN+Y+VP +KRNLISV CL+E  YS+ F++N+ FI KNGV ICSAKLENNLYVLRS  +KA+LN EMF+T
Subjt:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
        A TQNKR +ISP  N +LW LRLGHINL+RI RLVKN LL++LE++SLP CES LEGKMTKRPFTGKG+RAKEPLEL+HSDLCG MNVKARG FEYFI+F
Subjt:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIK
         DDYSRYGY+YLM+HKSEALEKFKEYKAEVEN LSK IK
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIK

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-11991.7Show/hide
Query:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT
        M LKVGTGDVISARA+GDAKLFFGNKFMFLENLYIVPKIKRNL+SVSCLIEHMYSI+FSMNEAFISKNGVHICS KLE+NLYVL+ NE KAVLNHEMFRT
Subjt:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
        ANTQNKRQRIS NNN YLW LRLGHINLDRIGRLVKN LLNKLEDDSLPPCES LEGKMTKRPFTGKGYRAKEPLELIHSDLCG MNVKA G FEYFISF
Subjt:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
        IDDYS YGYLYL+EHKSEALEKFKEYK EVENLLSKKIKIL
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]4.0e-11490.04Show/hide
Query:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT
        MTL VGTGDVISARA+GD KLFFG KFMFLENLYIVPKIKRNL+ VSCLIEHMYSINFSMNEAFISKNG     AKLE+NLYVLR NEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
        ANTQNKRQRISPNNN YLW LRL HINLDRIGRLVKN LLNKL+DDSLPPCES LEGKMTKRPFTGK YRAKEPLELIHSDLCG MNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
        IDDYSRYGYLYLMEHK EALEKFKEYK EVENLLSKKIKIL
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein8.7e-12394.17Show/hide
Query:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT
        MTLKVGTGDVISARA+GDAKLFFGNKFMFLENLYIVPKIKRNL+SVSCLIEHMYSINFSMNEAFI KNGVHICSAKLENNLYVLR NEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
        ANTQNKRQRISPNNN YLW LRLGHINLDRIGRLVK+ LLNKL+D SLPPCES LEGKMTKRPFTGKGYRAKEPLELIHSDLCG MNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI

A0A5A7TYF5 Gag/pol protein7.2e-9373.22Show/hide
Query:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT
        MT++VGTG VISA A+G  +L     F+ LEN+Y+VP +KRNLISV CL+E  YS+ F++N+ FI KNGV ICSAKLENNLYVLRS  +KA+LN EMF+T
Subjt:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
        A TQNKR +ISP  N +LW LRLGHINL+RI RLVKN LL++LE++SLP CES LEGKMTKRPFTGKG+RAKEPLEL+HSDLCG MNVKARG FEYFI+F
Subjt:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIK
         DDYSRYGY+YLM+HKSEALEKFKEYKAEVEN LSK IK
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIK

A0A5A7TZD0 Gag/pol protein1.3e-12394.61Show/hide
Query:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT
        MTLKVGTGDVISARA+GDAKLFFGNKFMFLENLYIVPKIKRNL+SVSCLIEHMYSINFSMNEAFI KNGVHICSAKLENNLYVLR NEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
        ANTQNKRQRISPNNN YLW LRLGHINLDRIGRLVKN LLNKL+D SLPPCES LEGKMTKRPFTGKGYRAKEPLELIHSDLCG MNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKIL
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL

A0A5A7VJG3 Gag/pol protein2.0e-11490.04Show/hide
Query:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT
        MTL VGTGDVISARA+GD KLFFG KFMFLENLYIVPKIKRNL+ VSCLIEHMYSINFSMNEAFISKNG     AKLE+NLYVLR NEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
        ANTQNKRQRISPNNN YLW LRL HINLDRIGRLVKN LLNKL+DDSLPPCES LEGKMTKRPFTGK YRAKEPLELIHSDLCG MNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
        IDDYSRYGYLYLMEHK EALEKFKEYK EVENLLSKKIKIL
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL

A0A5D3BNE1 Gag/pol protein1.2e-11991.7Show/hide
Query:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT
        M LKVGTGDVISARA+GDAKLFFGNKFMFLENLYIVPKIKRNL+SVSCLIEHMYSI+FSMNEAFISKNGVHICS KLE+NLYVL+ NE KAVLNHEMFRT
Subjt:  MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
        ANTQNKRQRIS NNN YLW LRLGHINLDRIGRLVKN LLNKLEDDSLPPCES LEGKMTKRPFTGKGYRAKEPLELIHSDLCG MNVKA G FEYFISF
Subjt:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
        IDDYS YGYLYL+EHKSEALEKFKEYK EVENLLSKKIKIL
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-1329.86Show/hide
Query:  LENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHIC-SAKLENNLYVLRSNEAKAVLNHEMFRTANTQNKRQRISPNNNIYLWDLRLGHIN-
        LE++    +   NL+SV  L E   SI F  +   ISKNG+ +  ++ + NN+          V+N + + + N ++K       NN  LW  R GHI+ 
Subjt:  LENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHIC-SAKLENNLYVLRSNEAKAVLNHEMFRTANTQNKRQRISPNNNIYLWDLRLGHIN-

Query:  -----LDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGLMNVKARGCFEYFISFIDDYSRYGYLYLMEHKSEAL
             + R        LLN LE  S   CE  L GK  + PF     +   K PL ++HSD+CG +         YF+ F+D ++ Y   YL+++KS+  
Subjt:  -----LDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGLMNVKARGCFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKAEVENLLSKKIKIL
          F+++ A+ E   + K+  L
Subjt:  EKFKEYKAEVENLLSKKIKIL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.9e-1625.31Show/hide
Query:  TLKVGTGDVISARALGDAKLFFG-NKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT
        T+K+G         +GD  +       + L+++  VP ++ NLIS   L    Y   F+  +  ++K  + I        LY   +   +  LN      
Subjt:  TLKVGTGDVISARALGDAKLFFG-NKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
                      ++ LW  R+GH++   +  L K  L++  +  ++ PC+  L GK  +  F     R    L+L++SD+CG M +++ G  +YF++F
Subjt:  ANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
        IDD SR  ++Y+++ K +  + F+++ A VE    +K+K L
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL

Q12491 Transposon Ty2-B Gag-Pol polyprotein9.3e-0525.4Show/hide
Query:  ISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRTANTQNKRQRI
        I   A+G+    F N           P I  +L+S+S L     +  F+ N      +G  +       + Y L  ++   + +H    T N  NK + +
Subjt:  ISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRTANTQNKRQRI

Query:  SPNNNIYLWDL---RLGHINLDRIGR-LVKNVLLNKLEDD------SLPPCESYLEGKMTKRPFTGKGYRAK-----EPLELIHSDLCGLMNVKARGCFE
            N Y + L    LGH N   I + L KN +    E D      S   C   L GK TK     KG R K     EP + +H+D+ G ++   +    
Subjt:  SPNNNIYLWDL---RLGHINLDRIGR-LVKNVLLNKLEDD------SLPPCESYLEGKMTKRPFTGKGYRAK-----EPLELIHSDLCGLMNVKARGCFE

Query:  YFISFIDDYSRYGYLYLMEHKSE--ALEKFKEYKAEVENLLSKKIKIL
        YFISF D+ +R+ ++Y +  + E   L  F    A ++N  + ++ ++
Subjt:  YFISFIDDYSRYGYLYLMEHKSE--ALEKFKEYKAEVENLLSKKIKIL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-1025.31Show/hide
Query:  VGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIE------HMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMF
        V  G  I     G   L   ++ + L N+  VP I +NLISV  L          +  +F + +      GV +   K ++ LY     E     +  + 
Subjt:  VGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIE------HMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMF

Query:  RTANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLE-DDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYF
          A+  +K    S       W  RLGH     +  ++ N  L+ L        C   L  K  K PF+     +  PLE I+SD+     + +   + Y+
Subjt:  RTANTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLE-DDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYF

Query:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKI
        + F+D ++RY +LY ++ KS+  E F  +K  +EN    +I
Subjt:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.7e-1127.39Show/hide
Query:  VGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIE-HMYSINFSMNEAFIS--KNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRTA
        +  G  I     G A L   ++ + L  +  VP I +NLISV  L   +  S+ F      +     GV +   K ++ LY      ++AV    MF  A
Subjt:  VGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIE-HMYSINFSMNEAFIS--KNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRTA

Query:  NTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLE-DDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF
        +  +K    S       W  RLGH +L  +  ++ N  L  L     L  C      K  K PF+     + +PLE I+SD+     + +   + Y++ F
Subjt:  NTQNKRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLE-DDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
        +D ++RY +LY ++ KS+  + F  +K+ VEN    +I  L
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.5e-0535.21Show/hide
Query:  LWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNV
        LW  RL H++   +  LVK   L+  +  SL  CE  + GK  +  F+   +  K PL+ +HSDL G  +V
Subjt:  LWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCATTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATA
GTTCCTAAAATTAAAAGGAACTTAATTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTA
CATATTTGTTCGGCTAAGCTTGAAAACAACTTGTATGTATTAAGATCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAAT
AAAAGGCAAAGAATTTCTCCAAATAACAATATCTATCTTTGGGATTTAAGATTAGGTCACATAAATCTCGATCGGATTGGGAGATTGGTAAAGAATGTACTTCTA
AACAAGTTAGAAGATGATTCATTACCTCCATGTGAATCTTATCTTGAAGGAAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTA
GAACTTATACATTCAGACCTCTGTGGTCTGATGAATGTAAAAGCTAGAGGGTGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCAAGGTATGGTTATTTA
TACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCATTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATA
GTTCCTAAAATTAAAAGGAACTTAATTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTA
CATATTTGTTCGGCTAAGCTTGAAAACAACTTGTATGTATTAAGATCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAAT
AAAAGGCAAAGAATTTCTCCAAATAACAATATCTATCTTTGGGATTTAAGATTAGGTCACATAAATCTCGATCGGATTGGGAGATTGGTAAAGAATGTACTTCTA
AACAAGTTAGAAGATGATTCATTACCTCCATGTGAATCTTATCTTGAAGGAAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTA
GAACTTATACATTCAGACCTCTGTGGTCTGATGAATGTAAAAGCTAGAGGGTGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCAAGGTATGGTTATTTA
TACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTTGA
Protein sequenceShow/hide protein sequence
MTLKVGTGDVISARALGDAKLFFGNKFMFLENLYIVPKIKRNLISVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRSNEAKAVLNHEMFRTANTQN
KRQRISPNNNIYLWDLRLGHINLDRIGRLVKNVLLNKLEDDSLPPCESYLEGKMTKRPFTGKGYRAKEPLELIHSDLCGLMNVKARGCFEYFISFIDDYSRYGYL
YLMEHKSEALEKFKEYKAEVENLLSKKIKIL