; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0013231 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0013231
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr01:9827958..9828674
RNA-Seq ExpressionCmc01g0013231
SyntenyCmc01g0013231
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.5e-10580.83Show/hide
Query:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQN---K
        +VSA+AVGDL L+F +RY+ILK++LYVP MKRNLISI+CILEH+Y ISF++NE FI  KGIQICSAI ENNLY+LRPTRAN VLNTEMFRT ETQN   K
Subjt:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQN---K

Query:  VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVY
        VSSNAYL HLRL HINLNRI RLVKSG+ ++LEDN LPPCESCL+GKMTKRSFTGK LRAK+PLELVHSDLCGPMNVKA+GGYEYFISF+DD+SRYGHVY
Subjt:  VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVY

Query:  LIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLD
        L+HHKS  FEKFK YKAEVENE+GKTIK LRSDRGGEY+D
Subjt:  LIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLD

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-9068.31Show/hide
Query:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---
        ++SA+AVGD KL+FGN+++ L+N+  VP++KRNL+S+SC++EHMY I+F +NEAFI+  G+ ICSA  ENNLY LRP  A  VLN EMFRTA TQNK   
Subjt:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---

Query:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH
           ++N YL HLRL HINL+RIGRLVK+GL +KL+D  LPPCESCL+GKMTKR FTGK  RAK PLEL+HSDLCGPMNVKA+GG+EYFISF+DDYSRYG+
Subjt:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH

Query:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL
        +YL+ HKS   EKFK YK EVEN L K IKILRSDRGGEY+DL
Subjt:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-8867.49Show/hide
Query:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---
        ++SA+AVGD KL+FGN+++ L+N+  VP++KRNL+S+SC++EHMY I+F +NEAFI+  G+ ICSA  ENNLY LRP  A  VLN EMFRTA TQNK   
Subjt:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---

Query:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH
           ++N YL HLRL HINL+RIGRLVK GL +KL+D  LPPCESCL+GKMTKR FTGK  RAK PLEL+HSDLCGPMNVKA+G +EYFISF+DDYSRYG+
Subjt:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH

Query:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL
        +YL+ HKS   EKFK YK EVEN L K IKI RSDRGGEY+DL
Subjt:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]8.4e-8867.49Show/hide
Query:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---
        ++SA+AVGD KL+FGN+++ L+N+  VP++KRNL+S+SC++EHMY ISF +NEAFI   G+ ICS   E+NLY L+P     VLN EMFRTA TQNK   
Subjt:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---

Query:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH
           ++N YL HLRL HINL+RIGRLVK+GL +KLED+ LPPCESCL+GKMTKR FTGK  RAK PLEL+HSDLCGPMNVKA GG+EYFISF+DDYS YG+
Subjt:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH

Query:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL
        +YLI HKS   EKFK YK EVEN L K IKILRSDRGGEY+DL
Subjt:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-8365.02Show/hide
Query:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---
        ++SA+AVGD+KL+FG +++ L+N+  VP++KRNL+ +SC++EHMY I+F +NEAFI   G ++     E+NLY LRP  A  VLN EMFRTA TQNK   
Subjt:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---

Query:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH
           ++N YL HLRL HINL+RIGRLVK+GL +KL+D+ LPPCESCL+GKMTKR FTGK  RAK PLEL+HSDLCGPMNVKA+GG+EYFISF+DDYSRYG+
Subjt:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH

Query:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL
        +YL+ HK    EKFK YK EVEN L K IKILRSDRGGEY+DL
Subjt:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein8.2e-8967.49Show/hide
Query:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---
        ++SA+AVGD KL+FGN+++ L+N+  VP++KRNL+S+SC++EHMY I+F +NEAFI+  G+ ICSA  ENNLY LRP  A  VLN EMFRTA TQNK   
Subjt:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---

Query:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH
           ++N YL HLRL HINL+RIGRLVK GL +KL+D  LPPCESCL+GKMTKR FTGK  RAK PLEL+HSDLCGPMNVKA+G +EYFISF+DDYSRYG+
Subjt:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH

Query:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL
        +YL+ HKS   EKFK YK EVEN L K IKI RSDRGGEY+DL
Subjt:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL

A0A5A7TZD0 Gag/pol protein4.3e-9068.31Show/hide
Query:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---
        ++SA+AVGD KL+FGN+++ L+N+  VP++KRNL+S+SC++EHMY I+F +NEAFI+  G+ ICSA  ENNLY LRP  A  VLN EMFRTA TQNK   
Subjt:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---

Query:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH
           ++N YL HLRL HINL+RIGRLVK+GL +KL+D  LPPCESCL+GKMTKR FTGK  RAK PLEL+HSDLCGPMNVKA+GG+EYFISF+DDYSRYG+
Subjt:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH

Query:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL
        +YL+ HKS   EKFK YK EVEN L K IKILRSDRGGEY+DL
Subjt:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL

A0A5A7VJG3 Gag/pol protein1.8e-8365.02Show/hide
Query:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---
        ++SA+AVGD+KL+FG +++ L+N+  VP++KRNL+ +SC++EHMY I+F +NEAFI   G ++     E+NLY LRP  A  VLN EMFRTA TQNK   
Subjt:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---

Query:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH
           ++N YL HLRL HINL+RIGRLVK+GL +KL+D+ LPPCESCL+GKMTKR FTGK  RAK PLEL+HSDLCGPMNVKA+GG+EYFISF+DDYSRYG+
Subjt:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH

Query:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL
        +YL+ HK    EKFK YK EVEN L K IKILRSDRGGEY+DL
Subjt:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL

A0A5D3BNE1 Gag/pol protein4.0e-8867.49Show/hide
Query:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---
        ++SA+AVGD KL+FGN+++ L+N+  VP++KRNL+S+SC++EHMY ISF +NEAFI   G+ ICS   E+NLY L+P     VLN EMFRTA TQNK   
Subjt:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNK---

Query:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH
           ++N YL HLRL HINL+RIGRLVK+GL +KLED+ LPPCESCL+GKMTKR FTGK  RAK PLEL+HSDLCGPMNVKA GG+EYFISF+DDYS YG+
Subjt:  --VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGH

Query:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL
        +YLI HKS   EKFK YK EVEN L K IKILRSDRGGEY+DL
Subjt:  VYLIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL

E2GK51 Gag/pol protein (Fragment)7.3e-10680.83Show/hide
Query:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQN---K
        +VSA+AVGDL L+F +RY+ILK++LYVP MKRNLISI+CILEH+Y ISF++NE FI  KGIQICSAI ENNLY+LRPTRAN VLNTEMFRT ETQN   K
Subjt:  MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQN---K

Query:  VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVY
        VSSNAYL HLRL HINLNRI RLVKSG+ ++LEDN LPPCESCL+GKMTKRSFTGK LRAK+PLELVHSDLCGPMNVKA+GGYEYFISF+DD+SRYGHVY
Subjt:  VSSNAYLGHLRLCHINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVY

Query:  LIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLD
        L+HHKS  FEKFK YKAEVENE+GKTIK LRSDRGGEY+D
Subjt:  LIHHKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLD

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-1228.7Show/hide
Query:  IILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNKVSSNAYLGHLRLCHINLNRIG
        I L+++L+  +   NL+S+  + E    I F  +   I   G+ +   +  + +    P     V+N   F+      K  +N  L H R  HI+    G
Subjt:  IILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNKVSSNAYLGHLRLCHINLNRIG

Query:  RLVKSGLPSKLEDNPL--------PPCESCLKGKMTKRSFTGKCLRAKI----PLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVYLIHHKSGYF
        +L++    +   D  L          CE CL GK  +  F  K L+ K     PL +VHSD+CGP+         YF+ FVD ++ Y   YLI +KS  F
Subjt:  RLVKSGLPSKLEDNPL--------PPCESCLKGKMTKRSFTGKCLRAKI----PLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVYLIHHKSGYF

Query:  EKFKGYKAEVENELGKTIKILRSDRGGEYL
          F+ + A+ E      +  L  D G EYL
Subjt:  EKFKGYKAEVENELGKTIKILRSDRGGEYL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-2533.94Show/hide
Query:  IILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKG-IQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNKVSSNAYLGHLRLCHINLNRI
        ++LK++ +VP ++ NLIS   +    Y+ S+  N+ +   KG + I   +    LY          LN         Q+++S +  L H R+ H++   +
Subjt:  IILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKG-IQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNKVSSNAYLGHLRLCHINLNRI

Query:  GRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVYLIHHKSGYFEKFKGYKAEVE
          L K  L S  +   + PC+ CL GK  + SF     R    L+LV+SD+CGPM +++ GG +YF++F+DD SR   VY++  K   F+ F+ + A VE
Subjt:  GRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVYLIHHKSGYFEKFKGYKAEVE

Query:  NELGKTIKILRSDRGGEY
         E G+ +K LRSD GGEY
Subjt:  NELGKTIKILRSDRGGEY

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein1.2e-0924.6Show/hide
Query:  VSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFR-TAETQNKVSS
        +   A+G+L   F N        L+ P +  +L+S+S +        F  N       G  +   +   + Y L      +++ + + + T    NK  S
Subjt:  VSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFR-TAETQNKVSS

Query:  -NAY---LGHLRLCHINLNRIGRLVKSGLPSKLEDNPLP-------PCESCLKGKMTK-RSFTGKCLR---AKIPLELVHSDLCGPMNVKAQGGYEYFIS
         N Y   L H  L H N   I + +K    + L+++ +         C  CL GK TK R   G  L+   +  P + +H+D+ GP++   +    YFIS
Subjt:  -NAY---LGHLRLCHINLNRIGRLVKSGLPSKLEDNPLP-------PCESCLKGKMTK-RSFTGKCLR---AKIPLELVHSDLCGPMNVKAQGGYEYFIS

Query:  FVDDYSRYGHVYLIH--HKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEY
        F D+ +R+  VY +H   +      F    A ++N+    + +++ DRG EY
Subjt:  FVDDYSRYGHVYLIH--HKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.5e-1226.47Show/hide
Query:  GDLKLYFGNRYIILKNILYVPQMKRNLISI------SCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNKVSSN
        G   L   +R + L NILYVP + +NLIS+      + +    +  SF++ +      G+ +     ++ LYE     +  V    +F  A   +K + +
Subjt:  GDLKLYFGNRYIILKNILYVPQMKRNLISI------SCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNKVSSN

Query:  AYLGHLRLCHINLNRIGRLVKSGLPSKLE-DNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVYLIH
        ++  H RL H   + +  ++ +   S L   +    C  CL  K  K  F+   + +  PLE ++SD+     + +   Y Y++ FVD ++RY  +Y + 
Subjt:  AYLGHLRLCHINLNRIGRLVKSGLPSKLE-DNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVYLIH

Query:  HKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL
         KS   E F  +K  +EN     I    SD GGE++ L
Subjt:  HKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-1326.89Show/hide
Query:  GDLKLYFGNRYIILKNILYVPQMKRNLISI------SCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNKVSSN
        G   L   +R + L  +LYVP + +NLIS+      + +    +  SF++ +      G+ +     ++ LYE     +  V    MF  A   +K + +
Subjt:  GDLKLYFGNRYIILKNILYVPQMKRNLISI------SCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNKVSSN

Query:  AYLGHLRLCHINLNRIGRLVKS-GLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVYLIH
        ++  H RL H +L  +  ++ +  LP     + L  C  C   K  K  F+   + +  PLE ++SD+     + +   Y Y++ FVD ++RY  +Y + 
Subjt:  AYLGHLRLCHINLNRIGRLVKS-GLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVYLIH

Query:  HKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL
         KS   + F  +K+ VEN     I  L SD GGE++ L
Subjt:  HKSGYFEKFKGYKAEVENELGKTIKILRSDRGGEYLDL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTCAGCTAAAGCAGTGGGAGATTTAAAGTTGTATTTTGGAAATAGATATATCATACTTAAGAATATCTTGTATGTACCACAAATGAAAAGAAATTTAATATCTAT
TTCTTGTATTTTGGAACACATGTATAAGATATCTTTTAAAATTAATGAAGCGTTCATTTTCTATAAAGGTATCCAAATCTGTTCTGCTATACATGAAAACAACTTATATG
AGTTAAGACCAACACGAGCAAATTTTGTCTTAAATACTGAAATGTTTAGAACAGCTGAAACTCAGAATAAAGTTTCTTCTAATGCCTATTTAGGCCACTTGAGACTTTGT
CACATAAATCTCAATAGGATTGGGAGATTAGTTAAAAGTGGACTTCCAAGTAAGTTAGAAGATAACCCTTTACCTCCTTGTGAATCTTGTCTTAAAGGAAAAATGACTAA
GAGATCTTTTACTGGAAAATGTCTCAGAGCCAAAATTCCTTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAACGTCAAAGCTCAAGGAGGGTACGAATATTTCA
TTAGTTTTGTTGATGATTATTCGAGGTACGGTCATGTTTATCTAATTCATCACAAGTCTGGTTATTTTGAAAAATTCAAAGGATATAAGGCTGAAGTTGAGAATGAATTA
GGTAAGACAATAAAAATACTTCGATCAGATCGAGGTGGAGAATATTTGGACTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTCAGCTAAAGCAGTGGGAGATTTAAAGTTGTATTTTGGAAATAGATATATCATACTTAAGAATATCTTGTATGTACCACAAATGAAAAGAAATTTAATATCTAT
TTCTTGTATTTTGGAACACATGTATAAGATATCTTTTAAAATTAATGAAGCGTTCATTTTCTATAAAGGTATCCAAATCTGTTCTGCTATACATGAAAACAACTTATATG
AGTTAAGACCAACACGAGCAAATTTTGTCTTAAATACTGAAATGTTTAGAACAGCTGAAACTCAGAATAAAGTTTCTTCTAATGCCTATTTAGGCCACTTGAGACTTTGT
CACATAAATCTCAATAGGATTGGGAGATTAGTTAAAAGTGGACTTCCAAGTAAGTTAGAAGATAACCCTTTACCTCCTTGTGAATCTTGTCTTAAAGGAAAAATGACTAA
GAGATCTTTTACTGGAAAATGTCTCAGAGCCAAAATTCCTTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAACGTCAAAGCTCAAGGAGGGTACGAATATTTCA
TTAGTTTTGTTGATGATTATTCGAGGTACGGTCATGTTTATCTAATTCATCACAAGTCTGGTTATTTTGAAAAATTCAAAGGATATAAGGCTGAAGTTGAGAATGAATTA
GGTAAGACAATAAAAATACTTCGATCAGATCGAGGTGGAGAATATTTGGACTTATGA
Protein sequenceShow/hide protein sequence
MVSAKAVGDLKLYFGNRYIILKNILYVPQMKRNLISISCILEHMYKISFKINEAFIFYKGIQICSAIHENNLYELRPTRANFVLNTEMFRTAETQNKVSSNAYLGHLRLC
HINLNRIGRLVKSGLPSKLEDNPLPPCESCLKGKMTKRSFTGKCLRAKIPLELVHSDLCGPMNVKAQGGYEYFISFVDDYSRYGHVYLIHHKSGYFEKFKGYKAEVENEL
GKTIKILRSDRGGEYLDL