; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0166761 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0166761
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr06:19414191..19414754
RNA-Seq ExpressionCmc06g0166761
SyntenyCmc06g0166761
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-8889.78Show/hide
Query:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY
        MF+TANTQNKRQRISPNNNTYLWHLRLGHINL+RIGRLVKN L N+L+D SLPPCESCLEGKMTKRPFT KGYRAKEPL LIH +L G MNVK R GFEY
Subjt:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY

Query:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        FISFIDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKIL+SDRGGEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERR
Subjt:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-8687.63Show/hide
Query:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY
        MF+TANTQNKRQRISPNNNTYLWHLRLGHINL+RIGRLVK+ L N+L+D SLPPCESCLEGKMTKRPFT KGYRAKEPL LIH +L G MNVK R  FEY
Subjt:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY

Query:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        FISFIDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI +SDRGGEYMDL FQDYMIEHGIQ QLSA GTPQQNGVSERR
Subjt:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-8688.17Show/hide
Query:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY
        MF+TANTQNKRQRIS NNNTYLWHLRLGHINL+RIGRLVKN L N+LED SLPPCESCLEGKMTKRPFT KGYRAKEPL LIH +L G MNVK   GFEY
Subjt:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY

Query:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        FISFIDDYS YGYLYL+EHKSEALEKFKEYK EVENLLSKKIKIL+SDRGGEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERR
Subjt:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

KAA0065386.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-8687.63Show/hide
Query:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY
        MF+TANTQNKRQRISPNN TYLWHLRLGHINL++IGRLVKN L N+LED SLPPCES LEGKMTKRPF  KGYRAKEPL LIH +L G MNVK REGFEY
Subjt:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY

Query:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        FISFIDDYSRYGYLYLMEHKSEALEK KEY+ EVENLLS+KIKIL+SDRGGEYMDLRFQDYMIEHGIQ QLSALGTPQQNGVSERR
Subjt:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-8688.17Show/hide
Query:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY
        MF+TANTQNKRQRISPNNNTYLWHLRL HINL+RIGRLVKN L N+L+D SLPPCESCLEGKMTKRPFT K YRAKEPL LIH +L G MNVK R GFEY
Subjt:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY

Query:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        FISFIDDYSRYGYLYLMEHK EALEKFKEYK EVENLLSKKIKIL+SDRGGEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERR
Subjt:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein2.3e-8687.63Show/hide
Query:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY
        MF+TANTQNKRQRISPNNNTYLWHLRLGHINL+RIGRLVK+ L N+L+D SLPPCESCLEGKMTKRPFT KGYRAKEPL LIH +L G MNVK R  FEY
Subjt:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY

Query:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        FISFIDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI +SDRGGEYMDL FQDYMIEHGIQ QLSA GTPQQNGVSERR
Subjt:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

A0A5A7TZD0 Gag/pol protein4.9e-8989.78Show/hide
Query:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY
        MF+TANTQNKRQRISPNNNTYLWHLRLGHINL+RIGRLVKN L N+L+D SLPPCESCLEGKMTKRPFT KGYRAKEPL LIH +L G MNVK R GFEY
Subjt:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY

Query:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        FISFIDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKIL+SDRGGEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERR
Subjt:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

A0A5A7VGC7 Gag/pol protein1.7e-8687.63Show/hide
Query:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY
        MF+TANTQNKRQRISPNN TYLWHLRLGHINL++IGRLVKN L N+LED SLPPCES LEGKMTKRPF  KGYRAKEPL LIH +L G MNVK REGFEY
Subjt:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY

Query:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        FISFIDDYSRYGYLYLMEHKSEALEK KEY+ EVENLLS+KIKIL+SDRGGEYMDLRFQDYMIEHGIQ QLSALGTPQQNGVSERR
Subjt:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

A0A5A7VJG3 Gag/pol protein6.0e-8788.17Show/hide
Query:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY
        MF+TANTQNKRQRISPNNNTYLWHLRL HINL+RIGRLVKN L N+L+D SLPPCESCLEGKMTKRPFT K YRAKEPL LIH +L G MNVK R GFEY
Subjt:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY

Query:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        FISFIDDYSRYGYLYLMEHK EALEKFKEYK EVENLLSKKIKIL+SDRGGEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERR
Subjt:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

A0A5D3BNE1 Gag/pol protein1.0e-8688.17Show/hide
Query:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY
        MF+TANTQNKRQRIS NNNTYLWHLRLGHINL+RIGRLVKN L N+LED SLPPCESCLEGKMTKRPFT KGYRAKEPL LIH +L G MNVK   GFEY
Subjt:  MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEY

Query:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        FISFIDDYS YGYLYL+EHKSEALEKFKEYK EVENLLSKKIKIL+SDRGGEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERR
Subjt:  FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.5e-2132.58Show/hide
Query:  NNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSL--------PPCESCLEGKMTKRPF--TRKGYRAKEPLGLIHPNLSGMMNVKVREGFEYFISFIDD
        NN  LWH R GHI+    G+L++ +  N   D SL          CE CL GK  + PF   +     K PL ++H ++ G +     +   YF+ F+D 
Subjt:  NNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSL--------PPCESCLEGKMTKRPF--TRKGYRAKEPLGLIHPNLSGMMNVKVREGFEYFISFIDD

Query:  YSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSER
        ++ Y   YL+++KS+    F+++ A+ E   + K+  L  D G EY+    + + ++ GI + L+   TPQ NGVSER
Subjt:  YSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSER

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-2635.98Show/hide
Query:  LWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEYFISFIDDYSRYGYLYLMEHKS
        LWH R+GH++   +  L K  L +  +  ++ PC+ CL GK  +  F     R    L L++ ++ G M ++   G +YF++FIDD SR  ++Y+++ K 
Subjt:  LWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEYFISFIDDYSRYGYLYLMEHKS

Query:  EALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSER
        +  + F+++ A VE    +K+K L+SD GGEY    F++Y   HGI+ + +  GTPQ NGV+ER
Subjt:  EALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSER

P93293 Uncharacterized mitochondrial protein AtMg003001.6e-0433.33Show/hide
Query:  NNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNV
        + T LWH RL H++   +  LVK    +  +  SL  CE C+ GK  +  F+   +  K PL  +H +L G  +V
Subjt:  NNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.7e-1831.52Show/hide
Query:  WHLRLGHINLNRIGRLVKNELPNELE-DHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEYFISFIDDYSRYGYLYLMEHKS
        WH RLGH   + +  ++ N   + L   H    C  CL  K  K PF++    +  PL  I+ ++     +   + + Y++ F+D ++RY +LY ++ KS
Subjt:  WHLRLGHINLNRIGRLVKNELPNELE-DHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEYFISFIDDYSRYGYLYLMEHKS

Query:  EALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        +  E F  +K  +EN    +I    SD GGE++ L   +Y  +HGI    S   TP+ NG+SER+
Subjt:  EALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.2e-2134.55Show/hide
Query:  WHLRLGHINLNRIGRLVKN-ELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEYFISFIDDYSRYGYLYLMEHKS
        WH RLGH +L  +  ++ N  LP     H L  C  C   K  K PF+     + +PL  I+ ++     + + + + Y++ F+D ++RY +LY ++ KS
Subjt:  WHLRLGHINLNRIGRLVKN-ELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEYFISFIDDYSRYGYLYLMEHKS

Query:  EALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR
        +  + F  +K+ VEN    +I  L SD GGE++ LR  DY+ +HGI    S   TP+ NG+SER+
Subjt:  EALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERR

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.1e-0533.33Show/hide
Query:  NNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNV
        + T LWH RL H++   +  LVK    +  +  SL  CE C+ GK  +  F+   +  K PL  +H +L G  +V
Subjt:  NNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAAAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCAATCGGATTGGGAG
ATTGGTAAAGAATGAACTTCCAAACGAGTTAGAAGATCATTCATTACCTCCATGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTACTAGAAAAGGTTATA
GAGCCAAAGAGCCTTTAGGACTTATACATCCAAACCTTTCTGGTATGATGAATGTAAAAGTTAGAGAAGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCAAGG
TATGGTTATTTATATTTAATGGAGCATAAATCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCAATC
TGATCGAGGTGGAGAGTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATCCAATTCCAACTCTCAGCACTTGGTACACCTCAACAAAATGGTGTATCAG
AAAGGAGAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTAAAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCAATCGGATTGGGAG
ATTGGTAAAGAATGAACTTCCAAACGAGTTAGAAGATCATTCATTACCTCCATGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTACTAGAAAAGGTTATA
GAGCCAAAGAGCCTTTAGGACTTATACATCCAAACCTTTCTGGTATGATGAATGTAAAAGTTAGAGAAGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCAAGG
TATGGTTATTTATATTTAATGGAGCATAAATCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCAATC
TGATCGAGGTGGAGAGTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATCCAATTCCAACTCTCAGCACTTGGTACACCTCAACAAAATGGTGTATCAG
AAAGGAGAAAGTAA
Protein sequenceShow/hide protein sequence
MFKTANTQNKRQRISPNNNTYLWHLRLGHINLNRIGRLVKNELPNELEDHSLPPCESCLEGKMTKRPFTRKGYRAKEPLGLIHPNLSGMMNVKVREGFEYFISFIDDYSR
YGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILQSDRGGEYMDLRFQDYMIEHGIQFQLSALGTPQQNGVSERRK