; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0098111 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0098111
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:13475164..13475604
RNA-Seq ExpressionCmc04g0098111
SyntenyCmc04g0098111
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.8e-6281.38Show/hide
Query:  LSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDSD
        L +R+GI+LSKEQCPKTPQ+VED+R IPY+SAVGSLMYAMLCTRP  C+SVG+VSRYQSNPG DHWT +KNILKYL RT++YMLVYG KDLILT YTDSD
Subjt:  LSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDSD

Query:  FQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA
        FQ+DKDARKS SGS+FTLN G+VVWR  KQTCIADSTMEAEYVAA
Subjt:  FQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA

KAA0026227.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-6282.88Show/hide
Query:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS
        +LS RYGI+LSKEQCPKTPQ+VED+  IPYASAVGSLMYAMLCTRP  C+SVG+VSRYQSNPG DHWT +KNILKYL RTKDYMLVYG+KDLILTRYT+ 
Subjt:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS

Query:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA
        DFQTDKDARKS SGS+FTLNEG+VVWR  KQ+ IADSTMEAEYVAA
Subjt:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA

KAA0050132.1 gag/pol protein [Cucumis melo var. makuwa]4.8e-6281.51Show/hide
Query:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS
        +LSYRYGI+LSKEQCPKTPQ+VED+  I YA AVGSLMYAMLCTRP  C+SVG+VSRYQSNPG DHWT +KNILKYL RTKDYMLVYG+KDLIL  YTDS
Subjt:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS

Query:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA
        DFQTDKDARKS SGS+FTLN G+VVWR  KQ+CIADSTME EYVAA
Subjt:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA

KAA0063026.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-6281.51Show/hide
Query:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS
        +LSYRY I+LSKEQCPKTPQKVEDI  IPYAS V SLMYAMLCTRP  C+S+G++SRYQSNPG DHWTT+KNILKYL RTKDYMLVYG+KDLILT YTDS
Subjt:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS

Query:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA
        DFQTDKD RKS SGS+FTLN G VVWR  KQ+CIA+STMEAEYVAA
Subjt:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA

TYK06386.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-6281.51Show/hide
Query:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS
        +L YRYGI+LSKEQCPKTPQ+VED+  IPYA AVGSLMYAMLCTRP  C+SVG+VSRYQSNPG DHWT +KNILKYL RTKDYMLVYG+KDLIL  YTDS
Subjt:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS

Query:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA
        DFQTDKDARKS SGS+FTLN G+VVWR  KQ+CIADSTME EYVAA
Subjt:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA

TrEMBL top hitse value%identityAlignment
A0A5A7SNE2 Gag/pol protein6.2e-6382.88Show/hide
Query:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS
        +LS RYGI+LSKEQCPKTPQ+VED+  IPYASAVGSLMYAMLCTRP  C+SVG+VSRYQSNPG DHWT +KNILKYL RTKDYMLVYG+KDLILTRYT+ 
Subjt:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS

Query:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA
        DFQTDKDARKS SGS+FTLNEG+VVWR  KQ+ IADSTMEAEYVAA
Subjt:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA

A0A5A7U2R8 Gag/pol protein2.3e-6281.51Show/hide
Query:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS
        +LSYRYGI+LSKEQCPKTPQ+VED+  I YA AVGSLMYAMLCTRP  C+SVG+VSRYQSNPG DHWT +KNILKYL RTKDYMLVYG+KDLIL  YTDS
Subjt:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS

Query:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA
        DFQTDKDARKS SGS+FTLN G+VVWR  KQ+CIADSTME EYVAA
Subjt:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA

A0A5A7V9B0 Gag/pol protein8.0e-6381.51Show/hide
Query:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS
        +LSYRY I+LSKEQCPKTPQKVEDI  IPYAS V SLMYAMLCTRP  C+S+G++SRYQSNPG DHWTT+KNILKYL RTKDYMLVYG+KDLILT YTDS
Subjt:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS

Query:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA
        DFQTDKD RKS SGS+FTLN G VVWR  KQ+CIA+STMEAEYVAA
Subjt:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA

A0A5D3C7T2 Gag/pol protein1.8e-6281.51Show/hide
Query:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS
        +L YRYGI+LSKEQCPKTPQ+VED+  IPYA AVGSLMYAMLCTRP  C+SVG+VSRYQSNPG DHWT +KNILKYL RTKDYMLVYG+KDLIL  YTDS
Subjt:  MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDS

Query:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA
        DFQTDKDARKS SGS+FTLN G+VVWR  KQ+CIADSTME EYVAA
Subjt:  DFQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA

E2GK51 Gag/pol protein (Fragment)1.4e-6281.38Show/hide
Query:  LSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDSD
        L +R+GI+LSKEQCPKTPQ+VED+R IPY+SAVGSLMYAMLCTRP  C+SVG+VSRYQSNPG DHWT +KNILKYL RT++YMLVYG KDLILT YTDSD
Subjt:  LSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDSD

Query:  FQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA
        FQ+DKDARKS SGS+FTLN G+VVWR  KQTCIADSTMEAEYVAA
Subjt:  FQTDKDARKSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.7e-1437.7Show/hide
Query:  PYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLI----LTRYTDSDFQTDKDARKSISGSIFTLNEGSV
        P  S +G LMY MLCTRP    +V ++SRY S    + W  LK +L+YL  T D  L++  K+L     +  Y DSD+   +  RKS +G +F + + ++
Subjt:  PYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLI----LTRYTDSDFQTDKDARKSISGSIFTLNEGSV

Query:  V-WRCEKQTCIADSTMEAEYVA
        + W  ++Q  +A S+ EAEY+A
Subjt:  V-WRCEKQTCIADSTMEAEYVA

P0CV72 Secreted RxLR effector protein 1613.3e-2142.74Show/hide
Query:  IRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVY---GTKDLILTRYTDSDFQTDKDARKSISGSIFTLNE
        ++ +PY SAVG++MY M+ TRP    +VG++S++ S+P   HW  LK +L+YL  T+ Y L +   GT  L+   Y+D+D+  D ++R+S SG +F LN 
Subjt:  IRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVY---GTKDLILTRYTDSDFQTDKDARKSISGSIFTLNE

Query:  GSVVWRCEKQTCIADSTMEAEYVA
        G V WR +KQ  +A S+ E EY+A
Subjt:  GSVVWRCEKQTCIADSTMEAEYVA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-3350.36Show/hide
Query:  LSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDSDFQTDKDAR
        LSK+ CP T ++  ++ K+PY+SAVGSLMYAM+CTRP    +VG+VSR+  NPG +HW  +K IL+YL  T    L +G  D IL  YTD+D   D D R
Subjt:  LSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDSDFQTDKDAR

Query:  KSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA
        KS +G +FT + G++ W+ + Q C+A ST EAEY+AA
Subjt:  KSISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-0934.78Show/hide
Query:  YASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDY-MLVYGTKDLILTRYTDSDFQTDKDARKSISGSIFTLNEGSVVWRC
        Y   VGSL Y +  TRP   ++V  +S++   P  +H   LK IL+YL  T ++ + +     L L  Y+D+D+  DKD   S +G I  L    + W  
Subjt:  YASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDY-MLVYGTKDLILTRYTDSDFQTDKDARKSISGSIFTLNEGSVVWRC

Query:  EKQTCIADSTMEAEY
        +KQ  +  S+ EAEY
Subjt:  EKQTCIADSTMEAEY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.8e-1236.52Show/hide
Query:  YASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDY-MLVYGTKDLILTRYTDSDFQTDKDARKSISGSIFTLNEGSVVWRC
        Y   VGSL Y +  TRP   ++V  +S+Y   P  DHW  LK +L+YL  T D+ + +     L L  Y+D+D+  D D   S +G I  L    + W  
Subjt:  YASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDY-MLVYGTKDLILTRYTDSDFQTDKDARKSISGSIFTLNEGSVVWRC

Query:  EKQTCIADSTMEAEY
        +KQ  +  S+ EAEY
Subjt:  EKQTCIADSTMEAEY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-1031.62Show/hide
Query:  YASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTK-DLILTRYTDSDFQTDKDARKSISGSIFTLNEGSVVWRC
        Y   +G LMY  + TR    F+V  +S++   P   H   +  IL Y+  T    L Y ++ ++ L  ++D+ FQ+ KD R+S +G    L    + W+ 
Subjt:  YASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTK-DLILTRYTDSDFQTDKDARKSISGSIFTLNEGSVVWRC

Query:  EKQTCIADSTMEAEYVA
        +KQ  ++ S+ EAEY A
Subjt:  EKQTCIADSTMEAEYVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTCGTACAGATATGGAATTTATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAAAAGTTGAGGATATAAGAAAAATTCCCTATGCTTCTGCTGTTGGAAGTTT
AATGTACGCAATGTTATGTACTAGACCTGCATTTTGCTTTTCAGTAGGGATGGTCAGTAGGTATCAGTCCAATCCTGGATGTGATCACTGGACAACCCTTAAGAATATTC
TAAAATATCTTACAAGAACAAAAGACTACATGCTCGTATATGGTACTAAGGATCTGATCCTTACTAGATACACTGATTCTGATTTCCAAACGGATAAAGATGCTAGAAAG
TCTATATCAGGATCAATATTCACTCTAAACGAAGGATCAGTAGTATGGAGATGCGAAAAACAAACTTGTATAGCCGACTCCACAATGGAAGCTGAATATGTAGCAGCTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGCTGTCGTACAGATATGGAATTTATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAAAAGTTGAGGATATAAGAAAAATTCCCTATGCTTCTGCTGTTGGAAGTTT
AATGTACGCAATGTTATGTACTAGACCTGCATTTTGCTTTTCAGTAGGGATGGTCAGTAGGTATCAGTCCAATCCTGGATGTGATCACTGGACAACCCTTAAGAATATTC
TAAAATATCTTACAAGAACAAAAGACTACATGCTCGTATATGGTACTAAGGATCTGATCCTTACTAGATACACTGATTCTGATTTCCAAACGGATAAAGATGCTAGAAAG
TCTATATCAGGATCAATATTCACTCTAAACGAAGGATCAGTAGTATGGAGATGCGAAAAACAAACTTGTATAGCCGACTCCACAATGGAAGCTGAATATGTAGCAGCTTG
A
Protein sequenceShow/hide protein sequence
MLSYRYGIYLSKEQCPKTPQKVEDIRKIPYASAVGSLMYAMLCTRPAFCFSVGMVSRYQSNPGCDHWTTLKNILKYLTRTKDYMLVYGTKDLILTRYTDSDFQTDKDARK
SISGSIFTLNEGSVVWRCEKQTCIADSTMEAEYVAA