; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0168031 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0168031
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr06:21562843..21563295
RNA-Seq ExpressionCmc06g0168031
SyntenyCmc06g0168031
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]7.9e-6887.33Show/hide
Query:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS
        MLK IRILLSI T YNYEIWQMDVKTAFLNGNLEESIYM+Q EGFI   QEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYK+I+NS
Subjt:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS

Query:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV
         V+FLILY+DDILLIGNDV +LTD+KKWL TQFQMKDLG AQY+LGIQIV
Subjt:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-6988.67Show/hide
Query:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS
        M+K IRILLSI T Y+YEIWQMDVKTAFLNGNLEESIYM+Q EGFIQ GQEQKVCKLQKSIYGLKQASRSWNIRFDT IKSYGFEQNVDEPCVYKRIINS
Subjt:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS

Query:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV
        TV+FL+LY+DDILLIGN+V HLTDIK+WL TQFQMKDLG+AQYVLGIQIV
Subjt:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV

KAA0061927.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-6582.67Show/hide
Query:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS
        MLK IRILLSI T Y+YEIWQMDVKTAFLNGNLE S+YM Q +GFI+NGQEQKVCKLQKSIY LKQASRSWNI+FDT IK+YGFEQNV+EPCVYK+++NS
Subjt:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS

Query:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV
         V+FL+LY+DDILLIGNDVG+LTDI+KWLATQFQMK LG+AQYVLGIQIV
Subjt:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-6686Show/hide
Query:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS
        M+K IRILLSI T Y+YEIWQMDVKT FLN NLEESIYM+Q E FIQ GQEQK+CKLQKSIYGLKQASRS NIRFDTAIKSYG EQNVDEPCVYKRI+NS
Subjt:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS

Query:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV
        TV+FL+LY+DDILLIGNDVGHL DIKKWLA QFQMKDLGNAQYVLG+QIV
Subjt:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV

TYK09500.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-6784.67Show/hide
Query:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS
        MLK IRILLSI T Y+YEIWQ+DVKTAFLNGNLEESIYM Q EGFI+ GQEQK+CK QKSIYGLKQASRSWNIRFDTAIKSYGFEQNVD+PCVYK+++NS
Subjt:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS

Query:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV
         V+FL+LY+DDILL+GNDVG+LTDIKKWLATQFQMKD G+AQYVLGIQIV
Subjt:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV

TrEMBL top hitse value%identityAlignment
A0A5A7TTA2 Gag/pol protein1.6e-6988.67Show/hide
Query:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS
        M+K IRILLSI T Y+YEIWQMDVKTAFLNGNLEESIYM+Q EGFIQ GQEQKVCKLQKSIYGLKQASRSWNIRFDT IKSYGFEQNVDEPCVYKRIINS
Subjt:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS

Query:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV
        TV+FL+LY+DDILLIGN+V HLTDIK+WL TQFQMKDLG+AQYVLGIQIV
Subjt:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV

A0A5A7V4T1 Gag/pol protein4.0e-6582.67Show/hide
Query:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS
        MLK IRILLSI T Y+YEIWQMDVKTAFLNGNLE S+YM Q +GFI+NGQEQKVCKLQKSIY LKQASRSWNI+FDT IK+YGFEQNV+EPCVYK+++NS
Subjt:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS

Query:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV
         V+FL+LY+DDILLIGNDVG+LTDI+KWLATQFQMK LG+AQYVLGIQIV
Subjt:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV

A0A5D3BX45 Gag/pol protein7.2e-6786Show/hide
Query:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS
        M+K IRILLSI T Y+YEIWQMDVKT FLN NLEESIYM+Q E FIQ GQEQK+CKLQKSIYGLKQASRS NIRFDTAIKSYG EQNVDEPCVYKRI+NS
Subjt:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS

Query:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV
        TV+FL+LY+DDILLIGNDVGHL DIKKWLA QFQMKDLGNAQYVLG+QIV
Subjt:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV

A0A5D3CDT9 Gag/pol protein1.1e-6784.67Show/hide
Query:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS
        MLK IRILLSI T Y+YEIWQ+DVKTAFLNGNLEESIYM Q EGFI+ GQEQK+CK QKSIYGLKQASRSWNIRFDTAIKSYGFEQNVD+PCVYK+++NS
Subjt:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS

Query:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV
         V+FL+LY+DDILL+GNDVG+LTDIKKWLATQFQMKD G+AQYVLGIQIV
Subjt:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV

E2GK51 Gag/pol protein (Fragment)3.8e-6887.33Show/hide
Query:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS
        MLK IRILLSI T YNYEIWQMDVKTAFLNGNLEESIYM+Q EGFI   QEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYK+I+NS
Subjt:  MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINS

Query:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV
         V+FLILY+DDILLIGNDV +LTD+KKWL TQFQMKDLG AQY+LGIQIV
Subjt:  TVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.3e-2440.82Show/hide
Query:  RILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVY---KRIINSTV
        R +LS+   YN ++ QMDVKTAFLNG L+E IYM   +G   N     VCKL K+IYGLKQA+R W   F+ A+K   F  +  + C+Y   K  IN  +
Subjt:  RILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVY---KRIINSTV

Query:  SFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQI
         +++LY+DD+++   D+  + + K++L  +F+M DL   ++ +GI+I
Subjt:  SFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-3348.98Show/hide
Query:  IRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVY-KRIINSTVS
        IR +LS+    + E+ Q+DVKTAFL+G+LEE IYM Q EGF   G++  VCKL KS+YGLKQA R W ++FD+ +KS  + +   +PCVY KR   +   
Subjt:  IRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVY-KRIINSTVS

Query:  FLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV
         L+LY+DD+L++G D G +  +K  L+  F MKDLG AQ +LG++IV
Subjt:  FLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV

P25600 Putative transposon Ty5-1 protein YCL074W1.4e-1434.38Show/hide
Query:  MDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINSTVSFLILYLDDILLIGNDVGH
        MDV TAFLN  ++E IY+ Q  GF+       V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y R  +    ++ +Y+DD+L+       
Subjt:  MDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINSTVSFLILYLDDILLIGNDVGH

Query:  LTDIKKWLATQFQMKDLGNAQYVLGIQI
           +K+ L   + MKDLG     LG+ I
Subjt:  LTDIKKWLATQFQMKDLGNAQYVLGIQI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.4e-2136.11Show/hide
Query:  IRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINSTVSF
        IRI+L +    ++ I Q+DV  AFL G L + +YM Q  GFI   +   VCKL+K++YGLKQA R+W +     + + GF  +V +  ++      ++ +
Subjt:  IRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINSTVSF

Query:  LILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQ
        +++Y+DDIL+ GND   L +    L+ +F +KD     Y LGI+
Subjt:  LILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-2134.72Show/hide
Query:  IRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINSTVSF
        IRI+L +    ++ I Q+DV  AFL G L + +YM Q  GF+   +   VC+L+K+IYGLKQA R+W +   T + + GF  ++ +  ++      ++ +
Subjt:  IRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINSTVSF

Query:  LILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQ
        +++Y+DDIL+ GND   L      L+ +F +K+  +  Y LGI+
Subjt:  LILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.0e-2434.87Show/hide
Query:  LKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQE----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRI
        L  ++++L+I+ +YN+ + Q+D+  AFLNG+L+E IYM    G+     +      VC L+KSIYGLKQASR W ++F   +  +GF Q+  +   + +I
Subjt:  LKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQE----QKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRI

Query:  INSTVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQI
          +    +++Y+DDI++  N+   + ++K  L + F+++DLG  +Y LG++I
Subjt:  INSTVSFLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQI

ATMG00810.1 DNA/RNA polymerases superfamily protein1.3e-0450Show/hide
Query:  FLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQI
        +L+LY+DDILL G+    L  +   L++ F MKDLG   Y LGIQI
Subjt:  FLILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAAGTTGATTAGAATACTCTTATCCATCACCACTCTTTATAATTATGAAATTTGGCAGATGGATGTCAAGACAGCCTTTTTAAATGGAAATCTTGAGGAG
AGTATCTATATGATCCAACAAGAGGGGTTCATACAAAATGGTCAAGAACAAAAAGTTTGTAAGCTTCAAAAATCCATATATGGATTAAAACAAGCATCTAGATCT
TGGAACATAAGGTTTGATACTGCAATCAAATCTTATGGTTTTGAACAAAATGTTGATGAACCTTGTGTTTACAAAAGGATCATCAATTCTACTGTATCATTCTTA
ATTCTGTATCTAGATGACATACTACTCATTGGGAATGATGTAGGTCATCTAACTGATATTAAGAAATGGCTAGCTACACAATTCCAAATGAAAGATTTGGGAAAT
GCTCAATACGTTCTTGGTATCCAAATAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAAGTTGATTAGAATACTCTTATCCATCACCACTCTTTATAATTATGAAATTTGGCAGATGGATGTCAAGACAGCCTTTTTAAATGGAAATCTTGAGGAG
AGTATCTATATGATCCAACAAGAGGGGTTCATACAAAATGGTCAAGAACAAAAAGTTTGTAAGCTTCAAAAATCCATATATGGATTAAAACAAGCATCTAGATCT
TGGAACATAAGGTTTGATACTGCAATCAAATCTTATGGTTTTGAACAAAATGTTGATGAACCTTGTGTTTACAAAAGGATCATCAATTCTACTGTATCATTCTTA
ATTCTGTATCTAGATGACATACTACTCATTGGGAATGATGTAGGTCATCTAACTGATATTAAGAAATGGCTAGCTACACAATTCCAAATGAAAGATTTGGGAAAT
GCTCAATACGTTCTTGGTATCCAAATAGTTTGA
Protein sequenceShow/hide protein sequence
MLKLIRILLSITTLYNYEIWQMDVKTAFLNGNLEESIYMIQQEGFIQNGQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKRIINSTVSFL
ILYLDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIV