; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0098121 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0098121
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:13475890..13476393
RNA-Seq ExpressionCmc04g0098121
SyntenyCmc04g0098121
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]6.5e-7986.83Show/hide
Query:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE
        MNDVDRDQWIKAM LEMESMYFNS+W LVD P+DVKPIGC WIYKRK+DQAGKVQTFKA+LV KGYTQ++GVDYEETFS V MLKSIRILLSIATFY+YE
Subjt:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE

Query:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE
        IW+MDVKTAFLNGNLE+SIYM QPEGFI + +EQKVCKLQKSI GLKQASRSWNIRFDTAIKSYGFE
Subjt:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-7683.83Show/hide
Query:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE
        MNDVD+DQW+KAM LEMESMYFNS+W LVD P  VKPIGC WIYKRK+D AGKVQTFKA+LV KGYTQR+GVDYEETFS V MLKSIRILLSIATFYDYE
Subjt:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE

Query:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE
        IW+MDVKTAFLNGNLE+SI+M+QPEGFI +G+EQKVCKL +SI GLKQASRSWNIRFDTAIKSYGF+
Subjt:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-7681.93Show/hide
Query:  NDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEI
        NDVD DQWIKAM L+MESMY NS+W LVDQPN+++PIGC WIYKRK+DQ  KVQTF+A+LV KGYTQ++G+DYEETFS + M+KSIRILLSIATFYDYEI
Subjt:  NDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEI

Query:  WKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE
        W+MDVKTAFLNGNLE+SIYM QPEGFI+KG+EQKVCKLQKSI GLKQASRSWNIRFDT IKSYGFE
Subjt:  WKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-7683.83Show/hide
Query:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE
        MNDVD+DQW+KAM LEMESMYFNS+W LVD P  VKPIGC WIYKRK+D AGKVQTFKA+LV KGYTQR+GVDYEETFS V MLKSIRILLSIATFYDYE
Subjt:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE

Query:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE
        IW+MDVKTAFLNGNLE+SI+M+QPEGFI +G+EQKVCKL +SI GLKQASRSWNIRFDTAIKSYGF+
Subjt:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-7683.83Show/hide
Query:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE
        MNDVD+DQW+KAM LEMESMYFNS+W LVD P  VKPIGC WIYKRK+D AGKVQTFKA+LV KGYTQR+GVDYEETFS V MLKSIRILLSIATFYDYE
Subjt:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE

Query:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE
        IW+MDVKTAFLNGNLE+SI+M+QPEGFI +G+EQKVCKL +SI GLKQASRSWNIRFDTAIKSYGF+
Subjt:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE

TrEMBL top hitse value%identityAlignment
A0A5A7TTA2 Gag/pol protein1.9e-7681.93Show/hide
Query:  NDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEI
        NDVD DQWIKAM L+MESMY NS+W LVDQPN+++PIGC WIYKRK+DQ  KVQTF+A+LV KGYTQ++G+DYEETFS + M+KSIRILLSIATFYDYEI
Subjt:  NDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEI

Query:  WKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE
        W+MDVKTAFLNGNLE+SIYM QPEGFI+KG+EQKVCKLQKSI GLKQASRSWNIRFDT IKSYGFE
Subjt:  WKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE

A0A5A7TZD0 Gag/pol protein5.0e-7783.83Show/hide
Query:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE
        MNDVD+DQW+KAM LEMESMYFNS+W LVD P  VKPIGC WIYKRK+D AGKVQTFKA+LV KGYTQR+GVDYEETFS V MLKSIRILLSIATFYDYE
Subjt:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE

Query:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE
        IW+MDVKTAFLNGNLE+SI+M+QPEGFI +G+EQKVCKL +SI GLKQASRSWNIRFDTAIKSYGF+
Subjt:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE

A0A5A7UYE8 Gag/pol protein5.0e-7783.83Show/hide
Query:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE
        MNDVD+DQW+KAM LEMESMYFNS+W LVD P  VKPIGC WIYKRK+D AGKVQTFKA+LV KGYTQR+GVDYEETFS V MLKSIRILLSIATFYDYE
Subjt:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE

Query:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE
        IW+MDVKTAFLNGNLE+SI+M+QPEGFI +G+EQKVCKL +SI GLKQASRSWNIRFDTAIKSYGF+
Subjt:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE

A0A5D3CYF4 Gag/pol protein5.0e-7783.83Show/hide
Query:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE
        MNDVD+DQW+KAM LEMESMYFNS+W LVD P  VKPIGC WIYKRK+D AGKVQTFKA+LV KGYTQR+GVDYEETFS V MLKSIRILLSIATFYDYE
Subjt:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE

Query:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE
        IW+MDVKTAFLNGNLE+SI+M+QPEGFI +G+EQKVCKL +SI GLKQASRSWNIRFDTAIKSYGF+
Subjt:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE

E2GK51 Gag/pol protein (Fragment)3.1e-7986.83Show/hide
Query:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE
        MNDVDRDQWIKAM LEMESMYFNS+W LVD P+DVKPIGC WIYKRK+DQAGKVQTFKA+LV KGYTQ++GVDYEETFS V MLKSIRILLSIATFY+YE
Subjt:  MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYE

Query:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE
        IW+MDVKTAFLNGNLE+SIYM QPEGFI + +EQKVCKLQKSI GLKQASRSWNIRFDTAIKSYGFE
Subjt:  IWKMDVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.9e-3038.27Show/hide
Query:  DRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEIWKM
        D+  W +A+  E+ +   N+ W +  +P +   +   W++  K ++ G    +KA+LV +G+TQ+  +DYEETF+ V  + S R +LS+   Y+ ++ +M
Subjt:  DRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEIWKM

Query:  DVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGF
        DVKTAFLNG L++ IYM  P+G         VCKL K+I GLKQA+R W   F+ A+K   F
Subjt:  DVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-3949.06Show/hide
Query:  DRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEIWKM
        +++Q +KAM  EMES+  N  + LV+ P   +P+ C W++K K+D   K+  +KA+LVVKG+ Q+ G+D++E FS VV + SIR +LS+A   D E+ ++
Subjt:  DRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEIWKM

Query:  DVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKS
        DVKTAFL+G+LE+ IYM QPEGF   G++  VCKL KS+ GLKQA R W ++FD+ +KS
Subjt:  DVKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKS

P92520 Uncharacterized mitochondrial protein AtMg008201.4e-1238.37Show/hide
Query:  WIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIA
        W +AM  E++++  N  WILV  P +   +GC W++K K    G +   KA+LV KG+ Q +G+ + ET+S VV   +IR +L++A
Subjt:  WIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.6e-3444.1Show/hide
Query:  DQWIKAMGLEMESMYFNSIWILV-DQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEIWKMD
        ++W  AMG E+ +   N  W LV   P+ V  +GC WI+ +K +  G +  +KA+LV KGY QR G+DY ETFS V+   SIRI+L +A    + I ++D
Subjt:  DQWIKAMGLEMESMYFNSIWILV-DQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEIWKMD

Query:  VKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGF
        V  AFL G L   +YM+QP GFI K R   VCKL+K++ GLKQA R+W +     + + GF
Subjt:  VKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.3e-3444.72Show/hide
Query:  DQWIKAMGLEMESMYFNSIWILV-DQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEIWKMD
        D+W +AMG E+ +   N  W LV   P  V  +GC WI+ +K +  G +  +KA+LV KGY QR G+DY ETFS V+   SIRI+L +A    + I ++D
Subjt:  DQWIKAMGLEMESMYFNSIWILV-DQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEIWKMD

Query:  VKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGF
        V  AFL G L   +YM+QP GF+ K R   VC+L+K+I GLKQA R+W +   T + + GF
Subjt:  VKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.8e-3541.36Show/hide
Query:  WIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEIWKMDVKT
        W  AM  E+ +M     W +   P + KPIGC W+YK K +  G ++ +KA+LV KGYTQ++G+D+ ETFS V  L S++++L+I+  Y++ + ++D+  
Subjt:  WIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEIWKMDVKT

Query:  AFLNGNLEKSIYMTQPEGFIKKGRE----QKVCKLQKSICGLKQASRSWNIRFDTAIKSYGF
        AFLNG+L++ IYM  P G+  +  +      VC L+KSI GLKQASR W ++F   +  +GF
Subjt:  AFLNGNLEKSIYMTQPEGFIKKGRE----QKVCKLQKSICGLKQASRSWNIRFDTAIKSYGF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.0e-1338.37Show/hide
Query:  WIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIA
        W +AM  E++++  N  WILV  P +   +GC W++K K    G +   KA+LV KG+ Q +G+ + ET+S VV   +IR +L++A
Subjt:  WIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATGTAGATCGTGACCAATGGATCAAAGCCATGGGCCTCGAAATGGAGTCTATGTATTTCAATTCTATCTGGATTTTAGTAGATCAACCAAATGACGTA
AAACCTATTGGTTGTACATGGATCTACAAAAGAAAACAAGACCAAGCCGGTAAAGTACAGACTTTTAAGGCTCAACTTGTGGTAAAAGGTTATACCCAAAGAGAT
GGAGTGGATTATGAAGAAACATTCTCTCATGTTGTCATGCTTAAGTCGATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGAAGATGGAT
GTCAAGACAGCTTTTTTGAATGGAAATCTTGAGAAGAGTATCTATATGACTCAACCAGAGGGGTTTATAAAAAAGGGTCGAGAACAAAAGGTTTGTAAGCTTCAG
AAATCCATTTGTGGTTTGAAGCAAGCATCTAGATCCTGGAATATAAGATTTGATACTGCGATAAAATCTTATGGTTTTGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGATGTAGATCGTGACCAATGGATCAAAGCCATGGGCCTCGAAATGGAGTCTATGTATTTCAATTCTATCTGGATTTTAGTAGATCAACCAAATGACGTA
AAACCTATTGGTTGTACATGGATCTACAAAAGAAAACAAGACCAAGCCGGTAAAGTACAGACTTTTAAGGCTCAACTTGTGGTAAAAGGTTATACCCAAAGAGAT
GGAGTGGATTATGAAGAAACATTCTCTCATGTTGTCATGCTTAAGTCGATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGAAGATGGAT
GTCAAGACAGCTTTTTTGAATGGAAATCTTGAGAAGAGTATCTATATGACTCAACCAGAGGGGTTTATAAAAAAGGGTCGAGAACAAAAGGTTTGTAAGCTTCAG
AAATCCATTTGTGGTTTGAAGCAAGCATCTAGATCCTGGAATATAAGATTTGATACTGCGATAAAATCTTATGGTTTTGAATAA
Protein sequenceShow/hide protein sequence
MNDVDRDQWIKAMGLEMESMYFNSIWILVDQPNDVKPIGCTWIYKRKQDQAGKVQTFKAQLVVKGYTQRDGVDYEETFSHVVMLKSIRILLSIATFYDYEIWKMD
VKTAFLNGNLEKSIYMTQPEGFIKKGREQKVCKLQKSICGLKQASRSWNIRFDTAIKSYGFE