; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0104581 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0104581
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:22458904..22459362
RNA-Seq ExpressionCmc04g0104581
SyntenyCmc04g0104581
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.2e-6686.84Show/hide
Query:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE
        M DVD DQWIKAM+ EMESMY NSVWTLVD P+DVKPI CKWIYKRKRDQAGKVQTFKARLVAK YTQKEG+DYEETFS VAM+KSIRILLSI TFY+YE
Subjt:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE

Query:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        IWQM+VKT FLNGNLEESIYMVQPE FI + QEQKVCKLQKSIYGLKQASRS
Subjt:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

KAA0043610.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-6383.55Show/hide
Query:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE
        M DVDCDQW+KAMD ++ESMYSNSVWTLVDQ NDVK I CKWIYKRKRDQAGKVQTFKARLV K Y +KEG++Y+ETFS V M+KSIRILLSI TFYDYE
Subjt:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE

Query:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        I QM+VKT FLNGNLEESIYMVQPE FIQK QEQKV KLQKSIYGLKQASRS
Subjt:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

KAA0045201.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-6378.95Show/hide
Query:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE
        M DVD DQW KAM+ EM+SMY NS+WTL+DQPND++PI CKWIYKRK+DQ GKVQTFKA+LVAK YTQ+EG+DY+ETFS VAM+KSIRILLSIVTFYDYE
Subjt:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE

Query:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        IW M+VKT F+NGNLEESIYM QP+ FI+++QEQKVCKLQKSIYGLKQA RS
Subjt:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-6888Show/hide
Query:  DVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIW
        DVD DQWIKAMD +MESMYSNSVWTLVDQPN+++PI CKWIYKRKRDQ  KVQTF+ARLVAK YTQKEGIDYEETFS +AMIKSIRILLSI TFYDYEIW
Subjt:  DVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIW

Query:  QMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        QM+VKT FLNGNLEESIYMVQPE FIQK QEQKVCKLQKSIYGLKQASRS
Subjt:  QMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]5.5e-6990.79Show/hide
Query:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE
        M DVD DQWIKAMD EMESMYSNSVWTLVDQPNDVKPI CKWIYKRKRDQAGKVQTFKARLVAK YTQKEGIDYEE FS  AMIKSIRILLSI TFYDYE
Subjt:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE

Query:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        IWQM+VKTTFLN NLEESIYMVQPE FIQK QEQK+CKLQKSIYGLKQASRS
Subjt:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

TrEMBL top hitse value%identityAlignment
A0A5A7TK28 Gag/pol protein1.7e-6383.55Show/hide
Query:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE
        M DVDCDQW+KAMD ++ESMYSNSVWTLVDQ NDVK I CKWIYKRKRDQAGKVQTFKARLV K Y +KEG++Y+ETFS V M+KSIRILLSI TFYDYE
Subjt:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE

Query:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        I QM+VKT FLNGNLEESIYMVQPE FIQK QEQKV KLQKSIYGLKQASRS
Subjt:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

A0A5A7TTA2 Gag/pol protein1.7e-6888Show/hide
Query:  DVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIW
        DVD DQWIKAMD +MESMYSNSVWTLVDQPN+++PI CKWIYKRKRDQ  KVQTF+ARLVAK YTQKEGIDYEETFS +AMIKSIRILLSI TFYDYEIW
Subjt:  DVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIW

Query:  QMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        QM+VKT FLNGNLEESIYMVQPE FIQK QEQKVCKLQKSIYGLKQASRS
Subjt:  QMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

A0A5D3BX45 Gag/pol protein2.7e-6990.79Show/hide
Query:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE
        M DVD DQWIKAMD EMESMYSNSVWTLVDQPNDVKPI CKWIYKRKRDQAGKVQTFKARLVAK YTQKEGIDYEE FS  AMIKSIRILLSI TFYDYE
Subjt:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE

Query:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        IWQM+VKTTFLN NLEESIYMVQPE FIQK QEQK+CKLQKSIYGLKQASRS
Subjt:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

A0A5D3DVP2 Gag/pol protein1.7e-6378.95Show/hide
Query:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE
        M DVD DQW KAM+ EM+SMY NS+WTL+DQPND++PI CKWIYKRK+DQ GKVQTFKA+LVAK YTQ+EG+DY+ETFS VAM+KSIRILLSIVTFYDYE
Subjt:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE

Query:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        IW M+VKT F+NGNLEESIYM QP+ FI+++QEQKVCKLQKSIYGLKQA RS
Subjt:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

E2GK51 Gag/pol protein (Fragment)5.6e-6786.84Show/hide
Query:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE
        M DVD DQWIKAM+ EMESMY NSVWTLVD P+DVKPI CKWIYKRKRDQAGKVQTFKARLVAK YTQKEG+DYEETFS VAM+KSIRILLSI TFY+YE
Subjt:  MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYE

Query:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        IWQM+VKT FLNGNLEESIYMVQPE FI + QEQKVCKLQKSIYGLKQASRS
Subjt:  IWQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.2e-2844Show/hide
Query:  YDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEI
        Y  D   W +A++ E+ +   N+ WT+  +P +   +D +W++  K ++ G    +KARLVA+ +TQK  IDYEETF+ VA I S R +LS+V  Y+ ++
Subjt:  YDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEI

Query:  WQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASR
         QM+VKT FLNG L+E IYM  P+          VCKL K+IYGLKQA+R
Subjt:  WQMNVKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.1e-3349.66Show/hide
Query:  DQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIWQMNV
        +Q +KAM  EMES+  N  + LV+ P   +P+ CKW++K K+D   K+  +KARLV K + QK+GID++E FS V  + SIR +LS+    D E+ Q++V
Subjt:  DQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIWQMNV

Query:  KTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASR
        KT FL+G+LEE IYM QPE F    ++  VCKL KS+YGLKQA R
Subjt:  KTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASR

P92520 Uncharacterized mitochondrial protein AtMg008202.5e-1138.82Show/hide
Query:  WIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSI
        W +AM  E++++  N  W LV  P +   + CKW++K K    G +   KARLVAK + Q+EGI + ET+S V    +IR +L++
Subjt:  WIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-2742.86Show/hide
Query:  DQWIKAMDFEMESMYSNSVWTLV-DQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIWQMN
        ++W  AM  E+ +   N  W LV   P+ V  + C+WI+ +K +  G +  +KARLVAK Y Q+ G+DY ETFS V    SIRI+L +     + I Q++
Subjt:  DQWIKAMDFEMESMYSNSVWTLV-DQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIWQMN

Query:  VKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        V   FL G L + +YM QP  FI K +   VCKL+K++YGLKQA R+
Subjt:  VKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-2742.86Show/hide
Query:  DQWIKAMDFEMESMYSNSVWTLV-DQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIWQMN
        D+W +AM  E+ +   N  W LV   P  V  + C+WI+ +K +  G +  +KARLVAK Y Q+ G+DY ETFS V    SIRI+L +     + I Q++
Subjt:  DQWIKAMDFEMESMYSNSVWTLV-DQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIWQMN

Query:  VKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS
        V   FL G L + +YM QP  F+ K +   VC+L+K+IYGLKQA R+
Subjt:  VKTTFLNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.7e-3144.9Show/hide
Query:  WIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIWQMNVKT
        W  AMD E+ +M +   W +   P + KPI CKW+YK K +  G ++ +KARLVAK YTQ+EGID+ ETFS V  + S++++L+I   Y++ + Q+++  
Subjt:  WIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIWQMNVKT

Query:  TFLNGNLEESIYMVQPEWFIQKHQE----QKVCKLQKSIYGLKQASR
         FLNG+L+E IYM  P  +  +  +      VC L+KSIYGLKQASR
Subjt:  TFLNGNLEESIYMVQPEWFIQKHQE----QKVCKLQKSIYGLKQASR

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.7e-1238.82Show/hide
Query:  WIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSI
        W +AM  E++++  N  W LV  P +   + CKW++K K    G +   KARLVAK + Q+EGI + ET+S V    +IR +L++
Subjt:  WIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGATGTGGACTGTGACCAATGGATCAAAGCCATGGACTTCGAAATGGAATCTATGTATTCCAATTCTGTCTGGACTCTAGTAGATCAACCAAATGATGTAAAACC
TATTGATTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTGGTAAAGTACAGACTTTCAAAGCTCGACTAGTGGCAAAATGTTACACACAAAAGGAGGGAATAGATT
ATGAAGAAACTTTCTCTCTTGTTGCCATGATAAAATCAATTAGAATACTCTTATCCATCGTCACTTTTTATGATTATGAAATTTGGCAGATGAATGTCAAGACAACCTTT
TTGAATGGAAATCTTGAGGAGAGTATCTATATGGTCCAACCAGAGTGGTTTATACAAAAGCATCAAGAACAAAAAGTTTGTAAGCTTCAAAAATCCATATATGGATTAAA
GCAAGCATCTAGATCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATGATGTGGACTGTGACCAATGGATCAAAGCCATGGACTTCGAAATGGAATCTATGTATTCCAATTCTGTCTGGACTCTAGTAGATCAACCAAATGATGTAAAACC
TATTGATTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTGGTAAAGTACAGACTTTCAAAGCTCGACTAGTGGCAAAATGTTACACACAAAAGGAGGGAATAGATT
ATGAAGAAACTTTCTCTCTTGTTGCCATGATAAAATCAATTAGAATACTCTTATCCATCGTCACTTTTTATGATTATGAAATTTGGCAGATGAATGTCAAGACAACCTTT
TTGAATGGAAATCTTGAGGAGAGTATCTATATGGTCCAACCAGAGTGGTTTATACAAAAGCATCAAGAACAAAAAGTTTGTAAGCTTCAAAAATCCATATATGGATTAAA
GCAAGCATCTAGATCCTAG
Protein sequenceShow/hide protein sequence
MYDVDCDQWIKAMDFEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAGKVQTFKARLVAKCYTQKEGIDYEETFSLVAMIKSIRILLSIVTFYDYEIWQMNVKTTF
LNGNLEESIYMVQPEWFIQKHQEQKVCKLQKSIYGLKQASRS