; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0048201 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0048201
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr02:13709697..13710104
RNA-Seq ExpressionCmc02g0048201
SyntenyCmc02g0048201
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037081.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-6189.92Show/hide
Query:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAMDLEMESMY NSVWTLVDQPN  KPIGCKWIYKRKRDQA KVQTFKARLVAKGYTQKEG+DYEE FS +AM+KSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE

Query:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR
        IWQMDV T FLNGNLEESIY+VQPEGFI+
Subjt:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR

KAA0050040.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-71100Show/hide
Query:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE

Query:  IWQMDVNTTFLNGNLEESIYMVQPEGFIRSRTKDL
        IWQMDVNTTFLNGNLEESIYMVQPEGFIRSRTKDL
Subjt:  IWQMDVNTTFLNGNLEESIYMVQPEGFIRSRTKDL

KAA0051500.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-6189.15Show/hide
Query:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
        MNDVDCDQWIK MDLEMESMY NSVWTLVDQPN+VKPI CKW+YKRKRDQAGKVQTF+ARLVAKGYTQKEGIDYEE FS +AMIKSIRI+LSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE

Query:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR
        IW MDV TTFLNGNL+ESIYMVQPEGFI+
Subjt:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR

KAA0060572.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-6189.15Show/hide
Query:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAM+LEMESM+FNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQ+E +DY+E FS +AM+KSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE

Query:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR
        IWQMDV TTFLN NLEESIYM QPEGFI+
Subjt:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR

TYK02298.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-6189.15Show/hide
Query:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAM+LEMESM+FNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQ+E +DY+E FS +AM+KSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE

Query:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR
        IWQMDV TTFLN NLEESIYM QPEGFI+
Subjt:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR

TrEMBL top hitse value%identityAlignment
A0A5A7U428 Gag/pol protein5.1e-72100Show/hide
Query:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE

Query:  IWQMDVNTTFLNGNLEESIYMVQPEGFIRSRTKDL
        IWQMDVNTTFLNGNLEESIYMVQPEGFIRSRTKDL
Subjt:  IWQMDVNTTFLNGNLEESIYMVQPEGFIRSRTKDL

A0A5A7U8L5 Gag/pol protein1.1e-6189.15Show/hide
Query:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
        MNDVDCDQWIK MDLEMESMY NSVWTLVDQPN+VKPI CKW+YKRKRDQAGKVQTF+ARLVAKGYTQKEGIDYEE FS +AMIKSIRI+LSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE

Query:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR
        IW MDV TTFLNGNL+ESIYMVQPEGFI+
Subjt:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR

A0A5A7UXJ0 Gag/pol protein8.2e-6289.15Show/hide
Query:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAM+LEMESM+FNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQ+E +DY+E FS +AM+KSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE

Query:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR
        IWQMDV TTFLN NLEESIYM QPEGFI+
Subjt:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR

A0A5D3BTJ6 Gag/pol protein8.2e-6289.15Show/hide
Query:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAM+LEMESM+FNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQ+E +DY+E FS +AM+KSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE

Query:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR
        IWQMDV TTFLN NLEESIYM QPEGFI+
Subjt:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR

A0A5D3C5F1 Gag/pol protein1.1e-6189.92Show/hide
Query:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAMDLEMESMY NSVWTLVDQPN  KPIGCKWIYKRKRDQA KVQTFKARLVAKGYTQKEG+DYEE FS +AM+KSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYE

Query:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR
        IWQMDV T FLNGNLEESIY+VQPEGFI+
Subjt:  IWQMDVNTTFLNGNLEESIYMVQPEGFIR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-2340.98Show/hide
Query:  DCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYEIWQM
        D   W +A++ E+ +   N+ WT+  +P +   +  +W++  K ++ G    +KARLVA+G+TQK  IDYEE F+ +A I S R +LS+   Y+ ++ QM
Subjt:  DCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYEIWQM

Query:  DVNTTFLNGNLEESIYMVQPEG
        DV T FLNG L+E IYM  P+G
Subjt:  DVNTTFLNGNLEESIYMVQPEG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-2950.41Show/hide
Query:  DQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYEIWQMDV
        +Q +KAM  EMES+  N  + LV+ P   +P+ CKW++K K+D   K+  +KARLV KG+ QK+GID++E+FS +  + SIR +LS+A   D E+ Q+DV
Subjt:  DQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYEIWQMDV

Query:  NTTFLNGNLEESIYMVQPEGF
         T FL+G+LEE IYM QPEGF
Subjt:  NTTFLNGNLEESIYMVQPEGF

P92520 Uncharacterized mitochondrial protein AtMg008204.0e-1339.53Show/hide
Query:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIA
        W +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KARLVAKG+ Q+EGI + E +S +    +IR +L++A
Subjt:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.0e-2443.9Show/hide
Query:  DQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYEIWQMD
        ++W  AM  E+ +   N  W LV   P+ V  +GC+WI+ +K +  G +  +KARLVAKGY Q+ G+DY E FS +    SIRI+L +A    + I Q+D
Subjt:  DQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYEIWQMD

Query:  VNTTFLNGNLEESIYMVQPEGFI
        VN  FL G L + +YM QP GFI
Subjt:  VNTTFLNGNLEESIYMVQPEGFI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.8e-2443.9Show/hide
Query:  DQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYEIWQMD
        D+W +AM  E+ +   N  W LV   P  V  +GC+WI+ +K +  G +  +KARLVAKGY Q+ G+DY E FS +    SIRI+L +A    + I Q+D
Subjt:  DQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYEIWQMD

Query:  VNTTFLNGNLEESIYMVQPEGFI
        VN  FL G L + +YM QP GF+
Subjt:  VNTTFLNGNLEESIYMVQPEGFI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.1e-2945.38Show/hide
Query:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYEIWQMDVNT
        W  AMD E+ +M     W +   P + KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EGID+ E FS +  + S++++L+I+  Y++ + Q+D++ 
Subjt:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYEIWQMDVNT

Query:  TFLNGNLEESIYMVQPEGF
         FLNG+L+E IYM  P G+
Subjt:  TFLNGNLEESIYMVQPEGF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.8e-1439.53Show/hide
Query:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIA
        W +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KARLVAKG+ Q+EGI + E +S +    +IR +L++A
Subjt:  WIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATGTGGACTGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAATCTATGTATTTCAATTCTGTCTGGACTCTAGTAGATCAACCAAATGATGTAAAACC
TATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTGGTAAAGTACAGACTTTCAAAGCTCGACTAGTGGCAAAAGGTTACACACAAAAGGAGGGAATAGATT
ATGAAGAAGTTTTCTCTTCTATTGCCATGATAAAGTCGATTAGAATACTTTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAATACAACATTT
TTGAATGGAAATCTTGAAGAGAGTATCTATATGGTCCAACCAGAGGGGTTTATACGGTCAAGAACAAAAGATTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGATGTGGACTGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAATCTATGTATTTCAATTCTGTCTGGACTCTAGTAGATCAACCAAATGATGTAAAACC
TATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTGGTAAAGTACAGACTTTCAAAGCTCGACTAGTGGCAAAAGGTTACACACAAAAGGAGGGAATAGATT
ATGAAGAAGTTTTCTCTTCTATTGCCATGATAAAGTCGATTAGAATACTTTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAATACAACATTT
TTGAATGGAAATCTTGAAGAGAGTATCTATATGGTCCAACCAGAGGGGTTTATACGGTCAAGAACAAAAGATTTGTAA
Protein sequenceShow/hide protein sequence
MNDVDCDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGIDYEEVFSSIAMIKSIRILLSIATFYDYEIWQMDVNTTF
LNGNLEESIYMVQPEGFIRSRTKDL