; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0017541 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0017541
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr01:15515513..15515914
RNA-Seq ExpressionCmc01g0017541
SyntenyCmc01g0017541
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.1e-5787.79Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        M+LEMESMY NS+WTLVD P++VKPIGCKWIYKRKRDQAGKVQTFKARLV KGYTQKE VDYEETFSPVA LKSIRILLSIATFYNYEIWQMDVKT FLN
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI
         NLEESIYMVQPEGFI + QEQ VC+LQKSI
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI

KAA0037081.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-5788.98Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        MDLEMESMYSNS+WTLVDQPN+ KPIGCKWIYKRKRDQA KVQTFKARLV KGYTQKE VDYEETFSPVA LKSIRILLSIATFY+YEIWQMDVKT FLN
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCEL
         NLEESIY+VQPEGFIQK QEQ VC L
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCEL

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-5986.26Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        MDL+MESMYSNS+WTLVDQPNN++PIGCKWIYKRKRDQ  KVQTF+ARLV KGYTQKE +DYEETFSP+A +KSIRILLSIATFY+YEIWQMDVKT FLN
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI
         NLEESIYMVQPEGFIQKGQEQ VC+LQKSI
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI

KAA0059715.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-5685.27Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        MDLE+ESM+ N +WTLVDQPNN+KPIGCKWIYKRKRDQA KVQTFKARLV KGYTQ+E VDYEETFSPVA LKSIRILLSIATFY+YEIWQMDVK  F N
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCELQK
        ENLEESIYM QPEGFI+KGQEQ VC+LQK
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCELQK

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-5787.79Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        MDLEMESMYSNS+WTLVDQPN+VKPIGCKWIYKRKRDQAGKVQTFKARLV KGYTQKE +DYEE FS  A +KSIRILLSIATFY+YEIWQMDVKTTFLN
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI
         NLEESIYMVQPE FIQKGQEQ +C+LQKSI
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI

TrEMBL top hitse value%identityAlignment
A0A5A7TTA2 Gag/pol protein9.9e-6086.26Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        MDL+MESMYSNS+WTLVDQPNN++PIGCKWIYKRKRDQ  KVQTF+ARLV KGYTQKE +DYEETFSP+A +KSIRILLSIATFY+YEIWQMDVKT FLN
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI
         NLEESIYMVQPEGFIQKGQEQ VC+LQKSI
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI

A0A5D3BX45 Gag/pol protein7.1e-5887.79Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        MDLEMESMYSNS+WTLVDQPN+VKPIGCKWIYKRKRDQAGKVQTFKARLV KGYTQKE +DYEE FS  A +KSIRILLSIATFY+YEIWQMDVKTTFLN
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI
         NLEESIYMVQPE FIQKGQEQ +C+LQKSI
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI

A0A5D3C5F1 Gag/pol protein2.7e-5788.98Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        MDLEMESMYSNS+WTLVDQPN+ KPIGCKWIYKRKRDQA KVQTFKARLV KGYTQKE VDYEETFSPVA LKSIRILLSIATFY+YEIWQMDVKT FLN
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCEL
         NLEESIY+VQPEGFIQK QEQ VC L
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCEL

A0A5D3DR66 Gag/pol protein7.8e-5785.27Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        MDLE+ESM+ N +WTLVDQPNN+KPIGCKWIYKRKRDQA KVQTFKARLV KGYTQ+E VDYEETFSPVA LKSIRILLSIATFY+YEIWQMDVK  F N
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCELQK
        ENLEESIYM QPEGFI+KGQEQ VC+LQK
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCELQK

E2GK51 Gag/pol protein (Fragment)5.4e-5887.79Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        M+LEMESMY NS+WTLVD P++VKPIGCKWIYKRKRDQAGKVQTFKARLV KGYTQKE VDYEETFSPVA LKSIRILLSIATFYNYEIWQMDVKT FLN
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI
         NLEESIYMVQPEGFI + QEQ VC+LQKSI
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.5e-2441.22Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        ++ E+ +   N+ WT+  +P N   +  +W++  K ++ G    +KARLV +G+TQK ++DYEETF+PVA + S R +LS+   YN ++ QMDVKT FLN
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI
          L+E IYM  P+G         VC+L K+I
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.0e-3048.09Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        M  EMES+  N  + LV+ P   +P+ CKW++K K+D   K+  +KARLV KG+ QK+ +D++E FSPV  + SIR +LS+A   + E+ Q+DVKT FL+
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI
         +LEE IYM QPEGF   G++ +VC+L KS+
Subjt:  ENLEESIYMVQPEGFIQKGQEQIVCELQKSI

P92520 Uncharacterized mitochondrial protein AtMg008205.7e-1240.24Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIA
        M  E++++  N  W LV  P N   +GCKW++K K    G +   KARLV KG+ Q+E + + ET+SPV    +IR +L++A
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-2442.42Show/hide
Query:  MDLEMESMYSNSIWTLV-DQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFL
        M  E+ +   N  W LV   P++V  +GC+WI+ +K +  G +  +KARLV KGY Q+  +DY ETFSPV    SIRI+L +A   ++ I Q+DV   FL
Subjt:  MDLEMESMYSNSIWTLV-DQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFL

Query:  NENLEESIYMVQPEGFIQKGQEQIVCELQKSI
           L + +YM QP GFI K +   VC+L+K++
Subjt:  NENLEESIYMVQPEGFIQKGQEQIVCELQKSI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-2342.42Show/hide
Query:  MDLEMESMYSNSIWTLV-DQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFL
        M  E+ +   N  W LV   P +V  +GC+WI+ +K +  G +  +KARLV KGY Q+  +DY ETFSPV    SIRI+L +A   ++ I Q+DV   FL
Subjt:  MDLEMESMYSNSIWTLV-DQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFL

Query:  NENLEESIYMVQPEGFIQKGQEQIVCELQKSI
           L + +YM QP GF+ K +   VC L+K+I
Subjt:  NENLEESIYMVQPEGFIQKGQEQIVCELQKSI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-2944.44Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN
        MD E+ +M +   W +   P N KPIGCKW+YK K +  G ++ +KARLV KGYTQ+E +D+ ETFSPV  L S++++L+I+  YN+ + Q+D+   FLN
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLN

Query:  ENLEESIYMVQPEGFIQKGQEQI----VCELQKSI
         +L+E IYM  P G+  +  + +    VC L+KSI
Subjt:  ENLEESIYMVQPEGFIQKGQEQI----VCELQKSI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.0e-1340.24Show/hide
Query:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIA
        M  E++++  N  W LV  P N   +GCKW++K K    G +   KARLV KG+ Q+E + + ET+SPV    +IR +L++A
Subjt:  MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCTTGAAATGGAATCTATGTATTCCAATTCTATCTGGACTCTAGTAGATCAACCAAATAATGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAA
CGAGACCAAGCTGGTAAAGTACAGACTTTCAAAGCTCGACTAGTGACAAAAGGTTATACACAAAAGGAGAGAGTGGATTATGAAGAAACTTTCTCTCCTGTTGCA
TTTCTGAAGTCGATTAGAATACTCTTATCCATCGCCACTTTTTATAATTATGAAATTTGGCAGATGGATGTCAAGACAACCTTTTTGAATGAAAATCTTGAGGAG
AGTATTTATATGGTCCAACCAGAAGGGTTTATACAAAAGGGTCAAGAACAAATTGTTTGTGAGCTTCAAAAATCCATAATGGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCTTGAAATGGAATCTATGTATTCCAATTCTATCTGGACTCTAGTAGATCAACCAAATAATGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAA
CGAGACCAAGCTGGTAAAGTACAGACTTTCAAAGCTCGACTAGTGACAAAAGGTTATACACAAAAGGAGAGAGTGGATTATGAAGAAACTTTCTCTCCTGTTGCA
TTTCTGAAGTCGATTAGAATACTCTTATCCATCGCCACTTTTTATAATTATGAAATTTGGCAGATGGATGTCAAGACAACCTTTTTGAATGAAAATCTTGAGGAG
AGTATTTATATGGTCCAACCAGAAGGGTTTATACAAAAGGGTCAAGAACAAATTGTTTGTGAGCTTCAAAAATCCATAATGGATTAA
Protein sequenceShow/hide protein sequence
MDLEMESMYSNSIWTLVDQPNNVKPIGCKWIYKRKRDQAGKVQTFKARLVTKGYTQKERVDYEETFSPVAFLKSIRILLSIATFYNYEIWQMDVKTTFLNENLEE
SIYMVQPEGFIQKGQEQIVCELQKSIMD