; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0105321 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0105321
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:23449164..23449622
RNA-Seq ExpressionCmc04g0105321
SyntenyCmc04g0105321
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.0e-7088.82Show/hide
Query:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE
        MNDVD DQWIKAMNLEMESMY NSVWTLVD P+DV+PIGCKWIYKRKRDQA KVQTFKARLVAKGYTQKEG++YEETFSP+AM+KSIRI LSIATFY+YE
Subjt:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE

Query:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        IWQM+VKTAFLN NLEESIYMVQPEGFI + QEQKVCKLQKSIYGLKQASRS
Subjt:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]6.8e-6783.55Show/hide
Query:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE
        MNDVD DQW+KAM+LEMESMY NSVW LVD P  V+PIGCKWIYKRKRD A KVQTFKARLVAKGYTQ+EG++YEETFSP+AM+KSIRI LSIATFYDYE
Subjt:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE

Query:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        IWQM+VKTAFLN NLEESI+M QPEGFI +GQEQKVCKL +SIYGLKQASRS
Subjt:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-7392.05Show/hide
Query:  NDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEI
        NDVDSDQWIKAM+L+MESMYSNSVWTLVDQPN++RPIGCKWIYKRKRDQ  KVQTF+ARLVAKGYTQKEGI+YEETFSPIAMIKSIRI LSIATFYDYEI
Subjt:  NDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEI

Query:  WQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        WQM+VKTAFLN NLEESIYMVQPEGFI+KGQEQKVCKLQKSIYGLKQASRS
Subjt:  WQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-7090.13Show/hide
Query:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE
        MNDVD DQWIKAM+LEMESMYSNSVWTLVDQPNDV+PIGCKWIYKRKRDQA KVQTFKARLVAKGYTQKEGI+YEE FS  AMIKSIRI LSIATFYDYE
Subjt:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE

Query:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        IWQM+VKT FLN NLEESIYMVQPE FI+KGQEQK+CKLQKSIYGLKQASRS
Subjt:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]6.8e-6783.55Show/hide
Query:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE
        MNDVD DQW+KAM+LEMESMY NSVW LVD P  V+PIGCKWIYKRKRD A KVQTFKARLVAKGYTQ+EG++YEETFSP+AM+KSIRI LSIATFYDYE
Subjt:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE

Query:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        IWQM+VKTAFLN NLEESI+M QPEGFI +GQEQKVCKL +SIYGLKQASRS
Subjt:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

TrEMBL top hitse value%identityAlignment
A0A5A7TTA2 Gag/pol protein8.1e-7492.05Show/hide
Query:  NDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEI
        NDVDSDQWIKAM+L+MESMYSNSVWTLVDQPN++RPIGCKWIYKRKRDQ  KVQTF+ARLVAKGYTQKEGI+YEETFSPIAMIKSIRI LSIATFYDYEI
Subjt:  NDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEI

Query:  WQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        WQM+VKTAFLN NLEESIYMVQPEGFI+KGQEQKVCKLQKSIYGLKQASRS
Subjt:  WQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

A0A5A7TZD0 Gag/pol protein3.3e-6783.55Show/hide
Query:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE
        MNDVD DQW+KAM+LEMESMY NSVW LVD P  V+PIGCKWIYKRKRD A KVQTFKARLVAKGYTQ+EG++YEETFSP+AM+KSIRI LSIATFYDYE
Subjt:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE

Query:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        IWQM+VKTAFLN NLEESI+M QPEGFI +GQEQKVCKL +SIYGLKQASRS
Subjt:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

A0A5D3BX45 Gag/pol protein1.1e-7090.13Show/hide
Query:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE
        MNDVD DQWIKAM+LEMESMYSNSVWTLVDQPNDV+PIGCKWIYKRKRDQA KVQTFKARLVAKGYTQKEGI+YEE FS  AMIKSIRI LSIATFYDYE
Subjt:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE

Query:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        IWQM+VKT FLN NLEESIYMVQPE FI+KGQEQK+CKLQKSIYGLKQASRS
Subjt:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

A0A5D3CYF4 Gag/pol protein3.3e-6783.55Show/hide
Query:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE
        MNDVD DQW+KAM+LEMESMY NSVW LVD P  V+PIGCKWIYKRKRD A KVQTFKARLVAKGYTQ+EG++YEETFSP+AM+KSIRI LSIATFYDYE
Subjt:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE

Query:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        IWQM+VKTAFLN NLEESI+M QPEGFI +GQEQKVCKL +SIYGLKQASRS
Subjt:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

E2GK51 Gag/pol protein (Fragment)4.9e-7188.82Show/hide
Query:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE
        MNDVD DQWIKAMNLEMESMY NSVWTLVD P+DV+PIGCKWIYKRKRDQA KVQTFKARLVAKGYTQKEG++YEETFSP+AM+KSIRI LSIATFY+YE
Subjt:  MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYE

Query:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        IWQM+VKTAFLN NLEESIYMVQPEGFI + QEQKVCKLQKSIYGLKQASRS
Subjt:  IWQMNVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.8e-2943.54Show/hide
Query:  DSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEIWQM
        D   W +A+N E+ +   N+ WT+  +P +   +  +W++  K ++      +KARLVA+G+TQK  I+YEETF+P+A I S R  LS+   Y+ ++ QM
Subjt:  DSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEIWQM

Query:  NVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASR
        +VKTAFLN  L+E IYM  P+G         VCKL K+IYGLKQA+R
Subjt:  NVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-3852.38Show/hide
Query:  DSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEIWQM
        + +Q +KAM  EMES+  N  + LV+ P   RP+ CKW++K K+D  CK+  +KARLV KG+ QK+GI+++E FSP+  + SIR  LS+A   D E+ Q+
Subjt:  DSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEIWQM

Query:  NVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASR
        +VKTAFL+ +LEE IYM QPEGF   G++  VCKL KS+YGLKQA R
Subjt:  NVKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASR

P92520 Uncharacterized mitochondrial protein AtMg008201.2e-1340.7Show/hide
Query:  WIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIA
        W +AM  E++++  N  W LV  P +   +GCKW++K K      +   KARLVAKG+ Q+EGI + ET+SP+    +IR  L++A
Subjt:  WIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.9e-3044.22Show/hide
Query:  DQWIKAMNLEMESMYSNSVWTLV-DQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEIWQMN
        ++W  AM  E+ +   N  W LV   P+ V  +GC+WI+ +K +    +  +KARLVAKGY Q+ G++Y ETFSP+    SIRI L +A    + I Q++
Subjt:  DQWIKAMNLEMESMYSNSVWTLV-DQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEIWQMN

Query:  VKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        V  AFL   L + +YM QP GFI K +   VCKL+K++YGLKQA R+
Subjt:  VKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.9e-3044.22Show/hide
Query:  DQWIKAMNLEMESMYSNSVWTLV-DQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEIWQMN
        D+W +AM  E+ +   N  W LV   P  V  +GC+WI+ +K +    +  +KARLVAKGY Q+ G++Y ETFSP+    SIRI L +A    + I Q++
Subjt:  DQWIKAMNLEMESMYSNSVWTLV-DQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEIWQMN

Query:  VKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS
        V  AFL   L + +YM QP GF+ K +   VC+L+K+IYGLKQA R+
Subjt:  VKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-3244.22Show/hide
Query:  WIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEIWQMNVKT
        W  AM+ E+ +M +   W +   P + +PIGCKW+YK K +    ++ +KARLVAKGYTQ+EGI++ ETFSP+  + S+++ L+I+  Y++ + Q+++  
Subjt:  WIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEIWQMNVKT

Query:  AFLNCNLEESIYMVQPEGFIKKGQE----QKVCKLQKSIYGLKQASR
        AFLN +L+E IYM  P G+  +  +      VC L+KSIYGLKQASR
Subjt:  AFLNCNLEESIYMVQPEGFIKKGQE----QKVCKLQKSIYGLKQASR

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.4e-1540.7Show/hide
Query:  WIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIA
        W +AM  E++++  N  W LV  P +   +GCKW++K K      +   KARLVAKG+ Q+EGI + ET+SP+    +IR  L++A
Subjt:  WIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATGTAGACTCTGACCAATGGATCAAAGCCATGAACCTCGAAATGGAATCTATGTATTCCAATTCTGTCTGGACTCTAGTAGATCAACCAAATGATGTA
AGACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTTGCAAAGTACAGACTTTCAAAGCTCGACTTGTGGCAAAAGGTTATACACAAAAGGAG
GGAATAAATTATGAAGAAACTTTCTCTCCTATTGCCATGATAAAGTCGATTAGAATACCCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGAAT
GTCAAGACAGCCTTTTTGAACTGTAATCTTGAAGAGAGTATTTATATGGTCCAACCAGAGGGGTTTATAAAAAAGGGTCAAGAACAAAAGGTTTGTAAGCTTCAA
AAATCCATTTATGGATTAAAGCAAGCTTCTAGATCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGATGTAGACTCTGACCAATGGATCAAAGCCATGAACCTCGAAATGGAATCTATGTATTCCAATTCTGTCTGGACTCTAGTAGATCAACCAAATGATGTA
AGACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTTGCAAAGTACAGACTTTCAAAGCTCGACTTGTGGCAAAAGGTTATACACAAAAGGAG
GGAATAAATTATGAAGAAACTTTCTCTCCTATTGCCATGATAAAGTCGATTAGAATACCCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGAAT
GTCAAGACAGCCTTTTTGAACTGTAATCTTGAAGAGAGTATTTATATGGTCCAACCAGAGGGGTTTATAAAAAAGGGTCAAGAACAAAAGGTTTGTAAGCTTCAA
AAATCCATTTATGGATTAAAGCAAGCTTCTAGATCCTAG
Protein sequenceShow/hide protein sequence
MNDVDSDQWIKAMNLEMESMYSNSVWTLVDQPNDVRPIGCKWIYKRKRDQACKVQTFKARLVAKGYTQKEGINYEETFSPIAMIKSIRIPLSIATFYDYEIWQMN
VKTAFLNCNLEESIYMVQPEGFIKKGQEQKVCKLQKSIYGLKQASRS