; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0094241 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0094241
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:6775347..6775784
RNA-Seq ExpressionCmc04g0094241
SyntenyCmc04g0094241
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025729.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-5886.13Show/hide
Query:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE
        MNDVDC+QWIKAIDL+MESMYSNSVWTLVDQ ++V+PI C+WIYKRKRDQA KVQ  KARLVAKGYTQ EGIDYEETFS V MIKSIRILLSIATFYD E
Subjt:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE

Query:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFV
        IW MDVKT FLNGNLEESIYMVQSEGFIQK QEQK V
Subjt:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFV

KAA0037081.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-6087.41Show/hide
Query:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE
        MNDVDC+QWIKA+DLEMESMYSNSVWTLVDQPN  KPI CKWIYKRKRDQA+KVQ  KARLVAKGYTQKEG+DYEETFS V M+KSIRILLSIATFYD E
Subjt:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE

Query:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQK
        IWQMDVKT FLNGNLEESIY+VQ EGFIQK QEQK
Subjt:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQK

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-5885.82Show/hide
Query:  NDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEI
        NDVD +QWIKA+DL+MESMYSNSVWTLVDQPN+++PI CKWIYKRKRDQ AKVQ  +ARLVAKGYTQKEGIDYEETFS + MIKSIRILLSIATFYD EI
Subjt:  NDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEI

Query:  WQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQK
        WQMDVKT FLNGNLEESIYMVQ EGFIQK QEQK
Subjt:  WQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQK

KAA0051500.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-6084.78Show/hide
Query:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE
        MNDVDC+QWIK +DLEMESMYSNSVWTLVDQPN+VKPI CKW+YKRKRDQA KVQ  +ARLVAKGYTQKEGIDYEETFS V MIKSIRI+LSIATFYD E
Subjt:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE

Query:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFVS
        IW MDVKTTFLNGNL+ESIYMVQ EGFIQK QEQK ++
Subjt:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFVS

TYK06159.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-5986.86Show/hide
Query:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE
        MNDVDC+QWIKAIDL+MESMYSNSVWTLVDQ ++V+PI C+WIYKRKRDQA KVQ  KARLVAKGYTQKEGIDYEETFS V MIKSIRILLSIATFYD E
Subjt:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE

Query:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFV
        IW MDVKT FLNGNLEESIYMVQSEGFIQK QEQK V
Subjt:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFV

TrEMBL top hitse value%identityAlignment
A0A5A7SKC9 Gag/pol protein5.4e-5986.13Show/hide
Query:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE
        MNDVDC+QWIKAIDL+MESMYSNSVWTLVDQ ++V+PI C+WIYKRKRDQA KVQ  KARLVAKGYTQ EGIDYEETFS V MIKSIRILLSIATFYD E
Subjt:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE

Query:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFV
        IW MDVKT FLNGNLEESIYMVQSEGFIQK QEQK V
Subjt:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFV

A0A5A7TTA2 Gag/pol protein7.0e-5985.82Show/hide
Query:  NDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEI
        NDVD +QWIKA+DL+MESMYSNSVWTLVDQPN+++PI CKWIYKRKRDQ AKVQ  +ARLVAKGYTQKEGIDYEETFS + MIKSIRILLSIATFYD EI
Subjt:  NDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEI

Query:  WQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQK
        WQMDVKT FLNGNLEESIYMVQ EGFIQK QEQK
Subjt:  WQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQK

A0A5A7U8L5 Gag/pol protein1.7e-6084.78Show/hide
Query:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE
        MNDVDC+QWIK +DLEMESMYSNSVWTLVDQPN+VKPI CKW+YKRKRDQA KVQ  +ARLVAKGYTQKEGIDYEETFS V MIKSIRI+LSIATFYD E
Subjt:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE

Query:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFVS
        IW MDVKTTFLNGNL+ESIYMVQ EGFIQK QEQK ++
Subjt:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFVS

A0A5D3C5F1 Gag/pol protein2.2e-6087.41Show/hide
Query:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE
        MNDVDC+QWIKA+DLEMESMYSNSVWTLVDQPN  KPI CKWIYKRKRDQA+KVQ  KARLVAKGYTQKEG+DYEETFS V M+KSIRILLSIATFYD E
Subjt:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE

Query:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQK
        IWQMDVKT FLNGNLEESIY+VQ EGFIQK QEQK
Subjt:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQK

A0A5D3C701 Gag/pol protein1.4e-5986.86Show/hide
Query:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE
        MNDVDC+QWIKAIDL+MESMYSNSVWTLVDQ ++V+PI C+WIYKRKRDQA KVQ  KARLVAKGYTQKEGIDYEETFS V MIKSIRILLSIATFYD E
Subjt:  MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNE

Query:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFV
        IW MDVKT FLNGNLEESIYMVQSEGFIQK QEQK V
Subjt:  IWQMDVKTTFLNGNLEESIYMVQSEGFIQKDQEQKFV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.7e-2242.62Show/hide
Query:  DCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEIWQM
        D + W +AI+ E+ +   N+ WT+  +P +   +D +W++  K ++       KARLVA+G+TQK  IDYEETF+ V  I S R +LS+   Y+ ++ QM
Subjt:  DCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEIWQM

Query:  DVKTTFLNGNLEESIYMVQSEG
        DVKT FLNG L+E IYM   +G
Subjt:  DVKTTFLNGNLEESIYMVQSEG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.3e-2952.07Show/hide
Query:  NQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEIWQMDV
        NQ +KA+  EMES+  N  + LV+ P   +P+ CKW++K K+D   K+   KARLV KG+ QK+GID++E FS VV + SIR +LS+A   D E+ Q+DV
Subjt:  NQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEIWQMDV

Query:  KTTFLNGNLEESIYMVQSEGF
        KT FL+G+LEE IYM Q EGF
Subjt:  KTTFLNGNLEESIYMVQSEGF

P92520 Uncharacterized mitochondrial protein AtMg008203.6e-1239.53Show/hide
Query:  WIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIA
        W +A+  E++++  N  W LV  P +   + CKW++K K      +   KARLVAKG+ Q+EGI + ET+S VV   +IR +L++A
Subjt:  WIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-2142.06Show/hide
Query:  QWIKAIDLEMESMYSNSVWTLV-DQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEIWQMDV
        +W  A+  E+ +   N  W LV   P+ V  + C+WI+ +K +    +   KARLVAKGY Q+ G+DY ETFS V+   SIRI+L +A      I Q+DV
Subjt:  QWIKAIDLEMESMYSNSVWTLV-DQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEIWQMDV

Query:  KTTFLNGNLEESIYMVQSEGFIQKDQ
           FL G L + +YM Q  GFI KD+
Subjt:  KTTFLNGNLEESIYMVQSEGFIQKDQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.6e-2140.94Show/hide
Query:  NQWIKAIDLEMESMYSNSVWTLV-DQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEIWQMD
        ++W +A+  E+ +   N  W LV   P  V  + C+WI+ +K +    +   KARLVAKGY Q+ G+DY ETFS V+   SIRI+L +A      I Q+D
Subjt:  NQWIKAIDLEMESMYSNSVWTLV-DQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEIWQMD

Query:  VKTTFLNGNLEESIYMVQSEGFIQKDQ
        V   FL G L + +YM Q  GF+ KD+
Subjt:  VKTTFLNGNLEESIYMVQSEGFIQKDQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.2e-2543.7Show/hide
Query:  WIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEIWQMDVKT
        W  A+D E+ +M +   W +   P + KPI CKW+YK K +    ++  KARLVAKGYTQ+EGID+ ETFS V  + S++++L+I+  Y+  + Q+D+  
Subjt:  WIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEIWQMDVKT

Query:  TFLNGNLEESIYMVQSEGF
         FLNG+L+E IYM    G+
Subjt:  TFLNGNLEESIYMVQSEGF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.6e-1339.53Show/hide
Query:  WIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIA
        W +A+  E++++  N  W LV  P +   + CKW++K K      +   KARLVAKG+ Q+EGI + ET+S VV   +IR +L++A
Subjt:  WIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATGTGGACTGTAACCAATGGATCAAAGCCATAGACCTCGAAATGGAATCTATGTATTCCAATTCTGTCTGGACTCTAGTAGATCAACCAAATGATGTAAAACC
TATTGATTGTAAATGGATTTACAAGAGAAAACGAGACCAAGCTGCTAAAGTACAGAATTCCAAAGCTCGATTAGTGGCAAAAGGTTATACACAAAAGGAGGGAATAGATT
ATGAAGAAACTTTCTCTCATGTTGTCATGATAAAGTCGATTAGAATTCTCTTATCCATCGCCACTTTTTATGATAATGAAATTTGGCAGATGGATGTCAAGACAACCTTT
TTGAATGGAAATCTTGAGGAGAGTATCTATATGGTCCAATCAGAGGGGTTTATACAAAAGGACCAAGAACAAAAGTTTGTAAGCTTCAAAAATCCATACATGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGATGTGGACTGTAACCAATGGATCAAAGCCATAGACCTCGAAATGGAATCTATGTATTCCAATTCTGTCTGGACTCTAGTAGATCAACCAAATGATGTAAAACC
TATTGATTGTAAATGGATTTACAAGAGAAAACGAGACCAAGCTGCTAAAGTACAGAATTCCAAAGCTCGATTAGTGGCAAAAGGTTATACACAAAAGGAGGGAATAGATT
ATGAAGAAACTTTCTCTCATGTTGTCATGATAAAGTCGATTAGAATTCTCTTATCCATCGCCACTTTTTATGATAATGAAATTTGGCAGATGGATGTCAAGACAACCTTT
TTGAATGGAAATCTTGAGGAGAGTATCTATATGGTCCAATCAGAGGGGTTTATACAAAAGGACCAAGAACAAAAGTTTGTAAGCTTCAAAAATCCATACATGGATTGA
Protein sequenceShow/hide protein sequence
MNDVDCNQWIKAIDLEMESMYSNSVWTLVDQPNDVKPIDCKWIYKRKRDQAAKVQNSKARLVAKGYTQKEGIDYEETFSHVVMIKSIRILLSIATFYDNEIWQMDVKTTF
LNGNLEESIYMVQSEGFIQKDQEQKFVSFKNPYMD