; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0026821 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0026821
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr01:27935792..27936295
RNA-Seq ExpressionCmc01g0026821
SyntenyCmc01g0026821
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]4.2e-7888.02Show/hide
Query:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE
        MNDVD DQWIKAM+LEME MY NSVWTLVD PS V+PIGCKWIY RKRDQAGKVQTFKARLVAKGYTQKEG+DYEETFSPVAM+KS RILLSIATFY+YE
Subjt:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE

Query:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE
        IWQ+DVKT FLNGNLEE IYMVQ EGFI + QEQKVCKLQKSIYGLKQASRSWNI FDTAIKSYGFE
Subjt:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-7583.83Show/hide
Query:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE
        MNDVD DQW+KAMDLEME MY NSVW LVD P  V+PIGCKWIY RKRD AGKVQTFKARLVAKGYTQ+EG+DYEETFSPVAM+KS RILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE

Query:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE
        IWQ+DVKT FLNGNLEE I+M Q EGFI +GQEQKVCKL +SIYGLKQASRSWNI FDTAIKSYGF+
Subjt:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-8089.16Show/hide
Query:  NDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEI
        NDVD DQWIKAMDL+ME MYSNSVWTLVDQP+ +RPIGCKWIY RKRDQ  KVQTF+ARLVAKGYTQKEGIDYEETFSP+AMIKS RILLSIATFYDYEI
Subjt:  NDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEI

Query:  WQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE
        WQ+DVKT FLNGNLEE IYMVQ EGFIQKGQEQKVCKLQKSIYGLKQASRSWNI FDT IKSYGFE
Subjt:  WQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-7583.83Show/hide
Query:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE
        MNDVD DQW+KAMDLEME MY NSVW LVD P  V+PIGCKWIY RKRD AGKVQTFKARLVAKGYTQ+EG+DYEETFSPVAM+KS RILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE

Query:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE
        IWQ+DVKT FLNGNLEE I+M Q EGFI +GQEQKVCKL +SIYGLKQASRSWNI FDTAIKSYGF+
Subjt:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-7583.83Show/hide
Query:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE
        MNDVD DQW+KAMDLEME MY NSVW LVD P  V+PIGCKWIY RKRD AGKVQTFKARLVAKGYTQ+EG+DYEETFSPVAM+KS RILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE

Query:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE
        IWQ+DVKT FLNGNLEE I+M Q EGFI +GQEQKVCKL +SIYGLKQASRSWNI FDTAIKSYGF+
Subjt:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE

TrEMBL top hitse value%identityAlignment
A0A5A7TTA2 Gag/pol protein1.7e-8089.16Show/hide
Query:  NDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEI
        NDVD DQWIKAMDL+ME MYSNSVWTLVDQP+ +RPIGCKWIY RKRDQ  KVQTF+ARLVAKGYTQKEGIDYEETFSP+AMIKS RILLSIATFYDYEI
Subjt:  NDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEI

Query:  WQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE
        WQ+DVKT FLNGNLEE IYMVQ EGFIQKGQEQKVCKLQKSIYGLKQASRSWNI FDT IKSYGFE
Subjt:  WQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE

A0A5A7TZD0 Gag/pol protein1.2e-7583.83Show/hide
Query:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE
        MNDVD DQW+KAMDLEME MY NSVW LVD P  V+PIGCKWIY RKRD AGKVQTFKARLVAKGYTQ+EG+DYEETFSPVAM+KS RILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE

Query:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE
        IWQ+DVKT FLNGNLEE I+M Q EGFI +GQEQKVCKL +SIYGLKQASRSWNI FDTAIKSYGF+
Subjt:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE

A0A5A7UYE8 Gag/pol protein1.2e-7583.83Show/hide
Query:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE
        MNDVD DQW+KAMDLEME MY NSVW LVD P  V+PIGCKWIY RKRD AGKVQTFKARLVAKGYTQ+EG+DYEETFSPVAM+KS RILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE

Query:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE
        IWQ+DVKT FLNGNLEE I+M Q EGFI +GQEQKVCKL +SIYGLKQASRSWNI FDTAIKSYGF+
Subjt:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE

A0A5D3CYF4 Gag/pol protein1.2e-7583.83Show/hide
Query:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE
        MNDVD DQW+KAMDLEME MY NSVW LVD P  V+PIGCKWIY RKRD AGKVQTFKARLVAKGYTQ+EG+DYEETFSPVAM+KS RILLSIATFYDYE
Subjt:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE

Query:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE
        IWQ+DVKT FLNGNLEE I+M Q EGFI +GQEQKVCKL +SIYGLKQASRSWNI FDTAIKSYGF+
Subjt:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE

E2GK51 Gag/pol protein (Fragment)2.0e-7888.02Show/hide
Query:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE
        MNDVD DQWIKAM+LEME MY NSVWTLVD PS V+PIGCKWIY RKRDQAGKVQTFKARLVAKGYTQKEG+DYEETFSPVAM+KS RILLSIATFY+YE
Subjt:  MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYE

Query:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE
        IWQ+DVKT FLNGNLEE IYMVQ EGFI + QEQKVCKLQKSIYGLKQASRSWNI FDTAIKSYGFE
Subjt:  IWQLDVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-3243.21Show/hide
Query:  DCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEIWQL
        D   W +A++ E+     N+ WT+  +P     +  +W+++ K ++ G    +KARLVA+G+TQK  IDYEETF+PVA I S R +LS+   Y+ ++ Q+
Subjt:  DCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEIWQL

Query:  DVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGF
        DVKT FLNG L+E IYM   +G         VCKL K+IYGLKQA+R W  +F+ A+K   F
Subjt:  DVKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-3951.59Show/hide
Query:  DQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEIWQLDV
        +Q +KAM  EME +  N  + LV+ P   RP+ CKW++  K+D   K+  +KARLV KG+ QK+GID++E FSPV  + S R +LS+A   D E+ QLDV
Subjt:  DQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEIWQLDV

Query:  KTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKS
        KT FL+G+LEE IYM Q EGF   G++  VCKL KS+YGLKQA R W + FD+ +KS
Subjt:  KTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKS

P92520 Uncharacterized mitochondrial protein AtMg008209.9e-1440.7Show/hide
Query:  WIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIA
        W +AM  E++ +  N  W LV  P     +GCKW++  K    G +   KARLVAKG+ Q+EGI + ET+SPV    + R +L++A
Subjt:  WIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-3445.34Show/hide
Query:  DQWIKAMDLEMEFMYSNSVWTLV-DQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEIWQLD
        ++W  AM  E+     N  W LV   PS V  +GC+WI+T+K +  G +  +KARLVAKGY Q+ G+DY ETFSPV    S RI+L +A    + I QLD
Subjt:  DQWIKAMDLEMEFMYSNSVWTLV-DQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEIWQLD

Query:  VKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGF
        V   FL G L + +YM Q  GFI K +   VCKL+K++YGLKQA R+W +     + + GF
Subjt:  VKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-3445.34Show/hide
Query:  DQWIKAMDLEMEFMYSNSVWTLV-DQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEIWQLD
        D+W +AM  E+     N  W LV   P  V  +GC+WI+T+K +  G +  +KARLVAKGY Q+ G+DY ETFSPV    S RI+L +A    + I QLD
Subjt:  DQWIKAMDLEMEFMYSNSVWTLV-DQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEIWQLD

Query:  VKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGF
        V   FL G L + +YM Q  GF+ K +   VC+L+K+IYGLKQA R+W +   T + + GF
Subjt:  VKTGFLNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.9e-3745.06Show/hide
Query:  WIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEIWQLDVKT
        W  AMD E+  M +   W +   P   +PIGCKW+Y  K +  G ++ +KARLVAKGYTQ+EGID+ ETFSPV  + S +++L+I+  Y++ + QLD+  
Subjt:  WIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEIWQLDVKT

Query:  GFLNGNLEEIIYMVQLEGFIQKGQE----QKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGF
         FLNG+L+E IYM    G+  +  +      VC L+KSIYGLKQASR W + F   +  +GF
Subjt:  GFLNGNLEEIIYMVQLEGFIQKGQE----QKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.1e-1540.7Show/hide
Query:  WIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIA
        W +AM  E++ +  N  W LV  P     +GCKW++  K    G +   KARLVAKG+ Q+EGI + ET+SPV    + R +L++A
Subjt:  WIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATGTGGATTGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAATTTATGTATTCCAATTCTGTCTGGACTCTAGTAGATCAACCAAGTAAGGTAAGACC
TATTGGTTGTAAATGGATCTACACGAGAAAACGAGACCAAGCTGGTAAAGTACAAACTTTCAAAGCTCGACTTGTGGCAAAAGGTTATACACAAAAGGAGGGAATAGATT
ATGAAGAAACTTTCTCTCCTGTTGCCATGATAAAATCGACTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGCTGGATGTCAAGACAGGCTTT
TTGAACGGTAATCTTGAAGAGATTATTTATATGGTCCAACTAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGGTTTGTAAGCTTCAAAAATCCATTTATGGATTGAA
ACAAGCATCTAGATCCTGGAATATAATATTTGATACTGCGATCAAATCTTATGGTTTTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGATGTGGATTGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAATTTATGTATTCCAATTCTGTCTGGACTCTAGTAGATCAACCAAGTAAGGTAAGACC
TATTGGTTGTAAATGGATCTACACGAGAAAACGAGACCAAGCTGGTAAAGTACAAACTTTCAAAGCTCGACTTGTGGCAAAAGGTTATACACAAAAGGAGGGAATAGATT
ATGAAGAAACTTTCTCTCCTGTTGCCATGATAAAATCGACTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGCTGGATGTCAAGACAGGCTTT
TTGAACGGTAATCTTGAAGAGATTATTTATATGGTCCAACTAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGGTTTGTAAGCTTCAAAAATCCATTTATGGATTGAA
ACAAGCATCTAGATCCTGGAATATAATATTTGATACTGCGATCAAATCTTATGGTTTTGAATAG
Protein sequenceShow/hide protein sequence
MNDVDCDQWIKAMDLEMEFMYSNSVWTLVDQPSKVRPIGCKWIYTRKRDQAGKVQTFKARLVAKGYTQKEGIDYEETFSPVAMIKSTRILLSIATFYDYEIWQLDVKTGF
LNGNLEEIIYMVQLEGFIQKGQEQKVCKLQKSIYGLKQASRSWNIIFDTAIKSYGFE