; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0068971 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0068971
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr03:13533546..13533968
RNA-Seq ExpressionCmc03g0068971
SyntenyCmc03g0068971
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-6789.29Show/hide
Query:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
        MNDVDKDQW+KAMDLE+ESMYFN VWE VDLP+G+KPIG KWIYK+KRDSAGKVQ FKARLVAKGYTQR+GVDYEE FSP+ MLKSIRILLSIATFYDYE
Subjt:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE

Query:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN
        IWQ+DVKTAFLNGNLEESIF+SQPEGFITQGQEQKVCKLN
Subjt:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-6789.29Show/hide
Query:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
        MNDVDKDQW+KAMDLE+ESMYFN VWE VDLP+G+KPIG KWIYK+KRDSAGKVQ FKARLVAKGYTQR+GVDYEE FSP+ MLKSIRILLSIATFYDYE
Subjt:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE

Query:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN
        IWQ+DVKTAFLNGNLEESIF+SQPEGFITQGQEQKVCKLN
Subjt:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN

KAA0063959.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-7098.55Show/hide
Query:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
        MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
Subjt:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE

Query:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCK
        IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQK  K
Subjt:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCK

TYK06361.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-6788.57Show/hide
Query:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
        MNDVDKDQW+KAMDLE+ESMYFN VWE VDLP+G+KPIG KWIYK+KRDSAGKVQ FKARLVAKGYTQR+GVDY+E FSP+ MLKSIRILLSIATFYDYE
Subjt:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE

Query:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN
        IWQ+DVKTAFLNGNLEESIF+SQPEGFITQGQEQKVCKLN
Subjt:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-6789.29Show/hide
Query:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
        MNDVDKDQW+KAMDLE+ESMYFN VWE VDLP+G+KPIG KWIYK+KRDSAGKVQ FKARLVAKGYTQR+GVDYEE FSP+ MLKSIRILLSIATFYDYE
Subjt:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE

Query:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN
        IWQ+DVKTAFLNGNLEESIF+SQPEGFITQGQEQKVCKLN
Subjt:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein1.4e-6789.29Show/hide
Query:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
        MNDVDKDQW+KAMDLE+ESMYFN VWE VDLP+G+KPIG KWIYK+KRDSAGKVQ FKARLVAKGYTQR+GVDYEE FSP+ MLKSIRILLSIATFYDYE
Subjt:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE

Query:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN
        IWQ+DVKTAFLNGNLEESIF+SQPEGFITQGQEQKVCKLN
Subjt:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN

A0A5A7UYE8 Gag/pol protein1.4e-6789.29Show/hide
Query:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
        MNDVDKDQW+KAMDLE+ESMYFN VWE VDLP+G+KPIG KWIYK+KRDSAGKVQ FKARLVAKGYTQR+GVDYEE FSP+ MLKSIRILLSIATFYDYE
Subjt:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE

Query:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN
        IWQ+DVKTAFLNGNLEESIF+SQPEGFITQGQEQKVCKLN
Subjt:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN

A0A5D3C1D1 Gag/pol protein7.7e-7198.55Show/hide
Query:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
        MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
Subjt:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE

Query:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCK
        IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQK  K
Subjt:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCK

A0A5D3C378 Gag/pol protein4.0e-6788.57Show/hide
Query:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
        MNDVDKDQW+KAMDLE+ESMYFN VWE VDLP+G+KPIG KWIYK+KRDSAGKVQ FKARLVAKGYTQR+GVDY+E FSP+ MLKSIRILLSIATFYDYE
Subjt:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE

Query:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN
        IWQ+DVKTAFLNGNLEESIF+SQPEGFITQGQEQKVCKLN
Subjt:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN

A0A5D3CYF4 Gag/pol protein1.4e-6789.29Show/hide
Query:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE
        MNDVDKDQW+KAMDLE+ESMYFN VWE VDLP+G+KPIG KWIYK+KRDSAGKVQ FKARLVAKGYTQR+GVDYEE FSP+ MLKSIRILLSIATFYDYE
Subjt:  MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYE

Query:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN
        IWQ+DVKTAFLNGNLEESIF+SQPEGFITQGQEQKVCKLN
Subjt:  IWQIDVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-2338.24Show/hide
Query:  DKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYEIWQI
        DK  W +A++ EL +   N  W     P+    +  +W++  K +  G    +KARLVA+G+TQ+  +DYEE F+P+  + S R +LS+   Y+ ++ Q+
Subjt:  DKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYEIWQI

Query:  DVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN
        DVKTAFLNG L+E I++  P+G         VCKLN
Subjt:  DVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-3550.74Show/hide
Query:  DKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYEIWQI
        +K+Q +KAM  E+ES+  N  ++ V+LPKG +P+  KW++K K+D   K+  +KARLV KG+ Q++G+D++EIFSP+V + SIR +LS+A   D E+ Q+
Subjt:  DKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYEIWQI

Query:  DVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN
        DVKTAFL+G+LEE I++ QPEGF   G++  VCKLN
Subjt:  DVKTAFLNGNLEESIFISQPEGFITQGQEQKVCKLN

P92520 Uncharacterized mitochondrial protein AtMg008206.0e-1239.53Show/hide
Query:  WIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIA
        W +AM  EL+++  N  W  V  P     +G KW++K K  S G +   KARLVAKG+ Q +G+ + E +SP+V   +IR +L++A
Subjt:  WIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.3e-2643.28Show/hide
Query:  DQWIKAMDLELESMYFNLVWEFVDLPKG-IKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYEIWQID
        ++W  AM  E+ +   N  W+ V  P   +  +G +WI+ KK +S G +  +KARLVAKGY QR G+DY E FSP++   SIRI+L +A    + I Q+D
Subjt:  DQWIKAMDLELESMYFNLVWEFVDLPKG-IKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYEIWQID

Query:  VKTAFLNGNLEESIFISQPEGFITQGQEQKVCKL
        V  AFL G L + +++SQP GFI + +   VCKL
Subjt:  VKTAFLNGNLEESIFISQPEGFITQGQEQKVCKL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.3e-2642.54Show/hide
Query:  DQWIKAMDLELESMYFNLVWEFV-DLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYEIWQID
        D+W +AM  E+ +   N  W+ V   P  +  +G +WI+ KK +S G +  +KARLVAKGY QR G+DY E FSP++   SIRI+L +A    + I Q+D
Subjt:  DQWIKAMDLELESMYFNLVWEFV-DLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYEIWQID

Query:  VKTAFLNGNLEESIFISQPEGFITQGQEQKVCKL
        V  AFL G L + +++SQP GF+ + +   VC+L
Subjt:  VKTAFLNGNLEESIFISQPEGFITQGQEQKVCKL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.5e-2946.22Show/hide
Query:  WIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYEIWQIDVKT
        W  AMD E+ +M     WE   LP   KPIG KW+YK K +S G ++ +KARLVAKGYTQ++G+D+ E FSP+  L S++++L+I+  Y++ + Q+D+  
Subjt:  WIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYEIWQIDVKT

Query:  AFLNGNLEESIFISQPEGF
        AFLNG+L+E I++  P G+
Subjt:  AFLNGNLEESIFISQPEGF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.2e-1339.53Show/hide
Query:  WIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIA
        W +AM  EL+++  N  W  V  P     +G KW++K K  S G +   KARLVAKG+ Q +G+ + E +SP+V   +IR +L++A
Subjt:  WIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATGTAGACAAGGACCAATGGATCAAAGCCATGGATCTTGAATTGGAGTCTATGTACTTCAATTTAGTGTGGGAGTTTGTAGATCTACCTAAAGGGATAAAACC
TATAGGGTACAAATGGATCTATAAAAAAAAGAGAGATTCAGCTGGGAAGGTACAGGCCTTCAAAGCTAGACTTGTAGCAAAAGGGTATACCCAAAGGCAAGGGGTTGACT
ATGAGGAAATTTTTTCCCCTATTGTTATGTTAAAGTCTATAAGGATTCTCTTGTCCATAGCCACATTTTATGATTATGAAATATGGCAAATAGATGTCAAGACTGCTTTT
CTGAATGGCAATCTTGAAGAGAGTATCTTTATATCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGATGTAGACAAGGACCAATGGATCAAAGCCATGGATCTTGAATTGGAGTCTATGTACTTCAATTTAGTGTGGGAGTTTGTAGATCTACCTAAAGGGATAAAACC
TATAGGGTACAAATGGATCTATAAAAAAAAGAGAGATTCAGCTGGGAAGGTACAGGCCTTCAAAGCTAGACTTGTAGCAAAAGGGTATACCCAAAGGCAAGGGGTTGACT
ATGAGGAAATTTTTTCCCCTATTGTTATGTTAAAGTCTATAAGGATTCTCTTGTCCATAGCCACATTTTATGATTATGAAATATGGCAAATAGATGTCAAGACTGCTTTT
CTGAATGGCAATCTTGAAGAGAGTATCTTTATATCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATTGA
Protein sequenceShow/hide protein sequence
MNDVDKDQWIKAMDLELESMYFNLVWEFVDLPKGIKPIGYKWIYKKKRDSAGKVQAFKARLVAKGYTQRQGVDYEEIFSPIVMLKSIRILLSIATFYDYEIWQIDVKTAF
LNGNLEESIFISQPEGFITQGQEQKVCKLN