; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0053381 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0053381
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr02:20803976..20804371
RNA-Seq ExpressionCmc02g0053381
SyntenyCmc02g0053381
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-6798.47Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
        MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG

Query:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA
        K+AFLVLYVDDILLIGNDVGYLTD KAWLAA
Subjt:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-6697.71Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
        MLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG

Query:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA
        K+AFLVLYVDDILLIGNDVGYLTD KAWLAA
Subjt:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA

KAA0058278.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-6697.71Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
        MLKSIRILLSIATFYDYEIWQMDVKTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG

Query:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA
        K+AFLVLYVDDILLIGNDVGYLTD KAWLAA
Subjt:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-6798.47Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
        MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG

Query:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA
        K+AFLVLYVDDILLIGNDVGYLTD KAWLAA
Subjt:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-6798.47Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
        MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG

Query:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA
        K+AFLVLYVDDILLIGNDVGYLTD KAWLAA
Subjt:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein4.8e-6797.71Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
        MLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG

Query:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA
        K+AFLVLYVDDILLIGNDVGYLTD KAWLAA
Subjt:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA

A0A5A7TZD0 Gag/pol protein9.7e-6898.47Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
        MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG

Query:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA
        K+AFLVLYVDDILLIGNDVGYLTD KAWLAA
Subjt:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA

A0A5A7USZ2 Gag/pol protein6.3e-6797.71Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
        MLKSIRILLSIATFYDYEIWQMDVKTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG

Query:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA
        K+AFLVLYVDDILLIGNDVGYLTD KAWLAA
Subjt:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA

A0A5A7UYE8 Gag/pol protein9.7e-6898.47Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
        MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG

Query:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA
        K+AFLVLYVDDILLIGNDVGYLTD KAWLAA
Subjt:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA

A0A5D3CYF4 Gag/pol protein9.7e-6898.47Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
        MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKG

Query:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA
        K+AFLVLYVDDILLIGNDVGYLTD KAWLAA
Subjt:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLAA

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.3e-2040.46Show/hide
Query:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGK
        + S R +LS+   Y+ ++ QMDVKTAFLNG L+E I+M  P+G         VCKLN++IYGLKQA+R W   F+ A+K   F  +  + C+Y  ++KG 
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGK

Query:  I---AFLVLYVDDILLIGNDVGYLTDAKAWL
        I    +++LYVDD+++   D+  + + K +L
Subjt:  I---AFLVLYVDDILLIGNDVGYLTDAKAWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-3048.46Show/hide
Query:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKG
        + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE I+M QPEGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS  + +   +PCVY K+ ++ 
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKG

Query:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLA
            L+LYVDD+L++G D G +   K  L+
Subjt:  KIAFLVLYVDDILLIGNDVGYLTDAKAWLA

P25600 Putative transposon Ty5-1 protein YCL074W5.6e-1236.56Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKIAFLVLYVDDILL
        MDV TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKIAFLVLYVDDILL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-2040.87Show/hide
Query:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKIA
        SIRI+L +A    + I Q+DV  AFL G L + ++MSQP GFI + +   VCKL +++YGLKQA R+W +     + + GF  +V +  ++       I 
Subjt:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKIA

Query:  FLVLYVDDILLIGND
        ++++YVDDIL+ GND
Subjt:  FLVLYVDDILLIGND

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-2040Show/hide
Query:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKIA
        SIRI+L +A    + I Q+DV  AFL G L + ++MSQP GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  ++ +  ++       I 
Subjt:  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKIA

Query:  FLVLYVDDILLIGND
        ++++YVDDIL+ GND
Subjt:  FLVLYVDDILLIGND

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.8e-2136.36Show/hide
Query:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKI
        L S++++L+I+  Y++ + Q+D+  AFLNG+L+E I+M  P G+   QG       VC L +SIYGLKQASR W ++F   +  +GF Q+  +   + KI
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKI

Query:  NKGKIAFLVLYVDDILLIGNDVGYLTDAKAWL
               +++YVDDI++  N+   + + K+ L
Subjt:  NKGKIAFLVLYVDDILLIGNDVGYLTDAKAWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAAGTCTATAAGGATTCTCTTATCCATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTAT
CTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGTAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGGAACATTA
GGTTTGATACTGCGATCAAATCCTACGGTTTTGACCAAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAGGGAAAAATAGCTTTCTTAGTACTTTATGTGGAC
GATATCCTCCTCATTGGAAATGATGTAGGATACCTTACTGATGCTAAAGCTTGGTTAGCAGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTAAAGTCTATAAGGATTCTCTTATCCATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTAT
CTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGTAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGGAACATTA
GGTTTGATACTGCGATCAAATCCTACGGTTTTGACCAAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAGGGAAAAATAGCTTTCTTAGTACTTTATGTGGAC
GATATCCTCCTCATTGGAAATGATGTAGGATACCTTACTGATGCTAAAGCTTGGTTAGCAGCCTAA
Protein sequenceShow/hide protein sequence
MLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKIAFLVLYVD
DILLIGNDVGYLTDAKAWLAA