; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0045541 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0045541
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr02:9423239..9423604
RNA-Seq ExpressionCmc02g0045541
SyntenyCmc02g0045541
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035790.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.9e-4085.71Show/hide
Query:  YRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWGI
        Y+MDVKSAFLN YLNEEVFVAQPKGFVDSEFP HVYKLNKALYGLKQAPRAWYERLTIYL DKGYS+ GT KTLFINRTSSELI+AQIY+D+  I+G+
Subjt:  YRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWGI

KAA0037650.1 gag-pol polyprotein [Cucumis melo var. makuwa]8.1e-3969.57Show/hide
Query:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE
        ++ + LL  +  F     +RMDVKSAFLN YLNEEV+VAQPKGFVDSEFPQ+VYKLNKALYGLKQAPRAWYERLT+YL ++GYS+G T++TLFINRTS++
Subjt:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE

Query:  LIVAQIYIDEYYIWG
        LIVAQIY+D+    G
Subjt:  LIVAQIYIDEYYIWG

KAA0039035.1 gag-pol polyprotein [Cucumis melo var. makuwa]8.1e-3981.44Show/hide
Query:  YRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWG
        Y+MD+KSAFLN YLN+EV+VAQPKGFVDSEFPQHVYKLNKALYGLKQAP+AWYE LTIYL +KGYS+GG DKTLFINRTSS+LIVAQIY+D+    G
Subjt:  YRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWG

TYK01623.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.8e-3972.17Show/hide
Query:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE
        ++ + LL  +        Y+MDVKSAF+N YLNEEVFV QPK  V+SEFP+HVYKLNKALYGLKQAPRAWYERLTIYL DKGYS+GGTDKTLFINRTSSE
Subjt:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE

Query:  LIVAQIYIDEYYIWG
        LIVAQIY+D+    G
Subjt:  LIVAQIYIDEYYIWG

TYK29824.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]3.9e-4174.78Show/hide
Query:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE
        +K + LL  +        Y+MDVKSAFLN YLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWY+RLTIYL DKGYS+ GT KTLFINRTSSE
Subjt:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE

Query:  LIVAQIYIDEYYIWG
        LI+AQIY+D+    G
Subjt:  LIVAQIYIDEYYIWG

TrEMBL top hitse value%identityAlignment
A0A5A7SXM5 Putative gag-pol polyprotein9.4e-4185.71Show/hide
Query:  YRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWGI
        Y+MDVKSAFLN YLNEEVFVAQPKGFVDSEFP HVYKLNKALYGLKQAPRAWYERLTIYL DKGYS+ GT KTLFINRTSSELI+AQIY+D+  I+G+
Subjt:  YRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWGI

A0A5A7T2Q0 Gag-pol polyprotein3.9e-3969.57Show/hide
Query:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE
        ++ + LL  +  F     +RMDVKSAFLN YLNEEV+VAQPKGFVDSEFPQ+VYKLNKALYGLKQAPRAWYERLT+YL ++GYS+G T++TLFINRTS++
Subjt:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE

Query:  LIVAQIYIDEYYIWG
        LIVAQIY+D+    G
Subjt:  LIVAQIYIDEYYIWG

A0A5A7TCA8 Gag-pol polyprotein3.9e-3981.44Show/hide
Query:  YRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWG
        Y+MD+KSAFLN YLN+EV+VAQPKGFVDSEFPQHVYKLNKALYGLKQAP+AWYE LTIYL +KGYS+GG DKTLFINRTSS+LIVAQIY+D+    G
Subjt:  YRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWG

A0A5D3BPJ2 Gag-pol polyprotein2.3e-3972.17Show/hide
Query:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE
        ++ + LL  +        Y+MDVKSAF+N YLNEEVFV QPK  V+SEFP+HVYKLNKALYGLKQAPRAWYERLTIYL DKGYS+GGTDKTLFINRTSSE
Subjt:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE

Query:  LIVAQIYIDEYYIWG
        LIVAQIY+D+    G
Subjt:  LIVAQIYIDEYYIWG

A0A5D3E1T0 Putative gag-pol polyprotein1.9e-4174.78Show/hide
Query:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE
        +K + LL  +        Y+MDVKSAFLN YLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWY+RLTIYL DKGYS+ GT KTLFINRTSSE
Subjt:  MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSE

Query:  LIVAQIYIDEYYIWG
        LI+AQIY+D+    G
Subjt:  LIVAQIYIDEYYIWG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.7e-1038.14Show/hide
Query:  YRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFI--NRTSSELIVAQIYIDEYYI
        ++MDVK+AFLN  L EE+++  P+G   S    +V KLNKA+YGLKQA R W+E     L +  +     D+ ++I      +E I   +Y+D+  I
Subjt:  YRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFI--NRTSSELIVAQIYIDEYYI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-1539.6Show/hide
Query:  RMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTS-SELIVAQIYIDEYYIWGISQ
        ++DVK+AFL+  L EE+++ QP+GF  +     V KLNK+LYGLKQAPR WY +   ++  + Y K  +D  ++  R S +  I+  +Y+D+  I G  +
Subjt:  RMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTS-SELIVAQIYIDEYYIWGISQ

Query:  G
        G
Subjt:  G

P25600 Putative transposon Ty5-1 protein YCL074W1.8e-1236.56Show/hide
Query:  MDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYI
        MDV +AFLN  ++E ++V QP GFV+   P +V++L   +YGLKQAP  W E +   L   G+ +   +  L+   TS   I   +Y+D+  +
Subjt:  MDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-1842.71Show/hide
Query:  RMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWG
        ++DV +AFL   L ++V+++QP GF+D + P +V KL KALYGLKQAPRAWY  L  YL   G+    +D +LF+ +    ++   +Y+D+  I G
Subjt:  RMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-1843.75Show/hide
Query:  RMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWG
        ++DV +AFL   L +EV+++QP GFVD + P +V +L KA+YGLKQAPRAWY  L  YL   G+    +D +LF+ +    +I   +Y+D+  I G
Subjt:  RMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYIWG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-1233.33Show/hide
Query:  YRMDVKSAFLNDYLNEEVFVAQPKGFV----DSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYI
        +++D+ +AFLN  L+EE+++  P G+     DS  P  V  L K++YGLKQA R W+ + ++ L   G+ +  +D T F+  T++  +   +Y+D+  I
Subjt:  YRMDVKSAFLNDYLNEEVFVAQPKGFV----DSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDEYYI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATGTTTGCACCTGTTGCCAGACTTGAAGCTATTCACCTTCTGCTCGGAATATCGAATGGATGTCAAAAGTGCCTTTTTAAATGATTACTTGAATGAGGAAGTCTT
TGTAGCTCAACCAAAAGGGTTTGTTGATTCTGAATTTCCTCAGCATGTTTACAAACTAAATAAAGCTTTGTATGGGTTAAAGCAAGCTCCTCGAGCTTGGTATGAGCGCT
TAACAATCTATCTGAGTGATAAAGGATACTCTAAAGGTGGAACTGATAAGACATTATTTATAAATAGAACCAGTAGTGAGCTCATTGTAGCACAAATTTATATTGATGAA
TATTATATTTGGGGGATTTCCCAAGGCACTTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATGTTTGCACCTGTTGCCAGACTTGAAGCTATTCACCTTCTGCTCGGAATATCGAATGGATGTCAAAAGTGCCTTTTTAAATGATTACTTGAATGAGGAAGTCTT
TGTAGCTCAACCAAAAGGGTTTGTTGATTCTGAATTTCCTCAGCATGTTTACAAACTAAATAAAGCTTTGTATGGGTTAAAGCAAGCTCCTCGAGCTTGGTATGAGCGCT
TAACAATCTATCTGAGTGATAAAGGATACTCTAAAGGTGGAACTGATAAGACATTATTTATAAATAGAACCAGTAGTGAGCTCATTGTAGCACAAATTTATATTGATGAA
TATTATATTTGGGGGATTTCCCAAGGCACTTGTTGA
Protein sequenceShow/hide protein sequence
MKCLHLLPDLKLFTFCSEYRMDVKSAFLNDYLNEEVFVAQPKGFVDSEFPQHVYKLNKALYGLKQAPRAWYERLTIYLSDKGYSKGGTDKTLFINRTSSELIVAQIYIDE
YYIWGISQGTC