; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0227441 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0227441
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr08:20515841..20516236
RNA-Seq ExpressionCmc08g0227441
SyntenyCmc08g0227441
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-5988.55Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS
        MLKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESIF+SQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNI+FDTAIKSY FDQNV+E C+Y+KINK 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS

Query:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA
         VAFLVLY DDILLIGND+GYL DVK WLAA
Subjt:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-5887.79Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS
        MLKSIRILLSIA FYDYEIWQMDVKT FLNGNLEESIF+SQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNI+FDTAIKSY FDQNV+E C+Y+KINK 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS

Query:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA
         VAFLVLY DDILLIGND+GYL DVK WLAA
Subjt:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA

KAA0058278.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-5887.79Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS
        MLKSIRILLSIATFYDYEIWQMDVKT FLN NLEESIF+SQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNI+FDTAIKSY FDQNV+E C+Y+KINK 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS

Query:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA
         VAFLVLY DDILLIGND+GYL DVK WLAA
Subjt:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-5988.55Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS
        MLKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESIF+SQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNI+FDTAIKSY FDQNV+E C+Y+KINK 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS

Query:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA
         VAFLVLY DDILLIGND+GYL DVK WLAA
Subjt:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-5988.55Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS
        MLKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESIF+SQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNI+FDTAIKSY FDQNV+E C+Y+KINK 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS

Query:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA
         VAFLVLY DDILLIGND+GYL DVK WLAA
Subjt:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein8.3e-5987.79Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS
        MLKSIRILLSIA FYDYEIWQMDVKT FLNGNLEESIF+SQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNI+FDTAIKSY FDQNV+E C+Y+KINK 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS

Query:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA
         VAFLVLY DDILLIGND+GYL DVK WLAA
Subjt:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA

A0A5A7TZD0 Gag/pol protein1.7e-5988.55Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS
        MLKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESIF+SQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNI+FDTAIKSY FDQNV+E C+Y+KINK 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS

Query:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA
         VAFLVLY DDILLIGND+GYL DVK WLAA
Subjt:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA

A0A5A7USZ2 Gag/pol protein1.1e-5887.79Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS
        MLKSIRILLSIATFYDYEIWQMDVKT FLN NLEESIF+SQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNI+FDTAIKSY FDQNV+E C+Y+KINK 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS

Query:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA
         VAFLVLY DDILLIGND+GYL DVK WLAA
Subjt:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA

A0A5A7UYE8 Gag/pol protein1.7e-5988.55Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS
        MLKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESIF+SQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNI+FDTAIKSY FDQNV+E C+Y+KINK 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS

Query:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA
         VAFLVLY DDILLIGND+GYL DVK WLAA
Subjt:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA

A0A5D3CYF4 Gag/pol protein1.7e-5988.55Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS
        MLKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESIF+SQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNI+FDTAIKSY FDQNV+E C+Y+KINK 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKS

Query:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA
         VAFLVLY DDILLIGND+GYL DVK WLAA
Subjt:  TVAFLVLYADDILLIGNDMGYLIDVKTWLAA

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.6e-2040Show/hide
Query:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIY--EKINK
        + S R +LS+   Y+ ++ QMDVKT FLNG L+E I++  P+G  I      VCKLN++IYGLKQA+R W   F+ A+K  +F  +  + CIY  +K N 
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIY--EKINK

Query:  STVAFLVLYADDILLIGNDMGYLIDVKTWL
        +   +++LY DD+++   DM  + + K +L
Subjt:  STVAFLVLYADDILLIGNDMGYLIDVKTWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.3e-2845.24Show/hide
Query:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIY-EKINKS
        + SIR +LS+A   D E+ Q+DVKT FL+G+LEE I++ QPEGF + G++  VCKLN+S+YGLKQA R W +KFD+ +KS  + +  ++ C+Y ++ +++
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIY-EKINKS

Query:  TVAFLVLYADDILLIGNDMGYLIDVK
            L+LY DD+L++G D G +  +K
Subjt:  TVAFLVLYADDILLIGNDMGYLIDVK

P25600 Putative transposon Ty5-1 protein YCL074W6.8e-1033.33Show/hide
Query:  MDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKSTVAFLVLYADDILL
        MDV T FLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K   F ++  E  +Y +       ++ +Y DD+L+
Subjt:  MDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKSTVAFLVLYADDILL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-1836.52Show/hide
Query:  SIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKSTVA
        SIRI+L +A    + I Q+DV   FL G L + +++SQP GFI + +   VCKL +++YGLKQA R+W ++    + +  F  +V++  ++      ++ 
Subjt:  SIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKSTVA

Query:  FLVLYADDILLIGND
        ++++Y DDIL+ GND
Subjt:  FLVLYADDILLIGND

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.3e-1835.65Show/hide
Query:  SIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKSTVA
        SIRI+L +A    + I Q+DV   FL G L + +++SQP GF+ + +   VC+L ++IYGLKQA R+W ++  T + +  F  ++++  ++      ++ 
Subjt:  SIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKSTVA

Query:  FLVLYADDILLIGND
        ++++Y DDIL+ GND
Subjt:  FLVLYADDILLIGND

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.3e-1934.09Show/hide
Query:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFII-QGQE---QKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKI
        L S++++L+I+  Y++ + Q+D+   FLNG+L+E I++  P G+   QG       VC L +SIYGLKQASR W +KF   +  + F Q+ ++   + KI
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFII-QGQE---QKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKI

Query:  NKSTVAFLVLYADDILLIGNDMGYLIDVKTWL
          +    +++Y DDI++  N+   + ++K+ L
Subjt:  NKSTVAFLVLYADDILLIGNDMGYLIDVKTWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAAGTCTATAAGGATTCTCTTGTCCATAGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTACTTTTCTGAATGGCAATCTTGAAGAGAGTAT
CTTTATATCTCAGCCCGAGGGGTTCATAATTCAAGGTCAGGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGGAACATTA
AGTTTGATACTGCAATCAAATCCTACGATTTTGACCAAAACGTTAATGAACTTTGTATATATGAGAAAATCAACAAAAGTACAGTAGCTTTCTTAGTACTTTATGCGGAT
GATATCCTCCTCATTGGGAATGATATGGGATACCTCATCGACGTTAAAACTTGGTTAGCAGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTAAAGTCTATAAGGATTCTCTTGTCCATAGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTACTTTTCTGAATGGCAATCTTGAAGAGAGTAT
CTTTATATCTCAGCCCGAGGGGTTCATAATTCAAGGTCAGGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGGAACATTA
AGTTTGATACTGCAATCAAATCCTACGATTTTGACCAAAACGTTAATGAACTTTGTATATATGAGAAAATCAACAAAAGTACAGTAGCTTTCTTAGTACTTTATGCGGAT
GATATCCTCCTCATTGGGAATGATATGGGATACCTCATCGACGTTAAAACTTGGTTAGCAGCCTAA
Protein sequenceShow/hide protein sequence
MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIFISQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIKFDTAIKSYDFDQNVNELCIYEKINKSTVAFLVLYAD
DILLIGNDMGYLIDVKTWLAA