; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0222441 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0222441
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr08:11208293..11208709
RNA-Seq ExpressionCmc08g0222441
SyntenyCmc08g0222441
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]6.4e-4064.23Show/hide
Query:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS
        MLKSIRILLSIA FY+Y+IWQMDVKTA LN  L      +   G + + + ++    +  ++ LK+ASRSWNI+F+ AIKSYGFEQNVDEP VY+K+VNS
Subjt:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS

Query:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD
        +VAFL+LYVDDILLIGNDV YLTD+KKWL  QF+MKD
Subjt:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD

KAA0061927.1 gag/pol protein [Cucumis melo var. makuwa]5.8e-4166.9Show/hide
Query:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS
        MLKSIRILLSIA FYDY+IWQMDVKTA LN  L         +G +K  + ++    +  ++ LK+ASRSWNIKF+  IK+YGFEQNV+EP VY+KVVNS
Subjt:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS

Query:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMK---DAQ
        IVAFLVLYVDDILLIGNDVGYLTDI+KWLA QF+MK   DAQ
Subjt:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMK---DAQ

KAA0065369.1 gag/pol protein [Cucumis melo var. makuwa]4.9e-4070.8Show/hide
Query:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLHLKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNSI
        MLKSIRILLSIA FYDY+IWQMDVKTA LN  L   SI++ Q   L + K K +   ++   LK+ASRSWNI+F+ AIKSY FEQNVDEP VY+KVVNSI
Subjt:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLHLKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNSI

Query:  VAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKDA
        V FL LYVDDILLIGNDVGYLTDIKKWLA QF+MKD+
Subjt:  VAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKDA

TYK09500.1 gag/pol protein [Cucumis melo var. makuwa]6.2e-4368.61Show/hide
Query:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS
        MLKSIRILLSIA FYDY+IWQ+DVKTA LN  L          G +K+ + ++   F+  ++ LK+ASRSWNI+F+ AIKSYGFEQNVD+P VY+KVVNS
Subjt:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS

Query:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD
        IVAFLVLYVDDILL+GNDVGYLTDIKKWLA QF+MKD
Subjt:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD

TYK24002.1 gag/pol protein [Cucumis melo var. makuwa]5.8e-4166.9Show/hide
Query:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS
        MLKSIRILLSIA FYDY+IWQMDVKTA LN  L         +G +K  + ++    +  ++ LK+ASRSWNIKF+  IK+YGFEQNV+EP VY+KVVNS
Subjt:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS

Query:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMK---DAQ
        IVAFLVLYVDDILLIGNDVGYLTDI+KWLA QF+MK   DAQ
Subjt:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMK---DAQ

TrEMBL top hitse value%identityAlignment
A0A5A7V4T1 Gag/pol protein2.8e-4166.9Show/hide
Query:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS
        MLKSIRILLSIA FYDY+IWQMDVKTA LN  L         +G +K  + ++    +  ++ LK+ASRSWNIKF+  IK+YGFEQNV+EP VY+KVVNS
Subjt:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS

Query:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMK---DAQ
        IVAFLVLYVDDILLIGNDVGYLTDI+KWLA QF+MK   DAQ
Subjt:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMK---DAQ

A0A5A7VIS5 Gag/pol protein2.4e-4070.8Show/hide
Query:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLHLKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNSI
        MLKSIRILLSIA FYDY+IWQMDVKTA LN  L   SI++ Q   L + K K +   ++   LK+ASRSWNI+F+ AIKSY FEQNVDEP VY+KVVNSI
Subjt:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLHLKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNSI

Query:  VAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKDA
        V FL LYVDDILLIGNDVGYLTDIKKWLA QF+MKD+
Subjt:  VAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKDA

A0A5D3CDT9 Gag/pol protein3.0e-4368.61Show/hide
Query:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS
        MLKSIRILLSIA FYDY+IWQ+DVKTA LN  L          G +K+ + ++   F+  ++ LK+ASRSWNI+F+ AIKSYGFEQNVD+P VY+KVVNS
Subjt:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS

Query:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD
        IVAFLVLYVDDILL+GNDVGYLTDIKKWLA QF+MKD
Subjt:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD

A0A5D3DKH2 Gag/pol protein2.8e-4166.9Show/hide
Query:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS
        MLKSIRILLSIA FYDY+IWQMDVKTA LN  L         +G +K  + ++    +  ++ LK+ASRSWNIKF+  IK+YGFEQNV+EP VY+KVVNS
Subjt:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS

Query:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMK---DAQ
        IVAFLVLYVDDILLIGNDVGYLTDI+KWLA QF+MK   DAQ
Subjt:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMK---DAQ

E2GK51 Gag/pol protein (Fragment)3.1e-4064.23Show/hide
Query:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS
        MLKSIRILLSIA FY+Y+IWQMDVKTA LN  L      +   G + + + ++    +  ++ LK+ASRSWNI+F+ AIKSYGFEQNVDEP VY+K+VNS
Subjt:  MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNS

Query:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD
        +VAFL+LYVDDILLIGNDV YLTD+KKWL  QF+MKD
Subjt:  IVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.7e-0728.99Show/hide
Query:  LKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGF-EQNVDE-PYVYEKVVN
        + S R +LS+   Y+ ++ QMDVKTA LN  L+        +G+     +         ++ LK+A+R W   F  A+K   F   +VD   Y+ +K   
Subjt:  LKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGF-EQNVDE-PYVYEKVVN

Query:  SIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD
        +   +++LYVDD+++   D+  + + K++L  +F M D
Subjt:  SIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-1235.51Show/hide
Query:  LKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNP--LHLKKASRSWNIKFNIAIKSYGFEQNVDEPYVY-EKVVN
        + SIR +LS+AA  D ++ Q+DVKTA L+  L    I++ Q    +    K  V   N     LK+A R W +KF+  +KS  + +   +P VY ++   
Subjt:  LKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNP--LHLKKASRSWNIKFNIAIKSYGFEQNVDEPYVY-EKVVN

Query:  SIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD
        +    L+LYVDD+L++G D G +  +K  L+  F+MKD
Subjt:  SIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-0728.36Show/hide
Query:  SIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNSIVA
        SIRI+L +A    + I Q+DV  A L   L          G + + +       R  L+ LK+A R+W ++    + + GF  +V +  ++       + 
Subjt:  SIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNSIVA

Query:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD
        ++++YVDDIL+ GND   L +    L+ +F +KD
Subjt:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.7e-0726.12Show/hide
Query:  SIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNSIVA
        SIRI+L +A    + I Q+DV  A L   L          G + + +       R  ++ LK+A R+W ++    + + GF  ++ +  ++       + 
Subjt:  SIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLH-LKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNSIVA

Query:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD
        ++++YVDDIL+ GND   L      L+ +F +K+
Subjt:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-1026.43Show/hide
Query:  LKSIRILLSIAAFYDYQIWQMDVKTAILN-----EILRRVSIWLNQRGLLKRVKNKEFVSFRNPLHLKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKV
        L S++++L+I+A Y++ + Q+D+  A LN     EI  ++      R       N      ++   LK+ASR W +KF++ +  +GF Q+  +   + K+
Subjt:  LKSIRILLSIAAFYDYQIWQMDVKTAILN-----EILRRVSIWLNQRGLLKRVKNKEFVSFRNPLHLKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKV

Query:  VNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD
          ++   +++YVDDI++  N+   + ++K  L   F+++D
Subjt:  VNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFEMKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAAGTCGATTAGAATACTCTTATCCATCGCCGCCTTTTATGATTATCAAATTTGGCAGATGGATGTCAAGACAGCCATTTTGAATGAAATCTTGAGGAGAGTATC
TATATGGCTCAACCAGAGGGGTTTATTAAAAAGGGTCAAGAACAAAGAGTTTGTAAGCTTCAGAAATCCATTGCATTTGAAGAAAGCATCTAGATCCTGGAATATAAAAT
TTAATATTGCGATCAAGTCTTATGGCTTTGAACAAAATGTTGACGAACCTTATGTTTATGAAAAAGTCGTCAATTCCATTGTAGCATTCTTAGTATTATATGTAGATGAT
ATTCTACTCATTGGGAATGACGTAGGTTATCTTACTGATATTAAGAAATGGCTAGCTATGCAATTTGAAATGAAAGATGCACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTAAGTCGATTAGAATACTCTTATCCATCGCCGCCTTTTATGATTATCAAATTTGGCAGATGGATGTCAAGACAGCCATTTTGAATGAAATCTTGAGGAGAGTATC
TATATGGCTCAACCAGAGGGGTTTATTAAAAAGGGTCAAGAACAAAGAGTTTGTAAGCTTCAGAAATCCATTGCATTTGAAGAAAGCATCTAGATCCTGGAATATAAAAT
TTAATATTGCGATCAAGTCTTATGGCTTTGAACAAAATGTTGACGAACCTTATGTTTATGAAAAAGTCGTCAATTCCATTGTAGCATTCTTAGTATTATATGTAGATGAT
ATTCTACTCATTGGGAATGACGTAGGTTATCTTACTGATATTAAGAAATGGCTAGCTATGCAATTTGAAATGAAAGATGCACAATAA
Protein sequenceShow/hide protein sequence
MLKSIRILLSIAAFYDYQIWQMDVKTAILNEILRRVSIWLNQRGLLKRVKNKEFVSFRNPLHLKKASRSWNIKFNIAIKSYGFEQNVDEPYVYEKVVNSIVAFLVLYVDD
ILLIGNDVGYLTDIKKWLAMQFEMKDAQ