; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0224091 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0224091
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr08:13916024..13916509
RNA-Seq ExpressionCmc08g0224091
SyntenyCmc08g0224091
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.3e-7183.85Show/hide
Query:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS
        MLK IRILLSI TFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI + QEQK+CKLQKSIYGLKQASRSWNI+FDT IKSY FEQNVDEPCVYKK+V+S
Subjt:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS

Query:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS
        +VAFL+LY+DDILLIGNDV YLT++KKWL  QFQMKDLG+AQY+LGIQIVRN KN+TLA+S
Subjt:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-7083.23Show/hide
Query:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS
        M+K IRILLSI TFYDYEIWQMDVKTAFLNGNLEESIYM QPEGFI+KGQEQK+CKLQKSIYGLKQASRSWNI+FDT IKSY FEQNVDEPCVYK++++S
Subjt:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS

Query:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS
         VAFLVLY+DDILLIGN+V +LT+IK+WL  QFQMKDLG AQYVLGIQIV+N KN+TLA+S
Subjt:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS

KAA0061927.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-7286.96Show/hide
Query:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS
        MLK IRILLSI TFYDYEIWQMDVKTAFLNGNLE S+YMAQP+GFIK GQEQK+CKLQKSIY LKQASRSWNIKFDT IK+Y FEQNV+EPCVYKKVV+S
Subjt:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS

Query:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS
        IVAFLVLY+DDILLIGNDVGYLT+I+KWLA QFQMK LGDAQYVLGIQIVRN KNRTL +S
Subjt:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS

TYK09500.1 gag/pol protein [Cucumis melo var. makuwa]8.5e-7690.06Show/hide
Query:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS
        MLK IRILLSI TFYDYEIWQ+DVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICK QKSIYGLKQASRSWNI+FDT IKSY FEQNVD+PCVYKKVV+S
Subjt:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS

Query:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS
        IVAFLVLY+DDILL+GNDVGYLT+IKKWLA QFQMKD GDAQYVLGIQIVRN KNRTLA+S
Subjt:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS

TYK24002.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-7186.34Show/hide
Query:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS
        MLK IRILLSI TFYDYEIWQMDVKTAFLNGNLE S+YMAQP+GFIK GQEQK+CKLQKSIY LKQASRSWNIKFDT IK+Y FEQNV+EP VYKKVV+S
Subjt:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS

Query:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS
        IVAFLVLY+DDILLIGNDVGYLT+I+KWLA QFQMK LGDAQYVLGIQIVRN KNRTL +S
Subjt:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS

TrEMBL top hitse value%identityAlignment
A0A5A7TTA2 Gag/pol protein5.2e-7183.23Show/hide
Query:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS
        M+K IRILLSI TFYDYEIWQMDVKTAFLNGNLEESIYM QPEGFI+KGQEQK+CKLQKSIYGLKQASRSWNI+FDT IKSY FEQNVDEPCVYK++++S
Subjt:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS

Query:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS
         VAFLVLY+DDILLIGN+V +LT+IK+WL  QFQMKDLG AQYVLGIQIV+N KN+TLA+S
Subjt:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS

A0A5A7V4T1 Gag/pol protein1.6e-7286.96Show/hide
Query:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS
        MLK IRILLSI TFYDYEIWQMDVKTAFLNGNLE S+YMAQP+GFIK GQEQK+CKLQKSIY LKQASRSWNIKFDT IK+Y FEQNV+EPCVYKKVV+S
Subjt:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS

Query:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS
        IVAFLVLY+DDILLIGNDVGYLT+I+KWLA QFQMK LGDAQYVLGIQIVRN KNRTL +S
Subjt:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS

A0A5D3CDT9 Gag/pol protein4.1e-7690.06Show/hide
Query:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS
        MLK IRILLSI TFYDYEIWQ+DVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICK QKSIYGLKQASRSWNI+FDT IKSY FEQNVD+PCVYKKVV+S
Subjt:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS

Query:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS
        IVAFLVLY+DDILL+GNDVGYLT+IKKWLA QFQMKD GDAQYVLGIQIVRN KNRTLA+S
Subjt:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS

A0A5D3DKH2 Gag/pol protein4.0e-7186.34Show/hide
Query:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS
        MLK IRILLSI TFYDYEIWQMDVKTAFLNGNLE S+YMAQP+GFIK GQEQK+CKLQKSIY LKQASRSWNIKFDT IK+Y FEQNV+EP VYKKVV+S
Subjt:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS

Query:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS
        IVAFLVLY+DDILLIGNDVGYLT+I+KWLA QFQMK LGDAQYVLGIQIVRN KNRTL +S
Subjt:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS

E2GK51 Gag/pol protein (Fragment)6.1e-7283.85Show/hide
Query:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS
        MLK IRILLSI TFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI + QEQK+CKLQKSIYGLKQASRSWNI+FDT IKSY FEQNVDEPCVYKK+V+S
Subjt:  MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDS

Query:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS
        +VAFL+LY+DDILLIGNDV YLT++KKWL  QFQMKDLG+AQY+LGIQIVRN KN+TLA+S
Subjt:  IVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.9e-2338.1Show/hide
Query:  RILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVY---KKVVDSIV
        R +LS+   Y+ ++ QMDVKTAFLNG L+E IYM  P+G         +CKL K+IYGLKQA+R W   F+  +K  +F  +  + C+Y   K  ++  +
Subjt:  RILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVY---KKVVDSIV

Query:  AFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQI
         +++LY+DD+++   D+  + N K++L  +F+M DL + ++ +GI+I
Subjt:  AFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-3648.73Show/hide
Query:  IRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVY-KKVVDSIVA
        IR +LS+    D E+ Q+DVKTAFL+G+LEE IYM QPEGF   G++  +CKL KS+YGLKQA R W +KFD+ +KS  + +   +PCVY K+  ++   
Subjt:  IRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVY-KKVVDSIVA

Query:  FLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS
         L+LY+DD+L++G D G +  +K  L+  F MKDLG AQ +LG++IVR   +R L +S
Subjt:  FLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS

P25600 Putative transposon Ty5-1 protein YCL074W1.5e-1432.81Show/hide
Query:  MDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDSIVAFLVLYIDDILLIGNDVGY
        MDV TAFLN  ++E IY+ QP GF+ +     + +L   +YGLKQA   WN   +  +K   F ++  E  +Y +       ++ +Y+DD+L+       
Subjt:  MDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDSIVAFLVLYIDDILLIGNDVGY

Query:  LTNIKKWLAMQFQMKDLGDAQYVLGIQI
           +K+ L   + MKDLG     LG+ I
Subjt:  LTNIKKWLAMQFQMKDLGDAQYVLGIQI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.1e-2336.73Show/hide
Query:  IRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDSIVAF
        IRI+L +     + I Q+DV  AFL G L + +YM+QP GFI K +   +CKL+K++YGLKQA R+W ++    + +  F  +V +  ++       + +
Subjt:  IRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDSIVAF

Query:  LVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVR
        +++Y+DDIL+ GND   L N    L+ +F +KD  +  Y LGI+  R
Subjt:  LVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.6e-2335.37Show/hide
Query:  IRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDSIVAF
        IRI+L +     + I Q+DV  AFL G L + +YM+QP GF+ K +   +C+L+K+IYGLKQA R+W ++  T + +  F  ++ +  ++       + +
Subjt:  IRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDSIVAF

Query:  LVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVR
        +++Y+DDIL+ GND   L +    L+ +F +K+  D  Y LGI+  R
Subjt:  LVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.5e-2534.19Show/hide
Query:  LKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQE----QKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKV
        L  ++++L+I+  Y++ + Q+D+  AFLNG+L+E IYM  P G+  +  +      +C L+KSIYGLKQASR W +KF   +  + F Q+  +   + K+
Subjt:  LKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQE----QKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKV

Query:  VDSIVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRN
          ++   +++Y+DDI++  N+   +  +K  L   F+++DLG  +Y LG++I R+
Subjt:  VDSIVAFLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRN

ATMG00810.1 DNA/RNA polymerases superfamily protein8.3e-0550Show/hide
Query:  FLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQI
        +L+LY+DDILL G+    L  +   L+  F MKDLG   Y LGIQI
Subjt:  FLVLYIDDILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAAGTTGATTAGAATACTCTTATCCATCACCACCTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCCTTTCTGAATGGAAATCTTGAGGAGAGTAT
CTATATGGCACAACCAGAGGGATTTATTAAAAAGGGTCAAGAACAAAAAATTTGTAAGCTTCAAAAATCCATATATGGTTTGAAGCAAGCATCTAGATCCTGGAATATAA
AATTTGATACTAGGATCAAGTCTTATGACTTTGAACAAAATGTTGACGAACCTTGTGTTTACAAAAAGGTTGTCGATTCCATTGTAGCATTCTTAGTATTATATATAGAT
GATATTCTACTCATTGGAAATGACGTAGGTTATCTAACTAATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATTTGGGAGATGCACAATATGTTCTTGGAAT
CCAAATTGTTCGGAACCATAAAAATAGAACACTAGCCGTGTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTAAGTTGATTAGAATACTCTTATCCATCACCACCTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCCTTTCTGAATGGAAATCTTGAGGAGAGTAT
CTATATGGCACAACCAGAGGGATTTATTAAAAAGGGTCAAGAACAAAAAATTTGTAAGCTTCAAAAATCCATATATGGTTTGAAGCAAGCATCTAGATCCTGGAATATAA
AATTTGATACTAGGATCAAGTCTTATGACTTTGAACAAAATGTTGACGAACCTTGTGTTTACAAAAAGGTTGTCGATTCCATTGTAGCATTCTTAGTATTATATATAGAT
GATATTCTACTCATTGGAAATGACGTAGGTTATCTAACTAATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATTTGGGAGATGCACAATATGTTCTTGGAAT
CCAAATTGTTCGGAACCATAAAAATAGAACACTAGCCGTGTCTTAA
Protein sequenceShow/hide protein sequence
MLKLIRILLSITTFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIKKGQEQKICKLQKSIYGLKQASRSWNIKFDTRIKSYDFEQNVDEPCVYKKVVDSIVAFLVLYID
DILLIGNDVGYLTNIKKWLAMQFQMKDLGDAQYVLGIQIVRNHKNRTLAVS