; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc12g0325061 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc12g0325061
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr12:14971637..14972065
RNA-Seq ExpressionCmc12g0325061
SyntenyCmc12g0325061
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]8.3e-5983.82Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        MDVK AFLNGNLEESI M+QPEGFI +DQEQKVCKLQKSIYGLKQ SRSWNIRFD AI+SYGFEQ+VDEPCVYK+I+NS VAFL LYVDDILLIGN+V +
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT
        L D+KKWL TQFQMKDL  AQY+LGIQIVRNRKNKT
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]5.7e-6086.76Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        MDVK AFLNGNLEESI M+QPEGFIQK QEQKVCKLQKSIYGLKQ SRSWNIRFD  I+SYGFEQ+VDEPCVYKRIINSTVAFL LYVDDILLIGN V H
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT
        L DIK+WL TQFQMKDL +AQYVLGIQIV+NRKNKT
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT

KAA0063026.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-5885.29Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        MDVK AFLNGNLEESI M+QPEG IQK QEQKVCKLQKSIYGLKQ SRSWNIRFD AI+SYGF+Q+V+EPCVYKRI NSTVAFL LYVD+ILLIGN+V  
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT
        L DIKKWLATQFQMKDL NAQYVLGIQ VRNRKNKT
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-5884.56Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        MDVK  FLN NLEESI M+QPE FIQK QEQK+CKLQKSIYGLKQ SRS NIRFD AI+SYG EQ+VDEPCVYKRI+NSTVAFL LYVDDILLIGN+VGH
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT
        L DIKKWLA QFQMKDL NAQYVLG+QIVRNRKNKT
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT

TYK09500.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-5780.88Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        +DVK AFLNGNLEESI M QPEGFI+K QEQK+CK QKSIYGLKQ SRSWNIRFD AI+SYGFEQ+VD+PCVYK+++NS VAFL LYVDDILL+GN+VG+
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT
        L DIKKWLATQFQMKD  +AQYVLGIQIVRNRKN+T
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT

TrEMBL top hitse value%identityAlignment
A0A5A7TTA2 Gag/pol protein2.8e-6086.76Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        MDVK AFLNGNLEESI M+QPEGFIQK QEQKVCKLQKSIYGLKQ SRSWNIRFD  I+SYGFEQ+VDEPCVYKRIINSTVAFL LYVDDILLIGN V H
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT
        L DIK+WL TQFQMKDL +AQYVLGIQIV+NRKNKT
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT

A0A5A7V9B0 Gag/pol protein1.5e-5885.29Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        MDVK AFLNGNLEESI M+QPEG IQK QEQKVCKLQKSIYGLKQ SRSWNIRFD AI+SYGF+Q+V+EPCVYKRI NSTVAFL LYVD+ILLIGN+V  
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT
        L DIKKWLATQFQMKDL NAQYVLGIQ VRNRKNKT
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT

A0A5D3BX45 Gag/pol protein2.6e-5884.56Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        MDVK  FLN NLEESI M+QPE FIQK QEQK+CKLQKSIYGLKQ SRS NIRFD AI+SYG EQ+VDEPCVYKRI+NSTVAFL LYVDDILLIGN+VGH
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT
        L DIKKWLA QFQMKDL NAQYVLG+QIVRNRKNKT
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT

A0A5D3CDT9 Gag/pol protein5.8e-5880.88Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        +DVK AFLNGNLEESI M QPEGFI+K QEQK+CK QKSIYGLKQ SRSWNIRFD AI+SYGFEQ+VD+PCVYK+++NS VAFL LYVDDILL+GN+VG+
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT
        L DIKKWLATQFQMKD  +AQYVLGIQIVRNRKN+T
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT

E2GK51 Gag/pol protein (Fragment)4.0e-5983.82Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        MDVK AFLNGNLEESI M+QPEGFI +DQEQKVCKLQKSIYGLKQ SRSWNIRFD AI+SYGFEQ+VDEPCVYK+I+NS VAFL LYVDDILLIGN+V +
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT
        L D+KKWL TQFQMKDL  AQY+LGIQIVRNRKNKT
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNKT

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.3e-1838.17Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVY---KRIINSTVAFLALYVDDILLIGNN
        MDVK AFLNG L+E I M  P+G         VCKL K+IYGLKQ +R W   F+ A++   F     + C+Y   K  IN  + ++ LYVDD+++   +
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVY---KRIINSTVAFLALYVDDILLIGNN

Query:  VGHLPDIKKWLATQFQMKDLENAQYVLGIQI
        +  + + K++L  +F+M DL   ++ +GI+I
Subjt:  VGHLPDIKKWLATQFQMKDLENAQYVLGIQI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.9e-2846.32Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVY-KRIINSTVAFLALYVDDILLIGNNVG
        +DVK AFL+G+LEE I M QPEGF    ++  VCKL KS+YGLKQ  R W ++FD  ++S  + +   +PCVY KR   +    L LYVDD+L++G + G
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVY-KRIINSTVAFLALYVDDILLIGNNVG

Query:  HLPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNK
         +  +K  L+  F MKDL  AQ +LG++IVR R ++
Subjt:  HLPDIKKWLATQFQMKDLENAQYVLGIQIVRNRKNK

P25600 Putative transposon Ty5-1 protein YCL074W1.3e-1433.59Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        MDV  AFLN  ++E I + QP GF+ +     V +L   +YGLKQ    WN   +  ++  GF +H  E  +Y R  +    ++A+YVDD+L+   +   
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQI
           +K+ L   + MKDL      LG+ I
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.5e-2037.69Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        +DV  AFL G L + + M QP GFI KD+   VCKL+K++YGLKQ  R+W +     + + GF   V +  ++      ++ ++ +YVDDIL+ GN+   
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVR
        L +    L+ +F +KD E   Y LGI+  R
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1935.38Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH
        +DV  AFL G L + + M QP GF+ KD+   VC+L+K+IYGLKQ  R+W +     + + GF   + +  ++      ++ ++ +YVDDIL+ GN+   
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGH

Query:  LPDIKKWLATQFQMKDLENAQYVLGIQIVR
        L      L+ +F +K+ E+  Y LGI+  R
Subjt:  LPDIKKWLATQFQMKDLENAQYVLGIQIVR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.3e-2135.56Show/hide
Query:  MDVKAAFLNGNLEESICMIQPEGFIQKDQE----QKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGN
        +D+  AFLNG+L+E I M  P G+  +  +      VC L+KSIYGLKQ SR W ++F + +  +GF Q   +   + +I  +    + +YVDDI++  N
Subjt:  MDVKAAFLNGNLEESICMIQPEGFIQKDQE----QKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGN

Query:  NVGHLPDIKKWLATQFQMKDLENAQYVLGIQIVRN
        N   + ++K  L + F+++DL   +Y LG++I R+
Subjt:  NVGHLPDIKKWLATQFQMKDLENAQYVLGIQIVRN

ATMG00810.1 DNA/RNA polymerases superfamily protein8.1e-0450Show/hide
Query:  FLALYVDDILLIGNNVGHLPDIKKWLATQFQMKDLENAQYVLGIQI
        +L LYVDDILL G++   L  +   L++ F MKDL    Y LGIQI
Subjt:  FLALYVDDILLIGNNVGHLPDIKKWLATQFQMKDLENAQYVLGIQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTCAAGGCAGCCTTTTTGAATGGAAATCTTGAGGAGAGTATTTGTATGATCCAACCAGAGGGGTTTATTCAAAAGGATCAAGAACAAAAAGTTTGTAAGCTTCA
AAAATCCATATATGGATTAAAGCAAGTATCTAGATCCTGGAATATAAGATTTGATATTGCGATCAGATCTTATGGTTTTGAACAACATGTTGATGAACCTTGTGTTTACA
AAAGGATTATCAATTCTACTGTAGCATTCTTAGCTCTGTATGTAGATGACATTCTACTCATTGGGAATAATGTAGGTCATCTACCTGATATTAAGAAATGGCTAGCTACG
CAATTCCAAATGAAAGATTTGGAAAATGCACAATATGTTCTTGGTATCCAAATAGTTCGGAACCGAAAGAACAAAACACGAAATGTTGTTAAGATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTCAAGGCAGCCTTTTTGAATGGAAATCTTGAGGAGAGTATTTGTATGATCCAACCAGAGGGGTTTATTCAAAAGGATCAAGAACAAAAAGTTTGTAAGCTTCA
AAAATCCATATATGGATTAAAGCAAGTATCTAGATCCTGGAATATAAGATTTGATATTGCGATCAGATCTTATGGTTTTGAACAACATGTTGATGAACCTTGTGTTTACA
AAAGGATTATCAATTCTACTGTAGCATTCTTAGCTCTGTATGTAGATGACATTCTACTCATTGGGAATAATGTAGGTCATCTACCTGATATTAAGAAATGGCTAGCTACG
CAATTCCAAATGAAAGATTTGGAAAATGCACAATATGTTCTTGGTATCCAAATAGTTCGGAACCGAAAGAACAAAACACGAAATGTTGTTAAGATATAA
Protein sequenceShow/hide protein sequence
MDVKAAFLNGNLEESICMIQPEGFIQKDQEQKVCKLQKSIYGLKQVSRSWNIRFDIAIRSYGFEQHVDEPCVYKRIINSTVAFLALYVDDILLIGNNVGHLPDIKKWLAT
QFQMKDLENAQYVLGIQIVRNRKNKTRNVVKI