; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0102101 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0102101
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:18767850..18768425
RNA-Seq ExpressionCmc04g0102101
SyntenyCmc04g0102101
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040942.1 pol protein [Cucumis melo var. makuwa]2.0e-7779.06Show/hide
Query:  MSFGLTNAPTVFMDLMN----------------------------REHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        MSFGLTNAP VFMDLMN                             EHLHQVLETLRAN+LYAKFFKCEFWL+KV+FLGHVVSSEGV VDPAKIEAV NW
Subjt:  MSFGLTNAPTVFMDLMN----------------------------REHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM
        PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE SFQELKQKLVTAP+LTVPDGSG+FVIYSDAS K LGCVLM
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM

KAA0054634.1 pol protein [Cucumis melo var. makuwa]1.2e-7779.58Show/hide
Query:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        MSFGLTNAP VFMDLMNR                            EHLHQVLETLRAN+LYAKF KCEFWL+KV+FLGHVVSSEGV VDPAKIEAV NW
Subjt:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM
        PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAP+LTVPDGSG+FVIYSDAS K LGCVLM
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM

KAA0058812.1 pol protein [Cucumis melo var. makuwa]1.2e-7779.58Show/hide
Query:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        MSFGLTNAP VFMDLMNR                            EHLHQVLETLRAN+LYAKF KCEFWL+KV+FLGHVVSSEGV VDPAKIEAV NW
Subjt:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM
        PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAP+LTVPDGSG+FVIYSDAS K LGCVLM
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM

KAA0063098.1 pol protein [Cucumis melo var. makuwa]2.7e-7779.06Show/hide
Query:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        MSFGLTNAP VFMDLMNR                            EHLHQVLETLRAN+LYAKF KCEFWL+KV+FLGHVVSSEGV VDPAKIEAV NW
Subjt:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM
        PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE SFQELKQKLVTAP+LTVPDGSG+FVIYSDAS K LGCVLM
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM

KAA0066951.1 pol protein [Cucumis melo var. makuwa]1.2e-7779.58Show/hide
Query:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        MSFGLTNAP VFMDLMNR                            EHLHQVLETLRAN+LYAKF KCEFWL+KV+FLGHVVSSEGV VDPAKIEAV NW
Subjt:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM
        PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAP+LTVPDGSG+FVIYSDAS K LGCVLM
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM

TrEMBL top hitse value%identityAlignment
A0A5A7TBV4 Pol protein9.9e-7879.06Show/hide
Query:  MSFGLTNAPTVFMDLMN----------------------------REHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        MSFGLTNAP VFMDLMN                             EHLHQVLETLRAN+LYAKFFKCEFWL+KV+FLGHVVSSEGV VDPAKIEAV NW
Subjt:  MSFGLTNAPTVFMDLMN----------------------------REHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM
        PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE SFQELKQKLVTAP+LTVPDGSG+FVIYSDAS K LGCVLM
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM

A0A5A7UHL7 Reverse transcriptase5.8e-7879.58Show/hide
Query:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        MSFGLTNAP VFMDLMNR                            EHLHQVLETLRAN+LYAKF KCEFWL+KV+FLGHVVSSEGV VDPAKIEAV NW
Subjt:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM
        PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAP+LTVPDGSG+FVIYSDAS K LGCVLM
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM

A0A5A7USG7 Reverse transcriptase5.8e-7879.58Show/hide
Query:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        MSFGLTNAP VFMDLMNR                            EHLHQVLETLRAN+LYAKF KCEFWL+KV+FLGHVVSSEGV VDPAKIEAV NW
Subjt:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM
        PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAP+LTVPDGSG+FVIYSDAS K LGCVLM
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM

A0A5A7V646 Reverse transcriptase1.3e-7779.06Show/hide
Query:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        MSFGLTNAP VFMDLMNR                            EHLHQVLETLRAN+LYAKF KCEFWL+KV+FLGHVVSSEGV VDPAKIEAV NW
Subjt:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM
        PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE SFQELKQKLVTAP+LTVPDGSG+FVIYSDAS K LGCVLM
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM

A0A5A7VMR4 Reverse transcriptase5.8e-7879.58Show/hide
Query:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        MSFGLTNAP VFMDLMNR                            EHLHQVLETLRAN+LYAKF KCEFWL+KV+FLGHVVSSEGV VDPAKIEAV NW
Subjt:  MSFGLTNAPTVFMDLMNR----------------------------EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM
        PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAP+LTVPDGSG+FVIYSDAS K LGCVLM
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.2e-2636.65Show/hide
Query:  MSFGLTNAPTVFMDLMN-------------------------REHLHQ---VLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        M FGL NAP  F   MN                          EHL     V E L    L  +  KCEF  ++ +FLGHV++ +G+  +P KIEA+  +
Subjt:  MSFGLTNAPTVFMDLMN-------------------------REHLHQ---VLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVL
        P P+   EI++FLGL GYYR+F+ +F+ IA P+T+  +K       +P  +S+F++LK  +   PIL VPD +  F + +DAS   LG VL
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVL

P10394 Retrovirus-related Pol polyprotein from transposon 4123.3e-2234.03Show/hide
Query:  EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWS
        ++L +V    R   L     KC F++ +V+FLGH  + +G+  D  K + + N+P P      R F+    YYRRF+++F+  +  +T+L +K  PF W+
Subjt:  EHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWS

Query:  PACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVL
          C+ +F  LK +L+   +L  PD S  F I +DAS +  G VL
Subjt:  PACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVL

P20825 Retrovirus-related Pol polyprotein from transposon 2972.1e-2435.08Show/hide
Query:  MSFGLTNAPTVFMDLMNR-------------------------EHLHQ---VLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        M FGL NAP  F   MN                          EHL+    V   L    L  +  KCEF  K+ +FLGH+V+ +G+  +P K++A+ ++
Subjt:  MSFGLTNAPTVFMDLMNR-------------------------EHLHQ---VLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVL
        P P+   EIR+FLGL GYYR+F+ +++ IA P+T   +K T           +F++LK  ++  PIL +PD    FV+ +DAS   LG VL
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVL

P92523 Uncharacterized mitochondrial protein AtMg008603.2e-2541.98Show/hide
Query:  HLHQVLETLRANRLYAKFFKCEFWLKKVSFLG--HVVSSEGVFVDPAKIEAVNNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVW
        HL  VL+    ++ YA   KC F   ++++LG  H++S EGV  DPAK+EA+  WP P   +E+R FLGL GYYRRFV+++ +I  PLT+L +K +   W
Subjt:  HLHQVLETLRANRLYAKFFKCEFWLKKVSFLG--HVVSSEGVFVDPAKIEAVNNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVW

Query:  SPACESSFQELKQKLVTAPILTVPDGSGSFV
        +     +F+ LK  + T P+L +PD    FV
Subjt:  SPACESSFQELKQKLVTAPILTVPDGSGSFV

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus7.4e-2230.85Show/hide
Query:  MSFGLTNAPTVFMDLMN---REH-------------------------LHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW
        + FGL NAP +F  +++   REH                         L  VL +L    L     K  F   +V FLG++V+++G+  DP K+ A++  
Subjt:  MSFGLTNAPTVFMDLMN---REH-------------------------LHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNW

Query:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTR-----------KGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCV
        P P++V E++ FLG+  YYR+F++D++++A PLT LTR              P         SF +LK  L ++ IL  P  +  F + +DAS   +G V
Subjt:  PRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTR-----------KGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCV

Query:  L
        L
Subjt:  L

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.3e-2641.98Show/hide
Query:  HLHQVLETLRANRLYAKFFKCEFWLKKVSFLG--HVVSSEGVFVDPAKIEAVNNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVW
        HL  VL+    ++ YA   KC F   ++++LG  H++S EGV  DPAK+EA+  WP P   +E+R FLGL GYYRRFV+++ +I  PLT+L +K +   W
Subjt:  HLHQVLETLRANRLYAKFFKCEFWLKKVSFLG--HVVSSEGVFVDPAKIEAVNNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVW

Query:  SPACESSFQELKQKLVTAPILTVPDGSGSFV
        +     +F+ LK  + T P+L +PD    FV
Subjt:  SPACESSFQELKQKLVTAPILTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGGGTTGACTAATGCTCCTACGGTATTCATGGACTTGATGAACAGGGAGCATTTGCACCAAGTTTTGGAGACTCTTCGAGCCAATAGACTGTACGCCAAGTT
CTTCAAGTGTGAGTTTTGGCTGAAGAAGGTATCTTTCCTTGGACATGTGGTGTCCAGTGAGGGAGTCTTTGTGGATCCAGCAAAGATCGAAGCGGTTAACAATTGGCCTC
GACCGTCTACAGTTAGTGAGATTCGTAGTTTTCTGGGCTTGGCAGGCTACTACAGGAGGTTCGTGGAAGACTTCTCACGTATAGCCAGTCCCTTGACCCAGTTGACCAGG
AAGGGAACTCCGTTTGTTTGGAGCCCAGCTTGCGAGAGTAGCTTCCAAGAGCTTAAACAGAAGCTAGTGACTGCACCAATCCTGACAGTGCCCGATGGGTCGGGAAGTTT
TGTGATCTACAGTGATGCCTCCATAAAGAGACTGGGCTGTGTGCTGATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGGGTTGACTAATGCTCCTACGGTATTCATGGACTTGATGAACAGGGAGCATTTGCACCAAGTTTTGGAGACTCTTCGAGCCAATAGACTGTACGCCAAGTT
CTTCAAGTGTGAGTTTTGGCTGAAGAAGGTATCTTTCCTTGGACATGTGGTGTCCAGTGAGGGAGTCTTTGTGGATCCAGCAAAGATCGAAGCGGTTAACAATTGGCCTC
GACCGTCTACAGTTAGTGAGATTCGTAGTTTTCTGGGCTTGGCAGGCTACTACAGGAGGTTCGTGGAAGACTTCTCACGTATAGCCAGTCCCTTGACCCAGTTGACCAGG
AAGGGAACTCCGTTTGTTTGGAGCCCAGCTTGCGAGAGTAGCTTCCAAGAGCTTAAACAGAAGCTAGTGACTGCACCAATCCTGACAGTGCCCGATGGGTCGGGAAGTTT
TGTGATCTACAGTGATGCCTCCATAAAGAGACTGGGCTGTGTGCTGATGTAG
Protein sequenceShow/hide protein sequence
MSFGLTNAPTVFMDLMNREHLHQVLETLRANRLYAKFFKCEFWLKKVSFLGHVVSSEGVFVDPAKIEAVNNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTR
KGTPFVWSPACESSFQELKQKLVTAPILTVPDGSGSFVIYSDASIKRLGCVLM