; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0095941 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0095941
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr04:9808414..9808788
RNA-Seq ExpressionCmc04g0095941
SyntenyCmc04g0095941
Gene Ontology termsGO:0000105 - histidine biosynthetic process (biological process)
GO:0098542 - defense response to other organism (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0060320 - rejection of self pollen (biological process)
GO:0015074 - DNA integration (biological process)
GO:0007165 - signal transduction (biological process)
GO:0006508 - proteolysis (biological process)
GO:0006351 - transcription, DNA-templated (biological process)
GO:0000428 - DNA-directed RNA polymerase complex (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0005576 - extracellular region (cellular component)
GO:0000786 - nucleosome (cellular component)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
GO:0030527 - structural constituent of chromatin (molecular function)
GO:0043531 - ADP binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
GO:0003879 - ATP phosphoribosyltransferase activity (molecular function)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD6453934.1 hypothetical protein E3N88_08640 [Mikania micrantha]5.0e-4470.83Show/hide
Query:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT
        N TW+L DLP G+KPI  KWIFK+K + +G++++YKARLV+ G+TQK G+DYFDTYSPVTKITTIR+LI++AAI+ LLIHQMD+KT FLNGDL+EEIYM 
Subjt:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT

Query:  QPEGFKILGQENKVCKLRKS
        QPEGF + G E+KVCKLRKS
Subjt:  QPEGFKILGQENKVCKLRKS

PHT42147.1 hypothetical protein CQW23_16172 [Capsicum baccatum]1.1e-4371.67Show/hide
Query:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT
        N TW+LVDLP G+KP+  KWIFKRK K++G+I++YKARLVV G+ QK+G+DYFDTYSPVT+IT+IR LIALAA++ L IHQMD+KT FLNG+LEEEIYM 
Subjt:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT

Query:  QPEGFKILGQENKVCKLRKS
        QPEGF + G+ENKVCKL KS
Subjt:  QPEGFKILGQENKVCKLRKS

PHT43509.1 ATP phosphoribosyltransferase 1, chloroplastic [Capsicum baccatum]2.3e-4473.33Show/hide
Query:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT
        N TW+LVDLP G+KP+  KWIFKRK K++GSIE+YKARLVV G+ QK+G+DYFDTYSPVT+IT+IR LIALAA++ L IHQMD+KT FLNG+LEEEIYM 
Subjt:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT

Query:  QPEGFKILGQENKVCKLRKS
        QPEGF + G+ENKVCKL KS
Subjt:  QPEGFKILGQENKVCKLRKS

PNX71449.1 retrotransposon-related protein, partial [Trifolium pratense]6.0e-4571.67Show/hide
Query:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT
        N TW++VDLP G+KPI CKWIFK+K K +GS+E+YKARLV  GYTQK+G DYFDTYSPV ++T+IR  IALA+IH+L+IHQMD+KT FLNGDLEEEIYM 
Subjt:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT

Query:  QPEGFKILGQENKVCKLRKS
        QPEGF + G+ENKVC+L KS
Subjt:  QPEGFKILGQENKVCKLRKS

TYK06518.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]6.8e-5790.91Show/hide
Query:  MNQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYM
        MN TW+LVDLPMG+KPIRCKWIFKRKTK NG IERYKARLVVVGYTQKQG+DYFDTYSPVTKITTIRALIALAAIH+LLIHQMD+KT FLNG+LEEEIYM
Subjt:  MNQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYM

Query:  TQPEGFKILGQENKVCKLRKS
        TQPEGFKI GQENKVCKLRKS
Subjt:  TQPEGFKILGQENKVCKLRKS

TrEMBL top hitse value%identityAlignment
A0A2G2WEA7 ATP phosphoribosyltransferase 1, chloroplastic1.1e-4473.33Show/hide
Query:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT
        N TW+LVDLP G+KP+  KWIFKRK K++GSIE+YKARLVV G+ QK+G+DYFDTYSPVT+IT+IR LIALAA++ L IHQMD+KT FLNG+LEEEIYM 
Subjt:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT

Query:  QPEGFKILGQENKVCKLRKS
        QPEGF + G+ENKVCKL KS
Subjt:  QPEGFKILGQENKVCKLRKS

A0A2K3KYW3 Retrotransposon-related protein (Fragment)2.9e-4571.67Show/hide
Query:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT
        N TW++VDLP G+KPI CKWIFK+K K +GS+E+YKARLV  GYTQK+G DYFDTYSPV ++T+IR  IALA+IH+L+IHQMD+KT FLNGDLEEEIYM 
Subjt:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT

Query:  QPEGFKILGQENKVCKLRKS
        QPEGF + G+ENKVC+L KS
Subjt:  QPEGFKILGQENKVCKLRKS

A0A2N9EQT1 Integrase catalytic domain-containing protein4.9e-4571.67Show/hide
Query:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT
        N TW+LV+LP G K I  KW+FK+K K++GSIE++KARLV  GYTQK+GIDYFDTYSPVT++TTIR L+A+A+I+ L+IHQMD+KT FLNGDL+EEIYM 
Subjt:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT

Query:  QPEGFKILGQENKVCKLRKS
        QPEGF + GQENKVCKLRKS
Subjt:  QPEGFKILGQENKVCKLRKS

A0A2N9H4B0 Uncharacterized protein4.9e-4571.67Show/hide
Query:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT
        N TW+LV+LP G K I  KW+FK+K K++GSIE++KARLV  GYTQK+GIDYFDTYSPVT++TTIR L+A+A+I+ L+IHQMD+KT FLNGDL+EEIYM 
Subjt:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT

Query:  QPEGFKILGQENKVCKLRKS
        QPEGF + GQENKVCKLRKS
Subjt:  QPEGFKILGQENKVCKLRKS

A0A5D3C5T2 Ty1-copia retrotransposon protein3.3e-5790.91Show/hide
Query:  MNQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYM
        MN TW+LVDLPMG+KPIRCKWIFKRKTK NG IERYKARLVVVGYTQKQG+DYFDTYSPVTKITTIRALIALAAIH+LLIHQMD+KT FLNG+LEEEIYM
Subjt:  MNQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYM

Query:  TQPEGFKILGQENKVCKLRKS
        TQPEGFKI GQENKVCKLRKS
Subjt:  TQPEGFKILGQENKVCKLRKS

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.0e-2343.8Show/hide
Query:  MNQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYM
        +N TW +   P     +  +W+F  K    G+  RYKARLV  G+TQK  IDY +T++PV +I++ R +++L   ++L +HQMD+KT FLNG L+EEIYM
Subjt:  MNQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYM

Query:  TQPEGFKILGQENKVCKLRKS
          P+G  I    + VCKL K+
Subjt:  TQPEGFKILGQENKVCKLRKS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-3455Show/hide
Query:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT
        N T+ LV+LP G +P++CKW+FK K   +  + RYKARLVV G+ QK+GID+ + +SPV K+T+IR +++LAA   L + Q+D+KT FL+GDLEEEIYM 
Subjt:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMT

Query:  QPEGFKILGQENKVCKLRKS
        QPEGF++ G+++ VCKL KS
Subjt:  QPEGFKILGQENKVCKLRKS

P92520 Uncharacterized mitochondrial protein AtMg008201.6e-1345.83Show/hide
Query:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALA
        N+TW LV  P+    + CKW+FK K  S+G+++R KARLV  G+ Q++GI + +TYSPV +  TIR ++ +A
Subjt:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-2648.76Show/hide
Query:  NQTWDLVDLPMGSKPI-RCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYM
        N TWDLV  P     I  C+WIF +K  S+GS+ RYKARLV  GY Q+ G+DY +T+SPV K T+IR ++ +A   S  I Q+D+   FL G L +++YM
Subjt:  NQTWDLVDLPMGSKPI-RCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYM

Query:  TQPEGFKILGQENKVCKLRKS
        +QP GF    + N VCKLRK+
Subjt:  TQPEGFKILGQENKVCKLRKS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-2648.76Show/hide
Query:  NQTWDLVDLPMGSKPI-RCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYM
        N TWDLV  P  S  I  C+WIF +K  S+GS+ RYKARLV  GY Q+ G+DY +T+SPV K T+IR ++ +A   S  I Q+D+   FL G L +E+YM
Subjt:  NQTWDLVDLPMGSKPI-RCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYM

Query:  TQPEGFKILGQENKVCKLRKS
        +QP GF    + + VC+LRK+
Subjt:  TQPEGFKILGQENKVCKLRKS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.6e-3350.82Show/hide
Query:  TWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMTQP
        TW++  LP   KPI CKW++K K  S+G+IERYKARLV  GYTQ++GID+ +T+SPV K+T+++ ++A++AI++  +HQ+DI   FLNGDL+EEIYM  P
Subjt:  TWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMTQP

Query:  EGFKILGQE----NKVCKLRKS
         G+     +    N VC L+KS
Subjt:  EGFKILGQE----NKVCKLRKS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.2e-1445.83Show/hide
Query:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALA
        N+TW LV  P+    + CKW+FK K  S+G+++R KARLV  G+ Q++GI + +TYSPV +  TIR ++ +A
Subjt:  NQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCAAACATGGGACTTAGTGGACCTTCCCATGGGAAGCAAGCCAATTAGGTGTAAGTGGATCTTCAAAAGAAAAACAAAATCAAATGGGTCAATAGAAAGATACAA
GGCTAGATTAGTGGTAGTAGGGTATACCCAGAAACAAGGCATTGATTACTTTGACACATATTCCCCTGTAACTAAGATAACCACAATTAGGGCCTTGATTGCATTAGCTG
CCATACATAGCCTTCTTATTCACCAAATGGACATAAAAACTGTCTTTCTAAATGGTGACTTAGAAGAAGAAATTTATATGACACAACCAGAAGGTTTTAAAATTCTTGGC
CAAGAAAACAAAGTGTGTAAACTGAGAAAATCCTGTGTAAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCAAACATGGGACTTAGTGGACCTTCCCATGGGAAGCAAGCCAATTAGGTGTAAGTGGATCTTCAAAAGAAAAACAAAATCAAATGGGTCAATAGAAAGATACAA
GGCTAGATTAGTGGTAGTAGGGTATACCCAGAAACAAGGCATTGATTACTTTGACACATATTCCCCTGTAACTAAGATAACCACAATTAGGGCCTTGATTGCATTAGCTG
CCATACATAGCCTTCTTATTCACCAAATGGACATAAAAACTGTCTTTCTAAATGGTGACTTAGAAGAAGAAATTTATATGACACAACCAGAAGGTTTTAAAATTCTTGGC
CAAGAAAACAAAGTGTGTAAACTGAGAAAATCCTGTGTAAACTGA
Protein sequenceShow/hide protein sequence
MNQTWDLVDLPMGSKPIRCKWIFKRKTKSNGSIERYKARLVVVGYTQKQGIDYFDTYSPVTKITTIRALIALAAIHSLLIHQMDIKTVFLNGDLEEEIYMTQPEGFKILG
QENKVCKLRKSCVN