; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0070231 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0070231
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr03:15949854..15950225
RNA-Seq ExpressionCmc03g0070231
SyntenyCmc03g0070231
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006952 - defense response (biological process)
GO:0007165 - signal transduction (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043531 - ADP binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033349.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.5e-5387.8Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        MDQP+GFMVEGKEH V KLKRSIYGLKQASRQ YLKFNDTITSFGFKENI+DRCIY+KIS SKFI+L+LYVDDILLAT DF LLC TKEFLSKN EMKDM
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFRDRTHGLLGLS
        GEASYVI+IEIFRDR HGLLGLS
Subjt:  GEASYVIEIEIFRDRTHGLLGLS

KAA0052755.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-5590.24Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        MDQP+GFMVEGKEH V KLKRSIYGLKQASRQWYLKFNDTITSFGFKENI+DRCIY+KIS SKFIIL+LYVDDILLATNDF LLC TKEFLSKN EMKDM
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFRDRTHGLLGLS
         EASYVI IEIFRDRTHGLLGLS
Subjt:  GEASYVIEIEIFRDRTHGLLGLS

RYE20331.1 hypothetical protein EOP45_11235, partial [Sphingobacteriaceae bacterium]1.0e-4982.93Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        MDQP+GF+V+GKEH V KLK+SIYGLKQASRQWYLKFNDT+TSFGFKENI+DRCIY+KIS SKFIIL+LYVDDILLATND  LL  TKEFLSKN EMKDM
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFRDRTHGLLGLS
        GEASYVI IEI RDR+ G LGLS
Subjt:  GEASYVIEIEIFRDRTHGLLGLS

TYK00088.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.7e-5590.24Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        MDQP+GFMVEGKEH V KLKRSIYGLKQASRQWYLKFNDTITSFGFKENI+DRCIY+KIS SKFIIL+LYVDDILLATNDF LLC TKEFLSKN EMKDM
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFRDRTHGLLGLS
        GEASYVI IEIFRDRTHGLL LS
Subjt:  GEASYVIEIEIFRDRTHGLLGLS

TYK04201.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-5590.24Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        MDQP+GFMVEGKEH V KLKRSIYGLKQASRQWYLKFNDTITSFGFKENI+DRCIY+KIS SKFIIL+LYVDDILLATNDF LLC TKEFLSKN EMKDM
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFRDRTHGLLGLS
         EASYVI IEIFRDRTHGLLGLS
Subjt:  GEASYVIEIEIFRDRTHGLLGLS

TrEMBL top hitse value%identityAlignment
A0A4V1T029 Reverse transcriptase Ty1/copia-type domain-containing protein (Fragment)5.0e-5082.93Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        MDQP+GF+V+GKEH V KLK+SIYGLKQASRQWYLKFNDT+TSFGFKENI+DRCIY+KIS SKFIIL+LYVDDILLATND  LL  TKEFLSKN EMKDM
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFRDRTHGLLGLS
        GEASYVI IEI RDR+ G LGLS
Subjt:  GEASYVIEIEIFRDRTHGLLGLS

A0A5A7SVZ5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-5387.8Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        MDQP+GFMVEGKEH V KLKRSIYGLKQASRQ YLKFNDTITSFGFKENI+DRCIY+KIS SKFI+L+LYVDDILLAT DF LLC TKEFLSKN EMKDM
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFRDRTHGLLGLS
        GEASYVI+IEIFRDR HGLLGLS
Subjt:  GEASYVIEIEIFRDRTHGLLGLS

A0A5A7UG95 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-5590.24Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        MDQP+GFMVEGKEH V KLKRSIYGLKQASRQWYLKFNDTITSFGFKENI+DRCIY+KIS SKFIIL+LYVDDILLATNDF LLC TKEFLSKN EMKDM
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFRDRTHGLLGLS
         EASYVI IEIFRDRTHGLLGLS
Subjt:  GEASYVIEIEIFRDRTHGLLGLS

A0A5D3BLU0 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-5590.24Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        MDQP+GFMVEGKEH V KLKRSIYGLKQASRQWYLKFNDTITSFGFKENI+DRCIY+KIS SKFIIL+LYVDDILLATNDF LLC TKEFLSKN EMKDM
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFRDRTHGLLGLS
        GEASYVI IEIFRDRTHGLL LS
Subjt:  GEASYVIEIEIFRDRTHGLLGLS

A0A5D3BWW5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-5590.24Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        MDQP+GFMVEGKEH V KLKRSIYGLKQASRQWYLKFNDTITSFGFKENI+DRCIY+KIS SKFIIL+LYVDDILLATNDF LLC TKEFLSKN EMKDM
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFRDRTHGLLGLS
         EASYVI IEIFRDRTHGLLGLS
Subjt:  GEASYVIEIEIFRDRTHGLLGLS

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.8e-1238.78Show/hide
Query:  VSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYI--KISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDMGEASYVIEIEI
        V KL ++IYGLKQA+R W+  F   +    F  + +DRCIYI  K + ++ I ++LYVDD+++AT D + + + K +L +   M D+ E  + I I I
Subjt:  VSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYI--KISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDMGEASYVIEIEI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.4e-2547.58Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIK-ISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKD
        M+QP+GF V GK+H V KL +S+YGLKQA RQWY+KF+  + S  + +   D C+Y K  S + FIIL+LYVDD+L+   D  L+   K  LSK+ +MKD
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIK-ISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKD

Query:  MGEASYVIEIEIFRDRTHGLLGLS
        +G A  ++ ++I R+RT   L LS
Subjt:  MGEASYVIEIEIFRDRTHGLLGLS

P25600 Putative transposon Ty5-1 protein YCL074W5.1e-0731.19Show/hide
Query:  QPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDMGE
        QP GF+ E     V +L   +YGLKQA   W    N+T+   GF  +  +  +Y + +    I + +YVDD+L+A     +    K+ L+K   MKD+G+
Subjt:  QPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDMGE

Query:  ASYVIEIEI
            + + I
Subjt:  ASYVIEIEI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.1e-1535.4Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        M QP GF+ + + + V KL++++YGLKQA R WY++  + + + GF  ++ D  +++       + +++YVDDIL+  ND +LL +T + LS+   +KD 
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFR
         E  Y + IE  R
Subjt:  GEASYVIEIEIFR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-1335.4Show/hide
Query:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM
        M QP GF+ + +   V +L+++IYGLKQA R WY++    + + GF  +I D  +++       I +++YVDDIL+  ND  LL  T + LS+   +K+ 
Subjt:  MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDM

Query:  GEASYVIEIEIFR
         +  Y + IE  R
Subjt:  GEASYVIEIEIFR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.5e-1740Show/hide
Query:  HTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDMGEASYVIEIEIFR
        + V  LK+SIYGLKQASRQW+LKF+ T+  FGF ++  D   ++KI+ + F+ +++YVDDI++ +N+ + +   K  L    +++D+G   Y + +EI R
Subjt:  HTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDMGEASYVIEIEIFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAACCAGACGGCTTTATGGTTGAAGGAAAGGAACATACGGTGAGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAGACAATGGTATCTTAAGTT
TAATGATACCATCACATCTTTTGGTTTTAAAGAAAACATCATTGATCGATGTATATACATAAAGATCAGTTGGAGTAAGTTTATAATTCTTATTCTATATGTTGATGACA
TCTTGCTTGCTACGAATGACTTTAGTTTGTTATGTTCAACCAAAGAATTTCTTTCTAAAAACCTTGAAATGAAAGATATGGGTGAAGCATCCTATGTGATTGAAATTGAA
ATATTCCGTGATCGAACGCATGGATTGTTAGGATTGTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCAACCAGACGGCTTTATGGTTGAAGGAAAGGAACATACGGTGAGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAGACAATGGTATCTTAAGTT
TAATGATACCATCACATCTTTTGGTTTTAAAGAAAACATCATTGATCGATGTATATACATAAAGATCAGTTGGAGTAAGTTTATAATTCTTATTCTATATGTTGATGACA
TCTTGCTTGCTACGAATGACTTTAGTTTGTTATGTTCAACCAAAGAATTTCTTTCTAAAAACCTTGAAATGAAAGATATGGGTGAAGCATCCTATGTGATTGAAATTGAA
ATATTCCGTGATCGAACGCATGGATTGTTAGGATTGTCTTAA
Protein sequenceShow/hide protein sequence
MDQPDGFMVEGKEHTVSKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIIDRCIYIKISWSKFIILILYVDDILLATNDFSLLCSTKEFLSKNLEMKDMGEASYVIEIE
IFRDRTHGLLGLS