; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0040341 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0040341
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr02:2244923..2245351
RNA-Seq ExpressionCmc02g0040341
SyntenyCmc02g0040341
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU21337.1 hypothetical protein TSUD_189240 [Trifolium subterraneum]1.8e-6184.17Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        INDEM SLESN+TWHLVDLP GCK IGCKW+L+KKLK DG+V+KYK RLV KGFRQRENIDFFDTF+PVTRITSIRVLISIA L NL++HQMDVK AFLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
        GDLEEEIYMEQPEGF+++ QE+KVCKLDKSLYGLKQAPK
Subjt:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

GAU47690.1 hypothetical protein TSUD_245810 [Trifolium subterraneum]6.8e-6183.45Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        INDEM SLESN+TWHLVDLP GCK IGCKW+L+KKLK DG+V+KYK RLV KGFRQRENIDFFDTF+PVTRITSIRVLISI  L NL++HQMDVK AFLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
        GDLEEEIYMEQPEGF+++ QE+KVCKLDKSLYGLKQAPK
Subjt:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

GAU49932.1 hypothetical protein TSUD_408340 [Trifolium subterraneum]8.9e-6182.73Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        INDEM SLESN+TWHLVD P GCK IGCKW+L+KKLK DG+V+KYK RL+ KGFRQRENIDFFDTF+PVTRITSIRV ISIATL NL++HQMDVK AFLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
        GDLEEEIYMEQPEGF+++ QE+KVCKLDKSLYGLKQAPK
Subjt:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

KAA0046026.1 putative Polyprotein [Cucumis melo var. makuwa]1.4e-6189.93Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        INDEM SLESNRTWHLVDLP GCKAIGCKWVLRKKLK DGSVDKYK RLV KGFRQRENIDFFDTF+PVTRITSIRVLISI  LNNLLIHQMDVK AFLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
         DLEEEIYMEQ EGFIV+ QESKVCKLDKSLYGLKQA K
Subjt:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

TYK22685.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]8.0e-6288.49Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        INDEM SLESNRTWHLVDLP  CKAIGCKWVLRKK K DGS+DKYK RLV KGFRQREN+DFFDTF+ VTRITSIRVLISIA LNNLLIHQMDVK  FLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
        GDLEEEIYMEQPEGFIV  QESKVCKLDKSLYGLKQAPK
Subjt:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

TrEMBL top hitse value%identityAlignment
A0A2Z6MWZ1 Reverse transcriptase Ty1/copia-type domain-containing protein8.6e-6284.17Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        INDEM SLESN+TWHLVDLP GCK IGCKW+L+KKLK DG+V+KYK RLV KGFRQRENIDFFDTF+PVTRITSIRVLISIA L NL++HQMDVK AFLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
        GDLEEEIYMEQPEGF+++ QE+KVCKLDKSLYGLKQAPK
Subjt:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

A0A2Z6PC97 CCHC-type domain-containing protein3.3e-6183.45Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        INDEM SLESN+TWHLVDLP GCK IGCKW+L+KKLK DG+V+KYK RLV KGFRQRENIDFFDTF+PVTRITSIRVLISI  L NL++HQMDVK AFLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
        GDLEEEIYMEQPEGF+++ QE+KVCKLDKSLYGLKQAPK
Subjt:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

A0A2Z6PHW1 CCHC-type domain-containing protein4.3e-6182.73Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        INDEM SLESN+TWHLVD P GCK IGCKW+L+KKLK DG+V+KYK RL+ KGFRQRENIDFFDTF+PVTRITSIRV ISIATL NL++HQMDVK AFLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
        GDLEEEIYMEQPEGF+++ QE+KVCKLDKSLYGLKQAPK
Subjt:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

A0A5A7TV55 Putative Polyprotein6.6e-6289.93Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        INDEM SLESNRTWHLVDLP GCKAIGCKWVLRKKLK DGSVDKYK RLV KGFRQRENIDFFDTF+PVTRITSIRVLISI  LNNLLIHQMDVK AFLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
         DLEEEIYMEQ EGFIV+ QESKVCKLDKSLYGLKQA K
Subjt:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

A0A5D3DGJ1 Retrotransposon protein, putative, Ty1-copia sub-class3.9e-6288.49Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        INDEM SLESNRTWHLVDLP  CKAIGCKWVLRKK K DGS+DKYK RLV KGFRQREN+DFFDTF+ VTRITSIRVLISIA LNNLLIHQMDVK  FLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
        GDLEEEIYMEQPEGFIV  QESKVCKLDKSLYGLKQAPK
Subjt:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.7e-2744.6Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        IN E+ + + N TW +   P     +  +WV   K    G+  +YK RLV +GF Q+  ID+ +TF PV RI+S R ++S+    NL +HQMDVK AFLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
        G L+EEIYM  P+G  +      VCKL+K++YGLKQA +
Subjt:  GDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-3754.23Show/hide
Query:  MARINDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIA
        M  + +EM SL+ N T+ LV+LP G + + CKWV + K   D  + +YK RLV KGF Q++ IDF + F+PV ++TSIR ++S+A   +L + Q+DVK A
Subjt:  MARINDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIA

Query:  FLNGDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
        FL+GDLEEEIYMEQPEGF V  ++  VCKL+KSLYGLKQAP+
Subjt:  FLNGDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

P92520 Uncharacterized mitochondrial protein AtMg008203.4e-1543.9Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIA
        + +E+ +L  N+TW LV  P     +GCKWV + KL  DG++D+ K RLV KGF Q E I F +T++PV R  +IR ++++A
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-3045.71Show/hide
Query:  INDEMYSLESNRTWHLV-DLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFL
        +  E+ +   N TW LV   P+    +GC+W+  KK   DGS+++YK RLV KG+ QR  +D+ +TF+PV + TSIR+++ +A   +  I Q+DV  AFL
Subjt:  INDEMYSLESNRTWHLV-DLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFL

Query:  NGDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
         G L +++YM QP GFI  D+ + VCKL K+LYGLKQAP+
Subjt:  NGDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.4e-3044.29Show/hide
Query:  INDEMYSLESNRTWHLV-DLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFL
        +  E+ +   N TW LV   P     +GC+W+  KK   DGS+++YK RLV KG+ QR  +D+ +TF+PV + TSIR+++ +A   +  I Q+DV  AFL
Subjt:  INDEMYSLESNRTWHLV-DLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFL

Query:  NGDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK
         G L +E+YM QP GF+  D+   VC+L K++YGLKQAP+
Subjt:  NGDLEEEIYMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-3546.15Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN
        ++DE+ ++E+  TW +  LP   K IGCKWV + K   DG++++YK RLV KG+ Q+E IDF +TF+PV ++TS++++++I+ + N  +HQ+D+  AFLN
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLN

Query:  GDLEEEIYMEQPEGFIVYDQES----KVCKLDKSLYGLKQAPK
        GDL+EEIYM+ P G+     +S     VC L KS+YGLKQA +
Subjt:  GDLEEEIYMEQPEGFIVYDQES----KVCKLDKSLYGLKQAPK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.4e-1643.9Show/hide
Query:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIA
        + +E+ +L  N+TW LV  P     +GCKWV + KL  DG++D+ K RLV KGF Q E I F +T++PV R  +IR ++++A
Subjt:  INDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGAATCAATGATGAAATGTACTCTCTTGAATCCAATAGAACTTGGCACCTAGTTGACTTACCCACCGGATGTAAAGCTATAGGCTGTAAATGGGTTTTAAGGAA
GAAACTCAAACTTGATGGATCAGTAGATAAGTATAAGACTAGATTAGTGGAAAAAGGATTTAGGCAGAGGGAAAACATAGACTTCTTTGATACTTTTACCCCGGTTACTA
GAATCACCTCTATTAGAGTGTTGATATCTATAGCTACCCTAAACAATCTCTTAATCCATCAGATGGATGTTAAAATAGCTTTCCTAAATGGTGATTTAGAAGAAGAGATT
TACATGGAACAACCTGAAGGTTTCATAGTTTACGACCAAGAATCCAAAGTTTGCAAACTAGATAAATCCCTTTATGGCCTAAAACAAGCTCCCAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGAATCAATGATGAAATGTACTCTCTTGAATCCAATAGAACTTGGCACCTAGTTGACTTACCCACCGGATGTAAAGCTATAGGCTGTAAATGGGTTTTAAGGAA
GAAACTCAAACTTGATGGATCAGTAGATAAGTATAAGACTAGATTAGTGGAAAAAGGATTTAGGCAGAGGGAAAACATAGACTTCTTTGATACTTTTACCCCGGTTACTA
GAATCACCTCTATTAGAGTGTTGATATCTATAGCTACCCTAAACAATCTCTTAATCCATCAGATGGATGTTAAAATAGCTTTCCTAAATGGTGATTTAGAAGAAGAGATT
TACATGGAACAACCTGAAGGTTTCATAGTTTACGACCAAGAATCCAAAGTTTGCAAACTAGATAAATCCCTTTATGGCCTAAAACAAGCTCCCAAGTAA
Protein sequenceShow/hide protein sequence
MARINDEMYSLESNRTWHLVDLPTGCKAIGCKWVLRKKLKLDGSVDKYKTRLVEKGFRQRENIDFFDTFTPVTRITSIRVLISIATLNNLLIHQMDVKIAFLNGDLEEEI
YMEQPEGFIVYDQESKVCKLDKSLYGLKQAPK