; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0106821 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0106821
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCMiso1.1chr04:25017743..25023140
RNA-Seq ExpressionCmc04g0106821
SyntenyCmc04g0106821
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0007034 - vacuolar transport (biological process)
GO:0015074 - DNA integration (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0005768 - endosome (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004601 - peroxidase activity (molecular function)
GO:0030246 - carbohydrate binding (molecular function)
GO:0047938 - glucose-6-phosphate 1-epimerase activity (molecular function)
InterPro domainsIPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036831.1 peroxidase 64 [Cucumis melo var. makuwa]1.9e-1472.41Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FV+NYG+IAAPL TQ LKKG + W+E A +AFERLK+AM+TLPVL LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

KAA0039715.1 Transposon Tf2-6 polyprotein [Cucumis melo var. makuwa]2.5e-1474.14Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FVQNYGSIAAPL TQLLK GAY+W+E    AFE+LK AM+TLPVL +PD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

KAA0044875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-1472.41Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FV+NYG+IAAPL TQ LKKG + W+E A +AFERLK+AM+TLPVL LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

KAA0048504.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]1.5e-1475.86Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FVQNYG+IAAPL TQLLKKG Y+WS+ A  AFERLK AM +LPVL LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

KAA0050169.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-1472.41Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FV+NYG+IAAPL TQLLKKG + W+E A +AF+RLK+AMV+LPVL LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

TrEMBL top hitse value%identityAlignment
A0A5A7TE42 Transposon Tf2-6 polyprotein1.2e-1474.14Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FVQNYGSIAAPL TQLLK GAY+W+E    AFE+LK AM+TLPVL +PD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

A0A5A7TU09 Glucose-6-phosphate 1-epimerase9.4e-1572.41Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FV+NYG+IAAPL TQ LKKG + W+E A +AFERLK+AM+TLPVL LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

A0A5A7TZP3 Transposon Tf2-1 polyprotein isoform X17.2e-1575.86Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FVQNYG+IAAPL TQLLKKG Y+WS+ A  AFERLK AM +LPVL LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

A0A5A7U9J7 Ty3/gypsy retrotransposon protein9.4e-1572.41Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FV+NYG+IAAPL TQLLKKG + W+E A +AF+RLK+AMV+LPVL LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

A0A5D3BC83 Peroxidase 649.4e-1572.41Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FV+NYG+IAAPL TQ LKKG + W+E A +AFERLK+AM+TLPVL LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

SwissProt top hitse value%identityAlignment
P03359 Gag-Pol polyprotein8.5e-0534.48Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +G  G+ R ++  + S+AAPL     +   + W+E    AF+R+K A+++ P L LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

P20825 Retrovirus-related Pol polyprotein from transposon 2972.2e-0546.67Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANM--AFERLKTAMVTLPVLVLPD
        +GLTGYYR F+ NY  IA P MT  LKK     ++      AFE+LK  ++  P+L LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANM--AFERLKTAMVTLPVLVLPD

P92523 Uncharacterized mitochondrial protein AtMg008606.5e-1362.07Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FV+NYG I  PL T+LLKK + +W+E A +AF+ LK A+ TLPVL LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

Q7M732 Retrotransposon-like protein 16.5e-0549.02Show/hide
Query:  YRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLP
        YR FV+N+  IAAPL+ QLL    Y W E    A E LK A    PVL  P
Subjt:  YRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLP

Q9TTC1 Gag-Pol polyprotein2.2e-0536.21Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +G  G+ R ++  + S+AAPL     +K  + W+E    AF R+K A+++ P L LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.6e-1462.07Show/hide
Query:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD
        +GLTGYYR FV+NYG I  PL T+LLKK + +W+E A +AF+ LK A+ TLPVL LPD
Subjt:  MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTAACCGGTTATTATAGATGCTTTGTCCAAAATTATGGAAGCATTGCAGCCCCATTGATGACACAGCTGTTAAAGAAGGGAGCTTACCAGTGGTCAGAAGGGGC
AAATATGGCATTCGAACGATTGAAGACAGCCATGGTGACTCTGCCAGTGTTGGTTTTACCAGACTAG
mRNA sequenceShow/hide mRNA sequence
CTATATATATATCATTTGAAATCACATAAAAAAAGAAAATACTTGAAGGGTCCTGTCCACTAACCTTTTTATAGGAATATCCAAGTCAGCAATATAATGAAGCGTAAAGT
TGATAATACTCTGCAATTTTTTGGCATTGGGTCAATTGAGAAAATCGATGTTTATATCAGTTGCTTTTCATGAGCACAAAAGGATGCAACATTTATGGAAGATGTCTCAT
TACACCGGTCACAAGGGAAGATTTTCGAAAATGGCGCACAAACGACTGGAGGAGTTGAGGCTAGCGAGGCGGTGATTTCTGCGACACGAGTGAAAATTCGGAAATTGCCG
AATTCGTTAGCGAAAAGTATCGAGCGATTAAATGATCAAGGCGATAATCAACATCGAAGTTCAAAAGAGATGAAGAAGTTGAAAAGAGATGAAGAAGTTGCTATCGAACT
TGTTGATTGAAATTGCGCACTCAAGATTCTAATCAATTCGGAAATACAGAAGCAGAGGCATCGAGCGGAAGTAAGAAAGGTAAGGAGGTGGCGGAGGATTCGACAACAGT
TTTGAATGAGTTGTTGTCATCAGGAGTAGAAGAAACTGGAATCGAGAGACGAAAATTTACGAAAGTTGAAACGTCGATATTCAACGGCACTGATCCGGATCCCCCATTCA
GAGCAGAGTGATCCTTCCAGATCCAAAAGTTAATCGAGGCAGAGAAATTAATGGAGGTTGTAATCAGTTTTGAATCAGTGAGGTTAGATTGGTATCGTGCGAAGGAGGAG
ATGGACTCCGGACTCGTTTAGTGATTGGAAGACATTGAAGTTGAGACTTCTTCAACGATTTCGATATCCTCGGCAAGGCACGGTATGCAGTCAATTCCTTGCGATTAAAC
AGGAATCTATAGTTGAAGAATATATAAACCTATTTGATAAGCTTTGTAGCGCCATTACCTCATTTATGAATGAAATATTAGAAAGTACTTTCATGAATGGGCTCGTTCCA
TGGGTTAGGGCTGAGGTAGAGTGTTTGAAGCCGGTTGGGAAATATGATGAAAGTGGCCCAACGAGCGGAGAATAGGGAGTTGATCCGGTCCAAAATGTTAGCAAAGTGTT
CTGAGAAGAGTCCAGAAAAATAAAAACTGTAGTGCGATGATTAACCCTAAACCTACGATTCCGACTGCTAGAAAAGAGATGCCGAAGGTGAATGAGCCAATTCTGCGGAG
AAGGAGAAGGGCTTGTGCTTTAGTTGTGACGAGCGGTATACGGTAGGGCATCGATGTAAAATTAAAAATCAGCGTGAGTTACGAGTTTTGTACAGAATAATGTTGATGAT
CTGGAAGTGATTGAGGATGCGGTTGAAGAATTTGAAACGGTAGAATTGCAGATGACAGAGGTAAACGAAATTGTGGAGCTTTCAATGAATTCAGTTTTCGATTTGTCAAT
TCCAGGTACAGTGAAGTTGAAGGGAAGATTGGGGAATAGGGATGTTGTAGTGTTAATCGGTTGAGAAGCAACTTACAATTTTATTGCCCAACGTCTGGTTTTGGAAGAGA
AGTTGAAAGTGGAGACGACTACGAATTATGGTGTTATAATGGGAACAGGAACAGAAGTGAAGGGAAAGGGACCGTGTAAGGGGGCCTGATTGCAACTGGGAGGATTGTCG
ATTGTGGAGGAGTTTTTGCTGCTGGAGTTGGGAGTGGACATCATTATGGGAATGCAGCGGTTGCATACCTTGGGTGTTACAGAGGTGGATTGAGAACATTGACAATGAAA
TTTGTTCACGAGGGAAAAACAATTGTGCTGAGAAGTTGGTGTAAAGACTTGGGAAGTTGGTGATCAAGGGTTTCTTGTAGTATGCAGATTGTTGCAAGGGGGAAACTTGG
GAGGAATTTTATGGGATAGAACCGAACATTAATCCTACTGAGGCTTTGACTAGAGTCATTTTGAAGTCTGATGACATTTTTCGATTGGCCTGAAGAATTACCTCCAAGTA
AAGGAATCGATCATTACATTCATTTGAAAGAGGGAACAGGTCCAGTGAATGTAAGGCCCTACAGGTACGCATAGCCAGAAGACTTAGATGGCTCAGATGGTAGCTTATAT
GTTATCTTCGAGTATAATTACGCCAAGTGCTAGCCCTTACTTGAGTCCGGTATATTGTTGGCCCTTACTTGAGTCCAGTATATTGTTGGTTAAAAAGAAGGATGCAGGGT
GGAGGTTCTGTGTGGATTATGTGTGGATTGTAGAGCTTTGAACAGTGTCACTGTCCTGGACAAATTCCCAATTCCTGTGATTGAGGAACTGTTTGATGAACTAAATGGAA
GGGTGACGTTTTCGAAGATTGATCATAAGACCATCAAATCAGAATGCACCCAACTAATGAAGAGAAGACACTGCATTTCATACGCACAAGGGCCGTTATGAGTTTTTAGC
AATGCCCTTTGGGTTGACGAAAATGCCATGAATGTTCTAAGCTCTAATGAACAAGATCTTGAATGTGTTTTTCGATGATATCCTAGTTTACGGCAAGGATTTGGAGGACC
ATATTCATCATCTGAAGGAGGTGCAGGAGGTTTTGCAAGAAAGCGATTTGTATGCTAATCAACATAAATGCAAATTTATGCATGGCCGTTTGGACTATTTAGGACATTAT
TTCAGGTGCAGGGGTAGAAGCGGATCCAAGAAAAATTCAAGCTATAATTGATTGACCACCACCTTGTATTCAAAGAAGTAAGGAGCTTCATGGGTTTAACCGGTTATTAT
AGATGCTTTGTCCAAAATTATGGAAGCATTGCAGCCCCATTGATGACACAGCTGTTAAAGAAGGGAGCTTACCAGTGGTCAGAAGGGGCAAATATGGCATTCGAACGATT
GAAGACAGCCATGGTGACTCTGCCAGTGTTGGTTTTACCAGACTAGTAAAACCTTCAAAATAGAAACTGATACATTAGGCTATCGACTAGGGGCGGTGATGATGCAGCAA
AAGCACCCGATTGCTTTGTTTAGCCATACTCTTTCAAATATTGATCCTGGAAAGCCGGTTTATGAAAGAGAATTGATGGCTGTCCAGCGTTGGTGTCCATATTTGTAGGG
ACACAAATTTATTATGAGGACAGAATAGAAAGCTCTCATGTTTTGGCTTGAACAACGAGTTATTCAGTTCCAGCATCAGCAGTGGTTATCTAAGTTGTTTGGTTTTGATT
TTGAGGTGATCGGGAGTTGAATTGATAACAAGGTTGCTGATGACTTGTCTCGGCAAGCCTGAGGGCATTGAATTGGCAAACCTAGTTGTGCCTACTTTGTTGGATGTGTC
TGCGATTAAGGAAGAGGTTTACAAAGACTCCAAATTGAAGGATATTATTGAGAAGTTGATGGTCAGTATTTCCAATTTTTCTCTTCAACAGGGGATTTTCAAGTATAGGG
GTCGTTTGGTGGTTTTTAAAACTTCTCTGTAGCCGGCAATATTACATACGTATAAAAGGTTGGTGGACATTCTGGTTTTTTGCGCACTTATAAAAGGTTGCTTGGGGAGG
TATTGGAGGGGCAAGAGTTAAAAAGCATGTGCCGAATGTGATCGATAAAGATCAATAGCTTTCACAATGAAATTCAGTTACTGTTTCTCATTGAAAATAAATAAAATAAA
AAACAAAAACTAAGGCTTCTCTTATCGAACCCTGTATGCCAAATCAGGTTGTGCAAGAAGATGATAAAGAATGGCCTCAAACATGAGAACCCAAGGTTACCCTGCCAATG
GTTTCTAAATAAATGGAATACGAGAAAAAAGAGTAGCTTATATTACCTTAAGATTTTCTTCTCCCAGATGTAATTTCAAGCTCTGTACAATTGGAAAGGAACATATTATG
ACGTAGGGATAGATCAAATCAAAAAACATGATAAAAGGAGCAATGTTACAAGGAATTCAAATGATAGTTTGTCTAAGTCGAATAAGCACATCATGTCAGAGAAGTTCTTA
AATAAAAGGCACCAATGGTATTAAAGGTCTTAAATGAAAATATAGCCATTCATAAGGGAATGGAAAGCTTCGGAGTTCAGAGTAAACAATGACTTTGTAAATCAGTAAAA
TCTCAGTATAGAAATGTTATATAACATAAATATTTAGCTAACTAATTCCTAACCACAATTTTACTTCGTATCAAATGTAATTAAACCCGTGGATTGTCATTCTTGAAATG
CAAGGATTTACTCTAATTTCCTTTCCTTTTTTATCATAAGCATGCTGTCTCTTTATTTTCCCTTTTGTATCTTTTGTTAACATGAGAAATTAATAAGAAAGTCATATCGT
GGTTTTTCTCTCGGTACTCGGGCTTTCACATAACTCAATGTCTATTTTCTTTACCACTTTTAACATGGTATTAGAGAGAGGTAGAGACTAAAGACGAAACCTTAGACACA
ATTATAAATGAAACTCGAATAGAGACAGACGAGATAGTCCCCACGGTCGGGATCAACACCGCCATGGAAAAGCTGCTCCACCAGCTTCAGAAGATGTTGGTGATCACAAT
GGGTCCACCCTTGGAGTTAGTCGCGAAGCTAGTGGAGGAAACCAATCGTATATCCCCACGCGCTATCGTCCAACCCCTTTGTCGTTCAGCCGACTTACTCTTACCTGTTT
TCCCAAGTCCAGCTGCAGTACCCTTTTGGTCAGCCACACACCCACGCATCGTCGCTTCCCTCGATTTACGGACTGCCATCATTTCACGCCCCGCTACCCTTTGATCCTTT
ACAACAACCACACATTCGTGGTATTGAGATCGATCAACTCCATAATAAATCAGGATTTGAAGTTGGTGAATCTCCAGCACAATCCAAACCAACCGACTTGCCGATTATAT
AACTAGTTCTCTGGCACCATCTACAGGTGTTTTTCCAAGGGTGAAGCTAAGTGGCTAAAACTACTTTTCCTGGTCTTAATCAATAAAAATGTTTCTTGAGGGGCGCACCA
ATTAGATCTTTTGACAAGGGAAACTATTCGACCCCTAC
Protein sequenceShow/hide protein sequence
MGLTGYYRCFVQNYGSIAAPLMTQLLKKGAYQWSEGANMAFERLKTAMVTLPVLVLPD