; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C032043 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C032043
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr06:14762978..14768169
RNA-Seq ExpressionMELO3C032043
SyntenyMELO3C032043
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0032991 - protein-containing complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR020847 - AP endonuclease 1, binding site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN47248.1 hypothetical protein Csa_016987 [Cucumis sativus]5.3e-2177.22Show/hide
Query:  TKIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKA
        TKIESV ELEP AS+QN   LSKEVVA LLEYDLCI AVTRKGK QRKGG+  +SN+KL +EVKALLGSWERE Q AKA
Subjt:  TKIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKA

RVW56511.1 putative ribonuclease H protein [Vitis vinifera]1.2e-1232.41Show/hide
Query:  DFLVKFNPDVVILQESKVSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESITK---SGSFIDDNALEREDCRGALLDLIVKEQKIWIEKCNLQ
        DFL   NPDVV++QE+K  +  R+ V SVW +       L A G+SGGIL++W  +++ +    G    D   +R   +G L +LI++E+  W +K  ++
Subjt:  DFLVKFNPDVVILQESKVSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESITK---SGSFIDDNALEREDCRGALLDLIVKEQKIWIEKCNLQ

Query:  WPCEGEENYSFFHRWISARKSKSIISSLMCIDGQALELGYPLTAQ
        W  EG+ N  F+H+  + R+++  I  L    G  L+    +T +
Subjt:  WPCEGEENYSFFHRWISARKSKSIISSLMCIDGQALELGYPLTAQ

TYK01560.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.7e-11864.95Show/hide
Query:  KRTKIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKACTSEGPDDFLVKFNPDVVI
        +RTKIESVSELEPVASLQNLL LSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQ AKACTSEGPDDFLVKFNPDVVI
Subjt:  KRTKIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKACTSEGPDDFLVKFNPDVVI

Query:  LQESKVSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESITKSGSFIDDNALE------------REDCRGALLDLIVKEQKIWIEKCNLQWPC
        LQESKVSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESIT   S      +               DCRGALLDLIVKEQKIWIEKCNLQWPC
Subjt:  LQESKVSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESITKSGSFIDDNALE------------REDCRGALLDLIVKEQKIWIEKCNLQWPC

Query:  EGEENYSFFHRWISARKSKSIISSLMCIDGQAL------------------------------------------ELGYPLT------------------
        EGEENYSFFHRWISARKSKSIIS LM IDGQAL                                           L  P +                  
Subjt:  EGEENYSFFHRWISARKSKSIISSLMCIDGQAL------------------------------------------ELGYPLT------------------

Query:  -----------------------------------------AQKTSLIGINLDSTVLAPYASTLGYVVDSIAFNYLGFSIGGGRNRKE
                                                 AQKTSLIGINLDSTVLAPYASTLGYVVDSIAFNYLGFSIGGGRNRKE
Subjt:  -----------------------------------------AQKTSLIGINLDSTVLAPYASTLGYVVDSIAFNYLGFSIGGGRNRKE

TYK05527.1 eukaryotic translation initiation factor 3 subunit A-like [Cucumis melo var. makuwa]1.1e-1545.1Show/hide
Query:  SELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTR--KGKS-QRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKACTSEGPDDFLVKFNPDVVILQESK
        S L  VA+ Q L+ L+ +V    +        V +  KGK  +R   +  SSNKKL +E KALLGSWEREAQ A A   +G          D   L+   
Subjt:  SELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTR--KGKS-QRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKACTSEGPDDFLVKFNPDVVILQESK

Query:  VSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESITKSGSFIDDNAL
        + VD  KLVKS+W S LVG  +LEACGSS GILLMWKE+ I  + S  D N L
Subjt:  VSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESITKSGSFIDDNAL

XP_031739979.1 uncharacterized protein LOC116403332 [Cucumis sativus]2.6e-1271.19Show/hide
Query:  DFLVKFNPDVVILQESKVSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESIT
        + LVKFNPDVVILQ+SKVS   R LVKSVW S  VG   LEA GSSGGIL++WKE+SIT
Subjt:  DFLVKFNPDVVILQESKVSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESIT

TrEMBL top hitse value%identityAlignment
A0A0A0K5P3 Uncharacterized protein5.8e-1365.75Show/hide
Query:  KRTKIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWE
        K T+IES ++LE VA+L N  PLSKEVV   LEY+LCIR VTRKGK Q KG + +SS KKL +EVKALLG+WE
Subjt:  KRTKIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWE

A0A0A0KCH2 Uncharacterized protein2.6e-2177.22Show/hide
Query:  TKIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKA
        TKIESV ELEP AS+QN   LSKEVVA LLEYDLCI AVTRKGK QRKGG+  +SN+KL +EVKALLGSWERE Q AKA
Subjt:  TKIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKA

A0A0A0L9C9 Uncharacterized protein1.8e-2277.92Show/hide
Query:  KIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQVAK
        +IESV+ELEP AS+QN  PLSKEVVA LLEYDLCIRAVTRKGK  RKGG+ +++NKKL KEVKALLGSWEREAQ  K
Subjt:  KIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQVAK

A0A5D3BR42 LINE-1 retrotransposable element ORF2 protein1.3e-11864.95Show/hide
Query:  KRTKIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKACTSEGPDDFLVKFNPDVVI
        +RTKIESVSELEPVASLQNLL LSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQ AKACTSEGPDDFLVKFNPDVVI
Subjt:  KRTKIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKACTSEGPDDFLVKFNPDVVI

Query:  LQESKVSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESITKSGSFIDDNALE------------REDCRGALLDLIVKEQKIWIEKCNLQWPC
        LQESKVSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESIT   S      +               DCRGALLDLIVKEQKIWIEKCNLQWPC
Subjt:  LQESKVSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESITKSGSFIDDNALE------------REDCRGALLDLIVKEQKIWIEKCNLQWPC

Query:  EGEENYSFFHRWISARKSKSIISSLMCIDGQAL------------------------------------------ELGYPLT------------------
        EGEENYSFFHRWISARKSKSIIS LM IDGQAL                                           L  P +                  
Subjt:  EGEENYSFFHRWISARKSKSIISSLMCIDGQAL------------------------------------------ELGYPLT------------------

Query:  -----------------------------------------AQKTSLIGINLDSTVLAPYASTLGYVVDSIAFNYLGFSIGGGRNRKE
                                                 AQKTSLIGINLDSTVLAPYASTLGYVVDSIAFNYLGFSIGGGRNRKE
Subjt:  -----------------------------------------AQKTSLIGINLDSTVLAPYASTLGYVVDSIAFNYLGFSIGGGRNRKE

A0A5D3C150 Eukaryotic translation initiation factor 3 subunit A5.6e-1645.1Show/hide
Query:  SELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTR--KGKS-QRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKACTSEGPDDFLVKFNPDVVILQESK
        S L  VA+ Q L+ L+ +V    +        V +  KGK  +R   +  SSNKKL +E KALLGSWEREAQ A A   +G          D   L+   
Subjt:  SELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTR--KGKS-QRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKACTSEGPDDFLVKFNPDVVILQESK

Query:  VSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESITKSGSFIDDNAL
        + VD  KLVKS+W S LVG  +LEACGSS GILLMWKE+ I  + S  D N L
Subjt:  VSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESITKSGSFIDDNAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGAACCAAAATTGAAAGCGTTAGTGAATTAGAACCAGTTGCCTCTCTGCAAAACCTGTTACCGTTAAGCAAAGAAGTGGTTGCGAGGCTTCTTGAGTAT
GACTTATGCATCCGAGCAGTGACTAGGAAAGGAAAATCTCAAAGAAAGGGTGGGTCCTTCATGAGCAGTAATAAGAAACTTATTAAAGAAGTTAAAGCTTTACTT
GGTAGTTGGGAAAGGGAAGCTCAAGTAGCCAAAGCTTGTACTTCTGAGGGCCCTGATGATTTTCTTGTGAAGTTCAATCCGGATGTTGTTATTCTCCAAGAATCT
AAGGTTAGTGTTGATGGTCGTAAGCTGGTGAAATCAGTTTGGGGCAGCTGTTTAGTTGGGTGTGGTATTTTAGAAGCCTGTGGCTCTTCAGGTGGTATTCTTCTT
ATGTGGAAAGAGGAGTCGATCACGAAATCTGGTTCATTTATTGATGATAATGCCTTGGAAAGAGAAGACTGTAGAGGGGCTTTACTTGATTTGATTGTTAAGGAA
CAAAAGATCTGGATTGAAAAATGTAACCTTCAATGGCCTTGTGAGGGGGAGGAGAATTATAGTTTCTTCCACAGATGGATTTCAGCTCGTAAAAGTAAAAGTATT
ATTTCTTCGTTGATGTGCATTGATGGGCAAGCTCTGGAGTTGGGTTATCCTTTAACAGCTCAAAAAACATCATTAATTGGTATTAACCTTGATAGCACTGTTCTG
GCTCCTTATGCTAGTACTTTGGGTTATGTGGTGGATAGCATCGCCTTTAACTATTTAGGTTTCTCTATAGGAGGGGGTCGTAATAGGAAAGAGCCTTTCTACGAT
TTTCAACAATTGGAAGGCCCTTATTCTTCAGTTTCTCTTGGGGAAGGCCCTCTTGTTTCTTGCACTCAGGCTGTTGAGCAATTTATTCGCATTAAAAGAGTTGGC
ACAAGTACGATCTTCCATAAATGTAAAGATTACTTCTACCATCAAAACTATTCCACAACTAACAGATTGTTTTCCATCAAACCTTCGCCAACAGTGGGAAATTCT
GGTGATAGCTGCCATTCTCCATCCACTAAGAACTTGATTTCGTACCTTCATATATATTACAGAAACAACAGCAAAAGAAATAATGTTCTACGTCATGAAGGTAGT
CATTTGATCAGTAAATCAAGAAAATCAAGACGAGAGGAAGGCAAGGGAGAGGAATTATACCTCCCAGGTCTAAGCCTTAAAGTTGTTGAGAACTTAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGAACCAAAATTGAAAGCGTTAGTGAATTAGAACCAGTTGCCTCTCTGCAAAACCTGTTACCGTTAAGCAAAGAAGTGGTTGCGAGGCTTCTTGAGTAT
GACTTATGCATCCGAGCAGTGACTAGGAAAGGAAAATCTCAAAGAAAGGGTGGGTCCTTCATGAGCAGTAATAAGAAACTTATTAAAGAAGTTAAAGCTTTACTT
GGTAGTTGGGAAAGGGAAGCTCAAGTAGCCAAAGCTTGTACTTCTGAGGGCCCTGATGATTTTCTTGTGAAGTTCAATCCGGATGTTGTTATTCTCCAAGAATCT
AAGGTTAGTGTTGATGGTCGTAAGCTGGTGAAATCAGTTTGGGGCAGCTGTTTAGTTGGGTGTGGTATTTTAGAAGCCTGTGGCTCTTCAGGTGGTATTCTTCTT
ATGTGGAAAGAGGAGTCGATCACGAAATCTGGTTCATTTATTGATGATAATGCCTTGGAAAGAGAAGACTGTAGAGGGGCTTTACTTGATTTGATTGTTAAGGAA
CAAAAGATCTGGATTGAAAAATGTAACCTTCAATGGCCTTGTGAGGGGGAGGAGAATTATAGTTTCTTCCACAGATGGATTTCAGCTCGTAAAAGTAAAAGTATT
ATTTCTTCGTTGATGTGCATTGATGGGCAAGCTCTGGAGTTGGGTTATCCTTTAACAGCTCAAAAAACATCATTAATTGGTATTAACCTTGATAGCACTGTTCTG
GCTCCTTATGCTAGTACTTTGGGTTATGTGGTGGATAGCATCGCCTTTAACTATTTAGGTTTCTCTATAGGAGGGGGTCGTAATAGGAAAGAGCCTTTCTACGAT
TTTCAACAATTGGAAGGCCCTTATTCTTCAGTTTCTCTTGGGGAAGGCCCTCTTGTTTCTTGCACTCAGGCTGTTGAGCAATTTATTCGCATTAAAAGAGTTGGC
ACAAGTACGATCTTCCATAAATGTAAAGATTACTTCTACCATCAAAACTATTCCACAACTAACAGATTGTTTTCCATCAAACCTTCGCCAACAGTGGGAAATTCT
GGTGATAGCTGCCATTCTCCATCCACTAAGAACTTGATTTCGTACCTTCATATATATTACAGAAACAACAGCAAAAGAAATAATGTTCTACGTCATGAAGGTAGT
CATTTGATCAGTAAATCAAGAAAATCAAGACGAGAGGAAGGCAAGGGAGAGGAATTATACCTCCCAGGTCTAAGCCTTAAAGTTGTTGAGAACTTAGAGTAA
Protein sequenceShow/hide protein sequence
MKRTKIESVSELEPVASLQNLLPLSKEVVARLLEYDLCIRAVTRKGKSQRKGGSFMSSNKKLIKEVKALLGSWEREAQVAKACTSEGPDDFLVKFNPDVVILQES
KVSVDGRKLVKSVWGSCLVGCGILEACGSSGGILLMWKEESITKSGSFIDDNALEREDCRGALLDLIVKEQKIWIEKCNLQWPCEGEENYSFFHRWISARKSKSI
ISSLMCIDGQALELGYPLTAQKTSLIGINLDSTVLAPYASTLGYVVDSIAFNYLGFSIGGGRNRKEPFYDFQQLEGPYSSVSLGEGPLVSCTQAVEQFIRIKRVG
TSTIFHKCKDYFYHQNYSTTNRLFSIKPSPTVGNSGDSCHSPSTKNLISYLHIYYRNNSKRNNVLRHEGSHLISKSRKSRREEGKGEELYLPGLSLKVVENLE