; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0109191 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0109191
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase, catalytic core
Genome locationCMiso1.1chr04:28696810..28697475
RNA-Seq ExpressionCmc04g0109191
SyntenyCmc04g0109191
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033603.1 Integrase, catalytic core [Cucumis melo var. makuwa]3.7e-8273.33Show/hide
Query:  MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTSV
        +VEN+ LALHAEE                     RKLQ+TIE    RV V ANN KL IAHVGKTMIVPRSN NQVELDNVFYV  MKKNL+++SQLTS 
Subjt:  MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTSV

Query:  GNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQ
         NF+VFG N+VKVYHN KVSGTPL+EGRRMD IYVMSAET Y+NK +KNET DLW+ARL HVSYSKLKTIINK +L GLPQLDIREDMVCVGCQYGKAHQ
Subjt:  GNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQ

Query:  LPFKESKFRAKQPLELVHSDVFGHV
         PFKE KFRAKQPLELVHSDVFG V
Subjt:  LPFKESKFRAKQPLELVHSDVFGHV

KAA0037684.1 Integrase, catalytic core [Cucumis melo var. makuwa]1.7e-12299.1Show/hide
Query:  MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIERVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTSVGNFV
        MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIERVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNL+TISQLTSVGNFV
Subjt:  MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIERVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTSVGNFV

Query:  VFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQLPFK
        VFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKS+LNGLPQLDIREDMVCVGCQYGKAHQLPFK
Subjt:  VFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQLPFK

Query:  ESKFRAKQPLELVHSDVFGHV
        ESKFRAKQPLELVHSDVFGHV
Subjt:  ESKFRAKQPLELVHSDVFGHV

KAA0046986.1 Integrase, catalytic core [Cucumis melo var. makuwa]1.7e-10686.67Show/hide
Query:  MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTSV
        ++ENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIE    RVVV ANN KLPIAHVGKTMIVPRSN NQVELDNVFYV GMKKNL+++SQLTS 
Subjt:  MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTSV

Query:  GNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQ
        GNFVVFGP+DVKVYHN KVSGTPL+EGRRMD IYVMSAETAYVNKTRKN+T DLW+ARL HVS SKLKTIINK++L GLPQLDIREDMVC GCQYGKAHQ
Subjt:  GNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQ

Query:  LPFKESKFRAKQPLELVHSDVFGHV
        LPFK+SKFRAKQPLELVHSDVF  V
Subjt:  LPFKESKFRAKQPLELVHSDVFGHV

KAE8684576.1 hypothetical protein F3Y22_tig00111127pilonHSYRG00074 [Hibiscus syriacus]4.9e-8265.93Show/hide
Query:  ENEVLALHA---EEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTS
        E E LAL     E ++Y+NDWIVDSGCSNHMTGDK+KLQN  E    RVVV A+N +LPI H+GKT++ PR N NQV+L +V++VPGMKKNL++++QLTS
Subjt:  ENEVLALHA---EEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTS

Query:  VGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAH
         G++V+FGP DVKVY + K++ TP +EGRR++ IYVMSAE+AYV++TRKNET DLW+ RL HVSYSKL  ++ KS+L GLPQLD+R D VC GCQYGKAH
Subjt:  VGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAH

Query:  QLPFKESKFRAKQPLELVHSDVFGHV
        QLP+ ESKF+AK+PLELVHSDVFG V
Subjt:  QLPFKESKFRAKQPLELVHSDVFGHV

KAE8705435.1 hypothetical protein F3Y22_tig00110429pilonHSYRG01243 [Hibiscus syriacus]4.9e-8265.93Show/hide
Query:  ENEVLALHA---EEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTS
        E E LAL     E ++Y+NDWIVDSGCSNHMTGDK+KLQN  E    RVVV A+N +LPI H+GKT++ PR N NQV+L +V++VPGMKKNL++++QLTS
Subjt:  ENEVLALHA---EEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTS

Query:  VGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAH
         G++V+FGP DVKVY + K++ TP +EGRR++ IYVMSAE+AYV++TRKNET DLW+ RL HVSYSKL  ++ KS+L GLPQLD+R D VC GCQYGKAH
Subjt:  VGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAH

Query:  QLPFKESKFRAKQPLELVHSDVFGHV
        QLP+ ESKF+AK+PLELVHSDVFG V
Subjt:  QLPFKESKFRAKQPLELVHSDVFGHV

TrEMBL top hitse value%identityAlignment
A0A2N9EJM7 Uncharacterized protein1.6e-8366.08Show/hide
Query:  VENEVLALHA---EEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLT
        +E E LAL     E+++Y+NDWIVDSGCSNHMTGDK KLQN  E    RVVV A+N +LPIAH+GKT++ PR N NQV L +V++VPGMKKNL++++QLT
Subjt:  VENEVLALHA---EEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLT

Query:  SVGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKA
          G++V+FGP DVKVY + K+S TP++EG+R++ +YVMSAE+AYV+KTRKNET DLW+ RL HVSYSKL  ++ KS+L GLPQLD+R D VC GCQYGKA
Subjt:  SVGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKA

Query:  HQLPFKESKFRAKQPLELVHSDVFGHV
        HQLP+KESKF+AK+PLELVHSDVFG V
Subjt:  HQLPFKESKFRAKQPLELVHSDVFGHV

A0A2N9FTW5 Uncharacterized protein1.6e-8366.08Show/hide
Query:  VENEVLALHA---EEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLT
        +E E LAL     E+++Y+NDWIVDSGCSNHMTGDK KLQN  E    RVVV A+N +LPIAH+GKT++ PR N NQV L +V++VPGMKKNL++++QLT
Subjt:  VENEVLALHA---EEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLT

Query:  SVGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKA
          G++V+FGP DVKVY + K+S TP++EG+R++ +YVMSAE+AYV++TRKNET DLW+ RL HVSYSKL  ++ KS+L GLPQLD+R D VCVGCQYGKA
Subjt:  SVGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKA

Query:  HQLPFKESKFRAKQPLELVHSDVFGHV
        HQLP+KESKF+AK+PLELVHSDVFG V
Subjt:  HQLPFKESKFRAKQPLELVHSDVFGHV

A0A2N9J3C6 Uncharacterized protein1.6e-8366.08Show/hide
Query:  VENEVLALHA---EEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLT
        +E E LAL     E+++Y+NDWIVDSGCSNHMTGDK KLQN  E    RVVV A+N +LPIAH+GKT++ PR N NQV L +V++VPGMKKNL++++QLT
Subjt:  VENEVLALHA---EEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLT

Query:  SVGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKA
          G++V+FGP DVKVY + K+S TP++EG+R++ +YVMSAE+AYV+KTRKNET DLW+ RL HVSYSKL  ++ KS+L GLPQLD+R D VC GCQYGKA
Subjt:  SVGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKA

Query:  HQLPFKESKFRAKQPLELVHSDVFGHV
        HQLP+KESKF+AK+PLELVHSDVFG V
Subjt:  HQLPFKESKFRAKQPLELVHSDVFGHV

A0A5A7T2J1 Integrase, catalytic core8.0e-12399.1Show/hide
Query:  MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIERVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTSVGNFV
        MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIERVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNL+TISQLTSVGNFV
Subjt:  MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIERVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTSVGNFV

Query:  VFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQLPFK
        VFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKS+LNGLPQLDIREDMVCVGCQYGKAHQLPFK
Subjt:  VFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQLPFK

Query:  ESKFRAKQPLELVHSDVFGHV
        ESKFRAKQPLELVHSDVFGHV
Subjt:  ESKFRAKQPLELVHSDVFGHV

A0A5A7TYD2 Integrase, catalytic core8.1e-10786.67Show/hide
Query:  MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTSV
        ++ENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIE    RVVV ANN KLPIAHVGKTMIVPRSN NQVELDNVFYV GMKKNL+++SQLTS 
Subjt:  MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIE----RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTSV

Query:  GNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQ
        GNFVVFGP+DVKVYHN KVSGTPL+EGRRMD IYVMSAETAYVNKTRKN+T DLW+ARL HVS SKLKTIINK++L GLPQLDIREDMVC GCQYGKAHQ
Subjt:  GNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQ

Query:  LPFKESKFRAKQPLELVHSDVFGHV
        LPFK+SKFRAKQPLELVHSDVF  V
Subjt:  LPFKESKFRAKQPLELVHSDVFGHV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.8e-1025.99Show/hide
Query:  ENEVLALHAEE-----VNYENDWIVDSGCSNHMTGDKRKLQNTIE---RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLT
        +N VL ++ EE        E++W+VD+  S+H T  +      +      V + N     IA +G   I        V L +V +VP ++ NL++   L 
Subjt:  ENEVLALHAEE-----VNYENDWIVDSGCSNHMTGDKRKLQNTIE---RVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLT

Query:  SVGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAY--VNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYG
          G    F     ++     V    + +G     +Y  +AE     +N  +   +VDLW+ R+ H+S   L+ +  KS+++      ++    C  C +G
Subjt:  SVGNFVVFGPNDVKVYHNFKVSGTPLIEGRRMDFIYVMSAETAY--VNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYG

Query:  KAHQLPFKESKFRAKQPLELVHSDVFG
        K H++ F+ S  R    L+LV+SDV G
Subjt:  KAHQLPFKESKFRAKQPLELVHSDVFG

P93293 Uncharacterized mitochondrial protein AtMg003006.8e-1033.65Show/hide
Query:  LIEGRRMDFIYVM--SAETAYVN--KTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQLPFKESKFRAKQPLELVHS
        +++G R D +Y++  S ET   N  +T K+ET  LW++RL+H+S   ++ ++ K  L+      ++    C  C YGK H++ F   +   K PL+ VHS
Subjt:  LIEGRRMDFIYVM--SAETAYVN--KTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQLPFKESKFRAKQPLELVHS

Query:  DVFG
        D++G
Subjt:  DVFG

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein4.8e-1133.65Show/hide
Query:  LIEGRRMDFIYVM--SAETAYVN--KTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQLPFKESKFRAKQPLELVHS
        +++G R D +Y++  S ET   N  +T K+ET  LW++RL+H+S   ++ ++ K  L+      ++    C  C YGK H++ F   +   K PL+ VHS
Subjt:  LIEGRRMDFIYVM--SAETAYVN--KTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQLPFKESKFRAKQPLELVHS

Query:  DVFG
        D++G
Subjt:  DVFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGAAAATGAAGTGTTAGCTCTCCATGCAGAAGAGGTAAATTATGAAAATGATTGGATTGTTGATTCAGGATGTTCTAACCATATGACAGGAGATAAGAGGAAGCT
ACAAAACACAATAGAGCGAGTTGTTGTAATTGCAAACAACTTGAAGTTGCCAATAGCTCACGTTGGCAAAACTATGATAGTGCCTCGTTCTAATTTCAATCAAGTGGAAT
TAGATAATGTATTTTATGTGCCTGGAATGAAGAAGAATTTGATGACAATATCTCAATTGACTTCAGTAGGCAACTTCGTTGTATTTGGACCTAACGATGTCAAGGTGTAT
CATAATTTTAAAGTAAGTGGTACACCATTAATAGAAGGACGAAGGATGGACTTCATCTACGTTATGTCAGCAGAGACCGCTTACGTGAACAAGACGCGGAAGAATGAAAC
AGTAGATTTGTGGAATGCAAGACTTAGTCATGTTAGCTACAGCAAATTAAAGACAATAATAAACAAGTCCATATTGAATGGGTTGCCACAACTTGATATCAGAGAAGACA
TGGTATGTGTTGGTTGCCAGTATGGAAAAGCACATCAACTACCATTCAAGGAGTCCAAATTCAGAGCAAAACAACCATTGGAGTTGGTGCATTCAGATGTATTTGGTCAT
GTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGAAAATGAAGTGTTAGCTCTCCATGCAGAAGAGGTAAATTATGAAAATGATTGGATTGTTGATTCAGGATGTTCTAACCATATGACAGGAGATAAGAGGAAGCT
ACAAAACACAATAGAGCGAGTTGTTGTAATTGCAAACAACTTGAAGTTGCCAATAGCTCACGTTGGCAAAACTATGATAGTGCCTCGTTCTAATTTCAATCAAGTGGAAT
TAGATAATGTATTTTATGTGCCTGGAATGAAGAAGAATTTGATGACAATATCTCAATTGACTTCAGTAGGCAACTTCGTTGTATTTGGACCTAACGATGTCAAGGTGTAT
CATAATTTTAAAGTAAGTGGTACACCATTAATAGAAGGACGAAGGATGGACTTCATCTACGTTATGTCAGCAGAGACCGCTTACGTGAACAAGACGCGGAAGAATGAAAC
AGTAGATTTGTGGAATGCAAGACTTAGTCATGTTAGCTACAGCAAATTAAAGACAATAATAAACAAGTCCATATTGAATGGGTTGCCACAACTTGATATCAGAGAAGACA
TGGTATGTGTTGGTTGCCAGTATGGAAAAGCACATCAACTACCATTCAAGGAGTCCAAATTCAGAGCAAAACAACCATTGGAGTTGGTGCATTCAGATGTATTTGGTCAT
GTCTAA
Protein sequenceShow/hide protein sequence
MVENEVLALHAEEVNYENDWIVDSGCSNHMTGDKRKLQNTIERVVVIANNLKLPIAHVGKTMIVPRSNFNQVELDNVFYVPGMKKNLMTISQLTSVGNFVVFGPNDVKVY
HNFKVSGTPLIEGRRMDFIYVMSAETAYVNKTRKNETVDLWNARLSHVSYSKLKTIINKSILNGLPQLDIREDMVCVGCQYGKAHQLPFKESKFRAKQPLELVHSDVFGH
V