; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0247021 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0247021
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr09:11389530..11390087
RNA-Seq ExpressionCmc09g0247021
SyntenyCmc09g0247021
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.7e-5682.95Show/hide
Query:  NTEMFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYK
        NTEMFR  ETQNK+QKVSSNA+LWH RLGHINLNRI RLVKSG+L+QLEDNSLPP +SCLEGKMTKRSFT KG RAK PLELVHSD+CGPMNVKA+GGY+
Subjt:  NTEMFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYK

Query:  YFISFIDDYLRHGHVYLIHKKSDSFEKFK
        YFISFIDD+ R+GHVYL+H KS+SFEKFK
Subjt:  YFISFIDDYLRHGHVYLIHKKSDSFEKFK

KAA0035526.1 gag/pol protein [Cucumis melo var. makuwa]9.8e-5285.25Show/hide
Query:  AETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFISFID
        AETQNKRQKVSSNAFL H RLGHINLN+IGRLVKSGLL+QL+DNSLPP DSCLEGKMTKRSFT KG RAK PLELVHSD+ GPMNVKA+GGY+YFISFID
Subjt:  AETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFISFID

Query:  DYLRHGHVYLIHKKSDSFEKFK
        +Y R+GHVYLI  KSDSFEKFK
Subjt:  DYLRHGHVYLIHKKSDSFEKFK

KAA0050872.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-5585.71Show/hide
Query:  MFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFI
        +FR A+TQNKRQKVS NAFLWH RLGHINLN IGRLVKSGLLSQLEDNSLPP DSCLEGKMT+RSFT KG RAKTPLELVHSD+CGPMNVKA+GGYKYFI
Subjt:  MFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFI

Query:  SFIDDYLRHGHVYLIHKKSDSFEKFK
        SFID+Y R+GHVYLI  K DSFEKFK
Subjt:  SFIDDYLRHGHVYLIHKKSDSFEKFK

TYK08480.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-5585.71Show/hide
Query:  MFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFI
        +FR A+TQNKRQKVS NAFLWH RLGHINLN IGRLVKSGLLSQLEDNSLPP DSCLEGKMT+RSFT KG RAKTPLELVHSD+CGPMNVKA+GGYKYFI
Subjt:  MFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFI

Query:  SFIDDYLRHGHVYLIHKKSDSFEKFK
        SFID+Y R+GHVYLI  K DSFEKFK
Subjt:  SFIDDYLRHGHVYLIHKKSDSFEKFK

TYK31009.1 gag/pol protein [Cucumis melo var. makuwa]9.8e-5285.25Show/hide
Query:  AETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFISFID
        AETQNKRQKVSSNAFL H RLGHINLN+IGRLVKSGLL+QL+DNSLPP DSCLEGKMTKRSFT KG RAK PLELVHSD+ GPMNVKA+GGY+YFISFID
Subjt:  AETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFISFID

Query:  DYLRHGHVYLIHKKSDSFEKFK
        +Y R+GHVYLI  KSDSFEKFK
Subjt:  DYLRHGHVYLIHKKSDSFEKFK

TrEMBL top hitse value%identityAlignment
A0A5A7T011 Gag/pol protein4.7e-5285.25Show/hide
Query:  AETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFISFID
        AETQNKRQKVSSNAFL H RLGHINLN+IGRLVKSGLL+QL+DNSLPP DSCLEGKMTKRSFT KG RAK PLELVHSD+ GPMNVKA+GGY+YFISFID
Subjt:  AETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFISFID

Query:  DYLRHGHVYLIHKKSDSFEKFK
        +Y R+GHVYLI  KSDSFEKFK
Subjt:  DYLRHGHVYLIHKKSDSFEKFK

A0A5A7U4V3 Gag/pol protein2.7e-5585.71Show/hide
Query:  MFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFI
        +FR A+TQNKRQKVS NAFLWH RLGHINLN IGRLVKSGLLSQLEDNSLPP DSCLEGKMT+RSFT KG RAKTPLELVHSD+CGPMNVKA+GGYKYFI
Subjt:  MFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFI

Query:  SFIDDYLRHGHVYLIHKKSDSFEKFK
        SFID+Y R+GHVYLI  K DSFEKFK
Subjt:  SFIDDYLRHGHVYLIHKKSDSFEKFK

A0A5D3CAW3 Gag/pol protein2.7e-5585.71Show/hide
Query:  MFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFI
        +FR A+TQNKRQKVS NAFLWH RLGHINLN IGRLVKSGLLSQLEDNSLPP DSCLEGKMT+RSFT KG RAKTPLELVHSD+CGPMNVKA+GGYKYFI
Subjt:  MFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFI

Query:  SFIDDYLRHGHVYLIHKKSDSFEKFK
        SFID+Y R+GHVYLI  K DSFEKFK
Subjt:  SFIDDYLRHGHVYLIHKKSDSFEKFK

A0A5D3E562 Gag/pol protein4.7e-5285.25Show/hide
Query:  AETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFISFID
        AETQNKRQKVSSNAFL H RLGHINLN+IGRLVKSGLL+QL+DNSLPP DSCLEGKMTKRSFT KG RAK PLELVHSD+ GPMNVKA+GGY+YFISFID
Subjt:  AETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFISFID

Query:  DYLRHGHVYLIHKKSDSFEKFK
        +Y R+GHVYLI  KSDSFEKFK
Subjt:  DYLRHGHVYLIHKKSDSFEKFK

E2GK51 Gag/pol protein (Fragment)8.4e-5782.95Show/hide
Query:  NTEMFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYK
        NTEMFR  ETQNK+QKVSSNA+LWH RLGHINLNRI RLVKSG+L+QLEDNSLPP +SCLEGKMTKRSFT KG RAK PLELVHSD+CGPMNVKA+GGY+
Subjt:  NTEMFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYK

Query:  YFISFIDDYLRHGHVYLIHKKSDSFEKFK
        YFISFIDD+ R+GHVYL+H KS+SFEKFK
Subjt:  YFISFIDDYLRHGHVYLIHKKSDSFEKFK

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.7e-1134.43Show/hide
Query:  KVSSNAFLWHSRLGHIN------LNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRA--KTPLELVHSDICGPMNVKAQGGYKYFISFID
        K  +N  LWH R GHI+      + R        LL+ LE  S    + CL GK  +  F +   +   K PL +VHSD+CGP+         YF+ F+D
Subjt:  KVSSNAFLWHSRLGHIN------LNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRA--KTPLELVHSDICGPMNVKAQGGYKYFISFID

Query:  DYLRHGHVYLIHKKSDSFEKFK
         +  +   YLI  KSD F  F+
Subjt:  DYLRHGHVYLIHKKSDSFEKFK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-1840.19Show/hide
Query:  LWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFISFIDDYLRHGHVYLIHKKS
        LWH R+GH++   +  L K  L+S  +  ++ P D CL GK  + SF     R    L+LV+SD+CGPM +++ GG KYF++FIDD  R   VY++  K 
Subjt:  LWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFISFIDDYLRHGHVYLIHKKS

Query:  DSFEKFK
          F+ F+
Subjt:  DSFEKFK

P93293 Uncharacterized mitochondrial protein AtMg003007.6e-0734.15Show/hide
Query:  NKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNV
        N  +       LWHSRL H++   +  LVK G L   + +SL   + C+ GK  + +F+      K PL+ VHSD+ G  +V
Subjt:  NKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNV

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein5.4e-0834.15Show/hide
Query:  NKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNV
        N  +       LWHSRL H++   +  LVK G L   + +SL   + C+ GK  + +F+      K PL+ VHSD+ G  +V
Subjt:  NKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLPPFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGTGCGTCGGGAAAACCTCTTATTCCCGATGTCTTTCTTTATGCGTTGAGAATAGCTTTTTCTCCTGACGTCTTACTTTATGCGTCAGGAGAACCTCTTATTCT
CGACGTCTTTTATGCTGACGTCTTTTTTTACGTTGGGAAATGTCCGATTTCTTGTAGAAATACTGAGATGTTTAGAATAGCTGAAACTCAGAATAAAAGACAAAAAGTTT
CTTCCAATGCCTTCTTATGGCACTCAAGACTCGGTCATATTAATCTCAATAGGATTGGGAGATTGGTTAAGAGTGGACTTCTAAGTCAGTTAGAAGATAACTCTTTACCT
CCTTTTGATTCCTGTCTTGAAGGGAAAATGACCAAAAGATCTTTTACTGAAAAAGGTTTTAGAGCTAAAACACCTTTAGAGCTCGTACATTCGGACATTTGTGGACCAAT
GAATGTCAAGGCTCAGGGAGGATACAAATATTTCATTAGTTTCATTGATGATTATTTGAGGCATGGTCATGTTTACCTAATTCATAAGAAGTCTGATTCTTTTGAAAAGT
TCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTGTGCGTCGGGAAAACCTCTTATTCCCGATGTCTTTCTTTATGCGTTGAGAATAGCTTTTTCTCCTGACGTCTTACTTTATGCGTCAGGAGAACCTCTTATTCT
CGACGTCTTTTATGCTGACGTCTTTTTTTACGTTGGGAAATGTCCGATTTCTTGTAGAAATACTGAGATGTTTAGAATAGCTGAAACTCAGAATAAAAGACAAAAAGTTT
CTTCCAATGCCTTCTTATGGCACTCAAGACTCGGTCATATTAATCTCAATAGGATTGGGAGATTGGTTAAGAGTGGACTTCTAAGTCAGTTAGAAGATAACTCTTTACCT
CCTTTTGATTCCTGTCTTGAAGGGAAAATGACCAAAAGATCTTTTACTGAAAAAGGTTTTAGAGCTAAAACACCTTTAGAGCTCGTACATTCGGACATTTGTGGACCAAT
GAATGTCAAGGCTCAGGGAGGATACAAATATTTCATTAGTTTCATTGATGATTATTTGAGGCATGGTCATGTTTACCTAATTCATAAGAAGTCTGATTCTTTTGAAAAGT
TCAAATAA
Protein sequenceShow/hide protein sequence
MNCASGKPLIPDVFLYALRIAFSPDVLLYASGEPLILDVFYADVFFYVGKCPISCRNTEMFRIAETQNKRQKVSSNAFLWHSRLGHINLNRIGRLVKSGLLSQLEDNSLP
PFDSCLEGKMTKRSFTEKGFRAKTPLELVHSDICGPMNVKAQGGYKYFISFIDDYLRHGHVYLIHKKSDSFEKFK