; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0221461 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0221461
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr08:9272126..9273076
RNA-Seq ExpressionCmc08g0221461
SyntenyCmc08g0221461
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-10569.72Show/hide
Query:  MDSLRVIFGQPSIQIKQEANVAHS-RRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK----------------------------------------
        MDSLR +FGQPSIQIKQEANVAHS RRF P  SGS+KIQKRK GKGKG T+A E KGKAK                                        
Subjt:  MDSLRVIFGQPSIQIKQEANVAHS-RRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK----------------------------------------

Query:  --------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNE
                                  ETSSFKQLE+SEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLV VSCLIEH YSINFSMNE
Subjt:  --------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNE

Query:  AFISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKR
        AFI KN VHIC AKLENNLYVL+PNEAKAVLNHEMFRTANTQNKR+ ISPNNNTYLWHLRL HINLDRI RLVKNGLL++L+D SLPPCESCLE KMTKR
Subjt:  AFISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKR

Query:  FFTGKGDRAKEPLELIH
         FTGKG RAKEPLELIH
Subjt:  FFTGKGDRAKEPLELIH

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]7.9e-10569.4Show/hide
Query:  MDSLRVIFGQPSIQIKQEANVAHS-RRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK----------------------------------------
        MDSLR +FGQPSIQIKQEANVAHS RRF P  SGS+KIQKRK GKGKG T+A E KGKAK                                        
Subjt:  MDSLRVIFGQPSIQIKQEANVAHS-RRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK----------------------------------------

Query:  --------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNE
                                  ETSSFKQLE+SEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLV VSCLIEH YSINFSMNE
Subjt:  --------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNE

Query:  AFISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKR
        AFI KN VHIC AKLENNLYVL+PNEAKAVLNHEMFRTANTQNKR+ ISPNNNTYLWHLRL HINLDRI RLVK+GLL++L+D SLPPCESCLE KMTKR
Subjt:  AFISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKR

Query:  FFTGKGDRAKEPLELIH
         FTGKG RAKEPLELIH
Subjt:  FFTGKGDRAKEPLELIH

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-6366.67Show/hide
Query:  SSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEAFISKNDVHICLAKLENNLYVLKPNEAK
        SS++QLE  EMT++VGTG V+SA AVG  +L+    F+ LEN+Y+VP +KRNL+ V CL+E  YS+ F++N+ FI KN V IC AKLENNLYVL+   +K
Subjt:  SSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEAFISKNDVHICLAKLENNLYVLKPNEAK

Query:  AVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH
        A+LN EMF+TA TQNKR  ISP  N +LWHLRL HINL+RIERLVKNGLLSELE++SLP CESCLE KMTKR FTGKG RAKEPLEL+H
Subjt:  AVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-10468.99Show/hide
Query:  MDSLRVIFGQPSIQIKQEANVAHSRRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK-----------------------------------------
        MDSLR +FGQPSIQIKQE NVAHS+RFA  S GS+KIQKRK GKGKG T+A EGKGK K                                         
Subjt:  MDSLRVIFGQPSIQIKQEANVAHSRRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK-----------------------------------------

Query:  -------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEA
                                 ETSSFKQLEESEM LKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLV VSCLIEH YSI+FSMNEA
Subjt:  -------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEA

Query:  FISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRF
        FISKN VHIC  KLE+NLYVLKPNE KAVLNHEMFRTANTQNKR+ IS NNNTYLWHLRL HINLDRI RLVKNGLL++LEDDSLPPCESCLE KMTKR 
Subjt:  FISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRF

Query:  FTGKGDRAKEPLELIH
        FTGKG RAKEPLELIH
Subjt:  FTGKGDRAKEPLELIH

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-10674.66Show/hide
Query:  MDSLRVIFGQPSIQIKQEANVAHSRRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK-----------------------------------------
        MDSLR +FGQPSIQIKQEANVAHS+RFAP SSGS+KIQKRK GKG+G T+A EGKGKAK                                         
Subjt:  MDSLRVIFGQPSIQIKQEANVAHSRRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK-----------------------------------------

Query:  -ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEAFISKNDVHICLAKLENNLYVLKPN
         ETSSFKQLEESEMTL VGTGDVISARAVGD KLFFG KFMFLENLYIVPKIKRNLVFVSCLIEH YSINFSMNEAFISKN      AKLE+NLYVL+PN
Subjt:  -ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEAFISKNDVHICLAKLENNLYVLKPN

Query:  EAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH
        EAKAVLNHEMFRTANTQNKR+ ISPNNNTYLWHLRLDHINLDRI RLVKNGLL++L+DDSLPPCESCLE KMTKR FTGK  RAKEPLELIH
Subjt:  EAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein3.8e-10569.4Show/hide
Query:  MDSLRVIFGQPSIQIKQEANVAHS-RRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK----------------------------------------
        MDSLR +FGQPSIQIKQEANVAHS RRF P  SGS+KIQKRK GKGKG T+A E KGKAK                                        
Subjt:  MDSLRVIFGQPSIQIKQEANVAHS-RRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK----------------------------------------

Query:  --------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNE
                                  ETSSFKQLE+SEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLV VSCLIEH YSINFSMNE
Subjt:  --------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNE

Query:  AFISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKR
        AFI KN VHIC AKLENNLYVL+PNEAKAVLNHEMFRTANTQNKR+ ISPNNNTYLWHLRL HINLDRI RLVK+GLL++L+D SLPPCESCLE KMTKR
Subjt:  AFISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKR

Query:  FFTGKGDRAKEPLELIH
         FTGKG RAKEPLELIH
Subjt:  FFTGKGDRAKEPLELIH

A0A5A7TU93 Gag/pol protein9.5e-6466.67Show/hide
Query:  SSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEAFISKNDVHICLAKLENNLYVLKPNEAK
        SS++QLE  EMT++VGTG V+SA AVG  +L+    F+ LEN+Y+VP +KRNL+ V CL+E  YS+ F++N+ FI KN V IC AKLENNLYVL+   +K
Subjt:  SSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEAFISKNDVHICLAKLENNLYVLKPNEAK

Query:  AVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH
        A+LN EMF+TA TQNKR  ISP  N +LWHLRL HINL+RIERLVKNGLLSELE++SLP CESCLE KMTKR FTGKG RAKEPLEL+H
Subjt:  AVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH

A0A5A7TZD0 Gag/pol protein1.0e-10569.72Show/hide
Query:  MDSLRVIFGQPSIQIKQEANVAHS-RRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK----------------------------------------
        MDSLR +FGQPSIQIKQEANVAHS RRF P  SGS+KIQKRK GKGKG T+A E KGKAK                                        
Subjt:  MDSLRVIFGQPSIQIKQEANVAHS-RRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK----------------------------------------

Query:  --------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNE
                                  ETSSFKQLE+SEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLV VSCLIEH YSINFSMNE
Subjt:  --------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNE

Query:  AFISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKR
        AFI KN VHIC AKLENNLYVL+PNEAKAVLNHEMFRTANTQNKR+ ISPNNNTYLWHLRL HINLDRI RLVKNGLL++L+D SLPPCESCLE KMTKR
Subjt:  AFISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKR

Query:  FFTGKGDRAKEPLELIH
         FTGKG RAKEPLELIH
Subjt:  FFTGKGDRAKEPLELIH

A0A5A7VJG3 Gag/pol protein9.1e-10774.66Show/hide
Query:  MDSLRVIFGQPSIQIKQEANVAHSRRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK-----------------------------------------
        MDSLR +FGQPSIQIKQEANVAHS+RFAP SSGS+KIQKRK GKG+G T+A EGKGKAK                                         
Subjt:  MDSLRVIFGQPSIQIKQEANVAHSRRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK-----------------------------------------

Query:  -ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEAFISKNDVHICLAKLENNLYVLKPN
         ETSSFKQLEESEMTL VGTGDVISARAVGD KLFFG KFMFLENLYIVPKIKRNLVFVSCLIEH YSINFSMNEAFISKN      AKLE+NLYVL+PN
Subjt:  -ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEAFISKNDVHICLAKLENNLYVLKPN

Query:  EAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH
        EAKAVLNHEMFRTANTQNKR+ ISPNNNTYLWHLRLDHINLDRI RLVKNGLL++L+DDSLPPCESCLE KMTKR FTGK  RAKEPLELIH
Subjt:  EAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH

A0A5D3BNE1 Gag/pol protein8.5e-10568.99Show/hide
Query:  MDSLRVIFGQPSIQIKQEANVAHSRRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK-----------------------------------------
        MDSLR +FGQPSIQIKQE NVAHS+RFA  S GS+KIQKRK GKGKG T+A EGKGK K                                         
Subjt:  MDSLRVIFGQPSIQIKQEANVAHSRRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAK-----------------------------------------

Query:  -------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEA
                                 ETSSFKQLEESEM LKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLV VSCLIEH YSI+FSMNEA
Subjt:  -------------------------ETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVFVSCLIEHKYSINFSMNEA

Query:  FISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRF
        FISKN VHIC  KLE+NLYVLKPNE KAVLNHEMFRTANTQNKR+ IS NNNTYLWHLRL HINLDRI RLVKNGLL++LEDDSLPPCESCLE KMTKR 
Subjt:  FISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRF

Query:  FTGKGDRAKEPLELIH
        FTGKG RAKEPLELIH
Subjt:  FTGKGDRAKEPLELIH

SwissProt top hitse value%identityAlignment
P93293 Uncharacterized mitochondrial protein AtMg003004.8e-0436.36Show/hide
Query:  NNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH
        + T LWH RL H++   +E LVK G L   +  SL  CE C+  K  +  F+      K PL+ +H
Subjt:  NNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein3.4e-0536.36Show/hide
Query:  NNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH
        + T LWH RL H++   +E LVK G L   +  SL  CE C+  K  +  F+      K PL+ +H
Subjt:  NNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLPPCESCLEEKMTKRFFTGKGDRAKEPLELIH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCCCTTAGAGTGATATTTGGGCAACCGTCCATTCAGATCAAACAAGAGGCAAATGTTGCCCATTCCAGGAGGTTTGCACCTTTATCTTCTGGATCTAAAAAAAT
TCAGAAGAGGAAAGGAGGGAAGGGGAAAGGTTCTACTGTTGCTGCTGAGGGCAAAGGGAAGGCTAAGGAAACTAGTTCCTTCAAGCAGCTTGAGGAGAGTGAGATGACAC
TCAAGGTTGGAACAGGGGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTTGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATT
AAAAGGAACTTAGTTTTCGTTTCTTGTCTTATTGAACATAAGTATTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGATGTACATATTTGTTTGGCTAA
GCTTGAAAACAACTTATATGTATTAAAACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGAAAATAATTTCTCCAA
ATAATAATACCTATCTTTGGCATTTAAGATTAGATCACATAAATCTCGATCGAATCGAGAGATTGGTAAAGAATGGACTTTTAAGCGAGTTAGAGGATGATTCATTACCT
CCATGTGAATCTTGTCTTGAAGAAAAAATGACAAAAAGATTTTTTACTGGAAAAGGTGATAGGGCCAAAGAGCCTTTAGAACTTATACATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACTCCCTTAGAGTGATATTTGGGCAACCGTCCATTCAGATCAAACAAGAGGCAAATGTTGCCCATTCCAGGAGGTTTGCACCTTTATCTTCTGGATCTAAAAAAAT
TCAGAAGAGGAAAGGAGGGAAGGGGAAAGGTTCTACTGTTGCTGCTGAGGGCAAAGGGAAGGCTAAGGAAACTAGTTCCTTCAAGCAGCTTGAGGAGAGTGAGATGACAC
TCAAGGTTGGAACAGGGGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTTGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATT
AAAAGGAACTTAGTTTTCGTTTCTTGTCTTATTGAACATAAGTATTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGATGTACATATTTGTTTGGCTAA
GCTTGAAAACAACTTATATGTATTAAAACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGAAAATAATTTCTCCAA
ATAATAATACCTATCTTTGGCATTTAAGATTAGATCACATAAATCTCGATCGAATCGAGAGATTGGTAAAGAATGGACTTTTAAGCGAGTTAGAGGATGATTCATTACCT
CCATGTGAATCTTGTCTTGAAGAAAAAATGACAAAAAGATTTTTTACTGGAAAAGGTGATAGGGCCAAAGAGCCTTTAGAACTTATACATTAG
Protein sequenceShow/hide protein sequence
MDSLRVIFGQPSIQIKQEANVAHSRRFAPLSSGSKKIQKRKGGKGKGSTVAAEGKGKAKETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKI
KRNLVFVSCLIEHKYSINFSMNEAFISKNDVHICLAKLENNLYVLKPNEAKAVLNHEMFRTANTQNKRKIISPNNNTYLWHLRLDHINLDRIERLVKNGLLSELEDDSLP
PCESCLEEKMTKRFFTGKGDRAKEPLELIH