; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0159831 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0159831
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr06:8066298..8066927
RNA-Seq ExpressionCmc06g0159831
SyntenyCmc06g0159831
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]9.2e-10793.81Show/hide
Query:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTL VGTGDVISAR VGDAKLFFGNKFMFLENLYIV KIKRNLVSVSCLIEH+YSIN SMNEAFI KNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-
        ANTQNKRQRI PNNNTYLWHLRLGHI LD+IGRLVKNGLLNKLKD SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS LCGPMNVKARGGFEYFIS 
Subjt:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-

Query:  LDDYSRYGYL
        +DDYSRYGYL
Subjt:  LDDYSRYGYL

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-10592.86Show/hide
Query:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTL VGTGDVISAR VGDAKLFFGNKFMFLENLYIV KIKRNLVSVSCLIEH+YSIN SMNEAFI KNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-
        ANTQNKRQRI PNNNTYLWHLRLGHI LD+IGRLVK+GLLNKLKD SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS LCGPMNVKARG FEYFIS 
Subjt:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-

Query:  LDDYSRYGYL
        +DDYSRYGYL
Subjt:  LDDYSRYGYL

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]4.9e-7668.1Show/hide
Query:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT+ VGTG V+SA  VG  +L+    F+ LEN+Y+V  +KRNL+SV CL+E  YS+  ++N+ FI KNGV ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFISL
        A TQNKR +I P  N +LWHLRLGHI L++I RLVKNGLL++L+++SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HS LCGPMNVKARGGFEYFI+ 
Subjt:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFISL

Query:  -DDYSRYGYL
         DDYSRYGY+
Subjt:  -DDYSRYGYL

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-10290Show/hide
Query:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        M L VGTGDVISAR VGDAKLFFGNKFMFLENLYIV KIKRNLVSVSCLIEH+YSI+ SMNEAFISKNGVHICS KLE+NLYVL+PNE KAVLNHEMFRT
Subjt:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-
        ANTQNKRQRI  NNNTYLWHLRLGHI LD+IGRLVKNGLLNKL+DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS LCGPMNVKA GGFEYFIS 
Subjt:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-

Query:  LDDYSRYGYL
        +DDYS YGYL
Subjt:  LDDYSRYGYL

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-9990Show/hide
Query:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLMVGTGDVISAR VGD KLFFG KFMFLENLYIV KIKRNLV VSCLIEH+YSIN SMNEAFISKNG     AKLE+NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-
        ANTQNKRQRI PNNNTYLWHLRL HI LD+IGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGK YRAKEPLELIHS LCGPMNVKARGGFEYFIS 
Subjt:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-

Query:  LDDYSRYGYL
        +DDYSRYGYL
Subjt:  LDDYSRYGYL

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein8.4e-10692.86Show/hide
Query:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTL VGTGDVISAR VGDAKLFFGNKFMFLENLYIV KIKRNLVSVSCLIEH+YSIN SMNEAFI KNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-
        ANTQNKRQRI PNNNTYLWHLRLGHI LD+IGRLVK+GLLNKLKD SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS LCGPMNVKARG FEYFIS 
Subjt:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-

Query:  LDDYSRYGYL
        +DDYSRYGYL
Subjt:  LDDYSRYGYL

A0A5A7TU93 Gag/pol protein2.4e-7668.1Show/hide
Query:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT+ VGTG V+SA  VG  +L+    F+ LEN+Y+V  +KRNL+SV CL+E  YS+  ++N+ FI KNGV ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFISL
        A TQNKR +I P  N +LWHLRLGHI L++I RLVKNGLL++L+++SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HS LCGPMNVKARGGFEYFI+ 
Subjt:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFISL

Query:  -DDYSRYGYL
         DDYSRYGY+
Subjt:  -DDYSRYGYL

A0A5A7TZD0 Gag/pol protein4.5e-10793.81Show/hide
Query:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTL VGTGDVISAR VGDAKLFFGNKFMFLENLYIV KIKRNLVSVSCLIEH+YSIN SMNEAFI KNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-
        ANTQNKRQRI PNNNTYLWHLRLGHI LD+IGRLVKNGLLNKLKD SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS LCGPMNVKARGGFEYFIS 
Subjt:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-

Query:  LDDYSRYGYL
        +DDYSRYGYL
Subjt:  LDDYSRYGYL

A0A5A7VJG3 Gag/pol protein5.3e-10090Show/hide
Query:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLMVGTGDVISAR VGD KLFFG KFMFLENLYIV KIKRNLV VSCLIEH+YSIN SMNEAFISKNG     AKLE+NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-
        ANTQNKRQRI PNNNTYLWHLRL HI LD+IGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGK YRAKEPLELIHS LCGPMNVKARGGFEYFIS 
Subjt:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-

Query:  LDDYSRYGYL
        +DDYSRYGYL
Subjt:  LDDYSRYGYL

A0A5D3BNE1 Gag/pol protein1.1e-10290Show/hide
Query:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        M L VGTGDVISAR VGDAKLFFGNKFMFLENLYIV KIKRNLVSVSCLIEH+YSI+ SMNEAFISKNGVHICS KLE+NLYVL+PNE KAVLNHEMFRT
Subjt:  MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-
        ANTQNKRQRI  NNNTYLWHLRLGHI LD+IGRLVKNGLLNKL+DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS LCGPMNVKA GGFEYFIS 
Subjt:  ANTQNKRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-

Query:  LDDYSRYGYL
        +DDYS YGYL
Subjt:  LDDYSRYGYL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.4e-0932.09Show/hide
Query:  LENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHIC-SAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRICPNNNTYLWHLRLGHI--
        LE++    +   NL+SV  L E   SI    +   ISKNG+ +  ++ + NN+          V+N + + + N ++K       NN  LWH R GHI  
Subjt:  LENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHIC-SAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRICPNNNTYLWHLRLGHI--

Query:  -KLDQIGR---LVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSGLCGPMNVKARGGFEYF-ISLDDYSRY
         KL +I R        LLN L + S   CE CL GK  + PF     +   K PL ++HS +CGP+         YF I +D ++ Y
Subjt:  -KLDQIGR---LVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSGLCGPMNVKARGGFEYF-ISLDDYSRY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-1237.08Show/hide
Query:  LWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-LDDYSR
        LWH R+GH+    +  L K  L++  K  ++ PC+ CL GK  +  F     R    L+L++S +CGPM +++ GG +YF++ +DD SR
Subjt:  LWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFIS-LDDYSR

P93293 Uncharacterized mitochondrial protein AtMg003001.3e-0738.67Show/hide
Query:  NNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNV
        + T LWH RL H+    +  LVK G L+  K  SL  CE C+ GK  +  F+   +  K PL+ +HS L G  +V
Subjt:  NNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNV

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein9.5e-0938.67Show/hide
Query:  NNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNV
        + T LWH RL H+    +  LVK G L+  K  SL  CE C+ GK  +  F+   +  K PL+ +HS L G  +V
Subjt:  NNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTCATGGTTGGAACGGGAGATGTCATTTCAGCTCGTGTAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATA
GTTCTTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATTTACTCAATTAATATTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTA
CATATTTGTTCAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACAGCTAATACTCAAAAT
AAAAGGCAAAGAATTTGTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAAACTCGATCAGATCGGGAGATTGGTAAAGAATGGACTTCTA
AACAAGTTAAAAGATGATTCATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTA
GAACTTATACATTCAGGCCTTTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTAGATGATTATTCTAGGTATGGTTATTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGACACTCATGGTTGGAACGGGAGATGTCATTTCAGCTCGTGTAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATA
GTTCTTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATTTACTCAATTAATATTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTA
CATATTTGTTCAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACAGCTAATACTCAAAAT
AAAAGGCAAAGAATTTGTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAAACTCGATCAGATCGGGAGATTGGTAAAGAATGGACTTCTA
AACAAGTTAAAAGATGATTCATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTA
GAACTTATACATTCAGGCCTTTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTAGATGATTATTCTAGGTATGGTTATTTATAA
Protein sequenceShow/hide protein sequence
MTLMVGTGDVISARVVGDAKLFFGNKFMFLENLYIVLKIKRNLVSVSCLIEHIYSINISMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQN
KRQRICPNNNTYLWHLRLGHIKLDQIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSGLCGPMNVKARGGFEYFISLDDYSRYGYL