; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0049741 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0049741
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr02:15622064..15622594
RNA-Seq ExpressionCmc02g0049741
SyntenyCmc02g0049741
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034962.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-8089.2Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR
        MSQTSYIDKMLSRYKM NSKKGLL YRYGIHLSKEQC KTPQEVEDMSNILYA  VGSLMYAMLCTRPDICY + IVSRYQSN   DHWT VKNILKYLR
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR

Query:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        R KDYMLVYGSKD ILTGYTDS+FQTDKDARKSTSGSVFTLNGGAVVW SIKQSCI +STMEAEYVAACEAAKE V
Subjt:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

KAA0046766.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-8798.2Show/hide
Query:  MLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLRRIKDYMLVY
        M + YKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLRRIKDYMLVY
Subjt:  MLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLRRIKDYMLVY

Query:  GSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        GSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
Subjt:  GSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

KAA0050132.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-8188.64Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR
        MSQTSY+DKMLSRYKM+N KK LL YRYGIHLSKEQC KTPQEVEDMSNI YA+ VGSLMYAMLCTRPDICYS+ IVSRYQSN G DHWTA+KNILKYLR
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR

Query:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        R KDYMLVYGSKDLIL GYTDS FQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTME EYVAACEAAK+ V
Subjt:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

KAA0058854.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-8087.5Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR
        MSQTSYIDKMLSRYKM+NSKK LLPYRYGIHLSKEQC KTPQEVEDMSNI YAF +GSLMYAMLC RPDICYS+ IVS YQSN G DHWT VKNI+KYLR
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR

Query:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        R KDYM VYGSKDLILT YTDS FQTDKDARKSTSGSVF LNGGAVVWRSIKQSCIADSTME +YVAACEAAKE V
Subjt:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

KAA0063746.1 gag/pol protein [Cucumis melo var. makuwa]4.1e-8491.48Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR
        MSQTSYIDKMLSRYKM+NSKK +LPYRYGIHLSKEQC KTPQEVEDMSNILY  VVGSLMYA+LCTRPDICYS+ IVSRYQSN G DHWTAVKNILKYLR
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR

Query:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        RIKDYMLVYGSKDLILTGYTDS FQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIA+ TMEAEYVAACEAAKE V
Subjt:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

TrEMBL top hitse value%identityAlignment
A0A5A7TXS5 Gag/pol protein6.7e-8898.2Show/hide
Query:  MLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLRRIKDYMLVY
        M + YKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLRRIKDYMLVY
Subjt:  MLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLRRIKDYMLVY

Query:  GSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        GSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
Subjt:  GSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

A0A5A7U2R8 Gag/pol protein7.1e-8288.64Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR
        MSQTSY+DKMLSRYKM+N KK LL YRYGIHLSKEQC KTPQEVEDMSNI YA+ VGSLMYAMLCTRPDICYS+ IVSRYQSN G DHWTA+KNILKYLR
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR

Query:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        R KDYMLVYGSKDLIL GYTDS FQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTME EYVAACEAAK+ V
Subjt:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

A0A5A7VBE3 Gag/pol protein2.0e-8491.48Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR
        MSQTSYIDKMLSRYKM+NSKK +LPYRYGIHLSKEQC KTPQEVEDMSNILY  VVGSLMYA+LCTRPDICYS+ IVSRYQSN G DHWTAVKNILKYLR
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR

Query:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        RIKDYMLVYGSKDLILTGYTDS FQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIA+ TMEAEYVAACEAAKE V
Subjt:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

A0A5D3DCF0 Gag/pol protein1.0e-8089.2Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR
        MSQTSYIDKMLSRYKM NSKKGLL YRYGIHLSKEQC KTPQEVEDMSNILYA  VGSLMYAMLCTRPDICY + IVSRYQSN   DHWT VKNILKYLR
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR

Query:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        R KDYMLVYGSKD ILTGYTDS+FQTDKDARKSTSGSVFTLNGGAVVW SIKQSCI +STMEAEYVAACEAAKE V
Subjt:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

A0A5D3DJL5 Gag/pol protein1.8e-8087.5Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR
        MSQTSYIDKMLSRYKM+NSKK LLPYRYGIHLSKEQC KTPQEVEDMSNI YAF +GSLMYAMLC RPDICYS+ IVS YQSN G DHWT VKNI+KYLR
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR

Query:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        R KDYM VYGSKDLILT YTDS FQTDKDARKSTSGSVF LNGGAVVWRSIKQSCIADSTME +YVAACEAAKE V
Subjt:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-1832.97Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLP----YRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNIL
        +SQ++Y+ K+LS++ M+N      P      Y +  S E C           N     ++G LMY MLCTRPD+  ++ I+SRY S    + W  +K +L
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLP----YRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNIL

Query:  KYLRRIKDYMLVYGSKDLI----LTGYTDSHFQTDKDARKSTSGSVFTL-NGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        +YL+   D  L++  K+L     + GY DS +   +  RKST+G +F + +   + W + +Q+ +A S+ EAEY+A  EA +E +
Subjt:  KYLRRIKDYMLVYGSKDLI----LTGYTDSHFQTDKDARKSTSGSVFTL-NGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

P0CV72 Secreted RxLR effector protein 1613.0e-2141.22Show/hide
Query:  MSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLRRIKDYMLVYGSKDLI-LTGYTDSHFQTDKDARKSTSGSVFTLNGGA
        M N+ Y   VG++MY M+ TRPD+  ++ ++S++ S+    HW A+K +L+YL+  + Y L +       L GY+D+ +  D ++R+STSG +F LNGG 
Subjt:  MSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLRRIKDYMLVYGSKDLI-LTGYTDSHFQTDKDARKSTSGSVFTLNGGA

Query:  VVWRSIKQSCIADSTMEAEYVAACEAAKEVV
        V WRS KQ  +A S+ E EY+A  EA +E V
Subjt:  VVWRSIKQSCIADSTMEAEYVAACEAAKEVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-4047.73Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR
        +SQ  YI+++L R+ MKN+K    P    + LSK+ C  T +E  +M+ + Y+  VGSLMYAM+CTRPDI +++ +VSR+  N G +HW AVK IL+YLR
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR

Query:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV
              L +G  D IL GYTD+    D D RKS++G +FT +GGA+ W+S  Q C+A ST EAEY+AA E  KE++
Subjt:  RIKDYMLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.4e-1130.68Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR
        +SQ  YI  +L+R  M  +K    P      LS     K     E      Y  +VGSL Y +  TRPDI Y++  +S++      +H  A+K IL+YL 
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR

Query:  RIKDY-MLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEV
           ++ + +     L L  Y+D+ +  DKD   ST+G +  L    + W S KQ  +  S+ EAEY +    + E+
Subjt:  RIKDY-MLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-1330.68Show/hide
Query:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR
        +SQ  Y   +L+R  M  +K    P      L+     K P   E      Y  +VGSL Y +  TRPD+ Y++  +S+Y      DHW A+K +L+YL 
Subjt:  MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLR

Query:  RIKDY-MLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEV
           D+ + +     L L  Y+D+ +  D D   ST+G +  L    + W S KQ  +  S+ EAEY +    + E+
Subjt:  RIKDY-MLVYGSKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.9e-1133.33Show/hide
Query:  YAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLRRIKDYMLVYGSK-DLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRS
        Y  ++G LMY  + TR DI +++  +S++       H  AV  IL Y++      L Y S+ ++ L  ++D+ FQ+ KD R+ST+G    L    + W+S
Subjt:  YAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLRRIKDYMLVYGSK-DLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRS

Query:  IKQSCIADSTMEAEYVAACEAAKEVV
         KQ  ++ S+ EAEY A   A  E++
Subjt:  IKQSCIADSTMEAEYVAACEAAKEVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAACATCTTATATAGACAAAATGTTATCAAGATATAAGATGAAGAATTCCAAAAAGGGTCTGCTGCCGTACAGATATGGAATTCATTTATCAAAAGAACAATG
TCTAAAGACACCTCAAGAAGTTGAGGATATGAGTAACATTCTCTATGCTTTTGTTGTTGGGAGCCTGATGTATGCTATGTTATGTACTAGACCTGACATTTGCTATTCAA
TGGAGATTGTGAGTAGATATCAGTCCAATTTTGGATGTGATCATTGGACAGCCGTTAAGAATATTCTAAAATATCTTAGAAGAATAAAAGACTATATGCTTGTGTATGGT
TCTAAGGATCTGATTCTTACTGGATACACTGACTCCCATTTTCAAACTGATAAAGATGCTAGAAAGTCTACTTCAGGATCAGTTTTCACTCTGAACGGAGGAGCAGTAGT
ATGGAGAAGCATAAAACAGTCTTGTATTGCTGACTCTACTATGGAAGCTGAATATGTAGCTGCCTGTGAAGCAGCCAAAGAAGTAGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAAACATCTTATATAGACAAAATGTTATCAAGATATAAGATGAAGAATTCCAAAAAGGGTCTGCTGCCGTACAGATATGGAATTCATTTATCAAAAGAACAATG
TCTAAAGACACCTCAAGAAGTTGAGGATATGAGTAACATTCTCTATGCTTTTGTTGTTGGGAGCCTGATGTATGCTATGTTATGTACTAGACCTGACATTTGCTATTCAA
TGGAGATTGTGAGTAGATATCAGTCCAATTTTGGATGTGATCATTGGACAGCCGTTAAGAATATTCTAAAATATCTTAGAAGAATAAAAGACTATATGCTTGTGTATGGT
TCTAAGGATCTGATTCTTACTGGATACACTGACTCCCATTTTCAAACTGATAAAGATGCTAGAAAGTCTACTTCAGGATCAGTTTTCACTCTGAACGGAGGAGCAGTAGT
ATGGAGAAGCATAAAACAGTCTTGTATTGCTGACTCTACTATGGAAGCTGAATATGTAGCTGCCTGTGAAGCAGCCAAAGAAGTAGTATGA
Protein sequenceShow/hide protein sequence
MSQTSYIDKMLSRYKMKNSKKGLLPYRYGIHLSKEQCLKTPQEVEDMSNILYAFVVGSLMYAMLCTRPDICYSMEIVSRYQSNFGCDHWTAVKNILKYLRRIKDYMLVYG
SKDLILTGYTDSHFQTDKDARKSTSGSVFTLNGGAVVWRSIKQSCIADSTMEAEYVAACEAAKEVV