; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0026841 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0026841
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr01:27938076..27938558
RNA-Seq ExpressionCmc01g0026841
SyntenyCmc01g0026841
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046768.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-7288.68Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        MK  GNAQYVLGIQIVRN KNKTLAMSQT YIDK+LSRYKMQNSKKGLLPYRY   LSKEQCPKTPQEVEDMSNIPY S +GSLMYAMLCTRPDICY VG
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKST
        IVSRYQS+P  DHWTA KNILKY RRTKDYMLVYGSKDLILT YTDSDFQTDKDARKST
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKST

KAA0050132.1 gag/pol protein [Cucumis melo var. makuwa]9.6e-7287.5Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        MKD GNAQYVLGIQIVRNRKNKTLAMSQT Y+DK+LSRYKMQN KK LL YRY  +LSKEQCPKTPQEVEDMSNI Y   VGSLMYAMLCTRPDICY VG
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS
        IVSRYQS+PG DHWTA KNILKYLRRTKDYMLVYGSKDLIL  YTDSDFQTDKDARKSTS
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS

KAA0052272.1 gag/pol protein [Cucumis melo var. makuwa]4.6e-7488.75Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        MKD  NAQYVLGI+IVRNRKNKTLAMSQT YIDK+LSRYKMQNSKK LLPYRY  +LSKEQCPKTPQEV+DMSNIPY S VGSLMYAMLCTRPDICY VG
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS
        IVSRYQS+PG DHWTA KNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQ+DKD RKSTS
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS

KAA0063746.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-7287.5Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        MKD GNAQYVLGIQIV+N+KNKT  MSQT YIDK+LSRYKMQNSKK +LPYRY  +LSKEQCPKTPQEVEDMSNI YVSVVGSLMYA+LCTRPDICY VG
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS
        IVSRYQS+PG DHWTA KNILKYLRR KDYMLVYGSKDLILT YTDSDFQTDKDARKSTS
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS

TYK11050.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-7489.38Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        MKD GNAQYVLGIQIVRN KNKTLAMSQT YIDK+LSRYKMQNSKKGLLPYRY  +LSKEQCPKTPQEVEDMSNIPY S +GSLMYAMLCTR DICY VG
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS
        IV+RYQS+PG DHWTA KNILKYLRRTKDYMLVYGSKDLILT YTDSDFQTDKDARKSTS
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS

TrEMBL top hitse value%identityAlignment
A0A5A7TV73 Gag/pol protein1.6e-7288.68Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        MK  GNAQYVLGIQIVRN KNKTLAMSQT YIDK+LSRYKMQNSKKGLLPYRY   LSKEQCPKTPQEVEDMSNIPY S +GSLMYAMLCTRPDICY VG
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKST
        IVSRYQS+P  DHWTA KNILKY RRTKDYMLVYGSKDLILT YTDSDFQTDKDARKST
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKST

A0A5A7U2R8 Gag/pol protein4.7e-7287.5Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        MKD GNAQYVLGIQIVRNRKNKTLAMSQT Y+DK+LSRYKMQN KK LL YRY  +LSKEQCPKTPQEVEDMSNI Y   VGSLMYAMLCTRPDICY VG
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS
        IVSRYQS+PG DHWTA KNILKYLRRTKDYMLVYGSKDLIL  YTDSDFQTDKDARKSTS
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS

A0A5A7U945 Gag/pol protein2.2e-7488.75Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        MKD  NAQYVLGI+IVRNRKNKTLAMSQT YIDK+LSRYKMQNSKK LLPYRY  +LSKEQCPKTPQEV+DMSNIPY S VGSLMYAMLCTRPDICY VG
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS
        IVSRYQS+PG DHWTA KNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQ+DKD RKSTS
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS

A0A5A7VBE3 Gag/pol protein1.2e-7287.5Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        MKD GNAQYVLGIQIV+N+KNKT  MSQT YIDK+LSRYKMQNSKK +LPYRY  +LSKEQCPKTPQEVEDMSNI YVSVVGSLMYA+LCTRPDICY VG
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS
        IVSRYQS+PG DHWTA KNILKYLRR KDYMLVYGSKDLILT YTDSDFQTDKDARKSTS
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS

A0A5D3CI71 Gag/pol protein1.0e-7489.38Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        MKD GNAQYVLGIQIVRN KNKTLAMSQT YIDK+LSRYKMQNSKKGLLPYRY  +LSKEQCPKTPQEVEDMSNIPY S +GSLMYAMLCTR DICY VG
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS
        IV+RYQS+PG DHWTA KNILKYLRRTKDYMLVYGSKDLILT YTDSDFQTDKDARKSTS
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-1534.52Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLP----YRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDIC
        M D    ++ +GI+I    +   + +SQ+ Y+ KILS++ M+N      P      YE   S E C           N P  S++G LMY MLCTRPD+ 
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLP----YRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDIC

Query:  YLVGIVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLI----LTRYTDSDFQTDKDARKSTS
          V I+SRY S    + W   K +L+YL+ T D  L++  K+L     +  Y DSD+   +  RKST+
Subjt:  YLVGIVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLI----LTRYTDSDFQTDKDARKSTS

P0CV72 Secreted RxLR effector protein 1614.3e-1442.22Show/hide
Query:  MSNIPYVSVVGSLMYAMLCTRPDICYLVGIVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLI-LTRYTDSDFQTDKDARKSTS
        M N+PY+S VG++MY M+ TRPD+   VG++S++ S P   HW A K +L+YL+ T+ Y L +       L  Y+D+D+  D ++R+STS
Subjt:  MSNIPYVSVVGSLMYAMLCTRPDICYLVGIVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLI-LTRYTDSDFQTDKDARKSTS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-3749.38Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        MKD G AQ +LG++IVR R ++ L +SQ  YI+++L R+ M+N+K    P      LSK+ CP T +E  +M+ +PY S VGSLMYAM+CTRPDI + VG
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS
        +VSR+  +PG +HW A K IL+YLR T    L +G  D IL  YTD+D   D D RKS++
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-0933.54Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        +KD     Y LGI+    R    L +SQ  YI  +L+R  M  +K    P      LS     K     E      Y  +VGSL Y +  TRPDI Y V 
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDY-MLVYGSKDLILTRYTDSDFQTDKDARKSTS
         +S++   P  +H  A K IL+YL  T ++ + +     L L  Y+D+D+  DKD   ST+
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDY-MLVYGSKDLILTRYTDSDFQTDKDARKSTS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.9e-1232.92Show/hide
Query:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG
        +K+  +  Y LGI+    R  + L +SQ  Y   +L+R  M  +K    P      L+     K P   E      Y  +VGSL Y +  TRPD+ Y V 
Subjt:  MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVG

Query:  IVSRYQSSPGHDHWTADKNILKYLRRTKDY-MLVYGSKDLILTRYTDSDFQTDKDARKSTS
         +S+Y   P  DHW A K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+
Subjt:  IVSRYQSSPGHDHWTADKNILKYLRRTKDY-MLVYGSKDLILTRYTDSDFQTDKDARKSTS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATTTTGGAAATGCACAATATGTTCTTGGTATCCAAATAGTTCGGAATCGAAAGAACAAAACTCTAGCCATGTCTCAAACATTTTATATAGACAAAATATTGTC
AAGATATAAGATGCAGAATTCCAAAAAGGGTCTGCTGCCGTACAGATATGAAAGTTATTTATCAAAAGAACAATGTCCAAAGACACCTCAAGAAGTTGAGGATATGAGTA
ACATTCCCTATGTTTCTGTTGTTGGGAGCCTGATGTATGCAATGTTATGTACTAGACCTGACATTTGTTATTTAGTAGGGATTGTTAGTAGATATCAGTCCAGTCCTGGA
CATGATCATTGGACAGCCGATAAGAATATTCTAAAATATCTTAGAAGAACAAAAGACTACATGCTTGTGTATGGTTCTAAGGATCTGATCCTTACTAGATACACTGACTC
CGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATTTTGGAAATGCACAATATGTTCTTGGTATCCAAATAGTTCGGAATCGAAAGAACAAAACTCTAGCCATGTCTCAAACATTTTATATAGACAAAATATTGTC
AAGATATAAGATGCAGAATTCCAAAAAGGGTCTGCTGCCGTACAGATATGAAAGTTATTTATCAAAAGAACAATGTCCAAAGACACCTCAAGAAGTTGAGGATATGAGTA
ACATTCCCTATGTTTCTGTTGTTGGGAGCCTGATGTATGCAATGTTATGTACTAGACCTGACATTTGTTATTTAGTAGGGATTGTTAGTAGATATCAGTCCAGTCCTGGA
CATGATCATTGGACAGCCGATAAGAATATTCTAAAATATCTTAGAAGAACAAAAGACTACATGCTTGTGTATGGTTCTAAGGATCTGATCCTTACTAGATACACTGACTC
CGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCATGA
Protein sequenceShow/hide protein sequence
MKDFGNAQYVLGIQIVRNRKNKTLAMSQTFYIDKILSRYKMQNSKKGLLPYRYESYLSKEQCPKTPQEVEDMSNIPYVSVVGSLMYAMLCTRPDICYLVGIVSRYQSSPG
HDHWTADKNILKYLRRTKDYMLVYGSKDLILTRYTDSDFQTDKDARKSTS