; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G036437 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G036437
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionGag/pol protein
Genome locationCla97Chr02:19901477..19904859
RNA-Seq ExpressionCla97C02G036437
SyntenyCla97C02G036437
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-4574.63Show/hide
Query:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV
        L SLSEVLAKKHE+M+TAREIMDSLQE+FGQ S QIKHD LK+IYNA MNE +SV+EHVLNMMVHFNVAE+NG +I+EASQ+SFI+E L ESFLQFR+N 
Subjt:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV

Query:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV
        VMNK+ Y+LTTLLNELQ +ES+ K +GQ G+ NV
Subjt:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-4574.63Show/hide
Query:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV
        L SLSEVLAKKHE+M+TAREIMDSLQE+FGQ S QIKHD LK+IYNA MNE +SV+EHVLNMMVHFNVAE+NG +I+EASQ+SFI+E L ESFLQFR+N 
Subjt:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV

Query:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV
        VMNK+ Y+LTTLLNELQ +ES+ K +GQ G+ NV
Subjt:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-4574.63Show/hide
Query:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV
        L SLSEVLAKKHE+M+TAREIMDSLQE+FGQ S QIKHD LK+IYNA MNE +SV+EHVLNMMVHFNVAE+NG +I+EASQ+SFI+E L ESFLQFR+N 
Subjt:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV

Query:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV
        VMNK+ Y+LTTLLNELQ +ES+ K +GQ G+ NV
Subjt:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-4574.63Show/hide
Query:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV
        L SLSEVLAKKHE+M+TAREIMDSLQE+FGQ S QIKHD LK+IYNA MNE +SV+EHVLNMMVHFNVAE+NG +I+EASQ+SFI+E L ESFLQFR+N 
Subjt:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV

Query:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV
        VMNK+ Y+LTTLLNELQ +ES+ K +GQ G+ NV
Subjt:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-4574.63Show/hide
Query:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV
        L SLSEVLAKKHE+M+TAREIMDSLQE+FGQ S QIKHD LK+IYNA MNE +SV+EHVLNMMVHFNVAE+NG +I+EASQ+SFI+E L ESFLQFR+N 
Subjt:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV

Query:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV
        VMNK+ Y+LTTLLNELQ +ES+ K +GQ G+ NV
Subjt:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.6e-4574.63Show/hide
Query:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV
        L SLSEVLAKKHE+M+TAREIMDSLQE+FGQ S QIKHD LK+IYNA MNE +SV+EHVLNMMVHFNVAE+NG +I+EASQ+SFI+E L ESFLQFR+N 
Subjt:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV

Query:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV
        VMNK+ Y+LTTLLNELQ +ES+ K +GQ G+ NV
Subjt:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV

A0A5A7TU93 Gag/pol protein3.6e-4574.63Show/hide
Query:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV
        L SLSEVLAKKHE+M+TAREIMDSLQE+FGQ S QIKHD LK+IYNA MNE +SV+EHVLNMMVHFNVAE+NG +I+EASQ+SFI+E L ESFLQFR+N 
Subjt:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV

Query:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV
        VMNK+ Y+LTTLLNELQ +ES+ K +GQ G+ NV
Subjt:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV

A0A5A7TWB9 Gag/pol protein3.6e-4574.63Show/hide
Query:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV
        L SLSEVLAKKHE+M+TAREIMDSLQE+FGQ S QIKHD LK+IYNA MNE +SV+EHVLNMMVHFNVAE+NG +I+EASQ+SFI+E L ESFLQFR+N 
Subjt:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV

Query:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV
        VMNK+ Y+LTTLLNELQ +ES+ K +GQ G+ NV
Subjt:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV

A0A5A7V4M1 Gag/pol protein3.6e-4574.63Show/hide
Query:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV
        L SLSEVLAKKHE+M+TAREIMDSLQE+FGQ S QIKHD LK+IYNA MNE +SV+EHVLNMMVHFNVAE+NG +I+EASQ+SFI+E L ESFLQFR+N 
Subjt:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV

Query:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV
        VMNK+ Y+LTTLLNELQ +ES+ K +GQ G+ NV
Subjt:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV

A0A5D3CPJ6 Gag/pol protein3.6e-4574.63Show/hide
Query:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV
        L SLSEVLAKKHE+M+TAREIMDSLQE+FGQ S QIKHD LK+IYNA MNE +SV+EHVLNMMVHFNVAE+NG +I+EASQ+SFI+E L ESFLQFR+N 
Subjt:  LPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERL-ESFLQFRTNV

Query:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV
        VMNK+ Y+LTTLLNELQ +ES+ K +GQ G+ NV
Subjt:  VMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAAACTCATAACTCTGCAACTAGGCATGGAGAGGAGGAGGAAGAAGAAGAAGAATCAGGTCTAATATTTGGAAAACCACGAGCTGTAGAGATCGAGTTAAGCGC
TGAAACTTTAAGGTACTCAATGCTGAAGAATATTGACAAGGCATTTGAGTATCCTAAGGACCCAGCACTGAAGGACTCGGAGAAGCGAAAGAGTTCCCAGTGCTTAGCCT
ACCCAAGCCTCATCCTGCTGAAGTTTCAAGCTGTCAACAAATACCTTTCCTTCATAGTTGTTGTACGTGTAAATAATCTTAAGTTAGTCAAGTACGGATGGACAGTGAAC
CTCACAAAAGGGGTGAAGTTTGCTGCCTTCACATCTCTTCCCAGCTTATCTGAAGTTTTGGCAAAGAAGCATGAAACCATGGTCACTGCTCGTGAAATCATGGATTCATT
GCAAGAGATATTTGGACAACCATCCTCACAAATCAAACATGATACTCTGAAATTCATTTATAATGCACATATGAACGAGGAGTCATCAGTGCAAGAACATGTTCTCAACA
TGATGGTCCACTTCAATGTGGCTGAAATTAATGGGGTCATTATCAATGAGGCTAGTCAAATTAGTTTTATTATGGAGAGACTTGAGAGTTTCCTTCAGTTCAGGACAAAT
GTTGTTATGAACAAACTGACCTACTCTCTTACTACCCTCCTTAACGAGCTACAAATATATGAGTCCATGCATAAAGGCGAAGGACAAGATGGGAAGACAAATGTTCCTCT
AAGAAGTTCCACAGAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAAACTCATAACTCTGCAACTAGGCATGGAGAGGAGGAGGAAGAAGAAGAAGAATCAGGTCTAATATTTGGAAAACCACGAGCTGTAGAGATCGAGTTAAGCGC
TGAAACTTTAAGGTACTCAATGCTGAAGAATATTGACAAGGCATTTGAGTATCCTAAGGACCCAGCACTGAAGGACTCGGAGAAGCGAAAGAGTTCCCAGTGCTTAGCCT
ACCCAAGCCTCATCCTGCTGAAGTTTCAAGCTGTCAACAAATACCTTTCCTTCATAGTTGTTGTACGTGTAAATAATCTTAAGTTAGTCAAGTACGGATGGACAGTGAAC
CTCACAAAAGGGGTGAAGTTTGCTGCCTTCACATCTCTTCCCAGCTTATCTGAAGTTTTGGCAAAGAAGCATGAAACCATGGTCACTGCTCGTGAAATCATGGATTCATT
GCAAGAGATATTTGGACAACCATCCTCACAAATCAAACATGATACTCTGAAATTCATTTATAATGCACATATGAACGAGGAGTCATCAGTGCAAGAACATGTTCTCAACA
TGATGGTCCACTTCAATGTGGCTGAAATTAATGGGGTCATTATCAATGAGGCTAGTCAAATTAGTTTTATTATGGAGAGACTTGAGAGTTTCCTTCAGTTCAGGACAAAT
GTTGTTATGAACAAACTGACCTACTCTCTTACTACCCTCCTTAACGAGCTACAAATATATGAGTCCATGCATAAAGGCGAAGGACAAGATGGGAAGACAAATGTTCCTCT
AAGAAGTTCCACAGAAGTTTGA
Protein sequenceShow/hide protein sequence
MKKTHNSATRHGEEEEEEEESGLIFGKPRAVEIELSAETLRYSMLKNIDKAFEYPKDPALKDSEKRKSSQCLAYPSLILLKFQAVNKYLSFIVVVRVNNLKLVKYGWTVN
LTKGVKFAAFTSLPSLSEVLAKKHETMVTAREIMDSLQEIFGQPSSQIKHDTLKFIYNAHMNEESSVQEHVLNMMVHFNVAEINGVIINEASQISFIMERLESFLQFRTN
VVMNKLTYSLTTLLNELQIYESMHKGEGQDGKTNVPLRSSTEV