; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G013886 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G013886
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGag/pol protein
Genome locationCG_Chr09:21746348..21746671
RNA-Seq ExpressionClCG09G013886
SyntenyClCG09G013886
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]7.0e-3475.51Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWLKKFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH + EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

KAA0035986.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-3372.45Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+ATKEAVWLKKFLTDL+VVPNMH+ ITLYCDN+G +ANS+E RSHK  +HIERKYH + EIV +GD+TV +I SE N+ DPFTKALTAKVF+ H
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-3374.49Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH + EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

KAA0053385.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]9.1e-3474.49Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C A KEA+WL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKESRSHK  +HIERKYH + EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

KAA0058279.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-3374.49Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH + EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

TrEMBL top hitse value%identityAlignment
A0A5A7T187 Gag/pol protein7.5e-3472.45Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+ATKEAVWLKKFLTDL+VVPNMH+ ITLYCDN+G +ANS+E RSHK  +HIERKYH + EIV +GD+TV +I SE N+ DPFTKALTAKVF+ H
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

A0A5A7T2V9 Gag/pol protein3.4e-3475.51Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWLKKFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH + EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

A0A5A7TZD0 Gag/pol protein7.5e-3474.49Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH + EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

A0A5A7UI63 Putative gag-pol polyprotein4.4e-3474.49Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C A KEA+WL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKESRSHK  +HIERKYH + EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

A0A5A7UYE8 Gag/pol protein7.5e-3474.49Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH + EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.0e-0835.79Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVF
        +A+ +A +EA+WLK  LT + +   +   I +Y DN G I+ +     HK  +HI+ KYH   E VQ   + ++ I +E+ + D FTK L A  F
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-0732.29Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFK
        +A  +  KE +WLK+FL +L +    ++   +YCD+  AI  SK S  H   +HI+ +YH + E+V    + V +I++  N  D  TK +    F+
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTTAGAAAAAGGGTGTGACAATGTAGCTGTATGTAAAGCAACAAAAGAAGCAGTGTGGCTCAAGAAGTTCTTAACGGATCTGAAAGTTGTTCCAAATATGCATCT
GTTTATCACTCTTTATTGTGATAATAATGGTGCAATTGCAAACTCCAAAGAATCTAGAAGTCATAAGCACGACCAACACATTGAGCGAAAATACCATCCCATGGGAGAAA
TTGTGCAGAAAGGTGACATGACTGTTCAGCAGATCGCGTCAGAGCACAACATTGATGATCCATTTACAAAGGCTCTCACGGCTAAAGTGTTTAAGGGTCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCTTAGAAAAAGGGTGTGACAATGTAGCTGTATGTAAAGCAACAAAAGAAGCAGTGTGGCTCAAGAAGTTCTTAACGGATCTGAAAGTTGTTCCAAATATGCATCT
GTTTATCACTCTTTATTGTGATAATAATGGTGCAATTGCAAACTCCAAAGAATCTAGAAGTCATAAGCACGACCAACACATTGAGCGAAAATACCATCCCATGGGAGAAA
TTGTGCAGAAAGGTGACATGACTGTTCAGCAGATCGCGTCAGAGCACAACATTGATGATCCATTTACAAAGGCTCTCACGGCTAAAGTGTTTAAGGGTCACTAG
Protein sequenceShow/hide protein sequence
MVLEKGCDNVAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPMGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH