; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G15370 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G15370
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag/pol protein
Genome locationClcChr09:20850695..20851018
RNA-Seq ExpressionClc09G15370
SyntenyClc09G15370
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-3476.53Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWLKKFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH I EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

KAA0035986.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-3473.47Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+ATKEAVWLKKFLTDL+VVPNMH+ ITLYCDN+G +ANS+E RSHK  +HIERKYH I EIV +GD+TV +I SE N+ DPFTKALTAKVF+ H
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-3475.51Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH I EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

KAA0053385.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]3.1e-3475.51Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C A KEA+WL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKESRSHK  +HIERKYH I EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

KAA0058279.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-3475.51Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH I EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

TrEMBL top hitse value%identityAlignment
A0A5A7T187 Gag/pol protein2.6e-3473.47Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+ATKEAVWLKKFLTDL+VVPNMH+ ITLYCDN+G +ANS+E RSHK  +HIERKYH I EIV +GD+TV +I SE N+ DPFTKALTAKVF+ H
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

A0A5A7T2V9 Gag/pol protein1.2e-3476.53Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWLKKFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH I EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

A0A5A7TZD0 Gag/pol protein2.6e-3475.51Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH I EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

A0A5A7UI63 Putative gag-pol polyprotein1.5e-3475.51Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C A KEA+WL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKESRSHK  +HIERKYH I EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

A0A5A7UYE8 Gag/pol protein2.6e-3475.51Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH
        VA C+A KEAVWL+KFL DL+VVPNM+L ITLYCDN+GA+ANSKE RSHK  +HIERKYH I EIVQ+GD+ V +IASEHNI DPFTK LTAKVF+GH
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFKGH

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.1e-0835.79Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVF
        +A+ +A +EA+WLK  LT + +   +   I +Y DN G I+ +     HK  +HI+ KYH   E VQ   + ++ I +E+ + D FTK L A  F
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.0e-0833.33Show/hide
Query:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFK
        +A  +  KE +WLK+FL +L +    ++   +YCD+  AI  SK S  H   +HI+ +YH I E+V    + V +I++  N  D  TK +    F+
Subjt:  VAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTTAGAAAAAGGGTGTGACAATGTAGCTGTATGTAAAGCAACAAAAGAAGCAGTGTGGCTCAAGAAGTTCTTAACGGATCTGAAAGTTGTTCCAAATATG
CATCTGTTTATCACTCTTTATTGTGATAATAATGGTGCAATTGCAAACTCCAAAGAATCTAGAAGTCATAAGCACGACCAACACATTGAGCGAAAATACCATCCC
ATCGGAGAAATTGTGCAGAAAGGTGACATGACTGTTCAGCAGATCGCGTCAGAGCACAACATTGATGATCCATTTACAAAGGCTCTCACGGCTAAAGTGTTTAAG
GGTCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCTTAGAAAAAGGGTGTGACAATGTAGCTGTATGTAAAGCAACAAAAGAAGCAGTGTGGCTCAAGAAGTTCTTAACGGATCTGAAAGTTGTTCCAAATATG
CATCTGTTTATCACTCTTTATTGTGATAATAATGGTGCAATTGCAAACTCCAAAGAATCTAGAAGTCATAAGCACGACCAACACATTGAGCGAAAATACCATCCC
ATCGGAGAAATTGTGCAGAAAGGTGACATGACTGTTCAGCAGATCGCGTCAGAGCACAACATTGATGATCCATTTACAAAGGCTCTCACGGCTAAAGTGTTTAAG
GGTCACTAG
Protein sequenceShow/hide protein sequence
MVLEKGCDNVAVCKATKEAVWLKKFLTDLKVVPNMHLFITLYCDNNGAIANSKESRSHKHDQHIERKYHPIGEIVQKGDMTVQQIASEHNIDDPFTKALTAKVFK
GH