; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G193350 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G193350
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionGag/pol protein
Genome locationCla97Chr10:21731019..21734130
RNA-Seq ExpressionCla97C10G193350
SyntenyCla97C10G193350
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]6.7e-3481.61Show/hide
Query:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF
        P+TPQ+V+DMR+ PY+SAVG+L YAMLCTRP+ICY+V IVSR+QSN G DHWT VKNILKYLRRTRNYMLVYGAKDLILTGY DSDF
Subjt:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF

KAA0025159.1 gag/pol protein [Cucumis melo var. makuwa]5.7e-3381.61Show/hide
Query:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF
        P+TPQ+V+DMR  PYASAVG+L YAMLCTRP+ICYAV IVSR+QSN G DHWT VK ILKYLRRTR+YMLVYGAKDLILTGY DSDF
Subjt:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF

KAA0046416.1 gag/pol protein [Cucumis melo var. makuwa]5.7e-3380.46Show/hide
Query:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF
        P+TPQDV++MRH PYASAVG+L YAMLCTRP+ICYAV IVSR+QSN G  HWT VK ILKYLRRTR+YMLVYG+KDLILTGY DSDF
Subjt:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]5.7e-3380.46Show/hide
Query:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF
        P+TPQDV++MRH PYASAVG+L YAMLCTRP+ICYAV IVSR+QSN G  HWT VK ILKYLRRTR+YMLVYG+KDLILTGY DSDF
Subjt:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF

KAA0050103.1 gag/pol protein [Cucumis melo var. makuwa]4.3e-3381.61Show/hide
Query:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF
        P+TPQ+V DMR +PYASAVG+L Y MLCTRP+ICYAVRIVSR+QSN G DHWTVVK ILKYLRRTR+YMLVYGAKDLILTGY D DF
Subjt:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF

TrEMBL top hitse value%identityAlignment
A0A5A7SIN2 Gag/pol protein2.7e-3381.61Show/hide
Query:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF
        P+TPQ+V+DMR  PYASAVG+L YAMLCTRP+ICYAV IVSR+QSN G DHWT VK ILKYLRRTR+YMLVYGAKDLILTGY DSDF
Subjt:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF

A0A5A7TWB9 Gag/pol protein2.7e-3380.46Show/hide
Query:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF
        P+TPQDV++MRH PYASAVG+L YAMLCTRP+ICYAV IVSR+QSN G  HWT VK ILKYLRRTR+YMLVYG+KDLILTGY DSDF
Subjt:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF

A0A5A7U2H0 Gag/pol protein2.1e-3381.61Show/hide
Query:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF
        P+TPQ+V DMR +PYASAVG+L Y MLCTRP+ICYAVRIVSR+QSN G DHWTVVK ILKYLRRTR+YMLVYGAKDLILTGY D DF
Subjt:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF

A0A5A7ULH1 Gag/pol protein2.7e-3380.46Show/hide
Query:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF
        P+TPQDV++MRH PYASAVG+L YAMLCTRP+ICYAV IVSR+QSN G  HWT VK ILKYLRRTR+YMLVYG+KDLILTGY DSDF
Subjt:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF

E2GK51 Gag/pol protein (Fragment)3.2e-3481.61Show/hide
Query:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF
        P+TPQ+V+DMR+ PY+SAVG+L YAMLCTRP+ICY+V IVSR+QSN G DHWT VKNILKYLRRTRNYMLVYGAKDLILTGY DSDF
Subjt:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSDF

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-0835.05Show/hide
Query:  TPPPPRTPQDV---DDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYG---AKDLILTGYIDSDF
        TP P +   ++   D+  ++P  S +G L Y MLCTRP++  AV I+SR+ S    + W  +K +L+YL+ T +  L++    A +  + GY+DSD+
Subjt:  TPPPPRTPQDV---DDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYG---AKDLILTGYIDSDF

P0CV72 Secreted RxLR effector protein 1611.2e-0940.51Show/hide
Query:  MRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVY-GAKDLILTGYIDSDF
        M++ PY SAVG + Y M+ TRP++  AV ++S+F S+    HW  +K +L+YL+ T+ Y L +  A    L GY D+D+
Subjt:  MRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVY-GAKDLILTGYIDSDF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-1852.33Show/hide
Query:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSD
        P T ++  +M   PY+SAVG+L YAM+CTRP+I +AV +VSRF  N G +HW  VK IL+YLR T    L +G  D IL GY D+D
Subjt:  PRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVYGAKDLILTGYIDSD

P25600 Putative transposon Ty5-1 protein YCL074W2.6e-0436.49Show/hide
Query:  SPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVY-GAKDLILTGYIDS
        +PY S VG L +     RP+I Y V ++SRF       H    + +L+YL  TR+  L Y     L LT Y D+
Subjt:  SPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNYMLVY-GAKDLILTGYIDS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-0433.78Show/hide
Query:  YASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNY-MLVYGAKDLILTGYIDSDF
        Y   VG+L+Y +  TRP++ YAV  +S++      DHW  +K +L+YL  T ++ + +     L L  Y D+D+
Subjt:  YASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLRRTRNY-MLVYGAKDLILTGYIDSDF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATCAAAAACAAGGTTGCTTTGGTGGGTATGGTAGTTGTGATGGTGGTTATGGCTCAATCCGCCACTGCTGCCGTGGATGAGGGGATTCAGCTCCCGGCCGAAGG
CGCAGCGAGTTATTACCATACGCCGCCGCCTCCGAGAACACCTCAAGATGTTGATGACATGAGACATAGTCCTTATGCATCAGCAGTGGGCAACTTGAGGTATGCTATGT
TGTGTACACGACCTAACATTTGCTATGCAGTAAGGATTGTCAGTAGATTTCAGTCCAATACTGGATATGATCATTGGACTGTCGTTAAAAATATCCTCAAGTATCTAAGG
AGAACAAGGAACTACATGCTTGTGTATGGCGCTAAGGATCTTATTCTTACAGGATACATTGACTCTGATTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGATCAAAAACAAGGTTGCTTTGGTGGGTATGGTAGTTGTGATGGTGGTTATGGCTCAATCCGCCACTGCTGCCGTGGATGAGGGGATTCAGCTCCCGGCCGAAGG
CGCAGCGAGTTATTACCATACGCCGCCGCCTCCGAGAACACCTCAAGATGTTGATGACATGAGACATAGTCCTTATGCATCAGCAGTGGGCAACTTGAGGTATGCTATGT
TGTGTACACGACCTAACATTTGCTATGCAGTAAGGATTGTCAGTAGATTTCAGTCCAATACTGGATATGATCATTGGACTGTCGTTAAAAATATCCTCAAGTATCTAAGG
AGAACAAGGAACTACATGCTTGTGTATGGCGCTAAGGATCTTATTCTTACAGGATACATTGACTCTGATTTCTAG
Protein sequenceShow/hide protein sequence
MEIKNKVALVGMVVVMVVMAQSATAAVDEGIQLPAEGAASYYHTPPPPRTPQDVDDMRHSPYASAVGNLRYAMLCTRPNICYAVRIVSRFQSNTGYDHWTVVKNILKYLR
RTRNYMLVYGAKDLILTGYIDSDF