; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G11120 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G11120
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag/pol protein
Genome locationClcChr02:19853252..19853620
RNA-Seq ExpressionClc02G11120
SyntenyClc02G11120
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-3875.7Show/hide
Query:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE
        AQFV+G+QI RDRKNKMLALSQ SYIDK++V+YSM NSKRGLL F HG+ LSKEQC  TPQ+VEEMR I Y S +GSLMYAMLCTRPDICY +GIVSRY+
Subjt:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE

Query:  SNLGFDH
        SN G  H
Subjt:  SNLGFDH

KAA0050437.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-3875.7Show/hide
Query:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQCS-TPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE
        AQ+V+G+QIIRDRKNK+LALSQ +YIDKMLVRYSM NSK+GLL F HG+ LSKEQCS TPQEVE+MR I Y S +GSLMY MLCTRP+ICY +GIVSRY+
Subjt:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQCS-TPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE

Query:  SNLGFDH
        SN G DH
Subjt:  SNLGFDH

KAA0062886.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-3876.64Show/hide
Query:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE
        AQ+V+G+QIIRDRKNK LALSQ +YIDKMLVRY M NSK+GLL F HG+ LSKEQC  TPQEVE+MR I Y S +GSLMYAMLCTRPDICYV+GIVSRY+
Subjt:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE

Query:  SNLGFDH
        SN G DH
Subjt:  SNLGFDH

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-3875.7Show/hide
Query:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE
        AQFV+G+QI RDRKNKMLALSQ SYIDK++V+YSM NSKRGLL F HG+ LSKEQC  TPQ+VEEMR I Y S +GSLMYAMLCTRPDICY +GIVSRY+
Subjt:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE

Query:  SNLGFDH
        SN G  H
Subjt:  SNLGFDH

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-3875.7Show/hide
Query:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE
        AQFV+G+QI RDRKNKMLALSQ SYIDK++V+YSM NSKRGLL F HG+ LSKEQC  TPQ+VEEMR I Y S +GSLMYAMLCTRPDICY +GIVSRY+
Subjt:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE

Query:  SNLGFDH
        SN G  H
Subjt:  SNLGFDH

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.5e-3875.7Show/hide
Query:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE
        AQFV+G+QI RDRKNKMLALSQ SYIDK++V+YSM NSKRGLL F HG+ LSKEQC  TPQ+VEEMR I Y S +GSLMYAMLCTRPDICY +GIVSRY+
Subjt:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE

Query:  SNLGFDH
        SN G  H
Subjt:  SNLGFDH

A0A5A7TWB9 Gag/pol protein1.5e-3875.7Show/hide
Query:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE
        AQFV+G+QI RDRKNKMLALSQ SYIDK++V+YSM NSKRGLL F HG+ LSKEQC  TPQ+VEEMR I Y S +GSLMYAMLCTRPDICY +GIVSRY+
Subjt:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE

Query:  SNLGFDH
        SN G  H
Subjt:  SNLGFDH

A0A5A7U7T0 Gag/pol protein8.8e-3975.7Show/hide
Query:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQCS-TPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE
        AQ+V+G+QIIRDRKNK+LALSQ +YIDKMLVRYSM NSK+GLL F HG+ LSKEQCS TPQEVE+MR I Y S +GSLMY MLCTRP+ICY +GIVSRY+
Subjt:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQCS-TPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE

Query:  SNLGFDH
        SN G DH
Subjt:  SNLGFDH

A0A5A7V8W0 Gag/pol protein8.8e-3976.64Show/hide
Query:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE
        AQ+V+G+QIIRDRKNK LALSQ +YIDKMLVRY M NSK+GLL F HG+ LSKEQC  TPQEVE+MR I Y S +GSLMYAMLCTRPDICYV+GIVSRY+
Subjt:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE

Query:  SNLGFDH
        SN G DH
Subjt:  SNLGFDH

A0A5D3CPJ6 Gag/pol protein1.5e-3875.7Show/hide
Query:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE
        AQFV+G+QI RDRKNKMLALSQ SYIDK++V+YSM NSKRGLL F HG+ LSKEQC  TPQ+VEEMR I Y S +GSLMYAMLCTRPDICY +GIVSRY+
Subjt:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE

Query:  SNLGFDH
        SN G  H
Subjt:  SNLGFDH

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-0735Show/hide
Query:  IGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHN----SKRGLLSFGHGIILSKEQCSTPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYES
        IG++I  + +   + LSQ++Y+ K+L +++M N    S        + ++ S E C+TP            S IG LMY MLCTRPD+   + I+SRY S
Subjt:  IGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHN----SKRGLLSFGHGIILSKEQCSTPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYES

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.5e-1942.99Show/hide
Query:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE
        AQ ++G++I+R+R ++ L LSQ  YI+++L R++M N+K         + LSK+ C +T +E   M  + Y S +GSLMYAM+CTRPDI + +G+VSR+ 
Subjt:  AQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQC-STPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGIVSRYE

Query:  SNLGFDH
         N G +H
Subjt:  SNLGFDH

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACAAATGGCTTACTGCACAATTCCAAATAAAAGATTTGAGTGGGCACAGTTTGTTATAGGGCTCCAGATCATAAGAGATCGTAAGAACAAAATGTTAGCCTTGTC
TCAAACATCGTATATTGACAAGATGCTTGTCAGGTATTCGATGCATAATTCTAAAAGAGGCTTATTATCTTTTGGACATGGAATTATATTGTCTAAGGAACAGTGTTCTA
CTCCTCAAGAGGTTGAGGAAATGAGATGGATTTCCTATGTATCGACTATTGGTAGCCTTATGTATGCCATGTTGTGTACAAGGCCTGACATTTGTTATGTAATTGGGATT
GTTAGTAGATATGAATCCAATCTAGGATTTGATCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCACAAATGGCTTACTGCACAATTCCAAATAAAAGATTTGAGTGGGCACAGTTTGTTATAGGGCTCCAGATCATAAGAGATCGTAAGAACAAAATGTTAGCCTTGTC
TCAAACATCGTATATTGACAAGATGCTTGTCAGGTATTCGATGCATAATTCTAAAAGAGGCTTATTATCTTTTGGACATGGAATTATATTGTCTAAGGAACAGTGTTCTA
CTCCTCAAGAGGTTGAGGAAATGAGATGGATTTCCTATGTATCGACTATTGGTAGCCTTATGTATGCCATGTTGTGTACAAGGCCTGACATTTGTTATGTAATTGGGATT
GTTAGTAGATATGAATCCAATCTAGGATTTGATCACTAG
Protein sequenceShow/hide protein sequence
MSQMAYCTIPNKRFEWAQFVIGLQIIRDRKNKMLALSQTSYIDKMLVRYSMHNSKRGLLSFGHGIILSKEQCSTPQEVEEMRWISYVSTIGSLMYAMLCTRPDICYVIGI
VSRYESNLGFDH