; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G01160 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G01160
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag/pol protein
Genome locationClcChr04:3240952..3241344
RNA-Seq ExpressionClc04G01160
SyntenyClc04G01160
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-5383.72Show/hide
Query:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS
        RDYMLVY  KDLIL  YTDSDFQTD DSRK TSG VFTLN GA+VWRSIKQGCI DST+EAEYVAAC AA EAVWL+K L +LEVVPNM+LPITLYCDNS
Subjt:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS

Query:  DVVANSKEPRSHKRGKHIERKYHIIREIV
          VANSKEPRSHKRGKHIERKYH+IREIV
Subjt:  DVVANSKEPRSHKRGKHIERKYHIIREIV

KAA0040406.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-5282.95Show/hide
Query:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS
        RDYMLVY TKDLIL  YTDS+FQTD DSRK TSG VFTLN GA+VWRSIKQGCI DST+EAEYVAAC AA EAVWL+K L +LEVVPNM+LPITLYCDNS
Subjt:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS

Query:  DVVANSKEPRSHKRGKHIERKYHIIREIV
          VANSKEPRSHKRGKHIERKYH+IR+IV
Subjt:  DVVANSKEPRSHKRGKHIERKYHIIREIV

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-5383.72Show/hide
Query:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS
        RDYMLVY  KDLIL  YTDSDFQTD DSRK TSG VFTLN GA+VWRSIKQGCI DST+EAEYVAAC AA EAVWL+K L +LEVVPNM+LPITLYCDNS
Subjt:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS

Query:  DVVANSKEPRSHKRGKHIERKYHIIREIV
          VANSKEPRSHKRGKHIERKYH+IREIV
Subjt:  DVVANSKEPRSHKRGKHIERKYHIIREIV

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-5383.72Show/hide
Query:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS
        RDYMLVY  KDLIL  YTDSDFQTD DSRK TSG VFTLN GA+VWRSIKQGCI DST+EAEYVAAC AA EAVWL+K L +LEVVPNM+LPITLYCDNS
Subjt:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS

Query:  DVVANSKEPRSHKRGKHIERKYHIIREIV
          VANSKEPRSHKRGKHIERKYH+IREIV
Subjt:  DVVANSKEPRSHKRGKHIERKYHIIREIV

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-5484.5Show/hide
Query:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS
        RDYMLVY  KDLIL  YTDSDFQTD DSRK TSG VFTLNEGA+VWRSIKQGCI DST+EAEYVAAC AA EAVWL+K L +LEVVPNM+LPITLYCDNS
Subjt:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS

Query:  DVVANSKEPRSHKRGKHIERKYHIIREIV
          VANSKEPRSHKRGKHIERKYH+IREIV
Subjt:  DVVANSKEPRSHKRGKHIERKYHIIREIV

TrEMBL top hitse value%identityAlignment
A0A5A7TE91 Gag/pol protein5.1e-5382.95Show/hide
Query:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS
        RDYMLVY TKDLIL  YTDS+FQTD DSRK TSG VFTLN GA+VWRSIKQGCI DST+EAEYVAAC AA EAVWL+K L +LEVVPNM+LPITLYCDNS
Subjt:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS

Query:  DVVANSKEPRSHKRGKHIERKYHIIREIV
          VANSKEPRSHKRGKHIERKYH+IR+IV
Subjt:  DVVANSKEPRSHKRGKHIERKYHIIREIV

A0A5A7TKM4 Gag/pol protein1.8e-5383.72Show/hide
Query:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS
        RDYMLVY  KDLIL  YTDSDFQTD DSRK TSG VFTLN GA+VWRSIKQGCI DST+EAEYVAAC AA EAVWL+K L +LEVVPNM+LPITLYCDNS
Subjt:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS

Query:  DVVANSKEPRSHKRGKHIERKYHIIREIV
          VANSKEPRSHKRGKHIERKYH+IREIV
Subjt:  DVVANSKEPRSHKRGKHIERKYHIIREIV

A0A5A7TZD0 Gag/pol protein1.8e-5383.72Show/hide
Query:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS
        RDYMLVY  KDLIL  YTDSDFQTD DSRK TSG VFTLN GA+VWRSIKQGCI DST+EAEYVAAC AA EAVWL+K L +LEVVPNM+LPITLYCDNS
Subjt:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS

Query:  DVVANSKEPRSHKRGKHIERKYHIIREIV
          VANSKEPRSHKRGKHIERKYH+IREIV
Subjt:  DVVANSKEPRSHKRGKHIERKYHIIREIV

A0A5A7UYE8 Gag/pol protein1.8e-5383.72Show/hide
Query:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS
        RDYMLVY  KDLIL  YTDSDFQTD DSRK TSG VFTLN GA+VWRSIKQGCI DST+EAEYVAAC AA EAVWL+K L +LEVVPNM+LPITLYCDNS
Subjt:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS

Query:  DVVANSKEPRSHKRGKHIERKYHIIREIV
          VANSKEPRSHKRGKHIERKYH+IREIV
Subjt:  DVVANSKEPRSHKRGKHIERKYHIIREIV

A0A5A7V1F5 Gag/pol protein2.7e-5484.5Show/hide
Query:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS
        RDYMLVY  KDLIL  YTDSDFQTD DSRK TSG VFTLNEGA+VWRSIKQGCI DST+EAEYVAAC AA EAVWL+K L +LEVVPNM+LPITLYCDNS
Subjt:  RDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNS

Query:  DVVANSKEPRSHKRGKHIERKYHIIREIV
          VANSKEPRSHKRGKHIERKYH+IREIV
Subjt:  DVVANSKEPRSHKRGKHIERKYHIIREIV

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.1e-1538.46Show/hide
Query:  LIRYTDSDFQTDIDSRKFTSGLVFTLNE-GAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNSDVVANSKEPRSH
        +I Y DSD+      RK T+G +F + +   I W + +Q  +  S+ EAEY+A   A  EA+WLK LL ++ +   +  PI +Y DN   ++ +  P  H
Subjt:  LIRYTDSDFQTDIDSRKFTSGLVFTLNE-GAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNSDVVANSKEPRSH

Query:  KRGKHIERKYHIIREIV
        KR KHI+ KYH  RE V
Subjt:  KRGKHIERKYHIIREIV

P0CV72 Secreted RxLR effector protein 1612.6e-0947.62Show/hide
Query:  LIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWL
        L+ Y+D+D+  D++SR+ TSG +F LN G + WRS KQ  +  S+ E EY+A   A  EAVWL
Subjt:  LIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-2042.4Show/hide
Query:  LVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNSDVVA
        L +   D IL  YTD+D   DID+RK ++G +FT + GAI W+S  Q C+  ST EAEY+AA     E +WLK+ L  L +    ++   +YCD+   + 
Subjt:  LVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNSDVVA

Query:  NSKEPRSHKRGKHIERKYHIIREIV
         SK    H R KHI+ +YH IRE+V
Subjt:  NSKEPRSHKRGKHIERKYHIIREIV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-1234.23Show/hide
Query:  YTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNSDVVANSKEPRSHKRGK
        ++D+ FQ+  D+R+ T+G    L    I W+S KQ  +  S+ EAEY A   A +E +WL +    L++   +  P  L+CDN+  +  +     H+R K
Subjt:  YTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNSDVVANSKEPRSHKRGK

Query:  HIERKYHIIRE
        HIE   H +RE
Subjt:  HIERKYHIIRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGATTATATGCTCGTGTATGACACTAAGGATCTAATCCTTATAAGATACACTGACTCTGATTTTCAAACTGATATAGATTCGAGGAAATTTACATCAGGATTAGT
GTTCACTCTAAATGAAGGAGCAATAGTTTGGAGGAGTATCAAACAAGGTTGTATAGTTGACTCCACCTTGGAAGCTGAGTACGTAGCTGCATGTTTAGCAGCAAATGAAG
CAGTATGGCTCAAGAAGCTCTTAGCAAATCTGGAAGTTGTTCCAAATATGCATTTGCCTATCACTCTCTATTGTGATAATAGTGATGTAGTTGCAAATTCCAAAGAACCT
AGAAGCCATAAGCGCGGCAAACACATTGAACGCAAATATCATATCATTAGAGAAATTGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGGATTATATGCTCGTGTATGACACTAAGGATCTAATCCTTATAAGATACACTGACTCTGATTTTCAAACTGATATAGATTCGAGGAAATTTACATCAGGATTAGT
GTTCACTCTAAATGAAGGAGCAATAGTTTGGAGGAGTATCAAACAAGGTTGTATAGTTGACTCCACCTTGGAAGCTGAGTACGTAGCTGCATGTTTAGCAGCAAATGAAG
CAGTATGGCTCAAGAAGCTCTTAGCAAATCTGGAAGTTGTTCCAAATATGCATTTGCCTATCACTCTCTATTGTGATAATAGTGATGTAGTTGCAAATTCCAAAGAACCT
AGAAGCCATAAGCGCGGCAAACACATTGAACGCAAATATCATATCATTAGAGAAATTGTGTAG
Protein sequenceShow/hide protein sequence
MRDYMLVYDTKDLILIRYTDSDFQTDIDSRKFTSGLVFTLNEGAIVWRSIKQGCIVDSTLEAEYVAACLAANEAVWLKKLLANLEVVPNMHLPITLYCDNSDVVANSKEP
RSHKRGKHIERKYHIIREIV