; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G11120 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G11120
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDNA-directed DNA polymerase
Genome locationClcChr11:16579322..16579690
RNA-Seq ExpressionClc11G11120
SyntenyClc11G11120
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833156.1 hypothetical protein, partial [Synechococcus sp. PCC 7002]1.5e-5388.52Show/hide
Query:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVN
        MDFEKA EKR+LELNEMEEFRAQAY+NAKLYK+RTARWHDKKIT  TFLP QR+LLFNSRLRLFPGKLRTRWSGPF+IVKVSPHGAVELQGNNGTTFKVN
Subjt:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVN

Query:  GQRLKHYIRDEERRLENLTFIA
        G RLKHYI DEER LENL F A
Subjt:  GQRLKHYIRDEERRLENLTFIA

XP_030498073.1 uncharacterized protein LOC115713732 [Cannabis sativa]1.2e-3465.74Show/hide
Query:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVN
        M+ + A EKR+L+LNE+EEFR +AY+NAK+YKERT +WHD+ +  + F PGQ+VLLFNSRL+LFPGKL++RWSGPF +VKV P+GAVEL+G+   TFKVN
Subjt:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVN

Query:  GQRLKHYI
        GQRLK Y+
Subjt:  GQRLKHYI

XP_030504949.1 uncharacterized protein LOC115719915 [Cannabis sativa]1.2e-3465.74Show/hide
Query:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVN
        M+ + A EKR+L+LNE++EFR +AY+NAK+YKERT +WHD+ +  + F PGQ+VLLFNSRL+LFPGKL++RWSGPF +VKV P+GAVEL+G + TTFKVN
Subjt:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVN

Query:  GQRLKHYI
        GQRLK Y+
Subjt:  GQRLKHYI

XP_038885822.1 uncharacterized protein LOC120076116 [Benincasa hispida]1.5e-4074.36Show/hide
Query:  EKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNGQR
        +K  EKR+LEL E+EEF  QAY+NAKLYKER ARWHDKKI   TF  GQ VLLFNSRLRLFP KLRTRW GPFV+VK SPHGAVE+QG +G  FKVNGQR
Subjt:  EKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNGQR

Query:  LKHYIRDEERRLENLTF
        L+HY  DEER+LENL F
Subjt:  LKHYIRDEERRLENLTF

XP_038885946.1 uncharacterized protein LOC120076251 [Benincasa hispida]2.9e-3669.37Show/hide
Query:  DFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNG
        D + A + R+L+LNEMEEF+ QAY+N+K+YKERT +WHD  I  R FLPGQRVLLFNSRLRLFPGKL++RW GPFVI  V+P+ AVEL G +GTTFKVN 
Subjt:  DFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNG

Query:  QRLKHYIRDEE
        QRLKHY  DEE
Subjt:  QRLKHYIRDEE

TrEMBL top hitse value%identityAlignment
A0A1S4DFS8 uncharacterized protein LOC1078293658.6e-3468.52Show/hide
Query:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGT-TFKV
        MD + A EKR+L+LNE++EFR  AY+NAKLYK +T RWHDK I  R F PGQ VLLFNSRL+LFPGKL++RWSGPFV+V V PHGAVEL+  + T TF V
Subjt:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGT-TFKV

Query:  NGQRLKHY
        NGQR+KHY
Subjt:  NGQRLKHY

A0A1U7YB01 uncharacterized protein LOC1042430151.9e-3364.81Show/hide
Query:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNG-TTFKV
        MD E ACEKR+++LNE++EF+  +Y+NAKLYKE T RWHDK I  R F PGQ VLLFNSRL+LFPGKL++RWSGPF +V+V+P+GA+EL+  NG   F V
Subjt:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNG-TTFKV

Query:  NGQRLKHY
        NG R+KHY
Subjt:  NGQRLKHY

A0A1U8BL36 uncharacterized protein LOC1046120341.9e-3366.04Show/hide
Query:  ACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNGQRLK
        A +KR L+LNEMEE+R QAY+NA+ YKE+T  WHD +I  + F  G +V LFNSRL LFPGKL+TRWSGPFV+ KV PHGA+EL+   G TFKVNGQRLK
Subjt:  ACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNGQRLK

Query:  HYIRDE
        HYI  E
Subjt:  HYIRDE

A0A2G9H400 Reverse transcriptase5.0e-3468.22Show/hide
Query:  DFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNG-TTFKVN
        D + A EKR+L+LNE++EFR QAY+NAK+YKE+T RWHDKKI  R F PGQ VLLFNSRL+LFPGKL++RWSGPF I +V PHGAVEL+  N    FKVN
Subjt:  DFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNG-TTFKVN

Query:  GQRLKHY
         QR+KHY
Subjt:  GQRLKHY

A0A6P4D1N6 uncharacterized protein LOC1074845091.1e-3363.39Show/hide
Query:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGN-NGTTFKV
        +D + A EKR+L+LNE+EEFR +AY+NA++YKER  RWHDK+I+ RTF PGQRVLLFNSRL++FPGKLR+RW+GP+ I+KVSPHG VEL    +  TF  
Subjt:  MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGN-NGTTFKV

Query:  NGQRLKHYIRDE
        NG R+KHY   E
Subjt:  NGQRLKHYIRDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTGAGAAAGCCTGTGAGAAACGCGTTTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATAAGAATGCCAAACTTTACAAAGAGCGCACTGCGAG
ATGGCATGACAAGAAGATCACATCACGAACCTTCCTTCCAGGACAAAGAGTATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGTAAGCTTAGGACACGATGGTCGG
GACCCTTTGTTATTGTCAAGGTATCCCCACACGGAGCCGTGGAACTACAAGGTAACAATGGAACAACTTTTAAAGTGAATGGTCAACGATTAAAGCACTACATACGGGAT
GAAGAACGCAGACTTGAGAACCTGACTTTTATTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTGAGAAAGCCTGTGAGAAACGCGTTTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATAAGAATGCCAAACTTTACAAAGAGCGCACTGCGAG
ATGGCATGACAAGAAGATCACATCACGAACCTTCCTTCCAGGACAAAGAGTATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGTAAGCTTAGGACACGATGGTCGG
GACCCTTTGTTATTGTCAAGGTATCCCCACACGGAGCCGTGGAACTACAAGGTAACAATGGAACAACTTTTAAAGTGAATGGTCAACGATTAAAGCACTACATACGGGAT
GAAGAACGCAGACTTGAGAACCTGACTTTTATTGCATGA
Protein sequenceShow/hide protein sequence
MDFEKACEKRVLELNEMEEFRAQAYKNAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNGQRLKHYIRD
EERRLENLTFIA