; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G000875 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G000875
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCG_Chr05:940633..940899
RNA-Seq ExpressionClCG05G000875
SyntenyClCG05G000875
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]8.3e-3378.89Show/hide
Query:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA
        +GF++TVPAKLWCDNQAAL  ASNPVF E T+HIEV+C+FI EKIQ+ LV TGYVKTGEQ GDILT+A+NG RISYLCNKL MIDIFAPA
Subjt:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]8.3e-3378.89Show/hide
Query:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA
        +GF++TVPAKLWCDNQAAL  ASNPVF E T+HIEV+C+FI EKIQ+ LV TGYVKTGEQ GDILT+A+NG RISYLCNKL MIDIFAPA
Subjt:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]8.3e-3378.89Show/hide
Query:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA
        +GF++TVPAKLWCDNQAAL  ASNPVF E T+HIEV+C+FI EKIQ+ LV TGYVKTGEQ GDILT+A+NG RISYLCNKL MIDIFAPA
Subjt:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]8.3e-3378.89Show/hide
Query:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA
        +GF++TVPAKLWCDNQAAL  ASNPVF E T+HIEV+C+FI EKIQ+ LV TGYVKTGEQ GDILT+A+NG RISYLCNKL MIDIFAPA
Subjt:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]8.3e-3378.89Show/hide
Query:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA
        +GF++TVPAKLWCDNQAAL  ASNPVF E T+HIEV+C+FI EKIQ+ LV TGYVKTGEQ GDILT+A+NG RISYLCNKL MIDIFAPA
Subjt:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA

TrEMBL top hitse value%identityAlignment
A0A5A7SZ99 Putative mitochondrial protein2.9e-3176.14Show/hide
Query:  FNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA
        F +TVP+KLWCDNQAAL   SN VF E T+HIE++C FI EKIQ+ LV+ GYVKTGE+FGDILT+A+NGARISYLCNKLDMIDIFAPA
Subjt:  FNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA

A0A5A7T406 Copia protein4.4e-3278.41Show/hide
Query:  FNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA
        F++TVPAKLWCDNQ AL  ASNPVF E T+++EV+C+FI EKIQ+ LV TGYVKTGE+ GDILT+AVNGARISYLCNKLDMIDIFAPA
Subjt:  FNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA

A0A5D3CVN0 Putative mitochondrial protein4.9e-3173.33Show/hide
Query:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA
        +GF++T+PAKLWCDNQ AL  ASN VF E T++I+V+C+FIHEKIQ+++VFTGYVKTGEQ GDI T  V+GARISYLCNKL MIDIFAPA
Subjt:  MGFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA

A0A5D3E4V8 Putative mitochondrial protein2.9e-3179.78Show/hide
Query:  GFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA
        GFN+TVP KL CDNQAAL  ASNPVF E T+HIEV+C+FI EKIQ+ LV TGYVKT EQ  DILT+AVNGARISYLCNKLDMIDIFAPA
Subjt:  GFNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA

A0A5D3E5M8 Copia protein4.4e-3278.41Show/hide
Query:  FNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA
        F++TVPAKLWCDNQ AL  ASNPVF E T+++EV+C+FI EKIQ+ LV TGYVKTGE+ GDILT+AVNGARISYLCNKLDMIDIFAPA
Subjt:  FNVTVPAKLWCDNQAAL--ASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.4e-0631.17Show/hide
Query:  PAKLWCDNQA--ALASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMI
        P K++ DNQ   ++A+NP   +  +HI++  +F  E++Q  ++   Y+ T  Q  DI T+ +  AR   L +KL ++
Subjt:  PAKLWCDNQA--ALASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.6e-0448.65Show/hide
Query:  PAKLWCDNQAA--LASNPVFQEGTEHIEVNCNFIHEK
        P  L+CDN AA  +A+N VF E T+HIE +C+ + E+
Subjt:  PAKLWCDNQAA--LASNPVFQEGTEHIEVNCNFIHEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTCAATGTTACTGTGCCAGCTAAATTATGGTGTGATAATCAAGCTGCACTTGCATCTAACCCAGTGTTTCAGGAAGGGACTGAACATATTGAAGTGAATTGTAA
TTTCATTCACGAGAAAATACAAGAAAGGTTGGTGTTCACAGGATATGTGAAGACTGGAGAACAATTTGGAGATATTCTTACCAGAGCTGTAAATGGAGCAAGAATAAGCT
ATCTATGCAACAAGCTGGACATGATTGACATATTTGCTCCAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGATTCAATGTTACTGTGCCAGCTAAATTATGGTGTGATAATCAAGCTGCACTTGCATCTAACCCAGTGTTTCAGGAAGGGACTGAACATATTGAAGTGAATTGTAA
TTTCATTCACGAGAAAATACAAGAAAGGTTGGTGTTCACAGGATATGTGAAGACTGGAGAACAATTTGGAGATATTCTTACCAGAGCTGTAAATGGAGCAAGAATAAGCT
ATCTATGCAACAAGCTGGACATGATTGACATATTTGCTCCAGCTTGA
Protein sequenceShow/hide protein sequence
MGFNVTVPAKLWCDNQAALASNPVFQEGTEHIEVNCNFIHEKIQERLVFTGYVKTGEQFGDILTRAVNGARISYLCNKLDMIDIFAPA