; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G004780 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G004780
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCG_Chr08:15057016..15059021
RNA-Seq ExpressionClCG08G004780
SyntenyClCG08G004780
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026081.1 pol protein [Cucumis melo var. makuwa]3.9e-0969.57Show/hide
Query:  RDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFR
        RDC LI  GESG+  DSI L F GQDR+ SWEHN TRWN LLP FR
Subjt:  RDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFR

KAA0047821.1 uncharacterized protein E6C27_scaffold133G00730 [Cucumis melo var. makuwa]2.6e-1369.23Show/hide
Query:  DCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFREVDECSL
        DC LIY  ESG+ ADS+ L FWGQDRV SWEHN+TRWNSLLP FRE+ + S+
Subjt:  DCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFREVDECSL

KAA0049822.1 reverse transcriptase [Cucumis melo var. makuwa]1.7e-0443.04Show/hide
Query:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFREVDECSLKWCLQTINRLFIRGALVL
        +G   ++ RDC LI  GESG+ ADSI L FWGQDRV  W+H         P                   LFIRGALVL
Subjt:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFREVDECSLKWCLQTINRLFIRGALVL

TYK05792.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-1160Show/hide
Query:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFRE
        +G   ++ RDC L   GESG+  DSI L FWGQDRV SWEHN+TRWNS +P FR+
Subjt:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFRE

TYK08698.1 reverse transcriptase [Cucumis melo var. makuwa]2.9e-0458.54Show/hide
Query:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEH
        +G   ++ RDC LI  GESG+ ADSI L FWGQDRV  W+H
Subjt:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEH

TrEMBL top hitse value%identityAlignment
A0A5A7SPG1 Pol protein1.9e-0969.57Show/hide
Query:  RDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFR
        RDC LI  GESG+  DSI L F GQDR+ SWEHN TRWN LLP FR
Subjt:  RDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFR

A0A5A7U202 Reverse transcriptase8.3e-0543.04Show/hide
Query:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFREVDECSLKWCLQTINRLFIRGALVL
        +G   ++ RDC LI  GESG+ ADSI L FWGQDRV  W+H         P                   LFIRGALVL
Subjt:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFREVDECSLKWCLQTINRLFIRGALVL

A0A5A7U2P4 Integrase catalytic domain-containing protein1.3e-1369.23Show/hide
Query:  DCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFREVDECSL
        DC LIY  ESG+ ADS+ L FWGQDRV SWEHN+TRWNSLLP FRE+ + S+
Subjt:  DCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFREVDECSL

A0A5D3C3J6 Gag/pol protein1.6e-1160Show/hide
Query:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFRE
        +G   ++ RDC L   GESG+  DSI L FWGQDRV SWEHN+TRWNS +P FR+
Subjt:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFRE

A0A5D3CBX3 Reverse transcriptase1.4e-0458.54Show/hide
Query:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEH
        +G   ++ RDC LI  GESG+ ADSI L FWGQDRV  W+H
Subjt:  VGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGACTTTATCAAATCCAAGATGTCGAGCTATTGGGCTTGGTATCAGATTCCTCCTTCCATCTCGAGTCGAAAAGATTATAGAGGAGCCTTCTTTTGGCGA
GTTAGCCCCAAACAAATGGTGGTGACCAACTCCTGGACCATGATGTGGGCACACAATGGGGATGAGGTTTACCTATCCATGGAGCAATTCCTTATTATACATGAG
CTAAAGCAAGTCCCAAAGCTTGAGGACTGGCTTTACTTGTCAATTCGACCTAGGAGGCCTTTGTTGATTTTGAAGCCTTCCTCCAACAAGTCTTTAGAAGGACAA
ACTGTTCTACACTTGGTACCTTATCCTGGTGACACTATGGATACGACCCACTTTGTATTTGATACAAACGCAATGATCCAACATGTTTTTGTAGGGGACACGCGA
GTGGGGTCTCAACCCTGGGTGGGTTATGAACTCCTATTCACGAGGGATTGTTCTTTGATTTATATGGGTGAGAGTGGGCGGTTCGCTGACTCAATAAGCCTACCA
TTTTGGGGACAAGACCGAGTGAGAAGCTGGGAGCATAATAACACAAGATGGAATTCACTCCTTCCTTGCTTTAGGGAAGTAGATGAGTGTTCCCTTAAGTGGTGC
CTCCAGACTATAAACAGGTTGTTCATTAGAGGAGCATTGGTACTTACGGATGTAAAGGTAACTCGAGGGACTAACTTGTTATTGTTGATCTATGTCCGAGGACAC
AAAAATATATCTGCAGTGAGAAGAGTGCAACTGTGGATCCATTATGTCCCACTGGTAGCTCGTAAAGGGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGACTTTATCAAATCCAAGATGTCGAGCTATTGGGCTTGGTATCAGATTCCTCCTTCCATCTCGAGTCGAAAAGATTATAGAGGAGCCTTCTTTTGGCGA
GTTAGCCCCAAACAAATGGTGGTGACCAACTCCTGGACCATGATGTGGGCACACAATGGGGATGAGGTTTACCTATCCATGGAGCAATTCCTTATTATACATGAG
CTAAAGCAAGTCCCAAAGCTTGAGGACTGGCTTTACTTGTCAATTCGACCTAGGAGGCCTTTGTTGATTTTGAAGCCTTCCTCCAACAAGTCTTTAGAAGGACAA
ACTGTTCTACACTTGGTACCTTATCCTGGTGACACTATGGATACGACCCACTTTGTATTTGATACAAACGCAATGATCCAACATGTTTTTGTAGGGGACACGCGA
GTGGGGTCTCAACCCTGGGTGGGTTATGAACTCCTATTCACGAGGGATTGTTCTTTGATTTATATGGGTGAGAGTGGGCGGTTCGCTGACTCAATAAGCCTACCA
TTTTGGGGACAAGACCGAGTGAGAAGCTGGGAGCATAATAACACAAGATGGAATTCACTCCTTCCTTGCTTTAGGGAAGTAGATGAGTGTTCCCTTAAGTGGTGC
CTCCAGACTATAAACAGGTTGTTCATTAGAGGAGCATTGGTACTTACGGATGTAAAGGTAACTCGAGGGACTAACTTGTTATTGTTGATCTATGTCCGAGGACAC
AAAAATATATCTGCAGTGAGAAGAGTGCAACTGTGGATCCATTATGTCCCACTGGTAGCTCGTAAAGGGTATTGA
Protein sequenceShow/hide protein sequence
MPDFIKSKMSSYWAWYQIPPSISSRKDYRGAFFWRVSPKQMVVTNSWTMMWAHNGDEVYLSMEQFLIIHELKQVPKLEDWLYLSIRPRRPLLILKPSSNKSLEGQ
TVLHLVPYPGDTMDTTHFVFDTNAMIQHVFVGDTRVGSQPWVGYELLFTRDCSLIYMGESGRFADSISLPFWGQDRVRSWEHNNTRWNSLLPCFREVDECSLKWC
LQTINRLFIRGALVLTDVKVTRGTNLLLLIYVRGHKNISAVRRVQLWIHYVPLVARKGY