; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G016580 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G016580
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase domain-containing protein
Genome locationCG_Chr09:32476909..32481424
RNA-Seq ExpressionClCG09G016580
SyntenyClCG09G016580
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH07150.1 TatD related DNase [Prunus dulcis]4.3e-0932.21Show/hide
Query:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ
        FS+SIR   N   D W++G+YGP   + R    +E+ DLYG C   WC+GG FNVVR+S EK +      SM    D  D  +E  + D     L   S 
Subjt:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ

Query:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES
           +L     C+      ++GS         + A P++ S+H   E+++
Subjt:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES

VVA20479.1 Hypothetical predicted protein, partial [Prunus dulcis]4.3e-0932.21Show/hide
Query:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ
        FS+SIR   N   D W++G+YGP   + R    +E+ DLYG C   WC+GG FNVVR+S EK +      SM    D  D  +E  + D     L   S 
Subjt:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ

Query:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES
           +L     C+      ++GS         + A P++ S+H   E+++
Subjt:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]1.0e-0743.94Show/hide
Query:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSD
        FSIS+  +    F  W+TGVY P   K R+   QE++DL GLC   W +G  FN+ RWS E  S++
Subjt:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSD

XP_028093340.1 uncharacterized protein LOC114293463 [Camellia sinensis]8.0e-0850.82Show/hide
Query:  SISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEK
        SI+ RNR N V+  W+TGVYGP    +R++   E+  L+GLC+  WC+GG FNVVR + EK
Subjt:  SISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEK

XP_031739979.1 uncharacterized protein LOC116403332 [Cucumis sativus]1.4e-1247.19Show/hide
Query:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIAD
        FSISI  + N  F  W+TGVYGP+  + R++   E+  LYGLCN  WC+GG FNVVRW  EK S      SMI       L EE ++ D
Subjt:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIAD

TrEMBL top hitse value%identityAlignment
A0A4Y1RS61 TatD related DNase2.1e-0932.21Show/hide
Query:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ
        FS+SIR   N   D W++G+YGP   + R    +E+ DLYG C   WC+GG FNVVR+S EK +      SM    D  D  +E  + D     L   S 
Subjt:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ

Query:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES
           +L     C+      ++GS         + A P++ S+H   E+++
Subjt:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES

M5VS59 Reverse transcriptase domain-containing protein (Fragment)7.1e-1032.21Show/hide
Query:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ
        FS+SIR   N   D W++G+YGP   + R    +E+ DLYG C   WC+GG FNVVR+S EK +      SM    D  D  +E  + D     L   S 
Subjt:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ

Query:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES
           +L     C+      ++GS         + A P++ S+H   E+++
Subjt:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)2.2e-1132.89Show/hide
Query:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ
        FS+SIR   N   D W++G+YGP   + R    +E+ DLYG C   WC+GG FNVVR+S EK +      SM    D  D  +E  + D I   L   S 
Subjt:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ

Query:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES
           +L     C+      ++GS         + A P++ S+H   E+++
Subjt:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES

M5WPQ5 Reverse transcriptase domain-containing protein4.9e-1133.56Show/hide
Query:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ
        FS+SIR   N   D W++G+YGP   + R    +E+ DLYG C   WC+GG FNVVR+S EK +      SM    D  D  +E  + D I   L   S 
Subjt:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ

Query:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES
           +L     C+      ++GS         + A P++ S+H   E++S
Subjt:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES

M5XHS0 Reverse transcriptase domain-containing protein (Fragment)9.2e-1032.21Show/hide
Query:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ
        FS+SIR   N   D W++G+YGP   + R    +E+ DLYG C   WC+GG FNVVR+S EK +      SM    D  D  +E  + D     L   S 
Subjt:  FSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQ

Query:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES
           +L     C+      ++GS         + A P++ S+H   E+++
Subjt:  ASGSLAPLGFCQSF--ININGSNVTFLQDTSNLAHPQLCSEHSDAEVES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTCAATCTCAATTCGTAATCGCACAAATCATGTTTTTGACAGTTGGGTGACAGGTGTCTACGGGCCAGCTTGTACAAAAAACAGGGAAAAGTTGTGCCAAGAAAT
CTATGATCTGTATGGGTTGTGCAATGGTTTTTGGTGCATTGGTGGAGTCTTCAATGTTGTTAGATGGTCGTTTGAAAAGAGGTCTTCAGACATAGATGTGATTTCCATGA
TTGAATGTGACGATGGTAGAGACCTCTTTGAAGAGAGGGAGATTGCTGATAAGATTGTGGCTCTCCTCCCTCAGGAATCTCAAGCGTCGGGTTCCTTAGCGCCTCTTGGC
TTCTGTCAATCCTTCATCAATATCAATGGCTCAAATGTAACCTTCTTGCAAGATACATCAAATCTTGCTCATCCTCAACTATGCTCAGAGCACTCGGATGCGGAAGTCGA
GTCAATAGTTAGTGCAAGTAGCGATGACATCGAGCTTTGTGACACCATGGAGTATCAAGAAAAGGAGGCAACTTTTGATTCATTTGGAAAAGACATCACGAAATTGTTCC
AAACAGAATCACCCTTGATTAAAAACAAGGCTAAAGCCTCAGTATTCTCTCCCATTTGCAAGAAAATAGTCCCTCAAAACCTGCTATCAATCATGGAAGCTTGTGATATT
ACCCTCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCTCAATCTCAATTCGTAATCGCACAAATCATGTTTTTGACAGTTGGGTGACAGGTGTCTACGGGCCAGCTTGTACAAAAAACAGGGAAAAGTTGTGCCAAGAAAT
CTATGATCTGTATGGGTTGTGCAATGGTTTTTGGTGCATTGGTGGAGTCTTCAATGTTGTTAGATGGTCGTTTGAAAAGAGGTCTTCAGACATAGATGTGATTTCCATGA
TTGAATGTGACGATGGTAGAGACCTCTTTGAAGAGAGGGAGATTGCTGATAAGATTGTGGCTCTCCTCCCTCAGGAATCTCAAGCGTCGGGTTCCTTAGCGCCTCTTGGC
TTCTGTCAATCCTTCATCAATATCAATGGCTCAAATGTAACCTTCTTGCAAGATACATCAAATCTTGCTCATCCTCAACTATGCTCAGAGCACTCGGATGCGGAAGTCGA
GTCAATAGTTAGTGCAAGTAGCGATGACATCGAGCTTTGTGACACCATGGAGTATCAAGAAAAGGAGGCAACTTTTGATTCATTTGGAAAAGACATCACGAAATTGTTCC
AAACAGAATCACCCTTGATTAAAAACAAGGCTAAAGCCTCAGTATTCTCTCCCATTTGCAAGAAAATAGTCCCTCAAAACCTGCTATCAATCATGGAAGCTTGTGATATT
ACCCTCAGTTAG
Protein sequenceShow/hide protein sequence
MFSISIRNRTNHVFDSWVTGVYGPACTKNREKLCQEIYDLYGLCNGFWCIGGVFNVVRWSFEKRSSDIDVISMIECDDGRDLFEEREIADKIVALLPQESQASGSLAPLG
FCQSFININGSNVTFLQDTSNLAHPQLCSEHSDAEVESIVSASSDDIELCDTMEYQEKEATFDSFGKDITKLFQTESPLIKNKAKASVFSPICKKIVPQNLLSIMEACDI
TLS