; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G14610 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G14610
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationClcChr09:19277834..19280012
RNA-Seq ExpressionClc09G14610
SyntenyClc09G14610
Gene Ontology termsGO:0009058 - biosynthetic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052767.1 hypothetical protein E6C27_scaffold43055G00210 [Cucumis melo var. makuwa]2.9e-2454.55Show/hide
Query:  ISLLDGRALWVVRLVPSLRPILASPKEVIRELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERV
        +S  DG    +VRLVPSL  +    ++  R  S +    EKIRKRL  WKK FF K GRLTLIRS+ +GI +YY SL R   SV +S+EK  RDFLWE V
Subjt:  ISLLDGRALWVVRLVPSLRPILASPKEVIRELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERV

Query:  EEGKGCHLVSWDDVGKPMNQG
        +EG+G HLVSW+ VG+PM QG
Subjt:  EEGKGCHLVSWDDVGKPMNQG

TYK19487.1 NADH dehydrogenase (ubiquinone) complex I, assembly factor 6 isoform X1 [Cucumis melo var. makuwa]8.1e-1957.61Show/hide
Query:  RELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG
        R +S +    EKI KRLA WKK  F K GRLTLIRS+ +GIP+YYFSLF+   SV + I+KLMRDFLW+ ++ G+G  LVSW  V  P+NQG
Subjt:  RELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG

VVA31869.1 Hypothetical predicted protein, partial [Prunus dulcis]6.4e-1648.91Show/hide
Query:  RELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG
        R L+ +    +K+ KRL  WK++   K GRLTLI+++ + IP YY SLF+  I V   +E+LMR+FLWE VEEGK CHLV W  V K   +G
Subjt:  RELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG

XP_022151711.1 uncharacterized protein LOC111019624 [Momordica charantia]2.1e-1962.03Show/hide
Query:  RKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG
        +K L+SWKK+FF K GRLTLI+S+  GIP YY SLFR  + V E +EK+MRDFLWE VEEG G HLV+W +V KP+  G
Subjt:  RKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG

XP_028056784.1 uncharacterized protein LOC114260796 [Camellia sinensis]4.9e-1653.09Show/hide
Query:  KIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG
        KIRKRL  WK++   K GR TL++S+   +PIY+ SLF+  +S+   IEKLMRDFLWE  EEGKG HLV W+ V     +G
Subjt:  KIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG

TrEMBL top hitse value%identityAlignment
A0A5D3C1E6 Uncharacterized protein1.4e-2454.55Show/hide
Query:  ISLLDGRALWVVRLVPSLRPILASPKEVIRELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERV
        +S  DG    +VRLVPSL  +    ++  R  S +    EKIRKRL  WKK FF K GRLTLIRS+ +GI +YY SL R   SV +S+EK  RDFLWE V
Subjt:  ISLLDGRALWVVRLVPSLRPILASPKEVIRELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERV

Query:  EEGKGCHLVSWDDVGKPMNQG
        +EG+G HLVSW+ VG+PM QG
Subjt:  EEGKGCHLVSWDDVGKPMNQG

A0A5D3D7E5 NADH dehydrogenase (Ubiquinone) complex I, assembly factor 6 isoform X13.9e-1957.61Show/hide
Query:  RELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG
        R +S +    EKI KRLA WKK  F K GRLTLIRS+ +GIP+YYFSLF+   SV + I+KLMRDFLW+ ++ G+G  LVSW  V  P+NQG
Subjt:  RELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG

A0A6J1DFI2 uncharacterized protein LOC1110196241.0e-1962.03Show/hide
Query:  RKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG
        +K L+SWKK+FF K GRLTLI+S+  GIP YY SLFR  + V E +EK+MRDFLWE VEEG G HLV+W +V KP+  G
Subjt:  RKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG

M5X4S0 Reverse transcriptase domain-containing protein (Fragment)1.8e-1648.91Show/hide
Query:  RELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG
        R L+ +    EK+ KRL  WK++   K GRLTLI+++ + IP YY SLF+  I V   +E+LMR+FLWE +EEGK CHLV W+ V K   +G
Subjt:  RELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG

M5XV38 zf-RVT domain-containing protein1.8e-1648.91Show/hide
Query:  RELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG
        R L+ +    EK+ KRL  WK++   K GRLTLI+++ + IP YY SLF+  I V   +E+LMR+FLWE +EEGK CHLV W+ V K   +G
Subjt:  RELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657505.1e-0831.03Show/hide
Query:  FGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG
        FG   E++  R++ W++      GRLTL +++ + +P++  S      S+   +++L R FLW    E K  HLV W  V  P  +G
Subjt:  FGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.6e-0731.46Show/hide
Query:  SCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG
        S +G   EKIR R+  W        GRL LI S+ + +  ++ S FR   +  + I+ +   FLW   E       V+W DV  P ++G
Subjt:  SCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHLVSWDDVGKPMNQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGGGGAGGGTTAGAGGTTGGGACTTTCTTTTGTAAAGGGGTAGATGAAAGGAAAGGGTCGCATTTGGTTAGTTGGGAAGCCATTGGGACACTTGTGAATTGGGG
AGGTTTAGAGGTTGGGAATTTCTTTGGGAAGGGGTGCATGAAGGGAAAAGGGTCGCATTTGGTTTGTTGGGAAGCCATTGGGAAACTTCTCAATCAAGGAGGTTCAGAGG
TTGGGATTTTCCTTGGGAAGGGGTGGAGAAAGGGAAAAGGGTCGCATTTGGTTAGTTGGGAAGCCATTGGGAAACTTGTGAATTGGGGAGGTTTAGAGCTTGGAAACTTA
AAGAGGATAAGCTTGCTAGATGGGCGAGCTTTGTGGGTTGTGAGGTTGGTTCCTTCCCTTCGTCCTATCTTGGCCTCCCCTAAGGAGGTAATTCGAGAGCTATCTTGTTT
TGGGACACCCCCCGAGAAGATTAGGAAGAGATTGGCTTCATGGAAGAAGAGTTTCTTTCCCAAAGTTGGGAGGTTAACACTTATTAGATCAATGTCAAATGGAATCCCTA
TCTATTACTTCTCCCTTTTCAGAGCAGCAATCTCAGTTTGGGAGAGCATTGAAAAGCTAATGAGAGACTTTCTCTGGGAAAGGGTGGAGGAAGGGAAAGGTTGTCATTTG
GTTAGCTGGGATGATGTGGGGAAACCTATGAATCAGGGGGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTGGGGAGGGTTAGAGGTTGGGACTTTCTTTTGTAAAGGGGTAGATGAAAGGAAAGGGTCGCATTTGGTTAGTTGGGAAGCCATTGGGACACTTGTGAATTGGGG
AGGTTTAGAGGTTGGGAATTTCTTTGGGAAGGGGTGCATGAAGGGAAAAGGGTCGCATTTGGTTTGTTGGGAAGCCATTGGGAAACTTCTCAATCAAGGAGGTTCAGAGG
TTGGGATTTTCCTTGGGAAGGGGTGGAGAAAGGGAAAAGGGTCGCATTTGGTTAGTTGGGAAGCCATTGGGAAACTTGTGAATTGGGGAGGTTTAGAGCTTGGAAACTTA
AAGAGGATAAGCTTGCTAGATGGGCGAGCTTTGTGGGTTGTGAGGTTGGTTCCTTCCCTTCGTCCTATCTTGGCCTCCCCTAAGGAGGTAATTCGAGAGCTATCTTGTTT
TGGGACACCCCCCGAGAAGATTAGGAAGAGATTGGCTTCATGGAAGAAGAGTTTCTTTCCCAAAGTTGGGAGGTTAACACTTATTAGATCAATGTCAAATGGAATCCCTA
TCTATTACTTCTCCCTTTTCAGAGCAGCAATCTCAGTTTGGGAGAGCATTGAAAAGCTAATGAGAGACTTTCTCTGGGAAAGGGTGGAGGAAGGGAAAGGTTGTCATTTG
GTTAGCTGGGATGATGTGGGGAAACCTATGAATCAGGGGGTTTAG
Protein sequenceShow/hide protein sequence
MNWGGLEVGTFFCKGVDERKGSHLVSWEAIGTLVNWGGLEVGNFFGKGCMKGKGSHLVCWEAIGKLLNQGGSEVGIFLGKGWRKGKGSHLVSWEAIGKLVNWGGLELGNL
KRISLLDGRALWVVRLVPSLRPILASPKEVIRELSCFGTPPEKIRKRLASWKKSFFPKVGRLTLIRSMSNGIPIYYFSLFRAAISVWESIEKLMRDFLWERVEEGKGCHL
VSWDDVGKPMNQGV