; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G08575 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G08575
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationClcChr09:7212807..7218564
RNA-Seq ExpressionClc09G08575
SyntenyClc09G08575
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH07150.1 TatD related DNase [Prunus dulcis]4.6e-0836.22Show/hide
Query:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG
        D   +GGD NV R S EKSN  + T +M  FN FI + NL                       +DRFL+S    D F     K LPR TS+ CP+ L   
Subjt:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG

Query:  EQNWGLSYLKYKNMWM---DQKTEISL
           WG S  +++NMW+   D K +I L
Subjt:  EQNWGLSYLKYKNMWM---DQKTEISL

RVX10571.1 putative ribonuclease H protein [Vitis vinifera]7.3e-0629.73Show/hide
Query:  LLYPSLQSQAPWFWKNGLCIMAIPNKPKKGGNDHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNLI-----------------------DRFL
        +L+P   S    FW     I+ +        +    +GGD NV R S EK  G + T +M  F+ FI DC LI                       DRFL
Subjt:  LLYPSLQSQAPWFWKNGLCIMAIPNKPKKGGNDHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNLI-----------------------DRFL

Query:  ISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLGEQNWGLSYLKYKNMWM
         S+     F + I   LPR TS+  P+ L      WG +  +++NMW+
Subjt:  ISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLGEQNWGLSYLKYKNMWM

VVA20479.1 Hypothetical predicted protein, partial [Prunus dulcis]4.6e-0836.22Show/hide
Query:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG
        D   +GGD NV R S EKSN  + T +M  FN FI + NL                       +DRFL+S    D F     K LPR TS+ CP+ L   
Subjt:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG

Query:  EQNWGLSYLKYKNMWM---DQKTEISL
           WG S  +++NMW+   D K +I L
Subjt:  EQNWGLSYLKYKNMWM---DQKTEISL

XP_021820446.1 uncharacterized protein LOC110762145 [Prunus avium]8.7e-0734.51Show/hide
Query:  IGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLGEQNW
        IGGD NV R   +KSNG   T++M  FN FI+D NL                       +DRFL +    D F     K L R TS+ CP+ L   +  W
Subjt:  IGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLGEQNW

Query:  GLSYLKYKNMWMD
        G    +++NMW++
Subjt:  GLSYLKYKNMWMD

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.5e-1441.03Show/hide
Query:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDC--------------------NLIDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLGEQN
        +H I+ GD NV+R S+EKSNG+  T +M LFN FI D                     +LID FL+++ CIDK    I KR+ R TS+  P+ L  G+ N
Subjt:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDC--------------------NLIDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLGEQN

Query:  WGLSYLKYKNMWMDQKT
        WGL+  +++NMW+  KT
Subjt:  WGLSYLKYKNMWMDQKT

TrEMBL top hitse value%identityAlignment
A0A4Y1RS61 TatD related DNase2.2e-0836.22Show/hide
Query:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG
        D   +GGD NV R S EKSN  + T +M  FN FI + NL                       +DRFL+S    D F     K LPR TS+ CP+ L   
Subjt:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG

Query:  EQNWGLSYLKYKNMWM---DQKTEISL
           WG S  +++NMW+   D K +I L
Subjt:  EQNWGLSYLKYKNMWM---DQKTEISL

A0A5E4F090 Reverse transcriptase domain-containing protein (Fragment)2.2e-0836.22Show/hide
Query:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG
        D   +GGD NV R S EKSN  + T +M  FN FI + NL                       +DRFL+S    D F     K LPR TS+ CP+ L   
Subjt:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG

Query:  EQNWGLSYLKYKNMWM---DQKTEISL
           WG S  +++NMW+   D K +I L
Subjt:  EQNWGLSYLKYKNMWM---DQKTEISL

A0A6J1E2G6 uncharacterized protein LOC1110254057.1e-1541.03Show/hide
Query:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDC--------------------NLIDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLGEQN
        +H I+ GD NV+R S+EKSNG+  T +M LFN FI D                     +LID FL+++ CIDK    I KR+ R TS+  P+ L  G+ N
Subjt:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDC--------------------NLIDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLGEQN

Query:  WGLSYLKYKNMWMDQKT
        WGL+  +++NMW+  KT
Subjt:  WGLSYLKYKNMWMDQKT

M5VS59 Reverse transcriptase domain-containing protein (Fragment)2.9e-0835.9Show/hide
Query:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG
        D   +GGD NV R S EKSN  + T +M  FN FI + NL                       +DRFL+S    D F     K LPR TS+ CP+ L   
Subjt:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG

Query:  EQNWGLSYLKYKNMWMD
           WG S  +++NMW++
Subjt:  EQNWGLSYLKYKNMWMD

M5XHS0 Reverse transcriptase domain-containing protein (Fragment)2.2e-0836.22Show/hide
Query:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG
        D   +GGD NV R S EKSN  + T +M  FN FI + NL                       +DRFL+S    D F     K LPR TS+ CP+ L   
Subjt:  DHRIIGGDLNVSRLSFEKSNGKQRTANMNLFNRFIHDCNL-----------------------IDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLG

Query:  EQNWGLSYLKYKNMWM---DQKTEISL
           WG S  +++NMW+   D K +I L
Subjt:  EQNWGLSYLKYKNMWM---DQKTEISL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCTTCCTTCTCAGGGGCACGAGACACTCAAGAAGTCGATGGATTTCGACAAAGATGGCTTGGCCCCCGATCATCAAGCAAAGGTAACTAAGAACTTACCAATTTC
TCACCCTCAAGAAGGTGACAAGAAGTTCAACACTTTAGATACAAAGGGAGAAGTCGATGGAACTGATTCTTTACTTTACCCGAGTTTGCAATCACAGGCACCTTGGTTTT
GGAAAAACGGTCTTTGTATCATGGCAATTCCTAACAAACCCAAGAAGGGAGGCAACGATCATAGGATTATTGGGGGAGATTTAAATGTCTCAAGATTGTCATTTGAAAAA
TCCAATGGCAAACAGAGAACTGCTAATATGAATCTTTTCAACAGATTCATTCATGATTGCAATCTCATTGATCGCTTTCTTATTTCTAGCCCTTGTATTGATAAATTCAA
GAGGGACATTGTCAAGCGTCTACCTCGACCCACCTCTAATCTTTGCCCGATGCATCTCACTCTTGGAGAGCAAAATTGGGGTCTTAGTTACCTTAAATACAAGAACATGT
GGATGGATCAGAAAACAGAAATTTCATTGTTAAGTATTGACAAAAGAGCTATACTAGCAAAGAGAGACATGAGAGGGGAGGATAGCCAAACCCAAAGGGAGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCTTCCTTCTCAGGGGCACGAGACACTCAAGAAGTCGATGGATTTCGACAAAGATGGCTTGGCCCCCGATCATCAAGCAAAGGTAACTAAGAACTTACCAATTTC
TCACCCTCAAGAAGGTGACAAGAAGTTCAACACTTTAGATACAAAGGGAGAAGTCGATGGAACTGATTCTTTACTTTACCCGAGTTTGCAATCACAGGCACCTTGGTTTT
GGAAAAACGGTCTTTGTATCATGGCAATTCCTAACAAACCCAAGAAGGGAGGCAACGATCATAGGATTATTGGGGGAGATTTAAATGTCTCAAGATTGTCATTTGAAAAA
TCCAATGGCAAACAGAGAACTGCTAATATGAATCTTTTCAACAGATTCATTCATGATTGCAATCTCATTGATCGCTTTCTTATTTCTAGCCCTTGTATTGATAAATTCAA
GAGGGACATTGTCAAGCGTCTACCTCGACCCACCTCTAATCTTTGCCCGATGCATCTCACTCTTGGAGAGCAAAATTGGGGTCTTAGTTACCTTAAATACAAGAACATGT
GGATGGATCAGAAAACAGAAATTTCATTGTTAAGTATTGACAAAAGAGCTATACTAGCAAAGAGAGACATGAGAGGGGAGGATAGCCAAACCCAAAGGGAGAGATAA
Protein sequenceShow/hide protein sequence
MDLPSQGHETLKKSMDFDKDGLAPDHQAKVTKNLPISHPQEGDKKFNTLDTKGEVDGTDSLLYPSLQSQAPWFWKNGLCIMAIPNKPKKGGNDHRIIGGDLNVSRLSFEK
SNGKQRTANMNLFNRFIHDCNLIDRFLISSPCIDKFKRDIVKRLPRPTSNLCPMHLTLGEQNWGLSYLKYKNMWMDQKTEISLLSIDKRAILAKRDMRGEDSQTQRER