; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G005000 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G005000
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRNase H domain-containing protein
Genome locationCG_Chr04:18539864..18540555
RNA-Seq ExpressionClCG04G005000
SyntenyClCG04G005000
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4273082.1 unnamed protein product [Prunus armeniaca]3.6e-1233.12Show/hide
Query:  GKGRVNGVISNPP-----TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLE
        GK  +N  +  PP     T +SW  P +G  K+N DAAW      GG+GW+VRD  G  V  G K   +      +EA+ I   L   ++  L     LE
Subjt:  GKGRVNGVISNPP-----TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLE

Query:  VEFDALEVVGLLNRDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAHMTVTLVS
        VE D+L+V+ ++  +W     +D I+ DI   +       F +  R  NKAAH     VS
Subjt:  VEFDALEVVGLLNRDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAHMTVTLVS

CAB4279811.1 unnamed protein product [Prunus armeniaca]6.1e-1233.12Show/hide
Query:  GKGRVNGVISNPP-----TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLE
        GK  +N  +  PP     T +SW    +G  K+N DAAW      GG+GW+VRD  G  V  GCK   +      +EAK I   L   ++  L     LE
Subjt:  GKGRVNGVISNPP-----TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLE

Query:  VEFDALEVVGLLNRDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAHMTVTLVS
        VE D+L+++ +L  +W     ++ I+ DI   +       F +  R  NKAAH     VS
Subjt:  VEFDALEVVGLLNRDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAHMTVTLVS

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]5.5e-1335.56Show/hide
Query:  TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDALEVVGLLNRDWIF
        T + W PP+    KLN +AAW      GG+GWI+RD KG  +   C+       ++ LE   I  GL  + +E   PI    +E D+LE + LL+R    
Subjt:  TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDALEVVGLLNRDWIF

Query:  FAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAH
          EI  ++E+I   +  + I    H SRE NK AH
Subjt:  FAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAH

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.2e-1539.72Show/hide
Query:  WSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDALEVVGLLNRDWIFFAEI
        W PP      LN DA+WSD   RGG+GWI+R   G  V  G +F    + V +LEA  IL GL  L    LG + PL +E D+ EV  LLNR      + 
Subjt:  WSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDALEVVGLLNRDWIFFAEI

Query:  DVIIEDILYFLSSINIKFFCHYSREDNKAAHMTVTLVSSLR
          ++E+IL    S  I  F    RE N  AH      S LR
Subjt:  DVIIEDILYFLSSINIKFFCHYSREDNKAAHMTVTLVSSLR

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]5.0e-1435.71Show/hide
Query:  TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIP-----PLEVEFDALEVVGLLN
        T + W PP+    KLN DAAW      GG+GWI+RD KG  +   C+       ++ LE   I  GL  + +E   PI      P+ +E D+LE + LL+
Subjt:  TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIP-----PLEVEFDALEVVGLLN

Query:  RDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAH
        R      EI  ++E+I   +  + I    H SRE NK AH
Subjt:  RDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAH

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134122.7e-1335.56Show/hide
Query:  TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDALEVVGLLNRDWIF
        T + W PP+    KLN +AAW      GG+GWI+RD KG  +   C+       ++ LE   I  GL  + +E   PI    +E D+LE + LL+R    
Subjt:  TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDALEVVGLLNRDWIF

Query:  FAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAH
          EI  ++E+I   +  + I    H SRE NK AH
Subjt:  FAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAH

A0A6J1D4B6 uncharacterized protein LOC1110171811.5e-1135.37Show/hide
Query:  GRVNGVISNPPTPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGC-KFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDAL
        G +N    N P    W+PP+    KLNVDA W D    GGLGWIVRD +G  +   C K  S+S     LEA +                  +E+E D L
Subjt:  GRVNGVISNPPTPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGC-KFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDAL

Query:  EVVGLLNRDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAH
        EVV ++N+  +   E+ +I+EDI   + S+ I+ F H   + N  AH
Subjt:  EVVGLLNRDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAH

A0A6J1DNV9 uncharacterized protein LOC1110224035.7e-1639.72Show/hide
Query:  WSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDALEVVGLLNRDWIFFAEI
        W PP      LN DA+WSD   RGG+GWI+R   G  V  G +F    + V +LEA  IL GL  L    LG + PL +E D+ EV  LLNR      + 
Subjt:  WSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDALEVVGLLNRDWIFFAEI

Query:  DVIIEDILYFLSSINIKFFCHYSREDNKAAHMTVTLVSSLR
          ++E+IL    S  I  F    RE N  AH      S LR
Subjt:  DVIIEDILYFLSSINIKFFCHYSREDNKAAHMTVTLVSSLR

A0A6J1DSV1 uncharacterized protein LOC1110236082.4e-1435.71Show/hide
Query:  TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIP-----PLEVEFDALEVVGLLN
        T + W PP+    KLN DAAW      GG+GWI+RD KG  +   C+       ++ LE   I  GL  + +E   PI      P+ +E D+LE + LL+
Subjt:  TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIP-----PLEVEFDALEVVGLLN

Query:  RDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAH
        R      EI  ++E+I   +  + I    H SRE NK AH
Subjt:  RDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAH

A0A6J5UAY2 Reverse transcriptase domain-containing protein1.7e-1233.12Show/hide
Query:  GKGRVNGVISNPP-----TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLE
        GK  +N  +  PP     T +SW  P +G  K+N DAAW      GG+GW+VRD  G  V  G K   +      +EA+ I   L   ++  L     LE
Subjt:  GKGRVNGVISNPP-----TPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLE

Query:  VEFDALEVVGLLNRDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAHMTVTLVS
        VE D+L+V+ ++  +W     +D I+ DI   +       F +  R  NKAAH     VS
Subjt:  VEFDALEVVGLLNRDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAHMTVTLVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.0e-0427.4Show/hide
Query:  KGRVNGVISNPPTPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDAL
        +G+ +G          W  P +   K N DA W     R G+GWI+R+  G  ++ G +   ++  V  LEA++  +   +L          +  E DA 
Subjt:  KGRVNGVISNPPTPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDAL

Query:  EVVGLLNRDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAA
         +V LLN D  F+  +   +EDI   L       F    R  NK A
Subjt:  EVVGLLNRDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAA

AT4G29090.1 Ribonuclease H-like superfamily protein5.9e-0528.46Show/hide
Query:  WSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDALEVVGLLNRDWIFFAEI
        W PP     K N DA W+    R G+GW++R+ KG   + G +   K    S+LEA++  +   +L          +  E D+  ++ +LN D I +  +
Subjt:  WSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVILVGLTLLIEECLGPIPPLEVEFDALEVVGLLNRDWIFFAEI

Query:  DVIIEDILYFLSSINIKFFCHYSREDNKAA
           I+D+   LS      F    RE N  A
Subjt:  DVIIEDILYFLSSINIKFFCHYSREDNKAA

AT5G19270.1 unknown protein6.9e-0637.7Show/hide
Query:  SWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVIL
        +W+PP FG  K NV+A W +     G+ WI RD  G A++      ++SS   M E + IL
Subjt:  SWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKVIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGTCTGGTTAGTGTTGGCAGGGGCAGGTGGTCCTTGTTGGATTATTGGGAGAACCTCTGGAGTGCTGATTAATTCCCTCCTCCCTATCGATGGTGTTGCTGGTAA
AGGTCGTGTTAATGGTGTTATCTCTAATCCTCCGACCCCTTCTAGCTGGTCTCCCCCTTCTTTTGGGTGTTCGAAGTTGAATGTGGATGCTGCTTGGTCCGACATCAAAG
GAAGGGGAGGCCTTGGGTGGATTGTCCGGGACCACAAGGGACATGCTGTATTTGGAGGCTGCAAGTTCACTTCTAAAAGCAGTGGTGTGAGCATGCTAGAAGCTAAGGTC
ATCCTTGTGGGGTTGACATTGTTGATTGAGGAGTGCTTGGGTCCTATTCCACCATTGGAGGTGGAGTTCGATGCTTTGGAGGTTGTCGGGCTGCTAAACAGGGATTGGAT
TTTCTTTGCTGAGATCGATGTTATTATTGAGGACATTTTGTACTTTTTGAGCTCTATAAATATTAAATTTTTTTGTCATTACTCCCGTGAAGATAACAAAGCGGCCCATA
TGACTGTCACTTTGGTATCCTCTTTGCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGTCTGGTTAGTGTTGGCAGGGGCAGGTGGTCCTTGTTGGATTATTGGGAGAACCTCTGGAGTGCTGATTAATTCCCTCCTCCCTATCGATGGTGTTGCTGGTAA
AGGTCGTGTTAATGGTGTTATCTCTAATCCTCCGACCCCTTCTAGCTGGTCTCCCCCTTCTTTTGGGTGTTCGAAGTTGAATGTGGATGCTGCTTGGTCCGACATCAAAG
GAAGGGGAGGCCTTGGGTGGATTGTCCGGGACCACAAGGGACATGCTGTATTTGGAGGCTGCAAGTTCACTTCTAAAAGCAGTGGTGTGAGCATGCTAGAAGCTAAGGTC
ATCCTTGTGGGGTTGACATTGTTGATTGAGGAGTGCTTGGGTCCTATTCCACCATTGGAGGTGGAGTTCGATGCTTTGGAGGTTGTCGGGCTGCTAAACAGGGATTGGAT
TTTCTTTGCTGAGATCGATGTTATTATTGAGGACATTTTGTACTTTTTGAGCTCTATAAATATTAAATTTTTTTGTCATTACTCCCGTGAAGATAACAAAGCGGCCCATA
TGACTGTCACTTTGGTATCCTCTTTGCGGTGA
Protein sequenceShow/hide protein sequence
MLVWLVLAGAGGPCWIIGRTSGVLINSLLPIDGVAGKGRVNGVISNPPTPSSWSPPSFGCSKLNVDAAWSDIKGRGGLGWIVRDHKGHAVFGGCKFTSKSSGVSMLEAKV
ILVGLTLLIEECLGPIPPLEVEFDALEVVGLLNRDWIFFAEIDVIIEDILYFLSSINIKFFCHYSREDNKAAHMTVTLVSSLR