; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G013790 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G013790
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRNase H domain-containing protein
Genome locationCG_Chr01:27611263..27612250
RNA-Seq ExpressionClCG01G013790
SyntenyClCG01G013790
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4358383.1 hypothetical protein G4B88_008543 [Cannabis sativa]1.9e-1432.89Show/hide
Query:  WSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVI
        WS PP G F +NTD +        G+  +IRD  G +V A  E+IP  +  L AE L +   +K+     +  +L+ +D Q+++  + G++R ++  G I
Subjt:  WSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGEKEVWTHCSPS
        LE+     +  N  SF +  R CN VAH LA+W+      +VWT   P+
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGEKEVWTHCSPS

KAF4371092.1 hypothetical protein F8388_020819 [Cannabis sativa]6.3e-1830.99Show/hide
Query:  VTSCSDNGAIVKWWKRMW-SMSIPNKIRQSETTNHVFLQYKRSKQLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDG
        +   SDN A   WWK +  S+ IP  +   E   H+   +K S Q  K +  ++           WSSPP G F +NTD +        G+  +IRD  G
Subjt:  VTSCSDNGAIVKWWKRMW-SMSIPNKIRQSETTNHVFLQYKRSKQLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDG

Query:  SMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAW
        ++V A  ++IP  +  L AE L +   LK+     +  + + +D+Q+++  + G++R ++  G+I+E+     +  N  SFI+I R CN VAH LA+W+ 
Subjt:  SMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAW

Query:  ANGEKEVWTHCSP
             +VWT   P
Subjt:  ANGEKEVWTHCSP

KAF4381263.1 hypothetical protein G4B88_009591 [Cannabis sativa]1.7e-1827.91Show/hide
Query:  VTSCSDNGAIVKWWKRMWSMSIPNKIRQS--ETTNHVFLQYKRSK-QLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDS
        +  CSDN AI  WWK +W   +  K++       + +++ +     ++  A  P+          + WS PP G F +NTD +        G+  +IRD 
Subjt:  VTSCSDNGAIVKWWKRMWSMSIPNKIRQS--ETTNHVFLQYKRSK-QLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDS

Query:  DGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASW
         G +V A  ++IP  +  L AE L +   LK+     +  + + +D Q+++  + G++R ++  G ILE+     +  N  SF +  R CN VAH  A+W
Subjt:  DGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASW

Query:  AWANGEKEVWTHCSP
        +      +VWT   P
Subjt:  AWANGEKEVWTHCSP

KAF4383359.1 hypothetical protein G4B88_023933 [Cannabis sativa]4.5e-1633.78Show/hide
Query:  WSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVI
        WSSPP G F +NTD +        G+  +IRD  G++V A  ++IP  +  L AE L +   LK+     +  + + +D+Q+++  + G++R ++  G+I
Subjt:  WSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGEKEVWTHCSP
        +E+     +  N  SFI+I R CN VAH LA+W+      +VWT   P
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGEKEVWTHCSP

KAF4392530.1 hypothetical protein F8388_000657 [Cannabis sativa]3.1e-1727.36Show/hide
Query:  VTSCSDNGAIVKWWKRMWSMSIPNKIRQSETTNHVFLQYKRSKQLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGS
        +  CSDN A   WWK +W   +  K++       +F+      +++   +P +   +V   ++ WS PP G F +NT  +        G+R +IRD  G+
Subjt:  VTSCSDNGAIVKWWKRMWSMSIPNKIRQSETTNHVFLQYKRSKQLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGS

Query:  MVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWA
        +V A  ++IP  +  L AE L +   LK+V   ++  + + +D+Q ++  + G++R ++  G+++++     ++    SF +  R CN VAH LA+W+  
Subjt:  MVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWA

Query:  NGEKEVWTHCSP
             VWT   P
Subjt:  NGEKEVWTHCSP

TrEMBL top hitse value%identityAlignment
A0A7J6FK63 RNase H domain-containing protein3.0e-1830.99Show/hide
Query:  VTSCSDNGAIVKWWKRMW-SMSIPNKIRQSETTNHVFLQYKRSKQLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDG
        +   SDN A   WWK +  S+ IP  +   E   H+   +K S Q  K +  ++           WSSPP G F +NTD +        G+  +IRD  G
Subjt:  VTSCSDNGAIVKWWKRMW-SMSIPNKIRQSETTNHVFLQYKRSKQLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDG

Query:  SMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAW
        ++V A  ++IP  +  L AE L +   LK+     +  + + +D+Q+++  + G++R ++  G+I+E+     +  N  SFI+I R CN VAH LA+W+ 
Subjt:  SMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAW

Query:  ANGEKEVWTHCSP
             +VWT   P
Subjt:  ANGEKEVWTHCSP

A0A7J6GEF1 RNase H domain-containing protein8.0e-1927.91Show/hide
Query:  VTSCSDNGAIVKWWKRMWSMSIPNKIRQS--ETTNHVFLQYKRSK-QLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDS
        +  CSDN AI  WWK +W   +  K++       + +++ +     ++  A  P+          + WS PP G F +NTD +        G+  +IRD 
Subjt:  VTSCSDNGAIVKWWKRMWSMSIPNKIRQS--ETTNHVFLQYKRSK-QLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDS

Query:  DGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASW
         G +V A  ++IP  +  L AE L +   LK+     +  + + +D Q+++  + G++R ++  G ILE+     +  N  SF +  R CN VAH  A+W
Subjt:  DGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASW

Query:  AWANGEKEVWTHCSP
        +      +VWT   P
Subjt:  AWANGEKEVWTHCSP

A0A7J6GMP0 RNase H domain-containing protein2.2e-1633.78Show/hide
Query:  WSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVI
        WSSPP G F +NTD +        G+  +IRD  G++V A  ++IP  +  L AE L +   LK+     +  + + +D+Q+++  + G++R ++  G+I
Subjt:  WSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGEKEVWTHCSP
        +E+     +  N  SFI+I R CN VAH LA+W+      +VWT   P
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGEKEVWTHCSP

A0A7J6HDR6 Uncharacterized protein1.5e-1727.36Show/hide
Query:  VTSCSDNGAIVKWWKRMWSMSIPNKIRQSETTNHVFLQYKRSKQLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGS
        +  CSDN A   WWK +W   +  K++       +F+      +++   +P +   +V   ++ WS PP G F +NT  +        G+R +IRD  G+
Subjt:  VTSCSDNGAIVKWWKRMWSMSIPNKIRQSETTNHVFLQYKRSKQLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGS

Query:  MVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWA
        +V A  ++IP  +  L AE L +   LK+V   ++  + + +D+Q ++  + G++R ++  G+++++     ++    SF +  R CN VAH LA+W+  
Subjt:  MVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWA

Query:  NGEKEVWTHCSP
             VWT   P
Subjt:  NGEKEVWTHCSP

A0A803QGC3 Uncharacterized protein2.8e-1625.31Show/hide
Query:  VTSCSDNGAIVKWWKRMWSMSIPNKIRQSETTNHVFLQYKRSKQLWKATLPEI---------------FPDVVVGISL-----------------VWSSP
        +  CS+   +  WWK +W   +  K++     N V+  Y +    W  T  E+               F D  +  +L                  W  P
Subjt:  VTSCSDNGAIVKWWKRMWSMSIPNKIRQSETTNHVFLQYKRSKQLWKATLPEI---------------FPDVVVGISL-----------------VWSSP

Query:  PNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEI
        PNG + +NTD +        G+ AVIRDS G++V+A   ++P  +  L A+A  +L  + +  +  +  + V +D+QT++  ++ E+   +  G ++ EI
Subjt:  PNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEI

Query:  KTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGEKEVWTHCSPS
        K         +F++  R CN+VA+ LASW+    + E+WT   P+
Subjt:  KTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGEKEVWTHCSPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein2.4e-0728.79Show/hide
Query:  WSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVI
        W+ PP G  K N D               IR+ +G +VL     + +   +L AEAL  L  L+V+    +  +  E+D+++L+ +IN      S +G +
Subjt:  WSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLAS
        + +I+ WM K+   S  ++ R  N  A  LAS
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLAS

AT3G09510.1 Ribonuclease H-like superfamily protein1.3e-0530.83Show/hide
Query:  WSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVI
        W +PP    K N D       +      +IR+  G+ +      + +    LEAE   LL  L+         + +E D QTL+N+ING S   SS+   
Subjt:  WSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASW
        LE+I  W  K     F +I+R  N++AH LA +
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTACTTCATGTTCTGATAATGGAGCCATTGTGAAGTGGTGGAAAAGAATGTGGAGTATGTCAATCCCAAATAAGATACGGCAATCTGAAACAACTAATCATGTATT
TCTTCAGTATAAAAGATCTAAACAACTATGGAAAGCTACTTTACCTGAGATATTTCCCGATGTGGTTGTTGGTATTTCTCTGGTTTGGTCGTCACCACCGAATGGGGAGT
TCAAACTCAACACTGATGTTGCATGCTGTCCTGATTTGATAAGAGTAGGGATGAGAGCAGTGATTAGAGATTCCGATGGATCGATGGTTTTAGCTATTGTTGAGTGGATT
CCAAATGGTGTTCTAGCGTTGGAGGCAGAAGCTTTGGTTCTTTTGTTTCGCTTGAAGGTGGTGCATCAGATGGTTGTCCCGATTCTTCTGGTGGAAACAGATGCCCAGAC
GCTAATGAACATCATTAATGGCGAATCAAGGAAGTCTTCATCTATGGGGGTGATCCTTGAAGAAATCAAAACATGGATGAGGAAAGTTAATATTAAGAGCTTTATTTATA
TCCAAAGATCTTGTAATCAAGTGGCCCATAAATTGGCTTCTTGGGCTTGGGCTAATGGGGAAAAAGAGGTGTGGACTCATTGCTCTCCCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTACTTCATGTTCTGATAATGGAGCCATTGTGAAGTGGTGGAAAAGAATGTGGAGTATGTCAATCCCAAATAAGATACGGCAATCTGAAACAACTAATCATGTATT
TCTTCAGTATAAAAGATCTAAACAACTATGGAAAGCTACTTTACCTGAGATATTTCCCGATGTGGTTGTTGGTATTTCTCTGGTTTGGTCGTCACCACCGAATGGGGAGT
TCAAACTCAACACTGATGTTGCATGCTGTCCTGATTTGATAAGAGTAGGGATGAGAGCAGTGATTAGAGATTCCGATGGATCGATGGTTTTAGCTATTGTTGAGTGGATT
CCAAATGGTGTTCTAGCGTTGGAGGCAGAAGCTTTGGTTCTTTTGTTTCGCTTGAAGGTGGTGCATCAGATGGTTGTCCCGATTCTTCTGGTGGAAACAGATGCCCAGAC
GCTAATGAACATCATTAATGGCGAATCAAGGAAGTCTTCATCTATGGGGGTGATCCTTGAAGAAATCAAAACATGGATGAGGAAAGTTAATATTAAGAGCTTTATTTATA
TCCAAAGATCTTGTAATCAAGTGGCCCATAAATTGGCTTCTTGGGCTTGGGCTAATGGGGAAAAAGAGGTGTGGACTCATTGCTCTCCCTCTTAG
Protein sequenceShow/hide protein sequence
MVTSCSDNGAIVKWWKRMWSMSIPNKIRQSETTNHVFLQYKRSKQLWKATLPEIFPDVVVGISLVWSSPPNGEFKLNTDVACCPDLIRVGMRAVIRDSDGSMVLAIVEWI
PNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKSSSMGVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGEKEVWTHCSPS