; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G012700 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G012700
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionRNase H domain-containing protein
Genome locationCiama_Chr01:24830279..24834922
RNA-Seq ExpressionCaUC01G012700
SyntenyCaUC01G012700
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4274760.1 unnamed protein product [Prunus armeniaca]5.3e-0929.61Show/hide
Query:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI
        W  P  G  K+N D A            +IR   G ++ A  +   +G  A+  E L +   L       +  ++VE+D+Q  + ++NG +   S +E I
Subjt:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWLY
        + +I+  +  ++  SF++  RSCNQ AH +A +A   GG  VW H  P WL+
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWLY

KAF4371092.1 hypothetical protein F8388_020819 [Cannabis sativa]1.7e-1031.08Show/hide
Query:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI
        W SPP G F +NTD +            +IR  +G +V A  + IP  +  L AE L +   LK+     +  + + +D+Q+++  + G++R  +   +I
Subjt:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSP
        +E+     +  N  SFI+I R CN VAH LA+W+      +VWT   P
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSP

KAF4383359.1 hypothetical protein G4B88_023933 [Cannabis sativa]1.7e-1031.08Show/hide
Query:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI
        W SPP G F +NTD +            +IR  +G +V A  + IP  +  L AE L +   LK+     +  + + +D+Q+++  + G++R  +   +I
Subjt:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSP
        +E+     +  N  SFI+I R CN VAH LA+W+      +VWT   P
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSP

KMS97072.1 hypothetical protein BVRB_7g179330 [Beta vulgaris subsp. vulgaris]1.1e-0929.22Show/hide
Query:  WLSPPNGEFKLNTDVACCPDLIRVGM--------GPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI
        WL P  G  K+N D A   +  RVG+        G ++ A +  + +   A +AEA  +L+  +         +++E+DAQ ++N IN   ++   ++ +
Subjt:  WLSPPNGEFKLNTDVACCPDLIRVGM--------GPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWLYQL
        LE++   +       F +  R CN++AH+LA WA +N   EVW    PSW+  L
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWLYQL

XP_021808158.1 uncharacterized protein LOC110751913 [Prunus avium]2.6e-1127.49Show/hide
Query:  QLWKATLPEIFPNVLWAHKVPIAEASMWANYVMAYLNEFWEAQ----VALAVGISPVWLSPPNGEFKLNTDVACCPDLIRVGM--------GPMVLATAE
        ++WK     +F +   A   PI    +  N V  Y     E Q     A++  ++  W  PP    K+N DVA    L+R G+        G ++ A  E
Subjt:  QLWKATLPEIFPNVLWAHKVPIAEASMWANYVMAYLNEFWEAQ----VALAVGISPVWLSPPNGEFKLNTDVACCPDLIRVGM--------GPMVLATAE

Query:  CIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVW
            G  AL  E L +   L       +  ++VE+D Q  ++++NG++   S +E I+ +I+  +   +  SF++  +SCN+VAH +A +   +GG  VW
Subjt:  CIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVW

Query:  THCSPSWLYQL
         H    WL+ +
Subjt:  THCSPSWLYQL

TrEMBL top hitse value%identityAlignment
A0A0J8BAU9 Uncharacterized protein5.2e-1029.22Show/hide
Query:  WLSPPNGEFKLNTDVACCPDLIRVGM--------GPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI
        WL P  G  K+N D A   +  RVG+        G ++ A +  + +   A +AEA  +L+  +         +++E+DAQ ++N IN   ++   ++ +
Subjt:  WLSPPNGEFKLNTDVACCPDLIRVGM--------GPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWLYQL
        LE++   +       F +  R CN++AH+LA WA +N   EVW    PSW+  L
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWLYQL

A0A6J5UE59 Reverse transcriptase domain-containing protein2.6e-0929.61Show/hide
Query:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI
        W  P  G  K+N D A            +IR   G ++ A  +   +G  A+  E L +   L       +  ++VE+D+Q  + ++NG +   S +E I
Subjt:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWLY
        + +I+  +  ++  SF++  RSCNQ AH +A +A   GG  VW H  P WL+
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWLY

A0A6P5S2T0 uncharacterized protein LOC1107519131.2e-1127.49Show/hide
Query:  QLWKATLPEIFPNVLWAHKVPIAEASMWANYVMAYLNEFWEAQ----VALAVGISPVWLSPPNGEFKLNTDVACCPDLIRVGM--------GPMVLATAE
        ++WK     +F +   A   PI    +  N V  Y     E Q     A++  ++  W  PP    K+N DVA    L+R G+        G ++ A  E
Subjt:  QLWKATLPEIFPNVLWAHKVPIAEASMWANYVMAYLNEFWEAQ----VALAVGISPVWLSPPNGEFKLNTDVACCPDLIRVGM--------GPMVLATAE

Query:  CIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVW
            G  AL  E L +   L       +  ++VE+D Q  ++++NG++   S +E I+ +I+  +   +  SF++  +SCN+VAH +A +   +GG  VW
Subjt:  CIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVW

Query:  THCSPSWLYQL
         H    WL+ +
Subjt:  THCSPSWLYQL

A0A7J6FK63 RNase H domain-containing protein8.0e-1131.08Show/hide
Query:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI
        W SPP G F +NTD +            +IR  +G +V A  + IP  +  L AE L +   LK+     +  + + +D+Q+++  + G++R  +   +I
Subjt:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSP
        +E+     +  N  SFI+I R CN VAH LA+W+      +VWT   P
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSP

A0A7J6GMP0 RNase H domain-containing protein8.0e-1131.08Show/hide
Query:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI
        W SPP G F +NTD +            +IR  +G +V A  + IP  +  L AE L +   LK+     +  + + +D+Q+++  + G++R  +   +I
Subjt:  WLSPPNGEFKLNTDVACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVI

Query:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSP
        +E+     +  N  SFI+I R CN VAH LA+W+      +VWT   P
Subjt:  LEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein4.5e-0625.32Show/hide
Query:  MKQSETTNHV--FLQYKTSKQLWKATLPEIFPNVL----WAHKVPIAEASMWANYVMAYLNEFWEAQVALAVGISPV---------WLSPPNGEFKLNTD
        + +++TTN +  FL +    +LWK+    +F        +  +  I +A+ W N      NE  E    + V  +P+         W  PP G  K N D
Subjt:  MKQSETTNHV--FLQYKTSKQLWKATLPEIFPNVL----WAHKVPIAEASMWANYVMAYLNEFWEAQVALAVGISPV---------WLSPPNGEFKLNTD

Query:  VACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVILEEIKTWMRKVNIK
                       IR   G +VL     + +   +L AEAL  L  L+V+    +  +  E+D+++L+ +IN      S +  ++ +I+ WM K+   
Subjt:  VACCPD--------LIRVGMGPMVLATAECIPNGVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVILEEIKTWMRKVNIK

Query:  SFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWL
        S  ++ R  N  A  LAS   A           PSWL
Subjt:  SFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWL

AT3G09510.1 Ribonuclease H-like superfamily protein1.9e-0434.29Show/hide
Query:  LEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWL
        LEAE   LL  L+         + +E D QTL+N+ING S   SS+   LE+I  W  K     F +I+R  N++AH LA +          +   P WL
Subjt:  LEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWL

Query:  YQLGC
         +  C
Subjt:  YQLGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAATCTGAAACAACTAATCATGTATTTCTTCAGTATAAAACATCTAAACAACTATGGAAAGCTACTTTACCTGAGATATTTCCCAATGTGTTGTGGGCTCACAA
GGTCCCGATTGCTGAAGCTTCGATGTGGGCTAATTATGTGATGGCGTATCTCAATGAGTTTTGGGAGGCTCAGGTGGCTTTGGCTGTTGGTATTTCTCCGGTTTGGTTGT
CACCACCGAATGGGGAGTTCAAACTCAACACTGATGTTGCATGCTGTCCTGATTTGATAAGAGTAGGGATGGGACCAATGGTTTTAGCTACTGCTGAGTGCATTCCAAAT
GGTGTTCTAGCGTTGGAGGCAGAAGCTTTGGTTCTTTTGTTTCGCTTGAAGGTGGTGCATCAGATGGTTGTCCCGATTCTTCTGGTGGAAACAGATGCCCAGACGCTAAT
GAACATCATTAATGGCGAATCGAGGAAGTTTTCATCTATGGAGGTGATCCTTGAAGAAATCAAAACATGGATGAGGAAAGTTAATATTAAGAGTTTTATTTATATCCAAA
GATCTTGTAATCAAGTGGCCCATAAATTGGCTTCTTGGGCTTGGGCTAATGGGGGAAAAGAGGTGTGGACTCATTGCTCTCCCTCTTGGTTATATCAGTTAGGGTGCTTG
CTTGGCTTTTTCAACAAGATGGCTTCCCCGACAAAGACGCCATTTTTAAAGAATTCTTTTCGATGTCATTTTATTCCCAACGAGAATCAATCAAAAGAAATTTATAATAT
CGAATCCTACTACTATTACTACGATGGCGTAGCAACAGTGTCAAGTCTGGGTCGATCTGCACAAAAGCGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGCAATCTGAAACAACTAATCATGTATTTCTTCAGTATAAAACATCTAAACAACTATGGAAAGCTACTTTACCTGAGATATTTCCCAATGTGTTGTGGGCTCACAA
GGTCCCGATTGCTGAAGCTTCGATGTGGGCTAATTATGTGATGGCGTATCTCAATGAGTTTTGGGAGGCTCAGGTGGCTTTGGCTGTTGGTATTTCTCCGGTTTGGTTGT
CACCACCGAATGGGGAGTTCAAACTCAACACTGATGTTGCATGCTGTCCTGATTTGATAAGAGTAGGGATGGGACCAATGGTTTTAGCTACTGCTGAGTGCATTCCAAAT
GGTGTTCTAGCGTTGGAGGCAGAAGCTTTGGTTCTTTTGTTTCGCTTGAAGGTGGTGCATCAGATGGTTGTCCCGATTCTTCTGGTGGAAACAGATGCCCAGACGCTAAT
GAACATCATTAATGGCGAATCGAGGAAGTTTTCATCTATGGAGGTGATCCTTGAAGAAATCAAAACATGGATGAGGAAAGTTAATATTAAGAGTTTTATTTATATCCAAA
GATCTTGTAATCAAGTGGCCCATAAATTGGCTTCTTGGGCTTGGGCTAATGGGGGAAAAGAGGTGTGGACTCATTGCTCTCCCTCTTGGTTATATCAGTTAGGGTGCTTG
CTTGGCTTTTTCAACAAGATGGCTTCCCCGACAAAGACGCCATTTTTAAAGAATTCTTTTCGATGTCATTTTATTCCCAACGAGAATCAATCAAAAGAAATTTATAATAT
CGAATCCTACTACTATTACTACGATGGCGTAGCAACAGTGTCAAGTCTGGGTCGATCTGCACAAAAGCGCTAG
Protein sequenceShow/hide protein sequence
MKQSETTNHVFLQYKTSKQLWKATLPEIFPNVLWAHKVPIAEASMWANYVMAYLNEFWEAQVALAVGISPVWLSPPNGEFKLNTDVACCPDLIRVGMGPMVLATAECIPN
GVLALEAEALVLLFRLKVVHQMVVPILLVETDAQTLMNIINGESRKFSSMEVILEEIKTWMRKVNIKSFIYIQRSCNQVAHKLASWAWANGGKEVWTHCSPSWLYQLGCL
LGFFNKMASPTKTPFLKNSFRCHFIPNENQSKEIYNIESYYYYYDGVATVSSLGRSAQKR