; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G08993 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G08993
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRNase H domain-containing protein
Genome locationClcChr01:9651781..9652616
RNA-Seq ExpressionClc01G08993
SyntenyClc01G08993
Gene Ontology termsNA
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7041153.1 unnamed protein product [Microthlaspi erraticum]8.1e-1227.09Show/hide
Query:  FQGR---IKAEVSKINAAITYWAIWNDRNNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQHEAPL--------------AEF
        FQG+   ++  + ++      WA   +  N   ++ +       W+RP  G +K N D AW  +    GLG ++R+ + +                 AE 
Subjt:  FQGR---IKAEVSKINAAITYWAIWNDRNNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQHEAPL--------------AEF

Query:  LAVKEGLRLAIPFRQKKLIVESDS----------DCW------FNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFS-YPSWIVNHLS
         A++  +R    F  +K++ ESDS          + W       NEIR L   F  V+F + +R  NG ADR+AK A S  I+A   +S  P W+ + + 
Subjt:  LAVKEGLRLAIPFRQKKLIVESDS----------DCW------FNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFS-YPSWIVNHLS

Query:  LDN
        +DN
Subjt:  LDN

MBA0758538.1 hypothetical protein [Gossypium trilobum]2.0e-1031.49Show/hide
Query:  WAIWNDRNNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQ--------------HEAPLAEFLAVKEGLRLAIPFRQKKLIVE
        W  WNDRNN   K   +  + K W +P +G +K+NVDA  +     TG+GA+ RDH                +   AE  A K GL+L       +LIVE
Subjt:  WAIWNDRNNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQ--------------HEAPLAEFLAVKEGLRLAIPFRQKKLIVE

Query:  SDSDCWFNEIR-----------------DLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSWIVNHLSLD
        SDS    N +                  D+F  F SV   + NR SN  AD + K A   + +  +   YP  I N +  D
Subjt:  SDSDCWFNEIR-----------------DLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSWIVNHLSLD

XP_038902513.1 uncharacterized protein LOC120089172 [Benincasa hispida]2.4e-1135.57Show/hide
Query:  KLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQ--------------HEAPLAEFLAVKEGLRLAIPFRQKKLIVESD-------------SD----
        + W  P    +KLNVDAAW   P S+G  AI+RD+Q              +  PLAE   V +GLRL      KK+IV+SD             SD    
Subjt:  KLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQ--------------HEAPLAEFLAVKEGLRLAIPFRQKKLIVESD-------------SD----

Query:  CWFNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSY
         W  EI ++   F  ++F Y  R  N  AD +AK  R   IN  W  SY
Subjt:  CWFNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSY

XP_042972859.1 uncharacterized protein LOC122304666 [Carya illinoinensis]2.6e-1030.72Show/hide
Query:  KKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQHEAPLA--------------EFLAVKEGLRLAIPFRQKKLIVESDS------------
        K++     W  P IG LKLNVD A   D  S G+G ILRDH  +  +A              E +A+  GL+L + +   K+I+ESDS            
Subjt:  KKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQHEAPLA--------------EFLAVKEGLRLAIPFRQKKLIVESDS------------

Query:  -----DCWFNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSWIVNHLSLD
             D    +IR L  GF      + +R  NG A ++A+ A+       W+ S PS++   + LD
Subjt:  -----DCWFNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSWIVNHLSLD

XP_042988669.1 uncharacterized protein LOC122316201 [Carya illinoinensis]6.9e-1128.71Show/hide
Query:  ITYWAIWNDRNNVRAKEAK-------------KLEITKL---------------WQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQHEAPLA------
        +  W IWN RN    ++A              K E TK+               W  P IG LKLNVD A   D  S G+G ILRDH  +  +A      
Subjt:  ITYWAIWNDRNNVRAKEAK-------------KLEITKL---------------WQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQHEAPLA------

Query:  --------EFLAVKEGLRLAIPFRQKKLIVESDS-----------------DCWFNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFS
                E + +  GL+L   +   K+I+ESDS                 D    +IR L  GF      + +R  NG   ++A+ AR  +    W+ S
Subjt:  --------EFLAVKEGLRLAIPFRQKKLIVESDS-----------------DCWFNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFS

Query:  YPSWIVNHL
         PS++V H+
Subjt:  YPSWIVNHL

TrEMBL top hitse value%identityAlignment
A0A446K5I3 RNase H domain-containing protein4.1e-0931.68Show/hide
Query:  LWQRPSIGTLKLNVDAAWILDPPSTGLGAILRD--------------HQHEAPLAEFLAVKEGLRLAIPFRQKKLIVESDSD----------------CW
        +W++P+ G +K+NVDAA+  +  +   GA+ RD              H   A  AE +A++ GL LA+      LI+ESDS                   
Subjt:  LWQRPSIGTLKLNVDAAWILDPPSTGLGAILRD--------------HQHEAPLAEFLAVKEGLRLAIPFRQKKLIVESDSD----------------CW

Query:  FNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSW----IVNHLSL
        + E R L   FA V   +C R +N  AD IAKNA S + +  W    P +    IVN L++
Subjt:  FNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSW----IVNHLSL

A0A5J9U9H2 RNase H domain-containing protein (Fragment)3.1e-0926.49Show/hide
Query:  NNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQHEAPL--------------AEFLAVKEGLRLAIPFRQKKLIVESDSDC--
        ++V+ KE   +   K W +P  G  K+NVDAA+ +D    G+G I+R+   E  L              AE LA KEGL L +  +   +  E +SDC  
Subjt:  NNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQHEAPL--------------AEFLAVKEGLRLAIPFRQKKLIVESDSDC--

Query:  -----------------WFNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSWIVNHLSLDNCPCNALVA
                            E+++L      V  T+CNR  N  +  +A  A     +  W  S+P ++ + ++ D   CN +++
Subjt:  -----------------WFNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSWIVNHLSLDNCPCNALVA

A0A6D2JVB2 Uncharacterized protein3.9e-1227.09Show/hide
Query:  FQGR---IKAEVSKINAAITYWAIWNDRNNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQHEAPL--------------AEF
        FQG+   ++  + ++      WA   +  N   ++ +       W+RP  G +K N D AW  +    GLG ++R+ + +                 AE 
Subjt:  FQGR---IKAEVSKINAAITYWAIWNDRNNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQHEAPL--------------AEF

Query:  LAVKEGLRLAIPFRQKKLIVESDS----------DCW------FNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFS-YPSWIVNHLS
         A++  +R    F  +K++ ESDS          + W       NEIR L   F  V+F + +R  NG ADR+AK A S  I+A   +S  P W+ + + 
Subjt:  LAVKEGLRLAIPFRQKKLIVESDS----------DCW------FNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFS-YPSWIVNHLS

Query:  LDN
        +DN
Subjt:  LDN

A0A7J9DD45 RNase H domain-containing protein9.7e-1131.49Show/hide
Query:  WAIWNDRNNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQ--------------HEAPLAEFLAVKEGLRLAIPFRQKKLIVE
        W  WNDRNN   K   +  + K W +P +G +K+NVDA  +     TG+GA+ RDH                +   AE  A K GL+L       +LIVE
Subjt:  WAIWNDRNNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQ--------------HEAPLAEFLAVKEGLRLAIPFRQKKLIVE

Query:  SDSDCWFNEIR-----------------DLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSWIVNHLSLD
        SDS    N +                  D+F  F SV   + NR SN  AD + K A   + +  +   YP  I N +  D
Subjt:  SDSDCWFNEIR-----------------DLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSWIVNHLSLD

Q10IA6 Retrotransposon protein, putative, unclassified2.2e-1029.81Show/hide
Query:  NNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDH------------QH--EAPLAEFLAVKEGLRLAIPFRQKKLIVESDSDCWF
        N + + +  K +I   W++P  G L +NVDAA+  D  S G+G +LRDH             H  +AP+AE  A+++GL LA     K +I+        
Subjt:  NNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDH------------QH--EAPLAEFLAVKEGLRLAIPFRQKKLIVESDSDCWF

Query:  NEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSWIVNHLSLDNCPC
             ++DGF ++S  +CNR SN  A  +A+     K +   F ++  W  + + L +  C
Subjt:  NEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSWIVNHLSLDNCPC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.7e-0427.48Show/hide
Query:  RPSIGTLKLNVDAAWILDPPSTGLGAILRD-------HQHEAP-------LAEFLAVKEGLRLAIPFRQKKLIVESDSDCWFNEIRD----------LFD
        RP +  + +  DAAW  +    G G ++R+       H   A        +AE +A+   L+ A      KL + SDS      I            +FD
Subjt:  RPSIGTLKLNVDAAWILDPPSTGLGAILRD-------HQHEAP-------LAEFLAVKEGLRLAIPFRQKKLIVESDSDCWFNEIRD----------LFD

Query:  ------GFASVSFTYCNRGSNGTADRIAKNA
              GFA VSF++  R  N  AD +AK++
Subjt:  ------GFASVSFTYCNRGSNGTADRIAKNA

AT5G38920.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.1e-0424.42Show/hide
Query:  WAIWNDRNNVRAK----EAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAIL-RDHQHEAPLAEFLAVKEGLRLAIPFRQKKLIVESDS------
        W +W +RN++  K    +A  +E+  +    S+ +      A  +  P  T  G +        A + E  A++  +        +K+I ESDS      
Subjt:  WAIWNDRNNVRAK----EAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAIL-RDHQHEAPLAEFLAVKEGLRLAIPFRQKKLIVESDS------

Query:  ----------DCWFNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFS-YPSWIVNHLSLD
                  D    +I+ L   F    F + +RG NG ADRIAK + S +      +S  P W+ + + LD
Subjt:  ----------DCWFNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFS-YPSWIVNHLSLD

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.3e-0426.42Show/hide
Query:  WQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQH---EAPLAEF---LAVKE--------GLRLAIPFRQKKLIVESDSDC----------------WF
        W  P    LK N DA+       +GLG ILR+ Q    E  + +F   +  +E         ++ +  F  KK+I E D+                  + 
Subjt:  WQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQH---EAPLAEF---LAVKE--------GLRLAIPFRQKKLIVESDSDC----------------WF

Query:  NEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADW--FFSYPSWIVNHLSLD
        + I+     F S+ F++ +R  NG AD +AK A   K N  W  F S P ++  +++ D
Subjt:  NEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADW--FFSYPSWIVNHLSLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCTTGTCTCCTGTCAGAGAGCCTACCTTCTTTGAGAGGGTCTTCCTTCCCCTTTGCTTCCAAGGGAGAATCAAAGCGGAAGTTTCAAAGATAAATGCGGCTATTAC
CTACTGGGCCATTTGGAATGACAGGAATAATGTTCGGGCAAAAGAAGCAAAGAAATTAGAGATTACCAAATTGTGGCAACGCCCTTCGATTGGAACTCTCAAATTGAATG
TTGATGCTGCTTGGATTTTGGATCCCCCTTCTACAGGACTGGGAGCTATTCTTCGAGACCATCAACATGAAGCCCCTCTGGCTGAGTTTCTTGCAGTCAAAGAAGGCCTT
CGCCTCGCGATTCCTTTCCGCCAGAAGAAACTTATTGTGGAGTCGGACTCGGATTGCTGGTTTAATGAAATCAGAGATCTTTTTGACGGTTTTGCTTCAGTATCTTTCAC
ATATTGTAATCGGGGGAGCAACGGTACTGCTGATAGAATAGCTAAAAATGCAAGGTCATGTAAAATCAATGCTGATTGGTTTTTCTCCTATCCAAGCTGGATTGTAAACC
ATCTGTCTCTAGACAATTGTCCTTGTAATGCCCTTGTGGCGCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCCTTGTCTCCTGTCAGAGAGCCTACCTTCTTTGAGAGGGTCTTCCTTCCCCTTTGCTTCCAAGGGAGAATCAAAGCGGAAGTTTCAAAGATAAATGCGGCTATTAC
CTACTGGGCCATTTGGAATGACAGGAATAATGTTCGGGCAAAAGAAGCAAAGAAATTAGAGATTACCAAATTGTGGCAACGCCCTTCGATTGGAACTCTCAAATTGAATG
TTGATGCTGCTTGGATTTTGGATCCCCCTTCTACAGGACTGGGAGCTATTCTTCGAGACCATCAACATGAAGCCCCTCTGGCTGAGTTTCTTGCAGTCAAAGAAGGCCTT
CGCCTCGCGATTCCTTTCCGCCAGAAGAAACTTATTGTGGAGTCGGACTCGGATTGCTGGTTTAATGAAATCAGAGATCTTTTTGACGGTTTTGCTTCAGTATCTTTCAC
ATATTGTAATCGGGGGAGCAACGGTACTGCTGATAGAATAGCTAAAAATGCAAGGTCATGTAAAATCAATGCTGATTGGTTTTTCTCCTATCCAAGCTGGATTGTAAACC
ATCTGTCTCTAGACAATTGTCCTTGTAATGCCCTTGTGGCGCTTTAA
Protein sequenceShow/hide protein sequence
MPLSPVREPTFFERVFLPLCFQGRIKAEVSKINAAITYWAIWNDRNNVRAKEAKKLEITKLWQRPSIGTLKLNVDAAWILDPPSTGLGAILRDHQHEAPLAEFLAVKEGL
RLAIPFRQKKLIVESDSDCWFNEIRDLFDGFASVSFTYCNRGSNGTADRIAKNARSCKINADWFFSYPSWIVNHLSLDNCPCNALVAL