; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017601 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017601
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRNase H domain-containing protein
Genome locationChr03:16647964..16651311
RNA-Seq ExpressionHG10017601
SyntenyHG10017601
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7599443.1 Reverse transcriptase zinc-binding domain [Arabidopsis suecica]6.3e-1027.46Show/hide
Query:  WCIWGDRNKVVHGEALPS--------VIFRCDWIVNYLEAFVDPTSVCLAISKRGHVLSPFGKLSCAVLFVDASCKPGVDVMGFGWVILDEGDKIKAAKY
        W IW  RNK +  + + S        V    +W+   L A   P  +   +     V++    +SC     DA+    +   G+GW+  +  D +  A  
Subjt:  WCIWGDRNKVVHGEALPS--------VIFRCDWIVNYLEAFVDPTSVCLAISKRGHVLSPFGKLSCAVLFVDASCKPGVDVMGFGWVILDEGDKIKAAKY

Query:  GFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWNTAAHSLARQGL
           S   SP+ A+AI +   L  A+ +G   ++I SD   LI+ +N +  P  E+ G++  I +   ++    FHFT RE N  A SLA+  L
Subjt:  GFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWNTAAHSLARQGL

PON64686.1 Ribonuclease H-like domain containing protein [Trema orientale]6.3e-1030.29Show/hide
Query:  MKGKIRLSKKDFALACVGAWCIWGDRNKVVHGEA---LPSVIFRCDWIVNYLEAFVDPTSVCLAIS----KRGHVLSPFGKLSCAVLFVDASCKPGVDVM
        M  K++LSK+DF L  V +W I  DRN + HG A     S++   D+ +   + FV    V  + +     +       G+L    L VDA+ K   D +
Subjt:  MKGKIRLSKKDFALACVGAWCIWGDRNKVVHGEA---LPSVIFRCDWIVNYLEAFVDPTSVCLAIS----KRGHVLSPFGKLSCAVLFVDASCKPGVDVM

Query:  GFGWVILDEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWN
        G G V+ D    +  A     +  LSP  A+ + + EG+ FA + GL    I +D   ++  + +K    LE   +V  I R +    +    FT R  N
Subjt:  GFGWVILDEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWN

Query:  TAAHSLAR
         AAHSLA+
Subjt:  TAAHSLAR

RYR79715.1 hypothetical protein Ahy_A01g004533 [Arachis hypogaea]4.8e-1023.96Show/hide
Query:  RGCRS--FRFDELWSFHSNCRDIIT---NCVSEHSHGIRCHLEEVLSYCAVNLRKQGR-NVNRI---LNSNIRELRLTIQSEYNRPPPFNFRLISEMEGK
        RG R+  F+F+  W+ H+ C +II    N V   S     +L   ++ C   L K  + N  R    +   +  L+L  + +Y+       + IS ++  
Subjt:  RGCRS--FRFDELWSFHSNCRDIIT---NCVSEHSHGIRCHLEEVLSYCAVNLRKQGR-NVNRI---LNSNIRELRLTIQSEYNRPPPFNFRLISEMEGK

Query:  LHHLLLEEEMYWNREQE----RIGCSGVIKILLDSTNKHLGGGSKTRFRGFTTLMEFGLKMKGKIRL-SKKDFALAC--VG--AWCIWGDRNKVVHGEAL
        +  L  +EE +W +       + G          ST +        + +  +    +G      +++ +  D+ L+   VG   W IW  RN+ VH  + 
Subjt:  LHHLLLEEEMYWNREQE----RIGCSGVIKILLDSTNKHLGGGSKTRFRGFTTLMEFGLKMKGKIRL-SKKDFALAC--VG--AWCIWGDRNKVVHGEAL

Query:  PSVIFRCD----WIVNYLEAFVDPTSVCLAISKRGHVLS---------PFGKLSCAVLFVDASCKPGVDVMGFGWVILDEGDKIKAAKYGFRSPFLSPIC
        PS +   +      +++ +   +P     AIS   H  +         P G + C    VDA+ +          V  D    + AA    R    SP+ 
Subjt:  PSVIFRCD----WIVNYLEAFVDPTSVCLAISKRGHVLS---------PFGKLSCAVLFVDASCKPGVDVMGFGWVILDEGDKIKAAKYGFRSPFLSPIC

Query:  AKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWNTAAHSLARQGLSG
        A+A  + E L  A +  +DRI + SD L LI  +  K  P+ E++ +++ I   V S  +C F +  RE N  AH +AR  + G
Subjt:  AKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWNTAAHSLARQGLSG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.3e-1529.13Show/hide
Query:  RLSKKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAF--VDPTSVCLAISKRGHVLSPFGKLSCAV---LFVDASCKPGVDVMGFGWVIL
        +L  KD  LA +  W IW DRN ++HG+ +  V F+C+W+  +L++      ++           +  + + S +V   L  DA+C+       FG +I 
Subjt:  RLSKKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAF--VDPTSVCLAISKRGHVLSPFGKLSCAV---LFVDASCKPGVDVMGFGWVIL

Query:  DEGDKIKAAKYGFRSPF-LSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWNTAAHSL
        D    + AA    R PF LSP+ A+   ILEGL FA       + + SD L  I ++  ++    + +  V  I      +A   F  + R+ N AAH L
Subjt:  DEGDKIKAAKYGFRSPF-LSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWNTAAHSL

Query:  ARQGLS
        A+ G++
Subjt:  ARQGLS

XP_030970837.1 uncharacterized protein LOC115991254 [Quercus lobata]1.4e-0924.88Show/hide
Query:  MKGKIRLSKKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAFVDPTSVCLAISKRGHVLSPFGKLSCAVLFVDASCKPGVDVMGFGWVIL
        +K +++L  +  AL  V AWC+W   N+V  G A  S      W   YLE F     +         VL    K     + +D++    +  +G G V+ 
Subjt:  MKGKIRLSKKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAFVDPTSVCLAISKRGHVLSPFGKLSCAVLFVDASCKPGVDVMGFGWVIL

Query:  DEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWNTAAHSLA
        D+   + AA     +  L P+  +A+ + E + FA   G   +   SD L++I  +N  + P   +  ++    + +  +   +F +  R  N AAH LA
Subjt:  DEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWNTAAHSLA

Query:  R
        +
Subjt:  R

TrEMBL top hitse value%identityAlignment
A0A2P5CUQ4 Ribonuclease H-like domain containing protein3.1e-1030.29Show/hide
Query:  MKGKIRLSKKDFALACVGAWCIWGDRNKVVHGEA---LPSVIFRCDWIVNYLEAFVDPTSVCLAIS----KRGHVLSPFGKLSCAVLFVDASCKPGVDVM
        M  K++LSK+DF L  V +W I  DRN + HG A     S++   D+ +   + FV    V  + +     +       G+L    L VDA+ K   D +
Subjt:  MKGKIRLSKKDFALACVGAWCIWGDRNKVVHGEA---LPSVIFRCDWIVNYLEAFVDPTSVCLAIS----KRGHVLSPFGKLSCAVLFVDASCKPGVDVM

Query:  GFGWVILDEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWN
        G G V+ D    +  A     +  LSP  A+ + + EG+ FA + GL    I +D   ++  + +K    LE   +V  I R +    +    FT R  N
Subjt:  GFGWVILDEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWN

Query:  TAAHSLAR
         AAHSLA+
Subjt:  TAAHSLAR

A0A6J1DX30 uncharacterized protein LOC1110248746.4e-1629.13Show/hide
Query:  RLSKKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAF--VDPTSVCLAISKRGHVLSPFGKLSCAV---LFVDASCKPGVDVMGFGWVIL
        +L  KD  LA +  W IW DRN ++HG+ +  V F+C+W+  +L++      ++           +  + + S +V   L  DA+C+       FG +I 
Subjt:  RLSKKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAF--VDPTSVCLAISKRGHVLSPFGKLSCAV---LFVDASCKPGVDVMGFGWVIL

Query:  DEGDKIKAAKYGFRSPF-LSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWNTAAHSL
        D    + AA    R PF LSP+ A+   ILEGL FA       + + SD L  I ++  ++    + +  V  I      +A   F  + R+ N AAH L
Subjt:  DEGDKIKAAKYGFRSPF-LSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCDFHFTQREWNTAAHSL

Query:  ARQGLS
        A+ G++
Subjt:  ARQGLS

A0A803PV25 Uncharacterized protein1.0e-1027.23Show/hide
Query:  GGGSKTRFRGFTTLMEFGLKMKGKIRLSKKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAFVDPTSVCLAISKRGHVLSPFGKLSCAVL
        G  SK + +G   ++ F ++M     L+K+DF    V  W +W  RN V HG   P      +W   +L  F D      A + R              +
Subjt:  GGGSKTRFRGFTTLMEFGLKMKGKIRLSKKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAFVDPTSVCLAISKRGHVLSPFGKLSCAVL

Query:  FVDASCKPGVDVMGFGWVILDEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSY
         VDA  K G  +     V+ D   ++K A        LSP+ A+   I +G+       L    + +DCL  +++V K  G   +V GLV  I   +   
Subjt:  FVDASCKPGVDVMGFGWVILDEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSY

Query:  AHCDFHFTQREWNTAAHSLARQGL
              F  RE N  AH LA + L
Subjt:  AHCDFHFTQREWNTAAHSLARQGL

A0A803Q0L5 Uncharacterized protein1.1e-1230.96Show/hide
Query:  KKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAFVDPTSVCLAISKRGHVLSPFGKLSCAVLFVDASCKPGVDVMGFGWVILDEGDKIKA
        K  F    V  W IW  RN V+HG   P      DW   YL  F    +  ++  +R +      +L    + VDA  K G  V G G VI D G +   
Subjt:  KKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAFVDPTSVCLAISKRGHVLSPFGKLSCAVLFVDASCKPGVDVMGFGWVILDEGDKIKA

Query:  AKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCD-FHFTQREWNTAAHSLARQGL
        A        L+P+  +   IL GL    H  + R +I SDCL  + ++ +K     +V  L++ I R +L Y   D   F  RE N  A+ LA   L
Subjt:  AKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQVLSYAHCD-FHFTQREWNTAAHSLARQGL

A0A803Q8A7 Uncharacterized protein1.0e-1025.55Show/hide
Query:  KHLGGGSKTRFRGFTTLMEFGLKMKGKIRLSKKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAFVDPTSVCLAISKRGHVLSPFGKLSC
        K  G   K   +G   ++ F ++M   + L+ +DF    V +W +W  RN + HG   P      +W   +   F D T+     SKR            
Subjt:  KHLGGGSKTRFRGFTTLMEFGLKMKGKIRLSKKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAFVDPTSVCLAISKRGHVLSPFGKLSC

Query:  AVLFVDASCKPGVDVMGFGWVILDEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQV
          + VDA  K G  +      + D   ++  A        L+P+ A+  TI  GL    H  L   T+ ++CL  +++V K  G   +V GL+  I   +
Subjt:  AVLFVDASCKPGVDVMGFGWVILDEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIWRQV

Query:  LSYAHCDFHFTQREWNTAAHSLARQGL
                 F  R+ N  AH LA + L
Subjt:  LSYAHCDFHFTQREWNTAAHSLARQGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.8e-0827.69Show/hide
Query:  LSCAVLFVDASCKPGVDVMGFGWVILDEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIW
        L    +F DA+ K     +GFGWVI +  +               P+ A+AI +   L +A  +G+ ++++ SD   LI  +  +  P  E  G++  I 
Subjt:  LSCAVLFVDASCKPGVDVMGFGWVILDEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIW

Query:  RQVLSYAHCDFHFTQREWNTAAHSLARQGL
           L +A   F F  R  N  A  LA+  L
Subjt:  RQVLSYAHCDFHFTQREWNTAAHSLARQGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATCTAGTTCCATTAGAAGCCGTCGGGGTTGTAGGTCCTTCCGTTTTGATGAGTTGTGGTCTTTCCATTCTAATTGTAGAGATATTATCACTAACTGTGTTTCTGA
GCATTCTCATGGGATTCGTTGTCATCTTGAGGAGGTTCTCTCCTATTGTGCTGTTAATTTAAGAAAGCAGGGAAGGAACGTCAATCGTATCCTGAATTCGAACATTAGAG
AATTGAGGTTGACCATTCAATCTGAATATAATCGCCCTCCTCCTTTCAATTTTAGACTTATTTCTGAAATGGAAGGAAAGCTCCATCATTTACTTCTGGAAGAGGAGATG
TATTGGAACAGAGAGCAAGAGAGAATTGGTTGTAGTGGGGTGATAAAAATACTACTTGATTCCACAAACAAGCATCTTGGCGGAGGAAGCAAAACACGATTTCGAGGATT
CACAACTCTGATGGAGTTTGGACTGAAGATGAAGGGCAAAATAAGGCTGTCTAAAAAGGATTTTGCTTTGGCTTGTGTGGGTGCGTGGTGTATTTGGGGTGATCGGAATA
AAGTGGTTCATGGTGAGGCTTTACCTTCGGTTATTTTTCGGTGTGACTGGATTGTTAATTATCTGGAGGCTTTTGTTGATCCAACATCTGTCTGTTTGGCGATCTCTAAG
AGGGGCCATGTGCTTAGCCCTTTTGGCAAACTGTCTTGTGCGGTTCTTTTTGTTGATGCCTCTTGCAAACCTGGGGTTGATGTTATGGGCTTTGGTTGGGTGATCCTTGA
TGAAGGAGACAAGATCAAGGCTGCTAAGTATGGGTTTCGAAGCCCTTTTCTTTCTCCTATTTGTGCTAAAGCCATTACAATTTTAGAAGGTTTAGATTTTGCTACCCATG
TGGGGTTAGATCGTATCACGATTATGTCTGATTGTCTTGCGCTTATTGATATGGTAAATAAAAAGGTTGGTCCAATTTTGGAGGTTCGTGGCCTTGTGGAGGCTATCTGG
AGACAGGTTCTTTCCTATGCGCATTGTGATTTTCATTTTACTCAACGTGAGTGGAATACTGCTGCTCACTCTTTGGCGCGCCAGGGTCTGTCTGGAGTTGAATACTCCTG
A
mRNA sequenceShow/hide mRNA sequence
ATGAGATCTAGTTCCATTAGAAGCCGTCGGGGTTGTAGGTCCTTCCGTTTTGATGAGTTGTGGTCTTTCCATTCTAATTGTAGAGATATTATCACTAACTGTGTTTCTGA
GCATTCTCATGGGATTCGTTGTCATCTTGAGGAGGTTCTCTCCTATTGTGCTGTTAATTTAAGAAAGCAGGGAAGGAACGTCAATCGTATCCTGAATTCGAACATTAGAG
AATTGAGGTTGACCATTCAATCTGAATATAATCGCCCTCCTCCTTTCAATTTTAGACTTATTTCTGAAATGGAAGGAAAGCTCCATCATTTACTTCTGGAAGAGGAGATG
TATTGGAACAGAGAGCAAGAGAGAATTGGTTGTAGTGGGGTGATAAAAATACTACTTGATTCCACAAACAAGCATCTTGGCGGAGGAAGCAAAACACGATTTCGAGGATT
CACAACTCTGATGGAGTTTGGACTGAAGATGAAGGGCAAAATAAGGCTGTCTAAAAAGGATTTTGCTTTGGCTTGTGTGGGTGCGTGGTGTATTTGGGGTGATCGGAATA
AAGTGGTTCATGGTGAGGCTTTACCTTCGGTTATTTTTCGGTGTGACTGGATTGTTAATTATCTGGAGGCTTTTGTTGATCCAACATCTGTCTGTTTGGCGATCTCTAAG
AGGGGCCATGTGCTTAGCCCTTTTGGCAAACTGTCTTGTGCGGTTCTTTTTGTTGATGCCTCTTGCAAACCTGGGGTTGATGTTATGGGCTTTGGTTGGGTGATCCTTGA
TGAAGGAGACAAGATCAAGGCTGCTAAGTATGGGTTTCGAAGCCCTTTTCTTTCTCCTATTTGTGCTAAAGCCATTACAATTTTAGAAGGTTTAGATTTTGCTACCCATG
TGGGGTTAGATCGTATCACGATTATGTCTGATTGTCTTGCGCTTATTGATATGGTAAATAAAAAGGTTGGTCCAATTTTGGAGGTTCGTGGCCTTGTGGAGGCTATCTGG
AGACAGGTTCTTTCCTATGCGCATTGTGATTTTCATTTTACTCAACGTGAGTGGAATACTGCTGCTCACTCTTTGGCGCGCCAGGGTCTGTCTGGAGTTGAATACTCCTG
A
Protein sequenceShow/hide protein sequence
MRSSSIRSRRGCRSFRFDELWSFHSNCRDIITNCVSEHSHGIRCHLEEVLSYCAVNLRKQGRNVNRILNSNIRELRLTIQSEYNRPPPFNFRLISEMEGKLHHLLLEEEM
YWNREQERIGCSGVIKILLDSTNKHLGGGSKTRFRGFTTLMEFGLKMKGKIRLSKKDFALACVGAWCIWGDRNKVVHGEALPSVIFRCDWIVNYLEAFVDPTSVCLAISK
RGHVLSPFGKLSCAVLFVDASCKPGVDVMGFGWVILDEGDKIKAAKYGFRSPFLSPICAKAITILEGLDFATHVGLDRITIMSDCLALIDMVNKKVGPILEVRGLVEAIW
RQVLSYAHCDFHFTQREWNTAAHSLARQGLSGVEYS