; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019562 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019562
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRNase H domain-containing protein
Genome locationChr04:23202386..23202805
RNA-Seq ExpressionHG10019562
SyntenyHG10019562
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2635063.1 hypothetical protein PVAP13_2NG311303 [Panicum virgatum]3.8e-0833.9Show/hide
Query:  EEVLKKSKVASSFKGMVENSVNRQKTGV-----WSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAK
        +EV   ++ A S   +  N  N  K  V     W  P   + KLNVDA +S    +    A++R+  G+  A A   ++   DAP+AE FA+++ LRLA+
Subjt:  EEVLKKSKVASSFKGMVENSVNRQKTGV-----WSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAK

Query:  KCGASRIMVESDCIQAIN
        + G +RI+V +DC+Q ++
Subjt:  KCGASRIMVESDCIQAIN

KAG2635966.1 hypothetical protein PVAP13_2NG409203 [Panicum virgatum]5.0e-0838.82Show/hide
Query:  WSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAIN
        W  PP  +  LNVDA +          A+LR+  G + AA+ +HI   +DAP+AE +A+ E L LA+  GA+R++V+SDC++ ++
Subjt:  WSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAIN

XP_023917750.1 uncharacterized protein LOC112029294 [Quercus suber]2.2e-0839.77Show/hide
Query:  TGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAIN
        T  WSPPP G FK+NVD   S I  S+    ++R+ KG+  AA    ++S     L E+FA+ + +RLA++   SR++ ESD +  IN
Subjt:  TGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAIN

XP_023920453.1 uncharacterized protein LOC112031983 [Quercus suber]3.5e-0940.91Show/hide
Query:  TGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAIN
        T  WSPPP G FK+NVD   S I  S+    ++R+ KG++ AA    ++S   A L E+FA+ + +RLA++   SR++ ESD +  IN
Subjt:  TGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAIN

XP_038902513.1 uncharacterized protein LOC120089172 [Benincasa hispida]1.7e-1136Show/hide
Query:  WIVKYVEEVLKKSKVASSFKGMVENSVNRQKTG----VWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEE
        W+   +E    +   AS F+       + +K+G     WS PP  F KLNVDA W   P S+G+SA++R+++G L     + I +    PLAE F +++ 
Subjt:  WIVKYVEEVLKKSKVASSFKGMVENSVNRQKTG----VWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEE

Query:  LRLAKKCGASRIMVESDCIQAINQF
        LRL  K    +I+V+SDC  AI+ F
Subjt:  LRLAKKCGASRIMVESDCIQAINQF

TrEMBL top hitse value%identityAlignment
A0A453QKC2 RNase H domain-containing protein5.4e-0837.5Show/hide
Query:  ASSFKGMVENSVNRQKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCI
        A+ FK M + S   ++ G W+ P  GF KLNVDA + ++        +LR+ KG   AA++  ++   DA  AE +A+   L LA++ G S+++VESDC+
Subjt:  ASSFKGMVENSVNRQKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCI

Query:  QAIN
        + IN
Subjt:  QAIN

A0A5B7BI33 Uncharacterized protein (Fragment)6.0e-0736.96Show/hide
Query:  NRQKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAIN
        +R  + VWSPPP   FKLNVD  W     S G   ++R+S+G + A  A  +K    A  AE  A+   +  AK+ G   +++ESDC+  +N
Subjt:  NRQKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAIN

M5WZF6 RNase H domain-containing protein (Fragment)7.0e-0840Show/hide
Query:  QKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAIN
        +K+  WSPPP G +KLNVDA +       G  A++RN KGE+ AA A  + S+  +  AEI A +  ++ A+  G S I++ESD    +N
Subjt:  QKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAIN

M5X848 Uncharacterized protein7.8e-0737.37Show/hide
Query:  ENSVNRQKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAINQFH
        E+++  +K   WS PP+G FKLNVDA +       G  A++RN KGE+  A A  + ++     AEI A+   L  A   G S I+VESD    +N  +
Subjt:  ENSVNRQKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVESDCIQAINQFH

M5XIC5 RNase H domain-containing protein2.0e-0736.45Show/hide
Query:  VASSFKGMVENSVNR--QKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVES
        +A  +  +V+   N   +K   WSPPP G +KLNVDA +       G   ++RN KGE+ AA A  + S+  +  AEI A + E++ A   G S I++ES
Subjt:  VASSFKGMVENSVNR--QKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAIIEELRLAKKCGASRIMVES

Query:  DCIQAIN
        D    +N
Subjt:  DCIQAIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-0431.11Show/hide
Query:  IVKYVEEVLKKSKVASSFKGM-VENSVNRQKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGE---LCAAAANHIKSSMDAPL
        +++  E+ L++ ++ +  +    +  VNR   G W PPP  + K N DA W+      G   +LRN KGE   + A A   +KS ++A L
Subjt:  IVKYVEEVLKKSKVASSFKGM-VENSVNRQKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGE---LCAAAANHIKSSMDAPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AACAAGGTGGTCCATAGTTCTCAAATTGTAAAAGTGGAAGGCCGTTGTGGTTGGATTGTGAAGTATGTTGAGGAAGTTTTAAAGAAGAGCAAGGTTGCTTCTTCTTTTAA
GGGGATGGTGGAGAATTCTGTGAACAGGCAGAAGACTGGGGTTTGGTCTCCTCCTCCGATTGGCTTTTTCAAATTAAATGTGGATGCACCTTGGTCATCTATTCCCCCTT
CTACTGGTTGGAGTGCTATGTTGAGAAATTCTAAGGGTGAGTTGTGTGCTGCAGCTGCAAACCATATCAAAAGTTCAATGGATGCCCCGTTAGCAGAGATTTTTGCTATC
ATAGAAGAGCTGCGGTTAGCGAAAAAGTGTGGGGCTTCTCGGATCATGGTGGAGTCTGATTGCATTCAAGCAATCAATCAATTTCATTAA
mRNA sequenceShow/hide mRNA sequence
AACAAGGTGGTCCATAGTTCTCAAATTGTAAAAGTGGAAGGCCGTTGTGGTTGGATTGTGAAGTATGTTGAGGAAGTTTTAAAGAAGAGCAAGGTTGCTTCTTCTTTTAA
GGGGATGGTGGAGAATTCTGTGAACAGGCAGAAGACTGGGGTTTGGTCTCCTCCTCCGATTGGCTTTTTCAAATTAAATGTGGATGCACCTTGGTCATCTATTCCCCCTT
CTACTGGTTGGAGTGCTATGTTGAGAAATTCTAAGGGTGAGTTGTGTGCTGCAGCTGCAAACCATATCAAAAGTTCAATGGATGCCCCGTTAGCAGAGATTTTTGCTATC
ATAGAAGAGCTGCGGTTAGCGAAAAAGTGTGGGGCTTCTCGGATCATGGTGGAGTCTGATTGCATTCAAGCAATCAATCAATTTCATTAA
Protein sequenceShow/hide protein sequence
NKVVHSSQIVKVEGRCGWIVKYVEEVLKKSKVASSFKGMVENSVNRQKTGVWSPPPIGFFKLNVDAPWSSIPPSTGWSAMLRNSKGELCAAAANHIKSSMDAPLAEIFAI
IEELRLAKKCGASRIMVESDCIQAINQFH