; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021112 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021112
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRNase H domain-containing protein
Genome locationChr05:5519187..5519711
RNA-Seq ExpressionHG10021112
SyntenyHG10021112
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015381856.1 uncharacterized protein LOC107175052 [Citrus sinensis]6.9e-1534.44Show/hide
Query:  HLKWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEG
        H +W PP EG +K+NVD A  +     G   VIRN KG++IA A    K+S    +AE  A+L G+++A   + + +++E D    + LIN K  TL E 
Subjt:  HLKWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEG

Query:  LCWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFP
        +  + +I +  + F  F    ++RS N  A  +AK A     S +W + FP
Subjt:  LCWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFP

XP_015383531.1 uncharacterized protein LOC107176039 [Citrus sinensis]3.1e-1536.91Show/hide
Query:  KWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGLC
        +W PP E  +K+NVD A  ++    G  AVIRN +G+++  A N +KS     + E  A L GI+ A +   S I++ESD    + LIN K STL E   
Subjt:  KWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGLC

Query:  WLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFP
         + +I +  + F  F    + R+ N+L   +AK A   S S IW D FP
Subjt:  WLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFP

XP_024033533.1 uncharacterized protein LOC112095662 [Citrus clementina]6.9e-1534Show/hide
Query:  KWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGLC
        +W PPP   LK+NVD A        G  AVIRNS+G ++  A   +       + E  A+  G+ +A +   + +++E+DC +  NL N K S   E   
Subjt:  KWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGLC

Query:  WLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPS
         + EI    QDF       + R  N+ A  +AK+A  SS S IW+D FP+
Subjt:  WLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPS

XP_024046567.1 uncharacterized protein LOC112100924 [Citrus clementina]9.6e-1735.48Show/hide
Query:  WIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGLCW
        W PPP   +K+NVD A  +   S G   VIRN +G ++A A   +       +AE  AI  G+ +AK   +S ++IE+DC    +L NKK S + E    
Subjt:  WIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGLCW

Query:  LTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWLKALA
        +++I K +QDF + +   + R  N  A  +AK+A  SS S IW ++FP+ +  +A
Subjt:  LTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWLKALA

XP_038902513.1 uncharacterized protein LOC120089172 [Benincasa hispida]1.2e-1935.37Show/hide
Query:  WIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGLCW
        W  PP   +KLNVD AW   P+S+G+SA+IR+++G L  V  + + +    PLAE   +L+G+RL  +     I+++SDC  A++L  K   + +    W
Subjt:  WIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGLCW

Query:  LTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHF
        L EIW++S  F      +I R++N LAD +AK+ +    + +W D +
Subjt:  LTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHF

TrEMBL top hitse value%identityAlignment
A0A1R3G9C4 Reverse transcriptase3.1e-1337.5Show/hide
Query:  LKWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGL
        ++W+PP  GS KLNVD A+       G  AVIR+  GD+ + A   L        AE+ AI  G++LAK   +   L+ESDCL+ ++ IN  S+   EG 
Subjt:  LKWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGL

Query:  CWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQA
        C + EI  L+  F       +NR  N  A  +AK A
Subjt:  CWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQA

A0A200PV73 Reverse transcriptase zinc-binding domain3.1e-1334.32Show/hide
Query:  THLKWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVA----ANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSS
        +H +WIPPP GS+K+N D A  D   + G   VIRNS+G ++A       NH+    KA  AE  A L+GI LAK    + I++E D L+ VN+++  S+
Subjt:  THLKWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVA----ANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSS

Query:  TLNEGL-CWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWLKALAVKDCI
        T+   +   + +   LS  F+ F+  ++ R  N +A  +AK A  S     W +  P  +  L   D I
Subjt:  TLNEGL-CWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWLKALAVKDCI

A0A200R8X5 Reverse transcriptase zinc-binding domain1.8e-1334.32Show/hide
Query:  THLKWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVA----ANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSS
        +H +WIPPP GS+K+N D A  D   + G   VIRNS+G ++A       NH+    KA  AE  A L+GI LAK    + I++E D L+ VN+++  S+
Subjt:  THLKWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVA----ANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSS

Query:  TLNEGL-CWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWLKALAVKDCI
        T+   +   +++   LS  F+ F+  ++ R  N +A  +AK A  S     W +  P  +  L   D I
Subjt:  TLNEGL-CWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWLKALAVKDCI

A0A6P5RZ31 uncharacterized protein LOC1107515791.3e-1432.34Show/hide
Query:  PTHLKWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLN
        P+   W PPP G LKLNVD A S   +  G  AV+RN  GDL+   +  +     A + ELLAI EG++         +++E+D   A+N I   +    
Subjt:  PTHLKWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLN

Query:  EGLCWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWLKALAVKDCIHL
             + +I  LS++F   + +F  R  N++AD +AK A  S    +W +  P WL      D   L
Subjt:  EGLCWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWLKALAVKDCIHL

A0A6P6U6V4 uncharacterized protein LOC1137081833.7e-1432.7Show/hide
Query:  KWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGLC
        +W+ P  G +K+N D A   +    GW  V RNS G+L+   A   +   +A + E LAI E + +AK+     + +ESDC L V+ IN +   ++    
Subjt:  KWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGLC

Query:  WLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWLKALAVKD
         L +IW++ +DF    C F  R+ N ++  +AK A        W   FP+WL  LA  D
Subjt:  WLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWLKALAVKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein9.6e-0729.03Show/hide
Query:  KWIPPPEGSLKLNVDVAWSD-RPFS-TGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINK-KSSTLNE
        +W PPPEG +K N D  ++   P++ +GW+  IR   G ++      L+SS  +  AE L  L  +++     +  +  ESD    V LIN  +  +L  
Subjt:  KWIPPPEGSLKLNVDVAWSD-RPFS-TGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINK-KSSTLNE

Query:  GLCWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWL
         L +    W L   +   +  F+NR  NS AD +A            +   PSWL
Subjt:  GLCWLTEIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGTTTTTGCCCAACCCACTTAAAATGGATCCCTCCTCCTGAAGGTTCCTTGAAGTTAAATGTTGATGTTGCGTGGTCAGATCGCCCCTTTTCCACTGGTTGGAGTGCTGT
CATCAGGAACTCTAAAGGAGACCTCATTGCGGTTGCTGCAAACCATCTTAAAAGCTCTCTTAAAGCCCCTCTTGCTGAACTCTTAGCTATTCTTGAAGGTATTCGTTTAG
CAAAGAGGTGTAAGGTCTCCTGCATTTTGATTGAATCGGATTGTCTTTTAGCAGTTAATCTGATCAACAAAAAATCCTCAACCTTGAATGAAGGTCTCTGCTGGCTTACT
GAAATTTGGAAGCTCTCTCAAGATTTTAGTATTTTTACCTGCCTCTTTATAAACAGATCTCTAAATTCTTTAGCAGATTGTATAGCTAAACAGGCTAAATTTTCGTCTTT
CTCTGGAATTTGGTTTGATCATTTTCCTTCCTGGTTGAAAGCTTTAGCTGTTAAAGATTGTATTCATCTTGCCCGTGTGGCGTAA
mRNA sequenceShow/hide mRNA sequence
TGTTTTTGCCCAACCCACTTAAAATGGATCCCTCCTCCTGAAGGTTCCTTGAAGTTAAATGTTGATGTTGCGTGGTCAGATCGCCCCTTTTCCACTGGTTGGAGTGCTGT
CATCAGGAACTCTAAAGGAGACCTCATTGCGGTTGCTGCAAACCATCTTAAAAGCTCTCTTAAAGCCCCTCTTGCTGAACTCTTAGCTATTCTTGAAGGTATTCGTTTAG
CAAAGAGGTGTAAGGTCTCCTGCATTTTGATTGAATCGGATTGTCTTTTAGCAGTTAATCTGATCAACAAAAAATCCTCAACCTTGAATGAAGGTCTCTGCTGGCTTACT
GAAATTTGGAAGCTCTCTCAAGATTTTAGTATTTTTACCTGCCTCTTTATAAACAGATCTCTAAATTCTTTAGCAGATTGTATAGCTAAACAGGCTAAATTTTCGTCTTT
CTCTGGAATTTGGTTTGATCATTTTCCTTCCTGGTTGAAAGCTTTAGCTGTTAAAGATTGTATTCATCTTGCCCGTGTGGCGTAA
Protein sequenceShow/hide protein sequence
CFCPTHLKWIPPPEGSLKLNVDVAWSDRPFSTGWSAVIRNSKGDLIAVAANHLKSSLKAPLAELLAILEGIRLAKRCKVSCILIESDCLLAVNLINKKSSTLNEGLCWLT
EIWKLSQDFSIFTCLFINRSLNSLADCIAKQAKFSSFSGIWFDHFPSWLKALAVKDCIHLARVA