; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G009680 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G009680
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionRNase H domain-containing protein
Genome locationchr09:12732538..12794854
RNA-Seq ExpressionLsi09G009680
SyntenyLsi09G009680
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4356945.1 hypothetical protein F8388_015921 [Cannabis sativa]2.4e-0444.26Show/hide
Query:  AIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAIS
        A+IRDS G +VVA++ FL   +    A+A  +LHG+ L R W + NV V +D+Q ++KA+S
Subjt:  AIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAIS

KAF4379885.1 hypothetical protein G4B88_021018 [Cannabis sativa]1.9e-0431.75Show/hide
Query:  HSANVGRDRWSPPVVGSLKLKVRS-----------GAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKA-----
        HS       WSPP  G+ K+   +           G +IRD  G+VVVA + ++   L    AE+LA+  GL L   W L ++ + +D Q ++ A     
Subjt:  HSANVGRDRWSPPVVGSLKLKVRS-----------GAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKA-----

Query:  --ISENRFVLVVVAAVQPLPFSHRRC
          IS+   +LV +A  +P  F  RRC
Subjt:  --ISENRFVLVVVAAVQPLPFSHRRC

KAF4401592.1 hypothetical protein G4B88_001786 [Cannabis sativa]2.4e-0444.26Show/hide
Query:  AIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAIS
        A+IRDS G +VVA++ FL   +    A+A  +LHG+ L R W + NV V +D+Q ++KA+S
Subjt:  AIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAIS

XP_030943489.1 uncharacterized protein LOC115968280 [Quercus lobata]4.9e-0531.19Show/hide
Query:  SSSSPEERHSANVGRDRWSPPVVGSLKLKVRS-----------GAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQAL
        +SSS +++   +   + W PP  G  K+ V             G IIRDS G++V A +++L  Q  +F  E LA+  G+ L +  GL  + +E+DA ++
Subjt:  SSSSPEERHSANVGRDRWSPPVVGSLKLKVRS-----------GAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQAL

Query:  VKAISENRF
        +++I+ N F
Subjt:  VKAISENRF

XP_037439691.1 uncharacterized protein LOC119307722 isoform X1 [Triticum dicoccoides]3.4e-0638.89Show/hide
Query:  RWSPPVVGSLKLKV-----------RSGAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAISE
        RW  P  G +KL V            +GA++RDSSG  + AS  F  Y ++  + EALALL GL+L  H G+  + VE+D+Q +V+A+++
Subjt:  RWSPPVVGSLKLKV-----------RSGAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAISE

TrEMBL top hitse value%identityAlignment
A0A2K2CVF0 RNase H domain-containing protein5.8e-0431.78Show/hide
Query:  PESCSNLILLLSSSSPEERHSANVGRDRWSPPVVGSLKLKV-----------RSGAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLV
        P S  ++  L  +       S+ + R  W  P  G  KL V            +GA+IRD+SG+ V AS+ F+    ++  AEALAL HG++L +++G  
Subjt:  PESCSNLILLLSSSSPEERHSANVGRDRWSPPVVGSLKLKV-----------RSGAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLV

Query:  NVPVEAD
        N+ + +D
Subjt:  NVPVEAD

A0A7J6EES1 RNase H domain-containing protein1.2e-0444.26Show/hide
Query:  AIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAIS
        A+IRDS G +VVA++ FL   +    A+A  +LHG+ L R W + NV V +D+Q ++KA+S
Subjt:  AIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAIS

A0A7J6GAB1 RNase H domain-containing protein9.0e-0531.75Show/hide
Query:  HSANVGRDRWSPPVVGSLKLKVRS-----------GAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKA-----
        HS       WSPP  G+ K+   +           G +IRD  G+VVVA + ++   L    AE+LA+  GL L   W L ++ + +D Q ++ A     
Subjt:  HSANVGRDRWSPPVVGSLKLKVRS-----------GAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKA-----

Query:  --ISENRFVLVVVAAVQPLPFSHRRC
          IS+   +LV +A  +P  F  RRC
Subjt:  --ISENRFVLVVVAAVQPLPFSHRRC

A0A7J6I1Z3 RNase H domain-containing protein1.2e-0444.26Show/hide
Query:  AIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAIS
        A+IRDS G +VVA++ FL   +    A+A  +LHG+ L R W + NV V +D+Q ++KA+S
Subjt:  AIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAIS

A0A803QGC3 Uncharacterized protein3.4e-0429.7Show/hide
Query:  WSPPVVGSLKLKVRS-----------GAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAISEN----------
        W PP  G   +   +            A+IRDS G +VVA++ FL   +    A+A  +LHG+ L R W + NV V +D+Q ++KA+S            
Subjt:  WSPPVVGSLKLKVRS-----------GAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPVEADAQALVKAISEN----------

Query:  -RFVLVVVAAVQPLP--FSHRRCVEQQWAVAVVAVVQSSIHGASSCWSSLV-RTVAAVAVAVVPS
           + ++ ++ Q L   F HR C +    VA      S +   S  W+ L+    AA+ +A VPS
Subjt:  -RFVLVVVAAVQPLP--FSHRRCVEQQWAVAVVAVVQSSIHGASSCWSSLV-RTVAAVAVAVVPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATGATTTTAAGCTATCGCAACTCCTATCTACGCTTAAATCTTTCCCCTGAATCTTGTTCAAATCTCATTCTTCTTTTATCTTCTTCTTCGCCGGAAGAGAGACA
TTCTGCAAATGTGGGAAGAGATCGGTGGTCTCCTCCGGTTGTTGGAAGTCTTAAACTCAAAGTGAGGTCAGGGGCTATCATCCGCGACTCGTCTGGGGTGGTGGTGGTTG
CTTCTTCTCATTTTCTTCTTTATCAGTTGGAGTCCTTTGCAGCGGAGGCTTTGGCCCTTCTTCACGGACTGAAGCTGGTTCGACATTGGGGGTTAGTGAATGTTCCTGTT
GAAGCTGATGCTCAAGCGCTCGTCAAAGCCATTTCTGAGAATAGGTTTGTGCTCGTGGTCGTCGCTGCTGTCCAGCCATTACCGTTCAGCCATCGTCGTTGCGTGGAGCA
GCAGTGGGCCGTCGCTGTCGTTGCCGTCGTGCAGTCGAGCATTCATGGAGCGTCGTCGTGCTGGAGCAGTCTCGTCCGTACCGTCGCTGCTGTTGCCGTTGCTGTCGTGC
CATCTCGTCCAGCCGCCGGATCTGTCCAGTTCATCCCGGATCTCACAAATCTAGCTCTTATCCTTTGGGAGAGGTTCGAGGCAAATAAGGAAAACCTCCATATTGGCATG
ATTTTGGTTGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATGATTTTAAGCTATCGCAACTCCTATCTACGCTTAAATCTTTCCCCTGAATCTTGTTCAAATCTCATTCTTCTTTTATCTTCTTCTTCGCCGGAAGAGAGACA
TTCTGCAAATGTGGGAAGAGATCGGTGGTCTCCTCCGGTTGTTGGAAGTCTTAAACTCAAAGTGAGGTCAGGGGCTATCATCCGCGACTCGTCTGGGGTGGTGGTGGTTG
CTTCTTCTCATTTTCTTCTTTATCAGTTGGAGTCCTTTGCAGCGGAGGCTTTGGCCCTTCTTCACGGACTGAAGCTGGTTCGACATTGGGGGTTAGTGAATGTTCCTGTT
GAAGCTGATGCTCAAGCGCTCGTCAAAGCCATTTCTGAGAATAGGTTTGTGCTCGTGGTCGTCGCTGCTGTCCAGCCATTACCGTTCAGCCATCGTCGTTGCGTGGAGCA
GCAGTGGGCCGTCGCTGTCGTTGCCGTCGTGCAGTCGAGCATTCATGGAGCGTCGTCGTGCTGGAGCAGTCTCGTCCGTACCGTCGCTGCTGTTGCCGTTGCTGTCGTGC
CATCTCGTCCAGCCGCCGGATCTGTCCAGTTCATCCCGGATCTCACAAATCTAGCTCTTATCCTTTGGGAGAGGTTCGAGGCAAATAAGGAAAACCTCCATATTGGCATG
ATTTTGGTTGTATGA
Protein sequenceShow/hide protein sequence
MDMILSYRNSYLRLNLSPESCSNLILLLSSSSPEERHSANVGRDRWSPPVVGSLKLKVRSGAIIRDSSGVVVVASSHFLLYQLESFAAEALALLHGLKLVRHWGLVNVPV
EADAQALVKAISENRFVLVVVAAVQPLPFSHRRCVEQQWAVAVVAVVQSSIHGASSCWSSLVRTVAAVAVAVVPSRPAAGSVQFIPDLTNLALILWERFEANKENLHIGM
ILVV