; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037397 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037397
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNase H domain-containing protein
Genome locationscaffold8:1666887..1667495
RNA-Seq ExpressionSpg037397
SyntenySpg037397
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PRQ37083.1 putative ribonuclease H-like domain, reverse transcriptase zinc-binding domain-containing protein [Rosa chinensis]2.6e-1330.86Show/hide
Query:  QSGEPDGDVVRHRDWIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNL
        Q G    DV     W+P + GS KLNCDA+      + G+GWICR+  GR + A    I  + + +  E L +  GL+  +    S +R+E D L+ V L
Subjt:  QSGEPDGDVVRHRDWIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNL

Query:  INEVDQDVTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENESKCWINSFPNWLL--LENEADI
        +N  ++ + +    ++  + LL    I  I HV R  N  AH +A      N    W+   P+WL+  ++N+  I
Subjt:  INEVDQDVTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENESKCWINSFPNWLL--LENEADI

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]9.8e-2138.13Show/hide
Query:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNLINEVDQDVTELIYF
        W P    SWKLN +A W      GG+GWI RD +G  + A  R+I  +  I +LE +AIC+GL++I +     + +E D+L+ ++L++   QD TE+I+ 
Subjt:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNLINEVDQDVTELIYF

Query:  IKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENE
        ++E   ++    I S+ H+ R  N++AH LARRA +EN+
Subjt:  IKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENE

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]2.7e-1836.18Show/hide
Query:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILET-FQSTVRMECDALQMVNLINEVDQDVTELIY
        W P     W LN DA+WS++  RGG+GWI R   G  ++AG R +E    ++ LEA AI +GL+++        + +E D+ ++ +L+N   +D+T+  +
Subjt:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILET-FQSTVRMECDALQMVNLINEVDQDVTELIY

Query:  FIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENESKCWINSFPNWL
         ++E  +L     I +   V R  N  AH LA+RA V  ES  W++ FPNWL
Subjt:  FIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENESKCWINSFPNWL

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]4.1e-1936.73Show/hide
Query:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVR--------MECDALQMVNLINEVDQ
        W P    SWKLN DA W      GG+GWI RD +G  + A  R+I  +  I +LE +AIC+GL++I +     ++        +E D+L+ ++L++   Q
Subjt:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVR--------MECDALQMVNLINEVDQ

Query:  DVTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENE
        D TE+I+ ++E   ++    I S+ H+ R  N++AH LARRA +EN+
Subjt:  DVTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENE

XP_024957833.1 uncharacterized protein LOC112499264 [Citrus sinensis]3.4e-1327.74Show/hide
Query:  DVVRHRDWIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNLINEVDQD
        ++ R+  W+P  EG +K+N +A  ++  R+GG+G + RD+ GR +    + I+    +  +EA AI  G++   +   + + +E D+ ++++L+      
Subjt:  DVVRHRDWIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNLINEVDQD

Query:  VTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENESKCWINSFP
         TE+++ I++ Q+ +   ++  I H+PR  N MAH +A+ A   ++   W  SFP
Subjt:  VTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENESKCWINSFP

TrEMBL top hitse value%identityAlignment
A0A2P6QSB8 Putative ribonuclease H-like domain, reverse transcriptase zinc-binding domain-containing protein1.2e-1330.86Show/hide
Query:  QSGEPDGDVVRHRDWIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNL
        Q G    DV     W+P + GS KLNCDA+      + G+GWICR+  GR + A    I  + + +  E L +  GL+  +    S +R+E D L+ V L
Subjt:  QSGEPDGDVVRHRDWIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNL

Query:  INEVDQDVTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENESKCWINSFPNWLL--LENEADI
        +N  ++ + +    ++  + LL    I  I HV R  N  AH +A      N    W+   P+WL+  ++N+  I
Subjt:  INEVDQDVTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENESKCWINSFPNWLL--LENEADI

A0A6J1CP26 uncharacterized protein LOC1110134124.7e-2138.13Show/hide
Query:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNLINEVDQDVTELIYF
        W P    SWKLN +A W      GG+GWI RD +G  + A  R+I  +  I +LE +AIC+GL++I +     + +E D+L+ ++L++   QD TE+I+ 
Subjt:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNLINEVDQDVTELIYF

Query:  IKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENE
        ++E   ++    I S+ H+ R  N++AH LARRA +EN+
Subjt:  IKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENE

A0A6J1D4B6 uncharacterized protein LOC1110171813.6e-1335.17Show/hide
Query:  DGDVVRHRDWIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNLINEVD
        D ++ R   W P  +  WKLN DATW ++L  GG+GWI RD  GR ++A    ++   Q   LEA     G+K         + ME D L++VN+IN+  
Subjt:  DGDVVRHRDWIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRMECDALQMVNLINEVD

Query:  QDVTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACV
          +TE+   +++    +    I    H+P   N +AH +ARRACV
Subjt:  QDVTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACV

A0A6J1DNV9 uncharacterized protein LOC1110224031.3e-1836.18Show/hide
Query:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILET-FQSTVRMECDALQMVNLINEVDQDVTELIY
        W P     W LN DA+WS++  RGG+GWI R   G  ++AG R +E    ++ LEA AI +GL+++        + +E D+ ++ +L+N   +D+T+  +
Subjt:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILET-FQSTVRMECDALQMVNLINEVDQDVTELIY

Query:  FIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENESKCWINSFPNWL
         ++E  +L     I +   V R  N  AH LA+RA V  ES  W++ FPNWL
Subjt:  FIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENESKCWINSFPNWL

A0A6J1DSV1 uncharacterized protein LOC1110236082.0e-1936.73Show/hide
Query:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVR--------MECDALQMVNLINEVDQ
        W P    SWKLN DA W      GG+GWI RD +G  + A  R+I  +  I +LE +AIC+GL++I +     ++        +E D+L+ ++L++   Q
Subjt:  WIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVR--------MECDALQMVNLINEVDQ

Query:  DVTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENE
        D TE+I+ ++E   ++    I S+ H+ R  N++AH LARRA +EN+
Subjt:  DVTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAATATATGGGAGAAGAATCAACTAACCAGGTGTCGGCGACATCTCGACGGCTTCAATCGGGCGAACCTGATGGCGACGTCGTGAGACATCGCGACTGGATCCC
AATTGTCGAGGGGTCATGGAAACTCAACTGTGACGCTACTTGGAGTGAAGCTCTCAGGCGAGGCGGCGTCGGTTGGATTTGCAGAGACCGGAGAGGGAGACCGATGGTGG
CAGGCTACCGCGTGATCGAGCAACAATGGCAGATTCAGTGGCTTGAAGCTTTAGCGATATGTGATGGTTTGAAATCGATTCTAGAGACCTTTCAGTCGACTGTGCGCATG
GAGTGTGATGCGTTGCAAATGGTTAATTTGATTAATGAGGTAGATCAGGATGTAACTGAATTGATCTACTTCATTAAAGAAGCCCAATCTCTCCTAGCTTTGAACAATAT
TGGGTCTATCTTTCATGTTCCTAGAGCTCAAAATGAAATGGCCCACCGTCTAGCCCGTAGGGCATGTGTGGAGAATGAGTCCAAATGCTGGATAAATTCTTTCCCAAATT
GGCTTTTATTAGAAAATGAGGCTGATATTGGGTGTGCTAGTCACACTAGTGGGGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAATATATGGGAGAAGAATCAACTAACCAGGTGTCGGCGACATCTCGACGGCTTCAATCGGGCGAACCTGATGGCGACGTCGTGAGACATCGCGACTGGATCCC
AATTGTCGAGGGGTCATGGAAACTCAACTGTGACGCTACTTGGAGTGAAGCTCTCAGGCGAGGCGGCGTCGGTTGGATTTGCAGAGACCGGAGAGGGAGACCGATGGTGG
CAGGCTACCGCGTGATCGAGCAACAATGGCAGATTCAGTGGCTTGAAGCTTTAGCGATATGTGATGGTTTGAAATCGATTCTAGAGACCTTTCAGTCGACTGTGCGCATG
GAGTGTGATGCGTTGCAAATGGTTAATTTGATTAATGAGGTAGATCAGGATGTAACTGAATTGATCTACTTCATTAAAGAAGCCCAATCTCTCCTAGCTTTGAACAATAT
TGGGTCTATCTTTCATGTTCCTAGAGCTCAAAATGAAATGGCCCACCGTCTAGCCCGTAGGGCATGTGTGGAGAATGAGTCCAAATGCTGGATAAATTCTTTCCCAAATT
GGCTTTTATTAGAAAATGAGGCTGATATTGGGTGTGCTAGTCACACTAGTGGGGGATAG
Protein sequenceShow/hide protein sequence
MEEYMGEESTNQVSATSRRLQSGEPDGDVVRHRDWIPIVEGSWKLNCDATWSEALRRGGVGWICRDRRGRPMVAGYRVIEQQWQIQWLEALAICDGLKSILETFQSTVRM
ECDALQMVNLINEVDQDVTELIYFIKEAQSLLALNNIGSIFHVPRAQNEMAHRLARRACVENESKCWINSFPNWLLLENEADIGCASHTSGG