; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012501 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012501
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr1:41689436..41690200
RNA-Seq ExpressionLag0012501
SyntenyLag0012501
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MBA0756060.1 hypothetical protein [Gossypium gossypioides]6.1e-1229.5Show/hide
Query:  CVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVLRDKQENLKAVQNAFS
        C   W IW  RN ++ +R++ S       I  YL+E      K ++ +          S GE  I+  D       S    GIV+RD+   +KA ++   
Subjt:  CVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVLRDKQENLKAVQNAFS

Query:  NANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLSTSSHLWLGD
        +  SS   AEAFA LEA  L   + +  +T++ DS  VIN     ++ +S I  ++ DI+  +  F  I F F  R  N  AH++A+  L     ++L D
Subjt:  NANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLSTSSHLWLGD

XP_016747102.1 uncharacterized protein LOC107955819 [Gossypium hirsutum]8.0e-1224.65Show/hide
Query:  WLSLADNSMEALERICVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVL
        W+    NS +  +  C   W IW+ RN ++ + +  +     + ++ Y++E      +  +    +    N  ++     ++ DA      S    G+++
Subjt:  WLSLADNSMEALERICVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVL

Query:  RDKQENLKAVQNAFSNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSV
        R + +   A+++   +A SSP  AEA+A L+A+ L   L  + +T++ DS TVI   N     +S+I  ++ DI+  ++ F    F F  R  N  AH +
Subjt:  RDKQENLKAVQNAFSNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSV

Query:  ARIGLSTSSHLWLGD
        A+  L T   ++L D
Subjt:  ARIGLSTSSHLWLGD

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.8e-1629.1Show/hide
Query:  CVSMDIKDKWLSLADNSMEALE-----RICVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNII-----SKGENLIM
        C+S +    +L L  +  E LE        +  W IWNDRN++I  +QV     +CEW+  +L    +A   + SP +T  +   ++     S   +L +
Subjt:  CVSMDIKDKWLSLADNSMEALE-----RICVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNII-----SKGENLIM

Query:  HTDATVMGGQSTCGIGIVLRDKQENLKAVQNAFSNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLF
        +TDA   G  ++   G ++RD   +L A  +       SPL AE   +LE +  A   N   L V SDSL  I  I  +I         + +I+ +   F
Subjt:  HTDATVMGGQSTCGIGIVLRDKQENLKAVQNAFSNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLF

Query:  DCITFHFTSRRFNGFAHSVARIGLSTSS--HLWLGDYPEWMVGL
          I+F  +SR+ N  AH +A+ G+++ S  + WL ++P W++ L
Subjt:  DCITFHFTSRRFNGFAHSVARIGLSTSS--HLWLGDYPEWMVGL

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]6.1e-1227.05Show/hide
Query:  WSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMS-----PSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVLRDKQENLKAVQNAF
        W++WNDR+ +I ++++P  +++ EWI  Y  E        M       ++    +G ++       M+TDA V   +   G+G++LR+    +      F
Subjt:  WSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMS-----PSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVLRDKQENLKAVQNAF

Query:  SNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLS-TSSHLWL
         +   +PL A+  A+ E + LAT L + ++ V +DSL  +N I +K   +   V+ + DI+   + F  I F    R  N  A+ + R  +S     LW 
Subjt:  SNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLS-TSSHLWL

Query:  GDYPEWM
         D+P W+
Subjt:  GDYPEWM

XP_027096164.1 uncharacterized protein LOC113716063 [Coffea arabica]4.7e-1228.04Show/hide
Query:  WSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVG----NIISKGENLIMHTDATVMGGQSTCGIGIVLRD-KQENLKAVQNAF
        W IW  RN  I       P    +   +  LEF +          T    G    N   + + + ++TDA +       G GI+ R+ K   L+A  N  
Subjt:  WSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVG----NIISKGENLIMHTDATVMGGQSTCGIGIVLRD-KQENLKAVQNAF

Query:  SNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLSTSSHL-WL
            ++ +E EA A+  A+ +A     K++ V SD  +V+  IN   + E +I T+L D++E++K FD  +F F SR  N  +H++A   +    ++ W 
Subjt:  SNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLSTSSHL-WL

Query:  GDYPEWMVGLSLKE
         D+P W++ L +KE
Subjt:  GDYPEWMVGLSLKE

TrEMBL top hitse value%identityAlignment
A0A1U8P7A7 uncharacterized protein LOC1079558193.9e-1224.65Show/hide
Query:  WLSLADNSMEALERICVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVL
        W+    NS +  +  C   W IW+ RN ++ + +  +     + ++ Y++E      +  +    +    N  ++     ++ DA      S    G+++
Subjt:  WLSLADNSMEALERICVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVL

Query:  RDKQENLKAVQNAFSNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSV
        R + +   A+++   +A SSP  AEA+A L+A+ L   L  + +T++ DS TVI   N     +S+I  ++ DI+  ++ F    F F  R  N  AH +
Subjt:  RDKQENLKAVQNAFSNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSV

Query:  ARIGLSTSSHLWLGD
        A+  L T   ++L D
Subjt:  ARIGLSTSSHLWLGD

A0A6J1DX30 uncharacterized protein LOC1110248748.9e-1729.1Show/hide
Query:  CVSMDIKDKWLSLADNSMEALE-----RICVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNII-----SKGENLIM
        C+S +    +L L  +  E LE        +  W IWNDRN++I  +QV     +CEW+  +L    +A   + SP +T  +   ++     S   +L +
Subjt:  CVSMDIKDKWLSLADNSMEALE-----RICVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNII-----SKGENLIM

Query:  HTDATVMGGQSTCGIGIVLRDKQENLKAVQNAFSNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLF
        +TDA   G  ++   G ++RD   +L A  +       SPL AE   +LE +  A   N   L V SDSL  I  I  +I         + +I+ +   F
Subjt:  HTDATVMGGQSTCGIGIVLRDKQENLKAVQNAFSNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLF

Query:  DCITFHFTSRRFNGFAHSVARIGLSTSS--HLWLGDYPEWMVGL
          I+F  +SR+ N  AH +A+ G+++ S  + WL ++P W++ L
Subjt:  DCITFHFTSRRFNGFAHSVARIGLSTSS--HLWLGDYPEWMVGL

A0A6P6UIN1 uncharacterized protein LOC1137113515.0e-1224.79Show/hide
Query:  WLSLADNSMEA--LERICVGE---WSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGEN-----LIMHTDATVMGG
        W+++  ++ EA  L+RI +     W +W  RN +  Q +  +  +  +  +   LE+   N  +  P+   E+ G    K E      + +HTDA +   
Subjt:  WLSLADNSMEA--LERICVGE---WSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGEN-----LIMHTDATVMGG

Query:  QSTCGIGIVLRDKQENLKAVQNAFSNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTS
            G+GI+ R+ +  +   +   +     P   EA A+  A+ +A       + V SD   V++ IN   + +  + T+L DI++++K FDC  F F  
Subjt:  QSTCGIGIVLRDKQENLKAVQNAFSNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTS

Query:  RRFNGFAHSVARIGLSTSSHL-WLGDYPEWMVGLSLKE
        R  N  +H++A+  + +   + W G +P W+  L+ K+
Subjt:  RRFNGFAHSVARIGLSTSSHL-WLGDYPEWMVGLSLKE

A0A6P6UZA6 uncharacterized protein LOC1137160632.3e-1228.04Show/hide
Query:  WSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVG----NIISKGENLIMHTDATVMGGQSTCGIGIVLRD-KQENLKAVQNAF
        W IW  RN  I       P    +   +  LEF +          T    G    N   + + + ++TDA +       G GI+ R+ K   L+A  N  
Subjt:  WSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVG----NIISKGENLIMHTDATVMGGQSTCGIGIVLRD-KQENLKAVQNAF

Query:  SNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLSTSSHL-WL
            ++ +E EA A+  A+ +A     K++ V SD  +V+  IN   + E +I T+L D++E++K FD  +F F SR  N  +H++A   +    ++ W 
Subjt:  SNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLSTSSHL-WL

Query:  GDYPEWMVGLSLKE
         D+P W++ L +KE
Subjt:  GDYPEWMVGLSLKE

A0A7J9D5S3 RNase H domain-containing protein3.0e-1229.5Show/hide
Query:  CVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVLRDKQENLKAVQNAFS
        C   W IW  RN ++ +R++ S       I  YL+E      K ++ +          S GE  I+  D       S    GIV+RD+   +KA ++   
Subjt:  CVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVLRDKQENLKAVQNAFS

Query:  NANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLSTSSHLWLGD
        +  SS   AEAFA LEA  L   + +  +T++ DS  VIN     ++ +S I  ++ DI+  +  F  I F F  R  N  AH++A+  L     ++L D
Subjt:  NANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLSTSSHLWLGD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein1.5e-0826.53Show/hide
Query:  WSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVLRDKQENLKAVQ--NAFSNA
        W IW  RN +I Q    S +          L +  A               +  +   + + + DA      S  G G V +    + K +   +A    
Subjt:  WSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGIGIVLRDKQENLKAVQ--NAFSNA

Query:  NSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLSTSSHLWL
          SPL AEA+A+  AM  A  L    L VLSDS ++++++N  + + + I  LL +I+ I+  F  I+F F  R  N  A + A++ L  S ++ L
Subjt:  NSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLSTSSHLWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGGAATTTATGTGTTTCTATGGATATCAAAGACAAGTGGTTAAGCTTGGCAGACAATTCGATGGAAGCATTAGAGAGGATTTGTGTGGGAGAGTGGTCAATTTG
GAACGATAGGAATAATGTGATTCAACAGCGCCAGGTTCCTAGCCCAATGGTTAGATGTGAATGGATTAATGATTACCTGTTAGAATTCTTGAAGGCCAATCCGAAAAGCA
TGTCTCCTTCTCAAACAATGGAGGACGTTGGAAATATAATCTCAAAGGGAGAAAATTTGATTATGCACACAGATGCAACTGTCATGGGAGGGCAAAGTACATGCGGTATT
GGGATTGTGCTGCGTGATAAACAAGAGAATTTAAAGGCGGTGCAAAATGCATTTTCCAATGCGAATTCATCTCCTTTGGAAGCGGAAGCATTTGCAGTGCTTGAAGCAAT
GTGTTTGGCTACATTGTTAAATATAAAGCAGCTGACTGTCTTGTCTGATTCGTTGACTGTAATAAATTCAATAAACGAGAAAATACAAGTGGAGTCTTCTATTGTGACGT
TGTTGTGGGACATTAAAGAAATTCAAAAGCTATTTGACTGTATAACTTTCCATTTTACGAGTCGTAGATTTAATGGCTTTGCTCATAGTGTGGCCCGAATAGGTTTATCA
ACATCATCACATTTGTGGTTAGGAGACTATCCTGAGTGGATGGTAGGATTGTCGCTCAAGGAGCGATGTTTATTTGTATCCCCTGGGGATTCTATTATGTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGGAATTTATGTGTTTCTATGGATATCAAAGACAAGTGGTTAAGCTTGGCAGACAATTCGATGGAAGCATTAGAGAGGATTTGTGTGGGAGAGTGGTCAATTTG
GAACGATAGGAATAATGTGATTCAACAGCGCCAGGTTCCTAGCCCAATGGTTAGATGTGAATGGATTAATGATTACCTGTTAGAATTCTTGAAGGCCAATCCGAAAAGCA
TGTCTCCTTCTCAAACAATGGAGGACGTTGGAAATATAATCTCAAAGGGAGAAAATTTGATTATGCACACAGATGCAACTGTCATGGGAGGGCAAAGTACATGCGGTATT
GGGATTGTGCTGCGTGATAAACAAGAGAATTTAAAGGCGGTGCAAAATGCATTTTCCAATGCGAATTCATCTCCTTTGGAAGCGGAAGCATTTGCAGTGCTTGAAGCAAT
GTGTTTGGCTACATTGTTAAATATAAAGCAGCTGACTGTCTTGTCTGATTCGTTGACTGTAATAAATTCAATAAACGAGAAAATACAAGTGGAGTCTTCTATTGTGACGT
TGTTGTGGGACATTAAAGAAATTCAAAAGCTATTTGACTGTATAACTTTCCATTTTACGAGTCGTAGATTTAATGGCTTTGCTCATAGTGTGGCCCGAATAGGTTTATCA
ACATCATCACATTTGTGGTTAGGAGACTATCCTGAGTGGATGGTAGGATTGTCGCTCAAGGAGCGATGTTTATTTGTATCCCCTGGGGATTCTATTATGTGTTGA
Protein sequenceShow/hide protein sequence
MLRNLCVSMDIKDKWLSLADNSMEALERICVGEWSIWNDRNNVIQQRQVPSPMVRCEWINDYLLEFLKANPKSMSPSQTMEDVGNIISKGENLIMHTDATVMGGQSTCGI
GIVLRDKQENLKAVQNAFSNANSSPLEAEAFAVLEAMCLATLLNIKQLTVLSDSLTVINSINEKIQVESSIVTLLWDIKEIQKLFDCITFHFTSRRFNGFAHSVARIGLS
TSSHLWLGDYPEWMVGLSLKERCLFVSPGDSIMC