; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033679 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033679
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr3:1037979..1038707
RNA-Seq ExpressionLag0033679
SyntenyLag0033679
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
SPT16885.1 unnamed protein product [Triticum aestivum]9.9e-2030.84Show/hide
Query:  WAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTE
        W +W +RNA R G  +   P++         EF++M     T  +  R + + S   W  P  G + +NVDA+  S+  + G G+V+R   GR+  A   
Subjt:  WAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTE

Query:  VFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHACAMGSFLWS
         F  V  P VAEA+A+   L L    G+ ++ V SDCL LI+ + G G      G +V +I   A+ F      HV R  N  AH LA  A       W 
Subjt:  VFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHACAMGSFLWS

Query:  SNFPQWVTKLVLQD
        + FP  +  ++  +
Subjt:  SNFPQWVTKLVLQD

TXG47194.1 hypothetical protein EZV62_026488 [Acer yangbiense]7.6e-2033.5Show/hide
Query:  WIENYLAEF----LSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTEVFSVVYSPPVAEALAILVGL
        W  ++L EF      +Q R  ++R VP         +W  PP G  K+N DA+ + + +  GIG+V+R   G++  +  + F   +SP +AEALA+L GL
Subjt:  WIENYLAEF----LSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTEVFSVVYSPPVAEALAILVGL

Query:  RLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHACA-MGSFLWSSNFPQWVTKLVLQDFPSAL
        RL +  G     +ESD L ++  +  +     + G ++++IL     F    FRHV R  N  AH LA  A + +G F+WS + P  V  LV+ DFP  +
Subjt:  RLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHACA-MGSFLWSSNFPQWVTKLVLQDFPSAL

XP_023903785.1 uncharacterized protein LOC112015598 [Quercus suber]5.8e-2030.4Show/hide
Query:  SDHDFGLWCVGCWAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLR
        +  D  L+    W +W  RN  R  +         +  + Y+AEF  +  +       PR +   S  RW  P  G VK+N D + S + +K+GIG+V+R
Subjt:  SDHDFGLWCVGCWAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLR

Query:  TASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLA
          +G + A+ TE    +Y+    EA+     L     +G     +E+D L L + L  +      DG L+E+I   AS F Q+++ HV RE N  AH LA
Subjt:  TASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLA

Query:  SHACAMGSFL-WSSNFPQWVTKLVLQD
         HA  +  FL W  + P  +  +V  D
Subjt:  SHACAMGSFL-WSSNFPQWVTKLVLQD

XP_042942894.1 uncharacterized protein LOC122277073 [Carya illinoinensis]1.3e-1934.15Show/hide
Query:  SDHDFGLWCVGCWAMWNDRNAAR--GGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIV
        ++ D  L+ V CW MW  RN  +  G R+ P+     +     ++   S  +    N  V R        RW+ PP   +KLNVD +   ++ K GIG++
Subjt:  SDHDFGLWCVGCWAMWNDRNAAR--GGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIV

Query:  LRTASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHC
        LR ++G +  A T   + V  P   E LAIL G+++   LG+ ++ VESDCL L++ L  S      +  LV E+  L S F +V+F HV R  N  AH 
Subjt:  LRTASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHC

Query:  LASHA
        LA +A
Subjt:  LASHA

XP_042974894.1 uncharacterized protein LOC122306534 [Carya illinoinensis]9.0e-2129.96Show/hide
Query:  SDHDFGLWCVGCWAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLR
        ++ D   +    W +W  RN               S   +  +EF  +Q   + NR +P+       L W+ PP G +K+N+D +    F+K G+G+VLR
Subjt:  SDHDFGLWCVGCWAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLR

Query:  TASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLA
          +GR+  A ++    V +P + EA A+L GL+     G ++V +E+DCL L+  LN    +  E   ++ EI  L   F+++ F HV R+ N+ AHCLA
Subjt:  TASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLA

Query:  SHACAMGSFL-WSSNFPQWVTKLVLQD
         +A  +   + W    P +V++ V  D
Subjt:  SHACAMGSFL-WSSNFPQWVTKLVLQD

TrEMBL top hitse value%identityAlignment
A0A5C7GQX3 RNase H domain-containing protein3.7e-2033.5Show/hide
Query:  WIENYLAEF----LSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTEVFSVVYSPPVAEALAILVGL
        W  ++L EF      +Q R  ++R VP         +W  PP G  K+N DA+ + + +  GIG+V+R   G++  +  + F   +SP +AEALA+L GL
Subjt:  WIENYLAEF----LSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTEVFSVVYSPPVAEALAILVGL

Query:  RLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHACA-MGSFLWSSNFPQWVTKLVLQDFPSAL
        RL +  G     +ESD L ++  +  +     + G ++++IL     F    FRHV R  N  AH LA  A + +G F+WS + P  V  LV+ DFP  +
Subjt:  RLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHACA-MGSFLWSSNFPQWVTKLVLQDFPSAL

A0A5N5GJP1 RNase H domain-containing protein1.1e-1932.26Show/hide
Query:  WAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGL--RWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQ
        WA+W  +NA   G +    PI         A+FL +            +Q V  G+  RW CPP+G +KLNVD +   +    G+GI++R A G    A 
Subjt:  WAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGL--RWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQ

Query:  TEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHAC-AMGSF
        T VF  ++SP   EALA+ VGL LV   GL ++ +ESD LH++  +  S ++    G ++E+   L +L  +    H+  + N  AH LA  A  + G++
Subjt:  TEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHAC-AMGSF

Query:  LWSSNFPQWVTKLVLQD
         W +  P  ++ L++ D
Subjt:  LWSSNFPQWVTKLVLQD

A0A6J1DX30 uncharacterized protein LOC1110248746.3e-2031.78Show/hide
Query:  LSDHDFGLWCVGCWAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQR----VVSGLRWECPPLGCVKLNVDASCSSKFHKTGI
        L   D  L  +  W +WNDRN+   G+ +  +  K  W    L  FL    + Q +   PR Q     VV    W       +KLN DA+C      T  
Subjt:  LSDHDFGLWCVGCWAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQR----VVSGLRWECPPLGCVKLNVDASCSSKFHKTGI

Query:  GIVLRTASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHP
        G ++R +S  L AA +       SP +AE   IL GL+         +EVESD L  I ++     +  ++   V EI  L   F  + F H  R+ N  
Subjt:  GIVLRTASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHP

Query:  AHCLASHACAMGS--FLWSSNFPQWVTKLVLQDFPS
        AH LA       S  + W  NFP W+  LV +DFPS
Subjt:  AHCLASHACAMGS--FLWSSNFPQWVTKLVLQDFPS

A0A7H4LE47 Genome assembly, chromosome: II4.8e-2030.84Show/hide
Query:  WAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTE
        W +W +RNA R G  +   P++         EF++M     T  +  R + + S   W  P  G + +NVDA+  S+  + G G+V+R   GR+  A   
Subjt:  WAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTE

Query:  VFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHACAMGSFLWS
         F  V  P VAEA+A+   L L    G+ ++ V SDCL LI+ + G G      G +V +I   A+ F      HV R  N  AH LA  A       W 
Subjt:  VFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHACAMGSFLWS

Query:  SNFPQWVTKLVLQD
        + FP  +  ++  +
Subjt:  SNFPQWVTKLVLQD

A0A803NGI9 Uncharacterized protein4.8e-2029.77Show/hide
Query:  WAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTE
        W +WNDRN    G+  P +   + W ++ +A F + Q +  +++ +  +    +   W  PPL  +K+NVDA+C    +K G+GI++R +SG++ AA ++
Subjt:  WAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTE

Query:  VFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHACAM-GSFLW
          +    P   EA A+L+G+       L+    ESD L L++ +N    +    G LV +I    S    V   HV R+ N  AH LA HA  +    +W
Subjt:  VFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHACAM-GSFLW

Query:  SSNFPQWVTKLVLQD
            P  +  +V+ D
Subjt:  SSNFPQWVTKLVLQD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G04420.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.4e-0825Show/hide
Query:  RWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGT
        +WE PP+G +K N D S + +  +T  G ++R   G    A   V   + +   +E  A+++ ++   + G  +V  E D   +  +LN   + +     
Subjt:  RWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGT

Query:  LVEEILVLASLFKQVVFRHVLRERNHPAHCLA-SHACAMGSFLWSSNFPQWVTKLVLQDFPSALCSHY
         + E    +  F++V+F    R  N PA  LA SH     SF++    P ++T        +  C+HY
Subjt:  LVEEILVLASLFKQVVFRHVLRERNHPAHCLA-SHACAMGSFLWSSNFPQWVTKLVLQDFPSALCSHY

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.2e-0427.34Show/hide
Query:  PPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEE
        P L  V +  DA+   +    G G V+R                V  P +AEA+A+ + L+  +++G+ ++ + SD   LI+ +     S    G ++ +
Subjt:  PPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHAAQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEE

Query:  ILVLASLFKQVVFRHVLRERNHPAHCLA
        IL L+  F  V F  V R  N  A  LA
Subjt:  ILVLASLFKQVVFRHVLRERNHPAHCLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTCTGACCATGACTTCGGTTTATGGTGTGTTGGTTGTTGGGCGATGTGGAATGATCGTAATGCAGCTCGAGGGGGCCGAATGATTCCGGATATTCCGATTAAACG
TTCATGGATTGAGAATTATCTGGCTGAGTTTCTTTCGATGCAGGATCGTGGTCAAACTAATCGGGTTGTTCCGAGAGCTCAGCGTGTGGTTTCTGGTTTACGATGGGAAT
GTCCTCCGTTGGGATGTGTTAAATTAAATGTGGATGCTTCTTGTTCTTCCAAGTTTCATAAGACAGGTATTGGAATTGTTCTTCGTACTGCGTCTGGTCGTCTCCATGCG
GCTCAAACGGAGGTTTTTTCTGTTGTTTATAGTCCCCCAGTTGCTGAAGCTTTGGCTATTCTTGTCGGTCTCCGACTGGTTAGGACGCTGGGTCTGGCTCGTGTTGAAGT
GGAGTCTGATTGTCTTCATCTTATTTCCATGTTGAATGGTTCTGGAATTTCTTACCATGAAGACGGCACTCTGGTAGAAGAGATTTTGGTGCTAGCTTCTTTGTTTAAGC
AGGTGGTGTTTCGTCATGTTCTTCGAGAAAGAAACCATCCAGCTCATTGTTTGGCATCTCATGCTTGTGCTATGGGTTCTTTTTTGTGGAGTTCTAATTTCCCTCAATGG
GTGACAAAACTTGTGTTGCAGGATTTCCCTTCTGCTTTGTGTAGCCACTACTACAAAAACGGACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTCTGACCATGACTTCGGTTTATGGTGTGTTGGTTGTTGGGCGATGTGGAATGATCGTAATGCAGCTCGAGGGGGCCGAATGATTCCGGATATTCCGATTAAACG
TTCATGGATTGAGAATTATCTGGCTGAGTTTCTTTCGATGCAGGATCGTGGTCAAACTAATCGGGTTGTTCCGAGAGCTCAGCGTGTGGTTTCTGGTTTACGATGGGAAT
GTCCTCCGTTGGGATGTGTTAAATTAAATGTGGATGCTTCTTGTTCTTCCAAGTTTCATAAGACAGGTATTGGAATTGTTCTTCGTACTGCGTCTGGTCGTCTCCATGCG
GCTCAAACGGAGGTTTTTTCTGTTGTTTATAGTCCCCCAGTTGCTGAAGCTTTGGCTATTCTTGTCGGTCTCCGACTGGTTAGGACGCTGGGTCTGGCTCGTGTTGAAGT
GGAGTCTGATTGTCTTCATCTTATTTCCATGTTGAATGGTTCTGGAATTTCTTACCATGAAGACGGCACTCTGGTAGAAGAGATTTTGGTGCTAGCTTCTTTGTTTAAGC
AGGTGGTGTTTCGTCATGTTCTTCGAGAAAGAAACCATCCAGCTCATTGTTTGGCATCTCATGCTTGTGCTATGGGTTCTTTTTTGTGGAGTTCTAATTTCCCTCAATGG
GTGACAAAACTTGTGTTGCAGGATTTCCCTTCTGCTTTGTGTAGCCACTACTACAAAAACGGACTTTAA
Protein sequenceShow/hide protein sequence
MLSDHDFGLWCVGCWAMWNDRNAARGGRMIPDIPIKRSWIENYLAEFLSMQDRGQTNRVVPRAQRVVSGLRWECPPLGCVKLNVDASCSSKFHKTGIGIVLRTASGRLHA
AQTEVFSVVYSPPVAEALAILVGLRLVRTLGLARVEVESDCLHLISMLNGSGISYHEDGTLVEEILVLASLFKQVVFRHVLRERNHPAHCLASHACAMGSFLWSSNFPQW
VTKLVLQDFPSALCSHYYKNGL