; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035129 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035129
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr3:15400977..15401900
RNA-Seq ExpressionLag0035129
SyntenyLag0035129
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]6.7e-2129.05Show/hide
Query:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTID---NRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGL
        E   R  +++ WQIW  RN  +     P+   I+  I  Y++   +    NL   S   D    R +E+ +  +W P +   W+LN   +W       G+
Subjt:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTID---NRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGL

Query:  GWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVHVPRVNNVM
        GW+LR   G+++ A  R I    +I +LE +A+ EGL+ I  +    + +E DSL+    ++ + +D+TE++  + E   ++    I S+ H+ R  N +
Subjt:  GWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVHVPRVNNVM

Query:  AHNLARKATE
        AH LAR+A E
Subjt:  AHNLARKATE

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]4.9e-1629.9Show/hide
Query:  SNSWYDDNMNAWETQEYCGRW-WLDDKGNIREEHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDN-RGVEER
        +N +Y D  N W T+EY   W WL DK    EE   R  ++ C QIW  RN  +      +   I+  I  Y++   +  + NL   S+     R + + 
Subjt:  SNSWYDDNMNAWETQEYCGRW-WLDDKGNIREEHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDN-RGVEER

Query:  SRNRWLPISDGKWRLNIETSWIEKEGCDGLGWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETE
        +R RW P +   W+LN + +W      DG+GW+LR   G+++  G R I    +I +LE +A+ EGL+ I  +    + +E DSL+    ++   + + +
Subjt:  SRNRWLPISDGKWRLNIETSWIEKEGCDGLGWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETE

Query:  LLNF
        L  F
Subjt:  LLNF

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]4.2e-1529.53Show/hide
Query:  WETQEYCGRW-WLDDKGNIREEHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDN-RGVEERSRNRWLPISDG
        W T+EY   W WL DK    EE   R  ++ C QIW  RN  +      +   I+  I  Y++   +  + NL   S+     R + + +R RW P +  
Subjt:  WETQEYCGRW-WLDDKGNIREEHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDN-RGVEERSRNRWLPISDG

Query:  KWRLNIETSWIEKEGCDGLGWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETELLNF
         W+LN + +W      DG+GW+LR   G+++  G R I    +I +LE +A+ EGL+ I  +    + +E DSL+    ++   + + +L  F
Subjt:  KWRLNIETSWIEKEGCDGLGWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETELLNF

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]2.8e-1929.2Show/hide
Query:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDNRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGLGWV
        +  L + L+  W IW  RN ++          + QQ++ ++ E     E +L +  +T++N       + +W P     W LN + SW +     G+GW+
Subjt:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDNRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGLGWV

Query:  LRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL--VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVHVPRVNNVMAH
        +R  DGD++ AG RF+    ++  LE  A+ EGL+ +    +L  + +E DS +V   +N K ED T+    V E   L     I +   V R  N  AH
Subjt:  LRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL--VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVHVPRVNNVMAH

Query:  NLARKATECRGSF---NCFGNFPSCI
        +LA++A+  R S    +CF N+ S +
Subjt:  NLARKATECRGSF---NCFGNFPSCI

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]4.8e-1927.52Show/hide
Query:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTID---NRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGL
        E   R  +++ WQIW  RN  +      +   I+  I  Y++   +  + NL   S   D    R + + +  RW P +   W+LN + +W       G+
Subjt:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTID---NRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGL

Query:  GWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL---------VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVH
        GW+LR   G+++ A  R I    +I +LE +A+ EGL+ I  +            + +E DSL+    ++ + +D+TE++  + E   ++    I S+ H
Subjt:  GWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL---------VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVH

Query:  VPRVNNVMAHNLARKATE
        + R  N +AH+LAR+A E
Subjt:  VPRVNNVMAHNLARKATE

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134123.2e-2129.05Show/hide
Query:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTID---NRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGL
        E   R  +++ WQIW  RN  +     P+   I+  I  Y++   +    NL   S   D    R +E+ +  +W P +   W+LN   +W       G+
Subjt:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTID---NRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGL

Query:  GWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVHVPRVNNVM
        GW+LR   G+++ A  R I    +I +LE +A+ EGL+ I  +    + +E DSL+    ++ + +D+TE++  + E   ++    I S+ H+ R  N +
Subjt:  GWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVHVPRVNNVM

Query:  AHNLARKATE
        AH LAR+A E
Subjt:  AHNLARKATE

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X12.4e-1629.9Show/hide
Query:  SNSWYDDNMNAWETQEYCGRW-WLDDKGNIREEHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDN-RGVEER
        +N +Y D  N W T+EY   W WL DK    EE   R  ++ C QIW  RN  +      +   I+  I  Y++   +  + NL   S+     R + + 
Subjt:  SNSWYDDNMNAWETQEYCGRW-WLDDKGNIREEHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDN-RGVEER

Query:  SRNRWLPISDGKWRLNIETSWIEKEGCDGLGWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETE
        +R RW P +   W+LN + +W      DG+GW+LR   G+++  G R I    +I +LE +A+ EGL+ I  +    + +E DSL+    ++   + + +
Subjt:  SRNRWLPISDGKWRLNIETSWIEKEGCDGLGWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETE

Query:  LLNF
        L  F
Subjt:  LLNF

A0A6J1DNV9 uncharacterized protein LOC1110224031.4e-1929.2Show/hide
Query:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDNRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGLGWV
        +  L + L+  W IW  RN ++          + QQ++ ++ E     E +L +  +T++N       + +W P     W LN + SW +     G+GW+
Subjt:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDNRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGLGWV

Query:  LRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL--VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVHVPRVNNVMAH
        +R  DGD++ AG RF+    ++  LE  A+ EGL+ +    +L  + +E DS +V   +N K ED T+    V E   L     I +   V R  N  AH
Subjt:  LRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL--VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVHVPRVNNVMAH

Query:  NLARKATECRGSF---NCFGNFPSCI
        +LA++A+  R S    +CF N+ S +
Subjt:  NLARKATECRGSF---NCFGNFPSCI

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X22.0e-1529.53Show/hide
Query:  WETQEYCGRW-WLDDKGNIREEHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDN-RGVEERSRNRWLPISDG
        W T+EY   W WL DK    EE   R  ++ C QIW  RN  +      +   I+  I  Y++   +  + NL   S+     R + + +R RW P +  
Subjt:  WETQEYCGRW-WLDDKGNIREEHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDN-RGVEERSRNRWLPISDG

Query:  KWRLNIETSWIEKEGCDGLGWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETELLNF
         W+LN + +W      DG+GW+LR   G+++  G R I    +I +LE +A+ EGL+ I  +    + +E DSL+    ++   + + +L  F
Subjt:  KWRLNIETSWIEKEGCDGLGWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL-VRVEIDSLQVARFINDKDEDETELLNF

A0A6J1DSV1 uncharacterized protein LOC1110236082.3e-1927.52Show/hide
Query:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTID---NRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGL
        E   R  +++ WQIW  RN  +      +   I+  I  Y++   +  + NL   S   D    R + + +  RW P +   W+LN + +W       G+
Subjt:  EHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTID---NRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGL

Query:  GWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL---------VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVH
        GW+LR   G+++ A  R I    +I +LE +A+ EGL+ I  +            + +E DSL+    ++ + +D+TE++  + E   ++    I S+ H
Subjt:  GWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLL---------VRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVH

Query:  VPRVNNVMAHNLARKATE
        + R  N +AH+LAR+A E
Subjt:  VPRVNNVMAHNLARKATE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.9e-0622.38Show/hide
Query:  LCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDNRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGLGWVLRIGDGDLL
        L W+IW + N L+ N+ +   +   +   N   E       N + + +   NR  +     +W P    K + N + S  E+    GLGW+LR   G ++
Subjt:  LCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDNRGVEERSRNRWLPISDGKWRLNIETSWIEKEGCDGLGWVLRIGDGDLL

Query:  AAGYRFINRPWHINWLETLALSEGLQ-TIPTDSLLVRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVHVPRVNNVMAHNLARKATECR
          G             E   L   +Q +       V  E D+  + R IN K  +   L +F+   ++ I            R  N  A  LA++A +  
Subjt:  AAGYRFINRPWHINWLETLALSEGLQ-TIPTDSLLVRVEIDSLQVARFINDKDEDETELLNFVMEAKALISGKNIYSLVHVPRVNNVMAHNLARKATECR

Query:  GSFNCFGNFP
          ++ F + P
Subjt:  GSFNCFGNFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCAAGGATCCTCAAATTTCTTCCTTGTTCTAATTCATGGTATGATGATAACATGAATGCTTGGGAGACTCAAGAATACTGTGGCAGATGGTGGCTGGATGATAA
GGGCAACATCAGAGAAGAGCATAGCTTAAGGATTTGCCTAGTTCTATGTTGGCAAATATGGACGGCTAGAAACATATTACTCCACAACAACTACCAGCCAGATGTGGAGC
ACATAGAGCAGCAGATTTCAAACTACTTAATGGAGATGAAGTCAAGTGGAGAGGCGAACCTGATAATCCATAGCCGAACGATCGATAACAGAGGCGTAGAGGAGAGATCG
AGGAATCGGTGGCTGCCGATCTCTGATGGAAAGTGGCGGCTCAACATTGAAACATCTTGGATCGAAAAGGAGGGGTGTGACGGTCTTGGCTGGGTTCTCAGAATAGGGGA
TGGAGATCTTCTGGCGGCAGGGTACCGGTTCATTAATCGGCCATGGCACATCAACTGGCTGGAAACCCTTGCCCTGTCGGAAGGCCTTCAAACCATTCCCACTGATTCGC
TTTTGGTGCGGGTAGAAATCGACTCCCTCCAGGTGGCTCGCTTCATCAACGACAAGGATGAAGATGAGACTGAGTTGCTGAACTTTGTTATGGAAGCTAAAGCCCTAATT
TCGGGCAAGAATATTTACTCTTTGGTTCATGTGCCTAGAGTTAACAATGTCATGGCCCACAATCTGGCGAGGAAGGCTACTGAATGTAGGGGTTCTTTTAATTGTTTTGG
AAATTTTCCTTCTTGCATCCTTTCTCTTAATGAGGAAGACATTGGTTTTGTATCTCATGCACATGGGGTCCCTGTCCCATGGGCATTCACTTTTCAGGAGCTATTGCTCG
AGTCTAATGCTTATTTCTTATTCTCAAAAAAAATAAAAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCAAGGATCCTCAAATTTCTTCCTTGTTCTAATTCATGGTATGATGATAACATGAATGCTTGGGAGACTCAAGAATACTGTGGCAGATGGTGGCTGGATGATAA
GGGCAACATCAGAGAAGAGCATAGCTTAAGGATTTGCCTAGTTCTATGTTGGCAAATATGGACGGCTAGAAACATATTACTCCACAACAACTACCAGCCAGATGTGGAGC
ACATAGAGCAGCAGATTTCAAACTACTTAATGGAGATGAAGTCAAGTGGAGAGGCGAACCTGATAATCCATAGCCGAACGATCGATAACAGAGGCGTAGAGGAGAGATCG
AGGAATCGGTGGCTGCCGATCTCTGATGGAAAGTGGCGGCTCAACATTGAAACATCTTGGATCGAAAAGGAGGGGTGTGACGGTCTTGGCTGGGTTCTCAGAATAGGGGA
TGGAGATCTTCTGGCGGCAGGGTACCGGTTCATTAATCGGCCATGGCACATCAACTGGCTGGAAACCCTTGCCCTGTCGGAAGGCCTTCAAACCATTCCCACTGATTCGC
TTTTGGTGCGGGTAGAAATCGACTCCCTCCAGGTGGCTCGCTTCATCAACGACAAGGATGAAGATGAGACTGAGTTGCTGAACTTTGTTATGGAAGCTAAAGCCCTAATT
TCGGGCAAGAATATTTACTCTTTGGTTCATGTGCCTAGAGTTAACAATGTCATGGCCCACAATCTGGCGAGGAAGGCTACTGAATGTAGGGGTTCTTTTAATTGTTTTGG
AAATTTTCCTTCTTGCATCCTTTCTCTTAATGAGGAAGACATTGGTTTTGTATCTCATGCACATGGGGTCCCTGTCCCATGGGCATTCACTTTTCAGGAGCTATTGCTCG
AGTCTAATGCTTATTTCTTATTCTCAAAAAAAATAAAAAAATAA
Protein sequenceShow/hide protein sequence
MPSRILKFLPCSNSWYDDNMNAWETQEYCGRWWLDDKGNIREEHSLRICLVLCWQIWTARNILLHNNYQPDVEHIEQQISNYLMEMKSSGEANLIIHSRTIDNRGVEERS
RNRWLPISDGKWRLNIETSWIEKEGCDGLGWVLRIGDGDLLAAGYRFINRPWHINWLETLALSEGLQTIPTDSLLVRVEIDSLQVARFINDKDEDETELLNFVMEAKALI
SGKNIYSLVHVPRVNNVMAHNLARKATECRGSFNCFGNFPSCILSLNEEDIGFVSHAHGVPVPWAFTFQELLLESNAYFLFSKKIKK