; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006385 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006385
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr6:41943813..41944553
RNA-Seq ExpressionLag0006385
SyntenyLag0006385
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]5.4e-1332.74Show/hide
Query:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGN----------QISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKG
        +E+   ++ I   IW  RN        +D   + RSI   +  N          + S+Q  G + +      + L   WS   +NCWK+NTDASWSEE+ 
Subjt:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGN----------QISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKG

Query:  IGGLGWIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAIL
        +GG+GWI+ D  G  +  G  K+++K  I  LEL  I  GL+ +  +  +P   I++ESD VE I ++
Subjt:  IGGLGWIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAIL

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]2.8e-1429.91Show/hide
Query:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEA----SLGGN-QISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLG
        EE++ +++ I   IW  RN      V  +T  I  +I+     S G N  +  +   K      R E      W P  SN WK+NT+A+W  +   GG+G
Subjt:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEA----SLGGN-QISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLG

Query:  WIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTN
        WI+RD  G  I    + ++ + +I  LE+ AI EGL+ +  +   P   I +ESD +EAI +L+   +  +E   +  ++   ++ +  V      R  N
Subjt:  WIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTN

Query:  TEAHLLARRAWAHE
          AH LARRA  ++
Subjt:  TEAHLLARRAWAHE

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]5.9e-1232.1Show/hide
Query:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSK---TPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWI
        EE++ +++ I   IW  RN      V  +T  I  +I+  +  +   +  L + SK      R        W P  SN WK+NTDA+W  +    G+GWI
Subjt:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSK---TPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWI

Query:  VRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILN
        +RD  G  I  G + ++ + +I  LE+ AI EGL+ +  +   P   I +ESD +EAI +L+
Subjt:  VRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILN

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]5.9e-1232.1Show/hide
Query:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSK---TPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWI
        EE++ +++ I   IW  RN      V  +T  I  +I+  +  +   +  L + SK      R        W P  SN WK+NTDA+W  +    G+GWI
Subjt:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSK---TPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWI

Query:  VRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILN
        +RD  G  I  G + ++ + +I  LE+ AI EGL+ +  +   P   I +ESD +EAI +L+
Subjt:  VRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILN

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.4e-1329.68Show/hide
Query:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTN----TIIRSIEASLG-GNQISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLG
        EE++ +++ I   IW  RN      V  +T      I R I  S G    +  +   K      R        W P  SN WK+NTDA+W  +   GG+G
Subjt:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTN----TIIRSIEASLG-GNQISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLG

Query:  WIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPK-----IFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFS
        WI+RD  G  I    + ++ + +I  LE+ AI EGL+ +  +   P+ +     I +ESD +EAI +L+   +  +E   +  ++   +E +  V     
Subjt:  WIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPK-----IFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFS

Query:  RRVTNTEAHLLARRAWAHE
         R  N  AH LARRA  ++
Subjt:  RRVTNTEAHLLARRAWAHE

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134121.4e-1429.91Show/hide
Query:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEA----SLGGN-QISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLG
        EE++ +++ I   IW  RN      V  +T  I  +I+     S G N  +  +   K      R E      W P  SN WK+NT+A+W  +   GG+G
Subjt:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEA----SLGGN-QISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLG

Query:  WIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTN
        WI+RD  G  I    + ++ + +I  LE+ AI EGL+ +  +   P   I +ESD +EAI +L+   +  +E   +  ++   ++ +  V      R  N
Subjt:  WIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTN

Query:  TEAHLLARRAWAHE
          AH LARRA  ++
Subjt:  TEAHLLARRAWAHE

A0A6J1CQG0 uncharacterized protein LOC1110132162.6e-1332.74Show/hide
Query:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGN----------QISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKG
        +E+   ++ I   IW  RN        +D   + RSI   +  N          + S+Q  G + +      + L   WS   +NCWK+NTDASWSEE+ 
Subjt:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGN----------QISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKG

Query:  IGGLGWIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAIL
        +GG+GWI+ D  G  +  G  K+++K  I  LEL  I  GL+ +  +  +P   I++ESD VE I ++
Subjt:  IGGLGWIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAIL

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X12.9e-1232.1Show/hide
Query:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSK---TPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWI
        EE++ +++ I   IW  RN      V  +T  I  +I+  +  +   +  L + SK      R        W P  SN WK+NTDA+W  +    G+GWI
Subjt:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSK---TPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWI

Query:  VRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILN
        +RD  G  I  G + ++ + +I  LE+ AI EGL+ +  +   P   I +ESD +EAI +L+
Subjt:  VRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILN

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X22.9e-1232.1Show/hide
Query:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSK---TPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWI
        EE++ +++ I   IW  RN      V  +T  I  +I+  +  +   +  L + SK      R        W P  SN WK+NTDA+W  +    G+GWI
Subjt:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSK---TPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWI

Query:  VRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILN
        +RD  G  I  G + ++ + +I  LE+ AI EGL+ +  +   P   I +ESD +EAI +L+
Subjt:  VRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILN

A0A6J1DSV1 uncharacterized protein LOC1110236086.8e-1429.68Show/hide
Query:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTN----TIIRSIEASLG-GNQISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLG
        EE++ +++ I   IW  RN      V  +T      I R I  S G    +  +   K      R        W P  SN WK+NTDA+W  +   GG+G
Subjt:  EEDQGKAISIISSIWNHRNLVKQNAVKVDTN----TIIRSIEASLG-GNQISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLG

Query:  WIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPK-----IFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFS
        WI+RD  G  I    + ++ + +I  LE+ AI EGL+ +  +   P+ +     I +ESD +EAI +L+   +  +E   +  ++   +E +  V     
Subjt:  WIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPK-----IFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFS

Query:  RRVTNTEAHLLARRAWAHE
         R  N  AH LARRA  ++
Subjt:  RRVTNTEAHLLARRAWAHE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein4.8e-0425Show/hide
Query:  LGKVSKTPSRFESPLNQSWSPSHSNC-WKINTDASWSEEKGIGGLGWIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVES
        +GK S+     E+ ++ S  PS   C  K N DAS  E   + GLGW++R+S G+ +  G  K + + +    E  A+   +  +         K+  E 
Subjt:  LGKVSKTPSRFESPLNQSWSPSHSNC-WKINTDASWSEEKGIGGLGWIVRDSNGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVES

Query:  DFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTNTEAHLLARRA
        D      ++N  S+     K   + + + +    + EF+F+ R  N  A  L ++A
Subjt:  DFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTNTEAHLLARRA

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.7e-0722.92Show/hide
Query:  IWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWIVRDSNGSFIHVGFKKM
        +W  RN +     + D   ++R             +  GK S    + E  L+  W        K NTDA+W  E    G+GWI+R+ +G  + +G + +
Subjt:  IWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWIVRDSNGSFIHVGFKKM

Query:  KKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTNTEAHLLARRA
         +  +++  EL+A+      +L        +I  ESD  +A+  L ++ + +   +    D+   +     V+F F+ R  N  A  +AR +
Subjt:  KKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTNTEAHLLARRA

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)3.7e-0430.43Show/hide
Query:  SKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWIVRDSNGSFIHVGFKKMKKKWSIICLE
        ++ P+   S  ++ WSP      K N D+ + + +      WI+RDSNG  IH G  K+++ +S +  E
Subjt:  SKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWIVRDSNGSFIHVGFKKMKKKWSIICLE

AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-0925.51Show/hide
Query:  IWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSKTPSRFESPLNQS----WSPSHSNCWKINTDASWSEEKGIGGLGWIVRDSNGSFIHVG
        +W +RN +     + +   ++R  E  L      E++  +        +  +N+S    W P      K NTDA+W+ +    G+GW++R+  G    +G
Subjt:  IWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSKTPSRFESPLNQS----WSPSHSNCWKINTDASWSEEKGIGGLGWIVRDSNGSFIHVG

Query:  FKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTNTEAHLLARRA
         + + K  S++  EL+A+   +  L     N    +  ESD    I ILN+  E++   K    D+   +     V+FVF  R  NT A  +AR +
Subjt:  FKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTNTEAHLLARRA

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.4e-0825.52Show/hide
Query:  IWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWIVRDSNGSFIHVGFKKM
        ++NH     Q  V++  N      +  L     +EQ  G  +  PSR     N  WSP   +  K N DAS  E   + GLGWI+R+S G+ I  G  K 
Subjt:  IWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWIVRDSNGSFIHVGFKKM

Query:  KKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTNTEAHLLARRA
        + + +    E   +   +  +    G    K+  E D  + I  + +T       +   + + + +    ++EF F  R  N  A  LA++A
Subjt:  KKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTNTEAHLLARRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGACCTTTCCAAAAACTTCAAAGAAGAAGATCAAGGGAAAGCTATTAGCATCATATCTAGCATTTGGAATCATAGGAATCTGGTGAAACAAAACGCAGTCAAAGT
AGATACCAACACGATTATTAGATCAATTGAAGCAAGTTTGGGTGGAAATCAAATTTCAGAGCAGTACCTGGGCAAAGTTAGCAAAACCCCTTCGAGATTTGAGAGCCCGT
TGAATCAGAGCTGGTCTCCCTCTCATTCGAATTGCTGGAAAATTAACACAGACGCTTCATGGTCTGAAGAAAAAGGAATTGGGGGTTTAGGATGGATTGTTCGTGACTCA
AATGGATCTTTTATCCATGTGGGGTTCAAGAAAATGAAGAAGAAATGGTCGATTATCTGCCTGGAATTGAAAGCCATTGAAGAAGGCCTGAAATGCTTACTTGACAAAAT
GGGTAATCCCCTTCCTAAAATCTTTGTGGAATCTGACTTCGTAGAGGCGATCGCCATTCTCAATCACACTTCCGAAGTCTTCTCAGAATGCAAAATGGTGGCGAACGATG
TTGTAGCCGCGGTGGAGATCTTGGGCAATGTTGAGTTCGTATTCAGCCGGCGGGTTACCAACACTGAAGCGCATCTTCTGGCGCGAAGGGCCTGGGCTCACGAGTGTGTG
GGTGGGTCTGATACAGGGTTTTTGGCCTCTTCCAATTCGGAAGAGGATATTTTTTGTATGGCAATTATGCCCCCTGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGACCTTTCCAAAAACTTCAAAGAAGAAGATCAAGGGAAAGCTATTAGCATCATATCTAGCATTTGGAATCATAGGAATCTGGTGAAACAAAACGCAGTCAAAGT
AGATACCAACACGATTATTAGATCAATTGAAGCAAGTTTGGGTGGAAATCAAATTTCAGAGCAGTACCTGGGCAAAGTTAGCAAAACCCCTTCGAGATTTGAGAGCCCGT
TGAATCAGAGCTGGTCTCCCTCTCATTCGAATTGCTGGAAAATTAACACAGACGCTTCATGGTCTGAAGAAAAAGGAATTGGGGGTTTAGGATGGATTGTTCGTGACTCA
AATGGATCTTTTATCCATGTGGGGTTCAAGAAAATGAAGAAGAAATGGTCGATTATCTGCCTGGAATTGAAAGCCATTGAAGAAGGCCTGAAATGCTTACTTGACAAAAT
GGGTAATCCCCTTCCTAAAATCTTTGTGGAATCTGACTTCGTAGAGGCGATCGCCATTCTCAATCACACTTCCGAAGTCTTCTCAGAATGCAAAATGGTGGCGAACGATG
TTGTAGCCGCGGTGGAGATCTTGGGCAATGTTGAGTTCGTATTCAGCCGGCGGGTTACCAACACTGAAGCGCATCTTCTGGCGCGAAGGGCCTGGGCTCACGAGTGTGTG
GGTGGGTCTGATACAGGGTTTTTGGCCTCTTCCAATTCGGAAGAGGATATTTTTTGTATGGCAATTATGCCCCCTGGTTAA
Protein sequenceShow/hide protein sequence
MGDLSKNFKEEDQGKAISIISSIWNHRNLVKQNAVKVDTNTIIRSIEASLGGNQISEQYLGKVSKTPSRFESPLNQSWSPSHSNCWKINTDASWSEEKGIGGLGWIVRDS
NGSFIHVGFKKMKKKWSIICLELKAIEEGLKCLLDKMGNPLPKIFVESDFVEAIAILNHTSEVFSECKMVANDVVAAVEILGNVEFVFSRRVTNTEAHLLARRAWAHECV
GGSDTGFLASSNSEEDIFCMAIMPPG