; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035519 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035519
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr3:23237674..23241599
RNA-Seq ExpressionLag0035519
SyntenyLag0035519
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8530045.1 hypothetical protein F0562_004754 [Nyssa sinensis]1.5e-0835.11Show/hide
Query:  WKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGK-IRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDL
        WK    G ++LN D SW+D   RG VG L+RD QG+ I   G  ++D G   + +E  AI  GL        + + +E DSL  +  I    EDF+ L +
Subjt:  WKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGK-IRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDL

Query:  FIKEAQHLISLRHVKSISHVSRDHNLVAHSL
        +++E        +  S +HV R+ N VAH L
Subjt:  FIKEAQHLISLRHVKSISHVSRDHNLVAHSL

XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]3.4e-0835.19Show/hide
Query:  PCSTSNLNT--HHWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLI
        P    NLN     W       W+LN DASW++    G +GW+L D +G+I + G   I     I+ LE   I  GLQ        P+ +E DS++V+RL+
Subjt:  PCSTSNLNT--HHWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLI

Query:  NEEEEDFT
         +E+ D T
Subjt:  NEEEEDFT

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]1.5e-1634.62Show/hide
Query:  WKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDLF
        WKP    +W+LN +A+W   T  G +GW+LRD +G++     ++I     I++LE  AI EGL+A   +   P+ +E DSL+ + L++ + +D TE+   
Subjt:  WKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDLF

Query:  IKEAQHLISLRHVKSISHVSRDHNLVAHSL
        ++E   ++    + S+ H+SR+ N VAH L
Subjt:  IKEAQHLISLRHVKSISHVSRDHNLVAHSL

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.4e-1435.56Show/hide
Query:  NTHHWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPS-DPPYPVCIECDSLQVVRLINEEEEDFT
        N   W+P     W LN DASW+D T RG +GW++R W G I + G + ++    +  LE++AI EGL+   +     P+ IE DS +V  L+N + ED T
Subjt:  NTHHWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPS-DPPYPVCIECDSLQVVRLINEEEEDFT

Query:  ELDLFIKEAQHLISLRHVKSISHVSRDHNLVAHSL
        +    ++E  +L     + + + V R+ N  AHSL
Subjt:  ELDLFIKEAQHLISLRHVKSISHVSRDHNLVAHSL

XP_024957833.1 uncharacterized protein LOC112499264 [Citrus sinensis]8.9e-0929.23Show/hide
Query:  WKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDLF
        W P  EG +++N +A+ N  TR+G +G ++RD  G++     K I     ++ +E+ AI+ G+Q        P+ +E DS +V+ L+  ++   TE+   
Subjt:  WKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDLF

Query:  IKEAQHLISLRHVKSISHVSRDHNLVAHSL
        I++ Q+ I    +  I H+ R+ N +AH++
Subjt:  IKEAQHLISLRHVKSISHVSRDHNLVAHSL

TrEMBL top hitse value%identityAlignment
A0A3P6ESY2 RNase H domain-containing protein (Fragment)2.1e-0833.86Show/hide
Query:  WKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDLF
        W+P  E   + N DA+W    R G  GW+LRD +G++   G K +         E+ A+   LQ         V IE DSLQ++R+IN EEE +  L+  
Subjt:  WKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDLF

Query:  IKEAQHLISLRHVKSISHVSRDHNLVA
        I+E   L+S+R   ++ +  R  N  A
Subjt:  IKEAQHLISLRHVKSISHVSRDHNLVA

A0A5C7IFK3 RNase H domain-containing protein4.8e-0831.06Show/hide
Query:  WEKEGEIFDLGSTNLHRPCSTSNLNTH---HWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSD
        W K   +F  G   + R  ++S L +H   +W P   GT ++N DA+ +    R   G ++RD  G+I      V      +   E+ AI EG+    + 
Subjt:  WEKEGEIFDLGSTNLHRPCSTSNLNTH---HWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSD

Query:  PPYPVCIECDSLQVVRLINEEEEDFTELDLFIKEAQHLISLRHVKSISHVSRDHNLVAHSL
            + IE DSL VVRL N E     ++   I + Q L+S     SIS++ R  N+VAH +
Subjt:  PPYPVCIECDSLQVVRLINEEEEDFTELDLFIKEAQHLISLRHVKSISHVSRDHNLVAHSL

A0A6J1CP26 uncharacterized protein LOC1110134127.4e-1734.62Show/hide
Query:  WKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDLF
        WKP    +W+LN +A+W   T  G +GW+LRD +G++     ++I     I++LE  AI EGL+A   +   P+ +E DSL+ + L++ + +D TE+   
Subjt:  WKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDLF

Query:  IKEAQHLISLRHVKSISHVSRDHNLVAHSL
        ++E   ++    + S+ H+SR+ N VAH L
Subjt:  IKEAQHLISLRHVKSISHVSRDHNLVAHSL

A0A6J1CQG0 uncharacterized protein LOC1110132161.6e-0835.19Show/hide
Query:  PCSTSNLNT--HHWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLI
        P    NLN     W       W+LN DASW++    G +GW+L D +G+I + G   I     I+ LE   I  GLQ        P+ +E DS++V+RL+
Subjt:  PCSTSNLNT--HHWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLI

Query:  NEEEEDFT
         +E+ D T
Subjt:  NEEEEDFT

A0A6J1DNV9 uncharacterized protein LOC1110224036.9e-1535.56Show/hide
Query:  NTHHWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPS-DPPYPVCIECDSLQVVRLINEEEEDFT
        N   W+P     W LN DASW+D T RG +GW++R W G I + G + ++    +  LE++AI EGL+   +     P+ IE DS +V  L+N + ED T
Subjt:  NTHHWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPS-DPPYPVCIECDSLQVVRLINEEEEDFT

Query:  ELDLFIKEAQHLISLRHVKSISHVSRDHNLVAHSL
        +    ++E  +L     + + + V R+ N  AHSL
Subjt:  ELDLFIKEAQHLISLRHVKSISHVSRDHNLVAHSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein7.1e-0423.56Show/hide
Query:  WEVGRKRGSELRKRGKNERLGEGRGRVEKDEERMGGWEKEGEIFDLGSTNLHRPCSTSNLNTHHWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGK
        W + + R +EL  RG+     E   R E D E    W    E    G+    +P   +  +   W+P      + N DA+WN    R  +GW+LR+ +G+
Subjt:  WEVGRKRGSELRKRGKNERLGEGRGRVEKDEERMGGWEKEGEIFDLGSTNLHRPCSTSNLNTHHWKPILEGTWRLNCDASWNDRTRRGEVGWLLRDWQGK

Query:  IRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDLFIKEAQHLISLRHVKSISHVSRDHNLVA
        ++ +G + +     +   E  A+   + +        V  E DS  ++ ++N  +E +  L   I++ Q L+S         + R+ N +A
Subjt:  IRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDLFIKEAQHLISLRHVKSISHVSRDHNLVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGTTGGGAGAAGGAAGGGGGAGAGTTGAGAAAGATGGGGAAAGAATGGGAGGTTGGGAGAAAGAGGGGGAGTGAGTTGAGAAAGAGGGGAAAGAATGAGAGGTT
GGGAGAAGGAAGGGGGAGAGTTGAGAAAGATGAGGAAAGAATGGGAGGTTGGGAGAAAGAGGGGGAAATTTTTGATCTGGGATCGACGAACCTACATCGGCCCTGTTCTA
CTAGTAACCTGAATACTCATCACTGGAAGCCGATCTTAGAAGGAACTTGGAGGCTAAACTGCGACGCCAGCTGGAACGACAGGACACGACGAGGTGAAGTAGGTTGGCTG
CTACGCGACTGGCAGGGCAAGATTCGAATGGTTGGGTATAAAGTTATCGACCTTGGATGGAGAATTTCTTGGTTGGAGTCAACGGCGATATCTGAAGGGTTGCAGGCGTT
TCCCTCAGATCCCCCTTACCCTGTGTGTATCGAGTGTGACTCCCTGCAAGTGGTTCGCCTAATTAATGAAGAAGAGGAAGATTTTACTGAGTTAGATCTTTTCATTAAAG
AAGCCCAACATCTTATTTCTTTACGGCATGTGAAATCCATCTCCCATGTATCAAGAGATCATAACCTTGTGGCCCATTCGCTGCCCGCTGGGCCTGTGAAGAGAATGAGT
CCAAAATTTGAACTGCCGCGAAACCGAGACGTTGTGGCCGCTGTTGCTACTCCGCCGAACGCTACCGGAGAAGACCCCATGCACCACCGTGCGCTAAGCACCTGTTGGAC
TGAGAGACTCGTCGGAAAATTGCGTGATGGACTGCTGCTAGCTAGGGGAGACAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGTTGGGAGAAGGAAGGGGGAGAGTTGAGAAAGATGGGGAAAGAATGGGAGGTTGGGAGAAAGAGGGGGAGTGAGTTGAGAAAGAGGGGAAAGAATGAGAGGTT
GGGAGAAGGAAGGGGGAGAGTTGAGAAAGATGAGGAAAGAATGGGAGGTTGGGAGAAAGAGGGGGAAATTTTTGATCTGGGATCGACGAACCTACATCGGCCCTGTTCTA
CTAGTAACCTGAATACTCATCACTGGAAGCCGATCTTAGAAGGAACTTGGAGGCTAAACTGCGACGCCAGCTGGAACGACAGGACACGACGAGGTGAAGTAGGTTGGCTG
CTACGCGACTGGCAGGGCAAGATTCGAATGGTTGGGTATAAAGTTATCGACCTTGGATGGAGAATTTCTTGGTTGGAGTCAACGGCGATATCTGAAGGGTTGCAGGCGTT
TCCCTCAGATCCCCCTTACCCTGTGTGTATCGAGTGTGACTCCCTGCAAGTGGTTCGCCTAATTAATGAAGAAGAGGAAGATTTTACTGAGTTAGATCTTTTCATTAAAG
AAGCCCAACATCTTATTTCTTTACGGCATGTGAAATCCATCTCCCATGTATCAAGAGATCATAACCTTGTGGCCCATTCGCTGCCCGCTGGGCCTGTGAAGAGAATGAGT
CCAAAATTTGAACTGCCGCGAAACCGAGACGTTGTGGCCGCTGTTGCTACTCCGCCGAACGCTACCGGAGAAGACCCCATGCACCACCGTGCGCTAAGCACCTGTTGGAC
TGAGAGACTCGTCGGAAAATTGCGTGATGGACTGCTGCTAGCTAGGGGAGACAACTGA
Protein sequenceShow/hide protein sequence
MRGWEKEGGELRKMGKEWEVGRKRGSELRKRGKNERLGEGRGRVEKDEERMGGWEKEGEIFDLGSTNLHRPCSTSNLNTHHWKPILEGTWRLNCDASWNDRTRRGEVGWL
LRDWQGKIRMVGYKVIDLGWRISWLESTAISEGLQAFPSDPPYPVCIECDSLQVVRLINEEEEDFTELDLFIKEAQHLISLRHVKSISHVSRDHNLVAHSLPAGPVKRMS
PKFELPRNRDVVAAVATPPNATGEDPMHHRALSTCWTERLVGKLRDGLLLARGDN