; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033040 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033040
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr11:40272777..40274938
RNA-Seq ExpressionLag0033040
SyntenyLag0033040
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139684.1 uncharacterized protein LOC111010533 [Momordica charantia]6.6e-1833.85Show/hide
Query:  MWSTRNKARFQGAE-RPSGLVEWAKGYVMAFR-----------EVGRS-REEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLG-IIVRNPMGQ
        +W+ RN   F         L  W   Y+  F+           +V +S ++  ++   +   +W     G +K+  DASF S    AGLG II+R+  GQ
Subjt:  MWSTRNKARFQGAE-RPSGLVEWAKGYVMAFR-----------EVGRS-REEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLG-IIVRNPMGQ

Query:  VMLSATFTKDNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA
        V+ SAT   ++V  VD AE  AAV+ L +  + G  P +LETDS R+  L  R++E  S+ G  I        + LQV + FT R GN  AH LA
Subjt:  VMLSATFTKDNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA

XP_022140628.1 uncharacterized protein LOC111011237 [Momordica charantia]8.0e-3249.08Show/hide
Query:  LVEWAKGYVMAFREVGRSREEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDNVRDVDMAEGYAAVKSLELVRD
        LVEWA  YVM FRE   +   GRVT   E ++W  P    YK+N DASFL+    AGLGII+RN  GQVM SAT   +N++ VDMAE   AV+ L+L   
Subjt:  LVEWAKGYVMAFREVGRSREEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDNVRDVDMAEGYAAVKSLELVRD

Query:  MGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA
        +G +P ILETDSSR+  L ++  ED SE G  +  A   +   L   F F  REGN+AAH LA
Subjt:  MGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.1e-2540.32Show/hide
Query:  MWSTRNKARFQGAERP-----SGLVEWAKGYVMAFREVGRSREEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTK
        +W+ RN   F  + +        LVEWA  Y M FRE   +   GRVT   E I+W+ P  G YK+N DASFL+    AGLGII+ N  GQVM +AT   
Subjt:  MWSTRNKARFQGAERP-----SGLVEWAKGYVMAFREVGRSREEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTK

Query:  DNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA
        +N++ VDMAE  AAV+ L+L  ++G  P++                ED SE G  +  A   +   L   F F  REGN+AAH LA
Subjt:  DNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]1.6e-1934.76Show/hide
Query:  MWSTRNKA--RFQGAERPSGLVEWAKGYVMAFREVGRSREEGRVTICRERI-VWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDN
        +W+ RN+    F      S LV W++ Y+  ++   RS     V     R+ +WR P+    KVN DA+F  +   AG+G+I+R+  G V L+A      
Subjt:  MWSTRNKA--RFQGAERPSGLVEWAKGYVMAFREVGRSREEGRVTICRERI-VWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDN

Query:  VRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQ-VLFGFTHREGNQAAHRLASL
          DVD  EG+A  + + L  + GF    +ETDS R+  LL  +  D SE+G+  +   +   S+ + V F FTHR GN  AH LA L
Subjt:  VRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQ-VLFGFTHREGNQAAHRLASL

XP_042962737.1 uncharacterized protein LOC122297016 [Carya illinoinensis]6.6e-1833.52Show/hide
Query:  MWSTRNKARF-QGAERPSGLVEWAKGYVMAFREVGRSREEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDNVR
        +WS RN++   QG + P+ +V  AK  ++ ++EV  SR+E  +   ++   WR PS G YKVN DA+  S+    GLG++VR+  G V+ +    +    
Subjt:  MWSTRNKARF-QGAERPSGLVEWAKGYVMAFREVGRSREEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDNVR

Query:  DVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA
        +   AE Y  V +    R++G     LE DS  V  LL     + S  G+ + DA +   S          REGNQA+H+LA
Subjt:  DVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA

TrEMBL top hitse value%identityAlignment
A0A6J1CDQ4 uncharacterized protein LOC1110105333.2e-1833.85Show/hide
Query:  MWSTRNKARFQGAE-RPSGLVEWAKGYVMAFR-----------EVGRS-REEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLG-IIVRNPMGQ
        +W+ RN   F         L  W   Y+  F+           +V +S ++  ++   +   +W     G +K+  DASF S    AGLG II+R+  GQ
Subjt:  MWSTRNKARFQGAE-RPSGLVEWAKGYVMAFR-----------EVGRS-REEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLG-IIVRNPMGQ

Query:  VMLSATFTKDNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA
        V+ SAT   ++V  VD AE  AAV+ L +  + G  P +LETDS R+  L  R++E  S+ G  I        + LQV + FT R GN  AH LA
Subjt:  VMLSATFTKDNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA

A0A6J1CIF1 uncharacterized protein LOC1110112373.9e-3249.08Show/hide
Query:  LVEWAKGYVMAFREVGRSREEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDNVRDVDMAEGYAAVKSLELVRD
        LVEWA  YVM FRE   +   GRVT   E ++W  P    YK+N DASFL+    AGLGII+RN  GQVM SAT   +N++ VDMAE   AV+ L+L   
Subjt:  LVEWAKGYVMAFREVGRSREEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDNVRDVDMAEGYAAVKSLELVRD

Query:  MGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA
        +G +P ILETDSSR+  L ++  ED SE G  +  A   +   L   F F  REGN+AAH LA
Subjt:  MGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA

A0A6J1DAR4 uncharacterized protein LOC1110189545.4e-2640.32Show/hide
Query:  MWSTRNKARFQGAERP-----SGLVEWAKGYVMAFREVGRSREEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTK
        +W+ RN   F  + +        LVEWA  Y M FRE   +   GRVT   E I+W+ P  G YK+N DASFL+    AGLGII+ N  GQVM +AT   
Subjt:  MWSTRNKARFQGAERP-----SGLVEWAKGYVMAFREVGRSREEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTK

Query:  DNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA
        +N++ VDMAE  AAV+ L+L  ++G  P++                ED SE G  +  A   +   L   F F  REGN+AAH LA
Subjt:  DNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA

A0A6J1DBJ7 uncharacterized protein LOC1110189737.6e-2034.76Show/hide
Query:  MWSTRNKA--RFQGAERPSGLVEWAKGYVMAFREVGRSREEGRVTICRERI-VWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDN
        +W+ RN+    F      S LV W++ Y+  ++   RS     V     R+ +WR P+    KVN DA+F  +   AG+G+I+R+  G V L+A      
Subjt:  MWSTRNKA--RFQGAERPSGLVEWAKGYVMAFREVGRSREEGRVTICRERI-VWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDN

Query:  VRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQ-VLFGFTHREGNQAAHRLASL
          DVD  EG+A  + + L  + GF    +ETDS R+  LL  +  D SE+G+  +   +   S+ + V F FTHR GN  AH LA L
Subjt:  VRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQ-VLFGFTHREGNQAAHRLASL

A0A7J7GYW5 Uncharacterized protein7.1e-1835.81Show/hide
Query:  EGRVTICRER---IVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQ
        + R   C  R   + W AP+NGW K+N D +   ++   G+G+++RN +G+VM + +       D D AE YAA K++EL RD+GF    LE DS R+ +
Subjt:  EGRVTICRER---IVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQ

Query:  LLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLASL
         L  E E  SE G  +   + A  S+ +       R+GN  AH LA +
Subjt:  LLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLASL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G33160.1 glycoside hydrolase family 28 protein / polygalacturonase (pectinase) family protein7.3e-0725.54Show/hide
Query:  MWSTRNKARFQGAERP-SGLVEWAKGYVMAFRE-VGRSREEGRVTICRERIV-WRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDN
        +W++RN   FQ    P    ++ A+  +  + E V  S      T  R  I  WR P+NGW K N D SF++       G IVR+  G+   +     + 
Subjt:  MWSTRNKARFQGAERP-SGLVEWAKGYVMAFRE-VGRSREEGRVTICRERIV-WRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDN

Query:  VRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA
        + +    E  A + +++     G+     E D+  +  L+N  +         I D L     + Q  F +T+R+ N+ A  LA
Subjt:  VRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-1028.49Show/hide
Query:  MWSTRNKARFQGAERPSGLVEWAKGYVMAFREVGRSRE-EGRVTICR-ER---IVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTK
        +W +RN+  F+G E  +   E  +  +  F E    RE EG+ +  + ER   + W+AP   W K N DA++  +  R G+G I+RN  G V+       
Subjt:  MWSTRNKARFQGAERPSGLVEWAKGYVMAFREVGRSRE-EGRVTICR-ER---IVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTK

Query:  DNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA
           ++V  AE  A   ++  +    +   I E+D+  +  LLN + +    L   + D       + +V F FT R GN+ A R+A
Subjt:  DNVRDVDMAEGYAAVKSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTCGACCCGGAACAAAGCTCGTTTTCAAGGGGCTGAAAGGCCGAGTGGGCTAGTGGAATGGGCTAAAGGGTATGTAATGGCTTTCCGTGAGGTTGGGAGGAGTAG
GGAGGAAGGTCGAGTGACGATTTGTCGTGAGAGGATAGTGTGGCGGGCGCCGAGTAATGGCTGGTATAAGGTGAATTGTGATGCCTCTTTTCTGTCTGATGTGTCCAGGG
CTGGTCTAGGGATAATTGTGAGAAACCCTATGGGCCAAGTGATGCTTTCAGCGACTTTCACTAAGGATAATGTGAGAGATGTCGACATGGCTGAGGGATATGCAGCTGTG
AAAAGTCTGGAGCTCGTGAGAGATATGGGTTTCGACCCATCAATTCTTGAGACGGACTCGAGTAGAGTTTGCCAGCTTCTTAATCGAGAGCGCGAGGATGCGTCTGAGCT
AGGTATGACTATTGCAGATGCACTGATGGCTTTTCCTTCTTATTTGCAGGTATTGTTTGGTTTTACTCACCGTGAGGGGAATCAGGCGGCGCACCGATTGGCGAGCTTGG
TTGGAGGCCATGCATCTGAGCATTGGACCTTTTTGTTTCTGGGATTAGTGGAAACTGTGGTGAGACTAATAGCTGAGTTCTGGGGGTTTGGGTTCTTAGCCATGATACTG
TTGTTTGAGTACTCGTTCACTCTTTTTTTAGTGTATAGACTTGAAGGTCTAGATCTGCTTTATGGGGTTCATAAATCTGTTGTCACTCCGATAGGGGTAAGGTTTCGATT
GGGAGTATCTGATTTCTCCAAGTTGGCGTGCCTAGCAAACTCACATGGACTAATCCATACTTATGTAAGGAGGCAGGAGGATGGAGGTTTTGGATGGCTAGCTGAATCCG
GCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGTCGACCCGGAACAAAGCTCGTTTTCAAGGGGCTGAAAGGCCGAGTGGGCTAGTGGAATGGGCTAAAGGGTATGTAATGGCTTTCCGTGAGGTTGGGAGGAGTAG
GGAGGAAGGTCGAGTGACGATTTGTCGTGAGAGGATAGTGTGGCGGGCGCCGAGTAATGGCTGGTATAAGGTGAATTGTGATGCCTCTTTTCTGTCTGATGTGTCCAGGG
CTGGTCTAGGGATAATTGTGAGAAACCCTATGGGCCAAGTGATGCTTTCAGCGACTTTCACTAAGGATAATGTGAGAGATGTCGACATGGCTGAGGGATATGCAGCTGTG
AAAAGTCTGGAGCTCGTGAGAGATATGGGTTTCGACCCATCAATTCTTGAGACGGACTCGAGTAGAGTTTGCCAGCTTCTTAATCGAGAGCGCGAGGATGCGTCTGAGCT
AGGTATGACTATTGCAGATGCACTGATGGCTTTTCCTTCTTATTTGCAGGTATTGTTTGGTTTTACTCACCGTGAGGGGAATCAGGCGGCGCACCGATTGGCGAGCTTGG
TTGGAGGCCATGCATCTGAGCATTGGACCTTTTTGTTTCTGGGATTAGTGGAAACTGTGGTGAGACTAATAGCTGAGTTCTGGGGGTTTGGGTTCTTAGCCATGATACTG
TTGTTTGAGTACTCGTTCACTCTTTTTTTAGTGTATAGACTTGAAGGTCTAGATCTGCTTTATGGGGTTCATAAATCTGTTGTCACTCCGATAGGGGTAAGGTTTCGATT
GGGAGTATCTGATTTCTCCAAGTTGGCGTGCCTAGCAAACTCACATGGACTAATCCATACTTATGTAAGGAGGCAGGAGGATGGAGGTTTTGGATGGCTAGCTGAATCCG
GCTGA
Protein sequenceShow/hide protein sequence
MWSTRNKARFQGAERPSGLVEWAKGYVMAFREVGRSREEGRVTICRERIVWRAPSNGWYKVNCDASFLSDVSRAGLGIIVRNPMGQVMLSATFTKDNVRDVDMAEGYAAV
KSLELVRDMGFDPSILETDSSRVCQLLNREREDASELGMTIADALMAFPSYLQVLFGFTHREGNQAAHRLASLVGGHASEHWTFLFLGLVETVVRLIAEFWGFGFLAMIL
LFEYSFTLFLVYRLEGLDLLYGVHKSVVTPIGVRFRLGVSDFSKLACLANSHGLIHTYVRRQEDGGFGWLAESG