; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034692 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034692
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr3:9828816..9829433
RNA-Seq ExpressionLag0034692
SyntenyLag0034692
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8478291.1 hypothetical protein CXB51_028103 [Gossypium anomalum]7.1e-1124.87Show/hide
Query:  YNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMILHVDATFDEESRRCGVRAVIRTR
        +N+++++  +   +  W IW N+N I H+     V     +++ Y  E   +     R VQ + N      G D + ++ DATF++ +RR     + R +
Subjt:  YNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMILHVDATFDEESRRCGVRAVIRTR

Query:  EGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNRE
        +G  M      +   S P+ AEA A L+ V ++ +  F  + +  DSL++I  L   ++ +++  + V +I+     F+ + F H+ RE
Subjt:  EGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNRE

MBA0701905.1 hypothetical protein [Gossypium aridum]9.9e-1325.26Show/hide
Query:  WLC-IYNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMI-LHVDATFDEESRRCGVR
        WL  ++   S    +  C   WAIW +RNN +H +          ++  Y+ E   +G  +   V+S ++ S      D+++ ++ DA +DE+S+     
Subjt:  WLC-IYNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMI-LHVDATFDEESRRCGVR

Query:  AVIRTREGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNR
         V R  E K +      +   +SP  AE +A  + +K   +  ++K+ +  DSLS+I         ++     ++DI  L+++FQ+  F HV R
Subjt:  AVIRTREGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNR

XP_012487919.1 PREDICTED: uncharacterized protein LOC105801131 [Gossypium raimondii]7.6e-1326.8Show/hide
Query:  WLCIYNNVSSQTPERI-CMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMI-LHVDATFDEESRRCGVR
        WL    + +S++  RI C   WAIW +RNN +H +          ++  Y+ E   +G  +    QS +  S      D+++ ++ DA +D +S++    
Subjt:  WLCIYNNVSSQTPERI-CMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMI-LHVDATFDEESRRCGVR

Query:  AVIRTREGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNR
         V R  EGK +      +   +SP  AEA+A  + +K+  +  ++K+ +  DSLS+I         ++     ++DI  L++ FQ+  F HV R
Subjt:  AVIRTREGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNR

XP_030969948.1 uncharacterized protein LOC115990241 [Quercus lobata]1.2e-1027.84Show/hide
Query:  MVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMILHVDATFDEESRRCGVRAVIRTREGKFMFILHKGFL
        MV WA+WN RNN+   +   P+ +  +  + +L ++      R   V      S    G  +  +++D    E     G R VIR   G+ M  L +   
Subjt:  MVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMILHVDATFDEESRRCGVRAVIRTREGKFMFILHKGFL

Query:  LFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNRE
        L SS L  EA+A   G++L  +  F+ + + SDS  LI  L  G    +   + V DI+++ +    +N+ HV R+
Subjt:  LFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNRE

XP_030975730.1 uncharacterized protein LOC115995344 [Quercus lobata]7.1e-1127.32Show/hide
Query:  CIYNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKR----GRMVQSLENVSNMIAGEDEMILHVDATFDEESRRCGVR
        C++    ++ P    MVAWAIWN RNN+   +P  P+       QD + E+    +      GR   S +  S+ I       ++ D          G+ 
Subjt:  CIYNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKR----GRMVQSLENVSNMIAGEDEMILHVDATFDEESRRCGVR

Query:  AVIRTREGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNR
         VIR  EG+ M  L +   L  + +  EA+A    +KL+ +  F+++++  DS  LI  L++     +   + + DIQ+L + F  +N+ HV R
Subjt:  AVIRTREGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNR

TrEMBL top hitse value%identityAlignment
A0A1U8JJX9 uncharacterized protein LOC1079077829.4e-0924.76Show/hide
Query:  ESIWNIKDRWLCI-YNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMI---AGEDEMILHVDAT
        ES W    +WL + + N S++  +   +  WA+W N N I H+           +I  Y  E  ++    G ++++ +     +     +D + ++ DA+
Subjt:  ESIWNIKDRWLCI-YNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMI---AGEDEMILHVDAT

Query:  FDEESRRCGVRAVIRTREGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNF
        F++ SRR     + R +EG  M      +   S P+ AEA+A L+ V ++ +  F+ + V  D+L++I  L   K+ ++   + + +I      F++M F
Subjt:  FDEESRRCGVRAVIRTREGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNF

Query:  CHVNRE
          V RE
Subjt:  CHVNRE

A0A2P5EP22 Ribonuclease H-like domain containing protein5.5e-0928.57Show/hide
Query:  ERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMI----AGEDEMILHVDATFDEESRRCGVRAVIRTREGKFM
        E  C++ WAIWN RN+++H +         +W+Q +LFE+   G K    V S+    N++         + L+VDA   ++S   GV   IR  EG  +
Subjt:  ERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMI----AGEDEMILHVDATFDEESRRCGVRAVIRTREGKFM

Query:  FILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQY
               + +   + +E +A  EGV+L+ +      S+ SDSLS +  +     C A   + V DI +
Subjt:  FILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQY

A0A5B6VK73 Reverse transcriptase2.2e-1025.26Show/hide
Query:  IYNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMILHVDATFDEESRRCGVRAVIRT
        ++   S+Q    I    WA+W +RN +IH+R +        +I DYL E    G  +   +  L N S     +  + ++ DA FD++ +R     VIR 
Subjt:  IYNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMILHVDATFDEESRRCGVRAVIRT

Query:  REGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNRE
          G+ +   H       S   AEA+  ++ V+  A+  F +V V   +LS+I  +L     +++  + + D + L++ F      H  R+
Subjt:  REGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNRE

A0A5B6VXJ9 Reverse transcriptase1.2e-0825.25Show/hide
Query:  DRWLC-IYNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMILHVDATFDEESRRCGV
        ++WL  ++  V+       C   WAIW +RN+ IH + V        +I +YL E   L  ++   +   +  +      D + ++VD  FD  +     
Subjt:  DRWLC-IYNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMILHVDATFDEESRRCGV

Query:  RAVIRTREGKFMFI---LHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNR
          V+R  EG  +     +HKG     S   AEA+A  E V++  +  + K+ +  DSL++I    +  + ++     + DIQ + ++ +K  F HV R
Subjt:  RAVIRTREGKFMFI---LHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNR

A0A6J1DX30 uncharacterized protein LOC1110248741.7e-1026.19Show/hide
Query:  MGAESIWNIKDRWLCIYNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMI-----AGEDEMILH
        + AE   +  + W  +   +  +      +  W IWN+RN++IH + V PV   CEW+  +L + H          ++  N   ++     +    + L+
Subjt:  MGAESIWNIKDRWLCIYNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMI-----AGEDEMILH

Query:  VDATFDEESRRCGVRAVIRTREGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQ
         DA     S   G   +IR      +           SPL AE    LEG+K +A  NF  + V SDSL  I ++ +    + D  N V +IQ L   F 
Subjt:  VDATFDEESRRCGVRAVIRTREGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQ

Query:  KMNFCHVNRE
         ++F H +R+
Subjt:  KMNFCHVNRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCTGAATCTATTTGGAATATCAAAGATCGATGGCTTTGTATTTATAATAATGTTTCTTCACAGACCCCAGAGCGGATATGTATGGTTGCATGGGCTATTTGGAA
CAACCGTAATAATATAATTCATCAAAGGCCAGTCCCTCCTGTGGGGGTTTTCTGTGAATGGATTCAGGACTATCTCTTTGAATACCACAGATTAGGTTATAAGCGTGGTC
GTATGGTTCAATCTTTGGAGAATGTATCGAACATGATCGCTGGAGAGGACGAGATGATTTTGCATGTTGACGCGACATTTGATGAAGAGTCTCGACGTTGTGGAGTGAGG
GCAGTTATCAGAACAAGGGAAGGTAAATTTATGTTTATTTTGCATAAAGGTTTCCTTTTGTTCTCTTCCCCTTTATGCGCCGAGGCGGTGGCGGATTTGGAGGGTGTTAA
ATTGTCAGCTCAACAGAATTTTAAGAAAGTGTCGGTCTTCTCAGATTCTCTATCACTAATCCCTATTCTTCTTCATGGTAAGCAGTGTCAAGCAGATTGTCTCAACACGG
TTACGGATATCCAATATTTAAGGAACACTTTCCAGAAGATGAACTTTTGTCATGTCAATCGTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCTGAATCTATTTGGAATATCAAAGATCGATGGCTTTGTATTTATAATAATGTTTCTTCACAGACCCCAGAGCGGATATGTATGGTTGCATGGGCTATTTGGAA
CAACCGTAATAATATAATTCATCAAAGGCCAGTCCCTCCTGTGGGGGTTTTCTGTGAATGGATTCAGGACTATCTCTTTGAATACCACAGATTAGGTTATAAGCGTGGTC
GTATGGTTCAATCTTTGGAGAATGTATCGAACATGATCGCTGGAGAGGACGAGATGATTTTGCATGTTGACGCGACATTTGATGAAGAGTCTCGACGTTGTGGAGTGAGG
GCAGTTATCAGAACAAGGGAAGGTAAATTTATGTTTATTTTGCATAAAGGTTTCCTTTTGTTCTCTTCCCCTTTATGCGCCGAGGCGGTGGCGGATTTGGAGGGTGTTAA
ATTGTCAGCTCAACAGAATTTTAAGAAAGTGTCGGTCTTCTCAGATTCTCTATCACTAATCCCTATTCTTCTTCATGGTAAGCAGTGTCAAGCAGATTGTCTCAACACGG
TTACGGATATCCAATATTTAAGGAACACTTTCCAGAAGATGAACTTTTGTCATGTCAATCGTGAGTAG
Protein sequenceShow/hide protein sequence
MGAESIWNIKDRWLCIYNNVSSQTPERICMVAWAIWNNRNNIIHQRPVPPVGVFCEWIQDYLFEYHRLGYKRGRMVQSLENVSNMIAGEDEMILHVDATFDEESRRCGVR
AVIRTREGKFMFILHKGFLLFSSPLCAEAVADLEGVKLSAQQNFKKVSVFSDSLSLIPILLHGKQCQADCLNTVTDIQYLRNTFQKMNFCHVNRE