; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029115 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029115
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr8:35557709..35562630
RNA-Seq ExpressionLag0029115
SyntenyLag0029115
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006473914.2 uncharacterized protein LOC102629934 [Citrus sinensis]3.1e-1738.24Show/hide
Query:  KWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLSM
        KW  PP+GW+KVN+DA    E+   G+G+V+RN  GKVMAA ++   ++  VD AE  AA  GL++A  +G FP +IE+DS+ V EL+  ++   +++  
Subjt:  KWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLSM

Query:  LLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI
        ++++  D       ++   + R  N LAH LA LA+
Subjt:  LLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI

XP_015388332.1 uncharacterized protein LOC107178077 [Citrus sinensis]1.2e-1636.42Show/hide
Query:  YSPIVRGNRADRESVKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVF
        +S I+   + +R+ V W  PP GW+K+N+DA  + +  I G+G+++RN +G+V+AA VQ   +  S    E  A   G+K A   GF P +IETDS+ V 
Subjt:  YSPIVRGNRADRESVKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVF

Query:  ELLRGERVELSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI
        +L   ++V +++ S L+AD  +    S       + RE N  AH LAK A+
Subjt:  ELLRGERVELSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI

XP_022140628.1 uncharacterized protein LOC111011237 [Momordica charantia]2.8e-1836.55Show/hide
Query:  GNRADRESVKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGE
        G   +   V W+ P K  YK+N DA+F       G+G+++RN  G+VMA+  ++   ++SVDMAE   A +GL+LA+++G  P ++ETDS R+F L    
Subjt:  GNRADRESVKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGE

Query:  RVELSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI
          +LS+   ++  A + W+ S         REGN+ AH LA+ A+
Subjt:  RVELSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.7e-1828.89Show/hide
Query:  IRCMVGGKDAGKLEIPSKIKIFPHDGTATFFVLSTQGRLCLDRLPSVDNLIKRGVDMMNLCSFCGWESQ-------LCMF---------FG-YSPIV---
        +RC   G    K+ IP+KIK+F               RLCLDRLP+  NL KRGV++ N C FCG   +       +C F         FG  SP +   
Subjt:  IRCMVGGKDAGKLEIPSKIKIFPHDGTATFFVLSTQGRLCLDRLPSVDNLIKRGVDMMNLCSFCGWESQ-------LCMF---------FG-YSPIV---

Query:  ------------------------RGNRADRESVK---------------------------------------WVVPPKGWYKVNIDATFDGEKIIVGV
                                R  RA  +S K                                       W  P +G YK+N DA+F       G+
Subjt:  ------------------------RGNRADRESVK---------------------------------------WVVPPKGWYKVNIDATFDGEKIIVGV

Query:  GLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLSMLLADALDEWSLSWPLQSVISLREGNRL
        G+++ N  G+VMAA  ++   ++SVDMAE  AA +GL+LA+E+G  PAL                 +LS+   ++  A + W+ S         REGN+ 
Subjt:  GLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLSMLLADALDEWSLSWPLQSVISLREGNRL

Query:  AHHLAK--LAIHEQS
        AH LA+  L +HE S
Subjt:  AHHLAK--LAIHEQS

XP_024042007.1 uncharacterized protein LOC112099133 [Citrus clementina]3.1e-1738.97Show/hide
Query:  KWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLSM
        KW  PP+GW+KVNIDA    E+   G+G+V+RN  GKVMAA ++   ++  VD AE  AA  GL++A  +G FP +IE+DS+ V EL+  ++   +++  
Subjt:  KWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLSM

Query:  LLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI
        ++++  D       ++   + R  N LAH LA LA+
Subjt:  LLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI

TrEMBL top hitse value%identityAlignment
A0A2N9J3G5 Uncharacterized protein2.2e-1334.97Show/hide
Query:  NRADRESVKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGER
        ++  RES++W  P +GWYKVN+D     +   VG+G+V+RN  G+ + A  +   Y      AE  AA   ++ ATE+  F  + E D  +V + L    
Subjt:  NRADRESVKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGER

Query:  VELSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLA
         +LS +  L A A  ++SL      V   REGN +AH LA+ A
Subjt:  VELSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLA

A0A6J1CIF1 uncharacterized protein LOC1110112371.4e-1836.55Show/hide
Query:  GNRADRESVKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGE
        G   +   V W+ P K  YK+N DA+F       G+G+++RN  G+VMA+  ++   ++SVDMAE   A +GL+LA+++G  P ++ETDS R+F L    
Subjt:  GNRADRESVKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGE

Query:  RVELSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI
          +LS+   ++  A + W+ S         REGN+ AH LA+ A+
Subjt:  RVELSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI

A0A6J1DAR4 uncharacterized protein LOC1110189541.8e-1828.89Show/hide
Query:  IRCMVGGKDAGKLEIPSKIKIFPHDGTATFFVLSTQGRLCLDRLPSVDNLIKRGVDMMNLCSFCGWESQ-------LCMF---------FG-YSPIV---
        +RC   G    K+ IP+KIK+F               RLCLDRLP+  NL KRGV++ N C FCG   +       +C F         FG  SP +   
Subjt:  IRCMVGGKDAGKLEIPSKIKIFPHDGTATFFVLSTQGRLCLDRLPSVDNLIKRGVDMMNLCSFCGWESQ-------LCMF---------FG-YSPIV---

Query:  ------------------------RGNRADRESVK---------------------------------------WVVPPKGWYKVNIDATFDGEKIIVGV
                                R  RA  +S K                                       W  P +G YK+N DA+F       G+
Subjt:  ------------------------RGNRADRESVK---------------------------------------WVVPPKGWYKVNIDATFDGEKIIVGV

Query:  GLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLSMLLADALDEWSLSWPLQSVISLREGNRL
        G+++ N  G+VMAA  ++   ++SVDMAE  AA +GL+LA+E+G  PAL                 +LS+   ++  A + W+ S         REGN+ 
Subjt:  GLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLSMLLADALDEWSLSWPLQSVISLREGNRL

Query:  AHHLAK--LAIHEQS
        AH LA+  L +HE S
Subjt:  AHHLAK--LAIHEQS

A0A7J7GYW5 Uncharacterized protein2.9e-1335.97Show/hide
Query:  VKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLS
        V W  P  GW K+N D     E   VGVG+V+RNH G+VMAA  +   +    D AE +AA   ++LA ++GF    +E DS R+ + LR E   +S+  
Subjt:  VKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLS

Query:  MLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAIHE
         +L   +               R+GN LAH LA++A H+
Subjt:  MLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAIHE

V4UMR3 RNase H domain-containing protein (Fragment)4.5e-1435.29Show/hide
Query:  WVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLSML
        W  PP  WYKVN+DA     +   G+G++V N +G+VMAA++    + R ++  E  A  +GL+L  ++G  PA+IE+DS  V  L+  +     ++  L
Subjt:  WVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLSML

Query:  LADALDEWSLS-WPLQSVISLREGNRLAHHLAKLAI
        ++D  +  S S    Q   + R  N +AHHL+KLA+
Subjt:  LADALDEWSLS-WPLQSVISLREGNRLAHHLAKLAI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G04420.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0627.52Show/hide
Query:  VRGNRADRESV--KWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFEL
        VR    +RES   KW  PP GW K N D +F+        G ++R+  G    A     G + +   +E  A    ++     G+   + E DS++V EL
Subjt:  VRGNRADRESV--KWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFEL

Query:  LRGERVELSDLSMLLADALDEWSLSWPLQSVI---SLREGNRLAHHLAK
        L  +++     + +     + WS S   + VI   + R  N+ A  LAK
Subjt:  LRGERVELSDLSMLLADALDEWSLSWPLQSVI---SLREGNRLAHHLAK

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.3e-0525.69Show/hide
Query:  SVKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGE------R
        SV+W  PP  W K N DAT+  E    G+G ++RN SG V+    +     ++V  AE  A    +   +   +   + E+D++ +  LL  +      +
Subjt:  SVKWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGE------R

Query:  VELSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI
          L D+  LL    +       ++   + R GN++A  +A+ +I
Subjt:  VELSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAKLAI

AT4G29090.1 Ribonuclease H-like superfamily protein9.0e-0727.34Show/hide
Query:  KWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERV------E
        +W  PP  W K N DAT++ +    G+G V+RN  G+V     +    ++SV  AE  A    +   +   +   + E+DS+ + E+L  + +       
Subjt:  KWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERV------E

Query:  LSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAK
        + DL  LL+   +       ++ V   REGN LA  +A+
Subjt:  LSDLSMLLADALDEWSLSWPLQSVISLREGNRLAHHLAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCTCGGCCCATGGCCGAGGCGACCCTCGGCCCGCTCGTGCGGGCCGAGTCCGTTTG
GTCCCGTCTGGTCCCCACCGCCTCTGGATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCAAAAACCCTAAAAAGGCTAGGAGGATGAACAGGCCACGTATTCCCC
CCTCAACTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGGACACTAGGGACCAAATGGAGGAGGAGAGACTCGGCCCACACGAGTGGGTTGAGTGGTCGATAGGCCGA
GACCGAGCATGGGATTGGGCTTCGACTCGCCTTCCTCTGAGGGATGGTGATGAGTCTCGGATTCGCTGCATGGTTGGTGGCAAGGATGCTGGAAAATTGGAGATCCCAAG
TAAGATTAAAATTTTCCCTCATGATGGAACGGCTACCTTCTTTGTGTTGTCTACTCAAGGGCGATTATGTTTGGATCGTCTTCCATCTGTTGATAATTTAATCAAACGTG
GGGTTGACATGATGAATCTTTGTTCCTTTTGTGGTTGGGAGAGTCAACTTTGCATGTTTTTTGGCTATAGTCCTATTGTGAGAGGGAATCGAGCTGACCGTGAGTCTGTA
AAGTGGGTTGTCCCACCCAAAGGTTGGTATAAAGTCAATATAGACGCAACCTTTGATGGGGAGAAAATAATTGTTGGAGTAGGCTTGGTAGTGAGAAACCATTCTGGGAA
AGTTATGGCTGCGACAGTTCAATTCCATGGGTATGTCAGGAGCGTTGATATGGCGGAAGGTTGGGCGGCTGCCGACGGTCTGAAACTCGCTACTGAGATGGGGTTTTTTC
CAGCTTTGATCGAGACTGATTCGAGACGTGTGTTTGAGCTTCTGCGAGGCGAGAGAGTAGAACTCTCTGATCTGAGTATGCTGCTTGCTGATGCCTTGGATGAATGGTCG
TTGTCTTGGCCTTTGCAATCTGTCATCTCCTTACGTGAAGGGAATCGTCTAGCTCATCATTTAGCGAAATTGGCTATTCATGAGCAAAGTGACTGTGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCTCGGCCCATGGCCGAGGCGACCCTCGGCCCGCTCGTGCGGGCCGAGTCCGTTTG
GTCCCGTCTGGTCCCCACCGCCTCTGGATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCAAAAACCCTAAAAAGGCTAGGAGGATGAACAGGCCACGTATTCCCC
CCTCAACTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGGACACTAGGGACCAAATGGAGGAGGAGAGACTCGGCCCACACGAGTGGGTTGAGTGGTCGATAGGCCGA
GACCGAGCATGGGATTGGGCTTCGACTCGCCTTCCTCTGAGGGATGGTGATGAGTCTCGGATTCGCTGCATGGTTGGTGGCAAGGATGCTGGAAAATTGGAGATCCCAAG
TAAGATTAAAATTTTCCCTCATGATGGAACGGCTACCTTCTTTGTGTTGTCTACTCAAGGGCGATTATGTTTGGATCGTCTTCCATCTGTTGATAATTTAATCAAACGTG
GGGTTGACATGATGAATCTTTGTTCCTTTTGTGGTTGGGAGAGTCAACTTTGCATGTTTTTTGGCTATAGTCCTATTGTGAGAGGGAATCGAGCTGACCGTGAGTCTGTA
AAGTGGGTTGTCCCACCCAAAGGTTGGTATAAAGTCAATATAGACGCAACCTTTGATGGGGAGAAAATAATTGTTGGAGTAGGCTTGGTAGTGAGAAACCATTCTGGGAA
AGTTATGGCTGCGACAGTTCAATTCCATGGGTATGTCAGGAGCGTTGATATGGCGGAAGGTTGGGCGGCTGCCGACGGTCTGAAACTCGCTACTGAGATGGGGTTTTTTC
CAGCTTTGATCGAGACTGATTCGAGACGTGTGTTTGAGCTTCTGCGAGGCGAGAGAGTAGAACTCTCTGATCTGAGTATGCTGCTTGCTGATGCCTTGGATGAATGGTCG
TTGTCTTGGCCTTTGCAATCTGTCATCTCCTTACGTGAAGGGAATCGTCTAGCTCATCATTTAGCGAAATTGGCTATTCATGAGCAAAGTGACTGTGGATAG
Protein sequenceShow/hide protein sequence
MGRAKTEGVGFSARPPARPRPMAEATLGPLVRAESVWSRLVPTASGCPGFAWFDLKRLKNPKKARRMNRPRIPPSTTNLPLVAREGQDTRDQMEEERLGPHEWVEWSIGR
DRAWDWASTRLPLRDGDESRIRCMVGGKDAGKLEIPSKIKIFPHDGTATFFVLSTQGRLCLDRLPSVDNLIKRGVDMMNLCSFCGWESQLCMFFGYSPIVRGNRADRESV
KWVVPPKGWYKVNIDATFDGEKIIVGVGLVVRNHSGKVMAATVQFHGYVRSVDMAEGWAAADGLKLATEMGFFPALIETDSRRVFELLRGERVELSDLSMLLADALDEWS
LSWPLQSVISLREGNRLAHHLAKLAIHEQSDCG