; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018801 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018801
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease h domain
Genome locationchr5:34713677..34714279
RNA-Seq ExpressionLag0018801
SyntenyLag0018801
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH81978.1 ribonuclease H protein [Trifolium medium]4.1e-1137.41Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA
        W+PP QG+ K N DAS+ D+  A G GW  RD  G FI AG   I  + +    E  A+ E +  L   I R +  ++IESDS + I  +       SE 
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA

Query:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLARLVYS
          L    + L L V      F RR  N  AH+LAR  YS
Subjt:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLARLVYS

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]2.2e-1237.04Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA
        W+PP    WK+NT+A+W      GGIGWI RD  G  I A  + I  + ++  +E+ AI EGLR++  +  RP   I +ESDS  AI +L+ + ++ +E 
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA

Query:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR
          L +E   +  ++  VS     R+ N  AH LAR
Subjt:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]5.7e-1337.04Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA
        W PP    W +N DASWSDS   GGIGWI R   G  + AG + +    ++K +E  AI+EGLR+ LT +G   P + IE+DS+   ++LN + E+ ++ 
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA

Query:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR
          + +E   L      ++F+   R+ N  AH+LA+
Subjt:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]3.3e-1337.14Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQ-----ILIESDSSVAIAILNHEAE
        W+PP    WK+NTDA+W      GGIGWI RD  G  I A  + I  + ++  +E+ AI EGLR++  +  RP  Q     I +ESDS  AI +L+ + +
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQ-----ILIESDSSVAIAILNHEAE

Query:  EFSEARVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR
        + +E   L +E   +  ++  VS     R+ N  AH+LAR
Subjt:  EFSEARVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR

XP_028949310.1 uncharacterized protein LOC114821394 [Malus domestica]4.1e-1132.41Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA
        W  P  G  K+NTDASW  +    G+GW+ R+  G    AGG  ++        E  AI     +LL  I   +  ++IESD+ V I ++ HE +     
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA

Query:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLARLVYSHNFDYG
          +  + E LA  +  VSFS+  R+ N  AH++A+ V+    +YG
Subjt:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLARLVYSHNFDYG

TrEMBL top hitse value%identityAlignment
A0A1J3ELU2 RNase H domain-containing protein1.8e-0931.72Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLR-SLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSE
        W+PP  GW K N D +W+      GIGW+ RDS G  ++ GG+KISR   +    L+  +E LR ++L      +  I+ E+DS  A+  L+ + E + +
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLR-SLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSE

Query:  ARVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLARLVYSH-NFD
         R +  +   L  ++      F  RD N  A  +A+   S+ N+D
Subjt:  ARVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLARLVYSH-NFD

A0A6J1CP26 uncharacterized protein LOC1110134121.0e-1237.04Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA
        W+PP    WK+NT+A+W      GGIGWI RD  G  I A  + I  + ++  +E+ AI EGLR++  +  RP   I +ESDS  AI +L+ + ++ +E 
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA

Query:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR
          L +E   +  ++  VS     R+ N  AH LAR
Subjt:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR

A0A6J1CQG0 uncharacterized protein LOC1110132161.4e-0939.78Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHE
        W  P    WK+NTDASWS+ ++ GGIGWI  D  G  + AG  KI  K ++  +EL  I+ GL+ +  +   P   I +ESDS   I ++  E
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHE

A0A6J1DNV9 uncharacterized protein LOC1110224032.8e-1337.04Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA
        W PP    W +N DASWSDS   GGIGWI R   G  + AG + +    ++K +E  AI+EGLR+ LT +G   P + IE+DS+   ++LN + E+ ++ 
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA

Query:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR
          + +E   L      ++F+   R+ N  AH+LA+
Subjt:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR

A0A6J1DSV1 uncharacterized protein LOC1110236081.6e-1337.14Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQ-----ILIESDSSVAIAILNHEAE
        W+PP    WK+NTDA+W      GGIGWI RD  G  I A  + I  + ++  +E+ AI EGLR++  +  RP  Q     I +ESDS  AI +L+ + +
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQ-----ILIESDSSVAIAILNHEAE

Query:  EFSEARVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR
        + +E   L +E   +  ++  VS     R+ N  AH+LAR
Subjt:  EFSEARVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0627.21Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKC-MELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSE
        WR P++GW K N D S+ +       GW+ RDS GS++ A G+ I RK D     E++A++  ++   +     + ++  E D+ +   ++N     F  
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKC-MELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSE

Query:  ARVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR
           + D    L  +   + F++ RR  N  A  LA+
Subjt:  ARVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLAR

AT4G29090.1 Ribonuclease H-like superfamily protein3.0e-1232.32Show/hide
Query:  SSQGPWRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGR-PFPQILIESDSSVAIAILNHEA
        SS G WRPP   W K NTDA+W+   +  GIGW+ R+  G   + G + + +   +K + L+A +E +R  +  + R  +  ++ ESDS V I ILN++ 
Subjt:  SSQGPWRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGR-PFPQILIESDSSVAIAILNHEA

Query:  EEFSEARVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLARLVYSH-NFD---YGICENFLASS
        E +   +    + + L  +   V F F  R+ N  A  +AR   S  N+D   Y I  ++  SS
Subjt:  EEFSEARVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLARLVYSH-NFD---YGICENFLASS

AT5G19270.1 unknown protein2.5e-0627.03Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAI---VEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEF
        W PP  G  K N +A W ++    G+ WIARD  G  ++      +R  +    EL+ I   V+ LR L  +       ++I SD   AIA L +   ++
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAI---VEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEF

Query:  SEARVLADEAEGLALEVGCVSFSFSRRD-QNCQAHNLARLVYSHNFDYGICENFLASSPSEEGFFCLAALPPWLASSIYSEKEVG
            +  D      + + C SF F   +  +C A+++AR +       G    +L    SE G       P WL + + SE + G
Subjt:  SEARVLADEAEGLALEVGCVSFSFSRRD-QNCQAHNLARLVYSHNFDYGICENFLASSPSEEGFFCLAALPPWLASSIYSEKEVG

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.2e-0623.87Show/hide
Query:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA
        W PP +   K N DAS  +     G+GWI R+S G+ I  G  K   +   +  E   ++  +++     G    +++ E D+     ++N ++      
Subjt:  WRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNHEAEEFSEA

Query:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLARLVYSHNFDYGI---CENFLA
        +   D  +        + FSF  R+QN  A  LA+     N  + +   C  FL+
Subjt:  RVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLARLVYSHNFDYGI---CENFLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTCAGAGCTTCTCCGATCAGTTGTGAGAGCTGGTCGAGTCAGGGACCGTGGAGACCCCCTCAGCAAGGTTGGTGGAAGATAAACACGGATGCTTCCTGGTCAGA
CTCCAAAAAAGCGGGGGGTATCGGGTGGATTGCCCGTGACTCGGGCGGGTCTTTCATCTTCGCCGGAGGAAAAAAAATCAGCAGAAAGTGGGATATGAAATGCATGGAGT
TGAAAGCGATTGTGGAAGGTCTCAGAAGCTTACTAACCAAAATTGGTAGGCCTTTCCCGCAGATCCTCATCGAATCAGACTCCTCTGTTGCGATCGCCATTTTGAATCAC
GAAGCCGAAGAGTTTTCTGAAGCGCGGGTTCTGGCAGACGAAGCAGAGGGATTGGCGCTGGAAGTGGGTTGTGTTTCCTTCTCTTTCAGCCGGCGAGACCAAAATTGTCA
AGCGCACAATCTGGCGCGCTTAGTTTATAGCCATAATTTTGATTATGGAATCTGTGAGAATTTTTTGGCCTCTTCCCCTTCAGAAGAGGGATTTTTTTGTTTGGCTGCTT
TGCCTCCCTGGTTGGCTTCTTCGATTTACTCGGAGAAAGAAGTGGGTGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTCAGAGCTTCTCCGATCAGTTGTGAGAGCTGGTCGAGTCAGGGACCGTGGAGACCCCCTCAGCAAGGTTGGTGGAAGATAAACACGGATGCTTCCTGGTCAGA
CTCCAAAAAAGCGGGGGGTATCGGGTGGATTGCCCGTGACTCGGGCGGGTCTTTCATCTTCGCCGGAGGAAAAAAAATCAGCAGAAAGTGGGATATGAAATGCATGGAGT
TGAAAGCGATTGTGGAAGGTCTCAGAAGCTTACTAACCAAAATTGGTAGGCCTTTCCCGCAGATCCTCATCGAATCAGACTCCTCTGTTGCGATCGCCATTTTGAATCAC
GAAGCCGAAGAGTTTTCTGAAGCGCGGGTTCTGGCAGACGAAGCAGAGGGATTGGCGCTGGAAGTGGGTTGTGTTTCCTTCTCTTTCAGCCGGCGAGACCAAAATTGTCA
AGCGCACAATCTGGCGCGCTTAGTTTATAGCCATAATTTTGATTATGGAATCTGTGAGAATTTTTTGGCCTCTTCCCCTTCAGAAGAGGGATTTTTTTGTTTGGCTGCTT
TGCCTCCCTGGTTGGCTTCTTCGATTTACTCGGAGAAAGAAGTGGGTGGTTAG
Protein sequenceShow/hide protein sequence
MEFRASPISCESWSSQGPWRPPQQGWWKINTDASWSDSKKAGGIGWIARDSGGSFIFAGGKKISRKWDMKCMELKAIVEGLRSLLTKIGRPFPQILIESDSSVAIAILNH
EAEEFSEARVLADEAEGLALEVGCVSFSFSRRDQNCQAHNLARLVYSHNFDYGICENFLASSPSEEGFFCLAALPPWLASSIYSEKEVGG