; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035474 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035474
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr3:22255990..22261065
RNA-Seq ExpressionLag0035474
SyntenyLag0035474
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4360260.1 hypothetical protein F8388_020551 [Cannabis sativa]2.5e-1021.99Show/hide
Query:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHH-QITPDPMKMKTTI
        C  C N+ ET TH  W C   K +W    L+P           S  D L S+      N+FE      ++ + W IW++RN  +++  +      +    
Subjt:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHH-QITPDPMKMKTTI

Query:  QKYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG------------------------------FFGIGSGLRSIPN
          Y          T +  ++ T+     T + WI PP G   ++C   +  G    G                               + I   L ++PN
Subjt:  QKYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG------------------------------FFGIGSGLRSIPN

Query:  ETPK-IQVEIDASNVVRLLLDEGQDLTEISNFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW
         T   ++++ D  ++V  ++++   LT ++  I   +  ++  N  ++ +V R +N  AH+LA K    +T++I+T  FP W
Subjt:  ETPK-IQVEIDASNVVRLLLDEGQDLTEISNFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW

KAF4363292.1 hypothetical protein F8388_001833 [Cannabis sativa]3.9e-1123.02Show/hide
Query:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQ
        C  C ++ ET TH  W C   K +W    L+P        KN S  D + ++ +    ++FE+     ++ + W IW++RN  +      + + +   IQ
Subjt:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQ

Query:  KYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG-FFGIGSGLRSIPN-ETPKIQVEIDASNVVRLLLDEGQDLTEIS
             F    +  N  ++  T          WI PP G   ++C   +  G       + I   L+  P+     ++++ D   +V  + +    L+ +S
Subjt:  KYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG-FFGIGSGLRSIPN-ETPKIQVEIDASNVVRLLLDEGQDLTEIS

Query:  NFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW
          +   Q  ++  N   + +V R +N  AH+LA K  D + + I+T  FP W
Subjt:  NFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW

KAF4391449.1 hypothetical protein G4B88_005520 [Cannabis sativa]6.7e-1123.02Show/hide
Query:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQ
        C  C ++ ET TH  W C   K +W    L+P        KN S  D + ++ +    ++FE+     ++ + W IW++RN  +      + + +   IQ
Subjt:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQ

Query:  KYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG-FFGIGSGLRSIPN-ETPKIQVEIDASNVVRLLLDEGQDLTEIS
             F    +  N  ++  T          WI PP G   ++C   +  G       + I   L+  P+     ++++ D   +V  + +    L+ +S
Subjt:  KYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG-FFGIGSGLRSIPN-ETPKIQVEIDASNVVRLLLDEGQDLTEIS

Query:  NFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW
          +   Q  ++  N   + +V R +N  AH+LA K  D + + I+T  FP W
Subjt:  NFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]1.5e-1028.9Show/hide
Query:  RNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQKYLA
        R K ETT H+ WECK+ K +W     +     +  R NW+  +Y E  W   +  + E+RR   SM++  QIW+ RN      +  +   ++  I +Y+ 
Subjt:  RNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQKYLA

Query:  KFNIQEEETNLAQEIATSFP----APTTNSTWIPPPEGIWKLSCGLKLERGEESDGFFGIGSGLRSIPNETPK
          N   ++TNL ++     P       T + W PP    WKL+          +D   GIG  LR    E  K
Subjt:  KFNIQEEETNLAQEIATSFP----APTTNSTWIPPPEGIWKLSCGLKLERGEESDGFFGIGSGLRSIPNETPK

XP_027109098.1 uncharacterized protein LOC113728951 [Coffea arabica]1.5e-1023.73Show/hide
Query:  MCYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTI
        +C  CR   ET  H+F+ C     +W    L  +G+     K W   + L    KD+   +  K R+  ++ + WQIWK RN +  ++   DP   +TT+
Subjt:  MCYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTI

Query:  QKYLAKF-NIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG-------FFGIGSGLRSIPNET--------------------
         K +  +    E +   A+ +  S         W  P EG  K++    L +  +  G       + G   G  ++P                       
Subjt:  QKYLAKF-NIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG-------FFGIGSGLRSIPNET--------------------

Query:  ----PKIQVEIDASNVVRLLLDEGQDLTEISNFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEWKLILEPIGCSGA
             K+  E D   VVR L    + +   +  + D + L+   +     +  RA+N ++H LA KA  L+ S  W  +FP W L L    C G+
Subjt:  ----PKIQVEIDASNVVRLLLDEGQDLTEISNFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEWKLILEPIGCSGA

TrEMBL top hitse value%identityAlignment
A0A2N9GI95 Reverse transcriptase domain-containing protein2.1e-1024.37Show/hide
Query:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQ
        C  C N+ ETT H  W CK  + +W     LP G    G    + +D+++ +W    T       +    ++ W IW HRN V  HQ T    ++    Q
Subjt:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQ

Query:  KYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDGFFGI---------GSGLRSIPNE---------------------
        + + KF  +++       ++ S    +    W PP EG +K++    + R     G   I         GS  + +P                       
Subjt:  KYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDGFFGI---------GSGLRSIPNE---------------------

Query:  -TPKIQVEIDASNVVRLLLDEGQDLTEISNFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFP
          P+I++E D+  VV  LL  G   T   + I D + + +  +    ++V R  N +AHLLA +A   ++ E+W    P
Subjt:  -TPKIQVEIDASNVVRLLLDEGQDLTEISNFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFP

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X17.2e-1128.9Show/hide
Query:  RNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQKYLA
        R K ETT H+ WECK+ K +W     +     +  R NW+  +Y E  W   +  + E+RR   SM++  QIW+ RN      +  +   ++  I +Y+ 
Subjt:  RNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQKYLA

Query:  KFNIQEEETNLAQEIATSFP----APTTNSTWIPPPEGIWKLSCGLKLERGEESDGFFGIGSGLRSIPNETPK
          N   ++TNL ++     P       T + W PP    WKL+          +D   GIG  LR    E  K
Subjt:  KFNIQEEETNLAQEIATSFP----APTTNSTWIPPPEGIWKLSCGLKLERGEESDGFFGIGSGLRSIPNETPK

A0A7J6ERF5 Uncharacterized protein1.2e-1021.99Show/hide
Query:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHH-QITPDPMKMKTTI
        C  C N+ ET TH  W C   K +W    L+P           S  D L S+      N+FE      ++ + W IW++RN  +++  +      +    
Subjt:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHH-QITPDPMKMKTTI

Query:  QKYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG------------------------------FFGIGSGLRSIPN
          Y          T +  ++ T+     T + WI PP G   ++C   +  G    G                               + I   L ++PN
Subjt:  QKYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG------------------------------FFGIGSGLRSIPN

Query:  ETPK-IQVEIDASNVVRLLLDEGQDLTEISNFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW
         T   ++++ D  ++V  ++++   LT ++  I   +  ++  N  ++ +V R +N  AH+LA K    +T++I+T  FP W
Subjt:  ETPK-IQVEIDASNVVRLLLDEGQDLTEISNFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW

A0A7J6EXZ1 RNase H domain-containing protein1.9e-1123.02Show/hide
Query:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQ
        C  C ++ ET TH  W C   K +W    L+P        KN S  D + ++ +    ++FE+     ++ + W IW++RN  +      + + +   IQ
Subjt:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQ

Query:  KYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG-FFGIGSGLRSIPN-ETPKIQVEIDASNVVRLLLDEGQDLTEIS
             F    +  N  ++  T          WI PP G   ++C   +  G       + I   L+  P+     ++++ D   +V  + +    L+ +S
Subjt:  KYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG-FFGIGSGLRSIPN-ETPKIQVEIDASNVVRLLLDEGQDLTEIS

Query:  NFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW
          +   Q  ++  N   + +V R +N  AH+LA K  D + + I+T  FP W
Subjt:  NFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW

A0A7J6HAE5 RNase H domain-containing protein3.2e-1123.02Show/hide
Query:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQ
        C  C ++ ET TH  W C   K +W    L+P        KN S  D + ++ +    ++FE+     ++ + W IW++RN  +      + + +   IQ
Subjt:  CYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQ

Query:  KYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG-FFGIGSGLRSIPN-ETPKIQVEIDASNVVRLLLDEGQDLTEIS
             F    +  N  ++  T          WI PP G   ++C   +  G       + I   L+  P+     ++++ D   +V  + +    L+ +S
Subjt:  KYLAKFNIQEEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDG-FFGIGSGLRSIPN-ETPKIQVEIDASNVVRLLLDEGQDLTEIS

Query:  NFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW
          +   Q  ++  N   + +V R +N  AH+LA K  D + + I+T  FP W
Subjt:  NFIADAQALIKVHNIEAVRYVPRAHNRMAHLLASKACDLQTSEIWTIDFPEW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTATTTTTGCAGGAACAAATATGAGACAACAACTCACTTGTTCTGGGAATGCAAATTGACTAAGGGCTTATGGTCATATTTTGGTCTTCTTCCTAATGGAATGTG
TTTTAATGGCAGGAAAAATTGGAGCCCTTCGGATTATTTAGAAAGCATTTGGAAGGATAGCAGGACAAACCAGTTCGAGAAGAGAAGAATATGCGCTAGTATGGTTATGT
GCTGGCAGATATGGAAGCATAGGAATGACGTTTTTCATCATCAAATCACACCAGACCCAATGAAGATGAAGACTACAATTCAGAAATACTTGGCAAAGTTCAACATTCAA
GAGGAAGAAACGAACTTGGCTCAAGAGATAGCAACATCGTTTCCAGCTCCGACGACGAACTCGACTTGGATCCCGCCGCCTGAAGGCATTTGGAAATTGAGTTGTGGTCT
GAAACTCGAAAGAGGGGAGGAATCAGATGGATTCTTTGGGATTGGTTCGGGTCTTCGATCCATTCCGAATGAAACTCCAAAAATACAAGTGGAGATTGATGCATCAAATG
TCGTTCGTCTTCTATTAGATGAGGGCCAAGATCTGACTGAGATTTCAAACTTTATTGCCGATGCCCAAGCCCTGATAAAGGTACATAATATTGAAGCAGTGAGATATGTC
CCTAGAGCACACAATAGGATGGCACATCTGTTGGCTTCTAAAGCTTGTGACCTTCAAACTTCTGAAATTTGGACTATTGATTTTCCTGAGTGGAAATTAATATTGGAGCC
AATTGGATGTTCGGGGGCGAAAAGAGGTCAAAGAATGAAAAAGAATCAAAGAAGAAAAAGTCAAAATAGGGTCAGCCGCAGACCAGCGTCTCGTCGCCGCCTTTCCTTAT
CTGAATTAGACGGCAACAGCGCACAGCGTCGAGACGCTGCGACCTTAGCGTCTCGACGCTCTCGAAATTCCCTTAAACAGAATGTGCGGCAGGCGATAGCGTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTATTTTTGCAGGAACAAATATGAGACAACAACTCACTTGTTCTGGGAATGCAAATTGACTAAGGGCTTATGGTCATATTTTGGTCTTCTTCCTAATGGAATGTG
TTTTAATGGCAGGAAAAATTGGAGCCCTTCGGATTATTTAGAAAGCATTTGGAAGGATAGCAGGACAAACCAGTTCGAGAAGAGAAGAATATGCGCTAGTATGGTTATGT
GCTGGCAGATATGGAAGCATAGGAATGACGTTTTTCATCATCAAATCACACCAGACCCAATGAAGATGAAGACTACAATTCAGAAATACTTGGCAAAGTTCAACATTCAA
GAGGAAGAAACGAACTTGGCTCAAGAGATAGCAACATCGTTTCCAGCTCCGACGACGAACTCGACTTGGATCCCGCCGCCTGAAGGCATTTGGAAATTGAGTTGTGGTCT
GAAACTCGAAAGAGGGGAGGAATCAGATGGATTCTTTGGGATTGGTTCGGGTCTTCGATCCATTCCGAATGAAACTCCAAAAATACAAGTGGAGATTGATGCATCAAATG
TCGTTCGTCTTCTATTAGATGAGGGCCAAGATCTGACTGAGATTTCAAACTTTATTGCCGATGCCCAAGCCCTGATAAAGGTACATAATATTGAAGCAGTGAGATATGTC
CCTAGAGCACACAATAGGATGGCACATCTGTTGGCTTCTAAAGCTTGTGACCTTCAAACTTCTGAAATTTGGACTATTGATTTTCCTGAGTGGAAATTAATATTGGAGCC
AATTGGATGTTCGGGGGCGAAAAGAGGTCAAAGAATGAAAAAGAATCAAAGAAGAAAAAGTCAAAATAGGGTCAGCCGCAGACCAGCGTCTCGTCGCCGCCTTTCCTTAT
CTGAATTAGACGGCAACAGCGCACAGCGTCGAGACGCTGCGACCTTAGCGTCTCGACGCTCTCGAAATTCCCTTAAACAGAATGTGCGGCAGGCGATAGCGTCATGA
Protein sequenceShow/hide protein sequence
MCYFCRNKYETTTHLFWECKLTKGLWSYFGLLPNGMCFNGRKNWSPSDYLESIWKDSRTNQFEKRRICASMVMCWQIWKHRNDVFHHQITPDPMKMKTTIQKYLAKFNIQ
EEETNLAQEIATSFPAPTTNSTWIPPPEGIWKLSCGLKLERGEESDGFFGIGSGLRSIPNETPKIQVEIDASNVVRLLLDEGQDLTEISNFIADAQALIKVHNIEAVRYV
PRAHNRMAHLLASKACDLQTSEIWTIDFPEWKLILEPIGCSGAKRGQRMKKNQRRKSQNRVSRRPASRRRLSLSELDGNSAQRRDAATLASRRSRNSLKQNVRQAIAS