; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021221 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021221
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationchr7:5653118..5653807
RNA-Seq ExpressionLag0021221
SyntenyLag0021221
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131661.1 uncharacterized protein LOC111004786 [Momordica charantia]4.2e-2836.59Show/hide
Query:  ESTDHIQIQCPKAQEIWKLTFNHELLL-VNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNRWIKNYLREYLQENSNNNR
        E T H    C +A++IW L F       +N N SF D W  L+  L+ ++  L A T WAIW DRN+  H   V +P +R  WI +Y + Y Q   N   
Subjt:  ESTDHIQIQCPKAQEIWKLTFNHELLL-VNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNRWIKNYLREYLQENSNNNR

Query:  SPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLSISLGYENVILDSGCLHVINL
        SP  +  V      W PP  + +K+N DAA +     TG+G+I RD FG+LL A S F+ +  +P  AE++ + E +KL+ S  Y  ++++S C   I L
Subjt:  SPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLSISLGYENVILDSGCLHVINL

Query:  IKGEL
        ++G+L
Subjt:  IKGEL

XP_022145060.1 uncharacterized protein LOC111014578 [Momordica charantia]2.3e-2634.47Show/hide
Query:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNR
        + K GI     C  C  + E+TDH   +C +A+E+W +         +FN+S  D    L   LS+ +  L+ +  WAIW DRN +  ++ +P   IR+ 
Subjt:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNR

Query:  WIKNYLREY------------LQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAEL
        WI  Y+R++            +QE++++N   + N         W+PPPA W+KINVDAA K    RTGIG++CR+  G +L A+S       DP +AE 
Subjt:  WIKNYLREY------------LQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAEL

Query:  QALSEG
         AL +G
Subjt:  QALSEG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]5.5e-2030.22Show/hide
Query:  GIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKLTFNH-ELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNRWIK
        GI     C  C  + ES  H    C +A++IW+  F     L    N SF + W  L   L  +++ L A+T W IW DRN+++H K V   + +  W+ 
Subjt:  GIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKLTFNH-ELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNRWIK

Query:  NYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLSISLGY
         +L  + Q    +N SP      +  +  W P  +  +K+N DAA +     T  G I RDS   L+ A+S  V     P LAE++ + EG+K + +  +
Subjt:  NYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLSISLGY

Query:  ENVILDSGCLHVINLIKGELEVRNE
         ++ ++S  L  I LI+ E+  R +
Subjt:  ENVILDSGCLHVINLIKGELEVRNE

XP_030483228.1 uncharacterized protein LOC115699823 [Cannabis sativa]1.2e-1926.91Show/hide
Query:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKL---TFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDI
        +FK    T+  C  C   WES  H+   C  A+ +WK+   +FN++ ++   +    D  F+     +  E++ I  T W+IW+DRNNV+H K+   P +
Subjt:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKL---TFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDI

Query:  RNRWIKNYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKL
             +N+L  Y      +    ++         AW PPP   +K+NVD A+     + G G I RDS G ++ A S  +N    P   E + L   +K 
Subjt:  RNRWIKNYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKL

Query:  SISLGYENVILDSGCLHVINLIK
        +    ++  ++++  L + N ++
Subjt:  SISLGYENVILDSGCLHVINLIK

XP_034228798.1 uncharacterized protein LOC117637836 [Prunus dulcis]1.6e-1930.96Show/hide
Query:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIW-----KLTFNHELLLVNFNHSFSDRWFELHHVLS--------SEEMQLIAMTCWAIWTDRNNVV
        + K  I  +  CP C    E+ +HI + CP  + +W      L  NH+ +      SF  +W  L +V+         S  + +IA  CW IW DR   V
Subjt:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIW-----KLTFNHELLLVNFNHSFSDRWFELHHVLS--------SEEMQLIAMTCWAIWTDRNNVV

Query:  HEKVVPSPDIRNRWIKNYLREYLQENSNNNRSPVN-NLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASS--AFVNLDFDPH
         E + PSP          + E+L    +     V+  L    Q   WNPP + +VK+NVDA+W     R GIG++ R++ G  +  SS  +  N   +  
Subjt:  HEKVVPSPDIRNRWIKNYLREYLQENSNNNRSPVN-NLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASS--AFVNLDFDPH

Query:  LAELQALSEGMKLSISLGYENVILDSGCLHVINLIKGEL
         AE QA  EG KL+  +G+  V  +S C  +I+ +KG L
Subjt:  LAELQALSEGMKLSISLGYENVILDSGCLHVINLIKGEL

TrEMBL top hitse value%identityAlignment
A0A6J1BQ49 uncharacterized protein LOC1110047862.0e-2836.59Show/hide
Query:  ESTDHIQIQCPKAQEIWKLTFNHELLL-VNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNRWIKNYLREYLQENSNNNR
        E T H    C +A++IW L F       +N N SF D W  L+  L+ ++  L A T WAIW DRN+  H   V +P +R  WI +Y + Y Q   N   
Subjt:  ESTDHIQIQCPKAQEIWKLTFNHELLL-VNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNRWIKNYLREYLQENSNNNR

Query:  SPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLSISLGYENVILDSGCLHVINL
        SP  +  V      W PP  + +K+N DAA +     TG+G+I RD FG+LL A S F+ +  +P  AE++ + E +KL+ S  Y  ++++S C   I L
Subjt:  SPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLSISLGYENVILDSGCLHVINL

Query:  IKGEL
        ++G+L
Subjt:  IKGEL

A0A6J1CTE3 uncharacterized protein LOC1110145781.1e-2634.47Show/hide
Query:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNR
        + K GI     C  C  + E+TDH   +C +A+E+W +         +FN+S  D    L   LS+ +  L+ +  WAIW DRN +  ++ +P   IR+ 
Subjt:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNR

Query:  WIKNYLREY------------LQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAEL
        WI  Y+R++            +QE++++N   + N         W+PPPA W+KINVDAA K    RTGIG++CR+  G +L A+S       DP +AE 
Subjt:  WIKNYLREY------------LQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAEL

Query:  QALSEG
         AL +G
Subjt:  QALSEG

A0A803Q8J4 Uncharacterized protein3.5e-2028.25Show/hide
Query:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKL---TFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDI
        +FK    T+  C  C   WES  H    C  A+ +WK+   TFN++  +   + +  D  F++    +  E++ I  T W+IW+DRNNV+H K    P +
Subjt:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKL---TFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDI

Query:  RNRWIKNYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKL
         +   +N+L  Y      +  + ++         AW+PPP   +K+NVDAA+     + G G I RDS G +  A S  +N    P   E + L   +K 
Subjt:  RNRWIKNYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKL

Query:  SISLGYENVILDSGCLHVINLIK
        +  L ++  ++++  L + N ++
Subjt:  SISLGYENVILDSGCLHVINLIK

A0A803QCM2 Uncharacterized protein1.3e-1926.82Show/hide
Query:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNR
        +FK    T+  C  C   WES  H    C  A+ +WK+           +    D  F++    +  E++ I  T W+IW+DRNNV+H K+   P +   
Subjt:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNR

Query:  WIKNYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLSIS
          +N+L  Y         + ++         AW PPP   +K+NVDAA+  G  + G G I RDS G +  + S  +N    P   E + L   +K +  
Subjt:  WIKNYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLSIS

Query:  LGYENVILDSGCLHVINLIK
          ++  ++++  L + N ++
Subjt:  LGYENVILDSGCLHVINLIK

A0A803QIY9 Uncharacterized protein5.9e-2026.91Show/hide
Query:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKL---TFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDI
        +FK    T+  C  C   WES  H+   C  A+ +WK+   +FN++ ++   +    D  F+     +  E++ I  T W+IW+DRNNV+H K+   P +
Subjt:  MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKL---TFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDI

Query:  RNRWIKNYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKL
             +N+L  Y      +    ++         AW PPP   +K+NVD A+     + G G I RDS G ++ A S  +N    P   E + L   +K 
Subjt:  RNRWIKNYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKL

Query:  SISLGYENVILDSGCLHVINLIK
        +    ++  ++++  L + N ++
Subjt:  SISLGYENVILDSGCLHVINLIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein3.2e-1025.34Show/hide
Query:  CPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNF---NHSFSD---RWFELHHVLSSEEMQ--LIAMTCWAIWTDRNNVVHEKVVPSPDIRNRWIK
        C RC  + E+  HI   CP  Q +W+   +  +++ N      SF D   R  +L    ++  +   L     W +W  RN  + ++   SPD   R   
Subjt:  CPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNF---NHSFSD---RWFELHHVLSSEEMQ--LIAMTCWAIWTDRNNVVHEKVVPSPDIRNRWIK

Query:  NYLREYLQEN---SNNNRSPVNNLV--VQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLS
            E+L  N    N N     N +   +   S WNPPP  WVK N D+ +  G P T  G   R+  G ++   +A +        AE       +++ 
Subjt:  NYLREYLQEN---SNNNRSPVNNLV--VQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLS

Query:  ISLGYENVILDSGCLHVINLI
         + G   V  +S    ++ LI
Subjt:  ISLGYENVILDSGCLHVINLI

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.4e-1225Show/hide
Query:  CPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNFNHSFSDRWFELHHVLSSEEM--------QLIAMTCWAIWTDRNNVVHE-KVVPSPDIRNRWI
        C RC    E+ +H+  +C  A+ +W ++    +            +  L+ VL+ E           L+    W +W  RN ++ + K   +P++  R +
Subjt:  CPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNFNHSFSDRWFELHHVLSSEEM--------QLIAMTCWAIWTDRNNVVHE-KVVPSPDIRNRWI

Query:  KNY----LREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLS
        +++     R  L E   +      NL VQ     W  PP  WVK N DA W++  PR GIG I R+  G +L   +  +    +   AEL+AL   +   
Subjt:  KNY----LREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLS

Query:  ISLGYENVILDSGCLHVINLIKGE
            Y+ +I +S    ++NL+  +
Subjt:  ISLGYENVILDSGCLHVINLIKGE

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-0722.17Show/hide
Query:  MKCPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNFNHSFSDR-WFELHHVLS--------SEEMQLIAMTCWAIWTDRNNVV--------HEKVV
        ++CP C+   E+ +H+  +C  A+  W ++     + +     ++D  +  L+ V +         +  QL+    W +W +RN +V         E + 
Subjt:  MKCPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNFNHSFSDR-WFELHHVLS--------SEEMQLIAMTCWAIWTDRNNVV--------HEKVV

Query:  PSPDIRNRWIKNYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALS
         + D    W     R   +  S   +  VN    +     W PPP  WVK N DA W     R GIG + R+  G +    +  +        AEL+A+ 
Subjt:  PSPDIRNRWIKNYLREYLQENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALS

Query:  EGMKLSISLGYENVILDSGCLHVINLIKGE
          +       Y  VI +S    +I ++  +
Subjt:  EGMKLSISLGYENVILDSGCLHVINLIKGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAGTCGGGGATTCAAACAACTATGAAATGCCCTAGATGTCAAGGGAAATGGGAATCAACAGATCATATCCAAATCCAATGCCCCAAAGCCCAAGAGATTTGGAA
GCTTACCTTCAATCATGAATTGCTGTTGGTGAATTTCAACCATAGCTTCTCGGATAGATGGTTTGAGCTTCATCACGTTCTTTCTTCTGAGGAGATGCAGCTGATTGCTA
TGACCTGTTGGGCAATTTGGACAGATAGAAACAATGTTGTACATGAGAAAGTTGTTCCTTCTCCAGATATTCGCAACAGATGGATTAAAAATTATCTGAGGGAGTACTTA
CAGGAGAATTCTAATAATAATCGATCCCCTGTTAATAATTTAGTAGTTCAAGAGCAAATGTCAGCTTGGAATCCGCCTCCGGCGAACTGGGTAAAGATTAATGTTGACGC
TGCTTGGAAGGTAGGTTTGCCCCGCACAGGAATTGGCGTTATATGCCGCGATTCTTTCGGATTATTGCTGGGAGCTTCATCTGCTTTTGTCAACCTGGATTTTGACCCTC
ATTTAGCCGAATTACAGGCTCTTTCAGAAGGCATGAAGCTGTCGATCTCCCTAGGTTACGAAAATGTGATTTTGGATTCTGGTTGTCTCCATGTGATTAATCTCATTAAG
GGAGAATTGGAGGTGCGAAATGAACTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAAGTCGGGGATTCAAACAACTATGAAATGCCCTAGATGTCAAGGGAAATGGGAATCAACAGATCATATCCAAATCCAATGCCCCAAAGCCCAAGAGATTTGGAA
GCTTACCTTCAATCATGAATTGCTGTTGGTGAATTTCAACCATAGCTTCTCGGATAGATGGTTTGAGCTTCATCACGTTCTTTCTTCTGAGGAGATGCAGCTGATTGCTA
TGACCTGTTGGGCAATTTGGACAGATAGAAACAATGTTGTACATGAGAAAGTTGTTCCTTCTCCAGATATTCGCAACAGATGGATTAAAAATTATCTGAGGGAGTACTTA
CAGGAGAATTCTAATAATAATCGATCCCCTGTTAATAATTTAGTAGTTCAAGAGCAAATGTCAGCTTGGAATCCGCCTCCGGCGAACTGGGTAAAGATTAATGTTGACGC
TGCTTGGAAGGTAGGTTTGCCCCGCACAGGAATTGGCGTTATATGCCGCGATTCTTTCGGATTATTGCTGGGAGCTTCATCTGCTTTTGTCAACCTGGATTTTGACCCTC
ATTTAGCCGAATTACAGGCTCTTTCAGAAGGCATGAAGCTGTCGATCTCCCTAGGTTACGAAAATGTGATTTTGGATTCTGGTTGTCTCCATGTGATTAATCTCATTAAG
GGAGAATTGGAGGTGCGAAATGAACTGTGA
Protein sequenceShow/hide protein sequence
MFKSGIQTTMKCPRCQGKWESTDHIQIQCPKAQEIWKLTFNHELLLVNFNHSFSDRWFELHHVLSSEEMQLIAMTCWAIWTDRNNVVHEKVVPSPDIRNRWIKNYLREYL
QENSNNNRSPVNNLVVQEQMSAWNPPPANWVKINVDAAWKVGLPRTGIGVICRDSFGLLLGASSAFVNLDFDPHLAELQALSEGMKLSISLGYENVILDSGCLHVINLIK
GELEVRNEL