; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021547 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021547
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr7:9010347..9012653
RNA-Seq ExpressionLag0021547
SyntenyLag0021547
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG71533.1 hypothetical protein EZV62_000112 [Acer yangbiense]2.0e-1726.84Show/hide
Query:  FALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPN
        FA    S  +  F E + +   +L   +FEL+ I WW VW  RN            +++ +  +++  +       G  +  + Q  E      W  PP 
Subjt:  FALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPN

Query:  RVLKLNIDASRCWSV---------DLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFIPRQGNRVA
          +K+N DA+  + +          +AE  A+ +G++ A    F    +E+D+L +V+ +N +    SEVG++++DI  +L  +      F+PR GN+VA
Subjt:  RVLKLNIDASRCWSV---------DLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFIPRQGNRVA

Query:  HVLARLAFSYV-DRVWLEEWPSEVSDVLRGD
        H LA+L  ++  + VWL + P  V +++ GD
Subjt:  HVLARLAFSYV-DRVWLEEWPSEVSDVLRGD

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.6e-1728.57Show/hide
Query:  FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNN-LFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDASRC
        F   I  M  + +  + EL++++ W +WS RN  +F G +SD R L A +   L A+     R     ++        ++  W+PP   VLKLN+DA+  
Subjt:  FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNN-LFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDASRC

Query:  WS------------------------------VDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLF
                                        V LAE  A++ G+Q+A Q+     +VE+D   +V++LN      +E+  ++ D+RR    +   +  F
Subjt:  WS------------------------------VDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLF

Query:  IPRQGNRVAHVLARLAF--SYVDRVWLEEWPSEVSDVL
        IPR  N  AH LA+ A   S  D VW+  +P+EV +VL
Subjt:  IPRQGNRVAHVLARLAF--SYVDRVWLEEWPSEVSDVL

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]3.9e-2133.62Show/hide
Query:  VVIFWWSVWSLRN----NLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEER-GVWRPPPNRVLKLNIDA------------------
        +V+  W++W+ RN    +   GG     DL ++S +YL  +     +   R S  A    +  R  +WRPP   +LK+N+DA                  
Subjt:  VVIFWWSVWSLRN----NLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEER-GVWRPPPNRVLKLNIDA------------------

Query:  ------------SRCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNG-KVLFIPRQGNRVAHVLAR
                    +R   VD  EG+AVY+GI LA + GF+ F +ETDSLR+  +L  +  D SEVG+L   I+  LS         F  R GN  AH+LA+
Subjt:  ------------SRCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNG-KVLFIPRQGNRVAHVLAR

Query:  LAFSYVD-RVWLEEWPSEVSDVLRGDFAS
        LA +    ++W+EEWP E+S VL  D  S
Subjt:  LAFSYVD-RVWLEEWPSEVSDVLRGDFAS

XP_023884925.1 uncharacterized protein LOC111997106 [Quercus suber]2.4e-1528.63Show/hide
Query:  DKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDASRCWSV------
        ++L+  + EL     W VW+ RN L  GGQ      L   + +Y+  F     R GV      Q  +Q     W+PPP    K+N DA+    +      
Subjt:  DKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDASRCWSV------

Query:  ------------------------DLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILS--PWVNGKVLFIPRQGNR
                                D AE  A  + I+ A   GF   +VE D++ +V+ ++  + ++S +G+++DDIR +L    WV+  +  I R GN 
Subjt:  ------------------------DLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILS--PWVNGKVLFIPRQGNR

Query:  VAHVLARLAFSYVDR--VWLEEWPSEVSDVLRGD
        VAHVLA+ A + +D    WLE+ P   ++ L  D
Subjt:  VAHVLARLAFSYVDR--VWLEEWPSEVSDVLRGD

XP_023928118.1 uncharacterized protein LOC112039474 [Quercus suber]5.8e-1729.75Show/hide
Query:  FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDA---
        F+E++G    KL+  +FEL VI  W +W+ RN + +GG+  D + L  ++ ++L  FH   G+  +  S    S       VWRPPP+   KLN DA   
Subjt:  FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDA---

Query:  --SRCWSVD-------------------------LAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLF
            C  V                          +AE  A  + ++ A + GF D VVE D+L ++K L     D+S +G ++ DI+ +   +      +
Subjt:  --SRCWSVD-------------------------LAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLF

Query:  IPRQGNRVAHVLARLAFS-YVDRVWLEEWPSEVSDVLRGDFA
        + R  N VA+ LAR A   + D  W+E+ P  V + L  DF+
Subjt:  IPRQGNRVAHVLARLAFS-YVDRVWLEEWPSEVSDVLRGDFA

TrEMBL top hitse value%identityAlignment
A0A2P6SAP1 Putative RNA-directed DNA polymerase7.7e-1529.13Show/hide
Query:  EEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS---
        EE +  + D +T     L  ++ W +WS RNNL W G   +  +   ++S +L  +        V  +   +SG    R  W  PP   LK+NID S   
Subjt:  EEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS---

Query:  ---------------------------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFI
                                      S    E  A   G+ LA    + +F++E+D   +V  LN  L D SEVG ++DD +  L+   + K+  +
Subjt:  ---------------------------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFI

Query:  PRQGNRVAHVLARLAFS-YVDRVWLEEWPSEVSDVLRGDFAS---GRFSVSLSL
         R+ N VA+ LA LA S +++ VWLEE P  + DVL  D  +   G+ S S S+
Subjt:  PRQGNRVAHVLARLAFS-YVDRVWLEEWPSEVSDVLRGDFAS---GRFSVSLSL

A0A5B7BI33 Uncharacterized protein (Fragment)4.3e-1827.31Show/hide
Query:  FHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVL
        F  +  H  F +    + +KL     EL  +  W VW  RN ++   Q  D   +   +  YL+ +H    R  +  S+ +++G      VW PPP  + 
Subjt:  FHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVL

Query:  KLNIDAS------------------------------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILS
        KLN+D S                               C S D AE  A++ G+  A+++G VD ++E+D L LV  +     D S +G + DDIRR + 
Subjt:  KLNIDAS------------------------------RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILS

Query:  PWVNGKVLFIPRQGNRVAHVLARLAFSYVDR-VWLEEWPSEVSDVLRGD
           + +V  + R  N+ AH +A  A    +  +W+E  PS    VL  D
Subjt:  PWVNGKVLFIPRQGNRVAHVLARLAFSYVDR-VWLEEWPSEVSDVLRGD

A0A5C7IQ65 RNase H domain-containing protein9.7e-1826.84Show/hide
Query:  FALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPN
        FA    S  +  F E + +   +L   +FEL+ I WW VW  RN            +++ +  +++  +       G  +  + Q  E      W  PP 
Subjt:  FALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ-SDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPN

Query:  RVLKLNIDASRCWSV---------DLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFIPRQGNRVA
          +K+N DA+  + +          +AE  A+ +G++ A    F    +E+D+L +V+ +N +    SEVG++++DI  +L  +      F+PR GN+VA
Subjt:  RVLKLNIDASRCWSV---------DLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFIPRQGNRVA

Query:  HVLARLAFSYV-DRVWLEEWPSEVSDVLRGD
        H LA+L  ++  + VWL + P  V +++ GD
Subjt:  HVLARLAFSYV-DRVWLEEWPSEVSDVLRGD

A0A6J1C467 uncharacterized protein LOC1110077751.2e-1528.33Show/hide
Query:  DKLTGPDFELVVIFWWSVWSLRN-NLFWGGQSDGR-DLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDA-----------
        D++     E + +F W++W+ RN ++F  G      ++  + +DYL  +        +   L  + G       W PP     K+N+DA           
Subjt:  DKLTGPDFELVVIFWWSVWSLRN-NLFWGGQSDGR-DLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDA-----------

Query:  --------------SRCW-----SVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSP-WVNGKVLFIPRQGNR
                      + C+      V LAE  A  +G+ LA + G + F +ETDS ++  +L  +  D SE+G+L   IR I+S   + G   F+ R+GN 
Subjt:  --------------SRCW-----SVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSP-WVNGKVLFIPRQGNR

Query:  VAHVLARLAF-SYVDRVWLEEWPSEVSDVLRGD
         AH LAR+   S    VW+EEW S++S+V+  D
Subjt:  VAHVLARLAF-SYVDRVWLEEWPSEVSDVLRGD

A0A6J1DBJ7 uncharacterized protein LOC1110189731.9e-2133.62Show/hide
Query:  VVIFWWSVWSLRN----NLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEER-GVWRPPPNRVLKLNIDA------------------
        +V+  W++W+ RN    +   GG     DL ++S +YL  +     +   R S  A    +  R  +WRPP   +LK+N+DA                  
Subjt:  VVIFWWSVWSLRN----NLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEER-GVWRPPPNRVLKLNIDA------------------

Query:  ------------SRCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNG-KVLFIPRQGNRVAHVLAR
                    +R   VD  EG+AVY+GI LA + GF+ F +ETDSLR+  +L  +  D SEVG+L   I+  LS         F  R GN  AH+LA+
Subjt:  ------------SRCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNG-KVLFIPRQGNRVAHVLAR

Query:  LAFSYVD-RVWLEEWPSEVSDVLRGDFAS
        LA +    ++W+EEWP E+S VL  D  S
Subjt:  LAFSYVD-RVWLEEWPSEVSDVLRGDFAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-1025.73Show/hide
Query:  ELVVIFWWSVWSLRNNL-FWGGQSDGRDLWAYSSDYLHAFHV--GGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------RC---WSVDLA
        +LV    W +W  RN L F G + + +++   + D L  + +      CG +  +      +   G WRPPP++ +K N DA+      RC   W +   
Subjt:  ELVVIFWWSVWSLRNNL-FWGGQSDGRDLWAYSSDYLHAFHV--GGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------RC---WSVDLA

Query:  EGWAVYKGIQLARQLGFV--------------------DFVV-ETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFIPRQGNRVAHVLARL
        +G   + G +   +L  V                    ++V+ E+DS  L++ILN +      +   + D++R+LS +   K +FIPR+GN +A  +AR 
Subjt:  EGWAVYKGIQLARQLGFV--------------------DFVV-ETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFIPRQGNRVAHVLARL

Query:  AFSYVD
        + S+++
Subjt:  AFSYVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTTGGGTTCCAAATTTGCCCTCTTCCACCAATCCTTTTCTCATTTCAGGTTCGAGGAAATCATTGGGGCGATGAGGGACAAACTGACAGGGCCTGATTTTGAGCT
TGTGGTGATTTTTTGGTGGTCTGTGTGGAGCCTACGGAATAATCTGTTTTGGGGTGGGCAGTCAGACGGTCGGGATCTCTGGGCATATTCGAGTGATTACCTCCATGCCT
TCCATGTCGGTGGGGGACGTTGCGGGGTAAGGGACTCCTTATGGGCTCAGTCGGGGGAGCAAGAAGAGCGCGGTGTATGGAGACCGCCCCCTAATAGGGTGCTGAAACTT
AATATTGATGCTTCAAGGTGTTGGAGCGTGGATTTGGCTGAGGGTTGGGCTGTGTATAAAGGGATCCAACTTGCTCGACAGTTGGGGTTTGTGGATTTTGTGGTGGAGAC
TGACTCTCTAAGACTGGTCAAAATTTTGAATGGGGAGCTGCATGATGTGTCGGAAGTGGGGCTGCTGATGGATGACATTCGACGGATCCTCAGTCCTTGGGTCAACGGTA
AGGTGTTGTTTATTCCACGTCAGGGGAATAGGGTGGCGCATGTTCTGGCCCGCCTGGCCTTTTCATATGTTGATCGTGTATGGCTTGAGGAGTGGCCTAGCGAGGTCTCG
GATGTCCTGAGGGGTGATTTTGCGTCAGGAAGGTTTTCTGTTTCCCTTTCACTTGTCGTTTATACATTGTCTTATGGTTGCATTGTAGATGTTCCTGGGAATAATTTGAA
TTGTGGTCCGTTATGTGGGTGCTGGGTCTGGTCTGAGTGGGGTTTGGTTTTGGCCTTAGAAGGGGGTGCGAAAGCTAGTGCCCTGCCTCTGAGTGGAAAGAAAATGGGGG
AAGATGGACGGTGGGTGCTTATAAATCTGGTCAGGGCTTCTAGGGCTGCTCTAGGTATATTTCGCCTCTTCTGGGTAGGATTTAGCTTTTGCCATGCAAGGGCGTTATCT
ATGAGGGTTCCCGATTCCAGAGCCCTTCAGTGGAAAGTGTGTATGATGGCTTTCGCTAGAATGTCGGCTCCTATGGTTTCTTGTTGGGAAGCTCAAGGGGCTGGAGGTGT
GTCGTGGCAGATGGGCCTGTGGATAGGATGGATCAAGATAATGCGTCATCCTGCCATATTGTGCCATCAAGATTCCAGGTGGATGGACCAGTGCCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGTTGGGTTCCAAATTTGCCCTCTTCCACCAATCCTTTTCTCATTTCAGGTTCGAGGAAATCATTGGGGCGATGAGGGACAAACTGACAGGGCCTGATTTTGAGCT
TGTGGTGATTTTTTGGTGGTCTGTGTGGAGCCTACGGAATAATCTGTTTTGGGGTGGGCAGTCAGACGGTCGGGATCTCTGGGCATATTCGAGTGATTACCTCCATGCCT
TCCATGTCGGTGGGGGACGTTGCGGGGTAAGGGACTCCTTATGGGCTCAGTCGGGGGAGCAAGAAGAGCGCGGTGTATGGAGACCGCCCCCTAATAGGGTGCTGAAACTT
AATATTGATGCTTCAAGGTGTTGGAGCGTGGATTTGGCTGAGGGTTGGGCTGTGTATAAAGGGATCCAACTTGCTCGACAGTTGGGGTTTGTGGATTTTGTGGTGGAGAC
TGACTCTCTAAGACTGGTCAAAATTTTGAATGGGGAGCTGCATGATGTGTCGGAAGTGGGGCTGCTGATGGATGACATTCGACGGATCCTCAGTCCTTGGGTCAACGGTA
AGGTGTTGTTTATTCCACGTCAGGGGAATAGGGTGGCGCATGTTCTGGCCCGCCTGGCCTTTTCATATGTTGATCGTGTATGGCTTGAGGAGTGGCCTAGCGAGGTCTCG
GATGTCCTGAGGGGTGATTTTGCGTCAGGAAGGTTTTCTGTTTCCCTTTCACTTGTCGTTTATACATTGTCTTATGGTTGCATTGTAGATGTTCCTGGGAATAATTTGAA
TTGTGGTCCGTTATGTGGGTGCTGGGTCTGGTCTGAGTGGGGTTTGGTTTTGGCCTTAGAAGGGGGTGCGAAAGCTAGTGCCCTGCCTCTGAGTGGAAAGAAAATGGGGG
AAGATGGACGGTGGGTGCTTATAAATCTGGTCAGGGCTTCTAGGGCTGCTCTAGGTATATTTCGCCTCTTCTGGGTAGGATTTAGCTTTTGCCATGCAAGGGCGTTATCT
ATGAGGGTTCCCGATTCCAGAGCCCTTCAGTGGAAAGTGTGTATGATGGCTTTCGCTAGAATGTCGGCTCCTATGGTTTCTTGTTGGGAAGCTCAAGGGGCTGGAGGTGT
GTCGTGGCAGATGGGCCTGTGGATAGGATGGATCAAGATAATGCGTCATCCTGCCATATTGTGCCATCAAGATTCCAGGTGGATGGACCAGTGCCCATGA
Protein sequenceShow/hide protein sequence
MWLGSKFALFHQSFSHFRFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGVRDSLWAQSGEQEERGVWRPPPNRVLKL
NIDASRCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFIPRQGNRVAHVLARLAFSYVDRVWLEEWPSEVS
DVLRGDFASGRFSVSLSLVVYTLSYGCIVDVPGNNLNCGPLCGCWVWSEWGLVLALEGGAKASALPLSGKKMGEDGRWVLINLVRASRAALGIFRLFWVGFSFCHARALS
MRVPDSRALQWKVCMMAFARMSAPMVSCWEAQGAGGVSWQMGLWIGWIKIMRHPAILCHQDSRWMDQCP