; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012919 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012919
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr1:45727460..45730042
RNA-Seq ExpressionLag0012919
SyntenyLag0012919
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]5.4e-1029.92Show/hide
Query:  RKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPD--------IIHIKSSIEEGI-------REWQDSYLKRSSPDSARSQASHVQWS
        R  WT K+ WNW+V   S E++A   ++ W +W  +N+++   +  D        ++ I S+I++G         +  D YL R   +        V+WS
Subjt:  RKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPD--------IIHIKSSIEEGI-------REWQDSYLKRSSPDSARSQASHVQWS

Query:  KPKSKFWKLNADATWFDKLGRGGVGWV
         P +  WKLN DA+W ++   GG+GW+
Subjt:  KPKSKFWKLNADATWFDKLGRGGVGWV

XP_022148549.1 uncharacterized protein LOC111017181 [Momordica charantia]1.3e-1126.77Show/hide
Query:  EYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADN-KIPDIIHIKSSIEEGIREWQDSYLKRSSPDSARSQASHVQWSKPKSKFWKLNADATWFDKLGR
        +Y++W+  +        G +L+WS+W ++N+ V    + P    I++  E  I E+  + L  +     ++    V W+ P    WKLN DATW D L  
Subjt:  EYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADN-KIPDIIHIKSSIEEGIREWQDSYLKRSSPDSARSQASHVQWSKPKSKFWKLNADATWFDKLGR

Query:  GGVGWV-------------------------KISLEIEFDSLAVIQALKRESNDLSKLRPITDEILSLADRLHTGIFSHCFRETSSVAHWVAREASVF
        GG+GW+                          I +E+E D L V+  + + S  L+++  I ++I    + L    F H   + + VAH +AR A VF
Subjt:  GGVGWV-------------------------KISLEIEFDSLAVIQALKRESNDLSKLRPITDEILSLADRLHTGIFSHCFRETSSVAHWVAREASVF

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]1.8e-0830.51Show/hide
Query:  RKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGI--REWQDSYLKRSSPD----SARSQASHVQWSKPKSKFWKL
        R  WT KEYW W+++    E+  +  I+   +W  +NK++      +   I+ +I+  I     QD+ LKR S D          +  +W  P S  WKL
Subjt:  RKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGI--REWQDSYLKRSSPD----SARSQASHVQWSKPKSKFWKL

Query:  NADATWFDKLGRGGVGWV
        N DA W       G+GW+
Subjt:  NADATWFDKLGRGGVGWV

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]1.8e-0830.83Show/hide
Query:  EGRKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGI--REWQDSYLKRSSPD----SARSQASHVQWSKPKSKFW
        E R  WT KEYW W+++    E+  +  I+   +W  +NK++      +   I+ +I+  I     QD+ LKR S D          +  +W  P S  W
Subjt:  EGRKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGI--REWQDSYLKRSSPD----SARSQASHVQWSKPKSKFW

Query:  KLNADATWFDKLGRGGVGWV
        KLN DA W       G+GW+
Subjt:  KLNADATWFDKLGRGGVGWV

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]3.5e-0926.79Show/hide
Query:  MVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGIREWQDSYLKRSSPDSARSQASH-VQWSKPKSKFWKLNADATWFDKLGRGGVGW
        M++  S EDL    I  W +WN +N  +   +      +   + + + E   SY   +S        ++ ++W  P    W LNADA+W D   RGG+GW
Subjt:  MVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGIREWQDSYLKRSSPDSARSQASH-VQWSKPKSKFWKLNADATWFDKLGRGGVGW

Query:  VKIS------------------------------------------LEIEFDSLAVIQALKRESNDLSKLRPITDEILSLADRLHTGIFSHCFRETSSVA
        +  S                                          L IE DS  V   L R+  DL+K   + +EIL+L D      F+   RET+  A
Subjt:  VKIS------------------------------------------LEIEFDSLAVIQALKRESNDLSKLRPITDEILSLADRLHTGIFSHCFRETSSVA

Query:  HWVAREASV
        H +A+ ASV
Subjt:  HWVAREASV

TrEMBL top hitse value%identityAlignment
A0A6J1CQG0 uncharacterized protein LOC1110132162.6e-1029.92Show/hide
Query:  RKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPD--------IIHIKSSIEEGI-------REWQDSYLKRSSPDSARSQASHVQWS
        R  WT K+ WNW+V   S E++A   ++ W +W  +N+++   +  D        ++ I S+I++G         +  D YL R   +        V+WS
Subjt:  RKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPD--------IIHIKSSIEEGI-------REWQDSYLKRSSPDSARSQASHVQWS

Query:  KPKSKFWKLNADATWFDKLGRGGVGWV
         P +  WKLN DA+W ++   GG+GW+
Subjt:  KPKSKFWKLNADATWFDKLGRGGVGWV

A0A6J1D4B6 uncharacterized protein LOC1110171816.3e-1226.77Show/hide
Query:  EYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADN-KIPDIIHIKSSIEEGIREWQDSYLKRSSPDSARSQASHVQWSKPKSKFWKLNADATWFDKLGR
        +Y++W+  +        G +L+WS+W ++N+ V    + P    I++  E  I E+  + L  +     ++    V W+ P    WKLN DATW D L  
Subjt:  EYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADN-KIPDIIHIKSSIEEGIREWQDSYLKRSSPDSARSQASHVQWSKPKSKFWKLNADATWFDKLGR

Query:  GGVGWV-------------------------KISLEIEFDSLAVIQALKRESNDLSKLRPITDEILSLADRLHTGIFSHCFRETSSVAHWVAREASVF
        GG+GW+                          I +E+E D L V+  + + S  L+++  I ++I    + L    F H   + + VAH +AR A VF
Subjt:  GGVGWV-------------------------KISLEIEFDSLAVIQALKRESNDLSKLRPITDEILSLADRLHTGIFSHCFRETSSVAHWVAREASVF

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X18.5e-0930.51Show/hide
Query:  RKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGI--REWQDSYLKRSSPD----SARSQASHVQWSKPKSKFWKL
        R  WT KEYW W+++    E+  +  I+   +W  +NK++      +   I+ +I+  I     QD+ LKR S D          +  +W  P S  WKL
Subjt:  RKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGI--REWQDSYLKRSSPD----SARSQASHVQWSKPKSKFWKL

Query:  NADATWFDKLGRGGVGWV
        N DA W       G+GW+
Subjt:  NADATWFDKLGRGGVGWV

A0A6J1DNV9 uncharacterized protein LOC1110224031.7e-0926.79Show/hide
Query:  MVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGIREWQDSYLKRSSPDSARSQASH-VQWSKPKSKFWKLNADATWFDKLGRGGVGW
        M++  S EDL    I  W +WN +N  +   +      +   + + + E   SY   +S        ++ ++W  P    W LNADA+W D   RGG+GW
Subjt:  MVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGIREWQDSYLKRSSPDSARSQASH-VQWSKPKSKFWKLNADATWFDKLGRGGVGW

Query:  VKIS------------------------------------------LEIEFDSLAVIQALKRESNDLSKLRPITDEILSLADRLHTGIFSHCFRETSSVA
        +  S                                          L IE DS  V   L R+  DL+K   + +EIL+L D      F+   RET+  A
Subjt:  VKIS------------------------------------------LEIEFDSLAVIQALKRESNDLSKLRPITDEILSLADRLHTGIFSHCFRETSSVA

Query:  HWVAREASV
        H +A+ ASV
Subjt:  HWVAREASV

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X28.5e-0930.83Show/hide
Query:  EGRKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGI--REWQDSYLKRSSPD----SARSQASHVQWSKPKSKFW
        E R  WT KEYW W+++    E+  +  I+   +W  +NK++      +   I+ +I+  I     QD+ LKR S D          +  +W  P S  W
Subjt:  EGRKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGI--REWQDSYLKRSSPD----SARSQASHVQWSKPKSKFW

Query:  KLNADATWFDKLGRGGVGWV
        KLN DA W       G+GW+
Subjt:  KLNADATWFDKLGRGGVGWV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAGAGCCCGACAGAAACAAGTTAGAAGAAGAGAATGAGGAGGATGGTGGAATGAACGAAGGTAGGAAAGGATGGACGGAAAAGGAATATTGGAATTGGATGGTTGA
AAACCAAAGCGGGGAAGATTTAGCAAAAGGCTCGATTTTAATGTGGAGCTTATGGAATTTTCAGAACAAGGCAGTAGCAGACAACAAAATCCCCGATATTATCCACATCA
AGTCTTCGATCGAGGAAGGCATACGAGAATGGCAAGACTCTTACCTTAAGAGGAGCAGTCCAGATTCGGCTAGGAGCCAAGCGAGTCATGTTCAGTGGAGCAAACCGAAG
TCAAAATTCTGGAAATTAAACGCGGACGCTACCTGGTTCGACAAATTGGGCAGAGGAGGTGTTGGCTGGGTCAAGATTTCCCTCGAAATTGAATTTGATTCGCTCGCTGT
GATTCAAGCGTTGAAGAGGGAATCAAACGATCTGTCGAAGTTGAGGCCCATCACAGACGAAATCCTCTCCCTCGCAGATCGCCTGCACACCGGGATATTCTCGCACTGCT
TCAGAGAAACCAGCTCAGTCGCCCACTGGGTTGCTAGGGAAGCCTCTGTTTTTTATTTTGATTTTGGTTGTATACATGAGACATCATTGTCTATGGAAAAAGGGCAATCT
TTTTGGGTCCTTGAAGTTCCTTCGTTTATTTGGCCCCTCATTAATGAGGGTAGTTGTTCAGGTGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGAGAGCCCGACAGAAACAAGTTAGAAGAAGAGAATGAGGAGGATGGTGGAATGAACGAAGGTAGGAAAGGATGGACGGAAAAGGAATATTGGAATTGGATGGTTGA
AAACCAAAGCGGGGAAGATTTAGCAAAAGGCTCGATTTTAATGTGGAGCTTATGGAATTTTCAGAACAAGGCAGTAGCAGACAACAAAATCCCCGATATTATCCACATCA
AGTCTTCGATCGAGGAAGGCATACGAGAATGGCAAGACTCTTACCTTAAGAGGAGCAGTCCAGATTCGGCTAGGAGCCAAGCGAGTCATGTTCAGTGGAGCAAACCGAAG
TCAAAATTCTGGAAATTAAACGCGGACGCTACCTGGTTCGACAAATTGGGCAGAGGAGGTGTTGGCTGGGTCAAGATTTCCCTCGAAATTGAATTTGATTCGCTCGCTGT
GATTCAAGCGTTGAAGAGGGAATCAAACGATCTGTCGAAGTTGAGGCCCATCACAGACGAAATCCTCTCCCTCGCAGATCGCCTGCACACCGGGATATTCTCGCACTGCT
TCAGAGAAACCAGCTCAGTCGCCCACTGGGTTGCTAGGGAAGCCTCTGTTTTTTATTTTGATTTTGGTTGTATACATGAGACATCATTGTCTATGGAAAAAGGGCAATCT
TTTTGGGTCCTTGAAGTTCCTTCGTTTATTTGGCCCCTCATTAATGAGGGTAGTTGTTCAGGTGGTTAG
Protein sequenceShow/hide protein sequence
MREPDRNKLEEENEEDGGMNEGRKGWTEKEYWNWMVENQSGEDLAKGSILMWSLWNFQNKAVADNKIPDIIHIKSSIEEGIREWQDSYLKRSSPDSARSQASHVQWSKPK
SKFWKLNADATWFDKLGRGGVGWVKISLEIEFDSLAVIQALKRESNDLSKLRPITDEILSLADRLHTGIFSHCFRETSSVAHWVAREASVFYFDFGCIHETSLSMEKGQS
FWVLEVPSFIWPLINEGSCSGG