; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001894 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001894
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr4:36675950..36682315
RNA-Seq ExpressionLag0001894
SyntenyLag0001894
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599977.1 hypothetical protein SDJN03_05210, partial [Cucurbita argyrosperma subsp. sororia]2.3e-0435.71Show/hide
Query:  AKKSENQGRGGLGWVVHDLEGFLICFGMHQIHKNWSMKSLEAKAILEGLKK---ISDTC--NQRAIFLEVESEALEVINILNEKSEDISEVKTFIKHI
        A  SE  G+GG+ WV+HD  G  IC    ++ + W +K LE KA++EGLK    I +T   + R I        LE+  ILN    D+ E+   +  I
Subjt:  AKKSENQGRGGLGWVVHDLEGFLICFGMHQIHKNWSMKSLEAKAILEGLKK---ISDTC--NQRAIFLEVESEALEVINILNEKSEDISEVKTFIKHI

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.5e-0631.78Show/hide
Query:  AKKSENQGRGGLGWVVHDLEGFLICFGMHQIHKNWSMKSLEAKAILEGLKKISDTCNQRAIFLEVESEALEVINILNEKSEDISEVKTFIKHIKSIVESS
        A  S++  RGG+GW++   +G ++  G   +    ++K LEA AILEGL+ +++    R   L +E+++ EV ++LN K ED+++    ++ I ++ +S 
Subjt:  AKKSENQGRGGLGWVVHDLEGFLICFGMHQIHKNWSMKSLEAKAILEGLKKISDTCNQRAIFLEVESEALEVINILNEKSEDISEVKTFIKHIKSIVESS

Query:  FGMAF-RGRRFLNTYAHYVARKALDLSSS
          +AF +  R  N  AH +A++A  L  S
Subjt:  FGMAF-RGRRFLNTYAHYVARKALDLSSS

TrEMBL top hitse value%identityAlignment
A0A445CQM1 RNase H domain-containing protein9.6e-0426.42Show/hide
Query:  PTSLLPLVVSESYRCSHSGPTEQRRIFGGGWKLILAKKSENQGRGGLGWVVHDLEGFLICFGMHQIHKNWSMKSLEAKAILEGLKKISDTCNQRAIFLEV
        PT+  PL+  +    S   P+  R    G +K+ +   S N  +GG+G V+ D  G ++   M ++    S++  EA +  +G+  ++ TC    + +E+
Subjt:  PTSLLPLVVSESYRCSHSGPTEQRRIFGGGWKLILAKKSENQGRGGLGWVVHDLEGFLICFGMHQIHKNWSMKSLEAKAILEGLKKISDTCNQRAIFLEV

Query:  ESEALEVINILNEKSEDISEVKTFIKHIKSIVES-SFGMAFRGRRFLNTYAHYVARKAL
        E + +EV+N LN  S       T I + K+++    F      +R  N+  H +A+ AL
Subjt:  ESEALEVINILNEKSEDISEVKTFIKHIKSIVES-SFGMAFRGRRFLNTYAHYVARKAL

A0A6J1DNV9 uncharacterized protein LOC1110224037.1e-0731.78Show/hide
Query:  AKKSENQGRGGLGWVVHDLEGFLICFGMHQIHKNWSMKSLEAKAILEGLKKISDTCNQRAIFLEVESEALEVINILNEKSEDISEVKTFIKHIKSIVESS
        A  S++  RGG+GW++   +G ++  G   +    ++K LEA AILEGL+ +++    R   L +E+++ EV ++LN K ED+++    ++ I ++ +S 
Subjt:  AKKSENQGRGGLGWVVHDLEGFLICFGMHQIHKNWSMKSLEAKAILEGLKKISDTCNQRAIFLEVESEALEVINILNEKSEDISEVKTFIKHIKSIVESS

Query:  FGMAF-RGRRFLNTYAHYVARKALDLSSS
          +AF +  R  N  AH +A++A  L  S
Subjt:  FGMAF-RGRRFLNTYAHYVARKALDLSSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCGGTTTACGATTACTTTAATTTCAGACCCAAAAAATCTGAAATCTCAATTCCGTTTAGAATTTTCGATTTCTGGAAGCTTCAATGTTGGGCTGTTAGATTTACC
AAATTCCCCGATGGTCGAAATTACGACCTTGTGTATTTATGAGGTGGCACTAATGGGTTTCATTTACAGCGACAAGGGGTTTAACGTTGTCAGGTCAGCACTGGAAATTA
GTAGAAATCGTGAAGTTTTCTACGACAACGTGGGTGATTGGAGTATTCTTCTACAGAAGAACCCCTGGGGTGAGGTGAAAGACATCTCATTCATTCTCCTACTCTCAGAG
AATTCTCCAAAGTCTCCCACCAGTCTCTTGCCTCTAGTAGTCTCAGAGTCATACCGGTGTAGTCATAGTGGTCCGACTGAACAACGAAGGATTTTTGGTGGTGGATGGAA
GCTAATTTTAGCAAAGAAGAGTGAGAATCAAGGAAGAGGCGGCCTGGGTTGGGTTGTTCATGACTTGGAAGGATTCCTCATCTGTTTCGGAATGCATCAAATTCACAAAA
ATTGGTCAATGAAATCGTTGGAAGCAAAAGCTATTCTCGAAGGCCTTAAGAAGATTTCTGATACCTGTAATCAACGAGCCATCTTCCTCGAGGTCGAATCAGAAGCCCTC
GAGGTTATCAACATCCTGAATGAAAAATCAGAGGACATTTCAGAGGTGAAAACTTTCATCAAACATATCAAATCCATCGTCGAGAGCTCCTTTGGCATGGCGTTTCGTGG
TAGGCGGTTTTTGAACACATACGCGCATTATGTTGCGAGGAAGGCCTTGGATCTCTCTTCGTCTTCAGGCGATCTCCATGGCAGTAGCCCCTCGTGGGAAGTGGAGCATG
TCTTTTGGGCTCCTAACCTCCCTTTGTGGATTTTCCCTCTCTTAAATGAGGGAGGTGCGCGCATTCGGGTCAGCGTTGAGACGCTACCTCCACAGCGTCTCGACGCTGCA
AACTTTCCAGATTTAATTAGGGCGTACGCGCGACAGCGTCACGACGCTATGTCCACAGCGTCTCGACGCTGTCACGCAGAATCAGCAGTATATATATTTGCGATGCTTTC
AGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTCGGTTTACGATTACTTTAATTTCAGACCCAAAAAATCTGAAATCTCAATTCCGTTTAGAATTTTCGATTTCTGGAAGCTTCAATGTTGGGCTGTTAGATTTACC
AAATTCCCCGATGGTCGAAATTACGACCTTGTGTATTTATGAGGTGGCACTAATGGGTTTCATTTACAGCGACAAGGGGTTTAACGTTGTCAGGTCAGCACTGGAAATTA
GTAGAAATCGTGAAGTTTTCTACGACAACGTGGGTGATTGGAGTATTCTTCTACAGAAGAACCCCTGGGGTGAGGTGAAAGACATCTCATTCATTCTCCTACTCTCAGAG
AATTCTCCAAAGTCTCCCACCAGTCTCTTGCCTCTAGTAGTCTCAGAGTCATACCGGTGTAGTCATAGTGGTCCGACTGAACAACGAAGGATTTTTGGTGGTGGATGGAA
GCTAATTTTAGCAAAGAAGAGTGAGAATCAAGGAAGAGGCGGCCTGGGTTGGGTTGTTCATGACTTGGAAGGATTCCTCATCTGTTTCGGAATGCATCAAATTCACAAAA
ATTGGTCAATGAAATCGTTGGAAGCAAAAGCTATTCTCGAAGGCCTTAAGAAGATTTCTGATACCTGTAATCAACGAGCCATCTTCCTCGAGGTCGAATCAGAAGCCCTC
GAGGTTATCAACATCCTGAATGAAAAATCAGAGGACATTTCAGAGGTGAAAACTTTCATCAAACATATCAAATCCATCGTCGAGAGCTCCTTTGGCATGGCGTTTCGTGG
TAGGCGGTTTTTGAACACATACGCGCATTATGTTGCGAGGAAGGCCTTGGATCTCTCTTCGTCTTCAGGCGATCTCCATGGCAGTAGCCCCTCGTGGGAAGTGGAGCATG
TCTTTTGGGCTCCTAACCTCCCTTTGTGGATTTTCCCTCTCTTAAATGAGGGAGGTGCGCGCATTCGGGTCAGCGTTGAGACGCTACCTCCACAGCGTCTCGACGCTGCA
AACTTTCCAGATTTAATTAGGGCGTACGCGCGACAGCGTCACGACGCTATGTCCACAGCGTCTCGACGCTGTCACGCAGAATCAGCAGTATATATATTTGCGATGCTTTC
AGAATAG
Protein sequenceShow/hide protein sequence
MFRFTITLISDPKNLKSQFRLEFSISGSFNVGLLDLPNSPMVEITTLCIYEVALMGFIYSDKGFNVVRSALEISRNREVFYDNVGDWSILLQKNPWGEVKDISFILLLSE
NSPKSPTSLLPLVVSESYRCSHSGPTEQRRIFGGGWKLILAKKSENQGRGGLGWVVHDLEGFLICFGMHQIHKNWSMKSLEAKAILEGLKKISDTCNQRAIFLEVESEAL
EVINILNEKSEDISEVKTFIKHIKSIVESSFGMAFRGRRFLNTYAHYVARKALDLSSSSGDLHGSSPSWEVEHVFWAPNLPLWIFPLLNEGGARIRVSVETLPPQRLDAA
NFPDLIRAYARQRHDAMSTASRRCHAESAVYIFAMLSE