; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010202 (gene) of Snake gourd v1 genome

Gene IDTan0010202
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibonuclease H domain
Genome locationLG07:4279324..4283000
RNA-Seq ExpressionTan0010202
SyntenyTan0010202
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_021722108.1 uncharacterized protein LOC110689646 [Chenopodium quinoa]2.2e-1330.16Show/hide
Query:  CSKAEEIQADRH---VIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEA-STSPICPLTQKRSWQPSKANVWKLNTDVAWSCKLNRGGL
        C     +  D H   ++  +LW +W +RN   +        D++ K VS +  +++A +A S+S       +  W+P    + K+N+D A     +  GL
Subjt:  CSKAEEIQADRH---VIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEA-STSPICPLTQKRSWQPSKANVWKLNTDVAWSCKLNRGGL

Query:  GWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLEFLERRTLGSGVTHALAALTSSLGDSRVWNE
        G ++RDA G++++  C  L S++ V + +ALA+   L+  L    S+   L     S  F   +  G+G  HALA L+SS GD RVW E
Subjt:  GWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLEFLERRTLGSGVTHALAALTSSLGDSRVWNE

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]4.1e-1230.06Show/hide
Query:  KAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLK----VVSALRIHQRASEASTSPICPLTQK------RSWQPSKANVWKLNTDVAWSCKLN
        KA E +  R +I  + W +W  RN+  F        D+ L     ++++   +      ST+    L ++        W+P  +N WKLNT+ AW    N
Subjt:  KAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLK----VVSALRIHQRASEASTSPICPLTQK------RSWQPSKANVWKLNTDVAWSCKLN

Query:  RGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE
         GG+GWI+RD  G++I   C+I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Subjt:  RGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]1.1e-1228.07Show/hide
Query:  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VSALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTD
        W   +YW     +  + +R    ++   +W  RN+  F    S   D+ L +       +    + +       PI  +    +  W+P  +N WKLNTD
Subjt:  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VSALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTD

Query:  VAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE
         AW    N  G+GWI+RD  G++I  GC+I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Subjt:  VAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]1.1e-1228.07Show/hide
Query:  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VSALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTD
        W   +YW     +  + +R    ++   +W  RN+  F    S   D+ L +       +    + +       PI  +    +  W+P  +N WKLNTD
Subjt:  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VSALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTD

Query:  VAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE
         AW    N  G+GWI+RD  G++I  GC+I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Subjt:  VAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.5e-1434.69Show/hide
Query:  DRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEASTSPI-CPLTQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLI
        D  V+ +  W +W+ RN + F    SS   ++ ++   +      SE S S +   L  K  W+P   ++W LN D +WS   +RGG+GWI+R   G ++
Subjt:  DRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEASTSPI-CPLTQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLI

Query:  LEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE
        L G + + +   VKLL+A AILEGL+ +    +  L  L +ETDS E
Subjt:  LEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134122.0e-1230.06Show/hide
Query:  KAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLK----VVSALRIHQRASEASTSPICPLTQK------RSWQPSKANVWKLNTDVAWSCKLN
        KA E +  R +I  + W +W  RN+  F        D+ L     ++++   +      ST+    L ++        W+P  +N WKLNT+ AW    N
Subjt:  KAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLK----VVSALRIHQRASEASTSPICPLTQK------RSWQPSKANVWKLNTDVAWSCKLN

Query:  RGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE
         GG+GWI+RD  G++I   C+I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Subjt:  RGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X15.2e-1328.07Show/hide
Query:  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VSALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTD
        W   +YW     +  + +R    ++   +W  RN+  F    S   D+ L +       +    + +       PI  +    +  W+P  +N WKLNTD
Subjt:  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VSALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTD

Query:  VAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE
         AW    N  G+GWI+RD  G++I  GC+I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Subjt:  VAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE

A0A6J1DNV9 uncharacterized protein LOC1110224037.3e-1534.69Show/hide
Query:  DRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEASTSPI-CPLTQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLI
        D  V+ +  W +W+ RN + F    SS   ++ ++   +      SE S S +   L  K  W+P   ++W LN D +WS   +RGG+GWI+R   G ++
Subjt:  DRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEASTSPI-CPLTQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLI

Query:  LEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE
        L G + + +   VKLL+A AILEGL+ +    +  L  L +ETDS E
Subjt:  LEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X25.2e-1328.07Show/hide
Query:  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VSALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTD
        W   +YW     +  + +R    ++   +W  RN+  F    S   D+ L +       +    + +       PI  +    +  W+P  +N WKLNTD
Subjt:  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VSALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTD

Query:  VAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE
         AW    N  G+GWI+RD  G++I  GC+I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Subjt:  VAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE

A0A6J1DSV1 uncharacterized protein LOC1110236082.2e-1131.25Show/hide
Query:  KAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVL----KVVSALRIHQRASEASTSPICPLTQK------RSWQPSKANVWKLNTDVAWSCKLN
        KA E +  R +I  + W +W  RN+  F    S   D+ L     ++++          S +    L ++        W+P  +N WKLNTD AW    N
Subjt:  KAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVL----KVVSALRIHQRASEASTSPICPLTQK------RSWQPSKANVWKLNTDVAWSCKLN

Query:  RGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEI
         GG+GWI+RD  G++I   C+I+ +   +  L+ +AI EGL+ I
Subjt:  RGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.4e-0723.65Show/hide
Query:  YWTCSKAEEIQADRHVIHLV---LWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEASTSPICPLTQKR---SWQPSKANVWKLNTDVAWSCKL
        YW  +   EI     + +LV   LW LW  RN++ F        +++ + +          E       P  ++     W+       K NTD  W  + 
Subjt:  YWTCSKAEEIQADRHVIHLV---LWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEASTSPICPLTQKR---SWQPSKANVWKLNTDVAWSCKL

Query:  NRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAW
         R G+GWI+R+ +G ++  G + L         +   +LE   E L W
Subjt:  NRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAW

AT4G29090.1 Ribonuclease H-like superfamily protein1.0e-0826Show/hide
Query:  VLWNLWSQRNQIHFNAGMSSPDDLVLKVVSAL---RIHQRASEASTSPICPLTQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLILEGCK
        +LW LW  RN++ F     +  +++ +    L   RI   A    T P    +    W+P      K NTD  W+    R G+GW++R+  G++   G +
Subjt:  VLWNLWSQRNQIHFNAGMSSPDDLVLKVVSAL---RIHQRASEASTSPICPLTQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLILEGCK

Query:  ILSSRWPVKLLKALAILEGLKEILAWKMSQLPS-----LIVETDSLEFLE
         L         K  ++LE   E + W +  L       +I E+DS   +E
Subjt:  ILSSRWPVKLLKALAILEGLKEILAWKMSQLPS-----LIVETDSLEFLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCTCGCGCCTGCAACCCGATTCTGAACCTCTGGGAACCGTGGGATTACTGGACTTGCTCCAAGGCTGAGGAAATCCAAGCAGATCGTCATGTTATTCATTTGGT
GTTGTGGAACTTGTGGAGTCAAAGGAACCAAATCCATTTTAATGCAGGAATGTCGAGTCCTGATGATCTTGTCCTGAAAGTTGTTTCTGCCCTTCGCATACACCAAAGAG
CTTCTGAAGCTTCTACCTCTCCAATTTGCCCCCTTACCCAAAAAAGGTCCTGGCAACCTTCGAAGGCGAATGTTTGGAAACTCAACACAGATGTCGCTTGGTCTTGTAAA
TTGAATCGTGGTGGTTTGGGGTGGATTGTTCGGGATGCTGCAGGAAAATTGATCTTGGAGGGATGCAAAATCCTCTCTTCTAGATGGCCTGTTAAACTGCTTAAAGCGCT
TGCTATTCTGGAAGGTCTGAAGGAAATCCTAGCTTGGAAAATGAGTCAACTCCCGTCTTTGATCGTTGAAACTGACTCGCTGGAGTTCCTAGAGAGGAGAACTCTCGGGT
CTGGAGTCACTCACGCTCTTGCAGCTTTGACATCCTCTTTAGGAGACTCTCGGGTCTGGAATGAGGGCTTTCTAGAGGACATTATCTCTCTCATTTTTGAGATGGGTGTG
GATGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCTCGCGCCTGCAACCCGATTCTGAACCTCTGGGAACCGTGGGATTACTGGACTTGCTCCAAGGCTGAGGAAATCCAAGCAGATCGTCATGTTATTCATTTGGT
GTTGTGGAACTTGTGGAGTCAAAGGAACCAAATCCATTTTAATGCAGGAATGTCGAGTCCTGATGATCTTGTCCTGAAAGTTGTTTCTGCCCTTCGCATACACCAAAGAG
CTTCTGAAGCTTCTACCTCTCCAATTTGCCCCCTTACCCAAAAAAGGTCCTGGCAACCTTCGAAGGCGAATGTTTGGAAACTCAACACAGATGTCGCTTGGTCTTGTAAA
TTGAATCGTGGTGGTTTGGGGTGGATTGTTCGGGATGCTGCAGGAAAATTGATCTTGGAGGGATGCAAAATCCTCTCTTCTAGATGGCCTGTTAAACTGCTTAAAGCGCT
TGCTATTCTGGAAGGTCTGAAGGAAATCCTAGCTTGGAAAATGAGTCAACTCCCGTCTTTGATCGTTGAAACTGACTCGCTGGAGTTCCTAGAGAGGAGAACTCTCGGGT
CTGGAGTCACTCACGCTCTTGCAGCTTTGACATCCTCTTTAGGAGACTCTCGGGTCTGGAATGAGGGCTTTCTAGAGGACATTATCTCTCTCATTTTTGAGATGGGTGTG
GATGTTTGA
Protein sequenceShow/hide protein sequence
MAPRACNPILNLWEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEASTSPICPLTQKRSWQPSKANVWKLNTDVAWSCK
LNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLEFLERRTLGSGVTHALAALTSSLGDSRVWNEGFLEDIISLIFEMGV
DV