; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016849 (gene) of Snake gourd v1 genome

Gene IDTan0016849
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG08:75558366..75559134
RNA-Seq ExpressionTan0016849
SyntenyTan0016849
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023873208.1 uncharacterized protein LOC111985788 [Quercus suber]1.8e-1226.16Show/hide
Query:  MNWHATV--EWNADLMWWFYSNLLKSQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNF
        M W+  V  EW+A+L            + K  ++ WS+W  RN+   +NGG  +      N  L YL ++++      +ET++    +   WKPP    F
Subjt:  MNWHATV--EWNADLMWWFYSNLLKSQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNF

Query:  KLNTGATIFKNESRSGVGALIRDEKGNAMITLMQLVPWMTDIVTAKAIAPELG-------------------QIIEEIKEM-----------------AR
        KLN    +F ++  +GVGALIRDE+G  +  L + +      +  +A A E+G                    +I  +KE+                 + 
Subjt:  KLNTGATIFKNESRSGVGALIRDEKGNAMITLMQLVPWMTDIVTAKAIAPELG-------------------QIIEEIKEM-----------------AR

Query:  KMMNCTFSWCDRRANSLAHSLARHASDFPEEAMWMED
        +     FS   R+ N  AHSLA +AS   + ++W+E+
Subjt:  KMMNCTFSWCDRRANSLAHSLARHASDFPEEAMWMED

XP_023904177.1 uncharacterized protein LOC112015942 [Quercus suber]2.1e-1326.83Show/hide
Query:  VMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNESRSGVGALIRDEKGNAMITL
        V  W +W+ RN  +  +GG  +   +     + +L++F     +++ + L+P      +W+PP  P FK+N  A IF +  RSG GA+IR+E G  M  L
Subjt:  VMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNESRSGVGALIRDEKGNAMITL

Query:  MQLVPWMTDIVTAKAIA---------------------------------PEL---GQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDFPEEA
          + P ++    A+ +A                                 P+L   G ++ +IK +   +   +FSW +R  N +AH+LA+ A+   E+ 
Subjt:  MQLVPWMTDIVTAKAIA---------------------------------PEL---GQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDFPEEA

Query:  MWMED
         WMED
Subjt:  MWMED

XP_023920779.1 uncharacterized protein LOC112032247 [Quercus suber]4.3e-1427.7Show/hide
Query:  SQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPP-IFPNFKLNTGATIFKNESRSGVGALIRDE
        ++ E   +  W +WN RN  +   GG  +   +   +   +L +FR    + +   +  +     +W+PP +FP FKLN  A +F   S SG+GA+IR++
Subjt:  SQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPP-IFPNFKLNTGATIFKNESRSGVGALIRDE

Query:  KGNAMITLMQLVPWMTDIVTAKAIAPE------------------------------------LGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARH
         G  M  +    P + D   A+ +A                                      LG II++IK +AR     +FS+  R AN++AH LAR+
Subjt:  KGNAMITLMQLVPWMTDIVTAKAIAPE------------------------------------LGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARH

Query:  ASDFPEEAMWMED
        A    E+  WMED
Subjt:  ASDFPEEAMWMED

XP_023928118.1 uncharacterized protein LOC112039474 [Quercus suber]2.1e-1629.25Show/hide
Query:  SQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNESRSGVGALIRDEK
        ++ E   +  W +WN RN  +   GG  +   +   +   +L +F   ++      +  +     +W+PP    FKLN  A +F   S SGVGA+IR+E 
Subjt:  SQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNESRSGVGALIRDEK

Query:  GNAMITLMQLVPWMTDIVTAKAIAPE------------------------------------LGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHA
        G  M T++  VP + D V A+ IA                                      LG II++IK + R     +FS+  R ANS+A+ LAR+A
Subjt:  GNAMITLMQLVPWMTDIVTAKAIAPE------------------------------------LGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHA

Query:  SDFPEEAMWMED
         D  E+  WMED
Subjt:  SDFPEEAMWMED

XP_030929162.1 uncharacterized protein LOC115955232 [Quercus lobata]4.8e-1328.23Show/hide
Query:  EKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNESRSGVGALIRDEKGNA
        E   V  W +WN RNK +  +GG            L YL + +  N + +   +  T      WKPP    +KLN  A IF N + SG GA+IR+E+G  
Subjt:  EKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNESRSGVGALIRDEKGNA

Query:  MITLMQLVPWMTDIVTAKAIAPE------------------------------------LGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDF
        M  +    PW+ +   A+A+                                       LG I E+I+ +   + + + S   R  N +AH LARHA   
Subjt:  MITLMQLVPWMTDIVTAKAIAPE------------------------------------LGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDF

Query:  PEEAMWMED
         +E  WMED
Subjt:  PEEAMWMED

TrEMBL top hitse value%identityAlignment
A0A2N9EV70 Uncharacterized protein1.1e-1227.43Show/hide
Query:  ADLMWWFYSNLLKSQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNE
        ADL     ++  +   E   V+ WS+W  RNK  + N  VE  +    +    YL +F   N    S   EPT  +   W+PP   NFK N    IFK  
Subjt:  ADLMWWFYSNLLKSQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNE

Query:  SRSGVGALIRDEKGNAMITLMQLVPWMTDIVTAKAIA-------------PE-----------------------LGQIIEEIKEMARKMMNCTFSWCDR
        + +G+G ++R+ +G  M +L+Q V +   + + +A A             PE                        G +IE++K     +++  F+   R
Subjt:  SRSGVGALIRDEKGNAMITLMQLVPWMTDIVTAKAIA-------------PE-----------------------LGQIIEEIKEMARKMMNCTFSWCDR

Query:  RANSLAHSLARHASDFPEEAMWMEDV
        + N++AH+LAR A +     +WMEDV
Subjt:  RANSLAHSLARHASDFPEEAMWMEDV

A0A2N9F7G9 zf-RVT domain-containing protein7.4e-1228Show/hide
Query:  VEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNESRSGVGALIRDEKGN
        +EK  V  W +W+ RN  + +     +    +WN    YL +F       K E  +P   R   WKPP+   +K N    IFK  +  G+G +IRD  G 
Subjt:  VEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNESRSGVGALIRDEKGN

Query:  AMITLMQLVPWMTDIVTAKAIAPELGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDFPEEAMWMEDV
         + T+                        E+ K + ++    + S   R  NS+AH+LAR ASD     +W+E+V
Subjt:  AMITLMQLVPWMTDIVTAKAIAPELGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDFPEEAMWMEDV

A0A2N9HRK6 Uncharacterized protein4.4e-1225.66Show/hide
Query:  ADLMWWFYSNLLKSQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNE
        ADL     ++  +   E   ++ WS+W  RNK  + +  V   +         YL +F + N +    T EP   +   W+PP   NFK N    IFK  
Subjt:  ADLMWWFYSNLLKSQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNE

Query:  SRSGVGALIRDEKGNAMITLMQLVPWMTDIVTAKAIA---------------------------------PEL---GQIIEEIKEMARKMMNCTFSWCDR
        + +G+G ++R+ +G  M +L+Q V +   + + +A A                                 P L   G +IE++K +   +++  F+   R
Subjt:  SRSGVGALIRDEKGNAMITLMQLVPWMTDIVTAKAIA---------------------------------PEL---GQIIEEIKEMARKMMNCTFSWCDR

Query:  RANSLAHSLARHASDFPEEAMWMEDV
        + N++AH+LAR A +     +WMEDV
Subjt:  RANSLAHSLARHASDFPEEAMWMEDV

A0A2N9IVV8 Uncharacterized protein3.6e-1430.1Show/hide
Query:  SNLLKSQVEKLFVML-WSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNESRSGVGA
        S  LK    +LF+++ W++W  RNK  +Q       I    +    YL+Q+    E++K    +P       W+PP    +K+N    +FK  + +G+G 
Subjt:  SNLLKSQVEKLFVML-WSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNESRSGVGA

Query:  LIRDEKGNAMITLMQLVPWMTDIVTAKAIA----------PEL---GQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDFPEEAMWMEDV
        ++RD  G  M +L Q V +   + + +A A          P L   G +I + K +A K+ N +FS   R+ N LAH+LAR A       +WME V
Subjt:  LIRDEKGNAMITLMQLVPWMTDIVTAKAIA----------PEL---GQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDFPEEAMWMEDV

A0A7J6HML3 Uncharacterized protein7.4e-1222.77Show/hide
Query:  ADLMWWFYSNLLKSQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNE
        AD++WW + +L   +  K   + W VW  RN  + Q+  ++  I  SW   L        + E ++  +  P ++ +  W PP    F +NT A++   +
Subjt:  ADLMWWFYSNLLKSQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNE

Query:  SRSGVGALIRDEKGNAMITLMQLVPWMTDIVTAKAIAPEL------------------------------------GQIIEEIKEMARKMMNCTFSWCDR
           G+ A+IRD KG  ++     +P    ++ A+A A  L                                    GQ++++IK +  K     F +  R
Subjt:  SRSGVGALIRDEKGNAMITLMQLVPWMTDIVTAKAIAPEL------------------------------------GQIIEEIKEMARKMMNCTFSWCDR

Query:  RANSLAHSLARHASDFPEEAMWME
          N +A+SLA+ +    +  MW +
Subjt:  RANSLAHSLARHASDFPEEAMWME

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.6e-0621.21Show/hide
Query:  MLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSE--TLEPTVERH--CLWKPPIFPNFKLNTGATIFKNESRSGVGALIRDEKG---
        +LW +W +RN+ + +  G E   P         ++ F +++ R + E     P VER+    WK P +   K NT AT      R G+G ++R+E G   
Subjt:  MLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSE--TLEPTVERH--CLWKPPIFPNFKLNTGATIFKNESRSGVGALIRDEKG---

Query:  -----------NAMITLMQLVPW---------------------MTDIVTAKAIAPELGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDF
                   N +   ++ + W                     + +++ +    P L   +E+I+++        F +  R  N +A  +AR +  F
Subjt:  -----------NAMITLMQLVPW---------------------MTDIVTAKAIAPELGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDF

AT4G29090.1 Ribonuclease H-like superfamily protein3.5e-0621.94Show/hide
Query:  MLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCL--WKPPIFPNFKLNTGATIFKNESRSGVGALIRDEKG-----
        +LW +W  RN+ + +  G E             L+++R   E     T +P V R     W+PP     K NT AT  ++  R G+G ++R+EKG     
Subjt:  MLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCL--WKPPIFPNFKLNTGATIFKNESRSGVGALIRDEKG-----

Query:  ---------NAMITLMQLVPW---------------------MTDIVTAKAIAPELGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDF
                 + +   ++ + W                     + +I+    I P L   I++++ +  +     F +  R  N+LA  +AR +  F
Subjt:  ---------NAMITLMQLVPW---------------------MTDIVTAKAIAPELGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGGCATGCAACAGTTGAATGGAATGCTGACTTAATGTGGTGGTTCTATTCAAACCTACTGAAGAGTCAGGTTGAGAAGTTGTTTGTCATGCTGTGGAGTGTGTG
GAATACTCGAAATAAAGCTATCATTCAAAATGGAGGAGTGGAAAGGGGCATTCCTTATAGTTGGAATTTCTGCCTAACCTATTTACAACAATTTCGAGATTATAATGAGA
GGAATAAATCAGAAACTTTGGAACCAACTGTAGAACGACACTGCTTATGGAAACCCCCAATATTTCCCAACTTCAAATTAAATACAGGTGCAACGATTTTCAAAAATGAG
AGTCGAAGTGGAGTGGGTGCATTGATACGAGATGAAAAGGGGAATGCGATGATTACCTTGATGCAGCTTGTTCCATGGATGACAGACATTGTGACAGCCAAAGCAATTGC
GCCGGAGTTAGGGCAGATTATCGAAGAAATTAAAGAAATGGCAAGGAAGATGATGAACTGTACCTTCTCTTGGTGTGATCGAAGAGCAAACTCCCTGGCTCACTCTCTAG
CAAGGCATGCAAGCGACTTTCCTGAAGAGGCGATGTGGATGGAAGACGTTCTTGTTTTCTCGAGAGACTTCTACGAAGTTGAGAGAAATGAAGAAGGAACTTGCTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACTGGCATGCAACAGTTGAATGGAATGCTGACTTAATGTGGTGGTTCTATTCAAACCTACTGAAGAGTCAGGTTGAGAAGTTGTTTGTCATGCTGTGGAGTGTGTG
GAATACTCGAAATAAAGCTATCATTCAAAATGGAGGAGTGGAAAGGGGCATTCCTTATAGTTGGAATTTCTGCCTAACCTATTTACAACAATTTCGAGATTATAATGAGA
GGAATAAATCAGAAACTTTGGAACCAACTGTAGAACGACACTGCTTATGGAAACCCCCAATATTTCCCAACTTCAAATTAAATACAGGTGCAACGATTTTCAAAAATGAG
AGTCGAAGTGGAGTGGGTGCATTGATACGAGATGAAAAGGGGAATGCGATGATTACCTTGATGCAGCTTGTTCCATGGATGACAGACATTGTGACAGCCAAAGCAATTGC
GCCGGAGTTAGGGCAGATTATCGAAGAAATTAAAGAAATGGCAAGGAAGATGATGAACTGTACCTTCTCTTGGTGTGATCGAAGAGCAAACTCCCTGGCTCACTCTCTAG
CAAGGCATGCAAGCGACTTTCCTGAAGAGGCGATGTGGATGGAAGACGTTCTTGTTTTCTCGAGAGACTTCTACGAAGTTGAGAGAAATGAAGAAGGAACTTGCTCGTAA
Protein sequenceShow/hide protein sequence
MNWHATVEWNADLMWWFYSNLLKSQVEKLFVMLWSVWNTRNKAIIQNGGVERGIPYSWNFCLTYLQQFRDYNERNKSETLEPTVERHCLWKPPIFPNFKLNTGATIFKNE
SRSGVGALIRDEKGNAMITLMQLVPWMTDIVTAKAIAPELGQIIEEIKEMARKMMNCTFSWCDRRANSLAHSLARHASDFPEEAMWMEDVLVFSRDFYEVERNEEGTCS