; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013366 (gene) of Snake gourd v1 genome

Gene IDTan0013366
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG08:4068779..4070008
RNA-Seq ExpressionTan0013366
SyntenyTan0013366
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]5.5e-0828.92Show/hide
Query:  SHGDLRTFMVVLWKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMS---RETVGRILGDTTVVRG----------WTPPVWGTWSLCIDASWCPVLN
        S  ++ T MV+ W+IW  RN+      +I+ +++ RS  L +   +       +T      D  + RG          W+ P    W L  DASW     
Subjt:  SHGDLRTFMVVLWKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMS---RETVGRILGDTTVVRG----------WTPPVWGTWSLCIDASWCPVLN

Query:  CRGLGWIVRDWDGRIINVGRHVVTVGWSILFLELWGIITGLKVVS-DTKIPLVVESYSLEAIQLIE
          G+GWI+ D  G I+  G   +     I  LEL  II GL+ ++  ++ P+ +ES S+E I+L++
Subjt:  CRGLGWIVRDWDGRIINVGRHVVTVGWSILFLELWGIITGLKVVS-DTKIPLVVESYSLEAIQLIE

XP_022148549.1 uncharacterized protein LOC111017181 [Momordica charantia]2.7e-0736.89Show/hide
Query:  MVVLWKIWTWRNKETRTSSSINREEMIRS-TKLHLQEF-------LQMSRETVGRILGDTTVVRGWTPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDG
        +V+LW IWT+RN+   ++      + IR+ T+  + EF       L M  + + R +       GWTPP    W L +DA+W   L+  GLGWIVRD +G
Subjt:  MVVLWKIWTWRNKETRTSSSINREEMIRS-TKLHLQEF-------LQMSRETVGRILGDTTVVRGWTPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDG

Query:  RII
        R I
Subjt:  RII

XP_022148737.1 uncharacterized protein LOC111017329 [Momordica charantia]1.1e-0539.77Show/hide
Query:  WSLCIDASWCPVLNCRGLGWIVRDWDGRIINVGRHVVTVGWSILFLELWGIITGLK-VVSDTKIPLVVESYSLEAIQLIEGLVEYCME
        W L  DA+W       GLGWI+R+    I   G   +T    I +LEL  I  G++ VVS + + L++ES SLEAI LI+G+ +   E
Subjt:  WSLCIDASWCPVLNCRGLGWIVRDWDGRIINVGRHVVTVGWSILFLELWGIITGLK-VVSDTKIPLVVESYSLEAIQLIEGLVEYCME

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]5.5e-0828.1Show/hide
Query:  SHGDLRTFMVVLWKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMSRETVGRILGDTTVVRGWTPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDG
        S  DL   ++  W IW  RN         +   MI+     + E    S  ++  +         W PP    W+L  DASW    +  G+GWI+R WDG
Subjt:  SHGDLRTFMVVLWKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMSRETVGRILGDTTVVRGWTPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDG

Query:  RIINVGRHVVTVGWSILFLELWGIITGLKVVSDTKI--PLVVESYSLEAIQLI
         I+  G   V    ++  LE   I+ GL+ +++  +  PL +E+ S E   L+
Subjt:  RIINVGRHVVTVGWSILFLELWGIITGLKVVSDTKI--PLVVESYSLEAIQLI

XP_028948114.1 uncharacterized protein LOC114820933 [Malus domestica]1.5e-0526.95Show/hide
Query:  IRPILGELLGGWRLFW--LDDRCDSHGDLRTFMVVLWKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMSRETVGRILGDTTVVRG----------W
        I  I  + L  W +F   + DR D+   ++ F   LW+IW  RN      S     E++   + ++ E+    RE +     D   VR           W
Subjt:  IRPILGELLGGWRLFW--LDDRCDSHGDLRTFMVVLWKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMSRETVGRILGDTTVVRG----------W

Query:  TPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDGRIINVGRHVVTVGWSILFLELWGIITGLKVVSD
          P +GT+ L  DASWC      G GW++RD+ G +   G    +        E   I TGL+  ++
Subjt:  TPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDGRIINVGRHVVTVGWSILFLELWGIITGLKVVSD

TrEMBL top hitse value%identityAlignment
A0A5J5A364 RNase H domain-containing protein1.2e-0530.67Show/hide
Query:  WKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMSRET-VGRILGDTTVV---RGWTPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDGRII-NVGR
        W+IW  RN     +SS N         LH +E++ + ++T V +++G + VV     W+P   G + L +D SW P L    +G ++RDW G +I   G+
Subjt:  WKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMSRET-VGRILGDTTVV---RGWTPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDGRII-NVGR

Query:  HVVTVGWSILFLELWGIITGLKVVSDTKI-PLVVESYSLEAIQLIEGLVE
        H + +  S    E+  I+ GL    D  I  + +ES  L A+  I+   E
Subjt:  HVVTVGWSILFLELWGIITGLKVVSDTKI-PLVVESYSLEAIQLIEGLVE

A0A6J1CQG0 uncharacterized protein LOC1110132162.7e-0828.92Show/hide
Query:  SHGDLRTFMVVLWKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMS---RETVGRILGDTTVVRG----------WTPPVWGTWSLCIDASWCPVLN
        S  ++ T MV+ W+IW  RN+      +I+ +++ RS  L +   +       +T      D  + RG          W+ P    W L  DASW     
Subjt:  SHGDLRTFMVVLWKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMS---RETVGRILGDTTVVRG----------WTPPVWGTWSLCIDASWCPVLN

Query:  CRGLGWIVRDWDGRIINVGRHVVTVGWSILFLELWGIITGLKVVS-DTKIPLVVESYSLEAIQLIE
          G+GWI+ D  G I+  G   +     I  LEL  II GL+ ++  ++ P+ +ES S+E I+L++
Subjt:  CRGLGWIVRDWDGRIINVGRHVVTVGWSILFLELWGIITGLKVVS-DTKIPLVVESYSLEAIQLIE

A0A6J1D4B6 uncharacterized protein LOC1110171811.3e-0736.89Show/hide
Query:  MVVLWKIWTWRNKETRTSSSINREEMIRS-TKLHLQEF-------LQMSRETVGRILGDTTVVRGWTPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDG
        +V+LW IWT+RN+   ++      + IR+ T+  + EF       L M  + + R +       GWTPP    W L +DA+W   L+  GLGWIVRD +G
Subjt:  MVVLWKIWTWRNKETRTSSSINREEMIRS-TKLHLQEF-------LQMSRETVGRILGDTTVVRGWTPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDG

Query:  RII
        R I
Subjt:  RII

A0A6J1D5W1 uncharacterized protein LOC1110173295.5e-0639.77Show/hide
Query:  WSLCIDASWCPVLNCRGLGWIVRDWDGRIINVGRHVVTVGWSILFLELWGIITGLK-VVSDTKIPLVVESYSLEAIQLIEGLVEYCME
        W L  DA+W       GLGWI+R+    I   G   +T    I +LEL  I  G++ VVS + + L++ES SLEAI LI+G+ +   E
Subjt:  WSLCIDASWCPVLNCRGLGWIVRDWDGRIINVGRHVVTVGWSILFLELWGIITGLK-VVSDTKIPLVVESYSLEAIQLIEGLVEYCME

A0A6J1DNV9 uncharacterized protein LOC1110224032.7e-0828.1Show/hide
Query:  SHGDLRTFMVVLWKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMSRETVGRILGDTTVVRGWTPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDG
        S  DL   ++  W IW  RN         +   MI+     + E    S  ++  +         W PP    W+L  DASW    +  G+GWI+R WDG
Subjt:  SHGDLRTFMVVLWKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMSRETVGRILGDTTVVRGWTPPVWGTWSLCIDASWCPVLNCRGLGWIVRDWDG

Query:  RIINVGRHVVTVGWSILFLELWGIITGLKVVSDTKI--PLVVESYSLEAIQLI
         I+  G   V    ++  LE   I+ GL+ +++  +  PL +E+ S E   L+
Subjt:  RIINVGRHVVTVGWSILFLELWGIITGLKVVSDTKI--PLVVESYSLEAIQLI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAATTTTCTTAATGCAGAGTTGGGCTCTAATTCGTCCTATACTTGGCGAACTCTTGGGTGGTTGGAGATTATTTTGGCTGGATGATCGATGCGACAGTCATGGGGA
TCTTCGTACATTTATGGTTGTTTTGTGGAAGATTTGGACTTGGAGAAACAAAGAAACCAGAACTTCTAGTTCAATAAATAGGGAGGAGATGATCAGAAGCACGAAATTAC
ACCTTCAGGAGTTCCTTCAGATGTCCAGGGAGACGGTTGGTAGAATTCTTGGGGATACGACTGTTGTTCGTGGATGGACCCCACCAGTCTGGGGGACGTGGAGTTTGTGT
ATCGATGCATCTTGGTGTCCCGTTTTAAATTGCAGGGGCTTGGGTTGGATTGTTCGAGACTGGGACGGACGGATCATTAATGTTGGTCGTCATGTTGTTACTGTGGGTTG
GTCCATTCTCTTTCTTGAGCTTTGGGGAATCATTACAGGGTTGAAAGTCGTTTCAGACACAAAAATCCCACTAGTCGTGGAGTCATACTCGCTCGAGGCCATTCAGTTGA
TCGAGGGATTGGTGGAATATTGTATGGAAACACAAGAGTTTCTGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAATTTTCTTAATGCAGAGTTGGGCTCTAATTCGTCCTATACTTGGCGAACTCTTGGGTGGTTGGAGATTATTTTGGCTGGATGATCGATGCGACAGTCATGGGGA
TCTTCGTACATTTATGGTTGTTTTGTGGAAGATTTGGACTTGGAGAAACAAAGAAACCAGAACTTCTAGTTCAATAAATAGGGAGGAGATGATCAGAAGCACGAAATTAC
ACCTTCAGGAGTTCCTTCAGATGTCCAGGGAGACGGTTGGTAGAATTCTTGGGGATACGACTGTTGTTCGTGGATGGACCCCACCAGTCTGGGGGACGTGGAGTTTGTGT
ATCGATGCATCTTGGTGTCCCGTTTTAAATTGCAGGGGCTTGGGTTGGATTGTTCGAGACTGGGACGGACGGATCATTAATGTTGGTCGTCATGTTGTTACTGTGGGTTG
GTCCATTCTCTTTCTTGAGCTTTGGGGAATCATTACAGGGTTGAAAGTCGTTTCAGACACAAAAATCCCACTAGTCGTGGAGTCATACTCGCTCGAGGCCATTCAGTTGA
TCGAGGGATTGGTGGAATATTGTATGGAAACACAAGAGTTTCTGGAATAA
Protein sequenceShow/hide protein sequence
MRIFLMQSWALIRPILGELLGGWRLFWLDDRCDSHGDLRTFMVVLWKIWTWRNKETRTSSSINREEMIRSTKLHLQEFLQMSRETVGRILGDTTVVRGWTPPVWGTWSLC
IDASWCPVLNCRGLGWIVRDWDGRIINVGRHVVTVGWSILFLELWGIITGLKVVSDTKIPLVVESYSLEAIQLIEGLVEYCMETQEFLE