; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016910 (gene) of Snake gourd v1 genome

Gene IDTan0016910
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG08:72312815..72313899
RNA-Seq ExpressionTan0016910
SyntenyTan0016910
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015386106.1 uncharacterized protein LOC107177190 [Citrus sinensis]2.8e-1028.79Show/hide
Query:  VWKHNNRRIFTVKSAYRLGLNHRCHLDASSSDKEKVRGRARALAGNQSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTY
        +WK  N+ +F  K    LG+         S  K +         GN    +QWSPP  GW  +  DA+   E    G+G +VR+  G    A    +   
Subjt:  VWKHNNRRIFTVKSAYRLGLNHRCHLDASSSDKEKVRGRARALAGNQSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTY

Query:  WPILVMELFGIIKGMRSISDKGIPL-MVESDSLEAILLIEGKIEDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRASHLQQYDIWLGE
          + + E   +  G++      I   + ESDSLE I LI  K    TE    I  I    +++ +   +H PR  N  AH LA+ A   ++  IWL E
Subjt:  WPILVMELFGIIKGMRSISDKGIPL-MVESDSLEAILLIEGKIEDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRASHLQQYDIWLGE

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]1.8e-1740.74Show/hide
Query:  QWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRSISDKGI-PLMVESDSLEAILLIEGKIEDCTEARD
        QW PP    W L T+A+W  + N+GGIGW++R+ KG VI A    I     I  +E+  I +G+R+I  +   P+ +ESDSLEAI L+  + +D TE   
Subjt:  QWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRSISDKGI-PLMVESDSLEAILLIEGKIEDCTEARD

Query:  FIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRA
         ++ I  M +D   +  RHI R +N+ AH LA+RA
Subjt:  FIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRA

XP_022148737.1 uncharacterized protein LOC111017329 [Momordica charantia]1.3e-1035Show/hide
Query:  SIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRS-ISDKGIPLMVESDSLEAILLIEGKIEDCT
        S+ Q  SP    +W L TDA+W      GG+GW++RN K  +  A    I     I  +EL  I  G+ + +S   + L++ES+SLEAI LI+G  ++ T
Subjt:  SIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRS-ISDKGIPLMVESDSLEAILLIEGKIEDCT

Query:  EARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRAS
        E    +  I N  E      F+H+ R  N  A ++A RA+
Subjt:  EARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRAS

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]6.0e-2131.16Show/hide
Query:  IVWKHNNRRIFTVK-SAYRLGLNHRCHLDASSSDKEKVRGRARALAGNQSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYID
        ++W H N  IF  + S++   +         SS + +          N  +  +W PP    W+L  DASWSD  + GGIGW++R+W G ++ A + +++
Subjt:  IVWKHNNRRIFTVK-SAYRLGLNHRCHLDASSSDKEKVRGRARALAGNQSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYID

Query:  TYWPILVMELFGIIKGMRSISDKGI--PLMVESDSLEAILLIEGKIEDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRASHLQQYDIWL
            + ++E   I++G+R++++ G+  PL +E+DS E   L+  K ED T+    ++ I N+R+    + F  + R +N  AH LAQRAS L++  IW+
Subjt:  TYWPILVMELFGIIKGMRSISDKGI--PLMVESDSLEAILLIEGKIEDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRASHLQQYDIWL

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.1e-1739.86Show/hide
Query:  QWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRSISDKGI---------PLMVESDSLEAILLIEGKI
        +W PP    W L TDA+W  + N+GGIGW++R+ KG VI A    I T   I  +E+  I +G+R+I  +           P+ +ESDSLEAI L+  + 
Subjt:  QWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRSISDKGI---------PLMVESDSLEAILLIEGKI

Query:  EDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRA
        +D TE    ++ I  M ED   +  RHI R +N+ AH LA+RA
Subjt:  EDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRA

TrEMBL top hitse value%identityAlignment
A0A5B7BU49 RNase H domain-containing protein (Fragment)5.0e-1329.23Show/hide
Query:  WKHNNRRIFTVKSAYRLGLNHRCHLDASSSDKEKVRGRARALAGNQSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYW
        W H N  +F  +      +              ++R +  A++  Q     WSPP +G + L   ASW    + GGIG ++R+WKG VI      I T  
Subjt:  WKHNNRRIFTVKSAYRLGLNHRCHLDASSSDKEKVRGRARALAGNQSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYW

Query:  PILVMELFGIIKGMRSISDKGI-PLMVESDSLEAILLIEGKIEDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRASHLQQYDIWL
         I   E   I+ G+    D G+  L VE D L  ++ IE  +ED +E     D I   R+ +    F H+ R++N  AH++A  A  +     WL
Subjt:  PILVMELFGIIKGMRSISDKGI-PLMVESDSLEAILLIEGKIEDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRASHLQQYDIWL

A0A6J1CP26 uncharacterized protein LOC1110134128.7e-1840.74Show/hide
Query:  QWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRSISDKGI-PLMVESDSLEAILLIEGKIEDCTEARD
        QW PP    W L T+A+W  + N+GGIGW++R+ KG VI A    I     I  +E+  I +G+R+I  +   P+ +ESDSLEAI L+  + +D TE   
Subjt:  QWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRSISDKGI-PLMVESDSLEAILLIEGKIEDCTEARD

Query:  FIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRA
         ++ I  M +D   +  RHI R +N+ AH LA+RA
Subjt:  FIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRA

A0A6J1D5W1 uncharacterized protein LOC1110173296.1e-1135Show/hide
Query:  SIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRS-ISDKGIPLMVESDSLEAILLIEGKIEDCT
        S+ Q  SP    +W L TDA+W      GG+GW++RN K  +  A    I     I  +EL  I  G+ + +S   + L++ES+SLEAI LI+G  ++ T
Subjt:  SIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRS-ISDKGIPLMVESDSLEAILLIEGKIEDCT

Query:  EARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRAS
        E    +  I N  E      F+H+ R  N  A ++A RA+
Subjt:  EARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRAS

A0A6J1DNV9 uncharacterized protein LOC1110224032.9e-2131.16Show/hide
Query:  IVWKHNNRRIFTVK-SAYRLGLNHRCHLDASSSDKEKVRGRARALAGNQSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYID
        ++W H N  IF  + S++   +         SS + +          N  +  +W PP    W+L  DASWSD  + GGIGW++R+W G ++ A + +++
Subjt:  IVWKHNNRRIFTVK-SAYRLGLNHRCHLDASSSDKEKVRGRARALAGNQSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYID

Query:  TYWPILVMELFGIIKGMRSISDKGI--PLMVESDSLEAILLIEGKIEDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRASHLQQYDIWL
            + ++E   I++G+R++++ G+  PL +E+DS E   L+  K ED T+    ++ I N+R+    + F  + R +N  AH LAQRAS L++  IW+
Subjt:  TYWPILVMELFGIIKGMRSISDKGI--PLMVESDSLEAILLIEGKIEDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRASHLQQYDIWL

A0A6J1DSV1 uncharacterized protein LOC1110236085.1e-1839.86Show/hide
Query:  QWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRSISDKGI---------PLMVESDSLEAILLIEGKI
        +W PP    W L TDA+W  + N+GGIGW++R+ KG VI A    I T   I  +E+  I +G+R+I  +           P+ +ESDSLEAI L+  + 
Subjt:  QWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRSISDKGI---------PLMVESDSLEAILLIEGKI

Query:  EDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRA
        +D TE    ++ I  M ED   +  RHI R +N+ AH LA+RA
Subjt:  EDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G04420.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-0627.11Show/hide
Query:  QSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRSISDKGI-PLMVESDSLEAILLIEGKIEDC
        +S  Q+W  P  GW     D S++        GW++R+ KG    A  +   T    L  EL  ++  M+    +G   ++ E DS +   L+  K    
Subjt:  QSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILVMELFGIIKGMRSISDKGI-PLMVESDSLEAILLIEGKIEDC

Query:  TEARDFIDIIHNMREDWT------DIVFRHIPRSSNQEAHKLAQRASHLQQYDIWLGESFGLHYFI
           +      + +RE W+      +++F   PR++NQ A  LA+  SHL Q       SF  HYF+
Subjt:  TEARDFIDIIHNMREDWT------DIVFRHIPRSSNQEAHKLAQRASHLQQYDIWLGESFGLHYFI

AT4G29090.1 Ribonuclease H-like superfamily protein2.5e-0924.24Show/hide
Query:  VWKHNNRRIFTVKSAYRLGLNHRCHLDASSSDKEKVRGRARALA------GNQSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARH
        +WK+ N  +F  +       N +  L  +  D E+ R R  A +       N+S   +W PP   W    TDA+W+ +    GIGW++RN KG V     
Subjt:  VWKHNNRRIFTVKSAYRLGLNHRCHLDASSSDKEKVRGRARALA------GNQSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARH

Query:  SYIDTYWPILVMELFGIIKGMRSISDKGIPLMVESDSLEAILLIEGKIEDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRASHLQQYD
          +     +L  EL  +   + S+S      ++     + ++ I    E     +  I  +  +   +T++ F  IPR  N  A ++A+ +     YD
Subjt:  SYIDTYWPILVMELFGIIKGMRSISDKGIPLMVESDSLEAILLIEGKIEDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRASHLQQYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATGAAATAGTGTGGAAACATAACAATCGGAGAATCTTTACAGTCAAAAGTGCGTACCGACTAGGCCTTAATCACCGATGTCATCTAGATGCCTCAAGCTCTGA
TAAGGAGAAAGTCAGGGGACGAGCCCGTGCATTGGCGGGAAATCAAAGTATCCCTCAACAATGGAGCCCTCCCATTCAGGGCTGGTGGAGCCTTTGTACGGATGCCTCTT
GGAGTGACGAATTGAATAGTGGAGGTATTGGTTGGATGGTACGAAACTGGAAAGGTCGTGTGATCTGTGCAAGACATTCTTACATTGACACTTACTGGCCTATTTTGGTT
ATGGAACTATTTGGGATAATTAAAGGAATGAGATCGATCTCGGATAAAGGTATTCCTTTGATGGTGGAATCAGATTCTCTTGAAGCCATCCTTTTGATAGAAGGAAAGAT
TGAAGATTGCACAGAGGCACGAGATTTCATAGATATAATTCACAACATGCGAGAGGACTGGACTGACATTGTCTTCCGGCACATCCCTCGGTCATCGAATCAAGAAGCTC
ACAAGCTGGCACAAAGAGCATCTCATCTTCAACAATACGATATTTGGTTGGGGGAGTCTTTTGGACTCCATTACTTTATTTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGATGAAATAGTGTGGAAACATAACAATCGGAGAATCTTTACAGTCAAAAGTGCGTACCGACTAGGCCTTAATCACCGATGTCATCTAGATGCCTCAAGCTCTGA
TAAGGAGAAAGTCAGGGGACGAGCCCGTGCATTGGCGGGAAATCAAAGTATCCCTCAACAATGGAGCCCTCCCATTCAGGGCTGGTGGAGCCTTTGTACGGATGCCTCTT
GGAGTGACGAATTGAATAGTGGAGGTATTGGTTGGATGGTACGAAACTGGAAAGGTCGTGTGATCTGTGCAAGACATTCTTACATTGACACTTACTGGCCTATTTTGGTT
ATGGAACTATTTGGGATAATTAAAGGAATGAGATCGATCTCGGATAAAGGTATTCCTTTGATGGTGGAATCAGATTCTCTTGAAGCCATCCTTTTGATAGAAGGAAAGAT
TGAAGATTGCACAGAGGCACGAGATTTCATAGATATAATTCACAACATGCGAGAGGACTGGACTGACATTGTCTTCCGGCACATCCCTCGGTCATCGAATCAAGAAGCTC
ACAAGCTGGCACAAAGAGCATCTCATCTTCAACAATACGATATTTGGTTGGGGGAGTCTTTTGGACTCCATTACTTTATTTCATAA
Protein sequenceShow/hide protein sequence
MNDEIVWKHNNRRIFTVKSAYRLGLNHRCHLDASSSDKEKVRGRARALAGNQSIPQQWSPPIQGWWSLCTDASWSDELNSGGIGWMVRNWKGRVICARHSYIDTYWPILV
MELFGIIKGMRSISDKGIPLMVESDSLEAILLIEGKIEDCTEARDFIDIIHNMREDWTDIVFRHIPRSSNQEAHKLAQRASHLQQYDIWLGESFGLHYFIS