; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018429 (gene) of Snake gourd v1 genome

Gene IDTan0018429
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG04:11007350..11007778
RNA-Seq ExpressionTan0018429
SyntenyTan0018429
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4273082.1 unnamed protein product [Prunus armeniaca]5.5e-1031.16Show/hide
Query:  DDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETH------ESQGDWTPPDPDYWKLNCDVSWMNKVNVGGIG
        D L +    +W++W +RN       V+   +   A+        EF  A+ H  PL            E+   W  P     K+NCD +W+++  +GG+G
Subjt:  DDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETH------ESQGDWTPPDPDYWKLNCDVSWMNKVNVGGIG

Query:  WVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEA
        WV+RDS+G ++ AGGK   R      +EA+AI E L A
Subjt:  WVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEA

XP_021808158.1 uncharacterized protein LOC110751913 [Prunus avium]1.2e-0931.11Show/hide
Query:  SADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETHESQG-DWTPPDPDYWKLNCDVSWMNKVNVGGIGWVI
        +A+ +   +  +W++W  RN     +   + AD  +A   +     E+  A++   P        S    W  P P   K+NCDV+W  ++  GG+GWVI
Subjt:  SADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETHESQG-DWTPPDPDYWKLNCDVSWMNKVNVGGIGWVI

Query:  RDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEA
        RDS+G L+CAGG+   R     ++E  AI   L A
Subjt:  RDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEA

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]7.1e-1030.77Show/hide
Query:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFL-------------AEKHKFPLTRLETHESQGDWTPPDPDYWKLNCDVSWMNKVNVGG
        ++II W++W +R     N  +      E   RDI LA + + +               K    + R+E + +   W PP  + WKLN + +W    N GG
Subjt:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFL-------------AEKHKFPLTRLETHESQGDWTPPDPDYWKLNCDVSWMNKVNVGG

Query:  IGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK
        IGW++RD  G +I A  + I+    I  LE  AI EGL A  +
Subjt:  IGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]7.6e-1233.57Show/hide
Query:  MTKNLSADDLDLAIIIIWKVWSVRNFI------SSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETHESQGDWTPPDPDYWKLNCDVSWMNK
        M    S +DLD+ +I  W +W+ RN++      SS + ++    K       S  + E  L+  HK       T  ++  W PP    W LN D SW + 
Subjt:  MTKNLSADDLDLAIIIIWKVWSVRNFI------SSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETHESQGDWTPPDPDYWKLNCDVSWMNK

Query:  VNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGL
         + GGIGW+IR  +G ++ AG + ++    +K+LEA AILEGL
Subjt:  VNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGL

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]7.1e-1034.51Show/hide
Query:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAE-------KHKFPLTRLETHESQGD-----WTPPDPDYWKLNCDVSWMNKVNVGGI
        ++II W++W +RN  S   GV S        RDI L  + + +         K K     L      GD     W PP  + WKLN D +W    N GGI
Subjt:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAE-------KHKFPLTRLETHESQGD-----WTPPDPDYWKLNCDVSWMNKVNVGGI

Query:  GWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK
        GW++RD  G +I A  + I+    I  LE  AI EGL A  +
Subjt:  GWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134123.4e-1030.77Show/hide
Query:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFL-------------AEKHKFPLTRLETHESQGDWTPPDPDYWKLNCDVSWMNKVNVGG
        ++II W++W +R     N  +      E   RDI LA + + +               K    + R+E + +   W PP  + WKLN + +W    N GG
Subjt:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFL-------------AEKHKFPLTRLETHESQGDWTPPDPDYWKLNCDVSWMNKVNVGG

Query:  IGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK
        IGW++RD  G +I A  + I+    I  LE  AI EGL A  +
Subjt:  IGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK

A0A6J1DNV9 uncharacterized protein LOC1110224033.7e-1233.57Show/hide
Query:  MTKNLSADDLDLAIIIIWKVWSVRNFI------SSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETHESQGDWTPPDPDYWKLNCDVSWMNK
        M    S +DLD+ +I  W +W+ RN++      SS + ++    K       S  + E  L+  HK       T  ++  W PP    W LN D SW + 
Subjt:  MTKNLSADDLDLAIIIIWKVWSVRNFI------SSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETHESQGDWTPPDPDYWKLNCDVSWMNK

Query:  VNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGL
         + GGIGW+IR  +G ++ AG + ++    +K+LEA AILEGL
Subjt:  VNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGL

A0A6J1DSV1 uncharacterized protein LOC1110236083.4e-1034.51Show/hide
Query:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAE-------KHKFPLTRLETHESQGD-----WTPPDPDYWKLNCDVSWMNKVNVGGI
        ++II W++W +RN  S   GV S        RDI L  + + +         K K     L      GD     W PP  + WKLN D +W    N GGI
Subjt:  AIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAE-------KHKFPLTRLETHESQGD-----WTPPDPDYWKLNCDVSWMNKVNVGGI

Query:  GWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK
        GW++RD  G +I A  + I+    I  LE  AI EGL A  +
Subjt:  GWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEAFNK

A0A6J5UAY2 Reverse transcriptase domain-containing protein2.6e-1031.16Show/hide
Query:  DDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETH------ESQGDWTPPDPDYWKLNCDVSWMNKVNVGGIG
        D L +    +W++W +RN       V+   +   A+        EF  A+ H  PL            E+   W  P     K+NCD +W+++  +GG+G
Subjt:  DDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETH------ESQGDWTPPDPDYWKLNCDVSWMNKVNVGGIG

Query:  WVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEA
        WV+RDS+G ++ AGGK   R      +EA+AI E L A
Subjt:  WVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLEA

A0A6J5V7L7 RNase H domain-containing protein2.2e-0930.94Show/hide
Query:  ADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETH--------ESQGDWTPPDPDYWKLNCDVSWMNKVNVG
        A+ L L   ++W++W  RN +     +V   D   A+  +    +EF   E      T+L+          E+   W+ P P + KLNCD +W+ +   G
Subjt:  ADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETH--------ESQGDWTPPDPDYWKLNCDVSWMNKVNVG

Query:  GIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGL
        G+GWV+RD  G  I AGG    R     + EA+A+ E L
Subjt:  GIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)2.5e-0532.31Show/hide
Query:  DWTPPDPDYWKLNCDVSWMNKVNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLE
        +W+PP   Y K N D  ++   +     W+IRDSNG +I +G  ++++S+     EA   L  L+
Subjt:  DWTPPDPDYWKLNCDVSWMNKVNVGGIGWVIRDSNGSLICAGGKQIKRSWPIKVLEAKAILEGLE

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-0528.07Show/hide
Query:  IIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETHESQG-----------DWTPPDPDYWKLNCDVSWMNKVNVGGIGWVI
        ++W++W       S N +V N  + K    + +A  +       K  L    T+E Q             W+PP  D  K N D S   +  V G+GW++
Subjt:  IIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETHESQG-----------DWTPPDPDYWKLNCDVSWMNKVNVGGIGWVI

Query:  RDSNGSLI-CAGGK
        R+S G++I C  GK
Subjt:  RDSNGSLI-CAGGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAAAAATTTATCTGCGGATGACCTGGATCTGGCCATTATTATTATCTGGAAGGTTTGGAGTGTCAGAAACTTTATTTCTTCTAACAATGGTGTGGTTTCAAACGC
AGATAAGGAGAAGGCCATTAGAGATATCTCGCTCGCAAAGGAGGAATTCTTCTTAGCAGAGAAGCATAAGTTCCCTTTGACAAGATTGGAGACTCACGAGAGTCAAGGAG
ATTGGACCCCTCCGGACCCAGACTATTGGAAGCTAAATTGCGATGTTTCCTGGATGAATAAAGTCAATGTTGGTGGTATTGGTTGGGTTATCCGTGACTCTAATGGCTCT
CTGATTTGTGCAGGAGGGAAGCAAATTAAAAGAAGTTGGCCAATTAAAGTGCTGGAAGCGAAGGCGATTCTTGAGGGTCTTGAAGCGTTTAACAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTAAAAATTTATCTGCGGATGACCTGGATCTGGCCATTATTATTATCTGGAAGGTTTGGAGTGTCAGAAACTTTATTTCTTCTAACAATGGTGTGGTTTCAAACGC
AGATAAGGAGAAGGCCATTAGAGATATCTCGCTCGCAAAGGAGGAATTCTTCTTAGCAGAGAAGCATAAGTTCCCTTTGACAAGATTGGAGACTCACGAGAGTCAAGGAG
ATTGGACCCCTCCGGACCCAGACTATTGGAAGCTAAATTGCGATGTTTCCTGGATGAATAAAGTCAATGTTGGTGGTATTGGTTGGGTTATCCGTGACTCTAATGGCTCT
CTGATTTGTGCAGGAGGGAAGCAAATTAAAAGAAGTTGGCCAATTAAAGTGCTGGAAGCGAAGGCGATTCTTGAGGGTCTTGAAGCGTTTAACAAGTGA
Protein sequenceShow/hide protein sequence
MTKNLSADDLDLAIIIIWKVWSVRNFISSNNGVVSNADKEKAIRDISLAKEEFFLAEKHKFPLTRLETHESQGDWTPPDPDYWKLNCDVSWMNKVNVGGIGWVIRDSNGS
LICAGGKQIKRSWPIKVLEAKAILEGLEAFNK