; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012569 (gene) of Snake gourd v1 genome

Gene IDTan0012569
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG08:10533245..10533676
RNA-Seq ExpressionTan0012569
SyntenyTan0012569
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]9.1e-1336.72Show/hide
Query:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVK
        WKLN++A+W  + NT G+ W++RD +G +I + C   +++ +I  LE  AI EGL  + +   RP   I +ESDSL  ++LL+ +  D TEI   + E+ 
Subjt:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVK

Query:  DLAEAMEDVSFTFCPWKKNVGAHSLAKK
         + + ME VS      + N  AH LA++
Subjt:  DLAEAMEDVSFTFCPWKKNVGAHSLAKK

XP_022148549.1 uncharacterized protein LOC111017181 [Momordica charantia]3.6e-0935.94Show/hide
Query:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVK
        WKLN DA+W++  +  GL W+VRDSEG  I + C++  SQ     LEA     G+            +I +ESD L VVN++N   M LTE+S  + ++ 
Subjt:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVK

Query:  DLAEAMEDVSFTFCPWKKNVGAHSLAKK
           E++    F   P K N  AH +A++
Subjt:  DLAEAMEDVSFTFCPWKKNVGAHSLAKK

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]4.2e-1034.11Show/hide
Query:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHL-HRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEV
        W LN+DASW +  +  G+ W++R  +G ++ +G    ++  +++ LEA AI EGL +L +    RP   + +E+DS  V +LLN K  DLT+    + E+
Subjt:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHL-HRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEV

Query:  KDLAEAMEDVSFTFCPWKKNVGAHSLAKK
         +L ++ E ++F     + N  AHSLA++
Subjt:  KDLAEAMEDVSFTFCPWKKNVGAHSLAKK

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]4.1e-1336.84Show/hide
Query:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQ-----IIVESDSLGVVNLLNGKEMDLTEISSN
        WKLN+DA+W  + NT G+ W++RD +G +I + C   +++ +I  LE  AI EGL  + +   RP  Q     I +ESDSL  ++LL+ +  D TEI   
Subjt:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQ-----IIVESDSLGVVNLLNGKEMDLTEISSN

Query:  ITEVKDLAEAMEDVSFTFCPWKKNVGAHSLAKK
        + E+  + E M+ VS      + N  AH LA++
Subjt:  ITEVKDLAEAMEDVSFTFCPWKKNVGAHSLAKK

XP_024190234.1 uncharacterized protein LOC112194221 [Rosa chinensis]7.2e-1038.93Show/hide
Query:  KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVKD
        KLN+DA+   +   A L  +VRD EG L C+G        SI  +EA A+Y GLL L+R  +     ++VESDS  V++ LN  E+DL+     + ++K 
Subjt:  KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVKD

Query:  LAEAMEDVSFTFCPWKK-----NVGAHSLAK
        LA   E V     PWKK     N+ AH +AK
Subjt:  LAEAMEDVSFTFCPWKK-----NVGAHSLAK

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134124.4e-1336.72Show/hide
Query:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVK
        WKLN++A+W  + NT G+ W++RD +G +I + C   +++ +I  LE  AI EGL  + +   RP   I +ESDSL  ++LL+ +  D TEI   + E+ 
Subjt:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVK

Query:  DLAEAMEDVSFTFCPWKKNVGAHSLAKK
         + + ME VS      + N  AH LA++
Subjt:  DLAEAMEDVSFTFCPWKKNVGAHSLAKK

A0A6J1D4B6 uncharacterized protein LOC1110171811.7e-0935.94Show/hide
Query:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVK
        WKLN DA+W++  +  GL W+VRDSEG  I + C++  SQ     LEA     G+            +I +ESD L VVN++N   M LTE+S  + ++ 
Subjt:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVK

Query:  DLAEAMEDVSFTFCPWKKNVGAHSLAKK
           E++    F   P K N  AH +A++
Subjt:  DLAEAMEDVSFTFCPWKKNVGAHSLAKK

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X13.8e-0942.17Show/hide
Query:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLN
        WKLN+DA+W  + NT G+ W++RD +G +I +GC   +++ +I  LE  AI EGL  + +   RP   I +ESDSL  ++LL+
Subjt:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLN

A0A6J1DNV9 uncharacterized protein LOC1110224032.0e-1034.11Show/hide
Query:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHL-HRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEV
        W LN+DASW +  +  G+ W++R  +G ++ +G    ++  +++ LEA AI EGL +L +    RP   + +E+DS  V +LLN K  DLT+    + E+
Subjt:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHL-HRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEV

Query:  KDLAEAMEDVSFTFCPWKKNVGAHSLAKK
         +L ++ E ++F     + N  AHSLA++
Subjt:  KDLAEAMEDVSFTFCPWKKNVGAHSLAKK

A0A6J1DSV1 uncharacterized protein LOC1110236082.0e-1336.84Show/hide
Query:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQ-----IIVESDSLGVVNLLNGKEMDLTEISSN
        WKLN+DA+W  + NT G+ W++RD +G +I + C   +++ +I  LE  AI EGL  + +   RP  Q     I +ESDSL  ++LL+ +  D TEI   
Subjt:  WKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQ-----IIVESDSLGVVNLLNGKEMDLTEISSN

Query:  ITEVKDLAEAMEDVSFTFCPWKKNVGAHSLAKK
        + E+  + E M+ VS      + N  AH LA++
Subjt:  ITEVKDLAEAMEDVSFTFCPWKKNVGAHSLAKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein6.7e-0625.98Show/hide
Query:  KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVKD
        K N DAS  E +  +GL W++R+S+G+++  G  +F+ + +    E  A+   +  +  +S   + ++I E D+  V  L+N K  D   +   +  +K 
Subjt:  KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVKD

Query:  LAEAMEDVSFTFCPWKKNVGAHSLAKK
           +     F F   ++N  A +L KK
Subjt:  LAEAMEDVSFTFCPWKKNVGAHSLAKK

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.2e-0928.35Show/hide
Query:  KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVKD
        K N+DA+W  EN   G+ W++R+  G ++  G       +++   E  A+   +L + R +   + +II ESD+  +VNLLN  +   T +   + +++ 
Subjt:  KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVKD

Query:  LAEAMEDVSFTFCPWKKNVGAHSLAKK
        L    E+V F F P   N  A  +A++
Subjt:  LAEAMEDVSFTFCPWKKNVGAHSLAKK

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.0e-0627.01Show/hide
Query:  AGW-KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNIT
        A W K N D S       +GL W++R+S+G+ +  GC +F+ +++I+  E  A+   +  +  +    + ++  E D++ V  L+  KE +   +   + 
Subjt:  AGW-KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNIT

Query:  EVKDLAEAMEDVSFTFCPWKKNVGAHSLAKKK-NNHL
         ++  ++A   V FTF   ++NV    LAKK   NH+
Subjt:  EVKDLAEAMEDVSFTFCPWKKNVGAHSLAKKK-NNHL

AT4G29090.1 Ribonuclease H-like superfamily protein1.9e-0826.52Show/hide
Query:  KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVKD
        K N+DA+W  +N   G+ W++R+ +G +   G       +S+   E  A+   +L L R     +  +I ESDS  ++ +LN  E+    +   I +++ 
Subjt:  KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVKD

Query:  LAEAMEDVSFTFCPWKKNVGAHSLAKKKNNHL
        L     +V F F P + N  A  +A++  + L
Subjt:  LAEAMEDVSFTFCPWKKNVGAHSLAKKKNNHL

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-0625.2Show/hide
Query:  KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVKD
        K N DAS  E N  +GL W++R+S+G++I  G  +F+ + +    E   +   +  +  S    H ++I E D+  +  ++N K  +   +   +  ++ 
Subjt:  KLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVKD

Query:  LAEAMEDVSFTFCPWKKNVGAHSLAKK
           + E + F+F   ++N  A  LAK+
Subjt:  LAEAMEDVSFTFCPWKKNVGAHSLAKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACAACACGGCGGGATGGAAGCTGAATTCGGATGCGTCGTGGATCGAGGAGAACAACACGGCGGGGCTGAGTTGGATGGTTCGTGACTCTGAGGGATCTCTAAT
CTGTTCGGGTTGCATGCAGTTTAAAAGTCAAGAGTCGATAAGGAATTTGGAGGCAAGAGCGATTTATGAAGGTCTATTGCACCTTCATCGTTCGAGCCAAAGACCGCACC
CGCAAATCATCGTGGAATCTGATTCGCTGGGGGTTGTTAACCTGCTAAACGGAAAAGAGATGGATCTTACTGAAATTTCGAGTAACATAACAGAAGTCAAAGATCTTGCT
GAAGCCATGGAAGATGTTTCGTTTACCTTCTGTCCCTGGAAGAAGAATGTGGGCGCTCACTCCTTGGCAAAAAAAAAAAACAACCACCTTAAAGTCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACAACACGGCGGGATGGAAGCTGAATTCGGATGCGTCGTGGATCGAGGAGAACAACACGGCGGGGCTGAGTTGGATGGTTCGTGACTCTGAGGGATCTCTAAT
CTGTTCGGGTTGCATGCAGTTTAAAAGTCAAGAGTCGATAAGGAATTTGGAGGCAAGAGCGATTTATGAAGGTCTATTGCACCTTCATCGTTCGAGCCAAAGACCGCACC
CGCAAATCATCGTGGAATCTGATTCGCTGGGGGTTGTTAACCTGCTAAACGGAAAAGAGATGGATCTTACTGAAATTTCGAGTAACATAACAGAAGTCAAAGATCTTGCT
GAAGCCATGGAAGATGTTTCGTTTACCTTCTGTCCCTGGAAGAAGAATGTGGGCGCTCACTCCTTGGCAAAAAAAAAAAACAACCACCTTAAAGTCAAATGA
Protein sequenceShow/hide protein sequence
MENNTAGWKLNSDASWIEENNTAGLSWMVRDSEGSLICSGCMQFKSQESIRNLEARAIYEGLLHLHRSSQRPHPQIIVESDSLGVVNLLNGKEMDLTEISSNITEVKDLA
EAMEDVSFTFCPWKKNVGAHSLAKKKNNHLKVK