; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021576 (gene) of Snake gourd v1 genome

Gene IDTan0021576
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG06:45230497..45230985
RNA-Seq ExpressionTan0021576
SyntenyTan0021576
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU50545.1 hypothetical protein TSUD_409890 [Trifolium subterraneum]5.6e-1131.91Show/hide
Query:  HTNWQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKAND
        H  WQ P   G +K NVDA + ++ G    GW +RD +G  I AG  ++P+K SI+  EA+ ++E +K++    ++ F  +  E+DS+ + + IQ   N 
Subjt:  HTNWQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKAND

Query:  ITGTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA
        ++  S +V ++  +L++     V    R  N  AH L R A
Subjt:  ITGTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]5.8e-1634.78Show/hide
Query:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKANDITG
        W+PP     WKLN +A+W       G+GW+LRD  G  I A C+ +  + +I +LE + I EGL+ I     RP   IH+ESDS E ++L+  +  D T 
Subjt:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKANDITG

Query:  TSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA
          +++ E+ +++  +    + H  R  N  AH L R A
Subjt:  TSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]4.9e-1536.17Show/hide
Query:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMS-RFQRPFPCIHIESDSQEIVNLIQGKANDIT
        W+PP P  +W LN DASW +S    G+GW++R   G  + AG +F+   +++  LEA  I+EGL+ + +    RP   +HIE+DS E+ +L+  K  D+T
Subjt:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMS-RFQRPFPCIHIESDSQEIVNLIQGKANDIT

Query:  GTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMAAI
         T +VV E+  L +           R  NG AH+L + A++
Subjt:  GTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMAAI

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]5.8e-1634.97Show/hide
Query:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFP---C--IHIESDSQEIVNLIQGKA
        W+PP     WKLN DA+W       G+GW+LRD  G  I A C+ +  + +I +LE + I EGL+ I     RP     C  IH+ESDS E ++L+  + 
Subjt:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFP---C--IHIESDSQEIVNLIQGKA

Query:  NDITGTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA
         D T   +++ E+ +++  +    + H  R  N  AH+L R A
Subjt:  NDITGTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA

XP_025877173.1 uncharacterized protein LOC107278050 isoform X1 [Oryza sativa Japonica Group]6.6e-1233.54Show/hide
Query:  SSSHTNWQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGK
        +S+  +W+ PR  G  KLNVD S+    G  G+G +LRD+ GS ++A CK L    + +  E    MEGL   +    RP   I IE+D   +VNL++  
Subjt:  SSSHTNWQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGK

Query:  ANDITGTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMAAIQEIPRDLSKEHLNFL
          D++  + +V E+ RLL+      V    R QNG +H L   A    I     +   NF+
Subjt:  ANDITGTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMAAIQEIPRDLSKEHLNFL

TrEMBL top hitse value%identityAlignment
A0A2Z6PJB8 Uncharacterized protein2.7e-1131.91Show/hide
Query:  HTNWQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKAND
        H  WQ P   G +K NVDA + ++ G    GW +RD +G  I AG  ++P+K SI+  EA+ ++E +K++    ++ F  +  E+DS+ + + IQ   N 
Subjt:  HTNWQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKAND

Query:  ITGTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA
        ++  S +V ++  +L++     V    R  N  AH L R A
Subjt:  ITGTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA

A0A6J1CP26 uncharacterized protein LOC1110134122.8e-1634.78Show/hide
Query:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKANDITG
        W+PP     WKLN +A+W       G+GW+LRD  G  I A C+ +  + +I +LE + I EGL+ I     RP   IH+ESDS E ++L+  +  D T 
Subjt:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKANDITG

Query:  TSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA
          +++ E+ +++  +    + H  R  N  AH L R A
Subjt:  TSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA

A0A6J1DNV9 uncharacterized protein LOC1110224032.4e-1536.17Show/hide
Query:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMS-RFQRPFPCIHIESDSQEIVNLIQGKANDIT
        W+PP P  +W LN DASW +S    G+GW++R   G  + AG +F+   +++  LEA  I+EGL+ + +    RP   +HIE+DS E+ +L+  K  D+T
Subjt:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMS-RFQRPFPCIHIESDSQEIVNLIQGKANDIT

Query:  GTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMAAI
         T +VV E+  L +           R  NG AH+L + A++
Subjt:  GTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMAAI

A0A6J1DSV1 uncharacterized protein LOC1110236082.8e-1634.97Show/hide
Query:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFP---C--IHIESDSQEIVNLIQGKA
        W+PP     WKLN DA+W       G+GW+LRD  G  I A C+ +  + +I +LE + I EGL+ I     RP     C  IH+ESDS E ++L+  + 
Subjt:  WQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFP---C--IHIESDSQEIVNLIQGKA

Query:  NDITGTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA
         D T   +++ E+ +++  +    + H  R  N  AH+L R A
Subjt:  NDITGTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMA

M8AUK9 Protein transport protein SEC247.9e-1134.27Show/hide
Query:  NWQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKANDIT
        +W+PP P    KLN D S++ + G AG G VLRD  G  IY  C++L   H  +  E     EGLK  +   Q+P   + +E+D  EI++LI  +  D +
Subjt:  NWQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKANDIT

Query:  GTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMAAIQE
           + V E+  LL     A +    R QN  AH +  +   QE
Subjt:  GTSFVVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMAAIQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G33330.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.3e-0523.08Show/hide
Query:  VDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMS-RFQRPFPCIHIESDSQEIVNLIQ--GKANDITGTSFV--VNEL
        VDASW+    CAG+GWVL +     +  G   +P  +S +  EA  +   + Q+    ++    C+      Q +   +Q   KA   T +     + ++
Subjt:  VDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMS-RFQRPFPCIHIESDSQEIVNLIQ--GKANDITGTSFV--VNEL

Query:  DRLLNIVGGAFVNHCLRIQNGEAHNLTRMAAIQEIPRDLSKEH
         +L  I     V+      N  A +L ++A  +++   +S  H
Subjt:  DRLLNIVGGAFVNHCLRIQNGEAHNLTRMAAIQEIPRDLSKEH

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.9e-0530.39Show/hide
Query:  SSHTNWQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKA
        S +T W PP  R   K N DAS  E    +GLGW+LR+S G+ I  G      + +    E  T+   +  I + +      +  E D+Q I  +I  K+
Subjt:  SSHTNWQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKA

Query:  ND
        ++
Subjt:  ND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCGAGTCACACGAACTGGCAACCCCCCCGCCCTCGAGGTGTCTGGAAATTGAATGTCGATGCTTCTTGGATCGAATCGGTCGGCTGCGCTGGTCTTGGGTGGGT
TCTCCGTGACTCTATTGGATCTCCAATCTATGCAGGTTGTAAGTTCCTTCCCAAAAAACATTCAATTGTCTGGCTAGAAGCAATCACAATTATGGAAGGTTTGAAGCAAA
TTATGTCCCGGTTCCAAAGACCTTTCCCCTGCATTCACATAGAGTCTGATTCTCAAGAAATCGTAAATCTAATTCAAGGAAAGGCAAACGACATTACCGGAACTTCCTTT
GTGGTTAACGAGTTAGATCGGTTGCTGAACATTGTGGGTGGTGCGTTTGTGAACCATTGCCTAAGAATTCAGAATGGTGAAGCCCACAATTTAACTCGAATGGCTGCCAT
ACAGGAGATCCCCCGTGATCTGTCGAAGGAGCATCTAAATTTCTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCGAGTCACACGAACTGGCAACCCCCCCGCCCTCGAGGTGTCTGGAAATTGAATGTCGATGCTTCTTGGATCGAATCGGTCGGCTGCGCTGGTCTTGGGTGGGT
TCTCCGTGACTCTATTGGATCTCCAATCTATGCAGGTTGTAAGTTCCTTCCCAAAAAACATTCAATTGTCTGGCTAGAAGCAATCACAATTATGGAAGGTTTGAAGCAAA
TTATGTCCCGGTTCCAAAGACCTTTCCCCTGCATTCACATAGAGTCTGATTCTCAAGAAATCGTAAATCTAATTCAAGGAAAGGCAAACGACATTACCGGAACTTCCTTT
GTGGTTAACGAGTTAGATCGGTTGCTGAACATTGTGGGTGGTGCGTTTGTGAACCATTGCCTAAGAATTCAGAATGGTGAAGCCCACAATTTAACTCGAATGGCTGCCAT
ACAGGAGATCCCCCGTGATCTGTCGAAGGAGCATCTAAATTTCTTATAG
Protein sequenceShow/hide protein sequence
MSSSHTNWQPPRPRGVWKLNVDASWIESVGCAGLGWVLRDSIGSPIYAGCKFLPKKHSIVWLEAITIMEGLKQIMSRFQRPFPCIHIESDSQEIVNLIQGKANDITGTSF
VVNELDRLLNIVGGAFVNHCLRIQNGEAHNLTRMAAIQEIPRDLSKEHLNFL