; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008420 (gene) of Snake gourd v1 genome

Gene IDTan0008420
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG01:102474454..102475371
RNA-Seq ExpressionTan0008420
SyntenyTan0008420
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599977.1 hypothetical protein SDJN03_05210, partial [Cucurbita argyrosperma subsp. sororia]3.4e-1439.83Show/hide
Query:  KLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYM----KISKEHAQSLMVESDSVEVIKVLNDETLDLSELNDIAN
        KLN  ASW+E   +GG+ W I D  GS IC   K+L+R+W +K LEGKA++EGL TY+     +S  H   +      +E+ ++LN+  +DL EL+++ +
Subjt:  KLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYM----KISKEHAQSLMVESDSVEVIKVLNDETLDLSELNDIAN

Query:  DIHSIASDVGIISFTKCP
        +I  +    GIISF + P
Subjt:  DIHSIASDVGIISFTKCP

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]4.3e-1732.62Show/hide
Query:  ECKLAKERYL-----------NETHRQDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKA
        + +LA +RY+            ++  +D  L RR    +   W+PP    WKLN +A+W    + GG+GW +RD +G +I A  + ++ +  I  LE  A
Subjt:  ECKLAKERYL-----------NETHRQDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKA

Query:  ILEGLHTYMKISKEHAQSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFNLLLEE
        I EGL     I +EH + + +ESDS+E I +L+ +  D +E+  +  +I  +  D+ I+S     R +N+  H LAR A  N L EE
Subjt:  ILEGLHTYMKISKEHAQSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFNLLLEE

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.2e-1431.71Show/hide
Query:  RYLNETHRQDEALTRRENQTSHE--GWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEHA
        +++ E+  Q E      ++T +    W PP    W LN DASW+++  +GG+GW IR + G ++ AG + ++    +K LE  AILEGL     +     
Subjt:  RYLNETHRQDEALTRRENQTSHE--GWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEHA

Query:  QSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAA
        + L +E+DS EV  +LN +  DL++   +  +I ++     I++F K  R +N   H LA+ A+
Subjt:  QSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAA

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.1e-1734.94Show/hide
Query:  QDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMK-----ISKEHAQSLMV
        +D  L RR    +   W+PP    WKLN DA+W    + GG+GW +RD +G +I A  + ++ +  I  LE  AI EGL    +     I +EH + + +
Subjt:  QDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMK-----ISKEHAQSLMV

Query:  ESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFNLLLEE
        ESDS+E I +L+ +  D +E+  +  +I  +  D+ I+S     R +N+  H LAR A  N L EE
Subjt:  ESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFNLLLEE

XP_038886170.1 uncharacterized protein LOC120076417 [Benincasa hispida]1.1e-1238.76Show/hide
Query:  GWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEHAQSLMVESDSV-EVIKVLNDETLDLS
        GW P     WKLN+DASWN  I   GLGW   D    +  AG+K + R   +  LE  AI  GL     +S     ++MVESD + EVI +LND+ +DLS
Subjt:  GWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEHAQSLMVESDSV-EVIKVLNDETLDLS

Query:  ELNDIANDIHSIASDVGIISFTKCPRSSN
        E++  + +      ++G+ISF+   R  N
Subjt:  ELNDIANDIHSIASDVGIISFTKCPRSSN

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134122.1e-1732.62Show/hide
Query:  ECKLAKERYL-----------NETHRQDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKA
        + +LA +RY+            ++  +D  L RR    +   W+PP    WKLN +A+W    + GG+GW +RD +G +I A  + ++ +  I  LE  A
Subjt:  ECKLAKERYL-----------NETHRQDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKA

Query:  ILEGLHTYMKISKEHAQSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFNLLLEE
        I EGL     I +EH + + +ESDS+E I +L+ +  D +E+  +  +I  +  D+ I+S     R +N+  H LAR A  N L EE
Subjt:  ILEGLHTYMKISKEHAQSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFNLLLEE

A0A6J1CQG0 uncharacterized protein LOC1110132163.4e-1240.82Show/hide
Query:  WRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEHAQSLMVESDSVEVIKVLNDETLDLS
        W  P   CWKLN DASW+E    GG+GW + D RG ++ AG  K++ +  I  LE   I+ GL     I+ +    + +ESDSVEVI+++  E +DL+
Subjt:  WRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEHAQSLMVESDSVEVIKVLNDETLDLS

A0A6J1DNV9 uncharacterized protein LOC1110224035.7e-1531.71Show/hide
Query:  RYLNETHRQDEALTRRENQTSHE--GWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEHA
        +++ E+  Q E      ++T +    W PP    W LN DASW+++  +GG+GW IR + G ++ AG + ++    +K LE  AILEGL     +     
Subjt:  RYLNETHRQDEALTRRENQTSHE--GWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEHA

Query:  QSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAA
        + L +E+DS EV  +LN +  DL++   +  +I ++     I++F K  R +N   H LA+ A+
Subjt:  QSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAA

A0A6J1DSV1 uncharacterized protein LOC1110236085.5e-1834.94Show/hide
Query:  QDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMK-----ISKEHAQSLMV
        +D  L RR    +   W+PP    WKLN DA+W    + GG+GW +RD +G +I A  + ++ +  I  LE  AI EGL    +     I +EH + + +
Subjt:  QDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMK-----ISKEHAQSLMV

Query:  ESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFNLLLEE
        ESDS+E I +L+ +  D +E+  +  +I  +  D+ I+S     R +N+  H LAR A  N L EE
Subjt:  ESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFNLLLEE

M5X3Z4 RNase H domain-containing protein (Fragment)1.5e-1231.14Show/hide
Query:  AKERYLNETHRQDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEH
        AK   +NE  R  E   R +++     W  P  G  K+N D +WN    +GG GW IRDF G ++ AG K+ +R       E  AI   +    +    H
Subjt:  AKERYLNETHRQDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEH

Query:  AQSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFN
           ++VESDS+  I+++  + +  +E+  +  DIH++  ++  ++FT  PRS N + H++A  A  N
Subjt:  AQSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.4e-0431.76Show/hide
Query:  HILWECKLAKERYLNETHRQDEALTRRENQT---------SHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAG
        HI WE  L   +   + H   EA T  +  T         +H+ WR P +G  K N D S+     +   GW +RD  GS + AG
Subjt:  HILWECKLAKERYLNETHRQDEALTRRENQT---------SHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAG

AT2G04420.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.0e-0425.52Show/hide
Query:  QTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGK---AILEGLHTYMKISKEHAQSLMVESDSVEVIKVLN
        +++H+ W  P  G  K N D S+N    Q   GW IRD +G     G  +         LE +    ++   HT+     +  + ++ E DS +V ++LN
Subjt:  QTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGK---AILEGLHTYMKISKEHAQSLMVESDSVEVIKVLN

Query:  DETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARA
         + +     N I  +  S +     + F+  PR++NQ    LA++
Subjt:  DETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARA

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.4e-0422.47Show/hide
Query:  KERYLNETHRQDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEHA
        KE   N    + +   R  + + +  W PP +   K N DAS +E  +  GLGW +R+ +G++I  G+ K + +   +  E   ++  +           
Subjt:  KERYLNETHRQDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYMKISKEHA

Query:  QSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFNLLLEENVSSSFF
        + ++ E D+  + +++N ++ +   L    + I S       I F+   R  N     LA+ A     ++EN   S F
Subjt:  QSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFNLLLEENVSSSFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATATCCCGAGTCCACAGTCCATATTTTGTGGGAGTGCAAATTAGCAAAAGAGAGGTACCTGAATGAAACGCATCGGCAGGACGAGGCTCTGACGAGGAGGGAGAA
CCAAACGAGTCATGAAGGTTGGCGTCCTCCTCTAAAAGGATGCTGGAAACTCAACGTCGACGCCTCCTGGAACGAAGCGATCTCGCAAGGGGGTTTGGGTTGGACAATTC
GTGACTTTCGGGGTTCTCTCATCTGCGCAGGAATTAAAAAACTCAAACGTCAATGGCCAATTAAATGTCTCGAAGGGAAAGCCATCCTTGAAGGGCTTCACACTTATATG
AAGATATCTAAGGAACATGCACAAAGTTTGATGGTGGAATCCGACTCTGTAGAAGTAATTAAGGTGTTAAACGATGAAACACTCGATCTCTCCGAATTGAATGACATCGC
AAACGACATTCACTCAATAGCCAGCGACGTTGGTATAATTTCCTTCACCAAATGCCCCAGATCGAGCAATCAATCGACCCATAAGCTAGCGAGAGCAGCTGCTTTCAATC
TCCTTTTGGAAGAAAACGTCTCTTCTTCTTTTTTTGTCGAAGAAGACTCTCTTTTGGATTGTAAATATCCCCTCTTGGGTGTATTCCCTCGTCGAGAAAGTTGGTGTACT
GGTTTCGTTGTTTTTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATATCCCGAGTCCACAGTCCATATTTTGTGGGAGTGCAAATTAGCAAAAGAGAGGTACCTGAATGAAACGCATCGGCAGGACGAGGCTCTGACGAGGAGGGAGAA
CCAAACGAGTCATGAAGGTTGGCGTCCTCCTCTAAAAGGATGCTGGAAACTCAACGTCGACGCCTCCTGGAACGAAGCGATCTCGCAAGGGGGTTTGGGTTGGACAATTC
GTGACTTTCGGGGTTCTCTCATCTGCGCAGGAATTAAAAAACTCAAACGTCAATGGCCAATTAAATGTCTCGAAGGGAAAGCCATCCTTGAAGGGCTTCACACTTATATG
AAGATATCTAAGGAACATGCACAAAGTTTGATGGTGGAATCCGACTCTGTAGAAGTAATTAAGGTGTTAAACGATGAAACACTCGATCTCTCCGAATTGAATGACATCGC
AAACGACATTCACTCAATAGCCAGCGACGTTGGTATAATTTCCTTCACCAAATGCCCCAGATCGAGCAATCAATCGACCCATAAGCTAGCGAGAGCAGCTGCTTTCAATC
TCCTTTTGGAAGAAAACGTCTCTTCTTCTTTTTTTGTCGAAGAAGACTCTCTTTTGGATTGTAAATATCCCCTCTTGGGTGTATTCCCTCGTCGAGAAAGTTGGTGTACT
GGTTTCGTTGTTTTTGAATGA
Protein sequenceShow/hide protein sequence
MKYPESTVHILWECKLAKERYLNETHRQDEALTRRENQTSHEGWRPPLKGCWKLNVDASWNEAISQGGLGWTIRDFRGSLICAGIKKLKRQWPIKCLEGKAILEGLHTYM
KISKEHAQSLMVESDSVEVIKVLNDETLDLSELNDIANDIHSIASDVGIISFTKCPRSSNQSTHKLARAAAFNLLLEENVSSSFFVEEDSLLDCKYPLLGVFPRRESWCT
GFVVFE