; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011871 (gene) of Snake gourd v1 genome

Gene IDTan0011871
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG11:20778381..20778986
RNA-Seq ExpressionTan0011871
SyntenyTan0011871
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_021807757.1 uncharacterized protein LOC110751579 [Prunus avium]1.3e-1735.8Show/hide
Query:  WNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPI
        W PP  G +KLNVD AC  S+   GLGA++R+  G  +GA SV + G       E +A+ +G++   + G++ L +E+D +  +  I   +      G I
Subjt:  WNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPI

Query:  LEDIRNCMITNKFAGLS--FSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILADTD
        + DI   M++  F  LS  FSPR  N VA  LA +A  S     WL + P WL+  + AD D
Subjt:  LEDIRNCMITNKFAGLS--FSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILADTD

XP_021809818.1 uncharacterized protein LOC110753261 [Prunus avium]1.6e-1832.58Show/hide
Query:  GEAKKIIPNPRNAAVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQ
        GE K I+  P        KW+PP  G  KLNVD A  P   ++G+GA++R+  G  + A ++ +        AE MA+  GM  A   GF+   +ESD Q
Subjt:  GEAKKIIPNPRNAAVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQ

Query:  VLVRGI-HDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD
         +V  +  D     +  G +++DI+  +   +   +SFSPR  N+VAH LA  A +      W+ + P WL  ++  D
Subjt:  VLVRGI-HDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD

XP_021824838.1 uncharacterized protein LOC110765903 [Prunus avium]2.4e-1934.38Show/hide
Query:  WNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPI
        W PP  G  K+NVD A      + G+G ++R+  G F+GA    +Q  +     + MA ++GM  ++ MG N + +E D QV ++GI          G +
Subjt:  WNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPI

Query:  LEDIRNCMITNKFAGL--SFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD
         E+ +N  + +KF G+   ++PRS NK AH LAH+A S  +  +W+ D P WL  ++ AD
Subjt:  LEDIRNCMITNKFAGL--SFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD

XP_023893303.1 uncharacterized protein LOC112005292 [Quercus suber]4.1e-1936.42Show/hide
Query:  EKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQG
        +KW PPK   VK+N D A   + +  G+G ++RDS G  LG+ S  +   + P   EAMA+   M+ A+ +GF  + +E+D  VLV+ + D +   S  G
Subjt:  EKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQG

Query:  PILEDIRNCMITNKFAGLSFS--PRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD
         +L++IR  ++ N F  L +S   R  N VAH LAH A    +   W+ D P  L  ++LAD
Subjt:  PILEDIRNCMITNKFAGLSFS--PRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD

XP_038704726.1 uncharacterized protein LOC120000672 [Tripterygium wilfordii]9.4e-2433.87Show/hide
Query:  YMRDYRLPGEAKKIIPNPRNAAVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNK
        YM +Y     A + +P P  +  +  KW PP  G +KL+VD A     H  G+GA++RD +G+   A S  + G   P   +A A++KG +LA+ MG + 
Subjt:  YMRDYRLPGEAKKIIPNPRNAAVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNK

Query:  LWIESDFQVLVRGIHDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILADT
        L IESD  VLV  I   ++ L+P G IL+DIR+ +       L ++ R  N+VAH LA  + +      W+ + P+++   ++ D+
Subjt:  LWIESDFQVLVRGIHDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILADT

TrEMBL top hitse value%identityAlignment
A0A6P5RZ31 uncharacterized protein LOC1107515796.4e-1835.8Show/hide
Query:  WNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPI
        W PP  G +KLNVD AC  S+   GLGA++R+  G  +GA SV + G       E +A+ +G++   + G++ L +E+D +  +  I   +      G I
Subjt:  WNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPI

Query:  LEDIRNCMITNKFAGLS--FSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILADTD
        + DI   M++  F  LS  FSPR  N VA  LA +A  S     WL + P WL+  + AD D
Subjt:  LEDIRNCMITNKFAGLS--FSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILADTD

A0A6P5S7Q3 uncharacterized protein LOC1107532617.5e-1932.58Show/hide
Query:  GEAKKIIPNPRNAAVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQ
        GE K I+  P        KW+PP  G  KLNVD A  P   ++G+GA++R+  G  + A ++ +        AE MA+  GM  A   GF+   +ESD Q
Subjt:  GEAKKIIPNPRNAAVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQ

Query:  VLVRGI-HDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD
         +V  +  D     +  G +++DI+  +   +   +SFSPR  N+VAH LA  A +      W+ + P WL  ++  D
Subjt:  VLVRGI-HDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD

A0A6P5TBX9 uncharacterized protein LOC1107659031.2e-1934.38Show/hide
Query:  WNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPI
        W PP  G  K+NVD A      + G+G ++R+  G F+GA    +Q  +     + MA ++GM  ++ MG N + +E D QV ++GI          G +
Subjt:  WNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPI

Query:  LEDIRNCMITNKFAGL--SFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD
         E+ +N  + +KF G+   ++PRS NK AH LAH+A S  +  +W+ D P WL  ++ AD
Subjt:  LEDIRNCMITNKFAGL--SFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD

A0A803P6S3 Uncharacterized protein2.9e-1831.44Show/hide
Query:  MAEWIVDYMRDYR--LPGEAKKIIPNPRNAAVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGME
        M EW   Y+ +YR    G   K            ++W PP    VK+NVD       H +GLG + RD +G    A + +VQ    P   E MA+ +G++
Subjt:  MAEWIVDYMRDYR--LPGEAKKIIPNPRNAAVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGME

Query:  LAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD
        L +Q  + +  IESD    V  I            +L  IR+ M    F G+SF  R  N+VAH LA++A   + S  W+   P   H  +L D
Subjt:  LAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD

M5VP36 Uncharacterized protein (Fragment)1.7e-1832.28Show/hide
Query:  VDYMRDYRLPGEAKKIIP-NPRNAAVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMG
        V + R + L GE   ++   P+    +  KW+PP  G  KLNVD A  P   + G+GA++R+  G  + A ++ +        AE MA L  M+ A   G
Subjt:  VDYMRDYRLPGEAKKIIP-NPRNAAVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMG

Query:  FNKLWIESDFQVLVRGI-HDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD
        F+ + IESD Q  V  +  D     +  G ++ DI+  +   +   +SFSPR  N+VAH LA  A +      W+ + P WL  +I  D
Subjt:  FNKLWIESDFQVLVRGI-HDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILAD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein2.5e-0626Show/hide
Query:  VKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQ-GPILEDIRNC
        VK N D + H    ++GLG ++R+S G  L       QGR  P  AE  A++  ++     G+ K+  E D   + R I+  S   +P+    L+ I++ 
Subjt:  VKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDFQVLVRGIHDHSHRLSPQ-GPILEDIRNC

Query:  MITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILA
        + +       F+ R  N+ A  L   A  S    +  +  P +L+ ++L+
Subjt:  MITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILA

AT3G09510.1 Ribonuclease H-like superfamily protein2.0e-0828.57Show/hide
Query:  EAKKIIPNP-RNAAVQEEKWNPPKEGNVKLNVDVACH-PSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDF
        ++ K  P+P R  A  + +W  P    VK N D       L  TG G I+R+  G  +   S+ +    +P  AE  A+L  ++     G+ ++++E D 
Subjt:  EAKKIIPNP-RNAAVQEEKWNPPKEGNVKLNVDVACH-PSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWIESDF

Query:  QVLVRGIHDHSHRLSPQGPILEDIRNCMITNKFAGLSFS--PRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILADTD
        Q L+  I+  S   S     LEDI      NKFA + F    R  NK+AH LA +  +     +     P WL      D++
Subjt:  QVLVRGIHDHSHRLSPQGPILEDIRNCMITNKFAGLSFS--PRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILADTD

AT4G03292.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.2e-0626.97Show/hide
Query:  KIIPNPRNAAVQEEKWNPPK---------EGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWI
        ++I NP+   ++   WN P           G  K N D         T  G I+R+S+G  +   S  +        AE +  L  +++    G   +W 
Subjt:  KIIPNPRNAAVQEEKWNPPK---------EGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWI

Query:  ESDFQVLVRGI-HDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVA
        E+D + LV  + +D  HRL   GP+L DIR  M+   +  + F  R  N  A
Subjt:  ESDFQVLVRGI-HDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVA

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.2e-1027.17Show/hide
Query:  EWIVDYMRDYRLPGEAKKIIPNPRNA-AVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAV
        EW+ + M + +  G         RNA   +  KW+PP    +K N D + H    ++GLG ILR+S G  +       QGR     AE   ++  ++ + 
Subjt:  EWIVDYMRDYRLPGEAKKIIPNPRNA-AVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAV

Query:  QMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWL
          G  K+  E D Q + R I+  S     Q   L+ I++ + + +    SF  R  N  A  LA  A       +  H  P +L
Subjt:  QMGFNKLWIESDFQVLVRGIHDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAATGGATCGTTGACTATATGAGGGATTATCGCTTACCAGGAGAAGCAAAGAAGATCATACCTAATCCAAGAAATGCAGCGGTTCAGGAAGAGAAATGGAACCC
GCCGAAGGAAGGAAATGTGAAGCTCAACGTAGATGTTGCATGTCATCCTTCTCTTCATCTCACCGGCTTAGGGGCGATTCTCCGAGATTCTTCTGGAAGATTTTTAGGAG
CTCAATCGGTGATCGTTCAGGGACGCCACGACCCCTTCTCCGCAGAAGCTATGGCCATGCTGAAAGGTATGGAGCTGGCAGTCCAAATGGGGTTCAACAAACTATGGATT
GAATCAGATTTCCAAGTCTTGGTTAGGGGCATCCACGACCATTCTCACAGACTTTCTCCTCAAGGTCCAATTCTGGAGGACATACGCAACTGTATGATCACAAACAAATT
CGCAGGGCTTAGTTTTAGTCCTAGATCCACTAACAAGGTAGCCCACAACCTCGCTCATTGGGCTTTTTCATCACAAGAGTCTTGTAATTGGTTACACGATGAGCCCGAGT
GGCTCCATCATCTTATTTTGGCTGACACTGATTTGGTTAGGCCTTTTATTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAATGGATCGTTGACTATATGAGGGATTATCGCTTACCAGGAGAAGCAAAGAAGATCATACCTAATCCAAGAAATGCAGCGGTTCAGGAAGAGAAATGGAACCC
GCCGAAGGAAGGAAATGTGAAGCTCAACGTAGATGTTGCATGTCATCCTTCTCTTCATCTCACCGGCTTAGGGGCGATTCTCCGAGATTCTTCTGGAAGATTTTTAGGAG
CTCAATCGGTGATCGTTCAGGGACGCCACGACCCCTTCTCCGCAGAAGCTATGGCCATGCTGAAAGGTATGGAGCTGGCAGTCCAAATGGGGTTCAACAAACTATGGATT
GAATCAGATTTCCAAGTCTTGGTTAGGGGCATCCACGACCATTCTCACAGACTTTCTCCTCAAGGTCCAATTCTGGAGGACATACGCAACTGTATGATCACAAACAAATT
CGCAGGGCTTAGTTTTAGTCCTAGATCCACTAACAAGGTAGCCCACAACCTCGCTCATTGGGCTTTTTCATCACAAGAGTCTTGTAATTGGTTACACGATGAGCCCGAGT
GGCTCCATCATCTTATTTTGGCTGACACTGATTTGGTTAGGCCTTTTATTTCTTAA
Protein sequenceShow/hide protein sequence
MAEWIVDYMRDYRLPGEAKKIIPNPRNAAVQEEKWNPPKEGNVKLNVDVACHPSLHLTGLGAILRDSSGRFLGAQSVIVQGRHDPFSAEAMAMLKGMELAVQMGFNKLWI
ESDFQVLVRGIHDHSHRLSPQGPILEDIRNCMITNKFAGLSFSPRSTNKVAHNLAHWAFSSQESCNWLHDEPEWLHHLILADTDLVRPFIS