; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017583 (gene) of Snake gourd v1 genome

Gene IDTan0017583
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationLG05:74805540..74806368
RNA-Seq ExpressionTan0017583
SyntenyTan0017583
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]4.3e-1130.81Show/hide
Query:  DADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE-------------PDTCYRENVPPLRSI----GSTSKNKTS-WWKLNTNATRNKEKQQGGLG
        + ++ +++ I W +WE RNK++  G+HP   +I   I+                   +++  +R I    G+  K  TS  WKLNTNA    +   GG+G
Subjt:  DADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE-------------PDTCYRENVPPLRSI----GSTSKNKTS-WWKLNTNATRNKEKQQGGLG

Query:  SIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLINLTSEDLGE
         I+RD  G  I    + I+   +I  LE+  I  GL++ R    + C PI +ESD+ EAI+L++   +D  E
Subjt:  SIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLINLTSEDLGE

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]1.8e-1228.65Show/hide
Query:  DWAPSDCWKTLTNRLKDADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR---ENVPPLRSIGSTSKNK-----TSWWKLNT
        +W   + W+ L ++  + ++ +++ I   +WE RNK++  G+H    +I   I+         DT  +   ++  P+R IG  ++ +     ++ WKLNT
Subjt:  DWAPSDCWKTLTNRLKDADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR---ENVPPLRSIGSTSKNK-----TSWWKLNT

Query:  NATRNKEKQQGGLGSIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLIN
        +A    +    G+G I+RD  G  I T  + I+   +I  LE+  I  GL++ R    + C PI +ESD+ EAI+L++
Subjt:  NATRNKEKQQGGLGSIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLIN

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]1.8e-1228.65Show/hide
Query:  DWAPSDCWKTLTNRLKDADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR---ENVPPLRSIGSTSKNK-----TSWWKLNT
        +W   + W+ L ++  + ++ +++ I   +WE RNK++  G+H    +I   I+         DT  +   ++  P+R IG  ++ +     ++ WKLNT
Subjt:  DWAPSDCWKTLTNRLKDADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR---ENVPPLRSIGSTSKNK-----TSWWKLNT

Query:  NATRNKEKQQGGLGSIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLIN
        +A    +    G+G I+RD  G  I T  + I+   +I  LE+  I  GL++ R    + C PI +ESD+ EAI+L++
Subjt:  NATRNKEKQQGGLGSIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLIN

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]2.1e-1028.81Show/hide
Query:  DADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR-----ENVPPLRSIGSTSKNK-----TSWWKLNTNATRNKEKQQGGLG
        + ++ +++ I W +WE RNK++  G+H    +I   I+         DT  +     +++  +R IG  +  +     ++ WKLNT+A    +   GG+G
Subjt:  DADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR-----ENVPPLRSIGSTSKNK-----TSWWKLNTNATRNKEKQQGGLG

Query:  SIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRD-----LPPDLCPPICVESDAAEAINLINLTSEDLGE
         I+RD  G  I  D + I+   +I  LE+  I  GL++ R      +  + C PI +ESD+ EAI+L++   +D  E
Subjt:  SIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRD-----LPPDLCPPICVESDAAEAINLINLTSEDLGE

XP_024033458.1 uncharacterized protein LOC112095586 [Citrus clementina]1.8e-0931.87Show/hide
Query:  RLKDADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIEP--DTCYRENVPPLRSIGSTSKNKTSWW--------KLNTNATRNKEKQQGGLGSIVRD
        R KD D    +TI+W +W ARN  +  G+  + +    + E   +   R  +P    IG  S +K   W        K+N NA  N EKQ  GLG+++RD
Subjt:  RLKDADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIEP--DTCYRENVPPLRSIGSTSKNKTSWW--------KLNTNATRNKEKQQGGLGSIVRD

Query:  SLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLC-PPICVESDAAEAINLIN
         +G+ I+  ++  K    + I E++ I  GL+    L  + C   + VESDA E + L+N
Subjt:  SLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLC-PPICVESDAAEAINLIN

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134122.1e-1130.81Show/hide
Query:  DADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE-------------PDTCYRENVPPLRSI----GSTSKNKTS-WWKLNTNATRNKEKQQGGLG
        + ++ +++ I W +WE RNK++  G+HP   +I   I+                   +++  +R I    G+  K  TS  WKLNTNA    +   GG+G
Subjt:  DADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE-------------PDTCYRENVPPLRSI----GSTSKNKTS-WWKLNTNATRNKEKQQGGLG

Query:  SIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLINLTSEDLGE
         I+RD  G  I    + I+   +I  LE+  I  GL++ R    + C PI +ESD+ EAI+L++   +D  E
Subjt:  SIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLINLTSEDLGE

A0A6J1CQG0 uncharacterized protein LOC1110132161.5e-0926.98Show/hide
Query:  WAPSDCWKTLTNRLKDADQSKAITIMWSLWEARNKALISGLHPNKEN--------IAKRIEPDTCYRENVPPLRSIGSTSKNK--------------TSW
        W   D W  L N L D + + ++ I W +WE+RN+++  G   +++         I   I+  TC  +     ++ G   + +              T+ 
Subjt:  WAPSDCWKTLTNRLKDADQSKAITIMWSLWEARNKALISGLHPNKEN--------IAKRIEPDTCYRENVPPLRSIGSTSKNK--------------TSW

Query:  WKLNTNATRNKEKQQGGLGSIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLINLTSEDL
        WKLNT+A+ ++E++ GG+G I+ D  G  +      I+    I  LEL  I+ GL+    +      PI +ESD+ E I L+     DL
Subjt:  WKLNTNATRNKEKQQGGLGSIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLINLTSEDL

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X18.5e-1328.65Show/hide
Query:  DWAPSDCWKTLTNRLKDADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR---ENVPPLRSIGSTSKNK-----TSWWKLNT
        +W   + W+ L ++  + ++ +++ I   +WE RNK++  G+H    +I   I+         DT  +   ++  P+R IG  ++ +     ++ WKLNT
Subjt:  DWAPSDCWKTLTNRLKDADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR---ENVPPLRSIGSTSKNK-----TSWWKLNT

Query:  NATRNKEKQQGGLGSIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLIN
        +A    +    G+G I+RD  G  I T  + I+   +I  LE+  I  GL++ R    + C PI +ESD+ EAI+L++
Subjt:  NATRNKEKQQGGLGSIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLIN

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X28.5e-1328.65Show/hide
Query:  DWAPSDCWKTLTNRLKDADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR---ENVPPLRSIGSTSKNK-----TSWWKLNT
        +W   + W+ L ++  + ++ +++ I   +WE RNK++  G+H    +I   I+         DT  +   ++  P+R IG  ++ +     ++ WKLNT
Subjt:  DWAPSDCWKTLTNRLKDADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR---ENVPPLRSIGSTSKNK-----TSWWKLNT

Query:  NATRNKEKQQGGLGSIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLIN
        +A    +    G+G I+RD  G  I T  + I+   +I  LE+  I  GL++ R    + C PI +ESD+ EAI+L++
Subjt:  NATRNKEKQQGGLGSIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLIN

A0A6J1DSV1 uncharacterized protein LOC1110236081.0e-1028.81Show/hide
Query:  DADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR-----ENVPPLRSIGSTSKNK-----TSWWKLNTNATRNKEKQQGGLG
        + ++ +++ I W +WE RNK++  G+H    +I   I+         DT  +     +++  +R IG  +  +     ++ WKLNT+A    +   GG+G
Subjt:  DADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIE--------PDTCYR-----ENVPPLRSIGSTSKNK-----TSWWKLNTNATRNKEKQQGGLG

Query:  SIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRD-----LPPDLCPPICVESDAAEAINLINLTSEDLGE
         I+RD  G  I  D + I+   +I  LE+  I  GL++ R      +  + C PI +ESD+ EAI+L++   +D  E
Subjt:  SIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRD-----LPPDLCPPICVESDAAEAINLINLTSEDLGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein6.4e-0522.53Show/hide
Query:  IMWSLWEARNKALISGLHPNKENIAKRIEPD-----------TCYRENVPPLRSIGSTSKNKTSWWKLNTNATRNKEKQQGGLGSIVRDSLGSTISTDIQ
        ++W LW+ RN+ +  G   N + + +R E D           +C  +      S G        W K NT+AT N++ ++ G+G ++R+  G       +
Subjt:  IMWSLWEARNKALISGLHPNKENIAKRIEPD-----------TCYRENVPPLRSIGSTSKNKTSWWKLNTNATRNKEKQQGGLGSIVRDSLGSTISTDIQ

Query:  FIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLINLTSEDLGEAKSLALAVKTIASSLCKVTFAWHPRE
         +    S+   EL+ +   + S      +    +  ESD+   I ++N   E     K     ++ + S   +V F + PRE
Subjt:  FIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLINLTSEDLGEAKSLALAVKTIASSLCKVTFAWHPRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGAAACAGATGGAAACCTTTACGCACGTAATCTGGGGTTGGAAGATTGGGCTCCTAGTGATTGCTGGAAAACGCTGACTAACCGCCTTAAGGATGCAGATCAAAG
CAAAGCAATAACAATCATGTGGAGCTTATGGGAAGCCAGAAATAAAGCTCTAATTAGCGGACTACATCCTAACAAAGAGAACATAGCGAAAAGAATCGAACCCGATACCT
GCTATCGAGAAAATGTCCCTCCTCTTCGATCAATTGGCAGCACTTCGAAGAACAAGACGAGCTGGTGGAAATTGAACACCAACGCGACTCGGAACAAAGAAAAGCAACAA
GGCGGCTTAGGATCGATCGTTCGTGACTCCTTAGGTTCTACGATCAGTACCGACATCCAATTTATCAAAATCAGCTGGAGCATCAAAATTCTGGAATTACAAGTTATTCT
CTCAGGTTTAAAGAGTCCGAGAGACCTACCTCCCGATCTTTGTCCACCAATCTGCGTTGAATCCGATGCGGCTGAAGCGATCAACTTGATCAACCTCACCTCGGAAGATC
TTGGTGAAGCGAAATCTTTGGCATTAGCAGTCAAAACAATTGCTTCCTCTCTATGCAAAGTGACTTTTGCATGGCATCCTCGGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGAAACAGATGGAAACCTTTACGCACGTAATCTGGGGTTGGAAGATTGGGCTCCTAGTGATTGCTGGAAAACGCTGACTAACCGCCTTAAGGATGCAGATCAAAG
CAAAGCAATAACAATCATGTGGAGCTTATGGGAAGCCAGAAATAAAGCTCTAATTAGCGGACTACATCCTAACAAAGAGAACATAGCGAAAAGAATCGAACCCGATACCT
GCTATCGAGAAAATGTCCCTCCTCTTCGATCAATTGGCAGCACTTCGAAGAACAAGACGAGCTGGTGGAAATTGAACACCAACGCGACTCGGAACAAAGAAAAGCAACAA
GGCGGCTTAGGATCGATCGTTCGTGACTCCTTAGGTTCTACGATCAGTACCGACATCCAATTTATCAAAATCAGCTGGAGCATCAAAATTCTGGAATTACAAGTTATTCT
CTCAGGTTTAAAGAGTCCGAGAGACCTACCTCCCGATCTTTGTCCACCAATCTGCGTTGAATCCGATGCGGCTGAAGCGATCAACTTGATCAACCTCACCTCGGAAGATC
TTGGTGAAGCGAAATCTTTGGCATTAGCAGTCAAAACAATTGCTTCCTCTCTATGCAAAGTGACTTTTGCATGGCATCCTCGGGAGTAG
Protein sequenceShow/hide protein sequence
MSETDGNLYARNLGLEDWAPSDCWKTLTNRLKDADQSKAITIMWSLWEARNKALISGLHPNKENIAKRIEPDTCYRENVPPLRSIGSTSKNKTSWWKLNTNATRNKEKQQ
GGLGSIVRDSLGSTISTDIQFIKISWSIKILELQVILSGLKSPRDLPPDLCPPICVESDAAEAINLINLTSEDLGEAKSLALAVKTIASSLCKVTFAWHPRE