; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005432 (gene) of Snake gourd v1 genome

Gene IDTan0005432
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibonuclease H domain
Genome locationLG05:881935..882948
RNA-Seq ExpressionTan0005432
SyntenyTan0005432
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015382610.1 uncharacterized protein LOC107175578 [Citrus sinensis]6.0e-1629.35Show/hide
Query:  DLSKAITIMWSIWEARNKALKSGHPPNKEDITKRIE--LHTWDREFRPQIGSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDSSG
        DL   I I W +W ARN+ L  G   N + +  + E  L  + R  +P+   + K S+   + + W+PP  G+ K+N DA    E H  GLG  +RD  G
Subjt:  DLSKAITIMWSIWEARNKALKSGHPPNKEDITKRIE--LHTWDREFRPQIGSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDSSG

Query:  SSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTAHKIAR
        + I A+ ++ K +  +   E + +  GL+  RN    S   + VESDA   V L+N+      E   ++  I+++      V+  +  R +N   H +A+
Subjt:  SSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTAHKIAR

Query:  L
        L
Subjt:  L

XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]2.7e-1630.81Show/hide
Query:  ECWKALTTHLKDVDLSKAITIMWSIWEARNKALKSGHPPNKEDITKRIELH-----------TWDREFRPQIGSLDKSSKN-QMSHRHWDPPPTGWWKLN
        + W  L   L D +++ ++ I W IWE+RN+++  G   +++ + + I L            +  R  +   G L +  +N  M    W  PPT  WKLN
Subjt:  ECWKALTTHLKDVDLSKAITIMWSIWEARNKALKSGHPPNKEDITKRIELH-----------TWDREFRPQIGSLDKSSKN-QMSHRHWDPPPTGWWKLN

Query:  SDATWLEEAHQGGLGWTVRDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDL
        +DA+W EE   GG+GW + D  G  + A    I+    I  LEL  I+ GL+ +    + S  PI +ESD+ E + L+     DL
Subjt:  SDATWLEEAHQGGLGWTVRDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDL

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]4.5e-1932.2Show/hide
Query:  KAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQIGSLDKSSKNQMSH----------RHWDPPPTGWWKLNSDATWLEEAHQGGLGWTV
        +++ I W IWE RNK++  G  P   DI   I+ +  +   R    +L   S N+  H            W PP +  WKLN++A W  + + GG+GW +
Subjt:  KAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQIGSLDKSSKNQMSH----------RHWDPPPTGWWKLNSDATWLEEAHQGGLGWTV

Query:  RDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTA
        RD  G  I AS ++I+ +  I  LE+ AI  GL+++R    +   PI +ESD+ EA++L++   +D  E   L+  I  +   +  V+     RE N  A
Subjt:  RDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTA

Query:  HKIAR
        H +AR
Subjt:  HKIAR

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.8e-2029.3Show/hide
Query:  DVDLSKAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQ--IGSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDS
        D DL   +   W IW  RN  +  G   +   + +++     +  ++ +  +  L K+  N++    W+PPP   W LN+DA+W +  H+GG+GW +R  
Subjt:  DVDLSKAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQ--IGSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDS

Query:  SGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTAHKI
         G  + A  + ++    +K+LE  AI+ GL++L NL +    P+ +E+D+AE  +L+N   EDL +   +V  I ++  +   + FA   RE N  AH +
Subjt:  SGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTAHKI

Query:  ARLPS---SPIFWSD
        A+  S     + W D
Subjt:  ARLPS---SPIFWSD

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]2.9e-1831.43Show/hide
Query:  KAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQIGSLDKSSKNQMSH----------RHWDPPPTGWWKLNSDATWLEEAHQGGLGWTV
        +++ I W IWE RNK++  G      DI   I+ +  +   R    +L   S N+  H            W PP +  WKLN+DA W  + + GG+GW +
Subjt:  KAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQIGSLDKSSKNQMSH----------RHWDPPPTGWWKLNSDATWLEEAHQGGLGWTV

Query:  RDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRN-----LPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPRE
        RD  G  I A  ++I+T+  I  LE+ AI  GL+++R      +  +   PI +ESD+ EA++L++   +D  E   L+  I  +   +  V+     RE
Subjt:  RDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRN-----LPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPRE

Query:  KNTTAHKIAR
         N  AH +AR
Subjt:  KNTTAHKIAR

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134122.2e-1932.2Show/hide
Query:  KAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQIGSLDKSSKNQMSH----------RHWDPPPTGWWKLNSDATWLEEAHQGGLGWTV
        +++ I W IWE RNK++  G  P   DI   I+ +  +   R    +L   S N+  H            W PP +  WKLN++A W  + + GG+GW +
Subjt:  KAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQIGSLDKSSKNQMSH----------RHWDPPPTGWWKLNSDATWLEEAHQGGLGWTV

Query:  RDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTA
        RD  G  I AS ++I+ +  I  LE+ AI  GL+++R    +   PI +ESD+ EA++L++   +D  E   L+  I  +   +  V+     RE N  A
Subjt:  RDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTA

Query:  HKIAR
        H +AR
Subjt:  HKIAR

A0A6J1CQG0 uncharacterized protein LOC1110132161.3e-1630.81Show/hide
Query:  ECWKALTTHLKDVDLSKAITIMWSIWEARNKALKSGHPPNKEDITKRIELH-----------TWDREFRPQIGSLDKSSKN-QMSHRHWDPPPTGWWKLN
        + W  L   L D +++ ++ I W IWE+RN+++  G   +++ + + I L            +  R  +   G L +  +N  M    W  PPT  WKLN
Subjt:  ECWKALTTHLKDVDLSKAITIMWSIWEARNKALKSGHPPNKEDITKRIELH-----------TWDREFRPQIGSLDKSSKN-QMSHRHWDPPPTGWWKLN

Query:  SDATWLEEAHQGGLGWTVRDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDL
        +DA+W EE   GG+GW + D  G  + A    I+    I  LEL  I+ GL+ +    + S  PI +ESD+ E + L+     DL
Subjt:  SDATWLEEAHQGGLGWTVRDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDL

A0A6J1D4B6 uncharacterized protein LOC1110171813.2e-1533.67Show/hide
Query:  ITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDR--EFRPQI-GSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDSSGSSIC
        + ++WSIW  RN+ +   H   ++  +++I   T  +  EF   + G+L+   KN      W PP    WKLN DATW++  H GGLGW VRDS G  I 
Subjt:  ITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDR--EFRPQI-GSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDSSGSSIC

Query:  ASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTAHKIAR
        A               LKA+       ++L +++   I +ESD  E VN+IN  S  L E + +V  I     +L    F   P + N  AH IAR
Subjt:  ASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTAHKIAR

A0A6J1DNV9 uncharacterized protein LOC1110224038.8e-2129.3Show/hide
Query:  DVDLSKAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQ--IGSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDS
        D DL   +   W IW  RN  +  G   +   + +++     +  ++ +  +  L K+  N++    W+PPP   W LN+DA+W +  H+GG+GW +R  
Subjt:  DVDLSKAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQ--IGSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDS

Query:  SGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTAHKI
         G  + A  + ++    +K+LE  AI+ GL++L NL +    P+ +E+D+AE  +L+N   EDL +   +V  I ++  +   + FA   RE N  AH +
Subjt:  SGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTAHKI

Query:  ARLPS---SPIFWSD
        A+  S     + W D
Subjt:  ARLPS---SPIFWSD

A0A6J1DSV1 uncharacterized protein LOC1110236081.4e-1831.43Show/hide
Query:  KAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQIGSLDKSSKNQMSH----------RHWDPPPTGWWKLNSDATWLEEAHQGGLGWTV
        +++ I W IWE RNK++  G      DI   I+ +  +   R    +L   S N+  H            W PP +  WKLN+DA W  + + GG+GW +
Subjt:  KAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQIGSLDKSSKNQMSH----------RHWDPPPTGWWKLNSDATWLEEAHQGGLGWTV

Query:  RDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRN-----LPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPRE
        RD  G  I A  ++I+T+  I  LE+ AI  GL+++R      +  +   PI +ESD+ EA++L++   +D  E   L+  I  +   +  V+     RE
Subjt:  RDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRN-----LPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPRE

Query:  KNTTAHKIAR
         N  AH +AR
Subjt:  KNTTAHKIAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein2.9e-0823.64Show/hide
Query:  IECWKALTTHLKDVDLSKAITIMWSIWEARN------KALKSGHPPNK--EDITKRIELHTWDREFRPQIGSLDKSSKNQMSHRHWDPPPTGWWKLNSDA
        I+  K  TT+   +D      IMW +W++RN      K     +   K  +D T+ +  +         + + +    ++     W+PPP GW K N D+
Subjt:  IECWKALTTHLKDVDLSKAITIMWSIWEARN------KALKSGHPPNK--EDITKRIELHTWDREFRPQIGSLDKSSKNQMSHRHWDPPPTGWWKLNSDA

Query:  TWLEEAHQGGLGWTVRDSSGSSICASTQLIKTDWTIKILELKAIVLG-LKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTL
         + + +     GWT+R+ +G  +      +++        L A  LG L +L+ +       +  ESD+   V LIN+  ED     +L+  I      L
Subjt:  TWLEEAHQGGLGWTVRDSSGSSICASTQLIKTDWTIKILELKAIVLG-LKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTL

Query:  GKVTFAWCPREKNTTAHKIA
           +  +  RE+N+ A  +A
Subjt:  GKVTFAWCPREKNTTAHKIA

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.0e-1427.55Show/hide
Query:  IMWSIWEARNKALKSGHPPNKEDITKRI--ELHTWDREFRPQIGSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDSSGSSICAST
        ++W +W++RN+ +  G   +  ++ +R   +   W    R   G        +     W  PP  W K N+DATW  E  + G+GW +R+ SG  +    
Subjt:  IMWSIWEARNKALKSGHPPNKEDITKRI--ELHTWDREFRPQIGSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDSSGSSICAST

Query:  QLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTL---GKVTFAWCPREKNTTAHKIAR
        + +     +   EL+A+   + ++      ++  I  ESDA   VNL+N  S+D     +L  A+ED+   L    +V F + PR  N  A +IAR
Subjt:  QLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTL---GKVTFAWCPREKNTTAHKIAR

AT4G03566.1 unknown protein1.2e-0624.43Show/hide
Query:  IMWSIWEARNKALKSGHPPNKEDITKRI--ELHTWDREFRPQIGSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDSSGSSICAST
        ++W IW+ RN  + +G   +   +      E   W++       S+ + S N      W  PP    K N   +WL + H  G  W VR+  G +   + 
Subjt:  IMWSIWEARNKALKSGHPPNKEDITKRI--ELHTWDREFRPQIGSLDKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDSSGSSICAST

Query:  QLIKTDWTIKILELKAIVLGLKSLRNLPIDS
        ++          EL+ ++  L SLR+L +D+
Subjt:  QLIKTDWTIKILELKAIVLGLKSLRNLPIDS

AT4G29090.1 Ribonuclease H-like superfamily protein1.4e-1526.04Show/hide
Query:  IMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQIGSL-DKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDSSGSSICASTQ
        ++W +W+ RN+ +  G   N +++ +R E    +   R +  S   K   N+ S   W PPP  W K N+DATW  +  + G+GW +R+  G       +
Subjt:  IMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQIGSL-DKSSKNQMSHRHWDPPPTGWWKLNSDATWLEEAHQGGLGWTVRDSSGSSICASTQ

Query:  LIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTAHKIAR
         +    ++   EL+A+   + SL       +  +  ESD+   + ++N+  E        +  ++ + S   +V F + PRE NT A ++AR
Subjt:  LIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTTAHKIAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCTGGATTCTATCCCGAGCCACAAGACAACCGTCTGGGAAAAACTGGGAGAATGTTACTCATGTCATTTGGGGTAGAAGACAGGAACCCTATCGAATGCTGGAA
AGCTCTCACAACTCACCTCAAGGATGTTGATCTAAGTAAAGCAATAACTATTATGTGGAGTATCTGGGAAGCAAGGAATAAAGCTTTAAAGAGTGGTCATCCTCCTAACA
AAGAAGACATCACAAAGCGAATTGAACTTCATACCTGGGACCGCGAGTTCCGTCCTCAAATCGGCTCTCTGGACAAATCTTCGAAGAACCAAATGAGTCACAGACATTGG
GATCCTCCCCCGACTGGTTGGTGGAAGCTGAATTCCGATGCGACCTGGCTTGAAGAAGCACACCAAGGAGGCTTAGGGTGGACTGTCCGTGACTCTTCAGGTTCTTCGAT
CTGTGCCAGCACTCAATTGATCAAAACAGATTGGACCATCAAAATTCTGGAATTGAAAGCCATTGTTTTGGGTTTGAAGAGCTTGAGAAACTTACCCATCGATTCCTTCC
CTCCTATCTGTGTCGAATCCGATGCGGCGGAAGCAGTCAATCTGATCAACCACATATCCGAAGATCTAGGTGAAGCCAATTCCTTGGTTGTTGCTATCGAAGATGTAGCC
TCCACCTTAGGCAAAGTGACCTTTGCTTGGTGCCCCCGGGAGAAGAACACGACGGCTCACAAGATCGCTAGACTCCCTTCCTCCCCTATTTTTTGGTCGGATCTTAACCG
ATCCTTCATTGCGGAAGATGATCCGGTAGTCTGGACTCACCCGCTTCTTCCGTGTATCACCTCTGTCATACATGAGGCAGGTGTTTTTGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCTGGATTCTATCCCGAGCCACAAGACAACCGTCTGGGAAAAACTGGGAGAATGTTACTCATGTCATTTGGGGTAGAAGACAGGAACCCTATCGAATGCTGGAA
AGCTCTCACAACTCACCTCAAGGATGTTGATCTAAGTAAAGCAATAACTATTATGTGGAGTATCTGGGAAGCAAGGAATAAAGCTTTAAAGAGTGGTCATCCTCCTAACA
AAGAAGACATCACAAAGCGAATTGAACTTCATACCTGGGACCGCGAGTTCCGTCCTCAAATCGGCTCTCTGGACAAATCTTCGAAGAACCAAATGAGTCACAGACATTGG
GATCCTCCCCCGACTGGTTGGTGGAAGCTGAATTCCGATGCGACCTGGCTTGAAGAAGCACACCAAGGAGGCTTAGGGTGGACTGTCCGTGACTCTTCAGGTTCTTCGAT
CTGTGCCAGCACTCAATTGATCAAAACAGATTGGACCATCAAAATTCTGGAATTGAAAGCCATTGTTTTGGGTTTGAAGAGCTTGAGAAACTTACCCATCGATTCCTTCC
CTCCTATCTGTGTCGAATCCGATGCGGCGGAAGCAGTCAATCTGATCAACCACATATCCGAAGATCTAGGTGAAGCCAATTCCTTGGTTGTTGCTATCGAAGATGTAGCC
TCCACCTTAGGCAAAGTGACCTTTGCTTGGTGCCCCCGGGAGAAGAACACGACGGCTCACAAGATCGCTAGACTCCCTTCCTCCCCTATTTTTTGGTCGGATCTTAACCG
ATCCTTCATTGCGGAAGATGATCCGGTAGTCTGGACTCACCCGCTTCTTCCGTGTATCACCTCTGTCATACATGAGGCAGGTGTTTTTGGCTGA
Protein sequenceShow/hide protein sequence
MESGFYPEPQDNRLGKTGRMLLMSFGVEDRNPIECWKALTTHLKDVDLSKAITIMWSIWEARNKALKSGHPPNKEDITKRIELHTWDREFRPQIGSLDKSSKNQMSHRHW
DPPPTGWWKLNSDATWLEEAHQGGLGWTVRDSSGSSICASTQLIKTDWTIKILELKAIVLGLKSLRNLPIDSFPPICVESDAAEAVNLINHISEDLGEANSLVVAIEDVA
STLGKVTFAWCPREKNTTAHKIARLPSSPIFWSDLNRSFIAEDDPVVWTHPLLPCITSVIHEAGVFG