; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS024083 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS024083
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRNase H domain-containing protein
Genome locationscaffold26:136375..137419
RNA-Seq ExpressionMS024083
SyntenyMS024083
Gene Ontology termsGO:0001172 - transcription, RNA-templated (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003968 - RNA-directed 5'-3' RNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4363068.1 hypothetical protein F8388_013432 [Cannabis sativa]8.1e-0524.04Show/hide
Query:  LELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAIPNQIPRSKSEQPWCLPLLAHSVHPRFTLSPL
        +E++  C  C    E   HAL   S+++ +W+  L+      D L + S    H   H      E         + KS       +L   + P    +P 
Subjt:  LELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAIPNQIPRSKSEQPWCLPLLAHSVHPRFTLSPL

Query:  LAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDNFPLWLS
        +AE  AIL+ +    +   G   + S     +  +H     L A    +H +K +  +F   + +HV  E N  A++LAK+GL++ +   +   FP WL+
Subjt:  LAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDNFPLWLS

Query:  NLILNESP
        N    + P
Subjt:  NLILNESP

KAF7837133.1 uncharacterized protein G2W53_005615 [Senna tora]5.0e-0722.13Show/hide
Query:  KVSKIARSSTHNILPTLYNLQRKELELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAIPNQIPR-
        K+      +   ILPT  NL+ + +E+   C  C  + E T+HAL+     + +W + +       +    F   + +  + +D+ ++ +     + P  
Subjt:  KVSKIARSSTHNILPTLYNLQRKELELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAIPNQIPR-

Query:  ----------------------SKSEQPWCLPLLAHSVHPRFTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLK
                               ++ Q  CL   A  +  +   SP L E  A LKGL    +L   +I++   + + + LIHGSS  L   G+ + D++
Subjt:  ----------------------SKSEQPWCLPLLAHSVHPRFTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLK

Query:  VKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDNFPLWLSNLILNE
             F+      VS   N +AD +A       +   WL +FP ++S++++N+
Subjt:  VKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDNFPLWLSNLILNE

KAF7845245.1 transcription factor SCREAM2-like isoform X1 [Senna tora]4.7e-0520.87Show/hide
Query:  KVSKIARSSTHNILPTLYNLQRKELELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAIPNQIPR-
        K+      +   ILPT  NL+ + +++   C  C    E T HAL++    + +W + +  V  P +  + F   + +++    + ++ +     + P  
Subjt:  KVSKIARSSTHNILPTLYNLQRKELELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAIPNQIPR-

Query:  ----------------------SKSEQPWCLPLLAHSVHPRFTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLK
                               ++ Q  CL   A  +  +   S  L E  A L+GL    +L   +I++   + +   LIHGSS  L   G+ + D++
Subjt:  ----------------------SKSEQPWCLPLLAHSVHPRFTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLK

Query:  VKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDNFPLWLSNLILNES
             F+      V    N +AD +A           WL++FP ++S++++++S
Subjt:  VKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDNFPLWLSNLILNES

TXG71533.1 hypothetical protein EZV62_000112 [Acer yangbiense]8.1e-0533.33Show/hide
Query:  FTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDN
        F   P +AE LAIL+GL F  S +   + + S +   +  I+  S P    GV ++D+ +    F+  SF  V    N +A  LAK  L     S WL++
Subjt:  FTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDN

Query:  FPLWLSNLILNESP
         PL + NL+L + P
Subjt:  FPLWLSNLILNESP

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.8e-1340.3Show/hide
Query:  CLPLLAHSVHPRFTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQG
        C  + A S+   F LSPLLAEI  IL+GL F  + +   + V S S  AI LI       G E  W+ +++   C F + SF H S + N  A  LAK G
Subjt:  CLPLLAHSVHPRFTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQG

Query:  LQSLSFSF-WLDNFPLWLSNLILNESPSFVSHVA
        + S S ++ WL NFP WL +L+  + PS  +HVA
Subjt:  LQSLSFSF-WLDNFPLWLSNLILNESPSFVSHVA

TrEMBL top hitse value%identityAlignment
A0A2N9I609 Uncharacterized protein6.7e-0526.89Show/hide
Query:  NILPTLYNLQRKELELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAIP-------NQIPRSK---
        N LPTL NLQR+ +  S  C  C  + E   HAL +  +   +W H  LT  I   +   F  + F HL  +  SD  LA +        N  PR K   
Subjt:  NILPTLYNLQRKELELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAIP-------NQIPRSK---

Query:  -SEQPWCLP----------LLAHSVHPRFTLSPLLAE-----ILAILKGLHFGHSL-SIGSILVHSV--STRAIGL----IHGSSLPL-----------G
          +Q W  P                H   ++  ++ +     I+ + K  H  H +    +I VH      R +GL    + G SL +            
Subjt:  -SEQPWCLP----------LLAHSVHPRFTLSPLLAE-----ILAILKGLHFGHSL-SIGSILVHSV--STRAIGL----IHGSSLPL-----------G

Query:  AEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQGLQ-SLSFSFWLDNFPLWLSNLILNE
        + G  I D+   A       F HV    N +A +LA+ GL  S  F  WL++ P +L ++I +E
Subjt:  AEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQGLQ-SLSFSFWLDNFPLWLSNLILNE

A0A5C7IQ65 RNase H domain-containing protein3.9e-0533.33Show/hide
Query:  FTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDN
        F   P +AE LAIL+GL F  S +   + + S +   +  I+  S P    GV ++D+ +    F+  SF  V    N +A  LAK  L     S WL++
Subjt:  FTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDN

Query:  FPLWLSNLILNESP
         PL + NL+L + P
Subjt:  FPLWLSNLILNESP

A0A6J1DX30 uncharacterized protein LOC1110248741.3e-1340.3Show/hide
Query:  CLPLLAHSVHPRFTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQG
        C  + A S+   F LSPLLAEI  IL+GL F  + +   + V S S  AI LI       G E  W+ +++   C F + SF H S + N  A  LAK G
Subjt:  CLPLLAHSVHPRFTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQG

Query:  LQSLSFSF-WLDNFPLWLSNLILNESPSFVSHVA
        + S S ++ WL NFP WL +L+  + PS  +HVA
Subjt:  LQSLSFSF-WLDNFPLWLSNLILNESPSFVSHVA

A0A7J6EXH9 RNase H domain-containing protein3.9e-0524.04Show/hide
Query:  LELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAIPNQIPRSKSEQPWCLPLLAHSVHPRFTLSPL
        +E++  C  C    E   HAL   S+++ +W+  L+      D L + S    H   H      E         + KS       +L   + P    +P 
Subjt:  LELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAIPNQIPRSKSEQPWCLPLLAHSVHPRFTLSPL

Query:  LAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDNFPLWLS
        +AE  AIL+ +    +   G   + S     +  +H     L A    +H +K +  +F   + +HV  E N  A++LAK+GL++ +   +   FP WL+
Subjt:  LAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVKACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDNFPLWLS

Query:  NLILNESP
        N    + P
Subjt:  NLILNESP

A0A7N2LEC9 zf-RVT domain-containing protein1.9e-0435.48Show/hide
Query:  KVSKIARSSTHNILPTLYNLQRKELELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAI
        KV   A  + +N LPT+ NLQ++ +  S  C  CTT+ E T HAL + S+  ++WR +  T+      L  F+ L    L  KD+  KE+  I
Subjt:  KVSKIARSSTHNILPTLYNLQRKELELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHLLKFSGLWFHHLSHKDNSDKELAAI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGCTCCTCCACCGACTACACAGGAATCGCCAGCATCACGAGCACCATCTGCAGCACTCAAGGCACCTGAGCAAGAGCAGGCTGCCTCGAAGAAACGGTCGGGCAG
AAGGTGGTTTGTGAAGGTGTCCAAGATTGCTAGATCGAGCACGCACAATATTCTTCCTACATTATATAATTTACAGAGGAAAGAGTTAGAGTTATCATTGTACTGCCCTT
GCTGTACCACAAAGCTGGAAAAAACAGCTCATGCCCTTCTTTCCCACTCTAGATCCAGGGACATTTGGCGGCACGTTCTACTGACAGTTTCTATCCCAAATGATCATCTC
TTGAAGTTCTCTGGTTTATGGTTTCATCATCTAAGTCACAAGGATAATTCAGATAAGGAGCTTGCTGCAATTCCTAATCAGATCCCTAGGAGCAAATCGGAGCAGCCGTG
GTGCCTTCCTCTGCTGGCCCACTCCGTTCATCCTAGATTCACTCTTAGCCCTTTACTCGCAGAAATCTTGGCTATCCTTAAGGGTCTTCATTTTGGGCATTCGTTGAGTA
TTGGTAGCATCCTCGTTCATTCTGTTTCAACCCGAGCTATTGGGCTCATACATGGTTCTTCTCTTCCTCTTGGTGCTGAAGGCGTGTGGATCCATGATTTAAAGGTTAAA
GCTTGCGAGTTCACCTATTCCTCCTTCGTGCATGTTTCCCATGAGAGGAATCATCTAGCTGACTTGTTAGCGAAACAGGGGCTTCAATCATTGTCTTTTAGCTTCTGGCT
TGATAACTTTCCTCTTTGGCTTTCCAATCTTATCCTTAATGAAAGCCCCTCATTTGTATCCCATGTGGCATTTTCTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGCTCCTCCACCGACTACACAGGAATCGCCAGCATCACGAGCACCATCTGCAGCACTCAAGGCACCTGAGCAAGAGCAGGCTGCCTCGAAGAAACGGTCGGGCAG
AAGGTGGTTTGTGAAGGTGTCCAAGATTGCTAGATCGAGCACGCACAATATTCTTCCTACATTATATAATTTACAGAGGAAAGAGTTAGAGTTATCATTGTACTGCCCTT
GCTGTACCACAAAGCTGGAAAAAACAGCTCATGCCCTTCTTTCCCACTCTAGATCCAGGGACATTTGGCGGCACGTTCTACTGACAGTTTCTATCCCAAATGATCATCTC
TTGAAGTTCTCTGGTTTATGGTTTCATCATCTAAGTCACAAGGATAATTCAGATAAGGAGCTTGCTGCAATTCCTAATCAGATCCCTAGGAGCAAATCGGAGCAGCCGTG
GTGCCTTCCTCTGCTGGCCCACTCCGTTCATCCTAGATTCACTCTTAGCCCTTTACTCGCAGAAATCTTGGCTATCCTTAAGGGTCTTCATTTTGGGCATTCGTTGAGTA
TTGGTAGCATCCTCGTTCATTCTGTTTCAACCCGAGCTATTGGGCTCATACATGGTTCTTCTCTTCCTCTTGGTGCTGAAGGCGTGTGGATCCATGATTTAAAGGTTAAA
GCTTGCGAGTTCACCTATTCCTCCTTCGTGCATGTTTCCCATGAGAGGAATCATCTAGCTGACTTGTTAGCGAAACAGGGGCTTCAATCATTGTCTTTTAGCTTCTGGCT
TGATAACTTTCCTCTTTGGCTTTCCAATCTTATCCTTAATGAAAGCCCCTCATTTGTATCCCATGTGGCATTTTCTTCTTAA
Protein sequenceShow/hide protein sequence
MSAPPPTTQESPASRAPSAALKAPEQEQAASKKRSGRRWFVKVSKIARSSTHNILPTLYNLQRKELELSLYCPCCTTKLEKTAHALLSHSRSRDIWRHVLLTVSIPNDHL
LKFSGLWFHHLSHKDNSDKELAAIPNQIPRSKSEQPWCLPLLAHSVHPRFTLSPLLAEILAILKGLHFGHSLSIGSILVHSVSTRAIGLIHGSSLPLGAEGVWIHDLKVK
ACEFTYSSFVHVSHERNHLADLLAKQGLQSLSFSFWLDNFPLWLSNLILNESPSFVSHVAFSS