; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005272 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005272
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRNase H domain-containing protein
Genome locationscaffold83:492217..492888
RNA-Seq ExpressionMS005272
SyntenyMS005272
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142956.1 uncharacterized protein LOC111012950 [Momordica charantia]7.9e-6499.21Show/hide
Query:  TGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIR
        TGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLAD LNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIR
Subjt:  TGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIR

Query:  VLARNFIDFNFLHVCRSSNKAARRVA
        VLARNFIDFNFLHVCRSSNKAARRVA
Subjt:  VLARNFIDFNFLHVCRSSNKAARRVA

XP_022145060.1 uncharacterized protein LOC111014578 [Momordica charantia]2.2e-6985.71Show/hide
Query:  PQCDFNNSVQDRLLYPVESLSTLDFDLVGVGVWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGW
        PQCDFNNSVQD LLY VESLST DFDLVGVGVW IW DRNAIR+QRQIPDAKIRSDWILTYVRDFQ RDVPSLGDFRIQEDASSN  T E+YWSPPP GW
Subjt:  PQCDFNNSVQDRLLYPVESLSTLDFDLVGVGVWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGW

Query:  IKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEG
        IKINVD ACKKHQFRT IGIVCRNEKGQILAAAS      HDPLMAESLAL +G
Subjt:  IKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEG

XP_022154440.1 uncharacterized protein LOC111021711 [Momordica charantia]2.7e-2745.86Show/hide
Query:  SYWSPPPTGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSN
        +YW+PP   W+KINVD ACK    RT IG+VC+N+KG+I+ A  R+++ +++PL  E+LA+LEGL L   L++  VQ +SDS   INL++++       N
Subjt:  SYWSPPPTGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSN

Query:  VWLEDIRVLARNFIDFNFLHVCRSSNKAARRVA
        VW++DI + A NFID  F+H+ RSS+    R+A
Subjt:  VWLEDIRVLARNFIDFNFLHVCRSSNKAARRVA

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]8.9e-2333.82Show/hide
Query:  VESLSTLDFDLVGVGVWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGWIKINVDVACKKHQFRT
        +++LST +  L G+  W +W DR+A+  +++IP+A I+ +WIL Y  + + +            +         + W PP  G +K+N D A  +    +
Subjt:  VESLSTLDFDLVGVGVWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGWIKINVDVACKKHQFRT

Query:  AIGIVCRNEKGQILAAASRRMEAYH---DPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNV-WLEDIRVLARNFIDFNFLHVC
         +G++ R    +I+ A    M  +H    PL+A+ LA+ EGL LA RL + +V  E+DSL+ +NL+  D   W G  V W+EDIR  AR+F +  F HV 
Subjt:  AIGIVCRNEKGQILAAASRRMEAYH---DPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNV-WLEDIRVLARNFIDFNFLHVC

Query:  RSSNKAA
        R SN  A
Subjt:  RSSNKAA

XP_023872391.1 uncharacterized protein LOC111985003 [Quercus suber]5.4e-2031.68Show/hide
Query:  LDFDLVGVGVWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGWIKINVDVACKKHQFRTAIGIVC
        LD++L  V  W++W  RN ++   Q  +A   +  +  Y+++F+ ++  S G   I           +  W+PP  GW KIN+D A         +G+V 
Subjt:  LDFDLVGVGVWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGWIKINVDVACKKHQFRTAIGIVC

Query:  RNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTE-DPISWCGSNVWLEDIRVLARNFIDFNFLHVCRSSNKAARR
        RNE G ++ A S+R +     L  E+LA+ EG+ LA  L +++++ ESDS  +++ LT+ D + W  S V +E +R+  R F  +   H CR +N AA +
Subjt:  RNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTE-DPISWCGSNVWLEDIRVLARNFIDFNFLHVCRSSNKAARR

Query:  VA
        +A
Subjt:  VA

TrEMBL top hitse value%identityAlignment
A0A2N9EEE3 Uncharacterized protein9.3e-1829.15Show/hide
Query:  NSVQDRLLYPVESLSTLDFDLVGVGVWTIWKDRNAIRIQ--RQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESY----WSPPPTGW
        +SV + + Y +E  +T+D ++  V  W IW+ RN+++ Q   + PD        L  +++FQ   V             S     +SY    W PPPTG 
Subjt:  NSVQDRLLYPVESLSTLDFDLVGVGVWTIWKDRNAIRIQ--RQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESY----WSPPPTGW

Query:  IKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIRVLA
        + IN D A  K +    IG++ R++ G  +A  S+++   H     E+ A  E   LA  L +++V FE DS  +I+ L    +        +ED +V+ 
Subjt:  IKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIRVLA

Query:  RNFIDFNFLHVCRSSNKAARRVA
         NF   +F+H+ R  N AA  +A
Subjt:  RNFIDFNFLHVCRSSNKAARRVA

A0A6J1CNR4 uncharacterized protein LOC1110129503.8e-6499.21Show/hide
Query:  TGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIR
        TGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLAD LNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIR
Subjt:  TGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIR

Query:  VLARNFIDFNFLHVCRSSNKAARRVA
        VLARNFIDFNFLHVCRSSNKAARRVA
Subjt:  VLARNFIDFNFLHVCRSSNKAARRVA

A0A6J1CTE3 uncharacterized protein LOC1110145781.0e-6985.71Show/hide
Query:  PQCDFNNSVQDRLLYPVESLSTLDFDLVGVGVWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGW
        PQCDFNNSVQD LLY VESLST DFDLVGVGVW IW DRNAIR+QRQIPDAKIRSDWILTYVRDFQ RDVPSLGDFRIQEDASSN  T E+YWSPPP GW
Subjt:  PQCDFNNSVQDRLLYPVESLSTLDFDLVGVGVWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGW

Query:  IKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEG
        IKINVD ACKKHQFRT IGIVCRNEKGQILAAAS      HDPLMAESLAL +G
Subjt:  IKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEG

A0A6J1DJL6 uncharacterized protein LOC1110217111.3e-2745.86Show/hide
Query:  SYWSPPPTGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSN
        +YW+PP   W+KINVD ACK    RT IG+VC+N+KG+I+ A  R+++ +++PL  E+LA+LEGL L   L++  VQ +SDS   INL++++       N
Subjt:  SYWSPPPTGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSN

Query:  VWLEDIRVLARNFIDFNFLHVCRSSNKAARRVA
        VW++DI + A NFID  F+H+ RSS+    R+A
Subjt:  VWLEDIRVLARNFIDFNFLHVCRSSNKAARRVA

A0A6J1DZK3 uncharacterized protein LOC1110249684.3e-2333.82Show/hide
Query:  VESLSTLDFDLVGVGVWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGWIKINVDVACKKHQFRT
        +++LST +  L G+  W +W DR+A+  +++IP+A I+ +WIL Y  + + +            +         + W PP  G +K+N D A  +    +
Subjt:  VESLSTLDFDLVGVGVWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGWIKINVDVACKKHQFRT

Query:  AIGIVCRNEKGQILAAASRRMEAYH---DPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNV-WLEDIRVLARNFIDFNFLHVC
         +G++ R    +I+ A    M  +H    PL+A+ LA+ EGL LA RL + +V  E+DSL+ +NL+  D   W G  V W+EDIR  AR+F +  F HV 
Subjt:  AIGIVCRNEKGQILAAASRRMEAYH---DPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNV-WLEDIRVLARNFIDFNFLHVC

Query:  RSSNKAA
        R SN  A
Subjt:  RSSNKAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25270.1 Ribonuclease H-like superfamily protein9.2e-1025.65Show/hide
Query:  VWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARD--VPSLGDFRIQEDASSNGWTTESYWSPPPTGWIKINVDVACKKHQFRTA-IGIVCRNEKGQ
        +W +WK RN +  Q++    +         V++++  +  V SL   ++            + W  PP+ WIK N D A   HQ R A  G + R+E G 
Subjt:  VWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARD--VPSLGDFRIQEDASSNGWTTESYWSPPPTGWIKINVDVACKKHQFRTA-IGIVCRNEKGQ

Query:  ILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIRVLARNFIDFNFLHVCRSSNKAA
         + +         D L +E  AL+  ++ A     ++V FE DS Q+  L+  + +++ G   W+ + R   + F +  F  V R++N+ A
Subjt:  ILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIRVLARNFIDFNFLHVCRSSNKAA

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-0530.47Show/hide
Query:  PPTGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLED
        P    + I  D A K        G V RN                  PLMAE++AL   L+ A  + I ++   SDS QLI  +T +  S     + + D
Subjt:  PPTGWIKINVDVACKKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLED

Query:  IRVLARNFIDFNFLHVCRSSNKAARRVA
        I  L+  F D +F  V RS N+ A  +A
Subjt:  IRVLARNFIDFNFLHVCRSSNKAARRVA

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)7.3e-0721.48Show/hide
Query:  VWTIWKDRNAIRIQR--QIP------DAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGWIKINVDVACKKHQFRTAIGIVCR
        +W +WK RN    Q+  + P        +  ++W+ T + D          +    E  +         WSPPP G++K N D    + +  T+   + R
Subjt:  VWTIWKDRNAIRIQR--QIP------DAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGWIKINVDVACKKHQFRTAIGIVCR

Query:  NEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDS
        +  G ++ +   +++  +  L AE+L  L  L++      + V FE ++
Subjt:  NEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDS

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-1729.56Show/hide
Query:  VWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDV-----PSLGDFRIQEDASSNGWTTE------SYWSPPPTGWIKINVDVACKKHQFRTAIGI
        +W +WK+RN +  +                 R+F A++V       L ++RI+ +A S G   +        W PPP  W+K N D    +   R  IG 
Subjt:  VWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDV-----PSLGDFRIQEDASSNGWTTE------SYWSPPPTGWIKINVDVACKKHQFRTAIGI

Query:  VCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIRVLARNFIDFNFLHVCRSSNKAAR
        V RNEKG++    +R +      L AE  A+   +    R     V FESDS  LI +L  D I W      ++D++ L   F +  F+ + R  N  A 
Subjt:  VCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIRVLARNFIDFNFLHVCRSSNKAAR

Query:  RVA
        RVA
Subjt:  RVA

AT5G38920.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.4e-0433.33Show/hide
Query:  LMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTE-------DPISWCGSNVWLEDIRVLARNFIDFNFLHVCRSSNKAARRVA
        L+ E+ A+   +    RLN ++V FESDS QL+++L +       DPI        L+DI++L ++F +  F+ + R  N  A R+A
Subjt:  LMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTE-------DPISWCGSNVWLEDIRVLARNFIDFNFLHVCRSSNKAARRVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTCCCGCAATGTGATTTTAATAATTCCGTTCAAGATAGATTGTTGTATCCAGTGGAGTCGCTCTCTACTCTAGACTTTGATTTGGTTGGAGTTGGTGTTTGGACAATCTG
GAAGGATCGAAATGCAATTAGAATTCAAAGACAAATTCCAGACGCTAAGATTAGGAGCGATTGGATCCTTACCTATGTGAGAGATTTTCAAGCCCGTGATGTCCCTTCTC
TTGGTGATTTTCGGATTCAGGAGGATGCAAGTTCGAATGGTTGGACTACTGAGAGTTACTGGTCGCCTCCTCCGACTGGTTGGATTAAAATAAACGTGGATGTAGCATGC
AAGAAACATCAATTTAGAACTGCAATAGGGATTGTTTGCAGAAATGAAAAGGGGCAAATTTTGGCAGCAGCATCTCGTAGAATGGAGGCTTACCATGATCCGCTAATGGC
AGAATCTCTTGCTCTCCTTGAAGGCCTGCGTCTGGCTGACCGTCTGAATATCCAACAAGTACAGTTTGAATCAGATTCTTTGCAATTGATCAATTTGCTAACTGAGGATC
CTATTTCTTGGTGCGGTTCCAATGTCTGGTTAGAAGATATACGTGTATTGGCGAGAAATTTCATTGATTTCAATTTTTTGCATGTCTGTCGGTCCTCAAACAAGGCAGCC
CGCAGGGTTGCT
mRNA sequenceShow/hide mRNA sequence
GTCCCGCAATGTGATTTTAATAATTCCGTTCAAGATAGATTGTTGTATCCAGTGGAGTCGCTCTCTACTCTAGACTTTGATTTGGTTGGAGTTGGTGTTTGGACAATCTG
GAAGGATCGAAATGCAATTAGAATTCAAAGACAAATTCCAGACGCTAAGATTAGGAGCGATTGGATCCTTACCTATGTGAGAGATTTTCAAGCCCGTGATGTCCCTTCTC
TTGGTGATTTTCGGATTCAGGAGGATGCAAGTTCGAATGGTTGGACTACTGAGAGTTACTGGTCGCCTCCTCCGACTGGTTGGATTAAAATAAACGTGGATGTAGCATGC
AAGAAACATCAATTTAGAACTGCAATAGGGATTGTTTGCAGAAATGAAAAGGGGCAAATTTTGGCAGCAGCATCTCGTAGAATGGAGGCTTACCATGATCCGCTAATGGC
AGAATCTCTTGCTCTCCTTGAAGGCCTGCGTCTGGCTGACCGTCTGAATATCCAACAAGTACAGTTTGAATCAGATTCTTTGCAATTGATCAATTTGCTAACTGAGGATC
CTATTTCTTGGTGCGGTTCCAATGTCTGGTTAGAAGATATACGTGTATTGGCGAGAAATTTCATTGATTTCAATTTTTTGCATGTCTGTCGGTCCTCAAACAAGGCAGCC
CGCAGGGTTGCT
Protein sequenceShow/hide protein sequence
VPQCDFNNSVQDRLLYPVESLSTLDFDLVGVGVWTIWKDRNAIRIQRQIPDAKIRSDWILTYVRDFQARDVPSLGDFRIQEDASSNGWTTESYWSPPPTGWIKINVDVAC
KKHQFRTAIGIVCRNEKGQILAAASRRMEAYHDPLMAESLALLEGLRLADRLNIQQVQFESDSLQLINLLTEDPISWCGSNVWLEDIRVLARNFIDFNFLHVCRSSNKAA
RRVA