; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS028270 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS028270
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRNase H domain-containing protein
Genome locationscaffold47:1681955..1682629
RNA-Seq ExpressionMS028270
SyntenyMS028270
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135916.1 uncharacterized protein LOC111007754 [Momordica charantia]5.8e-8398.04Show/hide
Query:  MLDFIRGWKELIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE
        MLDFIRGWK LIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE
Subjt:  MLDFIRGWKELIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE

Query:  GEGVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGVDDSE
        G GVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAG+DDSE
Subjt:  GEGVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGVDDSE

XP_022135942.1 uncharacterized protein LOC111007775 [Momordica charantia]5.7e-3038.7Show/hide
Query:  IPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQ-GVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE-GEGVGVILR
        I W  LE L V LW+IW ARN+S+  +  G LL  +  W  +YL +Y+ AQ G SL  +   R      W PPL PF KVNVDAA  K     G+ +++R
Subjt:  IPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQ-GVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE-GEGVGVILR

Query:  DSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGV---------------------DDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAGNTTAHML
        DS   V L+A+  ++ +  V  AEC A  +G+ L +EAG+                     D+SE+GVL+S I+  + S +I   FSF  R GN+ AH L
Subjt:  DSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGV---------------------DDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAGNTTAHML

Query:  ARRAVSSPGFQVWLEEAPLELSGALEEDRE
        AR  + S  F VW+EE   +LS  +  DR+
Subjt:  ARRAVSSPGFQVWLEEAPLELSGALEEDRE

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]8.3e-2133.64Show/hide
Query:  IRGWKELIPWADLELLLVMLWSIWCARN-RSISVSHAGGLLEGIR--AWSDNYLSIYKQAQGVSL-HEVDQPRQLWAGGWNPPLNPFLKVNVDAA-VSKE
        +R   E +  AD E L V++W +W  RN R+ + S       G+    W++ Y   +++A+   +   V    ++    W PP     K+N DA+ ++ +
Subjt:  IRGWKELIPWADLELLLVMLWSIWCARN-RSISVSHAGGLLEGIR--AWSDNYLSIYKQAQGVSL-HEVDQPRQLWAGGWNPPLNPFLKVNVDAA-VSKE

Query:  EGEGVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGV-----DDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAGNTTAHMLARRAVS
        +  G+G+I+ +  G V  AA   L  I SVD AE  A  +GL+L  E G+     D SE G +    K F  + ++H +F+F +R GN  AHMLARRA+ 
Subjt:  EGEGVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGV-----DDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAGNTTAHMLARRAVS

Query:  SPGFQVWLEEAPLELSGALE
           F +W+E+ PLEL   LE
Subjt:  SPGFQVWLEEAPLELSGALE

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]1.3e-4545.96Show/hide
Query:  FIRGWKELIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE-GE
        F+R W +L+ W  +  ++V+LW+IW ARN++      GG L  + +WS+NYL +Y+ AQ  S   +   R      W PP  P LKVNVDAA  KE    
Subjt:  FIRGWKELIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE-GE

Query:  GVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAG---------------------VDDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAG
        GVGVI+RDS G+VYL A+  L+    VDW E FAV++G+ L VEAG                     VDDSEVGVL SVIK FL SH   V+FSFT R G
Subjt:  GVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAG---------------------VDDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAG

Query:  NTTAHMLARRAVSSPGFQVWLEEAPLELSGALEED
        N  AH+LA+ A++SP  Q+W+EE P E+S  L  D
Subjt:  NTTAHMLARRAVSSPGFQVWLEEAPLELSGALEED

XP_030940241.1 uncharacterized protein LOC115965200 [Quercus lobata]9.5e-1732.91Show/hide
Query:  WKELIPWAD-----LELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAA-VSKEEG
        +KEL+ W       LE+  V +WSIW  RNR + ++     +  I   S  +L+ Y QA+ +S + V    Q     W PP +   K+N D A    E+ 
Subjt:  WKELIPWAD-----LELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAA-VSKEEG

Query:  EGVGVILRDSYGVVY------------------LAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGV-DDSEVGVLSSVIKE---FLHSHNIHVTFSFTRR
         G+GV++RDS G+V                   +AA W LSF  + D     AV +G  L V AG+ +D  V V   ++ E   FL      + +S T+R
Subjt:  EGVGVILRDSYGVVY------------------LAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGV-DDSEVGVLSSVIKE---FLHSHNIHVTFSFTRR

Query:  AGNTTAHMLARRAVSSPGFQVWLEEAPLELSGALEED
         GN  AH LAR AV  P F VW+E+ P +     + D
Subjt:  AGNTTAHMLARRAVSSPGFQVWLEEAPLELSGALEED

TrEMBL top hitse value%identityAlignment
A0A6J1C225 uncharacterized protein LOC1110077542.8e-8398.04Show/hide
Query:  MLDFIRGWKELIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE
        MLDFIRGWK LIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE
Subjt:  MLDFIRGWKELIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE

Query:  GEGVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGVDDSE
        G GVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAG+DDSE
Subjt:  GEGVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGVDDSE

A0A6J1C467 uncharacterized protein LOC1110077752.8e-3038.7Show/hide
Query:  IPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQ-GVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE-GEGVGVILR
        I W  LE L V LW+IW ARN+S+  +  G LL  +  W  +YL +Y+ AQ G SL  +   R      W PPL PF KVNVDAA  K     G+ +++R
Subjt:  IPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQ-GVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE-GEGVGVILR

Query:  DSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGV---------------------DDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAGNTTAHML
        DS   V L+A+  ++ +  V  AEC A  +G+ L +EAG+                     D+SE+GVL+S I+  + S +I   FSF  R GN+ AH L
Subjt:  DSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGV---------------------DDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAGNTTAHML

Query:  ARRAVSSPGFQVWLEEAPLELSGALEEDRE
        AR  + S  F VW+EE   +LS  +  DR+
Subjt:  ARRAVSSPGFQVWLEEAPLELSGALEEDRE

A0A6J1CDQ4 uncharacterized protein LOC1110105333.9e-1630.57Show/hide
Query:  MLDFIRGWKELIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQA--------QGVSLHEVDQPRQLWAGG----WNPPLNPFL
        M++ +R W++++ W D E L+V LWS+W  RN  +  +      + +  W   Y++ +K          Q VS     Q  Q+        W P      
Subjt:  MLDFIRGWKELIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQA--------QGVSLHEVDQPRQLWAGG----WNPPLNPFL

Query:  KVNVDAAVSK-EEGEGVGV-ILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGVD-----------------DSEVGVLSSVIKEFLHSH-
        K+  DA+ S  +   G+GV I+RD  G V  +A   L  + SVD AE  A  +GLR+ +E G+                  D E    +  I E++ +H 
Subjt:  KVNVDAAVSK-EEGEGVGV-ILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGVD-----------------DSEVGVLSSVIKEFLHSH-

Query:  --NIHVTFSFTRRAGNTTAHMLARRAVSS
           + V++SFT+R GNT AH+LARRA+ S
Subjt:  --NIHVTFSFTRRAGNTTAHMLARRAVSS

A0A6J1DAR4 uncharacterized protein LOC1110189544.0e-2133.64Show/hide
Query:  IRGWKELIPWADLELLLVMLWSIWCARN-RSISVSHAGGLLEGIR--AWSDNYLSIYKQAQGVSL-HEVDQPRQLWAGGWNPPLNPFLKVNVDAA-VSKE
        +R   E +  AD E L V++W +W  RN R+ + S       G+    W++ Y   +++A+   +   V    ++    W PP     K+N DA+ ++ +
Subjt:  IRGWKELIPWADLELLLVMLWSIWCARN-RSISVSHAGGLLEGIR--AWSDNYLSIYKQAQGVSL-HEVDQPRQLWAGGWNPPLNPFLKVNVDAA-VSKE

Query:  EGEGVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGV-----DDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAGNTTAHMLARRAVS
        +  G+G+I+ +  G V  AA   L  I SVD AE  A  +GL+L  E G+     D SE G +    K F  + ++H +F+F +R GN  AHMLARRA+ 
Subjt:  EGEGVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGV-----DDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAGNTTAHMLARRAVS

Query:  SPGFQVWLEEAPLELSGALE
           F +W+E+ PLEL   LE
Subjt:  SPGFQVWLEEAPLELSGALE

A0A6J1DBJ7 uncharacterized protein LOC1110189736.2e-4645.96Show/hide
Query:  FIRGWKELIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE-GE
        F+R W +L+ W  +  ++V+LW+IW ARN++      GG L  + +WS+NYL +Y+ AQ  S   +   R      W PP  P LKVNVDAA  KE    
Subjt:  FIRGWKELIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEE-GE

Query:  GVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAG---------------------VDDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAG
        GVGVI+RDS G+VYL A+  L+    VDW E FAV++G+ L VEAG                     VDDSEVGVL SVIK FL SH   V+FSFT R G
Subjt:  GVGVILRDSYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAG---------------------VDDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAG

Query:  NTTAHMLARRAVSSPGFQVWLEEAPLELSGALEED
        N  AH+LA+ A++SP  Q+W+EE P E+S  L  D
Subjt:  NTTAHMLARRAVSSPGFQVWLEEAPLELSGALEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGATTTTATTCGGGGGTGGAAGGAGCTAATTCCCTGGGCTGACTTGGAGCTTTTGTTGGTTATGTTATGGTCGATTTGGTGCGCACGGAATCGATCTATTTCGGT
TTCTCATGCGGGTGGCTTACTCGAGGGGATCAGAGCATGGTCGGATAATTATCTTAGTATATATAAGCAGGCGCAGGGCGTTTCTCTCCATGAGGTGGATCAACCAAGGC
AGCTTTGGGCCGGAGGTTGGAACCCACCGTTGAACCCCTTTCTCAAGGTGAATGTAGACGCGGCCGTCTCGAAAGAGGAGGGAGAAGGAGTGGGGGTTATCCTCCGAGAT
TCTTATGGGGTCGTCTATCTTGCTGCGGTTTGGCCCCTTTCTTTCATTCCAAGCGTCGATTGGGCGGAATGTTTCGCGGTTTTTGACGGTTTGCGACTAGGAGTGGAGGC
TGGTGTGGACGATTCGGAGGTTGGGGTCCTGAGTTCTGTCATCAAAGAATTTCTTCATTCTCATAATATTCATGTCACGTTTAGTTTTACTCGCAGAGCCGGCAATACCA
CTGCTCATATGCTTGCTCGCCGGGCCGTATCTTCTCCTGGATTTCAAGTTTGGTTGGAGGAAGCGCCTCTGGAGCTTTCTGGAGCGTTGGAGGAGGACCGCGAGTTTTGT
TTTCGCTTTATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGATTTTATTCGGGGGTGGAAGGAGCTAATTCCCTGGGCTGACTTGGAGCTTTTGTTGGTTATGTTATGGTCGATTTGGTGCGCACGGAATCGATCTATTTCGGT
TTCTCATGCGGGTGGCTTACTCGAGGGGATCAGAGCATGGTCGGATAATTATCTTAGTATATATAAGCAGGCGCAGGGCGTTTCTCTCCATGAGGTGGATCAACCAAGGC
AGCTTTGGGCCGGAGGTTGGAACCCACCGTTGAACCCCTTTCTCAAGGTGAATGTAGACGCGGCCGTCTCGAAAGAGGAGGGAGAAGGAGTGGGGGTTATCCTCCGAGAT
TCTTATGGGGTCGTCTATCTTGCTGCGGTTTGGCCCCTTTCTTTCATTCCAAGCGTCGATTGGGCGGAATGTTTCGCGGTTTTTGACGGTTTGCGACTAGGAGTGGAGGC
TGGTGTGGACGATTCGGAGGTTGGGGTCCTGAGTTCTGTCATCAAAGAATTTCTTCATTCTCATAATATTCATGTCACGTTTAGTTTTACTCGCAGAGCCGGCAATACCA
CTGCTCATATGCTTGCTCGCCGGGCCGTATCTTCTCCTGGATTTCAAGTTTGGTTGGAGGAAGCGCCTCTGGAGCTTTCTGGAGCGTTGGAGGAGGACCGCGAGTTTTGT
TTTCGCTTTATTTAA
Protein sequenceShow/hide protein sequence
MLDFIRGWKELIPWADLELLLVMLWSIWCARNRSISVSHAGGLLEGIRAWSDNYLSIYKQAQGVSLHEVDQPRQLWAGGWNPPLNPFLKVNVDAAVSKEEGEGVGVILRD
SYGVVYLAAVWPLSFIPSVDWAECFAVFDGLRLGVEAGVDDSEVGVLSSVIKEFLHSHNIHVTFSFTRRAGNTTAHMLARRAVSSPGFQVWLEEAPLELSGALEEDREFC
FRFI