; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g18450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g18450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRNase H domain-containing protein
Genome locationchr2:13749548..13750291
RNA-Seq ExpressionMoc02g18450
SyntenyMoc02g18450
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG50387.1 hypothetical protein EZV62_022911 [Acer yangbiense]5.5e-1832.44Show/hide
Query:  PFLILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEA--CEWANNYVMDFRGANSNPFLGRFTNTVEIL-----WQPPNEGIYKIN
        P +IL  W  S+    F  L +  W LW  RN       +V      W  A    W +N+  +FR AN         N  E+L     W+ P  G +KIN
Subjt:  PFLILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEA--CEWANNYVMDFRGANSNPFLGRFTNTVEIL-----WQPPNEGIYKIN

Query:  TDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTL
         DASF      AG+G+II + +G  +A  +  V    S +M EA A +EG+ LA +IGV+ VI+E+D++ +  L S  +   +E+G I+  +        
Subjt:  TDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTL

Query:  HASFNFVKREDNKAAHLLARRGLLL
          S+  V+RE N  AH +A+  L L
Subjt:  HASFNFVKREDNKAAHLLARRGLLL

XP_022135942.1 uncharacterized protein LOC111007775 [Momordica charantia]4.0e-2434.23Show/hide
Query:  ESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFL--------GRFTNTVEILWQPPNEGIYKINTDASFLAS
        + + W G EEL V +W +WN RN    S+ T    FL  N   +W  +Y+  ++ A   P L        GR +      W PP    +K+N DA+F   
Subjt:  ESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFL--------GRFTNTVEILWQPPNEGIYKINTDASFLAS

Query:  DQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLH--ASFNF
        +  AGL I+I +    V+ +A  ++ ++    + E +AA EG+ LA E G+ P  +ETDSS +FNL     ED SE+G +    ++    +LH    F+F
Subjt:  DQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLH--ASFNF

Query:  VKREDNKAAHLLARRGLLLREF
        V RE N  AH LAR G++   F
Subjt:  VKREDNKAAHLLARRGLLLREF

XP_022139684.1 uncharacterized protein LOC111010533 [Momordica charantia]2.5e-4242.42Show/hide
Query:  ILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGR------FTNTVEI-------LWQPPNEGIY
        ILR+W + L+W  FEEL V +W LWN+RNA         K+ +  ++   W + Y+  F+  N+N           F  + +I       +W P  EG++
Subjt:  ILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGR------FTNTVEI-------LWQPPNEGIY

Query:  KINTDASFLASDQHAGLGIIICND-RGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFW
        K+ TDASF + D +AGLG+II  D RGQV+A+ATKY+E++ S D  EA+AAVEGL++A E G++P++LETDS  I+NLF++  E LS+ G I+   K   
Subjt:  KINTDASFLASDQHAGLGIIICND-RGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFW

Query:  TQTLHASFNFVKREDNKAAHLLARRGLLLRE
           L  S++F KR  N  AHLLARR L  +E
Subjt:  TQTLHASFNFVKREDNKAAHLLARRGLLLRE

XP_022140628.1 uncharacterized protein LOC111011237 [Momordica charantia]2.5e-7485.8Show/hide
Query:  EWANNYVMDFRGANSNPFLGRFTNTVEILWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGV
        EWAN YVM+FR ANSNPF GR TNT E+LW PP++ IYKINTDASFLASDQHAGLGIII NDRGQVMA+ATKY+ENIQS DM EAI AVEGLQLAS+IGV
Subjt:  EWANNYVMDFRGANSNPFLGRFTNTVEILWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGV

Query:  NPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLHASFNFVKREDNKAAHLLARRGLLLREF
        NPVILETDSS IFNLFSQPSEDLSE GEIVLKAKNFWTQ+LHASFNFVKRE NKAAH+LARR LLLREF
Subjt:  NPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLHASFNFVKREDNKAAHLLARRGLLLREF

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.4e-8471.97Show/hide
Query:  MWINSKFGKLSPIQHYHSPFLILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGRFTNTVEILW
        +WINSKFGKL       SPFLILRE HESLS A FEELCVVIWGLWNQRNAR  +  T +  F    E  EWAN Y M+FR A SNP  GR TNT EILW
Subjt:  MWINSKFGKLSPIQHYHSPFLILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGRFTNTVEILW

Query:  QPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIV
        QPP+EGIYKINTDASFLASDQHAGLGIII NDRGQVMA ATKY+ENIQS DM EAIAAVEGLQLASEIG++P +                EDLSE GEIV
Subjt:  QPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIV

Query:  LKAKNFWTQTLHASFNFVKREDNKAAHLLARRGLLLREF
        LKAKNFWTQ+LHASFNFVKRE NKAAH+LARR LLL EF
Subjt:  LKAKNFWTQTLHASFNFVKREDNKAAHLLARRGLLLREF

TrEMBL top hitse value%identityAlignment
A0A2N9I1D9 Uncharacterized protein8.4e-2027.8Show/hide
Query:  MWINSKFGKLSPIQHYHSPFLILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNP--FLGRFTNTVEI
        +W +  +G+     HY +   +    H  LS +  +      W +W +RN + L+     +Q    N+    A + + +F+ A  +P    G+     ++
Subjt:  MWINSKFGKLSPIQHYHSPFLILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNP--FLGRFTNTVEI

Query:  LWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGE
         W+PP  G YK N D +F      AG+G+II N RG  MA+  + +    S +  EA AA   ++L++++G+  V +E DS  I N    P    +  G 
Subjt:  LWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGE

Query:  IVLKAKNFWTQTLHASFNFVKREDNKAAHLLARRGLLLREF
        +V   K     +L   F  VKR+ N  AH LA+R    + F
Subjt:  IVLKAKNFWTQTLHASFNFVKREDNKAAHLLARRGLLLREF

A0A6J1C467 uncharacterized protein LOC1110077751.9e-2434.23Show/hide
Query:  ESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFL--------GRFTNTVEILWQPPNEGIYKINTDASFLAS
        + + W G EEL V +W +WN RN    S+ T    FL  N   +W  +Y+  ++ A   P L        GR +      W PP    +K+N DA+F   
Subjt:  ESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFL--------GRFTNTVEILWQPPNEGIYKINTDASFLAS

Query:  DQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLH--ASFNF
        +  AGL I+I +    V+ +A  ++ ++    + E +AA EG+ LA E G+ P  +ETDSS +FNL     ED SE+G +    ++    +LH    F+F
Subjt:  DQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLH--ASFNF

Query:  VKREDNKAAHLLARRGLLLREF
        V RE N  AH LAR G++   F
Subjt:  VKREDNKAAHLLARRGLLLREF

A0A6J1CDQ4 uncharacterized protein LOC1110105331.2e-4242.42Show/hide
Query:  ILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGR------FTNTVEI-------LWQPPNEGIY
        ILR+W + L+W  FEEL V +W LWN+RNA         K+ +  ++   W + Y+  F+  N+N           F  + +I       +W P  EG++
Subjt:  ILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGR------FTNTVEI-------LWQPPNEGIY

Query:  KINTDASFLASDQHAGLGIIICND-RGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFW
        K+ TDASF + D +AGLG+II  D RGQV+A+ATKY+E++ S D  EA+AAVEGL++A E G++P++LETDS  I+NLF++  E LS+ G I+   K   
Subjt:  KINTDASFLASDQHAGLGIIICND-RGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFW

Query:  TQTLHASFNFVKREDNKAAHLLARRGLLLRE
           L  S++F KR  N  AHLLARR L  +E
Subjt:  TQTLHASFNFVKREDNKAAHLLARRGLLLRE

A0A6J1CIF1 uncharacterized protein LOC1110112371.2e-7485.8Show/hide
Query:  EWANNYVMDFRGANSNPFLGRFTNTVEILWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGV
        EWAN YVM+FR ANSNPF GR TNT E+LW PP++ IYKINTDASFLASDQHAGLGIII NDRGQVMA+ATKY+ENIQS DM EAI AVEGLQLAS+IGV
Subjt:  EWANNYVMDFRGANSNPFLGRFTNTVEILWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGV

Query:  NPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLHASFNFVKREDNKAAHLLARRGLLLREF
        NPVILETDSS IFNLFSQPSEDLSE GEIVLKAKNFWTQ+LHASFNFVKRE NKAAH+LARR LLLREF
Subjt:  NPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLHASFNFVKREDNKAAHLLARRGLLLREF

A0A6J1DAR4 uncharacterized protein LOC1110189541.7e-8471.97Show/hide
Query:  MWINSKFGKLSPIQHYHSPFLILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGRFTNTVEILW
        +WINSKFGKL       SPFLILRE HESLS A FEELCVVIWGLWNQRNAR  +  T +  F    E  EWAN Y M+FR A SNP  GR TNT EILW
Subjt:  MWINSKFGKLSPIQHYHSPFLILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGRFTNTVEILW

Query:  QPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIV
        QPP+EGIYKINTDASFLASDQHAGLGIII NDRGQVMA ATKY+ENIQS DM EAIAAVEGLQLASEIG++P +                EDLSE GEIV
Subjt:  QPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIV

Query:  LKAKNFWTQTLHASFNFVKREDNKAAHLLARRGLLLREF
        LKAKNFWTQ+LHASFNFVKRE NKAAH+LARR LLL EF
Subjt:  LKAKNFWTQTLHASFNFVKREDNKAAHLLARRGLLLREF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-0624.27Show/hide
Query:  VIWGLWNQRN-----ARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGRFTNTVEILWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQ
        ++W LW  RN      +E     V ++ +   E  EW+     +  G  S P + R    + + W+ P     K NTDA++   +   G+G I+ N+ G 
Subjt:  VIWGLWNQRN-----ARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGRFTNTVEILWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQ

Query:  VM---ATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLH----ASFNFVKREDNKAAHL
        V+   A A    +N+   ++     AV  +   S      +I E+D+  + NL +       +    +  A     Q LH      F F  R  NK A  
Subjt:  VM---ATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLH----ASFNFVKREDNKAAHL

Query:  LARRGL
        +AR  +
Subjt:  LARRGL

AT3G09510.1 Ribonuclease H-like superfamily protein3.6e-0726.79Show/hide
Query:  VIWGLWNQRN------ARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGRFTNTVEILWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRG
        +IW +W  RN       RE    TV        E  +W N      +     P   R     +I W+ P     K N DA F      A  G II N  G
Subjt:  VIWGLWNQRN------ARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGRFTNTVEILWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRG

Query:  QVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLHA-SFNFVKREDNKAAHLLARRG
          ++  +  + +  +    E  A +  LQ     G   V +E D   + NL +  S   S      L+  +FW     +  F F++R+ NK AH+LA+ G
Subjt:  QVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLHA-SFNFVKREDNKAAHLLARRG

Query:  LLLREFRSG
             F SG
Subjt:  LLLREFRSG

AT4G29090.1 Ribonuclease H-like superfamily protein1.6e-0723.12Show/hide
Query:  VIWGLWNQRN-----ARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGRFTNTVEILWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQ
        ++W LW  RN      RE +   V ++    ++  EW      +     + P + R +      W+PP     K NTDA++   ++  G+G ++ N++G+
Subjt:  VIWGLWNQRN-----ARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGRFTNTVEILWQPPNEGIYKINTDASFLASDQHAGLGIIICNDRGQ

Query:  VMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLHASFNFVKREDNKAAHLLARRGL
        V     + +  ++S    E  A    +   S    N VI E+DS  +  + +   E    +   +   +   +Q     F F+ RE N  A  +AR  L
Subjt:  VMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLHASFNFVKREDNKAAHLLARRGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGATTAATTCGAAATTTGGAAAGCTATCTCCTATTCAGCATTATCACTCTCCATTTCTTATTCTTCGAGAGTGGCATGAAAGTCTAAGCTGGGCGGGTTTTGAGGA
ATTATGTGTTGTTATTTGGGGGCTGTGGAATCAAAGGAATGCACGCGAGCTTTCAATTCTGACAGTACAAAAACAGTTTTTAAGATGGAATGAGGCTTGTGAATGGGCAA
ATAATTATGTCATGGACTTTAGGGGAGCTAACTCTAATCCTTTTCTTGGGAGATTTACAAATACAGTAGAGATTTTATGGCAACCACCAAACGAAGGAATATATAAAATT
AACACTGATGCCTCTTTTTTAGCTTCAGATCAGCATGCAGGATTGGGAATCATCATCTGTAATGACAGAGGGCAAGTTATGGCTACAGCTACGAAGTACGTGGAGAATAT
TCAATCAGGGGATATGCCGGAAGCAATTGCTGCAGTGGAGGGACTTCAACTAGCGTCGGAAATTGGTGTCAACCCAGTGATTTTGGAGACCGATTCATCTTGTATTTTCA
ATCTTTTCTCTCAACCTTCGGAGGACCTGTCAGAAATGGGAGAAATCGTTTTGAAGGCGAAGAATTTCTGGACTCAAACTTTACATGCAAGTTTCAATTTCGTGAAGAGG
GAGGATAATAAAGCGGCTCACTTGTTGGCTCGGCGGGGTCTCCTTCTTCGTGAGTTTCGATCTGGATGGAGGATTGGCCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGATTAATTCGAAATTTGGAAAGCTATCTCCTATTCAGCATTATCACTCTCCATTTCTTATTCTTCGAGAGTGGCATGAAAGTCTAAGCTGGGCGGGTTTTGAGGA
ATTATGTGTTGTTATTTGGGGGCTGTGGAATCAAAGGAATGCACGCGAGCTTTCAATTCTGACAGTACAAAAACAGTTTTTAAGATGGAATGAGGCTTGTGAATGGGCAA
ATAATTATGTCATGGACTTTAGGGGAGCTAACTCTAATCCTTTTCTTGGGAGATTTACAAATACAGTAGAGATTTTATGGCAACCACCAAACGAAGGAATATATAAAATT
AACACTGATGCCTCTTTTTTAGCTTCAGATCAGCATGCAGGATTGGGAATCATCATCTGTAATGACAGAGGGCAAGTTATGGCTACAGCTACGAAGTACGTGGAGAATAT
TCAATCAGGGGATATGCCGGAAGCAATTGCTGCAGTGGAGGGACTTCAACTAGCGTCGGAAATTGGTGTCAACCCAGTGATTTTGGAGACCGATTCATCTTGTATTTTCA
ATCTTTTCTCTCAACCTTCGGAGGACCTGTCAGAAATGGGAGAAATCGTTTTGAAGGCGAAGAATTTCTGGACTCAAACTTTACATGCAAGTTTCAATTTCGTGAAGAGG
GAGGATAATAAAGCGGCTCACTTGTTGGCTCGGCGGGGTCTCCTTCTTCGTGAGTTTCGATCTGGATGGAGGATTGGCCATTAG
Protein sequenceShow/hide protein sequence
MWINSKFGKLSPIQHYHSPFLILREWHESLSWAGFEELCVVIWGLWNQRNARELSILTVQKQFLRWNEACEWANNYVMDFRGANSNPFLGRFTNTVEILWQPPNEGIYKI
NTDASFLASDQHAGLGIIICNDRGQVMATATKYVENIQSGDMPEAIAAVEGLQLASEIGVNPVILETDSSCIFNLFSQPSEDLSEMGEIVLKAKNFWTQTLHASFNFVKR
EDNKAAHLLARRGLLLREFRSGWRIGH