A hybrid discretization method for naïve Bayesian classifiers

Research output: Contribution to journalArticle

29 Citations (Scopus)

Abstract

Since naïve Bayesian classifiers are suitable for processing discrete attributes, many methods have been proposed for discretizing continuous ones. However, none of the previous studies apply more than one discretization method to the continuous attributes in a data set for naïve Bayesian classifiers. Different approaches employ different information embedded in continuous attributes to determine the boundaries for discretization. It is likely that discretizing the continuous attributes in a data set using different methods can utilize the information embedded in the attributes more thoroughly and thus improve the performance of naïve Bayesian classifiers. In this study, we propose a nonparametric measure to evaluate the dependence level between a continuous attribute and the class. The nonparametric measure is then used to develop a hybrid method for discretizing continuous attributes so that the accuracy of the naïve Bayesian classifier can be enhanced. This hybrid method is tested on 20 data sets, and the results demonstrate that discretizing the continuous attributes in a data set by various methods can generally have a higher prediction accuracy.

Original languageEnglish
Pages (from-to)2321-2325
Number of pages5
JournalPattern Recognition
Volume45
Issue number6
DOIs
Publication statusPublished - 2012 Jun 1

All Science Journal Classification (ASJC) codes

  • Software
  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Signal Processing

Fingerprint Dive into the research topics of 'A hybrid discretization method for naïve Bayesian classifiers'. Together they form a unique fingerprint.

  • Cite this