An efficient parameter estimation method for generalized Dirichlet priors in naïve Bayesian classifiers with multinomial models

Tzu Tsung Wong, Chao Rui Liu

研究成果: Article同行評審

3 引文 斯高帕斯(Scopus)

摘要

Generalized Dirichlet priors have been shown to be an effective way for improving the performance of naïve Bayesian classifiers with multinomial models, called multinomial naïve Bayesian classifiers, in document classification. For the sake of computational efficiency, a previous study divided distinct words into groups, and proposed a searching mechanism to determine the values of the parameters in a generalized Dirichlet prior group by group. That searching approach increases the computational cost of the multinomial naïve Bayesian classifier. In this paper, the covariance matrices for word groups are first calculated from available documents. A parameter estimation method and four strategies for choosing the value of a parameter corresponding to a word group are then proposed to solve for the parameters of the noninformative generalized Dirichlet priors for distinct words. The experimental results on two document sets show that the best strategy is to choose the largest value calculated from the statistics in a row, and that our parameter estimation method can efficiently solve for the parameters of generalized Dirichlet priors to significantly improve the performance of the multinomial naïve Bayesian classifier with respect to the searching approach.

原文English
頁(從 - 到)62-71
頁數10
期刊Pattern Recognition
60
DOIs
出版狀態Published - 2016 十二月 1

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

指紋 深入研究「An efficient parameter estimation method for generalized Dirichlet priors in naïve Bayesian classifiers with multinomial models」主題。共同形成了獨特的指紋。

引用此