Abstract
A decomposition of signal into a set of frequency channels of equal bandwidth on a logarithmic scale, i.e., an analysis of the signal using constant Q filters, using wavelet and multiresolution analysis is used in this paper to derive cepstrum features of different spatial frequency bands. Based on the decompositions, each channel is modeled as a Bayesian subnetwork and each subnetwork is weighted by a weighting algorithm. The distortions for speech recognition between a reference model and the input vectors are then computed by summing the weighted scores of all decomposed channels. The experimental results show that the recognition rate of this method is superior to those non-weighting methods.
| Original language | English |
|---|---|
| Pages | 519-523 |
| Number of pages | 5 |
| Publication status | Published - 1994 Dec 1 |
| Event | Proceedings of the 1994 IEEE Asia-Pacific Conference on Circuits and Systems - Taipei, Taiwan Duration: 1994 Dec 5 → 1994 Dec 8 |
Other
| Other | Proceedings of the 1994 IEEE Asia-Pacific Conference on Circuits and Systems |
|---|---|
| City | Taipei, Taiwan |
| Period | 94-12-05 → 94-12-08 |
All Science Journal Classification (ASJC) codes
- Electrical and Electronic Engineering