摘要
In this paper, we present a low bit rate speech coder with better speaker recognizability using the selection of glottal excitation for each speaker. In order to suitably represent the glottal excitation for each speaker, the excitation pulse determination algorithm used in multi-pulse excited linear predictive coding is adopted. In this system, 25 periodic pulses are determined for a voiced frame. A period of speaker-specific excitation pattern with only 3 pulses is chosen from the 25 pulses using a proposed pattern selection method. In the decoder, the 3-pulse pattern is smoothed using an FIR low pass filter in order to obtain a more smooth and continuous pattern. For unvoiced speech, random white noise is used as the excitation pattern. Informal listening test indicates that the speaker recognizability performance of our coder is better than that of LPC-10e.
原文 | English |
---|---|
頁面 | 617-620 |
頁數 | 4 |
出版狀態 | Published - 1997 |
事件 | Proceedings of the 1997 IEEE TENCON Conference. Part 1 (of 2) - Brisbane, Australia 持續時間: 1997 12月 2 → 1997 12月 4 |
Other
Other | Proceedings of the 1997 IEEE TENCON Conference. Part 1 (of 2) |
---|---|
城市 | Brisbane, Australia |
期間 | 97-12-02 → 97-12-04 |
All Science Journal Classification (ASJC) codes
- 電腦科學應用
- 電氣與電子工程