Perceptual speech modeling for noisy speech recognition

Chung Hsien Wu, Yu Hsien Chiu, Huigan Lim

研究成果: Conference article同行評審

摘要

This paper proposes a perceptual modeling approach with a two-stage recognition to deal with the issues of recognition degradation in noisy environment. The auditory masking effect is used for speech enhancement and acoustic modeling in order to overcome the model inconsistencies between training speech and noisy input. In the two-stage recognition, the maximum a posteriori (MAP) based adaptation algorithm is used to incrementally adapt the noise model. In order to evaluate our proposed approach, a Mandarin keyword spotting system was constructed. The experimental results show our proposed method achieves a better recognition rate compared to the audible noise suppression (ANS) and parallel model combination (PMC) methods for both in 70km/hr (10.3dB) and 90km/hr (6.4dB) car environments.

原文English
頁(從 - 到)I/385-I/388
期刊ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
1
出版狀態Published - 2002 七月 11
事件2002 IEEE International Conference on Acustics, Speech, and Signal Processing - Orlando, FL, United States
持續時間: 2002 五月 132002 五月 17

All Science Journal Classification (ASJC) codes

  • 軟體
  • 訊號處理
  • 電氣與電子工程

指紋

深入研究「Perceptual speech modeling for noisy speech recognition」主題。共同形成了獨特的指紋。

引用此