An efficient pre-processing scheme to improve the sound source localization system in noisy environment

Sheng Chieh Lee, K. Bharanitharan, Bo Wei Chen, Jhing Fa Wang, Chung Hsien Wu, Min Jian Liao

研究成果: Conference article

1 引文 斯高帕斯(Scopus)


In this study, we introduce an efficient pre-processing scheme for direction of arrival (DOA) estimation, which is capable of reducing the noise and reverberation effects in speech sound source localization. Furthermore, this presented system is also suitable for far-field speech localization. The adopted method of this proposed system can be simply subdivided into three stages: Linear phase-difference approximation, covariance matrix reconstruction, and frequency bin selection. The first two stages can initially decrease the influences of noise and reverberation; the last stage is used to filter the noise frequency bands according to the eigenvalue decomposition (EVD) of the covariance matrix. The experimental results show that our proposed system has effective performance of detecting different directions of speeches. For different signalto- noise ratios (SNRs) speech signals, the average estimation errors can be decreased by about 5 to 7.5 degrees.

頁(從 - 到)2493-2496
期刊Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
出版狀態Published - 2011 十二月 1
事件12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011 - Florence, Italy
持續時間: 2011 八月 272011 八月 31


All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modelling and Simulation