An efficient pre-processing scheme to improve the sound source localization system in noisy environment

Sheng Chieh Lee, K. Bharanitharan, Bo Wei Chen, Jhing Fa Wang, Chung Hsien Wu, Min Jian Liao

Research output: Contribution to journalConference articlepeer-review

1 Citation (Scopus)

Abstract

In this study, we introduce an efficient pre-processing scheme for direction of arrival (DOA) estimation, which is capable of reducing the noise and reverberation effects in speech sound source localization. Furthermore, this presented system is also suitable for far-field speech localization. The adopted method of this proposed system can be simply subdivided into three stages: Linear phase-difference approximation, covariance matrix reconstruction, and frequency bin selection. The first two stages can initially decrease the influences of noise and reverberation; the last stage is used to filter the noise frequency bands according to the eigenvalue decomposition (EVD) of the covariance matrix. The experimental results show that our proposed system has effective performance of detecting different directions of speeches. For different signalto- noise ratios (SNRs) speech signals, the average estimation errors can be decreased by about 5 to 7.5 degrees.

Original languageEnglish
Pages (from-to)2493-2496
Number of pages4
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publication statusPublished - 2011 Dec 1
Event12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011 - Florence, Italy
Duration: 2011 Aug 272011 Aug 31

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modelling and Simulation

Fingerprint Dive into the research topics of 'An efficient pre-processing scheme to improve the sound source localization system in noisy environment'. Together they form a unique fingerprint.

Cite this