Novel binaural spectro-temporal algorithm for speech enhancement in low SNR environments

Po Hsun Sung, Bo Wei Chen, Ling Sheng Jang, Jhing Fa Wang

Research output: Contribution to journalConference articlepeer-review

Abstract

A novel BInaural Spectro-Temporal (BIST) algorithm is proposed in this paper to increase the speech intelligibility in low or negative SNR noisy environments. The BIST algorithm consists of two modules. One is the spatial mask for receiving sound from the specific direction, and the other is the spectro-temporal modulation filter for noise reduction. Most speech enhancement algorithms are not applicable in harsh environments because the energy of speech is covered by the noise. To increase the speech intelligibility in low or negative SNR noisy environments, a distinctive approach is proposed to solve this problem. First, the BIST algorithm takes binaural auditory processing as a spatial mask to separate the speech and noise according to their locations. Next, the modulation filter is applied to reduce the noise source in the scale-rate (spectro-temporal modulation) domain according to their different acoustic feature. It works like the spectro-temporal receptive field (STRF) which is the perception response of human auditory cortex. The experimental results demonstrate that the proposed BIST speech enhancement algorithm can improve 20% from the noisy speech at SNR-10dB.

Original languageEnglish
Article number6298537
Pages (from-to)1021-1026
Number of pages6
JournalProceedings - IEEE International Conference on Multimedia and Expo
DOIs
Publication statusPublished - 2012 Nov 5
Event2012 13th IEEE International Conference on Multimedia and Expo, ICME 2012 - Melbourne, VIC, Australia
Duration: 2012 Jul 92012 Jul 13

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications

Fingerprint Dive into the research topics of 'Novel binaural spectro-temporal algorithm for speech enhancement in low SNR environments'. Together they form a unique fingerprint.

Cite this