Generation of phonetic units for mixed-language speech recognition based on acoustic and contextual analysis

Chien Lin Huang, Chung Hsien Wu

Research output: Contribution to journalArticlepeer-review

22 Citations (Scopus)

Abstract

This work presents a novel approach to generating phonetic units in order to recognize mixed-language or multilingual speech. Acoustic and contextual analysis is performed to characterize multilingual phonetic units for phone set creation. Acoustic likelihood is utilized for similarity estimation of phone models. The hyperspace analog to language (HAL) model is adopted for contextual modeling and contextual similarity estimation. A confusion matrix combining acoustic and contextual similarities between every two phonetic units is built for phonetic unit clustering. The multidimensional scaling (MDS) method is applied to the confusion matrix for reducing dimensionality. Experimental results indicate that the created phonetic set provides a compact and robust set that considers acoustic and contextual information for mixed-language or multilingual speech recognition.

Original languageEnglish
Pages (from-to)1225-1233
Number of pages9
JournalIEEE Transactions on Computers
Volume56
Issue number9
DOIs
Publication statusPublished - 2007 Sep 1

All Science Journal Classification (ASJC) codes

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture
  • Computational Theory and Mathematics

Fingerprint Dive into the research topics of 'Generation of phonetic units for mixed-language speech recognition based on acoustic and contextual analysis'. Together they form a unique fingerprint.

Cite this