Language boundary detection and identification of mixed-language speech based on map estimation

Chi Jiun Shia, Yu Hsien Chiu, Jia Hsin Hsieh, Chung Hsien Wu

Research output: Contribution to journalConference article

14 Citations (Scopus)

Abstract

This paper proposes a Maximum a Posteriori (MAP) based approach to jointly segment and identify an utterance with mixed languages. A statistical framework for language boundary detection and language identification is proposed. First, the MAP estimation is used to determine the boundary number and positions. Further, an LSA-based GMM and a VQ-based bi-gram language model are proposed to characterize a language and used for language identification. Finally, a likelihood ratio test approach is used to determine the optimal number of language boundaries. Experimental results show that the proposed approach exhibits encouraging potential in mixed-language segmentation and identification.

Original languageEnglish
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
Publication statusPublished - 2004 Sep 28
EventProceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing - Montreal, Que, Canada
Duration: 2004 May 172004 May 21

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Cite this

@article{20dde400097c4f4ea94f872fb4fb1de8,
title = "Language boundary detection and identification of mixed-language speech based on map estimation",
abstract = "This paper proposes a Maximum a Posteriori (MAP) based approach to jointly segment and identify an utterance with mixed languages. A statistical framework for language boundary detection and language identification is proposed. First, the MAP estimation is used to determine the boundary number and positions. Further, an LSA-based GMM and a VQ-based bi-gram language model are proposed to characterize a language and used for language identification. Finally, a likelihood ratio test approach is used to determine the optimal number of language boundaries. Experimental results show that the proposed approach exhibits encouraging potential in mixed-language segmentation and identification.",
author = "Shia, {Chi Jiun} and Chiu, {Yu Hsien} and Hsieh, {Jia Hsin} and Wu, {Chung Hsien}",
year = "2004",
month = "9",
day = "28",
language = "English",
volume = "1",
journal = "Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing",
issn = "0736-7791",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Language boundary detection and identification of mixed-language speech based on map estimation

AU - Shia, Chi Jiun

AU - Chiu, Yu Hsien

AU - Hsieh, Jia Hsin

AU - Wu, Chung Hsien

PY - 2004/9/28

Y1 - 2004/9/28

N2 - This paper proposes a Maximum a Posteriori (MAP) based approach to jointly segment and identify an utterance with mixed languages. A statistical framework for language boundary detection and language identification is proposed. First, the MAP estimation is used to determine the boundary number and positions. Further, an LSA-based GMM and a VQ-based bi-gram language model are proposed to characterize a language and used for language identification. Finally, a likelihood ratio test approach is used to determine the optimal number of language boundaries. Experimental results show that the proposed approach exhibits encouraging potential in mixed-language segmentation and identification.

AB - This paper proposes a Maximum a Posteriori (MAP) based approach to jointly segment and identify an utterance with mixed languages. A statistical framework for language boundary detection and language identification is proposed. First, the MAP estimation is used to determine the boundary number and positions. Further, an LSA-based GMM and a VQ-based bi-gram language model are proposed to characterize a language and used for language identification. Finally, a likelihood ratio test approach is used to determine the optimal number of language boundaries. Experimental results show that the proposed approach exhibits encouraging potential in mixed-language segmentation and identification.

UR - http://www.scopus.com/inward/record.url?scp=4544379390&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=4544379390&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:4544379390

VL - 1

JO - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing

JF - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing

SN - 0736-7791

ER -