CECOS: A Chinese-English code-switching speech database

Han Ping Shen, Chung Hsien Wu, Yan Ting Yang, Chun Shan Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

18 Citations (Scopus)

Abstract

With the increase on the demands for code-switching automatic speech recognition (ASR), the design and development of a code-switching speech database becomes highly desirable. However, it is not easy to collect sufficient code-switched utterances for model training for code-switching ASR. This study presents the procedure and experience for the design and development of a Chinese-English COde-switching Speech database (CECOS). Two different methods for collecting Chinese-English code-switched utterances are employed in this work. The applications of the collected database are also introduced. The CECOS database not only contains the speech data with code-switch properties but also accents due to non-native speakers. This database can be applied to several applications, such as code-switching speech recognition, language identification, named entity detection, etc.

Original languageEnglish
Title of host publication2011 International Conference on Speech Database and Assessments, Oriental COCOSDA 2011 - Proceedings
Pages120-123
Number of pages4
DOIs
Publication statusPublished - 2011 Dec 20
Event14th Annual International Conference on Speech Database and Assessments, Oriental COCOSDA 2011 - Hsinchu, Taiwan
Duration: 2011 Oct 262011 Oct 28

Publication series

Name2011 International Conference on Speech Database and Assessments, Oriental COCOSDA 2011 - Proceedings

Other

Other14th Annual International Conference on Speech Database and Assessments, Oriental COCOSDA 2011
Country/TerritoryTaiwan
CityHsinchu
Period11-10-2611-10-28

All Science Journal Classification (ASJC) codes

  • Software

Fingerprint

Dive into the research topics of 'CECOS: A Chinese-English code-switching speech database'. Together they form a unique fingerprint.

Cite this