Acoustic and Textual Data Augmentation for Code-Switching Speech Recognition in Under-Resourced Language

I. Ting Hsieh, Chung Hsien Wu, Chun Huang Wang

研究成果: Conference contribution

4 引文 斯高帕斯(Scopus)

摘要

Under-resourced and code-switching speech recognition have recently received research interest, resulting in several robust acoustic and language modeling approaches. As Taiwanese and Mandarin have been popularly and widely used in Taiwan, this paper aims to address the under-resourced and codeswitching issues. First, phone sharing between Taiwanese and Mandarin is employed for acoustic data augmentation to construct the acoustic models of Taiwanese speech recognizer. Regarding the lack of Taiwanese text corpus, this paper translates Mandarin corpus into Taiwanese corpus based on word-to-word translation. Moreover, additional translation rules for codeswitching text are manually designed. The augmented text corpus is then used for training the code-switching language models. In the experimental results, the word error rate for code-switching speech recognition was 26.02%, which was better than that trained by the pure Taiwanese corpus.

原文English
主出版物標題2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面302-307
頁數6
ISBN(電子)9789881476883
出版狀態Published - 2020 12月 7
事件2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020 - Virtual, Auckland, New Zealand
持續時間: 2020 12月 72020 12月 10

出版系列

名字2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020 - Proceedings

Conference

Conference2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020
國家/地區New Zealand
城市Virtual, Auckland
期間20-12-0720-12-10

All Science Journal Classification (ASJC) codes

  • 人工智慧
  • 電腦網路與通信
  • 電腦視覺和模式識別
  • 硬體和架構
  • 訊號處理
  • 決策科學(雜項)
  • 儀器

指紋

深入研究「Acoustic and Textual Data Augmentation for Code-Switching Speech Recognition in Under-Resourced Language」主題。共同形成了獨特的指紋。

引用此