Automatic pronunciation clustering using a World English archive and pronunciation structure analysis

H. P. Shen, N. Minematsu, T. Makino, S. H. Weinberger, T. Pongkittiphan, Chung-Hsien Wu

研究成果: Conference contribution

3 引文 斯高帕斯(Scopus)

摘要

English is the only language available for global communication. Due to the influence of speakers' mother tongue, however, those from different regions inevitably have different accents in their pronunciation of English. The ultimate goal of our project is creating a global pronunciation map of World Englishes on an individual basis, for speakers to use to locate similar English pronunciations. If the speaker is a learner, he can also know how his pronunciation compares to other varieties. Creating the map mathematically requires a matrix of pronunciation distances among all the speakers considered. This paper investigates invariant pronunciation structure analysis and Support Vector Regression (SVR) to predict the inter-speaker pronunciation distances. In experiments, the Speech Accent Archive (SAA), which contains speech data of worldwide accented English, is used as training and testing samples. IPA narrow transcriptions in the archive are used to prepare reference pronunciation distances, which are then predicted based on structural analysis and SVR, not with IPA transcriptions. Correlation between the reference distances and the predicted distances is calculated. Experimental results show very promising results and our proposed method outperforms by far a baseline system developed using an HMM-based phoneme recognizer.

原文English
主出版物標題2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Proceedings
頁面222-227
頁數6
DOIs
出版狀態Published - 2013 十二月 1
事件2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Olomouc, Czech Republic
持續時間: 2013 十二月 82013 十二月 13

出版系列

名字2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Proceedings

Other

Other2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013
國家/地區Czech Republic
城市Olomouc
期間13-12-0813-12-13

All Science Journal Classification (ASJC) codes

  • 言語和聽力

指紋

深入研究「Automatic pronunciation clustering using a World English archive and pronunciation structure analysis」主題。共同形成了獨特的指紋。

引用此