TY - CONF
T1 - Using Chi-square testing in modeling confusion characteristics for robust phonetic set generation
AU - Chen, Yeou Jiunn
AU - Wu, Chung Hsien
N1 - Funding Information:
The authors would like to thank the National Science Council, R.O.C., for its financial support of this work, under Contract No. NSC89-2614-H-006-004-F20. The paper is also a partial result of Project 3XS1B11 conducted by ITRI under sponsorship of the Ministry of Economic Affairs, R.O.C.
Publisher Copyright:
© 2001 Proceedings of the 14th Conference on Computational Linguistics and Speech Processing, ROCLING 2001. All rights reserved.
PY - 2001
Y1 - 2001
N2 - A phonetic representation of a language is used to describe the corresponding pronunciation and synthesize the acoustic model of any vocabulary. In order to obtain better phonetic representation, context-dependent units are used to model co-articulation effects between phones and have been broadly in speech recognition. However, this representation generally increases the number of recognition units. A phonetic representation with smaller phonetic units such as SAMPA-C for Mandarin Chinese can be applied to reduce the number of recognition units. Nevertheless, smaller phonetic units such as SAMPA-C will contain confusion characters and generally degrade the recognition performance. In this paper, a statistical method based on chi-square testing is used to investigate the confusion characteristics among phonetic units and develop a more reliable phonetic set, named modified SAMPA-C. Finally, experiments on continuous Mandarin telephone speech recognition were conducted. Experimental results show an encouraging improvement on recognition performance can be obtained. In addition, the proposed approaches represent a good compromise between the demands of accurate acoustic modeling.
AB - A phonetic representation of a language is used to describe the corresponding pronunciation and synthesize the acoustic model of any vocabulary. In order to obtain better phonetic representation, context-dependent units are used to model co-articulation effects between phones and have been broadly in speech recognition. However, this representation generally increases the number of recognition units. A phonetic representation with smaller phonetic units such as SAMPA-C for Mandarin Chinese can be applied to reduce the number of recognition units. Nevertheless, smaller phonetic units such as SAMPA-C will contain confusion characters and generally degrade the recognition performance. In this paper, a statistical method based on chi-square testing is used to investigate the confusion characteristics among phonetic units and develop a more reliable phonetic set, named modified SAMPA-C. Finally, experiments on continuous Mandarin telephone speech recognition were conducted. Experimental results show an encouraging improvement on recognition performance can be obtained. In addition, the proposed approaches represent a good compromise between the demands of accurate acoustic modeling.
UR - http://www.scopus.com/inward/record.url?scp=85121348630&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85121348630&partnerID=8YFLogxK
M3 - Paper
AN - SCOPUS:85121348630
ER -