On the process of coarticulation for a CELP-based Chinese text-to-speech system using LSP frequencies

Jau Hung Chen, Chung Hsien Wu

研究成果: Paper

摘要

This study proposes a novel approach based on Bayesian networks and the LSP frequencies to generate syllable prosody and the coarticulation between two concatenated syllables respectively. The Bayesian network is employed to model the relation between the prosodic information and the linguistic features. Given a Chinese character sequence, the Bayesian network can provide appropriate prosodic information, including pitch contour, syllable intensity, syllable duration and pause duration. Furthermore, the coarticulation is generated by adjusting the LSP frequencies in a CELP-based synthesizer. The synthesized speech is tested on twenty subjects. The test results indicate that the average correct rate is 95.8% for intelligibility, and the mean opinion score (MOS) is 3.2 for naturalness.

原文English
頁面37-40
頁數4
出版狀態Published - 1996 十二月 1
事件Proceedings of the 1996 IEEE Region 10 TENCON - Digital Signal Processing Applications Conference. Part 2 (of 2) - Perth, Aust
持續時間: 1996 十一月 261996 十一月 29

Other

OtherProceedings of the 1996 IEEE Region 10 TENCON - Digital Signal Processing Applications Conference. Part 2 (of 2)
城市Perth, Aust
期間96-11-2696-11-29

指紋

Bayesian networks
Linguistics

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Electrical and Electronic Engineering

引用此文

Chen, J. H., & Wu, C. H. (1996). On the process of coarticulation for a CELP-based Chinese text-to-speech system using LSP frequencies. 37-40. 論文發表於 Proceedings of the 1996 IEEE Region 10 TENCON - Digital Signal Processing Applications Conference. Part 2 (of 2), Perth, Aust, .
Chen, Jau Hung ; Wu, Chung Hsien. / On the process of coarticulation for a CELP-based Chinese text-to-speech system using LSP frequencies. 論文發表於 Proceedings of the 1996 IEEE Region 10 TENCON - Digital Signal Processing Applications Conference. Part 2 (of 2), Perth, Aust, .4 p.
@conference{2a8bb5b53807460d90e53f9a1d5b0a11,
title = "On the process of coarticulation for a CELP-based Chinese text-to-speech system using LSP frequencies",
abstract = "This study proposes a novel approach based on Bayesian networks and the LSP frequencies to generate syllable prosody and the coarticulation between two concatenated syllables respectively. The Bayesian network is employed to model the relation between the prosodic information and the linguistic features. Given a Chinese character sequence, the Bayesian network can provide appropriate prosodic information, including pitch contour, syllable intensity, syllable duration and pause duration. Furthermore, the coarticulation is generated by adjusting the LSP frequencies in a CELP-based synthesizer. The synthesized speech is tested on twenty subjects. The test results indicate that the average correct rate is 95.8{\%} for intelligibility, and the mean opinion score (MOS) is 3.2 for naturalness.",
author = "Chen, {Jau Hung} and Wu, {Chung Hsien}",
year = "1996",
month = "12",
day = "1",
language = "English",
pages = "37--40",
note = "Proceedings of the 1996 IEEE Region 10 TENCON - Digital Signal Processing Applications Conference. Part 2 (of 2) ; Conference date: 26-11-1996 Through 29-11-1996",

}

Chen, JH & Wu, CH 1996, 'On the process of coarticulation for a CELP-based Chinese text-to-speech system using LSP frequencies', 論文發表於 Proceedings of the 1996 IEEE Region 10 TENCON - Digital Signal Processing Applications Conference. Part 2 (of 2), Perth, Aust, 96-11-26 - 96-11-29 頁 37-40.

On the process of coarticulation for a CELP-based Chinese text-to-speech system using LSP frequencies. / Chen, Jau Hung; Wu, Chung Hsien.

1996. 37-40 論文發表於 Proceedings of the 1996 IEEE Region 10 TENCON - Digital Signal Processing Applications Conference. Part 2 (of 2), Perth, Aust, .

研究成果: Paper

TY - CONF

T1 - On the process of coarticulation for a CELP-based Chinese text-to-speech system using LSP frequencies

AU - Chen, Jau Hung

AU - Wu, Chung Hsien

PY - 1996/12/1

Y1 - 1996/12/1

N2 - This study proposes a novel approach based on Bayesian networks and the LSP frequencies to generate syllable prosody and the coarticulation between two concatenated syllables respectively. The Bayesian network is employed to model the relation between the prosodic information and the linguistic features. Given a Chinese character sequence, the Bayesian network can provide appropriate prosodic information, including pitch contour, syllable intensity, syllable duration and pause duration. Furthermore, the coarticulation is generated by adjusting the LSP frequencies in a CELP-based synthesizer. The synthesized speech is tested on twenty subjects. The test results indicate that the average correct rate is 95.8% for intelligibility, and the mean opinion score (MOS) is 3.2 for naturalness.

AB - This study proposes a novel approach based on Bayesian networks and the LSP frequencies to generate syllable prosody and the coarticulation between two concatenated syllables respectively. The Bayesian network is employed to model the relation between the prosodic information and the linguistic features. Given a Chinese character sequence, the Bayesian network can provide appropriate prosodic information, including pitch contour, syllable intensity, syllable duration and pause duration. Furthermore, the coarticulation is generated by adjusting the LSP frequencies in a CELP-based synthesizer. The synthesized speech is tested on twenty subjects. The test results indicate that the average correct rate is 95.8% for intelligibility, and the mean opinion score (MOS) is 3.2 for naturalness.

UR - http://www.scopus.com/inward/record.url?scp=0030311837&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0030311837&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:0030311837

SP - 37

EP - 40

ER -

Chen JH, Wu CH. On the process of coarticulation for a CELP-based Chinese text-to-speech system using LSP frequencies. 1996. 論文發表於 Proceedings of the 1996 IEEE Region 10 TENCON - Digital Signal Processing Applications Conference. Part 2 (of 2), Perth, Aust, .