On the process of coarticulation for a CELP-based Chinese text-to-speech system using LSP frequencies

Jau Hung Chen, Chung Hsien Wu

Research output: Contribution to conferencePaperpeer-review

Abstract

This study proposes a novel approach based on Bayesian networks and the LSP frequencies to generate syllable prosody and the coarticulation between two concatenated syllables respectively. The Bayesian network is employed to model the relation between the prosodic information and the linguistic features. Given a Chinese character sequence, the Bayesian network can provide appropriate prosodic information, including pitch contour, syllable intensity, syllable duration and pause duration. Furthermore, the coarticulation is generated by adjusting the LSP frequencies in a CELP-based synthesizer. The synthesized speech is tested on twenty subjects. The test results indicate that the average correct rate is 95.8% for intelligibility, and the mean opinion score (MOS) is 3.2 for naturalness.

Original languageEnglish
Pages37-40
Number of pages4
Publication statusPublished - 1996 Dec 1
EventProceedings of the 1996 IEEE Region 10 TENCON - Digital Signal Processing Applications Conference. Part 2 (of 2) - Perth, Aust
Duration: 1996 Nov 261996 Nov 29

Other

OtherProceedings of the 1996 IEEE Region 10 TENCON - Digital Signal Processing Applications Conference. Part 2 (of 2)
CityPerth, Aust
Period96-11-2696-11-29

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'On the process of coarticulation for a CELP-based Chinese text-to-speech system using LSP frequencies'. Together they form a unique fingerprint.

Cite this