TY - GEN
T1 - Variable-length unit selection using LSA-based syntactic structure cost
AU - Wu, Chung Hsien
AU - Hsia, Chi Chun
AU - Chen, Jiun Fu
AU - Liu, Te Hsien
PY - 2004
Y1 - 2004
N2 - This paper introduces a variable-length unit selection method based on LSA-based syntactic structure for concatenative speech synthesis. First, a probabilistic context free grammar (PCFG) based parser is used to construct the syntactic structure of the input text sentence. Second, the synthesizer selects the candidate units for each node of the syntactic structure. Latent Semantic Analysis (LSA) is then adopted to estimate the syntactic cost between the target unit and the candidate units in the database. Finally, the concatenation of units with minimum cost is selected using dynamic programming algorithm. Experimental results show that variable-length unit selection based on syntactic structure outperforms the synthesizer without considering syntactic structure. Also, the LSA-based syntactic cost provides better estimation of substitution cost than that calculated only from acoustic features.
AB - This paper introduces a variable-length unit selection method based on LSA-based syntactic structure for concatenative speech synthesis. First, a probabilistic context free grammar (PCFG) based parser is used to construct the syntactic structure of the input text sentence. Second, the synthesizer selects the candidate units for each node of the syntactic structure. Latent Semantic Analysis (LSA) is then adopted to estimate the syntactic cost between the target unit and the candidate units in the database. Finally, the concatenation of units with minimum cost is selected using dynamic programming algorithm. Experimental results show that variable-length unit selection based on syntactic structure outperforms the synthesizer without considering syntactic structure. Also, the LSA-based syntactic cost provides better estimation of substitution cost than that calculated only from acoustic features.
UR - http://www.scopus.com/inward/record.url?scp=21444449037&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=21444449037&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:21444449037
SN - 0780386787
SN - 9780780386785
T3 - 2004 International Symposium on Chinese Spoken Language Processing - Proceedings
SP - 201
EP - 204
BT - 2004 International Symposium on Chinese Spoken Language Processing - Proceedings
T2 - 2004 International Symposium on Chinese Spoken Language Processing
Y2 - 15 December 2004 through 18 December 2004
ER -