Affective structure modeling of speech using probabilistic context free grammar for emotion recognition

Kun Yi Huang, Jia Kuan Lin, Yu Hsien Chiu, Chung Hsien Wu

研究成果: Conference contribution

3 引文 斯高帕斯(Scopus)

摘要

A complete emotional expression typically contains a complex temporal course in a natural conversation. Related research on utterance-level and segment-level processing lacks understanding of the underlying structure of emotional speech. In this study, a hierarchical affective structure of an emotional utterance characterized by the probabilistic context free grammars (PCFGs) is proposed for emotion modeling. SVM-based emotion profiles are obtained and employed to segment the utterance into emotionally consistent segments. Vector quantization is applied to convert the emotion profile of each segment into codewords. A binary tree in which each node represents a codeword is constructed to characterize the affective structure of the utterance modeled by PCFG. Given an input utterance, the output emotion is determined according to the PCFG-based emotion model with the highest likelihood of the speech segments along with the score of the affective structure. For evaluation, the EMO-DB database and its expansion in utterance length were conducted. Experimental results show that the proposed method achieved emotion recognition accuracy of 87.22% for long utterances and outperformed the SVM-based method.

原文English
主出版物標題2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面5286-5290
頁數5
ISBN(電子)9781467369978
DOIs
出版狀態Published - 2015 8月 4
事件40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Brisbane, Australia
持續時間: 2014 4月 192014 4月 24

出版系列

名字ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
2015-August
ISSN(列印)1520-6149

Other

Other40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015
國家/地區Australia
城市Brisbane
期間14-04-1914-04-24

All Science Journal Classification (ASJC) codes

  • 軟體
  • 訊號處理
  • 電氣與電子工程

指紋

深入研究「Affective structure modeling of speech using probabilistic context free grammar for emotion recognition」主題。共同形成了獨特的指紋。

引用此