Emotion recognition of conversational affective speech using temporal course modeling-based error weighted cross-correlation model

Jen Chun Lin, Wen Li Wei, Chung Hsien Wu, Hsin Min Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A complete emotional expression in natural face-to-face conversation typically contains a complex temporal course. In this paper, we propose a temporal course modeling-based error weighted cross-correlation model (TCM-EWCCM) for speech emotion recognition. In TCM-EWCCM, a TCM-based cross-correlation model (CCM) is first used to not only model the temporal evolution of the extracted acoustic and prosodie features individually but also construct the statistical dependencies among paired acoustic-prosodic features in different emotional states. Then, a Bayesian classifier weighting scheme named error weighted classifier combination is adopted to explore the contributions of the individual TCM-based CCM classifiers for different acoustic-prosodic feature pairs to enhance the speech emotion recognition accuracy. The results of experiments on the NCKU-CASC corpus demonstrate that modeling the complex temporal structure and considering the statistical dependencies as well as contributions among paired features in natural conversation speech can indeed improve the speech emotion recognition performance.

Original languageEnglish
Title of host publication2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9786163618238
DOIs
Publication statusPublished - 2014 Feb 12
Event2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014 - Chiang Mai, Thailand
Duration: 2014 Dec 92014 Dec 12

Publication series

Name2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014

Other

Other2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
CountryThailand
CityChiang Mai
Period14-12-0914-12-12

All Science Journal Classification (ASJC) codes

  • Signal Processing
  • Information Systems

Fingerprint Dive into the research topics of 'Emotion recognition of conversational affective speech using temporal course modeling-based error weighted cross-correlation model'. Together they form a unique fingerprint.

  • Cite this

    Lin, J. C., Wei, W. L., Wu, C. H., & Wang, H. M. (2014). Emotion recognition of conversational affective speech using temporal course modeling-based error weighted cross-correlation model. In 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014 [7041621] (2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/APSIPA.2014.7041621