Audiovisual Emotion Recognition Using Semi-Coupled Hidden Markov Model with State-Based Alignment Strategy

Chung-Hsien Wu, Jen Chun Lin, Wen Li Wei

研究成果: Chapter

摘要

This chapter introduces the current data fusion strategies among audiovisual signals for bimodal emotion recognition. Face detection, in the chapter, is performed based on the adaboost cascade face detector and can be used to provide initial facial position and reduce the time for error convergence in feature extraction. In the chapter, active appearance model (AAM) is employed to extract the 68 labeled facial feature points (FPs) from 5 facial regions including eyebrow, eye, nose, mouth, and facial contours for later facial animation parameters (FAPs) calculation. Three kinds of primary prosodic features are adopted, including pitch, energy, and formants F1-F5 in each speech frame for emotion recognition. Finally, a semi-coupled hidden Markov model (SC-HMM) is proposed for emotion recognition based on state-based alignment strategy for audiovisual bimodal features.

原文English
主出版物標題Emotion Recognition
主出版物子標題A Pattern Analysis Approach
發行者wiley
頁面493-513
頁數21
ISBN(電子)9781118910566
ISBN(列印)9781118130667
DOIs
出版狀態Published - 2015 一月 2

指紋

Adaptive boosting
Data fusion
Hidden Markov models
Face recognition
Animation
Feature extraction
Detectors

All Science Journal Classification (ASJC) codes

  • Engineering(all)
  • Computer Science(all)

引用此文

Wu, C-H., Lin, J. C., & Wei, W. L. (2015). Audiovisual Emotion Recognition Using Semi-Coupled Hidden Markov Model with State-Based Alignment Strategy. 於 Emotion Recognition: A Pattern Analysis Approach (頁 493-513). wiley. https://doi.org/10.1002/9781118910566.ch19
Wu, Chung-Hsien ; Lin, Jen Chun ; Wei, Wen Li. / Audiovisual Emotion Recognition Using Semi-Coupled Hidden Markov Model with State-Based Alignment Strategy. Emotion Recognition: A Pattern Analysis Approach. wiley, 2015. 頁 493-513
@inbook{da68bf0d33154c6e80a7de3a79c367de,
title = "Audiovisual Emotion Recognition Using Semi-Coupled Hidden Markov Model with State-Based Alignment Strategy",
abstract = "This chapter introduces the current data fusion strategies among audiovisual signals for bimodal emotion recognition. Face detection, in the chapter, is performed based on the adaboost cascade face detector and can be used to provide initial facial position and reduce the time for error convergence in feature extraction. In the chapter, active appearance model (AAM) is employed to extract the 68 labeled facial feature points (FPs) from 5 facial regions including eyebrow, eye, nose, mouth, and facial contours for later facial animation parameters (FAPs) calculation. Three kinds of primary prosodic features are adopted, including pitch, energy, and formants F1-F5 in each speech frame for emotion recognition. Finally, a semi-coupled hidden Markov model (SC-HMM) is proposed for emotion recognition based on state-based alignment strategy for audiovisual bimodal features.",
author = "Chung-Hsien Wu and Lin, {Jen Chun} and Wei, {Wen Li}",
year = "2015",
month = "1",
day = "2",
doi = "10.1002/9781118910566.ch19",
language = "English",
isbn = "9781118130667",
pages = "493--513",
booktitle = "Emotion Recognition",
publisher = "wiley",

}

Audiovisual Emotion Recognition Using Semi-Coupled Hidden Markov Model with State-Based Alignment Strategy. / Wu, Chung-Hsien; Lin, Jen Chun; Wei, Wen Li.

Emotion Recognition: A Pattern Analysis Approach. wiley, 2015. p. 493-513.

研究成果: Chapter

TY - CHAP

T1 - Audiovisual Emotion Recognition Using Semi-Coupled Hidden Markov Model with State-Based Alignment Strategy

AU - Wu, Chung-Hsien

AU - Lin, Jen Chun

AU - Wei, Wen Li

PY - 2015/1/2

Y1 - 2015/1/2

N2 - This chapter introduces the current data fusion strategies among audiovisual signals for bimodal emotion recognition. Face detection, in the chapter, is performed based on the adaboost cascade face detector and can be used to provide initial facial position and reduce the time for error convergence in feature extraction. In the chapter, active appearance model (AAM) is employed to extract the 68 labeled facial feature points (FPs) from 5 facial regions including eyebrow, eye, nose, mouth, and facial contours for later facial animation parameters (FAPs) calculation. Three kinds of primary prosodic features are adopted, including pitch, energy, and formants F1-F5 in each speech frame for emotion recognition. Finally, a semi-coupled hidden Markov model (SC-HMM) is proposed for emotion recognition based on state-based alignment strategy for audiovisual bimodal features.

AB - This chapter introduces the current data fusion strategies among audiovisual signals for bimodal emotion recognition. Face detection, in the chapter, is performed based on the adaboost cascade face detector and can be used to provide initial facial position and reduce the time for error convergence in feature extraction. In the chapter, active appearance model (AAM) is employed to extract the 68 labeled facial feature points (FPs) from 5 facial regions including eyebrow, eye, nose, mouth, and facial contours for later facial animation parameters (FAPs) calculation. Three kinds of primary prosodic features are adopted, including pitch, energy, and formants F1-F5 in each speech frame for emotion recognition. Finally, a semi-coupled hidden Markov model (SC-HMM) is proposed for emotion recognition based on state-based alignment strategy for audiovisual bimodal features.

UR - http://www.scopus.com/inward/record.url?scp=85016374157&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85016374157&partnerID=8YFLogxK

U2 - 10.1002/9781118910566.ch19

DO - 10.1002/9781118910566.ch19

M3 - Chapter

AN - SCOPUS:85016374157

SN - 9781118130667

SP - 493

EP - 513

BT - Emotion Recognition

PB - wiley

ER -