Detection of mood disorder using speech emotion profiles and LSTM

Tsung Hsien Yang, Chung Hsien Wu, Kun Yi Huang, Ming Hsiang Su

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

In mood disorder diagnosis, bipolar disorder (BD) patients are often misdiagnosed as unipolar depression (UD) on initial presentation. It is crucial to establish an accurate distinction between BD and UD to make a correct and early diagnosis, leading to improvements in treatment and course of illness. To deal with this misdiagnosis problem, in this study, we experimented on eliciting subjects' emotions by watching six eliciting emotional video clips. After watching each video clips, their speech responses were collected when they were interviewing with a clinician. In mood disorder detection, speech emotions play an import role to detect manic or depressive symptoms. Therefore, speech emotion profiles (EP) are obtained by using the support vector machine (SVM) which are built via speech features adapted from selected databases using a denoising autoencoder-based method. Finally, a Long Short-Term Memory (LSTM) recurrent neural network is employed to characterize the temporal information of the EPs with respect to six emotional videos. Comparative experiments clearly show the promising advantage and efficacy of the LSTM-based approach for mood disorder detection.

Original languageEnglish
Title of host publicationProceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016
EditorsHsin-Min Wang, Qingzhi Hou, Yuan Wei, Tan Lee, Jianguo Wei, Lei Xie, Hui Feng, Jianwu Dang, Jianwu Dang
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781509042937
DOIs
Publication statusPublished - 2017 May 2
Event10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016 - Tianjin, China
Duration: 2016 Oct 172016 Oct 20

Publication series

NameProceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016

Other

Other10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016
CountryChina
CityTianjin
Period16-10-1716-10-20

All Science Journal Classification (ASJC) codes

  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Linguistics and Language

Fingerprint Dive into the research topics of 'Detection of mood disorder using speech emotion profiles and LSTM'. Together they form a unique fingerprint.

  • Cite this

    Yang, T. H., Wu, C. H., Huang, K. Y., & Su, M. H. (2017). Detection of mood disorder using speech emotion profiles and LSTM. In H-M. Wang, Q. Hou, Y. Wei, T. Lee, J. Wei, L. Xie, H. Feng, J. Dang, & J. Dang (Eds.), Proceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016 [7918439] (Proceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ISCSLP.2016.7918439