TY - JOUR
T1 - Interactional style detection for versatile dialogue response using prosodic and semantic features
AU - Liang, Wei Bin
AU - Wu, Chung Hsien
AU - Wang, Chih Hung
AU - Wang, Jhing Fa
PY - 2011
Y1 - 2011
N2 - This work presents an approach to interactional style (IS) detection for versatile responses in spoken dialogue systems (SDSs). Since speakers generally express their intents in different styles, the responses of an SDS should be versatile instead of invariable, planned responses. Moreover, the IS of dialogue turns can be affected by dialogue topics and speakers' emotional states. In this study, three base-level classifiers are employed for preliminary detection, including latent Dirichlet allocation for dialogue topic categorization, support vector machine for prosody-based emotional state identification and maximum entropy for semantic label-based emotional state identification. Finally, an artificial neural network is adopted for IS detection considering the scores estimated from the aforementioned classifiers. To evaluate the proposed approach, an SDS in a chatting domain was constructed for evaluation. The performance of IS detection can achieve 82.67%.
AB - This work presents an approach to interactional style (IS) detection for versatile responses in spoken dialogue systems (SDSs). Since speakers generally express their intents in different styles, the responses of an SDS should be versatile instead of invariable, planned responses. Moreover, the IS of dialogue turns can be affected by dialogue topics and speakers' emotional states. In this study, three base-level classifiers are employed for preliminary detection, including latent Dirichlet allocation for dialogue topic categorization, support vector machine for prosody-based emotional state identification and maximum entropy for semantic label-based emotional state identification. Finally, an artificial neural network is adopted for IS detection considering the scores estimated from the aforementioned classifiers. To evaluate the proposed approach, an SDS in a chatting domain was constructed for evaluation. The performance of IS detection can achieve 82.67%.
UR - http://www.scopus.com/inward/record.url?scp=84865757806&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84865757806&partnerID=8YFLogxK
U2 - 10.21437/interspeech.2011-445
DO - 10.21437/interspeech.2011-445
M3 - Conference article
AN - SCOPUS:84865757806
SN - 2308-457X
SP - 1345
EP - 1348
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
Y2 - 27 August 2011 through 31 August 2011
ER -