TY - JOUR
T1 - Empathetic Response Generation Based on Plug-and-Play Mechanism With Empathy Perturbation
AU - Hsu, Jia Hao
AU - Chang, Jeremy
AU - Kuo, Min Hsueh
AU - Wu, Chung Hsien
N1 - Funding Information:
This work was supported by National Science and Technology Council, Taiwan, under Grant 111-2221-E-006-150-MY3
Publisher Copyright:
© 2014 IEEE.
PY - 2023
Y1 - 2023
N2 - Spoken dialogue systems have rapidly developed but are often viewed as inhumane because they lack empathetic communication skills. In this study, a transformer-based language model (DialoGPT fine-tuned on the EmpatheticDialogues dataset) was combined with two proposed attribute models for affective and cognitive empathy to improve its performance. The affective empathy model ensures that the user sentence and system response have similar emotional valence, and the cognitive empathy model ensures that the system response is relevant to the user's input by using a DialoGPT-based reverse generation model to calculate the cross-entropy loss. A plug-and-play structure with these empathy attribute models was used to perturb the language generation model to increase response empathy without fine-tuning or retraining the generation model. Experiments indicated that the proposed model responses had substantially higher affective empathy, cognitive empathy, and BLEU scores than did the baseline model. Subjective evaluations also indicated that the responses of the proposed model had greater empathy, relevance, and fluency than did the baseline model. Moreover, the proposed model outperformed other similar models in A/B tests.
AB - Spoken dialogue systems have rapidly developed but are often viewed as inhumane because they lack empathetic communication skills. In this study, a transformer-based language model (DialoGPT fine-tuned on the EmpatheticDialogues dataset) was combined with two proposed attribute models for affective and cognitive empathy to improve its performance. The affective empathy model ensures that the user sentence and system response have similar emotional valence, and the cognitive empathy model ensures that the system response is relevant to the user's input by using a DialoGPT-based reverse generation model to calculate the cross-entropy loss. A plug-and-play structure with these empathy attribute models was used to perturb the language generation model to increase response empathy without fine-tuning or retraining the generation model. Experiments indicated that the proposed model responses had substantially higher affective empathy, cognitive empathy, and BLEU scores than did the baseline model. Subjective evaluations also indicated that the responses of the proposed model had greater empathy, relevance, and fluency than did the baseline model. Moreover, the proposed model outperformed other similar models in A/B tests.
UR - http://www.scopus.com/inward/record.url?scp=85160256273&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85160256273&partnerID=8YFLogxK
U2 - 10.1109/TASLP.2023.3277274
DO - 10.1109/TASLP.2023.3277274
M3 - Article
AN - SCOPUS:85160256273
SN - 2329-9290
VL - 31
SP - 2032
EP - 2042
JO - IEEE/ACM Transactions on Audio Speech and Language Processing
JF - IEEE/ACM Transactions on Audio Speech and Language Processing
ER -