Imitation learning for sentence generation with dilated convolutions using adversarial training

Jian Wei Peng, Min Chun Hu, Chuan Wang Chang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this work, we consider the sentence generation problem as an imitation learning problem, which aims to learn a policy to mimic the expert. Recent works have showed that adversarial learning can be applied to imitation learning problems. However, it has been indicated that the reward signal from the discriminator is not robust in reinforcement learning (RL) based generative adversarial network (GAN), and estimating state-action value is usually computationally intractable. To deal with this problem, we propose to use two discriminators to provide two different reward signals for constructing a more general imitation learning framework that can be used for sequence generation. Monte Carlo (MC) rollout is therefore not necessary to make our algorithm computationally tractable for generating long sequences. Furthermore, our policy and discriminator networks are integrated by sharing another state encoder network constructed based on dilated convolutions instead of recurrent neural networks (RNNs). In our experiment, we show that the two reward signals control the trade-off between the quality and the diversity of the output sequences.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages435-440
Number of pages6
ISBN (Electronic)9781538692141
DOIs
Publication statusPublished - 2019 Jul
Event2019 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2019 - Shanghai, China
Duration: 2019 Jul 82019 Jul 12

Publication series

NameProceedings - 2019 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2019

Conference

Conference2019 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2019
CountryChina
CityShanghai
Period19-07-0819-07-12

All Science Journal Classification (ASJC) codes

  • Media Technology
  • Computer Vision and Pattern Recognition

Fingerprint Dive into the research topics of 'Imitation learning for sentence generation with dilated convolutions using adversarial training'. Together they form a unique fingerprint.

Cite this