3.8 Proceedings Paper

Diversifying Dialog Generation via Adaptive Label Smoothing

Publisher

ASSOC COMPUTATIONAL LINGUISTICS-ACL

Keywords

-

Funding

  1. NSFC [61936010, 61876096]
  2. Guoqiang Institute of Tsinghua University [2019GQG1, 2020GQG0005]

Ask authors/readers for more resources

The paper introduces an Adaptive Label Smoothing (AdaLabel) approach, which can adaptively estimate target label distribution based on different dialog contexts, improving the generation diversity of responses.
Neural dialogue generation models trained with the one-hot target distribution suffer from the over-confidence issue, which leads to poor generation diversity as widely reported in the literature. Although existing approaches such as label smoothing can alleviate this issue, they fail to adapt to diverse dialog contexts. In this paper, we propose an Adaptive Label Smoothing (AdaLabel) approach that can adaptively estimate a target label distribution at each time step for different contexts. The maximum probability in the predicted distribution is used to modify the soft target distribution produced by a novel light-weight bi-directional decoder module. The resulting target distribution is aware of both previous and future contexts and is adjusted to avoid over-training the dialogue model. Our model can be trained in an end-to-end manner. Extensive experiments on two benchmark datasets show that our approach outperforms various competitive baselines in producing diverse responses.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available