Token-level and sequence-level loss smoothing for RNN language models

Despite the effectiveness of recurrent neu-ral network language models, their maximum likelihood estimation suffers from two limitations. It treats all sentences that do not match the ground truth as equally poor, ignoring the structure of the output space. Second, it suffers from "exposure bias": during training tokens are predicted given ground-truth sequences, while at test time prediction is conditioned on generated output sequences. To overcome these limitations we build upon the recent reward augmented maximum likelihood approach i.e. sequence-level smoothing that encourages the model to predict sentences close to the ground truth according to a given performance metric. We extend this approach to token-level loss smoothing, and propose improvements to the sequence-level smoothing approach. Our experiments on two different tasks, image captioning and machine translation, show that token-level and sequence-level loss smoothing are complementary, and significantly improve results.

Domains

Computer Vision and Pattern Recognition [cs.CV] Computation and Language [cs.CL] Machine Learning [cs.LG]

Fichier principal

paper.pdf (18.53 Mo)

Origin : Files produced by the author(s)

THOTH Team : Connect in order to contact the contributor

https://inria.hal.science/hal-01790879

Submitted on : Monday, May 14, 2018-10:19:13 AM

Last modification on : Saturday, April 27, 2024-3:09:45 AM

Long-term archiving on: Tuesday, September 25, 2018-10:25:32 PM

Dates and versions

hal-01790879 , version 1 (14-05-2018)

Identifiers

HAL Id : hal-01790879 , version 1

Cite

Maha Elbayad, Laurent Besacier, Jakob Verbeek. Token-level and sequence-level loss smoothing for RNN language models. ACL - 56th Annual Meeting of the Association for Computational Linguistics, Jul 2018, Melbourne, Australia. pp.2094-2103. ⟨hal-01790879⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG INSMI LJK LJK_GI LIG_TDCGE_GETALP PERSYVAL-LAB INRIA2 LJK-GI-THOTH POLYTECH-GRENOBLE ANR LIG_SIDCH

319 View

336 Download