Efficient Wait-k Models for Simultaneous Machine Translation

Simultaneous machine translation consists in starting output generation before the entire input sequence is available. Wait-k decoders offer a simple but efficient approach for this problem. They first read k source tokens, after which they alternate between producing a target token and reading another source token. We investigate the behavior of wait-k decoding in low resource settings for spoken corpora using IWSLT datasets. We improve training of these models using unidirectional encoders, and training across multiple values of k. Experiments with Transformer and 2D-convolutional architectures show that our wait-k models generalize well across a wide range of latency levels. We also show that the 2D-convolution architecture is competitive with Transformers for simultaneous translation of spoken language.

Domaines

Informatique et langage [cs.CL]

Fichier principal

Waitk_decoding__InterSpeech_2020_.pdf (385.16 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Laurent Besacier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02962195

Soumis le : vendredi 9 octobre 2020-09:04:41

Dernière modification le : lundi 15 avril 2024-11:25:23

Archivage à long terme le : dimanche 10 janvier 2021-18:14:18

Dates et versions

hal-02962195 , version 1 (09-10-2020)

Identifiants

HAL Id : hal-02962195 , version 1
DOI : 10.21437/Interspeech.2020-1241

Citer

Maha Elbayad, Laurent Besacier, Jakob Verbeek. Efficient Wait-k Models for Simultaneous Machine Translation. Interspeech 2020 - Conference of the International Speech Communication Association, Oct 2020, Shangai (Virtual Conf), China. pp.1461--1465, ⟨10.21437/Interspeech.2020-1241⟩. ⟨hal-02962195⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG LJK LJK_GI LIG_TDCGE_GETALP INRIA2 LJK-GI-THOTH POLYTECH-GRENOBLE MIAI ANR LIG_SIDCH

227 Consultations

196 Téléchargements