Survey on evaluation methods for dialogue systems

Jan Deriu; Alvaro Rodrigo; Arantxa Otegi; Guillermo Echegoyen; Sophie Rosset; Eneko Agirre; Mark Cieliebak

doi:10.1007/s10462-020-09866-x

Article Dans Une Revue Artificial Intelligence Review Année : 2020

Survey on evaluation methods for dialogue systems

, , , , (1) , ,

Jan Deriu

Fonction : Auteur

Alvaro Rodrigo

Fonction : Auteur

Arantxa Otegi

Fonction : Auteur

Guillermo Echegoyen

Fonction : Auteur

Sophie Rosset

Fonction : Auteur
PersonId : 14913
IdHAL : sophie-rosset
ORCID : 0000-0002-6865-4989
IdRef : 137157835

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Eneko Agirre

Fonction : Auteur

Mark Cieliebak

Fonction : Auteur

Résumé

In this paper, we survey the methods and concepts developed for the evaluation of dialogue systems. Evaluation, in and of itself, is a crucial part during the development process. Often, dialogue systems are evaluated by means of human evaluations and questionnaires. However, this tends to be very cost- and time-intensive. Thus, much work has been put into finding methods which allow a reduction in involvement of human labour. In this survey, we present the main concepts and methods. For this, we differentiate between the various classes of dialogue systems (task-oriented, conversational, and question-answering dialogue systems). We cover each class by introducing the main technologies developed for the dialogue systems and then present the evaluation methods regarding that class.

Mots clés

Dialogue systems Â· Evaluation metrics Â· Discourse model Â· Conversational AI Â· Chatbots

Domaines

Informatique [cs] Informatique et langage [cs.CL]

Fichier principal

s10462-020-09866-x.pdf (1.96 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Sophie Rosset : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03006231

Soumis le : lundi 18 décembre 2023-10:24:54

Dernière modification le : mercredi 7 février 2024-03:34:54

Dates et versions

hal-03006231 , version 1 (18-12-2023)

Identifiants

HAL Id : hal-03006231 , version 1
DOI : 10.1007/s10462-020-09866-x
PUBMEDCENTRAL : PMC7817575

Citer

Jan Deriu, Alvaro Rodrigo, Arantxa Otegi, Guillermo Echegoyen, Sophie Rosset, et al.. Survey on evaluation methods for dialogue systems. Artificial Intelligence Review, 2020, 54 (1), pp.755-810. ⟨10.1007/s10462-020-09866-x⟩. ⟨hal-03006231⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIMSI UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE LISN GS-ENGINEERING GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT

132 Consultations

12 Téléchargements

Survey on evaluation methods for dialogue systems

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager