A Visual Analytics Approach to Finding Factors Improving Automatic Speaker Identifications

Pierrick Bruneau; Mickaël Stefas; Hervé Bredin; Johann Poignant; Thomas Tamisier; Claude Barras

Communication Dans Un Congrès Année : 2015

A Visual Analytics Approach to Finding Factors Improving Automatic Speaker Identifications

, , (1) , (1) , , (1)

Pierrick Bruneau

Fonction : Auteur

Mickaël Stefas

Fonction : Auteur

Hervé Bredin

Fonction : Auteur
PersonId : 15856
IdHAL : hbredin
ORCID : 0000-0002-3739-925X
IdRef : 121165779

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Johann Poignant

Fonction : Auteur
PersonId : 1027647

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Thomas Tamisier

Fonction : Auteur

Claude Barras

Fonction : Auteur
PersonId : 17217
IdHAL : claude-barras
IdRef : 165065583

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Résumé

Classification quality criteria such as precision, recall, and F-measure are generally the basis for evaluating contributions in automatic speaker recognition. Specifically, comparisons are carried out mostly via mean values estimated on a set of media. Whilst this approach is relevant to assess improvement w.r.t. the state-of-the-art, or ranking participants in the context of an automatic annotation challenge, it gives little insight to system designers in terms of cues for improving algorithms, hypothesis formulation, and evidence display. This paper presents a design study of a visual and interactive approach to analyze errors made by automatic annotation algorithms. A timeline-based tool emerged from prior steps of this study. A critical review, driven by user interviews, exposes caveats and refines user objectives. The next step of the study is then initiated by sketching designs combining elements of the current prototype to principles newly identified as relevant.

Mots clés

Speaker identification Visual Analytics

Domaines

Informatique [cs] Informatique et langage [cs.CL]

Limsi Publications : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01836455

Soumis le : jeudi 12 juillet 2018-12:35:19

Dernière modification le : samedi 7 octobre 2023-21:36:20

Dates et versions

hal-01836455 , version 1 (12-07-2018)

Identifiants

HAL Id : hal-01836455 , version 1

Citer

Pierrick Bruneau, Mickaël Stefas, Hervé Bredin, Johann Poignant, Thomas Tamisier, et al.. A Visual Analytics Approach to Finding Factors Improving Automatic Speaker Identifications. International Conference on Multimodal Interaction, Jan 2015, Seattle, United States. ⟨hal-01836455⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIMSI UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE LISN GS-ENGINEERING GS-COMPUTER-SCIENCE

60 Consultations

0 Téléchargements

A Visual Analytics Approach to Finding Factors Improving Automatic Speaker Identifications

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager