End-to-end named entity and semantic concept extraction from speech

Named entity recognition (NER) is among SLU tasks that usually extract semantic information from textual documents. Until now, NER from speech is made through a pipeline process that consists in processing first an automatic speech recognition (ASR) on the audio and then processing a NER on the ASR outputs. Such approach has some disadvantages (error propagation, metric to tune ASR systems sub-optimal in regards to the final task, reduced space search at the ASR output level,...) and it is known that more integrated approaches outperform sequential ones, when they can be applied. In this paper, we explore an end-to-end approach that directly extracts named entities from speech, though a unique neural architecture. On a such way, a joint optimization is possible for both ASR and NER. Experiments are carried on French data easily accessible, composed of data distributed in several evaluation campaigns. The results are promising since this end-to-end approach provides similar results (F-measure=0.66 on test data) than a classical pipeline approach to detect named entity categories (F-measure=0.64). Last, we also explore this approach applied to semantic concept extraction , through a slot filling task known as a spoken language understanding problem.

Mots clés

End-to-end approach Automatic speech recognition Deep learning Named entity recognition Spoken language understanding Index Terms-End-to-end approach

Domaines

Informatique et langage [cs.CL] Intelligence artificielle [cs.AI]

Fichier principal

slt_e2e_enslu.pdf (755.67 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Sahar Ghannay : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01987740

Soumis le : jeudi 25 juin 2020-19:31:58

Dernière modification le : jeudi 22 février 2024-11:38:40

Dates et versions

hal-01987740 , version 1 (21-01-2019)

hal-01987740 , version 2 (25-06-2020)

Identifiants

HAL Id : hal-01987740 , version 2

Citer

Sahar Ghannay, Antoine Caubrière, Yannick Estève, Nathalie Camelin, Edwin Simonnet, et al.. End-to-end named entity and semantic concept extraction from speech. IEEE Spoken Language Technology Workshop, Dec 2018, Athens, Greece. ⟨hal-01987740v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-NANTES INSTITUT-TELECOM UNIV-AVIGNON CNRS UNIV-LEMANS EC-NANTES UNAM LIUM LS2N LS2N-TALN LIA NANTES-UNIVERSITE

350 Consultations

1317 Téléchargements