HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Where are we in Named Entity Recognition from Speech?

Abstract : Named entity recognition (NER) from speech is usually made through a pipeline process that consists in (i) processing audio using an automatic speech recognition system (ASR) and (ii) applying a NER to the ASR outputs. The latest data available for named entity extraction from speech in French were produced during the ETAPE evaluation campaign in 2012. Since the publication of ETAPE's campaign results, major improvements were done on NER and ASR systems, especially with the development of neural approaches for both of these components. In addition, recent studies have shown the capability of End-to-End (E2E) approach for NER / SLU tasks. In this paper, we propose a study of the improvements made in speech recognition and named entity recognition for pipeline approaches. For this type of systems, we propose an original 3-pass approach. We also explore the capability of an E2E system to do structured NER. Finally, we compare the performances of ETAPE's systems (state-of-the-art systems in 2012) with the performances obtained using current technologies. The results show the interest of the E2E approach, which however remains below an updated pipeline approach.
Document type :
Conference papers
Complete list of metadata

Contributor : Antoine Caubrière Connect in order to contact the contributor
Submitted on : Tuesday, February 11, 2020 - 5:30:32 PM
Last modification on : Friday, May 6, 2022 - 3:46:06 AM


  • HAL Id : hal-02475026, version 1


Antoine Caubrière, Sophie Rosset, Yannick Estève, Antoine Laurent, Emmanuel Morin. Where are we in Named Entity Recognition from Speech?. 12th International Conference on Language Resources and Evaluation (LREC), May 2020, Marseille, France. ⟨hal-02475026⟩



Record views