Incorporating Named Entity Recognition into the Speech Transcription Process

Mohamed Hatmi; Christine Jacquin; Emmanuel Morin; Sylvain Meignier

Communication Dans Un Congrès Année : 2013

Incorporating Named Entity Recognition into the Speech Transcription Process

(1, 2) , (1) , (1) , (2)

1
2

Mohamed Hatmi

Fonction : Auteur

Laboratoire d'Informatique de Nantes Atlantique

Laboratoire d'Informatique de l'Université du Mans

Christine Jacquin

Fonction : Auteur
PersonId : 4167
IdHAL : christine-jacquin

Laboratoire d'Informatique de Nantes Atlantique

Emmanuel Morin

Fonction : Auteur
PersonId : 3632
IdHAL : emmanuel-morin
ORCID : 0000-0001-8208-7039
IdRef : 14379373X

Laboratoire d'Informatique de Nantes Atlantique

Sylvain Meignier

Fonction : Auteur
PersonId : 11674
IdHAL : sylvain-meignier
ORCID : 0000-0001-7687-073X
IdRef : 182269086

Laboratoire d'Informatique de l'Université du Mans

Résumé

Named Entity Recognition (NER) from speech usually involves two sequential steps: transcribing the speech using Automatic Speech Recognition (ASR) and annotating the outputs of the ASR process using NER techniques. Recognizing named entities in automatic transcripts is difficult due to the presence of transcription errors and the absence of some important NER clues, such as capitalization and punctuation. In this paper, we describe a methodology for speech NER which consists of incorporating NER into the ASR process so that the ASR system generates transcripts annotated with named entities. The combination is achieved by adapting ASR language models and pre-annotating the pronunciation dictionary. We evaluate this method on ESTER 2 corpus, and show significant improvements over traditional approaches.

Mots clés

Named Entity Recognition Automatic Speech Recognition language modeling ASR vocabulary

Domaines

Informatique et langage [cs.CL]

Fichier principal

i13_3732.pdf (371.56 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

sylvain meignier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01433438

Soumis le : samedi 1 avril 2017-00:56:50

Dernière modification le : vendredi 5 janvier 2024-03:23:22

Archivage à long terme le : dimanche 2 juillet 2017-12:14:30

Dates et versions

hal-01433438 , version 1 (01-04-2017)

Identifiants

HAL Id : hal-01433438 , version 1

Citer

Mohamed Hatmi, Christine Jacquin, Emmanuel Morin, Sylvain Meignier. Incorporating Named Entity Recognition into the Speech Transcription Process. Interspeech, 2013, Lyon, France. ⟨hal-01433438⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-NANTES CNRS UNIV-LEMANS LINA LINA-TALN LIUM LIUM-LST LS2N NANTES-UNIVERSITE

394 Consultations

69 Téléchargements

Incorporating Named Entity Recognition into the Speech Transcription Process

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager