Morphosyntactic resources for automatic speech recognition

Stéphane Huet; Guillaume Gravier; Pascale Sébillot

Communication Dans Un Congrès Année : 2008

Morphosyntactic resources for automatic speech recognition

(1) , (1) , (1)

Stéphane Huet

Fonction : Auteur
PersonId : 10005
IdHAL : shuet
ORCID : 0000-0003-1838-3807
IdRef : 110355245

Multimedia content-based indexing

Guillaume Gravier

Fonction : Auteur
PersonId : 1046
IdHAL : guig
ORCID : 0000-0002-2266-5682
IdRef : 110355415

Multimedia content-based indexing

Pascale Sébillot

Fonction : Auteur
PersonId : 21840
IdHAL : pascale-sebillot
ORCID : 0000-0002-5429-4302
IdRef : 075988453

Multimedia content-based indexing

Résumé

Texts generated by automatic speech recognition (ASR) systems have some specificities, related to the idiosyncrasies of oral productions or the principles of ASR systems, that make them more difficult to exploit than more conventional natural language written texts. This paper aims at studying the interest of morphosyntactic information as a useful resource for ASR. We show the ability of automatic methods to tag outputs of ASR systems, by obtaining a tag accuracy similar for automatic transcriptions to the 95-98 % usually reported for written texts, such as newspapers. We also demonstrate experimentally that tagging is useful to improve the quality of transcriptions by using morphosyntactic information in a post-processing stage of speech decoding. Indeed, we obtain a significant decrease of the word error rate with experiments done on French broadcast news from the ESTER corpus; we also notice an improvement of the sentence error rate and observe that a significant number of agreement errors are corrected.

Domaines

Informatique et langage [cs.CL] Traitement du texte et du document

Fichier principal

LREC08.pdf (158.65 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Stéphane Huet : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02021879

Soumis le : samedi 16 février 2019-21:07:51

Dernière modification le : vendredi 24 mars 2023-14:53:09

Dates et versions

hal-02021879 , version 1 (16-02-2019)

Identifiants

HAL Id : hal-02021879 , version 1

Citer

Stéphane Huet, Guillaume Gravier, Pascale Sébillot. Morphosyntactic resources for automatic speech recognition. 6th International Conference on Language Resources and Evaluation (LREC), 2008, Marrakech, Morocco. ⟨hal-02021879⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-INSA-R INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

41 Consultations

39 Téléchargements

Morphosyntactic resources for automatic speech recognition

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager