Skip to Main content Skip to Navigation
Conference papers

Using the World Wide Web for Learning New Words in Continuous Speech Recognition Tasks: Two Case Studies

Abstract : In this paper, Web-based lexicon augmentation is addressed: using various strategies for Out-Of-Vocabulary (OOV) word learning, we discuss their relevance in two types of applications: broadcast news, or topic-specific corpora transcription. The Web-based OOV word learning is first tested on the French news corpus ESTER; the same approach is applied to a very specific corpus concerning surgical interventions. These tests allow us to assess the value of the OOV word learning methods proposed, emphasizing their strengths and weaknesses, regarding the particular type of application considered.
Document type :
Conference papers
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-01319868
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon Connect in order to contact the contributor
Submitted on : Monday, May 23, 2016 - 10:31:38 AM
Last modification on : Tuesday, January 14, 2020 - 10:38:06 AM

Identifiers

  • HAL Id : hal-01319868, version 1

Collections

Citation

Stanislas Oger, Vladimir Popescu, Georges Linarès. Using the World Wide Web for Learning New Words in Continuous Speech Recognition Tasks: Two Case Studies. SPECOM'2009, Jun 2009, St Petersbourg, Russia. ⟨hal-01319868⟩

Share

Metrics

Les métriques sont temporairement indisponibles