An Hybrid Language Model for a Continuous Dictation Prototype - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 1997

An Hybrid Language Model for a Continuous Dictation Prototype

Résumé

This paper describes the combination of a stochastic language model and a formal grammar modeled such as a unification grammar. The stochastic model is trained over 42 million words extracted from Le Monde newspaper. The stochastic model is based on smoothed 3-gram and 3-class. The 3-class model is represented by a Markov chain made up of four states. Several experiments have been done to state which values are the best for specific training and test corpus. Experiments indicate that the unification grammar reduces strongly the number of hypothesis (sentences) produced by the stochastic model.
Fichier non déposé

Dates et versions

hal-01112905 , version 1 (03-02-2015)

Identifiants

  • HAL Id : hal-01112905 , version 1

Citer

Kamel Smaïli, Imed Zitouni, François Charpillet, Jean-Paul Haton. An Hybrid Language Model for a Continuous Dictation Prototype. 5th European Conference on Speech Communication and Technology, Sep 1997, Rhodes, Greece. ⟨hal-01112905⟩
438 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More