J. Kunzmann, K. Choukri, E. Jahnke, A. Kiessling, K. Knill et al., Portability of automatic speech recognition technology to new languages: Multilinguality issues and speech/text resources, ASRU, 2001.

D. Vaufreydaz, C. Bergamini, J. F. Serignat, L. Besacier, and M. Akbar, A new methodology for speech corpora definition from internet documents, LREC, pp.423-426, 2000.
URL : https://hal.archives-ouvertes.fr/inria-00326150

T. Schultz and A. Waibel, Language-independent and language-adaptive acoustic modeling for speech recognition, Speech Communication, vol.35, issue.1-2, pp.31-51, 2001.
DOI : 10.1016/S0167-6393(00)00094-7

D. Vaufreydaz, L. Besacier, C. Bergamini, and R. Lamy, From generic to task-oriented speech recognition: French experience in the nespole! european project, ITRW Workshop on Adaptation Methods for Speech Recognition, 2001.
URL : https://hal.archives-ouvertes.fr/inria-00326171

V. Berment, Several technical issues for building new lexical bases, Workshop Papillon, 2002.

R. Rosenfeld, A maximum entropy approach to adaptive statistical language modeling. computer, Speech and Language, pp.187-228, 1996.

R. Ghani, R. Jones, and D. Mladenic, Building Minority Language Corpora by Learning to Generate Web Search Queries, Knowledge and Information Systems, vol.34, issue.1, 2001.
DOI : 10.1023/A:1007545901558

G. A. Monroe, J. C. French, and A. L. Powell, Obtaining language models of web collections using query-based sampling techniques, Proceedings of the 35th Annual Hawaii International Conference on System Sciences, 2002.
DOI : 10.1109/HICSS.2002.993982

R. Nisimura, K. Komatsu, Y. Kuroda, K. Nagatomo, A. Lee et al., Automatic ngram language model creation from web resources, Eurospeech, 2001.

]. H. Doan-nguyen, Techniques génériques d'accumulations d'ensembles lexicaux structurésstructurés`structurésà partir de ressources dictionnairiques informatisées multilingues hétérogènes, Ph.D. dissertation, INPG, issue.11, 1998.

A. Stolcke, Srilm -an extensible language modeling toolkit, International Conference Spoken Language Processing, 2002.