D. Vaufreydaz, Modélisation statistique du langage à partir d'Internet pour la reconnaissance automatique de la parole continue, Thèse de doctorat de l'Université J

X. Zhu and R. Rosenfeld, Improving Trigram Language Modelling with the World Wide Web, pp.533-536, 2001.

C. Barras, Transcriber: Development and use of a tool for assisting speech corpora production, Speech Communication, vol.33, issue.1-2, 2000.
DOI : 10.1016/S0167-6393(00)00067-4

M. Kurimo, Unsupervised segmentation of words into morphemes -Morpho Challenge 2005: Application to Automatic Speech Recognition, Interspeech'06, pp.1021-1024, 2006.

N. Abdillahi, Automatic transcription of Somali language, Interspeech, pp.289-292, 2006.

M. Afify, On the use of morphological analysis for dialectal Arabic Speech Recognition The character as an appropriate unit of processing for non-segmenting languages, NLP Annual Meeting Proc. ICSLP'2000 Algorithm, and System Development, pp.277-280731, 2000.

J. Billa, Audio Indexing of Arabic broadcast news, IEEE International Conference on Acoustics Speech and Signal Processing, pp.5-8, 2002.
DOI : 10.1109/ICASSP.2002.5743640

M. Bisani and H. Ney, Multigram-based grapheme-tophoneme conversion for LVCSR, Proceedings of the EUROSPEECH Cambodian System of Writing and Beginning Reader, pp.933-936, 1970.

L. Viet-bac-le, . Besacier, A. Khmer, I. Fiscus, and J. G. , A Post-Processing System to Yield Reduced Word Error Rates: Recogniser Output Voting Error Reduction (ROVER), Proc. IEEE ASRU Workshop 97, pp.129-132, 2006.