L. Besacier, E. Barnard, A. Karpov, and T. Schultz, Automatic speech recognition for under-resourced languages: A survey, Speech Communication, vol.56, pp.85-100, 2014.
DOI : 10.1016/j.specom.2013.07.008

URL : https://hal.archives-ouvertes.fr/hal-00953644

S. R. Maskey, A. W. Black, and L. M. Tomokiyo, Bootstrapping phonetic lexicons for language, Proc. INTERSPEECH, pp.69-72, 2004.

M. Davel and O. Martirosian, Pronunciation dictionary development in resource-scarce envionments, Proc. INTERSPEECH, pp.2851-2854, 2009.

S. S. Juan and L. Besacier, Fast bootstrapping of grapheme to phoneme system for under-resourced languages -application to the iban language, Proc. 4th Workshop on South and Southeast Asian Natural Language Processing 2013, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00953784

S. S. Juan, L. Besacier, and S. Rossato, Semi-supervised g2p bootstrapping and its application to asr for a very under-resourced language: Iban, Workshop for Spoken Language Technology for Under-resourced (SLTU), 2014.

S. Thomas, S. Ganapathy, and H. Hermansky, Cross-lingual and multi-stream posterior features for low resource lvcsr systems, Proc. INTERSPEECH, pp.877-880, 2010.

T. Schultz and A. Waibel, Multilingual and crosslingual speech recognition, Proc. DARPA workshop on Broadcast News Transcription and Understanding, 1998.

L. Lu, A. Ghoshal, and S. Renals, Cross-Lingual Subspace Gaussian Mixture Models for Low-Resource Speech Recognition, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.1, pp.17-27, 2014.
DOI : 10.1109/TASL.2013.2281575

L. Burget, P. Schwartz, M. Agarwal, P. Akyazi, K. Feng et al., Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4334-4337, 2010.
DOI : 10.1109/ICASSP.2010.5495646

Y. Miao, F. Metze, and A. Waibel, Subspace mixture model for low-resource speech recognition in cross-lingual settings, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.7339-7343, 2013.
DOI : 10.1109/ICASSP.2013.6639088

D. Imseng, P. Motlicek, H. Bourlard, and P. N. Garner, Using out-of-language data to improve an under-resourced speech recognizer, Speech Communication, vol.56, issue.0, pp.142-151, 2014.
DOI : 10.1016/j.specom.2013.01.007

T. Schultz and A. Waibel, Fast bootstrapping of lvcsr systems with multilingual phoneme sets, Proc. Eurospeech. Citeseer, pp.371-374, 1997.

T. Schultz, Globalphone: a multilingual speech and text database developed at karlsruhe university, Proc. ICLSP, pp.345-348, 2002.

M. Statistics-department, Negeri sarawak:total population by ethnic group, sub-district and state, Statistics Department, Malaysia, Tech. Rep, 2010.

A. Adelaar, The Austronesian Languages of Asia and Madagascar: A Historical Perspective, The Austronesian Languages of Asia and Madagascar, Routledge Language Family Series, 2005.

M. S. Dryer, M. Haspelmath, and E. , Leipzig: Max Planck Institute for Evolutionary Anthropology, 2013

. Available, Available: http://www.ethnologue.com [19] S. Education-Department, ethnologue.com. [Online], 2007.

C. Barras, E. Geoffrois, Z. Wu, and M. Liberman, Transcriber:development and use of a tool for assisting speech corpora production, Proc. Speech Communication special issue on Speech Annotation and Corpus Tools, 2000.

A. Stolcke, Srilm -an extensible language modeling toolkit, Proc. of the 7th International Conference on Spoken Language Processing, pp.901-904, 2002.

T. Tan, H. Li, E. K. Tang, X. Xiao, and E. S. Chng, MASS: A Malay language LVCSR corpus resource, 2009 Oriental COCOSDA International Conference on Speech Database and Assessments, pp.26-30, 2009.
DOI : 10.1109/ICSDA.2009.5278382

J. R. Novak, Phonetisaurus: A wfst-driven phoneticizer. available at : https, 2012.

J. R. Novak, N. Minematsu, and K. Hirose, Evaluations of an open source wfst-based phoneticezer, 2011.

T. Tan and B. Rainavo-malançon, Malay grapheme to phoneme tool for automatic speech recognition, Proc. Workshop of Malaysia and Indonesia Language Engineering, 2009.

D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek et al., The kaldi speech recognition toolkit, IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, I. S. P. Society, p.11, 2011.

R. A. Gopinath, Maximum likelihood modeling with Gaussian distributions for classification, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181), pp.661-664, 1998.
DOI : 10.1109/ICASSP.1998.675351

M. Gales, Maximum likelihood linear transformations for HMM-based speech recognition, Computer Science and Language, pp.75-98, 1998.
DOI : 10.1006/csla.1998.0043

D. Povey, L. Burget, M. Agarwal, P. Akyazi, F. Kai et al., The subspace Gaussian mixture model???A structured model for speech recognition, Computer Speech & Language, vol.25, issue.2, pp.404-439, 2011.
DOI : 10.1016/j.csl.2010.06.003

A. Rousseau, P. Deléglise, and Y. Estève, Ted-lium: An automatic speech recognition dedicated corpus, Proc. LREC. European Language Resources Association (ELRA), pp.125-129, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01434928