D. Berndt and J. Clifford, Using dynamic time warping to find patterns in time series, AAAI Workshop on Knowledge Discovery in Databases, p.94, 1994.

H. Chih-wei, Automatic closed caption alignment based on speech recognition transcripts, 2003.

G. Linarès, D. Massonié, and P. Nocéra, Scalable language model look-ahead for lvcsr, InterSpeech'05, 2005.

S. Galliano, E. Geoffrois, D. Mostefa, K. Choukri, J. Bonastre et al., The ESTER Phase II Evaluation Campaign for the Rich Transcription of French Broadcast News, Proc. of the european conf. on speech communication and technology, 2005.

P. Jaeyung, J. , and A. G. Hauptmann, Improving acoustic models with captioned multimedia speech, IEEE International Conference on Multimedia Computing and Systems, 1999.

L. Lamel, J. L. Gauvain, and G. Adda, Lightly supervised and unsupervised acoustic model training, Computer Speech & Language, vol.16, issue.1, pp.115-229, 2002.
DOI : 10.1006/csla.2001.0186
URL : https://hal.archives-ouvertes.fr/halshs-01252269

P. J. Moreno, C. Joerg, J. Van-thong, and O. Glickman, A recursive algorithm for the forced alignment of very long audio segments, International Conference on Spoken Language Processing, 1998.

P. Nocera, G. Linares, and D. Massonié, Phoneme lattice based a* search algorithm for speech recognition . Text, Speech and Dialogue, 5th International Conference, 2002.
DOI : 10.1007/3-540-46154-x_41
URL : https://hal.archives-ouvertes.fr/hal-01319837

M. Eskenazi, U. Jain, V. Parikh, B. Raj, M. Ravishankar et al., The 1996 hub-4 sphinx-3 system, Proceedings of the 1997 ARPA Speech Recognition Workshop, pp.85-89, 1997.

P. Placeway and J. Lafferty, Cheating with imperfect transcripts, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996.
DOI : 10.1109/ICSLP.1996.607220
URL : http://www.asel.udel.edu/icslp/cdrom/vol4/710/a710.pdf