D. Berndt and J. Clifford, Using dynamic time warping to find patterns in time series, Workshop on Knowledge Discovery in Databases (KDD'94, pp.359-370, 1994.

J. Bonastre, F. Wils, and S. Meignier, ALIZE, a free toolkit for speaker recognition, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.737-740, 2005.
DOI : 10.1109/ICASSP.2005.1415219
URL : https://hal.archives-ouvertes.fr/hal-01434280

P. Brown, S. Chen, S. D. Pietra, V. D. Pietra, S. Kehler et al., Automatic speech recognition in machine-aided translation, Computer Speech & Language, vol.8, issue.3, pp.177-187, 1994.
DOI : 10.1006/csla.1994.1008

P. Cardinal, G. Boulianne, and M. Comeau, Segmentation of recordings based on partial transcriptions, Proc. Interspeech'05, pp.3345-3348, 2005.

H. Y. Chan and P. Woodland, Improving broadcast news transcription by lightly supervised discriminative training, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.737-777, 2004.
DOI : 10.1109/ICASSP.2004.1326091
URL : http://mi.eng.cam.ac.uk/reports/svr-ftp/chan_icassp2004.pdf

L. Chen, J. Gauvain, L. Lamel, and G. Adda, Dynamic language modeling for broadcast news, Proc. International Conference on Spoken Language Processing, pp.1281-1284, 2004.

L. Chen, L. Lamel, and J. Gauvain, Lightly supervised acoustic model training using consensus networks, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004.

G. Chollet, Evaluation of asr systems, algorithms and databases. Speech recognition and coding: New advances and trends, pp.32-40, 1995.
DOI : 10.1007/978-3-642-57745-1_3

P. Clarkson and A. Robinson, Language model adaptation using mixtures and an exponentially decaying cache, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, p.97, 1997.
DOI : 10.1109/ICASSP.1997.596049
URL : http://svr-www.eng.cam.ac.uk/reports/svr-ftp/clarkson_icassp97.ps.gz

T. Cormen, C. Leiserson, R. Rivest, and C. Stein, Introduction to algorithms, 2001.

M. Finke and A. Waibel, Flexible transcription alignment, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, pp.34-40, 1997.
DOI : 10.1109/ASRU.1997.658974
URL : http://www.ri.cmu.edu/pub_files/pub1/finke_micheal_1997_1/finke_micheal_1997_1.pdf

J. G. Fiscus, J. Ajot, and J. S. Garofolo, The rich transcription 2007 meeting recognition evaluation. Multimodal Technologies for Perception of Humans: International Evaluation Workshops CLEAR'07 and RT'07, pp.373-389, 2008.
DOI : 10.1007/978-3-540-68585-2_36
URL : http://www.itl.nist.gov/iad/mig/publications/storage_paper/RT07Results-v08.pdf

S. Galliano, E. Geoffrois, D. Mostefa, K. Choukri, J. Bonastre et al., The ester phase 2 based evaluation campaign for the rich transcription of french broadcast news, Proc. of the European Conference on Speech Communication and Technology (ICSLP'05, pp.1149-1152, 2005.

A. Haubold and J. R. Kender, Alignment of Speech to Highly Imperfect Text Transcriptions, Multimedia and Expo, 2007 IEEE International Conference on, pp.224-227, 2007.
DOI : 10.1109/ICME.2007.4284627

D. A. Hull, Stemming algorithms: A case study for detailed evaluation, Journal of the American Society for Information Science, vol.47, issue.1, pp.70-84, 1996.
DOI : 10.1002/(SICI)1097-4571(199601)47:1<70::AID-ASI7>3.0.CO;2-#
URL : http://www.xrce.xerox.com/people/hull/hull/./papers/jasis96.ps

O. Ibrahimov, I. K. Sethi, and N. Dimitrova, Clustering of Imperfect Transcripts Using a Novel Similarity Measure, Journal: Information Retrieval Techniques for Speech Applications, vol.1, pp.23-34, 2002.
DOI : 10.1007/3-540-45637-6_3
URL : http://iielab-secs.secs.oakland.edu/publications/oktay_SIGIR2001.pdf

R. Iyer and M. Ostendorf, Modeling long distance dependence in language: topic mixtures versus dynamic cache models, IEEE Transactions on Speech and Audio Processing, vol.7, issue.1, pp.30-39, 1999.
DOI : 10.1109/89.736328

F. Jelinek, SELF-ORGANIZED LANGUAGE MODELING FOR SPEECH RECOGNITION, Journal: Language Processing for Speech Recognition, pp.450-506, 1990.
DOI : 10.1016/B978-0-08-051584-7.50045-0
URL : http://www.aclweb.org/anthology-new/H/H91/H91-1057.pdf

T. Kemp and A. Waibel, Unsupervised training of a speech recognizer: Recent experiments, Eurospeech'99, pp.2725-2728, 1999.

E. Keogh and M. Pazzani, Derivative Dynamic Time Warping, International Conference on Data Mining (SDM'01), 2001.
DOI : 10.1137/1.9781611972719.1
URL : http://www.siam.org/proceedings/datamining/2001/dm01_01KeoghE.pdf

R. Kuhn and R. De-mori, A cache-based natural language model for speech recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.12, issue.6, pp.570-583, 1990.
DOI : 10.1109/34.56193

L. Lamel, J. Gauvain, and G. Adda, Lightly supervised and unsupervised acoustic model training, Computer Speech & Language, vol.16, issue.1, pp.115-229, 2002.
DOI : 10.1006/csla.2001.0186
URL : https://hal.archives-ouvertes.fr/halshs-01252269

B. Lecouteux and G. Linarès, Using prompts to produce quality corpus for training automatic speech recognition systems, MELECON 2008, The 14th IEEE Mediterranean Electrotechnical Conference, pp.841-846, 2008.
DOI : 10.1109/MELCON.2008.4618540
URL : https://hal.archives-ouvertes.fr/hal-01318050

B. Lecouteux, G. Linarès, F. Beaugendre, and P. Nocéra, Text island spotting in large speech databases, Interspeech'07, pp.1318-1321, 2007.
URL : https://hal.archives-ouvertes.fr/hal-01318080

B. Lecouteux, G. Linarès, J. Bonastre, and P. Nocéra, Imperfect transcript driven speech recognition, InterSpeech'06, pp.1626-1629, 2006.
URL : https://hal.archives-ouvertes.fr/hal-01318085

B. Lecouteux, G. Linarès, Y. Estève, and J. Mauclair, System Combination by Driven Decoding, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, pp.341-344, 2007.
DOI : 10.1109/ICASSP.2007.366919
URL : https://hal.archives-ouvertes.fr/hal-01318073

G. Linarès, P. Nocéra, D. Massonié, and D. Matrouf, The LIA Speech Recognition System: From 10xRT to 1xRT, Proc. of the 10th international conference on Text, Speech and Dialogue (TSD'07, pp.302-308, 2007.
DOI : 10.1007/978-3-540-74628-7_40

C. Martins, A. Teixeira, and J. Neto, Dynamic language modeling for a daily broadcast news transcription system, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), pp.165-170, 2007.
DOI : 10.1109/ASRU.2007.4430103

D. Massonié, P. Nocéra, and G. Linarès, Scalable language model look-ahead for lvcsr, Proc. of InterSpeech'05, pp.569-572, 2005.

M. Mohri, Edit-Distance of Weighted Automata, Conference on Implementation and Application of Automata (CIAA'02, pp.1-23, 2002.
DOI : 10.1007/3-540-44977-9_1
URL : http://www.research.att.com/~mohri/postscript/wer.ps

P. J. Moreno, C. Joerg, J. V. Thong, and O. Glickman, A recursive algorithm for the forced alignment of very long audio segments, International Conference on Spoken Language Processing (ICSLP'98), 1998.

H. Ney, U. Essen, and R. Kneser, On structuring probabilistic dependencies in stochastic language modeling, Journal: Computer Speech and Language, vol.8, pp.1-38, 1994.
DOI : 10.1006/csla.1994.1001

L. Nguyen and B. Xiang, Light supervision in acoustic model training, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.185-188, 2004.

M. Paulik, C. Fügen, S. Stüker, T. Schultz, T. Schaaf et al., Document driven machine translation enhanced asr, Proc. Interspeech'05, pp.2261-2264, 2005.

M. Paulik and A. Waibel, Lightly supervised acoustic model training on epps recordings, Proc. Interspeech'08, pp.224-227, 2008.

S. Petrik and G. Kubin, Reconstructing Medical Dictations from Automatically Recognized and Non-Literal Transcripts with Phonetic Similarity Matching, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, pp.1125-1128, 2007.
DOI : 10.1109/ICASSP.2007.367272

S. Petrik and F. Pernkopf, Automatic phonetics-driven reconstruction of medical dictations on multiple levels of segmentation, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4317-4320, 2008.
DOI : 10.1109/ICASSP.2008.4518610

P. Placeway, S. Chen, M. Eskenazi, U. Jain, V. Parikh et al., The 1996 hub-4 sphinx-3 system, Proc. of the 1997 ARPA Speech Recognition Workshop, pp.85-89, 1997.

P. Placeway and J. Lafferty, Cheating with imperfect transcripts, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.2115-2118, 1996.
DOI : 10.1109/ICSLP.1996.607220
URL : http://www.asel.udel.edu/icslp/cdrom/vol4/710/a710.pdf

M. Rouvier, G. Linarès, and B. Lecouteux, On-the-fly term spotting by phonetic filtering and request-driven decoding, 2008 IEEE Spoken Language Technology Workshop, pp.305-308, 2008.
DOI : 10.1109/SLT.2008.4777901
URL : https://hal.archives-ouvertes.fr/hal-01320210

G. Salton, Automatic Text Processing, 1988.

G. Salton and C. Buckley, Term-weighting approaches in automatic text retrieval, Information Processing & Management, vol.24, issue.5, pp.513-523, 1988.
DOI : 10.1016/0306-4573(88)90021-0
URL : http://www.doc.ic.ac.uk/~jmag/classic/1988.Term-weighting approaches in automatic text retrieval.pdf

T. F. Smith and M. S. Waterman, Identification of common molecular subsequences, Journal of Molecular Biology, vol.147, issue.1, pp.195-197, 1981.
DOI : 10.1016/0022-2836(81)90087-5
URL : http://www.cmb.usc.edu/papers/msw_papers/msw-042.pdf

R. Stern, Specifications of the 1996 hub-4 broadcast news evaluation, Proc. of the DARPA Speech Recognition Workshop, 1997.

P. Taylor, A. Black, and R. Caley, The architecture of the festival speech synthesis system, Proc. of the third ESCA Workshop in Speech Synthesis, pp.147-151, 1998.

B. Tshibasu-kabeya, G. Bontempi, F. Beaugendre, and G. Marechal, Aidar : Une architecture pour l'indexation de documents audio numériques, Proc. Veille Stratégique Scientifique & Technologique (VSST'06), 2006.

R. Wagner and M. Fisher, The String-to-String Correction Problem, Journal of the ACM, vol.21, issue.1, pp.168-173, 1974.
DOI : 10.1145/321796.321811

F. Wessel and H. Ney, Unsupervised training of acoustic models for large vocabulary continuous speech recognition, IEEE Transactions on Speech and Audio Processing, vol.13, issue.1, pp.23-31, 2005.
DOI : 10.1109/TSA.2004.838537

M. J. Witbrock and A. G. Hauptmann, Using words and phonetic strings for efficient information retrieval from imperfectly transcribed spoken documents, Proceedings of the second ACM international conference on Digital libraries , DL '97, pp.30-35, 1997.
DOI : 10.1145/263690.263779

M. J. Witbrock and A. G. Hauptmann, Improving acoustic models by watching television, 1998.
DOI : 10.21236/ADA350494
URL : http://www.informedia.cs.cmu.edu/documents/aaaisss97-mjw.pdf

P. Woodland and D. Povey, Large scale discriminative training of hidden Markov models for speech recognition, Computer Speech & Language, vol.16, issue.1, pp.25-47, 2002.
DOI : 10.1006/csla.2001.0182

K. Yu, M. Gales, L. Wang, and P. C. Woodland, Unsupervised training and directed manual transcription for LVCSR, Speech Communication, vol.52, issue.7-8, pp.652-663, 2010.
DOI : 10.1016/j.specom.2010.02.014