H. Christensen, Y. Gotoh, and S. Renals, A cascaded broadcast news highlighter, IEEE transactions on audio, speech, and language processing, vol.16, issue.1, pp.151-161, 2008.

H. Duxans, X. Anguera, and D. Conejero, Audio based soccer game summarization, Broadband Multimedia Systems and Broadcasting, 2009. BMSB'09. IEEE International Symposium on, pp.1-6, 2009.

D. Jouvet, D. Langlois, M. Menacer, D. Fohr, O. Mella et al., Adaptation of speech recognition vocabularies for improved transcription of youtube videos, Journal of the International Science and General Applications, vol.1, issue.1, pp.1-9, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01873801

S. Kullback and R. A. Leibler, On information and sufficiency. The annals of mathematical statistics, vol.22, pp.79-86, 1951.

M. Leszczuk, M. Grega, A. Ko?bia-l, J. Gliwski, K. Wasieczko et al., Video summarization framework for newscasts and reports -work in progress, Multimedia Communications, Services and Security, pp.86-97, 2017.

A. Louis and A. Nenkova, Automatic summary evaluation without human models, 2008.

A. Louis and A. Nenkova, Automatically evaluating content selection in summarization without human models, Conference on Empirical Methods in Natural Language Processing, vol.1, pp.306-314, 2009.

C. D. Manning and H. Schütze, Foundations of Statistical Natural Language Processing, 1999.

S. Maskey and J. Hirschberg, Comparing lexical, acoustic/prosodic, structural and discourse features for speech summarization, Ninth European Conference on Speech Communication and Technology, 2005.

S. Maskey and J. Hirschberg, Summarizing speech without text using hidden markov models, Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers, pp.89-92, 2006.

B. Mcfee, C. Raffel, D. Liang, D. P. Ellis, M. Mcvicar et al., librosa: Audio and music signal analysis in python. In: 14th python in science conference, pp.18-25, 2015.

Z. Rafii and B. Pardo, Music/voice separation using the similarity matrix, pp.583-588, 2012.

M. Rott and P. ?erva, Speech-to-text summarization using automatic phrase extraction from recognized text

S. Text and D. , , pp.101-108, 2016.

H. Saggion, J. M. Torres-moreno, I. D. Cunha, and E. Sanjuan, Multilingual summarization evaluation without human models, Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pp.1059-1067, 2010.

G. Szaszák, M. Á. Tündik, and A. Beke, Summarization of spontaneous speech using automatic speech recognition and a speech prosody based tokenizer, pp.221-227, 2016.

C. M. Taskiran, Z. Pizlo, A. Amir, D. Ponceleon, and E. J. Delp, Automated video program summarization using speech transcripts, IEEE Transactions on Multimedia, vol.8, issue.4, pp.775-791, 2006.

J. M. Torres-moreno, Automatic text summarization, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01473135

J. Torres-moreno, H. Saggion, I. Da-cunha, E. Sanjuan, and P. Velázquez-morales, Summary evaluation with and without references, Polibits, vol.42, pp.13-19, 2010.

K. Zechner, Spoken language condensation in the 21st century, Eighth European Conference on Speech Communication and Technology, 2003.

A. Zlatintsi, E. Iosif, P. Marago, and A. Potamianos, Audio salient event detection and summarization using audio and text modalities, Signal Processing Conference (EUSIPCO), pp.2311-2315, 2015.

A. Zlatintsi, P. Maragos, A. Potamianos, and G. Evangelopoulos, A saliency-based approach to audio event detection and summarization, Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European, pp.1294-1298, 2012.