A. E. Abduraman, S. A. Berrani, and B. Merialdo, An unsupervised approach for recurrent tv program structuring, Proceddings of the 9th international interactive conference on Interactive television, EuroITV '11, pp.123-126, 2011.
DOI : 10.1145/2000119.2000143

A. E. Abduraman, S. A. Berrani, and B. Merialdo, Audio/visual recurrences and decision trees for unsupervised TV program structuring, VISAPP'13, pp.701-708, 2013.
DOI : 10.1145/2072552.2072556

V. Alfred, Algorithms for finding patterns in strings, Algorithms and Complexity, vol.1, p.255, 2014.

N. Ancona, C. Cicirelli, A. Branca, and A. Distante, Goal detection in football by using support vector machines for classification, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222), pp.611-616, 2001.
DOI : 10.1109/IJCNN.2001.939092

M. Ben and G. Gravier, Unsupervised mining of audiovisually consistent segments in videos with application to structure analysis, 2011 IEEE International Conference on Multimedia and Expo, pp.1-6, 2011.
DOI : 10.1109/ICME.2011.6011951

URL : https://hal.archives-ouvertes.fr/hal-00646603

Z. Botev, J. Grotowski, and D. Kroese, Kernel density estimation via diffusion, The Annals of Statistics, vol.38, issue.5, pp.2916-2957, 2010.
DOI : 10.1214/10-AOS799

URL : http://arxiv.org/abs/1011.2602

Y. F. Chang, P. Lin, S. H. Cheng, K. H. Chan, Y. C. Zeng et al., Robust anchorperson detection based on audio streams using a hybrid I-vector and DNN system, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific, pp.1-4, 2014.
DOI : 10.1109/APSIPA.2014.7041717

T. S. Chua, S. F. Chang, L. Chaisorn, and W. Hsu, Story boundary detection in large broadcast news video archives, Proceedings of the 12th annual ACM international conference on Multimedia , MULTIMEDIA '04, pp.656-659, 2004.
DOI : 10.1145/1027527.1027679

G. E. Crooks, G. Hon, J. M. Chandonia, and S. E. Brenner, WebLogo: A Sequence Logo Generator, Genome Research, vol.14, issue.6, pp.1188-1190, 2004.
DOI : 10.1101/gr.849004

URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC419797

E. Dumont and G. Quénot, Automatic Story Segmentation for TV News Video Using Multiple Modalities, International Journal of Digital Multimedia Broadcasting, vol.11, issue.1, 2012.
DOI : 10.1016/S0167-6393(01)00061-9

URL : https://hal.archives-ouvertes.fr/hal-00767035

V. Gupta, P. Kenny, P. Ouellet, and T. Stafylakis, I-vector-based speaker adaptation of deep neural networks for French broadcast audio transcription, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.6334-6338, 2014.
DOI : 10.1109/ICASSP.2014.6854823

J. E. Hopcroft, Introduction to automata theory, languages, and computation, 1979.
DOI : 10.1145/568438.568455

A. Jacobs, Using Self-similarity Matrices for Structure Mining on News Video, In: Lect. Notes Artif Int, vol.42, pp.87-94, 2006.
DOI : 10.1007/11752912_11

D. B. Jayagopi, S. Ba, J. M. Odobez, and D. Gatica-perez, Predicting two facets of social verticality in meetings from five-minute time slices and nonverbal cues, Proceedings of the 10th international conference on Multimodal interfaces, IMCI '08, pp.45-52, 2008.
DOI : 10.1145/1452392.1452403

P. Ji, L. Cao, X. Zhang, L. Zhang, and W. Wu, News videos anchor person detection by shot clustering, Neurocomputing, vol.123, pp.86-99, 2014.
DOI : 10.1016/j.neucom.2013.06.003

E. Kijak, G. Gravier, L. Oisel, and P. Gros, Audiovisual integration for tennis broadcast structuring, Multimedia Tools and Applications, vol.1, issue.1, pp.289-311, 2006.
DOI : 10.1007/s11042-006-0031-5

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.107.3587

H. Lee, J. Yu, Y. Im, J. M. Gil, and D. Park, A unified scheme of shot boundary detection and anchor shot detection in news video story parsing, Multimedia Tools and Applications, vol.17, issue.2, pp.1127-1145, 2011.
DOI : 10.1007/s11042-010-0462-x

P. Letessier, O. Buisson, and A. Joly, Scalable mining of small visual objects, Proceedings of the 20th ACM international conference on Multimedia, MM '12, pp.599-608, 2012.
DOI : 10.1145/2393347.2393431

URL : https://hal.archives-ouvertes.fr/hal-00739735

H. Li, J. Tang, S. Wu, Y. Zhang, and S. Lin, Automatic detection and analysis of player action in moving background sports video sequences, IEEE Trans. Circuits Syst. Video Technol, vol.20, issue.3, pp.351-364, 2010.

B. Mocanu, R. Tapu, and T. Zaharia, Automatic Segmentation of TV News into Stories Using Visual and Temporal Information, International Conference on Advanced Concepts for Intelligent Vision Systems, pp.648-660, 2016.
DOI : 10.1007/978-3-642-24028-7_21

URL : https://hal.archives-ouvertes.fr/hal-01451798

B. Qu, F. Vallet, J. Carrive, and G. Gravier, Content-based inference of hierarchical structural grammar for recurrent TV programs using multiple sequence alignment, 2014 IEEE International Conference on Multimedia and Expo (ICME), pp.1-6, 2014.
DOI : 10.1109/ICME.2014.6890295

URL : https://hal.archives-ouvertes.fr/hal-01026335

B. Qu, F. Vallet, J. Carrive, and G. Gravier, Content-Based Discovery of Multiple Structures from Episodes of Recurrent TV Programs Based on Grammatical Inference, International Conference on Multimedia Modelling, pp.140-154, 2015.
DOI : 10.1007/978-3-319-14445-0_13

URL : https://hal.archives-ouvertes.fr/hal-01089237

P. Sidiropoulos, V. Mezaris, I. Kompatsiaris, H. Meinedo, M. Bugalho et al., Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features, IEEE Transactions on Circuits and Systems for Video Technology, vol.21, issue.8, pp.1163-1177, 2011.
DOI : 10.1109/TCSVT.2011.2138830

A. Stuhlsatz, C. Meyer, F. Eyben, T. Zielke, G. Meier et al., Deep neural networks for acoustic emotion recognition: Raising the benchmarks, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5688-5691, 2011.
DOI : 10.1109/ICASSP.2011.5947651

J. D. Thompson, D. G. Higgins, and T. J. Gibson, CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice, Nucleic Acids Research, vol.22, issue.22, pp.4673-4680, 1994.
DOI : 10.1093/nar/22.22.4673

K. Thompson, Programming Techniques: Regular expression search algorithm, Communications of the ACM, vol.11, issue.6, pp.419-422, 1968.
DOI : 10.1145/363347.363387

L. Xie, P. Xu, S. F. Chang, A. Divakaran, and H. Sun, Structure analysis of soccer video with domain knowledge and hidden Markov models, Pattern Recognition Letters, vol.25, issue.7, pp.767-775, 2004.
DOI : 10.1016/j.patrec.2004.01.005

X. F. Yang, Q. Tian, and P. Xue, Efficient Short Video Repeat Identification With Application to News Video Structure Analysis, IEEE Transactions on Multimedia, vol.9, issue.3, pp.600-609, 2007.
DOI : 10.1109/TMM.2006.889352

D. Q. Zhang, C. Y. Lin, S. F. Chang, and J. R. Smith, Semantic video clustering across sources using bipartite spectral clustering, pp.117-120, 2004.

J. Zhang, J. Qiu, X. Wang, and L. Wu, Representation of the player action in sport videos, 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, pp.1-4, 2013.
DOI : 10.1109/APSIPA.2013.6694283

S. Zhu and Y. Liu, Video scene segmentation and semantic representation using a novel scheme, Multimedia Tools and Applications, vol.8, issue.4, pp.183-205, 2009.
DOI : 10.1007/s11042-008-0233-0