D. M. Blei, A. Y. Ng, and M. I. Jordan, Latent dirichlet allocation, Journal of Machine Learning Research, vol.3, pp.993-1022, 2003.

M. Eskevich, R. Aly, D. N. Racca, R. Ordelman, S. Chen et al., The Search and Hyperlinking task at MediaEval 2014, Working notes of the MediaEval 2014 Workshop, 2014.

J. R. Finkel, T. Grenager, and C. D. Manning, Incorporating non-local information into information extraction systems by Gibbs sampling, Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics , ACL '05, 2005.
DOI : 10.3115/1219840.1219885

J. Gauvain, L. Lamel, and G. Adda, The LIMSI Broadcast News transcription system, Speech Communication, vol.37, issue.1-2, pp.89-108, 2002.
DOI : 10.1016/S0167-6393(01)00061-9

URL : https://hal.archives-ouvertes.fr/hal-01434493

C. Guinaudeau, A. Simon, G. Gravier, and P. Sébillot, HITS and IRISA at MediaEval 2013: Search and hyperlinking task, Working Notes Proceedings of the MediaEval Workshop, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00906249

M. Steyvers and T. Griffiths, Probabilistic Topic Models. Handbook of Latent Semantic Analysis, pp.424-440, 2007.

M. Utiyama and H. Isahara, A statistical model for domain-independent text segmentation, Proceedings of the 39th Annual Meeting on Association for Computational Linguistics , ACL '01, 2001.
DOI : 10.3115/1073012.1073076