J. Pearl, Causal diagrams for empirical research, Biometrika, vol.82, issue.4, pp.669-710, 1995.

W. Peng and T. Li, On the equivalence between nonnegative tensor factorization and tensorial probabilistic latent semantic analysis, Applied Intelligence, vol.35, issue.2, pp.285-295, 2011.

J. Pessiot, Y. Kim, M. R. Amini, and P. Gallinari, Improving document clustering in a learned concept space, Information Processing & Management, vol.46, issue.2, pp.180-192, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01172634

B. Polyak, Some methods of speeding up the convergence of iteration methods, USSR Computational Mathematics and Mathematical Physics, vol.4, pp.1-17, 1964.

F. Martin, Porter : An algorithm for suffix stripping, vol.14, pp.130-137, 1980.

M. Rajih, P. Comon, and R. A. Harshman, Enhanced line search : a novel method to accelerate parafac, SIAM Journal on Metric Analysis and Applications, vol.30, issue.3, pp.1128-1147, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00327595

A. S. Razavian, H. Azizpour, J. Sullivan, and S. Carlsson, CNN features off-the-shelf : an astounding baseline for recognition, Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp.512-519, 2014.

H. Reijner, The development of the horizon graph, proc. Vis08 Workshop From Theory to Practice : Design, Vision and Visualization, 2008.

C. P. Robert, Le choix bayésien : principes et pratique, 2006.

C. P. Robert, G. Celeux, and J. Diebolt, Bayesian estimation of hidden Markov chains : a stochastic implementation, Statistics & Probability Letters, vol.16, pp.77-83, 1993.

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh et al., ImageNet large scale visual recognition challenge, vol.115, pp.211-252, 2015.

T. Saito, H. N. Miyamura, M. Yamamoto, H. Saito, Y. Hoshiya et al., Two-tone pseudo coloring : compact visualization for one-dimensional data, proc. InfoVis'05, pp.173-180, 2005.

J. Sanchez, F. Perronnin, T. Mensink, and J. Verbeek, Image classification with the Fisher vector : theory and practice, International Journal of Computer Vision, vol.105, issue.3, pp.222-245, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00779493

R. E. Schapire, Theoretical views of boosting and applications, Proceedings of the 10 th International Conference on Algorithmic Learning Theory, pp.13-25, 1999.

H. Schmid, Probabilistic part-of-speech tagging using decision trees, Proceedings of the International Conference on New Methods in Language Processing, 1994.

B. Schölkopf and A. J. Smola, Learning with kernels : support vector machines, regularization, optimization and beyond, 2002.

H. Schulz, Treevis.net : a tree visualization reference, IEEE Computer Graphics and Applications, vol.31, issue.6, pp.11-15, 2011.

S. Shalev-shwartz, Y. Singer, N. Srebro, and A. Cotter, Pegasos : primal estimated sub-gradient solver for SVM. Mathematical Programming, vol.127, pp.3-30, 2011.

B. Shneiderman, Tree visualization with tree-maps : 2-d space-filling approach, ACM Trans. Graph, vol.11, issue.1, pp.92-99, 1992.

B. Shneiderman, The eyes have it : a task by data type taxonomy for information visualizations, proc. Visual Languages, 1996.

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 2014.

J. Sivic and A. Zisserman, Video Google : a text retrieval approach to object matching in videos, Proceedings of the 9 th IEEE International Conference on Computer Vision, 2003.

N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, Dropout : a simple way to prevent neural networks from overfitting, Journal of Machine Learning Research, vol.15, pp.1929-1958, 2014.

R. J. Steele and A. E. , Raftery : Performance of bayesian model selection criteria for gaussian mixture models. Frontiers of Statistical Decision Making and Bayesian Analysis, vol.2, pp.113-130, 2010.

I. Sutskever, J. Martens, G. Dahl, and G. Hinton, On the importance of initialization and momentum in deep learning, Proceedings of the 30 th International Conference on Machine Learning, 2013.

C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed et al., Going deeper with convolutions, Computer Vision and Pattern Recognition, 2015.

M. Szummer and T. Jaakkola, Partially labeled classification with Markov random walks, Advances in Neural Information Processing Systems, 2002.

A. Toselli and O. B. Widlund, Domain decomposition methods : algorithms and theory, 2005.

A. Treisman and G. Gelade, A feature-integration theory of attention, Cog. Psycho, vol.12, pp.97-136, 1980.

T. Trouillon, J. Welbl, S. Riedel, E. Gaussier, and G. Bouchard, Complex embeddings for simple link prediction, Proceedings of the 33 nd International Conference on Machine Learning, 2016.

J. Truett, J. Cornfield, and W. Kannel, A multivariate analysis of the risk of coronary heart disease in Framingham, Journal of Chronic Diseases, vol.20, issue.7, pp.511-524, 1967.