G. Amati and C. J. Rijsbergen, Probabilistic models of information retrieval based on measuring the divergence from randomness, ACM Transactions on Information Systems, vol.20, issue.4, pp.357-389, 2002.
DOI : 10.1145/582415.582416

L. Amsaleg, C. Oussama, T. Furon, S. Girard, M. E. Houle et al., Estimating Local Intrinsic Dimensionality, Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '15, 2015.
DOI : 10.1109/34.368147

URL : https://hal.archives-ouvertes.fr/hal-01159217

A. Bellogín and A. P. De-vries, Understanding Similarity Metrics in Neighbour-based Recommender Systems, Proceedings of the 2013 Conference on the Theory of Information Retrieval, ICTIR '13, pp.48-61, 2013.
DOI : 10.1145/2499178.2499186

A. Beygelzimer, S. Kakade, and J. Langford, Cover trees for nearest neighbor, Proceedings of the 23rd international conference on Machine learning , ICML '06, pp.97-104, 2006.
DOI : 10.1145/1143844.1143857

A. Clauset, C. R. Shalizi, and M. E. Newman, Power-Law Distributions in Empirical Data, SIAM Review, vol.51, issue.4, pp.661-703, 2009.
DOI : 10.1137/070710111

V. Claveau and E. Kijak, Direct vs. indirect evaluation of distributional thesauri, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers The COLING 2016 Organizing Committee, pp.1837-1848, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01394739

V. Claveau, E. Kijak, and O. Ferret, Improving distributional thesauri by exploring the graph of neighbors, International Conference on Computational Linguistics, COLING 2014, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01027545

S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harshman, Indexing by latent semantic analysis, Journal of the American Society for Information Science, vol.41, issue.6, 1990.
DOI : 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9

URL : http://www.cs.bham.ac.uk/~pxt/IDA/lsa_ind.pdf

H. Fang, T. Tao, and C. Zhai, Diagnostic Evaluation of Information Retrieval Models, ACM Transactions on Information Systems, vol.29, issue.2, 2011.
DOI : 10.1145/1961209.1961210

URL : http://sifaka.cs.uiuc.edu/czhai/pub/tois-diag.pdf

W. Hersh, C. Buckley, T. J. Leone, and D. Hickam, OHSUMED: An Interactive Retrieval Evaluation and New Large Test Collection for Research, Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. pp. 192?201. SIGIR '94, 1994.
DOI : 10.1007/978-1-4471-2099-5_20

M. Hoffman, F. R. Bach, D. M. Blei, J. Lafferty, C. Williams et al., Online learning for latent dirichlet allocation, Advances in Neural Information Processing Systems, pp.856-864, 2010.

M. E. Houle, X. Ma, M. Nett, and V. Oria, Dimensional Testing for Multi-step Similarity Search, 2012 IEEE 12th International Conference on Data Mining, pp.299-308, 2012.
DOI : 10.1109/ICDM.2012.91

M. E. Houle and M. Nett, Rank Cover Trees for Nearest Neighbor Search, International Conference on Similarity Search and Appli-cations (SISAP), pp.16-29, 2013.
DOI : 10.1007/978-3-642-41062-8_3

M. Houle, H. Kashima, and M. Nett, Generalized Expansion Dimension, 2012 IEEE 12th International Conference on Data Mining Workshops, pp.587-594, 2012.
DOI : 10.1109/ICDMW.2012.94

E. Levina and P. J. Bickel, Maximum likelihood estimation of intrinsic dimension, Advances in Neural Information Processing Systems (NIPS), 2004.

Y. Lv and C. Zhai, Lower-bounding term frequency normalization, Proceedings of the 20th ACM international conference on Information and knowledge management, CIKM '11, 2011.
DOI : 10.1145/2063576.2063584

URL : http://sifaka.cs.uiuc.edu/czhai/pub/cikm11-bm25.pdf

D. Metzler and W. Croft, Combining the language model and inference network approaches to retrieval. Information Processing and Management Special Issue on, Bayesian Networks and Information Retrieval, vol.40, issue.5, pp.735-750, 2004.
DOI : 10.1016/j.ipm.2004.05.001

T. Mikolov, W. T. Yih, and G. Zweig, Linguistic regularities in continuous space word representations, 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2013, pp.746-751, 2013.

J. M. Ponte and W. B. Croft, A language modeling approach to information retrieval, Proc. of the 21st Annual international ACM SIGIR Conference on Research and Development in information Retrieval (SIGIR '98, pp.275-281, 1998.
DOI : 10.1145/3130348.3130368

S. E. Robertson, S. Walker, and M. Hancock-beaulieu, Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive, Proc. of the 7 th Text Retrieval Conference, pp.199-210, 1998.

S. T. Roweis and L. K. Saul, Nonlinear Dimensionality Reduction by Locally Linear Embedding, Science, vol.290, issue.5500, pp.2323-2326, 2000.
DOI : 10.1126/science.290.5500.2323

URL : http://mountains.ece.umn.edu/~guille/Uruguay/2323.pdf

B. Scholkopf, A. J. Smola, and K. R. Muller, Nonlinear Component Analysis as a Kernel Eigenvalue Problem, Neural Computation, vol.20, issue.5, pp.1299-1319, 1998.
DOI : 10.1007/BF02281970

A. Singhal, Modern information retrieval: a brief overview, Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, vol.24, 2001.

T. Strohman, D. Metzler, H. Turtle, and W. Croft, Indri: A language-model based search engine for complex queries (extended version), Tech. rep., CIIR, 2005.

H. Turtle and W. Croft, Evaluation of an inference network-based retrieval model, ACM Transactions on Information Systems, vol.9, issue.3, pp.187-222, 1991.
DOI : 10.1145/125187.125188

URL : http://www.doc.ic.ac.uk/~jmag/classic/1991.Evaluation of an inference network-based retrieval model.pdf

J. Venna and S. Kaski, Local multidimensional scaling, Neural Networks, vol.19, issue.6-7, 2006.
DOI : 10.1016/j.neunet.2006.05.014

T. De-vries, S. Chawla, and M. E. Houle, Density-preserving projections for large-scale local anomaly detection, Knowledge and Information Systems, vol.28, issue.10, pp.25-52, 2012.
DOI : 10.1007/s007780050006

C. Zhai and J. D. Lafferty, A study of smoothing methods for language models applied to ad hoc information retrieval, Proc. of the SIGIR conference, pp.334-342, 2001.