G. Amati and C. J. Rijsbergen, Probabilistic models of information retrieval based on measuring the divergence from randomness, ACM Transactions on Information Systems, vol.20, issue.4, pp.357-389, 2002.
DOI : 10.1145/582415.582416

A. Beygelzimer, S. Kakade, and J. Langford, Cover trees for nearest neighbor, Proceedings of the 23rd international conference on Machine learning , ICML '06, pp.97-104, 2006.
DOI : 10.1145/1143844.1143857

V. Claveau and E. Kijak, « Thésaurus distributionnels pour la recherche d'information et viceversa, Revue des Sciences et Technologies de l'Information -Série Document Numérique, 2015.

V. Claveau, E. Kijak, and O. Ferret, « Improving distributional thesauri by exploring the graph of neighbors, International Conference on Computational Linguistics, COLING 2014, 2014.

T. De-vries, S. Chawla, and M. E. Houle, Density-preserving projections for large-scale local anomaly detection, Knowledge and Information Systems, vol.28, issue.10, pp.25-52, 2012.
DOI : 10.1007/s10115-011-0430-4

S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harshman, Indexing by latent semantic analysis, Journal of the American Society for Information Science, vol.41, issue.6, 1990.
DOI : 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.108.8490

O. Ferret, « Identifying Bad Semantic Neighbors for Improving Distributional Thesauri, 51 st Annual Meeting of the Association for Computational Linguistics, pp.561-571, 2013.

E. Gabrilovich and S. Markovitch, Computing semantic relatedness using wikipedia-based explicit semantic analysis, 20 th International Joint Conference on Artificial Intelligence, pp.6-12, 2007.

M. Hoffman, F. R. Bach, D. M. Blei, and . Online, Learning for Latent Dirichlet Allocation, Advances in Neural Information Processing Systems, pp.856-864, 2010.

M. E. Houle, X. Ma, M. Nett, and V. Oria, Dimensional Testing for Multi-step Similarity Search, 2012 IEEE 12th International Conference on Data Mining, pp.299-308, 2012.
DOI : 10.1109/ICDM.2012.91

M. E. Houle and M. Nett, Rank Cover Trees for Nearest Neighbor Search, International Conference on Similarity Search and Appli-cations (SISAP), pp.16-29, 2013.
DOI : 10.1007/978-3-642-41062-8_3

E. Levina and P. J. Bickel, « Maximum likelihood estimation of intrinsic dimension, Advances in Neural Information Processing Systems (NIPS), 2004.

G. A. Miller and . Wordnet, An On-Line Lexical Database, International Journal of Lexicography, 1990.

J. M. Ponte and W. B. Croft, A language modeling approach to information retrieval, Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '98, pp.275-281, 1998.
DOI : 10.1145/290941.291008

S. E. Robertson, S. Walker, and M. Hancock-beaulieu, « Okapi at TREC-7 : Automatic Ad Hoc, Filtering, VLC and Interactive, Proc. of the 7 th Text Retrieval Conference, pp.199-210, 1998.

S. T. Roweis and L. K. Saul, Nonlinear Dimensionality Reduction by Locally Linear Embedding, Science, vol.290, issue.5500, pp.2323-2326, 2000.
DOI : 10.1126/science.290.5500.2323

B. Scholkopf, A. J. Smola, and K. Muller, Nonlinear Component Analysis as a Kernel Eigenvalue Problem, Neural Computation, vol.20, issue.5, pp.1299-1319, 1998.
DOI : 10.1007/BF02281970

A. Singhal, « Modern information retrieval : a brief overview, Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 2001.

T. Strohman, D. Metzler, H. Turtle, and W. Croft, Indri : A language-model based search engine for complex queries (extended version), 2005.

H. Turtle and W. Croft, Evaluation of an inference network-based retrieval model, ACM Transactions on Information Systems, vol.9, issue.3, pp.187-222, 1991.
DOI : 10.1145/125187.125188

E. M. Voorhees, Query Expansion using Lexical-Semantic Relations, Proc. of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '94, pp.61-69, 1994.
DOI : 10.1007/978-1-4471-2099-5_7

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.407.5135

C. Zhai and J. D. Lafferty, A study of smoothing methods for language models applied to Ad Hoc information retrieval, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '01, pp.334-342, 2001.
DOI : 10.1145/383952.384019