E. M. Airoldi, W. W. Cohen, and S. E. Fienberg, Bayesian methods for frequent terms in text: Models of contagion and the ? 2 statistic

G. Amati and C. J. Rijsbergen, Probabilistic models of information retrieval based on measuring the divergence from randomness, ACM Transactions on Information Systems, vol.20, issue.4, pp.357-389, 2002.
DOI : 10.1145/582415.582416

A. L. Barabasi and R. Albert, Emergence of scaling in random networks, Science, vol.286, issue.5439, pp.509-512, 1999.

D. Chakrabarti and C. Faloutsos, Graph mining, ACM Computing Surveys, vol.38, issue.1, 2006.
DOI : 10.1145/1132952.1132954

K. W. Church, Empirical estimates of adaptation, Proceedings of the 18th conference on Computational linguistics -, pp.180-186, 2000.
DOI : 10.3115/990820.990847

K. W. Church and W. A. Gale, Poisson mixtures, Natural Language Engineering, vol.none, issue.02, pp.163-190, 1995.
DOI : 10.1002/asi.4630260402

S. Clinchant and . Gaussier, The BNB Distribution for Text Modeling, Macdonald et al. [12], pp.150-161
DOI : 10.1007/978-3-540-78646-7_16

C. Elkan, Clustering documents with an exponential-family approximation of the Dirichlet compound multinomial distribution, Proceedings of the 23rd international conference on Machine learning , ICML '06, pp.289-296, 2006.
DOI : 10.1145/1143844.1143881

H. Fang, T. Tao, and C. Zhai, A formal study of information retrieval heuristics, Proceedings of the 27th annual international conference on Research and development in information retrieval , SIGIR '04, 2004.
DOI : 10.1145/1008992.1009004

W. Feller, An Introduction to Probability Theory and Its Applications, 1968.

S. Harter, A probabilistic approach to automatic keyword indexing, part 1: On the distribution of speciality words in a technical literature, part 2: An algorithm for probabilistic indexing, Journal of the American Society for Information Science, issue.26, pp.197-206, 1975.

C. Macdonald, I. Ounis, V. Plachouras, I. Ruthven, and R. W. White, Advances in Information Retrieval, 30th European Conference on IR Research Proceedings, volume 4956 of Lecture Notes in Computer Science, 2008.
DOI : 10.1007/978-3-540-78646-7

R. E. Madsen, D. Kauchak, and C. Elkan, Modeling word burstiness using the Dirichlet distribution, Proceedings of the 22nd international conference on Machine learning , ICML '05, pp.545-552, 2005.
DOI : 10.1145/1102351.1102420

S. Na, I. Kang, and J. Lee, Improving term frequency normalization for multitopical documents and application to language modeling approaches, Macdonald et al. [12], pp.382-393

S. E. Robertson and S. Walker, Some Simple Effective Approximations to the 2-Poisson Model for Probabilistic Weighted Retrieval, SIGIR '94: Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, pp.232-241, 1994.
DOI : 10.1007/978-1-4471-2099-5_24

G. Salton and M. J. Mcgill, Introduction to Modern Information Retrieval, 1983.

A. Singhal, C. Buckley, and M. Mitra, Pivoted document length normalization, Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '96, pp.21-29, 1996.
DOI : 10.1145/243199.243206

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.128.1360

Z. Xu and R. Akella, A new probabilistic retrieval model based on the dirichlet compound multinomial distribution, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '08, pp.427-434, 2008.
DOI : 10.1145/1390334.1390408

C. Zhai and J. Lafferty, A study of smoothing methods for language models applied to information retrieval, ACM Transactions on Information Systems, vol.22, issue.2, pp.179-214, 2004.
DOI : 10.1145/984321.984322