S. Mendelsohn, Patterns formed by a single shot of malt, Information World Review, 2000.

C. Boulakia, Patent mapping URL http://sciencecareers.sciencemag.org, ScienceCareers.org, 1190.

P. Rees, Patent pictures: It's patently good news, 2004.

A. L. Porter and S. W. Cunningham, Tech mining: Exploiting New Technologies for Competitive Advantage, 2005.
DOI : 10.1002/0471698466

H. P. Luhn, The Automatic Creation of Literature Abstracts, IBM Journal of Research and Development, vol.2, issue.2, pp.159-165, 1958.
DOI : 10.1147/rd.22.0159

G. K. Zipf, Human behavior and the principle of least effort, 1949.

W. Li, Bibliography on Zipf's law, 1997.

K. Papineni, Why inverse document frequency?, Second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies 2001 , NAACL '01, pp.1-8, 2001.
DOI : 10.3115/1073336.1073340

R. K. Belew, Finding Out About: Search Engine Technology from a Cognitive Perspective, 2000.

G. Salton and C. Yang, ON THE SPECIFICATION OF TERM VALUES IN AUTOMATIC INDEXING, Journal of Documentation, vol.29, issue.4, pp.351-372, 1973.
DOI : 10.1108/eb026562

K. Church and W. Gale, Inverse Document Frequency (IDF): A Measure of Deviations from Poisson, Proceedings of the Third Workshop on Very Large Corpora, pp.121-130, 1995.
DOI : 10.1007/978-94-017-2390-9_18

G. Salton and C. Buckley, Term-weighting approaches in automatic text retrieval, Information Processing & Management, vol.24, issue.5, pp.513-523, 1988.
DOI : 10.1016/0306-4573(88)90021-0

A. J. Trippe, Patinformatics: Tasks to tools, World Patent Information, vol.25, issue.3, pp.211-221, 2003.
DOI : 10.1016/S0172-2190(03)00079-6

G. Salton and M. J. Mcgill, Introduction to Modern Information Retrieval, 1983.

B. Vickery and A. Vickery, Information Science in Theory and Practice, 1987.
DOI : 10.1515/9783598440083

A. J. Trippe, Visualization of chemical patents: Source titles and abstracts vs. enhanced titles and abstracts, p.223, 2002.

M. P. Sinka and D. W. Corne, Evolving better stoplists for document clustering and web intelligence Design and Application of Hybrid Intelligent Systems, pp.1015-1023, 2003.

M. Fattori, G. Pedrazzi, and R. Turra, Text mining applied to patent mapping: a practical business case, World Patent Information, vol.25, issue.4, pp.335-342, 2003.
DOI : 10.1016/S0172-2190(03)00113-3

S. Robin, Statistical analysis of microarray data, Teaching material from Institut National Agronomique Paris-Grignon, 2004.

A. J. Trippe, A comparison of ideologies: intellectually assigned co-coding clustering vs ThemeScape automatic themematic mapping, Proceedings of the 2001 Chemical Information Conference, 2001.

A. J. Trippe, Patinformatics: Identifiying haystacks from space, Searcher, vol.10, issue.28, 2002.

G. Fischer and N. Lalyre, Analysis and visualisation with host-based software ??? The features of STN??AnaVist???, World Patent Information, vol.28, issue.4, pp.312-318, 2006.
DOI : 10.1016/j.wpi.2006.04.007

R. T. Lo and B. , He, I. Ounis, Automatically building a stopword list for an information retrieval system, Journal of Digital Information Management, vol.3, issue.1, pp.3-8, 2005.

S. Kullback and R. A. Leibler, On Information and Sufficiency, The Annals of Mathematical Statistics, vol.22, issue.1, pp.79-86, 1951.
DOI : 10.1214/aoms/1177729694

R. K. Al-halimi and F. W. Tompa, Using word position in documents for topic characterization, Tech, Canada, vol.36, 2003.