Skip to Main content Skip to Navigation
Conference papers

The rise and fall of stopword lists in patent searching and mapping

Abstract : Stopword lists are collections of so-called 'noise' words, which appear frequently in documents but are considered as non information-bearing. They have been introduced as a way to enhance information retrieval e.g. patent information retrieval in 1958 and were further developed until the 1970's. Since then, they have remained almost unchanged albeit they have gained importance by forming the basis for advanced tools such as text mining, mapping or clustering tools. Moreover, they are currently mostly used in static form whereas they could benefit from the latest findings of information science.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01276939
Contributor : Antoine Blanchard <>
Submitted on : Sunday, February 21, 2016 - 4:03:57 PM
Last modification on : Monday, April 27, 2020 - 10:00:04 PM

Identifiers

  • HAL Id : hal-01276939, version 1

Citation

Antoine Blanchard. The rise and fall of stopword lists in patent searching and mapping. PIUG Annual Conference, Patent Information Users Group, Inc., May 2006, Minneapolis, MN, United States. ⟨hal-01276939⟩

Share

Metrics

Record views

15

Files downloads

25