20 years of pattern mining

Arnaud Giacometti 1 Dominique H. Li 1 Patrick Marcel 1 Arnaud Soulet 1
1 BDTLN - Bases de données et traitement des langues naturelles
LIFAT - Laboratoire d'Informatique Fondamentale et Appliquée de Tours
Abstract : In 1993, Rakesh Agrawal, Tomasz Imielinski and Arun N. Swami published one of the founding papers of Pattern Mining: "Mining Association Rules between Sets of Items in Large Databases". Beyond the introduction to a new problem, it introduced a new methodology in terms of resolution and evaluation. For two decades, Pattern Mining has been one of the most active fields in Knowledge Discovery in Databases. This paper provides a bibliometric survey of the literature relying on 1,087 publications from five major international conferences: KDD, PKDD, PAKDD, ICDM and SDM. We first measured a slowdown of research dedicated to Pattern Mining while the KDD field continues to grow. Then, we quantified the main contributions with respect to languages, constraints and condensed representations to outline the current directions. We observe a sophistication of languages over the last 20 years, although association rules and itemsets are so far the most studied ones. As expected, the minimal support constraint predominates the extraction of patterns with approximately 50% of the publications. Finally, condensed representations used in 10% of the papers had relative success particularly between 2005 and 2008.
Document type :
Journal articles
Complete list of metadatas

Contributor : Arnaud Giacometti <>
Submitted on : Monday, July 6, 2015 - 3:27:46 PM
Last modification on : Tuesday, November 19, 2019 - 4:47:44 PM



Arnaud Giacometti, Dominique H. Li, Patrick Marcel, Arnaud Soulet. 20 years of pattern mining. SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining, Association for Computing Machinery (ACM), 2014, 15 (1), pp.41-50. ⟨10.1145/2594473.2594480⟩. ⟨hal-01171955⟩



Record views