The Data Problem in Data Mining - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

The Data Problem in Data Mining

Résumé

Computer science is essentially an applied or engineering science , creating tools. In Data Mining, those tools are supposed to help humans understand large amounts of data, and produce actionable insight. In this talk, I argue that for all the progress that has been made in Data Mining, in particular Pattern Mining, we are lacking understanding of key aspects of the performance and results of pattern mining algorithms. I will focus particularly on the difficulty of deriving actionable knowledge from patterns. I trace the lack of progress regarding those questions to a lack of data with varying, controlled properties, and argue that we will need to make a science of digital data generation, and use it to develop guidance to data practitioners.
Fichier principal
Vignette du fichier
invited_paper_1.pdf (37.55 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01627738 , version 1 (02-11-2017)

Identifiants

  • HAL Id : hal-01627738 , version 1

Citer

Albrecht Zimmermann. The Data Problem in Data Mining. Advances in Intelligent Data Analysis XIV - 14th International Symposium (IDA), Oct 2015, St. Étienne, France. pp.1-2. ⟨hal-01627738⟩
111 Consultations
15 Téléchargements

Partager

Gmail Facebook X LinkedIn More