SPoID: Do Not Throw Meaningful Incomplete Sequences Away! - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2007

SPoID: Do Not Throw Meaningful Incomplete Sequences Away!

Résumé

Industrial databases often contain a large amount of unfilled information. During the knowledge discovery process one processing step is often necessary in order to remove these incomplete data either by deleting or assessing them. When the data mining task consists in mining for frequent sequences, incomplete data are, most of the time, deleted, which leads to an important loss of information. Extracted knowledge then becomes less representative of the database. Therefore we propose a method that uses the partial information contained in incomplete records, only temporary ignoring the missing part of the record. Experiments run on various synthetic datasets show the validity of our proposal as well in terms of quality as in terms of the robustness to the rate of missing values.

Domaines

Autre
Fichier principal
Vignette du fichier
proceeding-eusflat-2007-I pages 329 - 336.pdf (618.28 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

lirmm-00173030 , version 1 (21-09-2019)

Identifiants

  • HAL Id : lirmm-00173030 , version 1

Citer

Céline Fiot, Anne Laurent, Maguelonne Teisseire. SPoID: Do Not Throw Meaningful Incomplete Sequences Away!. EUSFLAT, European Society For Fuzzy Logic and Technologies, Sep 2007, Ostrava, Czech Republic. pp.329-336. ⟨lirmm-00173030⟩
104 Consultations
26 Téléchargements

Partager

Gmail Facebook X LinkedIn More