Constrained Independence for Detecting Interesting Patterns - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Constrained Independence for Detecting Interesting Patterns

Résumé

Among other criteria, a pattern may be interesting if it is not redundant with other discovered patterns. A general approach to determining redundancy is to consider a probabilistic model for frequencies of patterns, based on those of patterns already mined, and compare observed frequencies to the model. Such probabilistic models include the independence model, partition models or more complex models which are approached via randomization for a lack of an adequate tool in probability theory allowing a direct approach. We define constrained independence, a generalization to the notion of independence. This tool allows us to describe probabilistic models for evaluating redundancy in frequent itemset mining. We provide algorithms, integrated within the mining process, for determining non-redundant itemsets. Through experimentations, we show that the models used reveal high rates of redundancy among frequent itemsets and we extract the most interesting ones.
Fichier non déposé

Dates et versions

hal-01247650 , version 1 (22-12-2015)

Identifiants

Citer

Thomas Delacroix, Ahcène Boubekki, Philippe Lenca, Stéphane Lallich. Constrained Independence for Detecting Interesting Patterns. DSAA 2015 : IEEE International Conference on Data Science and Advanced Analytics, Oct 2015, Paris, France. pp.1 - 10, ⟨10.1109/DSAA.2015.7344897⟩. ⟨hal-01247650⟩
211 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More