A Novel Decomposition Algorithm for Binary Datatables: Encouraging Results on Discrimination Tasks - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

A Novel Decomposition Algorithm for Binary Datatables: Encouraging Results on Discrimination Tasks

Résumé

We present here an algorithm for decomposing any binary datatable into a set of “sufficient itemsets”, i.e. a non-redundant list of itemsets adequate for reconstructing the whole table up to a permutation of the rows. For doing so, we have replaced the “support” threshold criterion of the well-known Apriori algorithm by a “number of liberties”: the liberty count expresses how a (k+1)-level itemset is constrained by its k-level “parents”, till the level when the situation turns frozen. Our algorithm is symmetric: we take into account the absence of items as well as their presence in our itemsets. Conversely, we present a method for reconstituting the original data starting from our exact MIDOVA representation. We illustrate these points with the examples of Breast Cancer and Mushroom datasets from UCI Repository. We validate our approach by deriving a learning classifier approach and applying it to three discrimination problems drawn from the above-mentioned repository.
Fichier non déposé

Dates et versions

hal-00460310 , version 1 (26-02-2010)

Identifiants

  • HAL Id : hal-00460310 , version 1

Citer

Martine Cadot, Alain Lelu. A Novel Decomposition Algorithm for Binary Datatables: Encouraging Results on Discrimination Tasks. Fourth International Conference on Research Challenges in Information Science - RCIS 2010, May 2010, Nice, France. pp.57-68. ⟨hal-00460310⟩
109 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More