Computing iceberg concept lattices with Titanic

Abstract : We introduce the notion of iceberg concept lattices and show their use in knowledge discovery in databases. Iceberg lattices are a conceptual clustering method, which is well suited for analyzing very large databases. They also serve as a condensed representation of frequent itemsets, as starting point for computing bases of association rules, and as a visualization method for association rules. Iceberg concept lattices are based on the theory of Formal Concept Analysis, a mathematical theory with applications in data analysis, information retrieval, and knowledge discovery. We present a new algorithm called TITANIC for computing (iceberg) concept lattices. It is based on data mining techniques with a level-wise approach. In fact, TITANIC can be used for a more general problem: Computing arbitrary closure systems when the closure operator comes along with a so-called weight function. The use of weight functions for computing closure systems has not been discussed in the literature up to now. Applications providing such a weight function include association rule mining, functional dependencies in databases, conceptual clustering, and ontology engineering. The algorithm is experimentally evaluated and compared with Ganter's Next-Closure algorithm. The evaluation shows an important gain in efficiency, especially for weakly correlated data.
Type de document :
Article dans une revue
Data and Knowledge Engineering, Elsevier, 2002, 42 (2), pp.189-222. <10.1016/S0169-023X(02)00057-5>
Liste complète des métadonnées


https://hal.archives-ouvertes.fr/hal-00578830
Contributeur : Nicolas Pasquier <>
Soumis le : mardi 22 mars 2011 - 14:15:37
Dernière modification le : mardi 25 octobre 2016 - 16:58:08
Document(s) archivé(s) le : jeudi 23 juin 2011 - 02:43:21

Fichier

Stumme_et_al._-_2002.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

Citation

Gerd Stumme, Rafik Taouil, Yves Bastide, Nicolas Pasquier, Lotfi Lakhal. Computing iceberg concept lattices with Titanic. Data and Knowledge Engineering, Elsevier, 2002, 42 (2), pp.189-222. <10.1016/S0169-023X(02)00057-5>. <hal-00578830>

Partager

Métriques

Consultations de
la notice

438

Téléchargements du document

617