Skip to Main content Skip to Navigation

Efficiently Finding Conceptual Clustering Models with Integer Linear Programming

Abstract : Conceptual clustering combines two long-standing machine learning tasks: the unsupervised grouping of similar instances and their description by symbolic concepts. In this paper, we decouple the problems of finding descriptions and forming clusters by first mining formal concepts (i.e. closed itemsets), and searching for the best k clusters that can be described with those itemsets. Most existing approaches performing the two steps separately are of a heuristic nature and produce results of varying quality. Instead, we address the problem of finding an optimal constrained conceptual clustering by using integer linear programming techniques. Most other generic approaches for this problem tend to have problems scaling. Our approach takes advantageous of both techniques, the general framework of integer linear programming, and high-speed specialized approaches of data mining. Experiments performed on UCI datasets show that our approach efficiently finds clusterings of consistently high quality.
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01597804
Contributor : Albrecht Zimmermann <>
Submitted on : Thursday, September 28, 2017 - 5:24:46 PM
Last modification on : Tuesday, March 3, 2020 - 3:44:03 PM
Document(s) archivé(s) le : Friday, December 29, 2017 - 3:45:37 PM

File

ijcai16.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01597804, version 1

Citation

Abdelkader Ouali, Samir Loudni, Yahia Lebbah, Patrice Boizumault, Albrecht Zimmermann, et al.. Efficiently Finding Conceptual Clustering Models with Integer Linear Programming. 25th International Joint Conferences on Artificial Intelligence, Jul 2016, New York, United States. ⟨hal-01597804⟩

Share

Metrics

Record views

123

Files downloads

282