The Cluster Description Problem -Complexity Results, Formulations and Approximations - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

The Cluster Description Problem -Complexity Results, Formulations and Approximations

S S Ravi
  • Fonction : Auteur
  • PersonId : 1031702

Résumé

Consider the situation where you are given an existing k-way clustering π. A challenge for explainable AI is to find a compact and distinct explanation of each cluster which in this paper is assumed to use instance-level descriptors/tags from a common dictionary. Since the descriptors/tags were not given to the clustering method, this is not a semi-supervised learning situation. We show that the feasibility problem of testing whether any distinct description (not necessarily the most compact) exists is generally intractable for just two clusters. This means that unless P = NP, there cannot exist an efficient algorithm for the cluster description problem. Hence, we explore ILP formulations for smaller problems and a relaxed but restricted setting that leads to a polynomial time algorithm for larger problems. We explore several extensions to the basic setting such as the ability to ignore some instances and composition constraints on the descriptions of the clusters. We show our formulation's usefulness on Twitter data where the communities were found using social connectivity (i.e. follower relation) but the explanation of the communities is based on behavioral properties of the nodes (i.e. hashtag usage) not available to the clustering method.
Fichier principal
Vignette du fichier
7857-the-cluster-description-problem-complexity-results-formulations-and-approximations.pdf (3 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02060574 , version 1 (07-03-2019)

Identifiants

  • HAL Id : hal-02060574 , version 1

Citer

Ian Davidson, Antoine Gourru, S S Ravi. The Cluster Description Problem -Complexity Results, Formulations and Approximations. Thirty-second Conference on Neural Information Processing Systems, Dec 2018, Montréal, Canada. ⟨hal-02060574⟩
55 Consultations
88 Téléchargements

Partager

Gmail Facebook X LinkedIn More