Fully Dynamic k -Center Clustering - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Fully Dynamic k -Center Clustering

Résumé

Static and dynamic clustering algorithms are a fundamental tool in any machine learning library. Most of the efforts in developing dynamic machine learning and data mining algorithms have been focusing on the sliding window model (where at any given point in time only the most recent data items are retained) or more simplistic models. However, in many real-world applications one might need to deal with arbitrary deletions and insertions. For example, one might need to remove data items that are not necessarily the oldest ones, because they have been flagged as containing inappropriate content or due to privacy concerns. Clustering trajectory data might also require to deal with more general update operations. We develop a (2 +)-approximation algorithm for the k-center clustering problem with "small" amortized cost under the fully dynamic adversarial model. In such a model, points can be added or removed arbitrarily, provided that the adversary does not have access to the random choices of our algorithm. The amortized cost of our algorithm is poly-logarithmic when the ratio between the maximum and minimum distance between any two points in input is bounded by a polynomial, while k and are constant. Our theoretical results are complemented with an extensive experimental evaluation on dynamic data from Twitter, Flickr, as well as trajectory data, demonstrating the effectiveness of our approach.
Fichier principal
Vignette du fichier
p579-chan.pdf (1.69 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-02315471 , version 1 (14-10-2019)

Identifiants

Citer

T-H. Hubert Chan, Arnaud Guerqin, Mauro Sozio. Fully Dynamic k -Center Clustering. 2018 World Wide Web Conference, Apr 2018, Lyon, France. pp.579-587, ⟨10.1145/3178876.3186124⟩. ⟨hal-02315471⟩
172 Consultations
251 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More