Improved time series clustering based on new geometric frameworks
Résumé
Most existing methods for time series clustering rely on distances calculated from the entire raw data using the Euclidean distance or Dynamic Time Warping distance. In this work, we propose to embed the time series onto higher-dimensional spaces to obtain geometric representations of the time series them- selves. Particularly, the embedding on R n ×p , on the Stiefel manifold and on the unit Sphere are analyzed for their performances with respect to several yet well-known clustering algorithms. The gain brought by the geometrical representation for the time series clustering is illustrated through a large benchmark of databases. We particularly exhibit that, firstly, the embedding of the time series on higher dimensional spaces gives better results than classical approaches and, secondly, that the embedding on the Stiefel manifold - in conjunction with UMAP and HDBSCAN clustering algorithms - is the recommended frame- work for time series clustering.
Fichier principal
ArticleJournal_Elsevier_Pattern_recognition__Copy_.pdf (2.03 Mo)
Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)