Wasserstein Discriminant Analysis

Rémi Flamary 1, 2 Marco Cuturi 3 Nicolas Courty 4 Alain Rakotomamonjy 5
4 OBELIX - Environment observation with complex imagery
UBS - Université de Bretagne Sud, IRISA-D5 - SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE
5 DocApp - LITIS - Equipe Apprentissage
LITIS - Laboratoire d'Informatique, de Traitement de l'Information et des Systèmes
Abstract : Wasserstein Discriminant Analysis (WDA) is a new supervised method that can improve classification of high-dimensional data by computing a suitable linear map onto a lower dimensional subspace. Following the blueprint of classical Lin- ear Discriminant Analysis (LDA), WDA selects the projection matrix that maxi- mizes the ratio of two quantities: the dispersion of projected points coming from different classes, divided by the dispersion of projected points coming from the same class. To quantify dispersion, WDA uses regularized Wasserstein distances, rather than cross-variance measures which have been usually considered, notably in LDA. Thanks to the the underlying principles of optimal transport, WDA is able to capture both global (at distribution scale) and local (at samples scale) interac- tions between classes. Regularized Wasserstein distances can be computed using the Sinkhorn matrix scaling algorithm; We show that the optimization of WDA can be tackled using automatic differentiation of Sinkhorn iterations. Numerical experiments show promising results both in terms of prediction and visualization on toy examples and real life datasets such as MNIST and on deep features ob- tained from a subset of the Caltech dataset.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-02112754
Contributor : Alain Rakotomamonjy <>
Submitted on : Friday, April 26, 2019 - 10:25:07 PM
Last modification on : Friday, May 3, 2019 - 9:30:19 AM

Links full text

Identifiers

Citation

Rémi Flamary, Marco Cuturi, Nicolas Courty, Alain Rakotomamonjy. Wasserstein Discriminant Analysis. Machine Learning, Springer Verlag, 2018, 107 (12), pp.1923-1945. ⟨10.1007/s10994-018-5717-1⟩. ⟨hal-02112754⟩

Share

Metrics

Record views

45