Unbalanced minibatch Optimal Transport; applications to Domain Adaptation - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation

Résumé

Optimal transport distances have found many applications in machine learning for their capacity to compare non-parametric probability distributions. Yet their algorithmic complexity generally prevents their direct use on large scale datasets. Among the possible strategies to alleviate this issue, practitioners can rely on computing estimates of these distances over subsets of data, {\em i.e.} minibatches. While computationally appealing, we highlight in this paper some limits of this strategy, arguing it can lead to undesirable smoothing effects. As an alternative, we suggest that the same minibatch strategy coupled with unbalanced optimal transport can yield more robust behavior. We discuss the associated theoretical properties, such as unbiased estimators, existence of gradients and concentration bounds. Our experimental study shows that in challenging problems associated to domain adaptation, the use of unbalanced optimal transport leads to significantly better results, competing with or surpassing recent baselines.

Dates et versions

hal-03264020 , version 1 (17-06-2021)

Identifiants

Citer

Kilian Fatras, Thibault Séjourné, Nicolas Courty, Rémi Flamary. Unbalanced minibatch Optimal Transport; applications to Domain Adaptation. International Conference in machine Learning, Jul 2021, online, France. ⟨hal-03264020⟩
68 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More