Debiasing Stochastic Gradient Descent to handle missing values

Stochastic gradient algorithm is a key ingredient of many machine learning methods, particularly appropriate for large-scale learning.However, a major caveat of large data is their incompleteness.We propose an averaged stochastic gradient algorithm handling missing values in linear models. This approach has the merit to be free from the need of any data distribution modeling and to account for heterogeneous missing proportion.In both streaming and finite-sample settings, we prove that this algorithm achieves convergence rate of $\mathcal{O}(\frac{1}{n})$ at the iteration $n$, the same as without missing values. We show the convergence behavior and the relevance of the algorithm not only on synthetic data but also on real data sets, including those collected from medical register.

Mots clés

Stochastic approximation Optimization Missing data Supervised learning

Domaines

Statistiques [math.ST]

Fichier principal

sgdNA.pdf (813.27 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Aude Sportisse : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02483651

Soumis le : jeudi 4 juin 2020-18:35:51

Dernière modification le : jeudi 14 mars 2024-03:13:44

Dates et versions

hal-02483651 , version 1 (21-02-2020)

hal-02483651 , version 2 (04-06-2020)

Identifiants

HAL Id : hal-02483651 , version 2
ARXIV : 2002.09338

Citer

Aude Sportisse, Claire Boyer, Aymeric Dieuleveut, Julie Josse. Debiasing Stochastic Gradient Descent to handle missing values. NeurIPS 2020 - 34th Conference on Neural Information Processing Systems, Dec 2020, Vancouver, Canada. ⟨hal-02483651v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X ENS-PARIS CNRS INRIA X-CMAP X-DEP-MATHA CMAP INRIA2 PSL LPSM SORBONNE-UNIVERSITE SU-SCIENCES MATH_ENS_PARIS IP_PARIS UP-SCIENCES GS-COMPUTER-SCIENCE

281 Consultations

362 Téléchargements