Low-rank Interaction with Sparse Additive Effects Model for Large Data Frames

Geneviève Robin; Hoi-To Wai; Julie Josse; Olga Klopp; Éric Moulines

Communication Dans Un Congrès Année : 2018

Low-rank Interaction with Sparse Additive Effects Model for Large Data Frames

(1) , (2) , (3) , (4) , (3)

1
2
3
4

Geneviève Robin

Fonction : Auteur
PersonId : 15158
IdHAL : genevieve-robin
ORCID : 0000-0002-6264-0842

Modélisation en pharmacologie de population

Hoi-To Wai

Fonction : Auteur

Arizona State University [Tempe]

Julie Josse

Fonction : Auteur
PersonId : 993919

Centre de Mathématiques Appliquées - Ecole Polytechnique

Olga Klopp

Fonction : Auteur

Modélisation aléatoire de Paris X

Éric Moulines

Fonction : Auteur
PersonId : 1350242
ORCID : 0000-0002-2058-0693
IdRef : 076452476

Centre de Mathématiques Appliquées - Ecole Polytechnique

Résumé

Many applications of machine learning involve the analysis of large data frames-matrices collecting heterogeneous measurements (binary, numerical, counts, etc.) across samples-with missing values. Low-rank models, as studied by Udell et al. [30], are popular in this framework for tasks such as visualization, clustering and missing value imputation. Yet, available methods with statistical guarantees and efficient optimization do not allow explicit modeling of main additive effects such as row and column, or covariate effects. In this paper, we introduce a low-rank interaction and sparse additive effects (LORIS) model which combines matrix regression on a dictionary and low-rank design, to estimate main effects and interactions simultaneously. We provide statistical guarantees in the form of upper bounds on the estimation error of both components. Then, we introduce a mixed coordinate gradient descent (MCGD) method which provably converges sub-linearly to an optimal solution and is computationally efficient for large scale data sets. We show on simulated and survey data that the method has a clear advantage over current practices, which consist in dealing separately with additive effects in a preprocessing step.

Domaines

Autres [stat.ML]

Fichier principal

LORIS_2018_supplementary.pdf (539.31 Ko)

imput_loris_softImpute_cat_variables.pdf (4.68 Ko)

imput_loris_softImpute_quant_variables.pdf (4.59 Ko)

imputation.pdf (4.44 Ko)

imputation2.pdf (4.55 Ko)

impute_loris_softImpute_all_var.pdf (4.5 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Geneviève Robin : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01959188

Soumis le : vendredi 5 avril 2019-16:59:50

Dernière modification le : mardi 2 avril 2024-15:40:02

Dates et versions

hal-01959188 , version 1 (19-12-2018)

hal-01959188 , version 2 (05-04-2019)

Identifiants

HAL Id : hal-01959188 , version 2
ARXIV : 1812.08398

Citer

Geneviève Robin, Hoi-To Wai, Julie Josse, Olga Klopp, Éric Moulines. Low-rank Interaction with Sparse Additive Effects Model for Large Data Frames. 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Dec 2018, Montréal, Canada. ⟨hal-01959188v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X CNRS INRIA X-CMAP X-DEP-MATHA CMAP INRIA2 UNIV-PARIS-SACLAY MODALX UNIV-PARIS-LUMIERES UNIV-PARIS-NANTERRE GS-COMPUTER-SCIENCE

135 Consultations

66 Téléchargements

Low-rank Interaction with Sparse Additive Effects Model for Large Data Frames

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager