Non-asymptotic oracle inequalities for the Lasso in high-dimensional mixture of experts

Trungtin Nguyen; Hien D Nguyen; Faicel Chamroukhi; Geoffrey J Mclachlan

Pré-Publication, Document De Travail Année : 2020

Non-asymptotic oracle inequalities for the Lasso in high-dimensional mixture of experts

(1, 2) , (3) , (4) , (3)

1
2
3
4

Trungtin Nguyen

Fonction : Auteur
PersonId : 743820
IdHAL : trungtinnguyen
ORCID : 0000-0001-8433-5980
IdRef : 145666425

Modèles statistiques bayésiens et des valeurs extrêmes pour données structurées et de grande dimension

Laboratoire de Mathématiques Nicolas Oresme

Hien D Nguyen

Fonction : Auteur

The University of Queensland

Faicel Chamroukhi

Fonction : Auteur
PersonId : 175791
IdHAL : fchamroukhi
ORCID : 0000-0002-5894-3103
IdRef : 152550003

IRT SystemX

Geoffrey J Mclachlan

Fonction : Auteur

The University of Queensland

Résumé

Mixture of experts (MoE) has a well-principled finite mixture model construction for prediction, allowing the gating network (mixture weights) to learn from the predictors (explanatory variables) together with the experts' network (mixture component densities). We investigate the estimation properties of MoEs in a high-dimensional setting, where the number of predictors is much larger than the sample size, for which the literature lacks computational and especially theoretical results. We consider the class of finite MoE models with softmax gating functions and Gaussian regression experts, and focus on the theoretical properties of their l1-regularized estimation via the Lasso. We provide a lower bound on the regularization parameter of the Lasso penalty that ensures an l1-oracle inequality is satisfied by the Lasso estimator according to the Kullback--Leibler loss. We further state an l1-ball oracle inequality for the l1-penalized maximum likelihood estimator from the model selection.

Mots clés

Mixture of experts models Mixture of regressions models Penalized maximum likelihood estimation Oracle inequality High-dimensional statistics Lasso

Domaines

Statistiques [math.ST] Intelligence artificielle [cs.AI] Apprentissage [cs.LG] Théorie [stat.TH] Machine Learning [stat.ML]

TrungTin Nguyen : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02957892

Soumis le : lundi 5 octobre 2020-14:37:33

Dernière modification le : jeudi 4 avril 2024-21:14:45

Dates et versions

hal-02957892 , version 1 (05-10-2020)

Identifiants

HAL Id : hal-02957892 , version 1
ARXIV : 2009.10622

Citer

Trungtin Nguyen, Hien D Nguyen, Faicel Chamroukhi, Geoffrey J Mclachlan. Non-asymptotic oracle inequalities for the Lasso in high-dimensional mixture of experts. 2020. ⟨hal-02957892⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LJK LJK_PS COMUE-NORMANDIE INRIA2 IRT-SYSTEMX UNICAEN LMNO LJK-PS-STATIFY ANR

68 Consultations

0 Téléchargements

Non-asymptotic oracle inequalities for the Lasso in high-dimensional mixture of experts

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager