Online Multi-task Learning with Hard Constraints

Gabor Lugosi; Omiros Papaspiliopoulos; Gilles Stoltz

Pré-Publication, Document De Travail Année : 2009

Online Multi-task Learning with Hard Constraints

(1) , (1) , (2, 3)

1
2
3

Gabor Lugosi

Fonction : Auteur
PersonId : 832564

Institució Catalana de Recerca i Estudis Avançats = Catalan Institution for Research and Advanced Studies

Omiros Papaspiliopoulos

Fonction : Auteur
PersonId : 858212

Institució Catalana de Recerca i Estudis Avançats = Catalan Institution for Research and Advanced Studies

Gilles Stoltz

Fonction : Auteur
PersonId : 738739
IdHAL : gilles-stoltz
ORCID : 0000-0003-1240-1007
IdRef : 091575419

Département de Mathématiques et Applications - ENS Paris

Groupement de Recherche et d'Etudes en Gestion à HEC

Résumé

We discuss multi-task online learning when a decision maker has to deal simultaneously with M tasks. The tasks are related, which is modeled by imposing that the M-tuple of actions taken by the decision maker needs to satisfy certain constraints. We give natural examples of such restrictions and then discuss a general class of tractable constraints, for which we introduce computationally efficient ways of selecting actions, essentially by reducing to an on-line shortest path problem. We briefly discuss ``tracking'' and ``bandit'' versions of the problem and extend the model in various ways, including non-additive global losses and uncountably infinite sets of tasks.

Domaines

Machine Learning [stat.ML] Théorie [stat.TH] Statistiques [math.ST] Autres [stat.ML] Apprentissage [cs.LG]

Fichier principal

LugPapSto-MultiTask.pdf (181.11 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Gilles Stoltz : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00362643

Soumis le : mercredi 18 février 2009-20:28:54

Dernière modification le : vendredi 19 avril 2024-16:18:54

Archivage à long terme le : mardi 8 juin 2010-22:41:00

Dates et versions

hal-00362643 , version 1 (18-02-2009)

hal-00362643 , version 2 (20-03-2009)

Identifiants

HAL Id : hal-00362643 , version 1
ARXIV : 0902.3526

Citer

Gabor Lugosi, Omiros Papaspiliopoulos, Gilles Stoltz. Online Multi-task Learning with Hard Constraints. 2009. ⟨hal-00362643v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

338 Consultations

330 Téléchargements

Online Multi-task Learning with Hard Constraints

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Altmetric

Partager