Efficient Model Selection for Regularized Classification by Exploiting Unlabeled Data

Georgios Balikas; Ioannis Partalas; Eric Gaussier; Rohit Babbar; Massih-Reza Amini

doi:10.1007/978-3-319-24465-5_3

Communication Dans Un Congrès Année : 2015

Efficient Model Selection for Regularized Classification by Exploiting Unlabeled Data

(1) , (2) , (1) , (3) , (1)

1
2
3

Georgios Balikas

Fonction : Auteur

Analyse de données, Modélisation et Apprentissage automatique [Grenoble]

Ioannis Partalas

Fonction : Auteur

VISEO

Eric Gaussier

Fonction : Auteur
PersonId : 182833
IdHAL : eric-gaussier
ORCID : 0000-0002-8858-3233
IdRef : 074308297

Analyse de données, Modélisation et Apprentissage automatique [Grenoble]

Rohit Babbar

Fonction : Auteur

Max Planck Institute for Intelligent Systems [Tübingen]

Massih-Reza Amini

Fonction : Auteur
PersonId : 747054
IdHAL : massih-reza-amini
ORCID : 0000-0001-9032-4233
IdRef : 132277042

Analyse de données, Modélisation et Apprentissage automatique [Grenoble]

Résumé

Hyper-parameter tuning is a resource-intensive task when optimizing classification models. The commonly used k-fold cross validation can become intractable in large scale settings when a classifier has to learn billions of parameters. At the same time, in real-world, one often encounters multi-class classification scenarios with only a few labeled examples; model selection approaches often offer little improvement in such cases and the default values of learners are used. We propose bounds for classification on accuracy and macro measures (precision, recall, F1) that motivate efficient schemes for model selection and can benefit from the existence of unlabeled data. We demonstrate the advantages of those schemes by comparing them with k-fold cross validation and hold-out estimation in the setting of large scale classification.

Domaines

Informatique [cs] Intelligence artificielle [cs.AI] Recherche d'information [cs.IR]

Fichier principal

Quantification_IDA15.pdf (510.35 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Georgios Balikas : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01237129

Soumis le : mercredi 2 décembre 2015-17:11:56

Dernière modification le : jeudi 4 avril 2024-21:21:58

Archivage à long terme le : samedi 29 avril 2017-05:11:58

Dates et versions

hal-01237129 , version 1 (02-12-2015)

Identifiants

HAL Id : hal-01237129 , version 1
DOI : 10.1007/978-3-319-24465-5_3

Citer

Georgios Balikas, Ioannis Partalas, Eric Gaussier, Rohit Babbar, Massih-Reza Amini. Efficient Model Selection for Regularized Classification by Exploiting Unlabeled Data. 14th International Symposium on Intelligent Data Analysis, IDA, Oct 2015, Saint-Etienne, France. ⟨10.1007/978-3-319-24465-5_3⟩. ⟨hal-01237129⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS LIG LIG_SIDCH LIG_SIDCH_APTIKAL

263 Consultations

171 Téléchargements

Efficient Model Selection for Regularized Classification by Exploiting Unlabeled Data

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager