On the use of binary stochastic autoencoders for multi-label classification under the zero-one loss

Denis Lecoeuche; Alex Aussem; Maxime Gasse

doi:10.1016/j.procs.2018.10.506

Article Dans Une Revue Procedia Computer Science Année : 2018

On the use of binary stochastic autoencoders for multi-label classification under the zero-one loss

, (1) , (1)

Denis Lecoeuche

Fonction : Auteur

Alex Aussem

Fonction : Auteur
PersonId : 7691
IdHAL : alexandre-aussem
IdRef : 137566514

Data Mining and Machine Learning

Maxime Gasse

Fonction : Auteur
PersonId : 2936
IdHAL : mgasse
ORCID : 0000-0001-6982-062X
IdRef : 203611365

Data Mining and Machine Learning

Résumé

Multi-label classification is a challenging problem when the number of labels is large. One simple strategy that appeared in the recent literature is to embed the labels in a latent binary subspace with autoencoders and then train binary classifiers to predict each latent binary variable independently. Latent predictions are afterwards fed to the decoder to provide the final label estimate. The goal is not only to reduce the classification time, but also to capture implicitly some useful information on the dependency structure of the labels. Despite being pleasingly simple, we show that this technique has some shortcomings and that information on the latent variables dependencies has to be incorporated into the learning process to solve the MLC task efficiently under the zero-one loss. Our contribution is two-fold: i) we propose a "volume-preserving" neural-based binary stochastic autoencoder (BSAE) that guarantees that the maximum a posteriori probability (MAP) solution in the latent space is decoded as the Bayes-optimal solution in the original multi-label space for the zero-one loss, and ii) we apply the method to identify a factorization of the latent variables into a product of conditionally independent terms to facilitate the estimation of the MAP solution. Our experiments on multiple datasets confirm our hypothesis that basic autoencoders do not necessarily disentangle the dependency structure of the label space, and that exploiting latent variables dependencies brings about significant gains in terms of zero-one loss

Domaines

Réseau de neurones [cs.NE]

Alexandre Aussem : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02042711

Soumis le : mercredi 20 février 2019-15:25:59

Dernière modification le : mercredi 5 juillet 2023-15:28:04

Dates et versions

hal-02042711 , version 1 (20-02-2019)

Identifiants

HAL Id : hal-02042711 , version 1
DOI : 10.1016/j.procs.2018.10.506

Citer

Denis Lecoeuche, Alex Aussem, Maxime Gasse. On the use of binary stochastic autoencoders for multi-label classification under the zero-one loss. Procedia Computer Science, 2018, 144, pp.71-80. ⟨10.1016/j.procs.2018.10.506⟩. ⟨hal-02042711⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS INSA-GROUPE UDL

116 Consultations

0 Téléchargements

On the use of binary stochastic autoencoders for multi-label classification under the zero-one loss

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager