Robust Multi-Output Learning with Highly Incomplete Data via Restricted Boltzmann Machines

Giancarlo Fissore; Aurelien Decelle; Cyril Furtlehner; Yufei Han

Communication Dans Un Congrès Année : 2020

Robust Multi-Output Learning with Highly Incomplete Data via Restricted Boltzmann Machines

(1, 2) , (1, 2) , (1, 2) ,

1
2

Giancarlo Fissore

Fonction : Auteur

Laboratoire de Recherche en Informatique

TAckling the Underspecified

Aurelien Decelle

Fonction : Auteur

Laboratoire de Recherche en Informatique

TAckling the Underspecified

Cyril Furtlehner

Fonction : Auteur

Laboratoire de Recherche en Informatique

TAckling the Underspecified

Yufei Han

Fonction : Auteur

Résumé

In a standard multi-output classification scenario, both features and labels of training data are partially observed. This challenging issue is widely witnessed due to sensor or database failures, crowd-sourcing and noisy communication channels in industrial data analytic services. Classic methods for handling multi-output classification with incomplete supervision information usually decompose the problem into an imputation stage that reconstructs the missing training information, and a learning stage that builds a classifier based on the imputed training set. These methods fail to fully leverage the dependencies between features and labels. In order to take full advantage of these dependencies we consider a purely probabilistic setting in which the features imputation and multi-label classification problems are jointly solved. Indeed, we show that a simple Restricted Boltzmann Machine can be trained with an adapted algorithm based on mean-field equations to efficiently solve problems of inductive and transductive learning in which both features and labels are missing at random. The effectiveness of the approach is demonstrated empirically on various datasets, with particular focus on a real-world Internet-of-Things security dataset.

Domaines

Apprentissage [cs.LG] Systèmes désordonnés et réseaux de neurones [cond-mat.dis-nn]

Aurélien Decelle : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02420824

Soumis le : vendredi 20 décembre 2019-10:23:16

Dernière modification le : lundi 12 février 2024-09:44:03

Dates et versions

hal-02420824 , version 1 (20-12-2019)

Identifiants

HAL Id : hal-02420824 , version 1
ARXIV : 1912.09382

Citer

Giancarlo Fissore, Aurelien Decelle, Cyril Furtlehner, Yufei Han. Robust Multi-Output Learning with Highly Incomplete Data via Restricted Boltzmann Machines. European Starting AI Researchers' Symposium 2020, Aug 2020, Santiago Compostela, Spain. ⟨hal-02420824⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UMR8623 CENTRALESUPELEC INRIA2 LRI-AO UNIV-PARIS-SACLAY LISN GS-ENGINEERING GS-COMPUTER-SCIENCE GS-LIFE-SCIENCES-HEALTH LISN-AO

36 Consultations

0 Téléchargements

Robust Multi-Output Learning with Highly Incomplete Data via Restricted Boltzmann Machines

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager