Probabilistic Graphical Model Structure Learning: Application to Multi-Label Classification

Maxime Gasse 1
1 DM2L - Data Mining and Machine Learning
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : In this thesis, we address the specific problem of probabilistic graphical model structure learning, that is, finding the most efficient structure to represent a probability distribution, given only a sample set D ∼ p(v). In the first part, we review the main families of probabilistic graphical models from the literature, from the most common (directed, undirected) to the most advanced ones (chained, mixed etc.). Then we study particularly the problem of learning the structure of directed graphs (Bayesian networks), and we propose a new hybrid structure learning method, H2PC (Hybrid Hybrid Parents and Children), which combines a constraint-based approach (statistical independence tests) with a score-based approach (posterior probability of the structure). In the second part, we address the multi-label classification problem, which aims at assigning a set of categories (binary vector y) to a given object (vector x). In this context, probabilistic graphical models provide convenient means of encoding p(y|x), particularly for the purpose of minimizing general loss functions. We review the main approaches based on PGMs for multi-label classification (Probabilistic Classifier Chain, Conditional Dependency Network, Bayesian Network Classifier, Conditional Random Field, Sum-Product Network), and propose a generic approach inspired from constraint-based structure learning methods to identify the unique partition of the label set into irreducible label factors (ILFs), that is, the irreducible factorization of p(y|x) into disjoint marginal distributions. We establish several theoretical results to characterize the ILFs based on the compositional graphoid axioms, and obtain three generic procedures under various assumptions about the conditional independence properties of the joint distribution p(x, y). Our conclusions are supported by carefully designed multi-label classification experiments, under the F-loss and the zero-one loss functions.
Complete list of metadatas

https://hal.archives-ouvertes.fr/tel-01442613
Contributor : Maxime Gasse <>
Submitted on : Friday, January 20, 2017 - 5:28:55 PM
Last modification on : Wednesday, October 31, 2018 - 12:24:25 PM
Long-term archiving on: Friday, April 21, 2017 - 4:42:28 PM

Identifiers

  • HAL Id : tel-01442613, version 1

Citation

Maxime Gasse. Probabilistic Graphical Model Structure Learning: Application to Multi-Label Classification. Artificial Intelligence [cs.AI]. Université Lyon 1 - Claude Bernard, 2017. English. ⟨NNT : 2017LYSE1003⟩. ⟨tel-01442613v1⟩

Share

Metrics

Record views

121

Files downloads

294