Probabilistic Graphical Model Structure Learning : Application to Multi-Label Classification

Maxime Gasse 1, 2
Abstract : In this thesis, we address the specific problem of probabilistic graphical model structure learning, that is, finding the most efficient structure to represent a probability distribution, given only a sample set D ∼ p(v). In the first part, we review the main families of probabilistic graphical models from the literature, from the most common (directed, undirected) to the most advanced ones (chained, mixed etc.). Then we study particularly the problem of learning the structure of directed graphs (Bayesian networks), and we propose a new hybrid structure learning method, H2PC (Hybrid Hybrid Parents and Children), which combines a constraint-based approach (statistical independence tests) with a score-based approach (posterior probability of the structure). In the second part, we address the multi-label classification problem, which aims at assigning a set of categories (binary vector y P (0, 1)m) to a given object (vector x P Rd). In this context, probabilistic graphical models provide convenient means of encoding p(y|x), particularly for the purpose of minimizing general loss functions. We review the main approaches based on PGMs for multi-label classification (Probabilistic Classifier Chain, Conditional Dependency Network, Bayesian Network Classifier, Conditional Random Field, Sum-Product Network), and propose a generic approach inspired from constraint-based structure learning methods to identify the unique partition of the label set into irreducible label factors (ILFs), that is, the irreducible factorization of p(y|x) into disjoint marginal distributions. We establish several theoretical results to characterize the ILFs based on the compositional graphoid axioms, and obtain three generic procedures under various assumptions about the conditional independence properties of the joint distribution p(x, y). Our conclusions are supported by carefully designed multi-label classification experiments, under the F-loss and the zero-one loss functions
Document type :
Theses
Complete list of metadatas

Cited literature [231 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/tel-01442613
Contributor : Abes Star <>
Submitted on : Monday, August 28, 2017 - 4:48:51 PM
Last modification on : Thursday, November 21, 2019 - 2:06:42 AM

File

TH2017GASSEMAXIME.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01442613, version 3

Citation

Maxime Gasse. Probabilistic Graphical Model Structure Learning : Application to Multi-Label Classification. Other [cs.OH]. Université de Lyon, 2017. English. ⟨NNT : 2017LYSE1003⟩. ⟨tel-01442613v3⟩

Share

Metrics

Record views

618

Files downloads

1446