835 articles – 1515 references  [version française]
HAL: hal-00568028, version 1

Detailed view  Export this paper
PGM 2010, Helsinki : Finlande (2010)
Sub-quadratic Markov tree mixture learning based on randomizations of the Chow-Liu algorithm
Sourour Ammar 1, Philippe Leray 1, François Schnitzler 2, Louis Wehenkel 2
(2010)

The present work analyzes different randomized methods to learn Markov tree mixtures for density estimation in very high-dimensional discrete spaces (very large number n of discrete variables) when the sample size (N ) is very small compared to n. Several sub-quadratic relaxations of the Chow-Liu algorithm are proposed, weakening its search procedure. We first study naîve randomizations and then gradually increase the deterministic behavior of the algorithms by trying to focus on the most interesting edges, either by retaining the best edges between models, or by inferring promising relationships between variables. We compare these methods to totally random tree generation and randomization based on bootstrap-resampling (bagging), of respectively linear and quadratic complexity. Our results show that randomization becomes increasingly more interesting for smaller N/n ratios, and that methods based on simultaneously discovering and exploiting the problem structure are promising in this context.
1:  Laboratoire d'Informatique de Nantes Atlantique (LINA)
CNRS : UMR6241 – Université de Nantes – École Nationale Supérieure des Mines - Nantes
2:  Department of Electrical Engineering and Computer Science (Institut Montefiore)
Université de Liège
Computer Science/Artificial Intelligence

Computer Science/Learning
Attached file list to this document: 
PDF
PGM2010_final_version.pdf(530.2 KB)