835 articles – 1517 references  [version française]
HAL: hal-00568028, version 1

Short view  Export this paper
Sub-quadratic Markov tree mixture learning based on randomizations of the Chow-Liu algorithm
Ammar S., Leray P., Schnitzler F., Wehenkel L.
Dans Proceedings of the Fifth European Workshop on Probabilistic Graphical Models - PGM 2010, Helsinki : Finlande (2010) - http://hal.archives-ouvertes.fr/hal-00568028
Conference proceedings
Computer Science/Artificial Intelligence
Computer Science/Learning
Sub-quadratic Markov tree mixture learning based on randomizations of the Chow-Liu algorithm
Sourour Ammar () 1, Philippe Leray () 1, François Schnitzler () 2, Louis Wehenkel () 2
1:  Laboratoire d'Informatique de Nantes Atlantique (LINA)
http://www.sciences.univ-nantes.fr/lina
CNRS : UMR6241 – Université de Nantes – École Nationale Supérieure des Mines - Nantes
LINA - Faculté des Sciences 2 rue de la Houssinière - BP 92208 44322 NANTES CEDEX 3
France
2:  Department of Electrical Engineering and Computer Science (Institut Montefiore)
http://www.montefiore.ulg.ac.be/
Université de Liège
Sart-Tilman, Bldg. B28, Parking P32 B-4000 Liège
Belgium
The present work analyzes different randomized methods to learn Markov tree mixtures for density estimation in very high-dimensional discrete spaces (very large number n of discrete variables) when the sample size (N ) is very small compared to n. Several sub-quadratic relaxations of the Chow-Liu algorithm are proposed, weakening its search procedure. We first study naîve randomizations and then gradually increase the deterministic behavior of the algorithms by trying to focus on the most interesting edges, either by retaining the best edges between models, or by inferring promising relationships between variables. We compare these methods to totally random tree generation and randomization based on bootstrap-resampling (bagging), of respectively linear and quadratic complexity. Our results show that randomization becomes increasingly more interesting for smaller N/n ratios, and that methods based on simultaneously discovering and exploiting the problem structure are promising in this context.
English
2010

Proceedings of the Fifth European Workshop on Probabilistic Graphical Models
international
2010
17-25

PGM 2010
2010-09-13
2010-09-15
Helsinki
Finland

Attached file list to this document: 
PDF
PGM2010_final_version.pdf(530.2 KB)