Proximal Methods for Hierarchical Sparse Coding

Sparse coding consists in representing signals as sparse linear combinations of atoms selected from a dictionary. We consider an extension of this framework where the atoms are further assumed to be embedded in a tree. This is achieved using a recently introduced tree-structured sparse regularization norm, which has proven useful in several applications. This norm leads to regularized problems that are difficult to optimize, and we propose in this paper efficient algorithms for solving them. More precisely, we show that the proximal operator associated with this norm is computable exactly via a dual approach that can be viewed as the composition of elementary proximal operators. Our procedure has a complexity linear, or close to linear, in the number of atoms, and allows the use of accelerated gradient techniques to solve the tree-structured sparse approximation problem at the same computational cost as traditional ones using the L1-norm. Our method is efficient and scales gracefully to millions of variables, which we illustrate in two types of applications: first, we consider fixed hierarchical dictionaries of wavelets to denoise natural images. Then, we apply our optimization tools in the context of dictionary learning, where learned dictionary elements naturally organize in a prespecified arborescent structure, leading to a better performance in reconstruction of natural image patches. When applied to text documents, our method learns hierarchies of topics, thus providing a competitive alternative to probabilistic topic models.

Mots clés

sparse coding dictionary learning structured sparsity Convex optimization matrix factorization proximal methods

Domaines

Machine Learning [stat.ML]

Fichier principal

hal_jenatton11a.pdf (560.02 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Rodolphe Jenatton : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00516723

Soumis le : mardi 5 juillet 2011-10:40:48

Dernière modification le : vendredi 19 avril 2024-16:18:55

Archivage à long terme le : jeudi 6 octobre 2011-02:21:30

Dates et versions

inria-00516723 , version 1 (10-09-2010)

inria-00516723 , version 2 (16-09-2010)

inria-00516723 , version 3 (09-03-2011)

inria-00516723 , version 4 (05-07-2011)

Identifiants

HAL Id : inria-00516723 , version 4
ARXIV : 1009.2139

Citer

Rodolphe Jenatton, Julien Mairal, Guillaume Obozinski, Francis Bach. Proximal Methods for Hierarchical Sparse Coding. Journal of Machine Learning Research, 2011, 12, pp.2297-2334. ⟨inria-00516723v4⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UNIV-RENNES1 CNRS INRIA IRISA INRIA2 PSL UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES ANR UR1-MATH-NUM

389 Consultations

684 Téléchargements