Learning with tree tensor networks: complexity estimates and model selection

Bertrand Michel; Anthony Nouy

doi:10.3150/21-BEJ1371

Article Dans Une Revue Bernoulli Année : 2022

Learning with tree tensor networks: complexity estimates and model selection

(1, 2) , (1, 2, 3)

1
2
3

Bertrand Michel

Fonction : Auteur
PersonId : 11758
IdHAL : bertrand-michel
ORCID : 0000-0002-5901-4146
IdRef : 129357510

Laboratoire de Mathématiques Jean Leray

NANTES UNIVERSITÉ - École Centrale de Nantes

Anthony Nouy

Fonction : Auteur
PersonId : 1976
IdHAL : anthony-nouy
ORCID : 0000-0002-2149-2986
IdRef : 07877361X

Laboratoire de Mathématiques Jean Leray

NANTES UNIVERSITÉ - École Centrale de Nantes

Méthodes d'Analyse Stochastique des Codes et Traitements Numériques

Résumé

In this paper, we propose and analyze a model selection method for tree tensor networks in an empirical risk minimization framework and analyze its performance over a wide range of smoothness classes. Tree tensor networks, or tree-based tensor formats, are prominent model classes for the approximation of high-dimensional functions in numerical analysis and data science. They correspond to sum-product neural networks with a sparse connectivity associated with a dimension partition tree T, widths given by a tuple r of tensor ranks, and multilinear activation functions (or units). The approximation power of these model classes has been proved to be optimal (or near to optimal) for classical smoothness classes. However, in an empirical risk minimization framework with a limited number of observations, the dimension tree T and ranks r should be selected carefully to balance estimation and approximation errors. In this paper, we propose a complexity-based model selection strategy à la Barron, Birgé, Massart. Given a family of model classes associated with different trees, ranks, tensor product feature spaces and sparsity patterns for sparse tensor networks, a model is selected by minimizing a penalized empirical risk, with a penalty depending on the complexity of the model class. After deriving bounds of the metric entropy of tree tensor networks with bounded parameters, we deduce a form of the penalty from bounds on suprema of empirical processes. This choice of penalty yields a risk bound for the predictor associated with the selected model. In a least-squares setting, after deriving fast rates of convergence of the risk, we show that the proposed strategy is (near to) minimax adaptive to a wide range of smoothness classes including Sobolev or Besov spaces (with isotropic, anisotropic or mixed dominating smoothness) and analytic functions. We discuss the role of sparsity of the tensor network for obtaining optimal performance in several regimes. In practice, the amplitude of the penalty is calibrated with a slope heuristics method. Numerical experiments in a least-squares regression setting illustrate the performance of the strategy for the approximation of multivariate functions and univariate functions identified with tensors by tensorization (quantization).

Mots clés

Metric entropy minimax adaptive Model selection Statistical learning Tensor networks

Domaines

Statistiques [math.ST] Machine Learning [stat.ML]

Anthony Nouy : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02889655

Soumis le : samedi 4 juillet 2020-13:39:33

Dernière modification le : mardi 23 avril 2024-10:18:03

Dates et versions

hal-02889655 , version 1 (04-07-2020)

Identifiants

HAL Id : hal-02889655 , version 1
ARXIV : 2007.01165
DOI : 10.3150/21-BEJ1371

Citer

Bertrand Michel, Anthony Nouy. Learning with tree tensor networks: complexity estimates and model selection. Bernoulli, 2022, 28 (2), pp.910 - 936. ⟨10.3150/21-BEJ1371⟩. ⟨hal-02889655⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

LMJL CNRS EC-NANTES UNAM CHL ANR NANTES-UNIVERSITE

38 Consultations

0 Téléchargements

Learning with tree tensor networks: complexity estimates and model selection

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager