Representation Learning of Compositional Data

Abstract : We consider the problem of learning a low dimensional representation for compositional data. Compositional data consists of a collection of nonnegative data that sum to a constant value. Since the parts of the collection are statistically dependent, many standard tools cannot be directly applied. Instead, compositional data must be first transformed before analysis. Focusing on principal component analysis (PCA), we propose an approach that allows low dimensional representation learning directly from the original data. Our approach combines the benefits of the log-ratio transformation from compositional data analysis and exponential family PCA. A key tool in its derivation is a generalization of the scaled Bregman theorem, that relates the perspective transform of a Bregman divergence to the Bregman divergence of a perspective transform and a remainder conformal divergence. Our proposed approach includes a convenient surrogate (upper bound) loss of the exponential family PCA which has an easy to optimize form. We also derive the corresponding form for nonlinear autoencoders. Experiments on simulated data and microbiome data show the promise of our method.
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01945508
Contributor : Marta Avalos <>
Submitted on : Wednesday, December 5, 2018 - 12:37:39 PM
Last modification on : Thursday, February 7, 2019 - 4:43:36 PM
Document(s) archivé(s) le : Wednesday, March 6, 2019 - 1:54:02 PM

File

7902-representation-learning-o...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01945508, version 1

Collections

Citation

Marta Avalos Fernandez, Richard Nock, Cheng Ong, Julien Rouar, Ke Sun. Representation Learning of Compositional Data. S. Bengio; H. Wallach; H. Larochelle; K. Grauman; N. Cesa-Bianchi; R. Garnett. NIPS 2018 - Thirty-second Conference on Neural Information Processing Systems, Dec 2018, Montréal, Canada. 31, 2018, Advances in Neural Information Processing Systems 31 (NIPS 2018) pre-proceedings. 〈https://nips.cc/Conferences/2018/Schedule?showEvent=11645〉. 〈hal-01945508〉

Share

Metrics

Record views

77

Files downloads

71