Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Multi-view cluster aggregation and splitting, with an application to multi-omic breast cancer data

Abstract : Multi-view data, which represent distinct but related groupings of variables, can be useful for identifying relevant and robust clustering structures among observations. A large number of multi-view classification algorithms have been proposed in the fields of computer science and ge-nomics; in this work, we instead focus on the task of merging or splitting an existing hard or fuzzy cluster partition based on multi-view data. This work is specifically motivated by an application involving multi-omic breast cancer data from The Cancer Genome Atlas, where multiple molecular profiles (gene expression, miRNA expression, methylation, and copy number alterations) are used to further subdivide the five currently accepted intrinsic tumor subtypes into clinically distinct subgroups of patients. In addition, we investigate the performance of the proposed multi-view splitting and aggregation algorithms, as compared to single-and concatenated-view alternatives , in a set of simulations. The multi-view splitting and aggregation algorithms developed in this work are implemented in the maskmeans R package.
Complete list of metadatas

Cited literature [39 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01916941
Contributor : Cathy Maugis-Rabusseau <>
Submitted on : Thursday, November 8, 2018 - 9:39:55 PM
Last modification on : Friday, April 10, 2020 - 5:27:06 PM
Document(s) archivé(s) le : Saturday, February 9, 2019 - 2:47:59 PM

File

Multiview_Preprint.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01916941, version 1

Citation

Antoine Godichon-Baggioni, Cathy Maugis-Rabusseau, Andrea Rau. Multi-view cluster aggregation and splitting, with an application to multi-omic breast cancer data. 2018. ⟨hal-01916941⟩

Share

Metrics

Record views

268

Files downloads

227