Skip to Main content Skip to Navigation
Journal articles

Clustering formal concepts to discover biologically relevant knowledge from gene expression data.

Abstract : The production of high-throughput gene expression data has generated a crucial need for bioinformatics tools to generate biologically interesting hypotheses. Whereas many tools are available for extracting global patterns, less attention has been focused on local pattern discovery. We propose here an original way to discover knowledge from gene expression data by means of the so-called formal concepts which hold in derived Boolean gene expression datasets. We first encoded the over-expression properties of genes in human cells using human SAGE data. It has given rise to a Boolean matrix from which we extracted the complete collection of formal concepts, i.e., all the largest sets of over-expressed genes associated to a largest set of biological situations in which their over-expression is observed. Complete collections of such patterns tend to be huge. Since their interpretation is a time-consuming task, we propose a new method to rapidly visualize clusters of formal concepts. This designates a reasonable number of Quasi-Synexpression-Groups (QSGs) for further analysis. The interest of our approach is illustrated using human SAGE data and interpreting one of the extracted QSGs. The assessment of its biological relevancy leads to the formulation of both previously proposed and new biological hypotheses.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-00294025
Contributor : Nathalie Laviron <>
Submitted on : Tuesday, July 8, 2008 - 11:57:00 AM
Last modification on : Thursday, November 21, 2019 - 2:40:32 AM

Identifiers

  • HAL Id : hal-00294025, version 1
  • PUBMED : 18391238

Citation

Sylvain Blachon, Ruggero G Pensa, Jérémy Besson, Céline Robardet, Jean-François Boulicaut, et al.. Clustering formal concepts to discover biologically relevant knowledge from gene expression data.. In Silico Biology, IOS Press, 2007, 7 (4-5), pp.467-83. ⟨hal-00294025⟩

Share

Metrics

Record views

360