Mining gene expression data with pattern structures in formal concept analysis

Abstract : This paper addresses the important problem of efficiently mining numerical data with formal concept analysis (FCA). Classically, the only way to apply FCA is to binarize the data, thanks to a so-called scaling procedure. This may either involve loss of information, or produce large and dense binary data known as hard to process. In the context of gene expression data analysis, we propose and compare two FCA-based methods for mining numerical data and we show that they are equivalent. The first one relies on a particular scaling, encoding all possible intervals of attribute values, and uses standard FCA techniques. The second one relies on pattern structures without a priori transformation, and is shown to be more computationally efficient and to provide more readable results. Experiments with real-world gene expression data are discussed and give a practical basis for the comparison and evaluation of the methods.
Document type :
Journal articles
Complete list of metadatas

Cited literature [39 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00541100
Contributor : Mehdi Kaytoue <>
Submitted on : Wednesday, November 23, 2011 - 7:39:25 PM
Last modification on : Wednesday, October 16, 2019 - 1:16:34 AM
Long-term archiving on : Friday, November 16, 2012 - 11:50:55 AM

Files

is_kknd09.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00541100, version 1

Collections

Citation

Mehdi Kaytoue, Sergei O. Kuznetsov, Amedeo Napoli, Sébastien Duplessis. Mining gene expression data with pattern structures in formal concept analysis. Information Sciences, Elsevier, 2011, 181 (10), pp.1989-2001. ⟨hal-00541100⟩

Share

Metrics

Record views

604

Files downloads

795