Mining association rule bases from integrated genomic data and annotations - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Mining association rule bases from integrated genomic data and annotations

Résumé

During the last decade, several clustering and association rule mining techniques have been applied to identify groups of co-regulated genes in gene expression data. Nowadays, integrating biological knowledge and gene expression data into a single framework has become a major challenge to improve the relevance of mined patterns and simplify their interpretation by the biologists. The GenMiner approach was developed for mining association rules showing gene groups that are both co-expressed (sharing similar expression profiles) and co-annotated (sharing the same annotations such as function, regulatory mechanism, etc.) from such integrated datasets. It combines a new nomalized discretization method, called NorDi, and the JClose algorithm to extract minimal non-redundant association rules only. Compared with classical Apriori based approaches, GenMiner improves the extraction applicability for these datasets and reduces the number of association rules by suppressing redundant rules that are uninformative and useless. We present a new Java implementation of GenMiner and experimental results obtained from microarray datasets with integrated biological knowledge (bio-ontologies, descriptions of regulation pathways and literature). These results show that GenMiner requires less memory than Apriori based approaches and that it improves the relevance of extracted rules. Moreover, association rules obtained revealed significant co-annotated and co-expressed gene patterns showing important biological relationships supported by recent biological literature.
Fichier principal
Vignette du fichier
Martinez_Pasquier_Pasquier_-_2008_-_Mining_association_rule_bases_from_integrate.pdf (89.84 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00361729 , version 1 (25-04-2010)

Identifiants

  • HAL Id : hal-00361729 , version 1

Citer

Ricardo Martinez, Nicolas Pasquier, Claude R. Pasquier. Mining association rule bases from integrated genomic data and annotations. CIBB international conference on Computational Intelligence methods for Bioinformatics and Biostatistics, Oct 2008, Salerno, Italy. pp.33-43. ⟨hal-00361729⟩
87 Consultations
207 Téléchargements

Partager

Gmail Facebook X LinkedIn More