| HAL : hal-00177059, version 2 |
| arXiv : 0710.1203 |
| Fiche détaillée | Récupérer au format |
|
|
| Nature inspired cooperative strategies for optimization (NICSO 2007), Krasnogor, N; Nicosia, G; Pavone, M; Pelta, D (Ed.) (2008) 431-442 |
|
|
| Versions disponibles : | v1 (05-10-2007) | v2 (06-10-2007) |
|
|
|
|
| Semantic distillation: a method for clustering objects by their contextual specificity |
|
|
| Thomas Sierocinski 1Antony Le Béchec 2 |
|
|
| (2008) |
|
|
| Techniques for data-mining, latent semantic analysis, contextual search of databases, etc.\ have long ago been developed by computer scientists working on information retrieval (IR). Experimental scientists, from all disciplines, having to analyse large collections of raw experimental data (astronomical, physical, biological, etc.) have developed powerful methods for their statistical analysis and for clustering, categorising, and classifying objects. Finally, physicists have developed a theory of quantum measurement, unifying the logical, algebraic, and probabilistic aspects of queries into a single formalism. The purpose of this paper is twofold: first to show that when formulated at an abstract level, problems from IR, from statistical data analysis, and from physical measurement theories are very similar and hence can profitably be cross-fertilised, and, secondly, to propose a novel method of fuzzy hierarchical clustering, termed \textit{semantic distillation} --- strongly inspired from the theory of quantum measurement ---, we developed to analyse raw data coming from various types of experiments on DNA arrays. We illustrate the method by analysing DNA arrays experiments and clustering the genes of the array according to their specificity. |
|
|
|
|
|
|
|
|
|
|
| 1 : | Institut de Recherche Mathématique de Rennes (IRMAR) |
| CNRS : UMR6625 – Université de Rennes 1 – École normale supérieure de Cachan - ENS Cachan – Institut National des Sciences Appliquées (INSA) : - RENNES – Université de Rennes II - Haute Bretagne | |
| 2 : | Détoxication et réparation tissulaire |
| INSERM : U620 – Université de Rennes 1 – IFR140 | |
|
|
|
|
|
|
|
|
| Domaine | : | Mathématiques/Probabilités Statistiques/Machine Learning Mathématiques/Statistiques Statistiques/Théorie Informatique/Base de données Sciences du Vivant/Bio-Informatique, Biologie Systémique Informatique/Bio-informatique |
|
|
| Quantum information retrieval – semantic distillation – DNA microarray – quantum and fuzzy logic |
|
|
| Liste des fichiers attachés à ce document : | ||||||||||
|
|
|
| hal-00177059, version 2 | |
| http://hal.archives-ouvertes.fr/hal-00177059 | |
| oai:hal.archives-ouvertes.fr:hal-00177059 | |
| Contributeur : Dimitri Petritis | |
| Soumis le : Samedi 6 Octobre 2007, 11:35:38 | |
| Dernière modification le : Vendredi 19 Février 2010, 16:01:29 | |