A general approach to account for dependence in large-scale multiple testing

Chloé Friguet

Article Dans Une Revue Journal de la Société Française de Statistique Année : 2012

A general approach to account for dependence in large-scale multiple testing

(1)

Chloé Friguet

Fonction : Auteur
PersonId : 183883
IdHAL : chloefriguet
ORCID : 0000-0003-2827-0283
IdRef : 148745504

Laboratoire de Mathématiques de Bretagne Atlantique

Résumé

The data generated by high-throughput biotechnologies are characterized by their high-dimension and heterogeneity. Usual, tried and tested inference approaches are questioned in the statistical analysis of such data. Motivated by issues raised by the analysis of gene expressions data, I focus on the impact of dependence on the properties of multiple testing procedures in high-dimension. This article aims at presenting the main results: after introducing the issues brought by dependence among variables, the impact of dependence on the error rates and on the procedures developed to control them is more particularly studied. It results in the description of an innovative methodology based on a factor structure to model the data heterogeneity, which provides a general framework to deal with dependence in multiple testing. The proposed framework leads to less variability for error rates and consequently shows large improvements of power and stability of simultaneous inference with respect to existing multiple testing procedures. Besides, the model parameters estimation in a high-dimensional setting and the determination of the number of factors to be considered in the model are evoked. These results are then illustrated by real data from microarray experiments analyzed using the R package called FAMT. This paper is an extended written version of my oral presentation on the same topic at the 44th Journées de Statistique organized by the French Statistical Society (SFdS) in Bruxelles, Belgium, 2012, when being awarded the Marie-Jeanne Laurent-Duhamel prize.

Mots clés

Multiple testing Dependence High-dimension Error rates Factor Analysis Proportion of null hypotheses

Domaines

Méthodologie [stat.ME] Statistiques [math.ST] Théorie [stat.TH]

Fichier principal

sfds_jsfds_153-2_100-122.pdf (969.88 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Chloé Friguet : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00880140

Soumis le : mardi 5 novembre 2013-14:43:52

Dernière modification le : jeudi 14 mars 2024-03:10:15

Archivage à long terme le : jeudi 6 février 2014-04:37:17

Dates et versions

hal-00880140 , version 1 (05-11-2013)

Identifiants

HAL Id : hal-00880140 , version 1

Citer

Chloé Friguet. A general approach to account for dependence in large-scale multiple testing. Journal de la Société Française de Statistique, 2012, 153 (2), pp.100-122. ⟨hal-00880140⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BREST UNIV-RENNES1 IRMAR CNRS UBS UR1-MATH-STIC UNIV-RENNES IBNM UR1-MATH-NUM

211 Consultations

126 Téléchargements

A general approach to account for dependence in large-scale multiple testing

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager