Ancestry informative markers for fine-scale individual assignment to worldwide populations - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of Medical Genetics Année : 2010

Ancestry informative markers for fine-scale individual assignment to worldwide populations

Résumé

The analysis of large-scale genetic data from thousands of individuals has revealed the fact that subtle population genetic structure can be detected at levels that were previously unimaginable. Using the Human Genome Diversity Panel as reference (51 populations - 650,000 SNPs), we describe a systematic evaluation of the resolution that can be achieved for the inference of genetic ancestry, even when small panels of genetic markers are used. Leveraging the power of Principal Components Analysis (PCA), we undertake a comprehensive investigation of human population structure around the world. We dissect the problem into hierarchical steps, proposing a decision tree for the prediction of individual ancestry. A complete leave-one-out validation experiment demonstrates that, using all available SNPs, assignment of individuals to their self-reported populations of origin is essentially perfect. Ancestry informative genetic markers are selected using two different metrics (In and correlation with PCA scores). Performing a thorough crossvalidation experiment, we show that, in most cases here, the number of SNPs needed for ancestry inference can be successfully reduced to less than 0.1% of the original 650,000 while retaining close to 100% accuracy. This reduction can be achieved using a clustering-based redundancy removal algorithm which we also introduce. The applicability of our suggested SNP panels is tested on HapMap Phase 3 populations. The methods we describe, in combination with the increasingly more comprehensive databases of human genetic variation, open new horizons in a variety of fields, ranging from the study of human evolution and population history, to medical genetics and forensics.
Fichier principal
Vignette du fichier
PEER_stage2_10.1136%2Fjmg.2010.078212.pdf (3.95 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00573484 , version 1 (04-03-2011)

Identifiants

Citer

Peristera Paschou, Jamey Lewis, Asif Javed, Petros Drineas. Ancestry informative markers for fine-scale individual assignment to worldwide populations. Journal of Medical Genetics, 2010, 47 (12), pp.835. ⟨10.1136/jmg.2010.078212⟩. ⟨hal-00573484⟩

Collections

PEER
37 Consultations
2579 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More