Dynamic read mapping and online consensus calling for better variant detection

Abstract : Variant detection from high-throughput sequencing data is an essential step in identification of alleles involved in complex diseases and cancer. To deal with these massive data, elaborated sequence analysis pipelines are employed. A core component of such pipelines is a read mapping module whose accuracy strongly affects the quality of resulting variant calls. We propose a dynamic read mapping approach that significantly improves read alignment accuracy. The general idea of dynamic mapping is to continuously update the reference sequence on the basis of previously computed read alignments. Even though this concept already appeared in the literature, we believe that our work provides the first comprehensive analysis of this approach. To evaluate the benefit of dynamic mapping, we developed a software pipeline (http://github.com/karel-brinda/dymas) that mimics different dynamic mapping scenarios. The pipeline was applied to compare dynamic mapping with the conventional static mapping and, on the other hand, with the so-called iterative referencing – a computationally expensive procedure computing an optimal modification of the reference that maximizes the overall quality of all alignments. We conclude that in all alternatives, dynamic mapping results in a much better accuracy than static mapping, approaching the accuracy of iterative referencing. To correct the reference sequence in the course of dynamic mapping, we developed an online consensus caller named Ococo (http://github.com/karel-brinda/ococo). Ococo is the first consensus caller capable to process input reads in the online fashion. Finally, we provide conclusions about the feasibility of dynamic mapping and discuss main obstacles that have to be overcome to implement it. We also review a wide range of possible applications of dynamic mapping with a special emphasis on variant detection.
Type de document :
Pré-publication, Document de travail
2016
Liste complète des métadonnées


https://hal.archives-ouvertes.fr/hal-01323188
Contributeur : Karel Břinda <>
Soumis le : lundi 30 mai 2016 - 11:25:05
Dernière modification le : mardi 11 octobre 2016 - 15:09:37
Document(s) archivé(s) le : mercredi 31 août 2016 - 10:34:07

Fichier

main.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01323188, version 1

Citation

Karel Břinda, Valentina Boeva, Gregory Kucherov. Dynamic read mapping and online consensus calling for better variant detection. 2016. <hal-01323188>

Partager

Métriques

Consultations de
la notice

334

Téléchargements du document

158