MATAM: reconstruction of phylogenetic marker genes from short sequencing reads in metagenomes

Pierre Pericard 1, 2 Yoann Dufresne 1, 2 Loïc Couderc 3, 1 Samuel Blanquart 2 Hélène Touzet 1, 2
2 BONSAI - Bioinformatics and Sequence Analysis
Université de Lille, Sciences et Technologies, Inria Lille - Nord Europe, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille (CRIStAL) - UMR 9189, CNRS - Centre National de la Recherche Scientifique
Abstract : Motivation: Advances in the sequencing of uncultured environmental samples, dubbed metagenomics, raise a growing need for accurate taxonomic assignment. Accurate identification of organisms present within a community is essential to understanding even the most elementary ecosystems. However, current high-throughput sequencing technologies generate short reads which partially cover full-length marker genes and this poses difficult bioinformatic challenges for taxonomy identification at high resolution. Results: We designed MATAM, a software dedicated to the fast and accurate targeted assembly of short reads sequenced from a genomic marker of interest. The method implements a stepwise process based on construction and analysis of a read overlap graph. It is applied to the assembly of 16S rRNA markers and is validated on simulated, synthetic and genuine metagenomes. We show that MATAM outperforms other available methods in terms of low error rates and recovered fractions and is suitable to provide improved assemblies for precise taxonomic assignments.
Type de document :
Article dans une revue
Bioinformatics, Oxford University Press (OUP), 2017, 〈10.1093/bioinformatics/btx644〉
Liste complète des métadonnées

Littérature citée [32 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01646297
Contributeur : Samuel Blanquart <>
Soumis le : mardi 16 janvier 2018 - 13:54:28
Dernière modification le : mardi 2 octobre 2018 - 16:13:13
Document(s) archivé(s) le : mardi 8 mai 2018 - 00:57:50

Fichier

main.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Pierre Pericard, Yoann Dufresne, Loïc Couderc, Samuel Blanquart, Hélène Touzet. MATAM: reconstruction of phylogenetic marker genes from short sequencing reads in metagenomes. Bioinformatics, Oxford University Press (OUP), 2017, 〈10.1093/bioinformatics/btx644〉. 〈hal-01646297〉

Partager

Métriques

Consultations de la notice

302

Téléchargements de fichiers

76