Parallel and memory-efficient reads indexing for genome assembly - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Parallel and memory-efficient reads indexing for genome assembly

Résumé

As genomes, transcriptomes and meta-genomes are being sequenced at a faster pace than ever, there is a pressing need for efficient genome assembly methods. Two practical issues in assembly are heavy memory usage and long execution time during the read indexing phase. In this article, a parallel and memory-efficient method is proposed for reads indexing prior to assembly. Specifically, a hash-based structure that stores a reduced amount of read information is designed. Erroneous entries are filtered on the fly during index construction. A prototype implementation has been designed and applied to actual Illumina short reads. Benchmark evaluation shows that this indexing method requires significantly less memory than those from popular assemblers.
Fichier principal
Vignette du fichier
CP144.pdf (243.29 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00637536 , version 1 (02-11-2011)

Identifiants

  • HAL Id : inria-00637536 , version 1

Citer

Rayan Chikhi, Guillaume Chapuis, Dominique Lavenier. Parallel and memory-efficient reads indexing for genome assembly. Parallel Bio-Computing 2011, Sep 2011, torun, Poland. ⟨inria-00637536⟩
291 Consultations
387 Téléchargements

Partager

Gmail Facebook X LinkedIn More