Parallel Position Weight Matrices Algorithms

Mathieu Giraud 1, 2 Jean-Stéphane Varré 1, 2
2 BONSAI - Bioinformatics and Sequence Analysis
LIFL - Laboratoire d'Informatique Fondamentale de Lille, Inria Lille - Nord Europe
Abstract : Position Weight Matrices (PWMs) are broadly used in computational biology. The basic problems, Scan and MultipleScan, aim to find all the occurrences of a given PWM or a set of PWMs in long sequences. Some other PWM tasks share a common NP-hard subproblem, ScoreDistribution. The existing algorithms rely on the enumeration on a large set of scores or words, and they are mostly not suitable for parallelization. We propose a new algorithm, BucketScoreDistribution, that is both very efficient and suitable for parallelization. We bound the error induced by this algorithm. We realized a GPU prototype for Scan, MultipleScan and BucketScoreDistribution with the CUDA libraries, and report for the different problems speedups larger than 10× on several Nvidia cards.
Liste complète des métadonnées

Littérature citée [38 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00623404
Contributeur : Mathieu Giraud <>
Soumis le : mercredi 14 septembre 2011 - 10:48:57
Dernière modification le : jeudi 21 février 2019 - 10:52:54
Document(s) archivé(s) le : mardi 13 novembre 2012 - 10:45:58

Fichier

parallelcomputing.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Mathieu Giraud, Jean-Stéphane Varré. Parallel Position Weight Matrices Algorithms. Parallel Computing, Elsevier, 2011, 37, pp.466-478. 〈10.1016/j.parco.2010.10.001〉. 〈hal-00623404〉

Partager

Métriques

Consultations de la notice

466

Téléchargements de fichiers

634