Sparse approaches for the exact distribution of patterns in long multi-states sequences generated by a Markov source - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2010

Sparse approaches for the exact distribution of patterns in long multi-states sequences generated by a Markov source

Résumé

We present two novel approaches for the computation of the exact distribution of a pattern in a long sequence. Both approaches take into account the sparse structure of the problem. The first approach relies on a partial recursion computing the largest eigenvalue of the the transition matrix of a Markov chain embedding. The second approach uses fast Taylor expansions of an exact bivariate rational reconstruction of the distribution. We illustrate the interest of both approaches on a simple toy-example and two biological applications: the transcription factors of the Human Chromosome 5 and the PROSITE signatures of functional motifs in proteins. On these examples our methods demonstrate their complementarity and their hability to extend the domain of feasibility for exact computations in pattern problems to a new level.
Fichier principal
Vignette du fichier
symbnum_pattern_sparse.pdf (395.98 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00492738 , version 1 (16-06-2010)
hal-00492738 , version 2 (06-12-2011)
hal-00492738 , version 3 (30-04-2012)
hal-00492738 , version 4 (05-06-2012)

Identifiants

Citer

Grégory Nuel, Jean-Guillaume Dumas. Sparse approaches for the exact distribution of patterns in long multi-states sequences generated by a Markov source. 2010. ⟨hal-00492738v1⟩
429 Consultations
220 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More