ParaMiner: a Generic Pattern Mining Algorithm for Multi-Core Architectures

Abstract : In this paper, we present Para Miner which is a generic and parallel algorithm for closed pattern mining. Para Miner is built on the principles of pattern enumeration in strongly accessible set systems. Its efficiency is due to a novel dataset reduction technique (that we call EL-reduction), combined with novel technique for performing dataset reduction in a parallel execution on a multi-core architecture. We illustrate Para Miner's genericity by using this algorithm to solve three different pattern mining problems: the frequent itemset mining problem, the mining frequent connected relational graphs problem and the mining gradual itemsets problem. In this paper, we prove the soundness and the completeness of Para Miner. Furthermore, our experiments show that despite being a generic algorithm, Para Miner can compete with specialized state of the art algorithms designed for the pattern mining problems mentioned above. Besides, for the particular problem of gradual itemset mining, Para Miner outperforms the state of the art algorithm by two orders of magnitude.
Type de document :
Article dans une revue
Data Mining and Knowledge Discovery, Springer Verlag, 2014, 28 (3), pp.595-633. <http://link.springer.com/article/10.1007%2Fs10618-013-0313-2>. <10.1007/s10618-013-0313-2>
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-00923535
Contributeur : Alexandre Termier <>
Soumis le : vendredi 3 janvier 2014 - 15:33:19
Dernière modification le : vendredi 19 décembre 2014 - 08:53:58

Identifiants

Citation

Benjamin Negrevergne, Alexandre Termier, Marie-Christine Rousset, Jean-François Mehaut. ParaMiner: a Generic Pattern Mining Algorithm for Multi-Core Architectures. Data Mining and Knowledge Discovery, Springer Verlag, 2014, 28 (3), pp.595-633. <http://link.springer.com/article/10.1007%2Fs10618-013-0313-2>. <10.1007/s10618-013-0313-2>. <hal-00923535>

Partager

Métriques

Consultations de la notice

190