Maximally Informative k-Itemset Mining from Massively Distributed Data Streams - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Maximally Informative k-Itemset Mining from Massively Distributed Data Streams

Résumé

We address the problem of mining maximally informative k-itemsets (miki) in data streams based on joint entropy. We propose PentroS, a highly scalable parallel miki mining algorithm. PentroS renders the mining process of large volumes of incoming data very efficient. It is designed to take into account the continuous aspect of data streams, particularly by reducing the computations of need for updating the miki results after arrival/departure of transactions to/from the sliding window. PentroS has been extensively evaluated using massive real-world data streams. Our experimental results confirm the effectiveness of our proposal which allows excellent throughput with high itemset length.
Fichier principal
Vignette du fichier
ACM_SAC_2018.pdf (356.05 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01711990 , version 1 (19-02-2018)

Identifiants

Citer

Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia. Maximally Informative k-Itemset Mining from Massively Distributed Data Streams. SAC: Symposium on Applied Computing, Apr 2018, Pau, France. pp.502-509, ⟨10.1145/3167132.3167187⟩. ⟨hal-01711990⟩
333 Consultations
312 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More