SPEED : Mining Maxirnal Sequential Patterns over Data Strearns - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

SPEED : Mining Maxirnal Sequential Patterns over Data Strearns

Résumé

Many recent real-world applications, such as network traffic monitoring, intrusion detection systems, sensor network data analysis, click stream mining and dynamic tracing of financial transactions, call for studying a new kind of data. Called stream data, this model is, in fact, a continuous, potentially infinite ow of information as opposed to finite, statically stored data sets extensively studied by researchers of the data mining community. An important application is to mine data streams for interesting patterns or anomalies as they happen. For data stream applications, the volume of data is usually too huge to be stored on permanent devices, main memory or to be scanned thoroughly more than once. We thus need to introduce approximations when executing queries and performing mining tasks over rapid data streams. In this paper we propose a new approach, called Speed (Sequential Patterns Efficient Extraction in Data streams), to identify frequent maximal sequential patterns in a data stream. To the best of our knowledge this is the first approach defined for mining sequential patterns in streaming data. The main originality of our mining method is that we use a novel data structure to maintain frequent sequential patterns coupled with a fast pruning strategy. At any time, users can issue requests for frequent maximal sequences over an arbitrary time interval. Furthermore, our approach produces an approximate support answer with an assurance that it will not bypass a user-defined frequency error threshold. Finally the proposed method is analyzed by a series of experiments on different datasets.
Fichier principal
Vignette du fichier
I3E-IS06-SPEED.pdf (168.4 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-00134387 , version 1 (01-03-2007)

Identifiants

Citer

Chedy Raïssi, Pascal Poncelet, Maguelonne Teisseire. SPEED : Mining Maxirnal Sequential Patterns over Data Strearns. IS: Intelligent Systems, Sep 2006, London, United Kingdom. pp.546-552, ⟨10.1109/IS.2006.348478⟩. ⟨hal-00134387⟩
185 Consultations
166 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More