GO-SPADE: Mining Sequential Patterns over Datasets with Consecutive Repetitions - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2003

GO-SPADE: Mining Sequential Patterns over Datasets with Consecutive Repetitions

Résumé

Databases of sequences can contain consecutive repetitions of items. This is the case in particular when some items represent discretized quantitative values. We show that on such databases, a typical algorithm like the SPADE algorithm tends to loose its efficiency. SPADE is based on the used of lists containing the localization of the occurrences of a pattern in the sequences and these lists are not appropriated in the case of data with repetitions. We introduce the concept of generalized occurrences and the corresponding primitive operators to manipulate them. We present an algorithm called GO-SPADE that extends SPADE to incorporate generalized occurrences. Finally we present experiments showing that GO-SPADE can handle sequences containing consecutive repetitions at nearly no extra cost.

Dates et versions

hal-01588170 , version 1 (15-09-2017)

Identifiants

Citer

Marion Leleu, Christophe Rigotti, Jean-François Boulicaut, G Euvrard. GO-SPADE: Mining Sequential Patterns over Datasets with Consecutive Repetitions. International Workshop on Machine Learning and Data Mining in Pattern Recognition, MLDM'03, Jul 2003, Leipzig, Germany. pp.293-306, ⟨10.1007/3-540-45065-3_26⟩. ⟨hal-01588170⟩
78 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More