Skip to Main content Skip to Navigation
Conference papers

Healthcare trajectory mining by combining multidimensional component and itemsets

Abstract : Sequential pattern mining is an approach to extract corre- lations among temporal data. Many different methods were proposed to either enumerate sequences of set valued data (i.e., itemsets) or sequences containing multidimensional items. However, in many real-world scenar- ios, data sequences are described as events of both multi-dimensional and set valued informations. These rich heterogeneous descriptions can- not be exploited by traditional approaches. For example, in healthcare domain, hospitalizations are defined as sequences of multi-dimensional attributes (e.g. Hospital or Diagnosis) associated with sets of medical procedures (e.g. { Radiography, Appendectomy }). In this paper we pro- pose a new approach called MMISP (Mining Multi-dimensional-Itemset Sequential Patterns) to extract patterns from sequences including both multi-dimensional and set valued data. The novelties of the proposal lies in: (i) the way in which the data can be efficiently compressed; (ii) the ability to reuse a state-of-the-art sequential pattern mining algo- rithm and (iii) the extraction of new kind of patterns. We introduce as a case-study, experiments on real data aggregated from a regional health- care system and we point out the usefulness of the extracted patterns. Additional experiments on synthetic data highlights the efficiency and scalability of our approach.
Document type :
Conference papers
Complete list of metadatas

Cited literature [18 references]  Display  Hide  Download
Contributor : Import Ws Irstea <>
Submitted on : Monday, March 18, 2013 - 2:25:21 PM
Last modification on : Thursday, July 2, 2020 - 1:59:01 PM
Long-term archiving on: : Sunday, April 2, 2017 - 2:16:03 PM


Files produced by the author(s)


  • HAL Id : hal-00801813, version 1
  • IRSTEA : PUB00037628


Elias Egho, Chedy Raïssi, Dino Ienco, Nicolas Jay, Amedeo Napoli, et al.. Healthcare trajectory mining by combining multidimensional component and itemsets. ECML-PKDD 2012, Sep 2012, Bristol, United Kingdom. p. 116 - p. 127. ⟨hal-00801813⟩



Record views


Files downloads