A Data Mining-Based OLAP Aggregation of Complex Data: Application on XML Documents

Abstract : Nowadays, most organizations deal with complex data having different formats and coming from different sources. The XML formalism is evolving and becoming a promising solution for modelling and warehousing these data in decision support systems. Nevertheless, classical OLAP tools are still not capable to analyze such data. In this paper, we associate OLAP and data mining to cope advanced analysis on complex data. We provide a generalized OLAP operator, called OpAC, based on the AHC. OpAC is adapted for all types of data since it deals with data cubes modelled within XML. Our operator enables significant aggregates of facts expressing semantic similarities. Evaluation criteria of aggregates' partitions are proposed in order to assist the choice of the best partition. Furthermore, we developed a Web application for our operator. We also provide performance experiments and drive a case study on XML documents dealing with the breast cancer researches domain.
Complete list of metadatas

Cited literature [29 references]  Display  Hide  Download

Contributor : Sabine Loudcher <>
Submitted on : Monday, April 26, 2010 - 3:38:07 PM
Last modification on : Wednesday, April 3, 2019 - 1:09:03 AM
Long-term archiving on : Thursday, June 30, 2011 - 12:16:16 PM


Files produced by the author(s)


  • HAL Id : halshs-00476497, version 1



Riadh Ben Messaoud, Omar Boussaïd, Sabine Loudcher Rabaseda. A Data Mining-Based OLAP Aggregation of Complex Data: Application on XML Documents. International Journal of Data Warehousing and Mining, 2006, 2 (4), pp.1-26. ⟨halshs-00476497⟩



Record views


Files downloads