Entrepôts, Représentation et Ingénierie des Connaissances
Publications du laboratoire

Recherche approfondie

par Année
par Auteur
par Thème
par Type
--------------------
- A Data Mining-Based OLAP Aggregation of Complex Data: Application on XML Documents hal link

Auteur(s): Ben Messaoud Riadh, Boussaid O., Loudcher S.

(Article) Publié: International Journal Of Data Warehousing And Mining, vol. 2 p.1-26 (2006)


Ref HAL: halshs-00476497_v1
Exporter : BibTex | endNote
Résumé:

Nowadays, most organizations deal with complex data having different formats and coming from different sources. The XML formalism is evolving and becoming a promising solution for modelling and warehousing these data in decision support systems. Nevertheless, classical OLAP tools are still not capable to analyze such data. In this paper, we associate OLAP and data mining to cope advanced analysis on complex data. We provide a generalized OLAP operator, called OpAC, based on the AHC. OpAC is adapted for all types of data since it deals with data cubes modelled within XML. Our operator enables significant aggregates of facts expressing semantic similarities. Evaluation criteria of aggregates' partitions are proposed in order to assist the choice of the best partition. Furthermore, we developed a Web application for our operator. We also provide performance experiments and drive a case study on XML documents dealing with the breast cancer researches domain.