A Conformity Measure using Background Knowledge for Association Rules: Application to Text Mining - Archive ouverte HAL Access content directly
Book Sections Year : 2009

A Conformity Measure using Background Knowledge for Association Rules: Application to Text Mining

Abstract

A text mining process using association rules generates a very large number of rules. According to experts of the domain, most of these rules basically convey a common knowledge, i.e. rules which associate terms that experts may likely relate to each other. In order to focus on the result interpretation and discover new knowledge units, it is necessary to define criteria for classifying the extracted rules. Most of the rule classification methods are based on numerical quality measures. In this chapter, we introduce two classification methods: The first one is based on a classical numerical approach, i.e. using quality measures, and the other one is based on domain knowledge. We propose the second original approach in order to classify association rules according to qualitative criteria using domain model as background knowledge. Hence, we extend the classical numerical approach in an effort to combine data mining and semantic techniques for post mining and selection of association rules. We mined a corpus of texts in molecular biology and present the results of both approaches, compare them, and give a discussion on the benefits of taking into account a knowledge domain model of the data.
Fichier principal
Vignette du fichier
PM06Cherfi-final.pdf (307.48 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

inria-00437237 , version 1 (30-11-2009)

Identifiers

  • HAL Id : inria-00437237 , version 1

Cite

Hacène Cherfi, Amedeo Napoli, Yannick Toussaint. A Conformity Measure using Background Knowledge for Association Rules: Application to Text Mining. Yanchang Zhao and Chengqi Zhang and Longbing Cao. Post-Mining of Association Rules: Techniques for Effective Knowledge Extraction, IGI Global, 2009, 978-1605664040. ⟨inria-00437237⟩
134 View
110 Download

Share

Gmail Facebook X LinkedIn More