A proximity measure and a clustering method for concept extraction in an ontology building perspective - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

A proximity measure and a clustering method for concept extraction in an ontology building perspective

Résumé

In this paper, we study the problem of clustering textual units in the framework of helping an expert to build a specialized ontology. This work has been achieved in the context of a French project, called BIOTIM, handling botany corpora. Building an ontology, either automatically or semi-automatically is a difficult task. We focus on one of the main steps of that process, namely structuring the textual units occurring in the texts into classes, likely to represent concepts of the domain. The approach that we propose relies on the definition of a new non-symmetrical measure for evaluating the semantic proximity between lemma, taking into account the contexts in which they occur in the documents. Moreover, we present a non-supervised classification algorithm designed for the task at hand and that kind of data. The first experiments performed on botanical data have given relevant results.
Fichier non déposé

Dates et versions

hal-00084782 , version 1 (10-07-2006)

Identifiants

  • HAL Id : hal-00084782 , version 1

Citer

Guillaume Cleuziou, Sylvie Billot, Stanislas Lew, Lionel Martin, Christel Vrain. A proximity measure and a clustering method for concept extraction in an ontology building perspective. 16th International Symposium on Methodologies for Intelligent Systems (ISMIS'2006), 2006, European Union. pp.697-706. ⟨hal-00084782⟩
44 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More