Using NLP to build the hypertextuel network of a back-of-the-book index - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2005

Using NLP to build the hypertextuel network of a back-of-the-book index

Résumé

Relying on the idea that back-of-the-book indexes are traditional devices for navigation through large documents, we have developed a method to build a hypertextual network that helps the navigation in a document. Building such an hypertextual network requires selecting a list of descriptors, identifying the relevant text segments to associate with each descriptor and finally ranking the descriptors and reference segments by relevance order. We propose a specific document segmentation method and a relevance measure for information ranking. The algorithms are tested on 4 corpora (of different types and domains) without human intervention or any semantic knowledge.
Fichier principal
Vignette du fichier
RANLP05-aitelmekki-nazarenko-VF.pdf (112.98 Ko) Télécharger le fichier
Loading...

Dates et versions

hal-00098036 , version 1 (23-09-2006)

Identifiants

Citer

Touria Aït El Mekki, Adeline Nazarenko. Using NLP to build the hypertextuel network of a back-of-the-book index. 2005, pp.316-320. ⟨hal-00098036⟩
137 Consultations
57 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More