Skip to Main content Skip to Navigation
Conference papers

A semi-automatic approach for building ontologies from a collection of structured web documents

Abstract : Many collections of structured documents are available on the web. The collection generally describes the characteristics of entities from a single type, where each page describes one entity. These documents are adequate knowledge sources for building ontologies. As they benefit from a strong and shared layout, they contain less well written text than plain text files but their architecture is very meaningful. Classical linguistic-based methods for identifying concepts and relations are no longer appropriate for analyzing them. The approach we propose in this paper exploits various properties of such documents, combining layout/formatting analysis and linguistic analysis, and using semantic annotation.
Complete list of metadatas

Cited literature [5 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01264565
Contributor : Open Archive Toulouse Archive Ouverte (oatao) <>
Submitted on : Friday, January 29, 2016 - 1:49:28 PM
Last modification on : Tuesday, September 1, 2020 - 7:08:03 PM
Long-term archiving on: : Friday, November 11, 2016 - 8:19:31 PM

File

kamel_12473.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01264565, version 1
  • OATAO : 12473

Citation

Mouna Kamel, Nathalie Aussenac-Gilles, Davide Buscaldi, Catherine Comparot. A semi-automatic approach for building ontologies from a collection of structured web documents. 7th International Conference on Knowledge Capture (K-CAP 2013), Jun 2013, Banff, Canada. pp. 139-140. ⟨hal-01264565⟩

Share

Metrics

Record views

249

Files downloads

215