Automatic Ontology Population from Product Catalogs

Abstract : In this paper we present an approach for ontology population based on heterogeneous documents describing commercial products with various descriptions and diverse styles. The originality is the generation and progressive refinement of semantic annotations leading to identify the types of the products and their features whereas the initial information is very poor quality. Documents are annotated using an ontology. The annotation process is based on an initial set of known instances, this set being built from terminological elements added in the ontology. Our approach first uses semi-automated annotation techniques on a small dataset and then applies machine learning techniques in order to fully annotate the entire dataset. This work was motivated by specific application needs. Experimentations were conducted on real-world datasets in the toys domain.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01115513
Contributor : Chantal Reynaud <>
Submitted on : Wednesday, February 11, 2015 - 11:34:02 AM
Last modification on : Wednesday, November 14, 2018 - 12:52:02 PM

Identifiers

  • HAL Id : hal-01115513, version 1

Collections

Citation

Céline Alec, Chantal Reynaud-Delaître, Brigitte Safar, Zied Sellami, Uriel Berdugo. Automatic Ontology Population from Product Catalogs. Knowledge Engineering and Knowledge Management - 19th International Conference (EKAW), Nov 2014, Linköping, Sweden. pp.1-12. ⟨hal-01115513⟩

Share

Metrics

Record views

116