Skip to Main content Skip to Navigation
Conference papers

An Integrated Approach for Large-Scale Relation Extraction from the Web

Abstract : Deriving knowledge from information stored in unstructured documents is a major challenge. More specifically, binary relationships representing facts between entities can be extracted to populate semantic triple stores or large knowledge bases. The main constraint of all knowledge extraction approaches is to find a trade-off between quality and scalability. Thus, we propose in this paper SPIDER, a novel integrated system for extracting binary relationships at large scale. Through series of experiments, we show the benefit of our approach, which in general, outperforms existing systems both in terms of quality (precision and the number of discovered facts) and scalability.
Document type :
Conference papers
Complete list of metadatas
Contributor : Équipe Gestionnaire Des Publications Si Liris <>
Submitted on : Wednesday, June 29, 2016 - 3:52:08 PM
Last modification on : Wednesday, July 8, 2020 - 12:43:36 PM

Links full text



Naimdjon Takhirov, Fabien Duchateau, Trond Aalberg, Solvberg Ingeborg. An Integrated Approach for Large-Scale Relation Extraction from the Web. Asia-Pacific Web Conference, Apr 2013, Sydney, Australia. pp.163-175, ⟨10.1007/978-3-642-37401-2_18⟩. ⟨hal-01339299⟩



Record views