An Integrated Approach for Large-Scale Relation Extraction from the Web

Abstract : Deriving knowledge from information stored in unstructured documents is a major challenge. More specifically, binary relationships representing facts between entities can be extracted to populate semantic triple stores or large knowledge bases. The main constraint of all knowledge extraction approaches is to find a trade-off between quality and scalability. Thus, we propose in this paper SPIDER, a novel integrated system for extracting binary relationships at large scale. Through series of experiments, we show the benefit of our approach, which in general, outperforms existing systems both in terms of quality (precision and the number of discovered facts) and scalability.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01339299
Contributor : Équipe Gestionnaire Des Publications Si Liris <>
Submitted on : Wednesday, June 29, 2016 - 3:52:08 PM
Last modification on : Thursday, November 21, 2019 - 2:23:22 AM

Links full text

Identifiers

Citation

Naimdjon Takhirov, Fabien Duchateau, Trond Aalberg, Solvberg Ingeborg. An Integrated Approach for Large-Scale Relation Extraction from the Web. Asia-Pacific Web Conference, Apr 2013, Sydney, Australia. pp.163-175, ⟨10.1007/978-3-642-37401-2_18⟩. ⟨hal-01339299⟩

Share

Metrics

Record views

139