An Integrated Approach for Large-Scale Relation Extraction from the Web

Naimdjon Takhirov; Fabien Duchateau; Trond Aalberg; Solvberg Ingeborg

doi:10.1007/978-3-642-37401-2_18

Communication Dans Un Congrès Année : 2013

An Integrated Approach for Large-Scale Relation Extraction from the Web

(1) , (2) , (1) , (1)

1
2

Naimdjon Takhirov

Fonction : Auteur

Department of Computer and Information Science [Trondheim]

Fabien Duchateau

Fonction : Auteur
PersonId : 4098
IdHAL : fabien-duchateau
IdRef : 142567302

Base de Données

Trond Aalberg

Fonction : Auteur

Department of Computer and Information Science [Trondheim]

Solvberg Ingeborg

Fonction : Auteur

Department of Computer and Information Science [Trondheim]

Résumé

Deriving knowledge from information stored in unstructured documents is a major challenge. More specifically, binary relationships representing facts between entities can be extracted to populate semantic triple stores or large knowledge bases. The main constraint of all knowledge extraction approaches is to find a trade-off between quality and scalability. Thus, we propose in this paper SPIDER, a novel integrated system for extracting binary relationships at large scale. Through series of experiments, we show the benefit of our approach, which in general, outperforms existing systems both in terms of quality (precision and the number of discovered facts) and scalability.

Domaines

Informatique [cs]

Équipe gestionnaire des publications SI LIRIS : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01339299

Soumis le : mercredi 29 juin 2016-15:52:08

Dernière modification le : mercredi 5 juillet 2023-15:28:04

Dates et versions

hal-01339299 , version 1 (29-06-2016)

Identifiants

HAL Id : hal-01339299 , version 1
DOI : 10.1007/978-3-642-37401-2_18

Citer

Naimdjon Takhirov, Fabien Duchateau, Trond Aalberg, Solvberg Ingeborg. An Integrated Approach for Large-Scale Relation Extraction from the Web. Asia-Pacific Web Conference, Apr 2013, Sydney, Australia. pp.163-175, ⟨10.1007/978-3-642-37401-2_18⟩. ⟨hal-01339299⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS LABEXIMU INSA-GROUPE UDL

77 Consultations

0 Téléchargements

An Integrated Approach for Large-Scale Relation Extraction from the Web

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager