Entity Resolution in the Web of Data - Archive ouverte HAL Accéder directement au contenu
Ouvrages Année : 2015

Entity Resolution in the Web of Data

Résumé

In recent years, several knowledge bases have been built to enable large-scale knowledge sharing, but also an entity-centric Web search, mixing both structured data and text querying. These knowledge bases offer machine-readable descriptions of real-world entities, e.g., persons, places, published on the Web as Linked Data. However, due to the different information extraction tools and curation policies employed by knowledge bases, multiple, complementary and sometimes conflicting descriptions of the same real-world entities may be provided. Entity resolution aims to identify different descriptions that refer to the same entity appearing either within or across knowledge bases. The objective of this book is to present the new entity resolution challenges stemming from the openness of the Web of data in describing entities by an unbounded number of knowledge bases, the semantic and structural diversity of the descriptions provided across domains even for the same real-world entities, as well as the autonomy of knowledge bases in terms of adopted processes for creating and curating entity descriptions. The scale, diversity, and graph structuring of entity descriptions in the Web of data essentially challenge how two descriptions can be effectively compared for similarity, but also how resolution algorithms can efficiently avoid examining pairwise all descriptions. The book covers a wide spectrum of entity resolution issues at the Web scale, including basic concepts and data structures, main resolution tasks and workflows, as well as state-of-the-art algorithmic techniques and experimental trade-offs.

Dates et versions

hal-01191691 , version 1 (02-09-2015)

Identifiants

Citer

Vassilis Christophides, Vasilis Efthymiou, Kostas Stefanidis. Entity Resolution in the Web of Data. Morgan & Claypool, 5 (3), pp.1-122, 2015, Synthesis Lectures on the Semantic Web: Theory and Technology, Ying Ding, Indiana University Paul Groth, Elsevier Labs, ⟨10.2200/S00655ED1V01Y201507WBE013⟩. ⟨hal-01191691⟩

Collections

INRIA INRIA2
1762 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More