ADEL: ADaptable Entity Linking : A Hybrid Approach to Link Entities with Linked Data for Information Extraction - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Semantic Web – Interoperability, Usability, Applicability Année : 2017

ADEL: ADaptable Entity Linking : A Hybrid Approach to Link Entities with Linked Data for Information Extraction

Julien Plu
  • Fonction : Auteur
  • PersonId : 1125442
Giuseppe Rizzo
  • Fonction : Auteur
  • PersonId : 1125443
Raphaël Troncy

Résumé

Four main challenges can cause numerous difficulties when developing an entity linking system: i) the kind of textual documents to annotate (such as social media posts, video subtitles or news articles); ii) the number of types used to categorise an entity (such as Person, Location, Organization, Date or Role); iii) the knowledge base used to disambiguate the extracted mentions (such as DBpedia, Wikidata or Musicbrainz); iv) the language used in the documents. Among these four challenges, being agnostic to the knowledge base and in particular to its coverage, whether it is encyclopedic like DBpedia or domain-specific like Musicbrainz, is arguably the most challenging one. We propose to tackle those four challenges and in order to be knowledge base agnostic, we propose a method that enables to index the data independently of the schema and vocabulary being used. More precisely, we design our index such that each entity has at least two information: a label and a popularity score such as a prior probability or a Pagerank score. This results in a framework named ADEL, an entity recognition and linking system based on a hybrid linguistic, information retrieval, and semantics-based methods. ADEL is a modular framework that is independent to the kind of text to be processed and to the knowledge base used as referent for disambiguating entities. We thoroughly evaluate the framework on six benchmark datasets: OKE2015, OKE2016, NEEL2014, NEEL2015, NEEL2016 and AIDA. Our evaluation shows that ADEL outperforms state-of-the-art systems in terms of extraction and entity typing. It also shows that our indexing approach allows to generate an accurate set of candidates from any knowledge base that makes use of linked data, respecting the required information for each entity, in a minimum of time and with a minimal size.
Fichier principal
Vignette du fichier
publi-5616.pdf (700.53 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03560444 , version 1 (07-02-2022)

Identifiants

  • HAL Id : hal-03560444 , version 1

Citer

Julien Plu, Giuseppe Rizzo, Raphaël Troncy. ADEL: ADaptable Entity Linking : A Hybrid Approach to Link Entities with Linked Data for Information Extraction. Semantic Web – Interoperability, Usability, Applicability, 2017, pp.1-5. ⟨hal-03560444⟩

Collections

EURECOM ANR
59 Consultations
81 Téléchargements

Partager

Gmail Facebook X LinkedIn More