Population of a Knowledge Base for News Metadata from Unstructured Text and Web Data - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Population of a Knowledge Base for News Metadata from Unstructured Text and Web Data

Résumé

We present a practical use case of knowl- edge base (KB) population at the French news agency AFP. The target KB instances are en- tities relevant for news production and con- tent enrichment. In order to acquire uniquely identified entities over news wires, i.e. tex- tual data, and integrate the resulting KB in the Linked Data framework, a series of data mod- els need to be aligned: Web data resources are harvested for creating a wide coverage entity database, which is in turn used to link entities to their mentions in French news wires. Fi- nally, the extracted entities are selected for in- stantiation in the target KB. We describe our methodology along with the resources created and used for the target KB population.
Fichier principal
Vignette du fichier
naacl12akbc.pdf (190.2 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00699297 , version 1 (20-05-2012)

Identifiants

  • HAL Id : hal-00699297 , version 1

Citer

Rosa Stern, Benoît Sagot. Population of a Knowledge Base for News Metadata from Unstructured Text and Web Data. AKBC-WEKEX 2012 - The Knowledge Extraction Workshop at NAACL-HLT 2012, Jun 2012, Montréal, Canada. ⟨hal-00699297⟩
246 Consultations
142 Téléchargements

Partager

Gmail Facebook X LinkedIn More