| HAL : hal-00699297, version 1 |
| Fiche détaillée | Récupérer au format |
|
|
| AKBC-WEKEX 2012 - The Knowledge Extraction Workshop at NAACL-HLT 2012, Montréal : Canada (2012) |
|
|
|
|
| Population of a Knowledge Base for News Metadata from Unstructured Text and Web Data |
|
|
Rosa Stern 1, 2Benoît Sagot 1 |
|
|
| (2012) |
|
|
| We present a practical use case of knowl- edge base (KB) population at the French news agency AFP. The target KB instances are en- tities relevant for news production and con- tent enrichment. In order to acquire uniquely identified entities over news wires, i.e. tex- tual data, and integrate the resulting KB in the Linked Data framework, a series of data mod- els need to be aligned: Web data resources are harvested for creating a wide coverage entity database, which is in turn used to link entities to their mentions in French news wires. Fi- nally, the extracted entities are selected for in- stantiation in the target KB. We describe our methodology along with the resources created and used for the target KB population. |
|
|
|
|
|
|
|
|
|
|
| 1 : | ALPAGE (INRIA Rocquencourt) |
| INRIA – Université Paris VII - Paris Diderot | |
| 2 : | Medialab AFP (Medialab AFP) |
| Agence France-Presse | |
|
|
|
|
|
|
|
|
| Domaine | : | Informatique/Informatique et langage |
|
|
| entity linking – web data extraction – knowledge base population |
|
|
| Liste des fichiers attachés à ce document : | |||||
|
|
|
| hal-00699297, version 1 | |
| http://hal.archives-ouvertes.fr/hal-00699297 | |
| oai:hal.archives-ouvertes.fr:hal-00699297 | |
| Contributeur : Rosa Stern | |
| Soumis le : Dimanche 20 Mai 2012, 15:34:44 | |
| Dernière modification le : Vendredi 25 Mai 2012, 10:50:01 | |