The Wikipedia XML Corpus - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2007

The Wikipedia XML Corpus

Résumé

This article presents the general Wikipedia XML Collection developped for Structured Information Retrieval and Structured Machine Learning. This collection has been built from the Wikipedia Enclyclopedia. We detail particularly here which parts of this collection have been used during INEX 2006 for the Ad-hoc track and for the XML Mining track. Note that other tracks of INEX - multimedia track for example - have also been based on this collection.

Dates et versions

hal-01335922 , version 1 (22-06-2016)

Identifiants

Citer

Ludovic Denoyer, Patrick Gallinari. The Wikipedia XML Corpus. Advances in XML Information Retrieval and Evaluation: Fifth Workshop of the INitiative for the Evaluation of XML Retrieval (INEX'06), Dec 2006, Dagstuhl, Germany. pp.12-19, ⟨10.1007/978-3-540-73888-6_2⟩. ⟨hal-01335922⟩
94 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More