LSHTC: A Benchmark for Large-Scale Text Classification

Abstract : LSHTC is a series of challenges which aims to assess the performance of classification systems in large-scale classification in a a large number of classes (up to hundreds of thousands). This paper describes the dataset that have been released along the LSHTC series. The paper details the construction of the datsets and the design of the tracks as well as the evaluation measures that we implemented and a quick overview of the results. All of these datasets are available online and runs may still be submitted on the online server of the challenges.
Keywords : ClassY
Liste complète des métadonnées
Contributeur : Thierry Artieres <>
Soumis le : mercredi 24 janvier 2018 - 08:21:56
Dernière modification le : jeudi 21 mars 2019 - 14:18:47

Lien texte intégral


  • HAL Id : hal-01691460, version 1
  • ARXIV : 1503.08581


Ioannis Partalas, Aris Kosmopoulos, Nicolas Baskiotis, Thierry Artières, George Paliouras, et al.. LSHTC: A Benchmark for Large-Scale Text Classification. 2015. 〈hal-01691460〉



Consultations de la notice