UJM at INEX 2009 XML Mining Track - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

UJM at INEX 2009 XML Mining Track

Christine Largeron
Christophe Moulin
  • Fonction : Auteur
  • PersonId : 854129
Mathias Géry
  • Fonction : Auteur
  • PersonId : 843869

Résumé

This paper reports our experiments carried out for the INEX XML Mining track 2009, consisting in developing categorization methods for multi-labeled XML documents. We represent XML documents as vectors of indexed terms. The purpose of our experiments is twofold: firstly we aim to compare strategies that reduce the index size using an improved feature selection criteria CCD. Secondly, we compare a thresholding strategy (MCut) we proposed with common RCut, PCut strategies. The index size was reduced in such a way that the results were less good than expected. However, we obtained good improvements with the MCut thresholding strategy.

Mots clés

Fichier principal
Vignette du fichier
paper_54.pdf (150.53 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00526610 , version 1 (15-10-2010)

Identifiants

Citer

Christine Largeron, Christophe Moulin, Mathias Géry. UJM at INEX 2009 XML Mining Track. Focused Retrieval and Evaluation, 8th International Workshop of the Initiative for the Evaluation of XML Retrieval, Dec 2009, Brisbane, Australia. pp.426-433, ⟨10.1007/978-3-642-14556-8⟩. ⟨hal-00526610⟩
118 Consultations
75 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More