An Information Theoretic Approach to Automatic Query Expansion - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue ACM Transactions on Information Systems Année : 2001

An Information Theoretic Approach to Automatic Query Expansion

Claudio Carpineto
  • Fonction : Auteur
  • PersonId : 950284
Giovanni Romano
  • Fonction : Auteur
  • PersonId : 883087
Brigitte Bigi

Résumé

Techniques for automatic query expansion from top retrieved documents have shown promise for improving retrieval effectiveness on large collections; however, they often rely on an empirical ground, and there is a shortage of cross-system comparisons. Using ideas from Information Theory, we present a computationally simple and theoretically justified method for assigning scores to candidate expansion terms. Such scores are used to select and weight expansion terms within Rocchio's framework for query reweigthing. We compare ranking with information-theoretic query expansion versus ranking with other query expansion techniques, showing that the former achieves better retrieval effectiveness on several performance measures. We also discuss the effect on retrieval effectiveness of the main parameters involved in automatic query expansion, such as data sparseness, query difficulty, number of selected documents, and number of selected terms, pointing out interesting relationships.
Fichier non déposé

Dates et versions

hal-01392277 , version 1 (04-11-2016)

Identifiants

  • HAL Id : hal-01392277 , version 1

Citer

Claudio Carpineto, Renato de Mori, Giovanni Romano, Brigitte Bigi. An Information Theoretic Approach to Automatic Query Expansion. ACM Transactions on Information Systems, 2001, 19 (1), http://dl.acm.org/citation.cfm?doid=366836.366860. ⟨hal-01392277⟩

Collections

UNIV-AVIGNON LIA
72 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More