Language-independent Query Representation for IR Model Parameter Estimation on Unlabeled Collections - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Language-independent Query Representation for IR Model Parameter Estimation on Unlabeled Collections

Résumé

We study here the problem of estimating the parameters of standard IR models (as BM25 or language models) on new collections without any relevance judgments, by using collections with already available relevance judgements. We propose different query representations that allow mapping queries (with and without relevance judgments, from different collections, potentially in different languages) into a common space. We then introduce a kernel regression approach to learn the parameters of standard IR models individually for each query in the new, unlabeled collection. Our experiments, conducted on standard English and Indian IR collections, show that our approach can be used to efficiently tune, query by query, standard IR models to new collections, potentially written in different languages. In particular, the versions of the standard IR models we obtain not only outperform the versions with default parameters, but can also outperform the versions in which the parameter values have been optimized globally over a set of queries with target relevance judgements.
Fichier non déposé

Dates et versions

hal-01236589 , version 1 (01-12-2015)

Identifiants

  • HAL Id : hal-01236589 , version 1

Citer

Parantapa Goswami, Massih-Reza Amini, Eric Gaussier. Language-independent Query Representation for IR Model Parameter Estimation on Unlabeled Collections. ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR 2015), Sep 2015, Northampton, Massachusetts, United States. ⟨hal-01236589⟩
90 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More