Learning to Adaptively Rank Document Retrieval System Configurations - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue ACM Transactions on Information Systems Année : 2018

Learning to Adaptively Rank Document Retrieval System Configurations

Résumé

Modern Information Retrieval (IR) systems become more and more complex, involving a large number of parameters. For example, a system may choose from a set of possible retrieval models (BM25, language model, etc.), or various query expansion parameters, whose values greatly influence the overall retrieval effectiveness. Traditionally, these parameters are set at system level based on training queries, and the same parameters are then used for different queries. We observe that it may not be easy to set all these parameters separately since they can be dependent. In addition, a global setting for all queries may not best fit all individual queries with different characteristics. The parameters should be set according to these characteristics. In this paper, we propose a novel approach to tackle this problem by dealing with the entire system configurations (i.e. a set of parameters representing an IR system behaviour) instead of selecting a single parameter at a time. The selection of the best configuration is cast as a problem of ranking different possible configurations given a query. We apply learning-to-rank approaches for this task. We exploit both the query features and the system configuration features in the learning-to-rank method so that the selection of configuration is query-dependent. The experiments we conducted on four TREC ad-hoc collections show that this approach can significantly outperform the traditional method to tune system configuration globally (i.e grid search), and leads to higher effectiveness than the top performing systems of the TREC tracks. We also perform an ablation analysis on the impact of different features on the model learning capability and show that query expansion features are among the most important for adaptive systems.
Fichier principal
Vignette du fichier
deveaud_22661.pdf (2.12 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02092955 , version 1 (08-04-2019)

Identifiants

Citer

Romain Deveaud, Josiane Mothe, Md Zia Ullah, Jian-Yun Nie. Learning to Adaptively Rank Document Retrieval System Configurations. ACM Transactions on Information Systems, 2018, 37 (1), pp.1-41. ⟨10.1145/3231937⟩. ⟨hal-02092955⟩
88 Consultations
469 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More