Skip to Main content Skip to Navigation
Conference papers

Human-Based Query Difficulty Prediction

Abstract : The purpose of an automatic query difficulty predictor is to decide whether an information retrieval system is able to provide the most appropriate answer for a current query. Researchers have investigated many types of automatic query difficulty predictors. These are mostly related to how search engines process queries and documents: they are based on the inner workings of searching/ranking system functions, and therefore they do not provide any really insightful explanation as to the reasons for the difficulty, and they neglect user-oriented aspects. In this paper we study if humans can provide useful explanations, or reasons, of why they think a query will be easy or difficult for a search engine.We run two experiments with variations in the TREC reference collection, the amount of information available about the query, and the method of annotation generation. We examine the correlation between the human prediction, the reasons they provide, the automatic prediction, and the actual system effectiveness. The main findings of this study are twofold. First, we confirm the result of previous studies stating that human predictions correlate only weakly with system effectiveness. Second, and probably more important, after analyzing the reasons given by the annotators we find that: (i) overall, the reasons seem coherent, sensible, and informative; (ii) humans have an accurate picture of some query or term characteristics; and (iii) yet, they cannot reliably predict system/query difficulty.
Keywords : SIGEVI
Complete list of metadata

Cited literature [14 references]  Display  Hide  Download
Contributor : Open Archive Toulouse Archive Ouverte (oatao) <>
Submitted on : Monday, February 19, 2018 - 4:00:11 PM
Last modification on : Thursday, March 18, 2021 - 2:18:36 PM
Long-term archiving on: : Monday, May 7, 2018 - 11:58:50 AM


Files produced by the author(s)


  • HAL Id : hal-01712541, version 1
  • OATAO : 18854


Adrian-Gabriel Chifu, Sébastien Déjean, Stefano Mizzaro, Josiane Mothe. Human-Based Query Difficulty Prediction. 39th European Colloquium on Information Retrieval (ECIR 2017), Apr 2017, Aberdeen, Scotland, United Kingdom. pp. 343-356. ⟨hal-01712541⟩



Record views


Files downloads