Studying the Variability of System Setting Effectiveness by Data Analytics and Visualization - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Studying the Variability of System Setting Effectiveness by Data Analytics and Visualization

Résumé

Search engines differ from their modules and parameters; defining the optimal system setting is challenging the more because of the complexity of a retrieval stream. The main goal of this study is to determine which are the most important system components and parameters in system setting, thus which ones should be tuned as the first priority. We carry out an extensive analysis of 20, 000 different system settings applied to three TREC ad-hoc collections. Our analysis includes zooming in and out the data using various data analysis methods such as ANOVA, CART, and data visualization. We found that the query expansion model is the most significant component that changes the system effectiveness, consistently across collections. Zooming in the queries, we show that the most significant component changes to the retrieval model when considering easy queries only. The results of our study are directly re-usable for the system designers and for system tuning.
Fichier principal
Vignette du fichier
Dejean_26250.pdf (1.34 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02930098 , version 1 (04-09-2020)

Identifiants

Citer

Sébastien Dejean, Josiane Mothe, Md Zia Ullah. Studying the Variability of System Setting Effectiveness by Data Analytics and Visualization. Conference and Labs of the Evaluation Forum, Living Labs (CLEF 2019), Sep 2019, Lugano, Switzerland. pp.62-74, ⟨10.1007/978-3-030-28577-7_3⟩. ⟨hal-02930098⟩
46 Consultations
39 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More