A Theoretical Analysis of Pseudo-Relevance Feedback Models

Stéphane Clinchant; Éric Gaussier

Communication Dans Un Congrès Année : 2013

A Theoretical Analysis of Pseudo-Relevance Feedback Models

(1) , (2)

1
2

Stéphane Clinchant

Fonction : Auteur

Xerox Research Centre Europe [Meylan]

Éric Gaussier

Fonction : Auteur
PersonId : 182833
IdHAL : eric-gaussier
ORCID : 0000-0002-8858-3233
IdRef : 074308297

Analyse de données, Modélisation et Apprentissage automatique [Grenoble]

Résumé

Our goal in this study is to compare several widely used pseudo-relevance feedback (PRF) models and understand what explains their respective behavior. To do so, we first analyze how different PRF models behave through the characteristics of the terms they select and through their performance on two widely used test collections. This analysis reveals that several well-known models surprisingly tend to select very common terms, with low IDF (inverse document frequency). We then introduce several conditions PRF models should satisfy regarding both the terms they select and the way they weigh them, prior to study whether standard PRF models satisfy these conditions or not. This study reveals that most models are deficient with respect to at least one condition, and that this deficiency explains the results of our analysis of the behavior of the models, as well as some of the results reported on the respective performance of PRF models. Based on the PRF conditions, we finally propose possible corrections for the simple mixture model. The PRF models obtained after these corrections outperform their standard version and yield state-of-the-art PRF models which confirms the validity of our theoretical analysis.

Domaines

Recherche d'information [cs.IR]

Maria-Irina Nicolae : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00952994

Soumis le : vendredi 28 février 2014-09:50:41

Dernière modification le : jeudi 4 avril 2024-20:54:21

Dates et versions

hal-00952994 , version 1 (28-02-2014)

Identifiants

HAL Id : hal-00952994 , version 1

Citer

Stéphane Clinchant, Éric Gaussier. A Theoretical Analysis of Pseudo-Relevance Feedback Models. International Conference on the Theory of Information Retrieval, 2013, Denmark. pp.6. ⟨hal-00952994⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS LIG LIG_SIDCH LIG_SIDCH_APTIKAL

70 Consultations

0 Téléchargements

A Theoretical Analysis of Pseudo-Relevance Feedback Models

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager