Addressing Different Evaluation Environments for Information Retrieval through Pivot Systems

Gabriela González Sáez; Lorraine Goeuriot; Philippe Mulhem

doi:10.24348/coria.2021.long_6

Communication Dans Un Congrès Année : 2021

Addressing Different Evaluation Environments for Information Retrieval through Pivot Systems

(1) , (1) , (1)

Gabriela González Sáez

Fonction : Auteur
PersonId : 754225
IdHAL : gabriela-gonzalez-saez
ORCID : 0000-0003-0878-5263

Modélisation et Recherche d’Information Multimédia [Grenoble]

Lorraine Goeuriot

Fonction : Auteur
PersonId : 744224
IdHAL : lorrainegoeuriot

Modélisation et Recherche d’Information Multimédia [Grenoble]

Philippe Mulhem

Fonction : Auteur
PersonId : 9330
IdHAL : philippe-mulhem
ORCID : 0000-0002-3245-6462
IdRef : 086873083

Modélisation et Recherche d’Information Multimédia [Grenoble]

Résumé

Classical evaluations of Information Retrieval systems, under the Cranfield Paradigm, compare several systems within one evaluation environment, defined by its settings (document collection, topics, assessments and evaluation measures). In this paper, we propose a framework to handle the comparison of systems across several evaluation environments. To achieve this goal, we investigate the use of pivot systems, allowing an indirect comparison of systems across evaluation environments by computing Result Deltas, i.e. the differences between their evaluation measures values. We detail the proposed pivot-based methodology, define a pivot characteristics and present experiments to validate our proposal (and in particular the pivot characteristics). We create altered environments that differ from their topic sets using the 2018 and 2020 CLEF eHealth evaluation campaigns (Goeuriot et al., 2020). We explore the behaviour of the metrics and pivots measuring the correlation between the result deltas, and the ranking of systems through the pivots compared to the official ranking of the systems. Our experiment show that correlations can greatly vary according to the chosen pivot and metric. We show that some pivot/metric pairs achieve high correlation values across the altered environments, with a ranking of systems similar to the official ranking.

Mots clés

Information Retrieval Evaluation Test Collection Result Delta

Domaines

Recherche d'information [cs.IR]

Fichier principal

main(1).pdf (1.23 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Philippe Mulhem : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03400576

Soumis le : lundi 25 octobre 2021-09:46:38

Dernière modification le : jeudi 4 avril 2024-21:27:04

Archivage à long terme le : mercredi 26 janvier 2022-18:50:53

Dates et versions

hal-03400576 , version 1 (25-10-2021)

Identifiants

HAL Id : hal-03400576 , version 1
DOI : 10.24348/coria.2021.long_6

Citer

Gabriela González Sáez, Lorraine Goeuriot, Philippe Mulhem. Addressing Different Evaluation Environments for Information Retrieval through Pivot Systems. CORIA 2021, Apr 2021, Grenoble (virtuel), France. ⟨10.24348/coria.2021.long_6⟩. ⟨hal-03400576⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS LIG LIG_TDCGE_MRIM ANR LIG_SIDCH

25 Consultations

22 Téléchargements

Addressing Different Evaluation Environments for Information Retrieval through Pivot Systems

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager