Transferring knowledge with source selection to learn IR functions on unlabeled collections

Parantapa Goswami; Massih-Reza Amini; Éric Gaussier

Communication Dans Un Congrès Année : 2013

Transferring knowledge with source selection to learn IR functions on unlabeled collections

(1) , (1) , (2)

1
2

Parantapa Goswami

Fonction : Auteur
PersonId : 947970

Laboratoire d'Informatique de Grenoble

Massih-Reza Amini

Fonction : Auteur
PersonId : 747054
IdHAL : massih-reza-amini
ORCID : 0000-0001-9032-4233
IdRef : 132277042

Laboratoire d'Informatique de Grenoble

Éric Gaussier

Fonction : Auteur
PersonId : 182833
IdHAL : eric-gaussier
ORCID : 0000-0002-8858-3233
IdRef : 074308297

Analyse de données, Modélisation et Apprentissage automatique [Grenoble]

Résumé

We investigate the problem of learning an IR function on a collection without relevance judgements (called target collection) by transferring knowledge from a selected source collection with relevance judgements. To do so, we first construct, for each query in the target collection, relative relevance judgment pairs using information from the source collection closest to the query (selection and transfer steps), and then learn an IR function from the obtained pairs in the target collection (self-learning step). For the transfer step, the relevance information in the source collection is summarized as a grid that provides, for each term frequency and document frequency values of a word in a document, an empirical estimate of the relevance of the document. The self-learning step iteratively assigns pairwise preferences to documents in the target collection using the scores of the former learned function. We show the effectiveness of our approach through a series of extensive experiments on CLEF and several collections from TREC used either as target or source datasets. Our experiments show the importance of selecting the source collection prior to transfer information to the target collection, and demonstrate that the proposed approach yields results consistently and significantly above state-of-the-art IR functions.

Mots clés

Domain adaptation knowledge transfer source selection learning to rank transductive learning

Domaines

Recherche d'information [cs.IR] Apprentissage [cs.LG]

Parantapa Goswami : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00881597

Soumis le : vendredi 8 novembre 2013-15:11:11

Dernière modification le : jeudi 4 avril 2024-18:26:00

Dates et versions

hal-00881597 , version 1 (08-11-2013)

Identifiants

HAL Id : hal-00881597 , version 1

Citer

Parantapa Goswami, Massih-Reza Amini, Éric Gaussier. Transferring knowledge with source selection to learn IR functions on unlabeled collections. ACM International Conference on Information and Knowledge Management, Oct 2013, San Francisco, United States. pp.2315-2320. ⟨hal-00881597⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS LIG LIG_SIDCH LIG_SIDCH_APTIKAL

100 Consultations

0 Téléchargements

Transferring knowledge with source selection to learn IR functions on unlabeled collections

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager