Transferring knowledge with source selection to learn IR functions on unlabeled collections

Abstract : We investigate the problem of learning an IR function on a collection without relevance judgements (called target collection) by transferring knowledge from a selected source collection with relevance judgements. To do so, we first construct, for each query in the target collection, relative relevance judgment pairs using information from the source collection closest to the query (selection and transfer steps), and then learn an IR function from the obtained pairs in the target collection (self-learning step). For the transfer step, the relevance information in the source collection is summarized as a grid that provides, for each term frequency and document frequency values of a word in a document, an empirical estimate of the relevance of the document. The self-learning step iteratively assigns pairwise preferences to documents in the target collection using the scores of the former learned function. We show the effectiveness of our approach through a series of extensive experiments on CLEF and several collections from TREC used either as target or source datasets. Our experiments show the importance of selecting the source collection prior to transfer information to the target collection, and demonstrate that the proposed approach yields results consistently and significantly above state-of-the-art IR functions.
Type de document :
Communication dans un congrès
ACM International Conference on Information and Knowledge Management, Oct 2013, San Francisco, United States. pp.2315-2320, 2013
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-00881597
Contributeur : Parantapa Goswami <>
Soumis le : vendredi 8 novembre 2013 - 15:11:11
Dernière modification le : mardi 28 octobre 2014 - 18:34:03

Identifiants

  • HAL Id : hal-00881597, version 1

Collections

LIG | UGA

Citation

Parantapa Goswami, Massih-Reza Amini, Éric Gaussier. Transferring knowledge with source selection to learn IR functions on unlabeled collections. ACM International Conference on Information and Knowledge Management, Oct 2013, San Francisco, United States. pp.2315-2320, 2013. <hal-00881597>

Partager

Métriques

Consultations de la notice

59