Skip to Main content Skip to Navigation
Conference papers

Transferring knowledge with source selection to learn IR functions on unlabeled collections

Abstract : We investigate the problem of learning an IR function on a collection without relevance judgements (called target collection) by transferring knowledge from a selected source collection with relevance judgements. To do so, we first construct, for each query in the target collection, relative relevance judgment pairs using information from the source collection closest to the query (selection and transfer steps), and then learn an IR function from the obtained pairs in the target collection (self-learning step). For the transfer step, the relevance information in the source collection is summarized as a grid that provides, for each term frequency and document frequency values of a word in a document, an empirical estimate of the relevance of the document. The self-learning step iteratively assigns pairwise preferences to documents in the target collection using the scores of the former learned function. We show the effectiveness of our approach through a series of extensive experiments on CLEF and several collections from TREC used either as target or source datasets. Our experiments show the importance of selecting the source collection prior to transfer information to the target collection, and demonstrate that the proposed approach yields results consistently and significantly above state-of-the-art IR functions.
Complete list of metadata
Contributor : Parantapa Goswami <>
Submitted on : Friday, November 8, 2013 - 3:11:11 PM
Last modification on : Friday, November 20, 2020 - 2:54:16 PM


  • HAL Id : hal-00881597, version 1




Parantapa Goswami, Massih-Reza Amini, Éric Gaussier. Transferring knowledge with source selection to learn IR functions on unlabeled collections. ACM International Conference on Information and Knowledge Management, Oct 2013, San Francisco, United States. pp.2315-2320. ⟨hal-00881597⟩



Record views