Skip to Main content Skip to Navigation
Conference papers

Modèles d'information pour la recherche multilingue

Bo Li 1 Éric Gaussier 1, *
* Corresponding author
Abstract : We present in this paper well-founded cross-language extensions of the recently introduced models in the information-based family for information retrieval, namely the LL (loglogistic) and SPL (smoothed power law) models of (Clinchant et al., 2010). These extensions are based on (a) a generalization of the notion of information used in the information-based family, (b) a generalization of the random variables also used in this family, and (c) the direct expansion of query terms with their translations. We then review these extensions from a theoretical point-of-view, prior to assessing them experimentally. The results of the experimental comparisons between these extensions and existing CLIR systems, on three collections and three language pairs, reveal that the cross-language extension of the LL model provides a state-of-the-art CLIR system, yielding the best performance overall.
Complete list of metadatas

Cited literature [16 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00741697
Contributor : Eric Gaussier <>
Submitted on : Monday, October 15, 2012 - 10:03:46 AM
Last modification on : Monday, April 20, 2020 - 11:24:01 AM
Document(s) archivé(s) le : Saturday, December 17, 2016 - 12:58:02 AM

File

Li-coria2012.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00741697, version 1

Citation

Bo Li, Éric Gaussier. Modèles d'information pour la recherche multilingue. CORIA 2012 - COnférence en Recherche d'Information et Applications, Mar 2012, Bordeaux, France. pp.9-24. ⟨hal-00741697⟩

Share

Metrics

Record views

289

Files downloads

348