Accurate and Effective Latent Concept Modeling for Ad Hoc Information Retrieval

Abstract : A keyword query is the representation of the information need of a user, and is the result of a complex cognitive process which often results in under-specification. We propose an unsupervised method namely Latent Concept Modeling (LCM) for mining and modeling latent search concepts in order to recreate the conceptual view of the original information need. We use Latent Dirichlet Allocation (LDA) to exhibit highly-specific query-related topics from pseudo-relevant feedback documents. We define these topics as the latent concepts of the user query. We perform a thorough evaluation of our approach over two large ad-hoc TREC collections. Our findings reveal that the proposed method accurately models latent concepts, while being very effective in a query expansion retrieval setting.
Document type :
Journal articles
Complete list of metadatas

Cited literature [44 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01002716
Contributor : Romain Deveaud <>
Submitted on : Friday, June 6, 2014 - 3:59:42 PM
Last modification on : Tuesday, April 2, 2019 - 2:03:25 AM
Long-term archiving on : Saturday, September 6, 2014 - 12:21:08 PM

File

DN.pdf
Files produced by the author(s)

Identifiers

Citation

Romain Deveaud, Eric Sanjuan, Patrice Bellot. Accurate and Effective Latent Concept Modeling for Ad Hoc Information Retrieval. Revue des Sciences et Technologies de l'Information - Série Document Numérique, Lavoisier, 2014, pp.61-84. ⟨10.3166/DN.17.1.61-84⟩. ⟨hal-01002716⟩

Share

Metrics

Record views

1586

Files downloads

1963