Skip to Main content Skip to Navigation
Conference papers

Extracting Search Tasks from Query Logs Using a Recurrent Deep Clustering Architecture

Abstract : Users fulfill their information needs by expressing them using search queries and running the queries in available search engines. The mining of query logs from search engines enables the automatic extraction of search tasks by clustering related queries into groups representing search tasks. The extraction of search tasks is crucial for multiple user supporting applications like query recommendation, query term prediction, and results ranking depending on search tasks. Most existing search task extraction methods use graph-based or nonparametric models, which grow as the query log size increases. Deep clustering methods offer a parametric alternative, but most deep clustering architectures fail to exploit recurrent neural networks for learning text data representations. We propose a recurrent deep clustering model for extracting search tasks from query logs. The proposed architecture leverages self-training and dual recurrent encoders for learning suitable latent representations of user queries, outperforming previous deep clustering methods. It is also a parametric approach that offers the possibility of having a fixed-sized architecture for analyzing increasingly large search query logs.
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-03195934
Contributor : Gilles Hubert Connect in order to contact the contributor
Submitted on : Monday, April 12, 2021 - 11:49:34 AM
Last modification on : Tuesday, October 19, 2021 - 2:24:23 PM

Identifiers

Citation

Luis Eduardo Lugo Martinez, Jose G. Moreno, Gilles Hubert. Extracting Search Tasks from Query Logs Using a Recurrent Deep Clustering Architecture. 43rd European Conference on Information Retrieval (ECIR 2021), Mar 2021, Lucca, Italy. pp.391-404, ⟨10.1007/978-3-030-72113-8_26⟩. ⟨hal-03195934⟩

Share

Metrics

Record views

129