Extracting Search Tasks from Query Logs Using a Recurrent Deep Clustering Architecture

Luis Eduardo Lugo Martinez; Jose G. Moreno; Gilles Hubert

doi:10.1007/978-3-030-72113-8_26

Communication Dans Un Congrès Année : 2021

Extracting Search Tasks from Query Logs Using a Recurrent Deep Clustering Architecture

(1) , (1) , (1)

Luis Eduardo Lugo Martinez

Fonction : Auteur
PersonId : 739684
IdHAL : luis-lugo-m

Recherche d’Information et Synthèse d’Information

Jose G. Moreno

Fonction : Auteur
PersonId : 743396
IdHAL : jose-g-moreno
ORCID : 0000-0002-8852-5797
IdRef : 190544007

Recherche d’Information et Synthèse d’Information

Gilles Hubert

Fonction : Auteur
PersonId : 737483
IdHAL : ghubert
ORCID : 0000-0003-3494-7561
IdRef : 031979890

Recherche d’Information et Synthèse d’Information

Résumé

Users fulfill their information needs by expressing them using search queries and running the queries in available search engines. The mining of query logs from search engines enables the automatic extraction of search tasks by clustering related queries into groups representing search tasks. The extraction of search tasks is crucial for multiple user supporting applications like query recommendation, query term prediction, and results ranking depending on search tasks. Most existing search task extraction methods use graph-based or nonparametric models, which grow as the query log size increases. Deep clustering methods offer a parametric alternative, but most deep clustering architectures fail to exploit recurrent neural networks for learning text data representations. We propose a recurrent deep clustering model for extracting search tasks from query logs. The proposed architecture leverages self-training and dual recurrent encoders for learning suitable latent representations of user queries, outperforming previous deep clustering methods. It is also a parametric approach that offers the possibility of having a fixed-sized architecture for analyzing increasingly large search query logs.

Mots clés

Search task extraction Deep clustering Recurrent neural networks

Domaines

Recherche d'information [cs.IR] Sciences de l'information et de la communication

Gilles HUBERT : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03195934

Soumis le : lundi 12 avril 2021-11:49:34

Dernière modification le : lundi 20 novembre 2023-11:44:23

Dates et versions

hal-03195934 , version 1 (12-04-2021)

Identifiants

HAL Id : hal-03195934 , version 1
DOI : 10.1007/978-3-030-72113-8_26

Citer

Luis Eduardo Lugo Martinez, Jose G. Moreno, Gilles Hubert. Extracting Search Tasks from Query Logs Using a Recurrent Deep Clustering Architecture. 43rd European Conference on Information Retrieval (ECIR 2021), Mar 2021, Lucca, Italy. pp.391-404, ⟨10.1007/978-3-030-72113-8_26⟩. ⟨hal-03195934⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 CNRS SMS UT1-CAPITOLE IRIT IRIT-IRIS IRIT-GD IRIT-UT3 TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

95 Consultations

0 Téléchargements

Extracting Search Tasks from Query Logs Using a Recurrent Deep Clustering Architecture

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager