Segmenting Search Query Logs by Learning to Detect Search Task Boundaries

Luis Eduardo Lugo Martinez; Jose G. Moreno; Gilles Hubert

doi:10.1145/3397271.3401257

Communication Dans Un Congrès Année : 2020

Segmenting Search Query Logs by Learning to Detect Search Task Boundaries

(1) , (1) , (1)

Luis Eduardo Lugo Martinez

Fonction : Auteur
PersonId : 739684
IdHAL : luis-lugo-m

Recherche d’Information et Synthèse d’Information

Jose G. Moreno

Fonction : Auteur
PersonId : 743396
IdHAL : jose-g-moreno
ORCID : 0000-0002-8852-5797
IdRef : 190544007

Recherche d’Information et Synthèse d’Information

Gilles Hubert

Fonction : Auteur
PersonId : 737483
IdHAL : ghubert
ORCID : 0000-0003-3494-7561
IdRef : 031979890

Recherche d’Information et Synthèse d’Information

Résumé

To fulfill their information needs, users submit sets of related queries to available search engines. Query logs record users' activities along with timestamps and additional search-related information. The analysis of those chronological query logs enables the modeling of search tasks from user interactions. Previous research works rely on clicked URLs and surrounding queries to determine if adjacent queries are part of the same search tasks to segment the query logs properly. However, waiting for clicked URLs or future adjacent queries could render the use of these methods unfeasible in user supporting applications that require model results on the fly. Therefore, we propose a model for sequential search log segmentation. The proposed model uses only query pairs and their time span, generating results suited for on the fly user supporting applications, with improved accuracy over existing search segmentation approaches. We also show the advantages of fine-tuning the proposed model for adjusting the architecture to a small annotated collection.

Mots clés

Information need Search task segmentation Recurrent neural net-work

Domaines

Informatique et langage [cs.CL]

Fichier principal

task_seg.pdf (634.17 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Gilles HUBERT : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02959155

Soumis le : jeudi 19 novembre 2020-15:26:03

Dernière modification le : mercredi 17 janvier 2024-16:14:38

Archivage à long terme le : samedi 20 février 2021-20:06:13

Dates et versions

hal-02959155 , version 1 (19-11-2020)

Identifiants

HAL Id : hal-02959155 , version 1
DOI : 10.1145/3397271.3401257

Citer

Luis Eduardo Lugo Martinez, Jose G. Moreno, Gilles Hubert. Segmenting Search Query Logs by Learning to Detect Search Task Boundaries. SIGIR 2020: 43rd International ACM SIGIR conference on research and development in Information Retrieval, Jul 2020, Virtual Event China, China. pp.2037-2040, ⟨10.1145/3397271.3401257⟩. ⟨hal-02959155⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 CNRS SMS UT1-CAPITOLE IRIT IRIT-IRIS IRIT-GD IRIT-UT3 TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

90 Consultations

142 Téléchargements

Segmenting Search Query Logs by Learning to Detect Search Task Boundaries

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager