GTE-Rank: a Time-aware Search Engine to Answer Time-sensitive Queries - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Information Processing \& Management (IPM) Année : 2016

GTE-Rank: a Time-aware Search Engine to Answer Time-sensitive Queries

Résumé

In the web environment, most of the queries issued by users are implicit by nature. Inferring the different temporal intents of this type of query enhances the overall temporal part of the web search results. Previous works tackling this problem usually focused on news queries, where the retrieval of the most recent results related to the query are usually sufficient to meet the user's information needs. However, few works have studied the importance of time in queries such as “Philip Seymour Hoffman” where the results may require no recency at all. In this work, we focus on this type of queries named “time-sensitive queries” where the results are preferably from a diversified time span, not necessarily the most recent one. Unlike related work, we follow a content-based approach to identify the most important time periods of the query and integrate time into a re-ranking model to boost the retrieval of documents whose contents match the query time period. For that purpose, we define a linear combination of topical and temporal scores, which reflects the relevance of any web document both in the topical and temporal dimensions, thus contributing to improve the effectiveness of the ranked results across different types of queries. Our approach relies on a novel temporal similarity measure that is capable of determining the most important dates for a query, while filtering out the non-relevant ones. Through extensive experimental evaluation over web corpora, we show that our model offers promising results compared to baseline approaches. As a result of our investigation, we publicly provide a set of web services and a web search interface so that the system can be graphically explored by the research community.
Fichier non déposé

Dates et versions

hal-01496659 , version 1 (27-03-2017)

Identifiants

  • HAL Id : hal-01496659 , version 1

Citer

Ricardo Campos, Gaël Dias, Célia Nunes, Alípio Jorge. GTE-Rank: a Time-aware Search Engine to Answer Time-sensitive Queries. Information Processing \& Management (IPM), 2016. ⟨hal-01496659⟩
83 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More