LIA/LINA at the INEX 2012 Tweet Contextualization track

Romain Deveaud 1 Florian Boudin 2
2 TALN
LINA - Laboratoire d'Informatique de Nantes Atlantique
Abstract : In this paper we describe our participation in the INEX 2012 Tweet Contextualization track and present our contributions. We combined Information Retrieval, Automatic Summarization and Topic Modeling techniques to provide the context of each tweet. We first formulate a specific query using hashtags and important words in the Tweets to retrieve the most relevant Wikipedia articles. Then, we segment the articles into sentences and compute several measures for each sentence, in order to estimate their contextual relevance to the topics expressed by the Tweets. Finally, the best scored sentences are used to form the context. Official results suggest that our methods performed very well compared to other participants.
Complete list of metadatas

Cited literature [5 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00755496
Contributor : Florian Boudin <>
Submitted on : Wednesday, November 21, 2012 - 2:11:06 PM
Last modification on : Saturday, March 23, 2019 - 1:22:02 AM
Long-term archiving on : Saturday, December 17, 2016 - 12:43:42 PM

File

CLEF2012wn-INEX-DeveaudEt2012....
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00755496, version 1

Citation

Romain Deveaud, Florian Boudin. LIA/LINA at the INEX 2012 Tweet Contextualization track. INitiative for the Evaluation of XML Retrieval (INEX), Sep 2012, Rome, Italy. pp.n/a. ⟨hal-00755496⟩

Share

Metrics

Record views

421

Files downloads

230