LIA/LINA at the INEX 2012 Tweet Contextualization track

Romain Deveaud 1 Florian Boudin 2
2 TALN
LINA - Laboratoire d'Informatique de Nantes Atlantique
Abstract : In this paper we describe our participation in the INEX 2012 Tweet Contextualization track and present our contributions. We combined Information Retrieval, Automatic Summarization and Topic Modeling techniques to provide the context of each tweet. We first formulate a specific query using hashtags and important words in the Tweets to retrieve the most relevant Wikipedia articles. Then, we segment the articles into sentences and compute several measures for each sentence, in order to estimate their contextual relevance to the topics expressed by the Tweets. Finally, the best scored sentences are used to form the context. Official results suggest that our methods performed very well compared to other participants.
Type de document :
Communication dans un congrès
INitiative for the Evaluation of XML Retrieval (INEX), Sep 2012, Rome, Italy. pp.n/a, 2012
Liste complète des métadonnées

Littérature citée [5 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00755496
Contributeur : Florian Boudin <>
Soumis le : mercredi 21 novembre 2012 - 14:11:06
Dernière modification le : jeudi 5 avril 2018 - 10:37:00
Document(s) archivé(s) le : samedi 17 décembre 2016 - 12:43:42

Fichier

CLEF2012wn-INEX-DeveaudEt2012....
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00755496, version 1

Collections

Citation

Romain Deveaud, Florian Boudin. LIA/LINA at the INEX 2012 Tweet Contextualization track. INitiative for the Evaluation of XML Retrieval (INEX), Sep 2012, Rome, Italy. pp.n/a, 2012. 〈hal-00755496〉

Partager

Métriques

Consultations de la notice

413

Téléchargements de fichiers

224