Skip to Main content Skip to Navigation
Conference papers

Tweet Data Mining: the Cultural Microblog Contextualization Data Set

Abstract : This paper presents an overview of the data set that was used for the Cultural Microblog Contextualization Workshop at CLEF 2016 and more specifically for the task 1: tweet contextualization. In this paper we first present a descriptive analysis of the data: we consider the variables or features associated with the tweets and analyse them. Then we also analyse the tweet textual content. The results of this work correspond to a first step toward data quality checking. It can also useful in order to understand better the data and its usefulness for some tasks or case studies.
Complete list of metadata

Cited literature [9 references]  Display  Hide  Download
Contributor : Open Archive Toulouse Archive Ouverte (oatao) <>
Submitted on : Friday, December 22, 2017 - 10:58:08 AM
Last modification on : Wednesday, June 9, 2021 - 1:10:03 PM


Files produced by the author(s)


  • HAL Id : hal-01671365, version 1
  • OATAO : 18770


Yassine Rkha Chaham, Clémentine Scohy, Sébastien Dejean, Josiane Mothe. Tweet Data Mining: the Cultural Microblog Contextualization Data Set. Conference and Labs of the Evaluation forum (CLEF 2016), Sep 2016, Evora, Portugal. pp. 1246-1259. ⟨hal-01671365⟩



Record views


Files downloads