Tweet Data Mining: the Cultural Microblog Contextualization Data Set

Abstract : This paper presents an overview of the data set that was used for the Cultural Microblog Contextualization Workshop at CLEF 2016 and more specifically for the task 1: tweet contextualization. In this paper we first present a descriptive analysis of the data: we consider the variables or features associated with the tweets and analyse them. Then we also analyse the tweet textual content. The results of this work correspond to a first step toward data quality checking. It can also useful in order to understand better the data and its usefulness for some tasks or case studies.
Complete list of metadatas

Cited literature [9 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01671365
Contributor : Open Archive Toulouse Archive Ouverte (oatao) <>
Submitted on : Friday, December 22, 2017 - 10:58:08 AM
Last modification on : Thursday, October 17, 2019 - 8:53:13 AM

File

chaham_18770.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01671365, version 1
  • OATAO : 18770

Citation

Yassine Rkha Chaham, Clémentine Scohy, Sébastien Déjean, Josiane Mothe. Tweet Data Mining: the Cultural Microblog Contextualization Data Set. Conference and Labs of the Evaluation forum (CLEF 2016), Sep 2016, Evora, Portugal. pp. 1246-1259. ⟨hal-01671365⟩

Share

Metrics

Record views

79

Files downloads

51