Mine your own business! Mine other's news! - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Mine your own business! Mine other's news!

Résumé

Major media companies such as The Financial Times, the Wall Street Journal or Reuters generate huge amounts of textual news data on a daily basis. Mining frequent patterns in this mass of information is critical for knowledge workers such as financial analysts, stock traders or economists. Using existing frequent pattern mining (FPM) algorithms for the analysis of news data is difficult because of the size and lack of structuring of the free text news content. In this article, we demonstrate a comprehensive Streaming TEmporAl Data (STEAD) analysis framework for mining frequent patterns in financial news. In this demonstration, we show how the mining task is supported by the use of a Time-Aware Content Summarization algorithm (TACS). This summary generates a concise representation of large volume of data by taking into account the expert's peculiar interest while preserving the news arrival temporal information which is essential for FPM algorithms. We experimented the whole framework on a set of news data from Reuters.
Fichier principal
Vignette du fichier
EDBT_TOI_Summarization.pdf (260.47 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00466855 , version 1 (25-03-2010)

Identifiants

Citer

Quang-Khai Pham, Régis Saint-Paul, Boualem Benatallah, Noureddine Mouaddib, Guillaume Raschia. Mine your own business! Mine other's news!. 11th International Conference on Extending Database technology: Advances in database technology (EDBT), Mar 2008, Nantes, France. pp.725-729, ⟨10.1145/1353343.1353436⟩. ⟨hal-00466855⟩
117 Consultations
177 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More