A Study of Synthetic Oversampling for Twitter Imbalanced Sentiment Analysis - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

A Study of Synthetic Oversampling for Twitter Imbalanced Sentiment Analysis

Résumé

The majority of Twitter sentiment analysis systems implicitly assume that the class distribution is balanced while in practice it is usually skewed. We argue that Twitter opinion mining using learning methods should be addressed in the framework of imbalanced learning. In this work, we present a study of synthetic oversampling techniques for tweet-polarity classification. The experiments we conducted on three publicly available datasets show that these methods can improve the recognition of the minority class as well as the geometric mean criterion.
Fichier principal
Vignette du fichier
paper_dmnlp_16.pdf (330.07 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01504684 , version 1 (10-04-2017)

Identifiants

  • HAL Id : hal-01504684 , version 1

Citer

Julien Ah-Pine, Edmundo-Pavel Soriano-Morales. A Study of Synthetic Oversampling for Twitter Imbalanced Sentiment Analysis. Workshop on Interactions between Data Mining and Natural Language Processing (DMNLP 2016), Sep 2016, Riva del Garda, Italy. ⟨hal-01504684⟩
730 Consultations
842 Téléchargements

Partager

Gmail Facebook X LinkedIn More