A Study of Synthetic Oversampling for Twitter Imbalanced Sentiment Analysis
Résumé
The majority of Twitter sentiment analysis systems implicitly assume that the class distribution is balanced while in practice it is usually skewed. We argue that Twitter opinion mining using learning methods should be addressed in the framework of imbalanced learning. In this work, we present a study of synthetic oversampling techniques for tweet-polarity classification. The experiments we conducted on three publicly available datasets show that these methods can improve the recognition of the minority class as well as the geometric mean criterion.
Origine : Fichiers produits par l'(les) auteur(s)
Loading...