Datasets for Aspect-Based Sentiment Analysis in French
Résumé
Aspect Based Sentiment Analysis (ABSA) is the task of mining and summarizing opinions from text about specific entities and their aspects. This article describes two datasets for the development and testing of ABSA systems for French which comprise user reviews annotated with relevant entities, aspects and polarity values. The first dataset contains 457~restaurant reviews (2365~sentences) for training and testing ABSA systems, while the second contains 162~museum reviews (655~sentences) for testing in this previously unseen domain for which training data is not available. Both datasets were built as part of SemEval-2016 Task~5 ``Aspect-Based Sentiment Analysis'' where seven different languages were represented, and are publicly available for research purposes.
This article provides examples and statistics by annotation type, summarizes the annotation guidelines and discusses their cross-lingual applicability. It also explains how the data was used for evaluation in the SemEval ABSA task and briefly presents the results obtained for French.
Origine : Fichiers produits par l'(les) auteur(s)
Loading...