Demographic Word Embeddings for Racism Detection on Twitter - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Demographic Word Embeddings for Racism Detection on Twitter

Résumé

Most social media platforms grant usersfreedom of speech by allowing them tofreely express their thoughts, beliefs, andopinions. Although this represents incredible and unique communication opportunities, it also presents important challenges. Online racism is such an example. In this study, we present a super-vised learning strategy to detect racist language on Twitter based on word embeddings that incorporate demographic (Age, Gender, and Location) information. Our methodology achieves reasonable classification accuracy over a gold standard dataset (F1=76.3%) and significantly improves over classification performance of demographic-agnostic model
Fichier non déposé

Dates et versions

hal-01586739 , version 1 (13-09-2017)

Identifiants

  • HAL Id : hal-01586739 , version 1

Citer

Mohammed Hasanuzzaman, Gaël Dias, Andy Way. Demographic Word Embeddings for Racism Detection on Twitter. 8th International Joint Conference on Natural Language Processing (IJCNLP 2017), 2017, Taipei, Taiwan. ⟨hal-01586739⟩
89 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More