Demographic Word Embeddings for Racism Detection on Twitter

Mohammed Hasanuzzaman; Gaël Dias; Andy Way

Communication Dans Un Congrès Année : 2017

Demographic Word Embeddings for Racism Detection on Twitter

(1) , (1) , (2)

1
2

Mohammed Hasanuzzaman

Fonction : Auteur

Equipe Hultech - Laboratoire GREYC - UMR6072

Gaël Dias

Fonction : Auteur
PersonId : 3735
IdHAL : gael-dias
ORCID : 0000-0002-5840-1603
IdRef : 113779747

Equipe Hultech - Laboratoire GREYC - UMR6072

Andy Way

Fonction : Auteur
PersonId : 880461

National Centre for Language Technology

Résumé

Most social media platforms grant usersfreedom of speech by allowing them tofreely express their thoughts, beliefs, andopinions. Although this represents incredible and unique communication opportunities, it also presents important challenges. Online racism is such an example. In this study, we present a super-vised learning strategy to detect racist language on Twitter based on word embeddings that incorporate demographic (Age, Gender, and Location) information. Our methodology achieves reasonable classification accuracy over a gold standard dataset (F1=76.3%) and significantly improves over classification performance of demographic-agnostic model

Domaines

Traitement du texte et du document Apprentissage [cs.LG] Réseaux sociaux et d'information [cs.SI]

Gaël Dias : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01586739

Soumis le : mercredi 13 septembre 2017-11:05:32

Dernière modification le : mercredi 20 mars 2024-16:20:04

Dates et versions

hal-01586739 , version 1 (13-09-2017)

Identifiants

HAL Id : hal-01586739 , version 1

Citer

Mohammed Hasanuzzaman, Gaël Dias, Andy Way. Demographic Word Embeddings for Racism Detection on Twitter. 8th International Joint Conference on Natural Language Processing (IJCNLP 2017), 2017, Taipei, Taiwan. ⟨hal-01586739⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS GREYC GREYC-HULTECH COMUE-NORMANDIE ENSICAEN UNICAEN

89 Consultations

0 Téléchargements

Demographic Word Embeddings for Racism Detection on Twitter

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager