Stacked Gender Prediction from Tweet Texts and Images Notebook for PAN at CLEF 2018 - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Stacked Gender Prediction from Tweet Texts and Images Notebook for PAN at CLEF 2018

Résumé

This paper describes our participation at the PAN 2018 Author Profiling shared task. Given texts and images from some Twitter's authors, the goal is to estimate their genders. We considered all the languages (Arabic, English and Spanish) and all the prediction types (only from texts, only from images and combined). The final submitted system is a stacked classifier composed of two main parts. The first one, based on previous PAN Author Profiling editions, concerns gender prediction from texts. It consists in a pipeline of preprocessing, word n-grams from 1 to 2, TF-IDF with sublinear weighting, Linear Support Vector classification and probability calibration. The second part is formed by different layers of classifiers used for gender estimation from images: four base classifiers (object detection, face recognition, colour histograms, local binary patterns) in the first layer, a meta classifier in the second layer and an aggregation classifier as third layer. Finally, the two gender predictions, from texts and images, feed into the last layer classifier that provides the combined gender predictions.
Fichier principal
Vignette du fichier
Ciccone_paper_111_vf.pdf (296.38 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02013987 , version 1 (11-02-2019)

Identifiants

  • HAL Id : hal-02013987 , version 1

Citer

Giovanni Ciccone, Arthur Sultan, Léa Laporte, Elod Egyed-Zsigmond, Alaa Alhamzeh, et al.. Stacked Gender Prediction from Tweet Texts and Images Notebook for PAN at CLEF 2018. CLEF 2018 - Conference and Labs of the Evaluation, Sep 2018, Avignon, France. 11p. ⟨hal-02013987⟩
169 Consultations
203 Téléchargements

Partager

Gmail Facebook X LinkedIn More