The SMarT Classifier for Arabic Fine-Grained Dialect Identification - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

The SMarT Classifier for Arabic Fine-Grained Dialect Identification

Résumé

This paper describes the approach adopted by the SMarT research group to build a dialect identification system in the framework of the Madar shared task on Arabic fine-grained dialect identification. We experimented several approaches, but we finally decided to use a Multinomial Naïve Bayes classifier based on word and character ngrams in addition to the language model probabilities. We achieved a score of 67.73% in terms of Macro accuracy and a macro-averaged F1-score of 67.31%.
Fichier principal
Vignette du fichier
SmartSubmissionMADAR.pdf (117.25 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02166384 , version 1 (26-06-2019)

Identifiants

  • HAL Id : hal-02166384 , version 1

Citer

Karima Meftouh, Karima Abidi, Salima Harrat, Kamel Smaïli. The SMarT Classifier for Arabic Fine-Grained Dialect Identification. The Fourth Arabic Natural Language Processing Workshop co-located with ACL, Aug 2019, Florence, Italy. ⟨hal-02166384⟩
135 Consultations
368 Téléchargements

Partager

Gmail Facebook X LinkedIn More