Cross-Dialectal Arabic Processing - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Cross-Dialectal Arabic Processing

Résumé

We present, in this paper an Arabic multi-dialect study including dialects from both the Maghreb and the Middle-east that we compare to the Modern Standard Arabic (MSA). Three dialects from Maghreb are concerned by this study: two from Algeria and one from Tunisia and two dialects from Middle-east (Syria and Palestine). The resources which have been built from scratch have lead to a collection of a multi-dialect parallel resource. Furthermore, this collection has been aligned by hand with a MSA corpus. We conducted several analytical studies in order to understand the relationship between these vernacular languages. For this, we studied the closeness between all the pairs of dialects and MSA in terms of Hellinger distance. We also performed an experiment of dialect identification. This experiment showed that neighbouring dialects as expected tend to be confused, making difficult their identification. Because the Arabic dialects are different from one region to another which make the communication between people difficult, we conducted cross-lingual machine translation between all the pairs of dialects and also with MSA. Several interesting conclusions have been carried out from this experiment.
Fichier principal
Vignette du fichier
cicling2015Smaili.pdf (250.83 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01261598 , version 1 (25-01-2016)

Identifiants

Citer

Salima Harrat, Karima Meftouh, Mourad Abbas, Salma Jamoussi, Motaz Saad, et al.. Cross-Dialectal Arabic Processing. International Conference on Intelligent Text Processing and Computational Linguistics, Apr 2015, cairo, Egypt. ⟨10.1007/978-3-319-18111-0_47⟩. ⟨hal-01261598⟩
277 Consultations
1476 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More