Arabizi transliteration of Algerian Arabic dialect into Modern Standard Arabic - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Arabizi transliteration of Algerian Arabic dialect into Modern Standard Arabic

Résumé

Machine transliteration is a very important research area in the field of machine translation. Neural Machine transliteration (NMTR) is a new approach to machine transliteration that has shown promising results. However research on NMTR of Arabic has just begun to give results while no research has been done on neural transliteration of Arabic dialect written in Latin letters known by “Arabizi”. In the current paper, we propose a me-thod of applying a neural transliteration based on a character-level for transliterating the Arabizi to Arabic script. Our method is composed of two important steps: 1) An Arabizi corpus construction 2) A character-based neural transliteration of Arabizi to Arabic. The evaluations were performed on in-ternal and external dataset. The best precision obtained is 73.66% on the internal dataset and 45.35% on the external one. We also conduct the same experiments for Statistical Machine Transliteration (SMTR), which has largely been studied in the literature, albeit NMTR obtains substantial improvements (up to 2.18%) over SMTR.
Fichier non déposé

Dates et versions

hal-01570289 , version 1 (28-07-2017)

Identifiants

  • HAL Id : hal-01570289 , version 1

Citer

Imane Guellil, Faiçal Azouaou, Mourad Abbas, Sadat Fatiha. Arabizi transliteration of Algerian Arabic dialect into Modern Standard Arabic. Social MT 2017/ First workshop on Social Media and User Generated Content Machine Translation, May 2017, Prague, Czech Republic. ⟨hal-01570289⟩
319 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More