A Romanization System and WebMAUS Aligner for Arabic Varieties - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

A Romanization System and WebMAUS Aligner for Arabic Varieties

Résumé

This paper presents the results of an ongoing collaboration to develop an Arabic variety-independent romanization system that aims to homogenize and simplify the romanization of the Arabic script, and introduces an Arabic variety-independent WebMAUS service offering a free to use forced-alignment service fully integrated within the WebMAUS services. We present the rationale for developing such a system, highlighting the need for a detailed romanization system with graphemes corresponding to the phonemic short and long vowels/consonants in Arabic varieties. We describe how the acoustic model was created, followed by several hands-on recipes for applying the forced alignment webservice either online or programatically. Finally, we discuss some of the issues we faced during the development of the system.
Fichier principal
Vignette du fichier
2022.lrec-1.789.pdf (206.78 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03729122 , version 1 (20-07-2022)

Identifiants

  • HAL Id : hal-03729122 , version 1

Citer

Jalal Al-Tamimi, Florian Schiel, Ghada Khattab, Navdeep Sokhey, Djegdjiga Amazouz, et al.. A Romanization System and WebMAUS Aligner for Arabic Varieties. 13th Conference on Language Resources and Evaluation (LREC 2022), 2022, Marseille, France. ⟨hal-03729122⟩
104 Consultations
127 Téléchargements

Partager

Gmail Facebook X LinkedIn More