ALIF editor for generating Arabic normalized lexicons

Abstract : the development of a normalized morpho-syntactic Arabic lexicon is not an easy task. In fact, many norms allow the structuration and representation of lexical data. The adoption of a stable standard will guarantee the interoperability and interchangeability of lexical resources. Still, research work that deals with normalization for Arabic lexical resources is not well developed yet, especially for some standards such as the TEI (Text Encoding Initiative). In this context, we aim at creating an Arabic lexicon editor with a constraint checker based on both the ISO standard LMF (Lexical Markup Framework) and the TEI guidelines. To develop this editor, we use a linguistic approach composed of several steps. The editor's prototype named ALIF can guarantee the construction of two types of output lexicon files: one in LMF and the other in TEI. The evaluation of this system is based upon a lexical database that contains all the derived and inflected forms generated from a lexicon of 10 000 canonical verbs. The results obtained were encouraging despite some flaws related to exceptional cases of difficult words.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [11 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01469966
Contributor : Kais Haddar <>
Submitted on : Thursday, February 16, 2017 - 9:13:53 PM
Last modification on : Thursday, April 4, 2019 - 10:18:06 AM
Document(s) archivé(s) le : Thursday, May 18, 2017 - 12:40:29 AM

File

ICICS.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01469966, version 1

Collections

CMB

Citation

Samia Ben Ismail, Hajer Maraoui, Kais Haddar, Laurent Romary. ALIF editor for generating Arabic normalized lexicons . The International Conference on Information and Communication Systems, IEEE Jordan Section, Jordan University of Science and Technology, Apr 2017, Irbid, Jordan. ⟨hal-01469966⟩

Share

Metrics

Record views

86

Files downloads

118