Recognition and TEI annotation of Arabic Events Using Transducers - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Recognition and TEI annotation of Arabic Events Using Transducers

Résumé

The recognition of Arabic Named Entity (ANE) is an important task allowing the identification and classification of relevant entities to predefined categories in the textual resources. In fact, the ANE having the category Event becomes a new challenge in NLP applications. Therefore, their appearance is clearly related to the evolution of the Web. Hence, it generates regularly new events’ articles appearing in the free resources such as Wikipedia. Nevertheless, their recognition and annotation require a powerful formalism and standard in order to have structured output. In this paper, we propose a method to recognize and to annotate ANE event. The proposed method is based on finite state trans-ducers using the TEI recommendation. These transducers are regrouped in a cas-cade generated by CaSsys tool available under Unitex linguistic platform. Our corpora are extracted from Arabic Wikipedia through the Kiwix tool. The ob-tained results are satisfactory through the calculated measures.
Fichier non déposé

Dates et versions

hal-01291336 , version 1 (21-03-2016)

Identifiants

  • HAL Id : hal-01291336 , version 1

Citer

Fatma Ben Mesmia, Nathalie Friburger, Kais Haddar, Denis Maurel. Recognition and TEI annotation of Arabic Events Using Transducers. 17th International Conference on Intelligent Text Processing and Computational Linguistics, Apr 2016, Konya, Turkey. ⟨hal-01291336⟩
124 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More