Encoding prototype of Al-Hadith Al-Shareef in TEI

Abstract : The standardization of Al-Hadith Al-Shareef can guarantee the interoperability and interchangeability with other textual sources and takes the processing of Al-Hadith corpus to a higher level. Still, research works on Hadith corpora had not previously considered the standardization as real objective, especially for some standards such as TEI (Text Encoding Initiative). In this context, we aim at the standardization of Al-Hadith Al-Shareef on the basis of the TEI guidelines. To achieve this objective, we elaborated a TEI model that we customized for Hadith structure. Then we developed a prototype allowing the encoding of Hadith text. This prototype analyses Hadith texts and automatically generates a standardized version of the Hadith in TEI format. The evaluation of the TEI model and the prototype is based on Hadith corpus collected from Sahih Bukhari. The obtained results were encouraging despite some flaws related to exceptional cases of Hadith structure.
Type de document :
Communication dans un congrès
ICALP 2017 - The 6th International Conference on Arabic Language Processing, Oct 2017, Fes, Morocco. pp.14, 2017
Liste complète des métadonnées

Littérature citée [16 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01574543
Contributeur : Hajer Maraoui <>
Soumis le : mardi 15 août 2017 - 14:49:47
Dernière modification le : vendredi 18 mai 2018 - 09:40:02

Fichier

Article ICALP 2017.pdf
Fichiers produits par l'(les) auteur(s)

Licence


Copyright (Tous droits réservés)

Identifiants

  • HAL Id : hal-01574543, version 1
  • INERIS : ICALP'17

Collections

Citation

Hajer Maraoui, Kais Haddar, Laurent Romary. Encoding prototype of Al-Hadith Al-Shareef in TEI. ICALP 2017 - The 6th International Conference on Arabic Language Processing, Oct 2017, Fes, Morocco. pp.14, 2017. 〈hal-01574543〉

Partager

Métriques

Consultations de la notice

209

Téléchargements de fichiers

217