Encoding prototype of Al-Hadith Al-Shareef in TEI

Abstract : The standardization of Al-Hadith Al-Shareef can guarantee the interoperability and interchangeability with other textual sources and takes the processing of Al-Hadith corpus to a higher level. Still, research works on Hadith corpora had not previously considered the standardization as real objective, especially for some standards such as TEI (Text Encoding Initiative). In this context, we aim at the standardization of Al-Hadith Al-Shareef on the basis of the TEI guidelines. To achieve this objective, we elaborated a TEI model that we customized for Hadith structure. Then we developed a prototype allowing the encoding of Hadith text. This prototype analyses Hadith texts and automatically generates a standardized version of the Hadith in TEI format. The evaluation of the TEI model and the prototype is based on Hadith corpus collected from Sahih Bukhari. The obtained results were encouraging despite some flaws related to exceptional cases of Hadith structure.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [16 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01574543
Contributor : Hajer Maraoui <>
Submitted on : Tuesday, August 15, 2017 - 2:49:47 PM
Last modification on : Friday, March 22, 2019 - 2:22:12 PM

File

Article ICALP 2017.pdf
Files produced by the author(s)

Licence


Copyright

Identifiers

  • HAL Id : hal-01574543, version 1
  • INERIS : ICALP'17

Collections

Citation

Hajer Maraoui, Kais Haddar, Laurent Romary. Encoding prototype of Al-Hadith Al-Shareef in TEI. ICALP 2017 - The 6th International Conference on Arabic Language Processing, Oct 2017, Fes, Morocco. pp.14. ⟨hal-01574543⟩

Share

Metrics

Record views

240

Files downloads

255