Skip to Main content Skip to Navigation
Conference papers

Encoding prototype of Al-Hadith Al-Shareef in TEI

Abstract : The standardization of Al-Hadith Al-Shareef can guarantee the interoperability and interchangeability with other textual sources and takes the processing of Al-Hadith corpus to a higher level. Still, research works on Hadith corpora had not previously considered the standardization as real objective, especially for some standards such as TEI (Text Encoding Initiative). In this context, we aim at the standardization of Al-Hadith Al-Shareef on the basis of the TEI guidelines. To achieve this objective, we elaborated a TEI model that we customized for Hadith structure. Then we developed a prototype allowing the encoding of Hadith text. This prototype analyses Hadith texts and automatically generates a standardized version of the Hadith in TEI format. The evaluation of the TEI model and the prototype is based on Hadith corpus collected from Sahih Bukhari. The obtained results were encouraging despite some flaws related to exceptional cases of Hadith structure.
Document type :
Conference papers
Complete list of metadata

Cited literature [16 references]  Display  Hide  Download
Contributor : Hajer Maraoui <>
Submitted on : Tuesday, August 15, 2017 - 2:49:47 PM
Last modification on : Tuesday, June 23, 2020 - 12:30:04 PM


Article ICALP 2017.pdf
Files produced by the author(s)




  • HAL Id : hal-01574543, version 1



Hajer Maraoui, Kais Haddar, Laurent Romary. Encoding prototype of Al-Hadith Al-Shareef in TEI. ICALP 2017 - The 6th International Conference on Arabic Language Processing, Oct 2017, Fes, Morocco. pp.14. ⟨hal-01574543⟩



Record views


Files downloads