Segmentation tool for hadith corpus to generate TEI encoding

Abstract : A segmentation tool for a hadith corpus is necessary to prepare the TEI hadith encoding process. In this context, we aim to develop a tool allowing the segmentation of hadith text from Sahih al-Bukhari corpus. To achieve this objective, we start by identifying different hadith structures. Then, we elaborate an automatic processing tool for hadith segmentation. This tool will be integrated in a prototype allowing the TEI encoding process. The experimentation and the evaluation of this tool is based on Sahih al-Bukhari corpus. The obtained results were encouraging despite some flaws related to exceptional cases of hadith structure.
Type de document :
Communication dans un congrès
4th International Conference on Advanced Intelligent Systems and Informatics (AISI’18), Sep 2018, Cairo, Egypt
Liste complète des métadonnées

Littérature citée [11 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01794105
Contributeur : Hajer Maraoui <>
Soumis le : jeudi 17 mai 2018 - 11:41:38
Dernière modification le : jeudi 5 juillet 2018 - 11:18:36
Document(s) archivé(s) le : mercredi 26 septembre 2018 - 01:28:09

Fichier

Article AISI 2018.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01794105, version 1

Collections

Citation

Hajer Maraoui, Kais Haddar, Laurent Romary. Segmentation tool for hadith corpus to generate TEI encoding. 4th International Conference on Advanced Intelligent Systems and Informatics (AISI’18), Sep 2018, Cairo, Egypt. 〈hal-01794105〉

Partager

Métriques

Consultations de la notice

103

Téléchargements de fichiers

149