Segmentation tool for hadith corpus to generate TEI encoding - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Segmentation tool for hadith corpus to generate TEI encoding

Résumé

A segmentation tool for a hadith corpus is necessary to prepare the TEI hadith encoding process. In this context, we aim to develop a tool allowing the segmentation of hadith text from Sahih al-Bukhari corpus. To achieve this objective, we start by identifying different hadith structures. Then, we elaborate an automatic processing tool for hadith segmentation. This tool will be integrated in a prototype allowing the TEI encoding process. The experimentation and the evaluation of this tool is based on Sahih al-Bukhari corpus. The obtained results were encouraging despite some flaws related to exceptional cases of hadith structure.
Fichier principal
Vignette du fichier
Article AISI 2018.pdf (221.17 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01794105 , version 1 (17-05-2018)

Identifiants

  • HAL Id : hal-01794105 , version 1

Citer

Hajer Maraoui, Kais Haddar, Laurent Romary. Segmentation tool for hadith corpus to generate TEI encoding. 4th International Conference on Advanced Intelligent Systems and Informatics (AISI’18), Sep 2018, Cairo, Egypt. ⟨hal-01794105⟩

Collections

INRIA INRIA2
189 Consultations
1155 Téléchargements

Partager

Gmail Facebook X LinkedIn More