A logical representation of Arabic questions toward automatic passage extraction from the Web

Abstract : With the expanding growth of Arabic electronic data on the web, extracting information, which is actually one of the major challenges of the question-answering, is essentially used for building corpus of documents. In fact, building a corpus is a research topic that is currently referred to among some other major themes of conferences, in Natural Language Processing (NLP), such as, Information Retrieval (IR), Question-Answering (QA), Automatic Summary (AS), etc. Generally, a question-answering system provides various passages to answer the user questions. To make these passages truly informative, this system needs access to an underlying knowledge base; this requires the construction of a corpus. The aim of our research is to build an Arabic question-answering system. In addition, analyzing the question must be the first step. Next, it is essential to retrieve a passage from the web that can serve as an appropriate answer. In this paper, we propose a method to analysis the question and retrieve the passage answer in the Arabic language. For the question analysis, five factual question types are processed. Additionally, our purpose is to experiment with the generation of a logic representation from the declarative form of each question. Several studies, deal with the logic approaches in question-answering, are discussed in other languages than the Arabic language. This representation is very promising because it helps us later in the selection of a justifiable answer. The accuracy of questions that are correctly analyzed and translated into the logic form achieved 64%. And then, the results of passages of texts that are automatically generated achieved an 87% score for accuracy and a 98% score for c@1.
Type de document :
Article dans une revue
International Journal of Speech Technology, Springer Verlag, 2017, 20 (2), pp.339 - 353. 〈10.1007/s10772-017-9411-7〉
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01794688
Contributeur : Patrice Bellot <>
Soumis le : lundi 21 mai 2018 - 21:57:37
Dernière modification le : mercredi 12 septembre 2018 - 01:27:12
Document(s) archivé(s) le : mardi 25 septembre 2018 - 00:54:47

Fichier

FINAL VERSION CORRIGEE ARTICLE...
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Patrice Bellot, Wided Bakari, Mahmoud Neji. A logical representation of Arabic questions toward automatic passage extraction from the Web. International Journal of Speech Technology, Springer Verlag, 2017, 20 (2), pp.339 - 353. 〈10.1007/s10772-017-9411-7〉. 〈hal-01794688〉

Partager

Métriques

Consultations de la notice

31

Téléchargements de fichiers

20