A Corpus for Hybrid Question Answering Systems

Brigitte Grau; Anne-Laure Ligozat

doi:10.1145/3184558.3191540

Communication Dans Un Congrès Année : 2018

A Corpus for Hybrid Question Answering Systems

(1, 2) , (2, 1)

1
2

Brigitte Grau

Fonction : Auteur
PersonId : 177137
IdHAL : brigitte-grau

Ecole Nationale Supérieure d'Informatique pour l'Industrie et l'Entreprise

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Anne-Laure Ligozat

Fonction : Auteur
PersonId : 11451
IdHAL : anne-laure-ligozat
ORCID : 0000-0002-2188-3426
IdRef : 112991440

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Ecole Nationale Supérieure d'Informatique pour l'Industrie et l'Entreprise

Résumé

Question answering has been the focus of a lot of researches and evaluation campaigns, either for text-based systems (TREC and CLEF evaluation campaigns for example), or for knowledge-based systems (QALD, BioASQ). Few systems have effectively combined both types of resources and methods in order to exploit the fruitful- ness of merging the two kinds of information repositories. The only evaluation QA track that focuses on hybrid QA is QALD since 2014. As it is a recent task, few annotated data are available (around 150 questions). In this paper, we present a question answering dataset that was constructed to develop and evaluate hybrid question an- swering systems. In order to create this corpus, we collected several textual corpora and augmented them with entities and relations of a knowledge base by retrieving paths in the knowledge base which allow to answer the questions. The resulting corpus contains 4300 question-answer pairs and 1600 have a true link with DBpedia.

Mots clés

Hybrid Question Answering Corpus

Domaines

Informatique [cs] Informatique et langage [cs.CL]

Fichier principal

HQA0790-grauA.pdf (489.87 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Brigitte Grau : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02284465

Soumis le : mercredi 11 septembre 2019-18:31:46

Dernière modification le : samedi 7 octobre 2023-21:36:20

Archivage à long terme le : samedi 8 février 2020-02:27:29

Dates et versions

hal-02284465 , version 1 (11-09-2019)

Identifiants

HAL Id : hal-02284465 , version 1
DOI : 10.1145/3184558.3191540

Citer

Brigitte Grau, Anne-Laure Ligozat. A Corpus for Hybrid Question Answering Systems. Workshop on Hybrid Question Answering with Structured and Unstructured Knowledge, Apr 2018, Lyon - FR, France. pp.1081-1086, ⟨10.1145/3184558.3191540⟩. ⟨hal-02284465⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIMSI UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE LISN GS-ENGINEERING GS-COMPUTER-SCIENCE ENSIIE

19 Consultations

191 Téléchargements

A Corpus for Hybrid Question Answering Systems

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager