Hard Time Parsing Questions: Building a QuestionBank for French

Abstract : We present the French Question Bank, a treebank of 2600 questions. We show that classical parsing model performance drop while the inclusion of this data set is highly beneficial without harming the parsing of non-question data. when facing out-of-domain data with strong structural divergences. Two thirds being aligned with the English QuestionBank (Judge et al., 2006) and being freely available, this treebank will prove useful to build robust NLP systems.
Type de document :
Communication dans un congrès
Tenth International Conference on Language Resources and Evaluation (LREC 2016), May 2016, Portorož, Slovenia. Proceedings of the 10th edition of the Language Resources and Evaluation Conference (LREC 2016)
Liste complète des métadonnées

Littérature citée [18 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01457184
Contributeur : Marie Candito <>
Soumis le : mardi 9 mai 2017 - 17:14:16
Dernière modification le : mercredi 10 mai 2017 - 01:07:36
Document(s) archivé(s) le : jeudi 10 août 2017 - 13:38:04

Fichier

lrec2016_QuestionBank.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01457184, version 2

Collections

Citation

Djamé Seddah, Marie Candito. Hard Time Parsing Questions: Building a QuestionBank for French. Tenth International Conference on Language Resources and Evaluation (LREC 2016), May 2016, Portorož, Slovenia. Proceedings of the 10th edition of the Language Resources and Evaluation Conference (LREC 2016). 〈hal-01457184v2〉

Partager

Métriques

Consultations de la notice

133

Téléchargements de fichiers

106