A French Fairy Tale Corpus syntactically and semantically annotated. - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

A French Fairy Tale Corpus syntactically and semantically annotated.

Résumé

Fairy tales, folktales and more generally children stories have lately attracted the Natural Language Processing (NLP) community. As such, very few corpora exist and linguistic resources are lacking. The work presented in this paper aims at filling this gap by presenting a syntactically and semantically annotated corpus. It focuses on the linguistic analysis of a Fairy Tales Corpus, and provides the description of the syntactic and semantic resources developed for Information Extraction. Resources include syntactic dependency relation annotation for 120 verbs; referential annotation, which is concerned with annotating each anaphoric occurrence and Proper Name with the most specific noun in the text; ontology matching for a substantial part of the nouns in the corpus; semantic role labelling for 41 verbs using the FrameNet database. The article also sums up previous analyses of this corpus and indicates possible uses of this corpus for the NLP community.
Fichier principal
Vignette du fichier
LREC2012final.pdf (69.79 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00768592 , version 1 (22-12-2012)

Identifiants

  • HAL Id : hal-00768592 , version 1

Citer

Ismaïl El Maarouf, Jeanne Villaneau. A French Fairy Tale Corpus syntactically and semantically annotated.. LREC'12, May 2012, Istanbul, Turkey. pp.978-2-9517408-7-7. ⟨hal-00768592⟩
296 Consultations
257 Téléchargements

Partager

Gmail Facebook X LinkedIn More