"A Novel of Character": Towards the Automatic Annotation of Characters in a Large Corpus of French Novels - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

"A Novel of Character": Towards the Automatic Annotation of Characters in a Large Corpus of French Novels

Résumé

In this paper, we apply named entity recognition techniques to a corpus of literary texts, i.e. French novels from the 18 th , 19 th and 20 th century. We obtain results that are usable but could be improved by using advanced annotation techniques. We discuss the use of active learning in this context, as well as the different applications that could be derived from this kind of annotation. In particular, we show that the automatic annotation of large literary corpora makes it possible to check whether traditional classifications exhibit specific structural patterns that could be identified automatically.
Fichier principal
Vignette du fichier
char_corpora2019-rabu2019.pdf (250.43 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02265134 , version 1 (08-08-2019)

Identifiants

  • HAL Id : hal-02265134 , version 1

Citer

Benjamin Rabu, Frédérique Mélanie-Becquet, Thierry Poibeau. "A Novel of Character": Towards the Automatic Annotation of Characters in a Large Corpus of French Novels. International Conference on Corpus Linguistics 2019, Jun 2019, Saint Petersbourg, Russia. ⟨hal-02265134⟩
196 Consultations
199 Téléchargements

Partager

Gmail Facebook X LinkedIn More