"A Novel of Character": Towards the Automatic Annotation of Characters in a Large Corpus of French Novels

Abstract : In this paper, we apply named entity recognition techniques to a corpus of literary texts, i.e. French novels from the 18 th , 19 th and 20 th century. We obtain results that are usable but could be improved by using advanced annotation techniques. We discuss the use of active learning in this context, as well as the different applications that could be derived from this kind of annotation. In particular, we show that the automatic annotation of large literary corpora makes it possible to check whether traditional classifications exhibit specific structural patterns that could be identified automatically.
Complete list of metadatas

Cited literature [13 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02265134
Contributor : Thierry Poibeau <>
Submitted on : Thursday, August 8, 2019 - 3:28:29 PM
Last modification on : Sunday, August 11, 2019 - 1:08:32 AM

File

char_corpora2019-rabu2019.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02265134, version 1

Collections

Citation

B Rabu, F Mélanie, Thierry Poibeau. "A Novel of Character": Towards the Automatic Annotation of Characters in a Large Corpus of French Novels. International Conference on Corpus Linguistics 2019, Jun 2019, Saint Petersbourg, Russia. ⟨hal-02265134⟩

Share

Metrics

Record views

40

Files downloads

7