Skip to Main content Skip to Navigation
Conference papers

TCOF-POS : un corpus libre de français parlé annoté en morphosyntaxe

Abstract : This article details the creation of TCOF-POS, the first freely available corpus of spontaneous spoken French. We present here the methodology that was followed in order to obtain the best possible quality in the final resource. This corpus already is freely available and can be used as a training/validation corpus for NLP tools, as well as a study corpus for linguistic research. We also present the results obtained by two POS-taggers trained on the corpus.
Complete list of metadatas

Cited literature [15 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00709187
Contributor : Karën Fort <>
Submitted on : Monday, June 18, 2012 - 10:25:37 AM
Last modification on : Saturday, March 28, 2020 - 2:20:38 AM
Document(s) archivé(s) le : Wednesday, September 19, 2012 - 2:35:25 AM

File

TALN2012_CBKFBS_Oral_FinaleSou...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00709187, version 1

Citation

Christophe Benzitoun, Karen Fort, Benoît Sagot. TCOF-POS : un corpus libre de français parlé annoté en morphosyntaxe. JEP-TALN 2012 - Journées d'Études sur la Parole et conférence annuelle du Traitement Automatique des Langues Naturelles, Jun 2012, Grenoble, France. pp.99-112. ⟨hal-00709187⟩

Share

Metrics

Record views

1728

Files downloads

780