Skip to Main content Skip to Navigation
Conference papers

TCOF-POS : un corpus libre de français parlé annoté en morphosyntaxe

Abstract : This article details the creation of TCOF-POS, the first freely available corpus of spontaneous spoken French. We present here the methodology that was followed in order to obtain the best possible quality in the final resource. This corpus already is freely available and can be used as a training/validation corpus for NLP tools, as well as a study corpus for linguistic research. We also present the results obtained by two POS-taggers trained on the corpus.
Document type :
Conference papers
Complete list of metadata

Cited literature [15 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00709187
Contributor : Karën Fort <>
Submitted on : Monday, June 18, 2012 - 10:25:37 AM
Last modification on : Thursday, February 25, 2021 - 9:54:05 AM
Long-term archiving on: : Wednesday, September 19, 2012 - 2:35:25 AM

File

TALN2012_CBKFBS_Oral_FinaleSou...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00709187, version 1

Citation

Christophe Benzitoun, Karen Fort, Benoît Sagot. TCOF-POS : un corpus libre de français parlé annoté en morphosyntaxe. JEP-TALN 2012 - Journées d'Études sur la Parole et conférence annuelle du Traitement Automatique des Langues Naturelles, Jun 2012, Grenoble, France. pp.99-112. ⟨hal-00709187⟩

Share

Metrics

Record views

1809

Files downloads

903