Skip to Main content Skip to Navigation
Conference papers

FrSemCor: Annotating a French corpus with supersenses

Abstract : French, as many languages, lacks semantically annotated corpus data. Our aim is to provide the linguistic and NLP research communities with a gold standard sense-annotated corpus of French, using WordNet Unique Beginners as semantic tags, thus allowing for interoperability. In this paper, we report on the first phase of the project, which focused on the annotation of common nouns. The resulting dataset consists of more than 12,000 French noun tokens which were annotated in double blind and adjudicated according to a carefully redefined set of supersenses. The resource is released online under a Creative Commons Licence.
Document type :
Conference papers
Complete list of metadata

Cited literature [22 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02511929
Contributor : Lucie Barque <>
Submitted on : Thursday, March 19, 2020 - 11:00:33 AM
Last modification on : Tuesday, January 5, 2021 - 5:28:07 PM
Long-term archiving on: : Saturday, June 20, 2020 - 1:45:20 PM

File

Fr_SemCor_LREC2020.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02511929, version 1

Citation

L Barque, P Haas, R Huyghe, D Tribout, M Candito, et al.. FrSemCor: Annotating a French corpus with supersenses. LREC-2020, May 2020, Marseille, France. ⟨hal-02511929⟩

Share

Metrics

Record views

363

Files downloads

176