Skip to Main content Skip to Navigation

Acquisition and enrichment of morphological and morphosemantic knowledge from the French Wiktionary

Abstract : We present two approaches to automatically acquire morphologically related words from Wiktionary. Starting with related words explicitly mentioned in the dictionary, we propose a method based on orthographic similarity to detect new derived words from the entries' definitions with an overall accuracy of 93.5%. Using word pairs from the initial lexicon as patterns of formal analogies to filter new derived words enables us to rise the accuracy up to 99%, while extending the lexicon's size by 56%. In a last experiment, we show that it is possible to semantically type the morphological definitions, focusing on the detection of process nominals.
Document type :
Conference papers
Complete list of metadatas

Cited literature [38 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01111869
Contributor : Franck Sajous <>
Submitted on : Tuesday, February 3, 2015 - 7:11:34 PM
Last modification on : Tuesday, July 9, 2019 - 10:12:56 AM
Document(s) archivé(s) le : Wednesday, May 27, 2015 - 3:20:49 PM

File

HathoutEtAl2014-COLING_LGLP.pd...
Publisher files allowed on an open archive

Identifiers

  • HAL Id : hal-01111869, version 1

Citation

Nabil Hathout, Franck Sajous, Basilio Calderone. Acquisition and enrichment of morphological and morphosemantic knowledge from the French Wiktionary. Workshop on Lexical and Grammatical Resources for Language Processing, COLING 2014, 2014, Dublin, Ireland. pp.65-74. ⟨hal-01111869⟩

Share

Metrics

Record views

251

Files downloads

414