Skip to Main content Skip to Navigation
Conference papers

Building a free French wordnet from multilingual resources

Abstract : This paper describes automatic construction a freely-available wordnet for French (WOLF) based on Princeton WordNet (PWN) by using various multilingual resources. Polysemous words were dealt with an approach in which a parallel corpus for five languages was word-aligned and the extracted multilingual lexicon was disambiguated with the existing wordnets for these languages. On the other hand, a bilingual approach sufficed to acquire equivalents for monosemous words. Bilingual lexicons were extracted from Wikipedia and thesauri. The results obtained from each resource were merged and ranked according to the number of resources yielding the same literal. Automatic evaluation of the merged wordnet was performed with the French WordNet (FREWN). Manual evaluation was also carried out on a sample of the generated synsets. Precision shows that the presented approach has proved to be very promising and applications to use the created wordnet are already intended.
Document type :
Conference papers
Complete list of metadata

Cited literature [18 references]  Display  Hide  Download
Contributor : Benoît Sagot Connect in order to contact the contributor
Submitted on : Monday, August 15, 2011 - 11:34:04 AM
Last modification on : Friday, January 21, 2022 - 3:21:44 AM
Long-term archiving on: : Friday, November 25, 2011 - 11:11:56 AM


Files produced by the author(s)


  • HAL Id : inria-00614708, version 1



Benoît Sagot, Darja Fišer. Building a free French wordnet from multilingual resources. OntoLex, May 2008, Marrakech, Morocco. ⟨inria-00614708⟩



Record views


Files downloads