Building a free French wordnet from multilingual resources - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Building a free French wordnet from multilingual resources

Résumé

This paper describes automatic construction a freely-available wordnet for French (WOLF) based on Princeton WordNet (PWN) by using various multilingual resources. Polysemous words were dealt with an approach in which a parallel corpus for five languages was word-aligned and the extracted multilingual lexicon was disambiguated with the existing wordnets for these languages. On the other hand, a bilingual approach sufficed to acquire equivalents for monosemous words. Bilingual lexicons were extracted from Wikipedia and thesauri. The results obtained from each resource were merged and ranked according to the number of resources yielding the same literal. Automatic evaluation of the merged wordnet was performed with the French WordNet (FREWN). Manual evaluation was also carried out on a sample of the generated synsets. Precision shows that the presented approach has proved to be very promising and applications to use the created wordnet are already intended.
Fichier principal
Vignette du fichier
Ontolex08.pdf (276.59 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00614708 , version 1 (15-08-2011)

Identifiants

  • HAL Id : inria-00614708 , version 1

Citer

Benoît Sagot, Darja Fišer. Building a free French wordnet from multilingual resources. OntoLex, May 2008, Marrakech, Morocco. ⟨inria-00614708⟩
904 Consultations
876 Téléchargements

Partager

Gmail Facebook X LinkedIn More