| Publication type: |
 |
Conference proceedings |
 |
| Subject: |
 |
|
 |
| Title: |
 |
Semi-automatic Endogenous Enrichment of Collaboratively Constructed Lexical Resources: Piggybacking onto Wiktionary |
 |
| Author(s): |
 |
Franck Sajous 1, Emmanuel Navarro 2, Bruno Gaume 1, Laurent Prévot 3, Yannick Chudy 1 |
 |
| Laboratory: |
 |
|
 |
| Abstract: |
 |
The lack of large-scale, freely available and durable lexical resources, and the consequences for NLP, is widely acknowledged but the attempts to cope with usual bottlenecks preventing their development often result in dead-ends. This article introduces a language-independent, semi-automatic and endogenous method for enriching lexical resources, based on collaborative editing and random walks through existing lexical relationships, and shows how this approach enables us to overcome recurrent impediments. It compares the impact of using different data sources and similarity measures on the task of improving synonymy networks. Finally, it defines an architecture for applying the presented method to Wiktionary and explains how it has been implemented. |
 |
| Fulltext language: |
 |
English |
 |
|
| Book title: |
 |
Advances in Natural Language Processing |
 |
| Audience: |
 |
international |
 |
| Publication date: |
 |
2010-09-01 |
 |
| Volume: |
 |
6233 |
 |
| Page, identifiant, ...: |
 |
332-344 |
 |
| Commercial editor: |
 |
Springer Berlin/Heidelberg |
 |
| Scientifics editor: |
 |
Hrafn Loftsson, Eiríkur Rögnvaldsson and Sigrún Helgadóttir |
 |
| Serie: |
 |
Lecture Notes in Computer Science |
 |
|
| Conference or book title: |
 |
7th International Conference on NLP, IceTAL 2010 |
 |
| Conference date: |
 |
2010-08-16 |
 |
| Conference date (end): |
 |
2010-08-18 |
 |
| City: |
 |
Reykjavik |
 |
| Country: |
 |
Iceland |
 |
|
| Keyword(s): |
 |
Collaboratively Constructed Lexical Resources – Endogenous Enrichment – Crowdsourcing – Wiktionary – Random Walks. |
 |
|